key: cord- -lv mll authors: kim, hyun; hong, yeongjin; shibayama, keigo; suzuki, yasuhiko; wakamiya, nobutaka; kim, youn uck title: functional analysis of the receptor binding domain of sars coronavirus s region and its monoclonal antibody date: - - journal: genes genomics doi: . /s - - - sha: doc_id: cord_uid: lv mll severe acute respiratory syndrome (sars) is caused by the sars coronavirus (cov). the spike protein of sars-cov consists of s and s domains, which are responsible for virus binding and fusion, respectively. the receptor-binding domain (rbd) positioned in s can specifically bind to angiotensin-converting enzyme (ace ) on target cells, and ace regulates the balance between vasoconstrictors and vasodilators within the heart and kidneys. here, a recombinant fusion protein containing -amino acid rbd (residues – ) and glutathione s-transferase were prepared for binding to target cells. additionally, monoclonal rbd antibodies were prepared to confirm rbd binding to target cells through ace . we first confirmed that ace was expressed in various mouse cells such as heart, lungs, spleen, liver, intestine, and kidneys using a commercial ace polyclonal antibody. we also confirmed that the mouse fibroblast (nih t ) and human embryonic kidney cell lines (hek ) expressed ace . we finally demonstrated that recombinant rbd bound to ace on these cells using a cellular enzyme-linked immunosorbent assay and immunoassay. these results can be applied for future research to treat ace -related diseases and sars. severe acute respiratory syndrome (sars) is a fatal emerging infectious disease caused by the sars coronavirus (cov) (baker ; rota et al. ) . sars-covlike virus has been isolated from horseshoe bats in china, and this has been postulated to be the natural reservoir for the virus (lau et al. ; li et al. b ). although there have been no recent sars outbreaks, serious concerns remain about its re-emergence from host animals and its potential application as a bioterrorism agent (chan et al. ; peiris and yuen ) . sars-cov mediates infection of target cells via the spike (s) protein, which is a type one transmembrane glycoprotein divided into two functional domains of s ( - a.a.) and s ( - a.a.) (he et al. a; li et al. a ). infection of sars-cov is initiated by binding of the s protein to the angiotensin-converting enzyme (ace ) functional receptor expressed on target cells (li et al. ) . a -amino acid fragment (residues - ) within the s subunit of the s protein has been characterized as the minimal receptor-binding domain (rbd) . in early studies, mice immunized with inactivated sars-cov were used to generate monoclonal antibodies capable of blocking infectivity. of these, several antibodies were directed against the s protein (berry et al. ) . the s protein serves as the main antigen that elicits protective immune responses, including neutralizing antibodies in infected humans and animals (bisht et al. ; buchholz et al. ; greenough et al. ; hofmann et al. ) . several studies have demonstrated that rbd of the s region is a major target for neutralizing sars-cov antibodies (he et al. b (he et al. , zhou et al. ) . ace , also called aceh (ace homologue), is an integral membrane protein and a zinc metalloprotease of the ace family that also includes somatic and germinal ace (komatsu et al. ; tipnis et al. ) . ace has been implicated in the pathology of hartnup's disease, a disorder of amino acid homeostasis, and it has been recently revealed that ace controls intestinal inflammation and diarrhea via its function in amino acid transport; thus, regulating the gut microbiome (kuba et al. ) . mouse ace has about a % amino acid identity to the n-and c-terminal domains of mouse somatic ace. the predicted mouse ace protein sequence consists of amino acids, including an n-terminal signal peptide, a single catalytic domain, a c-terminal membrane anchor, and a short cytoplasmic tail. ace is a newly described rennin-angiotensin (ras) system component that is sensitive to chloride ion concentration (donoghue et al. ) . it is a membrane-bound enzyme that acts as a monocarboxypeptidase and is an essential regulator of heart function. within the ras, ace competes with ace because it is capable of hydrolyzing the inactive decapeptide angiotensin i (ang i) into the nonapeptide ang ( - ); thus, decreasing the amount of ang i available for pressor ang ii generation by ace. similarly, ace degrades the vasoconstrictor ang ii into vasodilator ang ( - ), which may also be produced from ang ( - ) hydrolysis by ace (donoghue et al. ; vickers et al. ; zhong et al. ) . the antagonistic relationship between ace and ace modulates the balance between ang ii (vasopressor) and ang - (vasodilator), which plays a significant role regulating renal and cardiovascular functions. sars-cov infections and the s protein decrease ace expression, and s protein injections into mice worsen acute lung failure in vivo . in contrast, ace and the type ang ii receptor protect mice from severe acute lung injury, but other components of the ras (including ace, ang ii, and ang ii type a receptor) promote disease occurrence ). these findings suggest a possible therapeutic role for ace in acute lung injury, which affects many people worldwide every year. expression of ace is more restricted than ace, which is widely distributed on the endothelial cells of the arteries, arterioles, and venules in the heart and kidneys (tipnis et al. ; oudit et al. ) . ace is also expressed in the vascular smooth muscle cells of the intrarenal arteries, the renal tubular epithelium, coronary blood vessels (donoghue et al. ) and adult leydig cells of the testis (douglas et al. ) . other investigators have shown that ace expression occurs on the surface of lung alveolar epithelial cells and enterocytes of the small intestine (hamming et al. ) . ace mediates binding between a vero e -ace expressing cell line and the recombinant s protein expressed on the surface of cho cells, even under high stringency washing conditions (chou et al. ) . ace was recently reported at various levels in various tissues that also express ace mrna. because ace is a functional receptor for the sars-cov, its tissue distribution seems to be of great importance and appears to be species specific. ace may also contribute to programmed hypertension. here, we report that a recombinant rbd fusion protein induced a high titer of rbd-specific monoclonal antibodies, and effectively operated the antigen protein. our cellular enzyme-linked immunosorbent assay (elisa) and competitive binding assay using a polyclonal ace antibody indicated that our prepared recombinant rbd fusion protein binds to various tissues as well as nih t and hek cells through ace . we think that our prepared recombinant rbd and its monoclonal antibody can be developed to prevent sars and disease pathogenesis. the pgex- t- plasmid was purchased from ge (ge healthcare life sciences, uppsala, sweden) to express the rbd-gst fusion protein. goat anti-mouse igg alkaline phosphatase and -nitrophenyl phosphate (npp) were provided by sigma (st. louis mo, usa) for the elisa. dulbecco's modified eagle's medium (dmem) was purchased from gibco (grand island, ny, usa) for hybridoma cell culture, and hat and ht were obtained from sigma for selecting the hybridoma cells. sp / myeloma, hek , and nih t cells were provided by the american type culture collection (rockville, md, usa) for preparing the hybridoma cell and cellular elisa, respectively. various tissues were lysed from balb/c mice purchased from slc (tokyo, japan) for detecting ace expression. alexa fluor reactive dye was purchased from invitrogen (carlsbad, ca, usa) to obtain confocal images. all other chemicals used were of the best grade available from commercial sources. the pcr fragments encoding -a.a rbd sequences were amplified was done with two primers, srar ( -gtccg cgaattcaacatcaccaacctgtg) and sars ( -cttcggctcgagcacggtggcgggcgcgt) against codon-optimized sars spike protein gene in pcdna . (a gift of dr. m. farzan, harvard medical school, ma) as template in polymerase chain reaction (pcr). the pcr fragments were digested with ecori and xhoi and ligated into the same sites of the pgex t- vector (ge healthcare bio-sciences corp, nj, usa). as result, the expression plasmid carries an n-terminal glutathione-stransferase (gst)-tag/thrombin/lac promoter. the nucleotide sequence analysis was performed using the dye terminator cycle sequencing ready reaction kit with an abi dna sequencer. the recombinant plasmid was transformed into competent e. coli bl codon plus, and grown with constant shaking in yt broth ( g tryptone, g yeast extract, g nacl/l) in the presence of ampicillin ( lg/ml). five ml of cell suspension was inoculated into ml yt fresh media/ ml flask for induction of the recombinant protein and was incubated at °c until optical density reached . . the culture suspensions were further incubated for h at °c in the presence of . mm isopropyl-b-d-thio-galactoside with vigorous shaking ( rpm). four ml of bacterial culture was harvested by centrifugation at °c, and the pellet was resuspended with ml of reduced sodium dodecyl sulfate polyacrylamide gel electrophoresis (sds-page) sample buffer. the reactions were heated at °c for min, and only the supernatant was applied to % sds-page gels using a mini-protein electrophoresis apparatus (bio-rad hercules, ca, usa). the gel was soaked in . m cold kcl solution for min until the protein bands appeared as a gray color, then the bands were cut with a razor for homogenizing. the chopped gel and . ml pbs were added to a microtube for homogenizing, and about strokes were done to crush the gel. the tube was centrifuged for min at , g to remove the gel piece and then filtered with a . lm filter. purity was confirmed by % sds-page and used to immunize mice to prepare a monoclonal antibody. the purified rbd fusion protein was mixed with an equal volume of complete freund's adjuvant (sigma) and injected intraperitoneally. the antigen-adjuvant mixture was injected into female balb/c mice ( weeks old). the first injection was followed by three booster injections at -or -week intervals. the final injection was administered without adjuvant - days before cell fusion. after confirming the antibody titer in tail blood from immunized mice, b cells were separated from the spleen for fusion with myeloma cells. feeder cells were prepared day before fusion from a week-old mouse. the abdominal skin was carefully removed and feeder cells were collected by centrifugation. the fusion experiments were performed as follows. spleen cells were released by tearing the removed spleen with forceps and the rough side of a slide glass, and the cells were collected in a ml centrifuge tube. the spleen cells and sp / -ag- mouse myeloma cells were mixed in a : ratio, and ml of % polyethylene glycol in serumfree dmem was added slowly. the fusion process was allowed to continue for min at °c and centrifuged for min at g. then, . ml dmem was added slowly for the first min, and ml was added over the next min. the fused cells were brought up to ml with dmem, and collected by centrifugation at g for min. the cells were carefully resuspended in ml of selective hat medium [dme supplemented with % fetal bovine serum (fbs), antibiotics, and hat] by swirling, and then incubated under % co for min. each ll of cell suspension was transferred to -well plates, and incubated under % co in an incubator. about weeks after the fusion, culture supernatants were collected and screened by elisa. positive clones were transferred to -well plates, and frozen in liquid nitrogen. all positive clones were frozen first and cloned by limiting dilution after thawing. hybridoma cells ( ) were intraperitoneally injected into a balb/c mouse to collect ascites and purify the monoclonal antibody. after weeks, the drained ascites were centrifuged for min at , g to remove residual cells and insoluble aggregates and then applied to a protein g-agarose column (hitrap ml, ge healthcare life sciences). the column was washed with phosphatebuffered saline (pbs) until the absorbance of unbound proteins decreased to background, and then the antibody was eluted with . m glycine-hcl, ph . . the eluted antibody was neutralized by adding m tris and dialyzed against pbs overnight. a -well micro titer plate (costar, boston, ma, usa) was coated with ll ( lg/ml) of purified rbd fusion and gst proteins at °c overnight and then was washed three times with deionized distilled water. the wells were blocked with ll blocking buffer (borate buffered saline genes genom ( ) : - containing % skim milk, mm edta, . % nan , and . % tween ) for min at room temperature. after three washes with blocking buffer, ll of antiserum or cell supernatant was added and incubated for h at room temperature. the wells were washed three times with water, and then blocked with blocking buffer for min at room temperature. a goat anti-mouse igg antibody coupled with alkaline phosphatase (sigma, lg/ml in blocking buffer) was incubated at room temperature for h to bind the first antibody. the -well plates were washed with water and detected with ll of mg/ml ( mm) p-npp in . m na co and . mm mgcl . the reaction was stopped with ll of . m naoh and measured at nm using a microtiter plate reader (bio-rad). for the cellular elisa, various tissues were separated from the balb/c mouse and treated with ammonium sulfate to remove the erythrocytes. the cells were washed and cultured with % fbs/dmem media, then transferred to a -well plate at cells/well for immobilizing cells to the plate. the wells were washed with pbs, and ll of % paraformaldehyde was added (ethanol:methanol = : ). the plate was incubated for min at - °c and washed with water to react the rbd or ace antibody (rabbit igg polyclonal antibody, cat # - - , genway biotech, ca, usa). paraformaldehyde fixation has been used to attach various primary tissue cells and established cell lines to plastic wear or slide glass. we used paraformaldehyde fixation because we worried about detachment of primary tissue cells and suspension cells from vessel during various experimental procedures. for example, in our cellular elisa, intact cell was attached to plastic wear in the first step, and then several reactions such as rbd or ab were successively followed. the rbd fusion protein or tissue cell lysates were suspended in reduced sds-page sample buffer and heated, then resolved on % sds-page. the gels were electrophoretically transferred to pvdf membranes using a mini protein ii transfer chamber (bio-rad). the membranes were blocked overnight at °c in pbs containing % skim milk, and then washed three times with pbs containing . % tween (pbs-t). sequentially, the membrane was washed two times with pbs-t and incubated with lg/ml rbd or ace antibody for h. after washing twice with pbs-t, the membrane was reacted with goat anti-mouse igg antibody coupled with alkaline phosphatase for h at room temperature. the membranes were washed five times with pbs-t and five times with distilled water, then developed using mg/ml ( mm) (npp in . m na co , . mm mgcl . immunofluorescence assay nih t and hek cells were cultured and transferred to a four-chambered flask for fluorescence labeling and imaging. after washing with pbs, paraformaldehyde solution ( %) was added to the cells and incubated for min at °c to fix the cells to the wells. after three washes with pbs, . % triton x- was added to the plate for min at °c. the plate was washed three times with pbs, and then % bsa (in pbs) was added and incubated for h at room temperature. after washing with pbs, the rbd fusion protein was incubated for h at room temperature to bind with the ace molecules on the cell membranes. after washing three times with pbs, the monoclonal rbd antibody was reacted with the rbd-ace binding reactant. finally, secondary antibody (goat anti-mouse igg coupled with alexa -green) was treated for h at room temperature and sequentially with dapi for s. cell imaging was performed using an olympus fv confocal microscope equipped with a four-laser system (multi ar laser, hene g laser, hene r laser, and ld / laser diode) with transmitted light, differential interference contrast, and complete integrated image analysis software system (olympus america inc., melville, ny, usa). the excitation and emission wavelengths for alexa fluor (green) and dapi (red) were / and / nm, respectively. composite digital images were then converted to tiff format, imported into adobe photoshop (adobe photoshop cs , version . ; adobe systems inc., san jose, ca, usa), and color balance was adjusted for presentation. expression and purification of the rbd-gst fusion protein and preparation of the monoclonal rbd antibody pcr products of bp ( a.a.) long rbd sequences were ligated to the pgex t- expression vector and expressed in e. coli bl (fig. a) . main bands of - kda (fig. a , lane ) and kda (fig. a, lane ) were detected in e. coli extracts, which transformed rbd-gst and gst dna, respectively. as shown fig. b , the two main bands were purified for immunization into five mice each. after the final injection of purified antigen, the antibody titer was checked in tail blood. after - days, mouse spleen cells were lysed and fused with myeloma cells to prepare hybridoma cells. after limiting dilutions for weeks, we obtained seven rbd fusion and five gst positive colonies, respectively. among the seven positive colonies against the rbd-gst fusion protein, four colonies reacted only with the rbd but not the gst protein. the two colonies in the best condition were expanded for intraperitoneal injection to obtain a large amount of antibody. the antibodies were purified from ascites, and assayed by elisa and western blotting as shown fig. c , d. the results indicated that the selected monoclonal antibody recognized the rbd but not the gst region of the rbd-gst fusion protein (fig. d) . cells were separated from the heart, spleen, and liver to confirm binding to the cell surface. these cells were fixed to -well plates with paraformaldehyde, and treated using the same procedure as for the general elisa. as shown fig. a , three kinds of cells (heart, spleen, and liver) reacted to the rbd surface molecule. this binding phenomenon was confirmed by western blotting analysis with the same cell lysates (fig. b) . the rbd fusion protein band appeared at * kda but not the gst protein in all three panels for the positive and negative controls. these three blotted membranes were successively treated with purified rbd fusion protein and rbd monoclonal antibody. tissue cell lysates of heart or spleen revealed proteins at * kda. but, liver cells showed a smeared, faint band (fig. b) . this experiment indicated that our prepared rbd fusion protein bound to molecule(s) on the tissue cell surface. various cells were prepared from a balb/c mouse, and the total cell lysates were loaded onto a % gel for western blotting analysis (fig. ) . similar to fig. b , about a kda band was shown in various cell extracts (fig. a) . among the various cells, heart and lung extracts showed strong bands that were two or three fold more intense than some studies have reported that the sars virus is related to ace. thus, we tested ace and ace molecules as rbd receptors, and their antibodies were used as a blocking agent for rbd binding. as shown fig. b , the ace antibody was pretreated for min before the rbd reaction to block the rbd receptor, and the rbd antibody was added to detect residual rbd binding (fig. b) . the kda band almost disappeared in the tissue cell lanes, but not from the rbd protein lane. we knew that this kda band corresponded to the ace molecule that was detected at about the kda position with an ace antibody (fig. c) . these results indicate that rbd bound to the ace molecule in various cells. to confirm whether the rbd would bind to established cell lines, hek and nih t cells were fixed in -well plates as shown in fig. a . the cells were successively treated with rbd and rbd antibody. as shown fig. a , the rbd protein bound to both cell types to the same extent. the same amounts of rbd protein and ace antibody were simultaneously incubated with the cells for min and detected with the rbd antibody ( fig. b; column ). in this competitive assay, the ace antibody suppressed - % of rbd binding in both cell lines within min. we showed that these cell lines express ace molecule l and that the molecules were the rbd receptor molecules as shown in the various mouse tissues. but, in this reaction, inhibition decreased - % after - h incubation (data not shown). it seemed that the ace antibody may easily separate from the ace molecule or was degraded. next, we examined whether rbd binding was blocked by the ace antibody in a western blot using mouse tissue cell lysates with a pair of membranes. the ace molecule appeared as three or more bands in hek cells and as two bands in nih t cell lysates (fig. c, left panel) . the other transferred membrane was pretreated with ace antibody before rbd and rbd antibody to block the rbd fig. cellular enzyme-linked immunosorbent assay and western blot analysis of mouse tissues. a various mouse tissues were cultured and then fixed with paraformaldehyde. after reacting with receptorbinding domain (rbd)-specific monoclonal antibodies, alkaline phosphatase-conjugated goat anti-mouse igg and npp substrate were successively added to each well and measured in a dynex spectrophotometer at nm. b to confirm the rbd receptor, whole cell lysates from mouse organ tissue cells were loaded on to % sodium dodecyl sulfate polyacrylamide gel electrophoresis and immuno-blotted using the rbd monoclonal antibody. each lane indicated as follows: s spleen; l liver; h heart; g glutathione-stransferase (gst); r, rbd-gst binding sites (fig. c, right panel) . expectedly, ace molecules in hek cells were blocked with the ace antibody but not in nih t cells. we confirmed this by confocal immunofluorescence image using the two cell lines that were fixed and treated with rbd and the rbd antibody (fig. ) . as a result, the rbd receptor (ace ) was observed in both hek and nih t cells using the rbd antibody. a cocaine monoclonal antibody was used as the negative control in this experiment. in this study, the reason why we used elisa or immunoflourescence instead of immunoprecipitation or frozen section staining is that we want to investigate nih t or hek cell lines and primary tissues cell at same condition and environment. the s protein of sars-cov is able to induce protective antibodies from infected animals (bisht et al. ; buchholz et al. ). the rbd (residues - ) in the s region of the s protein induces highly potent neutralizing antibodies against the sars-cov (he et al. a, b) . here, we expressed the rbd of the s protein and immunized a mouse to prepare monoclonal antibodies. the rbd protein was prepared with the gst fused form and purified fusion protein without separating the two kinds of proteins, because this fusion protein was efficiently expressed in this system and easily separated from a gel. another benefit is that the fused protein efficiently provided monoclonal antibodies due to its suitable molecular size (see fig. ). we also prepared a plural gst antibody for confirming the rbd specific binding experiments (data not shown). we have demonstrated with a cellular elisa assay that rbd bound to cell membranes and that the anti-rbd antibody worked well with its antigen. the first essential step of cov infection is interaction of the s protein via the rbd with a specific cellular receptor. there are many reports that rbd plays a role infecting host cells. the rbds on the s proteins of other covs such as mouse hepatitis virus, transmissible gastroenteritis virus, and human coronavirus also contain major antigenic determinants capable of binding to host cells (bonavia et al. ; godet et al. ) . some studies have reported that rbd-specific antibodies block receptor binding and virus entry (he et al. a) . therefore, the rbd of the s protein may serve as an important target site for developing sars vaccines and immunotherapeutics. in this study, we conducted two kinds of experiments to demonstrate the results of previous reports. the first was cellular binding of rbd using a cellular elisa as an intact cell instead of a traditional protein or antigen, and the second was western blot analysis using the same cell lysates as in the cellular elisa. in the cellular elisa data, we demonstrated that the binding was due to rbd not gst, because the rbd fusion protein included the gst protein. this was shown by using the anti-gst antibody (data not shown), and the rbd binding receptor clearly appeared as a kda protein in the spleen and heart but was smeared in the liver lysate (fig. b) . the smeared band in the liver may indicate degradation by proteolysis or something that occurred during extract preparation, because this band was shown in other preparations (fig. a) . there are many reports of sars s or rbd protein binding to ace on host cell membrane proteins using an antibody against sars-cov and by other methods. various neutralizing monoclonal antibodies against sars-cov recognize overlapping sets of residues relative to the ace molecule (zhu et al. ) . ace is predominantly expressed in the heart, kidneys, and testes and at lower levels in a wide variety of tissues, particularly the colon and lungs (lew et al. ) . it seems that this rbd region plays a critical role attaching the host ace molecule. this phenomenon was also supported by chimeric monoclonal antibodies that bind to the ace rbd of the sars s protein (berry et al. ) . experimental support for ace -rbd binding was provided from a study of sars antibody fragments competing with ace for binding to the rbd (prabakaran et al. ; sui et al. ; hwang et al. ) . the crystal structure of the rbd-ace complex has been identified, providing detailed information concerning rbd structure and function (li et al. a ). the structure revealed that the rbd can be further divided into two separate subdomains. one is the rbd core and the other is the rbd loop (rpm) (a.a. - ). the rbd loop is the region that directly contacts the ace molecule. in contrast, the rbd core contacts accessory proteins (li et al. a) . we showed competitive binding of the rbd with the anti-ace antibody (see figs. , , ) . the same weights of tissue were homogenized and lysed by reducing sds-page sample buffer for western blotting analysis. ace was detected by the anti-ace antibody in all of our organ tissue samples, which cross-reacts with the mouse/human ace molecule. ace contains seven potential n linked glycosylation sites and is therefore likely to be glycosylated. overexpressed ace migrates at kda compared with the deglycosylated polypeptide that migrates at kda (lew et al. ) . very low levels of ace were detected in plasma by western blot analysis and in multiples smaller than the full-length enzyme. this likely resulted from proteolytic cleavage (lew et al. ). an approximately -kda immunoreactive band was present in the whole-cell lysate, and a slightly smaller band was detected in the conditioned medium of ace -transfected cells, indicating that full-length ace is processed in cho cells to generate a secreted form (donoghue et al. ) . we report, for the first time, that our prepared recombinant rbd bound to various mouse tissues and established after reacting with the receptor binding domain (rbd) and rbd monoclonal antibody, alkaline phosphataseconjugated goat anti-mouse igg was added to each well. the plates were developed with npp substrate solution, and then read at nm in a dynex spectrophotometer. b, c competition assay by cellular elisa (b) and western blot (c). prior to treatment with the rbd protein, the angiotensin-converting enzyme (ace ) polyclonal antibody was treated for min to block the rbd binding site. then, the rbd-specific monoclonal antibody and each secondary antibody were sequentially added to the cells cell lines. the ace molecule was detected at approximately kda and expressed strongly in the heart and lungs of mice. this is different from humans in which there is stronger expression in the kidneys than that in the lungs. rbd binding to ace molecule was interfered with by pre-treatment with the ace antibody in mouse tissues (see fig. ). we confirmed these results in nih t and hek cells by cellular elisa and a competitive assay (see fig. a , b). the anti-ace antibody suppressed rbd-ace binding about - % within min, but the inhibition decreased to - % after a - h incubation (data not shown). it seemed that the ace antibody separated from the ace molecules after a long incubation or degraded during incubation. we confirmed the molecular weight of ace in hek and nih t cells for the competitive assay in gels. expressed ace was kda, but it differs in tissue and cell lysates (donoghue et al. ; lew et al. ). in our lysates, ace molecules occurred in three or more bands in hek cells and in two or more bands in nih t cells. this seemed to be a proteolytic cleavage pattern resulting from the different tissue lysate preparations (see fig. c , left panel). the ace molecules were completely blocked by the ace antibody before treatment with the rbd protein in hek cells, but not in nih t cells (see fig. c , right panel). we suggest that the rbd binding site on ace molecules in nih t cells may have been destroyed or changed due to cleavage of the ace molecules during sample preparation. this suggestion was supported by our confocal microscope images results in which nih t cells stained well with the rbd fusion protein as in the hek cells. here, we first confirmed that ace was expressed in various mouse cells such as heart, lungs, spleen, liver, intestine, and kidneys using a commercial ace polyclonal antibody. we also confirmed that the mouse fibroblast (nih t ) and human embryonic kidney cell lines (hek ) expressed ace . we finally demonstrated that recombinant rbd bound to ace on these cells. we also demonstrated that our preparing rbd elicited high antibody titers in its immunized mice. therefore, it appears to be an ideal immunizing antigen for generating monoclonal antibodies as well as a possible vaccine candidate. we have also found that the rbd binds to various tissues through ace . these results can be applied for future research to help treat ace related diseases and sars. fig. confocal microscope of established cell lines. nih t and hek cells were reacted with the receptor binding domain (rbd) protein, and then the rbd monoclonal antibody and secondary alexa antibody were sequentially added to the each cell line, respectively. an anti-cocaine monoclonal antibody was used for the negative control. green color indicates rbd binding to the cell surface. nucleus was specifically stained by dapi (blue color) characterisation of neutralising monoclonal antibody to the sars-coronavirus neutralizing epitopes of the sars-cov s-protein cluster independent of repertoire, antigen structure or mab technology severe acute respiratory syndrome coronavirus spike protein expressed by attenuated vaccinia virus protectively immunizes mice identification of a receptor-binding domain of the spike glycoprotein of human coronavirus hcov- e contributions of the structural proteins of severe acute respiratory syndrome to protective immunity sars: clinical presentation, transmission, pathogenesis and treatment options a novel cell-based binding assay system reconstituting interaction between sars-cov s protein and its cellular receptor a novel angiotensin-converting enzyme-related carboxypeptidase (ace ) converts angiotensin i to angiotensin - the novel angiotensin-converting enzyme (ace) homolog, ace , is selectively expressed by adult leydig cells of the testis major receptorbinding and neutralization determinants are located within the same domain of the transmissible gastroenteritis virus (coronavirus) spike protein development and characterization of a severe acute respiratory syndrome-associated coronavirus-neutralizing human monoclonal antibody that provides effective immunoprophylaxis in mice tissue distribution of ace protein, the functional receptor for sars coronavirus. a first step in understanding sars pathogenesis receptor binding domain of sars-cov spike protein induces highly potent neutralizing antibodies: implication for developing subunit vaccine identification of immunodominant sites on the spike protein of severe acute respiratory syndrome (sars) coronavirus: implication for developing sars diagnostics and vaccines identification of a critical neutralization determinant of severe acute respiratory syndrome (sars)-associated coronavirus: importance for designing sars vaccines s protein of severe acute respiratory syndrome-associated coronavirus mediates entry into hepatoma cell lines and is targeted by neutralizing antibodies in infected patients structural basis of neutralization by a human anti-severe acute respiratory syndrome spike protein antibody, r angiotensin-converting enzyme protects from severe acute lung failure molecular cloning, mrna expression and chromosomal localization of mouse angiotensinconverting enzyme-related carboxypeptidase (mace ) a crucial role of angiotensin converting enzyme (ace ) in sars coronavirus-induced lung injury multiple functions of angiotensin-converting enzyme and its relevance in cardiovascular diseases severe acute respiratory syndrome coronavirus-like virus in chinese horseshoe bats angiotensin-converting enzyme catalytic activity in human plasma is masked by an endogenous inhibitor angiotensin-converting enzyme is a functional receptor for the sars coronavirus structure of sars coronavirus spike receptor-binding domain complexed with receptor bats are the natural reservoirs of sars-like coronaviruses the role of ace in cardiovascular physiology severe acute respiratory syndrome structure of severe acute respiratory syndrome coronavirus receptor-binding domain complexed with neutralizing antibody characterization of a novel coronavirus associated with severe acute respiratory syndrome potent neutralization of severe acute respiratory syndrome (sars) coronavirus by a human mab to s protein that blocks receptor association a human homolog of angiotensin-converting enzyme: cloning and functional expression as a captopril-insensitive carboxypeptidase hydrolysis of biological peptides by human angiotensin-converting enzyme-related carboxypeptidase a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensin converting enzyme angiotensin converting enzyme suppresses pathological hypertrophy, myocardial fibrosis and cardiac dysfunction an exposed domain in the severe acute respiratory syndrome coronavirus spike protein induces neutralizing antibodies potent cross-reactive neutralization of sars coronavirus isolates by human monoclonal antibodies key: cord- -isbqs hg authors: zeng, xin; li, lingfang; lin, jing; li, xinlei; liu, bin; kong, yang; zeng, shunze; du, jianhua; xiao, huahong; zhang, tao; zhang, shelin; liu, jianghai title: isolation of a human monoclonal antibody specific for the receptor binding domain of sars-cov- using a competitive phage biopanning strategy date: - - journal: antib ther doi: . /abt/tbaa sha: doc_id: cord_uid: isbqs hg the infection of the novel coronavirus sars-cov- has caused more than , deaths, but no vaccine or therapeutic monoclonal antibody is currently available. sars-cov- relies on its spike protein, in particular the receptor binding domain (rbd), to bind human cell receptor angiotensin-converting enzyme (ace ) for viral entry, and thus targeting rbd holds the promise for preventing sars-cov- infection. in this work, a competitive biopanning strategy of a phage display antibody library was applied to screen blocking antibodies against rbd. high-affinity antibodies were enriched after the first round using a standard panning process in which rbd-his was immobilized as a bait. at the next two rounds, immobilized ace -fc and free rbd-his were mixed with the enriched phage antibodies. antibodies binding to rbd at epitopes different from ace -binding site were captured by the immobilized ace -fc, forming a “sandwich” complex. only antibodies competed with ace can bind to the free rbd-his in the supernatant and be subsequently separated by the ni-nta magnetic beads. top lead from the competitive biopanning of our synthetic antibody library, lib ab , was produced as the full-length igg format. it was proved to competitively block the binding of rbd to ace and potently inhibit sars-cov- pseudovirus infection with ic( ) values of nm. nevertheless, top lead from the standard biopanning can only bind to rbd in vitro, but not have the blocking or neutralization activity. our strategy can efficiently isolate the blocking antibodies of rbd, and it would speed up the discovery of neutralizing antibodies against sars-cov- . the recent outbreak of a novel coronavirus disease (covid- ) has emerged from a public health emergency of international concern to global pandemic. its pathogen, sars-cov- , is a newly identified β-coronavirus. coronavirus got the family name from the spike (s) protein on the viral particle. the highly glycosylated s protein stays compact in a trimeric state, recognizes receptor on the host cell membrane, and undergoes a series of conformation changes, proteolysis events and membrane fusion to complete viral entry. for vaccines, clinical diagnosis, early prevention and medication, the s protein is the most significant target. the primary sequences of s protein between severe acute respiratory syndrome coronavirus (sars-cov) and sars-cov- share about % identities and % similarities, which indicates high possibility of structural homology and similar infection pathway. sars-cov and sars-cov- recognize the same host cell receptor ace for mediating viral entry into host cells. it was reported that sars-cov s protein trimer bound to ace at : in ratio [ , ] . before infection, rbd of each sars-cov s monomer was partially buried in the inactive "down" conformation and not able to bind ace due to steric clash. once infection started, one rbd monomer turned "up" to expose enough space to ace , inducing further conformational open and loose for proteolysis [ , ] . atomic-level structural analysis suggested that the spatial interaction and interface between sars-cov- rbd and ace was mostly in accordance with the sars-cov case [ ] . besides, a cryo-em structure of sars-cov- s protein trimer published recently showed that one of the three rbds was in "up" conformation and naturally exposed the whole interaction interface [ ] , while the classic closed symmetric trimer still existed [ ] . that might explain why sars-cov- is much more contagious and problematic than sars-cov worldwide. no effective cure or vaccine is currently available for covid- . based on structure information above, blocking sars-cov- rbd is a rational therapeutic approach. here we developed a competitive biopanning strategy to efficiently isolate blocking antibodies from phage display antibody libraries. several high-affinity antibodies targeting sars-cov- rbd and blocking its binding to ace were isolated, and the top lead exhibited a neutralization activity of sars-cov- pseudotyped vsv infection. recombinant proteins ace -his was purchased from novoprotein (shanghai, china). ace -hfc and sars-cov- rbd-his were purchased from sino biological (beijing, china). sars-cov- rbd-mfc was expressed using ablink biotech's hek f expression system. a synthetic human fab antibody library ab (libab ) was constructed according to a procedure previously described [ ] . human germline immunoglobulin variable segments vh - and vl - were employed as templates, the complementarity-determining regions l (cdr-l ) and h (cdr-h ) was diversified by the designed mutagenic oligonucleotides. the oligonucleotides were synthesized using the trimer phosphoramidites mix z (glen research) containing codons for amino acids in the following molar ratios: % each y, s &g, % each t & a, and % each p, h, r, f, w, v & l. the number of positions denoted by z in cdr-l (qq (z)n plt) and -h (ar (z) n (a/g/d/y) fdy) was varied from to and to , respectively. the library size is estimated to be × . antibodies against rbd were screened at the first round using a standard biopanning protocol [ ] . briefly, rbd-his was coated on -well maxisorp plates at °c overnight. after the coating buffer was decanted, the plate was blocked with % polyvinyl alcohol (pva) at room temperature for hour. μl of phage libraries ( pfu/ml) was added per well for -hour binding. after washing eight times with pt buffer ( . % tween- in pbs), bound phages were eluted with mm hcl ( μl per well), followed by -min incubation. the eluent was transferred into a . ml microfuge tube and neutralized with m tris-hcl (ph . ). half the neutralized phage solution was mixed with ml of actively growing e. coli neb -alpha f' (od = . ) in × yt media containing μg/ml tetracycline and incubated at °c for hour. × pfu of m k helper phages were added next and incubated for another hour. the infected bacteria were amplified in ml × yt medium containing μg/ml carbenicillin and μg/ml kanamycin, shaking at rpm and growing overnight at °c. the next day, phages were harvested in precipitant with peg/nacl solution and resuspended in pbs buffer for the following rounds of panning. after the first round of the standard biopanning, a competitive biopanning protocol that included steps of competitive binding, magnetic separation, elution and amplification ( fig. ) , was applied to isolate the epitope-specific antibodies. briefly, μl of ace -hfc protein ( μg/ml) was coated on the -well maxisorp plates. the wells were washed and blocked with % pva, and then the mixture of antibody library ( × pfu per well) and free rbd-his protein ( ng per well) was added. after a -hour competitive binding, the supernatant was transferred into a . ml microfuge tube containing the pre-washed ni-nta magnetic beads (genscript) and incubated on a shaker at room temperature for hour. beads were collected using the magnetic separation rack and washed by the pt buffer for times. bound phages were eluted with mm hcl ( μl per tube) after -min incubation. beads were collected using the magnetic separation rack, and the supernatant was transfer into a tube for neutralization. half the neutralized phage solution was mixed with ml of actively growing neb alpha f' cells and amplified as the standard biopanning protocol. μl of the bacterial culture before infection with helper phages was taken, diluted, and grown on the lb plates containing μg/ml carbenicillin at °c overnight. the single clones were picked up next day for the phage elisa assay. single clones were inoculated into μl × yt medium containing μg/ml carbenicillin, μg/ml kanamycin and pfu/ml helper phages in -deep-well plates and incubated overnight at °c and rpm. the plates were centrifuged at , rpm and the supernatant was applied for phage elisa. the -well maxisorp plates were coated overnight at °c with rbd-mfc ( μg/ml, μl per well). after blocking with % pva, plates were incubated with μl bacterial supernatant containing phages for hours at room temperature. after six times of wash with pt, bound phages were detected using an hrp-conjugated anti-m antibody (sino biological) and tetramethyl benzidine (tmb) as substrate. absorption at nm was measured. vh and vl of the positive phage were subcloned respectively into the pfusess-chig-hg and pfusess-clig-hk (invivogen). antibodies were transiently expressed in freestyle™ hek -f cells (life technologies) using fectin transfection reagent according to manufacturer's instructions. after transfection, cells were grown in the serum-free medium for an additional days. the supernatant was collected and purified on a mabselect protein a column (ge healthcare). eluted igg was dialyzed against pbs and stored at - °c. recombinant human ace -his ( μg/ml, μl per well) was coated on -well maxisorp plates, followed by a pre-incubated mixture of the anti-rbd antibody titrated into a constant amount of rbd-mfc ( µg/ml). rbd binding to ace was detected using hrp conjugated anti-mouse fc antibody. the neutralization effects of antibodies on sars-cov- pseudovirus were performed by the genscript inc. (nanjing, china) under a research service contract. briefly, , of the human ace -overexpressing hela monoclonal cells were seeded into each well of a -well plate. sars-cov- pseudovirus and antibodies were incubated at ambient temperature for hour. the mixture was transferred into wells and incubated with cells at °c, % co for hours. the culture medium was freshly replaced, and cells were incubated for another hours. the culture medium was removed, and cells were rinsed with pbs. µl lysis buffer was added and further incubated at ambient temperature for minutes. µl supernatant was transferred to a sterile un-clear -well plate with the bio-glo luciferase substrate added, and the luminescence signal was measured with envision. the dose response curves were plotted with the relative luminescence unit against the antibody concentration. the assay results were processed by microsoft office excel and graphpad prism . high-affinity antibodies were identified by the phage elisa rbd had a high affinity to ace with an ec of around µg/ml (fig. s ). thus µg/ml of rbd was applied in our competitive biopanning strategy to ensure the immobilized ace can completely capture the "sandwich binding" complex. during the standard biopanning, phages were always applied at a concentration of × pfu. however, we changed it to × pfu per well during the competitive biopanning to reduce the non-specific binding of phages to magnetic beads. after rounds of the competitive biopanning, clones were randomly selected. their properties of binding to rbd were measured using phage elisa. positive binding was defined as an od reading two or more times higher than the negative control (pva alone). clones showed positive signals (fig. ) . after the dna sequencing, these clones were summarized into groups of unique antibodies. rrbd- , the top lead with the highest od reading isolated from the competitive biopanning, and rrbd- , the top lead isolated from the standard biopanning at round , were expressed as full-length igg antibodies using the f expression system. their binding and blocking abilities against rbd were compared. both rrbd- and rrbd- had high affinities for rbd, with ec at . nm and . nm, respectively. only rrbd- blocked the binding of rbd to ace with an ic at . nm, while rrbd- did not. as a positive control, the recombinant ace -hfc ( µg/ml) totally inhibited the infection of ace -overexpressing hela monoclonal cells with sars-cov- pseudovirus. the antibody rrbd- showed a significant neutralization activity against the sars-cov- pseudovirus with ic values of . nm. however, the antibody rrbd- had no neutralization effect of the pseudovirus and there were no significant differences between the highest concentration antibody group and the blank group without antibody addition. rbds share high sequence identities ( %) and structure homology, so the well-established sars-cov antibodies were firstly assumed short-cut therapeutic candidates for sars-cov- . however, the real scenario is much more problematic. several independent peer-reviewed studies as well as preprinted ones have proved that all structurally known sars-cov specific antibodies, including s , r, m and f g , have no cross-reactivity of sars-cov- [ , , ] . the lack of cross-reactivity of current mabs may result from complicated factors. these antibodies all compete with ace to bind sars-cov rbd, but their epitopes only have limited overlaps of the several key residue mutations from sars-cov s to sars-cov- s don't alter the binding of ace , but slight changes could be enough to break antibody recognition. cr is a special case with % conserved key residues in the epitope between sars-cov- and sars-cov. its cross-reactivity was remarkable, but just one site loss of n-glycan results in ~ magnitude reduction of binding affinity to sars-cov- rbd [ ] . in humans, rbd-specific monoclonal antibodies derived from covid- recovered individuals indicated similar patterns of no cross-reactivities with either sars-cov or mers-cov [ ] . the findings using polyclonal antibodies are ambiguous. sera from sars-cov s-immunized mice, not rabbits or sars recovered patients, showed modest neutralization activity against sars-cov- [ , ] , while sera from covid- recovered patients had no effect on sars pseudovirus [ ] . in general, structural and functional analysis suggests that targeting sars-cov- rbd could be a direct and promising therapeutic strategy, while focusing on previous sars-cov antibodies is not very ideal or efficient. no sars-cov- rbd-specific monoclonal antibody has been reported from human antibody libraries (up to april th , ). in the meantime, sars-cov- spreads unexpectedly fast around the world, and a new study just shifted its basic reproductive number (r ) from . to . [ ] . a rapid and effective method of obtaining the sars-cov- neutralizing antibodies is much required. naïve antibody libraries derived from natural immune systems have their capacity limits, while synthetic libraries with higher diversity have more opportunities to isolate binders especially for novel infectious antigens. compared to a naïve antibody library of ~ diversity, a synthetic library with additional artificial randomization on cdrs can reach diversity as high as ~ . when the recombinant rbd and ace proteins were ready, it took weeks to isolate, produce and verify the antibodies in this study. using the standard biopanning method, we enriched rbd-specific phages from our synthetic lib ab , but not from our naïve antibody libraries (data not shown). unfortunately, the top lead rrbd- from the standard biopanning of lib ab could not block the rbd-ace interaction (fig. ) , although it bound to rbd with an ec of . nm (fig. ) . the clinical potential and applications of an antibody often depends on its binding epitopes of the target protein. a high-affinity antibody against the target protein can be screened from a phage display antibody library using the standard biopanning process, but its binding epitopes are identified by some extra steps, such as epitope mapping and competitive elisa. we therefore developed a new competitive biopanning strategy to efficiently isolate isotype-specific antibodies from libraries. as expected, the top lead rrbd- successfully bind to rbd in compete with ace both in solution and in pseudovirus, and its binding affinity is quite high in ~ nm differing from measuring methods. further experimental plan is scheduled to verify the neutralization effect in the live viruses and animal models, together with cross-reactivity assays of other disease-related coronaviruses. in conclusion, our strategic discovery of human monoclonal antibodies against sars-cov- rbd may fill the blanks of antibody-related pharmaceutical development and shed light on new treatments in need of global health concerns. fig. schematic presentation of a competitive biopanning strategy. a specific binder of target protein was added during the binding step for the selection of blocking antibodies. in this work, the immobilized ace -hfc captured rbd-his and the antibodies binding rbd at different epitopes, forming a complex like a "sandwich". however, when an antibody recognized the same or similar epitopes within rbd as the ace did, it could block rbd-ace interaction. the antibodies would bind to the free rbd-his in the supernatant and be subsequently separated by the ni-nta magnetic beads. two sars-cov- rbd-specific antibodies selected from different strategies showed different neutralization activities. luminescence signal on y-axis indicated relative proportions of pseudovirus entry into target cells. the antibody rrbd- competed with ace could neutralize sars-cov- pseudovirus, but rrbd- could not. cryo-em structure of the sars coronavirus spike glycoprotein in complex with its host cell receptor ace unexpected receptor functional mimicry elucidates activation of coronavirus fusion cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding structural and functional basis of sars-cov- entry by using human ace cryo-em structure of the sars-cov- spike in the prefusion conformation structure, function, and antigenicity of the sars-cov- spike glycoprotein a single-framework synthetic antibody library containing a combination of canonical and variable complementarity-determining regions identifying specificity profiles for peptide recognition modules from phage-displayed peptide libraries a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov potent human neutralizing antibodies elicited by sars-cov- infection characterization of spike glycoprotein of sars-cov- on virus entry and its immune cross-reactivity with sars-cov high contagiousness and rapid spread of severe acute respiratory syndrome coronavirus we thank chengdu zicheng yibo biotechnology co., ltd for providing the laboratory consumables and bovine serum. this work was supported by sichuan science and technology program ( rz ), the program of sars-cov- protection (cyhx , kezhi people's air-defense equipment co., ltd) and the program of sars-cov- antibody discovery (jl c- , ablink biotech co., ltd). key: cord- -z q wo v authors: sang, eric r.; tian, yun; gong, yuanying; miller, laura c.; sang, yongming title: integrate structural analysis, isoform diversity, and interferon-inductive propensity of ace to refine sars-cov susceptibility prediction in vertebrates date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: z q wo v the current new coronavirus disease (covid- ) has caused globally near . / million confirmed deaths/infected cases across more than countries. as the etiological coronavirus (a.k.a. sars-cov ) may putatively have a bat origin, our understanding about its intermediate reservoir between bats and humans, especially its tropism in wild and domestic animals, are mostly unknown. this constitutes major concerns in public health for the current pandemics and potential zoonosis. previous reports using structural analysis of the viral spike protein (s) binding its cell receptor of angiotensin-converting enzyme (ace ), indicate a broad sars-cov susceptibility in wild and particularly domestic animals. through integration of key immunogenetic factors, including the existence of s-binding-void ace isoforms and the disparity of ace expression upon early innate immune response, we further refine the sars-cov susceptibility prediction to fit recent experimental validation. in addition to showing a broad susceptibility potential across mammalian species based on structural analysis, our results also reveal that domestic animals including dogs, pigs, cattle and goats may evolve ace -related immunogenetic diversity to restrict sars-cov infections. thus, we propose that domestic animals may be unlikely to play a role as amplifying hosts unless the virus has further species-specific adaptation. these findings may relieve relevant public concerns regarding covid- -like risk in domestic animals, highlight virus-host coevolution, and evoke disease intervention through targeting ace molecular diversity and interferon optimization. erupting in china last december, the novel coronavirus disease (covid- ) has become a worldwide pandemic and caused near . million confirmed deaths and million infected cases across countries by the end of may [ , ] . the etiological virus, designated as severe acute respiratory syndrome coronavirus (sars-cov ) has been identified [ ] and related to the viruses previously causing sars or middle east respiratory syndrome (mers) in humans in and , respectively [ ] . these three human-pathogenic coronaviruses putatively evolve from bat coronaviruses, but have different animal tropisms and intermediate reservoirs before transmission to humans [ , ] . as civet cats and camels were retrospectively determined as reservoirs for sars and mers respectively, there is no conclusion about what animal species passing sars-cov to humans [ , ] . investigations indicated that canivora animals including raccoon dogs, red foxes, badgers and minks as well swine, at a less extent, are susceptible to sars virus infections [ , ] . although the viral nucleic acids and antibodies to mers were detectable in multiple ruminant species including sheep, goat, and donkeys, the virus inoculation studies did not result in a productive infection for mers disease in these domestic ruminants, nor in horses [ , ] . as a group of obligate pathogens, viruses need to engage cell receptors for entering cells and race with the host immunity for effective replication and spreading to initiate a productive infection [ ] . in this context, the spike proteins protruding on the coronavirus surface are responsible for cell receptor binding and mediating viral entry [ ] [ ] [ ] . for example, mers-cov adopts the dipeptidyl peptidase (dpp , a.k.a. cd ) and sars-cov uses angiotensin-converting enzyme (ace ) as primary receptors for cell attachment and entry [ ] [ ] [ ] [ ] [ ] [ ] . several groups have reported that sars-cov uses the same ace receptor as sars-cov, but exerts higher receptor affinity to human ace , which may ascribe to the efficacy of sars-cov infection in humans [ , ] . after cell attachment via the receptor binding domain (rbd) in the n-terminal s region of the s protein, the c-terminal s region thus engages in membrane fusion. further cleavage of s from s by a furinlike protease will release and prime the virus entering the recipient cells. several furin-like proteases, especially a broadly expressed trans-membrane serine protease (tmprss ), are adopted for priming sars-cov entry [ , ] . compared with sars-cov, studies showed that sars-cov spike protein also evolutionarily obtains an additional furin-like proteinase cleavage site within the s /s junction region for efficient release from the cell surface and entry into the cells [ , [ ] [ ] [ ] . because tmprss is widely expressed, the tissue-specific expression of ace has been shown to determine sars-cov cell tropism in humans [ , ] . namely, human nasal secretory cells, type ii pneumocytes, and absorptive enterocytes are ace -tmprss double positive and highly permissive to sars-cov infection [ , ] . for cross-species animal tropism, the potential infectivity of sars-cov in both wild and domestic animals raises a big public health concern after the prevalence of sars-cov infections in humans [ , ] . this concern involves two aspects: ( ) screening to identify the animal species that serve as a virus reservoir originally passing sars-cov to humans; and ( ) the existing risk of infected people passing the virus to animals, particularly domestic species, thus potentially amplifying the zoonotic cycle to worsen sars-cov evolution and prevalence [ , ] . by diagnosis of animals in close contact with covid- patients or screening of animal samples in some covid- epidemic zones, studies detected that domestic cats and dogs could be virally or serologically positive for sars-cov [ ] [ ] [ ] [ ] [ ] [ ] [ ] , as was a reported infection in a zoo tiger [ ] . using controlled experimental infection of human sars-cov isolates, several studies demonstrated that ferrets, hamsters, domestic cats and some non-human primate species are susceptible to human sars-cov strains [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . obviously, it is impractical to test sars-cov susceptibility experimentally in all animal species. by adoption of a structural simulation based on published structures of the viral s-rbd/ace complex, studies have predicted a broad spectrum of vertebrate species with high potential for sars-cov susceptibility, which, if true, entails unexpected risks in both public and animal health, and warrants further critical evaluation [ ] [ ] [ ] . ace is a key enzyme catalyzing angiotensin (agt) further conversion into numeral active forms of agt - , which are hormonal mediators in the body's renin-angiotensin system (ras) [ , ] . thus, ace plays a regulatory role in the blood volume/pressure, body fluid balance, sodium and water retention, as well as immune effects on apoptosis, inflammation, and generation of reactive oxygen species (ros) [ , ] . in this line, the expression of ace is also inter-regulated by immune mediators pertinent to its systemic function. multiple physio-pathological factors, including pathogenic inflammation, influence on ras through action on ace expression [ ] [ ] [ ] . interferon (ifn) response, especially that mediated by type i and type iii ifns, comprises a frontline of antiviral immunity to restrict viral spreading from the initial infection sites, and therefore primarily determines if a viral exposure becomes controlled or a productive infection [ ] . several recent studies revealed that human ace gene behaves like an interferon-stimulated gene (isg) and is stimulated by a viral infection and ifn treatment; however, mouse ace gene is not [ , , ] . therefore, to determine the cell tropism and animal susceptibility to sars-cov , the cross-species ace genetic and especially epigenetic diversity in regulation of ace expression and functionality should be evaluated [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . in this study, through integration of structural analysis and key immunogenetic factors that show species-dependent differences, we critically refine the sars-cov susceptibility prediction to fit recent experimental validation [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . along with showing a broad susceptibility potential across mammalian species based on structural analysis [ ] [ ] [ ] , our results further reveal that domestic animals including dogs, pigs, cattle and goats may evolve previously unexamined immunogenetic diversity to restrict sars-cov infections. protein and promoter sequence extraction and alignment: the amino acid sequences of ace proteins and dna sequences of the proximal promoters of each ace genes were extracted from ncbi gene and relevant databases (https://www.ncbi.nlm.nih.gov/gene). ace genes and corresponding transcripts have been well annotated in most representative vertebrate species. in most cases, the annotations were double verified through the same gene entries at ensembl (https://www.ensembl.org). the protein sequences were collected from all non-redundant transcript variants and further verified for expression using relevant rna-seq data (ncbi geo profiles). the proximal promoter region spans ~ . kb before the predicted transcription (or translation) start site (tss) of ace or other genes. the protein and dna sequences were aligned using the multiple sequence alignment tools of clustalw or muscle through an embl-ebi port (https://www.ebi.ac.uk/). other sequence management was conducted using programs at the sequence manipulation suite (http://www.bioinformatics.org). sequence alignments were visualized using jalview (http://www.jalview.org) and megax (https://www.megasoftware.net). sequence similarity calculations and plotting were done using sdt . (http://web.cbio.uct.ac.za/~brejnev). other than indicated, all programs were run with default parameters. phylogenic analysis: the phylogenic analysis and tree visualization were performed using megax and an online program, evoview. the evolutionary history was inferred using the neighbor-joining method. percentage of replicate trees in which the associated taxa clustered together in the bootstrap test ( , replicates) was also performed. the evolutionary distances were computed using the p-distance method and in units of the number of amino acid differences per site. other than indicated, all programs were run with default parameters as the programs suggested. structural simulation and analysis: the structure files of human ace protein and its interaction with sars-cov s-rbd were extracted from the protein data bank under the files of m and m j. the residual mutation and structure simulation were performed using ucsf chimera and pymol available at https://www.cgl.ucsf.edu/chimera/ and https://pymol.org/, respectively. structural visualization were using pymol. the binding affinity energy (Δg), dissociation constant (kd) and interfacial contacts between s-rbd and each ace were calculated using an prodigy algorithm at https://bianca.science.uu.nl/prodigy/. profiling transcription factor binding sites in ace promoters and pwm scoring: the regulatory elements (and pertinent binding factors) in the ~ . kb proximal promoter regions was examined against both human/animal tfd database using a program nsite (version . , at http://www.softberry.com). the mean position weight matrix (pwm) of key cis-elements in the proximal promoters were calculated using pwm tools through https://ccg.epfl.ch/cgibin/pwmtools, and the binding motif matrices of examined tfs were extracted from jaspar core vertebrates (http://jaspar.genereg.net/). for expression confirmation, several sets of rna-seq data from ncbi gene databases, and one of ours generated from porcine alveolar macrophages (bioproject with an accession number of srp ), were analyzed for verification of the differential expression of ace genes in most annotated animal species. especially, the expression of porcine ace isoforms and relevant other genes in the porcine lung macrophage datasets. significantly differentially expressed genes (degs) between two treatments were called using an edger package and visualized using heatmaps or bar charts as previously described [ ]. . . vertebrate ace orthologs share an functional constraint but experience intra-species diversification in livestock with unknown selective pressure sequence comparison among ace orthologs across representative vertebrate species shows a pairwise identity range at - % ( fig. a and supplemental fig. s and excel sheet), which is - % higher than the average value generated through a similarity analysis at - % on gene orthologs at a genome-wide scale [ ] . this indicates that ace exerts a similar and basic function cross-species, consistent with its systemic and regulatory role as a key enzyme in ras, an essential regulatory axis underlying the body circulatory and execratory systems in vertebrates [ ] [ ] [ ] . a comparison of evolutionary rates of major genes within ras including angiotensinogen (agt), ace, and several receptors of the processed angiotensin hormones showed that ace actually evolves slightly faster than ace [ , and unpublished data] . this implies that ace may bear pressure for ras adapting evolution per a species-dependent physiological and pathological requirement [ ] [ ] [ ] . this evolutionary adaptability of ace genes is demonstrated by the existence of numerical genetic polymorphisms [ ] and several transcript isoforms particularly in humans and major livestock species ( fig. b and supplemental fig. s and excel sheet). we identified (and verified by rna-seq annotation) four transcripts of ace isoforms in humans (fig. b ) that primarily differ in the c-terminal residues within the collectrin domain. particularly, - short ace isoforms were identified in dogs, pigs, cattle, and goats in addition to the longer ace consensus to the human's (designated as -s or -l, respectively after the animal common names in fig. b and thereafter). these livestock ace -s isoforms have a - residual truncation at their n-terminal peptidase domains, which also span the region interacting with sars-cov spike protein. the selective mechanisms driving the evolution of these short ace isoforms in livestock are unknown, but may relate to previous pathogenic exposure or unprecedented physiopathological pressure. to support this reasoning, short ace isoforms are detected in both domestic bos taurus and hybrid cattle, but not in the wild buffalo and bison; and ace isoforms from each species are generally paralogous and sister each other within a clade in the phylogenic tree (fig. b ). phylogenic analysis of vertebrate ace orthologs/paralogs reveals a general relationship aligning to the animal cladistics (fig. b) . in this context, homologs from the fish, frog and chicken conform to a primitive clade. all ungulate homologs form into parallel clades next to each other. the homologs from the glires, primates and carnivores cluster into a big clade (marked with yellow triangle node), which contains all the sars-cov susceptible species that have been verified via natural exposure or experimental infections (fig. b , marked with red/orange circles). we examined and merged several previous studies about the prediction of sars-cov susceptibility in vertebrates based on the simulated structural analysis of s-rbd-ace complex [ ] [ ] [ ] . as numerous vertebrate species were predicted to be high or low potential (fig. b , labeled as red h or green l) for sars-cov susceptibility, incongruence between the predicted sars-cov susceptibility and infected validation is apparent in pangolin, ferret, tiger, cat and horseshoe bat, indicating that some other factors besides ace -rbd affinity should be considered [ , [ ] [ ] [ ] . we, therefore, refined the prediction matrix to include the rbd-binding evasion of some ace orthologs identified in major livestock species and the interferon-stimulated ace expression underlying sars-cov infections [ , [ ] [ ] [ ] . several recent studies have elegantly demonstrated the structural interaction of the viral s protein or its rbd in complex with human ace receptor [ , ] . showing that the contacting residues at the rbd/ace interface ( fig. a) involve at least residues in ace (fig. b , listed in the table cells and referred to the aligned residual positions in human ace ) and residues in the sars-cov rbd (fig. b , blue circles with residue labels above the table) [ , , ] . the cross-species residual identity (%) of these interacting residues in ace are dispersed in a broader range ( - %) than the whole ace sequence identity rate at - % [ ] , indicating a faster evolution rate of this virus-interacting region. notably, the s-binding region spans a large part of the n-terminal peptidase domain and s-binding may competitively block a majority of active sites of the enzyme (fig. c ). using a similar structural analysis procedure [ , ] , we modeled the ace structures of animal species of interest and simulated their interaction with sars-cov s-rbd based on a published rbd-human ace structure (protein data bank file m j) [ ] . fig. demonstrates the s-rbd interaction with the simulated structures of ace long isoforms from the dog, pig and cattle, respectively. the major changes of the rbd-ace interacting interfaces are from the residual exchanges in ace from other species compared with human ace (fig. b - d, highlighted in red). in addition, the exchange of n t (in pigs) and n y (in cattle and sheep) would destroy the n-glycosylation site in human ace . ace from goat (supplement fig. s ) exhibits identical amino acid exchanges as in cattle in the rbd-ace interfacial contacts. in contrast,when compared with human ace , ace from cats (supplement fig. s ) conserves all relevant glycosylation sites in human ace [ , ] . we also calculated the interfacial contacts using parameters of protein-protein interaction including the predictable binding affinity energy (Δg), dissociation constant (kd) and number of different interfacial contacts within the s-rbd and ace contact. although the exact numbers may differ from previous reports [ ] , they provide a very comparable matrix generated using the same algorithm ( fig. e ) [ ] . data show that the ace of most domestic animals, including that from mouse and rat (species known to be unsusceptible to human sars-cov ), have a binding affinity (Δg) at - . to - . kcal/mol. this is within the binding affinity range ( . - . kcal/mol) between the rbd and the ace from known susceptible species (fig. e , underlined in the left part of the table). this indicates that other factors, conceivably from genetic divergence and/or natural immunity, also contribute to sars-cov susceptibility in animal species. therefore, an effective prediction matrix should include the critical immunogenetic factors to further determine virus susceptibility in addition to the sequence/structural similarity of ace receptors ( fig. and fig. s ) [ , , ] . we detected several short ace isoforms in the domestic animals including dog, pig, goat and cattle that have an n-terminal truncation spanning - key residues in the contacting network to s-rbd but retain the enzyme active sites (fig. a ). most of the splicing isoforms of ace genes, such as in zebrafish, cats and humans, share a common proximal promoter and encode ace proteins containing all key rbd-interacting residues [ , ] . however, these short ace -s isoforms in domestic animals truncate for (cattle/goat ace -s) or (dog/pig ace -s) residues at their n-termini compared with the long ace isoforms in the same species ( fig. and fig. s ). therefore, these short ace isoforms destroy - key residues in the contacting network to s-rbd but likely retain ace enzymatic function in ras. paired structural comparison between the human ace structure (extracted from m ) with each simulated ace -s structure from the pig, dog, and cattle/goat, reveals that all these ace -s orthologs from domestic animals, particularly the porcine one, show high structural similarity to the human ace except for the nterminal truncations ( fig. b- d ). this indicates that these short ace isoforms in domestic animals have little chance to be engaged by the viral s-binding, and predict an unexpected evolutionary advantage to allay potential covid- risk resulting from viral engagement and functional distortion on the classical long ace isoforms in these animal species [ , ] . sars-cov infection induces a weak ifn response but a production of a high amount of inflammatory cytokines including interleukin (il)- and chemokine cxcl in most severe covid- patients [ ] [ ] [ ] [ ] . studies of sars and mers showed that these pathogenic coronaviruses share similar viral antagonisms, including the endoribonuclease (endou) encoded by nonstructural protein (nsp ), which directly blunts cell receptors responding to viral dsrna and in turn weaken the acute antiviral response [ ] . several recent studies revealed that sars-cov seems more cunning in not only evading or antagonizing but also in exploiting the ifn response for efficient cell attachment [ , , , ] . as a key enzyme in ras, the expression of ace gene has been primarily investigated for physiological response to circulatory regulations, and a response to pathological inflammation is also expected [ ] [ ] [ ] . however, the expression of the ace gene was highly responsive to both viral infection and host ifn response, i.e. human ace gene seems an unstudied ifn-stimulated gene (isg) [ , ] . surprisingly, the isg propensity of ace genes is species-dependent, for example: the mouse ace gene is less ifn responsive which may partly explain the mouse insusceptibility to sars-cov infection [ ] . to categorize the different ifn-inductive propensity of ace genes in vertebrates, particularly in major livestock species, we profiled the regulatory cis-elements and relevant transcription factors in the proximal promoter regions of each ace genes ( . kb before tss or atg). figure illustrates major regulatory cis-elements located in ace genes from major livestock animals and several reference animal species. data show that animal ace gene promoters are evolutionally different in containing ifn-or virus-stimulated response elements (isre, prdi, ifrs, and/or stat / factors) and cis-elements responsive to pro-inflammatory mediators. all these cis-elements recruit corresponding transcription factors (tf) to mediate differential ace responses to antiviral ifns and inflammation that is associated with covid- disease [ , , ] . we discover that ace genes obtain species-different isg propensity responsive to ifn and inflammatory stimuli. in most (if not all) of the sars-cov susceptible species the ace genes obtained the ifn response between the typical robust and tunable ifn-stimulated genes (isg) [ ] . in general, the robust isgs (isg is an example here) are stimulated in the acute phase of viral infection and play a more antiviral role; in contrast, the later responsive tunable isgs (irf is an example) contribute more to anti-proliferation of ifn activity [ ] . in addition, unlike the promoter of the short ace isoforms in cattle and goats, which share most common promoter regions with their paralogous long isoforms, the short ace isoforms of dogs (dog-s) and pigs (pig-s) have distinct proximal promoter regions (and different ifn responsivity) to the paralogous long ace isoforms ( fig. and fig ) . results indicate that the short ace isoforms in pigs and dogs diversify from their long paralogs at both the levels of genetic coding and epigenetic regulation to adapt to some evolutionary pressure, such as that from pathogenic interaction (fig. ) [ , ]. the position weight matrix (pwm) stands as a position-specific scoring model for the binding specificity of a transcription factor (tf) on the dna sequences [ ]. using pwm toolsets online (https://ccg.epfl.ch/cgi-bin/pwmtools), we evaluate mean pwm of key cis-elements in the proximal promoters of ace genes that containing binding sites for canonical ifn-dependent transcription factors, which include isre/stat, irf . irf / and irf , as well as c/ebp representing a core transcription factor for pro-inflammation. these ifn-dependent transcription factors, particularly irf / and isre/stat for ifn stimulation, are differentially enriched in the promoter regions of ace genes in a species-dependent way. higher enrichment of isre/stat / and/or irf / binding sites are detected in most sars-cov /covid susceptible species (indicated with solid orange or red circles, respectively). in contrast, the pwm for irf and c/ebp, which regulate inflammation, are less differentiated in ace promoters from animal species, indicating that ace genes are more universally regulated by inflammation than that by the viral infection or ifninduction in a species-dependent way (fig. ). as compared with the promoters of a typical human robust isg and tunable irf genes, this data indicate that ace genes (particularly the primate ones) are not typical robust or tunable isgs as represented by isg or irf , but respond differently to viral infection (through irf / ) or ifn auto-induction (via isre/stat) in a speciesdependent manner ( fig. ) [ ]. higher enrichment of isre/stat / and/or irf / corresponds to sars-cov susceptibility in experimentally validated mammalian species especially primates, but not to the phylogenically distant species such as zebrafish, which has very low potential for sars-cov susceptibility due to the high disparity of ace structures ( fig. and fig. s ). in addition, the proximal promoters of the pig and dog ace -s genes differ much in their ifn-responsive elements to most ace promoters in mammalians ( fig. and fig. ). however, they are phylogenically sister to the ace promoters from the primitive vertebrates (frog, chicken and zebrafish) (fig. , phylogenic tree) . this indicates that the expression of these short ace isoforms is more conservative than the long ace isoforms, which represent a more recent evolution obtaining ace epigenetic regulation by ifn-signaling (fig. ) [ ]. studies show that affinity adaption of the viral s-rbd and ace receptor determines the cellular permissiveness to the virus [ , , ] . sars-cov not only adapts a high binding affinity to human ace for cell attachment, but also antagonizes host antiviral interferon (ifn) response and utilizes ifn-stimulated property of human ace gene to boost spreading [ , , , ] . in addition to structural analysis of simulated s-rbd-ace interaction, we propose that several immunogenetic factors, including the evolution of s-binding-void ace isoforms in some domestic animals, the species-specific ifn system, and epigenetic regulation of ifn-stimulated property of host ace genes, contribute to the viral susceptibility and the development of covid- -like symptoms in certain animal species [ , , , ] . a computational program in development that incorporates this multifactorial prediction matrix and in vitro validation of sars-cov susceptibility in major vertebrate species will be necessary to address public concerns relevant to sars-cov infections in animals (fig. ) . it will also lead to the development of better animal models for anti-covid investigations [ ] . in addition, several ifn-based therapies for treatment of covid have been proposed and are in the process of clinic trails [ - ]. considering the viral stealth of ifnstimulated property of human ace , a timely and subtype-optimized ifn treatment should be delivered rather than a general injection of typical human ifn-α/β subtypes [ - ]. in this line, domestic livestock like pigs and cattle have a most evolved ifn system containing numerous unconventional ifn subtypes. some of these unconventional ifn subtypes, such as some porcine ifn-ω exert much higher antiviral activity than ifn-α even in human cells and most ifn-λ retain antiviral activity with less pro-inflammatory activity, could be utilized for developing effective antiviral therapies [ , ] . in summary, a predication matrix, which integrates the structural analysis of s-rbd-ace interfacial interface and the species-specific immunogenetic diversity of ace genes, was used to predict the sars-cov susceptibility and fit current knowledge about the infectious potential already validated in different animal species (fig. ) . more extensive validation experiments are needed to further improve this prediction matrix. our current results demonstrate several previously unstudied immunogenetic properties of animal ace genes and imply some domestic animals, including dogs, pigs and cattle/goats, may obtain some immunogenetic diversity to confront sars-cov infection and face less covid- risk than may have been previously thought. however, immediate biosecurity practices should be applied in animal management to reduce animal exposure to the virus and prevent potential species-specific adaptation (fig. ) . for livestock breeding programs that targeting disease resistance to respiratory viruses, the genetic and epigenetic diversity of ace genes as well antiviral isgs are highly recommended [ , , , ]. in conclusion, sars-cov evolves to fit well with human (and non-human primates) ace receptor through the structural interfacial affinity, immunogenetic diversity and epigenetic expression regulation, which results in a highly infectious efficacy [ ] [ ] [ ] , , , ] . most mammals, especially those that belong to glires, primates and carnivores, have a higher potential for sars-cov susceptibility but in a species-different manner based on the s-binding-void ace isoforms and the difference of the ifn-inductive propensity of the major ace genes. most ungulate animals appear have a low susceptibility potential with horses and sheep having a high potential (fig. ) . default parameter setting. the prediction of sars-cov susceptibility is based on the sequence similarity of each ace to human ace in the s-rbd binding region and simulated using a published human ace -rbd structure ( m j) and refers to two recent publications using similar procedures but different structural models [ , ] . compared with the currently available experimental data, incongruence of the predicted sars-cov susceptibility is clearly demonstrated in pangolin, ferret, tiger, cat and horseshoe bat, indicating that some other factors besides ace -rbd affinity should be considered. interfacial contacts of the sars-cov s-rbd with ace orthologs of major livestock species. most domestic animals ace including that from mouse and rat (species known not to be susceptible to sars-cov ) have a binding affinity (Δg) at - . to - . kcal/mol that is within the range ( . - . kcal/mol) between the rbd and the ace from the known susceptible species (underlined in the left part of the table), indicating that some other factors, especially those from genetic divergence and natural immunity, contribute to the sars-cov susceptibility of different animal species. the phylogenic tree of identified ace orthologs/variants from different species was built with a neighbor-joining approach and visualized using an evoview program under default parameter setting. the prediction of sars-cov susceptibility is based on the sequence similarity of each ace to human ace in the s-rbd binding region and simulated using a published human ace -rbd structure ( m j) and refers to two recent publications using similar procedures but different structural models [ , ] . compared with the currently available experimental data, incongruence of the predicted sars-cov susceptibility is clearly demonstrated in pangolin, ferret, tiger, cat and horseshoe bat, indicating that some other factors besides ace -rbd affinity should be considered. we emphasize to integrate other factors, including the rbd-binding evasion of some short ace orthologs identified in some major livestock species and the recently identified ace -interferon association [ ] , to refine the sars-cov susceptibility prediction. y y predicted permissiveness by sequence similarity and ace -rbd binding energy figure : prediction of sars-cov susceptibility in major livestock species based on the conservation of key interacting residues and binding capacity between the viral spike (s) protein on the host ace receptor. (a) sars-cov- uses the cell receptor, angiotensin-converting enzyme (ace ) for entry and the serine protease tmprss and furin for s protein priming. (b) as tmprss is broadly expressed and active with a furin-like cleavage activity, the affinity adaption of the s receptor binding domain (rbd) and ace receptor determines the viral permissiveness. the contacting residues of human ace (a distance cutoff . Å) at the sars-cov- rbd/ace interfaces are shown, and the contacting network involves at least residues in ace (listed in the table cells and referring to the aligned residual positions in human ace ) and residues in the sars-cov- rbd (blue circles with residue labels), which are listed and connected with black lines (indicating hydrogen bonds) and red line (represents salt-bridge interaction). the cross-species residual identity (%) of these interacting residues in ace are listed in a broad range ( - %) [ ] [ ] [ ] . (c) we also detected several short ace isoforms (underlined) in the domestic animals including dog, pig, goat and cattle, which have a n-terminal truncation spanning - key residues in the contacting network to s-rbd but keeping the enzyme active sites (indicated by yellow triangles), thus resulting in little engagement by the viral s protein and predicting an unexpected evolutionary advantage for relieving potential covid- risk caused by the viral engagement and functional distortion on the classical long ace isoforms in these animal species. the ncbi accession numbers of the ace orthologs are listed as in fig. s-rbd with ace orthologs of major livestock species simulated using the human ace /cov -rbd structure ( m j). most residues involved in binding are highlighted as magenta (ace ) or orange (s) sticks and labeled as one-letter amino-acid codes plus residual numbers in bold or regular font respectively for s or ace residues. the dotted/blue lines indicate intermolecular salt bridge or hydrogen bonds between interacting residues (generated and visualized with ucsf chimera and pymol from protein data bank file m j). (b) to (d) rbd interaction with the simulated structures of ace long isoforms from the dog, pig and cattle, respectively. amino acid exchanges in ace from another species compared with human ace are highlighted in red. e) prediction of binding affinity energy (Δg), dissociation constant (kd) and interfacial contacts of the sars-cov s-rbd with ace orthologs of major livestock species. most domestic animals ace including that from mouse and rat (species known not to be susceptible to sars-cov ) have a binding affinity (Δg) at - . to - . kcal/mol that is within the range ( . - . kcal/mol) between the rbd and the ace from the known susceptible species (underlined in the left part of the table), indicating that some other factors, especially those from genetic divergence and natural immunity, contribute to the sars-cov susceptibility of different animal species. in contrast to most splicing isoforms such as in cats and humans, which share a common proximal promoter and encode ace proteins with similar sequences containing all key rbd-interacting residues, these short ace -s isoforms in domestic animals truncate for (cattle/goat ace -s) or (dog/pig ace -s) residues at their n-termini compared with human ace or the long ace isoforms in these species, thus destroying - key residues in the contacting network to s-rbd but retaining all enzyme active sites (yellow triangles in the blue ace domain bar). this results in little chance to be engaged by the viral s protein binding and predicts an unexpected evolutionary advantage to relieve potential covid- risk caused by the viral engagement and functional distortion on the classical long ace isoforms in these animal species. (b), (c) and (d) paired structural comparison between the human ace structure ( m ) with each simulated ace -s structure from pig (b), dog (c) and cattle/goat (d). human ace structure are in green, and each compared animal ace -s structure in magenta. the n-terminal residues of both compared structures are in cyan (arrows indicating n-termini of the ace -s isoforms) and shared c-termini are in red. putative proximal promoter region (ace -p) figure . categorizing ace genes based on regulatory cis-elements predicted in their proximal promoter regions (< kb before tss or atg). the regulatory elements (and pertinent binding factors) in the ~ kb proximal promoter regions were examined against both human/animal tfd database using a program nsite (version . , at http://www.softberry.com), including ace genes identified in major livestock animals and several reference animal species. data show that animal ace gene promoters are evolutionally different in containing ifn-or virus-stimulated response elements (isre, prdi, ifrs, and/or stat / factors) and cis-elements responsive to proinflammatory mediators, which mediate different ace responses to antiviral interferons (ifns) and inflammation associated with covid- disease. legend: ○, gata- regulating constitutive expression; acute (◊) or secondary (◊) ifn-stimulated response element (isre) and prdi that interact with irf, isgf and stat factors, respectively; □, cis-elements interacting with factors to mediate immune/ inflammatory responses including c/ebp, nf-kb, nf-il , and p ; •, cis-elements reacting with other factors significant in other developmental/physiological responses. the promoter features of two typical human interferon-stimulated genes (isg), the robust isg and tunable irf are shown as references to indicate that ace genes obtain species-different isg propensity responsive to ifn and inflammatory stimuli. covid- dashboard by the center for systems science and engineering (csse) at johns hopkins university (jhu) a familial cluster of pneumonia associated with the novel coronavirus indicating person-to-person transmission: a study of a family cluster a new coronavirus associated with human respiratory disease in china the emergence of sars, mers and novel sars- coronaviruses in the st century a genomic perspective on the origin and emergence of sars-cov- origin and evolution of pathogenic coronaviruses animal origins of the severe acute respiratory syndrome coronavirus: insight from ace -s-protein interactions middle east respiratory syndrome coronavirus (mers-cov): animal to human interaction mers-cov: the intermediate host identified? how viral and intracellular bacterial pathogens reprogram the metabolism of host cells to allow their intracellular replication sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor a highly conserved cryptic epitope in the receptor binding domains of sars-cov- and sars-cov the proximal origin of sars-cov- sars-cov- reverse genetics reveals a variable infection gradient in the respiratory tract sars-cov- receptor ace is an interferon-stimulated gene in human airway epithelial cells and is detected in specific cell subsets across tissues susceptibility of ferrets, cats, dogs, and other domesticated animals to sars-coronavirus covid- : animals, veterinary and zoonotic links transmission of sars-cov- in domestic cats pathogenesis and transmission of sars-cov- in golden hamsters serological survey of sars-cov- for experimental, domestic, companion and wild animals excludes intermediate hosts of different species of animals animal models of mechanisms of sars-cov- infection and covid- pathology first detection and genome sequencing of sars-cov- in an infected cat in france simulation of the clinical and pathological manifestations of coronavirus disease (covid- ) in golden syrian hamster model: implications for disease pathogenesis and transmissibility infection and rapid transmission of sars-cov- in ferrets complete genome sequence of sars-cov- in a tiger from a u spike protein recognition of mammalian ace predicts the host range and an optimized ace for sars-cov- infection predicting the angiotensin converting enzyme (ace ) utilizing capability as the receptor of sars-cov- covid- : epidemiology, evolution, and cross-disciplinary perspectives the pivotal link between ace deficiency and sars-cov- infection physiological and pathological regulation of ace , the sars-cov- receptor renin-angiotensin system at the heart of covid- pandemic type i and type iii interferons -induction, signaling, evasion, and application to combat covid- increasing host cellular receptor-angiotensin-converting enzyme (ace ) expression by coronavirus may facilitate -ncov (or sars-cov- ) infection sars-cov- entry factors are highly expressed in nasal epithelial cells together with innate immune genes evolutionary constraints on structural similarity in orthologs and paralogs a genomic survey of angiotensin-converting enzymes provides novel insights into their molecular evolution in vertebrates ace receptor polymorphism: susceptibility to sars-cov- , hypertension, multi-organ failure, and covid- disease outcome structural basis for the recognition of sars-cov- by full-length human ace structure of the sars-cov- spike receptor-binding domain bound to the ace receptor prodigy: a web server for predicting the binding affinity of protein-protein complexes imbalanced host response to sars-cov- drives development of covid- weak induction of interferon expression by sars-cov- supports clinical trials of interferon lambda to treat early covid- key: cord- -a cqw kg authors: shi, yuejun; shi, jiale; sun, limeng; tan, yubei; wang, gang; guo, fenglin; hu, guangli; fu, yanan; fu, zhen f.; xiao, shaobo; peng, guiqing title: insight into vaccine development for alpha-coronaviruses based on structural and immunological analyses of spike proteins date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: a cqw kg coronaviruses that infect humans belong to the alpha-coronavirus (including hcov- e) and beta-coronavirus (including sars-cov and sars-cov- ) genera. in particular, sars-cov- is currently a major threat to public health worldwide. however, no commercial vaccines against the coronaviruses that can infect humans are available. the spike (s) homotrimers bind to their receptors through the receptor-binding domain (rbd), which is believed to be a major target to block viral entry. in this study, we selected alpha-coronavirus (hcov- e) and beta-coronavirus (sars-cov and sars-cov- ) as models. their rbds were observed to adopt two different conformational states (lying or standing). then, structural and immunological analyses were used to explore differences in the immune response with rbds among these coronaviruses. our results showed that more rbd-specific antibodies were induced by the s trimer with the rbd in the “standing” state (sars-cov and sars-cov- ) than the s trimer with the rbd in the “lying” state (hcov- e), and the affinity between the rbd-specific antibodies and s trimer was also higher in the sars-cov and sars-cov- . in addition, we found that the ability of the hcov- e rbd to induce neutralizing antibodies was much lower and the intact and stable s subunit was essential for producing efficient neutralizing antibodies against hcov- e. importantly, our results reveal different vaccine strategies for coronaviruses, and s-trimer is better than rbd as a target for vaccine development in alpha-coronavirus. our findings will provide important implications for future development of coronavirus vaccines. importance outbreak of coronaviruses, especially sars-cov- , poses a serious threat to global public health. development of vaccines to prevent the coronaviruses that can infect humans has always been a top priority. coronavirus spike (s) protein is considered as a major target for vaccine development. currently, structural studies have shown that alpha-coronavirus (hcov- e) and beta-coronavirus (sars-cov and sars-cov- ) rbds are in lying and standing state, respectively. here, we tested the ability of s-trimer and rbd to induce neutralizing antibodies among these coronaviruses. our results showed that beta-covs rbds are in a standing state, and their s proteins can induce more neutralizing antibodies targeting rbd. however, hcov- e rbd is in a lying state, and its s protein induces a low level of neutralizing antibody targeting rbd. our results indicate that alpha-coronavirus is more conducive to escape host immune recognition, and also provide novel ideas for the development of vaccines targeting s protein. hcov-nl ) and beta-covs (hcov-oc and hcov-hku ) are well adapted to humans and widely circulate in the human population, with most infections causing mild disease in immunocompetent adults ( , , ). in addition, sars-cov, sars-cov- and mers-cov belong to beta-cov and are highly pathogenic ( - ). as the primary glycoprotein on the surface of the viral envelope, the spike (s) glycoprotein is the major target of neutralizing antibodies (nabs) elicited by natural infection and key antigens in experimental vaccine candidates. the s protein contains two subunits responsible for receptor binding (s subunit) and membrane fusion (s subunit) ( ). in particular, the s subunit of the prefusion s protein is structurally ( , ) . the s subunits of beta-and gamma-cov strains utilize the cross-subunit packing mode, reducing the conformational conflict of the rbd in a standing state ( , , , ). in contrast, alpha-and delta-cov strains both utilize an intrasubunit packing mode, and the s -ctd is limited by the conformational conflict with surrounding domains ( , , - , , ) . hence, the s -rbd in the s trimer was captured in two different states among different coronaviruses. in the beta-covs (sars-cov, sars-cov- and mers-cov), the s -rbd adopts a "standing" state, which is believed to be a prerequisite for receptor binding and rbm-specific antibody binding ( , , ) . nevertheless, the s -rbds of alpha-covs all adopt "lying" state, which is considered more conducive to evading antibody recognition ( , , , mers-cov. among them, the s protein or rbd was the major targets ( ) ( ) ( ) . compared with beta-covs, relatively few studies have investigated two alpha-hcovs: hcov- e and hcov-nl . however, their s subunit structure and receptor recognition pattern, especially the structure of the rbd and its state in the s trimer, differ substantially from those of beta-covs, suggesting different s protein immune responses between alpha-and beta-covs. importantly, considering the low homology between different coronavirus genera, related research on alpha-covs can not only help to elucidate the differences between s proteins that adopt different rbd states but can also facilitate the development of coronavirus vaccines. in this study, we selected sars-cov, sars-cov- , and hcov- e as models, which adopt the two rbd states, and evaluated and compared immune responses to the s trimers and rbds of these coronaviruses through immunological and bioinformatics approaches. we also investigated the mechanism through which the hcov- e s trimer produced effective nabs. finally, we provide possible vaccine strategies for alpha- to address this issue, we performed b-cell epitope predictions for the s trimers and rbds of alpha-cov (hcov- e) and beta-covs (sars-cov and sars-cov- ). the predicted positive residues (the corresponding spatial epitope and linear epitope) are displayed on the structural surface ( fig. a, c and e) , and the distribution of positive residues on the rbd is summarized in table . a total of and amino acid residues located on the rbd were predicted to be conformational epitopes for sars-cov and sars-cov- , respectively. of these, and residues were located in the sars-cov rbm subdomain and in the sars-cov- rbm subdomain, respectively. the linear b-cell epitope prediction results were similar in sars-cov and sars-cov- . however, in hcov- e, only residues located in the rbm subdomain were predicted to be conformational epitopes, and residues were predicted to be linear epitopes. the same results also appeared in the hcov- e s trimer: fewer positive residues were located in the rbd than in the sars-cov or sars-cov- rbm subdomain ( fig. a, c and sars-cov- -immunized mice had a good neutralizing ability ( fig. i and j) . for hcov- e, the s trimer serum had a comparable neutralizing ability to that of sars-cov or sars-cov- , but the rbd serum had no detectable neutralizing ability (fig. k) . our experimental results indicate that the lying state of the rbd in the hcov- e s-trimer induces the production of very few antibodies targeting the rbd, but the s-trimer still produces strong neutralizing antibody levels. in this study, we found that more rbd-specific antibodies were induced by the s trimer with the rbd in the standing state than the s trimer with the rbd in the lying state, and the affinity between rbd-specific antibodies and the s trimer was also higher in the standing state. however, we also found that fewer nabs were induced by the rbd of hcov- e than by the rbds of sars-cov or sars-cov- . in terms of hcov- e, the distribution of the potential residues in the rbm was lower than that of sars-cov or sars-cov- , which may have been caused by different rbm patterns and exposure degrees. when we compared the reported nab epitopes of sars-cov and alpha-cov tgev with our results ( ), they were basically consistent. therefore, we believe that this finding illustrates the inherent difference between the rbds of alpha-and beta-cov. the intact and stable s subunit of hcov- e is a prerequisite for the production of effective nabs our experimental results showed that hcov- e s-trimer can induce strong nab levels, while the rbd alone is less immunogenic. next, we will explore which functional domains of the s-trimer are involved in the generation of nabs. to clarify this issue, we immunized mice with the hcov- e s trimer ( µg), s ( µg), ntd ( µg), rbd ( µg) and ntd+rbd ( µg+ µg). meanwhile, to better confirm our results, the hcov- e strain vr was used for the neutralizing assay. the results indicated that the s trimer serum had the best neutralizing ability, followed by the s and ntd+rbd sera, while the ntd and rbd sera alone had no detectable neutralizing effects (fig. a) . the results indicate that the s region in the s-trimer should be the key region for nabs induction. to further verify the importance of the complete s structure in the s-trimer, we designed two s trimer mutants, namely, an ntd-deficient s trimer and an s c/t c s trimer, the s subunit integrity or stability of which was destroyed ( fig. c and f ). mutant proteins disrupt the conformational conflicts that limit rbd standing, significantly improving their ability to bind hapn ( fig. d and g) . however, an incomplete or unstable s conformation significantly reduces the level of nabs induced by the s-trimer (fig. e and k). taken together, these results showed that the intact and stable s subunit of hcov- e is a prerequisite for the production of effective nabs. furthermore, our experimental results show that rbd has a higher ability to bind to the receptor hapn (fig. b) , which indicates that the characteristics of rbd itself may lead to the generation of less neutralizing antibodies. furthermore, we screened monoclonal antibodies using s-trimer, and the results showed that few antibodies targeting s -rbd (fig. a) . to further determine the ability of rbd to induce antibodies itself, we screened monoclonal antibodies targeting the s region and found that the proportion of antibodies targeting rbd was approximately % (fig. b ). since the s protein is expressed in a monomeric form, rbd is not restricted by we compared the structures of s trimers and rbds among alpha-coronaviruses (figs. b and a) . we also predicted the potential b-cell epitopes for their rbds ( fig. a; table ) . in alpha-cov, the s-trimer had a closed s subunit with three "lying" rbds (fig. b) . moreover, the rbds consist of a standard β-sandwich fold core and three short discontinuous loops in the same spatial region ( , , , , , , ) (fig. a) . meanwhile, we performed a structural conservative analysis and the results showed that the rbd structures of hcov-nl , pedv, and fipv are most similar to hcov- e, with rsmd values of . , . , and . , respectively (fig. b) . in addition, the distribution of potential b-cell epitopes in the rbds of alpha-covs was also similar to that of hcov- e (fig. a and c; table ) . based on the above data, inherent differences exist in the rbds between alpha-and beta-covs (figs. and a). however, the alpha-and beta-covs show high similarity in their rbds and similar potential immune characteristics within their respective genera (figs. , , a and b). accordingly, in alpha-covs such as hcov- e, subunit vaccines should prioritize the s-trimer rather than the rbd. in beta-covs such as sars-cov and sars-cov- , the s trimer and rbd are both good candidates for subunit vaccines (fig. ) . in summary, we systematically analyzed the conformational states and igg ( : , diluted in pbst with % bsa (w/v), boster) was used for detection. signal reading was carried out in the same manner. hbs buffer was used as a mock then the plates were reacted with the hybridoma culture supernatants at ℃ for h. hrp-conjugated goat anti-mouse igg ( : , diluted in pbst with % bsa (w/v), boster) was used for detection. signal reading was carried out in the manner described above. hybridoma culturing medium was used as a mock control. ratification vote on taxonomic proposals to the international committee on taxonomy of viruses origin and evolution of pathogenic coronaviruses genetic recombination, and pathogenesis of coronaviruses clinical features of patients infected with novel coronavirus in wuhan genomic analysis of human coronaviruses oc (hcov-oc s) circulating in france from to reveals a high intra-specific diversity with new recombinant genotypes coronavirus as a possible cause of severe acute respiratory syndrome anonymous. . the species severe acute respiratory syndrome-related coronavirus: classifying -ncov and naming it sars-cov- structure, function, and evolution of coronavirus spike proteins cryo-em analysis of a feline coronavirus spike protein reveals a unique structure and camouflaging glycans cryo-em structure of the -ncov spike in the prefusion conformation the . -angstrom cryo-electron microscopy structure of the porcine epidemic diarrhea virus spike protein in the prefusion structural basis for human coronavirus attachment to sialic acid receptors the human coronavirus hcov- e s-protein glycan shield and fusion activation of a deltacoronavirus spike glycoprotein fine-tuned for enteric infections cryo-electron microscopy structure of porcine deltacoronavirus spike protein in the prefusion state cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a antigenic and immunogenic characterization of recombinant baculovirus-expressed severe acute respiratory syndrome coronavirus spike protein: implication for vaccine design recombinant receptor binding domain protein induces partial protective immunity in rhesus macaques against middle east respiratory syndrome coronavirus challenge immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen structural bases of coronavirus attachment to host aminopeptidase n and its inhibition by neutralizing antibodies the x-ray crystal structure of human aminopeptidase n reveals a novel dimer and the basis for peptide processing comparison of coronaviruses a sequence homology and bioinformatic approach can predict candidate targets for immune responses to sars-cov- clustal w and clustal x version . s : receptor-binding subunit; s : membrane fusion subunit; ntd: n-terminal domain; rbd: receptor-binding domain (magenta). (b) overall structure comparison of coronavirus s trimers structure-based b-cell epitope predictions of beta-cov (sars-cov and sars-cov- ) and alpha-cov (hcov- e). (a, c and e) the predicted b cell epitopes of sars-cov, sars-cov- and hcov- e are shown. the linear (red cartoon) and conformational (yellow sphere) b cell epitopes were bepipred . or discotope . and labeled onto the corresponding structure by and f) the complex structures of the rbds of sars-cov sars-cov- and hcov- e with the receptors (hace and hapn) are shown. the interface area of each complex and the surface area of each rbd were calculated via the rbm region of the rbd and the receptors (hace and hapn) are shown in red and cyan immunological analysis of beta-cov (sars-cov and sars-cov- ) and hcov- e). (a and b) cross-reactivity of the sars-cov s trimer and mice sera of sars-cov s trimer (red) and sars-cov rbd (blue) were -fold serially diluted (starting with -fold dilution) and reacted with the s trimer (a) or rbd (b), respectively cross-reactivity of the sars-cov- s trimer and rbd-specific sera is determined by mice sera of sars-cov- s trimer (magenta) and sars-cov- rbd (slate) were -fold diluted and reacted with sars-cov- s trimer (c) and rbd (d) -reactivity of the hcov- e s trimer and rbd-specific sera is determined by elisa. mice sera of hcov- e s trimer (orange) and hcov- e rbd the antibody titers of sera from mice immunized with μg of the hcov- e rbd (brown) and μg of the hcov- e rbd (purple) all data above are presented as the dilution that remained positive. (i, j and k) the neutralization assay of mouse sera from the spike trimer and rbd against sars-cov, sars-cov- and hcov- e pseudoviruses is determined. the data are presented as the mean reciprocal ic titer. the limit of detection for the assay depends on the initial dilution and is represented by the intact and stable s subunit of hcov- e is a prerequisite for the production of effective nabs. (a) the neutralization abilities of mouse sera from the b) determination of the affinity of ntd and rbd with the receptor hapn. (c) structural model of hcov- e-s-△ntd. magenta: rbd; green: sd ; cyan: sd . (d) dose-dependent binding of hcov- e-s-△ntd and hapn. (e) the neutralization ability of mouse sera from hcov- e-s-△ntd was measured via pseudovirus neutralization assay magenta: rbd; blue: ntd; green: sd ; cyan: sd . (g) dose-dependent binding of h) the neutralization ability of mouse sera from hcov- e-s-s c/t c was measured via pseudovirus neutralization assay the limit of detection for the assay depends on the initial dilution and is represented by dotted lines,a reciprocal ic titer of was assigned. besides monoclonal antibody epitope mapping of the hcov- e spike protein monoclonal antibody (mab) epitope regions in the hcov- e spike protein (a) and s domain (b). supernatants of positive hybridomas were reacted with the data are presented as the od (bottom). mabs and their epitope regions are indicated below the schematic of the b cell epitope analysis of the rbd regions of alpha-coronavirus spike proteins. (a) structures of the rbds from alpha-covs (hcov- e the linear (red cartoon) and conformational (yellow sphere) b cell epitopes were predicted by bepipred . or discotope . and labeled onto the corresponding rbd structure by pymol. (b) structural comparison of the rbds from alpha-covs. (c) sequence alignment of the rbds from alpha-covs. the rbm or putative rbm region is shown in cyan foundation (program no. py ). the authors declare no competing interests. fig. potential vaccine strategies for alpha-and beta-covs. the model showed that the rbds of the alpha-cov s trimers are in a lying state. in this state, the s protein cannot bind to the receptor, but meanwhile, this state is also conducive to escaping the immune response target the rbd, and the rbds of the alpha-covs also induces fewer nabs; thus, their s-trimers can be an effective potential subunit vaccine. in key: cord- -k iqv jb authors: li, yujun; wang, haimin; tang, xiaojuan; fang, shisong; ma, danting; du, chengzhi; wang, yifei; pan, hong; yao, weitong; zhang, renli; zou, xuan; zheng, jie; xu, liangde; farzan, michael; zhong, guocai title: sars-cov- and three related coronaviruses utilize multiple ace orthologs and are potently blocked by an improved ace -ig date: - - journal: j virol doi: . /jvi. - sha: doc_id: cord_uid: k iqv jb the ongoing coronavirus disease (covid- ) pandemic has caused > million infections and > , deaths. severe acute respiratory syndrome coronavirus (sars-cov- ), the etiological agent of covid- , has been found closely related to the bat coronavirus strain ratg (bat-cov ratg ) and a recently identified pangolin coronavirus (pangolin-cov- ). here, we first investigated the ability of sars-cov- and three related coronaviruses to utilize animal orthologs of angiotensin-converting enzyme (ace ) for cell entry. we found that ace orthologs of a wide range of domestic and wild mammals, including camels, cattle, horses, goats, sheep, cats, rabbits, and pangolins, were able to support cell entry of sars-cov- , suggesting that these species might be able to harbor and spread this virus. in addition, the pangolin and bat coronaviruses, pangolin-cov- and bat-cov ratg , were also found able to utilize human ace and a number of animal-ace orthologs for cell entry, indicating risks of spillover of these viruses into humans in the future. we then developed potently anticoronavirus ace -ig proteins that are broadly effective against the four distinct coronaviruses. in particular, through truncating ace at its residue but not , introducing a d e mutation, and adopting an antibody-like tetrameric-ace configuration, we generated an ace -ig variant that neutralizes sars-cov- at picomolar range. these data demonstrate that the improved ace -ig variants developed in this study could potentially be developed to protect from sars-cov- and some other sars-like viruses that might spillover into humans in the future. importance the severe acute respiratory syndrome coronavirus (sars-cov- ) is the etiological agent of the currently uncontrolled coronavirus disease (covid- ) pandemic. it is important to study the host range of sars-cov- , because some domestic species might harbor the virus and transmit it back to humans. in addition, insight into the ability of sars-cov- and sars-like viruses to utilize animal orthologs of the sars-cov- receptor ace might provide structural insight into improving ace -based viral entry inhibitors. in this study, we found that ace orthologs of a wide range of domestic and wild animals can support cell entry of sars-cov- and three related coronaviruses, providing insights into identifying animal hosts of these viruses. we also developed recombinant ace -ig proteins that are able to potently block these viral infections, providing a promising approach to developing antiviral proteins broadly effective against these distinct coronaviruses. cov- whu , pangolin-cov- , bat-cov ratg , and sars-cov bj . purified rbd proteins were then used to perform surface staining of t cells transfected with each of the ace orthologs or a vector plasmid control (fig. ) . all of the rbd proteins showed binding to a number of ace orthologs. unexpectedly, although the sars-fig sars-cov- and ace contact residues are conserved among four sars-like viruses and ace orthologs, respectively. (a) interactions between the sars-cov- receptor binding domain (rbd, red) and ace (blue) involve a large number of contact residues (pdb accession no. m j). rbd residues Ͻ Å from ace atoms and ace residues Ͻ Å from rbd atoms are shown. (b) the sequences of the sars-cov- whu , a pangolin coronavirus identified in manis javanica (pangolin-cov- ), a bat coronavirus identified in r. affinis (bat-cov ratg ), and the sars-cov bj are aligned, with residues different from the corresponding ones in sars-cov- highlighted in blue. the stars indicate rbd residues Ͻ Å from ace atoms. the yellow lines indicate the rbm region. n-linked glycosylation motifs are indicated in green. (c) sequences of ace orthologs from the indicated species are aligned, with only residues Ͻ Å from rbd atoms shown here. the numbering is based on human ace protein, and the residues different from the corresponding ones in human ace are highlighted in blue. cov- and sars-cov rbds differ significantly in the rbm region and ace -contact residues, both rbds bound to ace orthologs with identical ones. moreover, the pangolin-cov- rbd, which differs from sars-cov- rbd with only one amino acid within the rbm region, kept nine sars-cov- rbd-interacting ace orthologs and gained three additional interacting ones, including that of rat, mouse, and chicken. the rbd of bat-cov ratg then showed a binding profile significantly different and narrower than the other three rbds. note that human ace and ace orthologs of some domestic animals, including camels, cattle, horses, goats, sheep, cats, and rabbits, support efficient binding to all the four tested rbds, suggesting that these ace orthologs might be generally functional for supporting cell entry of the four tested viruses. a wide range of ace orthologs can support entry of the four coronaviruses. to evaluate spike protein-mediated entry of these coronaviruses, we generated retrovirus-based luciferase reporter pseudoviral particles (pp) enveloped with one of six different spike proteins, including a wild-type sars-cov- spike (sars-cov- whu fig a wide range of ace orthologs support binding to rbd proteins of sars-cov- and three related coronaviruses. (a) t cells were transfected with adjusted amounts of the indicated ace -ortholog plasmids to have similar expression levels of the ace ortholog proteins. cells were then stained with an rbd-mouse igg fc fusion protein of sars-cov- whu , pangolin-cov- , bat-cov ratg , or sars-cov bj , followed by staining with an alexa -goat anti-mouse igg secondary antibody. rbd-ace binding was detected using flow cytometry. (b) percentages of cells positive for rbd binding in panel a are presented as a heatmap according to the indicated color code. (c) expression levels of the indicated ace orthologs were detected using western blotting. the data shown are representative of two independent experiments performed by two different people with similar results. pp), a furin site deletion mutant of sars-cov- spike (sars-cov- Δfurin pp), a wild-type sars-cov spike (sars-cov bj pp), a sars-cov spike carrying the pangolin-cov- rbd (pangolin-rbd/bj pp), a wild-type bat-cov spike (bat-cov ratg pp), and a bat-cov spike carrying the pangolin-cov- rbd (pangolin-rbd/ratg pp). these reporter pseudoviruses were used to infect t cells expressing each of the ace orthologs. a vesicular stomatitis virus protein g (vsv-g)-pseudotyped reporter retrovirus whose entry is independent of ace was used as a control virus. as expected, all the orthologs that supported rbd binding were also functional on supporting pseudovirus infection. again, ace orthologs of humans and most domestic mammals, including camels, cattle, horses, goats, sheep, cats, and rabbits, supported entry of all the tested pseudoviruses ( fig. a to g). it is of note that, although furin-cleaved and uncleaved sars-cov- spike trimers have significant structural difference ( ) , infection with sars-cov- Δfurin pseudovirus produced stronger reporter signals but an almost identical pattern of ace -ortholog usage as the wild-type pseudovirus ( fig. a and b) . these data are consistent with the findings that both furin-cleaved and uncleaved sars-cov- spike trimers can adopt an ace -binding-competent conformation albeit at different frequency ( to %), and furin cleavage reduces overall stability of the spike protein ( ) ( ) ( ) . the ability of ace orthologs to support sars-cov- infection was further confirmed by infection assays using sars-cov- live virus (fig. h) . these data indicate that humans and these domestic animals might be generally susceptible to infections of the four distinct coronaviruses. ace -ig variants that have soluble ace domain truncated at residue but not potently block sars-cov- entry. recombinant rbd and soluble ace proteins have been shown to potently block sars-cov entry ( , ) . to investigate whether similar approaches could also be applied to sars-cov- , we first produced mouse igg a fc fusion proteins of rbd (rbd-ig) and soluble ace (ace -ig) variants (fig. a) . specifically, the rbd variants include wild-type rbds of sars-cov, pangolin-cov- , and sars-cov- , and four mutants of sars-cov- rbd that were expected to bind ace better via additional possible aromatic-stacking (f w, y w) or salt-bridge (k r, g d) interactions. the ectodomain of cell-surface ace spanning its residues to contains an enzymatic domain ( - ) and a collectrin-like domain (cld). previous crystal-structure studies showed that soluble ace protein truncated at its residue , preceding the cld domain, express well and forms stable complex with the rbd of sars-cov and sars-cov- , respectively ( , ) . therefore, the ace -ig variants include human ace truncated at its residue ( -wt) and ( -wt), respectively, and mutants of the -and -version ace -ig proteins that were expected to inactivate ace s' enzymatic activity (nn) or bind sars-cov- rbd better via additional possible hydrophobic (y w, h y, m k) or salt-bridge (d e) interactions. we then evaluated these proteins for their potency of blocking sars-cov- Δfurin pseudovirus infection ( fig. b and c) . among all the rbd-ig variants, the y w mutant of sars-cov- rbd showed modestly improved potency over the wild type, and wild-type rbd of pangolin-cov- showed the best neutralization activity among all the tested rbds (fig. b ). among the tested ace -ig variants, interestingly, all the -version variants showed significantly better potency than the -version variants (two-tailed two-sample t test, p Ͻ . ; fig. c ). in addition, d e mutants of both -and -version proteins showed improved potency over the corresponding wild types (two-sample t test, p Ͻ . ), and the -d e variant outperformed all the rbd-ig and ace -ig variants. the -d e variant of ace -ig is a broadly neutralizing immunoadhesin. we further tested the -wt and -d e variants of ace -ig for their neutralization activities against sars-cov- , sars-cov, pangolin-cov- , and bat-cov ratg pseudotypes (fig. ) . interestingly, the d e mutation improved the protein's neutralization activity against sars-cov- and bat-cov ratg pseudoviruses (two-sample t test, p Ͻ . ; fig. a and c) but not pangolin-rbd/bj or sars-cov pseudovirus (two-sample t test, p Ͼ . ; fig. b and d). the d residue of human ace was consistently found to form a salt-bridge interaction with the k residue of sars-cov- rbd in multiple released structures of sars-cov- rbd in complex with ace ( , ) (fig. e ). the residue of bat-cov ratg is also a lysine, while the same residue is an arginine for pangolin-cov- rbd and a valine for sars-cov rbd. thus, the mechanism of the d e-mediated improvement is likely that the mutation enhances the salt-bridge interaction between the residue of the ace and residue of sars-cov- and bat-cov ratg rbds. these data suggest that the -d e variant of ace -ig is a broadly neutralizing immunoadhesin against sars-cov- , sars-cov, pangolin-cov- , and bat-cov ratg . an antibody-like ace -ig variant further improves neutralization potency by ϳ -fold. to obtain a more potent ace -ig that may serve as an anti-sars-cov- drug candidate, we generated more variants and used human igg domains to replace the mouse counterparts in the ace -ig fusion proteins. two new variants, which have an antibody like configuration and contain four or six soluble ace domains in a single molecule, are named as ace -ig-v and ace -ig-v , respectively (fig. a ). we first tested these variants for their neutralization potency against sars-cov- pseudovirus. both of the new variants showed pronounced improvements over ace -ig-v . , the original -d e dimer variant ( fig. a to c). specifically, ace -ig-v and ace -ig-v have estimated % inhibitory concentration (ic ) values of to pm and ic values of to pm, representing Ͼ -fold improvement on ic and Ͼ -fold improvement on ic compared to those of ace -ig-v . ( fig. b and c) . because the ace -ig-v configuration significantly impairs protein yield during production (data not shown), we chose to proceed with ace -ig-v to test its neutralization potency against sars-cov- live virus (fig. d) . consistent with the pseudovirus neutralization data, ace -ig-v at . g/ml showed a more potent inhibition of sars-cov- live virus infection than ace -ig-v at . g/ml. moreover, ace -ig-v at . g/ml ( . nm) already completely abolished viral nucleocapsid protein (np) immunofluorescent signal. these data demonstrate that ace -ig-v is a markedly improved ace -ig variant as a potent entry inhibitor against sars-cov- virus. in this study, we investigated the ability of sars-cov- to utilize animal orthologs of ace for cell entry. we observed that ace orthologs of a wide range of domestic animals, including camels, cattle, horses, goats, sheep, cats, and rabbits, efficiently supported binding and entry of this virus, suggesting that these domestic mammals might be susceptible to this viral infection ( fig. b and fig. g ). consistent with this, during preparation of the manuscript, two studies independently reported laboratory ). sars-cov- and ratg have a k residue at their spike proteins, while pangolin-cov has an r residue and sars-cov has a v residue at their spike proteins, respectively. thus, a stabilized salt bridge interaction between e of the ace -ig protein and k of the virus spike protein is likely responsible for the d e mutation-mediated neutralization enhancement. the data shown are representative of two or three experiments independently performed by two different people with similar results, and data points in panels a to d represent the means Ϯ the sd of three or four biological replicates. and natural infection of cats by sars-cov- ( , ) . therefore, it is necessary to further investigate the susceptibility of these domestic animals to sars-cov- . in addition to the species investigated in this study, farmed mink has been found susceptible to sars-cov- and able to transmit the virus back to humans ( , ) . the currently available information therefore suggests that surveillance of livestock and farmed mammals in marketplaces for sars-cov- infection might be necessary. pangolins have been proposed as potential intermediate or natural hosts of sars-cov- ( ) ( ) ( ) . it has been proposed that sars-cov- might originate from recombination of a pangolin-cov-like coronavirus and a ratg -like coronavirus ( , ) . thus, the intermediate host of sars-cov- should also be susceptible to both of the "parental viruses." here, we found that pangolin (manis javanica) ace does not support binding or entry of bat-cov ratg , a coronavirus known to have the highest genome sequence identity ( . %) to sars-cov- so far, suggesting a lower likelihood of malayan pangolins (manis javanica) being intermediate hosts for sars-cov- . on the other hand, we also found that sars-cov- , sars-cov, and pangolin-cov- ( ) can all efficiently utilize pangolin (manis javanica) ace for cellular binding and entry, supporting the hypothesis that pangolins might be natural hosts of sars-cov-like coronaviruses ( , ) . it is also noteworthy that pangolin-cov- can also efficiently utilize human ace , as well as a wide range of domestic-and wild-animal ace orthologs for cell entry (fig. b and g) , indicating that this virus has very broad host range and high risk of spillover into human population in the future. animal models are essential for preclinical evaluation of efficacy and potential toxicity of candidate prophylactic vaccines or therapeutics for covid- , as well as for studying the transmission, pathogenesis, and immunology of this disease. in this study, rabbit ace was found to efficiently support binding and entry of all the four coronaviruses ( fig. b and g ). it is therefore worth exploring whether rabbit, a commonly used laboratory species, could serve as a common model animal for studying covid- , sars, and diseases caused by other sars-related coronaviruses. adaptation of viruses to infect mice is another way of developing small animal models of covid- . we found in this study that pangolin-cov- , whose rbm only differs from that of sars-cov- with one amino acid, could efficiently utilize mouse ace for binding and cell entry (fig. b, b, and g) . these data suggest that mice might be susceptible to this viral infection, and thus this pangolin coronavirus could be used as a surrogate to sars-cov- for in vivo studies in wild-type mice. effective vaccines or targeted therapeutics against sars-cov- infections are not yet available ( ) . ace -ig that has soluble ace truncated at its residue had been proposed as a candidate therapeutic for sars-cov- infection in the beginning of the covid- pandemic ( ) . two recently posted experimental studies by lui et al. ( ) and case et al. ( ) have also investigated the use of this -version ace -ig to block sars-cov- infection and showed ic values of ϳ and g/ml, respectively. consistent with these studies, we got in our study an estimated ic value of ϳ g/ml for this variant ( -wt in fig. c ). it is noteworthy that our studies here have identified three key improvements over the -version ace -ig. first, the -wt ace -ig variant showed a Ͼ -fold potency improvement over the -wt variant, and all the version variants showed markedly enhanced neutralization potency over the version variants (fig. c) . the reason for this improvement is not very clear yet, but it is possible that the cld domain, included in the version, stabilizes an orientation of the ace enzymatic domain favorable to s-protein binding, or that the cld has an independent anti-sars-cov- activity. second, likely because sars-cov- has adapted in animals (e.g., pangolins) whose ace orthologs have a glutamic acid at position , the d e mutation further improves the -version ace -ig's neutralization potency against sars-cov- . moreover, the d e mutation also enables the protein to neutralize all the four distinct sars-like coronaviruses (fig. ) . third, by utilizing an antibodylike structure to build an ace tetramer, we generated a variant ace -ig-v that has at least -fold additional improvement on neutralization potency against sars-cov- pseudotype as well as live virus (fig. ). through these changes, we have therefore successfully improved the originally very modest immunoadhesin inhibitor (ic Ϸ nm) to be a very potent entry inhibitor (ic Ϸ pm) against sars-cov- . recently, a clinical-grade recombinant soluble ace protein has been shown to block sars-cov- infection in engineered human organoid ( ) . a phase clinical trial to investigate this protein, injected twice daily, as a treatment for covid- patients has already started recruiting patients in multiple countries (https://clinicaltrials.gov/ct / show/nct ). considering that fusion with igg fc improves soluble ace protein's half-life in mice from Ͻ h to over a week ( , ) and that we have markedly improved the neutralization potency of ace -ig, we therefore optimistically expect that the improved ace -ig variants described in this study could potentially be developed to provide effective protections from sars-cov- and other sars-like viruses that might spillover into humans in the future, as well as sars-cov- variants that emerge over the course of the current pandemic. cells. t cells and vero cells were kindly provided by stem cell bank, chinese academy of sciences, confirmed mycoplasma-free by the provider, and maintained in dulbecco modified eagle medium (dmem; life technologies) at °c in a % co -humidified incubator. growth medium was supplemented with mm glutamax-i (gibco, catalog no. ), m nonessential amino acids (gibco, catalog no. ), u/ml penicillin and g/ml streptomycin (gibco, catalog no. ), and % fbs (gibco, catalog no. c). t-based stable cells expressing human ace were maintained under the same culture condition as t, except that g/ml of puromycin was added to the growth medium. f cells for recombinant protein production were generously provided by yu j. cao (school of chemical biology and biotechnology, peking university shenzhen graduate school) and maintained in smm -tii serum-free medium (sino biological, catalog no. m tii) at °c, % co , in a shaker incubator at rpm. plasmids. dna fragments encoding spike proteins of sars-cov- whu (genbank accession no. mn . ), sars-cov bj (genbank ay . ), pangolin-cov (national genomics data center gwhabkw ; https://bigd.big.ac.cn/search/?dbidϭgwh&qϭgwhabkw &pageϭ ) ( ) , and bat-cov ratg (genbank mn ) ( ), were synthesized by the beijing genomic institute (bgi, china) and sangon biotech (shanghai, china) and then cloned into pcdna . (ϩ) plasmid or pcaggs plasmid between ecori and xhoi restriction sites. plasmids encoding recombinant rbd and soluble ace variants were generated by cloning each of the gene fragments into a pcaggs-based mouse-igg a or human igg fc fusion protein expression plasmid between noti and bspei sites. the retroviral reporter plasmids encoding a gaussia luciferase or a green fluorescent protein (gfp) reporter gene were constructed by cloning the reporter genes into pqcxip plasmid (clontech), respectively. dna fragments encoding c-terminally s-tagged ace orthologs were synthesized in puc backbone plasmid by sangon biotech (shanghai, china). these fragments were then cloned into pqcxip plasmid (clontech) between sbfi and noti restriction sites. igg fc fusion protein production and purification. f cells at the density of ϫ cells/ml were seeded into ml of smm -tii serum-free medium (sino biological, catalog no. m tii) day before transfection. the cells were then transfected with g of plasmid in complex with g of pei max (polysciences, inc., catalog no. - ). cell culture supernatants were collected at to h posttransfection. recombinant fc fusion proteins are purified using protein a-sepharose cl- b (ge healthcare, catalog no. - - ), eluted with . m citric acid at ph . , and neutralized with m tris-hcl at ph . . buffers were then exchanged to phosphate-buffered saline (pbs), and proteins were concentrated by -kda cutoff amicon ultra- centrifugal filter units (millipore, catalog no. ufc ). flow cytometry for detecting interactions of rbd-ig proteins with cell surface ace orthologs. t cells were seeded at % density in -well plates at to h before transfection. cells in each well were then transfected with . l of lipofectamine (life technologies, catalog no. ) in complex with ng of plasmid encoding one of the ace orthologs or a d e mutant of the human ace . culture medium was changed at h after transfection. cells were then detached with mm edta (life technologies, catalog no. ) at h posttransfection. the cells were then stained with g/ml rbd-ig proteins at °c for min, washed three times, and then stained with g/ml alexa -conjugated goat anti-mouse igg secondary antibody (invitrogen, catalog no. a- ) at room temperature for min. after another three washes, cells were analyzed by attune nxt flow cytometer (thermo fisher), and signals of , fsc/ssc-gated cells were collected for each sample. western blot to detect s-tagged ace (ace -s-tag) or c -tagged spike (spike-c -tag) expression in t cells. t cells were seeded at % density in -well plates at to h before transfection. cells in each well were then transfected with g of plasmid in complex with l of lipofectamine (life technologies, catalog no. ). at h after transfection, the cells were lysed, and g of total protein was used for western blotting. ace -s-tag expression was detected by using . , a mouse anti-s-tag monoclonal antibody (invitrogen, catalog no. ma - ), and a horseradish peroxidase (hrp)-conjugated goat anti-mouse igg fc secondary antibody (invitrogen, catalog no. ). beta-actin was used as an internal control. spike-c -tag expression was then detected by using d , a mouse anti-c -tag monoclonal antibody (invitrogen, catalog no. ma - ), and the hrp-conjugated goat anti-mouse igg fc secondary antibody (invitrogen, catalog no. ). mlv retroviral vector-based coronavirus-spike and vsv-g pseudotypes were produced using a previously described protocol ( ) with some modifications. t cells were seeded at % density in mm dish at to h before transfection. cells were then transfected with . g of pei max (polysciences, inc., catalog no. - ) in complex with . g of plasmid encoding a coronavirus spike protein or vsv-g, . g of plasmid encoding murine leukemia virus) gag and pol proteins, and . g of a pqcxip-based gfp or luciferase reporter plasmid. eight hours after transfection, cell culture medium was refreshed and changed to growth medium containing % fetal bovine serum (fbs; gibco, catalog no. c) and mm hepes (gibco, catalog no. ). cell culture supernatants were collected at to h posttransfection, spun down at , ϫ g for min, and filtered through . -m filter units to remove cell debris. coronavirus spike-pseudotyped viruses were then concentrated times at , ϫ g using -kda cutoff amicon ultra- centrifugal filter units (millipore, catalog no. ufc ). pseudovirus infection of t cells expressing ace orthologs. t cells were seeded at % density in polylysine precoated -well plates to h before transfection. cells in each well were then transfected with . coronavirus pseudovirus neutralization assay. coronavirus spike protein-pseudotyped luciferase reporter viruses were prediluted in dmem ( % fbs, heat inactivated) containing titrated amounts of rbd-ig or ace -ig variant proteins. an fc fusion protein of an anti-influenza hemagglutinin (ha) antibody, f -scfv ( ) , was used as a control protein here. virus-inhibitor mixtures were then added to ace -expressing t or hela cells in polylysine (sigma, catalog no. p - ml) precoated -well plates and incubated overnight at °c. the cells were then washed with serum-free medium and incubated in l of dmem ( % fbs) at °c. cell culture supernatants were collected for gaussia luciferase assay at h postinfection. sars-cov- live virus infection of ace ortholog-expressing t cells. t cells expressing one of the ace orthologs were inoculated with sars-cov- live virus at % tissue culture infective dose(s) (tcid ) and incubated for h at °c. the cells were then washed with serum-free medium and incubated in l of dmem ( % fbs) at °c for an additional h. the cells were then fixed with % paraformaldehyde in pbs, permeabilized with . % triton x- , and sequentially stained with : -diluted rabbit anti-sars-cov- nucleocapsid polyclonal antibody (sino biological, catalog no. -t ) at °c for min, g/ml of alexa fluor goat anti-rabbit igg (invitrogen, catalog no. a- ) at °c for min, and . g/ml of dapi ( =, =-diamidino- -phenylindole; sigma-aldrich, catalog no. d - mg) at room temperature for min. stained cells were then examined under fluorescence microscope (ix microscope; olympus). sars-cov- live virus neutralization by ace -ig variants. sars-cov- live virus at tcid were prediluted in dmem ( % fbs, heat inactivated) containing titrated amounts of ace -ig variant proteins and incubated at °c for h. virus-inhibitor mixtures were then added to ace -expressing t or hela cells in polylysine-precoated -well plates and incubated for h at °c. the cells were then washed with serum-free medium and incubated in l of dmem ( % fbs) at °c for to h. cells were then fixed for immunofluorescence staining of viral nucleocapsid proteins as described above. data collection and analysis. all the experiments were repeated two to four times. all of the infection assays for fig. were independently performed by two different people, and all the data are reproducible in different hands. the key neutralization assays of fig. to were independently performed by two or three different people, and all of the data are reproducible in different hands. attune nxt software (thermo fisher) was used to collect and analyze flow cytometry data. image lab software (bio-rad) was used to collect sds-page and western blot image data. cell sens software (olympus) was used to collect fluorescence microscopy data. ice software (berthold technologies) was used to collect luciferase assay data. graphpad prism . software was used for figure preparation and statistical analyses. statistical analysis. data expressed as mean values Ϯ the standard deviations (sd). statistical analyses were performed using two-sided two-sample student t test using graphpad prism . software when applicable. differences were considered significant at p Ͻ . . data availability. the study did not generate unique data sets or code. our research resources, including methods, plasmids, and protocols, are available upon reasonable request to qualified academic the complete genome sequence of severe acute respiratory syndrome coronavirus strain hku- (hk- ) genomic characterization of the severe acute respiratory syndrome coronavirus of amoy gardens outbreak in hong kong epidemiology and cause of severe acute respiratory syndrome people's republic of china fatal swine acute diarrhoea syndrome caused by an hku -related coronavirus of bat origin isolation and characterization of viruses related to the sars coronavirus from animals in southern china origin and evolution of pathogenic coronaviruses broad cross-species infection of cultured cells by bat hku -related swine acute diarrhea syndrome coronavirus and identification of its replication in murine dendritic cells in vivo highlight its potential for diverse interspecies transmission a pneumonia outbreak associated with a new coronavirus of probable bat origin china novel coronavirus investigator research team. . a novel coronavirus from patients with pneumonia in china genomic characterization and epidemiology of novel coronavirus: implications for virus origins and receptor binding are pangolins the intermediate host of the novel coronavirus (sars-cov- )? identifying sars-cov- related coronaviruses in malayan pangolins isolation of sars-cov- -related coronavirus from malayan pangolins therapeutic options for the novel coronavirus ( -ncov) the hallmarks of covid- disease receptor recognition and cross-species infections of sars coronavirus angiotensin-converting enzyme is a functional receptor for the sars coronavirus functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor structural basis for the recognition of the sars-cov- by full-length human ace structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structure of sars coronavirus spike receptor-binding domain complexed with receptor sars-cov- and bat ratg spike glycoprotein structures inform on virus evolution and furin-cleavage effects structure, function, and antigenicity of the sars-cov- spike glycoprotein cryo-em structure of the -ncov spike in the prefusion conformation a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensin-converting enzyme retroviruses pseudotyped with the severe acute respiratory syndrome coronavirus spike protein efficiently infect cells expressing angiotensin-converting enzyme susceptibility of ferrets, cats, dogs, and other domesticated animals to sars-coronavirus sars-cov- neutralizing serum antibodies in cats: a serological investigation coronavirus rips through dutch mink farms, triggering culls sars-cov- infection in farmed minks, the netherlands evidence of recombination in coronaviruses implicating pangolin origins of ncov- therapeutic strategies in an outbreak scenario to treat the novel coronavirus originating in wuhan, china trimeric sars-cov- spike interacts with dimeric ace with limited intra-spike avidity neutralizing antibody and soluble ace inhibition of a replication-competent vsv-sars-cov- and a clinical isolate of sars-cov- inhibition of sars-cov- infections in engineered human tissues using clinical-grade soluble human ace fusion proteins for half-life extension of biologics as a strategy to make biobetters novel ace -fc chimeric fusion provides long-lasting hypertension control and organ protection in mouse models of systemic renin angiotensin system activation key: cord- - w caxwu authors: zeng, xin; li, lingfang; lin, jing; li, xinlei; liu, bin; kong, yang; zeng, shunze; du, jianhua; xiao, huahong; zhang, tao; zhang, shelin; liu, jianghai title: blocking antibodies against sars-cov- rbd isolated from a phage display antibody library using a competitive biopanning strategy date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: w caxwu the infection of the novel coronavirus sars-cov- have caused more than , deaths, but no vaccine or specific therapeutic antibody is currently available. sars-cov- relies on its spike protein, in particular the receptor binding domain (rbd), to bind human cell receptor angiotensin-converting enzyme (ace ) for viral entry, and thus targeting rbd holds the promise for preventing sars-cov- infection. in this work, a competitive biopanning strategy of a phage display antibody library was applied to screen blocking antibodies against rbd. high-affinity antibodies were enriched after the first round using a standard panning process in which rbd-his recombinant protein was immobilized as a bait. at the next two rounds, immobilized ace -fc and free rbd-his proteins were mixed with the enriched phage antibodies. antibodies binding to rbd at epitopes different from ace -binding site were captured by the immobilized ace -fc, forming a “sandwich” complex. only antibodies competed with ace for recognizing rbd at the same or similar epitopes can bind to the free rbd-his in the supernatant and be subsequently separated by the ni-nta magnetic beads. top lead from the competitive biopanning of a synthetic antibody library, lib ab , was produced as the full-length igg format. it was proved to competitively block the binding of rbd to ace protein, and potently inhibit sars-cov- pseudovirus infection of ace -overexpressing hela cells with ic values of nm. nevertheless, top lead from the standard biopanning of lib ab , can only bind to rbd in vitro but not have the blocking or neutralization activity. our strategy can efficiently isolate the blocking antibodies of rbd, and it would speed up the discovery of neutralizing antibodies against sars-cov- . the recent outbreak of a novel coronavirus disease (covid- ) has emerged from a public health emergency of international concern to global pandemic. its pathogen, sars-cov- , is a newly identified β-coronavirus. coronavirus got the family name from the spike(s) protein on the viral particle. the highly glycosylated s protein stays compact in trimeric state, recognizes receptor on the host cell membrane, and then undergoes a series of conformation changes, proteolysis events and membrane fusion to complete viral entry. for vaccines, clinical diagnosis, early prevention and medication, the s protein is the most significant target. the primary sequences of s protein between severe acute respiratory syndrome coronavirus (sars-cov) and sars-cov- share about % identities and % similarities, which indicates high possibility of structural homology and the same infection pathway. sars-cov and sars-cov- recognized the same host cell receptor ace for mediating viral entry into host cells. it was reported that sars-cov s protein trimer bound to ace at : in ratio [ , ] . before infection, rbd of each sars-cov s monomer was partially buried in the inactive "down" conformation and not able to bind ace due to steric clash. once infection started, one monomer turned "up" its rbd to expose enough space to ace , inducing further conformational open and loose for proteolysis [ , ] . atomic-level structural analysis suggested that the spatial interaction and interface between sars-cov- rbd and ace was mostly in accordance with the sars-cov case [ ] . besides, a cryo-em structure of sars-cov- s protein trimer published recently showed that one of the three rbds was in "up" conformation and naturally exposed the whole interaction interface [ ] , while the classic closed symmetric trimer still existed [ ] . that might explain why sars-cov- is much more contagious than sars-cov and causing tricky problems worldwide. no effective cure or vaccine is currently available for covid- . based on structure information above, blocking sars-cov- rbd is a rational therapeutic approach. here we developed a competitive biopanning strategy to efficiently isolate blocking antibodies from phage display antibody libraries. several high-affinity antibodies targeting sars-cov- rbd and blocking its binding to ace were isolated, and the top lead exhibited a neutralization activity of sars-cov- pseudotyped vsv infection. recombinant proteins ace -his was purchased from novoprotein (shanghai, china). ace -hfc and sars-cov- rbd-his were purchased from sino biological (beijing, china). sars-cov- rbd-mfc was expressed using ablink biotech's hek f expression system. a synthetic human fab antibody library ab (libab ) was constructed according to a procedure previously described [ ] . human germline immunoglobulin variable segments vh - and vl - were employed as templates, the complementarity-determining regions l (cdr-l ) and h (cdr-h ) was diversified by the designed mutagenic oligonucleotides. the oligonucleotides were synthesized using the trimer phosphoramidites mix z (glen research) containing codons for amino acids in the following molar ratios: % each y, s &g, % each t & a, and % each p, h, r, f, w, v & l. the number of positions denoted by z in cdr-l (qq (z)n plt) and -h (ar (z) n (a/g/d/y) fdy) was varied from to and to , respectively. the library size is estimated to be × . antibodies against rbd were screened at the first round using a standard biopanning protocol [ ] . briefly, rbd-his was coated on -well maxisorp plates at °c overnight. after the coating buffer was decanted, the plate was blocked with % polyvinyl alcohol (pva) at room temperature for hour. μl of phage libraries ( pfu/ml) was added per well for -hour binding. after washing eight times with pt buffer ( . % tween- in pbs), bound phages were eluted with mm hcl ( μl per well), followed by -min incubation. the eluent was transferred into a . ml microfuge tube and neutralized with m tris-hcl (ph . ). half the neutralized phage solution was mixed with ml of actively growing e. coli neb -alpha f' (od = . ) in ×yt media containing μg/ml tetracycline and incubated at °c for hour. pfu of m k helper phages were added next and incubated for another hour. the infected bacteria were amplified in ml ×yt medium containing μg/ml carbenicillin and μg/ml kanamycin, shaking at rpm and growing overnight at °c. the next day, phages were harvested in precipitant with peg/nacl solution and resuspended in pbs buffer for the following rounds of panning. after the first round of the standard biopanning, a competitive biopanning protocol that included steps of competitive binding, magnetic separation, elution and amplification ( fig. ) , was applied to isolate the epitope-specific antibodies. briefly, μl of ace -hfc protein ( μg/ml) was coated on the -well maxisorp plates. the wells were washed and blocked with % pva, and then the mixture of antibody library ( pfu per well) and free rbd-his protein ( ng per well) was added. after a -hour competitive binding, the supernatant was transferred into a . ml microfuge tube containing the pre-washed ni-nta magnetic beads (genscript) and incubated on a shaker at room temperature for hour. beads were collected using the magnetic separation rack and washed by the pt buffer for times. bound phages were eluted with mm hcl ( μl per tube) after -min incubation. beads were collected using the magnetic separation rack, and the supernatant was transfer into a tube for neutralization. half the neutralized phage solution was mixed with ml of actively growing neb alpha f' cells and amplified as the standard biopanning protocol. μl of the bacterial culture before infection with helper phages was taken, diluted, and grown on the lb plates containing μg/ml carbenicillin at °c overnight. the single clones were picked up next day for the phage elisa assay. fig. schematic presentation of a competitive biopanning strategy. a specific binder of target protein was added during the binding step for the selection of blocking antibodies. in this work, the immobilized ace -hfc captured rbd-his and the antibodies binding rbd at different epitopes, forming a complex like a "sandwich". however, when an antibody recognized the same or similar epitopes within rbd as the ace did, it could block rbd-ace interaction. the antibodies would bind to the free rbd-his in the supernatant and be subsequently separated by the ni-nta magnetic beads. single clones were inoculated into μl ×yt medium containing μg/ml carbenicillin, μg/ml kanamycin and pfu/ml helper phages in -deep-well plates and incubated overnight at °c and rpm. the plates were centrifuged at , rpm and the supernatant was applied for phage elisa. the -well maxisorp plates were coated overnight at °c with rbd-mfc ( μg/ml, μl per well). after blocking with % pva, plates were incubated with μl bacterial supernatant containing phages for hours at room temperature. after six times of wash with pt, bound phages were detected using an hrp-conjugated anti-m antibody (sino biological) and tetramethyl benzidine (tmb) as substrate. absorption at nm was measured. vh and vl of the positive phage were subcloned respectively into the pfusess-chig-hg and pfusess-clig-hk (invitrogen). antibodies were transiently expressed in freestyle™ hek -f cells (life technologies) using fectin transfection reagent according to manufacturer's instructions. after transfection, cells were grown in the serum-free medium for an additional days. the supernatant was collected and purified on a mabselect protein a column (ge healthcare). eluted igg was dialyzed against pbs and stored at - °c. recombinant human ace -his ( μg/ml, μl per well) was coated on -well maxisorp plates, followed by a pre-incubated mixture of the anti-rbd antibody titrated into a constant amount of rbd-mfc ( µg/ml). rbd binding to ace was detected using hrp conjugated anti-mouse fc antibody. the neutralization effects of antibodies on sars-cov- pseudovirus were performed by the genscript inc. (nanjing, china) under a research service contract. briefly, , of the human ace -overexpressing hela monoclonal cells were seeded into each well of a -well plate. sars-cov- pseudovirus and antibodies were incubated at ambient temperature for hour. the mixture was transferred into wells and incubated with cells at °c, % co for hours. the culture medium was freshly replaced, and cells were incubated for another hours. the culture medium was removed, and cells were rinsed with pbs. µl lysis buffer was added and further incubated at ambient temperature for minutes. µl supernatant was transferred to a sterile un-clear -well plate with the bio-glo luciferase substrate added, and the luminescence signal was measured with envision. the dose response curves were plotted with the relative luminescence unit against the antibody concentration. the assay results were processed by microsoft office excel and graphpad prism . high-affinity antibodies were identified by the phage elisa after rounds of the competitive biopanning, clones were randomly selected. their properties of binding to rbd were measured using phage elisa. positive binding was defined as an od reading two or more times higher than the negative control (pva alone). clones showed positive signals (fig. ) . after the dna sequencing, these clones were summarized into groups of unique antibodies. identification of positive clones to immobilized antigen in competitive manner. taken od readings as measurement, data of each group fluctuated within %. the highest-ranking one was named rrbd- in this work. rrbd- , the top lead with the highest od reading isolated from the competitive biopanning, and rrbd- , the top lead isolated from the standard biopanning at round , were expressed as full-length igg antibodies using the f expression system. their binding and blocking abilities against rbd were compared. both rrbd- and rrbd- had high affinities for rbd, with ec at . nm and . nm, respectively. only rrbd- blocked the binding of rbd to ace with an ic at . nm, while rrbd- did not. as a positive control, the recombinant ace -hfc ( µg/ml) totally inhibited the infection of ace -overexpressing hela monoclonal cells with sars-cov- pseudovirus. the antibody rrbd- showed a significant neutralization activity against the sars-cov- pseudovirus with ic values of . nm. however, the antibody rrbd- had no neutralization effect of the pseudovirus and there were no significant differences between the highest concentration antibody group and the blank group without antibody addition. two sars-cov- rbd-specific antibodies selected from different strategies showed different neutralization activities. the antibody rrbd- competed with ace could neutralize sars-cov- pseudovirus, but rrbd- couldn't. rbd-ace interaction initiated viral infection of both sars-cov and sars-cov- . their rbds share high sequence identities ( %) and structure homology, so the well-established sars-cov antibodies were firstly assumed short-cut therapeutic candidates for sars-cov- . however, the real scenario is much more problematic. several independent peer-reviewed studies as well as preprinted ones have proved that all structurally known sars-cov specific antibodies, including s , r, m and f g , have no cross-reactivity of sars-cov- [ , , ] . these antibodies all compete with ace to bind sars-cov rbd, but their epitopes only have limited overlaps of the full ace -rbd interface, which could be the reason of lacking cross-reactivity. cr is a special case with % conserved key residues in the epitope between sars-cov- and sars-cov. its cross-reactivity was remarkable, but one site loss of n-glycan results in ~ magnitude reduction of binding affinity to sars-cov- rbd [ ] . as in human life, rbd-specific monoclonal antibodies derived from covid- recovered individuals indicated similar patterns of no cross-reactivities with either sar-cov or mers-cov [ ] . in general, structural and functional analysis suggests that targeting sars-cov- rbd could be a direct and promising therapeutic strategy, while focusing on previous sars-cov antibodies is not very ideal or efficient. no sars-cov- rbd-specific monoclonal antibody has been reported from human antibody libraries (up to april th , ). in the meantime, sars-cov- spreads unexpectedly fast around the world, and a new study just shifted its basic reproductive number (r ) from . to . [ ] . a rapid and effective method of obtaining the sars-cov- neutralizing antibodies is much required. naï ve antibody libraries derived from natural immune systems have their capacity limits, while synthetic libraries with higher diversity have more opportunities to isolate binders especially for novel infectious antigens. compared to a naï ve antibody library of ~ diversity, a synthetic library with additional artificial randomization on cdrs can reach diversity as high as ~ . when the recombinant rbd and ace proteins were ready, it took weeks to isolate, produce and verify the antibodies in this study. using the standard biopanning method, we enriched rbdspecific phages from our synthetic lib ab but not from our naï ve antibody libraries (data not shown). unfortunately, the top lead rrbd- from the standard biopanning of lib ab couldn't block the rbd-ace interaction (fig. ) , although it did bind to rbd with an ec of . nm (fig. ) . the clinical potential and applications of an antibody often depends on its binding epitopes of the target protein. a high-affinity antibody against the target protein can be screened from a phage display antibody library using the standard biopanning process, but its binding epitopes are identified by some extra steps, such as epitope mapping and competitive elisa. we therefore developed a new competitive biopanning strategy to efficiently isolate isotype-specific antibodies from libraries. as expected, the top lead rrbd- successfully bind to rbd in compete with ace both in solution and in pseudovirus, and its binding affinity is quite high in ~ nm differing from measuring methods. in conclusion, our strategic discovery of human monoclonal antibodies against sars-cov- rbd may fill the blanks of antibody-related pharmaceutical development and shed light on new treatments in need of global health concerns. cryo-em structure of the sars coronavirus spike glycoprotein in complex with its host cell receptor ace unexpected receptor functional mimicry elucidates activation of coronavirus fusion cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding structural and functional basis of sars-cov- entry by using human ace cryo-em structure of the sars-cov- spike in the prefusion conformation structure, function, and antigenicity of the sars-cov- spike glycoprotein a single-framework synthetic antibody library containing a combination of canonical and variable complementarity-determining regions identifying specificity profiles for peptide recognition modules from phage-displayed peptide libraries potent human neutralizing antibodies elicited by sars-cov- infection high contagiousness and rapid spread of severe acute respiratory syndrome coronavirus we thank chengdu zicheng yibo biotechnology co., ltd for providing the laboratory consumables and bovine serum. this work was supported by sichuan science and technology program ( rz ), the program of sars-cov- protection (cyhx , kezhi people's air-defense equipment co., ltd) and the program of sars-cov- antibody discovery (jl c- , ablink biotech co., ltd). key: cord- -dxmto tu authors: zhao, tom y.; patankar, neelesh a. title: tetracycline as an inhibitor to the coronavirus sars-cov- date: - - journal: nan doi: nan sha: doc_id: cord_uid: dxmto tu the coronavirus sars-cov- remains an extant threat against public health on a global scale. cell infection begins when the spike protein of sars-cov- binds with the cell receptor, angiotensin-converting enzyme (ace ). here, we address the role of tetracycline as an inhibitor for the receptor-binding domain (rbd) of the spike protein. targeted molecular investigation show that tetracycline binds more favorably to the rbd (- . kcal/mol) compared to chloroquine (- . kcal/mol) or doxycycline (- . kcal/mol) and inhibits attachment to ace to a greater degree (binding efficiency of . $frac{text{kcal}}{text{mol}cdot text{nm}^ }$ for tetracycline-rbd, . $frac{text{kcal}}{text{mol}cdot text{nm}^ }$ for chloroquine-rbd, . $frac{text{kcal}}{text{mol}cdot text{nm}^ }$ for doxycycline-rbd). stronger tetracycline inhibition is verified with nonequilibrium pmf calculations, for which the tetracycline-rbd complex exhibits the lowest free energy profile along the dissociation pathway from ace . tetracycline appears to target viral residues that are usually involved in significant hydrogen bonding with ace ; this inhibition of cellular infection complements the anti-inflammatory and cytokine suppressing capability of tetracycline, and may further reduce the duration of icu stays and mechanical ventilation induced by the coronavirus sars-cov- . the extreme urgency for therapeutics against the acute respiratory syndrome coronavirus (sars-cov- ) drives the review of existing drugs for their ability to inhibit the function of this virus ; . tetracycline has been proposed as a strong candidate against sars-cov- due to its lipophilic nature, anti-inflammatory response, as well as its ability to chelate zinc species on matrix metalloproteinases (mmps). tetracycline class antibiotics have also been shown to be effective in reducing the duration of ventilatory support and icu stay from acute respiratory distress syndrome , and doxycycline has been suggested to be an important component in combination therapy for its anti-viral properties . tetracycline as well as a broad band of related antibiotics have been approved by the fda ; . in this work, we quantify the performance of to whom correspondence may be addressed. email: tomzhao@u.northwestern.edu ; n-patankar@northwestern.edu tetracycline in inhibiting the binding of the sars-cov spike protein to ace . tetracycline is found to bind more favorably to the receptor binding domain (rbd) of the spike protein compared to doxycycline or chloroquine, which was included in this study as a baseline. the tetracycline-rbd complex also displays lower binding efficiency to the human cell receptor ace . the sars-cov rbd, ace , tetracycline, and chloroquine molecular structures were obtained from rcsb pdb ( m j, uxo, v o, xrl) ; ; ; . missing hydrogen atoms were appended, after which structural preparation and molecular docking with full ligand and protein backbone flexibility were carried out using the rosetta suite ; ; . the resulting complexes were inspected manually, after which the binding affinities of the best-scoring complexes were gauged us-ing mm/pbsa calculations after ns equilibrium molecular dynamics simulations ; ; . the potentials of mean force (pmf) along the dissociation pathway of these rbd complexes from ace were found in lammps using steered molecular dynamics after parameterization with charmm . jarzynski's equality was employed to calculate the free energy profile for each rbd complex from statistically independent trajectories . tetracycline exhibits higher binding affinity to the rbd in both blind and site-specific docking (- . kcal/mol) compared to doxycycline (- . kcal/mol) or chloroquine (- . vs . kcal/mol) as delineated in table . the amino acid residues of the rbd involved in hydrogen bonding with the tetracycline molecule are tyr , asn , gly , and tyr (fig. ) , which have been shown to be crucial for the sars-cov rbd in binding to ace for cellular access . these four residues comprise major hot spots that form persistent hydrogen bonds with ace . meanwhile, the amino acids of rbd that interact with chloroquine in the site-specific configuration are lys , arg , arg and arg , of which none are involved in extended hydrogen bonding with ace . tetracyline appears to bind preferably to polar or slightly lipophilic rbd residues, which comprise the majority of amino acids that form persistent hydrogen bonds with ace ; . other tetracycline derivatives as doxycycline or minocycline are known to be more lipophilic ; ; and may therefore prefer nonpolar residues that are often buried beneath the solvent accessible surface area of the spike protein. indeed, the rbd residues that have highest binding affinity to doxycycline are tyr , gly , val , gly , of which only two overlap with rbd amino acids that engage in extended hydrogen bonding with ace . on the other end of the spectrum, chloroquine targets clusters of charged residues on the rbd that do not actively participate in hydrogen bonding with the cell receptor ace . the binding efficiency (magnitude of binding energy normalized by contact interface area) of the sars-cov rbd-ace complex was found to be . kcal/(mol·nm ). in the presence of the protein-ligand complex tetracycline-rbd, the binding efficiency with ace ( . kcal/(mol·nm )) is significantly lower than that for chloroquine-rbd ( . kcal/(mol·nm )) and doxycycline-rbd ( . kcal/(mol·nm )) as displayed in table . a survey of hydrogen bonding lifetimes between the important binding site residues in the rbd with ace shows that the tetracycline inhibited rbd exhibits the least hydrogen bonding activity with ace (fig. ). this suggests that not only does tetracycline bind more favorably to the receptor binding domain of the spike protein, it also inhibits the binding of the rbd to ace to a greater degree. to verify this statement, steered molecular dynamics simulations were carried out to find the potential of mean force (pmf) along a singular dissociation pathway for the inhibited and uninhibited rbd-ace complexes. figure shows that the pmf for unbinding of the tetracycline-rbd complex from ace was lowest of the three structures tested, which is in agreement with the binding efficiencies found from equilibrium simulations. this disruption of the rbd-ace interface may therefore inhibit the signaling cascade initiated during binding of the viral spike protein. binding efficiency to ace (kcal/(mol·nm )) rbd . chloroquine-rbd . doxycycline-rbd . tetracycline-rbd . table : the binding efficiency (magnitude of binding energy normalized by contact interface area) of the spike protein rbd as well as the tetracycline-rbd, doxycycline-rbd and chloroquine-rbd complexes to the human cell receptor ace . binding efficiency is lowest for the tetracycline-rbd complex, indicating that tetracycline is a more effective inhibitor. the tetracycline class of antibiotics, including tetracycline, oxytetracycline, and doxycycline may be helpful in the fight against the coronavirus sars-cov- , due to its preferential association with the important residues in the viral receptor binding domain and the resulting strong inhibition of the rbd-ace complex. further experimental studies are recommended to validate how this reduction of cellular infection complements or enhances the anti-inflammatory and anti-viral properties of tetracyclines in their role as treatment for sars-cov- . author contributions t.y.z conceived and planned the research, as well as performed calculations. n.a.p. and t.y.z. performed analysis and wrote the manuscript. silico identification of potent inhibitors of covid- main protease (mpro) and angiotensin converting enzyme (ace ) from natural products: quercetin, hispidulin, and cirsimaritin exhibited better potential inhibition than hydroxy-chloroquine against covid- main protease active site and ace docking study of chloroquine and hydroxychloroquine interaction with rna binding domain of nucleocapsid phospho-protein -an in silico insight into the comparative efficacy of repurposing antiviral drugs therapeutic potential for tetracyclines in the treatment of covid- a proposed randomized, double blind, placebo controlled study evaluating doxycycline for the prevention of covid- infection and disease in healthcare workers with ongoing high risk exposure to covid- prophylaxis with tetracyclines in ards: potential therapy for covid- -induced ards? medrxiv doxycycline as a potential partner of covid- therapies fda approved antibacterial drugs structural and simulation analysis of hotspot residues interactions of sars-cov with human ace receptor heuristic molecular lipophilicity potential (hmlp): lipophilicity and hydrophilicity of amino acid side chains the anti-amyloidogenic action of doxycycline: a molecular dynamics study on the interaction with ab screening of chloroquine, hydroxychloroquine and its derivatives for their binding affinity to multiple sars-cov- protein drug targets the molecular docking study of potential drug candidates showing anti-covid- activity by exploring of therapeutic targets of sars-cov- binding efficiency of protein-protein complexes structure of the sars-cov- spike receptor-binding domain bound to the ace receptor crystal structures of multidrug binding protein ttgr in complex with antibiotics and plant antimicrobials cooperativity and stability of the tetracycline repressor (tetr) upon tetracycline binding rosettaligand docking with full ligand and receptor flexibility rosetta ligand docking with flexible xml protocols rosettaligand: protein-small molecule docking with full sidechain flexibility gromacs: a message-passing parallel molecular dynamics implementation : a package for molecular simulation and trajectory analysis. molecular modeling annual a. g_mmpbsa-a gromacs tool for high-throughput mm-pbsa calculations accurate determination of the binding free energy for kcsacharybdotoxin complex from the potential of mean force calculations with restraints. biophysical journal fast parallel algorithms for short-range molecular dynamics charmm: the biomolecular simulation program computing equilibrium free energies using non-equilibrium molecular dynamics the authors have no competing financial interests or other interests that might be perceived to influence the results and/or discussion reported in this paper. key: cord- - rfdkuw authors: chen, jiahui; gao, kaifu; wang, rui; wei, guowei title: prediction and mitigation of mutation threats to covid- vaccines and antibody therapies date: - - journal: nan doi: nan sha: doc_id: cord_uid: rfdkuw antibody therapeutics and vaccines are among our last resort to end the raging covid- pandemic.they, however, are prone to over , mutations uncovered by a mutation tracker. it is urgent to understand how vaccines and antibodies in the development would be impacted by mutations. in this work, we first study the mechanism, frequency, and ratio of mutations on the spike (s) protein, which is the common target of most covid- vaccines and antibody therapies. additionally, we build a library of antibody structures and analyze their d and d characteristics. moreover, we predict the mutation-induced binding free energy (bfe) changes for the complexes of s protein and antibodies or ace . by integrating genetics, biophysics, deep learning, and algebraic topology, we deduce that some of the mutations such as m i, s f, and s f may weaken the binding of s protein and antibodies, and potentially disrupt the efficacy and reliability of antibody therapies and vaccines in the development. we provide a strategy to prioritize the selection of mutations for designing vaccines or antibody cocktails. the expeditious spread of coronavirus disease pandemic caused by severe acute respiratory syndrome coronavirus (sars-cov- ) has led to , , confirmed cases and , , fatalities as of september , . in the st century, three major outbreaks of deadly pneumonia are caused by βcoronaviruses: sars-cov ( ) , middle east respiratory syndrome coronavirus (mers-cov) ( ), and sars-cov- ( ) [ ] . similar to sars-cov and mers-cov, sars-cov- causes respiratory infections, and the transmission of viruses occurs among family members or in healthcare settings at the early stages of the outbreak. however, sars-cov- has an unprecedentedly high infection rate compared to sars-cov and mers-cov [ ] . considering the high infection rate, high prevalence rate, long incubation period [ ] , asymptomatic transmission [ , ] , and potential seasonal pattern [ ] of covid- , the development of specific antiviral drugs, antibody therapies, and effective vaccines is of paramount importance. traditional drug discovery takes more than ten years, on average, to bring a new drug on the market [ ]. however, developing potent sars-cov- specified antibodies and vaccines is a relatively more efficient and less timeconsuming strategy to combat covid- for the ongoing pandemic [ ] . antibody therapies and vaccines depend on the host immune system. recently studies have been working on the host-pathogen interaction, host immune responses, and the pathogen immune evasion strategies [ ] [ ] [ ] [ ] [ ] [ ] , which provide insight into understanding the mechanism of antibody therapies and vaccine development. the immune system is a host defense system that protects the host from pathogenic microbes, eliminates toxic or allergenic substances, and responds to an invading pathogen [ ] . it has innate immune system and adaptive immune system as two major subsystems. the innate system provides an immediate but non-specific response, whereas the adaptive immune system provides a highly specific and effective immune response. once the pathogen breaches the first physical barriers, such as epithelial cell layers, secreted mucus layer, mucous membranes, the innate system will be triggered to identify pathogens by pattern recognition receptors (prrs), which is expressed on dendritic cells, macrophages, or neutrophils [ ] . specifically, pprs identify pathogen-associated molecular patterns (pamps) located on pathogens and then activate complex signaling pathways that introduce inflammatory responses mediated by various cytokines and chemokines, which promote the eradication of the pathogen [ , ] . notably, the transmission of sars-cov- even occurs in asymptomatic infected individuals, which may delay the early response of the innate immune response [ ] . another important line of host defense is the adaptive immune system. b lymphocytes (b cells) and t lymphocytes (t cells) are special types of leukocytes that are the acknowledged cellular pillars of the adaptive immune system [ ] . two major subtypes of t cells are involved in the cell-mediated immune response: killer t cells (cd + t cells) and helper t cells (cd + cells). the killer t cells eradicate cells invaded by pathogens with the help of major histocompatibility complex (mhc) class i. mhc class i molecules are expressed on the surface of all nucleated cells [ ] . the nucleated cells will firstly degrade foreign proteins via antigen processing when viruses infect them. then, the peptide fragments will be presented by mhc class i, which will activate killer t cells to eliminate these infected cells by releasing cytotoxins [ ] . similarly, helper t cells cooperate with mhc class ii, a type of mhc molecules that are constitutively expressed on antigen-presenting cells, such as macrophages, dendritic cells, monocytes, and b cells [ ] . helper t cells express t cell receptors (tcr) to recognize antigen bound to mhc class ii molecules. however, helper t cells do not have cytotoxic activity. therefore, they can not kill infected cells directly. instead, the activated helper t cells will release cytokines to enhance the microbicidal function of macrophages and the activity of killer t cells [ ] . notably, an unbalanced response can result in a "cytokine storm," which is the main cause of the fatality of covid- patients [ ] . correspondingly, a b cell involves in humoral immune response and identifies pathogens by binding to foreign antigens with its b cell receptors (bcrs) located on its surface. the antigens that are recognized by antibodies will be degraded to petites in b cells and displayed by mhc class ii molecules. as mentioned above, helper t cells can recognize the signal provided by mhc class ii and upregulate the expression of cd ligand, which provides extra stimulation signals to activate antibody-producing b cells [ ] , rendering millions of copies of antibodies (ab) that recognize the specific antigen. additionally, when the antigen first enters the body, the t cells and b cells will be activated, and some of them will be differentiated to long-lived memory cells, such as memory t cells and memory b cells. these long-lived memory cells will play a role in quickly and specifically recognizing and eliminating a specific antigen that encountered the host and initiated a corresponding immune response in the future [ ] . the vaccination mechanism is to stimulate the primary immune response of the human body, which will activate t cells and b cells to generate the antibodies and long-lived memory cells that prevent infectious diseases, which is one of the most effective and economical means for combating with covid- at this stage. as mentioned above, secreted by b cells of the adaptive immune system, antibodies can recognize and bind to specific antigens. conventional antibodies (immunoglobulins) are y-shaped molecules that have two light chains and two heavy chains [ ] . each light chain is connected to the heavy chain via a disulfide bond, and heavy chains are connected through two disulfide bonds in the mid-region known as the hinge region. each light and heavy chain contain two distinct regions: constant regions (stem of the y) and variable regions ("arms" of the y) [ ] . an antibody binds the antigenic determinant (also called epitope) through the variable regions in the tips of heavy and light chains. there is an enormous amount of diversity in the variable regions. therefore, different antibodies can recognize many different types of antigenic epitopes. to be specific, there are three complementarity determining regions (cdrs) that are arranged non-consecutively in the tips of each variable region. cdrs generate most of the diversities between antibodies, which determine the specificity of individuals of antibodies. in addition to conventional antibodies, camelids also produce heavy-chain-only antibodies (hcabs). hcabs, also referred to as nanobodies, or vhhs, contain a single variable domain (vhh) that makes up the equivalent antigen-binding fragment (fab) of conventional immunoglobulin g (igg) antibodies [ ] . this single variable domain typically can acquire affinity and specificity for antigens comparable to conventional antibodies. nanobodies can easily be constructed into multivalent formats and have higher thermal stability and chemostability than most antibodies do [ ] . another advantage of nanobodies is that they are less susceptible to steric hindrances than large conventional antibodies [ ] . considering the broad specificity of antibodies, seeking potential antibody therapies has become one of the most feasible strategies to fight against sars-cov- . in general, antibody therapy is a form of immunotherapy that uses monoclonal antibodies (mab) to target pathogenic proteins. the binding of antibody and pathogenic antigen can facilitate either immune response, direct neutralization, radioactive treatment, the release of toxic agents, or cytokine steam inhibition (aka immune checkpoint therapy). the sars-cov- entry of a human cell facilitated by the process of a series of interactions between its spike (s) protein and the host receptor angiotensin-converting enzyme (ace ), primed by host transmembrane protease, serine (tmprss ) [ ] . as such, most covid- antibody therapeutic developments focus on the sars-cov- spike protein antibodies that were initially generated from patient immune response and t-cell pathway inhibitors that block t-cell responses. a large number of antibody therapeutic drugs are in clinical trials. currently, most antibody therapy developments focus on the use of antibodies isolated from patient convalescent plasma to directly neutralize sars-cov- [ ] [ ] [ ] , although there are efforts to alleviate cytokine storm. a more effective and economical means to fight against sars-cov- is vaccine [ ] , which is the most anticipated approach for preventing the covid- pandemic. a vaccine is designed to stimulate effective host immune responses and provide active acquired immunity by exploiting the body's immune system, including the production of antibodies, which is made of an antigenic agent that resembles a disease-causing microorganism, or surface protein, or genetic material that is needed to generate the surface protein. for sars-cov- , the first choice of surface proteins is the spike protein. there are four types of covid- vaccines, as shown in figure . ) virus vaccines use the virus itself, in a weakened or inactivated form. ) viral-vector vaccines are designed to genetically engineer a weakened virus, such as measles or adenovirus, to produce coronavirus s proteins in the body. both replicating and non-replicating viral-vector vaccines are being studied now. ) nucleic-acid vaccines use dna or mrna to produce sasr-cov- s proteins inside host cells to stimulate the immune response. ) protein-based vaccines are designed to directly inject coronavirus proteins, such as s protein or membrane (m) protein, or their fragments, into the body. both protein subunits and viral-like particles (vlps) are under development for covid- [ ] . among these technologies, nucleic-acid vaccines are safe and relatively easy to develop [ ] . however, they have not been approved for any human usage before. however, the general population's safety concerns are the major factors that hinder the rapid approval of vaccines and antibody therapies. a major potential challenge is an antibody-dependent enhancement, in which the binding of a virus to suboptimal antibodies enhances its entry into host cells. all vaccine and antibody therapeutic developments are currently based on the reference viral genome reported on january , [ ] . sars-cov- belongs to the coronaviridae family and the nidovirales order, which has been shown to have a genetic proofreading mechanism regulated by non-structure protein (nsp ) in synergy with nsp , i.e., rna-dependent rna polymerase (rdrp) [ , ] . therefore, sars-cov- has a higher fidelity in its transcription and replication process than other single-stranded rna viruses, such as the flu virus and hiv. even though the s protein of sars-cov- has been undergoing many mutations, as reported in [ , ] . as of september , a total of mutations on the s protein has been detected on complete sars-cov- genome sequences. therefore, it is of paramount importance to establish a reliable computational paradigm to predict and mitigate the impact of sars-cov- mutations on vaccines and antibody therapies. moreover, the efficacy of a given covid- vaccine depends on many factors, including sars-cov- biological properties associated with the vaccine, mutation impacts, vaccination schedule (dose and frequency), idiosyncratic response, assorted factors such as ethnicity, age, gender, or genetic predisposition. the effect of covid- vaccination also depends on the fraction of the population who accept vaccines. it is essentially unknown at this moment how these factors will unfold for covid- vaccines. it is no doubt that any preparation that leads to an improvement in the covid- vaccination effect will be of tremendous significance to human health and the world economy. therefore, in this work, we integrate genetic analysis and computational biophysics, including artificial intelligence (ai), as well as additional enhancement from advanced mathematics to predict and mitigate mutation threats to covid- vaccines and antibody therapies. we perform single nucleotide polymorphism (snp) calling [ , ] to identify sars-cov- mutations. for mutations on the s protein, we analyze their mechanism [ ] , frequency, ratio, and secondary structural traits. we construct a library of all existing antibody structures from the protein data bank (pdb) and analyze their two-dimensional ( d) and three-dimensional ( d) characteristics. we further predict the mutation-induced binding affinity changes of antibody and s protein complexes using a topology-based network tree (topnettree) [ ] , which is a state-of-the-art model that integrates deep learning and algebraic topology [ ] [ ] [ ] . after identifying mutations that are potentially disruptive to antibody and s protein interactions, we further infer their threats to vaccines based on antibody binding site analysis, mutation-induced disruptive free energy, and mutation occurrence frequency. we combine frequency and free energy change to prioritize mutation threats and guild the development of future vaccines and antibody therapies. as a fundamental biological process, mutagenesis changes the organism's genetic information and servers as a primary source for many kinds of cancer and heritable diseases, which is a driving force for evolution [ , ] . generally speaking, virus mutations are introduced by natural selection, replication mechanism, cellular environment, polymerase fidelity, gene editing, random genetic drift, gene editing, recent epidemiology features, host immune responses, etc [ , ] . notably, understanding how mutations have changed the sars-cov- structure, function, infectivity, activity, and virulence is of great importance for coming up with life-saving strategies in virus control, containment, prevention, and medication, especially in the antibodies and vaccines development. genome sequencing, snp calling, and phenotyping provide an efficient means to parse mutations from a large number of viral samples [ , ] (see the supporting material (s )). in this work, we retrieved over , complete sars-cov- genome sequences from the gisaid database [ ] and created a real-time interactive sars-cov- mutation tracker( https://users.math.msu.edu/users/weig/sars-cov- mutation tracker.html) to report over , single mutations along with its mutation frequency on sars-cov- as of september . figure is a screenshot of our online mutation tracker. it describes the distribution of mutations on the complete coding region of sars-cov- . the y-axis shows the natural log frequency for each mutation at a specific position. a reader can download the detailed mutation snp information from our mutation tracker website. as mentioned before, the s protein has become the first choice for antibody and vaccine development. among , complete genome sequences, unique single mutations are detected on the s protein, and the h-index of s protein is [ , ] the number of unique mutations (n u ) is determined by counting the same type of mutations in different genome isolates only once, whereas the number of non-unique mutations (n nu , i.e., frequency) is calculated by counting the same type of mutations in different genome isolates repeatedly. table lists the distribution of snp types among unique and non-unique mutations on the s protein of sars-cov- worldwide. it can be seen that c>t and a>g are the two dominated snp types, which may be due to the innate host immune response via apobec and adar gene editing [ ] . moreover, non-degenerated mutations occurred on the s protein receptor-binding domain (rbd), which are relevant to the binding of sars-cov- s protein and most antibodies as well as ace . additionally, mutations occurred on the s protein domain (residue id: to ) are relevant to the binding of another antibody ( a ) and sars-cov- s protein. furthermore, since antibody cdrs are random coils, the complementary antigen-binding domains must involve random coils as well. table lists the statistics of non-degenerate mutations on the secondary structures of sars-cov- s protein. here, the secondary structures are mostly extracted from the crystal structure of c l [ ] , and the missing residues are predicted by raptorx-property [ ] . we can see that for both unique and non-unique cases, the average mutation rates on the random coils of the s protein have the highest values. particularly, the a>g-(d g) mutation on the random coils has the highest frequency of . if we do not consider the a>g-(d g) mutations, then the unique and non-unique average rates on the random coils of s protein still have the highest values ( . and . ), indicating that mutations are more likely to occur on the random coils. consequently, the natural selection of mutations may tend to disrupt antibodies. table : the statistics of non-degenerate mutations on the secondary structure of sars-cov- s protein. the unique and non-unique mutations are considered in the calculation. n u , n nu , ar u , ar nu represent the number of unique mutations, the number of nonunique mutations, the average rate of unique mutations, and the average rate of non-unique mutations on the secondary structure of s protein, respectively. here, the secondary structure is mostly extracted from the crystal structure of c l, the missing residues are predicted by raptorx-property. we construct a sars-cov- antibody library of d antibody structures deposited in the pdb. among them, the binding sites of antibodies are on the rbd of the s protein. while another antibody, a [ ] , has a distinguished binding domain. additionally, mr -k y is a mutant of antibody mr [ ] . we align antibody structures, excluding mr -k y, with sars-cov- s protein in figure . ace is included as a reference. clearly, except for antibody a , all other structures bind to the s protein rbd. it is interesting to note that a locates on a different domain. the pdb ids of these complexes can be found in figure . [ ] , cr [ ] , ey a [ ] , and a , all the other antibodies have their binding sites spatially clashing with that of ace . notably, the paratope of h [ ] does not overlap with that of ace directly, but in terms of d structures, their binding sites still overlap. this suggests that the bindings of antibodies are in direct competition with that of ace . theoretically, this direct competition reduces the viral infection rate. for such an antibody with strong binding ability, it will directly neutralize sars-cov- without the need of antibody-dependent cell cytotoxicity (adcc), antibody-dependent cellular phagocytosis (adcp), or other immune mechanisms. the paratopes of s , cr , and ey a on the rbd are away from that of ace , leading to the absence of binding competition [ , , ] . one study shows that the adcc and adcp mechanisms contribute to the viral control conducted by s in infected individuals [ ] . for cr , one research indicates that it neutralizes the virus in a synergistic fashion [ ] . for ey a, the hypothesis is that the binding of ey a could inhibit the glycosylation of ace [ ] . a more radical example is a [ ] , it binds to the n-terminal domain (ntd) of the s protein (figure (h)), which is quite far from the rbd, it is speculated (i) the d structure of s protein rbd. the red, green, and blue represent for helix, sheet, and random coils of rbd, respectively. the darker color represents the higher mutation frequency on a specific residue. the antibodies are s ( m j) [ ] , cc . ( xc ) [ ] , cc . and cr ( xc ) [ ] , cc . ( xc ) [ ] , cc . and cr ( xc ), c ( xcm) [ ] , regn and regn ( xdg) [ ] , cv ( xe ) [ ] , fab - ( xey) [ ] , cr ( yla) [ ] , h -d ( yz ), cr and h -d ( z m) [ ] , h -h ( zbp), ey z and nanobody ( zcz) [ ] , ey z ( zer) [ ] , p b- f ( bwj) [ ] , bd ( byr) [ ] , b ( bz ) [ ] , cb ( c ) [ ] , a ( c l) [ ] , sr ( c v) [ ] , b ( c w), h ( cah) [ ] , mr -k y ( can) [ ] , bd- ( ch ), bd- ( ch ), bd- ( chb), bd- and bd- - ( che), bd- and bd- - ( chf), bd- - ( chh), cova - ( jmo) [ ] , and cova - ( jmp) [ ] . that a may neutralize sars-cov- by restraining the conformational changes of the s protein, which is very important for the sars-cov- cell entry [ ] . any antibody or drug that can inhibit serine protease tmprss priming of the s protein priming can effectively stop the viral cell entry [ ] . figure provides a visual illustration of antibody and ace competitions. it remains to know in the residue detail what has happened to these competitions. to better understand the antibody and s protein interactions, we study the residue contacts between antibodies and the s protein. we include the ace as a reference but excluding antibodies a and mr -k y. in figure , the paratopes of antibodies and ace were aligned on the s protein rbd d sequence, and their contact regions are highlighted. from the figure, one can see that, except for h , s , cr , and ey a, all the other antibodies have their antigenic epitopes overlapping with the ace rbd, especially on the residues from to of the sars-cov- rbd. therefore, these antibodies competitively bind against ace as revealed in figure . the next question is whether there is any connection or similarity between the antibody paratopes in our library, particularly for those antibodies that share the same binding sites. to better understand this perspective, we carry out multiple sequence alignment (msa) to further study the similarity and difference among existing antibodies. many antibodies are very similar to each other and can be described in a few groups. the first group includes bd- , cc . [ ] , cova - [ ] , cv [ ] , cc . [ ] , b [ ] , bd- , bd- , ey a, and regn [ ] , as well as cb [ ] . their identity scores to cb are . therefore, multiple sequence alignment suggests that the paratopes of the antibodies bd- , cb , cova - , cv , cc . , cc . , c , bd- , bd- , and b are almost identical. similarly, the paratopes of the antibodies h -h , h -d , nb are highly consistent. so are the antibodies regn , cova - , and p b- f . the above similarity indicates that the adaptive immune systems of individuals have a common way to generate antibodies. on the other hand, the existence of three distinct groups, as well as antibody a suggests the diversity in the immune response. note that we have also included ace in our msa as a reference but none of the existing antibodies is similar to ace , because they were created from entirely different mechanisms. to investigate the influences of existing s protein mutations on the binding free energy (bfe) of s protein and antibodies, we consider mutations occurred on the s protein rbd which are relevant to the binding of sars-cov- s protein and antibodies as well as ace . additionally, mutations occurred on the ntd of the s protein (residue id: to ) which are relevant to the binding of sars-cov- s protein and antibody a (pdb: c l). we predict the free energy changes following existing mutations using our topnettree model [ ] . the rbd mutations are computed which are in the distance of Å to antibodies. our predictions are built from the x-ray crystal structure of sars-cov- s protein and ace (pdb m j) [ ] , and various antibodies (pdbs wps [ ] , xc [ ] , xc [ ] , xc [ ] , xc , xcm [ ] , xdg [ ] , xe [ ] , xey [ ] , yla [ ] , yz , z m, zbp, zcz [ ] , zer [ ] , bwj [ ] , byr [ ] , bz [ ] , c [ ] , c l [ ] , c v [ ] , c w, cah [ ] , can [ ] , ch , ch , chb, che, chf, chh, jmo [ ] , and jmp [ ] ). the bfe change following mutation (∆∆g) is defined as the subtraction of the bfe of the mutant type from the bfe of the wild type, ∆∆g = ∆g w − ∆g m where ∆g w is the bfe of the wild type and ∆g m is the bfe of mutant type. therefore, a negative bfe change means that the mutation decreases affinities, making the protein-protein interaction less stable. we first present the bfe changes ∆∆g of sars-cov- s protein binding domain with antibody a in figure , which is the only complex that is not on the rbd in our collections of s protein and antibody complexes. most mutations have small changes in their binding free energies, while some of them have large changes. notably, out of mutations on the binding domain have positive bfe changes, which means that the mutations increase affinities and would make protein-protein interactions more stable. however, the majority ( %) of mutations have negative bfe changes, including high-frequency mutations, m i, s f, and s f. it is also noted that many mutations on the binding domain, such as g d and k n, have significant negative free energy changes. the mutations on the binding domain with negative binding affinities reveal that the binding of antibody a and s protein will be potentially disrupted. next, we study the bfe changes ∆∆g induced by mutations on the sars-cov- s protein rbd for the antibody fab - (pdb: xey) in figure . most mutations induce small changes in the thee binding free energies, while mutations, g r and s l, have large negative bfe changes. overall, out of mutations on the rbd lead to negative bfe changes, which means % of mutations will potentially weaken the binding between antibody fab - and s protein. particularly, mutation s n on the rbd induces a negative bfe change with a high frequency of . while some mutations leading to positive bfe changes, more mutations induce negative bfe changes with large magnitude. antibody fab - shares a similar binding domain with ace and thus is a potential candidate for the direct neutralization of sars-cov- . however, bfe change predictions indicate that the mutations on s protein weaken the fab - binding with s protein and make it less competitive with ace . in figure , we illustrate antibody b (pdb: c w), which shares the binding domain with ace as well. one can notice that only four mutations, r s, f s, l f, and s l, have the magnitude of bfe changes larger than kcal/mol and all are negative bfe changes. the rest mutations have a small magnitude of changes. mutation v a has a frequency of and small positive bfe changes. interestingly, mutation s l induces large bfe changes for antibodies b and fab - . antibody b will reduce its competitiveness with ace if mutations r s, f s, l f, and s l become dominant. finally, we consider the bfe change predictions for antibody s and s protein complex, whose re- ceptor binding motif (rbm) does not overlap with the rbm of ace . the bfe changes induced by mutations are predicted. among them, changes are positive. similar to the aforementioned antibodies, most of the mutations lead to small changes in their binding affinity magnitude but three mutations, t s, v i, and k n, induce large negative changes. the binding of antibody s might be disrupted, considering that a majority of mutations induce negative bfe changes with large magnitude. while antibodies play a variety of functions in the human immune system such as neutralization of infection, phagocytosis, antibody-dependent cellular cytotoxicity, etc., their binding with antigens is crucial for these functions. our analysis of bfe changes following mutations on s protein suggests that some antibodies will be less affected by mutations, which is important for developing vaccine and antibody therapies. the bfe change analysis of other antibodies is described in the supporting material (s ). in this section, we build a library of mutation-induced bfe changes for all mutations and all antibodies. in principle, we could create a library of all possible mutations for all antibodies, as we did for ace [ ] . here, we limit our effort to all existing mutations. antibody a on the ntd has been discussed above. we consider antibodies on the rbd. based on our earlier analysis, three types of sars-cov- s protein secondary structural residues have f l g d e k v i y h s p s f v f n s d v v i r i r t k r k n i v n k n k l i different mutation rates. among them, the random coils are major components of the rdb and the ntd, as shown in fig. . therefore, mutations on the rbd are split into three categories based on their locations in secondary structures helix, sheet, and coil. in figure , we present the bfe changes for ace and antibodies induced by mutations on helix residues of the s protein rbd. the frequency for each mutation is also presented. most mutations on helix residues lead to positive bfe changes (green squares), whereas some mutations induce negative bfe changes (pink squares). the n k mutation having the largest frequency, , shows mild bfe changes on ace and antibodies. mutations k n and y c induce positive bfe changes on most of ace and antibodies. especially, antibodies c and bd- have larger bfe changes than ace , which indicates that they are stronger competitive than ace . antibody cb may be potentially a good therapeutic candidate as its bfe changes are positive following all mutations, but this needs to be confirmed by other mutations on the coil and sheet residues. in figure , we present the bfe changes for ace and antibodies along with frequencies on mutations of sheet residues of the s protein rbd. the mutation r s has a large variance of the bfe changes such that both positive and negative changes occurred on antibodies and ace . clearly, antibodies bd , bd , cb , mr , and mr -k y lead to negative bfe changes on mutations of rbd sheet residues, which reduce their competitive binding ability with ace after mutations. as for mutations with high frequencies, the mutation r k has negative changes on most antibodies, which poses a danger of disrupting the binding of antibodies and s protein. figure presents the bfe changes for ace and antibodies along with the log of frequencies on each mutation of coil residues on the s protein rbd. overall, most mutations on coil residues lead to negative bfe changes. interestingly, cv has the most positive bfe changes following mutations, which can be a good candidate for potential therapy. for the high-frequency mutation s n, the bfe changes are mild on ace and antibodies. however, mutation l f induces negative bfe changes for all antibodies except for cova - , which is considered as a potentially dangerous mutation for antibody therapies. n d n s n k r k k n t p v i r k r s l m l r y f q l s p s l y h in statistics, most mutations ( of ) occur on residues whose secondary structures are coil, while out of mutations are on the helix, and out of mutations are on the sheet. here, mutations on the random coils and mutations on helix are not calculated due to the far distance to antibodies. moreover, residues on coil have more negative bfe changes ( negative bfe changes vs. positive bfe changes), while residues on the helix or sheet have more positive bfe changes ( and negative bfe changes vs. and positive bfe changes, respectively). lastly, the maximum bfe changes of the helix, sheet, and coils are . kcal/mol, . kcal/mol, and . kcal/mol, while the minimum bfe changes are - . kcal/mol, - . kcal/mol, and - . kcal/mol, respectively. binding affinity changes (kcal/mol) figure : illustration of sars-cov- mutation-induced maximal and minimal binding free energy changes for the complexes of s protein and antibodies or ace . here, the maximal change strengthens the binding while the minimal change weakens the binding for each complex. figure indicates the bfe changes extreme values (maximal in blue and minimal in red) of the complexes of s protein ace or antibodies following mutations. many antibodies, such as cr and cr h -d , are not very sensitive to the current s protein mutations. however, some other antibodies, such as cv , fab - , and ey z nb, can be dramatically affected by sars-cov- mutations. the increasing number of affected and dead individuals, the global spread situation, and the lack of prophylactics and therapeutics give rise to the urgent demand for the prevention of covid- . vaccination is the most effective and economical means to prevent and control pandemics [ ] . currently, vaccines are in various clinical trial stages, as reported in an online covid- treatment and vaccine tracker ( https://covid- tracker.milkeninstitute.org/#vaccines intro). broadly speaking, there are four types of coronavirus vaccines in progress: virus vaccines, viral-vector vaccines, nucleic-acid vaccines, and proteinbased vaccines, as shown in figure . the first type of vaccine is the virus vaccine, which injects weakened or inactivate viruses to the human body. a virus is conventionally weakened by altering its genetic code to reduce its virulence and elicit a stronger immune response. a biotechnology company codagenix is currently working on a "codon optimization" technology to weaken viruses, and it has weakened virus vaccine is in progress [ ] . unlike a weakened virus, the inactivated virus cannot replicate in the host cell. a virus is inactivated by heating or using chemicals, which induces neutralizing antibody titers and has been proven to have its safety [ ] . at this stage, both sinopharm, which works with the beijing institute of biological products and wuhan institute of biological products, and sinovac, which works with institute butantan and bio farma is developing inactivate sars-cov- vaccines that are in phase iii clinical trials. the second type of vaccine is the viral-vector vaccine, which is genetically engineered so that it can produce coronavirus surface proteins in the human body without causing diseases. there are two subtypes of viral-vector vaccines: the non-replicating viral vector and the replicating viral vector. there are non-replicating viral vector vaccines in phase iii trials. astrazeneca and the university of oxford, whose vaccine is in phase iii trials in many countries. it works by taking a chimpanzee virus and coating it with the s proteins of sars-cov- . the chimp virus causes a harmless infection in humans, but the spike proteins will activate the immune system to recognize signs of a future sars-cov- invasion. notably, the booster shots can be needed to keep long-lasting immunity. moreover, at this stage, only one replicating viral-vector vaccine is in phase i. institut pasteur themis, in cooperating with the university of pittsburgh cvr and merck sharp & dohme is developing such a replicating viral vaccine, which tends to be safe and provoke a strong immune response [ ] . the third type of vaccine is nucleic-acid vaccines, including two subtypes: dna-based vaccines and rna based vaccines. at least teams are currently working on nucleic-acid vaccines since they are safe and easy to develop. the dna-based vaccine works by inserting genetically engineered blueprints of the viral gene into small dna molecules such as plasmids for injection. moreover, the electroporation technique is employed to create pores in membranes to increase dna uptake into cells. the injected dna will produce mrna by transcription with the help of the nucleus in human cells. such an mrna will translate viral proteins (mostly spike proteins), which are dutifully produced by cells in response to the genes, alarm the immune system, and should produce immunity. currently, there are four dna-based vaccines in phase ii. similar to dna-based vaccines, the rna-based vaccines provide immunity through the introduction of rna, which is encased in a lipid coat to ensure its entering into cells. two rna-based vaccines are in phase iii, and companies such as moderna, biontec, and pfizer are working on the advanced development of rna-based vaccines. the fourth type of vaccine is the protein-based vaccines, which aims to inject viral proteins directly to human bodies to trigger immune readiness. protein subunits vaccine is one of the subtypes of the proteinbased vaccine. more than teams are working on vaccines with viral protein subunits, such as spike proteins and membrane (m) proteins. another subtype of the protein-based vaccine is the virus-like particle (vlp) vaccine. the vlp vaccines closely resemble viruses. however, they are not infectious since they do not contain viral genetic material. the non-replicating propriety provides a safer alternative to weakened virus vaccines, the hpv vaccine or newer flu vaccines are vlp vaccines. currently, teams are working on the vlp vaccines for the future prevention of covid- . since the structural basis of antibody cdrs, or paratope, is random coils, we hypothesize that cdrs favor antigenic random coils as complementary epitopes, i.e., antigenic determinants [ , ] . figure depicts the d structure of s protein, where the random coils are drawn with green strings, and the other secondary structure is described with the purple surface. it shows that the rbd and the ntd mostly consist of random coils. the rbd is the antigenic determinant of structurally-known sars-cov- antibodies; meanwhile, the ntd is the binding domain of antibody a , which confirms our hypothesis. figure marks the secondary structure of the s protein. the red, blue, and green colors represent helix, sheet, and random coils of s protein. it can be seen that the s protein mostly consists of random coils, which means there are many other potential antigenic epitopes on the s protein for antibody cdrs. we believe that the emphasis of direct binding competition with ace in the past [ , , ] has led to the neglecting of many important antibodies that do not bind to the rbd. therefore, we suggest that researchers pay more attention to antibodies that do not bind to the rbd. vaccine efficacy is an essential issue for the control of the covid- pandemic. s protein is one of the most popular surface proteins for the vaccine development. however, mutations accumulated on the s protein of sars-cov- , which may reduce the vaccine efficacy. as we found in section , mutations are more likely to happen on the random coils of s protein, which may have a devastating effect on vaccines in the development. as shown in figure , mutations could considerably weaken the binding between the s protein and antibodies and thus pose a direct threat to reduce the efficacy of vaccines. however, there are a few obstacles in determining the exact impacts of mutations to covid- vaccines. firstly, the four types of vaccine platforms can produce very different virus peptides, which will result in different immune responses, as well as antibodies. secondly, even for a given vaccine platform, the different peptides may be produced due to different immune responses caused by gender difference, age difference, race difference, etc. therefore, in this work, we proposed to understand the impact of sars-cov- mutations on covid- vaccines by the statistical analysis. by evaluating the binding affinity changes induced by existing sars-cov- antibodies, as shown in figure to figure , we can notice that the k n, y c, f l, and f l mutations enhance the binding of almost all of the antibodies. in contrast, the r k, l f, and p r mutations have weakened the binding of almost all of the existing antibodies. moreover, mutation k n enhances the binding of antibody ey a, whereas mutation v i weakens the binding of antibody s . furthermore, it is noticed that many mutations such as k n, q r, and g r considerably disrupt many antibodies and thus may bring a threat to future vaccines. figure depicts the maximal and minimal binding free energy changes for s protein complexes and antibodies or ace . it can be seen that antibodies cr , cr h -d , bd- , bd- , and bd- - are not very sensitive to the current mutations on the s protein. however, other antibodies, such as cv and ey a, are very sensitive to current mutations. in a nutshell, by setting up a sars-cov- antibody library with the statistical analysis based on the mutation-induced binding free energies changes, we can estimate the impacts of sars-cov- mutations on covid- vaccines, which will provide a way to infer how a specific mutation will pose a threat to vaccines. this approach works better when more antibody structures become available. another important factor in prioritization is mutation frequency. figures , , and have provided frequency information from our snp calling. once a mutation is identified as a potential threat, it can be incorporated into the next generation of vaccines in a cocktail approach. in principle, all four types of vaccine platforms allow the accommodation of new viral strains. coronavirus disease (covid- ) pandemic has gone out of control globally. there is no specific medicine and effective treatment for this viral infection at this point. vaccination is widely anticipated to be the endgame for taming the viral rampant. another promising treatment that is relatively easy to develop is antibody therapies. however, both vaccines and antibody therapies are prone to more than , unique mutations recorded in the mutation tracker. we present a prediction of mutation threats to vaccines and antibody therapies. first, we identify existing mutations on the severe acute respiratory syndrome coronavirus (sars-cov- ) spike (s) protein, which is the man target for both vaccines and antibody therapies. we analyze the mechanism, frequency, and ratio of mutations along with the secondary structures of the s protein. additionally, we build a library of antibodies with structures available from the protein data bank (pdb) and analyze their two-dimensional ( d) and three-dimensional ( d) characteristics by employing computational biophysics. we further predict the mutation-induced binding free energy (bfe) changes of s protein and antibody complexes by a model called topnettree based on deep learning and algebraic topology. from these studies, we infer that some of s protein mutations may disrupt the binding of antibodies and s protein, which will further affect the efficacy and reliability of vaccines. to prioritize mutation threats, we also take into consideration of mutation occurrence frequency. the resulting algorithm indicates that some high-frequency mutations such as m i, s f, and s f with negative bfe changes may potentially disrupt the efficacy and reliability of vaccines and antibody therapies currently in the development. our method can provide the efficient prioritization of mutations to guild the design of the next generation of vaccines and antibody therapies. supporting material is available for: s method; s multiple sequence alignments of the antibodies and pairwise identity scores; and s mutation-induced changes of binding free energies of antibody-sars-cov- spike protein complexes. p s a t a s t s r k r t a s a t s n c s c s d y a v s l f l v l p s p l t a a s g r q e q k q p q r t s d y s f k r k n v i v f v a g s g v n k l f f l r k k q k n s f s y n t i f i v i t s p t n e q e g e d i v a v g s s g s t s i s n s r t a t k t i p s p l n h n d g s v f v a e k e q e d g s g r f l f l f s p r l i h q genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding covid- vaccine development and a potential nanomaterial path forward covid- : four fifths of cases are asymptomatic, china figures indicate clinical and immunological assessment of asymptomatic sars-cov- infections projecting the transmission dynamics of sars-cov- through the postpandemic period deployment of convalescent plasma for the prevention and treatment of covid- immune responses in covid- and potential vaccines: lessons learned from sars and mers epidemic neutralizing antibody responses to sars-cov- in a covid- recovered patient cohort and their implications the orf , orf and nucleocapsid proteins of sars-cov- inhibit type i interferon signaling pathway covid- , immune system response, hyperinflammation and repurposing antirheumatic drugs highlight of immune pathogenic response and hematopathologic effect in sars-cov, mers-cov, and sars-cov- infection immune response in covid- : addressing a pharmacological challenge by targeting pathways triggered by sars-cov- overview of the immune response pathogen recognition by the innate immune system pattern recognition receptors and inflammation pathogen recognition in the innate immune response the evolution of adaptive immunity the mhc class i antigen presentation pathway: strategies for viral immune evasion cd + t cell effector mechanisms in resistance to infection genetic control of mhc class ii expression the cytokine storm and covid- cd and cd in cell-mediated immunity immunological memory in humans primary structure of a human iga immunoglobulin. iv. streptococcal iga protease, digestion, fab and fc fragments, and the complete amino acid sequence of the alpha heavy chain antibody structure, instability, and formulation naturally occurring antibodies devoid of light chains comparison of physical chemical properties of llama vhh antibody fragments and mouse monoclonal antibodies llama antibody fragments with cross-subtype human immunodeficiency virus type (hiv- )-neutralizing properties and high affinity for hiv- gp sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor covid- : immunopathology and its implications for therapy convalescent plasma as a potential therapy for covid- treatment of critically ill patients with covid- with convalescent plasma progress and prospects on vaccine development against sars-cov- . vaccines the race for coronavirus vaccines: a graphical guide a new coronavirus associated with human respiratory disease in china insights into rna synthesis, capping, and proofreading mechanisms of sars-coronavirus structural and molecular basis of mismatch correction and ribavirin excision from coronavirus rna decoding sars-cov- transmission, evolution and ramification on covid- diagnosis, vaccine, and medicine decoding sars-cov- transmission and evolution and ramifications for covid- diagnosis, vaccine, and medicine genotyping coronavirus sars-cov- : methods and implications host immune response driving sars-cov- evolution a topology-based network tree for the prediction of protein-protein binding affinity changes following mutation topological persistence and simplification persistent homology analysis of protein structure, flexibility, and folding. international journal for numerical methods in biomedical engineering structural and physico-chemical effects of disease and non-disease nssnps on proteins loss of protein structure stability as a major causative factor in monogenic disease mechanisms of viral mutation making sense of mutation: what d g means for the covid- pandemic remains unclear gisaid: global initiative on sharing all influenza data-from vision to reality a neutralizing human antibody binds to the n-terminal domain of the spike protein of sars-cov- raptorx-property: a web server for protein structure property prediction potent synthetic nanobodies against sars-cov- and molecular basis for neutralization structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structural basis of a shared antibody response to sars-cov- structures of human antibodies bound to sars-cov- spike reveal common epitopes and recurrent features of antibodies studies in humanized mice and convalescent humans yield a sars-cov- antibody cocktail structural basis for potent neutralization of sars-cov- and role of antibody affinity maturation. biorxiv structural characterisation of a nanobody derived from a naïve library structural basis for the neutralization of sars-cov- by an antibody from a convalescent patient human neutralizing antibodies elicited by sars-cov- infection potent neutralizing antibodies against sars-cov- identified by high-throughput single-cell sequencing of convalescent patients' b cells a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace a human neutralizing antibody targets the receptor binding site of sars-cov- structural basis for neutralization of sars-cov- and sars-cov by a potent therapeutic antibody an alternative binding mode of ighv - antibodies to the sars-cov- receptor binding domain structural and functional analysis of a potent sarbecovirus neutralizing antibody potent binding of novel coronavirus spike protein by a sars coronavirusspecific human monoclonal antibody. emerging microbes & infections human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants potent neutralizing antibodies against multiple epitopes on sars-cov- spike mutations strengthened sars-cov- infectivity the sars-cov- vaccine pipeline: an overview. current tropical medicine reports safety and immunogenicity from a phase i trial of inactivated severe acute respiratory syndrome coronavirus vaccine bioinformatic prediction of epitopes in the emy antigen of echinococcus multilocularis. experimental and therapeutic medicine structural analysis of b-cell epitopes in antibody: protein complexes this work was supported in part by nih grant gm , nsf grants dms- , dms- , and iis , michigan economic development corporation, george mason university award pd , bristol-myers squibb, and pfizer. the authors thank the ibm tj watson research center, the covid- high performance computing consortium, nvidia, and msu hpcc for computational assistance. rw thanks dr. changchuan yin for useful discussion. the authors declare no competing interests. key: cord- - zcjgu authors: chen, yun; guo, yao; pan, yihang; zhao, zhizhuang joe title: structure analysis of the receptor binding of -ncov date: - - journal: biochem biophys res commun doi: . /j.bbrc. . . sha: doc_id: cord_uid: zcjgu abstract -ncov is a newly identified coronavirus with high similarity to sars-cov. we performed a structural analysis of the receptor binding domain (rbd) of spike glycoprotein responsible for entry of coronaviruses into host cells. the rbds from the two viruses share % identity in amino acid sequences, and molecular simulation reveals highly similar ternary structures. however, -ncov has a distinct loop with flexible glycyl residues replacing rigid prolyl residues in sars-cov. molecular modeling revealed that -ncov rbd has a stronger interaction with angiotensin converting enzyme (ace ). a unique phenylalanine f in the flexible loop likely plays a major role because its penetration into a deep hydrophobic pocket in ace . ace is widely expressed with conserved primary structures throughout the animal kingdom from fish, amphibians, reptiles, birds, to mammals. structural analysis suggests that ace from these animals can potentially bind rbd of -ncov, making them all possible natural hosts for the virus. -ncov is thought to be transmitted through respiratory droplets. however, since ace is predominantly expressed in intestines, testis, and kidney, fecal-oral and other routes of transmission are also possible. finally, antibodies and small molecular inhibitors that can block the interaction of ace with rbd should be developed to combat the virus. a mysterious pneumonia illness was first reported in late december in wuhan, china, and has rapidly spread to a dozen of countries including the united states with thousands of infected individuals and hundreds of deaths within a month [ ] . scientists in china have isolated the virus from patients and determined its genetic code. the pathogen responsible for this epidemic is a new coronavirus designated -ncov by the world health organization. -ncov belongs to the same family of viruses as the well-known severe acute respiratory syndrome coronavirus (sars-cov) and middle east respiratory syndrome coronavirus (mers-cov), which have killed hundreds of people in the past years. coronaviruses consist of a large diverse family of viruses. they can be classified into four genera: alpha-, beta-, gamma-, and delta coronavirus [ , ] . representative alphacoronaviruses include human coronavirus nl (hcov-nl ), while the betacoronaviruses include the best-known sars-cov and mers-cov. based on nucleic acid sequence similarity, the newly identified -ncov is a betacoronavirus. the entry of all coronaviruses into host cells is mediated by spike glycoprotein that gives coronaviruses a crownlike appearance by forming spikes on their surface. the amino acid sequence of spike glycoprotein consists of a large ectodomain, a single-pass transmembrane anchor, and a short c-terminal intracellular tail [ ] . the ectodomain contains a receptor-binding unit s and a membrane-fusion unit s . electron microscopic imaging illustrated that spike glycoprotein forms a clove-shaped spike with three s heads and a trimeric s stalk. for a virus to enter a host cell, s binds to a specific cell surface receptor via its receptorbinding domain (rbd), and s fuses the host cell and viral membranes, enabling the entry of viral genomes into host cells. specific rbd-receptor binding determines if a cell or animal can be infected and also serves as a target for therapeutic inventions to treat diseases caused by coronaviruses. previous studies have identified angiotensin converting enzyme (ace ) as a functional receptor for sars-cov [ , ] . in this study, we analyzed the structure of spike glycoprotein rbd of -ncov and identified a unique feature that potentially allows a high affinity binding to ace in human cells. we further discussed potential candidates for natural hosts of -ncov, routes of transmission, and strategies to inhibit virus entry for therapeutic applications. the genomic sequence of -ncov as deposited by wang et al. was downloaded from the genbank database (mn . ). dna and protein sequences were compared by using the blast program. multiple sequence alignment was performed by using the clustal omega program. three-dimensional structure was analyzed by using the cn d program from the ncbi. protein structure simulation was performed by using swiss-model based on the cocrystal structure of human ace with the sars-cov spike glycoprotein rbd ( , pdb id ajf). ace and rdb interaction was analyzed by molecular docking using the patchdock and firedock programs. by using the initially reported sequence mn . , a blast search of the ncbi database revealed inputs for the virus with essentially identical sequences (accession nc_ . , mn . , mn . , mn . , mn . , and mn . ). the closest homolog of -ncov is a sars-like coronavirus isolated from bat (mg . ) with a sequence identity of . % at % coverage (fig. a) . it also shows % sequence identity with sars coronavirus isolated from human patients or civet with % coverage. throughout the entire , bp genome of -ncov, the least conserved region encodes for the spike glycoprotein with sequence identity of e %. spike glycoprotein forms spikes on the surface of coronaviruses and is responsible for entrance of the viruses into the host cells. the rbd in the spike glycoprotein molecule directly binds receptors on the surface of host cells [ ] . in the case of sars-cov and bat/civet sars-like cov, the receptor is ace , an exopeptidase that catalyzes the conversion of angiotensin i to the nonapeptide angiotensin - or the conversion of angiotensin ii to angiotensin - [ e ]. at the protein level, the whole spike glycoprotein and its rbd share % and % sequence identity with sars-cov, respectively. sars-cov spike glycoprotein is known to be glycosylated. a total of predicated n-glycosylation sites is found in spike glycoprotein of -ncov, which are shared by sars-cov except that the latter contains an extra glycosylation site at n . a detailed sequence alignment of the rbd of sars-cov spike glycoprotein with those from closely related coronaviruses at the protein level is shown in fig. b . the crystal structure of sars-cov rbd in complex with its receptor, human ace , has been solved [ ] . by performing molecular simulation, we obtained a ternary structure for rbd of -ncov that is essentially superimposable with that of sars-cov ( fig. aec) , except for a noted structural variation in a loop (loop ). the backbone of the deduced rbd structure consists of beta sheets. peptide segments involved in the formation of this secondary structure are all highly conserved without the presence of secondary structure breakers. four cysteinyl residues that form disulfide bonds (corresponding to c /c and c /c of sars-cov) are also conserved (see also fig. b) . we furthermore performed molecular docking to examine the binding of rbd with ace . the deduced complex structure reveals similar mode of extensive interaction as seen with sars-cov with a more favorable binding energy (À . vs. À . kcal/mol) (fig. def) . the contact between ace and rbd involves two b-sheets and three loops (see figs. b and a). there are amino acid residues in sars-cov rbd that are directly in contact with ace , of which are conserved in -ncov (see fig. b ). presumably, the substituted amino acids can either reduce or enhance the interaction. to define the contribution of the variant amino acids to the rbd/ ace interaction, we compare the sequences of rbd from three other sars-cov-associated viruses (fig. b) . these include coronaviruses isolated from patients during a short, weak sars outbreak in e (denoted sarsv here) and from palm civets and bats, possible sources of sars-cov found in humans [ e ]. recombinant proteins containing rbd of these viruses are all known to bind to human ace [ , ] . in comparison with sars-cov (responsible for the major sras outbreak during the e ), binding with sarsv and civet rbds is substantially weaker [ ] , while quantitative binding affinity with bat rbd has been not been determined [ ] . amino acid residues in the rbd/ ace binding interface plays a crucial role in determining the binding affinity. among the amino acid residues in rbd of sars that are in contact with ace , , , , and are shared by sarsv, civet, bat, and -ncov, respectively (fig. b) . n found in both sars viruses isolated from human patients is changed to k and r in civet and bat, respectively. an earlier study demonstrated that an n to k substitution resulted in significantly lower affinity ( fold increase in kd values) [ ] . interestingly, this amino acid is substituted by a similar amino acid glutamine (q ) in -ncov, which also contains an amide group but at an extended position, which can potentially carry out similar functions. in comparison with sars-cov, t is changed to asparagine (n ) in -ncov but alanine or serine in the other viruses. it has been shown that a t to s substitution increased kd by -fold, suggesting the methyl group rather than the hydroxyl group in this threonine residue is more important for the interaction [ ] . it is hard to predict if n with an amide group can confer a better interaction. hydrophobic amino acid l is also important for interaction between rbd and ace . interestingly, it is substituted by proline in sarsv and phenylalanine in -ncov (corresponding to f ). l is located in a loop formed by disulfide bond c /c . interestingly, this loop with ctppalnc in sars-cov is replaced by cngvegfnc in -ncov containing one extra amino acid residue and totally different amino acid compositions. the replacement of two proline residues by two flexible glycine residues converts a rigid structure to a very flexible one. further examination of the deduced rbd/ace complex structure reveals that this unique phenylalanine f in the flexible loop can penetrate deep into a hydrophobic pocket in ace formed by f , l , y , and l (fig. f) . the presence of two aromatic amino acids in the pocket may provide additional binding force via pstacking interactions [ ] . taken together, -ncov likely has a stronger binding to ace via its spike glycoprotein. glycosylation may also affect the interaction of rbd with ace . among the glycosylation sites on spike glycoprotein, two are in rbd (fig. b) . glycosylation has been detected on one of these residues, asn [ ] . n corresponds to n in the spike glycoprotein of -ncov and is a conserved glycosylation site. since it is well separated from the rbd/ace interaction interphase, glycosylation at this site is unlikely to interfere with the interaction [ ] . it should be noted that another potential glycosylation site corresponding to n in sars-cov is not conserved in -ncov because of substitution of t by a in the þ position. lack of this glycosylation is not expected to affect the receptor binding. the -ncov outbreak is thought to be initiated from a seafood market that also carried many other wild live animals including snakes, birds, and various mammals. interestingly, a study by ji et al. suggests that snakes might serve as a likely reservoir for the novel ncov- based on the observation that the codon usage of ncov- was more similar to snakes than other potential hosts they investigated [ ] . while the data and premise are being debated, we sought to address the problem by analyzing the structure of ace in different animals. ace is widely expressed in the animal kingdom from fish, amphibians, reptiles, birds, to mammals. remarkably, its structure is highly conserved. comparison of human ace with that of a civet (paguma larvata, aax . ), a bat (rhinolophus sinicus, adn . ), a bird (nipponia nippon, kfq . ), a snake (protobothrops mucrosquamatus, xp_ . ), a frog (xenopus laevis, xp_ . ), and a fish (callorhinchus milii, xp_ . ) revealed amino acid sequence identity of %, %, %, %, %, and %, respectively. fig. aligns parts of ace sequences that contain all the interaction sites in contact with sars-cov rbd according to the published co-crystal structure [ ] . the interaction involved mainly two a-helices of ace . out of amino acid residues involved in the direct interaction, of them are shared by all seven species of animals analyzed in the study, including f that supposedly interacts with f of spike glycoprotein from -ncov (fig. f) . many of the remaining resides in the contact are conserved or replaced by amino acids of similar chemical properties. it is interesting to note that bird ace shares as many conserved contacting amino acid residues as bat and civet ace . ace molecules from any of these has the potential to interact with rbd of -ncov with high affinity. therefore, it would not be a surprise if any of these wild animals is found to be a primary or secondary host of -ncov. sars-cov-like coronaviruses have been found in many bats that are considered as natural reservoirs for the viruses. they may well be the host for -ncov. however, the possibility that cold-blooded animals like snakes can serve as a host cannot be ruled out. the flexible interacting loop identified in our study may allow the virus to adapt to both the cold-blooded and warm-blooded hosts. by performing immunostaining, earlies studies have demonstrated the expression of ace in lung alveolar epithelial cells as well as arterial and venous endothelial cells, arterial smooth muscle cells, renal tubular epithelium, and epithelia of the small intestine [ , ] . the lung expression provides strong support for infection of sars-cov and -ncov through the airways of the lung. however, by searching the human protein atlas database, we found that ace mrna is mainly detected in small intestine, colon, duodenum, kidney, testis, and gallbladder. its expression level in the lung is minimal (fig. ) . furthermore, by examining data from two single-cell rna-seq studies [ , ] , we only identified out of and out of lung epithelial cells expressed a detectable level of ace (www.ebi.ac.uk/gxa/sc). this confirms that the overall expression of ace in the lung is low and may also suggest the presence of selected cells with upregulated ace expression under certain conditions. the tissue expression pattern of ace suggests other modes of virus transmission that may involve the functions of intestine, kidney, testis, and other tissues. particular attention should be paid to the intestines which expressed the highest level of ace . earlier studies have demonstrated that diarrhea was present in up to % of patients infected with sars-cov [ ] . more importantly, a recent case report demonstrated the presence of -ncov in feces of a patient with an initial diarrhea episode [ ] . while this finding has been noted in other reports, tests of feces and urine samples for the presence of -ncov is warranted, which may help to reveal alternative routes of virus transmission. since its initial outbreak, the -ncov infection is much more contagious than it was originally thought. we know that the virus is capable of spreading quickly from human to human and that people can spread the virus even before they become symptomatic [ ] . this makes it harder to contain the virus, and many are concerned about the possibility of a new pandemic. our study suggests unique structural features of the spike glycoprotein rbd of -ncov that confers potentially higher affinity binding for its receptor than found with sars-cov. with a higher affinity binding capability, the number of viruses required to infect a cell is much reduced. this partly explains why -ncov appears to be more aggressive than sars-cov. this also reminds us of a lesser-known coronavirus hcov-nl that also uses ace also as a receptor. hcov-nl was initially isolated from a child with bronchiolitis in the netherlands [ ] . it belongs to the alphacoronavirus subfamily. the rbd of sars-cov shares no structural homology with that of sars-cov but recognizes the same region in ace . however, cocrystal structure reveals that rbd of nl -cov has a narrower contact with ace , involving fewer amino acids [ ] . this presumably results in a weaker interaction. evidently, nl -cov does not spread aggressively and only causes mild to moderate respiratory infections [ ] . the exact mode of transmission for -ncov has not been firmly established. sars-cov is thought to be transmitted by respiratory droplets produced when an infected person coughs or sneezes [ ] . the respiratory droplets spread can occur only through direct person-to-person contact or at a close distance. presumably, -ncov can be transmitted through respiratory droplets. it may also be transmitted more effectively through the air over a long distance (airborne spread) or by other ways. considering the predominant expression of ace in intestines and kidney, -ncov may infect cells in these tissues and find its way into feces and urine. this makes transmission through the fecal-oral route and bodyfluids (urine) possible. the presence of -ncov in feces supports such a notion [ ] . specific rbd-receptor binding determines if a cell or animal can be infected and also serves as a target for therapeutic inventions to treat diseases caused by coronaviruses. by binding directly to ace on the surface of host cells, spike glycoprotein plays an essential role in virus infection. an obvious way to stop the virus infection is to block the rbd and ace interaction. this can be achieved by using antibodies or small molecular inhibitors. naturally, antibodies and inhibitors that can disrupt the interaction of rbd with ace is of therapeutic importance. by using a molecular docking approach, an earlier study identified n-( -aminoethyl)- aziridineethanamine as a novel ace inhibitor that effectively blocks the sars-cov rbd-mediated cell fusion [ ] . this has provided a potential candidate and lead compound for further therapeutic drug development. meanwhile, biochemical and cell-based assays can be established to screen chemical compound libraries to identify novel inhibitors. on the other hand, many ace inhibitors are currently used to treat hypertension and other cardiovascular diseases [ ] . among them are captopril, perindopril, ramipril, lisinopril, benazepril, and moexipril. although these drugs primarily target ace, a homolog of ace with % sequence identity and % sequence similarity in the catalytic domain, they may be effective toward ace as well [ ] . it should be noted that ace inhibitors bind to the catalytic center rather than rbd binding site. nonetheless, these enzymatic inhibitors may indirectly alter conformation of the rbd binding site and thereby affect the interaction of ace with rbd. it is certainly worthwhile to test these drugs for their ability to block the rbd/ace interaction. a novel coronavirus outbreak of global health concern coronaviruses: an overview of their replication and pathogenesis structure, function, and evolution of coronavirus spike proteins angiotensin-converting enzyme is a functional receptor for the sars coronavirus the sars-cov s glycoprotein: expression and functional characterization structure of sars coronavirus spike receptor-binding domain complexed with receptor ace of the heart: from angiotensin i to angiotensin ( - ) analysis of multimerization of the sars coronavirus nucleocapsid protein cross-host evolution of severe acute respiratory syndrome coronavirus in palm civet and human isolation and characterization of viruses related to the sars coronavirus from animals in southern china isolation and characterization of a bat sars-like coronavirus that uses the ace receptor receptor and viral determinants of sarscoronavirus adaptation to human ace rapp e, pi-stacking interactions. alive and well in proteins homologous recombination within the spike glycoprotein of the newly identified coronavirus may boost cross-species transmission from snake to human tissue distribution of ace protein, the functional receptor for sars coronavirus. a first step in understanding sars pathogenesis severe acute respiratory syndrome coronavirus infection of human ciliated airway epithelia: role of ciliated cells in viral spread in the conducting airways of the lungs a cellular census of human lungs identifies novel cell states in health and in asthma single-cell rna sequencing identifies diverse roles of epithelial cells in idiopathic pulmonary fibrosis severe acute respiratory syndrome: historical, epidemiologic, and clinical features first case of novel coronavirus in the united states identification of a new human coronavirus crystal structure of nl respiratory coronavirus receptor-binding domain complexed with its human receptor, proc. natl human coronavirus nl : a clinically important virus? structure-based discovery of a novel angiotensin-converting enzyme inhibitor intrarenal angiotensin-converting enzyme: the old and the new angiotensin-converting enzyme- (ace ): comparative modeling of the active site, specificity requirements, and chloride dependence this study is supported in part by the top talents program of sun yat-sen university, national natural science foundation of china (nsfc, grant no. ), and the sanming project of medicine in shenzhen (no. szsm ). the authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. key: cord- -h wrs h authors: liu, xianglei; drelich, aleksandra; li, wei; chen, chuan; sun, zehua; shi, megan; adams, cynthia; mellors, john w.; tseng, chien-te; dimitrov, dimiter s. title: enhanced elicitation of potent neutralizing antibodies by the sars-cov- spike receptor binding domain fc fusion protein in mice date: - - journal: vaccine doi: . /j.vaccine. . . sha: doc_id: cord_uid: h wrs h the development of an effective vaccine against sars-cov- is urgently needed. we generated sars-cov- rbd-fc fusion protein and evaluated its potency to elicit neutralizing antibody response in mice. rbd-fc elicited a higher neutralizing antibodies titer than rbd as evaluated by a pseudovirus neutralization assay and a live virus based microneutralization assay. furthermore, rbd-fc immunized sera better inhibited cell-cell fusion, as evaluated by a quantitative cell-cell fusion assay. the cell-cell fusion assay results correlated well with the virus neutralization potency and could be used for high-throughput screening of large panels of anti-sars-cov- antibodies and vaccines without the requirement of live virus infection in bsl containment. moreover, the anti-rbd sera did not enhance the pseudotyped sars-cov- infection of k cells. these results demonstrate that fc fusion can significantly improve the humoral immune response to recombinant rbd immunogen, and suggest that rbd-fc could serve as a useful component of effective vaccines against sars-cov- . the coronavirus disease (covid- ) outbreak has become a global pandemic responsible for over million confirmed cases and over . million deaths worldwide as of aug , . severe acute respiratory syndrome coronavirus (sars-cov- ), the novel betacoronavirus, is the etiological agent of covid- . vaccines are a highly effective strategy in preventing the spread of infectious diseases with advantages of low production and distribution cost; stark reduction in morbidity; and minimal negative long-term effects on overall health. currently, no approved vaccines are available for covid patients although tremendous scientific and industrial efforts have been dedicated. several approaches to the development of covid- vaccines have emerged including dna vaccines, rna vaccines, viral vector vaccines, recombinant subunit vaccines, inactivated and attenuated virus vaccines [ ] . as of aug , , candidate vaccines are undergoing clinical evaluations with more than vaccines in preclinical development stages (draft landscape of covid- candidate vaccines, who). among those, cansino's recombinant adenovirus type- vectored vaccine has achieved the most fast clinical progress as the first-in-human trial vaccine, reporting tolerability and high immunogenicity after days postvaccination [ ] . been demonstrated to be an appropriate immunogen capable of eliciting neutralizing antibodies [ ] . both viruses are phylogenetically close to sars-cov- [ ] , thus supporting the rationale of using the sars-cov- s protein as a subunit vaccine. subunit vaccines contain recombinant antigen proteins with strong immunogenicity capable of efficiently stimulating the host immune system. advantages of subunit vaccines include easy manufacture, low cost, and overall safety, since they do not contain genetic material and have a low probability to induce severe adverse reactions. for example, subunit vaccines are reported to be safer than other vaccines such as virus like particles, inactivated whole viruses and an rdna expressed s protein, which have been shown to induce the cytokine th -type immunopathology in sars-cov [ ] . several subunit vaccines based on the full-length sars-cov- s proteins are under-development, with five candidates in clinical trials. however, the full-length s protein immunogens contain many non-neutralizing epitopes that could result in enhanced infection through the antibody-dependent enhancement (ade) effect, which may occur through the fcγr-mediated internalization of antibody-bound virions in fcγr-expressing cells [ , ] . the risk of ade, observed for sars-cov, mers-cov, hiv- , zika and dengue virus vaccination, has become a major concern in vaccine development [ ] [ ] [ ] . the sars-cov- receptor-binding domain (rbd) is the protruding site of the s protein that mediates viral cell fusion during the initial infection event through binding of the human receptor angiotensin-converting enzyme (hace ) [ , ] . based on its highly homology to sars-cov, sars-cov- rbd is corroborated to contain immune dominant epitopes capable of eliciting antibodies that can neutralize viral infection and block viral entry by competing hace binding. the antigenic regions on sars-cov- were also confirmed by computational studies exploring t cell and b cell epitopes [ ] . compared to the full-length s protein based subunit vaccine, reducing the immunogen to only the neutralizing epitope containing rbd can not only specifically mount neutralizing antibody titers, but could also mitigate the risk of ade, which in most cases are mediated by the non-neutralizing antibodies [ ] . therefore, sars-cov- rbd has been selected as a primary target for vaccine design [ , ] . ravichandran et al recently found that compared to full-length s, sars-cov- rbd immunogen elicited a higher titer of neutralizing antibodies with -fold higher affinity [ ] . yang et al also showed that rbd can induce a potent antibody response in the immunized mice, rabbits and non-human primates [ ] . zang et al (preprint) have showed that the anti-rbd sera from mice did not promote (both pseudotyped and authentic) sars-cov- infection of fcγ receptor-bearing cells [ ] . given those advantages, we chose the sars-cov- rbd as a subunit vaccine. to boost the vaccination-induced antibody response, we fused rbd to the igg fc. the fc fragment can serve as a vaccine adjuvant by promoting cellular and humoral immune responses, probably by facilitating antigen delivery and presentation through interacting with fcγ receptors on antigenpresenting cells. in addition, the fc-fusion can also improve recombinant immuogens solubility and stability and extend their in vivo half-life after injection by interacting with human neonatal fcγ-receptor (hfcrn) [ ] . to stimulate anti-rbd antibody titers, we also utilized a well-accepted, highly effective adjuvant, mf tm . mf has been proven to increase neutralizing antibody production and boost the th and th immune responses. several vaccines containing mf as adjuvants are undergoing clinical trials (phases i-iii). notably, mf -adjuvanted seasonal influenza vaccine (fluad™) has already been licensed, showing acceptable safety and tolerability, and improved immunogenicity [ ] . the gene of sars-cov- rbd domain (residues - ) was synthesized by idt (coralville, iowa), then cloned in frame to human igg fc or ×his tag in the mammalian cell expression plasmid. these proteins were expressed in the expi tm expression system (thermo fisher scientific) and purified by protein a resin (genscript) or ni-nta resin (thermo fisher scientific). the purified sars-cov- rbd and rbd-fc were analyzed by sds-page and western blot (wb). protein concentration was measured spectrophotometrically (nanovue, ge healthcare), μg of proteins were separated by nupage ® - % bis-tris gels (life technologies), protein purity was estimated as > %. for western blot, the separated proteins were transferred to nitrocellulose membranes. after blocking at room temperature using % non-fat milk in pbst for h, the blots were incubated for h at room temperature with nm hace -mfc (mouse fc, sino biological, beijing, china). after three washes, the blots were incubated with anti-mouse igg-horseradish peroxidase (hrp) conjugated secondary antibody ( : , sigma-aldrich) for h at room temperature. the membranes were reacted with enhanced ecl chemiluminescence reagent (millipore, billerica, usa) and exposed by bio-rad detection system. protein deglycosylation mix ii (neb) was used for deglycosylation reaction following manufacturer's instructions. briefly, μg proteins were mixed with deglycosylation mix buffer (non-denaturing reaction) or buffer (denaturing reaction, with an additional incubation at °c for min), then incubated with protein deglycosylation mix ii for reaction ( °c for min, and then °c for h). the deglycosylated proteins were analyzed by sds-page. the mw of enzymes in protein deglycosylation mix ii is approximately , , and kda. genes of hace (origene, rockville, md) and the full length s protein of sars-cov- (codon optimized and synthesized by idt) were subcloned into our in-house mammalian cell expression plasmid and used to construct stable cell line t-ace and t-s, which were cultured in dulbecco's modified eagle's medium (dmem, gibco) with % fetal bovine serum (fbs), % penicillin/streptomycin (p/s) and μg/ml zeocin (thermo fisher). for the determination of t-s cell line, nm hace -fc (sino biological, beijing, china) were incubated with cells for min at o c. cells were washed and then incubated with pe conjugated anti-human fc antibody (sigma-aldrich) for min at o c. bound antibodies were detected by flow cytometry using bd lsr ii (san jose, ca). for the t-ace cell line, nm rbd-his followed by pe conjugated anti-his antibody (sigma-aldrich) were used for analysis. four groups of - week old female balb/c mice (n= ) were immunized twice (day and day ) subcutaneously with rbd proteins ( μg/mouse) with or without adjuvant mf . group was immunized with rbd-fc fusion, group was immunized with rbd-fc fusion in emulsion with mf , group was immunized with rbd in emulsion with mf , group served as a control and was immunized subcutaneously with dulbecco's phosphate-buffered saline dpbs (gibco tm ). sera were collected before (pre-vaccination), after days, and after days vaccination. for evaluation of affinity of rbd and rbd-fc to hace , both proteins were coated on a -well plate (costar) at ng/well in pbs overnight at o c. the plate was blocked using % skim milk for h at room temperature (rt). we then added serially diluted hace -mfc (mouse fc, sino biological, beijing, china) and incubated for h at rt. the plates were washed times with . % tween in phosphate buffered saline (pbst). anti-mouse igg-horseradish peroxidase (hrp) conjugated secondary antibody (sigma-aldrich) was added to the plate followed by incubation for h at rt. after another washes with pbst, the plate was incubated with a , ', , 'tetramethylbenzidine substrate solution (tmb, sigma-aldrich) for min. the reaction was stopped using m h so followed by reading absorbance of each well at nm. for detection of anti-rbd or anti-(s +s ) antibodies in mice serum, the sars-cov- rbd proteins or s +s (sino biological, beijing, china) were coated at ng/well in pbs overnight at o c. after blocking, serially diluted mouse serum were added and incubated for h at rt. for antibody isotyping, the bound rbd-specific antibodies were detected by anti-mouse igg, igm, iga hrp conjugated secondary antibody (sigma-aldrich), respectively. for competitive elisa, ~ nm ( μg/ml) biotinylated hace (sino biological, beijing, china) was incubated with serially diluted mouse serum, and the mixtures were added to rbd coated wells. after washing, bound hace was detected by streptavidin-hrp secondary antibody (sigma-aldrich). the pseudovirus neutralization assay was performed based on previous protocols. briefly, hiv- backbone based pseudovirus was packaged in t cells by co-transfecting with plasmid encoding sars-cov- s protein and plasmid encoding luciferase expressing hiv- genome (pnl - .luc.re) using polyethylenimine (pei). pseudovirus-containing supernatants were collected h later and concentrated using lenti-x™ concentrator kit (takara, ca). pseudovirus neutralization assay was then performed by incubation of sars-cov- pseudovirus with serially diluted mice serum for h at °c, followed by addition of the mixture into pre-seeded t-ace cells. the mixture was then centrifuged at × g for h at rt. the medium was replaced hrs later. after h, luciferase expression was determined by bright-glo kits (promega, madison, wi) and read using biotek synergy multi-mode reader (winooski, vt). the % pseudovirus neutralizing antibody titer (nt ) was calculated using the graphpad prism . the standard live virus-based microneutralization (mn) assay was used. briefly, serially five-fold (start from : ) and duplicate dilutions of mice serum were incubated with pfu of sars-cov- at room temperature for h before transferring into designated wells of confluent vero e cells (atcc, crl- ) grown in -well microtiter plates. vero e cells cultured with medium with or without virus were included as positive and negative controls, respectively. after incubation at o c for days, individual wells were observed under the microscopy for the status of virusinduced formation of cytopathic effect (cpe). the titer of mice serum (nt ) was expressed as the lowest dilution folds capable of completely preventing virus-induced cpe in % of the wells. to test mice serum mediated inhibition of cell fusions, the β-gal reporter gene based quantitative cell fusion assay was used. briefly, t-s cells were infected with t polymerase-expressing vaccinia virus (vtf - ), while t-ace cells were infected with vaccinia virus (vcb r lac-z) encoding t promotor controlled β-gal. two hours after infection, cells were incubated with fresh medium and transferred to °c for overnight incubation. the next day, t-s cells were pre-mixed with serially diluted mice serum at °c for h followed by incubation with t-ace cells at a : ratio for h at °c. then cells were then lysed, and the β-gal activity was measured using β-galactosidase assay kit (substrate cprg, g-biosciences, st. louis, mo) following the manufacturer's protocols. fusion inhibition percentage (sample reading, f) was normalized by maximal fusion (reading, f max ) of t-s and t-ace cells in the absence of inhibitors using this formula: fusion inhibition % = [(f max -f)/(f max -f blank )] × %, in which f blank refers to the od reading of t-s and t incubation wells. fusion inhibition percentage was plotted against serum dilution folds from which ic was calculated in graphpad prism . fcγrii expressing cell lines k (atcc, ccl- ) were used to perform ade assays. briefly, the mouse serum were serially diluted, mixed with sars-cov- pseudovirus, and incubated at ℃ for h. then, the mixtures were added to the pre-seeded plates with k cells. the following infection and culturing steps were carried out as described above in the pseudovirus neutralization assay. pseudovirus infected k or t-ace cells were set as the negative and positive controls, respectively. all experiments were conducted in duplicate, and data were averaged and presented as the mean ± standard deviation (sd). significant differences were determined by one-way analysis of variance followed by tukey's test, using the graphpad prism (version ) package. statistical significance was defined as p < . . recombinant sars-cov- rbd (without fc) and rbd-fc proteins were produced in expi tm mammalian cells, and then verified by sds-page and western blot. rbd-fc showed a homogenous band (fig. a) while the rbd exhibited relatively heterogeneous bands due to varying extents of glycosylation, which was also found by yang et al [ ] . the rbd protein showed a single band upon deglycosylation, confirming the heterogeneous bands resulting from glycosylation (fig. b) . besides, western blot results showed that rbd with different glycoforms can react with hace (fig. s ) . elisa binding to hace further validated the qualities of both rbd proteins (fig. c) . rbd antigens produced in this study were also used for panning against our in-house phage antibody libraries to retrieve high affinity binders [ ] . four groups of - week old female balb/c mice (n= ) were immunized subcutaneously at day and boosted at day with rbd-fc, rbd-fc in emulsion with mf (rbd-fc+mf ), and rbd in emulsion with mf (rbd+mf ) for each group at a dose of μg of proteins per mouse. the fourth group injection was of dpbs, which served as the negative control (fig. d) . on day (pre-immunization), day and day , mouse sera were collected and analyzed for rbd binding, pseudovirus and live virus neutralization, and cell-cell fusion inhibition. anti-rbd sera from each trial group were firstly evaluated for rbd binding as measured by elisa (fig. a) . the anti-rbd antibody in post immune mouse sera were also isotyped by anti-mouse igg, igm and iga antibodies, respectively (fig. s ) . the rbd antibody titers were calculated as the dilution folds that remain % of maximal binding signal (ec ). the recombinant rbd (his tag) was used as the detection antigen to avoid the interference of antihuman fc antibody titers in mice. the impact of his tag on the detection of rbd binding titer is marginal ( figure. s c) . results showed that for the sera collected at and days post immunization, the anti-rbd antibodies were mostly composed of the igg isotype with only marginally detectable igm (fig. s a) and no detectable iga isotype (fig. s b) . the low igm titer detected at day and may correlate to the fact that igm is typically rapidly mounted post infection (within one week) followed by isotype switching into igg isotype [ ] . the lack of iga titer may result from iga usually deriving from mucosa immunity, leading to low titers in sera [ ] . for the igg isotype antibodies, the pre-immunization sera showed no binding to rbd, while the day sera from all three rbd immunized groups exhibited varying extents of binding to the rbd. interestingly, the rbd binding titer elicited by the rbd+m group on day was much less than titers of the rbd-fc (titer : ) and rbd-fc+m (titer : ) groups, indicating the immune stimulation roles of fc fusion. however, for post-boosting sera on day , the rbd binding titers of rbd+mf group was significantly mounted to : . in contrast, on day the titers of rbd-fc (titer : ) and rbd-fc+mf (titer : ) groups were only improved by -fold compared to those of day , indicating distinct humoral response kinetics against rbd and rbd-fc immunogens. interestingly, although to a lesser extent than before receiving the booster, the rbd-fc and rbd-fc+mf groups exhibited . and folds higher titers, over the rbd+mf group (p< . ) respectively, assuring the enhancing role of fc in elicitation antibody response by the rbd immunogen. it is also intriguing that the rbd-fc+mf group exhibited slightly higher titers than the rbd-fc group, probably due to the adjuvant role of mf . interestingly, we also correlated the rbd binding titer to the full-length s ectodomain binding titer for the day sera. the elisa showed that the rbd binding sera also bound to s +s with similar titers (fig. s d) , which suggest that the rbd recognition antibody in the sera can also bind to full length s. hace blocking is a surrogate indicator for anti-sars-cov- antibody neutralizing activity. to preliminarily infer the neutralizing titers of post-immunization mouse serum, we performed the hace competitive elisa, in which serially diluted mouse sera in the presence of the biotinylated hace were added into rbd coated plates. bound hace was detected by the streptavidin-hrp secondary antibody. results showed the three rbd immunogen groups developed discernable hace competitive titers on day compared to the pbs control group; further significantly boosted to : , : , : for the rbd-fc, rbd-fc+mf , rbd+mf groups respectively on day (fig. b) . consistent with the above rbd binding titer, the rbd-fc and rbd-fc+mf groups sera showed . folds and . folds higher competitive titers respectively than the rbd+mf group (p< . ), supporting the role of fc in mounting neutralization titers. the competitive elisa results gave the specific hace blocking titers elicited by rbd immunogens, which presumably predict their neutralization activity [ ] . next we exploited the sars-cov- s pseudotyped hiv- to evaluate the neutralization activity of those anti-rbd sera. sars-cov- pseudovirus was packaged by co-transfecting hek t cells with pcdna . -s plasmid encoding codon-optimized full-length sars-cov- s protein and pnl - .luc.re plasmid containing luciferase expressing hiv- genome. serially diluted mouse sera were pre-incubated with pseudovirus followed by infection of t cells stably expressing hace ( t-ace ). as shown in fig. a , on day all the rbd immunized groups sera showed substantial % neutralizing antibody titers (nt ) compared to the pre-immune sera on day (titer : , : , : ), which were largely boosted to : , : , : for the rbd-fc, rbd-fc+mf and rbd+mf groups respectively. intriguingly, unlike the marginal differences for the rbd binding and hace competitive titers across the sera of the three rbd immunized groups on day , the pseudovirus neutralization titers were significantly distinct with the rbd-fc+mf group showing highest titers, . -fold higher than the rbd-fc group and . fold than the rbd+mf group (p< . ). in addition to the pseudovirus neutralization, we also evaluated the live virus neutralization potency of those mouse anti-rbd sera (day ) by using a microneutralization (mn) assay. in this assay, the cytopathic effect (cpe) of vero e was observed after days incubation with live virus, which was pre-mixed with the anti-rbd sera. the neutralization titer of mice serum (nt ) was expressed as the lowest dilution folds capable of completely preventing virus-induced cpe in % of the wells. consistently, the nt of serum in rbd-fc+mf group ( : ) was higher than those of the rbd-fc and rbd+mf groups ( : ) (fig. b) . these results clearly demonstrated that fc fusion could significantly augment the elicitation of neutralizing antibody titers by rbd immunogens, and the adjuvant mf can further stimulate the antigenicity of the rbd-fc fusion proteins. interestingly, we found that the pseudovirus neutralization titer positively correlated with the hace competition titer (fig. c) , demonstrating the utility of our hace competition elisa in predicting neutralization titers in convalescent plasma therapy and in detecting of the presence of neutralizing antibodies in serological tests during the covid pandemic. to further evaluate whether the anti-rbd sera could prevent sars-cov- s-mediated cell-cell fusion, we established a quantitative cell fusion assay using β-galactosidase (β-gal) as a reporter gene. we constructed t cell lines stably overexpressing sars-cov- s ( t-s) and hace ( t-ace ) respectively (fig. s ) . in this assay, the t-s cells were infected with t polymerase-expressing vtf - vaccinia virus and the t-ace cells were infected with t promotor controlled β-gal expressing vcb r vaccinia virus. therefore, β-gal expression is only allowed after cell-cell fusion, which can be quantified by monitoring β-gal activity. serially diluted mouse sera were pre-mixed with infected t-s cells followed by incubation with infected t-ace cells. as shown in fig. a , the pre-vaccination sera at day and the pbs control mouse sera did not inhibit the cell-cell fusion. however, the day rbd-fc+mf sera showed obvious cell-cell fusion inhibition with a % fusion inhibition antibody titers (ic ) of : , which was . -fold higher than the inhibition titers of the rbd-fc ( : ) and . -fold higher than the rbd+mf ( : ) group (p< . ). interestingly, in this assay, the rbd-fc and the rbd+mf groups did not show significant differences (p> . ). we also observed a nearly perfect correlation of the cell-cell fusion inhibition titer with the pseudovirus neutralization titer (r = . , p< . , fig. b ), which may be attributed to theirmechanism of action. anti-rbd antibodies typically neutralize virus by blocking viral entry. for sars-cov- , virus entry and cell-cell fusion share similar mechanisms. both viral entry and cell-cell fusion are initiated by the s protein binding to the receptor ace , followed by s subunit triggered susceptibility to protease cleavage, which causes s dissociation and conformational change of the s subunit. then, the fusion peptide (fp) in s is exposed for anchoring into host cell membrane and heptad repeats (hr and hr ) establishes the six helical-bundle resulting in the membrane fusion between viral and host cells [ ] . molecules, including antibodies, interfering with any of the above processes can block viral entry as well as cell-cell fusion [ ] . based on their similar mechanisms, the inhibitory activity of antibodies for cell-cell fusion can be a highly relevant predictor of the antibody neutralizing activity. this is further supported by our and others' sars-cov- neutralizing antibodies, which shows both potent inhibition of s mediated cell-cell fusion and neutralization of sars-cov- [ ] . although highly correlated, there are differences between these two assays in terms of different environments on the cell and viral surface such as s protein conformation/ density, accessibility of proteases. due to the high correlation to the virus neutralization, this method allows for high-throughput screening and is therefore well suited for the characterization of cell-cell fusion mediated by sars-cov- or other viruses. in addition, this assay is highly effective in screening potential neutralizing antibodies with fast speed since this assay can be finished within one day without the requirement of in biosafety level facilities while the virus neutralization assays typically require several days which is of particular relevance in the context of the global pandemic, which urgently needs vaccines and candidate antibody drugs. from the mechanism, one can also envision that rbd antibodies showing ace competition is sufficient, but not necessary for virus neutralization and cell-cell fusion, since antibodies disturbing other entry steps, rather than ace /rbd binding, can also neutralize virus and inhibit cell-cell fusion, as exampled by the antibody d [ ] . in this regard, one pertinent result from this study is that we found the extent of the correlation of ace competition elisa titer with the neutralizing titer (r = . , p= . , fig. c ) and competition elisa titer with cell-cell fusion inhibition titer (r = . , p= . , fig. c ) were lower than that of neutralizing titer to cell-cell fusion inhibition titer (r = . , p< . , fig. b ). finally, we evaluated whether the anti-rbd mouse sera can enhance sars-cov- infection of fcγrii expressing k cells [ ] . the results showed that sars-cov- pseudovirus alone cannot infect the k cells (fig. s ) . in addition, treatment with serially diluted (ranging from : to : ) anti-rbd sera did not enhance sars-cov- pseudovirus infection, indicating that the anti-rbd sera may not promote ade. after washing, the binding was detected by hrp conjugated anti-mouse igg antibody. (b) ng of rbd were coated and -fold serially diluted mice serum were added in the presence of ~ nm biotinylated hace followed by pbst washing. for detection, streptavidin-hrp secondary antibody was used. experiments were performed in duplicate and the error bars denote ± sd, n = . statistical significance was defined as *: p< . . safety, tolerability, and immunogenicity of a recombinant adenovirus type- vectored covid- vaccine: a dose-escalation, open-label, nonrandomised, first-in-human trial the spike protein of sars-cov--a target for vaccine and therapeutic development a highly conserved cryptic epitope in the receptor binding domains of sars-cov- and sars-cov immunization with sars coronavirus vaccines leads to pulmonary immunopathology on challenge with the sars virus anti-spike igg causes severe acute lung injury by skewing macrophage responses during acute sars-cov infection fc receptors in antibody-dependent enhancement of viral infections the potential for antibody-dependent enhancement of sars-cov- infection: translational implications for vaccine development molecular mechanism for antibody-dependent enhancement of coronavirus entry tortoises, hares, and vaccines: a cautionary note for sars-cov- vaccine development sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor the secret life of ace as a receptor for the sars virus immunoinformatics-aided identification of t cell and b cell epitopes in the surface glycoprotein of -ncov implications of antibodydependent enhancement of infection for sars-cov- countermeasures receptor-binding domain as a target for developing sars vaccines characterization of the receptor-binding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine antibody signature induced by sars-cov- spike protein immunogens in rabbits a vaccine targeting the rbd of the s protein of sars-cov- induces protective immunity immunization with the receptor-binding domain of sars-cov- elicits antibodies cross-neutralizing sars-cov- and sars-cov without antibody-dependent enhancement fc-based recombinant henipavirus vaccines elicit broad neutralizing antibody responses in mice vaccines with mf adjuvant expand the antibody repertoire to target protective sites of pandemic avian h n influenza virus potent neutralization of sars-cov- by human antibody heavy-chain variable domains isolated from a large library with a new stable scaffold antibody response of mice to lactate dehydrogenase-elevating virus during infection and immunization with inactivated virus the effects of secretory iga in the mucosal immune system a sars-cov- surrogate virus neutralization test based on antibody-mediated blockage of ace -spike protein-protein interaction cell entry mechanisms of sars-cov- a human monoclonal antibody blocking sars-cov- infection dengue virus neutralization in cells expressing fc gamma receptors inhibition of cell-cell fusion (a) by mice serum, correlation analysis for fusion inhibition assay with pseudo-nertralization (b) and competitive elisa (c). (a) a β-galactosidase (β-gal) reporter gene-based quantitative cell-cell fusion assay was used, in which t polymerase expressing t-s pre-incubated with mice serum followed by mixing with t promotor controlled β-gal expressing t-ace cells correlation analysis between cell-cell fusion inhibition (ic ) and pseudo-neutralization antibody titers (nt ) for sera of day and day . (c) correlation analysis between cell-cell fusion inhibition (ic ) and competitive elisa (ec ) for sera of day and day . correlation and linear regression analyses were performed in graphpad prism using pearson's correlation coefficients. statistical significance was calculated using the two-tailed test conflict of interest statement. the authors declare no conflict of interest we would like to thank the members of the center for antibody therapeutics doncho zhelev, du-san baek, liyong zhang, and xiaojie chu for their helpful discussions. this work was supported by the university of pittsburgh medical center. key: cord- - reu yz authors: reguera, juan; santiago, césar; mudgal, gaurav; ordoño, desiderio; enjuanes, luis; casasnovas, josé m. title: structural bases of coronavirus attachment to host aminopeptidase n and its inhibition by neutralizing antibodies date: - - journal: plos pathog doi: . /journal.ppat. sha: doc_id: cord_uid: reu yz the coronaviruses (covs) are enveloped viruses of animals and humans associated mostly with enteric and respiratory diseases, such as the severe acute respiratory syndrome and – % of all common colds. a subset of covs uses the cell surface aminopeptidase n (apn), a membrane-bound metalloprotease, as a cell entry receptor. in these viruses, the envelope spike glycoprotein (s) mediates the attachment of the virus particles to apn and subsequent cell entry, which can be blocked by neutralizing antibodies. here we describe the crystal structures of the receptor-binding domains (rbds) of two closely related cov strains, transmissible gastroenteritis virus (tgev) and porcine respiratory cov (prcv), in complex with their receptor, porcine apn (papn), or with a neutralizing antibody. the data provide detailed information on the architecture of the dimeric papn ectodomain and its interaction with the cov s. we show that a protruding receptor-binding edge in the s determines virus-binding specificity for recessed glycan-containing surfaces in the membrane-distal region of the papn ectodomain. comparison of the rbds of tgev and prcv to those of other related covs, suggests that the conformation of the s receptor-binding region determines cell entry receptor specificity. moreover, the receptor-binding edge is a major antigenic determinant in the tgev envelope s that is targeted by neutralizing antibodies. our results provide a compelling view on cov cell entry and immune neutralization, and may aid the design of antivirals or cov vaccines. apn is also considered a target for cancer therapy and its structure, reported here, could facilitate the development of anti-cancer drugs. the coronaviridae is a large family of enveloped, plus-rna viruses. they are involved in respiratory, enteric, hepatic and neuronal infectious diseases in animals and humans that lead to important economic losses [ , ] , as well as to high mortality rates in severe acute respiratory syndrome cov (sars-cov) infections [ ] . the covs are a numerous group of coronaviridae. they have been clustered in the coronavirinae subfamily, which includes three approved genera, alpha-, betaand gammacoronavirus, as well as a tentative new genus, the deltacoronavirus [ ] . representative cov species in each genus are alphacoronavirus (comprising transmissible gastroenteritis virus (tgev), porcine respiratory cov (prcv) and related canine and feline covs), human coronavirus (hcov- e and hcov-nl , genus alphacoronavirus), murine coronavirus (including mouse hepatitis virus (mhv), genus betacoronavirus, cluster a), severe acute respiratory syndrome-related coronavirus (sars-related cov, genus betacoronavirus, cluster b), avian coronavirus (including infectious bronchitis virus (ibv), genus gammacoronavirus), and bulbul-cov (tentative genus deltacoronavirus) [ ] . cov particles display characteristic large surface projections or peplomers ( - nm) comprised of homotrimers of the spike glycoprotein (s), a type i membrane protein [ , ] . the peplomers have a globular portion connected by a protein stalk to the transmembrane domain [ ] . the globular region is formed by the n-terminal s region, whereas the stalk corresponds to the membrane-proximal s region, which mediates virus fusion to host cells and adopts a helical structure characteristic of class i virus fusion proteins [ ] . determinants of cov tropism locate at the s region [ , ] , which mediates attachment of cov particles to cell surface molecules, initiating virus entry into cells and infection. there is considerable variability in receptor usage among the covs. most alphacoronavirus such as tgev and hcov- e use apn [ , ] , whereas the related hcov-nl uses a distinct cell entry receptor, the human angiotensin converting enzyme (ace ) [ ] ; sars-cov also recognizes the ace receptor [ ] . sars and nl cov bind to common regions of the ace protein, although the structures of their receptor-binding domains (rbds) are quite distinct [ , ] . mhv uses the cell adhesion molecule ceacam a [ ] ; a recent crystal structure showed that the mhv rbd adopts a galectin-like fold [ ] . the use of alternative receptors that confer extended tropism has been described for sars-cov, mhv and tgev [ , ] . the mammalian apns (cd ) are type ii cell surface metalloproteases whose large glycosylated ectodomain has a zinc metal ion at the active site [ ] . apn is linked to many cell functions, leading it to be termed the ''moonlighting enzyme'' [ ] . animal models confirmed a role for this cell surface enzyme in angiogenesis [ ] . peptides and inhibitors that target apn showed a link between this protein and tumor growth and invasion [ , ] . apn is a target for cancer chemotherapies; drugs that bind this protein have been developed to treat tumors, some of which are in clinical trials [ ] . as mentioned above, apn is also a major cov cell entry receptor [ , , ] . cov recognition of apn is species-specific, and specificity is associated with n-linked glycosylations in the apn protein [ ] . cell tropism and immune neutralization have been extensively studied in some porcine alphacoronavirus, such as the enteropathogenic tgev and porcine respiratory cov (prcv), a nonenteropathogenic virus derived from tgev [ ] . both viruses use porcine apn (papn) for cell entry. the apn-binding domain in tgev, prcv and other alphacoronavirus locates at the c-terminal portion of the s region [ , , ] , which bears epitopes recognized by cov-neutralizing antibodies [ , , , ] . most tgev-neutralizing antibodies cluster at antigenic site a [ , ] , comprised within the rbd at the s region ( figure a ) [ ] ; the other antigenic sites defined in the tgev s region (b through d) are outside the rbd ( figure a ) [ ] . to date, there is no structural information available on antibody neutralization and apn recognition by alphacoronavirus. we determined crystal structures of the prcv rbd in complex with the papn ectodomain, and the tgev rbd in complex with the neutralizing monoclonal antibody (mab) af [ ] . the rbd adopts a b-barrel fold, with a distinct protruding tip engaged in papn recognition. the structures show how these porcine alphacoronavirus recognize its cell entry papn receptor and how immune neutralization of these covs is achieved by antibody targeting of receptorbinding residues in the s protein. the mechanisms used by tgev to escape immune neutralization and the evolution of receptor recognition in the cov family are discussed. apn-binding domain and epitopes for neutralizing mabs overlap in tgev and prcv s proteins apn receptor recognition and envelope s antigenicity are well documented in tgev and related prcv. the papn-binding figure . apn-binding domain and epitopes for neutralizing mabs overlap in tgev and prcv s proteins. a. scheme of tgev and prcv s proteins showing the s , s , transmembrane (t) and cytoplasmic (cy) regions. location of the c, b, d and a antigenic sites [ ] , and the papn rbd (bar with n and c-terminal residues) [ ] are shown. length is indicated for mature s regions. b. a short, soluble s protein variant containing the tgev rbd region binds to cell surface papn. binding of a bivalent sa-fc fusion protein (sa) to bhk-papn (open histograms), alone (left) or in the presence of the site a-specific ac mab (right), as analyzed in facs. filled histograms correspond to an unrelated fc fusion protein. c. binding of site a-specific tgevneutralizing mab to the sa protein. mab binding to plastic-bound sa protein, monitored by optical density (od). site a mabs are specific for the aa ( bb ), ab ( de ) and ac ( ac , af ) subsites [ ] . an anti-ha mab that binds to the ha tag in the sa protein was used as control. d. site a-specific mabs prevent sa protein binding to papn. binding of the sa protein to bhk-papn cells in panel b was monitored alone or in the presence of site a (shown in c) or site d-specific ( dg ) mab, and the binding ratio determined (see materials and methods). c, d. mean and standard deviation for three experiments. doi: . /journal.ppat. .g the cell surface aminopeptidase n (apn), a membranebound metalloprotease target for cancer therapy, is a major cell entry receptor for coronaviruses (covs), agents that cause important respiratory and enteric diseases. in some covs, the virus envelope spike glycoprotein (s) mediates attachment of the virus particles to the host apn protein and cell entry, which is blocked by antibodies that prevent cov infections. the crystal structures of the s proteins of two porcine cov in complex with the pig apn (papn) or with a neutralizing antibody shown here, reveal how some cov bind to its cell surface apn receptor and how antibodies prevent receptor binding and infection. the report uncovers a unique virus-receptor recognition mode that engages a glycan n-linked to the papn ectodomain, revealing structural determinants of the receptor-binding specificity in covs. neutralizing antibodies target viral residues used for binding to the apn receptor and entry into host cells, showing that efficient cov neutralization requires immune responses focused toward key receptor binding motifs in the virus envelope. these structural insights, together with the structure of the apn ectodomain, provide a compelling view of relevant cell membrane processes related to infectious diseases and cancer. domain was mapped within residues to of the mature tgev s polypeptide [ ] , whereas tgev mab-resistant (mar) mutants defined four antigenic sites (c, b, d and a) [ , ] ( figure a) . antigenic sites c and b are not present in the prcv s protein. antigenic site a determinants are located within the papn-binding domain at the c-terminal moiety of the tgev and prcv s regions ( figure a ) [ , ] . we recently reported the modular dissection of the n-terminal s region of tgev and prcv, and the preparation of soluble s length variants with single antigenic sites [ ] . we produced a recombinant short s protein fragment termed sa, which comprises only residues to of the tgev s protein that binds cell surface papn ( figure b ) and displays conformational epitopes for the three antigenic a subsites (aa, ab, and ac) ( figure c ). antibodies clustered at the aa ( bb ), ab ( de ) and ac ( af and ac ) subsites blocked binding of the soluble sa protein to papn ( figure d ). the sa protein therefore includes the papn-binding domain of tgev and epitopes for site aneutralizing mab. we applied x-ray crystallography to s protein variants containing the rbd of the related tgev and prcv, and have identified how these alphacoronavirus bind to the cell surface papn and its inhibition by neutralizing antibodies. we attempted crystallization of the soluble papn-binding sa protein derived from the tgev s, alone and in complex with several neutralizing mabs. crystals were prepared with the sa protein in complex with the fab fragment of the af mab [ ] ; the structure of the complex was determined and refined using diffraction data extending to . Å resolution (materials and methods; table ). the asymmetric unit of the crystals contains two antibody-rbd complexes, one of which is shown in figure . residues pro to val of the tgev s protein, previously identified as the papn-binding domain ( figure a ) [ ] , were well defined in the crystal structure. they folded in a single domain structure, the rbd of tgev (figure a) . the rbd adopts a bbarrel fold formed by two b-sheets with five b-strands each (scheme in figure s a ). n-and c-terminal ends are on the same side of the domain (terminal side), which presumably lies close to other s protein domains; at the opposite side, two b-turns (b -b and b -b ) form the tip of the barrel (figure a) , where the mab binds to the rbd. the immunoglobulin (ig) variable domains of the mab heavy (v h ) and light (v l ) chains contact the b -b , b -b and b -b regions of the tgev rbd ( figure b ), burying a virus protein surface of , Å . the buried surface of the af mab is , Å , with equal contribution by the v h ( %) and v l ( %) ig domains. complementarity determining regions (cdr) of the antibody heavy (h ) and light (l and l ) chains, the n-terminus of the light chain and the c, c and c b-strands of the v h domain contact the viral rbd tip ( figure b ). the cdr-h of the af mab is relatively long, with two-residue insertion (tyr h and asp h ) relative to other homologous h loops in reported mab structures (table s ). the rbd b -b hairpin with tyr at its tip is at the center of the interacting surface and penetrates between the v l and v h ig domains of the af mab ( figure b and c). similar antibodyantigen recognition is described for some peptides and is common for small hapten molecules [ , ] . the rbd b -b region contributed % of the rbd surface buried by the af mab, and docked between the af mab variable domains ( figure b ). the b-turn is fully buried between the mab ig domains ( figure c ), forming a contact network with mab residues ( figure d ). the rbd residue tyr at the bottom of the pocket contacts mab residues trp h and tyr h , whereas its hydroxyl group is hydrogen bonded to the side chain of gln l and main chain carbonyl of tyr h ( figure c and d). these structural findings on af recognition of the rbd b -b region correlate with af mab binding to peptides (mkrsgygqpia ) that include this hairpin region [ ] . the rbd b -b and b -b regions are at the periphery of the epitope ( figure b ); their contribution to interaction with af is smaller than that of the b -b region, representing respectively , % and % of the rbd surface buried by the mab. they contact either the v l or v h ig domains ( figure b ). rbd residues leu and trp at the b -b loop contact the n-terminus, cdr-l and cdr-l of the v l domain, whereas the b -b loop contacts the long cdr-h loop ( figure b and c). to characterize cov attachment to its apn receptor, we attempted crystallization of the papn ectodomain in complex with tgev and prcv s protein variants comprising their rbds (materials and methods). crystals were obtained only with a mixture of a prcv s protein (s h) and the papn. using these crystals, we determined the structure of the prcv rbd-papn complex by molecular replacement using previously solved structures of the tgev rbd shown in figure ( % sequence identity) and of the papn ectodomain (materials and methods and table ). the asymmetric unit of the crystals contained two macromolecular rbd-papn complexes ( figure a ). the prcv rbd adopts a b-barrel fold like the tgev rbd ( figure s ). each papn molecule was engaged by the tip of a single prcv rbd molecule, which bears two exposed aromatic residues (tyr and trp) ( figure a , in red), and they bound to a membrane-distal region of the papn ectodomain ( figure a ). the rbd n-and cterminal ends and the remaining cov s are also distant from the papn, and are unlikely to contact the receptor molecule. based on a cryo-em structure of the sars-cov s [ ] , the rbd must be also at the viral-membrane distal side of the s and therefore, the receptor binding edge must be accessible for cov binding to the apn receptor. the papn is a type ii membrane protein and the n-terminal end of the ectodomain must be near the cell membrane ( figure a ). the n-terminal residues of the crystallized papn ectodomain are largely disordered in the structure and they might form a flexible region close to the cell membrane. the papn ectodomain is composed of four domains ( figure a ). domain i (orange) is made of b-strands, domain ii (yellow) adopts a thermolysin-like fold bearing a zinc ion at the catalytic site, domain iii (red) is a small b-barrel domain, and the c-terminal domain iv (green) is composed of alpha-helices (domain boundaries are shown in figure s ). the papn molecule structure is closely related to that of the human endoplasmic reticulum aminopeptidase- [ , ] (root-mean-square deviation of . Å for residues sharing % sequence identity, based on dali server). domain ii bearing the enzyme active site is the most related domain ( % identity), whereas domain iv is the most distinct ( % identity). the zinc ion is coordinated to conserved residues at the papn active site in domain ii ( figure s ). the active site conformation is similar to that of other aminopeptidases ( figure s ). the papn crystallized in complex with the prcv rbd had an open conformation [ , , ] , in which domain iv was , - Å from domains i and ii; this creates a central cavity in which the zinc ion at the catalytic site is highly accessible ( figure a ). the mammalian apns are cell surface metalloproteases that form membrane-bound dimers [ ] . the crystallized papn ectodomain also behaved as a dimer in solution ( figure s ). the papn dimeric assembly showed in figure a buried a large accessible surface (, Å ) in each monomer. the dimerization surface comprises residues spread across domain iv, which are distinct from those recognized by cov ( figure s ). similar dimeric assemblies were observed in two crystal structures determined for the papn ectodomain alone (not shown), crystallized using distinct conditions. the papn molecular assembly shown here might thus be representative of the dimer described for mammalian apn on membrane surfaces [ ] . in the crystals of the prcv rbd-papn complex, the rbd tip contacts a membrane-distal region of the papn ectodomain ( figure a ). the conformations of the receptor-binding loops (b -b and b -b ) at the tips of the two prcv b-barrel domains in the structure are identical ( figure s b ), suggesting very similar rbd-papn interactions in both complexes of the asymmetric unit. the virus-receptor interaction buried , Å of the virus protein, % of which corresponded to the b -b region ( figure b ) and % to the b -b turn ( figure c ). the size of the papn surface buried by the rbd was similar (, Å ), and included papn residues ranging from alpha helix (a ) to (a ) in domain iv, and a few domain ii residues ( figure s , table s ). the end of the papn helix a and helix a contacted the b -b region of the rbd ( figure b ). the tyr side chain (tyr in tgev), which protrudes at the b-turn in prcv and tgev rbds ( figure b and d) , is almost fully buried in the complex, locating between the first n-acetyl glucosamine (nag ) linked to papn asn , the end of helix a , and the first half of helix a ( figure b ). the hydroxyl group of the rbd tyr was hydrogen bonded to side chains of papn residues glu and trp , and contributed to virus-receptor binding specificity. the preceding rbd gly residue was at the papn proximal side of the b-turn, hydrogen bonded to the papn asn main chain; at the opposite side, the rbd gln side chain formed a network of hydrogen bond interactions with papn nag and asn side chain ( figure b ). the n-acetyl moiety of the glycan also interacted with rbd residues at the b and b strands ( figure b , table s ). the papn n-linked glycan and surrounding residues that contact the cov rbd b -b region in the structure were identified as one of the apn determinants of the cov host range [ ] . the second relevant virus-receptor interacting region engaged a b-turn at the beginning of the rbd b -b loop ( figure c and d). the unique rbd trp residue, which protrudes at the turn, docked in a papn cavity formed by the coils that precede helices a in domain iv and a in domain ii ( figure c and s ). the bulky side chain of the rbd trp residue packed against papn residues his and pro , and its imino group was hydrogen bonded to the main chain carbonyl of asn ( figure c ). the rbd trp as well as the rbd tyr at the b-barrel tip in tgev and prcv appear to be central residues in the virus-receptor interaction, as they contact with many papn residues and contribute also to binding specificity by mediating polar interactions with the papn (table s ) . to confirm the contribution of the prcv or tgev rbd bbarrel tip in papn receptor recognition, we analyzed binding of wild type and mutant tgev rbd proteins to cell surfaceexpressed papn ( figure a ). mutations in the three regions (b -b , b -b and b -b ) that build the receptor binding edge of the b-barrel decreased rbd binding to papn, whereas mutations outside the receptor-binding region (v ngly) had no effect on receptor recognition. deletion of the papn asn glycosylation site also abolished tgev rbd binding to cell surface-expressed papn ( figure b ). deletion of the homologous glycan in feline apn similarly prevents cell infection by feline, canine and porcine covs, all of which share the glycan-binding tyr residue in the b -b turn (see below), whereas addition of this glycan to human apn is sufficient to render it a tgev receptor [ ] . we determined the crystal structures of the related tgev and prcv rbds bound to two distinct ligands. the rbds adopt bbarrel structures with small differences in the ligand binding loops (figures s ). in the rbd, each of the two highly twisted b-sheets that build the b-barrel is formed by five b-strands ( figure a ). the bent b-strand (b ) crosses both b-sheets and has a b-bulge at asn ( figure a , magenta). at one side of the b-barrel, all bstrands are antiparallel ( figure a, cyan) , whereas on the opposite a dali search of structural homologs showed the greatest similarity (z score of ) with the rbd of the ace receptorbinding hcov-nl (root-mean-square deviation of . Å for residues), the other alphacoronavirus rbd whose structure is known [ ] . the cores of the tgev and hcov-nl b-barrel domains are structurally similar, but the loops at the tips ( figure b and d). the tip region of the hcov-nl rbd is the ace receptor-binding edge and has a ''bowl''-shaped conformation ( figure c ) that differs from the tgev rbd protruding edge. aromatic residues protrude from the b-turns at the tip of the bbarrel in tgev, whereas they are partially buried at the center of the ''bowl''-shaped edge in hcov-nl ( figure b and c). the distinct rbd tip conformation in ace -binding hcov-nl and in apn-binding tgev might be a determinant of their distinct cell entry receptor specificities. the degree of sequence identity in the rbd region among members in the species alphacoronavirus (, % identity) suggests a structure closely related to that of tgev, including conforma-tion of the receptor-binding loops (b -b and b -b ) at the bbarrel tip ( figure ). therefore, tgev, prcv, ccov and fcov must recognize the apn receptor in similar fashion. in contrast, the receptor-binding loops at the tip appear to have a different conformation from tgev in the hcov- e rbd, which also binds to the apn. in this cov, the b -b region has two cys, as in hcov-nl , and lacks the apn-binding tyr residue in alphacoronavirus , although it preserves the two gly residues found in the tgev b-turn ( figure ). the b -b loop in hcov- e is markedly shorter than in tgev, but it also has a trp residue. sequence identities between the rbd of tgev and ibv (gammacoronavirus) or the bulbul-cov (tentative deltacoronavirus) are relatively large (, %), and similarities are found mostly in bstrands and at the rbd c-terminal half ( figure ) . these data indicate a conserved rbd fold between alphacoronavirus and gammaor deltacoronavirus. there is less sequence similarity between the alphaand betacoronavirus rbd regions (, %), which correlates with notable structural differences between their detail of the rbd b -b region with the exposed tyr residue interacting with the papn. side chains of rbd and papn residues engaged in the interaction are shown as sticks with carbons in magenta or green, respectively. nag glycan n-linked to papn asn is shown with carbons in yellow and the electron density map, determined without the glycan, shown as a blue mesh contoured at sigma. c. detail of the rbd b -b region with the trp residue interacting with the papn. in b and c, rbd residues are numbered following the tgev sequence shown in d, and intermolecular hydrogen bonds are shown as dashed red lines. d. structure-based sequence alignment of the tgev and prcv rbds. b-strands are marked with bars. tgev sequence is numbered. in red, af mab-(for tgev) and papn receptor-binding residues (for prcv) identified by the structures. residues absent in the rbd structures are in grey, and the thrombin recognition sequences at the end of recombinant porcine cov rbds are in lowercase letters. doi: . /journal.ppat. .g rbds [ , , ] . the rbds of the sars and mhv betacoronavirus adopt folds unrelated to the b-barrel shown for alphacoronavirus. the most tgev-neutralizing mabs, including af , recognize antigenic site a in the s protein, divided into the aa, ab and ac subsites [ ] . to further characterize site a antigenic determinants in the tgev rbd, we mutated rbd residues targeted by the af mab ( figure ) and some surrounding residues, and analyzed binding to other site a-specific mabs. the antigenicity of residues in the b -b region, in the center of the epitope for af ( figure c ), was determined by monitoring mab binding to rbd mutants with tgev residue substitutions gly (g d), tyr (y a) and gly (g d) ( figure a ). all three substitutions abolished rbd binding by the ac subsite-specific mabs af and ac . the y a rbd mutant was recognized by aa-( bb ) and ab-specific ( de ) mabs ( figure a) , and mab de also bound the g d mutant. in contrast to the antibody binding profile of the y a rbd mutant, ala substitution of the tgev trp residue (w a), a papn-binding residue in the b -b loop at the periphery of the rbd epitope for af ( figure c ), did not affect binding by the ac-specific mabs ( af and ac ), whereas rbd recognition by bb and de mabs was greatly reduced ( figure a ). deletion of the b -b turn (lwd a mutant) reduced ac mab binding to the rbd markedly, with a partial reduction in af binding ( figure a ); this indicates that mab ac recognizes a broader epitope, which correlates with its higher tgev neutralization activity [ ] . replacement with ala of rbd residues thr and asn at the b -b hairpin, which contacts the af mab in the rbd- af structure ( figure c) , reduced binding by all site a-specific mab ( figure a ). this might be a result of a conformational effect induced on the nearby b -b region of the rbd. results for antibody binding to rbd mutants showed that site a epitopes extend across the tgev rbd tip, although there are some differences among the three a subsites ( figure b ). the epitopes recognized by aa-and ab-specific mabs bear the exposed tgev trp residue at the b -b loop, whereas epitopes for the acspecific mabs center on tyr in the b -b turn. none of the mab tested simultaneously targeted the two aromatic side chains (tyr and trp) at the tip of the tgev rbd that bind to the papn. subsite-specific residues defined by mar mutants (lys for aa, arg for ab and gly for ac) might be located at the periphery of their respective epitopes ( figure b ). ab and ac subsites appear to be relatively far apart, with the aa epitope in an intermediate position. the rbd tip, shown here as the papn-binding edge of the domain (figure ) , is the main s protein determinant of antigenic site a, recognized by the most effective neutralizing antibodies of tgev and related cov infections [ , ] . here we show how a group of covs attaches to the cell surface apn metalloprotease for entry into host cells, and how some covneutralizing antibodies prevent infection. the rbd-receptor complex structures determined for alphacoronavirus indicate that the conformation of the receptor binding edge in the envelope s proteins probably determines their receptor-binding specificity. the cov that bind apn analyzed here have protruding receptorbinding motifs that engage recessed surfaces on the receptor. this mode of receptor recognition is essentially opposite to that reported for cov binding to the ace receptor, where recessed receptor-binding motifs in the viral rbd cradle exposed surfaces of the ace ectodomain [ , ] . in the case of papn, an nlinked glycan is also engaged in the virus-receptor interaction. the inherent flexibility of this glycan might facilitate the initial contact of the cov tyr residue with apn amino acids, and subsequent virus-receptor interactions could lock the bound tyr between the glycan and an a-helix ( figure b ). the glycan n-linked to asn in papn is also conserved in canine and feline apn proteins ( figure s ), as are the viral s protein residues that interact with this glycan in the rbd b -b and the b -b regions ( figure ). this unique glycan-virus interaction must thus be conserved among the different covs in the species alphacoronavirus , in accordance with the glycan requirement reported for cell infection by ccov, fcov, and tgev/prcv [ ] . the lack of this glycan in human apn ( figure s ) and the absence of the interacting tyr residue in the b -b region of hcov- e rbd ( figure ) imply distinct virus-apn local contacts in humans. as shown for the alphacoronavirus group, however, hcov- e probably has a protruding receptor-binding edge in the envelope s, responsible for its apn-binding specificity. the structure of the rbd- af complex, together with structure-guided rbd mutagenesis and mab binding data, demonstrated that the receptor-binding region is a major antigenic determinant in the envelope s protein of cov that bind apn. potent tgev-neutralizing antibodies, such as the ac mab [ ] , target key apn-binding residues in the s (figure ) , preventing infection. data from antibody neutralization-resistant tgev mar mutants nonetheless show that some substitutions can be accommodated in the receptor-binding region of alphacoronavirus, which confer the ability to escape immune neutralization, while preserving . substituted residues at the rbd tip that contact papn in the prcv rbd-papn structure are shown in figure , except for the v ngly mutant, with a glycan at rbd position in the b -b b loop, outside the rbd tip (see figure d) . b. tgev rbd binding to cell surface papn glycosylation mutants. relative binding of the sa protein and the anti-ha mab to ha-tagged papn proteins with (papn) or without the glycan linked to asn (n a and t v). mean and standard deviation for three experiments. doi: . /journal.ppat. .g the receptor-binding affinity necessary for cell entry [ , ] . our results thus demonstrate that the receptor-binding region in alphacoronavirus is under selective pressure from the immune system, as described for other viruses [ , , , ] . it is tempting to speculate that immune pressure on exposed receptor-binding residues in the cov s could lead to conformational changes in receptor-binding edges of cov rbds. this would result either in changes in the apn-recognition mode observed with hcov- e and tgev, or in conformational changes in the rbd tip that lead to a receptor specificity switch for cell entry, as observed for hcov-nl [ ] . virus use of recessed binding regions, as for hcov-nl , is a well-defined strategy for hiding conserved receptorbinding residues from antibodies [ , ] . like hcov-nl , sars-cov uses a recessed, although broader ace -binding surface, which can accommodate mutations that permit crossspecies receptor recognition [ ] . it remains to be understood why, despite major changes in the receptor-binding region, all these cov use metalloproteases as cell entry receptors. in the course of our studies, we also determined the crystal structure of the cell surface apn, an important target for cancer therapies. the domain architecture of apn resembles that of related aminopeptidases [ , , ] . here we show a unique dimer configuration for the apn, mediated by its domain iv, the most divergent domain among m aminopeptidases [ ] . the implication of these structural findings for apn biology will require further biochemical analysis. knowledge of the structure is leading to research on the mechanism of action of numerous anti-tumor compounds that target mammalian apn [ ] ; these studies will be fundamental for improving drug specificity. the detailed view of the apn-cov interaction shown here might also lead to development of small molecules to block cov infection. we have identified the receptor-binding region as the major antigenic site in the alphacoronavirus envelope s, which could guide the design of immunogens that boost cov-neutralizing immune responses to key motifs for virus cell entry. design of soluble s proteins variants of tgev and prcv has been described [ ] . the sa protein containing the rbd of tgev was derived from the sc strain, and contains residues b-strands are marked (bars) above or beneath their sequences. tgev sequence is numbered. ace receptor-binding residues reported for hcov-nl [ ] , as well as papn receptor-binding residues for tgev (supplementary table s ) are colored as in c. residues absent in the rbd structures are in grey, and the thrombin recognition sequence at the end of the tgev rbd is in lowercase letters. doi: . /journal.ppat. .g to of the tgev s, an n-terminal influenza hemagglutinin ha peptide, and either a flag mab epitope (monovalent sa-flag variant) or the human igg fc portion (bivalent sa-fc variant) at the c-terminal end. the engineered soluble papn contains residues to (ectodomain) of the cell surface protein fused to ha and flag tags at the n and c terminus, respectively [ ] . the soluble s protein crystallized in complex with the papn was derived from the prcv hol strain (s h in [ ] ), and contains the n-terminal residues of the prcv s protein and same c-terminus as the tgev-derived sa protein [ ] . a recombinant membrane bound papn with an ha tag at the cterminal end was engineered for cell surface expression. thrombin recognition sequences were introduced between the tags and the viral or papn protein sequences. proteins were produced in transiently transfected t or stably transfected cho-lec . . . (cho-lec) cells as described [ ] , and concentration in cell supernatants determined by elisa. proteins prepared in cho-lec cells were used in crystallization experiments. hybridoma cells secreting the tgev s mabs were grown in dmem supplemented with % fcs in roller bottles. proteins secreted to culture supernatants were initially purified by affinity chromatography. all protein samples were further purified by size exclusion chromatography in hepes-saline buffer ( mm hepes, mm nacl) ph . . the fab fragment of the af mab was prepared by papain digestion of the purified antibody. the reaction was terminated by the addition of e (sigma) and the fab fragment purified by size exclusion and ion exchange chromatography using hepes-saline buffer ph . . the polypeptide chains of the ig variable domains of the af mab were determined by sequencing of their cdna prepared from reverse transcribed mrna purified from hybridoma cells. binding of anti-tgev s or -ha (control) mab to wild type and mutant sa proteins was tested in -well plates, using purified mab or hybridoma supernatants. the sa-fc fusion proteins in serum-free (opti-mem, invitrogen) cell supernatants were bound to plastic, and mab binding monitored by optical density (od nm ). at least four sa-fc protein concentrations ranging from to mg/ml were used in duplicate and average binding determined in each experiment. binding ratios were determined after correction for background binding. apn binding assays were also carried out with the sa-fc fusion protein comprising the tgev rbd. bhk-papn cells constitutively expressing cell surface papn were used for binding experiments comparing wild type and mutant rbds, whereas transiently transfected t cells were used for analysis of rbd binding to papn glycosylation mutants. binding was monitored as the percentage of stained cells with the fc fusion proteins and fitc labeled anti-fc antibodies by fluorescence-activated cell sorting (facs), as shown in figure b . the percentage of cells stained was determined for each protein sample and corrected for background staining. papn binding ratios for wild type and mutant rbd proteins shown in figure a were determined from the percentage of bhk-papn cells stained with same concentration of wild type and mutant sa-fc proteins. the binding ratios for wild type and mutant papn glycosylation mutants shown in figure b were determined from the percentage of sa-fc stained t cells expressing similar amounts of ha-tagged papn proteins. cell surface expression of the papn-ha protein was determined with the ha ac mab. the tgev rbd in complex with the af fab fragment was crystallized using the size exclusion-purified complex of a table ). crystallization of the papn ectodomain in complex with porcine cov s proteins was carried out with mixtures of the receptor protein and several tgev and prcv protein variants comprising the receptor-binding region (sa, s h and s h in [ ] ). crystals appeared only in trials performed with an equimolar mixture of papn and the s h protein derived from the prcv s at a final protein concentration of mg/ml, and with a crystallization solution of % peg- k, . m lithium sulfate and . m tris buffer ph . . crystals were transferred to crystallization solution containing % ethylene glycol and frozen for diffraction data collection at the id beamline (prcv rbd-papn in table ). the structure of the tgev rbd- af fab fragment was initially determined by the molecular replacement (mr) method using the phaser program [ ] , and two search models having either the variable or constant regions of the pdb id aif mab structure. the af fab model structure was built manually following electron density maps determined from the mr solution, after improvement with the dm program [ ] . the af fab structure was refined with the program phenix.refine [ ] , which provided an excellent electron density map for building residues to of the tgev s, as well as four residues of a thrombin recognition site at the c-terminus. final structure refinement of the complex was carried out with data extending to . Å resolution (statistics in table ). three cycles of solvent correction, refinement of individual coordinates and atomic displacement parameters combined with tls were applied in each step of structure refinement with phenix.refine, which was alternated with manual adjustment of the model to the electron density maps. all residues are in allowed regions of the ramachandran plot. sa protein residues included in the structure of the tgev rbd are shown in figure d . the structure of the prcv rbd-papn complex was resolved by the mr method using the papn structure determined alone (manuscript in preparation) and the tgev rbd structure as search models. mr solutions were obtained for the two papn molecules (chains a and b) of the asymmetric unit and for one rbd molecule (chain e). the three molecules were adjusted manually and refined with the phenix.refine program. the second rbd molecule (chain f) bound to papn molecule b was built manually into the electron density map. the residues nterminal to the prcv rbd in the s h protein were largely disordered or degraded during crystallization, and are absent in the structure. the complex structure was refined with the program phenix.refine applying solvent correction, ncs, refinement of individual coordinates and atomic displacement parameters combined with tls ( table ). the current model comprises residues to of the papn ectodomain with a zinc metal ion at the papn enzyme active site, and residues to of the prcv s, homologous to the tgev s residues to that defined the tgev rbd structure ( figure d ). all the residues are in allowed regions of the ramachandran plot. coordinates and structure factors have been deposited in the protein data bank with id codes f m (tgev rbd- af ) and f c (prcv rbd-papn). buried surfaces and residues at the molecular complex interfaces were determined with the pisa server (http://www. ebi.ac.uk/msd-srv/prot_int/pistart.html). only residues with at least % of their surface buried at interfaces in the two independent molecules of the crystal asymmetric units are shown. figure d was prepared with ligplot (http://www.ebi.ac.uk/ thornton-srv/software/ligplot/), figure a with ribbons [ ] and the other structure representations with pymol (pymol.org). structural alignments were carried out with modeller using a gap penalty of [ ] . accession numbers of the alphacoronavirus s proteins mentioned are q pkz (tgev), q (ccov), p (fcov), p figure . determinants of tgev s antigenic site a. a. binding of tgev-neutralizing, site a-specific mabs to rbd mutants. relative binding (%) of mutants to wild type sa protein is shown for tgev sspecific mabs (top; described in figure c ) and a control anti-ha antibody (see materials and methods). rbd regions in which mutations locate are shown (bottom; see also figure d ). mean and standard deviation of data from at least three experiments. b. antigenic site a in the tgev rbd and epitopes for antibodies. surface and ribbon representation of the rbd with the af contact regions colored as in figure b . three antibody-binding residues (tyr , trp and asn ) in the loops at the rbd tip, as well as tgev lys , arg and gly residues associated with aa, ab and ac subsites [ ] , respectively. lines indicate epitopes for mabs specific for each of the three antigenic subsites: aa in yellow, ab in green and ac in red. doi: . /journal.ppat. .g (hcov- ), q q s (hcov-nl ), b vdw (bulbul-cov) and q q p (ibv). the prcv hol s protein sequence is reported in reference [ ] . sequence identities among s proteins were determined with psiblast (http://www.ebi.ac.uk/tools/sss/ psiblast/). accession number for the papn protein is p . figure s structures of tgev and prcv rbds. a. secondary structure elements of the rbd structures. b-strands are shown with arrows and colored in blue and cyan, a b-bulge at the b-strand is shown in magenta, helix with a red cylinder, coils with black lines, and disulphide bonds with green lines. b. stereo view of the superimposed asymmetric unit rbd structures of tgev (blue and cyan), complex with the af mab, and of prcv (green and red), complex with the papn protein. view as in figures a and a . locations of n and c terminal ends are indicated in lowercase letters. (tif) figure s mammalian apn ectodomains. sequence alignment of the porcine, canine, feline and human apn proteins with conserved residues highlighted in red. secondary structure elements of the papn structure determined in complex with the rbd of prcv are shown above the sequences. cov-binding residues and those engaged in papn dimerization are highlighted in blue and green, respectively, whereas those at the papn catalytic site are in yellow. residues coordinating the zinc ion are marked with an asterisk, and the n-linked glycosylation site recognized by cov is marked with a triangle at the papn asn . the beginning of each of the four apn domains is indicated. (tif) figure s aminopeptidases active site. side chains of residues at the catalytic site of four structurally aligned zinc aminopeptidases based on domain ii are shown with stick representation, and with the coordinated zinc ion as a cyan sphere. human erap- (pdb code xdt) is shown in green, aminopeptidase n of e. coli (pdb code hpt) in magenta, aminopeptidase n of neisseria meningitidis (pdb code gtq) in blue, and papn in yellow. the glutamic acid located in the gamen motif is labeled in blue and those located at the conserved hexxhx e motif are in red (sequence in figure s ). (tif) figure s dimerization of the papn ectodomain in solution. size exclusion chromatography of the soluble papn ectodomain. continuous line shows optical density (od) at nm for the elution volume. papn protein was run through a superdex / column (ge healthcare) with hepessaline buffer ph . . exclusion volume and size (kda) of molecular weight markers are indicated. determined molecular weight for the single recombinant glycosylated papn ectodomain is about kda, whereas the protein elutes with a volume corresponding to , kda. (tif) table s sequence of homologous cdr-h loops in known mab structures. sequence of homologous heavy chain cdr-h loops to that of the af mab, identified by a blast search among protein structures, whose pdb codes are shown. (tif) table s intermolecular contacts in the prcv rbd-papn complex structure. rbd and papn residues in close contact (# Å ) in the two complexes of the crystal asymmetric unit, computed with the program ncont [ ] . rbd residues from the b -b , b -b and b -b regions at the tip of the bbarrel domain are shown, with those engaged in hydrogen bonding in red. tgev/prcv numbering is given for the rbd residues. (tif) the molecular biology of coronaviruses encyclopedia of virology coronaviruses post-sars: update on replication and pathogenesis virus taxonomy: ninth report of the international committee on taxonomy of viruses assembly of coronavirus spike protein into trimers and its role in epitope expression architecture of the sars coronavirus prefusion spike the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex crystal structure of mouse coronavirus receptor-binding domain complexed with its murine receptor human aminopeptidase n is a receptor for human coronavirus e aminopeptidase n is a major receptor for the entero-pathogenic coronavirus tgev crystal structure of nl respiratory coronavirus receptor-binding domain complexed with its human receptor angiotensinconverting enzyme is a functional receptor for the sars coronavirus structure of sars coronavirus spike receptor-binding domain complexed with receptor cloning of the mouse hepatitis virus (mhv) receptor: expression in human and hamster cell lines confers susceptibility to mhv the moonlighting enzyme cd : old and new functions to target impaired angiogenesis in aminopeptidase n-null mice the neovasculature homing motif ngr: more than meets the eye novel aminopeptidase n (apn/cd ) inhibitor f can suppress invasion of hepatocellular carcinoma cells as well as angiogenesis aminopeptidase n (cd ) as a target for cancer chemotherapy mutational analysis of aminopeptidase n, a receptor for several group coronaviruses, identifies key determinants of viral host range genetic evolution and tropism of transmissible gastroenteritis coronaviruses major receptor-binding and neutralization determinants are located within the same domain of the transmissible gastroenteritis virus (coronavirus) spike protein identification of a receptor-binding domain of the spike glycoprotein of human coronavirus hcov- e residues involved in the antigenic sites of transmissible gastroenteritis coronavirus s glycoprotein mechanisms of transmissible gastroenteritis coronavirus neutralization four major antigenic sites of the coronavirus transmissible gastroenteritis virus are located on the amino-terminal half of spike glycoprotein s antigenic modules in the n-terminal s region of the transmissible gastroenteritis virus spike protein crystal structures of an antibody to a peptide and its complex with peptide antigen at . a crystal structures of a quorum-quenching antibody crystal structures of the endoplasmic reticulum aminopeptidase- (erap ) reveal the molecular basis for n-terminal peptide trimming structural basis for antigenic peptide precursor processing by the endoplasmic reticulum aminopeptidase erap structure of aminopeptidase n from escherichia coli suggests a compartmentalized, gated active site reconstitution of purified amphiphilic pig intestinal microvillus aminopeptidase. mode of membrane insertion and morphology the canyon hypothesis: hiding the host cell receptor attachment site on a viral surface from immune surveillance evolution subverting essentiality: dispensability of the cell attachment arg-gly-asp motif in multiply passaged foot-and-mouth disease virus structural basis of immune evasion at the site of cd attachment on hiv- gp structure of the measles virus hemagglutinin bound to the cd receptor pushing the boundaries of molecular replacement with maximum likelihood the ccp suite: programs for protein crystallography phenix: a comprehensive python-based system for macromolecular structure solution ribbon models of macromolecules comparative protein modelling by satisfaction of spatial restrains we thank the esrf for provision of synchrotron radiation facilities through bag-madrid projects, as well as the swiss-sls facility, n. cubells for technical help and c. mark for editorial assistance. key: cord- -mywhe w authors: clausen, thomas mandel; sandoval, daniel r.; spliid, charlotte b.; pihl, jessica; perrett, hailee r.; painter, chelsea d.; narayanan, anoop; majowicz, sydney a.; kwong, elizabeth m.; mcvicar, rachael n.; thacker, bryan e.; glass, charles a.; yang, zhang; torres, jonathan l.; golden, gregory j.; bartels, phillip l.; porell, ryan; garretson, aaron f.; laubach, logan; feldman, jared; yin, xin; pu, yuan; hauser, blake; caradonna, timothy m.; kellman, benjamin p.; martino, cameron; gordts, philip l.s.m.; chanda, sumit k.; schmidt, aaron g.; godula, kamil; leibel, sandra l.; jose, joyce; corbett, kevin d.; ward, andrew b.; carlin, aaron f.; esko, jeffrey d. title: sars-cov- infection depends on cellular heparan sulfate and ace date: - - journal: cell doi: . /j.cell. . . sha: doc_id: cord_uid: mywhe w we show that sars-cov- spike protein interacts with both cellular heparan sulfate and angiotensin converting enzyme (ace ) through its receptor binding domain (rbd). docking studies suggest a heparin/heparan sulfate-binding site adjacent to the ace binding site. both ace and heparin can bind independently to spike protein in vitro and a ternary complex can be generated using heparin as a scaffold. electron micrographs of spike protein suggests that heparin enhances the open conformation of the rbd that binds ace . on cells, spike protein binding depends on both heparan sulfate and ace . unfractionated heparin, non-anticoagulant heparin, heparin lyases, and lung heparan sulfate potently block spike protein binding and/or infection by pseudotyped virus and authentic sars-cov- virus. we suggest a model in which viral attachment and infection involves heparan sulfate-dependent enhancement of binding to ace . manipulation of heparan sulfate or inhibition of viral adhesion by exogenous heparin presents new therapeutic opportunities. the covid- pandemic, caused by the novel respiratory coronavirus (sars-cov- ), has swept across the world, resulting in serious clinical morbidities and mortality, as well as widespread disruption to all aspects of society. as of september , , the virus has spread to countries, causing more than . million confirmed infections and at least , deaths (world health organization). current isolation/social distancing strategies seek to flatten the infection curve to avoid overwhelming hospitals and to give the medical establishment and pharmaceutical companies time to develop and test antiviral drugs and vaccines. currently, only one antiviral agent, remdesivir, has been approved for adult covid- patients (beigel et al., ) and vaccines may be - months away. understanding the mechanism for sars-cov- infection and its mechanism of infection could reveal other targets to interfere with viral infection and spread. the glycocalyx is a complex mixture of glycans and glycoconjugates surrounding all cells. given its location, viruses and other infectious organisms, must pass through the glycocalyx to engage receptors thought to mediate viral entry into host cells. many viral pathogens have evolved to utilize glycans as attachment factors, which facilitates the initial interaction with host cells, including influenza virus, herpes simplex virus, human immunodeficiency virus, and different coronaviruses (sars-cov- and mers-cov) (cagno et al., ; koehler et al., ; stencel-baerenwald et al., ) . several viruses interact with sialic acids, which are located on the ends of glycans found in glycolipids and glycoproteins. other viruses interact with heparan sulfate (hs) (milewska et al., ) , a highly negatively charged linear polysaccharide that is attached to a small set of membrane or extracellular matrix proteoglycans (lindahl et al., ) . in general, glycan-binding domains on membrane proteins of the virion envelope mediate initial attachment of virions to glycan receptors. attachment in this way can lead to the engagement of protein receptors on the host plasma membrane that facilitate membrane fusion or engulfment and internalization of the virion. j o u r n a l p r e -p r o o f like other macromolecules, hs can be divided into subunits, which are operationally defined as disaccharides based on the ability of bacterial enzymes or nitrous acid to cleave the chain into disaccharide units (esko and selleck, ) . the basic disaccharide subunit consists of α - linked d-glucuronic acid (glca) and α - linked n-acetyl-d-glucosamine (glcnac), which undergo various modifications by sulfation and epimerization as the copolymer assembles on a limited number of membrane and extracellular matrix proteins (only heparan sulfate proteoglycans are known) (lindahl et al., ) . the variable length of the modified domains and their pattern of sulfation create unique motifs to which hs-binding proteins interact (xu and esko, ) . different tissues and cell types vary in the structure of hs, and hs structure can vary between individuals and with age (de agostini et al., ; feyzi et al., ; han et al., ; ledin et al., ; vongchan et al., ; warda et al., ; wei et al., ) . these differences in hs composition may contribute to the tissue tropism and/or host susceptibility to infection by viruses and other pathogens. in this report, we show that the ectodomain of the sars-cov- spike (s) protein interacts with cell surface hs through the receptor binding domain (rbd) in the s subunit. binding of heparin to sars-cov- s protein shifts the structure to favor the rbd open conformation that binds ace . spike binding to cells requires engagement of both cellular hs and ace , suggesting that hs acts as a coreceptor priming the spike for ace interaction. therapeutic unfractionated heparin (ufh), non-anticoagulant heparin and hs derived from human lung and other tissues blocks binding. ufh and heparin lyases also block infection of cells by s protein pseudotyped virus and authentic sars-cov- . these findings identify cellular hs as a necessary co-factor for sars-cov- infection and emphasizes the potential for targeting s protein-hs interactions to attenuate virus infection. the trimeric s proteins from sars-cov- and sars-cov- viruses are thought to engage human ace with one or more rbd in an "open" active conformation (fig. a ) (kirchdoerfer et al., ; walls et al., ; wrapp et al., ) . adjacent to the ace binding site and exposed in the rbd lies a group of positively-charged amino acid residues that represents a potential site that could interact with heparin or heparan sulfate ( fig. a and suppl. fig. s ). we calculated an electrostatic potential map of the rbd (from pdb id m (yan et al., ) ), which revealed an extended electropositive surface with dimensions and turns/loops consistent with a heparin-binding site (fig. b) (xu and esko, ) . docking studies using a tetrasaccharide (dp ) fragment derived from heparin demonstrated preferred interactions with this electropositive surface, which based on its dimensions could accommodate a chain of up to monosaccharides ( fig. b and c ). evaluation of heparin-protein contacts and energy contributions using the molecular operating environment (moe) software suggested strong interactions with the positively charged amino acids r , r , k , r and possibly r (figs. a, d, and e) . other amino acids, notably f , s , n , g , y , and y , could coordinate the oligosaccharide through hydrogen bonds and hydrophobic interactions. notably, the putative binding surface for oligosaccharides is adjacent to, but separate from the ace binding site, suggesting that a single rbd could simultaneously bind both cell surface hs and the ace protein receptor. the putative hs binding site is partially obstructed in the "closed" inactive rbd conformation, while fully exposed in the open state (suppl. fig. s ). the amino acid sequence of s protein rbd of sars-cov- s is % identical to the rbd of sars-cov- s (fig. f) , and these domains are highly similar in structure with an overall cα r.m.s.d. of . Å (fig. g) . however, an electrostatic potential map of the sars-cov- s j o u r n a l p r e -p r o o f rbd does not show an electropositive surface like that observed in sars-cov- (fig. h ). most of the positively charged residues comprising this surface are conserved between the two proteins, with the exception of sars-cov- k which is a threonine in sars-cov- (fig. f ). additionally, the other amino acid residues predicted to coordinate with the oligosaccharide are conserved with the exception of asn in sars-cov- , which is a negatively charged glutamate residue in sars-cov- . sars-cov- has been shown to interact with cellular hs in addition to its entry receptors ace and transmembrane protease, serine (tmprss ) (lang et al., ) . our analysis suggests that the putative heparin-binding site in sars-cov- s may mediate an enhanced interaction with heparin or hs compared to sars-cov- , and that this change evolved through as few as two amino acid substitutions, thr lys and glu asn . to test experimentally if the sars-cov- s protein interacts with heparin/hs, recombinant ectodomain and rbd proteins were prepared and characterized. initial studies encountered difficulty in stabilizing the s ectodomain protein, a problem that was resolved by raising the concentration of nacl to . m in hepes buffer. under these conditions, the protein could be stored at room temperature, o c or at - o c for at least two weeks. sds-page showed that each protein was ~ % pure ( j o u r n a l p r e -p r o o f recombinant s ectodomain and rbd proteins were applied to a column of heparin-sepharose. elution with a gradient of sodium chloride showed that the rbd eluted at ~ . m nacl, with a shoulder that eluted with higher salt (fig. a) . recombinant s ectodomain also bound to heparin-sepharose, but it eluted across a broader concentration of nacl. the elution profiles suggest that the preparations contained a population of molecules that bind to heparin, but that some heterogeneity in affinity for heparin occurs, which may reflect differences in glycosylation, oligomerization or the number of binding sites in the open conformation. the rbd protein from sars-cov- also bound in a saturable manner to heparin-bsa immobilized on a plate (fig. b ). the rbd domain from sars-cov- showed significantly reduced binding to heparin-bsa and a higher k d value ( nm [ % c.i.; - nm] for sars-cov- rbd vs. nm [ % c.i. - nm]) for sars-cov- rbd), in accordance with the difference in electropositive potential in the proposed hs binding regions (fig. h) . a monomeric form of sars-cov- s ectodomain protein also bound in a saturable manner to heparin immobilized on a plate (suppl. fig. s a ). the trimeric protein bound to heparin-bsa with an apparent k d value of . nm [ % c.i. . - . nm] (fig. c ). binding of recombinant s ectodomain, mutated to lock the rbds into a closed (mut ) or that favors an open (mut ) conformation, showed that the heparin binding site in the rbd domain is accessible in both conformations (fig. d ). however, the k d value for mut is lower ( . nm [ % c.i. . - . nm] vs. . nm [ % c.i. . - . nm] for mut ), which is in line with the partial obstruction of the site in the closed conformation (suppl. fig. s ). as expected, only trimer with an open rbd conformation bound to ace (fig. e ). in contrast to spike protein, ace did not bind to heparin-bsa (fig. c) . ace also had no effect on binding of s protein to heparin-bsa at all concentrations that were tested (fig. c , inset). biotinylated ace bound to immobilized s protein (suppl. fig. s b ) and a ternary complex of heparin, ace and s protein could be demonstrated by titration of s protein bound to immobilized heparin-bsa with ace (fig. f ). binding of ace under these conditions j o u r n a l p r e -p r o o f increased in proportion to the amount of s protein bound to the heparin-bsa. collectively, these findings show that (i) spike protein can engage both heparin and ace simultaneously and (ii) that the heparin binding site is somewhat occluded in the closed conformation, but it can still bind heparin albeit with reduced affinity. the simultaneous binding of ace to spike protein and heparin suggested the possibility that heparin binding might affect the conformation of the rbd, possibly increasing the open conformation that can bind ace . to explore this possibility, spike protein was mixed with ace ( -fold molar ratio) with or without dp oligosaccharides derived from heparin ( -fold molar ratio). the samples were then stained and analyzed by transmission electron microscopy, and the images were deconvoluted and sorted into d reconstructions to determine the number of trimers with , , , or bound ace (fig. g -h and suppl. fig. s c-d) . the different populations were counted and the percentage of particles belonging to each d class was calculated. two time points were evaluated after mixing ace and trimeric s: at min , and , particles were analyzed in the absence or presence of dp oligosaccharides, respectively; at min, , and , particles were analyzed in absence or presence of dp oligosaccharides, respectively. at both time points, the presence of dp increased the total amount of ace protein bound to spike . after minutes in the absence of dp very few of the trimers had conformations with or bound ace ( % each), whereas the inclusion of dp oligosaccharides greatly increased the proportion of trimers bearing one ( %) or two ( %) ace , with a proportional drop in the unbound conformers from % in the absence of heparin to % in its presence (fig. g ). extending the incubation to minutes resulted in a mixture of trimers containing ( %), ( %) and ace ( %) in the absence of heparin. inclusion of dp further increased the proportion of bound spike trimers bearing ( %), and ( %) ace (fig. h) . the imaging studies suggest that, under these j o u r n a l p r e -p r o o f experimental conditions, heparin may stabilize the ace interaction, increasing the proportion of spike bound to ace as well as the occupancy of individual spikes. the sars-cov- spike protein depends on cellular heparan sulfate for cell binding. to extend these studies to hs on the surface of cells, s ectodomain protein was added to human h cells, an adenocarcinoma cell line derived from type alveolar cells (fig. a ). spike ectodomains bound to h cells, with half-maximal binding achieved at ~ nm. treatment of the cells with a mixture of heparin lyases (hsase), which degrades cell surface hs, dramatically reduced binding ( fig a) . the s ectodomain also bound to human a cells, another type alveolar adenocarcinoma line, as well as human hepatoma hep b cells (fig. b ). removal of hs by enzymatic treatment dramatically reduced binding in both of these cell lines as well (fig. b ). recombinant rbd protein also bound to all three cell lines dependent on hs (fig. c) . a melanoma cell line, a , was tested independently and also showed hs dependent binding ( fig d) . the extent of binding across the four cell lines varied ~ -fold. this variation was not due to differences in hs expression as illustrated by staining of cell surface hs with mab e , which recognizes a common epitope in hs ( we also measured binding of the s ectodomain and rbd proteins to a library of mutant hep b cells, carrying crispr/cas induced mutations in biosynthetic enzymes essential for synthesizing hs (anower et al., ) . inactivation of ext , a subunit of the copolymerase required for synthesis of the backbone of hs, abolished binding to a greater extent than enzymatic removal of the chains with hsases ( fig. f and suppl. fig. s ), suggesting that the hsase treatment may underestimate the dependence on hs. targeting ndst , a glcnac n-j o u r n a l p r e -p r o o f deacetylase-n-sulfotransferase that n-deacetylates and n-sulfates n-acetylglucosamine residues, and hs st and hs st , which introduces sulfate groups in the c position of glucosamine residues, significantly reduced binding (figs. f and suppl. fig. s ). although experiments with other sulfotransferases have not yet been done, the data suggests that the pattern of sulfation of hs affects binding to s and rbd. to further examine how variation in hs structure affects binding, we isolated hs from human kidney, liver, lung and tonsil. the samples were depolymerized into disaccharides by treatment with hsases, and the disaccharides were then analyzed by lc-ms (experimental methods). the disaccharide analysis showed that lung hs has a larger proportion of ndeacetylated and n-sulfated glucosamine residues (grey bars) and more -o-sulfated uronic acids (green bars) than hs preparations from the other tissues (fig. a ). the different hs preparations also varied in their ability to block binding of rbd to h cells (fig. b ). interestingly, hs isolated from lung was more potent compared to kidney and liver hs, consistent with the greater degree of sulfation of hs from this organ (suppl. table ). hs from tonsil was as potent as hs from lung, but the overall extent of sulfation was not as great, supporting the notion that the patterning of the sulfated domains in the chains may affect binding. unfractionated heparin is derived from porcine mucosa and possesses potent anticoagulant activity due to the presence of a pentasaccharide sequence containing a crucial -o-sulfated nsulfoglucosamine unit, which confers high affinity binding to antithrombin. heparin is also very highly sulfated compared to hs with an average negative charge of - . per disaccharide (the overall negative charge density of typical hs is - . to - . per disaccharide). mst cells, which were derived from a murine mastocytoma, make heparin-like hs that lacks the key -o-sulfate group and anticoagulant activity (gasimli et al., ; montgomery et al., ) . the anticoagulant properties of heparin can also be removed by periodate oxidation, which oxidizes the vicinal hydroxyl groups in the uronic acids, resulting in what is called "split-glycol" heparin (casu et al., ) . all of these agents significantly inhibited binding of the s protein to h and a cells ( fig. c and d ) yielding ic values in the range of . - . µg/ml (suppl . table ). interestingly, the lack of -o-sulfation, crucial for the anticoagulant activity of heparin, had little effect on its inhibition of s binding. in contrast, cho cell hs (containing . sulfates per disaccharide) only weakly inhibited binding (ic values of and µg/ml for a and h , respectively) (suppl. table ). these data suggest that inhibition by heparinoids is most likely charge dependent and independent of anticoagulant activity per se. the experiments shown in fig. g -h indicate that binding of heparin to spike protein can increase binding to ace . to explore if hs, ace and spike interact at the cell surface, we investigated the impact of ace expression on s protein cell binding. initial attempts were made to measure ace levels by western blotting or flow cytometry with different mabs and polyclonal antibodies, but a reliable signal was not obtained in any of the cell lines tested (a , a , h , and hep b). nevertheless, expression of ace mrna was observed by rt-qpcr (suppl. fig. a ). transfection of a cells with ace cdna resulted in robust expression of ace (fig. a) , resulting in an increase in s ectodomain protein binding by ~ fold (fig. b) . interestingly, the enhanced binding was hs-dependent, as illustrated by the loss of binding of s protein after hsase-treatment (fig. b ). crispr/cas mediated deletion of the b galt gene, which is required for glycosaminoglycan assembly (suppl. fig. s b ), also reduced binding of spike protein (fig. b ) despite the overexpression of ace (fig. a ). to explore the impact of diminished ace expression, we examined spike protein binding to a cells and in two crispr/cas gene targeted clones c and c bearing biallelic mutations in ace (suppl. fig. s c ). binding of s ectodomain protein was greatly reduced in the ace -/-j o u r n a l p r e -p r o o f clones and the residual binding was sensitive to hsases (fig. c ). these findings show that binding of spike protein on cells requires both hs and ace , consistent with the formation of a ternary complex (figs. f-h). assays using purified components provide biochemical insights into binding, but they do not recapitulate the multivalent presentation of the s protein as it occurs on the virion membrane. thus, to extend these studies, pseudotyped vesicular stomatitis virus (vsv) was engineered to express the full-length sars-cov- s protein and gfp or luciferase to monitor infection. vero e cells are commonly used in the study of sars-cov- infection, due to their high susceptibility to infection. spike protein binding to vero cells also depends on cellular hs as binding was sensitive to hsases, heparin and split-glycol heparin (fig. a ). interestingly, hsase treatment reduced binding to a lesser extent than the level of reduction observed in a , heparin very potently reduced infection more than ~ -fold at . µg/ml and higher concentrations (fig. g) . in contrast, studies of sars-cov- s protein pseudotype virus showed that hsase-treatment actually increased sars-cov- infection by more than -fold, suggesting that hs might interfere with binding of sars-cov- in this cell line (fig. h ). infection of h and a cells by sars-cov- s pseudotype virus was too low to obtain j o u r n a l p r e -p r o o f accurate measurements, but infection of hep b cells could be readily measured (fig. i ). hsase and mutations in ext and ndst dramatically reduced infection -to -fold. inactivation of the -o-sulfotransferases had only a mild effect unlike its strong effect on s protein binding (fig. f) , possibly due to the high valency conferred by multiple copies of s protein on the pseudovirus envelope. hep b cells were not susceptible to infection by sars-cov- s protein pseudotyped virus, but was infected by mers-cov s protein pseudotyped virus and infection was independent of hs (suppl. fig. s ). studies of pseudovirus were then extended to authentic sars-cov- virus infection using strain usa-wa / . infection of vero e cells was monitored by double staining of the cells with antibodies against the sars-cov- nucleocapsid (n) and s proteins ( heparin inhibition (maroon and blue symbols). to rule out that the treatments caused a decrease in ace expression or a reduction in cell viability, vero cells were treated with heparin lyases and µg/ml ufh, and ace expression was measured by western blotting and cell viability by celltiter-blue® (suppl. fig. s a -b) . no effect on ace expression or cell viability was observed. these findings further emphasize the potential for using unfractionated heparin or other non-anticoagulant heparinoids to prevent viral attachment. j o u r n a l p r e -p r o o f these findings were then extended to hep b cells and mutants altered in hs biosynthesis using a viral plaque assay. virus was added to wildtype, ndst -/and hs st / -/cells for hr, the virus was removed, and after days incubation a serial dilution of the conditioned culture medium was added to monolayers of vero e cells. the number of plaques were then quantitated by staining and visualization. as a control, culture medium from infected vero e cells was tested, which showed robust viral titers. hep b cells also supported viral replication, but to a lesser extent than vero cells. inactivation of ndst in hep b cells abolished virus production, whereas inactivation of hs st / -/reduced infection more mildly, ~ -fold (fig. d) . hsase and ufh reduced infection more than -fold, but it had no effect on cell viability (suppl. in this report, we provide compelling evidence that hs is a necessary host attachment factor that promotes sars-cov- infection of various target cells. the receptor binding domain of the sars-cov- s protein binds to heparin/hs, most likely through a docking site composed of positively charged amino acid residues aligned in a subdomain of the rbd that is separate from the site involved in ace binding (fig. ) . competition studies, enzymatic removal of hs, and genetic studies confirm that the s protein, whether presented as a recombinant protein (figs. -j o u r n a l p r e -p r o o f ), in a pseudovirus (fig. ) , or in authentic sars-cov- virions (fig. ) , binds to cell surface hs in a cooperative manner with ace receptors. mechanistically, binding of heparin/hs to spike trimers enhances binding to ace , likely increasing multivalent interactions with the target cell. this data provides crucial insights into the pathogenic mechanism of sars-cov- infection and suggests hs-spike protein complexes as a novel therapeutic target to prevent infection. the glycocalyx is the first point of contact for all pathogens that infect animal cells, and thus it is not surprising that many viruses exploit glycans, such as hs, as attachment factors. for example, the initial interaction of herpes simplex virus with cells involves binding to hs chains on one or more hs proteoglycans (shieh et al., ; wudunn and spear, ) through the interactions with the viral glycoproteins gb and gc. viral entry requires the interaction of a specific structure in hs with a third viral glycoprotein, gd (shukla et al., ) , working in concert with membrane proteins related to tnf/ngf receptors (montgomery et al., ) . similarly, the human immunodeficiency virus binds to hs by way of the v loop of the viral glycoprotein gp (roderiquez et al., ) , but infection requires the chemokine receptor ccr (deng et al., ; dragic et al., ) . other coronaviruses also utilize hs, for example nl (hcov-nl ) binds hs via the viral s protein in addition to ace (lang et al., ; milewska et al., ; milewska et al., ; naskalska et al., ) . in these examples, initial tethering of virions to the host cell plasma membrane appears to be mediated by hs, but infection requires transfer to a proteinaceous receptor. the data presented here shows that sars-cov- requires hs in addition to ace . we imagine a model in which cell surface hs acts as a "collector" of the virus and a mediator of the rbd-ace interaction, making viral infection more efficient. hs varies in structure across cell types and tissues, as well as with gender and age (de agostini et al., ; feyzi et al., ; ledin et al., ; vongchan et al., ; warda et al., ; wei et al., ) . variation in competition by hs from different tissues supports this conclusion and raises the possibility that hs contributes to the tissue tropism and j o u r n a l p r e -p r o o f the susceptibility of different patient populations, in addition to levels of expression of ace . coronaviruses can utilize a diverse set of glycoconjugates as attachment factors. human coronavirus oc (hcov-oc ) and bovine coronavirus (bcov) bind to -n-acetyl- -oacetylneuraminic acid (hulswit et al., ; tortorici et al., ) , middle east respiratory syndrome virus (mers-cov) binds -n-acetyl-neuraminic acid (park et al., ) , and guinea fowl coronavirus binds biantennary di-n-acetyllactosamine or sialic acid capped glycans (bouwman et al., ) . whether sars-cov- s protein binds to sialic acid remains unclear. mapping the binding site for sialic acids in other coronavirus s proteins has proved elusive, but modeling studies suggest a location distinct from the hs binding site shown in fig. (park et al., ; tortorici et al., ) . the s protein in murine coronavirus contains both a hemagglutinin domain for binding and an esterase domain that cleaves sialic acids that aids in the liberation of bound virions (rinninger et al., ; smits et al., ) . whether sars-cov- s protein, another viral envelope protein, or a host protein contributes to hs-degrading activity to aid in the release of newly made virions is unknown. the repertoire of proteins in organisms that bind to hs make up the so called "hs interactome" and consists of a variety of different hs-binding proteins (hsbps) (xu and esko, ) . unlike lectins that have a common fold that helps define the glycan binding site, hsbps do not exhibit a conserved motif that allows accurate predictions of binding sites based on primary sequence. instead, the capacity to bind heparin appears to have emerged through convergent evolution by juxtaposition of several positively charged amino acid residues arranged to accommodate the negatively charged sulfate and carboxyl groups present in the polysaccharide, and hydrophobic and h-bonding interactions stabilize the association. the rbd domains from the sars-cov- and sars-cov- s proteins are highly similar in structure (fig. g ), but the electropositive surface in sars-cov- s rbd is not as pronounced in sars-cov- s rbd (fig. h ). in accordance with this observation, recombinant rbd protein from sars-j o u r n a l p r e -p r o o f cov- showed significantly higher binding to heparin-bsa, compared to rbd from sars-cov- (fig. b) . a priori we predicted that the evolution of the hs binding site in the sars-cov- s protein might have occurred by the addition of arginine and lysine residues to its ancestor, sars-cov- . instead, we observed that four of the six predicted positively charged residues that make up the heparin-binding site are present in sars-cov- as well as most of the other amino acid residues predicted to interact with heparin ( fig. ) . sars-cov- has been shown to interact with cellular hs in addition to its entry receptors ace and transmembrane protease, serine (tmprss ) (lang et al., ) . our analysis suggests that the putative heparinbinding site in sars-cov- s may mediate an enhanced interaction with heparin compared to sars-cov- , and that this change evolved through as few as two amino acid substitutions, thr lys and glu asn. further studies are underway to define the amino acid residues in the combining site for heparin/hs to test this hypothesis. the ability of heparin and hs to compete for binding of the sars-cov- s protein to cell surface hs and the inhibitory activity of heparin towards infection of pseudovirus and authentic sars-cov- illustrates the therapeutic potential of agents that target the virus-hs interaction to control infection and transmission of sars-cov- . there is precedent for targeting proteinglycan interactions as therapeutic agents. for example, tamiflu targets influenza neuraminidase, thus reducing viral transmission, and sialylated human milk oligosaccharides can block sialic acid-dependent rotavirus attachment and subsequent infection in infants (hester et al., ; von itzstein, ) . covid- patients typically suffer from thrombotic complications ranging from vascular micro-thromboses, venous thromboembolic disease and stroke and often receive unfractionated heparin or low molecular weight heparin (thachil, ) . the findings presented here and elsewhere suggest that both of these agents can block viral infection (courtney mycroft-west, ; kim et al., ; liu et al., ; mycroft-west et al., ; tandon et al., ; wu et al., ) . effective anticoagulation is achieved with plasma levels of heparin of . - . units/ml. this concentration is equivalent to . - µg/ml heparin (assuming that the activity of ufh is units/mg). although this is sufficient to block spike protein binding to cells (fig. ) , it would not be expected to prevent viral infection, but it should attenuate infection depending on the viral load (fig. ) . the anticoagulant activity of heparin, which is typically absent in hs, is not critical for its antiviral activity based on the observation that mst derived heparin and split-glycol heparin is nearly as potent as therapeutic heparin ( figs. and ) . additional studies are needed to address the potential overlap in the dose response profiles for heparin as an anticoagulant and antiviral agent and the utility of nonanticoagulant heparins. antibodies directed to heparan sulfate or the binding site in the rbd might also prove useful for attenuating infection. in conclusion, this work revealed hs as a novel attachment factor for sars-cov- and suggests the possibility of using hs mimetics, hs degrading lyases, and metabolic inhibitors of hs biosynthesis for the development of therapy to combat covid- . further information and request for resources should be directed to the lead contact, thomas mandel clausen (tmandelclausen@health.ucsd.edu) all developed sars-cov- expression plasmids produced in this study can be made available upon request to the lead contact. j o u r n a l p r e -p r o o f this study did not generate any unique datasets or code. cell lines nci-h , a , hep b, a and vero e cells were from the american type culture collection (atcc). nci-h and a cells were grown in rpmi medium, whereas the other lines were grown in dmem. hep b cells carrying mutations in hs biosynthetic enzymes were previously derived from the parent hep b line as described (anower et al., ) . all cell media were supplemented with % (v/v) fbs, iu/ml of penicillin and µg/ml of streptomycin sulfate, and the cells were grown under an atmosphere of % co and % air. cells were passaged at ~ % confluence and seeded as explained for the individual assays. protein was produced in expicho or hek - e cells that were acquired from thermo fisher and grown according to the manufacturer's specifications. human bronchial epithelial cells were acquired from lonza. they were cultured in pneumacult-ex plus medium or to pneumacult-ali medium according to the manufacturer's instructions (stemcell technologies). specific details on the culture methods are described in the methods section. the collection of human tissue in this study abided by the helsinki principles and the an electrostatic potential map of the sars-cov- spike protein rbd domain was generated from a crystal structure (pdb: m ) and visualized using pymol (version . . by schrödinger). a dp fully sulfated heparin fragment was docked to the sars-cov- spike protein rbd using the cluspro protein docking server (https://cluspro.org/login.php) (kozakov et al., ; kozakov et al., ; vajda et al., ) . heparin-protein contacts and energy contributions were evaluated using the molecular operating environment (moe) software (chemical computing group). recombinant sars-cov- spike protein, encoding residues - (wuhan-hu- ; genbank: mn . ) with proline substitutions at amino acids positions and , a "gsas" substitution at the furin cleavage site (amino acids - ), twinstreptag and his x , was produced in expicho cells by transfection of x cells/ml at ºc with . µg/ml of plasmid dna using the expicho expression system transfection kit in expicho expression medium (thermofisher). one day later the cells were refed, then incubated at ºc for days. the conditioned medium was mixed with complete edta-free protease inhibitor (roche). samples of the recombinant trimeric spike protein ectodomain were diluted to . mg/ml in x tbs ph . . carbon coated copper mesh grids were glow discharged and µl of the diluted sample was placed on a grid for sec then blotted off. uniform stain was achieved by depositing µl of uranyl formate ( %) on the grid for sec and then blotted off. grids were transferred to a thermo fisher morgagni operating at kv. images at , magnification j o u r n a l p r e -p r o o f were acquired using a megaview k camera via the radius software. a dataset of micrographs at , x magnification and - . µm defocus was collected on a fei tecnai spirit ( kev) with a fei eagle k by k ccd camera. the pixel size was . Å per pixel and the dose was e − /Å . the leginon (suloway et al., ) software was used to automate the data collection and the raw micrographs were stored in the appion (lander et al., ) database. particles on the micrographs were picked using dogpicker , stack with a box size of pixels, and d classified with relion . (scheres, ) . secreted human ace was transiently produced in suspension hek - e cells. a plasmid encoding residues − of ace with a c-terminal hrv- c protease cleavage site, a twinstreptag and an his x tag was a gift from jason s. mclellan, university of texas at austin. briefly, ml of hek - e cells were seeded at a cell density of . × cells/ml hr before transfection with polyethyleneimine (pei). for transfection, µg of the ace plasmid and µg of pei ( : ratio) were incubated for min at room temperature. transfected cells were cultured for hr and fed with ml fresh media for additional hr before harvest. secreted ace were purified from culture medium by ni-nta affinity chromatography (qiagen). filtered media was mixed : (v/v) in x binding buffer ( mm tris-hcl, ph , , , m nacl) and loaded on to a self-packed column, pre-equilibrated with washing buffer ( mm tris-hcl, ph , . m nacl, mm imidazole). bound protein was washed with buffer and eluted with . m imidazole in washing buffer. the protein containing fractions were identified by sds-page. j o u r n a l p r e -p r o o f sars-cov- spike protein in dpbs was applied to a -ml hitrap heparin-sepharose column (ge healthcare). the column was washed with ml of dpbs and bound protein was eluted with a gradient of nacl from mm to m in dpbs. for binding studies, recombinant spike protein and ace was conjugated with ez-link tm sulfo-nhs-biotin ( : molar ratio; thermo fisher) in dulbecco's pbs at room temperature for min. glycine ( . m) was added to quench the reaction and the buffer was exchanged for pbs using a zeba spin column (thermo fisher). heparin ( and incubated with s protein ( nm). ace binding was measured to bound spike protein as described above. mixtures of stabilized (mut ) spike protein, x molar excess soluble ace ectodomain, with or without x molar excess an icosasaccharide (dp ) fragment derived from heparin were incubated at °c for min or hr. samples were diluted to . mg/ml with respect to spike protein in x pbs ph . . carbon coated copper mesh grids were glow discharged at ma for s and µl sample was applied for s and blotted off. grids were washed five times in µl x tbs ph . for sec then stained and blotted twice with µl % uranyl formate for sec. grids were imaged with an fei tecnai spirit ( kev) or fei tecnai f ( kev) with an fei eagle ccd ( k) camera. data were collected on the fei tecnai f at , x magnification, - . µm defocus with a pixel size of . Å per pixel. these datasets employed a box size of and comprised to micrographs. data were collected on the fei tecnai spirit as described above. data collection on both microscopes was automated through leginon (suloway et al., ) . stored in the appion (lander et al., ) database, and particles were picked with dog picker . particles were d classified with relion . j o u r n a l p r e -p r o o f (scheres, ) . trimeric d classes were selected for iterative d classification with relion . . classifications were performed until d classes demonstrated ace occupancy throughout the relevant threshold-level of the spike protein as visualized using chimerax (goddard et al., ) . particle counts of final d classes were obtained with relion . (scheres, ) and the percentages of particles bound to , , , or ace were calculated and visualized in graphpad prism . cells at - % confluence were lifted with pbs containing mm edta (gibco) and fresh human tissue was washed in pbs, frozen, and lyophilized. the dried tissue was crushed into a fine powder, weighed, resuspended in pbs containing mg/ml pronase with % ethanol (esko, ) . for hs quantification and disaccharide analysis, purified hs was digested with a mixture of heparin lyases i-iii ( mu each) for hr at °c in mm ammonium acetate buffer containing the ace expression plasmid (addgene, plasmid # ) (li et al., ) qpcr mrna was extracted from the cells using trizol (invitrogen) and chloroform and purified using the rneasy kit (qiagen). cdna was synthesized from the mrna using random primers and the superscript iii first-strand synthesis system (invitrogen). sybr green master mix (applied biosystems) was used for qpcr following the manufacturer's instructions, and the expression of tbp was used to normalize the expression of ace between the samples. the qpcr primers used were as follows: ace (human) forward: ' -cgaagccgaagacctgttcta - ' and reverse: ' -gggcaagtgtggactgttcc - '; and tbp (human) forward: ' -aacttcgcttccgctggccc - ' and reverse: ' -gaggggaggccaagccctga - '. to generate the cas lentiviral expression plasmid, . x hek t cells were seeded to a -cm diameter plate in dmem supplemented with % fbs. the following day, the cells j o u r n a l p r e -p r o o f were co-transfected with the pspax packaging plasmid (addgene, plasmid # ), pmd .g envelope plasmid (addgene, plasmid # ), and lenti-cas plasmid (addgene, plasmid # ) (sanjana et al., ) in dmem supplemented with fugene ( µl in µl dmem). media containing the lentivirus was collected and used to infect a wt and a wt cells, which were subsequently cultured with µg/ml and µg/ml blasticidin, respectively, to select for stably transduced cells. a single guide rna (sgrna) targeting ace ( '-tggatacatttgggcaagtg - ') and one targeting b galt ( '-tgacctgctccctctcaacg- ') was cloned into the lentiguide-puro plasmid (addgene plasmid # ) following published procedure (sanjana et al., ) . the lentiviral sgrna construct was generated in hek t cells, using the same protocol as for the cas expression plasmid, and used to infect a -cas and a -cas cells to generate crispr knockout mutant cell lines. after infection, the cells were cultured with µg/ml puromycin to select for cells with stably integrated lentivirus. after d, the cells were serially diluted into -well plates. single colonies where expanded and dna was extracted using the dneasy blood and tissue dna isolation kit (qiagen). proper editing was verified by sequencing (genewiz inc.) and gene analysis using the online ice tool from synthego (suppl. fig. ). vesicular stomatitis virus (vsv) pseudotyped with spike proteins of sars-cov- were generated according to a published protocol (whitt, ) . briefly, hek t, transfected to express full length sars-cov- spike proteins, were inoculated with vsv-g pseudotyped ∆gluciferase or gfp vsv (kerafast, ma). after hr at °c, the inoculum was removed and cells were refed with dmem supplemented with % fbs, u/ml penicillin, µg/ml streptomycin, and vsv-g antibody (i , mouse hybridoma supernatant from crl- ; atcc). pseudotyped particles were collected hr post-inoculation, centrifuged at , × g to remove cell debris and stored at − °c until use. briefly, µl of luciferin lysis solution was added to the cells and incubated for min at room temperature. the solution was transferred to a black -well plate and luminescence was detected using an enspire multimodal plate reader (perkin elmer). data analysis and statistical analysis was performed in prism . fluor labeling kits (invitrogen), respectively. zombie uv™ was used to gate for live cells in the analysis. cells were then analyzed using an ma cell sorter (sony). for days. fresh medium, µl in the apical chamber and µl in the basal chamber, was added daily. at day , the medium in the apical chambers was removed, and the basal chambers were changed every - days with apical washes with pbs every week for days. the apical side of the hbec ali culture was gently washed three times with µl of phosphate buffered saline without divalent cations (pbs-/-). heparinase was added to the apical side for half an hour prior to infection. an moi of . of authentic sars-cov- live virus (usa-wa / (bei resources, #nr- )) in µl total volume of pbs was added to the apical chamber with either dmso, heparinase ( . mu/ml heparin lyase ii, and mu/ml heparin lyase iii (ibex)) or ug/ml of unfractionated heparin. cells were incubated at c and % co for hours. unbound virus was removed, the apical surface was washed and the compounds were re-added to the apical chamber. cells were incubated for another hours at c and % co . after inoculation, cells were washed once with pbs-/-and µl tryple (thermofisher) was added to the apical chamber then incubated for min in the incubator. cells were gently pipetted up and down and transferred into a sterile ml conical tube containing neutralizing medium of dmem + % fbs. tryple was added again for rounds of minutes for a total of min to clear transwell membrane. cells were spun down and resuspended in pbs with zombie uv viability dye for min in room temp. cells were washed once with facs buffer then fixed in % pfa for min at room temp. pfa was washed off and cells were resuspended in pbs. zombie uv™ was used to gate for live cells in the analysis. infection was analyzed by flow cytometry as explained above. cell viability was assessed using the celltiter-blue® assay (promega). briefly, vero cells were seeded into a well plate. the cells were treated with hsase mix ( . mu/ml hsase ii, and mu/ml hsase iii; ibex) or µg/ml ufh for hrs. the viability of the cells using celltiter-blue® was measured according to the manufacturers protocol. briefly, the j o u r n a l p r e -p r o o f celltiter-blue® reagent was added directly to the cell culture and the cells were incubated overnight. fluorescence was read at excitation nm and emission nm, using an enspire multimodal plate reader (perkin elmer). data analysis was performed in prism. the human bronchial epithelial cells were grown at an air-liquid interface as explained above. cell viability after treatment with hsase mix ( . mu/ml hsase ii, and mu/ml hsase iii; ibex) or µg/ml ufh for hrs was measured by adding celltiter-blue® reagent directly to the transwell inserts and developed as explained above. all statistical analyses were performed in prism (graphpad). all experiments were performed in triplicate and repeated as indicated in the figure legends. data was analyzed statistically using unpaired t-tests when two groups were being compared or by one-way anova without post-hoc correction for multiple comparisons. ic values and confidence intervals were determined using non-linear regression using the inhibitor vs. response least squares fit algorithm. the error bars in the figures refer to mean plus standard deviation (sd) values. the specific statistical tests used are listed in the figure legends and in the methods section. experiments were evaluated by statistical significance according to the following scheme; ns: p > . , *: p ≤ . , **: p ≤ . , ***: p ≤ . , ****: p ≤ . . after hr, cell culture supernatants were collected and stored at - °c. virus titers were determined by plaque assays on vero e monolayers greiner bio-one, # ) and rocked for hr at room temperature. the cells were subsequently overlaid with mem containing % cellulose the plaques were visualized by fixation of the cells with a mixture of % formaldehyde and % methanol (v/v in water) for hr. the monolayer was washed once with pbs and stained with . % crystal violet (millipore sigma # v ) prepared in % ethanol the pennsylvania state university, following the guidelines approved by the institutional biosafety committees. human bronchial epithelial cell air-liquid interface generation and infection human bronchial epithelial cells (hbecs, lonza) were cultured in t flasks in plus medium according to manufacturer instructions (stemcell technologies) to generate air-liquid interface (ali) cultures, hbecs were plated on collagen i-coated well transwell inserts with a . -micron pore size (costar, corning) at x cells/ml. cells were maintained for - days in pneumacult-ex plus medium until confluence, then changed to pneumacult-ali medium triglyceride-rich lipoprotein binding and uptake by heparan sulfate proteoglycan receptors in a crispr/cas library of hep b mutants remdesivir for the treatment of covid- -preliminary report guinea fowl coronavirus diversity has phenotypic consequences for glycan and tissue binding heparan sulfate proteoglycans and viral attachment: true receptors or adaptation bias? viruses undersulfated and glycol-split heparins endowed with antiangiogenic activity the coronavirus (sars-cov- ) surface protein (spike) s receptor binding domain undergoes conformational change upon heparin binding identification of a major co-receptor for primary isolates of hiv- hiv- entry into cd + cells is mediated by the chemokine receptor cc-ckr- special considerations for proteoglycans and glycosaminoglycans and their purification order out of chaos: assembly of ligand binding sites in heparan sulfate age-dependent modulation of heparan sulfate structure and function bioengineering murine mastocytoma cells to produce anticoagulant heparin ucsf chimerax: meeting modern challenges in visualization and analysis structural analysis of urinary glycosaminoglycans from healthy human subjects human milk oligosaccharides inhibit rotavirus infectivity in vitro and in acutely infected piglets human coronaviruses oc and hku bind to -o-acetylated sialic acids via a conserved receptor-binding site in spike protein domain a loss of bcl- -expressing t follicular helper cells and germinal centers in covid- stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis initial step of virus entry: virion binding to cell-surface glycans how good is automated protein docking? the cluspro web server for protein-protein docking appion: an integrated, database-driven pipeline to facilitate em image processing inhibition of sars pseudovirus cell entry by lactoferrin binding to heparan sulfate proteoglycans evolutionary differences in glycosaminoglycan fine structure detected by quantitative glycan reductive isotope labeling heparan sulfate structure in mice with genetically modified heparan sulfate production assessing ace expression patterns in lung tissues in the pathogenesis of covid- angiotensin-converting enzyme is a functional receptor for the sars coronavirus proteoglycans and sulfated glycosaminoglycans sars-cov- spike protein binds heparan sulfate in a length-and sequence-dependent manner entry of human coronavirus nl into the cell human coronavirus nl utilizes heparan sulfate proteoglycans for attachment to target cells stable heparin-producing cell lines derived from the furth murine mastocytoma herpes simplex virus- entry into cells mediated by a novel member of the tnf/ngf receptor family heparin inhibits cellular invasion by sars-cov- : structural dependence of the interaction of the surface protein (spike) s receptor binding domain with heparin membrane protein of human coronavirus nl is responsible for interaction with the adhesion receptor structures of mers-cov spike glycoprotein in complex with sialoside attachment receptors localisation and distribution of o-acetylated n-acetylneuraminic acids, the endogenous substrates of the hemagglutinin-esterases of murine coronaviruses, in mouse tissue mediation of human immunodeficiency virus type binding by interaction of cell surface heparan sulfate proteoglycans with the v region of envelope gp -gp improved vectors and genome-wide libraries for crispr screening relion: implementation of a bayesian approach to cryo-em structure determination cell surface receptors for herpes simplex virus are heparan sulfate proteoglycans a novel role for -o-sulfated heparan sulfate in herpes simplex virus entry nidovirus sialate-o-acetylesterases: evolution and substrate specificity of coronaviral and toroviral receptor-destroying enzymes the sweet spot: defining virus-sialic acid interactions automated molecular microscopy: the new leginon system effective inhibition of sars-cov- entry by heparin and enoxaparin derivatives. biorxiv the versatile heparin in covid- structural basis for human coronavirus attachment to sialic acid receptors the war against influenza: discovery and development of sialidase inhibitors structural characterization of human liver heparan sulfate dog picker and tiltpicker: software tools to facilitate particle selection in single particle electron microscopy function, and antigenicity of the sars-cov- spike glycoprotein isolation and characterization of heparan sulfate from various murine tissues site-specific glycan analysis of the sars-cov- spike a comprehensive compositional analysis of heparin/heparan sulfate-derived disaccharides from human serum generation of vsv pseudotypes using recombinant deltag-vsv for studies on virus entry, identification of entry inhibitors, and immune responses to vaccines cryo-em structure of the -ncov spike in the prefusion conformation vaccines and therapies in development for sars-cov- infections initial interaction of herpes simplex virus with cells is binding to heparan sulfate demystifying heparan sulfate-protein interactions structural basis for the recognition of sars-cov- by full-length human ace cov- spike protein interacts with heparan sulfate and ace through the rbd • heparan sulfate promotes spike-ace interaction • sars-cov- infection is co-dependent on heparan sulfate and ace • heparin and non-anticoagulant derivatives block sars-cov- binding and infection in brief provide evidence that heparin sulfate is a necessary co-factor for sars-cov- infection. they show that heparin sulfate interacts with the receptor binding domain of the sars-cov- spike glycoprotein we thank scott selleck (the pennsylvania state university), eugene yeo (uc san diego), john guatelli (uc san diego), mark fuster (uc san diego) and stephen schoenberger (la jolla institute for immunology) for many helpful discussions, and annamaria naggi and giangiacomo torri from the ronzoni institute for generously providing split-glycol heparin. this key: cord- -yjr ef authors: hotez, peter j.; bottazzi, maria elena title: developing a low-cost and accessible covid- vaccine for global health date: - - journal: plos negl trop dis doi: . /journal.pntd. sha: doc_id: cord_uid: yjr ef nan there is an urgent need to advance safe and affordable covid- vaccines for low-and middle-income countries of asia, africa, and latin america. such vaccines rely on proven technologies such as recombinant protein-based vaccines to facilitate its transfer for emerging market vaccine manufacturers. our group is developing a two-pronged approach to advance recombinant protein-based vaccines to prevent covid- caused by sars-cov- and other coronavirus infections. one vaccine is based on a yeast-derived (pichia pastoris) recombinant protein comprised of the receptor-binding domain (rbd) of the sars-cov formulated on alum and referred to as the cov rbd -n vaccine. potentially, this vaccine could be used as a heterologous vaccine against covid- . a second vaccine specific for covid- is also being advanced using the corresponding rbd of sars-cov- . the first antigen has already undergone current good manufacturing practices (cgmp) manufacture and is therefore "shovel ready" for advancing into clinical trials, following vialing and required good laboratory practice (glp) toxicology testing. evidence for its potential efficacy to cross-protect against sars-cov- includes cross-neutralization and binding studies using polyclonal and monoclonal antibodies. evidence in support of its safety profile include our internal assessments in a mouse challenge model using a lethal mouse-adapted sars strain, which shows that sars-cov rbd -n (when adsorbed to aluminum hydroxide) does not elicit eosinophilic lung pathology. together, these findings suggest that recombinant protein-based vaccines based on the rbd warrant further development to prevent sars, covid- , or other coronaviruses of pandemic potential. "the thing we have to think about now that's different is, how do we produce vaccines specifically for the developing world if this is a truly global epidemic."-seth berkley, ceo, gavi as of june , covid- caused by the sars-cov- coronavirus has infected more than million people globally (confirmed cases) and caused almost , deaths [ ] . although the epidemic began in china, europe, and the united states, there are significant concerns about the risks of disease emergence in low-and middle-income nations. there are now more almost , cases in brazil, , cases in india, and , cases in south africa, such that covid- will become widespread among the poor living in the group of nations [ , ] . moreover, sars-cov- infection is expected to emerge in the global south [ ] . in the african region of the world health organization (who), covid- is now spreading in the populated areas of ghana, nigeria, and democratic republic of congo, and presumably across the region [ ] . in nations such as india, for example, the feasibility of enforcing social distancing in large and crowded urban centers will be particularly daunting [ ] , so that ensuring access to a safe and affordable covid- vaccine will become a global priority. dr. seth berkley, the ceo of gavi, the vaccine alliance, has highlighted the importance of prioritizing a covid- vaccine specifically for these countries [ ] . at least a dozen covid- candidate vaccines are under development using different technology platforms [ ] , with an emphasis on speed, maximizing safety, and avoiding vaccineinduced immunopathology [ ] . many of these will enlist cutting-edge nucleic acid delivery technologies and other innovative approaches. in the meantime, there is urgency to address and rapidly respond to gavi's charge and pursue safe, low-cost, easily administered, and rapidly scalable approaches. for instance, texas children's center for vaccine development (cvd) at baylor college of medicine, in collaboration with its nonprofit product development partners-seattle-based path and infectious disease research institute (idri)-have been spearheading a coronavirus vaccine program focusing on recombinant subunit protein vaccines produced in a globally available microbial fermentation platform, and optimized to maximize yield following expression and protein purification [ , ] . towards this goal, we are now also developing the sars-cov- rbd recombinant protein as a potential vaccine candidate, in parallel with the existing cov rbd -n candidate vaccine, which was previously developed and manufactured under cgmp in [ ] [ ] [ ] [ ] . the bulk drug substance has been stored frozen (− ˚c to ˚c) and remains stable through ongoing testing. furthermore, an independent quality assessment confirmed the suitability of the material through phase clinical trials. both rbd vaccine candidates have potential as vaccine antigens to prevent sars-cov- infection and/or covid- . overall, our initial approach relies on advancing the already manufactured cov rbd -n as a heterologous recombinant subunit vaccine to protect against both sars and covid- [ ] , and in parallel accelerate the advancement of the sars-cov- rbd candidate as a homologous covid- vaccine (fig ) . our preliminary studies now show that the sars-cov- rbd candidate, which is specific for the sequence of the sars-cov- , can also be highly produced in the yeast p. pastoris. both approaches reinforce each other, as the processes developed for the cov rbd -n candidate also apply to the sars-cov- candidate, and both antigens downstream could be further developed as potentially a bivalent or a universal coronavirus vaccine. the sars-cov protein known as cov rbd -n was selected on its ability to elicit high titers of neutralizing antibodies against both sars-cov pseudotype virus and live sars-cov virus [ , ] , prior to confirmatory testing against sars-cov challenge in animal models. it also induced high-level neutralizing antibodies and protective immunity with minimal immunopathology in mice after a homologous virus challenge with sars-cov (ma strain) [ , ] . there are several advantages of the cov rbd candidate antigens and vaccines for purposes of global health: . high yield and low cost. the antigens are expressed in p. pastoris, a low-cost expression platform, which can be produced and scaled at high yields [ , ] . by deleting an n-linked glycosylated asparagine at the n- position of rbd , both the yield and antigenicity improved. at a -liter scale production process, the cov rbd -n antigen was produced through fermentation at mg/l fermentation supernatant (fs) with purification recovery > % [ , ] . a panel of characterization tests indicates that the process is reproducible and robust and that the purified, tag-free rbd -n protein has high purity and a well-defined structure. it is therefore suitable for both pilot scale manufacturing and for transition into process improvements leading to industrial scale manufacturing. . technology transfer. the process is suitable for technology transfer to emerging market vaccine manufacturers (aka dcvms, developing country vaccine manufacturers) having expertise in fermentation technology (https://www.dcvmn.org/) [ ] . the p. pastorisderived recombinant protein is currently produced by several dcvms, including those in bangladesh, brazil, cuba, india, and indonesia. . shovel ready. the cov rbd -n antigen was manufactured under cgmp and can be vialed to produce between , and , doses, with the possibility of transferring production processes and cell banks to dcvms for large-scale production sufficient to meet global needs. beyond low cost and ease of potential technology transfer to dcvms, an advantage of employing a recombinant protein subunit vaccine is the long-standing safety record of this class of vaccines, and the fact that this technology has been used for the licensure of two other antiviral vaccines-hepatitis b and human papillomavirus, as well as biologics (e.g., insulin) [ ] . in addition to their low cost and suitability for use in public immunization programs in lowand middle-income countries, we pursued rbd recombinant protein-based vaccines as a technology to maximize safety relative to other platforms, such as virus vectors that have previously been found to induce immune enhancement. for instance, immune enhancement in children following a formalin-inactivated respiratory syncytial virus (rsv) vaccine was first reported in the s and later shown to occur in laboratory animals with early prototype sars-cov vaccines using virus-vectored platforms or inactivated virus constructs [ ] . we have recently summarized the major safety concerns of some prototype coronavirus vaccines based on studies conducted in laboratory animals (rodents, ferrets, and nonhuman primates) [ ] . they include the following points. some of the earliest sars-cov vaccine candidates used vectored-based platforms, and these were associated with immune enhancement or activation. in - , scientists at the public health agency of canada's national microbiology laboratory in winnipeg, manitoba (who helped to develop the first successful ebola vaccine), found that a recombinant modified vaccinia ankara (rmva) expressing the s-spike protein resulted in severe liver pathology upon sars-cov virus challenge. similarly, rmva expressing the s-spike also resulted in lung immunopathology in rhesus macaques, as did other virus-vectored constructs. lung immunopathology is also linked to whole inactivated viral vaccines. however, it was determined that in many cases eosinophilic pathology is driven by the sars nucleocapsid (n) protein, although a recent trial in nonhuman primates found that an alum-adjuvanted inactivated sars-cov- vaccine did not induce immunopathology [ ] . among the major conclusions of these studies was that they may be driven by t helper- (th ) responses linked to interleukin- [ , ] , and that aluminum formulations exhibit greatly reduced immunopathology [ ] . given the history of virus-vector platforms and inactivated vaccines in eliciting eosinophilic immunopathology, our emphasis has been on the evaluation of inexpensive recombinant proteins produced in microbial systems. these are comprised of the cov rbd -n antigen, encoding amino acids - ( aa) of the sars-cov s-spike protein [ ] [ ] [ ] [ ] , and now a second, cov rbd antigen, which is also expressed without the n-terminal amino acid. the rationale for selecting the rbd domain of the s protein includes focusing on the key component that binds to the human angiotensin converting enzyme (ace ) receptor, and removing the known elements of the s protein involved in immune enhancement. supporting studies summarized elsewhere emphasize how s protein peptides outside of the rbd can induce immune enhancement in non-human primates [ ] . moreover, cov rbd -n induce high titers of neutralizing antibodies in mice and % infection against sars cov virus challenge [ ] . alum formulations of cov rbd -n do not induce immunopathology [ ] , a finding consistent with other published studies [ ] [ ] [ ] . there is evidence to justify advancing the cov rbd -n antigen as either a homologous vaccine against sars [ ] [ ] [ ] [ ] or as a heterologous vaccine against covid- [ ] . in parallel, a cov rbd protein candidate is being advanced. regarding the former, against sars cov homologous virus challenge the vaccine formulated on alum exhibits high levels of protective immunity and with evidence of minimal or no immune enhancement [ ] . with regards to cross-protection against sars cov , the rbd of the sars-cov- and cov rbd -n share significant similarity of amino acid sequence (> % identity, > % similarity) and there is evidence that both viruses use the human ace receptor for cell entry [ ] . further published studies indicate strong antigenic similarities between the sars-cov and sars-cov- rbds, and the potential for cross protection. for example, serum from a convalescent sars-cov patient was shown to neutralize sars-cov- driven entry [ ] . moreover, new studies by tai and colleagues find that using pseudotyped sars-cov- , the sars-cov rbd blocks the entry of both sars-cov and sars-cov- pseudovirus into human ace -expressing t cells [ ] . through pseudovirus neutralization activity, it was found that sars-cov rbd-specific antisera could neutralize sars-cov- pseudovirus infection, suggesting that sars-cov rbd-specific antibodies can cross-react with sars-cov- rbd and cross neutralize sars-cov- pseudovirus infection [ ] . additional studies find that multiple (but not all) neutralizing monoclonal antibodies bind to both rbds [ , , ]. an international priority is the scale-up and global access of an affordable and safe recombinant vaccine to prevent emerging coronavirus infections, including covid- . our aspirational goal is to protect global populations at risk for this important emerging virus infection. a low-cost recombinant protein antigen expressed in p. pastoris and formulated on aluminum or other accessible adjuvants represents a highly accessible technology to transfer to low-and middle-income countries. it represents one of several key mechanisms for ensuring that populations across the major affected nations of africa, asia, and the americas will benefit from covid- vaccinations. world health organization. covid- situation reports poverty and the impact of covid- : the blue-marble health approach millions in india under coronavirus lockdown as major cities restrict daily life will vaccines reach low-income countries during a global pandemic the outbreak of sars-cov- pneumonia calls for viral vaccines don't rush to deploy covid- vaccines and drugs without sufficient safety guarantees optimization of the production process and characterization of the yeast-expressed sars-cov recombinant receptor-binding domain (rbd -n ), a sars vaccine candidate yeast-expressed recombinant protein of the receptor-binding domain in sars-cov spike protein with deglycosylated forms as a sars vaccine candidate potential for developing a sars-cov receptor-binding domain (rbd) recombinant protein as a heterologous human vaccine against coronavirus infectious disease (covid)- yeast-expressed sars-cov recombinant receptor-binding domain (rbd -n ) formulated with alum induces protective immunity and reduces immune enhancement the yeast stands alone: the future of protein biologic production covid- vaccine design: the janus face of immune enhancement rapid development of an inactivated vaccine candidate for sars-cov- the potential role of th immune responses in coronavirus immunopathology and vaccine-induced immune enhancement. microbes and infection covid- vaccines: neutralizing antibodies and the alum advantage pö hlmann s. the novel coronavirus ( -ncov) uses the sars coronavirus receptor ace and the cellular protease tmprss for entry into target cells characterization of the receptor-binding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine cryo-em structure of the -ncov spike in the prefusion conformation identification of sars-cov rbd-targeting monoclonal antibodies with cross-reactive or neutralizing activity against sars-cov- key: cord- - qle a authors: basit, abdul; ali, tanveer; rehman, shafiq ur title: truncated human angiotensin converting enzyme ; a potential inhibitor of sars-cov- spike glycoprotein and potent covid- therapeutic agent date: - - journal: j biomol struct dyn doi: . / . . sha: doc_id: cord_uid: qle a the current pandemic of covid- caused by sars-cov- is continued to spread globally and no potential drug or vaccine against it is available. spike (s) glycoprotein is the structural protein of sars-cov- located on the envelope surface, involve in interaction with angiotensin converting enzyme (ace ), a cell surface receptor, followed by entry into the host cell. thereby, blocking the s glycoprotein through potential inhibitor may interfere its interaction with ace and impede its entry into the host cell. here, we present a truncated version of human ace (tace ), comprising the n terminus region of the intact ace from amino acid position - , involved in binding with receptor binding domain (rbd) of sars-cov- . we analyzed the in-silico potential of tace to compete with intact ace for binding with rbd. the protein-protein docking and molecular dynamic simulation showed that tace has higher binding affinity for rbd and form more stabilized complex with rbd than the intact ace . furthermore, prediction of tace soluble expression in e. coli makes it a suitable candidate to be targeted for covid- therapeutics. this is the first md simulation based findings to provide a high affinity protein inhibitor for sars-cov- s glycoprotein, an important target for drug designing against this unprecedented challenge. communicated by ramaswamy h. sarma the rapid spread of sars coronavirus (sars-cov- ) demands an immediate public health emergency, and no fda approved treatment/vaccines are currently available. sars-cov- spike (s) protein ( amino acids) is essential for virus entry through binding with the host receptor angiotensin converting enzyme ii (ace ) and mediating virus-host membrane fusion (boopathi et al., ; sarma et al., ) . the s protein contains two functional domains (s and s ). the s (residues - ) domain performs the function of virion attachment with human ace receptor on epithelial membrane cell surface, followed by its internalization, hence initiating the infection (hasan et al., ) . this binding induces certain conformational changes in the s protein, which results the s (residue - ) to mediate fusion with cellular membrane. the receptor-binding domain (rbd) of the sars-cov- s protein are highly conserved and directly involve in binding to human ace (yuan et al., ) . since, ace is not mutated/evolved to recognize s protein of sars-cov- ; therefore, using alternative of ace with more binding affinity for s protein than the wild type receptor, may inhibit entry of sars-cov- & À into human cells. this strategy can play important role in devising therapeutics of sars-cov- . several studies have proposed small compounds based inhibitors as therapeutic agents for covid- (aanouz et al., ; elmezayen et al., ; gupta et al., ; khan et al., ; wahedi et al., ) . the small compounds based drugs may not efficiently block the entire binding patch of s protein. on the other hand, the peptides based therapeutics can block the entire binding interface (rbd) of s protein (wan et al., b) , as reported for hiv peptide based drug fuzeona (jenny-avital, ; w ojcik & berlicki, ) . there is growing interest in peptide based therapeutics for covid- treatment (pant et al., ) and approximately peptide based drugs have been evaluated in clinical trials (fosgerau & hoffmann, ) . peptide based drugs have little side effects and little drug tolerance compared with chemical drugs (bruno et al., ) . in order to block the fusion of sars-cov- s protein with human cells, a recent study has reported a neck and transmembrane deficient ace , called as soluble ace (sace ), that can block the entry of sars-cov- into the host cell (procko, ) , which is also found safe in healthy human subjects (haschke et al., ) and patients with lung disease (khan et al., ) . recombinant sace is under clinical trials for covid- treatment in guangdong province of china (clinicaltrials.gov #nct ). the study proposed that mutations in ace receptors interface may increase s/ace interaction. another study has proposed a amino acid peptide, derived from ace (amino acid position - ), which can bind with sars-cov- s protein with a low nanomolar affinity, and can block the attachment of sars-cov- to human ace . since, the binding residues of ace involve in interaction with rbd are located at amino acid position - (wan et al., a; yan et al., ) , therefore, we hypothesized that this fragment carrying all the binding residues will have better binding affinity for rbd and can hinder the interaction of sars-cov- with human ace , hence blocking its entry into the epithelial cells. we designed a truncated version (tace ) of ace receptor covering the binding residues and performed protein-protein docking and molecular dynamic simulations to analyze its binding affinity for rbd and complex stability. the tace will compete with wild type human ace receptors for binding to sars-cov- , as they will have more binding affinity for s protein. this will allow all sars-cov- viral particles to bind strongly with the tace , blocking all its available binding sites for the host ace receptors, thus inhibiting its entry into the cell which will be eliminated through body defense mechanisms. we further determined the soluble expression for tace in e. coli, a suitable host for bulk production of tace . the pdb structure of ace and rbd of sars-cov- s glycoprotein (pdb id: m ) was obtained from pdb database. in order to determine the variation in the sars-cov- s glycoprotein sequence reported from different regions of the globe, s glycoprotein sequences of sars-cov- including reference sequence (nc_ , reported from wuhan, china) were retrieved from ncbi. multiple sequence alignment of the sequences was performed through mega-x. the aligned sequences were then analyzed for amino acid variations. the pdb structures of ace and rbd were repaired for their missing loops and optimized for energy minimization and amino acid side chain clashes through foldx (schymkowitz et al., ) . side chains were optimized through foldx to remove vander waals' clashes by mutating residues with bad energy values into new rotamers with energy minimization (van durme et al., ) . the optimized three dimensional ( d) structures of ace and rbd were used to design truncated ace and studying protein-protein interactions. based on protein-protein interactions between ace and rbd shown in ace -rbd complex (pdb id m ), a truncated version of ace was produced by removing the c-terminus residues from amino acid position - , leaving a truncated n-terminus fragment tace , from - amino acid position. the first residues of ace is the signal peptide (huang et al., ; turner & hooper, ) , therefore it was also removed. the structure of tace was produced through i-tasser, which build the model by assembling continues fragments of multiple threading templates, identified through replica exchange monte carlo (remc) simulations (yang et al., ) . in order to determine binding affinity of both intact and truncated ace with sars-cov- s glycoprotein, rigid body protein-protein docking tools; zdock (pierce et al., ) , cluspro (kozakov et al., ) , patchdock (schneidman-duhovny et al., ) and a flexible protein-protein docking tool, haddock (van zundert et al., ) were used. the energy function used by zdock is z score, which is a cumulative of pairwise shape complementarity function with desolvation and electrostatics. the zdock rank the top predicted docking poses on the basis of z score (chen et al., ) . cluspro uses piper's scoring function, which contains terms of shape complementarity, electrostatics, and pairwise potentials applied on the top conformations produced and ranked on the basis of cluster size. patchdock uses patchdock score as the energy function which ranked the docked model based on desolvation energy, interface area size and geometric score (zhang et al., ) . haddock is a flexible docking method used for docking of protein-protein complexes. haddock drive the docking process by retrieving information from experimentally identified protein complex interfaces. the haddock scoring function consists on combination of various energies and buried surface area. the scoring of the models was performed according to the haddock score. all the generated docking poses of ace and spike protein were visualized through pymol (schrodinger, ) . based on the haddock score and the docking rmsd value, the docked complexes of ace and tace with rbd were analyzed for binding affinity dg (kcal mol À ) and stability using protein binding energy prediction (prodigy) server (xue et al., ) . the server predicts the binding affinity and stability on the basis of structural properties of the proteinprotein complexes. stability of the protein-protein complex is measured through dissociation constant k d (m). the run was performed at different temperatures ranging from to ˚c. the protein-protein docked complex with the minimum rmsd and higher binding affinity was considered for md simulation to further confirm stability of the complex. md simulation of the rbd domain in complex with intact ace and tace was performed through gromacs . . (abraham et al., ) . simulation was performed by using charm . force field and tip p cube box as water model. the protein complex in the cubic box was solvated with water molecules to provide an aqueous environment. the system was then neutralized with addition of na ions followed by energy minimization for removal of conflict between the atoms. the system was then equilibrated through nvt and npt at constant temperature ( k) and pressure ( bar), respectively. langevin thermostat was applied to regulate temperature of the system. md simulation was then run for ns. in order to determine post translation modifications (ptms) in ace , the protein sequence was submitted to ptm-ssmp server, which combines the submitted sequence and site specific modification profile to predict ptm sites in mammalian protein (liu et al., ) . since, glycosylation is the most abundant and diverse posttranslational modification of proteins, therefore, we further determined the o-glycosylation sites in ace using netoglyc . server which specifically predict the galnac-type o-glycosylation site, unique to ser and thr (steentoft et al., ) . we further determined the n-glycosylation sites by using netnglyc- . server using a threshold value of . (gupta et al., ) . in order to express the tace in e. coli, its soluble expressionat c was determined through camsol intrinsic and camsol structurally corrected online solubility prediction tools (sormanni & vendruscolo, ) . camsol determines the solubility on the basis of amino acid sequence, while camsol structurally corrected tool determines the solubility profile on the basis of the structure, which accounts for amino acid distribution in the structure and their solvent exposure. both run was performed at ph . . in both methods, the solubility profile scores higher than . denotes highly soluble regions, while scores lower than À indicates poor solubility in e. coli. in the current study, we have proposed a truncated version of ace that comprises the binding interface for receptor binding domain (rbd) of sars-cov- spike protein. recently, in-vitro binding assay have confirmed that rbd is mainly responsible for initial binding to ace , which further mediate virus entry into the host cell (lan et al., ) . variation in the rbd sequence was analyzed in the sars-cov- genome reported from various region of the globe so far (shu et al., ) . the sequence alignment showed more than . % homology for rbd domain, with only single variation r i in the sars-cov- genome reported from india ( figure s ). the rest of the sars-cov- genome sequences submitted throughout the globe have identical rbd sequence, which indicate that the sars-cov- rbd is highly conserved globally. structural elucidation has also found the rbd domain as highly conserved (lan et al., ). in order to block the spike protein attachment to the cell, the ace /rbd binding interface comprising residues from position - of ace was selected as truncated version of ace .the structure of tace was built through i-taseer with c-score . . the c-score value in range À to shows correctness of the fold. the high c-score for tace suggest the highly likelihood of the structure. the tace fragment contains almost all binding residues involve in binding with rbd domain of sars-cov- (yan et al., ) , covering two complete helices (lan et al., ) . this suggests that rational design of a binder based on this interface with enhanced affinities to rbd may play vital role by blocking the sars-cov- spike protein interaction with ace , thus inhibiting viral entry into the host cell. previously, peptides based strategies have been employed successfully to inhibit fusion of the sars-cov- s protein and membrane receptor (du et al., ) . another recent study has reported a amino acid based peptide, a homologue of ace binding interface, which successfully bind with s protein with low nanomolar affinity . since, the binding residues for ace are located at distant location on rbd, thus providing a larger protein binding site, which is difficult for a small size therapeutic peptide to cover the entire binding sites on rbd. however, our proposed tace fragment carrying almost all the binding residues that can block the attachment of rbd with the intact ace . rbd was docked with intact and truncated ace through haddock, a flexible protein-protein docking tool. the method allows the side-chains and backbone atoms of both the protein and receptor flexible during docking run . haddock scoring function (haddock score) is a linear combination of non-bonded intermolecular van der waals (vws), coulomb electrostatics energies and empirically derived desolvation energy term (vangone et al., ) .the haddock-score of ace and tace was À and À . , respectively, (the more negative the better). similarly the vws and electrostatic energy of tace -rbd complex was also greater than the ace -rbd complex, which shows higher binding affinity of tace for rbd than the intact ace (table ). the rmsd value of ace and tace in complex with rbd were . and . , respectively, showing the high likelihood of the docked complexes with native-one (vangone et al., ) . in order to further confirm these docking scores, rigid docking was also performed through patchdock, z-dock and cluspro protein-protein docking tools. the docking results obtained for ace was compared with tace in term of energy functions of each docking tool (table ). all the three docking scores are higher for tace than that of the intact ace , indicating high affinity of tace for rbd. our docking results showed that seven residues of ace glu , thr , asp , glu , tyr , asn and lys of ace interact with rbd residues lys , lys , asn , tyr , gln , tyr , gly , thr and gly , respectively, which is almost similar to the binding residues profile of ace interface reported previously (yan et al., ) , with additional thr and glu reported by our docking results (figure (a, b) ). however, the tace form a different binding residues network than the intact ace . our docking results showed that ser , asn , tyr , glu , gln , gln and arg of tace are involved in binding with rbd ( figure (c, d) ). this seems that the truncation has produced the conformational changes in the tace -rbd complex which results in exposure of buried binding residues , thus facilitate higher binding of tace to the rbd as compared to the native ace , which are in agreement with previously reported peptides inhibiting viral attachment with the host cell (koehler et al., ) . since, docking methods are not reliable for predicting binding affinity between protein-protein complexes, due to their simple scoring functions (ram ırez & caballero, ). as binding affinity of protein-protein complex also depends on dissociation constants (k d ), ph and temperature (kastritis & bonvin, ) , while these parameters are not included in the benchmark of docking scoring functions. therefore, we determined the binding affinity of ace variants for rbd through prodigy server, which determine the binding affinity based on structural properties of the protein-protein complexes (vangone & bonvin, ) . the ace and tace complexes showed À . and À . dg (kcal mol À ) binding affinity for rbd, respectively, at temperatures ranges from to c, showing higher binding affinity of tace for rbd than the intact ace (table ) . similarly, the dissociation constant k d value of tace -rbd complex was more than three-fold lesser than the intact ace -rbd complex, showing that tace is more tightly bound to rbd. the smaller k d value indicates high stability and strong binding affinity between protein-protein complex (johnson et al., ) . the ace variants showed a significant decline in k d value when temperature was increased from c to c, leading to a lower k d ( .  À m) for tace (higher affinity) than that of intact ace ( .  À m) at c.this k d value of tace is lesser than the previously reported k d value ( nm) of sbp (an ace derived peptide of amino acid) to rbd . the optimum stability of the complexes was found at ˚c (table ). the dramatic changes of binding kinetics might be caused by reduced stability of ace complex below optimum temperature ˚c (zhao et al., ) . in order to determine the structural stability and dynamic behavior of intact ace -rbd and tace -rbd complexes, we performed md simulation for ns using gromacs . . . the docking pose of each complex obtained from haddock with lowest energy was selected for md simulation run. to investigate structural stability of the complex, rmsd plot of the complex backbone was produced. a uniform rmsd plot signifying structural stability of tace -rbd complex. the rmsd value for tace complex was . - . nm, while intact ace showed . - . nm rmsd (figure ). the rmsd value of tace -rbd complex is lesser than sbp -rbd complex, reported previously, which is almost . nm , showing higher stability of tace -rbd complex. root mean square fluctuation (rmsf) was determined to evaluate the residues flexibility of both ace and rbd in the docked complexes. the high rmsf values indicate the mobility of residue side chains in relation to their average position (kumar et al., ) .the rmsf plot shows the residues of rbd in tace complex are stable with a few peaks with rmsf more than . nm (figure (b) ), while rbd of ace complex shows many residues with rmsf above . nm (figure (d) ).the residues of tace at position , , , and showed reduction in rmsf value due to creating binding interactions with rbd (figure (a) ). the residues involved in binding with other protein, present lower rmsf values, reveal the most stable regions of the complex (ardalan et al., ) .similarly, the residues window of - of rbd showed higher fluctuation to . nm, while decrease in fluctuation at the binding residues positions (figure (b) ). the most violent fluctuation in the intact ace was observed at c-terminus, which was above . nm (figure (c) ). the overall rmsf values of both tace and rbd are below . nm, which indicate that tace complex with rbd is stable, which are in agreement with a previously reported rmsf value . nm, showing complex stability (maqsood et al., ) . the overall trajectories obtained after every ps during a ns md simulation run, very small backbone deviation for both the intact ace and tace complex was observed (figure ) . however, the amino acid region - of rbd has shown backbone fluctuation highlighted as yellow (figure (c) ), which we suggest the region of binding site for ace . previously, the amino acid region of the sars-cov- spike protein ( - ) was also reported as binding region for ace (ibrahim et al., ) . À . à the haddock score is defined as: . evdw þ . eelec þ . edesol þ . eair. Ãà the z-score produced by haddock indicates standard deviations from the average cluster (the more negative the better). radius of gyration (r g) of both ace complexes describes overall spread of molecule during a ns md run. a low rg value indicates better structural integrity and folding behavior (erva et al., ) . a slight increase in rg value of the intact ace -rbd complex was observed during first ns of the run, then after no further drifts till end ( figure , red line), however, the tace -rbd complex was found stable throughout the md run ( figure , violet line), which indicates its structural integrity. overall, the md simulation results confirm that tace form a more stabilized complex with rbd and suggest its inhibitory features for sars-cov- spike glycoprotein. post-translational modifications (ptms) play important role in protein-protein interactions (su et al., ) . since, experimental methods are high-cost and time-consuming, therefore, it is necessary to theoretically predict ptms site on protein to be expressed heterologously. ptm-ssmp, which predict ptms sites on human protein based on local sequence and site specific modification profile (liu et al., ) . ace analysis through ptm-ssmp server predicted ubiquitination at position and , phosphorylation at and o-glycosylation at residue position. the ptm site at is important for protein degradation and have no role in ppis (lecker et al., ) . in transmembrane proteins, the extracellular domains may only be n-glycosylated (gupta et al., ) . however, there was no n-glycosylation and oglycosylation site predicted for tace . these results conclude that there is no ptms site predicted on tace , which is important for protein-protein interactions. interestingly, an experimental study reported that the lack of glycosylation do not affect the binding of sars-cov- rbd to human ace (chakraborti et al., ) , which strongly support our designed tace fragment, if expressed in e. coli may bind efficiently with rbd of sars-cov- s glycoprotein. figure . rmsd plot of the ace -rbd (red) and tace -rbd complex (violet) backbone atoms. the tace complex showing less rmsd value than the intact ace , indicating its higher complex stability than the intact ace . since, there was no ptm site predicted in tace , therefore, e. coli would be an ideal host for its large scale expression. e. coli is the easiest, quickest, and cheapest expression host with a fully known genome, most widely used for hetrologous expression of recombinant protein (basit et al., ) . since, ace is eukaryotic protein; therefore, its expression in its native form in e. coli will be uncertain, as most of the eukaryotic protein showed insoluble expression in e. coli, which need to be refolded invitro , which is costly and time consuming. that's why, the protein that express in soluble form in e. coli are referred as "low hanging fruit", as their bulk production is cost effective and easy to recover (maqsood et al., ) . both sequence and structure based solubility prediction tool using camsol software predicted expression of intact ace in a completely insoluble form in e. coli with intrinsic solubility score À . and complete soluble expression of tace with a solubility score of . . the software generate solubility profile with one score per residue, where regions with scores higher than denote highly soluble regions, while scores lower than À showing poorly soluble ones (figure (a, b) ). these results propose e. coli as a suitable host for soluble expression of tace using pet a (þ) as an expression vector, which favors single step purification. structure-based rational design of inhibitory protein with enhanced affinities to the sars-cov- spike glycoprotein may facilitate development of potential therapeutics. in this study, we have designed a truncated version of human angiotensin converting enzyme as a potential inhibitor of spike glycoprotein. the truncated protein tace was extensively studied through protein-protein docking and md simulation for binding to rbd of sars-cov- spike glycoprotein. we found that tace can bind to rbd with a higher binding affinity and form more stabilized complex than the intact ace . in addition, the tace sequence predicted soluble expression in e. coli, which makes it an easy target for rapid production at large scale for sars-cov- prevention. we believe that this study narrow down the region of interaction between sars-cov- s glycoprotein and human ace and paves the way to further enhance the binding affinity between tace and sars-cov- s glycoprotein through rational design. this will open a new path to covid- treatment. the authors declare no conflict of interest. moroccan medicinal plants as inhibitors of covid- : computational investigations gromacs: high performance molecular simulations through multi-level parallelism from laptops to supercomputers novel mutant of escherichia coli asparaginase ii to reduction of the glutaminase activity in treatment of acute lymphocytic leukemia by molecular dynamics simulations and qm-mm studies truncation of the processive cel a of thermotoga maritima results in soluble expression and several fold increase in activity health improvement of human hair and their reshaping using recombinant keratin k improvement in activity of cellulase cel a of thermotoga neapolitana by error prone pcr novel coronavirus structure, mechanism of action, antiviral drug promises and rule out against its treatment basics and recent advances in peptide and protein drug delivery the sars coronavirus s glycoprotein receptor binding domain: fine mapping and functional characterization zdock: an initial-stage protein-docking algorithm the spike protein of sars-cov-a target for vaccine and therapeutic development drug repurposing for coronavirus (covid- ): in silico screening of known drugs against coronavirus cl hydrolase and protease enzymes molecular dynamic simulations of escherichia coli l-asparaginase to illuminate its role in deamination of asparagine and glutamine residues peptide therapeutics: current status and future directions information-driven, ensemble flexible peptide docking using haddock in-silico approaches to detect inhibitors of the human severe acute respiratory syndrome coronavirus envelope protein ion channel netnglyc . server. center for biological sequence analysis a review on the cleavage priming of the spike protein on coronavirus by angiotensin-converting enzyme- and furin pharmacokinetics and pharmacodynamics of recombinant human angiotensin-converting enzyme in healthy human subjects novel peptide inhibitors of angiotensin-converting enzyme covid- spike-host cell receptor grp binding site prediction enfuvirtide, an hiv- fusion inhibitor inhibition of human pancreatic ribonuclease by the human ribonuclease inhibitor protein are scoring functions in protein Àprotein docking ready to predict interactomes? clues from a novel binding affinity benchmark a pilot clinical trial of recombinant human angiotensin-converting enzyme in acute respiratory distress syndrome identification of chymotrypsin-like protease inhibitors of sars-cov- via integrated computational approach a fusion-inhibiting peptide against rift valley fever virus inhibits multiple, diverse viruses the cluspro web server for protein-protein docking molecular docking and molecular dynamics studies on b-lactamases and penicillin binding proteins structure of the sars-cov- spike receptorbinding domain bound to the ace receptor protein degradation by the ubiquitin-proteasome pathway in normal and disease states ptm-ssmp: a web server for predicting different types of post-translational modification sites using novel site-specific modification profile characterization of a thermostable, allosteric l-asparaginase from anoxybacillus flavithermus peptide-like and small-molecule inhibitors against covid- zdock server: interactive docking prediction of protein-protein complexes and symmetric multimers the sequence of human ace is suboptimal for binding the s spike protein of sars coronavirus . biorxiv is it reliable to use common molecular docking methods for comparing the binding affinities of enantiomer pairs for their protein target? in-silico homology assisted identification of inhibitor of rna binding against -ncov n-protein (n terminal domain) patchdock and symmdock: servers for rigid and symmetric docking the pymol molecular graphics system the foldx web server: an online force field potential inhibitors for targeting mpro and spike of sars-cov- based on sequence and structural pharmacology analysis protein solubility predictions using the camsol method in the study of protein homeostasis precision mapping of the human o-galnac glycoproteome through simplecell technology investigation and identification of functional post-translational modification sites associated with drug binding and protein-protein interactions -angiotensin-converting enzyme a graphical interface for the foldx forcefield the haddock . web server: user-friendly integrative modeling of biomolecular complexes contacts-based prediction of binding affinity in protein-protein complexes. elife, , e sense and simplicity in haddock scoring: lessons from casp-capri round stilbene-based natural compounds as promising drug candidates against covid- receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus peptide-based inhibitors of protein-protein interactions prodigy: a web server for predicting the binding affinity of protein-protein complexes structural basis for the recognition of sars-cov- by full-length human ace the i-tasser suite: protein structure and function prediction a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov determination of atomic desolvation energies from the structures of crystallized proteins the first-in-class peptide binder to the sars-cov- spike protein. biorxiv, - impact of temperature on heparin and protein interactions key: cord- -maa c a authors: zhang, yuan; zheng, nan; zhong, yang title: computational characterization and design of sars coronavirus receptor recognition and antibody neutralization date: - - journal: comput biol chem doi: . /j.compbiolchem. . . sha: doc_id: cord_uid: maa c a the sequential determination of crystal structures of the sars coronavirus spike receptor-binding domain (rbd) in complex with its cellular receptor or neutralizing antibody opened a door for the design and development of antiviral competitive inhibitors. based on those complex structures, we conduct computational characterization and design of rbd-mediated receptor recognition and antibody neutralization. the comparisons between computational predictions and experimental evidences validate our structural bioinformatics protocols. and the calculations predict a number of single substitutions on rbd, receptor or antibody that could remarkably elevate the binding affinities of those complexes. it is reasonable to anticipate our structure-based computation-derived hypotheses could be informative to the future biochemical and immunological tests. as an envelope glycoprotein, the spike protein of severe acute respiratory syndrome coronavirus (sars-cov) plays a key role in the viral entry and neutralization (bartlam et al., ; denison, ; lau and peiris, ; xu and gao, ; zhu, ) . this structural protein consists of two functional regions: the outer globular s region responsible for the initial attachment to cellular receptor and the inner stalk s region contributing to the subsequent fusion between viral envelope and cellular membrane (beniac et al., ; hofmann and pohlmann, ; lin et al., ; xiao and dimitrov, ) . a membrane-associated zinc metallopeptidase, angiotensinconverting enzyme (ace ), has been identified as the functional receptor for sars-cov (li et al., ) . and a soluble form of ace could block the association of s region with the permissive vero e cells (li et al., ; moore et al., ) . in addition, a -amino acid fragment (residues - ), located within the s region, was demonstrated as an independently folded receptor-binding domain (rbd) capable of attaching ace more efficiently (ic < nm) compared with the full s region (ic ≈ nm) . besides, this rbd was able to elicit highly potent neutralizing antibodies in the immunized animals, which conferred those animals significant protection from the challenge of pathogenic sars-cov (du et al., ; he et al., he et al., , a he et al., ,b, a zakhartchouk et al., ; zhao et al., ) . moreover, a human monoclonal antibody r, isolated from a nonimmune human antibody library, was shown to potently neutralize sars-cov through targeting the rbd and blocking receptor recognition (sui et al., ) . the epitope mapping illustrated a -amino acid conformationally sensitive fragment (residues - ) within the rbd was the neutralizing epitope of r (sui et al., ) . furthermore, another human monoclonal antibody m also exhibited potent neutralization of sars-cov by competition with ace for binding to rbd (prabakaran et al., ) . together those data suggest the receptor association process of sars-cov is an attractive opportunity for therapeutic intervention (de clercq, ; he and jiang, ; hofmann and pohlmann, ; jiang et al., ; kuhn et al., ; yeung et al., ) . the peptide or peptidomimetic antagonist leads, including the sars-cov spike rbd, the soluble form of ace and the neutralizing antibodies r plus m , should be able to potently abolish viral attachment to host cells. in this study, we conducted structural bioinformatics analyses on the crystal structures of the sars-cov rbd complexed with functional receptor or neutralizing antibody (hwang et al., ; li et al., a li et al., , prabakaran et al., ) to predict single substitutions on spike rbd, receptor or antibodies possibly causing remarkable elevation in the binding affinities of complexes for the design and development of anti-sars agents. three coordinates files were retrieved from the protein data bank (pdb) (berman et al., ) . one file is the ace -bound rbd (pdb code: ajf) (li et al., a) , while the others are the rbd complexed with r (pdb code: ghw) (hwang et al., ) or m (pdb code: dd ) (prabakaran et al., ) . both the first and second files harbor a pair of sister complexes. and in the third file, either the heavy chain or the light chain of m makes its own contacts with the rbd. thus, a total of six complex structures (ae/bf for ace -rbd, ab/cd for r-rbd and hs/ls for m -rbd) are subject to computational simulations, respectively. firstly, the program foldx , based on an empirical effective energy function, was employed for calculation of the binding free energy values of wild type complexes. then, a computational alanine scanning on the protein-protein interfaces was performed for evaluation of energetical contribution from single binding sites to the complex formation. those positions yielding a calculated increase in association energy of more than kcal/mol on alanine substitution were defined as energetic hot (important) spots according to previous criteria (guerois et al., ; guerois and serrano, ; kiel and serrano, ; kiel et al., kiel et al., , . the next step was to redesign the interactions between rbd and its binding partners through the software deepview (arnold et al., ; guex and peitsch, ) . each of the binding sites on the rbd, receptor or antibody was saturated with virtual substitutions, i.e., replaced with all the natural amino acid residues except the original one. finally, the reconstructed models were feed to the program foldx to compute their binding energies. here, only the variants rewarded a value of at least kcal/mol lower than that of the wild type were taken into consideration. the calculated binding energy values and hot spots of the wild type complexes are shown in table . the complexes ace -rbd (ae/bf) and r-rbd (ab/cd) show a close correlation between their interaction energies (− . /− . kcal/mol versus − . /− . kcal/mol) and buried surface area ( Å versus Å ), gap volume ( Å versus Å ), or binding affinity ( . nm versus . nm) (hwang et al., ) . those obvious associations indicate that the higher geometric complementarity, corresponding to the larger buried surface area and the smaller gap volume, offers the complex r-rbd rather than the complex ace -rbd the lower interaction energy and consequently the stronger binding affinity. similarly, the correlation of binding energy with buried surface area is also found for the complex m -rbd in which the heavy chain and the light chain contribute % and % to the total buried surface (prabakaran et al., ) . and the rbd association energy of the former chain (− . kcal/mol) is remarkably lower than that of the latter (− . kcal/mol). the perfect agreements of computational predictions with structural observations or biochemical evidences strongly suggest the reliability of our protocols. as to the hot spots of complexes, the consistency between computational predictions and experimental evidences is clearly detected for rbd and receptor. in ace -rbd complexes, three receptor residues (glu , asp and tyr on the chain a of complex ae or glu , tyr and lys on the chain b of complex bf) form one hot spot cluster interacting with another hot cluster formed by five or six rbd residues (arg , tyr , tyr , tyr and tyr on the chain e of complex ae and the chain f of complex bf, whereas asn only on the chain f). the interactions between the two hot clusters make the major contribution to the binding free energy of ace -rbd complexes. notably, our predictions are in agreement with previous experimental alanine mutagenesis, which identified two hot spots on rbd (arg and asn ) (chakraborti et al., ) and another two on receptor (tyr and lys ) . in addition, computational alanine scanning on the sister complexes ab and cd successfully identify a rbd hot spot (asp ) revealed in mutational binding analyses (sui et al., ) . in sharp contrast to ace , the antibody r possesses four hot residues (tyr , asn , arg and trp ) being scattered on the binding surface rather than centralized into a cluster. the difference in the number and distribution of hot spots might account for the large gap between the interaction energies of r-rbd (− . and − . kcal/mol) and those of ace -rbd (− . and the complexes ae and bf with the chains a, b for ace and the chains e, f for rbd; the complexes ab and cd with the chains a, c for rbd and the chains b, d for r; the complexes hs and ls with the chains h, l and s for heavy and light chains of m plus rbd. for association with rbd, or the higher spike-binding affinity of r compared to that of receptor. finally, only two neighboring hot spots (trp and asp ) are found on the light chain of m while none on the heavy chain. thus, an interesting discovery is the fact that among the five or six ace -binding hot spots of rbd, three (tyr , tyr and tyr ) are simultaneously r-neutralizing hot spots whereas only one (tyr ) is important for m neutralization. this finding indicates that r might have the greater potential than m for inhibition of spike-mediated infection. in summary, the consistency of calculations with experiments mentioned above further validates our approaches to characterize protein-protein interactions. the predicted replacements on spike rbd, cellular receptor or neutralizing antibody with significant increase in binding affinity are listed in table . the comparisons between virtual mutants derived from sister complexes of ace -rbd or r-rbd consistently identify a number of substitutions worth of biochemical and immunological experimental tests. for instance, recent experimental evidences revealed the great potential of ace in the protection of several animal models from sars-cov-induced lung injury or severe acute lung failure kuba et al., kuba et al., , . simultaneously, the crystal structures of the native and inhibitor-bound forms of ace towler et al., ; turner et al., ) successfully laid a solid foundation for the discovery of novel small-molecule inhibitors of its enzymatic activity or spikemediated virus entry by chemical genetics (huentelman et al., ; kao et al., ) and the identification of its crucial activesite residues by site-directed mutagenesis (guy et al., a,b) . very recently, a modest anti-sars activity (ic ≈ . mm) was observed for an ace -derived peptide containing two segments of receptor (residues - and - ) linked by glycine (han et al., ) . it should be pointed out that both the experimentally confirmed hot spots (tyr and lys ) and the predicted sites for replacements (thr , lys and his ) are nested in those two segments. similarly, a small peptide derived from spike protein (residues - ) also block viral receptor recognition with ic of . nm (ho et al., ) . and our calculated two hot spots (tyr and tyr ) in combination with two target positions (tyr and gln ) are located in this short fragment, too. consequently, it is reasonable to anticipate that our blueprint could effectively increase the binding affinity of the two novel peptides to disrupt sars-cov infection. the swiss-model workspace: a web-based environment for protein structure homology modelling structural insights into sars coronavirus proteins architecture of the sars coronavirus prefusion spike the protein data bank the sars coronavirus s glycoprotein receptor binding domain: fine mapping and functional characterization potential antivirals and antiviral strategies against sars coronavirus infections severe acute respiratory syndrome coronavirus pathogenesis, disease and vaccines: an update receptorbinding domain of sars-cov spike protein induces long-term protective immunity in an animal model the sh -fold family: experimental evidence and prediction of variations in the folding pathways predicting changes in the stability of proteins and protein complexes: a study of more than mutations swiss-model and the swiss-pdbviewer: an environment for comparative protein modeling identification of critical active-site residues in angiotensin-converting enzyme- (ace ) by site-directed mutagenesis membrane-associated zinc peptidase families: comparing ace and ace identification of critical determinants on ace for sars-cov entry and development of a potent entry inhibitor vaccine design for severe acute respiratory syndrome coronavirus receptor-binding domain of sars-cov spike protein induces highly potent neutralizing antibodies: implication for developing subunit vaccine receptor-binding domain of severe acute respiratory syndrome coronavirus spike protein contains multiple conformation-dependent epitopes that induce highly potent neutralizing antibodies identification of a critical neutralization determinant of severe acute respiratory syndrome (sars)-associated coronavirus: importance for designing sars vaccines identification and characterization of novel neutralizing epitopes in the receptor-binding domain of sars-cov spike protein: revealing the critical antigenic determinants in inactivated sars-cov vaccine antigenic and immunogenic characterization of recombinant baculovirus-expressed severe acute respiratory syndrome coronavirus spike protein: implication for vaccine design a single amino acid substitution (r a) in the receptor-binding domain of sars coronavirus spike protein disrupts the antigenic structure and binding activity crossneutralization of human and palm civet severe acute respiratory syndrome coronaviruses by antibodies targeting the receptor-binding domain of spike protein design and biological activities of novel inhibitory peptides for sars-cov spike protein and angiotensin-converting enzyme interaction cellular entry of the sars coronavirus structure-based discovery of a novel angiotensin-converting enzyme inhibitor structural basis of neutralization by a human anti-severe acute respiratory syndrome spike protein antibody, r angiotensinconverting enzyme protects from severe acute lung failure sars vaccine development identification of novel small-molecule inhibitors of severe acute respiratory syndromeassociated coronavirus by chemical genetics the ubiquitin domain superfold: structure-based sequence alignments and characterization of binding epitopes a detailed thermodynamic analysis of ras/effector complex interfaces recognizing and defining true ras binding domains. ii. in silico prediction based on homology modelling and energy calculations a crucial role of angiotensin converting enzyme (ace ) in sars coronavirus-induced lung injury angiotensin-converting enzyme in lung diseases angiotensin-converting enzyme : a functional receptor for sars coronavirus pathogenesis of severe acute respiratory syndrome angiotensin-converting enzyme is a functional receptor for the sars coronavirus structure of sars coronavirus spike receptor-binding domain complexed with receptor receptor and viral determinants of sars-coronavirus adaptation to human ace conformational states of the severe acute respiratory syndrome coronavirus spike protein ectodomain surface ultrastructure of sars coronavirus revealed by atomic force microscopy retroviruses pseudotyped with the severe acute respiratory syndrome coronavirus spike protein efficiently infect cells expressing angiotensin-converting enzyme structure of severe acute respiratory syndrome coronavirus receptor-binding domain complexed with neutralizing antibody the foldx web server: an online force field potent neutralization of severe acute respiratory syndrome (sars) coronavirus by a human mab to s protein that blocks receptor association evaluation of human monoclonal antibody r for immunoprophylaxis of severe acute respiratory syndrome by an animal study, epitope mapping, and analysis of spike variants ace x-ray structures reveal a large hinge-bending motion important for inhibitor binding and catalysis ace : from vasopeptidase to sars virus receptor a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensinconverting enzyme immunological responses against sars-coronavirus infection in humans severe acute respiratory syndrome coronavirus entry into host cells: opportunities for therapeutic intervention immunogenicity of a receptor-binding domain of sars coronavirus spike protein in mice: implications for a subunit vaccine a study on antigenicity and receptor-binding ability of fragment - of the spike protein of sars coronavirus key: cord- - nd ptrg authors: lu, guangwen; hu, yawei; wang, qihui; qi, jianxun; gao, feng; li, yan; zhang, yanfang; zhang, wei; yuan, yuan; bao, jinku; zhang, buchang; shi, yi; yan, jinghua; gao, george f. title: molecular basis of binding between novel human coronavirus mers-cov and its receptor cd date: - - journal: nature doi: . /nature sha: doc_id: cord_uid: nd ptrg the newly emergent middle east respiratory syndrome coronavirus (mers-cov) can cause severe pulmonary disease in humans( , ), representing the second example of a highly pathogenic coronavirus, the first being sars-cov( ). cd (also known as dipeptidyl peptidase , dpp ) was recently identified as the cellular receptor for mers-cov( ). the engagement of the mers-cov spike protein with cd mediates viral attachment to host cells and virus–cell fusion, thereby initiating infection. here we delineate the molecular basis of this specific interaction by presenting the first crystal structures of both the free receptor binding domain (rbd) of the mers-cov spike protein and its complex with cd . furthermore, binding between the rbd and cd is measured using real-time surface plasmon resonance with a dissociation constant of . nm. the viral rbd is composed of a core subdomain homologous to that of the sars-cov spike protein, and a unique strand-dominated external receptor binding motif that recognizes blades iv and v of the cd β-propeller. the atomic details at the interface between the two binding entities reveal a surprising protein–protein contact mediated mainly by hydrophilic residues. sequence alignment indicates, among betacoronaviruses, a possible structural conservation for the region homologous to the mers-cov rbd core, but a high variation in the external receptor binding motif region for virus-specific pathogenesis such as receptor recognition. supplementary information: the online version of this article (doi: . /nature ) contains supplementary material, which is available to authorized users. the newly emergent middle east respiratory syndrome coronavirus (mers-cov) can cause severe pulmonary disease in humans , , representing the second example of a highly pathogenic coronavirus, the first being sars-cov . cd (also known as dipeptidyl peptidase , dpp ) was recently identified as the cellular receptor for mers-cov . the engagement of the mers-cov spike protein with cd mediates viral attachment to host cells and virus-cell fusion, thereby initiating infection. here we delineate the molecular basis of this specific interaction by presenting the first crystal structures of both the free receptor binding domain (rbd) of the mers-cov spike protein and its complex with cd . furthermore, binding between the rbd and cd is measured using real-time surface plasmon resonance with a dissociation constant of . nm. the viral rbd is composed of a core subdomain homologous to that of the sars-cov spike protein, and a unique strand-dominated external receptor binding motif that recognizes blades iv and v of the cd b-propeller. the atomic details at the interface between the two binding entities reveal a surprising protein-protein contact mediated mainly by hydrophilic residues. sequence alignment indicates, among betacoronaviruses, a possible structural conservation for the region homologous to the mers-cov rbd core, but a high variation in the external receptor binding motif region for virus-specific pathogenesis such as receptor recognition. the recent identification of a novel coronavirus, mers-covwhich, as of may th , had infected patients with a total of fatalities-has drawn worldwide attention as a potential cause of a future pandemic . unlike most coronaviruses circulating in humans that only cause mild respiratory illness , mers-cov possibly represents a second reported coronavirus of severely high virulence after sars-cov, which caused over , infection cases globally in , with more than deaths . the clinical manifestations of mers-cov infection include fever, cough, acute respiratory distress syndrome and, in some cases, accompanying renal failure , , and are very similar to those caused by sars-cov. however, the novel coronavirus diverges from sars-cov in genomic sequence, and is much more closely related to the bat-derived hku and hku coronaviruses , . consistent with phylogenetic analysis, mers-cov does not use the sars-cov receptor, angiotensin converting enzyme (ace ), as its entry receptor ; rather, a recent study showed that it uses human cd for this purpose . cd is the third peptidase to be identified as a functional coronavirus receptor, the others being aminopeptidase n (anpep, also known as apn and cd ) , and ace (ref. ) . the recognition of cd by mers-cov is mediated by virus surface spike (s) protein . as with other coronaviruses, the mers-cov s protein would be cleaved in host cells into s and s subunits (fig. a) . s engages the receptor whereas s , with typical sequence motifs homologous to those identified as the heptad repeats in class i enveloped viruses [ ] [ ] [ ] , should mediate membrane fusion. the exploitation of the virus-receptor interaction and thus of the intervention strategies requires an atomic delineation of the receptor-binding properties of s . on the basis of previous studies, the receptor attachment sites of coronavirus s subunits might locate to either the amino-terminal (such as in murine hepatitis virus ) or the carboxy-terminal (such as in, for example, sars-cov and human coronavirus nl (ref. )) domain. we therefore tested individually the binding of mers-cov s and its n-and c-terminal-domain proteins to cell-surface-expressed cd molecules. the receptor-binding capacity was attributed to the c-terminal amino acids - of mers-cov s (fig. b) . we hereby referred to this domain as rbd. the potent interaction between mers-cov rbd and cd was further demonstrated by surface plasmon resonance assays, in which cd binds to mers-cov rbd with a dissociation constant (k d ) of about . nm (k on , . m s ; k off , . s ), but does not bind to the rbd of sars-cov (fig. c) . we crystallized mers-cov rbd and solved its structure at a resolution of . Å (supplementary table ). two molecules of essentially the same structure are present in the asymmetric unit. each molecule contains consecutive density-traceable amino acids from v to l . a dali search within the protein data bank (pdb) revealed clear structural homology between mers-cov rbd and sars-cov rbd (pdb code, dd ; z score, . ). we therefore divided the mers-cov rbd structure into two subdomains: a core and an external b-sheet, using the structure of sars-cov rbd as a reference. the core subdomain reveals a five-stranded antiparallel b-sheet (b , b , b , b and b ) in the centre. the connecting helices (four a-helices: a - and two -helices: g and g ) and two small b-strands (b and b ) further decorate the sheet on both sides, together forming a globular fold. three disulphide bonds, connecting c to c , c to c , and c to c , respectively, stabilize the core-domain structure from the interior. at the solvent-exposed side, the rbd termini are clinched adjacent to each other (fig. a, b) . this subdomain fold is very similar to that of the sars-cov rbd core (a root mean squared deviation of . Å for ca pairs). superimposition of the two structures reveals a well-aligned centre sheet and homologous peripheral helices and strands, although several intervening loops are observed to exhibit large conformational variance (fig. c) . the external subdomain of mers-cov rbd is mainly a b-sheet structure with three large (b , b and b ) and one small (b ) strand arranged in an antiparallel manner. it is anchored to the rbd core through the b / , b / and b / intervening loops, which touch the core subdomain like a clamp at both the top and bottom positions. two small helices (g and g ) and most of the connecting loops in this subdomain locate on the interior side of the sheet, hence exposing a flat exterior sheet-face to the solvent. residues c and c form the fourth disulphide bond, linking the g helix to strand b (fig. a, b) . with no observable structure homology (fig. c) , the external subdomains of mers-cov and sars-cov rbds are topological equivalents, both being present as an 'insertion' between the equivalent core-strands (strands b and b in mers-cov, and b and b in sars-cov) (supplementary fig. ). to elucidate the structural basis of the virus-receptor engagement, we further prepared the rbd-cd complex by in vitro mixture of the two proteins and then purification on a gel filtration column. consistent with the high binding affinity between mers-cov rbd and cd , the complex is easily obtainable and stable ( supplementary fig. ). the complex structure was solved at . Å resolution (supplementary table ) with one rbd binding to a single cd molecule in the asymmetric unit. the receptor, as shown in previous reports , , is composed of an eight-bladed b-propeller domain and an a/b hydrolase domain. mers-cov rbd binds to the side-surface of the cd bpropeller, recognizing blades iv and v and a small bulged helix in the blade-linker. as for the viral ligand, the entire receptor binding site locates in the external subdomain and to the solvent-exposed sheetface, qualifying the subdomain as the receptor binding motif (rbm) (fig. a) . overall, engagement of the receptor does not induce obvious conformational changes in rbm, although small structural variance could be observed for the tip-loops. the g -a loop in the rbd core, however, unexpectedly exhibits a large conformational difference between the free and the bounded structures ( supplementary fig. ) . we believe this is due to a crystal contact present in the free rbd structure, which is interrupted in the complex crystal by the engaging receptor. cd is a type ii transmembrane protein. it is present as a homodimer on the cell surface - . the dimerization of the peptidase relies on broad intermolecule contacts contributed by the hydrolase domain and the extended strands in blade iv of the b-propeller , . a lateral binding of mers-cov rbd to cd would therefore not disrupt cd dimerization. accordingly, a similar u-shaped cd dimer could be generated by symmetry operations of the complex structure. the viral ligand locates at the membrane-distal tip of the dimer, corresponding well to a trans interaction between the virus and the receptor (fig. b) . considering that the rbd n and c termini are on the same side distant from cd , it is unlikely that the remaining s domains would contact the receptor molecule. the binding mode revealed by the complex structure is also in good accordance with a previous study showing that the virus-receptor interaction is independent of the peptidase activity of cd (ref. ) . the bound rbd is far away from interfering with either the substrate/product accessing tunnels or the catalytic centre , (fig. b) . overall, a surface area of . and . Å in cd and mers-cov rbd, respectively, is buried by complex formation (fig. a) . scrutinization of the binding interface reveals a group of hydrophilic residues at the site, forming a polar-contact (h-bond and salt-bridge) network. these interactions are predominantly mediated by the residue side chains (including rbd y with cd r , n with q , k with t , d with r , e with q , and d with k ), although cd l and rbd d are observed to contact rbd r and cd y , respectively, through the mainchain oxygen atom (fig. b) . in addition, the bulged helix in cd properly positions three hydrophobic residues a , l and i into close proximity with the rbd amino acids y , w and v , forming a hydrophobic centre at the interface (fig. c) . further virusreceptor contacts include v and i of cd packing against p and the apolar carbon atoms of r and e in rbd (fig. d) , and a cd n -linked carbohydrate moiety interacting with rbd amino acids w and e (fig. e) . overall, the virusreceptor engagement is dominated by the polar contacts mediated by the hydrophilic residues, and mutations of those in rbd (six alanine substitutions and one y f mutation of the cd -interacting amino acids) completely abrogated its interaction with cd ( supplementary fig. ). the features of these residue interactions are very similar to those mediating the interaction between adenosine deaminase (ada) and cd (ref. ). by a pairwise comparison, we unexpectedly found that all those cd residues identified in the virus-receptor interface are also involved in ada binding, indicating a competition between ada and the virus for cd receptor. as the ada-cd interaction is shown to induce co-stimulatory signals in t cells , this may indicate a possible manipulation of the host immune system by mers-cov through competition for the ada-recognition site. it is also noteworthy that those cd residues involved in rbd binding are highly conserved between human and bat, with only two variations (i t and r q), explaining the capability of mers-cov using bat cd for cell entry ( supplementary fig. ). coronaviruses can be categorized into three main genera or groups (group (alpha), group (beta) and group (gamma) coronaviruses) . both mers-cov and sars-cov belong to the betacoronavirus genus, but are classified into different lineage subgroups (subgroup b for dimer observed in the complex crystal. the two-fold axis is shown as an upright arrow. the transmembrane topology of cd is indicated with a modelled lipid-bilayer membrane. in cd , the propeller and side openings indicated as the substrate entrance/exit tunnels are marked with arrows, and the catalytic triad residues are highlighted as spheres. colour selections are the same as in a, and the cd a/b hydrolase domain is shown in orange. the n and c termini are labelled. to facilitate comparison, the secondary-structure elements of sars-cov rbd (pdb code, dd ) are marked with spiral (helices) and arrow (strands) lines below the sequence. the cysteine residues that form disulphide bonds are labelled as in a, and residue n with a star. c, a structural alignment between mers-cov (magenta for core and cyan for external subdomains) and sars-cov (green) rbds. sars-cov and c for mers-cov) . we noted that the spike sequences are of low identity among different subgroup members. for example, mers-cov and sars-cov s proteins show a sequence identity of less than %. nevertheless, rbds of the two coronaviruses are homologous for the core subdomain. notably, the three interior disulphide bonds in the core are well-aligned for the steric positions in the two rbd structures and well-conserved in sequence among betacoronaviruses. conversely, the external rbm region is highly variable in both length and residue composition ( supplementary fig. ). consistently, no structural homology in this subdomain is observed between mers-cov and sars-cov. yet it is this subdomain that engages cellular receptors. we therefore assume that betacoronaviruses probably have a similar core-domain fold in the s protein to present the external amino acids with divergent structures for viral pathogenesis, such as receptor recognition. our work presents the fifth structure of virus s protein-receptor complexes in the coronaviridae family [ ] [ ] [ ] . taking into account both the rbd structure and the binding mode with receptors, mers-cov is related to sars-cov (a single insertion functioning as rbm) but differs from porcine respiratory coronavirus and nl (ref. ) of alphacoronaviruses (multiple discontinuous rbms) ( supplementary fig. ) . nevertheless, related structural topologies can still be observed in rbds of these coronaviruses . we noted that in the rbd-receptor complex structures of both mers-cov and porcine respiratory coronavirus the binding interfaces involve a receptor n-glycan. this might represent another cross-genus similarity in the coronaviridae family, which supports a proposed common evolutionary origin of coronavirus s proteins . it would therefore be interesting to investigate the contribution of the sugar moiety to the virus-receptor interaction for mers-cov in the future. vaccination remains the most useful measure to combat viral infection and transmission. a large number of antibodies show neutralization activity by targeting the rbd and thereby disrupting the virus-receptor engagement. therefore, a properly folded rbd could be an ideal immunogen for vaccination, as demonstrated for sars-cov . a recent report indeed shows the presence of s-specific neutralizing antibodies in mers-cov-infected patients . it may be worth attempting to test the immunization effect of mers-cov rbd in the future. protein expression, purification, crystallization and structure determination. both his-tagged cd and mers-cov rbd proteins were expressed in insect high five cells using the bac-to-bac baculovirus expression system (invitrogen). the recombinant proteins were then purified via nickel-chelated affinity chromatography and gel filtration. crystals were obtained by initial screening with the commercially available kits followed by optimization. the rbd structure was solved by single-wavelength anomalous diffraction and the complex structure by molecular replacement. full methods and any associated references are available in the online version of the paper. supplementary information is available in the online version of the paper. isolation of a novel coronavirus from a man with pneumonia in saudi arabia severe respiratory illness caused by a novel coronavirus world health organization. cumulative number of reported probable cases of severe acute respiratory syndrome (sars) dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc world health organization. novel coronavirus infection -update coronavirus pathogenesis and the emerging pathogen severe acute respiratory syndrome coronavirus. microbiol genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans sars-like virus in the middle east: a truly bat-related coronavirus causing human diseases human coronavirus emc does not require the sars-coronavirus receptor and maintains broad replicative capability in mammalian cell lines human aminopeptidase n is a receptor for human coronavirus e aminopeptidase n is a major receptor for the entero-pathogenic coronavirus tgev angiotensin-converting enzyme is a functional receptor for the sars coronavirus combating the threat of pandemic influenza: drug discovery approaches coiled coils in both intracellular vesicle and viral membrane fusion following the rule: formation of the -helix bundle of the fusion core from severe acute respiratory syndrome coronavirus spike protein and identification of potent peptide inhibitors crystal structure of mouse coronavirus receptor-binding domain complexed with its murine receptor structure of sars coronavirus spike receptorbinding domain complexed with receptor crystal structure of nl respiratory coronavirus receptor-binding domain complexed with its human receptor dali server: conservation mapping in d crystal structure of the swine-origin a (h n )- influenza a virus hemagglutinin (ha) reveals similar antigenicity to that of the pandemic virus processing of x-ray diffraction data collected in oscillation mode collaborative computing project number . the ccp suite: programs for protein crystallography advances in direct methods for protein crystallography pushing the boundaries of molecular replacement with maximum likelihood density modification for macromolecular phase improvement phenix: a comprehensive python-based system for macromolecular structure solution coot: model-building tools for molecular graphics procheck: a program to check the stereochemical quality of protein structures espript: analysis of multiple sequence alignments in postscript acknowledgements this work was supported by the ministry of science and technology of china (most) project (grant no. cb ) and the national natural science foundation of china (nsfc, grant no. ). assistance by the staff at the shanghai synchrotron radiation facility (ssrf) of china and the high energy accelerator research organization (kek) of japan is acknowledged. we thank z. fan and t. zhao for their technical assistance. g.f.g. is a leading principal investigator of the nsfc innovative research group (grant no. ). we thank m. yang from tsinghua university for his help with data collection. author contributions g.f.g. designed and coordinated the study. g.l., y.h., q.w. and y.s. conducted the experiments. j.q. and f.g. collected the data sets and solved the structures. y.l., y.z., w.z., y.y. and j.y. assisted with the cell maintenance and protein preparations. g.l. and g.f.g. wrote the manuscript and j.y., j.b. and b.z. participated in the manuscript editing and discussion.author information the coordinates and related structure factors have been deposited into the protein data bank pdb under accession numbers kqz for the free mers-cov rbd structure and kr for the rbd-cd complex structure. reprints and permissions information is available at www.nature.com/reprints. the authors declare no competing financial interests. readers are welcome to comment on the online version of the paper. correspondence and requests for materials should be addressed to g.f.g. (gaof@im.ac.cn). protein expression and purification. the proteins used for crystallization and surface plasmon resonance experiments were prepared with the bac-to-bac baculovirus expression system (invitrogen). the coding sequences for mers-cov rbd (genbank accession number jx , spike residues - ), sars-cov rbd (accession number nc_ , spike residues - ), human cd (accession number np_ , residues - ) and human ace (accession number baj , residues - ) were individually cloned into the pfastbac vector. for each construct, a previously described gp signal peptide sequence was added to the protein n terminus for protein secretion, and a hexa-his tag was added to the c terminus to facilitate further purification processes. transfection and virus amplification were conducted with sf cells, and the recombinant proteins were produced in high five cells. the cell culture was collected h after infection and passed through a -ml histrap hp column (ge healthcare). after removal of most of the impurities, the recovered proteins were then pooled and further purified on a superdex column (ge healthcare). finally, each collected protein was prepared in a buffer consisting of mm tris-hcl (ph . ) and mm nacl and concentrated to about mg ml for further use.to obtain the complex of mers-cov rbd bound to cd , the individual proteins were in vitro mixed at a molar ratio of : and incubated at uc for about h. the complex was then further purified on a superdex column, and concentrated to about mg ml for crystallization experiments.to prepare the fc chimaeric proteins, the fragment encoding mers-cov s (residues - ) or ntd (residues - ) or rbd (adding the s residues - of the signal peptide to its n terminus to facilitate protein secretion) was fused terminally to a fragment coding for the fc domain of mouse igg and ligated into the pcaggs expression vector. a mutant rbd-fc protein-expressing plasmid was also constructed by site-directed mutagenesis, for which the identified hydrophilic residues involved in cd binding were mutated simultaneously (y f; n a, k a, d a, e a, d a and r a). the expression plasmids were then transfected into hek t cells. the cell culture was collected h after transfection and directly used in the flow cytometric assay. analytical gel filtration. mers-cov rbd, cd and their protein complex were individually prepared and adjusted to the same volume. the samples were then loaded onto a calibrated superdex column (ge healthcare). the chromatographs were recorded and overlaid onto each other. the pooled proteins were analysed on a % sds-page gel and stained with coomassie blue. surface plasmon resonance assay. the biacore experiments were carried out at room temperature ( uc) using a biacore machine with cm chips (ge healthcare). for all the measurements, an hbs-ep buffer consisting of mm hepes, ph . , mm nacl, mm edta and . % (v/v) tween- was used, and all proteins were exchanged to the same buffer in advance via gel filtration. the mers-cov rbd and sars-cov rbd proteins were immobilized on the chip at about response units. gradient concentrations of human cd ( , , , , , , , , and , nm) or human ace ( , , , , , , , and , nm) were then used to flow over the chip surface. after each cycle, the sensor surface was regenerated via a short treatment using mm naoh. the binding kinetics were analysed with the software biaevaluation version . using the : langmuir binding model. flow cytometric assay. for the surface expression of cd , the full-length coding sequence was cloned into the pegfp-c vector which yields a plasmid encoding a recombinant cd protein with an egfp-tag fused to its n terminus. the plasmid was transfected into the cd -negative bhk cells using lipo (invitrogen) according to the manufacturer's instructions. the cells were collected h after transfection.for staining, the mock-transfected bhk cells or the cells transfected with the cd -expressing plasmid were suspended in pbs and incubated with the individual fc-fusion protein culture or goat anti-cd igg (r&d systems) at room temperature for h. the cells were then washed and further incubated at room temperature for about . h with anti-mouse or anti-goat secondary igg antibodies (r&d systems). after washing, the cells were analysed by flow cytometry with a bd facscalibur machine. the cells incubated only with the secondary antibodies were used as the negative controls. crystallization. all the crystals were obtained by vapour-diffusion sitting-drop method with ml protein mixing with ml reservoir solution and then equilibrating against ml reservoir solution at uc. the initial crystallization screenings were carried out using the commercially available kits. the conditions that yield crystals were then optimized. diffractable crystals of the free rbd protein were finally obtained in a condition consisting of . m ammonium tartrate dibasic, ph . , and % peg , with a protein concentration of mg ml . derivative crystals were obtained by soaking rbd crystals for h in mother liquor containing mm kaucl n h o. the complex crystals were grown in % (v/v) -propanol, . m sodium acetate ph . and % peg with a protein concentration of mg ml . data collection, integration and structure determination. for data collection, all crystals were flash-cooled in liquid nitrogen after a brief soaking in reservoir solution with the addition of % (v/v) glycerol. the native rbd data set was collected at the high energy accelerator research organization (kek) bl a (wavelength, . Å ), whereas the diffraction data for the au derivative crystal (wavelength, . Å ) and the complex crystal (wavelength, . Å ) were collected at the shanghai synchrotron radiation facility (ssrf) bl u. all data were processed with hkl (ref. ) . additional processing was performed with programs from the ccp suite .the structure of rbd was determined by the single-wavelength anomalous diffraction (sad) method. the au sites were first located by shelxd for the au-sad data. the identified position were then refined and the phases were calculated with sad experimental phasing module of phaser . the real space constraints were further applied to the electron density map in dm . the initial model was built with autobuild in phenix package . additional missing residues were added manually in coot . the final model was refined with phenix.refine in the phenix with energy minimization, isotropic adp refinement, and bulk solvent modelling. the complex structure was solved by molecular replacement module of phaser , with the solved rbd structure and previously reported cd structure (pdb code, bgr) as the search models. the atomic model was completed with coot and refined with phenix.refine . the stereochemical qualities of the final models were assessed with procheck . the ramachandran plot distributions for the residues in the free rbd structure were . , . and . % for the most favoured, additionally and generously allowed regions, respectively. these values were . , . and . % for the rbd-cd complex structure. data collection and refinement statistics are summarized in supplementary table . all structural figures were generated using pymol (http://www.pymol.org). secondary-structure determination. the secondary structure determination was based on the espript algorithm. key: cord- - n tsk authors: roy, susmita title: dynamical asymmetry exposes -ncov prefusion spike date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: n tsk the novel coronavirus ( -ncov) spike protein is a smart molecular machine that instigates the entry of coronavirus to the host cell causing the covid- pandemic. in this study, a structural-topology based model hamiltonian of c symmetric trimeric spike is developed to explore its complete conformational energy landscape using molecular dynamic simulations. the study finds -ncov to adopt a unique strategy by undertaking a dynamic conformational asymmetry induced by a few unique inter-chain interactions. this results in two prevalent asymmetric structures of spike where one or two spike heads lifted up undergoing a dynamic transition likely to enhance rapid recognition of the host-cell receptor turning on its high-infectivity. the crucial interactions identified in this study are anticipated to potentially affect the efficacy of therapeutic targets. one sentence summary inter-chain-interaction driven rapid symmetry breaking strategy adopted by the prefusion trimeric spike protein likely to make -ncov highly infective. movement that generates the 'up' and 'down' conformations ( , , ) . other betacoronaviruses, like sars-cov, mers-cov and distantly related alphacoronavirus porcine epidemic diarrhea virus (pedv) also have this apparently stochastic rbd movement ( , ) . the combination of rbd up-down rearrangement may lead each s -head of the trimeric prefusion spike protein of coronavirus to adopt different possible conformations: (i) down, (ii) up- down, (iii) up- down, and (iv) up (fig. c) . among them down, up are symmetric conformers and up- down, up- down are asymmetric conformers. single-particle cryo-electron microscopy (cryo-em) determined few such symmetric and asymmetric structures referred to as the receptorbinding inactive state and receptor-binding active state, respectively ( ) . the asymmetric structure where one of the rbds rotates up was thought to be less stable for sars-cov s ( ) . in comparison, the recent cryo-em study found three rbds in up- down conformation as a predominant arrangement in the prefusion state of -ncov s trimer ( ) . this arrangement apparently appears legitimate for sars-cov- s in order to explain the higher affinity of up- down for ace receptor than that of sars-cov s. however, we cannot rule out the possibility of up- down conformation as a functional state, which may provide even stronger binding with ace considering the fact that ace is a dimeric receptor ( , ) . this hypothesis is consistent with a recent crystallographic study demonstrating that cr , a neutralizing antibody isolated from convalescent sars patients targets the rbd when at least two rbd on the trimeric spike protein are in the up conformation ( ) . assembling all these experimental results it is high time to understand the molecular mechanism of s -head coordination of trimeric sar-cov- s and to identify important interaction in regulating spike up-down conformations. a schematic of receptor-bound spike protein including the receptor-binding subunit s , the membrane-fusion subunit s of a coronavirus is shown. b. side and top views of the homo-trimeric structure of sars-cov- spike protein with one rbd of the s subunit head rotated in the up conformation. c. rbd up-down movement expected to lead s heads of the trimeric spike protein to attain the following possible conformers: (i) down (ii) up- down (iii) up- down, and (iv) up. these are an analogue demonstration of the spike protein top-view where ntds are represented by colored ovals, rbds are represented by flexible sticks and s domains are represented by filled circles. a major challenge was simulating the gigantic structure of the full-length trimeric spike, as it is associated with the largescale conformational transition. it is indeed a daunting task to explore the full conformational landscape at an atomic length-scale. to overcome this, a structure-based coarse-grained molecular dynamic simulation approach has been adopted ( ) . the simulation started with a full-length homo-trimeric spike protein structure generated from homology modeling which involves the alignment of a target sequence and a template structure (pdb: vsb) ( , ) . this also helped to build the missing loops. the domain-specific residuerange for the full-length, trimeric sars-cov- s is given in fig. a . the s head coordination of the trimeric spike is programmed by developing a super-symmetric topology-based modeling framework ( fig. b ) (described in the method pipeline in the supplementary material). with this, the molecular machine is ready to swing each of its s head between its 'up' and 'down' conformations (movie s , s ). a number of cryo-em structures captured the 'up' and 'down' conformations of the rbd domain of spike proteins of other coronaviruses including sars-cov- where the s subunit undergoes a hinge-like conformational movement prerequisite for receptor binding (fig. c) ( , , , ) . apart from the hinge-responsive rbd-cleft interaction, in this study, a few inter-chain interactions are found to assist the 'rbd-up' and the 'rbd-down' conformations (shown in fig. d and e, movie s ). these few interactions are identified to impact the breathing of rbd of sars-cov- s. this makes the early referred 'rbd-up/down' conformations slightly different from the 's -head-up/down' conformation for trimeric sars-cov- s as the former is regulated only by intra-chain interactions while the latter is regulated by both intra and inter-chain interactions (fig. s ). after identifying all these unique intra and inter-chain contacts ( , ) extracted from the corresponding 's -head-up' and 's -head-down' conformations, a super-symmetric contact map is generated. this follows the development of a structure-based model hamiltonian (materials and methods in supplementary) which is based on the energy landscape theory of protein folding ( ) ( ) ( ) ( ) ( ) . this approach not only potentiates the trimeric spike to adopt c symmetric ' up' and ' down' states but also to break the symmetry in a thermodynamically governed way ( fig. s -s ) ( , ) . residue-residue native contact map identifying unique intra and inter-chain contact-pairs formed by any single monomer in its s -head up and s -head down states. c. within intra-chain contacts, the unique contacts that drive hinge motion leading to rbd-up and rbd-down states are highlighted in the structure, as well as in the contact map. d. inter-chain unique contacts between rbd and ntd domains upholding the s -head-up state. e. inter-chain unique contacts are responsible for connecting the rbd of chaina with the s -stalk of chainb and the s stalk of chainc. to monitor the transition between the 's -head-up' and the 's -head-down' states for each monomer with the trimeric interactions, a large pool of unbiased longtime trajectories generated where multiple occurrences of up and down states for each monomer have been sampled. we employ a reaction coordinate, q, the fraction of the native contact ( , ) corresponding to the inter-chain contacts associated with the 's -head-up' and the 's -head-down' states. a typical trajectory plot of q extracted from the equilibrium simulation of the trimeric prefusion spike clearly shows the hopping between different conformational states as hypothesized earlier (fig. a) . furthermore, the dynamic transitions between the two major asymmetric states ( up- down: q s -head-down ≈ . and up-down: q s -head-down ≈ . ) are evident in the q-trajectory. analysis of all the simulations yields the -d free energy landscape of the trimeric spike protein of sars-cov- ( fig b) with its all possible conformations. the conformations corresponding to the minima of the free energy landscape are shown in fig. c . the temperature dependence of conformational transition indicates that the configurational entropy and enthalpy compensation results in the enhanced population of the asymmetric up- down to up- down conformations ( fig s ) . while the predominant population of the up- down state is consistent with the recent cryo-em data, ( ) (movie s , s ) the other asymmetric structure ( up- down) emerges as a best binding epitope for cr (an antibody collected from convalescent sars patients) according to a recent antibody recognition study of sars-cov- s ( ). .conformational transition of sars-cov- spike protein in its prefused state. a. the fraction of native contact (q) dynamics counting inter-chains contact-pairs formed in the s head-up state and the s -head-down state. b. a two-dimensional free energy landscape of conformational transition as a function of inter-chain contacts supporting s -head-down (x-axis) and s -head-up state (y-axis) explores all possible conformations. c. the representative structure corresponding to each minimum of the free energy landscape is designated as follows: (i) up, (ii) up- down, (iii) up- down, and (iv) down state (as shown in the one-dimension population distribution plot). a. unique inter-chain interactions formed by rbd of one chain with ntd of the adjacent chain stabilizing the s -head-up conformation in sars-cov- s (pdb: vsb). interchain domain closure is analyzed by inter-chain proline-proline distance measurement. the same distance measured for the following spikes: b. sars-cov spike (pdb: x b) and c. mers-cov spike (pdb: x f). d. rbd up-down hinge dynamics triggered by inter-chain rbd-ntd domain interaction. e. in the absence of rbd-ntd inter-chain interaction, the hinge motion of rbd is hindered by populating more 'rbd-down' conformations and allows to sample 'rbd-up' conformation only rarely in a stochastic manner. in this study, sequence and interaction level (fig. s , fig. s ) comparison has been made over the cryo-em structure of sars-cov- s (pdb: vsb), sars-cov s (pdb: x b) and mers-cov s (pdb: x f) ( , ) . this comparison results that sars-cov- s has ntd-rbd domain association where a proline residue of chaina forms ch-п type interaction with the tyrosine residue ( ) and hydrophobic interaction with another proline of chainb (fig. a, fig. s ). inter-chain proline-proline distance measurement shows that the corresponding rbd-ntd domains are far away in the case of sars-cov s (fig. b ) and further away in the case of mers-covs (fig. c ). this measurement involves their respective cryo-em structures. despite the relatively high degree of sequence similarity between the sars-cov- s and the sars-cov s and also with the spike protein from the bat coronavirus ratg , a single histidine residue at the relevant rbd-ntd domain interface is found unique in the vase of sars-cov- s (fig. s ) ( ) . the imidazole ring of histidine is pointing towards the hydrophobic assembly of aforesaid proline-tyrosine in the juxtaposition of the rbd-s hinge region. such inter-chain rbd-ntd connection is thus found to impact the rbd hinge interaction by upregulating more rbd-up conformation (fig. d ). in the absence of such interchain interaction, the rbd mostly stays in the down conformation allowing rbd to break the symmetry rarely in a stochastic manner (fig. e) . the absence of inter-chain rbd-ntd connection also appears to impact the sars-cov rbd hinge interaction. here, the opening of rbd-s cleft is significantly less than that of sars-cov- s in their respective s -head-up state (fig. s ). the assistance from the inter-chain rbd-s -stalk related interfacial contacts are also found to modulate the population dynamics of rbd-down conformation (fig. s ). the influence of this inter-chain rbd-s -stalk interaction has also been observed in an early cryo-em analyses where two proline mutations at the top of s stalk (inferring rbd-s inter-chain connection) helped to stabilize the 'up' conformers of sars-cov s ( ) . the synergy between internal rbd-hinge interactions and inter-chain interactions allows trimeric sars-cov- s to adopt a unique dynamical feature than other corona-virus spikes. it appears that the inter-chain interactions driven rapid symmetry breaking strategy potentiates this spike machine to turn on its high-infectivity. the energy landscape framework used in this study indeed helps to unify and compare different spike protein interactions present in other coronaviruses. while in the current situation to develop diagnostics and antiviral therapies are of utmost priority, the present structure-based model derived information at the microscopic interaction level might provide deep insight to design effective decoys or antibodies to fight against -ncov infection. movies s to s method pipeline of building a super-symmetric contact map of sars-cov- prefusion spike protein. coarse-grained structure-based simulations have been performed for full-length trimetric sars-cov- spike protein. the structure-based hamiltonians for different simulations were derived after processing the recent cryo-em structure (pdb: vsb) thorough the swiss model to complete missing loops present in the structure ( , ) . this generates a homo-trimeric sars-cov- spike where this initial structure has important components in terms of intra and inter-chain contacts (interaction) leading to an 's -head-up' and an 's -head-down' conformation for each protomer. in this prevalent trimeric variant, only one monomer adopts 's -head-up' and the same of the other two adopts the 's -head-down' conformation. few characteristic intra-chain contacts cause the receptor-binding domain to perform a hinge-motion resulting 'rbd-up' and 'rbd- conformations driven by intra c as defined in the pipeline method. contact calculation is performed using the shadow criterion ( ) . interesting components are inter-chain contacts residing at the interface of the dimer. now, two categories of interactive dimeric interfaces are there: asymmetric-dimer interface and symmetricdimer interface. chaina (s -head-up) and the adjacent chainb (s -head-down) represent an asymmetric dimer unit. similarly, chainb (s -head-down) and the adjacent chainc (s -headdown) represent a symmetric dimer unit. at the asymmetric-dimer interface, the rbd-domain of chaina forms a few unique contacts with the ntd domain of the adjacent chainb as shown in has been cycled over all the interfaces making each of interfaces dynamically capable of inducing s -head movement. developing a structure-based hamiltonian of trimeric spike protein simulation: a structure-based hamiltonian of trimeric spike protein for sar-cov is derived using the super-symmetric contact map. in the current structure-based model amino acids are represented by single beads at the location of the c-α atom ( , , , ) . the coarse-grained structurebased model, a well-established model, comprehends a novel way to investigate the mechanisms associated with protein folding and function ( - , , - ) . in the current context of decoding virus entry mechanism, this model successfully characterized class-i viral fusion protein dynamics including conformational rearrangement of a viral surface glycoprotein, influenza hemagglutinin (ha) during its prefusion and postfusion states ( , ) . as described in the pipeline method, the complete hamiltonian comprises of two terms: and, intra up down shared the first non-local term of the hamiltonian used in a/b/c intra h represents non-bonded interaction potential in the form of - lennard-jones potential that is used to describe the interactions that stabilize the native contacts ( ) . a native contact is defined for a pair of residues (i and j) present in the native state using shadow criteria and when (i−j)> . Δ ij is defined in such a way that if any i and j residues belong to intra c , Δ ij = turning on - lennard-jones potential; otherwise Δ ij = . for all non-native pairs for which Δ ij = , a repulsive potential with σ = Å is used. all the interaction coefficients used in this potential are given in table s . as described in the method pipeline, inter h will include only the non-local inter-chain contacts residing at the interface of the dimer which comprises of accounting for asymmetric-dimer similar to our early approach, Δ ij is such defined that if any i and j residues belong to inter c , Δ ij = , turning on - lennard-jones potential; otherwise Δ ij = . here, inter to begin every simulation an initial structure is energetically minimized under the structure-based hamiltonian using the steepest descent algorithm. atomic coordinates of the energy minimized structure have been evolved using langevin dynamics with a time step of . r  . we used an underdamped condition for rapid sampling ( ) . for explicit particles, reduced mass of r  and a drag coefficient all temperatures mentioned here are in reduced units. temperature dependence of the conformational transition has been performed over several temperatures. three representative reduced temperature-dependent (t*= . t r , t*= . t r, and t*= . t r ) analyses are shown for clarity in fig. s . population distribution as a function of the fraction of native inter-chain contacts formed in the s -head-down state is monitored over these temperatures. four states emerge as indicated in fig. b and fig. s . as the temperature increases the population shifts more towards the s -head-up state. at t*= . t r the population of up- down state appears as a predominant population in the conformational landscape which correlates well with the recent cryo-em data ( ) . we have performed all our simulations being consistent with this selected temperature. the rmsd analyses ensure the correctness of the simulation progress and the emergence of the correct structure (fig. s ) . the population shifts more towards the s -head-up state conformations as the temperature increases. it suggests that the s -head-up states are more dynamic and entropically stable. note that the dynamical transition between up- down and up- down states may tolerate a wide range of temperatures by a population shift mechanism. so far, we have examined that it tolerates the temperature range from t*= . t r to t*= . t r . temperature dependence of rbd hinge motion has also been studied (fig. s ) . population distribution as a function of the fraction of native intra-chain hinge-region contacts formed by the rbd at different temperatures has been monitored. a bimodal distribution reflects the population of the 'rbd-up' and 'rbddown' states for any individual chain being in trimeric spike. as temperature increases, the rbd-up states start to enhance their populations. free energy calculation: in a system, if a state "a" described by its reaction coordinate, x a (which in our case is the fraction of native contact) is separated from another state "b" described by its reaction coordinate, x b , by a finite barrier, the free energy of transition from a to b can be expressed as, where, ( ) b p x is the probability to find the system in state b at the reaction coordinate, q b . the same holds for ( ) a p x . from a finite set of unbiased simulations of trimeric spike protein, a complete thermodynamic description is obtained. probability distributions are obtained by sampling the configurational space running molecular dynamics simulation sets. fig. s : inter-chain interaction from the 's -head-up' and the 's -head-down' states of sars-cov- spike. a. inter-chain rbd-ntd domain closure in the s -head-up state. the domain closure is mediated by double hydrogen bonds connecting arg of chaina with asn and cys residue of chainb. b. inter-chain rbd-s domain closure in the s -head-down state. the s stalk connection with rbd is mediated by a proline residue of chaina with the formation of a ch-п type interaction with tyrosine and hydrophobic interaction with another proline of chainb. fig. s : the structural alignment of two chains in the s -head-down state. chainb (orange) and chainc (green) in the s -head-down state extracted from the cryo-em structure (pdb: vsb) of trimeric spike. low rmsd between these two chains suggests that contact information extracted from any of these chains will be equivalent. this supports our contact map generation shown in the method pipeline. fig. s . rms deviation of each chain from their initial state during a typical simulation progress. a. the initial state of chain a in the trimeric spike was in 's -head-up' state and chain b/c was in 's -head-down' state. b. the lower rmsd for chain a corresponds to chain a's head-up state. c. the lower rmsd for chain b corresponds to chain b 's head-down state. d. the lower rmsd for chain c corresponds to chain c's head-down state. the rmsd analyses ensure the correctness of the simulation progress and the emergence of the correct structure. fig. s . temperature dependence of s -head up-down transition and rbd open-close breathing transition. a. population distribution as a function of the of native inter-chain contacts formed in the s -head-down state as shown in fig. s . four states emerge as shown in fig. b . as temperature increases the population shifts more towards the s -head-up state conformations indicating that s -head-up states are more dynamic and entropically stable.note that the dynamical transition between up- down and up- down states may tolerate a wide range of temperatures by a population shift mechanism. b. population distribution as a function of the fraction of native intra-chain hinge-region contacts formed by the rbd. a bimodal distribution reflects the 'rbd-up' and the 'rbd-down' states for any individual chain being in trimeric spike. as the temperature increases, rbd-up started populating more. temperature analysis helps to choose an intermediate temperature to obtain correct population distribution. fig. s . sequence alignment of sars-cov- spike (pdb: vsb) with that of sars-cov spike(pdb: x b), mers-cov spike (pdb: x f) and ratg spike. only the rbd is highlighted in green. the unique histidine residue (highlighted in yellow) of the rbd of sars-cov- is noted. identical residues are denoted by an "*" beneath the consensus position. the multiple sequence alignment is continued over the next page. fig. s .the opening of rbd-s cleft in the 's -head-up' state of sars-cov- s differs from that of sars-cov s. the opening is measured by a characteristic distance between a serine and proline residues at two edges of the cleft. for sars-cov- s the distance is . nm while for sars-cov-s, it is . nm. it appears that inter-chain rbd-ntd connection influences the sars-cov s rbd hinge motion significantly where the cleft opening is supported by those inter-chain interactions. fig. s . the free energy landscape in the presence and absence of inter-chain rbd-s contacts. a. in the presence of inter-chain rbd-s contacts, the enhanced population of the up- down compared to up- down. b. in the absence of inter-chain rbd-s contacts, the population shifts from up- down state up- down state. movie s : conformational dynamics of full-length trimeric sars-cov- spike protein showing rapid symmetry breaking. movie s :conformational dynamics of full-length trimeric sars-cov- spike protein showing rapid symmetry breaking. in this movie the ntd domains are not shown for better demonstration of the rbd movement. movie s : conformational dynamics of a monomer of the full-length sars-cov- showing rbd hinge motion. and notes: . in microbial evolution and co-adaptation: a tribute to the life and scientific legacies of joshua lederberg: workshop summary plagues and peoples sars-cov- : an emerging coronavirus that causes a global threat evolution of the novel coronavirus from the ongoing wuhan outbreak and modeling of its spike protein for risk of human transmission identifying sars-cov- related coronaviruses in malayan pangolins structure, function, and evolution of coronavirus spike proteins cryo-em structure of the -ncov spike in the prefusion conformation structure, function, and antigenicity of the sars-cov- spike glycoprotein structural basis for the recognition of sars-cov- by full-length human ace cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding the . -angstrom cryo-electron microscopy structure of the porcine epidemic diarrhea virus spike protein in the prefusion conformation cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov topological and energetic factors: what determines the structural details of the transition state ensemble and "en-route" intermediates for protein folding? an investigation for small globular proteins swiss-model: homology modelling of protein structures and complexes the human coronavirus hcov- e s-protein structure and receptor binding smog : a versatile software package for generating structure-based models the shadow map: a general contact definition for capturing the dynamics of biomolecular folding and function. the journal of physical chemistry protein folding funnels: a kinetic approach to the sequence-structure relationship symmetry and the energy landscapes of biomolecules chemical physics of protein folding levinthal's paradox funnels, pathways, and the energy landscape of protein folding: a synthesis microbial evolution and co-adaptation: a tribute to the life and scientific legacies of joshua lederberg: workshop summary plagues and peoples sars-cov- : an emerging coronavirus that causes a global threat evolution of the novel coronavirus from the ongoing wuhan outbreak and modeling of its spike protein for risk of human transmission identifying sars-cov- related coronaviruses in malayan pangolins structure, function, and evolution of coronavirus spike proteins cryo-em structure of the -ncov spike in the prefusion conformation structure, function, and antigenicity of the sars-cov- spike glycoprotein structural basis for the recognition of sars-cov- by full-length human ace cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding the . -angstrom cryo-electron microscopy structure of the porcine epidemic diarrhea virus spike protein in the prefusion conformation cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov topological and energetic factors: what determines the structural details of the transition state ensemble and "en-route" intermediates for protein folding? an investigation for small globular proteins swiss-model: homology modelling of protein structures and complexes the human coronavirus hcov- e s-protein structure and receptor binding smog : a versatile software package for generating structure-based models the shadow map: a general contact definition for capturing the dynamics of biomolecular folding and function. the journal of physical chemistry protein folding funnels: a kinetic approach to the sequence-structure relationship symmetry and the energy landscapes of biomolecules chemical physics of protein folding levinthal's paradox funnels, pathways, and the energy landscape of protein folding: a synthesis the origin of minus-end directionality and mechanochemistry of ncd motors order and disorder control the functional rearrangement of influenza hemagglutinin landscape approaches for determining the ensemble of folding transition states: success and failure hinge on the degree of frustration pi-interactions in proteins the embl-ebi search and sequence analysis tools apis in stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis role of aaa domain in allosteric communication of dynein motor proteins intersubunit assisted folding of dna binding domains in dimeric catabolite activator protein. the journal of physical chemistry online service), computational modeling of biological systems : from molecules to pathways chemical physics of protein folding from levinthal to pathways to funnels from structure to function: the convergence of structure based models and co-evolutionary information protein folding mechanisms and the multidimensional folding funnel navigating the folding routes rotation-activated and cooperative zipping characterize class i viral fusion protein dynamics the nature of folded states of globular proteins strain mediated adaptation is key for myosin mechanochemistry: discovering general rules for motor activity key: cord- - w xgt authors: kirchdoerfer, robert n.; wang, nianshuang; pallesen, jesper; wrapp, daniel; turner, hannah l.; cottrell, christopher a.; corbett, kizzmekia s.; graham, barney s.; mclellan, jason s.; ward, andrew b. title: receptor binding and proteolysis do not induce large conformational changes in the sars-cov spike date: - - journal: biorxiv doi: . / sha: doc_id: cord_uid: w xgt severe acute respiratory syndrome coronavirus (sars-cov) emerged in as a highly transmissible pathogenic human betacoronavirus. the viral spike glycoprotein (s) utilizes angiotensin-converting enzyme (ace ) as a host protein receptor and mediates fusion of the viral and host membranes, making s essential to viral entry into host cells and host species tropism. as sars-cov enters host cells, the viral s undergoes two proteolytic cleavages at s /s and s ’ sites necessary for efficient membrane fusion. here, we present a cryo-em analysis of the trimeric sars-cov s interactions with ace and of the trypsin-cleaved s. surprisingly, neither binding to ace nor cleavage by trypsin at the s /s cleavage site impart large conformational changes within s or expose the secondary cleavage site, s ’. these observations suggest that s ’ cleavage does not occur in the s prefusion conformation and that additional triggers may be required. viral and host membranes, making s essential to viral entry into host cells and host species tropism. as sars-cov enters host cells, the viral s undergoes two proteolytic cleavages at s /s and s ʹ′ sites necessary for efficient membrane fusion. here, we present a cryo-em analysis of the trimeric sars-cov s interactions with ace and of the trypsin-cleaved s. surprisingly, neither binding to ace nor cleavage by trypsin at the s /s cleavage site impart large conformational changes within s or expose the secondary cleavage site, s ´. these observations suggest that s ´ cleavage does not occur in the s prefusion conformation and that additional triggers may be required. severe acute respiratory syndrome coronavirus (sars-cov) emerged in humans in and rapidly spread globally causing , cases and associated deaths in countries through july . sars-cov reappeared in a second smaller outbreak in , but has since disappeared from human circulation. however, closely related coronaviruses, such as wiv , currently circulate in bat reservoirs and are capable of utilizing human receptors to enter cells . the more recent emergence of middle east respiratory syndrome coronavirus (mers-cov) and the likelihood of future zoonotic transmission of novel coronaviruses to humans from animal reservoirs make understanding the coronavirus infection cycle of great importance to human health. coronaviruses are enveloped viruses possessing large, trimeric spike glycoproteins (s) required for the recognition of host receptors for many coronaviruses as well as the fusion of viral and host cell membranes for viral entry into cells . during viral egress from infected host cells, some coronavirus s proteins are cleaved into s and s subunits. the s subunit is responsible for host-receptor binding while the s subunit contains the membrane-fusion machinery. during viral entry, the s subunit binds host receptors in an interaction thought to expose a secondary cleavage site within s (s ´) adjacent to the fusion peptide for cleavage by host proteases - . this s ´ proteolysis has been hypothesized to facilitate insertion of the fusion peptide into host membranes after the first heptad repeat region (hr ) of the s subunit rearranges into an extended α-helix - . subsequent conformational changes in the second heptad repeat region (hr ) of s form a six-helix bundle with hr , fusing the viral and host membranes and allowing for release of the viral genome into host cells. coronavirus s is also the target of neutralizing antibodies , making an understanding of s structure and conformational transitions pertinent for investigating s antigenic surfaces and designing vaccines. the sars-cov s subunit is composed of two distinct domains: an n-terminal domain (s ntd) and a receptor-binding domain (s rbd) also referred to as the s ctd or domain b. each of these domains have been implicated in binding to host receptors, depending on the coronavirus in question. however, most coronaviruses are not known to utilize both the s ntd and s rbd for viral entry . sars-cov makes use of its s rbd to bind to the human angiotensin-converting enzyme (ace ) as its host receptor , . recent examination using cryo-electron microscopy (cryo-em) has illuminated the prefusion structures of coronavirus spikes [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . initial examination of hcov-hku s showed that the receptor-binding site on the s rbd was occluded when the rbd was in a 'down' conformation and it was hypothesized that conformational changes were required to access this site . subsequent studies of the highly pathogenic human coronavirus s proteins of sars- cov , and mers-cov , showed that these viral s rbd do indeed sample an 'up' conformation where the receptor-binding site is accessible. these structural studies also located the positions of the s /s and s ´ cleavage sites on the prefusion spike. the s /s site lies within a surface exposed loop in the second subdomain of s . however, the s ´ site lies closer to base of the spike and though this region is located on the surface of the spike, cleavage at this site is prevented by surrounding protein elements . to examine the hypothesized conformational transitions induced by proteolysis and receptor binding, we used single-particle cryo-em to determine structures of s in uncleaved, s /s cleaved and ace -bound states. three-dimensional classification of the s rbd positions and corresponding atomic protein models revealed that neither ace -binding nor trypsin cleavage at the s /s boundary induced substantial conformational changes in the cov may use a distinct mechanism of fp membrane insertion. as observed in the previous sars-cov and mers-cov s structures , , the trimeric s adopts two distinct conformations related to each of the s rbd. the 'down' conformation caps the s helices and makes extensive contacts with the s ntd. the 'up' conformation of the s rbd exposes the s rbd receptor-binding site. it has been previously reported that for wild- type sars-cov s, % of the particles contained three 'down' rbd conformations while % contained a single 'up' s rbd conformation . to examine the conformation of the s rbd among our sars-cov s p ectodomains, we used a local masking and -d sorting strategy to more accurately classify the conformations as being either 'down' or 'up' at each of the three s rbd positions within the trimer. this analysis revealed that the majority of the s p proteins were in the single-'up' conformation ( %) with lesser amounts of double-and triple-'up' conformations ( % and % respectively) and with no all-'down' conformation observed. the increased propensity to adopt the 'up' s rbd conformation may indicate a difference in the coronavirus s containing the p mutations, however other differences in sample preparation cannot be ruled out. ace and s c-terminal domains to examine the structure of sars-cov s bound to its receptor, ace , we combined sars-cov s p ectodomain with an excess of soluble human ace with subsequent purification by size-exclusion chromatography and immediate cryo-em specimen preparation. initial sorting of particle heterogeneity indicated spikes could be split into ace -bound ( %) and unbound ( %) classes. using a similar masking and -d sorting strategy as above we sorted the unbound s class further into classes with s conformations of one or two 'up' s rbds (fig and supplementary tables - and supplementary fig. - ) . we did not observe an all-'down' class nor a three 'up' s rbd class indicating a low prevalence of these conformations among the unbound spikes. expanding our d sorting strategy, we classified our ace -bound particles at each s rbd position and identified single, double and triple ace -bound s. we were further able to identify s rbd conformations at the non-ace occupied rbd positions to represent each population of s rbd conformations among ace -bound s. as hypothesized by previous structural work - , , the s rbd recognizes ace with an 'up' s rbd conformation. the proportion of total 'up' s rbd conformations within the ace -bound and -unbound classes is nearly identical within this dataset ( % 'up' s rbd), similar to the proportion of total 'up' s rbd in the sars s p ectodomain dataset ( %). this strongly suggests that binding of a single ace receptor does not induce adjacent s rbds to transition from a 'down' to 'up' conformation. hence, ace is more likely to bind to an already 'up' s rbd rather than inducing the conformational changes that are required for the s rbd to become accessible to ace . it is noteworthy that despite prolonged co-incubation and an excess of ace , we had difficulties in saturating the s rbd with ace in the context of trimeric s ectodomain. this poor saturation is illustrated by the small proportion of triple-bound ace and the majority of spikes that are unbound by receptor. in contrast, isolated recombinant s rbd easily binds ace and is capable saturating ace on target cells to block s-mediated entry . our observed sub-stoichiometric ace binding to trimeric spikes is consistent with the difficulty in using soluble ace receptor to neutralize sars-cov s pseudotyped onto vsv . the reduced binding of ace to trimeric spikes is likely due to the incomplete exposure and conformational flexibility of the s rbd. incomplete neutralization with soluble receptor was not encountered for mhv which binds ceacam a via its s ntd, which does not undergo conformational changes , . similar to recently published mers-cov s structures , the ace -bound rbd adopts a much more extended and rotated conformation compared to s rbd modeled in previous sars-cov s structures . this difference is likely due to poor density in the hinge regions between the s rbd and subdomain (sd- ) in these previous reconstructions , rather than the presentation of a unique receptor-bound conformation. indeed, the bound ace receptor and s rbd for all reconstructions here show poorer density quality than the less mobile regions of the sars-cov s (fig ) . to improve the density for ace -bound s rbd, we used focused refinement on this region to overcome the flexibility of these domains relative to the rest of s. this yielded a . Å resolution reconstruction with improved local density quality (fig b and c) . we successfully placed the crystal structure of the sars-cov s rbd bound to ace ( ajf.pdb ) into this density as a rigid body indicating that the previously determined crystal structure accurately recapitulates the conformation between the ace -bound s rbd in the trimeric spike. the ace -bound, s rbd extends upwards and rotates away from contacts with nearby amino acids. hence, any conformational changes induced by receptor binding to the s rbd are more likely to be caused by the absence of the s rbd contacts in the 'up' conformation, rather than the formation of additional contacts (supplemental figure ) . this model provides a flexible mechanism for how different coronavirus spikes can bind to different protein receptors with their s rbd and facilitate fusion with host cells. moreover, movements of the s rbd to the 'up' fig. ). nearing the end of the time course additional lower molecular weight bands are observed which we interpret to be degradation of the s subunit. regardless of which construct was used or whether ace was bound to the s ectodomains, there is no prominent band that corresponds to a s ʹ′ cleavage product (approximately kda). to analyze the cleavage products in detail, we performed cryo-em analysis on the trypsin-cleaved sars-cov s p ectodomain. using all-particles and c symmetry yielded a reconstruction at . Å resolution (fig. , supplementary tables and and supplementary fig. ). the short loop containing the s /s cleavage site is disordered in the uncleaved spike reconstruction and remains disordered in the trypsin cleaved reconstruction. moreover, examination of the structure models indicates no significant differences between the trypsin- cleaved and uncleaved sars-cov s (fig. b) . fine sorting of s rbd positions of the trypsin- cleaved s reveals a very similar distribution of 'up' s rbd conformations available for receptor binding as in the uncleaved samples, although we additionally observe a small proportion of s rbd in the all-'down' conformation (fig. c) . these results indicate that trypsin-cleavage at s /s does not impart large conformational changes on the sars-cov s and justifies the removal of s /s cleavage sites for the production of more homogeneous material as vaccine immunogens. this suggests that although cleavage at s /s may remove an obstacle for conformational changes leading to fusion, s /s cleavage alone does not produce significant conformational changes. terminal helix of s hr (fig ) . exposure of this site for cleavage may require remodeling of this penultimate loop or hr beyond the conformation observed in the prefusion state. we hypothesize that additional triggers beyond cleavage at the s /s site or protein-receptor binding are needed to transition the spike from its prefusion state to a yet to be observed intermediate. changes and that the s ′ proteolysis does not occur in the s prefusion state (fig. ) . this grids were loaded onto a titan krios and data was collected using leginon at a total dose of e -/Å . frames were aligned with motioncor (ucsf) implemented in the appion workflow . particles were selected using dog picker . images were assessed and particle picks were masked using em hole punch . the ctf for each image was estimated using gctf . electron microscopy data processing initial particle stacks were cleaned using multiple rounds of d classification in relion . good particles were selected as resembling prefusion coronavirus spikes. for the sars s p and trypsin-treated sars s p, all particles from the clean stacks were used for reconstruction with c symmetry. all datasets were extensively sorted using d classification to examine heterogeneity in the s rbds as described previously . briefly, d masks were defined to encompass the possible heterogeneity at each s rbd position. the density within these masks was then removed from unfiltered, unsharpened reconstructions. we then used relion_project with image subtraction to create a particle stack containing only the signal arising from the masked density. finally, we used focused d classification to identify compositional and conformational states at each s rbd position. all d reconstructions were produced with relion and final refinements were performed with a six-pixel soft-edge solvent mask. post- processing was applied to each reconstruction to apply b-factor sharpening and amplitude corrections as well as to calculate local resolution maps. coordinate models were built for several of the high-resolution reconstructions using i .pdb , ajf.pdb and x s.pdb as template models with reference to a recently sars and mers: recent insights into emerging coronaviruses proceedings of the national academy of sciences of the united states of america mechanisms of coronavirus cell entry mediated by the viral spike protein two-step conformational changes in a coronavirus envelope glycoprotein mediated by receptor binding and proteolysis receptor-bound porcine epidemic diarrhea virus spike protein cleaved by trypsin induces membrane fusion proteolytic processing of middle east respiratory syndrome coronavirus spikes expands virus tropism inhibitors of cathepsin l prevent severe acute respiratory syndrome coronavirus entry the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex structure of influenza haemagglutinin at the ph of membrane fusion tectonic conformational changes of a coronavirus spike glycoprotein promote membrane fusion sars immunity and vaccination recombination, reservoirs, and the modular spike: mechanisms of coronavirus cross-species transmission angiotensin-converting enzyme is a functional receptor for the sars coronavirus a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensin-converting enzyme . the cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding pre-fusion structure of a human coronavirus spike protein immunogenicity and structures of a rationally designed prefusion mers- proceedings of the national academy of sciences of the united states cryo-em structure of porcine delta coronavirus spike protein in the pre- fusion state cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer glycan shield and epitope masking of a coronavirus spike protein observed by cryo-electron microscopy glycan shield and fusion activation of a deltacoronavirus spike glycoprotein fine-tuned for enteric infections cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains structure of sars coronavirus spike receptor- binding domain complexed with receptor peptide forms an extended bipartite fusion platform that perturbs membrane order in a calcium-dependent manner vesicular stomatitis virus pseudotyped with severe acute respiratory syndrome coronavirus spike protein n-terminal domain of the murine coronavirus receptor ceacam is responsible for fusogenic activation and conformational changes of the spike protein activation of the sars coronavirus spike protein via sequential proteolytic cleavage at two distinct sites protease-mediated enhancement of severe acute respiratory syndrome coronavirus infection physiological and molecular triggers for sars-cov membrane fusion and entry into host cells host cell proteases: critical determinants of coronavirus tropism and pathogenesis discovery of a rich gene pool of bat sars-related coronaviruses provides new insights into the origin of sars coronavirus automated molecular microscopy: the new leginon system motioncor : anisotropic correction of beam-induced motion for improved cryo-electron microscopy appion: an integrated, database-driven pipeline to facilitate em image processing dog picker and tiltpicker: software tools to facilitate particle selection in single particle electron microscopy emhp: an accurate automated hole masking algorithm for single-particle cryo-em image processing real-time ctf determination and correction accelerated cryo-em structure determination with parallelisation using gpus in relion- coot: model-building tools for molecular graphics atomic-accuracy models from . -a cryo-electron microscopy data with density-guided iterative local refinement phenix: a comprehensive python-based system for macromolecular structure solution computational resources for electron microscopy at the scripps research institute are supported by nih grant od processed electron microscopy data. r.n.k and c.a.c. built and refined atomic models we gratefully acknowledge travis nieusma, charles bowman, jean-christophe ducom and bill anderson for microscopy and computational support. we also thank lauren holden for a critical reading of this manuscript. this work was supported by grants from nih/niaid to a.b.w and key: cord- -ds x yym authors: kim, young-seok; son, ahyun; kim, jihoon; kwon, soon bin; kim, myung hee; kim, paul; kim, jieun; byun, young ho; sung, jemin; lee, jinhee; yu, ji eun; park, chan; kim, yeon-sook; cho, nam-hyuk; chang, jun; seong, baik l. title: chaperna-mediated assembly of ferritin-based middle east respiratory syndrome-coronavirus nanoparticles date: - - journal: front immunol doi: . /fimmu. . sha: doc_id: cord_uid: ds x yym the folding of monomeric antigens and their subsequent assembly into higher ordered structures are crucial for robust and effective production of nanoparticle (np) vaccines in a timely and reproducible manner. despite significant advances in in silico design and structure-based assembly, most engineered nps are refractory to soluble expression and fail to assemble as designed, presenting major challenges in the manufacturing process. the failure is due to a lack of understanding of the kinetic pathways and enabling technical platforms to ensure successful folding of the monomer antigens into regular assemblages. capitalizing on a novel function of rna as a molecular chaperone (chaperna: chaperone + rna), we provide a robust protein-folding vehicle that may be implemented to np assembly in bacterial hosts. the receptor-binding domain (rbd) of middle east respiratory syndrome-coronavirus (mers-cov) was fused with the rna-interaction domain (rid) and bacterioferritin, and expressed in escherichia coli in a soluble form. site-specific proteolytic removal of the rid prompted the assemblage of monomers into nps, which was confirmed by electron microscopy and dynamic light scattering. the mutations that affected the rna binding to rbd significantly increased the soluble aggregation into amorphous structures, reducing the overall yield of nps of a defined size. this underscored the rna-antigen interactions during np assembly. the sera after mouse immunization effectively interfered with the binding of mers-cov rbd to the cellular receptor hdpp . the results suggest that rna-binding controls the overall kinetic network of the antigen folding pathway in favor of enhanced assemblage of nps into highly regular and immunologically relevant conformations. the concentration of the ion fe( +), salt, and fusion linker also contributed to the assembly in vitro, and the stability of the nps. the kinetic “pace-keeping” role of chaperna in the super molecular assembly of antigen monomers holds promise for the development and delivery of nps and virus-like particles as recombinant vaccines and for serological detection of viral infections. introduction various types of viral vaccines have been developed over the last century with a wide spectrum of efficacy and safety ( , ) . the manufacturing of most conventional vaccines-live attenuated, inactivated, or subunit vaccines-invariably require the culturing of infectious viruses in cell substrates ( ) . despite dedicated efforts, conventional cell culture often fails to produce sufficient amounts of virus for evaluating the immunogenicity, protective efficacy, and safety of viral vaccines. moreover, some emerging viruses cause high-mortality rates, without options for treatment or prophylaxis, necessitating their manipulation, and manufacture under stringent bio-safety environment ( ) . not surprisingly, alternative technologies that circumvent these limitations are a high priority in the areas of vaccine development and production. nanoparticles (nps), virus-like particles (vlps), and assembly of multimeric peptides each provide attractive platforms for vaccine design ( ) . virus-like particles and nps structurally resemble infectious virions, but are non-infectious due to the lack of viral genomes. recombinant surface antigens from natural virions are assembled into highly ordered conformations as empty particles devoid of genetic material. antigenic epitopes are presented on the multivalent and highly repetitive outer structure of the nps, which leads to the crosslinking of b-cell receptors and the induction of long-lasting immune responses ( ) ( ) ( ) . by mimicking the morphology of the natural infectious virions, the regularly assembled particles are highly immunogenic, and are amenable to diagnostic and prophylactic exploitation. among the simplest targets are the vlps of non-enveloped viruses, such as hepatitis e virus or human papilloma virus, and are composed purely of viral capsid proteins ( ) ( ) ( ) . in contrast to non-enveloped viruses, where virion assembly is exclusive to capsid proteins, enveloped viruses (e.g., coronavirus or flavivirus), require an additional membrane component for assembly into mature virions. consequently, in enveloped vlps, the assembly of matrix proteins provides a molecular scaffold, and viral antigens are embedded into lipid membranes. different types of glycoproteins may be embedded in the lipid membrane as target antigens for generating immunological responses ( ) . however, this process requires multiple proteins (surface antigens and matrix proteins), and the enveloped vlps are not structurally uniform and are difficult to characterize. a promising alternative is to present the target antigens on the surfaces of self-assembled nps, which, in lieu of lipid membranes, serve as the macromolecular scaffold for the presentation of the antigens of interest. in developing np vaccines, consideration should be given regarding the selection of a robust and faithful system for np assembly that enables the cost-effective development and delivery of vaccines in a timely manner. structure-based approaches in silico and their underlying principles are relatively advanced for np assembly ( ) ( ) ( ) . most of the approaches consider the thermodynamic stability of the final assembled nps, without due recognition for the kinetic complexities controlling regular assemblage over random interactions that lead to misfolded aggregations. therefore, it is not surprising that most engineered nps are refractory to soluble expression, which presents practical challenges in production, both at a laboratory-scale and in commercial manufacturing processes. this problem becomes augmented when expressed in bacterial hosts because of a lack of folding assistance in the bacterial cytoplasm for viral antigens. therefore, due to advantages in assisted folding, posttranslational modifications, and the possibility of generating multiple-component nps and vlps, eukaryotic hosts such as yeast, insects, and mammalian cells have been favored over bacterial hosts ( ) ( ) ( ) . however, these systems are significantly more expensive than bacterial systems, are more time-consuming, and the down-stream processes are usually more complex. moreover, the purification of vlps from insect cell systems poses a challenge due to similar physicochemical properties between the vlps and the baculoviruses ( , ) . bacterial systems, if available, would provide a cost-effective means to develop and deliver vaccines, as well as sero-diagnostic antigen kits used to diagnose-specific infection diseases. middle east respiratory syndrome (mers) was first reported in saudi arabia in and has caused multiple cases of infection with high mortality in europe and asia ( , ) . mers is caused by mers-coronavirus (mers-cov), which can be transmitted from camels to humans, and from humans to other humans ( , ) . worldwide transmission is increasing in direct household and community-wide transmission, as well as in nosocomial settings, as exemplified in a outbreak in korea ( , ) . neither effective vaccines nor therapeutic interventions are currently available. because of this, assembly of mers-cov antigens into immunologically relevant conformation as nps would be of interest and may be helpful in developing vaccines, sero-diagnostic tools, and therapeutic monoclonal antibodies. in the current study, we present a novel bacterial np of mers-cov antigen using ferritin as a molecular scaffold for self-assembly. ferritin, which is present in most living organisms, has identical subunits that spontaneously self-assemble and form np complexes with internal and external diameters of and nm, respectively ( , ) . previous studies show that ferritins of helicobacter pylori from a human isolate can be used as scaffold for hiv and influenza np vaccines, using eukaryotic host cells such as human embryonic kidney cells (hek f or hek s) ( , ) . likewise, bacterioferritin (fr), which self-assembles into nanocages with octahedral symmetry, has also been evaluated as a potential drug delivery system ( ) . however, viral antigens of human pathogens are prone to misfolding into aggregates, which necessitates chemical refolding of the insoluble aggregates in order to regain solubility and to allow regular assembly of the antigen ( , ) . in addition, displaying antigens on the surface of multi-molecularly assembled scaffolds in bacterial hosts remains a daunting challenge. we hypothesized that nps displaying the receptor-binding domain (rbd) of the spike protein from mers-cov could be produced in a bacterial system by harnessing the function of a molecular chaperone. conventionally, protein folding and the prevention of non-functional aggregation have been ascribed to molecular chaperones ( ) ( ) ( ) . recently, it has been shown that rna molecules are able to provide novel functions as molecular chaperones ( ) ( ) ( ) . based on novel findings, the concept of chaperna (chaperone + rna) function was established ( ) . in this report, chaperna function was harnessed for the folding and assembly of hybrid ferritin monomers into nps using a bacterial expression system. we also demonstrated that the biophysical properties, including solubility, yield, and stability of mers-cov nps, could be improved by properly controlling the rna-binding affinity, and the concentrations of fe + and salts. the chapernabased np assembly may prove to be a versatile tool for developing and delivering recombinant vaccines and for serological detection of emerging/re-emerging viruses. the expression vector pge-hrid( ) was constructed from the parental vector pge-lysrs ( ) ( ) . the pge-lysrs( ) vector was enzymatically cut with ndei and kpni. the pcr product of hrid, which carries the tev protease cleavage site and a -histidine tag at the c-terminus, was cut using the same restriction enzymes and the digested fragment inserted into the vector to generate pge-hrid( ). fr (genebank accession no. nc_ . ) dna was synthesized by, and purchased from, cosmo genetech (korea). the dna was cleaved with sali and hindiii, and inserted into pge-hrid( ) to generate hrid( )-fr. the receptor binding domain (rbd), n-terminal residues - , of the mers-cov s protein (genbank accession no. afs . ), was generated by gene synthesis, cut with kpni and sali, and inserted into hrid-fr to generate pge-hrid( )-rbd-fr. linker ssg or asg was inserted into the c-terminus of the rbd using overlapping pcr, cleaved with kpni and sai, and ligated into hrid-fr, generating pge-hrid( )-rbd-[ssg]-fr or pge-hrid( )-rbd-[asg]-fr, respectively. the schematic diagrams of each expression vector are illustrated in figure b . the genes of mutant hrid( m) (k a and k a) and hrid( m) (k a, k a, r a, k a, k a, k a, k a, k a, and k a) were generated by gene synthesis, cleaved with ndei and kpni, and inserted into pge-hrid( )-rbd-fr, generating pge-hrid( m)-rbd-fr and pge-hrid( m)-rbd-fr, respectively. the mutation sites and amino acid sequences of the mutants are shown in table s in supplementary material. the resulting expression vectors were transformed into the escherichia coli strain shuffle ® t . the cells were grown in ml of lb medium with ampicillin ( µg/ml) at °c overnight. each type of transformant was inoculated into ml of lb medium with ampicillin, grown at °c until an optical density (od ) of . - . was reached. protein expression was induced with mm iptg for h. each sample was harvested by centrifugation, lysed by sonication in lysis buffer ( mm tris-hcl, ph . ; % glycerol; mm -mercaptoethanol; and . % tween- ). the soluble fraction of each lysate was purified on a ni-affinity histrap™ hp column by atka prime (ge healthcare) and concentrated with centriprep™ (merck millipore ltd.). the purified proteins were treated with tev protease to remove the fusion partner hrid. the assembled nps were purified by gel filtration on / superose™ increase columns (ge healthcare). to examine the size and structure of the purified nps, microscopic evaluations using tem and cryo-em were performed. for tem analysis, a drop of the nps was placed onto a formvar/carboncoated tem grid (spl). the grid was negatively stained with % uranyl acetate, dried, and examined using a jem- electron microscope (jeol) at an accelerating voltage of kv. the particle sizes were calculated using camera-megaview iii (soft imaging system-germany) for measuring the nps in random image fields. for cryo-em, the nps were placed onto plasma-treated formvar/ carbon copper grid (ems) and negatively stained with % uranyl acetate. the grid was accelerated at kv with an fei cryotecnai f cryo-em microscope made available through the korean institute of science and technology. the nps were examined and photographed in high resolution. nanoparticle samples ( ml) were placed into a dispo-h cell, and analyzed using a zeta-potential & particle size analyzer (els- ( , , and °c ) and the cell lysates were separated into total (t), soluble (s), and insoluble (p) fractions by centrifugation (left panel). the solubility of each protein expressed at °c was measured by a gel densitometer and the data were summarized and shown in the right panel (n = ). statistical significance (**p < . , ***p < . ) was indicated for the samples compared with the control using a two-tailed student's t-test. (d) illustration of mers-cov rbd-fr nps using the chaperna-based hrid fusion partner. the hrid facilitated folding of the aggregation-prone rbd-fr through interaction with rna. the monomer of rbd-fr formed a properly folded trimeric structure by cleaving hrid with tev protease. eight trimers assembled and formed into mers-cov-like nps. red triangles indicate the rbd trimer on the fr nps. of the nps was measured twice at °c in water as a solvent with the sample accumulation time at s. effect of salt and fe + concentrations on np assembly and stability cultured cells ( ml) were lysed with lysis buffer in the presence of various concentrations of nacl ( , , , , , , , , and mm) to evaluate the intracellular proteins. all samples were performed in triplicate. the cell lysates were separated into soluble and insoluble fractions by centrifugation, and the protein stabilities analyzed by sodium dodecyl sulfatepolyacrylamide gel electrophoresis (sds-page). thus, the proteins from cell lysates ( ml culture) were purified using hispur™ ni-nta resin (thermo fisher scientific) in buffer a depending on nacl concentration ( - mm). to evaluate the effects on fe + on np formation, cells were cultured in lb media with various concentrations of fe + ( , , , and , µm). np formation was examined by size exclusion chromatography (sec), sds-page, tem, and dls at the various concentrations of nacl or fe + . the cells were harvested, sonicated with lysis buffer, and separated into soluble and pellet fractions by centrifugation. target proteins in the soluble fraction were purified using hispur™ ni-nta resin (thermo fisher scientific), following the manufacturer's instruction. t (total lysate), s (soluble fraction), p (pellet fraction), w (wash fraction), and e (the elution fraction were analyzed by sds-page. co-purification of the nucleic acids and proteins in the wash and elute were analyzed on a native agarose gel. the nucleic acids were visualized with ethidium bromide (etbr), and the proteins with coomassie staining. cultured cells ( ml) were harvested using the same method described above. the cells were lysed with µl of protein extraction reagent b-per™ ii (thermo scientific) and separated into soluble and pellet fractions by centrifuged , rpm for m. a µl aliquot of each soluble fraction was further treated with µg/ml of rnase a (intron biotechnology) and incubated at °c for min. the nuclease treated samples were clarified by centrifugation at , rpm for min and the soluble supernatants and the pelleted precipitates were analyzed on an sds-page gel followed by western blot analysis. to confirm the proper folding of rbd-fr and its variant (rbd-[ssg]-fr), the binding of the purified proteins with the mers-cov receptor hddp was performed by elisa. fr only and phosphate-buffered saline (pbs) were used as negative controls. nunc -well microtiter immunoplates (thermo fisher scientific) were coated with ng/well of hdpp proteins (abcam) and incubated at °c overnight. the plates were washed and blocked with µl/well of blocking buffer ( % bsa) for h at room temperature. rbd (ssg linker, wt, m, or m)-fr ( ng/ well) were added for h at °c. an anti-penta his antibody ( µl/well; qiagen) was serially diluted ( / to / , ) in tbst [ mm tris-cl (ph . ), . % tween- ], added to the wells, and incubated for h at °c. a secondary goat anti-mouse igg antibody conjugated with hrp in a -µl volume ( : , , sigma-aldrich) was added and incubated for h at °c. the plates were washed three times with tbst at the end of each step. after washing, µl/well of substrate tmb solution (bd biosciences) were added to the well and the plates were incubated at °c for min in the dark. µl of stop solution ( n h so ) was added to the well to stop the colorimetric reaction, and the absorbance at nm was measured using an elisa reader, fluostar optima (bmg labtech). . the coating antigens were removed, and the wells were blocked with pbst ( % skim milk in pbs and tween- ) for h at °c. after h, the blocking solution was removed. twofold serially diluted sera from four patients (cnnh- , , , and ) were added to each well and incubated at °c for h. the antigencoated wells were incubated with peroxidase-conjugated goat anti-human igg antibody (kpl, seracare life sciences, milford, ma, usa) at °c for h. the primary antibody was removed and , ′, , ′-tetramethylbenzidine (tmb; sigma-aldrich) was added to each well as colorimetric substrate. immediately after treatment of the reactions with stopping solution (sigma-aldrich), the od was read at nm. six-week-old female balb/c mice were immunized with µg/ mouse of the rbd-fr, rbd-[ssg]-fr, or rbd protein generated as described above, or with commercially available mers-cov rbd protein (mers-rbd- p; eenzyme) as antigen in bsl- facility in ylarc. antigens were diluted in pbs. for the first group, equal volume of mf adjuvant (addavax, cat. no vacadx- ) ( ) was mixed by pipetting. for the other group, equal volume of antigens and alum adjuvant (thermo fisher scientific) were mixed by pipetting following the manufacturers' protocol. pbs plus adjuvant and fr were used as negative controls. the immunized mice were boosted twice with intramuscular injections on days and . mice were anesthetized on days and for ocular bleeding from the orbital sinus ( figure s in supplementary material). immune sera were processed by centrifugation of the collected blood at , × g for min. the spleen and the balf (bronchoalveolar lavage) were obtained at days after the last immunization from sacrificed mice. balf was taken by washing the airways with ml of pbs. t-cell population from immunized mice were analyzed by flow cytometric analysis ( , ) . the spleens were taken at days after the last immunization from the sacrificed mice. to obtain single-cell suspensions, the tissues were homogenized and passed through µm cell strainers (spl). after centrifugation, erythrocytes were removed by red blood cell lysing buffer (sigma). the cells were washed and resuspended in iscove's modified dulbecco's media containing % fbs. for intracellular cytokine staining, the splenocytes were stimulated with µg/ml rbd protein or phorbol myristate acetate/ionomycin in the presence ng/ml recombinant human il- (biolegend) and brefeldin a ( : , ; ebioscience) at °c for h. after stimulation, the cells were blocked with rat anti-mouse cd /cd (bd biosciences) and surface stained with anti-cd (fitc, clone - . ; biolegend) and anti-cd (pe/cy , clone gk . ; biolegend) at °c for min. the stained cells were fixed in facs lysing solution (bd biosciences) at room temperature for min, and permeabilized with facs buffer ( . % fbs, . % nan in pbs) containing . % saponin (sigma) at room temperature for min. then, the cells were stained with anti-ifn-γ (pe, clone xmg . ; biolegend) and anti-tnf-α (apc, clone mp -xt ; biolegend) at room temperature for min. all data were collected by bd lsr fortessa (bd biosciences) and analyzed with flowjo software (tree star inc., ashland, or, usa). competition elisa was performed to determine whether mers-cov antigen [rbd-[ssg]-fr, rbd-fr, rbd, and fr (negative control)]-immunized mouse serum inhibited binding of rbd protein to hdpp receptor ( , ) . ng/well hdpp protein (abcam) was coated on nunc -well microtiter immunoplates (thermo fisher scientific) and incubated overnight at °c. plates were washed and blocked with µl/well of blocking buffer [ % skim milk in pbs and tween- (pbst)] for h at °c. at the same time, mouse sera immunized with rbd, rbd-[ssg]-fr, rbd-fr, and fr were serially diluted ( / to / ) with ng/well rbd protein (mers-rbd- p; eenzyme) in tbst [ mm tris-cl (ph . ), . % tween- ], added to new wells, and incubated for h at °c. µl solution was added to each well at °c and incubated for h. after that, µl of anti- xhis tag antibody conjugated with horseradish peroxidase ( : , , thermo fisher scientific) was added to each well and incubated for h at °c. plates were washed three times with tbst, and µl/well of substrate tmb solution (bd biosciences) was incubated at °c for min in the dark. µl of stop solution ( n h so ) was added to the well to stop the color reaction and measure the absorbance at nm using an elisa reader fluostar optima (bmg labtech). the hrid facilitated the solubility of mers-cov rbd-fr the spike glycoprotein (s) of mers-cov was used for the generation of mers-cov-like nps. s protein forms trimers, resulting in large spikes on the virus envelope ( ) . it is challenging to express the full-sized s protein (~ kda) in e. coli. thus, the s domain of s protein (~ kda), which includes the receptor-binding ability, was used. our initial attempt to express the s domain, either as s or as an s -fr fusion protein, failed; the expression level and solubility of the protein was below the lower limit of detection by sds-page and western blotting ( figure s in supplementary material). we therefore used the rbd ( - a.a.) of the s protein, which has a pivotal function as illustrated in figure b ( , ) . when expressed alone in e. coli, the rbd is not able to form the trimeric assembly (unpublished observation), due to the lack of the hr domain within the s domain ( ) . to overcome this problem, fr was used as scaffold for the assembly. fr is a spherical np whose subunits form trimers that subsequently result in octahedral structures composed of identical subunits ( ) . we therefore performed computational modeling to evaluate the potential of fr as scaffold for trimer formation of the rbd. possible trimer formation was analyzed by computational modeling using modeler ( , ) and cluspro ( , ) . various linkers, including ssg, asg, and d , were introduced between the rbd and fr with a goal to minimize steric hindrance between the two domains so as to enhance trimer and np formation. in silico analysis showed energy-stable trimeric models of rbd-fr, rbd-[ssg]-fr, and rbd-[asg]-fr, whereas rbd-d -fr failed to form a trimeric structure ( figure a) . the rbd-[ssg]-fr was predicted to be the most stable and well-structured compared with rbd-fr and rbd-[asg]-fr. initial testing of the rbd-fr constructs without hrid fusion showed that none of the constructs were solubly expressed, even under low-temperature culture conditions ( figure c ) ( figure b) . we previously confirmed that by using chaperna, the globular domain of influenza hemagglutinin (ha) is efficiently assembled into a trimeric complex with an immunologically relevant conformation (yang et al., in press) . as shown in figure c , the hrid fusion significantly increased the solubility of both rbd-fr ( . %) and rbd-[ssg]-fr ( . %), indicating that the chaperna platform effectively increased both the solubility and the folding of its fused target proteins. because of the poor expression level and low solubility of the rbd-[asg]-fr construct (figure c) , further experiments were performed using only the rbd-fr and rbd-[ssg]-fr constructs. after purification of the soluble proteins ( figure s in supplementary material), we determined the potential effects of using hrid as a fusion partner for the self-assembly of the nps. as shown in figure s in supplementary material, hrid-rbd-fr failed to form nps. because of this, we performed tev protease cleavage of the hrid. removal of the hrid domain facilitated the self-assembly of the rbd-fr monomers, and also eliminated the immune response against the non-self hrid domain in balb/c mice ( figure s in supplementary material). after hrid cleavage, rbd-fr and rbd-[ssg]-fr were purified using sec (figure a) . as expected, rbd-[ssg]-fr assembled into properly formed nps ( , kda) more efficiently than did rbd-fr nps, which were mainly detected in the void-volume fractions, suggesting they were irregularly assembled soluble aggregates. the size of the rbd-[ssg]-fr nps was further confirmed by tem. tem images of the rbd-[ssg]-fr np structures showed hollow, spherical particles that were more compact than the rbd-fr nps. the average diameter of the rbd-[ssg]-fr nps was - nm (figure b) . in contrast, dls analysis of the rbd-fr np structure without the ssg linker appeared to be smaller with an average intensity diameter of . nm, and this compared with rbd-[ssg]-fr that had an average intensity distribution diameter of . nm (figure c) . consistent with the sec analysis, rbd-fr without a fusion partner was mostly produced in a soluble aggregated form. therefore, we identified that the protein folding did not occur properly without hrid, and the formation of nps was confirmed by both sec and sds-page analyses. as shown in figure s in supplementary material, the purified nps retained their stability over an extended period of time at various temperatures ( , , and − °c). thus, these results indicate that the ssg linker allowed the rbd-fr to generate properly assembled nps. it should also be noted that the efficiency of protein folding and nps formation may be further enhance through appropriate linker selection. it has been reported that ionic strength plays an important role in the stability and self-assembly of ferritins ( , ) . we examined the effect of salt concentration on the formation and stability of the rbd-[ssg]-fr nps at various concentrations ( - mm). consistent with the previous studies, the stability of the protein was highly affected by the concentration of nacl in the lysis buffer by sds-page ( figure a ) (n = ). the solubility of the protein significantly decreased as the concentration of nacl increased from to mm, with the solubility being about . -fold lower at mm compared with mm. unlike previous studies, the solubility of the protein was gradually recovered at higher nacl concentrations (> mm); the solubility at mm was . -fold higher than at mm. furthermore, the yield of soluble of protein per liter of culture increased in a salt concentrationdependent manner (figure b) . to further investigate the effect of salt concentration, the physicochemical and morphological properties of the rbd-[ssg]-fr protein were examined by sec, tem, and dls. in mm nacl, most of the protein was aggregated during the purification process, and the purified protein failed to form spherical structures, but instead, existed predominantly as kda monomers (figures c,d; figure s in supplementary material). in contrast, the protein that was lysed in mm nacl and purified in mm nacl, developed well-structured nps according to tem and dls analyses (figures c,d; figure s in supplementary material). however, based on sec analysis, at high-salt concentrations (> mm), the protein failed to form stable structures with the proteins being eluted predominantly in the void volume, suggesting they were soluble aggregates under the high-salt concentrations ( figure c) . transmission electron microscopy images under the various salt concentrations clearly supported the conclusion, showing that the tendency for aggregation was dependent on the salt concentration ( figure d) . taken together, the results underscored the importance of salt concentration on the solubility of monomers and the quality of multimeric assembly of hybrid nps. ferritin has an intrinsic ability to interact with fe + to form ferritin-iron cores ( ) . thus, it was worth investigating the effect of fe + on the assembly and stability of rbd-[ssg]-fr nps. cells were grown in lb medium with various concentrations of fe + . as shown in figure a , the yield of purified protein was significantly increased from cultures with µm fe + , reflecting a . -fold increase compared with similar cultures µm fe + . the cell growth and purification yield at , µm fe + were slightly decreased, presumably due to the toxicity of ferric acid. np formation under the various concentrations of fe + was analyzed by sec ( figure b) . consistent with the previous results, the proteins were eluted mainly in the fractions expected for the size of assembled nps ( , kda). of note, the ratio between nps and soluble aggregates in the sec analysis showed that np formation was facilitated at high concentrations of fe + (figure b) . the formation of rbd-[ssg]-fr nps at an fe + concentration of , µm was confirmed by tem ( figure c ) and dls ( figure d) . the tem analysis clearly showed that the morphology of the proteins was more compact, and probably highly stable, when assembled at high fe + concentrations ( µm) than at lower concentrations ( µm) ( figure c ). as shown in figure d , the average diameter of nps examined by dls was . nm at high fe + concentration ( - , µm) and . - . nm at lower concentration ( - µm). these results suggest that both fe + and salts concentrations influenced the efficiency and quality of the regular assembly of hybrid ferritin monomers into nps. our previous studies show that an rna-protein interaction is crucial for transducing the chaperone function of rna into the folding of client proteins ( ) . consistent with that, our present study showed that rna facilitated the folding of its interacting proteins. the solubility of hrid(wt)-rbd-fr was . -fold higher than rbd-fr without hrid fusion (figure ) , strongly supporting the previous studies. in addition, the solubility of rbd alone was completely insoluble (figure b ; figure s in supplementary material). it has been shown that the positively charged residues of lysine moieties in hrid contribute to trna binding ( ) . in the current study, the trna binding induced the intrinsically disordered protein (idp) status of hrid to form alpha-helical structures ( figure a) . thus, two rna-binding (table s in supplementary material). the total e. coli lysate (t) was fractionated into the soluble fraction (s) and the pellet fraction (p) by centrifugation. as expected, both rbd and rbd-fr without fusion to hrid domain, were refractory to being produced as soluble proteins ( figure b) . interestingly, the solubility of the rna-binding mutants did not decrease, but actually increased to . % for the m mutant and . % for the m mutant compared with wild-type protein at . % (figure b) . considering that hrid is relatively unstructured in the absence of trna binding, the results are consistent with previous reports that the fusion with idps promotes the solubility of target proteins ( ) ( ) ( ) . following purification of wild-type hrid-rbd-fr (hrid(wt)-rbd-fr), electrophoretic mobility shift assays showed that greater amounts of nucleic acids were co-purified with hrid(wt)-rbd-fr protein than with the mutant hrid-rbd-frs ( and m) under non-denaturing conditions ( figure c ). the relative ratio of nucleic acid based on etbr staining and proteins based on coomassie staining in the eluted fraction confirmed the reduced affinity of mutants to nucleic acids. to test if rna had a role in maintaining the stability of the target proteins, the lysates were treated with rnase a to eliminate rna, and the solubility of each protein was analyzed by sds-page and western blotting. the soluble fractions of the lysates (s) were incubated at °c in the presence and absence of rnase a and the samples were further separated into soluble fraction (ss′) and insoluble fraction (sp′) by centrifugation. as shown in the left panel of figure , rnase a treatment completely abolished the effect of rna on protein solubility as compared with the control (rnase a−) or with samples prior to rnase treatment. parallel experiments with the and m mutants showed much less rna co-purified with the proteins, confirming the reduced affinity to nucleic acids and the complete depletion of rna by rnase a treatment (figure , left panel) . remarkably, the solubility of hrid(wt)-rbd-fr was greatly reduced by depletion of rna as reflected in the ratio of [ss′]/ [sp′] [ . and . for rnase (+) and rnase (−), respectively] by both coomassie staining and western blot analyses (figure , right panel). however, the solubility of the mutants ( and m), was not significantly affected by rnase a treatment, probably due to their lower affinity to rna (figure , right panel) . taken together, the results demonstrate that hrid(wt)-rbd-fr maintained a strong affinity for rna, and that affinity was pivotal for maintaining the solubility of the protein. to further define the rna dependence of solubility of the ferritin hybrids (figure ) , we investigated if the rna binding had a role in the formation of nps. rbd-fr and the various hrid-rbd-fr (wt, , and m) proteins were purified by nickel-affinity chromatography ( figure s in supplementary material) and their physicochemical properties analyzed by sec (figure a) , tem (figure b) , and dls ( figure c) . the soluble yields of rbd-fr (hrid fusion) was approximately . mg/l of culture, representing greater than , -fold higher levels than its hrid (−) counterpart (~ μg/l culture), again confirming the role of hrid as a robust enhancer for solubility and assembly. it was striking to note that the two mutant proteins, despite high solubility (figure b) , were detected at disproportionately higher amounts in the void fractions of sec, indicating that they failed to form nps of a defined size, and existed predominantly as soluble aggregates ( figure a) . however, hrid(wt)-rbd-fr predominantly formed nps of a defined size (~ , kda). it is also interesting to note that there was a slight shift of the rna-binding mutants ( and m) in the elution pattern, suggesting a larger size of nps compared with wild-type nps. overall, the ratio between soluble aggregates in the void volume and the nps of defined size clearly showed that rna binding was crucial for assembly of the monomers into nps. as a control, rbd-fr (without hrid fusion) existed predominantly as soluble aggregates ( figure s in supplementary material). consistent with these results, em analysis confirmed well-structured nps by hrid(wt)-rbd-fr, compared to largely aggregated structures by the mutant proteins ( figure b) . even if multi-molecular structure was formed, the structure becomes unstable, mostly as soluble aggregates. consistently, the intensity distribution diameter of the wild-type protein, as estimated by dls analysis, was nm compared with larger sizes of hrid( m) at . , . nm and hrid( m) at , . nm (figure c ; figure s in supplementary material). it is conceivable that soluble aggregates may shield the exposed -histidine tag, resulting in a decreased binding affinity to nickel resins and elution in earlier fractions compared with wt protein ( figure s in supplementary material). taken together, the data demonstrate that rna binding prevented aggregation into irregular conformations and guided the self-assembly of the hybrid ferritin monomers into nps of a stable structure. the immunological properties of ferritin nps were analyzed by elisa. the hddp (human dpp ) receptor has been previously identified as the receptor for mers-cov human infection ( ) . therefore, using hdpp as a coating antigen, elisa-binding assays between rbd nps and the receptor were performed (figure ) . fr without rbd fusion failed to bind, and was similar to the pbs negative control. strikingly, the binding ability increased in the same order as the rna-binding ability (hrid(wt) > m > m), with highest absorbance observed in the wt with the ssg linker (hrid(wt)-rbd-[ssg]-fr). the results show that the conformation of rbd in the wt nps better resembled the protective antigen of mers-cov rbd from cells, compared with the rna-binding mutants and m. again, judicious choice of linker between the ferritin carrier and the antigen was important for receptor binding and was reflected in its importance for np assembly into a stable conformation (figure ) . finally, the elisa results for np against human patients was investigated using the sera from four mers-cov-infected patients (figure ) . six different proteins, including five recombinant nps (hrid(wt)-rbd-fr, hrid( m)-rbd-fr, hrid( m)-rbd-fr, hrid(wt)-rbd-[ssg]-fr, and fr), and mers-cov rbd protein were compared by elisa using them as capture antigens. strong elisa signals were detected for the four recombinant nps and mers-cov rbd from cells (positive control). the wt form consistently showed a higher response than the rnabinding mutants (hrid(wt) > m > m), with hrid(wt)-rbd-[ssg]-fr being the best binder among constructs tested. these results address to the utility of the e. coli assembled mers-cov rbd-fr nps as useful tools for sero-diagnosis of mers-cov infection. taken together, the results confirmed the immunologically relevant conformation of the mers-cov rbd displayed on the hybrid ferritin particles, and the crucial role of rna in controlling the kinetic pathway for the assembly of viral antigen monomers into stable nps. to evaluate the immunogenicity of ferritin-based nps, balb/c mice (n = ) were immunized with rbd, rbd-fr, and rbd-[ssg]-fr nps antigens. the trnas were found to be removed from the hrid protein during the purification process. before immunization, potential rna contamination in the purified proteins was determined by gel electrophoresis. as shown in figure s in supplementary material, rna was below detection level, if any, after several purification steps, compared with the proteins purified in the first step. previously, mf -adjuvated and alum-adjuvated mers-cov antigen have been reported to increase the antibody and t-cell responses in mice ( , ) . thus, the first group and second group were immunized twice with . µg of antigen containing the equal volume of alum figure s in supplementary material and size exclusion chromatography was used to explore hdpp receptor-binding affinity to the protein. all data are shown as mean ± sd from triplicate samples. fr alone and phosphate-buffered saline were used as negative controls. figure s in supplementary material were used as coating antigens. fr alone and infected cell lysates were used as negative and positive controls, respectively. virus-infected sera from four patients were serially diluted from : (twofold dilution). all data are presented as mean ± sd of duplicate samples. higher than rbd, respectively. the antibody responses by rbd-fr and rbd-[ssg]-fr nps were much stronger than the rbd in all antibody subtypes tested (igg , igg a, and igg b) (figures b-d) . as a test of mucosal immune responses, the rbd-specific iga antibody levels from balf were also analyzed by elisa (figure e) . mf adjuvanted rbd-[ssg]-fr nps presented significantly higher od values than rbd and fr (negative control). these results suggested that rbd-[ssg]-fr nps induces local mucosal immune response stronger than rbd. in addition, it was confirmed that antibody responses of igg, igg (th ), igg a, and igg b (th ) against mf -adjuvated antigens were higher than those from alum-adjuvated antigens. in contrast, pbs and fr control groups failed to, or only weakly induce an antibody response against rbd protein. these results suggest that fr-based nps significantly enhance various antibody responses than monomeric antigens. the cellular immune responses were investigated in mice immunized with protein (rbd, rbd-fr, rbd-[ssg]-fr) and fr (negative control). splenocytes of mice (n = ) were harvested week after the last immunization, stimulated with | nanoparticles-immunized mouse serum inhibited interaction between middle east respiratory syndrome-coronavirus receptor-binding domain (rbd) and hdpp receptor. competition enzyme-linked immunosorbent assay showed that anti-rbd mouse sera ( : , from mice immunized with rbd-[ssg]-fr, rbd-fr, and rbd) blocked binding between rbd ( µg/ml) and hdpp receptor ( µg/ml). fr-immunized mouse serum ( : ) was used as a negative control. all sera were serially diluted from : (twofold dilution). all data are presented as mean ± sd (n = ) and p-values were obtained using student's two-tailed tests (***p < . ). figure | immune responses in receptor-binding domain (rbd) nanoparticles (nps) immunized mice (n = ). endpoint titer of igg (a), igg (b), igg a (c), and igg b (d) antibody binding to middle east respiratory syndrome-coronavirus rbd were detected using mice serum after two immunizations. rbd-specific antibodies were detected after immunizations of rbd nps, rbd, fr with adjuvant (alum and mf ) using enzyme-linked immunosorbent assay. (e). rbd-specific iga antibodies were detected using balf (diluted : ) after immunization of protein with mf . od, optical density. each endpoint titer was shown by individual. all error bars were shown as mean ± sd (n = ) and all p-values were obtained using student's two-tailed tests (**p < . , ***p < . ). rbd protein, and analyzed for cytokines by flow cytometry. in the rbd-immunized group, ifn-γ and tnf-α-producing cd + t-cell responses were detected at low levels. however, ifn-γ and tnf-α-producing cd + t cells were significantly increased in rbd-fr and rbd-[ssg]-fr-immunized groups compared with rbd and fr-immunized group ( figure s in supplementary material) . these results demonstrated that the rbd nps vaccination induced antigen-specific cd + t cells that produced ifn-γ and tnf-α upon antigen stimulation. anti-nps serum effectively blocked rbd protein binding to the hdpp receptor middle east respiratory syndrome-coronavirus infection is mediated by the interaction of rbd and the host receptor hdpp ( , ) . as a correlate of protection, a competition elisa was performed to investigate whether antibodies generated from nps immunization were able to interfere with the binding to hdpp . thus, after incubation of rbd protein with mouse serum ( : ), the binding of serum-mixed samples to hdpp protein was measured. as shown in figure , rbd-[ssg]-fr, rbd-fr, and rbd-immunized sera strongly abolished the binding of rbd to hdpp receptor ( . , . , and . %, respectively). interestingly, the relative efficiency of interference correlates with that of np assemblage (figure ). in contrast, the fr-immunized mouse serum (negative control) failed to inhibit the interaction. taking together, these results demonstrate that immunization of nps greatly stimulates mers-covspecific antibody response that effectively interferes with the cellular receptor binding, suggesting its possibility as a vaccine. however, protection efficacy should ultimately be tested in a live virus challenge model. having key immunologic features, like a highly repetitive nanostructure, provides a designing principle for nps in inducing potent and long-lasting antibody responses. for vlps of non-enveloped viruses, assembly is made purely by capsid proteins. for enveloped viruses, however, additional membrane components and matrix proteins are required to display the target antigens on the surface of assembled vlps. a promising alternative is to present target antigen on the surfaces of selfassembled nps, which, in lieu of lipid membranes and matrix proteins, serve as a macromolecular scaffold for the presentation of antigens of interest ( ) . ferritins, as a substitute for matrix proteins and membranes, have been used as scaffold for the regular assembly of target antigens. however, ferritin-based nps have been produced only in host cells of mammalian or insect origin ( , ) . previously, we showed that influenza ha could be assembled in a soluble, trimeric, and immunologically relevant conformation by exploiting chaperna activity ( ) . the present study is the first report of using rnas as molecular chaperone for supra-molecular structures. here, we present a novel bacterial system for np assembly of hybrid ferritin displayed surface antigens from mers-cov. the nps reacted strongly with sera derived from mers-cov-infected patients (figure ) confirming their utility in sero-diagnosis of infection. moreover, the antisera, generated from immunization of mice, were able to interfere with the binding to the cellular receptor hdpp (figure ), in part of essential protective immune responses. the efficiency of receptor-binding inhibition (figure ) , as well as the ability for inducing the mucosal responses (figure e) , correlated with the regular assembly of nps as examined by dls or em (figure ) , confirming that presentation of antigenic epitopes on a multivalent and highly repetitive structure is indeed important for the quality of immune responses. overall, the quality of nps and consequent immune responses were governed by the rna-mediated assembly of antigens. we hypothesized that chaperna function could be harnessed for presenting target antigens as highly repetitive nanostructures ( figure d) . the hrid is the n-terminal domain of hlysrs and was previously identified as a nucleic acid-binding domain ( figure ) ( ) . in this report, the hrid was exploited as a transducer for chaperna function (tcf) by serving as a docking-tag for cellular rna for the folding/assembly of the hybrid fr containing client antigen proteins [rbd of mers-cov (figure d) ]. the advantage of using hrid as a tcf could be many fold. first, hrid is small ( . kda), monomeric, and was flexible enough to allow the access of site-specific protease for the removal of hrid ( figure s in supplementary material). of note, hrid belongs to idps, which switches into stabile α-helixes upon binding with trnas. second, the bound rna, due to its highly negative charge, may resist uncontrolled intermolecular interactions among monomers into amorphous aggregation. finally, even the naked hrid (in the absence of rna binding), due to its intrinsically flexible nature, may not pose physical hindrance to multiple interactions among monomers, enabling assembly into stable super-structures, upon removal of the hrid. thus, the potential "pace-making" function harnessed with the rna molecule, allows a regular assembly of monomers as highly repetitive nanostructures. consequently, in the current study, hybrid fr was produced in soluble forms, could be purified by one-step affinity chromatography, and most remarkably, assembled into nps of defined sizes upon removal of the hrid ( figure s in supplementary material). consistent with the principles of design, the loss of rna binding by hrid significantly hampered the regular assembly of the ferritin monomers and increased the amount of non-functional misfolded proteins as soluble aggregates (figure ) . thus, the overall yield, as well as the quality of nps, were dependent on the chaperna function transduced by the hrid, which in turn was mediated by interaction with cellular rnas (likely to be trnas). the driving and controlling factors for de novo assembly of biomolecules are poorly understood. historically, host factors like groel/s were initially discovered as molecular chaperones for supporting viral growth in e. coli and supporting the assembly of viral capsid proteins ( , ) . moreover, groel/s also cooperates with rbcx in plant cells for the assembly of multicomponent rubisco, which is the most abundant protein in the biosphere responsible for photosynthesis ( ) . therefore, it is intriguing that rna could provide such a robust folding/ assembly of a supra-molecular structure. we recently confirmed that the present strategy could be successfully applied to the assembly of bacterially synthesized monomers of norovirus into vlps composed of monomers (unpublished observation, seong, b.l.). whether rna can substitute for, or collaborate with pre-existing protein-based molecular chaperones remains an exciting avenue for future investigations. it should be noted that the defined versatile functions are being expanded for rna molecules. as an engineered system for harnessing chaperna function, the present report may prove to be the tip of an iceberg for pivotal function of rna molecules as chaperones for the folding and supra-molecular assembly of proteins in living organisms ( , ) . various factors were identified as important for efficient assembly of mers-cov nps. as an extrinsic factor, the binding affinity of hrid to cellular rnas was crucial for the assembly and the quality of the assembled nps (figure ) . as intrinsic factors, the concentration of salts and fe + also influenced the assembly and stability of nps (figures and ) . the ionic strength played an important role in the stability and self-assembly of ferritins, and aggregation increased with increasing concentrations of nacl ( ) . the assembly of the hybrid mers-cov nps revealed an interesting change in salt dependence, with - mm nacl buffer as optimal condition as confirmed by em and dls analyses (figure ) . the change in salt dependence was probably due to the presence of electrostatic interactions among rbd domains ( , ) . the dependence on fe + was not surprising considering that ferritin has an intrinsic ability to interact with fe + to form ferritin-iron cores ( ) . based on our experience, to enhance the quality of nps, it is advisable to control fe + concentrations, both during the culturing of the bacterial cells and during the purification of the soluble monomer proteins (figure ) . first, the yield of the purified protein was increased in the presence of µm fe + (figure b) , up to . -fold greater compared with the control conditions lacking fe + . second, the ratio between nps and soluble aggregates in sec showed that nps formation was facilitated at high concentrations of fe + , and resulted in a more compact morphology under em (figures b,c) . thus, both the overall yield and the quality of nps were governed by their intrinsic ability to interact with fe + . finally, our data show that the presence and the nature of the linker between the ferritin and the rbd antigen was also important to the assembly of nps. it is possible that a linker with flexibility and sufficient length would accommodate the steric requirements for assembly of multimeric nps. however, it is difficult to precisely predict the effect of the linker, and therefore it is advisable to screen multiple constructs during the early stages of testing the assembly of nps displaying antigens of interest. in conclusion, the chaperna-based antigen assembly platform holds promise for the development and delivery of np-based vaccines to enhance rbd-specific antibody responses, and the serological detection of emerging viruses. various types of designing principles have advanced the structure-based approaches to np assembly ( , ) . however, most of the in silico methods consider the thermodynamic stability of the final assembled nps, but not necessarily the kinetic pathways leading to their successful folding into regular assemblages. consequently, most nps are refractory to soluble expression and fail to assemble as designed, resulting in significant, and practical challenges in the manufacturing process. the chaperna-mediated folding and the "pace-keeping" assembly of monomers into higher ordered structures will enable faithful production of np and vlp-based vaccines against emerging and re-emerging viral infections. this study was carried out in accordance with the recommenda- figure | elucidation of rna-mediated nanoparticle (np) formation of receptor-binding domain (rbd)-fr. (a) size exclusion chromatography analysis of rbd-fr nps purified from the tev protease-cleaved hrid(wt, , or m)-rbd-fr. the fractions ( - ml) estimated as nps were further analyzed by transmission electron microscopy (b) and dynamic light scattering (c) exploiting virus-like particles as innovative vaccines against emerging viral infections traditional and new influenza vaccines vaccine manufacturing: challenges and solutions management of accidental exposure to ebola virus in the biosafety level laboratory vaccine delivery using nanoparticles vaccine delivery: a matter of size, geometry, kinetics and molecular patterns virus-like particles as a highly efficient vaccine platform: diversity of targets and production systems and advances in clinical development the influence of antigen organization on b cell responsiveness structure of the hepatitis e virus-like particle suggests mechanisms for virus assembly and receptor binding self-assembly of human papillomavirus type capsids by expression of the l protein alone or by coexpression of the l and l capsid proteins how will hpv vaccines affect cervical cancer? virus-like particles as universal influenza vaccines comparative protein structure modeling using modeller structure-based design of peptides that self-assemble into regular polyhedral nanoparticles comparative protein structure modeling of genes and genomes construction and characterization of virus-like particles: a review self-assembling protein nanoparticles in the design of vaccines differences in the post-translational modifications of human papillomavirus type b major capsid protein expressed from a baculovirus system compared with a vaccinia virus system isolation of a novel coronavirus from a man with pneumonia in saudi arabia middle east respiratory syndrome coronavirus (mers-cov) serology in major livestock species in an affected region in jordan human infection with mers coronavirus after exposure to infected camels, saudi arabia interhuman transmissibility of middle east respiratory syndrome coronavirus: estimation of pandemic risk an outbreak of middle east respiratory syndrome coronavirus infection in south korea mers outbreak in korea: hospital-to-hospital transmission the composition and the structure of bacterioferritin of escherichia coli self-assembly in the ferritin nano-cage protein superfamily presenting native-like trimeric hiv- antigens with self-assembling nanoparticles hemagglutinin-stem nanoparticles generate heterosubtypic influenza protection ferritin nanocages: great potential as clinically translatable drug delivery vehicles? refolding of recombinant proteins advances in refolding of proteins produced in e. coli the role of molecular chaperones in protein folding molecular chaperones in the cytosol: from nascent chain to folded protein the hsp and hsp chaperone machines rna-mediated chaperone type for de novo protein folding m rna is important for the in-cell solubility of its cognate c protein: implications for rna-mediated protein folding rnas as chaperones the folding competence of hiv- tat mediated by interaction with tar rna protein solubility and folding enhancement by interaction with rna modeling of loops in protein structures cluspro: a fully automated algorithm for protein-protein docking the cluspro web server for protein-protein docking universal vaccine against respiratory syncytial virus a and b subtypes optimization of antigen dose for a receptor-binding domain-based subunit vaccine against mers coronavirus a truncated receptor-binding domain of mers-cov spike protein potently inhibits mers-cov infection and induces strong neutralizing antibody responses: implication for developing therapeutics and vaccines dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc middle east respiratory syndrome coronavirus (mers-cov) entry inhibitors targeting spike protein molecular basis of binding between novel human coronavirus mers-cov and its receptor cd structure of mers-cov spike receptor-binding domain complexed with human receptor dpp structure-based discovery of middle east respiratory syndrome coronavirus fusion inhibitor stability of a -meric homopolymer: comparative studies of assembly-defective mutants of rhodobacter capsulatus bacterioferritin and the native protein evaluation of comparative protein structure modeling by modeller- salt-dependent aggregation and assembly of e coli-expressed ferritin ferritin protein nanocages use ion channels, catalytic sites, and nucleation channels to manage iron/oxygen chemistry a peptide from the extension of lys-trna synthetase binds to transfer rna and dna sweeping away protein aggregation with entropic bristles: intrinsically disordered protein fusions enhance soluble expression intrinsically disordered proteins and intrinsically disordered protein regions intrinsically unstructured proteins and their functions identification of an ideal adjuvant for receptor-binding domain-based subunit vaccines against middle east respiratory syndrome coronavirus protein/peptide-templated biomimetic synthesis of inorganic nanoparticles for biomedical applications self-assembling influenza nanoparticle vaccines elicit broadly neutralizing h n antibodies harnessing an rna-mediated chaperone for the assembly of influenza hemagglutinin in an immunologically relevant conformation host participation in bacteriophage lambda head assembly properties of a mutant of escherichia coli defective in bacteriophage lambda head formation (groe). i. initial characterization coupled chaperone action in folding and assembly of hexadecameric rubisco modeling the influence of salt on the hydrophobic effect and protein fold stability rational design of an epstein-barr virus vaccine targeting the receptor-binding site key: cord- - gi vka authors: singh, praveen kumar; kulsum, umay; rufai, syed beenish; mudliar, s. rashmi; singh, sarman title: mutations in sars-cov- leading to antigenic variations in spike protein: a challenge in vaccine development date: - - journal: j lab physicians doi: . /s- - sha: doc_id: cord_uid: gi vka objectives the spread of severe acute respiratory syndrome coronavirus- (sars-cov- ) virus has been unprecedentedly fast, spreading to more than countries within months with variable severity. one of the major reasons attributed to this variation is genetic mutation. therefore, we aimed to predict the mutations in the spike protein (s) of the sars-cov- genomes available worldwide and analyze its impact on the antigenicity. materials and methods several research groups have generated whole genome sequencing data which are available in the public repositories. a total of , spike proteins were extracted from , complete genome and partial spike coding sequences of sars-cov- available in ncbi till may , and subjected to multiple sequence alignment to find the mutations corresponding to the reported single nucleotide polymorphisms (snps) in the genomic study. further, the antigenicity of the predicted mutations inferred, and the epitopes were superimposed on the structure of the spike protein. results the sequence analysis resulted in high snps frequency. the significant variations in the predicted epitopes showing high antigenicity were a v, v f and a s in receptor binding domain (rbd). other mutations observed within rbd exhibiting low antigenicity were t i, a s, r i, g s, v a, h q, a s, a s and k e. the rbd t i, a s, v f, a s, a s and k e are novel mutations reported first time in this study. moreover, a v and d y mutations were observed in the heptad repeat domain and one mutation d h was noted in heptad repeat domain . conclusion s protein is the major target for vaccine development, but several mutations were predicted in the antigenic epitopes of s protein across all genomes available globally. the emergence of various mutations within a short period might result in the conformational changes of the protein structure, which suggests that developing a universal vaccine may be a challenging task. since the rapid outbreak of novel coronavirus ( -ncov, later named sars-cov- or severe acute respiratory syndrome coronavirus ) in wuhan, china, the world health organization on january , , declared the sars-cov- epidemic as a public health emergency of international concern. the enduring pandemic has caused nearly million detected cases of coronavirus disease illness and claimed over , , lives worldwide as of may , according to covid- resource center johns hopkins. however, so far, no proven therapeutic or effective vaccine candidate has been found. for developing a drug or vaccine, the protein profiling and/or genomic information of the pathogen is extremely crucial. to understand genetic landscape of sars-cov- virus, scientists have worked tirelessly and the complete genome sequences of virus isolates are published. now, many isolates have been sequenced completely or partially and are available in the database for scientific community. it is found that the genome size of sars-cov- varies from . kb to . kb. specific genetic characteristic in its genome have also been found. the genome consists of four structural proteins including spike protein (s), envelope (e), membrane (m), and nucleocapsid (n) proteins. of the four glycoproteins, the m protein is reported to have role in determining the shape of the virus envelope and stabilizing the nucleocapsids, the n protein is involved in processes related to the viral genome, the viral replication cycle, and the cellular response of host cells to viral infections. the e protein which is the smallest protein in the sars-cov- structure plays the role in the production and maturation of this virus. however, the glycoprotein s is the core transmembrane monomer of approximately kda size with two subunits s and s . this glycoprotein mediates membrane fusion and finally facilitates virus entry (receptor-binding and entry of virion into the target cells). the receptor binding domain (rbd) (residues - ) of the subunit s is known to interact with angiotensin-converting enzyme (ace- ), which provides tight binding to the peptidase domain of ace- . this gives an impression of rbd being an important element of virus-receptor interaction and has an essential role in virus-host range, tropism, and infectivity. the rbd sequences of different sars-cov- strains that are circulating globally were initially thought to be conserved; however, with the availability of sequencing data several mutations have been reported. , these findings emphasize the argument, that there may be correlation of the mutated strains circulating in a particular geographic setting with higher mortality rates and transmission patterns beside other combating factors. the mutation rate in ribonucleic acid (rna) viruses is intensely high which can be million times higher than that of their hosts and this results in virulence modulation and evolutionary capability for better viral adaptation. genetic depiction of virus mutations can thus offer valuable insights for assessing the fitness of drug resistance, immune escapism, and pathogenesis. due to its receptor binding property, the s protein is also supposed to be immunogenic and a putative target for developing the neutralizing antibodies and vaccines. it is reported that single-point mutations in the conserved amino acid residues in the rbd region completely abolishes the capacity of fulllength s protein to induce neutralizing antibodies. thus, virus mutation studies can be crucial for designing new vaccines and antiviral drugs. in this study, we aimed to predict the mutations in the spike protein (s) of sars-cov- genomes available in the database (whole genome sequences as well as partial coding sequences of spike protein) and analyze the effect of each mutation on the antigenicity of the predicted epitopes. this information may be helpful in predicting the transmission and infectivity of various sars-cov- strains circulating worldwide. entrez direct (edirect) utilities were used to access the ncbi's nucleotide database by using in-house developed bash scripts to batch download the data. we used query keyword or phrase as "severe acute respiratory syndrome coronavirus " and "spike coding sequences (cds)" by applying esearch and efetch utilities implemented in bash scripts to download the dataset for genome and spike proteins that were available till may , . a total of , complete draft genomic sequences of sars-cov- and additional cds having partial genomes coding spike protein (total , cds of spike protein) were downloaded in fasta format available globally from ncbi database (►fig. ). multiple sequence alignment (msa) of all the , complete genome sequences as well as , cds was performed using clustalw-mpi with default parameters. generated snps were identified by in-house developed bash scripts to batch process the data using blade server (dell poweredge fc server) with gb ram and core processor with . ghz. after msa, each genome and spike protein were marked based on the location and clustering was done based on % similarity for ease of visualization and analysis. visualization was performed by using jalview . . . . the output snp alignment generated from msa was used to assemble a maximum likelihood phylogenetic tree using raxml (randomized axelerated maximum likelihood raxml . . ). phylogenetic trees were visualized using the interactive tree of life (itol) v with their respective metadata. emboss antigenic software was used to predict the antigenic regions and the epitopes in the unique spike proteins based on antigenic scores using the formula: where f(ag) = antigenic frequency; f(s) = surface frequency and antigenic score ≥ . is considered potentially antigenic. [ ] [ ] [ ] the data for different epitopes were analyzed and the epitopes with high antigenicity were superimposed on the structure of spike protein. overall, , snps were found in , complete genome datasets. on the basis of similarity these were classified in clusters. however, among cds of spike proteins ( , complete genomes and partial cds) a total snps in clusters were found. further snp analysis resulted in identification of snps in a gene stretch of , - , bp in the s gene, encoding the spike (s) protein. the most predominant snp predicted in the gene encoding s protein was a>g in . % of overall genomes under study. in addition to several single mutations in the s gene of all available genomes, we also predicted double mutations such as g>t, c>g, c>a, c>t (corresponding to four amino acid antigenic drift aldp -> sves at position - ) and t>g (l w); t>a (f i) in two different genomes from the united states. list of all the snps in the rbd, antigenic sites, and double mutations among strains is shown in ►table . one deletion ( - deltta) was also found in an indian strain (mt . ). no copy number variants were observed in this virus. the alignment of , spike proteins extracted from , complete genomes and partial cds was performed and after clustering based on % identity, we identified unique sequences with hypervariable sites within these protein sequences. based on the variable sites the phylogeny was inferred showing two major clades a and b with many subclades in the s protein of sars-cov- circulating worldwide (►fig. ). furthermore, the evaluation of the antigenicity of spike protein predicted highest scoring antigenic epitopes (antigenic scores ≥ . ) due to variations in each (at positions l f, l w, f i, s f, d n, f l, l m, l v, d e, d i, a s, v f, a v, and a v). out of these, amino acid changes were noted at positions a v, v f and a s in the rbd with v f and a s being novel. other speculated mutations in the putative epitopes lying within rbd showing less antigenicity were t i, a s, r i, g s, v a, h q, a s, a s, and k e out of which t i, a s, a s, and k e are also novel. in addition, regions outside rbd ( - , , - , , , - , , - , - , , - , , - , - ) also infer high antigenicity (based on predicted antigenic score in the range . - . ) with variations in s protein of different genomes from similar locations as shown in ►fig. . the antigenic epitopes are depicted in the protein structure of spike protein (►fig. ). two mutations a v and d y were observed in the heptad repeat domain (hr ) and one mutation d h in heptad repeat domain (hr ). several studies have shown that mutations within the spike protein influence virus-host interaction. among the four proteins, viz., m, s, n and e, the m protein is known to play a significant role in virus assembly, role of e protein involves the production and maturation of this virus, the n protein is involved in processes related to viral replication cycle, and cellular response of host cells to viral infections, and s protein is the major target for neutralizing antibodies as it mediates the fusion and facilitates viral entry into host. , in the present study, we found that although multiple genetic variants were identified in the same country, yet there were some unique mutations found in a particular country, which suggests that diversity of s protein mutations might have significant role in the pathogenicity of this virus in countries with high or low mortality rates, as proposed by others also. we predicted snps in the s protein with a>g and vice versa in sars-cov- genomes submitted from india, t>a and c>t from china and the interchange of all four nucleotides (c>t, t>a, a>g, g>t, c>g, c>a, g>c, t>c, a>t, g>a) in genomes submitted from the united states. the data from the united states is significant. however, it might be because maximum genomes were submitted from the united states only. the snp profile revealed that the s protein mutations were predominant at specific positions only. these mutations are expected to make the virus more capable to escape from the host immune and might help in natural selection and evolution of the sars-cov- , as reported by andersen et al. it is important to mention that double mutations in the s protein were found only in the strains from the united states but not in genomes from other regions. these double mutations probably could have helped in the increased virulence of the virus. it has also been noticed that the death toll is comparatively higher in the united states than in other regions included in the study. this might probably indicate that the prevalence of several mutated strains within the provinces would have either reduced or increased its severity. it may also help in understanding the antigenic and immunogenic changes but the correlation of mutation with regional virulence could not be established due to statistical imbalance of the available genomes in the database at the time the study was done. moreover, extensive research is required to correlate the mutations with the severity of the disease and mortality. out of snp clusters, d g was found in ( . %) sars-cov- genomes. the amino acid change in a>g variant (p.d g) involves a change of large acidic residue d (aspartic acid) into small hydrophobic residue g (glycine). this observation is important, as this large difference in both size and charge may help compromise the binding affinity of antibodies against s protein, due to electrostatic interactions in the tertiary structure of protein group. this may hinder the developments of vaccines and might potentiate the virus for antigenic drifts. the effect of deletion variant (figured in one sars-cov- genome from india) on the viral phenotypes needs further investigation. the high frequency of genetic mutations in rna viruses is well known but in the genomes of sars-cov- , we found a series of single amino acid variations. this can affect the virus evolution and emergence of the new strains. the mutations in the rbd found in our study predicted conformational changes in the s domain of spike protein. the mutations in rbd play an important role while designing new drugs, as suggested in a recent study. these mutations might affect the interaction of viral rbd with the host receptor. our study revealed mutations of which six were novel mutations (►table ). out of the six novel mutations two were exhibiting high antigenicity while others were in the less antigenic region. the amino acid change observed in the antigenic epitopes were from positively charged to uncharged amino acids (r->i, h->q), and negatively charged to uncharged (d->n, d->g, d->y) amino acids. we also found mutations in negative to positively charged (d->h) amino acids. these replacements might influence the tertiary structure of the proteins and facilitate the increased virulence by escaping host immune response. the sequences in the hr (residues - ) and hr (residues , - , ) regions tend to form dimeric or trimeric helix bundles. as the s protein of coronavirus are homodimers or homotrimers, these hr regions may undergo oligomerization and result in the conformational change of s protein during virus-host cell fusion. these regions show different conformations in different fusion states and are known to be the most conserved among other regions in s protein of sars-cov. , however, a previous study shows variations in hr domain which forms helical bundles with hr to facilitate fusion and entry of virus into the host and hypothesizes that the mutation a v in hr domain along with a v mutation in hr domain confers peptide entry inhibitor resistance in mouse hepatitis coronaviruses. hence the mutation a v in hr domain and d h in hr domain found in our study might be relevant in explaining the pathogenesis of sars-cov- . with the rapid spread of this virus and limitation of specific therapy, studies are being focused on exploring the potential of neutralizing antibodies (as in plasma therapy) against vulnerable epitopes of s protein. our study predicts an interactive web-based dashboard to track covid- in real time a review of sars-cov- and the ongoing clinical trials genomic characterization of a novel sars-cov- coronavirus envelope protein: current knowledge cryo-em structure of the -ncov spike in the prefusion conformation structure of mouse coronavirus spike protein complexed with receptor reveals mechanism for viral entry emerging sars-cov- mutation hot spots include a novel rna-dependent-rna polymerase variant preliminary identification of potential vaccine targets for the covid- coronavirus (sars-cov- ) based on sars-cov immunological studies real estimates of mortality following covid- infection single amino acid substitutions in the severe acute respiratory syndrome coronavirus spike glycoprotein determine viral entry and immunogenicity of a major neutralizing domain clustalw-mpi: clustalw analysis using distributed and parallel computing jalview version -a multiple sequence alignment editor and analysis workbench raxml-vi-hpc: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models interactive tree of life (itol) v : recent updates and new developments emboss: the european molecular biology open software suite a semi-empirical method for prediction of antigenic determinants on protein antigens new hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and x-ray-derived accessible sites potential therapeutic targeting of coronavirus spike glycoprotein priming sars-cov- viral spike g mutation exhibits higher case fatality rate the proximal origin of sars-cov- why are rna virus mutation rates so damn high? electrostatic interactions in protein structure, folding, binding, and condensation exploring the genomic and proteomic variations of sars-cov- spike glycoprotein: a computational biology approach lineage-specific differences in the amino acid substitution process interaction between heptad repeat and regions in spike protein of sars-associated coronavirus: implications for virus fusogenic mechanism and identification of fusion inhibitors tectonic conformational changes of a coronavirus spike glycoprotein promote membrane fusion mechanisms of viral membrane fusion and its inhibition coronavirus escape from heptad repeat (hr )-derived peptide entry inhibition as a result of mutations in the hr domain of the spike fusion protein a highly conserved cryptic epitope in the receptor binding domains of sars-cov- and sars-cov most of the vaccines strategies against covid- are focusing on the predicted epitopes of sars-cov- spike protein. this protein is also proposed to be the most potent and specific drug target and for designing neutralizing antibodies. our findings indicate that vaccine designing against sars-cov- could be a challenging task. even though both rna based, and peptide-based vaccines are being developed in more than seven laboratories, our observations may be useful in the efficacy analysis of these vaccine candidates. none.conflict of interest p.k.s. and u.k. are research officers in a department of biotechnology funded unrelated project (bt/pr / ner/ / / ). key: cord- -hoaxv e authors: jeong, gi uk; song, hanra; yoon, gun young; kim, doyoun; kwon, young-chan title: therapeutic strategies against covid- and structural characterization of sars-cov- : a review date: - - journal: front microbiol doi: . /fmicb. . sha: doc_id: cord_uid: hoaxv e the novel coronavirus, sars-cov- , or -ncov, which originated in wuhan, hubei province, china in december , is a grave threat to public health worldwide. a total of , , confirmed cases of coronavirus disease (covid- ) and , deaths were reported globally up to may , . however, approved antiviral agents for the treatment of patients with covid- remain unavailable. drug repurposing of approved antivirals against other viruses such as hiv or ebola virus is one of the most practical strategies to develop effective antiviral agents against sars-cov- . a combination of repurposed drugs can improve the efficacy of treatment, and structure-based drug design can be employed to specifically target sars-cov- . this review discusses therapeutic strategies using promising antiviral agents against sars-cov- . in addition, structural characterization of potentially therapeutic viral or host cellular targets associated with covid- have been discussed to refine structure-based drug design strategies. in late december , a newly identified coronavirus strain capable of crossing the species barrier and infecting humans was first reported in wuhan, hubei province, china, and was provisionally termed novel coronavirus zhu et al., ) . this novel virus was later designated as severe acute respiratory syndrome coronavirus (sars-cov- ), owing to its genetic similarity with other coronavirus strains (gorbalenya et al., ) . it is known to cause coronavirus disease , characterized by influenza-like mild or moderate respiratory symptoms including dry cough, fever, headache, and pneumonia, as well as severe lung injury and multi-organ failure, which eventually lead to death huang c. et al., ) . the world health organization (who) officially declared covid- as a pandemic on march , due to the rapid global dissemination of sars-cov- . according to the who, a total of , , confirmed cases of covid- and , deaths were recorded up to may , in over countries. moreover, effective antiviral therapeutic agents or vaccines are not yet available for covid- . the repurposing of existing drugs designed for other viruses is the most practical strategy to treat patients with covid- because they have already been tested for their safety. although de novo development of antivirals is a time-, cost-, and effort-intensive endeavor, it is important to generate specific antivirals for sars-cov- that directly target the viral or host proviral factors (cascella et al., ; senanayake, ) . with increasing structural data of key proteins in both sars-cov- and the host, such as the spike glycoprotein (s), the main protease (m pro ), rna-dependent rna polymerase (rdrp), and human angiotensin-converting enzyme (hace ), the structure-based design of new drugs has emerged as the most promising antiviral strategy. in this review, we have summarized the promising therapeutic potential of pre-existing drugs against covid- . in addition, the structural characterization of potentially therapeutic viral or host cellular targets associated with covid- have been discussed to refine structure-based drug design strategies. sars-cov- is an enveloped, positive-sense, single-stranded rna virus and belongs to the genus betacoronavirus, which also includes sars-cov and mers-cov (andersen et al., ; lu et al., ; zhu et al., ) . the genome sequence of sars-cov- is more closely related to that of sars-cov ( % identity) than with that of mers-cov (∼ %) . notably, the s protein of sars-cov- and sars-cov are highly homologous with . % amino acid sequence identity . consequently, sars-cov- and sars-cov are believed to bind to the same host cell entry receptor hace zhou et al., ) instead of human dipeptidyl peptidase (hdpp ), which is used by mers-cov (raj et al., ) . sars-cov- has club-like spikes on its surface and a distinct replication strategy analogous to other coronaviruses. the life cycle and replication of sars-cov- is shown in figure . viral infection is initiated by the interaction between the s protein and hace , followed by subsequent endocytosis or membrane fusion. the s protein comprises two subunits: s and s . the s subunit contains the receptor binding domain (rbd) and binds to n-terminal hace , while the s subunit mediates virus-host membrane fusion. s proteins are cleaved by the host cell furin protease and transmembrane serine protease (tmprss ) at the s /s boundary and the s ′ position. proteolytic cleavage at the s /s boundary is thought to promote tmprss -dependent entry into the target cells (belouzard et al., ; hoffmann et al., ; walls et al., ) . after the release of the viral polycistronic rna into the cytoplasm, the replicase gene comprising open reading frames (orfs) a and ab is directly translated into either replicase polyprotein pp a (∼ kda, nsp - ) or pp ab (∼ kda, nsp - ) by a ribosomal− frameshift near the ′ -end of orf a and autoproteolytically cleaved into non-structural proteins (nsp - ) by two orf aencoded protease domains (brierley et al., ; herold et al., ; thiel et al., thiel et al., , harcourt et al., ; prentice et al., ; ziebuhr, ) . furthermore, the main protease m pro (also called cl pro ) and papain-like protease (pl pro ) participate in this extensive proteolytic cleavage. the large pp ab polyprotein has no < conserved cleavage sites that are mediated by m pro , which cleaves at leu-gln↓(ser, ala, gly) (arrow indicates the cleavage site) (ziebuhr et al., ; hegyi and ziebuhr, ) . positive-strand rna viruses usually form a cytoplasmic enzyme complex called replicase-transcriptase complex (rtc) that can mediate the synthesis of the full-length genome (replication) or discontinuous mrnas (transcription) (gorbalenya et al., ; pasternak et al., ; sawicki et al., ) . structural and accessory proteins are subsequently translated from these transcripts, and new viruses assemble by budding into the lumen of the endoplasmic reticulum-golgi intermediate compartment (ergic) and are eventually secreted (klumperman et al., ; hogue and machamer, ) . antivirals can be broadly divided into two categories: directacting antivirals (daa) and indirect-acting antivirals (iaa). daas directly target specific viral components, such as viral polymerase, or steps in the viral life cycle without affecting other host cellular processes. the development of daas can facilitate the treatment of patients with covid- . in contrast, iaas target host proviral factors and indirectly inhibit viral infection or replication by impeding the function or interaction of these factors. iaas have an advantage over daas because they are not susceptible to viral mutations, which are frequently found in rna viruses. however, iaas can alter the host cellular system and are not considered safe. therefore, daas targeting viral entry, proteases, and replication can serve as effective antivirals owing to their enhanced safety features. drug repurposing of preexisting antiviral agents is considered one of the most practical strategies because there is no available approved antiviral drug or vaccine for covid- . furthermore, the de novo development of drugs typically requires over $ billion usd and - years (cascella et al., ; senanayake, ) . drug repurposing of several approved antivirals against covid- has progressed into clinical trials (table ) . however, there is a potential risk of drug-resistant mutations with the use of daa. a combination of repurposed drugs can reduce the time, cost of treatment, and risk of drug-resistance, and increase therapeutic efficacy to facilitate progression into clinical trials (cheng et al., ) . moreover, due to the existence of crystal structures of viral and host cellular proteins associated with sars-cov- , such as s protein, m pro , rdrp, and hace , structure-based drug design can be performed to develop more effective drugs with reduced off-target toxicity (schomburg and rarey, ). the cryo-electron microscopy (cryoem) structure of the extracellular domain of the s protein of sars-cov- revealed a homotrimeric conformation (wrapp et al., ) . the binding of rbd-located in the s subunit-to hace on the host cell surface initiates interaction between the virus and the host cell; therefore, the switching conformation of rbd is considered an important event for viral entry (shang et al., ) . cryoem figure | viral life cycle of sars-cov- . interaction between the s protein of sars-cov- and hace initiates sars-cov- infection. following receptor binding, the virus enters the cell by acid-dependent proteolytic cleavage of the s protein by tmprss or other proteases. upon fusion of the viral and host cell membranes, viral genomic rna is released in the cytoplasm. the viral rna initiates translation of co-terminal polyproteins (pp a/ab) by− frameshifting. these polyproteins are subsequently cleaved into nonstructural proteins (nsps) by m pro and pl pro . several nsp proteins interact with nsp (rdrp) to form the replicase-transcriptase complex (rtc), which is responsible for the synthesis of full-length viral genome (replication) and sub-genomic rnas (transcription). the viral structural proteins are expressed and translocated into the endoplasmic reticulum (er). the nucleocapsid (n) protein-encapsidated genomic rna translocates with the structural proteins into the er-golgi intermediate compartment (ergic) for virion assembly. the newly synthesized virions are budded through the cell membrane and exocytosed. studies revealed that the rbd in two out of three s proteins binds to the n-terminal domain (ntd) of the neighboring protomer of the s protein. these inter-molecular interactions result in a down (closed) conformation, wherein the hace interaction interfaces are buried inside the structure. moreover, the rbd in the third s protein forms an up (open) conformation to facilitate binding with the n-terminal region of hace (figure a ) (wrapp et al., ) . the cryoem study of sars-cov- s showed that single rbd formed an open conformation in an asymmetric trimer. the structural comparisons between the s protein of sars-cov (pdb id crz) and sars-cov (pdb id vsb) showed that the major structural differences came from rbd in a closed conformation. although the rbd of s from sars-cov and sars-cov- were largely resembled, the sars-cov- rbd showed a higher binding affinity toward hace than sars-cov rbd shang et al., ) . the cyroem structure of full-length hace revealed a homodimeric conformation, with each monomer of hace binding to one rbd of the sars-cov- s protein ( figure b ) . the crystal structure of hace in complex with sars-cov- rbd (pdb id m j and vw ) showed that sars-cov- rbd binds to the n-terminal region of hace via s , q , t , f , d , k , h , e , e , d , y , q , l , l , m , y , q , n , k , d , and r residues of hace and k , v , g , y , y , l , f , y , a , g , e , f , n , y , q , g , q , t , n , g , v , and y residues of sars-cov- rbd ( figure c ) (shang et al., ; wrapp et al., ) . most of these interactions are mediated by α of hace ( figure c) ; moreover, an n-glycosylation chain at n of hace interacts with sars-cov- s protein (shang et al., ) . as mentioned earlier, the s /s junction and s ′ site of the s protein are cleaved by furin and tmprss , to enable efficient entry of sars-cov- into the host cell (figure a) . in addition to trypsin, cathepsin l, and elastase, tmprss is known to activate the s protein and induce virus-cell membrane fusion (matsuyama et al., ) . a recent study reported that tmprss is also essential for sars-cov- entry into target cells matsuyama et al., ) . the overall structure reveals that human ace forms a homodimer (orange and light-yellow) with b at (dark and light gray), which is located in the transmembrane region. the two sars-cov- rbds are shown as dark and light green surfaces. (c) the interaction interface between rbd and ace is shown (pdb id m j). the residues involved in the interaction between sars-cov- rbd and hace are represented with stick models in green and orange, respectively. alpha helix (α ) of hace is also labeled. (d) the overall structure of sars-cov- rbd in complex with its neutralizing antibody cr (pdb id w ). the fab regions of the heavy and light chains are shown in hot pink and pink, respectively. sars-cov- rbd is shown in green. (e) structural comparison of interfaces between sars-cov- rbd and nab or hace . the interaction interfaces with the light chain of cr , heavy chain of cr , and hace are shown in pink, hot pink, and orange, respectively. (f) hinge movement of hace upon binding of the enzyme inhibitor. the apo form (pdb id r ) and inhibitor-bound form (pdb id r l) are superimposed and shown in blue and red, respectively. accordingly, targeting proteins that participate in sars-cov- entry can be a potential therapeutic strategy. the use of neutralizing antibodies (nabs) against sars-cov- 's s protein is thought to be promising for the treatment of patients with covid- (pinto et al., ) . a nab-cr -known to target sars-cov rbd and prevent lung pathology, can also bind to sars-cov- rbd (ter meulen et al., ; tian et al., ) . the crystal structure of sars-cov- rbd in complex with cr revealed that cr forms a distinct interaction interface with sars-cov- rbd, and does not overlap with the interaction interface between hace and sars-cov- rbd (figures d,e) . although cr binds to sars-cov rbd and sars-cov- rbd with binding affinities (kd) of and nm, respectively, it is unable to neutralize sars-cov- in vitro largely due to its inability to form the interaction interface and its low binding affinity (pinto et al., ; yuan et al., ) . however, continuous efforts are being undertaken to identify potent nabs by collecting plasma from infected individuals, and this has shown significant progress. the p b- f from sars-cov infected patients have overlapping residues, g and y , with higher rbd binding affinity than ace /rbd ( . and . nm respectively) (ju et al., ) . furthermore, the interaction interface of c /rbd overlapped with the ace binding region, and b share similar binding structures with prominent neutralizing effects (barnes et al., ; wu et al., ) . also they showed recent concern of mutation in s (d g) that might increase sars-cov- 's transmission rate and has a rare chance to affect the rbd-binding mab c , because of the distance between the rbd region and d (barnes et al., ) . in addition to identifying nabs targeting sars-cov- 's s protein, a pilot trial to use recombinant soluble human ace in covid- patients has been initiated (clinicaltrial.gov #nct ). however, this trial was recently withdrawn as it was not approved by the center for drug evaluation (cde). because ace can counter the activation of renin-angiotensin-aldosterone system (raas) treatment with ace inhibitors, it can increase ace expression in some patients to compensate for the blocked ace activity (vaduganathan et al., ) . in some animal studies, treatment of raas inhibitor resulted in increased expression of ace in specific tissues (ferrario et al., ; soler et al., ) . in this regard, some researchers hypothesized that treatment of the raas inhibitor might enhance the accessibility of sars-cov- into cells and therefore increase the risk of severity in patients carrying covid- (fang et al., ; watkins, ) . however, a recent case population study showed that there was no correlation between use of raas inhibitors and increased risk of covid- (de abajo et al., ) . the ramipril, ace inhibitor showed cardiac protective effects without increased expression of ace (burchill et al., ) . these contradictory results suggested that clinical validations of raas inhibitors are needed to demonstrate its effectiveness toward covd- . the highresolution x-ray crystal structure of apo-hace and hace in complex with its enzymatic inhibitor mln- showed that inhibitor binding at the active site of hace can cause large hinge-bending movement (towler et al., ) (figure f) . furthermore, a structure-based drug discovery study showed that an enzymatic hace inhibitor can prevent sars-cov infection (huentelman et al., ) . therefore, hace inhibitors can potentially prevent sars-cov- infection. although the structure of human tmprss is not available yet, homology modeling and in silico docking studies have demonstrated the molecular mechanisms of camostat mesylate, nafamostat, and bromhexine hydrochloride in inhibiting tmprss (sonawane et al., ) . in this respect, active sitespecific inhibitors of tmprss can be used as potential antiviral agents against sars-cov- . the crystal structure of sars-cov m pro -a cysteine proteaseconsists of domains - . the catalytic processes of m pro are mediated by the non-canonical cys-his catalytic dyad located between domains i and ii (anand et al., (anand et al., , . the m pro protein is highly conserved among sars-cov, mers-cov, and sars-cov- , and it shares the common substrate recognition sequence consisting of lq(s,a,g) (ziebuhr et al., ; hegyi and ziebuhr, ; dai et al., ) . among them, the gln in p of the substrate is an important common feature required for their catalytic activity. human proteases with a similar substrate specificity to that of m pro do not exist; therefore, development of m pro inhibitors is a potential therapeutic strategy for targeting sars-cov- . sars-cov- m pro consists of three domains, analogous to that of m pro from other covs (figure a ) (dai et al., ; jin et al., ; zhang et al., b) . the crystal structure of m pro revealed that it forms homodimers (dimeric protomer) through interactions between domain ii of protomer a and n-terminal residues of protomer b (figure a ) (zhang et al., b) . homodimerization of m pro is required for its enzymatic activity. mutational studies on the dimeric interface, as well as crystal structure analysis, revealed that the interaction between two protomers is required to form the s pocket at the substrate binding site (figure b ) (anand et al., ; lim et al., ; zhang et al., b) . the substrate binding site of sars-cov- consists of s ′ -s -s -s pockets lined with, h , s , m , y , f , l , n , g , c , h , h , m , e , l , h , f , d , q , t , a , and q residues ( figure b ) (dai et al., ; jin et al., ; zhang et al., b) . notably, the s pocket of covs is typically hydrophobic and can accommodate the bulky p fragment (figure b) . several structure-based drug discovery studies have investigated the interaction of inhibitors in the substrate-binding pockets of sars-cov- m pro ( figure c ) (dai et al., ; jin et al., ; zhang et al., b) . a previous study for developing broad spectrum inhibitors targeting cov m pro showed that inhibitors of sars-cov- contain a (s)-γ-lactam ring at p position to mimic glutamine and occupy the s pocket of sars-cov- m pro (zhang et al., a) . a total of structures of sars-cov- m pro in both apo and inhibitor complex forms are available in the protein data bank (pdb) database (https://www.rcsb. org/) until april . zhang et al. ( b) have developed peptidomimetic α-ketoamide inhibitors targeting sars-cov- m pro . they also solved the crystal structure of m pro in complex with α-ketoamide b (pdb id y g) and showed the presence of a γ-lactam ring at p position and cyclopropyl at p position ( figure d) . the biochemical ic of sars-cov- , sars-cov, and mers-cov m pro were found to be . , . , and . µm, respectively (zhang et al., b) . simultaneously, dai et al. ( ) developed inhibitors with an aldehyde-substituted compound at warhead for occupying the s site and thus it covalently bonds with the catalytic cysteine of sars-cov- m pro (pdb id lze and mok) (dai et al., ) (figure e) . these compounds showed high inhibition activity with ic of and nm in vitro and reduced sars-cov- infection with figure | structure of sars-cov- viral m pro and its complex with inhibitors. (a) the crystal structure of sars-cov- m pro . m pro is a cysteine protease that consists of three domains and two protomers. protomer b is shown in darker colors than protomer a and each domain is shown in different colors (sky blue, split pea, and violet represent domains , , and , respectively). (b) substrate binding site of sars-cov- m pro . the substrate binding site of m pro is subdivided into s , s ′ , s , and s (shown in bold orange). the inhibitors bind to residues shown as yellow sticks (h , s , m , y , f , l , n , c , h , m , e ec of . and . µm in plaque reduction assay (dai et al., ) . the crystal structure of sars-cov- m pro in complex with the inhibitor compound n (pdb id bqy), previously designed to inhibit cov m pro , revealed that n occupies the substrate binding pocket and forms a covalent bond with catalytic c of sars-cov- m pro . consistently, the lactam ring at p position of n forms a hydrogen bond with h of sars-cov- m pro ( figure f ) (yang et al., ; jin et al., ) . x , a potential inhibitor of sars-cov- m pro , also occupies the substrate binding pocket; however, it does not form covalent bonds (pdb id w ) ( figure g ). in conclusion, m pro of sars-cov- is a key protein that participates in the proteolytic processing of polyproteins and shows no overlapping substrate specificity with any of the known human proteases. several potent inhibitors share common structural features, including covalent bond formation with catalytic cysteine and a lactam ring at p position. because most inhibitors occupy the substrate binding pocket of sars-cov- figure | cryoem structure of rdrp in complex with cofactors (nsp and nsp ), rna template, and remdesivir. (a) surface representation of the cryoem structure of sars-cov- rdrp in complex with its cofactors (two nsp and one nsp ) (pdb id m ). nsp and nsp are shown in gray and pink, respectively. the β-hairpin, niran, interface, thumb, palm, and finger of sars-cov- rdrp are shown in cyan, yellow, green, orange, purple, and blue, respectively. (b) a cartoon representation of the overall structure of sars-cov- rdrp in complex with the rna template and its inhibitor remdesivir (pdb id bv ). the rna template and primer strand are shown in blue and red, respectively. the red arrow indicated the direction of ntp entry. (c) magnified view of remdesivir monophosphate binding region. remdesivir covalently binds to the primer rna strand and interacts with the template rna. m pro , targeting this pocket could be an efficient and safe strategy in terms of toxicity. replication of sars-cov- genomic rna is mediated by a multiprotein complex consisting of several non-structural proteins, such as nsp , nsp , nsp , and nsp . the functional core of this multiprotein complex consists of rna-dependent rna polymerase (rdrp, also called nsp ) . sars-cov- rdrp plays an important role in the replication and transcription of viral genomic rna (figure ) and its catalytic residues are highly conserved among covs (venkataraman et al., ; . it is because of this that the nucleotide analog remdesivir (gs- , gilead) was treated to target rdrp of mers-cov, sars-cov, and sars-cov- (warren et al., ; holshue et al., ; wang m. et al., ) . although the viral rdrp is a core component of viral replication, nsp and nsp are still required for full-fill transcriptional activity of rdrp (zhai et al., ; venkataraman et al., ; kirchdoerfer and ward, ; gao et al., ) . the cryoem structure of nsp revealed an n-terminal β-hairpin (aa - ), extended nidovirus rdrpassociated nucleotidyl-transferase domain (niran, aa - ), interface domain (aa - ), and rdrp domain (aa - ) consisting of finger, palm, and thumb subdomains (gao et al., ; yin et al., ) (figure a ). structural studies have demonstrated that nsp can recognize the rna template in a sequence-independent manner, suggesting that the enzymatic activity of rdrp is largely sequence independent. the cryoem structure of sars-cov- rdrp in complex with an rna template or its small molecule inhibitor, remdesivir, (figure b ) revealed the molecular inhibitory mechanism of remdesivir (yin et al., ) . remdesivir monophosphate interacts with the primer strand and uridine of the template strand by base stacking and hydrogen bonding, respectively, at the center of the catalytic active site of rdrp (yin et al., ) (figure c) . the covalent incorporation of remdesivir monophosphate into the primer strand blocks the entry of nucleotide triphosphates to the active site, and terminates the transcriptional activity of rdrp (yin et al., ) (figure b ). other nucleotide analog compounds such as favipiravir, ribavirin, eidd- , and eidd- may exhibit a similar mechanism of action as remdesivir to inhibit rdrp with non-obligate rna chain termination (elfiky, ; sheahan et al., ; wang y. et al., ) . although the u.s. food and drug administration issued an emergency use authorization for remdesivir on may , for the treatment of suspected or laboratory-confirmed covid- in adults and children hospitalized with severe symptoms, the clinical efficacy of remdesivir against sars-cov- is not known yet. moreover, no significant clinical benefits of remdesivir against sars-cov- were observed in a recent randomized, double-blind, placebo-controlled, multicenter clinical trial (clinicaltrials.gov, nct ) . taken together, compounds that target sars-cov- rdrp are largely nucleotide analogs because of their ability to form covalent bonds with the viral template rna and block the catalytic active site of rdrp. zoonotic coronavirus outbreaks such as covid- can not only affect public health but also have a major impact on societies and the global economy. therefore, global cooperation among academic institutions, governments, and pharmaceutical companies is necessary to overcome covid- . despite intensive worldwide efforts undertaken by researchers to contain the spread of sars-cov- , covid- has attained pandemic status. considering that the development of an effective vaccine and new therapeutics are still in the early stages, repurposing fda-approved and well-characterized drugs might be a pragmatic approach. consequently, some of these drugs, such as remdesivir, have been approved for emergency use and some are being tested in clinical trials. in addition, combination treatment might be an approach which could achieve synergistic effects and reduce the risk of drug-resistant mutations. a few studies have shown that some pre-existing drugs are effective for the treatment of patients with covid- . in this review, we described the ongoing therapeutic strategies targeting various components of the sars-cov- life cycle ( table ). in addition, we provided structural insights into the mechanism of action of well-characterized drugs targeting the interaction between hace and the spike protein of sars-cov- for viral entry, as well as m pro and rdrp for viral replication. we believe that structural characterization can aid in developing an effective therapeutic strategy not only against covid- but also other viral outbreaks in the future. gj and hs conceived, designed, did the literature review, provided, and wrote the manuscript. gy assisted in the preparation and design. dk and y-ck conceived, designed, assisted in the literature, final review, and co-wrote the manuscript. all authors contributed to the article and approved the submitted version. structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain coronavirus main proteinase ( clpro) structure: basis for design of anti-sars drugs the proximal origin of sars-cov- structures of human antibodies bound to sars-cov- spike reveal common epitopes and recurrent features of antibodies activation of the sars coronavirus spike protein via sequential proteolytic cleavage at two distinct sites tmprss -inhibitors play a role in cell entry mechanism of covid- : an insight into camostat and nafamostat characterization of an efficient coronavirus ribosomal frameshifting signal: requirement for an rna pseudoknot combination renin-angiotensin system blockade and angiotensinconverting enzyme in experimental myocardial infarction: implications for future therapeutic directions features, evaluation and treatment coronavirus (covid- ) epidemiological and clinical characteristics of cases of novel coronavirus pneumonia in wuhan, china: a descriptive study improving therapy of severe infections through drug repurposing of synergistic combinations structure-based design of antiviral drug candidates targeting the sars-cov- main protease use of renin-angiotensin-aldosterone system inhibitors and risk of covid- requiring admission to hospital: a casepopulation study anti-hcv, nucleotide inhibitors, repurposing against covid- are patients with hypertension and diabetes mellitus at increased risk for covid- infection? effect of angiotensin-converting enzyme inhibition and angiotensin ii receptor blockers on cardiac angiotensin-converting enzyme structure of the rna-dependent rna polymerase from covid- virus the species severe acute respiratory syndromerelated coronavirus: classifying -ncov and naming it sars-cov- identification of severe acute respiratory syndrome coronavirus replicase products and characterization of papain-like protease activity coronavirus puts drug repurposing on the fast track conservation of substrate specificities among coronavirus main proteases nucleotide sequence of the human coronavirus e rna polymerase locus sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor first case of novel coronavirus in the united states clinical features of patients infected with novel coronavirus in wuhan pharmacological therapeutics targeting rna-dependent rna polymerase, proteinase and spike protein: from mechanistic studies to clinical trials for covid- structure-based discovery of a novel angiotensin-converting enzyme inhibitor covid- pandemic; transmembrane protease serine (tmprss ) inhibitors as potential drugs structure of m pro from covid- virus and discovery of its inhibitors human neutralizing antibodies elicited by sars-cov- infection structure of the sars-cov nsp polymerase bound to nsp and nsp co-factors coronavirus m proteins accumulate in the golgi complex beyond the site of virion budding structure of the sars-cov- spike receptor-binding domain bound to the ace receptor dynamically-driven enhancement of the catalytic machinery of the sars c-like protease by the s -t -i /a mutations on the extra domain scutellaria baicalensis extract and baicalein inhibit replication of sars-cov- and its c-like protease in vitro genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding hiv protease inhibitors: a review of molecular selectivity and toxicity efficient activation of the severe acute respiratory syndrome coronavirus spike protein by the transmembrane protease tmprss enhanced isolation of sars-cov- by tmprss -expressing cells inhibition of sars-cov- infections in engineered human tissues using clinical-grade soluble human ace nidovirus transcription: how to make sense cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody identification and characterization of severe acute respiratory syndrome coronavirus replicase proteins dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc pharmacologic treatments for coronavirus disease (covid- ): a review a contemporary view of coronavirus transcription what is the potential of structurebased target prediction methods drug repurposing strategies for covid- structural basis of receptor recognition by sars-cov- an orally bioavailable broad-spectrum antiviral inhibits sars-cov- in human airway epithelial cell cultures and multiple coronaviruses in mice localization of ace in the renal vasculature: amplification by angiotensin ii type receptor blockade using telmisartan homology modeling and docking studies of tmprss with experimentally known inhibitors camostat mesylate, nafamostat and bromhexine hydrochloride to control sars-coronavirus- human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants viral replicase gene products suffice for coronavirus discontinuous transcription mechanisms and enzymes involved in sars coronavirus genome expression potent binding of novel coronavirus spike protein by a sars coronavirusspecific human monoclonal antibody ace x-ray structures reveal a large hinge-bending motion important for inhibitor binding and catalysis renin-angiotensin-aldosterone system inhibitors in patients with covid- rna dependent rna polymerases: insights from structure, function and evolution structure, function, and antigenicity of the sars-cov- spike glycoprotein remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus ( -ncov) in vitro remdesivir in adults with severe covid- : a randomised, doubleblind, placebo-controlled, multicentre trial therapeutic efficacy of the small molecule gs- against ebola virus in rhesus monkeys preventing a covid- pandemic cryo-em structure of the -ncov spike in the prefusion conformation a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace inhibition of sars-cov- (previously -ncov) infection by a highly potent pan-coronavirus fusion inhibitor targeting its spike protein that harbors a high capacity to mediate membrane fusion evolution of the novel coronavirus from the ongoing wuhan outbreak and modeling of its spike protein for risk of human transmission nelfinavir inhibits replication of severe acute respiratory syndrome coronavirus in vitro structural basis for the recognition of sars-cov- by full-length human ace design of widespectrum inhibitors targeting coronavirus main proteases structural basis for inhibition of the rna-dependent rna polymerase from sars-cov- by remdesivir a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov insights into sars-cov transcription and replication from the structure of the nsp -nsp hexadecamer α-ketoamides as broad-spectrum inhibitors of coronavirus and enterovirus replication: structure-based design, synthesis, and activity assessment crystal structure of sars-cov- main protease provides a basis for design of improved alpha-ketoamide inhibitors a pneumonia outbreak associated with a new coronavirus of probable bat origin a novel coronavirus from patients with pneumonia in china molecular biology of severe acute respiratory syndrome coronavirus virus-encoded proteinases and proteolytic processing in the nidovirales coronaviruses-drug discovery and therapeutic options the authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.copyright © jeong, song, yoon, kim and kwon. this is an open-access article distributed under the terms of the creative commons attribution license (cc by). the use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. no use, distribution or reproduction is permitted which does not comply with these terms. key: cord- -tj m s authors: ho, mitchell title: perspectives on the development of neutralizing antibodies against sars-cov- date: - - journal: antib ther doi: . /abt/tbaa sha: doc_id: cord_uid: tj m s sars-cov- gains entry to human cells through its spike (s) protein binding to angiotensin-converting enzyme (ace ). therefore, the receptor binding domain (rbd) of the s protein is the primary target for neutralizing antibodies. selection of broad-neutralizing antibodies against sars-cov- and sars-cov is attractive and might be useful for treating not only covid- but also future sars-related cov infections. broad-neutralizing antibodies, such as d , s , and vhh- , have been reported to target a conserved region in the rbd of the s subunit. the s subunit required for viral membrane fusion might be another target. due to their small size and high stability, single-domain antibodies might have the ability to be administered by an inhaler making them potentially attractive therapeutics for respiratory infections. a cocktail strategy combining two (or more) antibodies that recognize different parts of the viral surface that interact with human cells might be the most effective. covid- is caused by the new coronavirus sars-cov- (initially called -ncov) [ , ] . as of may , , there are , , confirmed cases and , death worldwide with countries affected (https://coronavirus.jhu.edu/ma p.html). it is widely believed that neutralizing antibodies can be used to treat covid- by reducing sars-cov- infectivity [ ] . in the current issue of antibody therapeutics, two articles related to covid- have been published through a fast track peer review, revision, and production [ , ] . while most publications in our journal have been focused on cancer therapies, it was not the first time that antibody therapeutics published papers that were relevant to antibody development for viral diseases. in one previous study, a human papillomavirus vaccine using a novel viruslike particle was shown to induce antibody response in figure . development of neutralizing antibodies for treating covid- . in the receptor binding stage, the s subunit of sars-cov- binds human ace on the host cell surface. antibodies that bind the rbd domain on the s subunit might block the interaction of the rbd and the ace . crossreactive antibodies (e.g., d , s , and vhh- ) that bind highly conserved epitopes on the rbds of sars-cov and sars-cov- could have broad neutralization activities against viral infection. in the viral fusion stage, after the cleavage of s subunit, the viral fusion peptide (fp) on the s subunit inserts into the host cell membrane, inducing the conformational change of the s subunit, which forms a six-helix bundle ( -hb) with the hr and hr trimers. antibodies (e.g., a against sars-cov) that target the hr domains might block viral fusion. ab, antibody. the bat coronavirus ratg , indicating that the bat coronavirus might be the origin of the sars-cov- [ ] . furthermore, sars-cov- might be the result of a recombination between bat (ratg ) and pangolin coronaviruses, as particularly indicated in the s protein sequence [ ] . the receptor binding domain (rbd) of the sars-cov- s protein contains several novel residues that might be introduced through recombination with the pangolin coronavirus, indicating a possible critical step in the evolution of the ability of sars-cov- to infect humans [ ] . the structures of sars-cov- s protein trimer [ ] and human ace [ ] have been rapidly solved using modern cryoelectron microscopy (cryo-em). the affinity of the sars-cov- rbd for human ace appears stronger than the sars-cov rbd. the structural analysis of the rbd-ace complex reveals some of the key mutations on the rbd, such as f and n , that form stronger contacts with human ace [ ] . interestingly, these residues can be found in the pangolin coronavirus [ ] . in the review paper from dr zhiqiang an's group in the university of texas health science center at houston, ku et al. summarized current findings on the structures and functions of sars-cov- viral proteins. they describe potential strategies for repurposing drugs for the treatment of covid- and the current development of vaccines, plasma therapies, and neutralizing antibodies [ ] . their review highlights the potential viral targets, screening methods, in vitro and in vivo models, as well as discussing potential antibody-dependent enhancement (ade) and fc engineering for developing neutralizing antibodies for treating covid- patients. in particular, their review describes major screening strategies for the discovery and development of sars-cov- neutralizing antibodies and provides several representative examples using these methods. antibody sources may include memory or plasma b cells from recovered patients, phage, yeast and ribosome libraries, or from mouse, rabbit, monkey, and llama immunizations. most antibodies are tested for their ability to block s protein (or rbd) binding to ace and preventing spike-mediated membrane fusion. antibody activity is tested either by using a pseudovirus-based neutralization assay or by a live virus-based neutralization assay. animal models such as human ace transgenic mice are also summarized in their review article. in the research paper from chengdu medical college and ablink biotech co., ltd in china, zeng et al. isolated a human monoclonal antibody (named "rrbd- ") that inhibits the interaction of the rbd of sars-cov- and the ace and neutralizes the pseudovirus infection [ ] . the group used a competitive screening strategy to isolate human antibodies from a phage display library. in the first round of phage panning, they followed the standard procedure by screening phage on immobilized rbd. after the first-round enrichment of rbd binders, in the nd and rd rounds, they immobilized ace and added the mixture of free rbd and a phage pool enriched from the st round. phage that bind rbd at epitopes different from the ace -binding site were captured by the immobilized ace to form a "sandwich" complex. phage that competed with ace for rbd were in the supernatant along with presumably unknown amounts of nonbinders or nonspecific binders. the phage that bind the ace site on the rbd were then isolated by magnetic beads using the histidine tag on the rbd. the key for success using this strategy is the ratio of immobilized ace , free rbd, and phage concentrations in solution. therefore, they used a low concentration of rbd ( μg/ml) close to the ec value of rbd binding to ace and a low concentration × pfu of the phage they enriched from st round of panning on rbd. standard phage panning protocols used about times more phage or about × pfu. in this way, they expected to reduce nonspecific binding of phage. antibody therapeutics, several important questions are raised in the development of neutralizing antibodies for treating covid- . could such antibodies be cross-reactive with other sars-related covs (sarsr-covs)? could such cross-reactive antibodies have neutralizing activities for all sarsr-covs? what would be ideal targets or epitopes for cross-neutralizing antibodies? selection of cross-neutralizing antibodies would be useful for treating not only current covid- patients but also future sarsr-cov infections. the rbd of the s protein is the primary target for neutralizing antibodies. many known neutralizing antibodies, including s , m , and r, are specific for sars-cov rbd but fail to bind sars-cov- even at the concentration up to μm [ ] . polyclonal antibodies from mice immunized with a stabilized sars-cov s protein can inhibit sars-cov- entry into target cells. this suggests that immunity against one sarsr-cov can potentially provide protection against related sars-cov [ ] . in contrast, another study showed that polyclonal rabbit anti-sars-cov s antibodies (t ) inhibited entry of sars-cov, but not sars-cov- pseudovirus [ ] . in addition, sera from recovered sars and covid- patients show only modest cross-neutralization, suggesting that recovery from one sarsr-cov infection might not protect against the other. a recent report showed that none of the monoclonal antibodies isolated from sars-cov- infected individuals by single b-cell sorting were cross-reactive with the rbd of sars-cov [ ] . in a research article published in our journal, zeng et al. did not test the cross-reactivity of the rrbd- human antibody for sars-cov rbd [ ] . in the review article, ku et al. discusses two unique cross-reactive antibodies, cr and d , which bind highly conserved epitopes in the rbd [ ] . cr is an antibody that binds both sars-cov and sars-cov- rbds, but it cannot neutralize sars-cov- as it does sars-cov [ ] . more recently, wang et al. identified the d monoclonal antibody that can neutralize both sars-cov and sars-cov- infection [ ] . the d antibody was isolated from transgenic mice immunized sequentially with purified s proteins of different coronaviruses (hcov-oc , sars-cov, and mers-cov). the transgenic mice produce chimeric immunoglobulins with human variable regions and rodent constant regions. four of antibodies specific for the s protein of sars-cov show cross-reactivity with the s subunit of sars-cov- . among them, the d antibody exhibits the cross-neutralizing activity of sars-cov- and sars-cov in cell culture. interestingly, d binds the rbd but does not block the interaction of rbd and ace , indicating that d might bind a highly conserved epitope of the rbd distinct from the ace binding site. previous studies indicate that sars-cov rbd antibodies that block the interaction of the rbd and ace are not cross-reactive with sars-cov- rbd [ ] . cr also binds a highly conserved epitope on the rbd and binds both sars-cov and sars-cov- rbds [ ] , but unlike d , it does not have the cross-neutralizing activity against sars-cov- . the structure complex of d and the rbd (or the s /s protein) would reveal a novel conserved site on the rbd for broad-neutralizing antibodies against sarsr-covs. in addition to d , another human antibody (s ) isolated from memory b cells of a sars survivor infected in neutralizes sars-cov- [ ] . interestingly, s recognizes a glycan-containing epitope on the rbd in both the open and closed s states. the cryo-em structure of the complex of s and sars-cov- s protein indicates that the antibody engages an epitope distinct from the ace binding motif and would not clash with ace for its binding to s protein. the glycan recognition of s implies the importance of the n-glycans in sars-cov- s protein. furthermore, antibody cocktails containing s further improved sars-cov- neutralization and might be useful for preventing or mitigating virus escape mutants. this supports the idea that antibody cocktails could be more effective than single antibody therapy. besides human neutralizing antibodies, a single domain camelid antibody (vhh- ), commonly called "nanobody", was isolated from a llama immunized with sars-cov s and mers-cov s. this nanobody showed reactivity with sars-cov- s protein by binding to a highly conserved epitope on the rbd partially overlapping the cr binding site [ ] . however, unlike cr , the bivalent vhh- -fc fusion protein not only prevents the binding of ace but also has neutralizing activity against sars-cov- pseudovirus. the neutralizing effects of d and vhh- suggest that co-immunizing animals with s proteins from sars-cov- and other coronavirus may produce potent broad-neutralizing antibodies against sars-cov- by targeting the rbd. single domain antibodies have unique binding features [ , ] , so they can bind novel viral conformational epitopes including highly conformational and/or buried sites unreachable by conventional antibodies. besides their unique binding features, other advantages include the construction of multivalent/multispecific molecules and thermostability/chemostability [ ] . for respiratory infection, single domain antibodies might be particularly attractive because they might be administered as an inhaler directly to the site of infection [ ] . both d and s are fully human igg molecules, whereas vhh- is a single domain antibody derived from a llama heavy chain antibody that has not been humanized yet. for therapeutic applications, the camelid-specific sequences in the framework may need to be mutated to their human heavy chain variable domain equivalent [ ] . its suitability for prophylactic and therapeutic treatments remains to be determined. it might be useful to analyze the mutations of sars-cov- as it spreads worldwide, so neutralizing antibodies can be effective for multiple strains of the virus [ ] [ ] [ ] . up to now, most of the mutations found in the rbd of sars-cov- are of low frequency [ ] . the g s mutation was located in the binding interface of the rbd with the ace , although it occurred in early samples and diminished in late samples, indicating that the virus with mutations in the critical functional site might not have advantage for its survival or spread. while the rbd is the focus for the development of neutralizing antibodies against sars-cov- , the function of non-rbd regions is poorly understood. recently, an antibody ( a ) isolated from convalescent covid- patients shows the binding on the n-terminal domain (ntd) of the sars-cov- s protein and exhibits high neutralization potency sars-cov- . the structural analysis has confirmed its binding to the ntd, not the rbd which directly interacts with ace , demonstrating a new vulnerable epitope in the s subunit as a target of neutralizing antibodies for treating covid- [ ] . antibodies that target the s protein beyond the s subunit have rarely been reported. the s subunit, in particular heptad repeat (hr) loops including hr and hr domains, required for membrane fusion might be another target. the a antibody is the only known monoclonal antibody that binds the hr domain on s subunit of sars-cov [ ] (fig. ) . a more recent study showed that sars-cov- had a superior plasma membrane fusion capacity compared with that of sars-cov [ ] . the x-ray crystal structure of six-helical bundle ( -hb) core of the hr and hr domains in the sars-cov- s protein s subunit has also been solved [ ] . a lipopeptide (ek c ) that disturbs viral -hb formation by binding the hr domain can inhibit the fusion of sars-cov- as well as other human coronavirus including sars-cov and mers-cov, suggesting that a broad inhibitor targeting the hr region should be tested for the treatment of infection by current and future sarsr-covs. besides widely studied protein targets, glycan targets might be worth exploring as well. the s protein of sars-cov- is highly glycosylated [ ] . isolation of the s neutralizing antibody that recognizes a glycan-containing epitope on the rbd indicates that glycosylation on the s protein would affect the development of neutralizing antibodies targeting sars-cov- [ ] . on the host cell, the primary target is the viral entry protein, ace . it has been proposed that recombinant ace might be used as a potential inhibitor to block the virus entry [ ] . the ace -human igg fc fusion has been engineered. the fusion protein can neutralize pseudoviruses that express the s proteins of sars-cov- or sars-cov in cell culture [ ] . heparan sulfate proteoglycans (hspgs) provide the initial sites for the virus to make primary contact with the cell surface [ ] . my laboratory research has been focused on the biology of hspgs, in particular glypicans, and their roles as targets in cancer therapy [ ] . using one of our human antibodies (hs ) specific for heparan sulfate oligosaccharides with high affinity [ ] [ ] [ ] , we and collaborators previously showed that the hs antibody can block the attachment of pathogenic polyomaviruses on cells [ ] . interestingly, treatment of the cells with heparinase or exogenous heparin prevents the binding of the s protein to host cells and inhibits sars pseudovirus infection [ ] , suggesting that in addition to ace , hspgs might be essential for sars-cov entry into host cells. blocking the hspgs on human cells by therapeutic antibodies is worth investigating as another potential strategy for treating covid- . many antibodies capable of neutralizing specifically either sars-cov or sars-cov- , but not both, have been identified and reported through many methodologies. a very few and very special sars-cov and sars-cov- crossneutralizing antibodies have also been documented, including d , s , and vhh- , among which the former two, which are human monoclonal antibodies, are ace nonblockers, whereas the third one, which is a llama single domain antibody, is an ace blocker. competitive phage display panning might be a new way to identify both ace blockers and nonblockers. even though zeng et al. only reported the identification of one antibody, rrbd- , that blocked ace and neutralized sars-cov- [ ] , the same phage display competitive panning strategy could also be used to identify antibodies that do not block the interaction between s protein and ace yet still neutralize viral infections. more importantly, with s proteins of both sars-cov and sars-cov- as competitors, this competitive panning strategy could be utilized to identify both ace blocking and nonblocking antibodies against both viruses with functional profiles similar to d , s , and vhh- , respectively. in the last years, human coronaviruses have infected humans and causes three major outbreaks due to sars-cov, mers-cov, and sars-cov- . an urgent and important challenge in modern medicine is whether we could identify a so-called "universal" target or strategy for inhibiting sarsr-cov or even all coronaviruses. the molecular mechanisms of sars-cov- infection are not yet fully understood. more research on the sars-cov- biology is urgently needed. current challenges in developing neutralizing antibodies against sars-cov- include mutations in less conserved region of s subunit, possibly antigen drift, immunodominant epitope, ade potentially induced by nonneutralizing antibodies, or increased affinity of viral s protein for ace [ ] . it is important to identify and develop neutralizing antibodies, such as d , s , and vhh- , against highly conserved region of rbd or s , to combat not only various strains of sars-cov- but also broadly against other sarsr-covs. furthermore, a single monoclonal antibody therapy might not be enough. there might be different strains of the virus that cannot be recognized by the antibody. mutations in the virus can lead to escape variants [ ] . multiple strategies and combination of multiple mechanisms are highly expected as described in mers [ ] and sars [ ] antibody development. a combination of two (or more) antibodies that recognize different parts including both neutralizing and nonneutralizing epitopes (e.g., rbd, ntd, hr, and glycan) of the viral surface that interact with human cells might be the most effective. future therapeutic applications could include cocktail therapy by combining early transmission dynamics in wuhan, china, of novel coronavirus-infected pneumonia clinical course and risk factors for mortality of adult inpatients with covid- in wuhan, china: a retrospective cohort study the race is on for antibodies that stop the new coronavirus antibody therapies for the treatment of covid- isolation of a human monoclonal antibody specific for the receptor binding domain of sars-cov- using a competitive phage biopanning strategy induction of neutralizing antibodies by human papillomavirus vaccine generated in mammalian cells construction and next-generation sequencing analysis of a large phage-displayed vnar single-domain antibody library from six naive nurse sharks renin-angiotensin-aldosterone system inhibitors in patients with covid- receptor recognition by the novel coronavirus from wuhan: an analysis based on decadelong structural studies of sars coronavirus the proximal origin of sars-cov- cryo-em structure of the -ncov spike in the prefusion conformation structural basis for the recognition of sars-cov- by full-length human ace structure, function, and antigenicity of the sars-cov- spike glycoprotein characterization of spike glycoprotein of sars-cov- on virus entry and its immune cross-reactivity with sars-cov human neutralizing antibodies elicited by sars-cov- infection a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov a human monoclonal antibody blocking sars-cov- infection structural and functional analysis of a potent sarbecovirus neutralizing antibody structural basis for potent neutralization of betacoronaviruses by single-domain camelid antibodies crystal structure of a shark single-domain antibody v region in complex with lysozyme a cold-blooded view of adaptive immunity ancient species offers contemporary therapeutics: an update on shark vnar single domain antibody sequences, phage libraries and potential clinical applications general strategy to humanize a camelid single-domain antibody and identification of a universal humanized nanobody scaffold phylogenetic network analysis of sars-cov- genomes patient-derived mutations impact pathogenicity of sars-cov- spike mutation pipeline reveals the emergence of a more transmissible form of sars-cov- a potent neutralizing human antibody reveals the n-terminal domain of the spike protein of sars-cov- as a site of vulnerability monoclonal antibodies targeting the hr domain and the region immediately upstream of the hr of the s protein neutralize in vitro infection of severe acute respiratory syndrome coronavirus inhibition of sars-cov- (previously -ncov) infection by a highly potent pan-coronavirus fusion inhibitor targeting its spike protein that harbors a high capacity to mediate membrane fusion therapeutic strategies in an outbreak scenario to treat the novel coronavirus originating in wuhan neutralization of sars-cov- spike pseudotyped virus by recombinant ace -ig inhibition of sars pseudovirus cell entry by lactoferrin binding to heparan sulfate proteoglycans glypicans as cancer therapeutic targets isolation of antibodies to heparan sulfate on glypicans by phage display human monoclonal antibody targeting the heparan sulfate chains of glypican- inhibits hgf-mediated migration and motility of hepatocellular carcinoma cells epitope mapping by a wnt-blocking antibody: evidence of the wnt binding domain in heparan sulfate infectious entry and neutralization of pathogenic jc polyomaviruses human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants towards a solution to mers: protective human monoclonal antibodies targeting different domains and functions of the mers-coronavirus spike glycoprotein antibodies (including multiple single domain antibodies) that target different epitopes via different mechanisms. i would like to thank chin-hsien tai, yaping sun, bryan fleming, and jessica hong (nci) for reading the manuscript, and alan hoofring and ethan tyler (nih medical arts design section) for making the figure. the content of this publication does not necessarily reflect the views or policies of the department of health and human services nor does mention of trade names, commercial products, or organizations imply endorsement by the u.s. government. the hs human monoclonal antibody is available for licensing, in a wide range of fields of use, from the national cancer institute. if you are interested in obtaining a license, please contact dr. mitchell ho. the author is supported by the intramural research program of nih, nci (z bc , zia bc , and nci ccr antibody engineering program). none declared. key: cord- -wtmjt hf authors: zha, lisha; zhao, hongxin; mohsen, mona o.; hong, liang; zhou, yuhang; li, zehua; yao, chuankai; guo, lijie; chen, hongquan; liu, xuelan; chang, xinyue; zhang, jie; li, dong; wu, ke; vogel, monique; bachmann, martin f; wang, junfeng title: development of a covid- vaccine based on the receptor binding domain displayed on virus-like particles date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: wtmjt hf the recently ermerging disease covid- is caused by the new sars-cov- virus first detected in the city of wuhan, china. from there it has been rapidly spreading inside and outside china. with initial death rates around %, covid- patients at longer distances from wuhan showed reduced mortality as was previously observed for the sars coronavirus. however, the new coronavirus spreads more strongly, as it sheds long before onset of symptoms or may be transmitted by people without symptoms. rapid development of a protective vaccine against covid- is therefore of paramount importance. here we demonstrate that recombinantly expressed receptor binding domain (rbd) of the spike protein homologous to sars binds to ace , the viral receptor. higly repetitive display of rbd on immunologically optimized virus-like particles derived from cucumber mosaic virus resulted in a vaccine candidate (rbd-cumvtt) that induced high levels of specific antibodies in mice which were able to block binding of spike protein to ace and potently neutralized the sars-cov- virus in vitro. onset of symptoms or may be transmitted by people without symptoms. rapid development of a protective vaccine against covid- is therefore of paramount importance. here we demonstrate that recombinantly expressed receptor binding domain (rbd) of the spike protein homologous to sars binds to ace , the viral receptor. higly repetitive display of rbd on immunologically optimized virus-like particles derived from cucumber mosaic virus resulted in a vaccine candidate (rbd-cumvtt) that induced high levels of specific antibodies in mice which were able to block binding of spike protein to ace and potently neutralized the sars-cov- virus in vitro. covid- is caused by a novel coronavirus closely related to viruses causing sars and mers. as the disease caused by the other two viruses, covid- mainly manifests symptoms in the lung and causes cough and fever . the disease covid- is less severe than sars and mers, which is beneficial per se but leads to easier and wider spread of the virus, in particular due to infected individuals with very little symptoms ("spreaders") and a long incubation time (up to weeks) combined with viral shedding long before disease onset . a vaccine with rapid onset of protection is therefore in high demand for the control of the pandemic that is currently taking its course. the spike protein of covid- is highly homologous to the spike protein of sars and both viruses share the same receptor, which is angiotensin converting enzyme (ace ) , . the receptor binding domain (rbd) of the sars spike protein binds to ace and is an important target for neutralizing antibodies [ ] [ ] [ ] . by analogy, the rbd of covid- spike protein may be expected to similarly be the target of neutralizing antibodies, blocking the interaction of the virus with its receptor. we have previously shown that antigens displayed on virus-like particles (vlp) induce high levels of antibodies in all species tested, including humans . more recently, we have developed an immunologically optimized vlp platform based on cucumber mosaic virus. these cumvtt vlps (hereafter cumvtt) incorporate a universal t cell epitope derived from tetanus toxin providing pre-existing t cell help. in addition, during the production process these vlps package bacterial rna which is a ligand for toll-like receptor / and serves as potent adjuvants . using antigens displayed on these vlps, it was possible to induce high levels of specific antibodies in mice, rats, cats, dogs and horses and treat diseases such as atopic dermatitis in dogs or insect bite hypersensitivity in horses [ ] [ ] [ ] . to generate a covid- vaccine candidate, we therefore attempted to display the rbd domain on cumvtt (fig. a) . to this end we gene-synthesized the covid- rbd domain and fused it to an fc molecule for better expression. as expected, the protein bound efficiently to the viral receptor ace as determined by sandwich elisa (fig. b) . in a next step, the protein was chemically coupled to the surface of cumvtt using the well established chemical cross-linkers sata and smph (ref. ). sds-page and western blotting confirmed efficient coupling of the rbd-fusion molecule to cumvtt, resulting in the vaccine candidate rbd-cumvtt (fig. c,d) . to test immunogenicity of the vaccine candidate, mice were immunized three times (weekly schedule) with the rbd-fusion molecule alone or conjugated to the surface of cumvtt formulated in montanide adjuvants. as shown in fig. a -c, coupling to vlps dramatically increased the immunogenicity of the rbd. as shown by elisa on recombinant rbd, rbd-cumvtt showed strongly increased immunogenicity at all time-points tested (one week after the vaccine injection time-points). to assess the potential for anti-viral activity, we assessed whether the induced antibodies were able to block binding of the rbd protein to the viral receptor ace . as shown in fig. , immune sera obtained after two boosts (day ) were able to strongly inhibit rbd binding to ace . the best correlate of protection is viral neutralization. to this end, we generated pseudotyped retroviruses expressing the sars-cov- spike protein and luciferase for quantification of infection (fig. a ). using these viruses, the neutralizing capacity of the sera from immunized mice was assessed on ace -transfected cells (fig. b) , directly demonstrating high anti-viral neutralizing activity of the induced antibodies. hence, the rbd-cumvtt vaccine candidate is able to induce high levels of sars-cov- neutralizing antibodies. furthermore, the cumvtt based vaccine is based on highly efficient expression systems and chemical conjugation technologies, rendering it an attractive candidate for large scale production under cgmp. previous studies with a similar vlp-based conjugate vaccine has demonstrated that high levels of specific antibodies can be mounted within a week , (see also fig. a) , offering the additional possibilities to rapidly immunize individuals that have been exposed to infected humans or those that are kept in quarantine. thus, vaccines based on the sars-cov- rbd domain displayed on vlps may have the potential to critically interfere with global spread of the virus. the sars-cov- receptor-binding domain (rbd) and the n-terminal peptidase domain of human ace were expressed using f cells (invitrogen). the sars-cov- rbd (residues arg -phe ) with an n-terminal il- signal peptide for secretion and a cterminal fc tag for purification was inserted into pfuse-migg -fc vector (invitrogen). the construct was transformed into bacterial dh α competent cells, and the extracted plasmid was then transfected into f cells at a density of × cells/ml using pei (invitrogen). the cell culture supernatant containing the secreted rbd was harvested h after infection, concentrated and buffer-exchanged to hbs ( mm hepes, ph . , mm nacl). rbd was captured by protein a resin (ge healthcare) and eluted with gly-hcl buffer ph . . fractions containing rbd were collected and neutralized to ph . with m tris. for elisa coating, ace was cleaved from the fc part using thrombin as described in the manufacturer's manual. the human ace (residues ser -ser ) with an n-terminal il- signal peptide for secretion and a c-terminal ×his tag for purification was inserted into pfuse-vector (invitrogen). the human ace was expressed by essentially the same protocol used for the sars-cov- rbd. ace was captured by ni-nta resin (ge healthcare) and eluted with mm imidazole in hbs buffer. rbd was then purified by gel filtration chromatography using the superdex column (ge healthcare) pre-equilibrated with hbs buffer. fractions containing ace were collected. the antibody competitive binding activities of the serum were assayed by elisa. ace ( ug/ml) was incubated in -well plate overnight at °c. after incubation, the plate was blocked with % bsa for h at °c and then washed five times with pbs containing . % tween . bsa was used as negative control followed by the addition of a mixture of -fold diluted serum and rbd-mfc ( . ug/ml) followed by incubation for min with gentle shaking at °c. plates were washed five times with pbs containing . % tween (pbt) followed by µl of horseradish peroxidase/anti-mfc antibody conjugate (diluted : in pbt buffer), incubated min with gentle shaking. plates were washed five times pbt buffer and developed with µl of freshly prepared , ', , '-tetramethylbenzidine (tmb) substrate. reaction was stopped with µl of . m h po and read spectrophotometrically at nm in a microtiter plate reader. the production of cumvtt was described in detail in zeltins et al. briefly, e coli c cells the rbd was conjugated to cumvtt using the cross-linker succinimidyl -(betamaleimidopropionamido) hexanoate (smph) (thermo fisher scientific, -molar excess, minutes, °c). the coupling reactions were performed with . x molar excess of rbd, . x rbd, or equal molar amount of rbd regarding the cumvtt (shaking at °c for hours at rpm on dsg titertek; flow laboratories, irvine, united kingdom). unreacted smph and rbd proteins were removed using amicon-ultra . , k (merck-millipore, burlington, mass). vlp samples were centrifuged for minutes at , rpm for measurement on nd- . coupling efficiency was calculated by densitometry (as previously described for il a-cumvtt vaccine ), with a result of approximately % to %. pseudovirus expressing the sars-cov- spike protein was produced by lentivirus second- the t-ace cells which stably express ace receptors on the cell membrane were prepared by transfection of ace gene into t cells using lentivirus system. pseudoviruses prepared above were added to the t-ace cells ( × cells/well) with μl polybrene ( μg/ml). after h, the infection was monitored using the luciferase assay system (promega). titer was calculated based on serial dilutions of pseudovirus. the mouse serum samples ( μl) were diluted to : , : , : , : and : respectively, and then mixed with an equal volume of pseudovirus stock. after incubation at °c for h, the mixture was inoculated on the t-ace cells ( x cells/well). at the same time, pseudovirus+dmem medium was set as a positive control and dmem medium only was set as a negative control. after the cells were incubated for hours, serum neutralization was measured by luciferase activity of infected pseudovirus. a cut-off of > % was used as to determine neutralizing titer. clinical characteristics of coronavirus disease in china characteristics of and important lessons from the coronavirus disease (covid- ) outbreak in china: summary of a cases from the chinese center for disease control and prevention angiotensin-converting enzyme is a functional receptor for the sars coronavirus composition and divergence of coronavirus spike proteins and host ace receptors predict potential intermediate hosts of sars-cov- a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensin-converting enzyme an efficient method to make human monoclonal antibodies from memory b cells: potent neutralization of sars coronavirus receptor-binding domain of sars-cov spike protein induces highly potent neutralizing antibodies: implication for developing subunit vaccine therapeutic vaccines for chronic diseases: successes and technical challenges incorporation of tetanus-epitope into virus-like particles achieves vaccine responses even in older recipients in models of psoriasis, alzheimer's and cat allergy vaccination against il- for the treatment of atopic dermatitis in dogs treating insect-bite hypersensitivity in horses with active vaccination against il- pseudotyped lentiviral vectors: one vector, many guises. hum interaction of viral capsidderived virus-like particles (vlps) with the innate immune system vaccine against peanut allergy based on engineered virus-like particles displaying single major peanut allergens generation of high-titer pseudotyped lentiviral vectors key: cord- - br lwma authors: zeng, hao; wang, dongfang; nie, jingmin; liang, haoyu; gu, jiang; zhao, anne; xu, lixin; lang, chunhui; cui, xiaoping; guo, xiaolan; zhou, changlong; li, haibo; guo, bin; zhang, jinyong; wang, qiang; fang, li; liu, wen; huang, yishan; mao, wei; chen, yaokai; zou, quanming title: the efficacy assessment of convalescent plasma therapy for covid- patients: a multi-center case series date: - - journal: signal transduct target ther doi: . /s - - -x sha: doc_id: cord_uid: br lwma convalescent plasma (cp) transfusion has been indicated as a promising therapy in the treatment for other emerging viral infections. however, the quality control of cp and individual variation in patients in different studies make it rather difficult to evaluate the efficacy and risk of cp therapy for coronavirus disease (covid- ). we aimed to explore the potential efficacy of cp therapy, and to assess the possible factors associated with its efficacy. we enrolled eight critical or severe covid- patients from four centers. each patient was transfused with – ml of cp from seven recovered donors. the primary indicators for clinical efficacy assessment were the changes of clinical symptoms, laboratory parameters, and radiological image after cp transfusion. cp donors had a wide range of antibody levels measured by serology tests which were to some degree correlated with the neutralizing antibody (nab) level. no adverse events were observed during and after cp transfusion. following cp transfusion, six out of eight patients showed improved oxygen support status; chest ct indicated varying degrees of absorption of pulmonary lesions in six patients within days; the viral load was decreased to a negative level in five patients who had the previous viremia; other laboratory parameters also tended to improve, including increased lymphocyte counts, decreased c-reactive protein, procalcitonin, and indicators for liver function. the clinical efficacy might be associated with cp transfusion time, transfused dose, and the nab levels of cp. this study indicated that cp might be a potential therapy for severe patients with covid- . in december , the outbreak of coronavirus disease (covid- ) caused by severe acute respiratory syndrome coronavirus (sars-cov- ) emerged in wuhan, china, and has rapidly spread around the world. covid- can manifest on a spectrum of illness from mild disease to severe respiratory failure requiring intensive care unit admission. the world health organization (who) has declared covid- a pandemic on march , . as of may , , it had caused a total of , , cases of infection and resulted in , deaths globally. epidemiology and etiology for covid- are rapidly evolving, giving us a greater understanding of those at risk and elucidating more potential therapy targets. in addition to supportive care, such as oxygen support and extracorporeal membrane oxygenation, several drugs for this disease are still being researched, such as remdesivir, lopinavir/ritonavir, arbidol, and darunavir. , however, up to now, no approved vaccine or specific antiviral agents has been proved to be effective to prevent or treat sars-cov- infection due to the absence of evidence. passive immunity delivered as neutralizing antibodies (nabs) from convalescent plasma (cp) may offer an alternative therapeutic approach for covid- . cp therapy has been empirically used in other epidemics, including sars, middle east respiratory syndrome (mers), and influenza a (h n ). - a metaanalysis of reports on sars coronavirus infection and severe influenza revealed a statistically significant reduction of mortality after administration of cp, especially when cp was given early after symptom onset. however, in a case series on influenza a (h n ) virus infection, nonsignificant benefits following the intervention of cp were reported, and no association of cp therapy with an increased survival was observed in patients with ebola virus disease. it is possibly due to the unknown levels of nabs in the infused plasma, which may obscure the effects of cp. in this current pandemic, preliminary studies suggested the effectiveness of cp with no severe adverse events to treat patients infected with sars-cov- . [ ] [ ] [ ] [ ] [ ] the results from a pilot study applying cp transfusion for severe patients showed that administration of cp with nab titers above : led to improvement in clinical symptoms and pulmonary lesions. these findings indicate that cp transfusion may be a promising therapy in the treatment for covid- . nonetheless, due to the limitations of the study design and small sample size, current evidence on the efficacy and safety of cp therapy for covid- is still limited. moreover, the quality control of cp and individual variation in patients in different studies make it rather difficult to evaluate the efficacy and risk of cp therapy. thus, more supporting evidence (such as multi-level assessment of specific antibodies in cp, indications for cp treatment, and selection of transfusion timing) is called for with wider adoption of cp for covid- in multi-centers and regions. herein, we performed a retrospective observational study involving eight critical or severe patients with covid- from four designated hospitals in the southwest region of china, aiming to explore the potential efficacy and safety of cp therapy, and to provide more evidence for the quality control of donated plasma and reasonable clinical application of cp transfusion. clinical characteristics of the patients a total of patients ( males and females) with critical or severe covid- were enrolled. the median age was . years (iqr, . - . years). the median time from symptom onset to hospital admission was . days (interquartile interval (iqr), . - . days). the most common symptoms during hospitalization were cough ( / ), shortness of breath ( / ) , and fever ( / ), while patients had fewer manifestations of dyspnoea (two cases), diarrhea (two cases), headache (one case), and fatigue (one case). five patients had coexisting chronic diseases at admission, including type ii diabetes, hypertension, chronic obstructive pulmonary disease (copd), and coronary heart disease (chd). table listed the drug treatments prior to and after cp transfusion. all patients received combination therapy of various antiviral treatment and other supportive care. the most commonly used antivirals were interferon alfa- b ( / ), lopinavir/ritonavir ( / ), and arbidol ( / ). darunavir and hydroxychloroquine sulfate were also administered for three and two patients, respectively. antibiotic or antifungal agents were used when patients had coinfection. five patients were given corticosteroids at the appropriate situation. chest computed tomography (ct) scans demonstrated that all patients presented bilateral multiple ground-glass opacity or partial consolidation at the time of admission, with primary involvement of subpleural lesions. characteristics of convalescent plasma donors in total, seven donors ( males and females) from the participating hospitals who had recovered from sars-cov- infection donated - ml of cp ( table ). the median age was . years (iqr, . - . years). these donors donated the cp at the median day of . (iqr, . - . days) from discharge. all of donors were mild or moderate patients during a hospital stay with no comorbidities. we measured sars-cov- specific antibodies using four platforms of immunological tests. the sars-cov- specific antibody titers were detected by magnetic chemiluminescence enzyme immunoassays (mclia) which targeted at the combination of nucleoprotein (np) and receptor binding domains of spike protein (s-rbd) specific antigens, as well as by enzyme-linked immunosorbent assays (elisa) which determined anti-np and anti-s-rbd specific igg antibodies separately. the igg titers detected by mclia ranged from : to : , and the igm mclia titers were less than or equal to : in six donors, except donor ( : ). the elisa results showed that the anti-s-rbd and anti-np specific igg titers were in a range of : - : and : - : , respectively. we measured the inhibitory activity of receptor binding (rbia) of the cp samples by a receptor-binding assay, finding the % inhibitory titer (it ) values ranging from : to : . importantly, the neutralizing activity of these plasma samples, which offer the most informative assessment of antiviral activity of patient sera against viral infection, was measured by a pseudovirus based neutralization assay. the nabs of the donated plasma also showed variable levels (nab titer (nat ) range, : - : ), and only three cp donors (donor , , and ) had nat values greater than : . the results of correlation analyses as shown in fig. a indicated that there was positive correlation between igg mclia titer and s-rbd specific igg elisa titer (r = . , p = . ). nat was positively correlated with s-rbd and np specific igg elisa titers, respectively (r = . , p = . ; r = . , p = . , respectively). however, the positive association between igg mclia titer and nat did not show statistical significance (r = . , p = . ). notably, it was neither related to nat , nor correlated to igg titers. comparing the antibody levels of cp collected at different time, we found that the cp donated greater that days had higher levels of s-rbd igg elisa titer and igg mclia titer than cp which collected less than or equal to days (fig. b) . the detailed information about cp treatment for the patients were shown in table . these patients were administered one or two transfusions of cp. two transfusions were administered with an interval less than h. abo-compatible and cross-matched cp were administered at the discretion of the attending clinicians and according to plasma availability. patients received cp transfusion between and days following the onset of symptoms, with three of them given within days from symptom onset. five of eight patients received two doses of - ml of cp within h (totally or ml), while the other three cases only received one dose of ml. clinical response of cp transfusion adverse effects of cp transfusions. no adverse events were observed in the eight patients after cp transfusion. clinical characteristics. as the patients have been treated by antiviral drugs and oxygen support before cp therapy, the body temperature, heart rate, and systolic pressure were normal even prior to cp transfusion, and kept unchanged within days after cp transfusion as indicated in table . individual patient's change in the category of oxygen support during hospitalization are shown in fig. . six of eight patients showed an improvement in the category of oxygen support within days from cp treatment. obvious improvement was observed in patients who were receiving high-flow nasal cannula oxygenation (n = ), or noninvasive positive pressure ventilation (nippv, n = ) prior to cp treatment. it is notable that patient , , and rapidly shifted highflow supplemental oxygen or nippv to low-flow supplemental oxygen within h after cp transfusion. pulmonary lesions on chest ct examinations. chest ct scans showed that pulmonary lesions improved at varying degrees in six out of eight patients. a partial resolution of pulmonary lesions was observed in patient , , and on st day, in patient and on rd day, in patient on th day, and in patient on th day after plasma transfusion, respectively. representative chest ct images of patient - were shown on fig. . laboratory results. we monitored the development of the virusspecific igg and igm antibodies by mclia prior to and after cp transfusion in all patients except patient . in of patients, the igg titer increased within days posttransfusion, with patient , , and presenting the most obvious increment (fig. a) . the igm level was observed lower than igg for all patients, and waved in a small range after cp transfusion (fig. b) . sars cov- viral load, estimated by the cycle threshold (ct) value from reverse transcriptase-polymerase chain reaction (rt-pcr), was positive in five patients before cp transfusion (for other three cases, the data of ct values was not available). the ct value was decreased to a negative level in patient and on posttransfusion day , patient on day , patient on day , and patient on day (fig. c, d) , which was basically consistent with the improvement of pulmonary lesions indicated by ct scans mentioned above. the result of arterial blood gas analysis showed that the ratio of the partial pressure of arterial oxygen (pao ) to fraction of inspired oxygen (fio ) (pao /fio ) (median, . ; iqr, . - . ) prior to transfusion immediately increased one day after transfusion (median, . ; iqr, . - . ), and five patients were indicated a tendency of improvement of pao /fio in the following days after cp therapy (table and fig. e ). lymphocytopenia, which is a prominent feature of critically ill patients with covid- , was also observed in this study, with the median lymphocyte counts of . (iqr, . - . ) ( table ). within days following plasma transfusion, the lymphocyte counts showed an increase in out of patients (fig. g ). the changes of white blood cell count (fig. f) and neutrophil count were similar with an overall downward trend, except that patient , , and , who had the complications of bacterial or fungal pneumonia, presented an increase after cp therapy. as for the inflammatory biomarkers, the increased creactive protein (crp) and procalcitonin before plasma transfusion were observed a declining trend following cp treatment for out of patients, and for of patients, respectively (fig. h , i). proinflammatory cytokines, including interleukin- (il- ) and tumor necrosis factor-α (tnf-α) demonstrated an increase for of patients (fig. j) , and for of patients ( supplementary fig. s a) , respectively, as compared to the status before cp therapy. other inflammatory cytokines, such as interferon-γ (ifn-γ), il- , il- , and il- a, showed various alterations in each patient after cp treatment (supplementary fig. s b -e). we also observed tendencies of increment of the ratios of proinflammatory cytokines and antiinflammatory cytokine (il- /il- , and il- /il- ) in four patients (table ). concerning the parameters indicative of liver function, the alanine aminotransferase (alt), aspartate aminotransferase (ast), and total bilirubin (tbil) tended to decrease after cp therapy, except for an increase of all these indicators in patient , and elevated alt and ast in patient . the coagulation profile of patients was also monitored following cp treatment, indicating that out of patients kept the normal level of prothrombin time, while abnormally elevated d-dimer prior to plasma transfusion (median, . ; iqr, . - . ) still increased within days after plasma treatment in of patients ( supplementary fig s f) . outcome of patients treated with cp. all patients were discharged from the hospital with a median length of stay of . days (iqr, - . days), except for patient , who remained hospitalized for further treatment of underlying diseases as of april , . assessment of possible factors associated with clinical effects. to assess the factors which might affect the clinical effects, we compared the clinical features between patients who received cp transfusion on different time, with different doses, and with different nab titers (table ) . although the results were not statistically analyzed due to limited samples in each group, patients who received cp transfusion before days from symptom onset tended to show a more rapid negative conversion of viral nucleic acid, and shorter hospital stays compared to patients who were transfused after days. concerning the doses of plasma, we found that the viral nucleic acid in patients transfused with ml of cp had a tendency to turn faster to a negative level than that in patients who received ml of cp. when comparing to patients treated by cp with nat ≤ : , the viral rna tended to be decreased to an undetectable level in less time, and the increment of igg mclia level indicated by the igg titer ratio (the titer at day after cp transfusion divided by the value before cp transfusion) tended to be higher in patients receiving cp with nat > : . but the hospitalization was longer for patients receiving high nat , mainly because of patient who remained hospitalized for treating for severe complications including acute respiratory distress syndrome (ards), this retrospective observational study explored the potential efficacy and safety of cp treatment in patients who were critically or severely ill with covid- . one or two doses of cp with a total of - ml was well tolerated by all patients without any adverse effects. improved clinical conditions as indicated by improvement of oxygen support and chest ct imaging were observed in most patients after cp treatment. the viral load as estimated by the ct value also declined to undetectable level within days post transfusion. it has been suggested that cp served as a method of passive immunity therapy , could significantly reduce the mortality of patients with sars infection. , one possible mechanism for the efficacy of cp therapy is the nabs from cp which may lead to the clearance of viraemia. , our results showed that only plasma from donor , , and had relatively high neutralizing activity (nat > : ). this is consistent with a recent finding that the majority of cp donors had relatively modest neutralizing activity and a small proportion of donors had high neutralization activity. it is not surprising since all donors were previously moderate or mild patients, and there is evidence that mild patients frequently had a lower level of sars-cov- specific antibodies than severe patients. assessing the effects of neutralizing activity of cp on the patients' clinical efficacy, we found that patients treated by cp with high nat (> : ) had more obvious improvement than patients receiving low nat value (≤ : ) of cp, including shorter negative conservation time of viral rna, and higher increment of igg level after cp transfusion. in line with other publications, our results indicated that cp with high concentration of nabs may contribute to the clearance of the virus. based on the fact that cp donors who usually recovered from mild infection may not generate adequate protective antibodies, and the levels of plasma neutralizing activity required to prevent sars-cov- re-infection are currently unknown, more studies are necessary to assess the minimum threshold of nab titers necessary to prevent sars-cov- reinfection. in addition to pseudovirus based neutralization test, this study also employed multiple sars-cov- serology tests and receptorbinding assay. the results demonstrated that cp donors had a wide range of antibody levels measured across multiple platforms. pseudotyped virus assay, an alternative of neutralization test which is considered as the optimal assay to determine the antiviral activity of antibodies, could measure how effectively donor plasma or serum can inhibit virus infection of target cells. but it is not feasible to implement neutralization test or pseudotyped virus assay as a measurement of antiviral antibodies for general population investigation. by contrast, serology tests are more convenient and practical. here we examined the correlations between serology test results and neutralization assay in the cp samples, which is seldomly explored in other studies. our results indicated that s-rbd and np specific igg elisa titers had a significant strong correlation with nab level, and igg mclia titer showed a modest correlation with neutralization activity. however, the inhibitory activity of receptor binding of the cp samples had a low degree of association with neutralization activity. these findings may provide some clues about that elisa or mclia assays may serve as a surrogate for pseudovirus neutralization assay to predict the degree of neutralization activity present in recovered patients or vaccine recipients. studies with larger sample size are necessary to further explore these alternative serology tests which could help to refine the cp selection, as well as inform immunogenicity of vaccines against sars-cov- . the treatment timing is considered as another important factor associated with the effectiveness of cp therapy. viraemia reaches to the peak in the first week of infection for most viral illnesses. patients usually develops a primary immune response by days - , which is followed by virus clearance. , the largest study involved the cp treatment of patients in hong kong with sars found that the better clinical outcome was observed among patients who were given cp before day of illness and among cases who were pcr positive and seronegative for coronavirus at the time of plasma infusion. a recent study on covid- demonstrated that cp therapy could not reduce the mortality rate in critically ill patients with end-stage disease. thus, to obtain the greatest benefit from cp, treatment should be fig. sars-cov- specific antibody levels of cp samples measured by serology tests, receptor-binding assay, and pseudovirus based neutralization assay. a the correlations among anti-sars-cov- specific igg and igm titers detected by commercial mclia kits, anti-s-rbd and anti-np specific igg titers determined by in-house elisa assays, inhibition activity measured by a receptor-binding assay, and neutralizing antibody titer measured by a pseudovirus based neutralization assay. b comparisons of antibody levels between cp samples collected before and after days from symptom onset. mclia magnetic chemiluminescence enzyme immunoassay, elisa enzyme-linked immunosorbent assay, rbd receptor binding domains, np nucleoprotein, it inhibitory titer which was calculated with the dilution of plasma that inhibits % rbd-fc binding to receptor ace , nat neutralizing antibody titer which was calculated with the highest dilution of plasma that resulted in a % reduction of virus infection, gmt geometric mean titer, ci confidence interval table . detailed information about patients receiving convalescent plasma treatment the efficacy assessment of convalescent plasma therapy for covid- . . . zeng et al. administered early in the course of the disease (e.g., before sars-cov- seroconversion). in our study, only three patients were given cp before days from illness onset, and all patients had developed sars-cov- -specific igg before cp transfusion. these three patients tended to show a more rapid negative conversion of viral rna, and shorter hospital stays compared to other patients who were transfused after days. the late administration of cp may result in the fact that the patients with critical illness and complications did not show obvious clinical improvement. specifically, patient who was given cp transfusion on the th day of infection, and had suffered from bacterial pneumonia prior to cp therapy, showed latest conversion of virus nucleic acid on posttransfusion day . on the other hand, patient was observed rapid decrease of ct value on posttransfusion day and obviously promoted clinical manifestation after receiving cp early after disease onset (day ). our results support that cp treatment in potentially critically ill patients with covd- early in the course of disease may be more effective. most patients with severe covid- were featured by substantially elevated levels of proinflammatory cytokines, which was characterized as cytokine release syndrome. [ ] [ ] [ ] our study also observed abnormally high levels of proinflammatory cytokines (especially il- ) in some patients prior to cp therapy. notably, inflammatory cytokines, including il- and tnf-α, proinflammatory/anti-inflammatory ratios (il- /il- , and il- /il- ) unexpectedly kept increased within days of cp treatment in almost half of patients, which was not compatible with another study on cp treatment for covid- . it is probably because that increased systemic cytokine production may lead to the pathophysiology of severe covid- , including ards and multiple organ failure, and it might be unable to attenuate the inflammatory damage soon after cp transfusion. elevated il- level was found to be a stable indicator of poor outcome in patients with severe covid- with pneumonia and ards. moreover, lymphopenia has been proven to be an effective and reliable indicator of the clinical severity in covid- patients early administration of cp containing nabs may not only inhibit viral entry and replication, but also consequently blunt an early proinflammatory pathogenic endogenous response and restore the immune system. , thus, we suggest that cp be given at an early stage in patients at high risk of subsequent deterioration (for instance, persistently abnormal inflammatory cytokines, and lymphopenia) for maximizing efficacy to prevent cytokine storms. besides, to prevent from worsening disease outcome, it is beneficial to monitor the abovementioned prognostic biomarkers for patients at high risk of developing ards or multiple organ failure, especially for those with chronic diseases, such as hypertension, diabetes, and copd. based on our findings, the dose of infused cp might play a role on its therapeutic effect, as demonstrated by the result that the viral nucleic acid in patients transfused with ml of cp tended to turn faster to undetectable than that in patients who received ml of cp. while a study about the cp therapy in sars patients found that there was no correlation between clinical outcome and the volume of infused plasma. future large-scale studies are needed to investigate the association between the dose of cp transfusion and its clinical efficacy. there are some limitations that should be noted in this study. first, this study was a case series with small sample size, and the outcome of the cp treated patients was not compared with a control group of patients who did not receive the intervention. second, the patients received other therapies (including antiviral agents, antibiotics or antifungal drugs, and corticosteroids), making it impossible to discriminate the specific contribution of cp to the clinical course or outcomes. moreover, cp was administered - days after admission in this study. the association between the transfusion timing and clinical outcomes should be further clarified. in addition, patients in the current study were given different doses of cp. it is unclear whether the doses and the titers of antibodies were associated with the treatment efficacy. despite these limitations, this study provided more evidence to support that cp therapy might be a promising option to treat covid- patients, which is also supported by the recent issue by fda of emergency use authorization for cp as potential promising covid- treatment. overall, this study not only provided more evidence on the potential efficacy and safety of cp therapy, but also contributed to the quality control of donated plasma and reasonable clinical application of cp transfusion. in conclusion, our preliminary study indicated that cp might be a potential therapy for severe patients with covid- . we observed improvement of clinical features without the occurrence of serious adverse reactions following cp transfusion. further welldesigned randomized clinical trials are needed to evaluate the efficacy and safety of cp transfusion, and to explore best donation candidates with high virus-specific antibodies, and indications for cp therapy (e.g., optimal transfusion time point, early warning indicators, and transfused dose). this study was performed from february , , to april , , at four centers, chongqing public health medical center, chongqing three gorges central hospital, yongchuan hospital of chongqing medical university, and affiliated hospital of north this study was approved by the ethical committee of chongqing public health medical center (approval number, - - -ky). all patients signed a written informed consent before any procedure was carried out. if patients cannot make rational decisions, the consents were signed by their family members on behalf of the patients. this study was conducted in accordance with the helsinki declaration as revised . donors for convalescent plasma transfusion cp was obtained from donors who had recovered from covid- infection. the recovery status was defined as follows: ( ) aged between and years; ( ) at least weeks following symptom onset; ( ) afebrile status for at least days; ( ) significant improvement in respiratory symptoms; ( ) two consecutively negative results of sputum sars-cov- of real-time rt-pcr assay (one-day sampling interval). persons who met all criteria were eligible for plasma donation. written informed consent was obtained from each donor. plasma preparation apheresis was performed using haemonetics mcs + ln - e blood cell separator (haemonetics, boston, ma, usa). convalescence plasma for treatment was collected from donors. a or ml of abo-compatible plasma sample was collected from each donor, and each sample was divided and stored as or ml aliquots at °c without any detergent or heat treatment. the cp was then treated with methylene blue and light treatment for min in the medical plasma virus inactivation cabinet (shanghai blood technology co., ltd, shanghai, china). the plasma samples were tested negative for hepatitis b virus, hepatitis c virus, hiv, syphilis, and blood type irregular antibody. as a routine check with plasma donation, the cp was also confirmed negative for residual sars-cov- by rt-pcr. rt-pcr detection of sars-cov- rna throat swab samples were collected from patients for extracting sars-cov- rna using the rna viral kit (daan, guangdong, china). the real-time rt-pcr assay was performed using commercials kit specific for sars-cov- nucleic acid detection (liferiver, shanghai, china; shengxiang, sansure biotech, hunan, china) approved by the china national medical products administration (approve numbers, for liferiver, and for shengxiang). two target genes, including open reading frame ab (orf ab) and nucleocapsid protein (n), were simultaneously amplified using the real-time rt-pcr assay. each transcript provided a ct value, which is the number of cycles required for the fluorescent signal. a higher ct value is correlated with a lower viral load. a ct value less than was defined as a positive result, and a ct value of or more was defined as a negative test. all procedures involving clinical specimens and sars-cov- were performed in a biosafety level laboratory. the collected cp and serum samples from the donors and patients were inactivated at °c for min and stored at − °c before testing, and serially diluted before determination. igg and igm against sars-cov- were tested using mclia kits supplied by bioscience co. (tianjing, china) (approved by the china national medical products administration; approval numbers, (igg) and (igm)), according to the manufacturer's instructions. mclia for igg or igm detection was developed based on a double-antibody sandwich immunoassay. the recombinant antigens containing the nucleoprotein and a peptide from the spike protein of sars-cov- were conjugated with fitc and immobilized on anti-fitc antibody-conjugated magnetic particles. the tests were conducted on an automated magnetic chemiluminescence analyzer (axceed , bioscience, tianjing, china) according to the manufacturer's instructions. the mclia titers of specific igg and igm antibodies were defined as the highest dilution giving a chemiluminescence value of more than or equal to . all tests were performed under strict biosafety conditions. detection of specific igg levels against sars-cov- s-rbd and np sars-cov- np and s-rbd specific igg antibodies in plasma were measured by in-house elisa separately. purified np and s-rbd antigens were coated onto maxisorp elisa plates (corning costar, acton, ma, usa) in . m carbonate buffer (ph · ) at concentration of . μg/ml overnight at °c, respectively. plates were washed fig. changes of laboratory results before and at day - after convalescent plasma transfusion. a, b sars-cov- specific igg and igm levels, respectively, determined by mclia. c, d cycle threshold (ct) values of orf ab-gene and n-gene, respectively. a ct value of was defined as undetectable. e pao /fio (normal range: - mmhg). f white blood cell count (normal range: . - . ). g lymphocyte count (normal range: . - . ). h c-reactive protein (normal range: < ). i procalcitonin (normal range: < . ). j il- (normal range: - . ) times with phosphate-buffered saline (pbs) containing . % vol/vol tween- (pbst) and blocked with % bovine serum albumin for h at °c. the plates were then washed with pbst. the serum samples were diluted to -fold into pbs as initial concentration, and then serial -fold diluted until -fold. the serial dilutions of serum samples were added to the plate wells and incubated, followed by wash and incubation with anti-human igg hrpconjugated coat secondary antibody (abcam, cambridge, uk). after washes, plates were developed by tetramethylbenzidine substrate (tiangen biotech co., beijing, china) at room temperature in the dark. the absorbance was measured at nm using a microplate reader (molecular devices co., sunnyvale, ca, usa) after adding the stop solution ( m sulphuric acid). all samples were run in duplicate. the titers of np and s-rbd specific igg antibodies were defined as the highest dilution giving an absorbance value of more than . times that of the negative control. receptor-binding assay inhibitory effects of the cp samples on rbd-fc binding to receptor angiotensin-converting enzyme (ace ) were tested using an elisa-based assay. recombinant soluble human ace (sino biological) was coated at µg/ml to -well elisa plates (corning costar) in . m carbonate buffer (ph . ) at °c overnight. plates were washed times with . % vol/vol pbst and blocked with . % bovine serum albumin for h at °c. ng/ml recombinant sars-cov- spike rbd-mfc (sino biological) was mixed with the presence or absence of serially diluted cp or serum samples : and incubated at °c for h, then add the μl mixed solution to the wells. incubated at °c for min, μl of the hrp conjugated goat anti-mouse igg (zsgb-bio) was added to the wells. after incubation at °c for h, μl of the substrate tmb was added to the wells. developed at room temperature in the dark for min, it was terminated with the stop solution ( m sulphuric acid). the absorbance was measured at nm. all samples were run in duplicate. the % inhibitory titer (it ) was defined as the dilution of serum or plasma that inhibits % rbd-fc binding to receptor ace using a linear interpolation algorithm. pseudovirus based neutralization assay the neutralization of plasma samples was measured by a pseudovirus-based neutralizing assay as described previously. in brief, pseudovirus was incubated with serial dilutions of the plasma samples (six dilutions in a -fold step-wise manner) in duplicate for h at °c, together with the virus control and cell control wells in hexaplicate. then, freshly huh- cells (japanese collection of research bioresources [jcrb], ) were added to each well. following h of incubation in a % co environment at °c, the luminescence was measured using a microplate luminometer (perkinelmer, ensight). the nab titers (nat ) were defined as the % inhibitory dilution (id ) which was calculated with the highest dilution of plasma that resulted in a % reduction of relative light units compared with virus control. clinical data collection and efficacy assessment clinical information of the patients before and after cp transfusion was retrieved from the hospital electronic medical records system, including: ( ) basic clinical data: age, sex, days of admission from symptom onset, presenting symptoms, comorbidities, and other treatments; ( ) cp transfusion information: time and dose of cp infusion, complications prior to cp therapy, and adverse effects; ( ) clinical features, laboratory data, and chest ct imaging. adverse events and serious adverse events associated with cp transfusion were assessed by the clinician. the primary indicators for efficacy assessment were the changes of clinical symptoms, laboratory parameters, and radiological image after cp intervention. clinical outcomes include discharge and hospitalization. statistical analysis continuous variables were summarized as median and iqr or range. spearman correlation analyses were used to calculate the correlations among log -transformed anti-sars-cov- specific igg and igm mclia titers, anti-s-rbd and anti-np specific igg elisa titers, it , and nat of cp. graphs were plotted using graphpad prism . (graphpad software, san diego, ca, usa). correlation analysis was performed using spss . (spss inc., chicago, il, usa). two-tailed p value of less than . was considered statistically significant. the efficacy assessment of convalescent plasma therapy for covid- . . . zeng et al. clinical features of patients infected with novel coronavirus in wuhan novel coronavirus (covid- ) situation discovering drugs to treat coronavirus disease (covid- ) review of emerging pharmacotherapy for the treatment of coronavirus disease compassionate use of remdesivir for patients with severe covid- convalescent plasma as a potential therapy for covid- use of convalescent plasma therapy in sars patients in hong kong the effectiveness of convalescent plasma and hyperimmune immunoglobulin for the treatment of severe acute respiratory infections of viral etiology: a systematic review and exploratory meta-analysis convalescent plasma treatment reduced mortality in patients with severe pandemic influenza a (h n ) virus infection challenges of convalescent plasma infusion therapy in middle east respiratory coronavirus infection: a single centre experience clinical characteristics of human cases of highly pathogenic avian influenza a (h n ) virus infection in china evaluation of convalescent plasma for ebola virus disease in guinea treatment of critically ill patients with covid- with convalescent plasma effectiveness of convalescent plasma therapy in severe covid- patients treatment with convalescent plasma for critically ill patients with sars-cov- infection treatment with convalescent plasma for covid- patients in wuhan use of convalescent plasma therapy in two covid- patients with acute respiratory distress syndrome in korea clinicalmanagement-of-severe-acute-respiratory-infection-when-novel-coronavirus-(ncov)-infection-is-suspected guideline for diagnosis and treatment for novel coronavirus pneumonia establishment and validation of a pseudovirus neutralization assay for sars-cov- meta-analysis: convalescent blood products for spanish influenza pneumonia: a future h n treatment? convalescent plasma transfusion for the treatment of covid- : systematic review deployment of convalescent plasma for the prevention and treatment of covid- a multimechanistic antibody targeting the receptor binding site potently cross-protects against influenza b viruses serological analysis of new york city covid convalescent plasma donors covid- : immunopathology and its implications for therapy antibody responses to sars-cov- in patients of novel coronavirus disease effect of convalescent plasma therapy on viral shedding and survival in patients with coronavirus disease cytokine release syndrome in severe covid- covid- : a new virus, but a familiar receptor and cytokine release syndrome fda issues emergency use authorization for convalescent plasma as potential promising covid- treatment, another achievement in administration's fight against pandemic the cytokine release syndrome (crs) of severe covid- and interleukin- receptor (il- r) antagonist tocilizumab may be the key to reduce the mortality lymphopenia predicts disease severity of covid- : a descriptive and predictive study collecting and evaluating convalescent plasma for covid- treatment: why and how? vox sang the authors declare that the data supporting the findings of this study are available within the paper and its supplementary materials. the online version of this article (https://doi.org/ . /s - - -x) contains supplementary material, which is available to authorized users.competing interests: the authors declare no competing interests.open access this article is licensed under a creative commons attribution . international license, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons license, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons license, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. to view a copy of this license, visit http://creativecommons. org/licenses/by/ . /. key: cord- - piio authors: zhou, haixia; zhang, shuyuan; wang, xinquan title: crystallization and structural determination of the receptor-binding domain of mers-cov spike glycoprotein date: - - journal: mers coronavirus doi: . / - - - - _ sha: doc_id: cord_uid: piio three-dimensional structures of the receptor-binding domain (rbd) of mers-cov spike glycoprotein bound to cellular receptor and monoclonal antibodies (mabs) have been determined by x-ray crystallography, providing structural information about receptor recognition and neutralizing mechanisms of mabs at the atomic level. in this chapter, we describe the purification, crystallization, and structure determination of the mers-cov rbd. the first three-dimensional structure of the mers-cov spike glycoprotein receptor-binding domain (rbd), providing the molecular basis of viral attachment to host cells, was determined in the complex with it cellular receptor dipeptidyl peptidase (dpp , also called cd ) by x-ray crystallography [ ] . because of the significance in receptor recognition and specific pathogenesis, rbd became a hot spot in the study of mers-cov. a number of structures of rbd bound by monoclonal antibodies (mabs) have also been determined and deposited in the protein data bank (pdb, http://www.rcsb.org/pdb/) [ ] [ ] [ ] [ ] [ ] [ ] [ ] . our group determined the rbd structures in complex with dpp and the mabs mers- , mers- and mers-gd , respectively [ ] [ ] [ ] ] . all the three-dimensional structures of mers-cov rbd have been determined by x-ray crystallography, which is a powerful method for determining molecular structures at atomic resolution. briefly, the ordered and repeated atoms in a single protein crystal can diffract the incident x-ray beam into many specific directions. the angles and intensities of these diffracted x-rays can be collected and measured in an x-ray diffraction experiment. after obtaining the phases of these diffracted x-rays by heavy-atom derivative, anomalous scattering or molecular replacement methods, a protein crystallographer then calculates the density of electrons with the protein crystal and builds a structural model based on the density map. for details on the principles and methodology of protein crystallography, please refer to the range of other excellent textbooks. in this chapter, an overview of the standard method of protein crystallography is briefly introduced, focusing on crystallization and structural determination of mers-cov rbd using the molecular replacement method. prepare all solutions using ultrapure water (prepared by purifying deionized water, to attain a resistivity of mΩ cm at c) and analytical grade reagents. when dealing with waste, we strictly follow all waste disposal regulations. . pfastbac vector containing the mers-cov rbd gene. . lb liquid medium: g tryptone, g yeast extract, g nacl, and l ultrapure water; sterilize by high-pressure steam. . liquid lb selection medium: lb liquid medium, μg/ml kanamycin, μg/ml gentamicin, and μg/ml tetracycline. . bacmid selection lb agar plate: g tryptone, g yeast extract, g agar powder, g nacl, and l ultrapure water. sterilization at high-pressure steam. μg/ml kanamycin, μg/ml gentamicin, μg/ml tetracycline, μg/ml x-gal, μg/ ml iptg mix and pout into sterile plates (see note ). mer-cov rbd can be expressed using the bac-to-bac baculovirus expression system (fig. ), collected and captured using nta sepharose (ge healthcare) and then further purified by gel filtration chromatography using a superdex high performance column (ge healthcare). crystallization trials are set up using the hanging-drop or sitting-drop vapor diffusion method in conjunction with the sparse-matrix crystal screening kits. the structure of mers-cov rbd in complex with mers- scfv was determined using the molecular replacement method. when the total volume remaining is reduced to ml, add ml hbs buffer to collect all the liquid in the system in a beaker. then dispense into high-speed centrifuge tubes, centrifuge at  g for h at c. . the supernatant after centrifugation is loaded onto the nickel-nta beads equilibrated with ml of hbs buffer. mers-cov rbd with a his tag could be captured by nickel-nta beads. repeat loading the sample once more. . add wash buffer to the beads to remove the nonspecifically bound proteins until the flow-through is not able to discolor the coomassie brilliant blue g solution. . after adding elution buffer, the target protein will dissociate from the beads; collect it in a kda millipore concentrating tube. similarly, detect protein with coomassie brilliant blue g . the concentrating tube containing the protein sample is centrifuged at  g to concentrate the sample to less than . mers-covrbd is further purified by gel filtration chromatography. the sample is loaded onto the superdex column pre-equilibrated with hbs buffer. fractions containing rbd are collected and the protein's purity is confirmed by sds-page (fig. ) . . dilute the protein to mg/ml, and digest with endoglycosidase f and f (f /f : rbd at the ratio of : ) at c overnight. the digested protein is concentrated and purified by gel filtration chromatography same as above (optional). . after preparing the mers-cov rbd protein and mers- scfv protein (see note ) detect the absorption of the protein sample at nm (a ). according to the molecular weight and extinction coefficient, the molar concentration can be calculated. the two proteins were mixed at molar ratio of : , incubated on ice for h, and purified using a superdex column. collect the fractions containing the complex and confirm the protein purity by sds-page. sartorius centrifugal concentrator to concentrate to - mg/ml. after mixing and aspirating, centrifuge at ,  g for min at c. . use ttp labtech's mosquito crystallization setup for automated crystallography. absorb μl protein on the -well μl micro-reservoir strip (fig. a) . then the needles aspirate the protein from the strip onto the swissci plate with nl of protein per well (fig. b) , using the sitting-drop vapor diffusion method by mixing nl reservoir and nl reservoir (fig. c,d) . . seal the plate with tape and gently place it in an c room. . check the sample drops under a microscope at -  magnifications after and days (and if necessary, after and weeks, and , , and months). . a week later, we found crystal growth in the peg/ion, pegrx and jcsg+ kits. specific conditions were as follows (fig. a,b) . the diffraction images should be collected on the bl u beamline (fig. c) . rotate the mounted crystal and the x-ray diffraction patterns should be recorded at per image, and collected for . . the diffraction images in a dataset should be processed with hkl [ ] including auto-index, refinement, integration, and scaling steps. after data processing, the crystal unit cell parameters, crystal space group, miller indexes of reflections, intensities, and error estimates of reflections should be determined and stored in a * .sca file, which provides the dataset applicable to structure determination. . using ccp suite solve the structure as follows: export the * . sca file to a * .mtz file using the program scalpack mtz. use the * .mtz file to calculate the solvent via matthews_-coef. run with phaser mr (see note ) with the mers-cov rbd structure (pdb id: l ) and the structures of the variable domain of the heavy and light chains available in the pdb with the highest sequence identities as search models (see note ) [ ] . when the phases are determined, the electron density map can be calculated, from which the molecular model can be constructed. . subsequent model building and refinement were performed using coot and phenix, respectively (see note ) [ , ] . . many validation programs are used to check the structure, until the investigator is satisfied, and then the structure can be deposited in the pdb. . weigh g tryptone, g yeast extract, g agar powder, and g nacl, and add ultrapure water to l. after sterilization by high-pressure steam, wait until the temperature of the medium drops to about c, add the required antibiotics, inducers etc. ( μg/ml kanamycin, μg/ml gentamicin, μg/ml tetracycline, μg/ml x-gal, μg/ml iptg). mix evenly and then pour into sterile plates. when the culture medium has cooled and solidified, store the bacmid selection lb agar plate at c. . endoglycosidase f and f are expressed and purified from e. coli by our laboratory. the endoglycosidase was added into the reaction system according to the mass ratio of : . . the coding sequence of the mers-cov rbd (emc strain, spike residues - ) was ligated into the pfastbac-dual vector (invitrogen) with a n-terminal gp signal peptide to enable the protein secreting outside the cell and a c-terminal his-tag to facilitate further purification processes. . allow the pellet to dissolve for at least min at room temperature. to avoid shearing the dna, pipet only - times to resuspend. store the bacmid at c and use it as soon as possible, usually within week. aliquot the bacmid dna into separate tubes and store at À c (not in a frost-free fridge). avoid multiple freeze/thaw cycles as this decreases the transfection efficiency. . characteristics of infected cells: a - % increase in cell diameter can be seen and the size of cell nuclei increases at the early stage. cells release from the plate and appear lysed, showing signs of clearing in the monolayer. . p virus can be stored for years, adding % (v/v) fbs at c, protected from light. . the expression of mers- scfv protein was conducted in f cells transiently transfected with plasmid dna. after h, the supernatant was collected and concentrated. the purified mers- scfv protein was obtained by ni-nta affinity chromatography and superdex size-exclusion chromatography. the purification method is the same as that of rbd protein. . if the crystal structure of the same protein or a similar protein has been solved, the molecular replacement method can be applied. after obtaining the solutions of the rotation and translation functions, initial phases can be calculated from the reference model, after which the electron density can be calculated. . the accuracy of the constructed model is confirmed by the crystallographic r-factor and r-free, which indicate the discrepancy between the calculated and observed amplitudes. the stereo-chemical parameters of the model can also be checked using programs such as molprobity, procheck, or rampage. molecular basis of binding between novel human coronavirus mers-cov and its receptor cd structural basis for the neutralization of mers-cov by a human monoclonal antibody mers- structural definition of a unique neutralization epitope on the receptorbinding domain of mers-cov spike glycoprotein ultrapotent human neutralizing antibody repertoires against middle east respiratory syndrome coronavirus from a recovered patient evaluation of candidate vaccine approaches for mers-cov junctional and allelespecific residues are critical for mers-cov neutralization by an exceptionally potent germline-like antibody introduction of neutralizing immunogenicity index to the rational design of mers coronavirus subunit vaccines importance of neutralizing monoclonal antibodies targeting multiple antigenic sites on mers-cov spike to avoid neutralization escape structure of mers-cov spike receptor-binding domain complexed with human receptor dpp processing of x-ray diffraction data collected in oscillation mode phaser crystallographic software coot: modelbuilding tools for molecular graphics phenix: building new software for automated crystallographic structure determination key: cord- - xsypzt authors: nelson-sathi, shijulal; umasankar, pk; sreekumar, e; radhakrishnan nair, r; joseph, iype; nori, sai ravi chandra; philip, jamiema sara; prasad, roshny; navyasree, kv; ramesh, shikha; pillai, heera; ghosh, sanu; santosh kumar, tr; radhakrishna pillai, m. title: mutational landscape and in silico structure models of sars-cov- spike receptor binding domain reveal key molecular determinants for virus-host interaction date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: xsypzt protein-protein interactions between virus and host are crucial for infection. sars-cov- , the causative agent of covid- pandemic is an rna virus prone to mutations. formation of a stable binding interface between the spike (s) protein receptor binding domain (rbd) of sars-cov- and angiotensin-converting enzyme (ace ) of host actuates viral entry. yet, how this binding interface evolves as virus acquires mutations during pandemic remains elusive. here, using a high fidelity bioinformatics pipeline, we analysed , sars-cov- genomes across the globe, and identified non-synonymous mutations that cause distinct amino acid substitutions in the rbd. molecular phylogenetic analysis suggested independent emergence of these rbd mutants during pandemic. in silico structure modelling of interfaces induced by mutations on residues which directly engage ace or lie in the near vicinity revealed molecular rearrangements and binding energies unique to each rbd mutant. comparative structure analysis using binding interface from mouse that prevents sars-cov- entry uncovered minimal molecular determinants in rbd necessary for the formation of stable interface. we identified that interfacial interaction involving amino acid residues n and g on either ends of the binding scaffold are indispensable to anchor rbd and are well conserved in all sars-like corona viruses. all other interactions appear to be required to locally remodel binding interface with varying affinities and thus may decide extent of viral transmission and disease outcome. together, our findings propose the modalities and variations in rbd-ace interface formation exploited by sars-cov- for endurance. importance covid- , so far the worst hit pandemic to mankind, started in january and is still prevailing globally. our study identified key molecular arrangements in rbd-ace interface that help virus to tolerate mutations and prevail. in addition, rbd mutations identified in this study can serve as a molecular directory for experimental biologists to perform functional validation experiments. the minimal molecular requirements for the formation of rbd-ace interface predicted using in silico structure models may help precisely design neutralizing antibodies, vaccines and therapeutics. our study also proposes the significance of understanding evolution of protein interfaces during pandemic. cov- and related sars-cov provided initial clues regarding molecular architecture of the interface. rbd comprises of amino acid long peptide in the s -region of s-protein ( table ) (walls et al., ) . however, ace binding information is confined to a variable loop region within rbd called receptor binding motif (rbm). these structures elucidated key interfacial interactions responsible for enhanced binding affinity of sars-cov- to ace than sars-cov . it also suggested that few amino acid changes in rbm can remodel the interface resulting in altered binding affinities and viral transmission. however, all these studies were based on parental sars-cov- wuhan strain and several questions remain unanswered. what are the mutations acquired on rbd during covid- and what are the interfacial molecular rearrangements induced by these mutations? can we gain valuable insights regarding rbd-ace interface formation by analyzing these mutations? to address these questions we investigated the mutational landscape of sars-cov- rbd. a total of , spike proteins of sars-cov- were directly downloaded on th june from the gisaid database. we removed the partial sequences, sequences greater than % unidentified 'x' amino acids and sequences from low quality genomes. further, , spike protein sequences along with wuhan reference spike protein (yp_ . ) were aligned using mafft (maxiterate , and global pair-ginsi) (katoh et al. ) . the alignments were visualized in jalview (waterhouse am et al., ) and the amino acid substitutions in each position were extracted using custom python script. we ignored the substitutions that were present in only one genome and unidentified amino acid x. the mutations that are present in at least two independent genomes in a particular position were further considered. these two criteria were used to avoid mutations due to sequencing errors. the mutated amino acids were further tabulated and plotted as a matrix using r script. for the maximum-likelihood phylogeny reconstruction, we have used the sars-cov- genomes containing rbd mutations, and genomes were sampled as representatives for each known subtype with wuhan refseq strain as root. sequences were aligned using mafft (maxiterate , and global pair-ginsi), and phylogeny was reconstructed using iq-tree (nguyen et al., ) . the best evolutionary model (gtr+f+i+g ) was picked using the modelfinder program (kalyaanamoorthy et al., ) . the structural analysis of the mutated spike glycoprotein of sars-cov- rbd domain was done to assess the impact of interface amino acid residue mutations on binding affinity towards the human ace (hace ) receptor. the crystal structure of the sars-cov- rbd-hace receptor complex was downloaded from protein data bank (pdb id: lzg) and the mutagenesis analysis to capture changes that affect viral tropism during pandemic, we searched for non-synonymous mutations in rbd sequences from sars-cov- genomes. using unbiased and stringent filtering criteria, we analyzed , genomes deposited in gisaid till th june, . altogether, non-synonymous mutations in rbd were identified that belong to viral genomes from countries. these mutations were found to substitute amino acid residues in which residues lie within rbm ( table ) . these residues include those that directly engage ace (g , l , a , g , e , f and q ) and those that are in the near binding vicinity (figure a and b). hot spot mutations were also identified that caused recurrent substitutions of amino acid residues in the same position (n , p , q , i , s , v , f , a , p and a ). each rbd mutation was found to be unique to the genome; a combination of mutations was never observed in our analysis. overall, rbd mutations accounted for ~ % of the total non-synonymous mutations in s-protein. to see the evolutionary trend in rbd mutations, we compared rbds from sars-cov- , the related sars-cov and the bat coronavirus ratg , a suspected precursor of sars-cov- . sars-cov- rbd is . % identical to sars-cov and . % identical to ratg ( figure a ). we identified several rbd mutations on residues that are unique to sars-cov- (n k, v a/f/i, e d, f s/l, q l and s p) or are conserved in all three viruses ( figure a ). in addition, we observed micro evolutionary reversion mutations in sars-cov- that interchange residues to that in sars-cov or ratg (r k, n d, n k, l r, e v and s g) ( figure a) . interestingly, most of the unique and reversion mutations were located in the rbm region and thus may have implications in viral tropism. we performed phylogenomic analysis to understand the evolutionary pattern of rbd variants during pandemic and observed an unbiased clustering of rbd variants among distinct sars-cov- subtypes likely indicating independent emergence of these mutants (figure c ). rbd is divided into a structured core region comprising five antiparallel β-sheets and a variable random coil region, rbm that directly binds to ace (figure b) . on the contrary, binding information on ace is located across long α-helices. structurally, rbm scaffold resembles a concave arch that makes three contact points with ace α-helix; cluster-i, ii and iii. cluster-i and cluster-iii are on two ends and cluster-ii is towards the middle of the interface (figure a) . we analysed the effect of observed rbm mutations on the molecular interactions at rbd-ace binding interface. it has been shown that differences in ace residues render mouse resistant to infection from sars-like coronaviruses (zhao et al., ) . hence, to gain insights into relevant interactions that can create stable interface in mutants, we also included rbd-mouse ace interface in our analysis ( figure b) . structure models were created for all mutants based on the information from three recently reported crystal/ cryo-em structures of sars-cov- rbd-ace bound complex (figure a) . comparative analysis of structures showed key differences in all three binding clusters of sars-cov- rbd wild type and mutant interfaces with human or mouse ace (figure c, d and table s ). in cluster-i, f of sars-cov- rbm is found buried into the hydrophobic pocket made of human ace residues l , m and y . a mutation of f>l in sars-cov disrupts this pocket, thus weakens the binding affinity suggesting importance of this interaction (wan et al., ) . in addition, n of sars-cov- rbm forms hydrogen bonds with q and y of human ace . the hydrophobic pocket and n -y interactions were completely abolished in mouse interface due to natural ace substitutions in l t, m s and y f. but, these interactions were retained in all rbm mutants suggesting their importance in the pandemic. nevertheless, interactions of a / g of sars-cov- rbd with s of ace in cluster-i which were present in human and mouse were disrupted in mutants. in addition, sars-cov- genomes containing a v and g s replacements were identified in our analysis suggesting these mutations can be well tolerated. an additional hydrogen bond between y -y was seen in mutants lacking a /g -s interaction. this could possibly be a compensatory mechanism to stabilize cluster-i interactions (figure c, d and table s ). table s ). a bunch of interactions in cluster-iii involving g /y /q of rbm and q of ace were present in human but abolished in mouse and rbm mutants. however, additional interactions to compensate for these were not seen. a hydrogen bond formed between g of rbm and k of human ace appeared significant as this was completely abolished in mouse, owing to k h substitution, but retained in all rbm mutants. in addition, other interactions; g -k , y -d and t -y in the same cluster were maintained in human, mouse and mutants likely suggesting their supportive role ( figure c, d and table s ). the varying interface arrangements in mutants were consistent with the binding affinity differences (ΔΔg). compared to wild type, ΔΔg values of mutants ranged within ~ + kcal/mol, with the lowest value close to that of sars-cov ( figure s ) . sine rbm is a variable loop, mutations on any residue could impact spatial arrangements of backbone leading to altered binding affinities. consistently, we did not find a considerable difference in binding energies between mutations on residues that are involved in ace interaction or are in the near vicinity. in conclusion, we could pinpoint two interfacial interactions that remain unaffected in all mutants analysed. these are interactions mediated through rbd residues n and g and are located in cluster-i and cluster-iii respectively. based on their spatial arrangement, these residues appear critical in directly anchoring the rbm loop onto ace . this may help initiate interface formation that favours viral entry. both n and g are highly conserved in all sars-like corona viruses further reinforcing this notion. the significant remodelling in cluster-ii interactions indicates they are dispensable for anchoring but might be important for stabilizing the interface. since rbd-ace interface is a direct determinant of viral infectivity, along with other factors, varying interface architecture and binding affinities in sars-cov- rbm mutants may account for global variations in covid- transmission and outcome. sars-cov- s-protein is highly immunogenic, so recombinant vaccines and neutralizing antibodies that target the whole s-protein or rbd are currently being considered in clinics . our investigations reveal key molecular determinants and their modalities for rbd-ace interaction. this information might be used to design vaccines, synthetic nanobodies or small molecules that could specifically target rbm anchoring residues or their binding pockets. tepymol molecular graphics system the spike protein of sars-cov -a target for vaccine and therapeutic development sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor modelfinder: fast model selection for accurate phylogenetic estimates mafft: a novel method for rapid multiple sequence alignment based on fast fourier transform increasing the precision of comparative models with yasara nova-a selfparameterizing force feld structure of the sars-cov- spike receptor-binding domain bound to the ace receptor iq-tree: a fast and effective stochastic algorithm for estimating maximum likelihood phylogenies characterization of spike glycoprotein of sars-cov- on virus entry and its immune cross-reactivity with sars-cov zdock server: interactive docking prediction of protein-protein complexes and symmetric multimers swiss-model: an automated protein homology-modeling server pic: protein interactions calculator ligplot: a program to generate schematic diagrams of protein-ligand interactions structure, function, and antigenicity of the sars-cov- spike glycoprotein receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus structural and functional basis of sars-cov- entry by using human ace enhanced receptor binding of sars-cov- through networks of hydrogen-bonding and hydrophobic interactions jalview version -a multiple sequence alignment editor and analysis workbench cryo-em structure of the -ncov spike in the prefusion conformation prodigy: a web server for predicting the binding affinity of protein-protein complexes mining of epitopes on spike protein of sars-cov- from covid- patients broad and differential animal angiotensin-converting enzyme receptor usage by sars-cov- the authors wish to acknowledge john b johnson, mahendran kr and sara jones for critical comments. the work was supported by the department of biotechnology, government of india. key: cord- -haci cy authors: kim, ju; yang, ye lin; jang, yong-suk title: human β-defensin is involved in ccr -mediated nod signal transduction, leading to activation of the innate immune response in macrophages date: - - journal: immunobiology doi: . /j.imbio. . . sha: doc_id: cord_uid: haci cy beta-defensins contribute to host innate defense against various pathogens, including viruses, although the details of their roles in innate immune cells are unclear. we previously reported that human β-defensin (hbd ) activates primary innate immunity against viral infection and suggested that it plays a role in the induction of the adaptive immune response. we analyzed the mechanisms by which hbd primes innate antiviral immunity and polarized activation of macrophage-like thp- cells using the receptor-binding domain (rbd) of middle east respiratory syndrome coronavirus (mers-cov) spike protein (s rbd) as a model antigen. the expression of nucleotide-binding oligomerization domain containing (nod ), type i interferons, (ifns), and proinflammatory mediators was enhanced in s rbd-hbd -treated thp- cells. s rbd-hbd treatment also enhanced phosphorylation and activation of receptor-interacting serine/threonine-protein kinase and ifn regulatory factor compared to s rbd alone. finally, hbd -conjugated s rbd interacted with c-c chemokine receptor (ccr ), and nod was involved in hbd -mediated ccr signaling, which was associated with the activation and m polarization of thp- cells. therefore, hbd promotes ccr -mediated nod signaling, which induces production of type i ifns and an inflammatory response, and enhances primary innate immunity leading to an effective adaptive immune response to hbd -conjugated antigen. antimicrobial peptides (amps) are a component of the primary host defense in the mucosa. amps are produced mainly by epithelial cells and the cells involved in innate immunity. the functions of amps in defense against various infections are better characterized than their role as innate immune modulators (boyton and openshaw, ) . defensins are small cationic amps and are found in various organisms including mammals, insects, and plants. among them, β-defensins are primarily produced by epithelial cells and leukocytes in vertebrates (stolzenberg et al., ) . immature dendritic cells (dcs) can be activated by β-defensins, and β-defensins also inhibit infection by haemophilus influenzae and viruses (zhao et al., ; lafferty et al., ) . inactivation of human β-defensins (hbds) leads to the recurrent airway infections experienced by patients with cystic fibrosis (smith et al., ) . also, a lack of human β-defensin (hbd ) results in immune dysfunction, such as reduced numbers of b and regulatory t cells, resulting in decreased production of antigen (ag)-specific immunoglobulin a (lugering et al., ; mcdonald et al., ) . therefore, an understanding of the regulatory mechanism by which amps modulate the immune response during the early stage of viral infection is critically needed to deal with various diseases caused by virus infection. virus infection of host cells triggers innate antiviral responses, which are initiated via pattern recognition receptor (prr) signaling pathways (ishii et al., ) . among the prr families, nucleotidebinding oligomerization domain containing (nod ) is an important mediator of the innate immune response to viral infection, and induces expression of type i interferons (ifns) to promote the expression of proinflammatory cytokines and restrict viral replication (wiese et al., ) . furthermore, nod -deficient mice exhibit decreased production of ifns and increased susceptibility to viral infection (sabbah et al., ) . presumably, the ability of a virus to counteract innate antiviral immunity during the early stage of infection influences pathogenicity and disease severity (perlman and dandekar, ) . type i ifns play a major role in the antiviral innate immune response by upregulating the https://doi.org/ . /j.imbio. . . received april ; received in revised form may ; accepted may production of antiviral proteins and the recruitment of immune cells (haller et al., ) . production of type i ifn is initiated by ubiquitously expressed cytoplasmic viral sensors in response to detection of viral pathogen-associated molecular patterns such as double-stranded rna (kato et al., ; li and zhong, ) . stimulated viral sensors activate downstream signaling pathways, leading to expression of transcription factors including ifn regulatory factor (irf ) and nuclear factor-κb (nf-κb), which drive ifn-β expression (yoneyama et al., ) . however, some viruses, including middle east respiratory syndrome coronavirus (mers-cov), inhibit these type i ifn induction pathways . for example, various proteins of mers-cov, including m protein, papain-like protease protein (plpro), and accessory proteins a and b, are antagonists of ifns (shokri et al., ) . accordingly, the virulence of mers-cov is linked to its immune evasion mechanisms, such as suppression of ifn production during the early stage of infection, induction of macrophage apoptosis, and inactivation of t cells with downregulation of ag presentation (niemeyer et al., ) . macrophages are professional phagocytes capable of internalizing and degrading pathogens and apoptotic cells. macrophages are present in the respiratory mucosa, such as in the lung and various fluid compartments, where the detection of and the response to infection occur. due to their location, macrophages detect viral ags first and promote an antiviral innate immune response as well as an ag-specific adaptive immune response by presenting ags to t-cells (manicassamy et al., ) . we previously reported that hbd promotes an antiviral innate immune response in macrophage-like thp- cells and elicits an enhanced ag-specific and virus-neutralizing antibody (ab) response in vivo using the receptor binding domain (rbd) of mers-cov spike protein (s rbd) as a model ag (kim et al., ) . moreover, the type i ifn response and the production of primary antiviral molecules such as nod were enhanced by s rbd-hdb treatment of thp- cells (kim et al., ) . consequently, we assumed that modulation of nod signaling by hbd would prevent infection by viruses that suppress innate antiviral immunity. in this study, we investigated the mechanism by which hbd enhances the type i ifn immune response in thp- cells by modulating nod signaling pathways using hbd -conjugated s rbd of mers-cov. macrophage-like thp- (atcc ® tib- ™) and vero e (atcc ® crl- ™) cells were obtained from the american type culture collection (manassas, va, usa). mers-cov ( - -mer-is- ) was provided by the korean center for disease control and prevention (kcdc). all experiments using mers-cov were performed in accordance with the world health organization recommendations in a biosafety level facility in the korea zoonosis research institute at chonbuk national university (iksan, korea). the chemicals and laboratory wares were obtained from sigma chemical co. (st. louis, mo, usa) and spl life sciences (pocheon, korea), respectively, unless otherwise specified. production of recombinant mers-cov s rbd with or without hbd at the c-terminus (residues - ) of the s domain were performed as described previously with minor modifications (ma et al., ) . briefly, the gene encoding s rbd was synthesized with codon optimization based on the mers-cov s protein sequence (genbank akl . ; genscript, piscataway, nj, usa). the s rbd gene with the hbd gene at its ′ terminus was amplified by polymerase chain reaction (pcr) using forward and reverse primers reported previously (kim et al., ) . the amplified genes were cloned into the pcoldii escherichia coli expression vector (takara bio, shiga, japan). recombinant proteins with an n-terminal his tag were purified using ni-nta superflow (qiagen, valencia, ca, usa) according to the manufacturer's instructions. thp- cells were cultured in rpmi medium (welgene, gyeongsan, korea) containing % fetal bovine serum (fbs; gibco, grand island, ny, usa) at °c in a co incubator. the cells were treated with phorbol- -myristate- -acetate ( μg/ml for × cells) for - days to induce differentiation into macrophages (daigneault et al., ) . the cultures were replenished with fresh medium, maintained for days, and incubated with recombinant s rbd or s brd-hbd ( μg/ml per × cells) with or without inhibitors (rs , for ccr and gsk for nod ). the cells were harvested after or h and subjected to quantitative real-time reverse transcription polymerase chain reaction (rt-pcr) or western blotting. rna was extracted using trizol ® reagent (thermo-fisher scientific, waltham, ma, usa) following the manufacturer's instructions. rna was used to synthesize cdna with an mmlv reverse transcription kit (promega, fitchburg, wi, usa). gene expression levels were measured by quantitative real-time rt-pcr with the quantitect sybr green pcr kit (qiagen, hilden, germany) and an abi system (applied biosystems, foster city, ca, usa) using ng of first-strand cdna under the following conditions: °c for min followed by cycles at °c for s, °c for s, and °c for s. the expression levels were normalized to that of β-actin (hactb) using fast software version . . (applied biosystems). the primer sets used to amplify target genes are listed in table . cells were washed twice with cold phosphate-buffered saline (pbs), and lysed in a lysis buffer containing % triton x- supplemented table sequences of the primers used for qrt-pcr. primers used to measure the expression levels of genes associated with antiviral innate immune responses and macrophage differentiation. the β-actin gene (hactb) was used as an endogenous control. with a complete protease inhibitor cocktail (roche applied science, mannheim, germany) and mm dithiothreitol. total cell lysates were prepared by centrifugation at , rpm for min. equal amounts of lysates were resolved by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (sds-page), transferred to an immobilon-p polyvinylidene difluoride membrane (merck millipore, burlington, ma, usa), and immunoblotted using specific abs. primary abs against human p-rip and p-irf were obtained from abcam (cambridge, ma, usa) and abs against nod and iκbα were from invitrogen (grand island, ny, usa) and cell signaling (danvers, ma, usa), respectively. the primary ab against human β-actin was purchased from bioss (woburn, ma, usa). target proteins were detected by enhanced chemiluminescence (thermo-fisher scientific). mers-cov was propagated in vero e cells, which were cultured in dmem medium (welgene) containing % fbs at °c in a co incubator. to assess the viral loads and the expression levels of target genes in mers-cov-infected cells, mers-cov was passaged six times in vero e cells and transferred to thp- cells ( × /well) in a tissue culture plate. after incubation for h, we extracted total rna and performed quantitative real-time rt-pcr using the primers in table to measure the expression levels of mers-cov upe and target genes (kim et al., ) . thp- cell monolayers in confocal dishes were fixed with % paraformaldehyde. the cells were permeabilized using buffer containing triton x- ( . %), blocked, and incubated with the recombinant protein and a specific primary ab. primary abs against human c-c chemokine receptor (ccr ) and nod were purchased from invitrogen. a penta-his ab conjugated to alexa fluor ® (qiagen) and anti-rabbit and -mouse igg conjugated to alexa fluor or (invitrogen), respectively, were used as the secondary abs. finally, washed cells were stained with ′ -diamidino- -phenylindole (dapi), covered with slowfade gold antifade reagent (invitrogen), and observed using a confocal laser scanning microscope (clsm, lsm , carl zeiss, thornwood, ny, usa). statistical analyses were performed using prism (graphpad, san diego, ca, usa). data are means ± standard deviations (sds). the significance of differences was assessed by two-way analysis of variance (anova), and p < . was considered indicative of statistical significance. . . hbd -conjugated ag stimulates the nod signaling procedure, which leads to type i ifn production in macrophage-like thp- cells mers-cov infection inhibits the production of ifn-α/β and the host antiviral immune response . we evaluated the influence of mers-cov infection of thp- cells on the innate immune response (fig. ) . the expression level of the upe gene was assessed by quantitative real-time rt-pcr at h post-infection of thp- cells with , , and plaque-forming units (pfu) of mers-cov (fig. a) . importantly, mers-cov infection dose-dependently decreased the expression of factors critical in antiviral innate immunity, such as nod , ifn-α, and ifn-β ( fig. b-d) . to analyze whether hbd affects immune induction, we determined the expression levels of the ifn-β, nod , and tumor necrosis factor (tnf)-α genes in thp- cells treated with s rbd or s rbd-hbd ( fig. ) . s rbd-hbd significantly (p < . ) enhanced the expression of ifn-β and nod by . -and . -fold, respectively, compared to s rbd alone-treated thp- cells ( fig. a and b) . the expression of tnfα, an nf-κb-dependent proinflammatory cytokine, was increased in s rbd-hbd -treated cells compared with that in s rbd alone-treated cells, albeit not significantly so (fig. c) . we next assessed the levels of the nod -associated signal transducing mediators, receptor interacting protein- (rip ) and irf , in s rbd-or s rbd-hbd -treated thp- cells (fig. ) . as expected, treatment with s rbd-hbd remarkably upregulated the phosphorylation of rip and irf and the nod protein level. in addition, hbd -conjugated s rbd reduced iκbα accumulation, degradation of which requires nf-κb activation (fig. c) . therefore, hbd -conjugated ag treatment induces the production of the type i ifn, ifn-β, and the proinflammatory cytokine, tnf-α, through nod -associated rip response by activating the transcription factors irf and nf-κb. . . hbd -conjugated ag-mediated induction of a type i ifn response was mediated by ccr signaling β-defensin-fused proteins retain their antibacterial and chemotactic activity for c-c chemokine receptor (ccr )-expressing cells, as do monocytes that do not express ccr (röhrl et al., a) . in addition, the β-defensin-conjugated proteins induced ccr -specific chemotaxis on ccr -transfected hek cells, human peripheral blood monocytes, and mouse peritoneal exudate cells in a dose-dependent manner (röhrl et al., b) . therefore, we determined whether hbd binds to ccr rather than ccr (fig. ) by immunofluorescence assay in thp- cells (fig. a ). ccr and s rbd-hbd co-localized, suggesting a direct interaction, but no such co-localization with ccr was observed for s rbd alone (data not shown). we next evaluated whether the interaction of hbd with ccr mediates intracellular signaling. interestingly, the significantly (p < . ) enhanced ifn-β expression by s rbd-hbd was abrogated by the ccr antagonist, rs , (fig. b) . moreover, the nod expression and nod -associated rip and irf expression enhanced significantly (p < . ) by s rbd-hbd was abrogated by the ccr antagonist ( fig. c-e) , as was tnf-α, whose expression is dependent on nf-κb activation (fig. f) . thus, hbd interacts directly with ccr , which promotes signal transduction, activation of type i ifn, and an inflammatory response. we next focused on endosomal and/or cytoplasmic signal transmediators involved in hbd -mediated activation of primary antiviral responses. in our previous report, a proinflammatory cytokine response, which requires the activation of nf-κb, was induced by hbd (kim et al., ) . additionally, early stimulation of an innate immune response is dependent on toll-like receptor (tlr) and/or nod -triggered nf-κb signaling (tsai et al., ) . we thus determined whether the hbd -mediated enhancement is due to nod -associated signaling because nod expression was enhanced by s rbd-hbd (fig. ) . s rbd-hbd co-localized with nod , suggesting an interaction (fig. a) , but no such co-localization with nod occurred for s rbd without hbd conjugation (data not shown). consequently, we speculated that nod interacts with s rbd-hbd and that nod -mediated signaling is involved in hbd -mediated activation of nf-κb. we evaluated the influence of nod signaling on hbd -mediated type i ifn and proinflammatory responses using the inhibitor of nod signaling, gsk . gsk abrogated the hbd -mediated (p < . ) enhanced expression of ifn-β, rip , and irf ( fig. b-d) . additionally, a nod inhibitor abrogated the enhanced expression of tnf-α in hbd -treated thp- cells (fig. e ). these observations suggest that nod functions as an intracellular signal transmediator in hbd -induced activation of type i ifn production and the inflammatory response. macrophages can differentiate into m (classically activated) or m (alternatively activated) macrophages. classical m -type macrophages are key effector cells for the elimination of pathogens, virus-infected cells, and malignant cells, while m -type macrophages exhibit anti-inflammatory and tissue repair activities (gordon and martinez, ) . nf-κb signaling is an intracellular proinflammatory pathway (hoesel and schmid, ) and activates m -type macrophage differentiation (saijo and glass, ) . consequently, we analyzed the expression of marker genes of m and m macrophages in s rbd-hbd -treated thp- cells by quantitative real-time rt-pcr (fig. ) . the expression of cd and cd was significantly (p < . ) enhanced by the hbd conjugate, suggesting differentiation into m -type macrophage cells. by contrast, there was no significant difference in the expression of m marker genes (cd and cd ) between cells treated with s rbd and s rbd-hbd (data not shown). to identify the signaling pathways involved in hbd -mediated activation and polarization of macrophages, we evaluated the effect of a ccr inhibitor, rs , , and nod inhibitor, gsk , on cells treated with s rbd with or without hbd conjugation. expression of the m -type macrophage marker genes, cd and cd , was markedly downregulated by the ccr and nod inhibitors compared to the control ( fig. a and b) . interestingly, expression of cd , a marker of early activation of macrophages, was stably upregulated in both s rbd alone-and s rbd-hbd -treated thp- cells, while cd expression was markedly downregulated by pretreatment with a ccr or nod inhibitor, although the ccr inhibitor did not completely reverse the hbd -mediated enhanced expression of cd (fig. c) . these results demonstrate that ccr -mediated activation of nod signaling pathway by s rbd-hbd is associated with the activation and m polarization of macrophage-like thp- cells. innate immunity is the first line of defense against exogenous and endogenous threats, including pathogen infection and tissue damage. innate immunity not only precedes the ag-specific adaptive immune response but also enables a long-lasting memory response by innate agpresenting cells (apcs), which interact with adaptive immune cells. innate immune cells, such as macrophages and mast cells, recruited into infected sites produce a wide range of cytokines that regulate the balance between pro-and anti-inflammatory responses and so maintain immunological homeostasis (gallenga et al., ) . cytokines have pleiotropic effects on the functions of immune cells and immune responses that constitute the host defense against infectious agents. for instance, members of the il- family, which are produced by macrophages and mast cells, are important regulators of the innate immune response and play a role in inflammatory processes (varvara et al., ) . several il- family cytokines are pro-inflammatory, while others, including il- , il- , il- ra, and il- ra, are anti-inflammatory . in addition, il- indirectly participates in t-lymphocyte-mediated immunity by inducing helper type t-cell polarization and the formation of abs by plasma cells by producing il- (gallenga et al., ) . these findings suggest that further studies on modulation of the balance between pro-and anti-inflammatory cytokines and network among innate immune cells, such as m /m macrophage polarization, would facilitate the development of novel therapeutic approaches for immunological disorders. macrophages, dcs, neutrophils, natural killer cells, and innate lymphoid cells play major roles in pathogen recognition through specialized receptors such as prrs (akira et al., ; shim and lee, ; seo et al., ) . viral infection activates danger signals which are transmitted via prrs, including tlrs, nucleotide-binding oligomerization domain-like receptors (nlrs), retinoic acid-inducible gene -like receptors (rlrs), c-type lectin receptors, cytosolic dna sensors, and inflammasome signaling cascades. cross-talk between prrs and to that of hactb were analyzed by quantitative real-time rt-pcr in duplicate. the resulting values normalized to that of hactb using the level in the cells treated with pfu of mers-cov as a reference basal expression value are shown as means ± sd. *p < . , **p < . , and ***p < . . activation of these signaling cascades induces an antiviral immune response by upregulating the expression of antiviral cytokines, including type i ifns. however, viruses can evade the antiviral function of ifn by inhibiting ifn production and signal transduction (ferran and skuse, ) . type i ifns, mainly ifn-α and ifn-β, are major effector cytokines in the innate antiviral response (gonzález-navajas et al., ) and their encoding genes are regulated by several transcription factors, including nf-κb and irf (seth et al., ) . upon virus infection, irf is phosphorylated, dimerizes, and enters the nucleus to upregulate the expression of type i ifn, melanoma differentiation-associated protein (mda ), and cytoplasmic retinoic acid-inducible gene i (rig-i), leading to activation of nf-κb and irf (kato et al., ) . the hosts react to infection by mounting a primary response involving inflammation, followed by a pathogen-specific adaptive response. although inflammation is a double-edged response, it is an important mechanism of protective innate immunity against infection by viruses, bacteria, fungi, prions, and parasites. inflammatory monocyte-derived macrophages and innate immune cells are rapidly recruited to inflamed sites, where they remove harmful stimuli and induce t-cell responses by ifn-dependent mechanisms (ginhoux et al., ) . we previously reported that hbd promotes the antiviral innate immune response in thp- cells and the ability of an hbd -conjugated fig. . hbd -conjugated s rbd ag treatment of thp- cells stimulates the expression of genes involved in innate immunity and antiviral responses. thp- cells were stimulated with μg/ml recombinant s-rbd or hbd -conjugated s rbd for h and the expression of the indicated genes was analyzed by quantitative real-time rt-pcr in duplicate, with normalization to the expression of the internal control (hactb). expression levels relative to those of pbstreated control cells are shown as means ± sd. **p < . . fig. . hbd -conjugated s rbd ag treatment of thp- cells stimulates the expression and activation of genes related to nod -mediated innate immune signaling. thp- cells were stimulated with μg/ml recombinant s rbd or hbd -conjugated s rbd for the indicated periods. cell lysates were prepared and immunoblotted with the indicated abs. β-actin was used as the loading control. mers-cov ag to elicit a greater ag-specific and mers-cov neutralizing ab response compared to hbd non-conjugated ag in vivo (kim et al., ) . also, immunization with s rbd-hbd prior to viral infection enhanced the humoral and protective immune response to mers-cov infection in human dipeptidyl peptidase (hdpp )-expressing mice, a model of mers-cov infection (data not shown). in addition, the type i ifn response, the expression of primary antiviral molecules including nod , a cytoplasmic viral prr that activates irf , and production of ifn-β were enhanced by hbd treatment of thp- cells. these cells are widely used as an in vitro model for studies of human macrophages involved in the inflammatory response and immunological homeostasis (ginhoux et al., ) . here, we investigated the immunomodulatory ability of hbd and the mechanism by which hbd induces production of type i ifn and an inflammatory response in thp- cells. β-defensins exert regulatory activity in host innate and adaptive immune responses. for example, mouse β-defensin activates immature dcs via tlr , triggering a th response, and human β-defensin activates apcs via tlr and tlr in an nf-κb-dependent manner (funderburg et al., ) . additionally, it was suggested that β-defensin is an endogenous ligand for tlr and shares a signal transduction pathway with other tlr ligands. we evaluated the possible interaction between hbd and tlr as well as nod and found that hbd -conjugated ag co-localized with nod ( fig. a) but not with tlr (data not shown). moreover, inhibition of nod signaling in thp- cells abrogated the hbd -mediated enhanced ifn-β expression by suppressing rip and irf signaling as well as tnf-α expression (fig. b-e) . human and mouse β-defensins induce ccr -and ccr dependent chemotaxis (röhrl et al., b) and ccr and ccr recruit fig. . hbd -conjugated s rbd co-localizes with ccr and the interaction between hbd -conjugated ag and ccr induces nod signaling. (a) thp- cells treated with μg/ml hbd -conjugated s rbd were subjected to immunofluorescence assay using monoclonal abs against s rbd and ccr and visualized by clsm. dapi-stained nuclei, blue; s rbd signal, green (alexa fluor -coupled secondary ab); ccr signal, red (alexa fluor -coupled secondary ab). representative fields are shown at × magnification. (b-f) thp- cells were stimulated with μg/ml recombinant s rbd protein with or without hbd at h or h after treatment with or without rs , (ccr antagonist) and their expression, together with that of the internal control gene hactb, was analyzed by qrt-pcr in duplicate. expression levels relative to those of the pbs-treated control are shown as means ± sd. *p < . and **p < . . (for interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article). professional apcs to inflamed tissues and initiate an adaptive immune response (osterholzer et al., ) . ccr is expressed on various types of myeloid cells, including monocytes and neutrophils, which are crucial for innate immunity and phagocytosis (iida et al., ) . we found that hbd -conjugated ag co-localized with ccr and contributed to nod -mediated signaling, leading to the activation of type i ifn production and an inflammatory response in thp- cells. consequently, the mechanism underlying hbd -induced ccr -mediated signaling should be investigated further. macrophages help clear infectious cells by internalizing and degrading pathogens. during infection with influenza a, the phagocytic capacity of mouse peritoneal exudate macrophages is enhanced by coculture with virus-infected epithelial cells (fujimoto et al., ) , and decreased phagocytic uptake of opsonized influenza a virus is correlated with decreased cell surface expression of cd and cd , which are highly expressed by classically activated m macrophages. also, expression of cd and cd is decreased in macrophages infected with viruses capable of replicating productively in them (marvin et al., ) . m macrophages exert a proinflammatory effect, present ag, perform phagocytosis, produce tnf-α and il- β, and express cd and cd on their surface. by contrast, m macrophages are responsible for tissue repair and wound healing, produce il- and tgf-β, and express arginase- and cd (gordon and martinez, ) . we report here that hbd induces the expression of m markers by activating ccr mediated nod signaling pathway in thp- cells (fig. ) . although further studies are needed, these results provide insight into the mechanism by which hbd induces antiviral innate and ag-specific adaptive immune responses. there are no competing financial interests in this study. . hbd -conjugated s rbd co-localizes with nod and the interaction between hbd conjugated ag and nod induces signaling involved in innate antiviral immune responses. (a) thp- cells treated with μg/ml hbd conjugated s rbd were subjected to immunofluorescence assay using monoclonal abs against s rbd and nod and visualized by clsm. dapi-stained nuclei, blue; s rbd signal, green (alexa fluor -coupled secondary ab); nod signal, red (alexa fluor -coupled secondary ab). representative fields are shown at × magnification. (b-e) thp- cells were stimulated with μg/ml recombinant s rbd protein with or without hbd at h or h after treatment with or without gsk (nod antagonist), and their expression, together with that of the internal control gene hactb, was analyzed by qrt-pcr in duplicate. expression levels relative to those of the pbstreated control are shown as means ± sd. *p < . and **p < . . (for interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article). the bk plus program in the department of bioactive material sciences. confocal laser scanning microscopy was performed using the instruments installed in the center for university-wide research facilities (curf) at chonbuk national university. pathogen recognition and innate immunity pulmonary defences to acute respiratory infection the identification of markers of macrophage differentiation in pma-stimulated thp- cells and monocyte-derived macrophages evasion of host innate immunity by emerging viruses: antagonizing host rig-i pathways virus clearance through apoptosis-dependent phagocytosis of influenza a virus-infected cells by macrophages human β-defensin- activates professional antigenpresenting cells via toll-like receptors and interleukin- family cytokines and mast cells: activation and inhibition new insights into the multidimensional concept of macrophage ontogeny, activation and function immunomodulatory functions of type i interferons alternative activation of macrophages: mechanism and functions the interferon response circuit: induction and suppression by pathogenic viruses the complexity of nf-κb signaling in inflammation and cancer identification of ccr , flotillin, and gp b genes as new g-csf targets during neutrophilic differentiation host innate immune receptors and beyond: making sense of microbial infections rig-i-like receptors: cytoplasmic sensors for non-self rna human β-defensin plays a regulatory role in innate antiviral immunity and is capable of potentiating the induction of antigen-specific immunity impact of mold on mast cell-cytokine immune response human beta defensin selectively inhibits hiv- in highly permissive ccr +cd + t cells regulation of cellular antiviral signaling by modifications of ubiquitin and ubiquitin-like molecules absence of ccr inhibits cd + regulatory tcell development and m-cell formation inside peyer's patches searching for an ideal vaccine candidate among different mers coronavirus receptor-binding fragments-the importance of immunofocusing in subunit vaccine design analysis of in vivo dynamics of influenza virus infection in mice using a gfp reporter virus influenza virus overcomes cellular blocks to productively replicate, impacting macrophage function cc chemokine receptor expression by b lymphocytes is essential for the development of isolated lymphoid follicles middle east respiratory syndrome coronavirus accessory protein a is a type i interferon antagonist ccr and ccr , but not endothelial selectins, mediate the accumulation of immature dendritic cells within the lungs of mice in response to particulate antigen immunopathogenesis of coronavirus infections: implications for sars thp- cells were stimulated with μg/ml recombinant s rbd protein with or without hbd at h after treatment with rs , (ccr antagonist) or gsk (nod antagonist), and their expression, together with that of the internal control gene hactb, was analyzed by qrt-pcr in duplicate. expression levels relative to those of the pbs-treated control are shown as means ± sd specific binding and chemotactic activity of mbd and its functional orthologue hbd to ccr -expressing cells human beta-defensin and and their mouse orthologs induce chemotaxis through interaction with ccr activation of innate immune antiviral responses by nod microglial cell origin and phenotypes in health and disease dectin- stimulation selectively reinforces lps-driven igg production by mouse b cells identification and characterization of mavs, a mitochondrial antiviral signaling protein that activates nf-κb and irf caspase- independent viral clearance and adaptive immunity against mucosal respiratory syncytial virus infection modulation of the immune response by middle east respiratory syndrome coronavirus cystic fibrosis airway epithelia fail to kill bacteria because of abnormal airway surface fluid epithelial antibiotic induced in states of disease dual roles of nod in tlr -mediated signal transduction and -induced inflammatory gene expression in macrophages stimulated mast cells release inflammatory cytokines: potential suppression and therapeutical aspects protection from influenza a virus infection by modulating nucleotide-binding oligomerization domain containing (nod ) signaling a novel peptide with potent and broad-spectrum antiviral activities against multiple respiratory viruses human cell tropism and innate immune system interactions of human respiratory coronavirus emc compared to those of severe acute respiratory syndrome coronavirus key: cord- - zr b authors: ravichandran, supriya; coyle, elizabeth m.; klenow, laura; tang, juanjie; grubbs, gabrielle; liu, shufeng; wang, tony; golding, hana; khurana, surender title: antibody repertoire induced by sars-cov- spike protein immunogens date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: zr b multiple vaccine candidates against sars-cov- based on viral spike protein are under development. however, there is limited information on the quality of antibody response generated following vaccination by these vaccine modalities. to better understand antibody response induced by spike protein-based vaccines, we immunized rabbits with various sars-cov- spike protein antigens: s-ectodomain (s +s ) (aa - ), which lacks the cytoplasmic and transmembrane domains (ct-tm), the s domain (aa - ), the receptor-binding domain (rbd) (aa - ), and the s domain (aa - as control). antibody response was analyzed by elisa, surface plasmon resonance (spr) against different spike proteins in native conformation, and a pseudovirion neutralization assay to measure the quality and function of the antibodies elicited by the different spike antigens. all three antigens (s +s ectodomain, s domain, and rbd) generated strong neutralizing antibodies against sars-cov- . vaccination induced antibody repertoire was analyzed by sars-cov- spike genome fragment phage display libraries (sars-cov- gfpdl), which identified immunodominant epitopes in the s , s -rbd and s domains. furthermore, these analyses demonstrated that surprisingly the rbd immunogen elicited a higher antibody titer with -fold higher affinity antibodies to native spike antigens compared with other spike antigens. these findings may help guide rational vaccine design and facilitate development and evaluation of effective therapeutics and vaccines against covid- disease. one sentence summary sars-cov- spike induced immune response the ongoing pandemic of sars-cov- has resulted in more than million human cases and , deaths as of th april . therefore, development of effective vaccines for commercially available sars-cov- spike protein and subdomains: the spike s +s ectodomain (aa - ), the s domain (aa - ), rbd domain (aa - ), and the s domain (aa - ) as a control, which is devoid of rbd (fig. a, suppl. fig. ). theese spike proteins were either produced in hek mammalian cells (s and rbd) or insect cells (s +s ectodomain and s domain). the purified s +s ectodomain, the s domain, and the rbd proteins retained the functional activity as demonstrated in spr assay using human ace protein, the sars-cov- receptor (fig. b) . the s +s ectodomain, s domain and rbd (black, blue and red binding curves, respectively) demonstrated high-affinity interaction with human ace . the control s domain protein (green curve), lacking the rbd, did not bind to human ace , demonstrating specificity of this receptor-binding assay (fig. b) . female new zealand white rabbits were immunized twice intra-muscularly at a -day interval with g of the purified proteins mixed with emulsigen adjuvant. sera were collected before (pre-vaccination) and after the first and second vaccination and analyzed for binding antibodies in elisa and spr, in a pseudovirion neutralization assay, and by gfpdl analysis. igg to various spike proteins and domains in elisa (s +s ; black, s ; blue, rbd; red, and s ; green) (fig. c) . representative titration curves to spike ectodomain (s +s ) and to the rbd in igg-elisa are shown in suppl. fig. . end-point titers of the serum igg were determined as the reciprocal of the highest dilution providing an optical density (od) twice that of the negative control (fig. c ). all four immunogens elicited strong igg binding to the spike ectodomain (s +s ). binding to the individual domains (s , s , and rbd) was specific, in that sera generated by s vaccination bound to s , but not to s or rbd, and vice-versa (fig. c ). spr allows following antibody binding to captured antigens in real-time kinetics, including total antibody binding in resonance units (max ru) and affinity kinetics (suppl. fig. ). in elisa, the antigens directly coated in the wells can be partially denatured increasing the likelihood of presenting epitopes that are not seen on the native form of the protein by the polyclonal serum igg. on the other hand, in our spr, the purified recombinant spike proteins were captured to a ni-nta sensor chip to maintain the native conformation (as determined by ace binding) to allow comparisons of binding to and dissociation from the four proteins. importantly, the protein density captured on the chip surface is low ( ru) and was optimized to measure primarily monovalent interactions, so as to measure the average affinity of antibody binding in the polyclonal serum ( , ). additionally, while elisa measured only igg binding, in spr, all antibody isotypes contributed to antibody binding to the captured spike antigen. in the current study, all rabbit sera contained anti spike antibodies that were at least % igg (data not shown). serial dilutions of post-vaccination serum were analyzed for binding kinetics with different spike proteins (suppl. fig. ). the spike ectodomain (s +s ) generated antibodies that predominantly bound to s +s (black bar), followed by the s protein (blue bar), and -fold lower antibody binding to the rbd and the s domain (red and green bars, respectively) (fig. d ). the s domain antigen induced antibodies that bound with similar titers (max ru values) to the s +s , s and rbd proteins (black, blue and red bars, respectively), and did not show reactivity to the s domain (green bar). however, the antibody reactivity of rabbit anti-s serum to s +s domain was -fold lower than the antibodies in the rabbit anti-s +s serum. rbd immunization generated similar high-titer antibody binding to s +s , s and rbd (black, blue and red bars, respectively), (fig. d) . in contrast, the s domain induced antibodies that primarily bound to homologous s antigen (green bars) and only weakly binding to the s +s ectodomain (black bars), and no binding to either s or rbd (fig. d ). antibody off-rate constants, which describe the fraction of antigen-antibody complexes that decay per second, were determined directly from the serum sample interaction with sars- cov- spike ectodomain (s +s ), s , s , and rbd using spr in the dissociation phase only for sensorgrams with max ru in the range of - ru (suppl. fig. ) and calculated using the biorad proteon manager software for the heterogeneous sample model as described before( ). these off rates provide additional important information on the affinity of the antibodies following vaccination with the different spike proteins that are likely to have an impact on the antibody function in vivo, as was observed previously in studies with influenza virus, rsv and ebola virus ( - ). surprisingly, we observed significant differences in the affinities of antibodies elicited by the four spike antigens (fig. e) . specifically, the rbd induced -fold higher affinity antibodies (slower dissociation rates) against s +s (black), s (blue) and rbd (red) proteins, compared with the post-vaccination antibodies generated by other three immunogens (fig. e ). this region may not be highly exposed on the virions or infected cells but is clearly immunogenic in the soluble recombinant spike ectodomain. in addition, the rabbit anti-s +s antibodies bound diverse epitopes spanning the rbd and to a lesser degree to the n-terminal domain (ntd) and the c-terminal region of s , and the n-terminus of s , including the fusion peptide ( fig. b and suppl. table ). the s domain elicited very strong response against the c-terminal region of s protein and a diverse antibody repertoire recognizing the ntd and rbd/rbm regions ( fig. c and suppl. table ). the recombinant rbd induced high-titer antibodies that were highly focused to the rbd/rbm (fig. e , and suppl. table ). in contrast, the recombinant s immunogen after two immunizations in rabbits elicited antibodies primarily targeting the c-terminus of the s protein (cd-hr ). . table ). structural depiction of these antigenic sites on the sars-cov- spike (suppl. table ). the other epitopes identified in our study cover less conserved sequences between the two sars-cov viruses that are unique to the sars-cov- spike and were not identified in the in-silico approach by grifoni et al. surprisingly, the s domain doesn't appear to elicit as many neutralizing antibodies as rbd or s . although s contains the fusion peptide, it does not appear to be as immunogenic, compared with s or rbd, in generating binding antibodies to the intact spike (s +s ) ectodomain, as observed in both igg elisa and spr. even though we characterized the purified proteins in various assays, there is a possibility that the structure of the antigens used in the study is different from the corresponding authentic spike protein on the surface of sars-cov- virion particle. one unexpected finding in this study was the higher affinity of antibodies elicited by the rbd compared with the other spike antigens (s +s ectodomain, s and s domains). in earlier anti-spike reactivity of post-immunization rabbit sera. serial dilutions of post-second vaccination rabbit sera were evaluated for binding to various spike proteins and domains (s +s ; black, s ; blue, rbd; red, and s ; green) in elisa. representative titration curves are shown in fig. s . to spike protein and domains from sars-cov- (s +s ; black, s ; blue, rbd; red, and s ; (e) antibody off-rate constants, which describe the fraction of antigen-antibody complexes that decay per second, were determined directly from the serum/ sample interaction with sars-cov- spike ectodomain (s +s ), s , s , and rbd using spr in the dissociation phase only for the recombinant sars-cov- proteins were purchased from sino biologicals (s +s ectodomain; -v b , s ; -v h, rbd; -v h or s ; -v b). recombinant purified proteins used in the study were either produced in hek mammalian cells (s and rbd) or insect cells (s +s ectodomain and s domain). female new zealand white rabbits (charles river labs) were immunized twice intra- muscularly at -days interval with g of purified proteins mixed with emulsigen adjuvant. sera were collected before (pre-vaccination) and after st and nd vaccination and analyzed for binding antibodies in elisa, spr, neutralization assay and gfpdl analysis. hr, plates were washed as before and opd was added for min. absorbance was measured at nm. end titer was determined as -fold above the average of the absorbance values of the naïve serum samples. the end titer is reported as the last serum dilution that was above this cutoff. proteon manager software (version . ). all spr experiments were performed twice and the researchers performing the assay were blinded to sample identity. in these optimized spr conditions, the variation for each sample in duplicate spr runs was < %. the maximum resonance units (max ru) data shown in the figures was the ru signal for the -fold diluted serum sample. antibody off-rate constants, which describe the fraction of antigen-antibody complexes that decay per second, are determined directly from the serum/ sample interaction with sars cov- spike ectodomain (s +s ), s , s , and rbd using spr in the dissociation phase only for the sensorgrams with max ru in the range of - ru and calculated using the biorad proteon manager software for the heterogeneous sample model as described before( ). off-rate constants were determined from two independent spr runs. the datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request. a crucial role of angiotensin converting enzyme (ace ) in sars coronavirus-induced lung injury cryo-em structure of the -ncov spike in the prefusion conformation structural basis for the recognition of sars-cov- by full-length human ace structure, function, and antigenicity of the sars-cov- spike glycoprotein sars-cov- vaccines: status report. immunity antigenic fingerprinting of h n avian influenza using convalescent sera and monoclonal antibodies reveals potential vaccine and diagnostic targets vaccines with mf adjuvant expand the antibody repertoire to target protective sites of pandemic avian h n influenza virus human antibody repertoire after vsv-ebola vaccination identifies novel targets and virus-neutralizing igm antibodies antigenic fingerprinting following primary rsv infection in young children identifies novel antigenic sites and reveals unlinked evolution of human antibody repertoires to fusion and attachment glycoproteins as -adjuvanted h n vaccine promotes antibody diversity and affinity maturation, nai titers, cross-clade h n neutralization, but not h n cross-subtype neutralization mf adjuvant enhances diversity and affinity of antibody-mediated immune response to pandemic influenza vaccines high-affinity h head and stalk domain-specific antibody responses to an inactivated influenza h n vaccine after priming with live attenuated influenza vaccine longitudinal human antibody repertoire against complete viral proteome from ebola virus survivor reveals protective sites for vaccine design intravenous immunoglobulin for adults with influenza a or b infection (flu-ivig): a double-blind, randomised, placebo-controlled trial antigenic fingerprinting of respiratory syncytial virus (rsv)-a-infected hematopoietic cell transplant recipients reveals importance of mucosal anti-rsv g antibodies in control of rsv infection in humans the covid- vaccine development landscape characterization of the receptor- binding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine a sequence homology and bioinformatic approach can predict candidate targets for immune responses to sars-cov- h n-terminal beta sheet promotes oligomerization of h -ha that induces better antibody affinity maturation and enhanced protection against h n and h n viruses compared to inactivated influenza vaccine differential human antibody repertoires following zika infection and the implications for serodiagnostics and disease outcome ) with a single receptor-binding domain (rbd) in the up conformation, wherever available using ucsf chimera software. the rbd region is shaded in red (residues - ) on every structure key: cord- -f lmstyn authors: struck, anna-winona; axmann, marco; pfefferle, susanne; drosten, christian; meyer, bernd title: a hexapeptide of the receptor-binding domain of sars corona virus spike protein blocks viral entry into host cells via the human receptor ace date: - - journal: antiviral res doi: . /j.antiviral. . . sha: doc_id: cord_uid: f lmstyn in vitro infection of vero e cells by sars coronavirus (sars-cov) is blocked by hexapeptide tyr-lys-tyr-arg-tyr-leu. the peptide also inhibits proliferation of coronavirus nl . on human cells both viruses utilize angiotensin-converting enzyme (ace ) as entry receptor. blocking the viral entry is specific as alpha virus sindbis shows no reduction in infectivity. peptide ( )ykyryl( ) is part of the receptor-binding domain (rbd) of the spike protein of sars-cov. peptide libraries were screened by surface plasmon resonance (spr) to identify rbd binding epitopes. ( )ykyryl( ) carries the dominant binding epitope and binds to ace with k(d) = μm. the binding mode was further characterized by saturation transfer difference (std) nmr spectroscopy and molecular dynamic simulations. based on this information the peptide can be used as lead structure to design potential entry inhibitors against sars-cov and related viruses. the sars-associated corona virus (sars-cov) has been identified as the causative agent of severe acute respiratory syndrome (sars) which emerged as an alerting epidemic in winter of - resulting in over infected cases with approximately % deaths ksiazek et al., ; marra et al., ; peiris et al., ; rota et al., ; who, ) . sars-cov infects human host cells by an initial interaction of its spike glycoprotein (s) and the receptor on human cells, angiotensinconverting enzyme (ace ) (dimitrov, ; holmes, ; li et al., ) . functional characterization of the s protein suggests that the receptor-binding domain (rbd) is located between amino acid residues and (xiao et al., ) . flow cytometry indicated that amino acids - are the minimal receptor binding region of the s glycoprotein (babcock et al., ) . further studies located the rbd from residues to . the rbd fused to the fc region of human igg (rbd-fc) binds ace with higher affinity (k d $ nm) than does the full length s -ig chimera (li et al., b; wong et al., ) . the crystal structure of residues - of the s in complex with the receptor ace reveals that a loop within the rbd (residues - ) makes all the contacts to ace and is referred to as the receptor-binding motif (rbm). six tyrosine residues are involved in direct binding to the receptor (li et al., a) . studying the virus adaptation to humans the spike protein also seems to play a major role in the species specificity of coronavirus infection. especially, the introduction of a threonine residue at position and an asparagine instead of a charged lysine residue at position of the spike protein seem to be responsible for its high affinity to human ace (holmes, ; li et al., b; qu et al., ) . yi et al. demonstrated that a single amino acid substitution (r a) in a full-length spike protein dna vaccine failed to induce neutralizing antibodies (nabs) and that the same mutation yielded pseudoviruses that were unable to enter the human cells (yi et al., ) . furthermore, the rbd-fc bearing the same r a mutation shows no affinity to ace and is not capable of blocking s protein-mediated pseudovirus entry (he et al., ) . the interaction of sars-cov with its receptor ace is an attractive drug target as epitopes of the rbd on the spike protein may serve as leads for the design of effective entry inhibitors (du et al., ) . another drug target is the fusion process of the spike protein with the host cell membrane that is characterized by the presence of two heptad repeat (hr) regions, hr and hr , which are postulated to form a fusion-active conformation similar to those of other typical viral fusion proteins (sainz et al., ; van der hoek et al., ; yuan et al., ) . peptides were synthesized on solid phase using a fmoc-protecting group strategy on a fmoc-pal-peg-ps resin (applied biosystems) with o-(benzotriazol- -yl)-n,n,n ,n -tetramethyluronium tetrafluoroborate (tbtu, iris biotech) as activator. a mos x synthesizer (advanced chemtech) and a liberty microwave synthesizer (cem) were used for peptide syntheses starting with lmol amino groups each. after each coupling step the growing peptide was capped with an acetyl residue by % acetic anhydride in dmf. cysteine residues were substituted by serines to avoid dimerization. using trifluoroacetic acid (tfa), triisopropylsilan and h o ( : : , v/v), peptides were cleaved off the resin leaving an amide at the c-terminus. the cocktail was applied twice for and min, respectively. preparative rp-hplc was carried out on a biocad e instrument (perseptive biosystems) using a h o/acetonitrile gradient ( . % tfa) on a vp / nucleodur c pyramid l column (macherey & nagel). peptides were characterized by maldi-tof mass spectrometry on a biflex iii instrument (bruker daltonics) in reflector mode using , dihydroxybenzoic acid (dhb) or a-cyano- -hydroxycinnamic acid (cca) as a matrix. peptide rbd- b (ykyryl, y -l ) and related peptides were further characterized by d-and d-nmr spectroscopy (not shown). spr studies were carried out using a biacore or biacore t instrument. for all experiments a temperature of °c, flow rate of ll/min (biacore ) or ll/min (t ), and a tbs running buffer ( mm tris, . m nacl, lm zncl at ph ) were used. the carboxymethylated sensorchip surface of a cm chip (biacore) was activated by nhs/edc followed by immobilization of rhace (r&d systems) in acetate-buffer (ph . , biacore). rhace was obtained in tbs that had to be changed for the immobilization of the enzyme to pbs containing additional lm zncl (ph . ) using a slide-a-lyzer mini unit (pierce biotechnology) with a molecular weight cut-off of at °c for at least h. sensorgrams of rbd- and rbd- were recorded with a chip that had fmol of ace immobilized, that of rbd- on a chip with fmol, those of rbd- b and of the rbd- b related peptide library on a chip with fmol, respectively. carboxyl groups of the activated chip surface that had not reacted with the protein were capped with ethanolamine (biacore). . . saturation transfer difference (std) nmr spectroscopy . . . sample preparation nmr samples were prepared in deuterated tris-buffered saline (d-tbs) containing mm perdeuterotris(hydroxymethyl)aminomethane (tris-d ), . m nacl and lm zncl (ph . ) in deuterium oxide (d o, . %). tbs of the commercial rhace (r&d systems) was changed to d-tbs in slide-a-lyzer mini units (pierce biotechnology) with a molecular weight cut-off of twice for at least h at °c. ykyryl was added from mm stock solution in d-tbs with sample volume adding up to ll in a mm shigemi nmr micro tube with c(ace ) = . lm and peptide concentration between . and lm ( - fold excess over ace ). all std spectra were recorded at a temperature of k with a spectral width of ppm on a bruker avance drx mhz spectrometer equipped with a mm inverse triple resonance cryoprobe. selective saturation of the protein was achieved by a train of gauss-shaped pulses of ms length each, truncated at %, and separated by a ms delay leading to a total length of saturation time of s. the on-resonance irradiation of the protein was performed at a chemical shift of À . ppm. off-resonance irradiation was set at ppm. total scan number in the std experiments was . nmr spectra were multiplied by an exponential linebroadening function of . hz prior to fourier transformation. water suppression was achieved by an excitation sculpting pulse sequence. spectra processing was performed using topspin . software (bruker). the sars-cov inhibition assay was performed as described previously (vassilatis et al., ) . in brief, vero cells in -well plates were infected in the biosafety level laboratory (bni hamburg) with sars-cov (frankfurt isolate) at a multiplicity of infection (moi) of . . the inoculum was removed after h and replaced with fresh medium complemented with different concentrations of compound. the virus rna concentration in the supernatant was measured by real-time pcr after days. rna was prepared from ll supernatant using diatomaceous silica (pfaff et al., ) . quantitative real-time reverse transcription-pcr (rt-pcr) was performed with the purified rna according to a published protocol . in vitro transcripts of the target region were used in the pcr to generate standard curves for quantification of the virus rna. the md simulations were carried out with the software maestro/desmond on an hp z workstation (one quadcore cpu), using the opls_aa/ force field. the starting structure was placed in a water box with orthorhombic boundary conditions and salt concentration of mmol/l (spc solvent model, , water molecules,   Å). md simulations over , ps were performed to equilibrate the system at k. the simulation in equilibrium was performed over ps at k, with the nose-hoover thermostat method and a relaxation time of . ps. the recording interval was . ps. before starting md simulations the system was minimized three times over steps, respectively. the period of the md after equilibration with constant potential energy was used for the analysis. table synthetic peptide library of sixteen mer peptides comprising rbd-residues n -t of sars-cov spike protein. the peptides rbd- , rbd- and rbd- show binding to ace . residues of the spike protein (s) amino acid sequence a library of linear peptides was synthesized via solid phase synthesis using fmoc strategy containing sixteen mer rbd peptides (rbd- to rbd- ) which together comprised residues n -t (cf. table ). cysteine residues were substituted by serine amino acids to avoid dimerization of the peptides. the compounds were used to identify binding motifs in interaction with the human receptor ace by surface plasmon resonance (spr) binding studies. this method allows the determination of the binding specificity, as table synthetic peptide library of fourteen mer peptides comprising rbd-residues n -e and a -s of sars-cov spike protein. the k d of rbd- b (ykyryl) is lm. the on-and off-rates are .  m À s À and . s À , respectively. peptide rbd- c (rylrhg) shows k d = lm, k on = .  m À s À and k off = . s À . residues of the spike protein (s) amino acid sequence well as the association and dissociation rates of ligands interacting with protein receptors. the receptor protein ace was immobilized on the sensor chip and the peptides were passed over the sensor surface. the affinity of the interaction is determined from the level of binding at equilibrium as a function of sample concentration and can also be determined from analysis of the binding kinetics. spr screening of the sixteen mer rbd peptides resulted in positive responses of three peptides, i.e. rbd- (y -r -ykyrylrhgklr), rbd- (s -t -sywplndygfyt) and rbd- (t -v -ttgigyqpyrvv), respectively. all other rbd peptides showed no or insignificantly small response signals and were thus not interacting with ace . to analyze whether the interaction of the peptides rbd- , rbd- and rbd- is based on a specific binding event, the spr sensorgrams of each compound were recorded at several different concentrations. the signal at equilibrium measured in response units [ru] obtained from the sensorgrams was plotted against the concentration of ligand passed over the sensor surface (cf. fig ) . assuming a one site binding model the specific saturable binding affinity of the ligand peptides is calculated to k d = ± lm for rbd- , k d = ± lm for rbd- and k d = ± lm for rbd- . the spr sensorgrams were also used to determine kinetic parameters of binding, the association rate constant k on [s À m À ] and the dissociation rate constant k off [s À ]. rbd- shows a fivefold higher association rate constant k on = .  s À m À compared to rbd- and rbd- with k on = .  s À m À and .  s À m À , respectively. the dissociation rate of all three peptides is comparable with k off = . s À (rbd- ), . s À (rbd- ) and . s À (rbd- ), respectively. the dissociation constant k d = k off /k on resulting from analysis of the kinetic data of the peptides are k d = lm (rbd- ), lm (rbd- ) and lm (rbd- ) in excellent agreement with the dissociation constants obtained from thermodynamic data analysis. the on-rates suggest that the bioactive conformation of peptide rbd- is more similar to its solution conformation than for rbd- and rbd- . peptides rbd- , rbd- and rbd- contain % of the residues, namely y , y , y , n , y , t , t , g and y that were identified by x-ray crystal structure analysis to be in contact with ace (li et al., a) . several mutations in the viral protein were necessary to change the host organisms from wild animals, like civet cats, to humans, i.e. s t and k n (holmes, ; li et al., b; qu et al., ) . these amino acids are included in the peptides rbd- and rbd- . in a screening of a library hu in all spectra water was suppressed by an excitation sculpting pulse sequence and spectra were acquired in d-tbs at k in a ll (hwang and shaka, ) . (b) determination of binding affinity from std nmr titration data. the titration curve of the aromatic protons hd,d of tyr is shown. the k d value was determined from the std nmr titration by using the one site binding model. the resulting k d value is ± lm. (c) std nmr epitope mapping of rbd- b. the spots indicate the range of relative std% for the protons that were saturated according to their proximity to the human receptor protein ace . a strong binding is detected for the aromatic protons of the three tyrosines. the highest degree of saturation of . std% (absolute) obtained for he,e of tyr was set to % relative std. could inhibit the binding of rbd to ace and the plaque formation of sars-cov in vero cells with an ec value of . lm . ho et al. showed biological activities of small peptides derived from s protein to inhibit the spike protein and ace interaction. among others peptide sp- (residues - , fytttgi-gyqpy), which overlaps in sequence with the identified peptides rbd- and rbd- , shows inhibitory activity at a million fold excess relative to ace (ho et al., ) . however, the mer peptide p comprising residues - (palncywplndygfyttsgi) of zheng et al. overlapping with peptide rbd- shows no inhibition in a cytopathic effect (cpe)-based assay . in contrast, as mentioned above, the partially overlapping peptides a -l and sp- (ho et al., ) showed virus inhibition in their respective assays. to further characterize the binding epitope we synthesized a second peptide library of fourteen hexamer peptides comprising residues n -e (rbd- a to rbd- e) and a -s (rbd- a to rbd- d and rbd- a to rbd- e) with an overlap of three amino acids in each sequence (cf. table ). these peptides were used to further locate strongly interacting amino acids by spr and saturation transfer difference (std) nmr spectroscopy. spr screening was performed with ligand concentrations of up to lm in tbs and fmol of immobilized ace . rbd- b (y -l ; ykyryl) showed the highest spr response signal of ru at a concentration of lm. also the flanking peptides rbd- a (n -y ; nynyky) and rbd- c (r -g ; rylrhg) showed significant binding of ru (c = lm) and ru (c = lm), respectively. the spr response of peptides rbd- a (a -w ; alnsyw) and rbd- b (s -n ; sywpln) was comparable to that of rbd- a ( and ru, respectively) (cf. fig. ) . the other hexapeptides gave no significant spr signal. therefore, only peptide rbd- b (y -l ; ykyryl) was further investigated by spr and std nmr experiments. a concentration dependent spr affinity plot was performed with fmol of receptor protein immobilized. fig. shows the concentration dependent binding of rbd- b (ykyryl) by spr resulting in a dissociation constant k d = ± lm which is two-fold higher affinity than that of peptide rbd- . the binding kinetics are k on = .  s À m À and k off = . s À and also result in a dissociation constant of k d = lm. the two-fold increase of the binding affinity compared to rbd- results from a two-fold higher on-rate which is probably due to a better defined conformation in solution such that binding can occur more rapidly. the off-rate is the same as found for the dodecapeptide rbd- . std nmr spectroscopy is a well-established method to characterize ligand-protein interactions meyer, , ) . here it was used to determine dissociation constant and binding epitope of the interaction of rbd- b with a soluble construct of the receptor protein ace in deuterated tris-buffered d o. fig. shows the h std nmr spectrum of rbd- b (ykyryl) and receptor ace at a fold excess of the ligand over the protein. dependence of the std amplification factor on the concentration of rbd- b yields the k d value. a one site binding model fits the experimental data well and gives k d values in the low micromolar range: hd,d of the tyr results in k d = ± lm. the binding epitope was determined from std spectra at a fold excess of the ligand. it shows that all tyrosine residues have a close contact to ace . furthermore, the ha atom of arg also has a close proximity to the receptor surface. the ha atoms of tyr and leu interact also strongly with the cellular receptor. the positively charged groups of lysine and arginine side chains also show moderately strong interactions with the surface. the c-terminal leucine seems to be involved in a hydrophobic contact via its side chain. the highest level of saturation of . % std (absolute) is found for the he,e of tyr . the x-ray crystal structure analysis of the complex of rbd (residues - ) with ace shows only contacts of tyr and tyr of the rbd with ace (li et al., a) . however, in the isolated hexapeptide ykyryl, tyr is also binding to ace . according to mutation studies arg of sars-cov spike protein is important for binding affinity (he et al., ) . this fact is confirmed by std nmr analysis. the side chain protons hd,d of arg exhibit . % absolute std, which suggests an ionic interaction of the guanidinium group with the receptor molecule. the x-ray structure of the complex revealed no direct contact between arg of rbd and ace . thus in the free form the hexapeptide adopts a different binding mode and conformation compared to the case when integrated into the rbd. lys also makes a contact via its positively charged group evidenced by the std effect on the he,e protons that receive a saturation of . %. protons of leu methyl groups show a std effect of . % and . % std, respectively, indicating a contact of the methyl groups to the receptor. the biological activity of the lead structure rbd- b (y -l ; ykyryl) was assayed with respect to its ability to inhibit virus replication in cell culture (drosten et al., ) . veroe cells were infected with the sars-cov isolate frankfurt. as described previously, growth of the virus in vero cells is not associated with a cytopathic effect (vassilatis et al., ) . cells were infected with a virus titer of . moi. the inoculum was removed after h and replaced with fresh medium complemented with different concentrations of peptide rbd- b in a one-time dose. two days post infection virus rna concentration in the supernatant was measured by real-time pcr (cf. fig. ). there was no evidence for toxicity of the compound in the concentration range tested with mtt cell proliferation assay on subconfluent cells. after two days a onetime dose of the peptide rbd- b (ykyryl, . mm) reduced the virus rna level compared to the untreated control by a factor of . further, the inhibition of virus proliferation by the peptide is concentration dependent. after two days, relative to an untreated control virus rna is reduced by a factor of at a mm concentration of the peptide. the final amount of virus rna is in fact lower than the initially added amount. these data suggest that the hexapeptide blocks indeed the binding site necessary for the initial viral attachment to the human receptor ace and effectively inhibits viral entry into vero cells. thus, the virus is not capable to replicate and viral particles are exposed to the degradation process. these results are in good agreement with the occupation of the receptor's binding site given the peptide's dissociation constant of k d = lm. at peptide concentrations of mm the occupation of binding sites is . %, at . mm it is . % and at mm . %, respectively. the peptide inhibits the infection of the corona virus specifically. this is proven by the fact that rbd- b does not inhibit infection of vero cells with the alpha virus sindbis, which cause high fever in humans. (tesh, ) furthermore, the peptide was also tested in an inhibition assay with another corona virus nl in llc-mk and caco cells. nl corona virus causes severe colds in human and uses ace also as a functional receptor. (van der hoek et al., ; wu et al., ) inhibitory effects were observed for both cell lines at a concentration of mm. for caco cells inhibition is also observed at a peptide concentration of mm. we have synthesized various hexapeptides closely related to peptide rbd- b (ykyryl) including an alanine scan library and analyzed by spr the importance of individual amino acids for binding to ace . the alanine scan reveals that the tripeptide motif kyr is essential for the binding (cf. fig. ) . the other binding curves show that the positively charged side chains of the amino acids lys and arg may be important for the receptor binding. no binding was observed for the peptides ydyryl and ykydyl with the negatively charged aspartate replacing the positively charged lys and arg. changing positively charged residues to uncharged residues (k s, r s) reduces the binding affinity but does not abolish it. these results indicate a clear role of lys and arg in the binding process in agreement with the std nmr data shown above. the binding mode of peptide rbd- b to the human receptor ace was also analyzed by docking of the peptide to the binding site followed by a molecular dynamics (md) simulation that was recorded for ps in equilibrium using the opls_aa/ force field as realized in the program desmond (schrödinger) (jorgensen amino acid sequence , ) . the conformation of peptide rbd- b in the crystal structure of the rbd of the viral spike protein in complex with the receptor ace was used as starting structure (li et al., a) . the protein peptide complex was placed in a water box with , water molecules and a sodium chloride concentration of . mol/l. the peptide shows high dynamics during the md but stays always in contact with the receptor surface. fig. shows the starting and the final conformation of the complex. the side chains of amino acids tyr and tyr interact with ace in the crystal structure. at the end of the md simulation lys , tyr and arg show interactions with the receptor. the binding mode of hexapeptide rbd- b to ace differs from the binding mode of the hexapeptide as part of the full length s protein. the positively charged residue of lys forms ionic interactions with the carboxyl group of glu of ace . the average distance from the lysine e-nitrogen atom and an oxygen atom of the carboxyl group over the course of the md simulation is . ± . Å. the guanidinium group (ng) of arg interacts with the carboxyl group of asp of the receptor with an average distance of . ± . Å. the crystal structure shows the first three carbohydrate units of a high mannose n-glycan on the receptor surface resolved. in the md simulation tyr shows contact to the glycan molecule by interaction of its aromatic residue with the second n-acetylglucosamine and the mannose residue. the average distance of the interaction of the aromatic ring with the n-acetylglucosamine is . ± . Å and with the mannose . ± . Å, respectively. during the md simulations the peptide shifted . Å at the c-terminus and . Å at the n-terminus from its initial position to its final pose (fig. ) . the results of the md studies support the affinity data obtained from the alanine scan, which indicates that the tripeptide motif kyr is important for the binding to ace . sars belongs to the major new emerging virus diseases. in it caused a major outbreak with about deaths. it is a respiratory disease with about % mortality that is caused by a variant of the common coronaviruses (sars-cov). rigid political measures helped to contain the epidemic within a short period of time. so far no specific treatment against sars is available. in search for molecules that can specifically block the attachment of the virus to the human cell, we analyzed binding modes of viral peptides to the human receptor. we identified and characterized the focal point of the viral protein that is used by the virus for its attachment to the human cell. we found a hexapeptide in the receptor-binding domain (rbd) of the s protein of sars-cov that carries a significant portion of the binding affinity of the virus to the human cell. the s protein mediates the attachment of the virus to its functional receptor ace . the attachment of the virus to ace does not interfere with the natural function of the receptor. therefore, it is easy to block the attachment site of the virus in the upper respiratory tract as a preventive measure against sars. we could clearly demonstrate that hexapeptide tyr-lys-tyr-arg-tyr-leu reduces viral infection of epithelial cells, as found in the upper respiratory tract, by a factor of . this peptide was shown to be specific against coronaviruses that attach to the ace receptor. its mode of action is specific as it does not interfere with other infections by viruses that utilize different receptors, like alpha virus sindbis. combination of several biophysical methods, e.g. spr, std nmr and molecular dynamics simulations, were used to characterize the specific binding mode of the inhibitory peptide. although there is currently no sars outbreak the need of an antiviral drug, e.g. based on the hexapeptide, is still present. a viral reservoir is present in wild animals like bats and civet cats and a new epidemic is likely someday. amino acids - of the severe acute respiratory syndrome coronavirus spike protein are required for interaction with receptor the secret life of ace as a receptor for the sars virus evaluation of advanced reverse transcription-pcr assays and an alternative pcr target region for detection of severe acute respiratory syndrome-associated coronavirus identification of a novel coronavirus in patients with severe acute respiratory syndrome the spike protein of sars-cov -a target for vaccine and therapeutic development a single amino acid substitution (r a) in the receptorbinding domain of sars coronavirus spike protein disrupts the antigenic structure and binding activity design and biological activities of novel inhibitory peptides for sars-cov spike protein and angiotensin-converting enzyme interaction sars-associated coronavirus structural biology: adaptation of sars coronavirus to humans screening and identification of linear b-cell epitopes and entry-blocking peptide of severe acute respiratory syndrome (sars)-associated coronavirus using synthetic overlapping peptide library water suppression that works -excitation sculpting using arbitrary wave-forms and pulsed-field gradients development and testing of the opls all-atom force field on conformational energetics and properties of organic liquids a novel coronavirus associated with severe acute respiratory syndrome structure of sars coronavirus spike receptor-binding domain complexed with receptor angiotensin-converting enzyme is a functional receptor for the sars coronavirus receptor and viral determinants of sars-coronavirus adaptation to human ace characterization of ligand binding by saturation transfer difference nmr spectroscopy group epitope mapping by saturation transfer difference nmr to identify segments of a ligand in direct contact with a protein receptor coronavirus as a possible cause of severe acute respiratory syndrome selective recognition of cyclic rgd peptides of nmr defined conformation by alpha iib beta , alpha v beta , and alpha beta integrins identification of two critical amino acid residues of the severe acute respiratory syndrome coronavirus spike protein for its variation in zoonotic tropism transition via a double substitution strategy inhibition of severe acute respiratory syndrome-associated coronavirus (sars-cov) infectivity by peptides analogous to the viral spike protein arthritides caused by mosquito-borne viruses identification of a new human coronavirus the g protein-coupled receptor repertoires of human and mouse ligplot: a program to generate schematic diagrams of protein-ligand interactions sars-lessons from a new disease a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensinconverting enzyme crystal structure of nl respiratory coronavirus receptor-binding domain complexed with its human receptor the sars-cov s glycoprotein: expression and functional characterization single amino acid substitutions in the severe acute respiratory syndrome coronavirus spike glycoprotein determine viral entry and immunogenicity of a major neutralizing domain suppression of sars-cov entry by peptides corresponding to heptad regions on spike glycoprotein synthetic peptides outside the spike protein heptad repeat regions as potent inhibitors of sars-associated coronavirus key: cord- -ycuiso g authors: li, wei; drelich, aleksandra; martinez, david r.; gralinski, lisa; chen, chuan; sun, zehua; schäfer, alexandra; leist, sarah r.; liu, xianglei; zhelev, doncho; zhang, liyong; peterson, eric c.; conard, alex; mellors, john w.; tseng, chien-te; baric, ralph s.; dimitrov, dimiter s. title: rapid selection of a human monoclonal antibody that potently neutralizes sars-cov- in two animal models date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: ycuiso g effective therapies are urgently needed for the sars-cov- /covid pandemic. we identified panels of fully human monoclonal antibodies (mabs) from eight large phage-displayed fab, scfv and vh libraries by panning against the receptor binding domain (rbd) of the sars-cov- spike (s) glycoprotein. one high affinity mab, igg ab , specifically neutralized replication competent sars-cov- with exceptional potency as measured by two different assays. there was no enhancement of pseudovirus infection in cells expressing fcγ receptors at any concentration. it competed with human angiotensin-converting enzyme (hace ) for binding to rbd suggesting a competitive mechanism of virus neutralization. igg ab potently neutralized mouse ace adapted sars-cov- in wild type balb/c mice and native virus in hace expressing transgenic mice. the ab sequence has relatively low number of somatic mutations indicating that ab -like antibodies could be quickly elicited during natural sars-cov- infection or by rbd-based vaccines. igg ab does not have developability liabilities, and thus has potential for therapy and prophylaxis of sars-cov- infections. the rapid identification (within days) of potent mabs shows the value of large antibody libraries for response to public health threats from emerging microbes. the severe acute respiratory distress coronavirus (sars-cov- ) ( ) has spread worldwide thus requiring safe and effective prevention and therapy. inactivated serum from convalescent patients inhibited sars-cov- replication and decreased symptom severity of newly infected patients ( , ) suggesting that monoclonal antibodies (mabs) could be even more effective. human mabs are typically highly target-specific and relatively non-toxic. by using phage display we have previously identified a number of potent fully human mabs (m , m , m . ) against emerging viruses including severe acute respiratory syndrome coronavirus (sars-cov) ( ) , middle east respiratory syndrome coronavirus (mers-cov) ( ) and henipaviruses ( , ) , respectively, which are also highly effective in animal models of infection ( ) ( ) ( ) ( ) ; one of them was administered on a compassionate basis to humans exposed to henipaviruses and successfully evaluated in a clinical trial ( ) . size and diversity of phage-displayed libraries are critical for rapid selection of high affinity antibodies without the need for additional affinity maturation. our exceptionally potent antibody against the mers-cov, m , was directly selected from very large (size ~ clones) library from individuals ( ) . however, another potent antibody, m . , against henipavirusses was additionally affinity matured from its predecessor selected from smaller library (size ~ clones) from individuals ( , ) . thus, to generate high affinity and safe mabs we used eight very large (size ~ clones each) naive human antibody libraries in fab, scfv or vh format using pbmcs from individuals total obtained before the sars-cov- outbreak. four of the libraries were based on single human vh domains where cdrs (except cdr which was mutagenized or grafted) from our other libraries were grafted as previously described ( ) . another important factor to consider when selecting effective mabs is the appropriate antigen. similar to sars-cov, sars-cov- uses the spike glycoprotein (s) to enter into host cells. the s receptor binding domain (rbd) binds to its receptor, the human angiotensinconverting enzyme (hace ), thus initiating series of events leading to virus entry into cells ( , ) . we have previously characterized the function of the sars-cov s glycoprotein and identified its rbd which is stable in isolation ( ) . the rbd was then used as an antigen to pan phage displayed antibody libraries; we identified potent antibodies ( , ) more rapidly and the antibodies were more potent than when we used whole s protein or s (unpublished). in addition, the sars-cov rbd based immunogens are highly immunogenic and elicit neutralizing antibodies which protect against sars-cov infections ( ). thus, to identify sars-cov- mabs, we generated two variants of the sars-cov- rbd (aa - ) (fig. s ) and used them as antigens for panning of our eight libraries. panels of high-affinity binders to rbd in fab, scfv and vh domain formats were identified. there was no preferential use of any antibody vh gene (an example for a panel of binders selected from the scfv library is shown in fig. s a ) and the number of somatic mutations was relatively low (fig. s b , for the same panel of binders as in fig. s a ). for nine of the highest affinity mabs a provisional patent application was filed on march , by the university of pittsburgh. those high affinity mabs can be divided into two groups in terms of their competition with hace . two representatives of each group are fab ab and vh ab . to further increase their binding through avidity effects and extend their half-live in vivo they were converted to igg and vh-fc fusion formats, respectively. ab was characterized in more details because of its potential for prophylaxis and therapy of sars-cov- infection. the fab and igg ab bound strongly to the rbd (fig. a ) and the whole sars-cov- s protein (fig. b) as measured by elisa. the fab ab equilibrium dissociation constant, kd, as measured by the biolayer interferometry technology (blitz), was . nm (fig. c) . the igg ab bound with high (kd = pm) avidity to recombinant rbd (fig. d) . igg ab bound cell surface associated native s glycoprotein suggesting that the conformation of its epitope on the rbd in isolation is close to that in the native s protein (fig. , s ). the binding of igg ab was of higher avidity than that of hace -fc (fig. b) . binding of ab was specific for the sars-cov- rbd; it did not bind to the sars-cov s (fig. a ) nor to cells that do not express sars-cov- s glycoprotein ( fig. a ). ab competed with hace for binding to the rbd ( fig. b and c) indicating possible neutralization of the virus by preventing binding to its receptor. it did not compete with the cr (fig. d and e) , which also binds to sars-cov ( ) and with ab ( fig. f ). igg ab potently neutralized sars-cov- pseudovirus with an ic of ng/ml ( fig a) . it did not enhance pseudovirus infection of fcγria overexpressing t-hace cells at any concentration ( fig b) . it also did not mediate pseudovirus infection of fcγrii expressing k cells ( fig s b) . importantly, igg ab exhibited potent neutralizing activity against authentic sars-cov- in two independent assays -a microneutralization-based assay ( % neutralization at < ng/ml) (fig. c ) and a luciferase reporter gene assay (ic = ng/ml) ( fig. d ). in agreement with the specificity of binding to the sars-cov- s and not to the sars-cov s the igg ab did not neutralize live sars-cov (fig. c ). the igg m ( ) control which is a potent neutralizer of mers-cov, did not exhibit any neutralizing activity against sars-cov- (fig. c ). the vh ab and vh-fc ab bound the rbd with high affinity and avidity (fig. s a .b) but did not compete with hace ( fig. s c ) or neutralize sars-cov- ( fig. d) , indicating that not all antibodies targeting epitopes on the rbd affect virus replication. to evaluate the efficacy of igg ab in vivo we used two animal models. the first one is based on the recently developed mouse ace adapted sars-cov- which has two mutations q t/p y at the ace binding interface on rbd ( ). igg ab protected mice from high titer intranasal sars-cov- challenge ( pfu) of balb/c mice in a dose dependent manner ( fig a) . there was complete neutralization of infectious virus at the highest dose of . mg, and statistically significant reduction by -fold at . mg; there was a trend for reduction at . mg dose but did not reach statistical significance. the igg m which potently neutralizes the mers-cov in vivo was used as an isotype control because it did not have any activity in vitro. these results also suggest that the rbd double mutations q t/p y do not affect igg ab binding. the second model we used is the transgenic mice expressing human ace (hace ) ( ). mice were administered ug of igg ab prior to wild type sars-cov- challenge followed by detection of infectious virus in lung tissue days later. replication competent virus was not detected in four of the five mice which were treated with igg ab ( fig b) . all six control mice and one of the treated mice had more than pfu per lung. these results show clear evidence of a potent preventive effect of igg ab in vivo. the reason for absence of virus neutralization in one of the mice is unclear but may be due to individual variation in antibody transfer from the peritoneal cavity where it was administered to the upper and lower respiratory tract. our previous experiments with transgenic mice expressing human dpp and treated with two different doses of m ( . and mg per mouse) showed similar lack of protection of one (out of four) mice at the lower dose but at the higher dose all four mice were protected ( ) similarly to the results obtained with the mouse adapted sars-cov- . the in vivo protection also indicates that igg ab can achieve neutralizing concentrations in the respiratory tract. this is the first report of in vivo activity of a human monoclonal antibody against sars-cov- by using two different mouse models. the results also show some similarity between the two models in terms of evaluation of antibody efficacy. in both models about the same dose of antibody ( . - . mg) reduced about -fold the infectious virus in the lungs. this result now suggests that testing of antibody efficacy could be performed at a larger scale than testing with the hace transgenic mice due to the availability of wild type mice. it also shows robust neutralizing activity of igg ab in two different models of infection. interestingly, fab ab had only several somatic mutations compared to the closest germline predecessor genes. this implies that ab -like antibodies could be elicited relatively quickly by using rbd-based immunogens especially in some individuals with naïve mature b cells expressing the germline predecessors of ab . this is in contrast to the highly mutated broadly neutralizing hiv- antibodies that require long maturation times, are difficult to elicit and their germline predecessors cannot bind native hiv- envelope glycoproteins ( , ). the rbd of the mers-cov s protein was previously shown to elicit neutralizing antibodies ( , ). for sars-cov- only a few somatic mutations would be sufficient to generate potent neutralizing antibodies against the sars-cov- rbd which is a major difference from the elicitation of broadly neutralizing antibodies against hiv- which requires complex maturation pathways ( , - ). the germline-like nature of the newly identified mab ab also suggests that it has excellent developability properties that could accelerate its development for prophylaxis and therapy of sars-cov- infection ( ). to further assess the developability (drugability) of ab its sequence was analyzed online (opig.stats.ox.ac.uk/webapps/sabdab-sabpred/tap.php); no obvious liabilities were found. in addition, we used dynamic light scattering (dls) and size exclusion chromatography to evaluate its propensity for aggregation. igg ab at a concentration of mg/ml did not aggregate for six days incubation at °c as measured by dls (fig. a) ; there were no high molecular weight species in freshly prepared igg ab also as measured by size exclusion chromatography (sec) (fig. b ). igg ab also did not bind to the human cell line t ( fig. a ) even at very high concentration ( μm) which is about -fold higher than its kd indicating absence of nonspecific binding to many membrane-associated human proteins. the igg ab also did not bind to , human membrane-associated proteins as measured by a membrane proteome array (fig. c ). the high affinity/avidity and specificity of igg ab along with potent neutralization of virus and good developability properties suggests its potential use for prophylaxis and therapy of sars-cov- infection. because it strongly competes with hace indicating a certain degree of mimicry, one can speculate that mutations in the rbd may also lead to inefficient entry into cells and infection. in the unlikely case of mutations that decrease the ab binding to rbd but do not affect binding to ace it can be used in combination with other mabs including those we identified or in bi(multi)specific formats to prevent infection of such sars-cov- isolates. ab could also be used to select appropriate epitopes for vaccine immunogens and for diagnosis of cov-specific igg m antibody was expressed in human mammalian cell as described previously ( ). the ace gene was ordered from origene (rockville, md). the rbd domain (residues - ) and s domain (residues - ) and ace (residues - ) genes were cloned into plasmid which carries a cmv promotor with an intron, human igg fc region and woodchuck posttranscriptional regulatory element (wpre) to generate the rbd-fc, s -fc and ace -fc expression plasmids. the rbd-avi-his protein with an avi tag followed by a ×his tag at c-terminal was subcloned similarly. these proteins were expressed with expi expression system (thermo fisher scientific) and purified with protein a resin (genscript) and by ni-nta resin (thermo fisher scientific). the fab cr antibody gene with a his tag was cloned into pcat plasmid (developed in house) for expression in hb bacteria and purified with ni-nta resin. protein purity was estimated as > % by sds-page and protein concentration was measured spectrophotometrically (nanovue, ge healthcare). blitz. antibody affinities and avidities were analyzed by the biolayer interferometry blitz binding ec was obtained by using the non-linear mode in graphpad prism . igg ab showed higher binding avidity to t-s cells than hace -fc ( . nm v.s. . nm for igg ab and hace -fc to achieve % binding, respectively). followed by pbst washing. for detection, an hrp conjugated anti mouse fc antibody was used. competition of ab with hace tested by blitz. nm hace -fc was monitored to bind ab saturated sensors (red line), which is compared to its independent binding signal to rbd sensor in the absence of ab (green line were defined as the sample concentration at which a % reduction in rlu was observed relative to the average of the virus control wells. a pneumonia outbreak associated with a new coronavirus of probable bat origin convalescent plasma as a potential therapy for covid- . the lancet infectious diseases sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor structure of severe acute respiratory syndrome coronavirus receptor-binding domain complexed with neutralizing antibody exceptionally potent neutralization of middle east respiratory syndrome coronavirus by human monoclonal antibodies potent neutralization of hendra and nipah viruses by human monoclonal antibodies exceptionally potent cross-reactive neutralization of nipah and hendra viruses by a human monoclonal antibody potent cross-reactive neutralization of sars coronavirus isolates by human monoclonal antibodies passive transfer of a germline-like neutralizing human monoclonal antibody protects transgenic mice against lethal middle east respiratory syndrome coronavirus infection a neutralizing human monoclonal antibody protects against lethal disease in a new ferret model of acute nipah virus infection a neutralizing human monoclonal antibody protects african green monkeys from hendra virus challenge safety, tolerability, pharmacokinetics, and immunogenicity of a human monoclonal antibody targeting the g glycoprotein of henipaviruses in healthy adults: a first-in-human, randomised, controlled, phase study construction of a large naive human phage-displayed fab library through one-step cloning construction of a large phagedisplayed human antibody domain library with a scaffold based on a newly identified highly soluble, stable heavy chain variable domain structural basis for the recognition of sars-cov- by full-length human ace an emerging coronavirus causing pneumonia outbreak in wuhan, china: calling for developing therapeutic and prophylactic strategies the sars-cov s glycoprotein: expression and functional characterization biotinylated hace -fc ( nm) was incubated with rbd-fc in the presence of different concentrations of vh ab . after washing, bound hace -fc was detected by using hrp conjugated streptavidin after washing, bound cr was detected by using hrp conjugated anti human fc antibody. ab showed weak competition with cr for binding to sars-cov- rbd. all the elisa experiments were performed in duplicate and the error bars denote ± sd key: cord- -mymndjvd authors: higuchi, yusuke; suzuki, tatsuya; arimori, takao; ikemura, nariko; kirita, yuhei; ohgitani, eriko; mazda, osam; motooka, daisuke; nakamura, shota; matsuura, yoshiharu; matoba, satoaki; okamoto, toru; takagi, junichi; hoshino, atsushi title: high affinity modified ace receptors prevent sars-cov- infection date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: mymndjvd the sars-cov- spike protein binds to the human angiotensin-converting enzyme (ace ) receptor via receptor binding domain (rbd) to enter into the cell. inhibiting this interaction is a main approach to block sars-cov- infection and it is required to have high affinity to rbd independently of viral mutation for effective protection. to this end, we engineered ace to enhance the affinity with directed evolution in human cells. three cycles of random mutation and cell sorting achieved more than -fold higher affinity to rbd than wild-type ace . the extracellular domain of modified ace fused to the fc region of the human immunoglobulin igg had stable structure and neutralized sars-cov- pseudotyped lentivirus and authentic virus with more than -fold lower concentration than wild-type. engineering ace decoy receptors with directed evolution is a promising approach to develop a sars-cov- neutralizing drug that has affinity comparable to monoclonal antibodies yet displaying resistance to escape mutations of virus. coronavirus disease has spread across the world as a tremendous pandemic and presented an unprecedented challenge to human society. the causative agent of covid- , sars-cov- is a single-stranded positive-strand rna virus that belongs to lineage b, clade of the betacoronavirus genus - . the virus binds to host cells through its trimeric spike glycoprotein composed of two subunits; s is responsible for receptor binding and s for membrane fusion . angiotensin-converting enzyme (ace ) is lineage b clade specific receptor including sars-cov- . the receptor binding domain (rbd) of s subunit directly binds ace with high affinity, therefore, it is the most important targeting site to inhibit viral infection. in fact, the rbd is the common binding site of effective neutralizing antibodies identified from convalescent patients [ ] [ ] [ ] . rna viruses such as sars-cov- have high mutation rates , which are correlated with high evolvability including the acquisition of anti-viral drug resistance. neutralizing antibodies are one of the promising approaches to combat covid- . accumulating evidence demonstrated that monoclonal antibodies isolated from convalescent covid- patients have high potency in neutralizing viruses. however, mutations in the spike gene can lead to the sars-cov- adaptation to such neutralizing antibodies. in the replicating sars-cov- pseudovirus culture experiment, escape mutation was observed against monoclonal antibody as early as in the first passage and evasion was seen even against the polyclonal convalescent plasma . notably, some mutations identified in in vitro replicating culture experiment are present in natural population according to the database similarly to the anti-rbd antibodies, extracellular domain of ace , soluble ace (sace ), can also be used to neutralize sars-cov- as a decoy receptor. the therapeutic potency was confirmed using human organoid , and now apeiron biologics conducts european phase ii clinical trial of recombinant sace against covid- . in addition, fusing sace to the fc region of the human igg has been shown to enhance neutralization capacity as well as to improve the pharmacokinetics to the level of igg in mice . most importantly, sace has a great advantage over antibodies due to the resistance to the escape mutation. the virus with escape mutation from sace should have limited binding affinity to cell surface native ace receptors, leading to a diminished or eliminated virulence. unfortunately, many reports, including our current study, have revealed that the binding affinity of wild-type sace to the sars-cov- spike rbd is much weaker (kd ~ nm) than that of clinical grade antibodies , , [ ] [ ] [ ] . thus, the therapeutic potential of the wild-type sace as a neutralizing agent against sars-cov- is uncertain. here we conducted protein engineering with human cell-based directed evolution to improve the binding affinity of ace to the spike rbd. random mutations were introduced in the protease domain containing the interface to the rbd, then full length ace mutant library was expressed in t cells and incubated with fluorescence-labelled rbd. high binding population was sorted and underwent dna extraction, the bulk of which was further induced with random mutations for the next cycle of selection. three cycles of screening resulted in an identification of mutant ace clones with more than -fold higher binding affinity to the rbd and lower half-maximal inhibitory concentration (ic ) for sars-cov- pseudotyped lentivirus as well as authentic virus. the present protein engineering system generates a virus-neutralizing drug that has high affinity comparable with antibodies and can resolve the issue of drug resistance caused by escape mutation. we engineered ace to bind the rbd of the sars-cov- spike protein with the combination of surface display of mutagenized library and fluorescence-activated cell sorting (facs) to perform the evolution in t human cells. the protease domain (pd) of ace is known to harbor the interface to viral spike protein, located in the top-middle part of ace ectodomain. in this study, ace residues - and - , referred to as pd and pd , respectively, were mutagenized independently. synthetic signal sequence and ha tag were appended and restriction sites were introduced in both sides of pd and pd by optimizing codon (fig. a) . we used error-prone pcr to mutagenize the protease domain of ace with an average of about one amino acid mutation per bp, then inserted the fragment into the introduced restriction site by homologous recombination. the reaction sample was transformed to competent cell, generating a library of ~ mutants. mutant plasmid library was packaged into lentivirus, followed by expression in human t cells in less than . moi (multiplicity of infection) to yield no more than one mutant ace per cell. cells were incubated with recombinant rbd of sars-cov- spike protein fused to superfolder gfp (sfgfp; fig. b) . we confirmed the level of bound rbd-sfgfp and surface expression levels of ha-tagged ace with alexa fluor in twodimensional display of flow cytometry. top . % cells showing higher binding relative to expression level were harvested from ~ x cells by facs. to exclude the structurally unstable mutants, cells with preserved signal of surface ace were gated. genomic dna was extracted from collected cells and mutagenized again to proceed to the next cycle of screening (fig. c) . random mutagenesis screening for pd was performed times and mutated sequences from top . % population were reconstructed into the backbone plasmid, and expressed in t cells individually. one to three hundred clones were validated for the binding capacity to the rbd-sfgfp. as the selection cycle advances, the two-dimensional distribution of library cells in flowcytometry became broader and higher in rbd-binding signal, and individual clone validation identified several mutants with higher binding capacity (fig. a) . to evaluate the neutralization activity in the form of sace , we first generated fusion protein of the soluble extracellular domain of mutagenized ace residues - and sfgfp (sace -sfgfp) and used them to compete with the cell-surface wt ace for the rbd binding. to this end, concentration of each mutant sace -sfgfp in the cultured medium from transfected cells was quantitatively standardized with sfgfp signal, serially diluted, preincubated with rbd-sfgfp for min, and then transferred to wild-type ace expressing t cells. after min, the rbd-bound cells were analyzed in flowcytometry. higher neutralization activity against the rbd was confirmed for the mutants that have accumulated mutations (fig. b , table. s ). second mutagenesis based on the top hit of first screening, - mutant, was also performed, but the distribution of the library cells did not expand so much, and we could not isolate clones with significantly higher affinity than the bulk of top . % (fig. s ). we next performed pd mutagenesis in both the bulk of top . % and one of the highest mutants of the rd library, the clone n . again, the binding distribution of the pd library cells was similar to the basal cells, suggesting the inability of this strategy to further increase the affinity to rbd. a recent study reported, via deep mutational scanning, that several specific mutations in pd were enriched in high rbd-binding clones . when we added these mutations in n , it did not improve the capacity of the rbd neutralization further ( fig. s ). to identify essential mutation(s) in the high affinity ace mutants, each mutation was altered to wild-type in mutant n , j and j . mutant n contains mutations of a v, k e, k n, e k, n i, l f, and n h. among them, individual back-mutation of v , n , k , and f to wild-type residues resulted in modest to severe reduction of the rbd-neutralization capacity (fig. a) , while multiple back mutations of e , i , and h in various combination did not alter the high activity of the original n ( fig. b) , indicating that a v, k n, e k, and l f was necessary and sufficient components. in the case of mutant j that is composed of k m, e k, q r, s f, l f, and n d, similar back mutation experiment revealed that k m, e k, and q r was essential (fig. c) . simultaneous back mutations of two (f /f and f /d ) but not three (f /f /d ) nonessential residues were tolerated ( fig. d) , suggesting that l f and n d may exert their positive effect on the activity only when they coexist. the third mutant j has t i, a v, h a, t r, t q, and q h. single and multiple back mutation experiments showed that h a, t q, and q h were essential in securing the high inhibitory activity of j (figs. e, f). these top high affinity mutants exhibited higher rbd-neutralization activity when compared with the same sace scaffold carrying the two high affinity mutation sets reported recently , (fig. g ). first mutant library was also sorted in the manner of high and low rbd-sfgfp binding signal (fig. s a ). the affinity value of each mutant was defined as the ratio of high and low read counts. then, the impact of each amino acid mutation on rbd binding was analyzed as a semi-deep mutational scanning (dms) (fig. s b ). this analysis revealed that some mutations such as k n, e k, and q r in top high affinity mutants had very mild impact in itself. simple combination of high value mutations, a v, k t, q l, l v and t k, referred to here as the dms mutant, and its derivatives showed less neutralization activity than top three high affinity mutants, indicating that each mutation works coordinately in high affinity mutants (fig. h ). next, we characterized the binding affinities of the mutant sace s for spike rbd using surface plasmon resonance, where igg -fc fused rbd was immobilized as a ligand and the association and dissociation kinetics of his-tagged sace were determined. the kd value of wild type sace was . nm, whereas those of mutants j and n were determined to be . nm and . nm (fig. a) . analytical size exclusion chromatography (sec) showed no signs of protein aggregation in the ace mutant samples, confirming that the apparent high affinity was not caused by the avidity effect and the observed kd values represent genuine : affinity toward rbd (fig. s a ). recombinant soluble ace (rsace ) was reported to have a fast clearance rate in human blood with a half-life of hours , . recently, it was demonstrated that a rsace fused with a fc fragment show high stability in mice as well as higher neutralization activity toward both pseudotyped and authentic sars-cov- in cultured cells . we formulated our high affinity mutant sace s as fc fusion (sace -fc) and found that the purified proteins were folded well and devoid of aggregation, showing solution behavior indistinguishable from wild type protein (fig. s b) . to evaluate their efficacy in neutralizing sars-cov- infections, affinity-enhanced sace -fc mutants were assayed for viral neutralization against pseudotyped lentivirus and authentic sars-cov- . the ic values of wild-type, j , and n for pseudovirus neutralization in ace -expressing t cells were . , . , and . μg/ml, respectively. in the same way, n neutralized pseudovirus very efficiently with an ic value more than times lower than the wild-type in tmprss -expressing veroe cells, (fig. b) . most importantly, when the neutralization potential against the authentic sars-cov- in tmprss expressing veroe cells was evaluated, wild-type sace -fc showed no efficiency even at μg/ml, whereas n sace -fc demonstrated significant neutralizing effect in . μg/ml (fig. c) . sars-cov- neutralization is one of the preventative or therapeutic approaches against covid- . monoclonal antibodies have become one of the common drug modalities, especially as therapeutics against autoimmune diseases and cancer. as virus-neutralizing antibodies, palivizumab is clinically used to prevent hospitalization from respiratory syncytial virus infection in high-risk infants , and cocktail of monoclonal antibodies has been shown to reduce mortality from ebola virus disease . engineered recombinant decoy receptor drugs are also developed to neutralize various cytokines including vascular endothelial growth factor, tumor necrosis factor alpha, and ctla- and approved for orbital vascular diseases and rheumatoid arthritis. recombinant sace or sace -fc fusion protein has potency to neutralize sars-cov- , , however its modest binding affinity requires higher dose than monoclonal antibody. we developed the screening system based on the cycle of random mutation and sorting of high affinity population in t cells followed by validation of neutralizing activity in a soluble form. in this screening, an additional random mutation was induced in the bulk of sorted mutants, which worked better than mutagenesis in the top mutant. engineering of decoy receptors with improved affinity was previously reported for cancer-related molecules and ace , , . they used yeast display system to perform directed evolution. large scale library (~ mutants) was prepared and high affinity mutants were identified by repeating sorting from initial library. fast growth rate of yeast is suitable for library screening involving repeated sorting and propagation. we on the other hand employed human cells for the display purpose. since post-translational modification can modulate protein binding affinity, human cell-based screening is better to understand the impact of ace variants on viral affinity and also to proceed biologics development. repeating mutagenesis after cell sorting without propagation enabled us to conduct screening with relatively small library (~ mutants) and human cells. during the validation, we noticed that high affinity pattern in the flow cytometry assay of full length ace binding rbd-sfgfp did not always correlate with its neutralization activity. thus, it is evident that experimental validation of each mutation at the level of sace protein was important for efficient identification of high affinity mutants. our mutant ace s have affinity comparable to typical anti-spike monoclonal antibodies, but they also offer some advantages over antibodies when considered as a drug candidate. interface of ace to the rbd is larger than that of antibodies, which potentially increases efficacy. escape mutation to modified ace is likely to result in lower affinity to the native receptor, making such virus much less virulent. sars-cov- also enters into host cell via endocytosis. sars-cov- infection is mediated not only by tmprss family proteases but by cathepsin l that is catalytically active at ph . - . . some antibodies are susceptible to impaired affinity at lower ph, leading to lower viral neutralization. high affinity modified ace fused with fc is the promising strategy to neutralize sars-cov- . the time frame for running one cycle of mutagenesis and sorting was just one week in our system, and we succeeded in developing optimized mutants in a couple of months without depending on patientsderived cells or tissues. thus, our system can rapidly generate therapeutic candidates against various viral diseases and may be well suited for fighting against future viral pandemics. lenti-x t cells were purchased from clontech and cultured at °c with % co in dulbecco's modified eagle's medium (dmem, wako) containing % fetal bovine serum (gibco) and penicillin/streptomycin ( u/ml, invitrogen). veroe /tmprss cells were a gift from national institutes of biomedical innovation, health and nutrition (japan) and cultured at °c with % co in dmem (wako) containing % fetal bovine serum (gibco) and penicillin/streptomycin ( u/ml, invitrogen). all the cell lines were routinely tested negative for mycoplasma contamination. for a semi-deep mutational scanning of ace residues - , % of ha positive cells with the highest and % with the lowest gfp fluorescence were also collected in st mutated library (fig. s a) and their genomic dna was extracted by nucleospin tissue (takara). ace residues - plasmid sequence was amplified with primers containing adaptor and barcode sequence to perform deep sequencing on the illumina miseq platform using nt paired-end protocol. data were analyzed as follows; high and low gating read count of each mutant was normalized with total counts and log ratio of high/low was defined as affinity value. then, each amino acid mutation-containing mutant affinity values were aggregated. . the pcdna to ha-ace plasmid was transfected into t cells ( ng dna per ml of culture kinetic binding measurement using biacore (spr) the binding kinetics of sace (wild-type or mutants) to rbd were analyzed by spr using a biacore pseudotyped reporter virus assays were conducted as previously described . a plasmid coding sars-cov- spike was obtained from addgene # , and deletion mutant cΔ (with amino acids deleted from the c terminus) was cloned into pcdna to (invitrogen) to enhance virus titer . spike-pseudovirus with a luciferase reporter gene was prepared by transfecting plasmids (cΔ , pspax , and plenti firefly) into lentix- t cells with lipofectamine (invitrogen). after hours, supernatants were harvested, filtered with a . μm low protein-binding filter (sfca), and frozen at - °c. ace -expressing t cells were seeded at , cells per well in -well plate. pseudovirus and three-fold dilution series of sace -fc protein were incubated for hour, then this mixture was administered to ace -expressing t cells. after hour pre-incubation, medium was vero-tmprss were seeded on well plates ( , cells/well) and incubated for overnight. the culture supernatants serially diluted by medium were inoculated and incubated for hours. culture medium was removed, fresh medium containing % methylcellulose ( . ml) was added, and the culture was further incubated for days. the cells were fixed with % paraformaldehyde phosphate buffer solution (nacalai tesque) and plaques were visualized by using a crystal violet. table. s amino acid sequence and rbd neutralization activity value of validated mutants. the value of rbd neutralization activity was calculated as -log concentration of % rbd-sfgfp bound competing relative to n . a pneumonia outbreak associated with a new coronavirus of probable bat origin a new coronavirus associated with human respiratory disease in china functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses cryo-em structure of the -ncov spike in the prefusion conformation a human neutralizing antibody targets the receptor-binding site of sars-cov- human neutralizing antibodies elicited by sars-cov- infection potent neutralizing antibodies against sars-cov- identified by high-throughput single-cell sequencing of convalescent patients' b cells why are rna virus mutation rates so damn high? antibody cocktail to sars-cov- spike protein prevents rapid mutational escape seen with individual antibodies escape from neutralizing antibodies by sars-cov- spike protein variants engineered ace receptor traps potently neutralize sars-cov- . biorxiv neutralization of sars-cov- spike pseudotyped virus by recombinant ace -ig engineering human ace to optimize binding to the spike protein of sars coronavirus structural basis of receptor recognition by sars-cov- structure of the sars-cov- spike receptor-binding domain bound to the ace receptor targeting the degradation of angiotensin ii with recombinant angiotensinconverting enzyme : prevention of angiotensin ii-dependent hypertension pharmacokinetics and pharmacodynamics of recombinant human angiotensinconverting enzyme in healthy human subjects humanized respiratory syncytial virus monoclonal antibody, reduces hospitalization from respiratory syncytial virus infection in high-risk infants. the impact-rsv study group controlled trial of ebola virus disease therapeutics inhibition of sars-cov- infections in engineered human tissues using clinical-grade soluble human ace an engineered axl 'decoy receptor' effectively silences the gas -axl signaling axis antitumor activity of an engineered decoy receptor targeting clcf -cntfr signaling in lung adenocarcinoma genome-wide crispr screen reveals host genes that regulate sars-cov- infection human secretory signal peptide description by hidden markov model and generation of a strong artificial signal peptide for secreted protein expression protocol and reagents for pseudotyping lentiviral particles with sars-cov- spike protein for neutralization assays retroviral vectors pseudotyped with severe acute respiratory syndrome coronavirus s protein we would like to thank sho hashimoto, toshiyuki nishiji, tomohiro hino, and keiko tamura- key: cord- -i e fge authors: huang, kuan-ying a.; tan, tiong kit; chen, ting-hua; huang, chung-guei; harvey, ruth; hussain, saira; chen, cheng-pin; harding, adam; gilbert-jaramillo, javier; liu, xu; knight, michael; schimanski, lisa; shih, shin-ru; lin, yi-chun; cheng, chien-yu; cheng, shu-hsing; huang, yhu-chering; lin, tzou-yien; jan, jia-tsrong; ma, che; james, william; daniels, rodney s.; mccauley, john w.; rijal, pramila; townsend, alain r. title: breadth and function of antibody response to acute sars-cov- infection in humans date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: i e fge serological and plasmablast responses and plasmablast-derived igg monoclonal antibodies (mabs) have been analysed in three covid- patients with different clinical severities. potent humoral responses were detected within weeks of onset of illness in all patients and the serological titre was elicited soon after or concomitantly with peripheral plasmablast response. an average of . % and . % of plasmablast-derived mabs were reactive with virus spike glycoprotein or nucleocapsid, respectively. a subset of anti-spike ( of ) and over half of anti-nucleocapsid ( of ) antibodies cross-reacted with other betacoronaviruses tested and harboured extensive somatic mutations, indicative of an expansion of memory b cells upon sars-cov- infection. fourteen of anti-spike mabs, including five anti-rbd, three anti-non-rbd s and six anti-s , neutralised wild-type sars-cov- in independent assays. anti-rbd mabs were further grouped into four cross-inhibiting clusters, of which six antibodies from three separate clusters blocked the binding of rbd to ace and five were neutralising. all ace -blocking anti-rbd antibodies were isolated from two patients with prolonged fever, which is compatible with substantial ace -blocking response in their sera. at last, the identification of non-competing pairs of neutralising antibodies would offer potential templates for the development of prophylactic and therapeutic agents against sars-cov- . in late , a novel coronavirus emerged and was identified as the cause of a cluster of respiratory infection cases in wuhan, china. it spread quickly around the world. in march of a pandemic was declared by the world health organization, the virus was formally named as severe acute respiratory syndrome coronavirus (sars- cov- ) and the resulting disease was named covid- . as of october , there (table ) , suggesting the presence of conserved epitopes on the spike glycoproteins of betacoronaviruses. each of anti-spike glycoprotein mabs was encoded by a unique set of heavy chain vdj and light chain vj rearrangements in the variable domain (supplemental table ). oc virus and three of these also cross-reacted on mers (table ). all five cross- reactive anti-s antibodies had high rates of somatic mutation ( ± ), indicating a memory phenotype, and three of the five were neutralising to a moderate level (half maximal effective concentration, ec , - . nm, table ). the cdr length varied among anti-spike glycoprotein antibodies (supplemental table ). no significant differences were found between anti-s and anti-s or anti- rbd subsets. among anti-s mabs, a significantly longer heavy chain cdr length was found in the cross-reactive group compared to the specific group (cross-reactive ± versus specific ± , p= . , two-tailed mann-whitney test; figure c) , indicating that a long cdr may play a role in antigen binding, which is also found in several broadly reactive human mabs against human immunodeficiency virus and influenza virus ( , ). the binding activities of anti-rbd mabs were further characterised in detail. using mdck-siat cells transduced to express the rbd and flow cytometry, binding activities of the anti-rbd mabs were shown to vary with % binding concentration from . to . µg/ml (supplemental figure ). the mabs with strong anti-rbd binding have a relatively long heavy chain cdr length ( % binding concentration < . µg/ml versus > . µg/ml, p= . , two-tailed mann- whitney test; supplemental figure the anti-spike glycoprotein mabs were systematically examined by plaque reduction neutralisation (prnt) assay for neutralisation of wild type sars-cov- virus (see methods; summarised in table ). a total of neutralising antibodies distributed between different regions of the spike glycoprotein were identified: of to rbd, of to s (non-rbd), of to s . the ec concentrations, as a measure of potency, ranged from . to ~ nm ( ng/ml -~ µg/ml). (see methods): inhibition of virus replication was measured by quantitative pcr in the supernatant bathing the infected cells. this results corroborated that anti-rbd fd a, anti-rbd fi a, anti-rbd fd d, anti-rbd ey a and anti-s ew c, as crude culture supernatants, reduced the virus signal from ~ -to ~ , -fold (supplemental figure ) . potent neutralising antibodies to the rbd of sars-cov- spike glycoprotein were identified and we thus analyse the blockade of the ace -rbd interaction by anti- rbd antibodies in two assays ( figure , table the structure of vhh -fc bound to rbd is known ( ) and its footprint on the rbd does not overlap that of ace , so inhibition is thought to occur by steric hindrance. in the second assay, we employed mdck-siat cells overexpressing full-length human ace as a transmembrane protein. unlabelled antibodies or ace -fc were mixed in excess with biotinylated rbd, and binding of rbd was detected with streptavidin-hrp in elisa (figure b ). the results of this assay mostly mirrored those of the first assay and confirmed that in this orientation anti-rbd neutralising antibodies fd a and fd d competed in excess with soluble rbd for binding to ace ( figure b ). in addition, anti-rbd neutralising antibody ey a competed with rbd for ace binding. the binding pattern of ey a is analogous to a previously described antibody cr (table ) ( ). these two antibodies are known to bind to the same region of rbd away from the ace binding site, but they influence the binding kinetics of rbd to ace , presumably through steric effects ( ). the ten anti-rbd mabs were then divided into cross-inhibiting groups as described for human mabs to ebola ( ) by assessing competition of unlabelled antibodies at -fold (or greater) excess over a biotin labelled target antibody by elisa. included as controls were the vhh -fc ( ) and h -h -fc ( ) the ten antibodies formed four cross-inhibiting clusters (table ) , represented by antibodies ey a (cluster , which included cr ), fi a (cluster , which included h -h ), fd a (cluster , which included s ) and fj b (cluster ). the strongest inhibitors of ace -fc binding were in clusters and (tables and ) . neutralising antibodies were detected in clusters , and , with the strongest antibodies fi a and fd a being in clusters and (tables and ) . table ) and did not cross-react strongly with other betacoronaviruses (table ) . fd a exhibits the most potent neutralising activity in the prnt assay and also completely inhibits sars-cov- -induced cytopathic effect (see methods) at . nm. thirteen mabs were defined that bound the s region and three, close to germline in sequence, were neutralising. fj c showed strong neutralisation (ec . nm), whilst fd e (ec nm) and fd e (ec nm) were moderately neutralising (table ) the mabs were evolved from clonal groups defined by their heavy chain vdj and light chain vj rearrangements (supplemental table the presence of pre-existing immune memory to betacoronavirus that cross-react with sars-cov- is supported by the accumulation of somatic mutations in the genes encoding cross-reactive antibodies isolated from covid- patients (figures c and d, supplemental tables and ). this situation is reminiscent of re-exposure to immunogenic epitopes shared by closely related viruses leading to induction of broadly cross-reactive antibodies in patients infected with influenza, dengue or zika viruses ( - ). the mabs that bound to the spike glycoprotein were systematically tested for neutralisation (summarised in table ). results established that neutralising epitopes were present on the rbd, s -ntd, s -non ntd/rbd, and s regions of the spike cd neg cd pos cd neg cd hi cd hi igg pos plasmablasts were gated and isolated in chamber as single cells as previously described ( ) . sorted single cells were used to produce human igg mabs as previously described ( ) confluent monolayers of vero e cells in -well plates were incubated with ~ plaque forming units (pfu) of sars cov- (hcov- /england/ / , epi_isl_ ) and antibodies in a -fold dilution series (triplicates) for hours at room temperature. inoculum was then removed, and cells were overlaid with plaque assay overlay. cells were incubated at °c, % co for hours prior to fixation with % paraformaldehyde at °c for minutes. fixed cells were then permeabilised with . % triton-x- and stained with a horseradish peroxidase conjugated-antibody against virus protein for hour at room temperature. tmb substrate was then added to visualise virus plaques as described previously for influenza virus ( ). convalescent serum from covid- patients was used as a control. in brief, this rapid, high-throughput assay determines the concentration of antibody that produces a % reduction in infectious focus-forming units of authentic sars- eagle's medium containing % fbs), two-fold serially diluted mabs in vgm starting at µg/ml were added to each duplicated well. the plates were immediately transferred to a bsl- laboratory and tcid sars-cov- (hcov- /taiwan/ / , epi_isl_ ) in vgm was added. the plates were further incubated at °c with % co for three days and the cytopathic morphology of the cells was recorded using an imagexpress nano automated cellular imaging system. competitive binding assays were performed as described previously ( ) two assays were used to determine the blocking of binding of ace to rbd by mabs. rbd was anchored on the plate in the first assay whereas ace was anchored for the second assay. the second ace blocking assay was performed as described previously ( , ) . b non-ntd s pos . . . . . . . ew b b non-ntd s -ve . . . . . . -ve fd d b ntd pos . . . . . . -ve fd c b non-ntd s pos . . . . . . -ve fd d b non-ntd s -ve . . . . . . -ve fd b b non-ntd s -ve . . . . . . -ve fd c b ntd pos . . . . . . -ve fg c a non-ntd s pos . . . . . . -ve fn c c non-ntd s -ve . . . . . . -ve fd e b non-ntd s pos . . . . . . -ve ew b b non-ntd s -ve . . . . . . -ve deployment of convalescent plasma for the prevention and treatment of covid- effect of convalescent plasma therapy on time to clinical improvement in patients with severe and life-threatening use of convalescent plasma therapy in sars patients in hong kong structure, function, and antigenicity of the sars-cov- spike breadth of concomitant immune responses prior to patient recovery: a case report of non-severe covid- neutralizing antibodies in patients with severe acute respiratory syndrome-associated coronavirus infection antibody responses to sars-cov- in patients with covid- serology characteristics of sars-cov- infection since exposure and post symptom onset cross-neutralization of influenza a viruses mediated by a single antibody loop structural insights on the role of antibodies in hiv- vaccine and therapy human monoclonal antibody combination against sars . huo, j. et al. neutralization of sars-cov- by destruction of the prefusion spike a highly conserved cryptic epitope in the receptor binding domains of neutralizing nanobodies bind sars-cov- spike rbd and block interaction with ace structural basis for the neutralization of sars-cov- by an antibody from a convalescent patient rugged nanoscaffold to enhance plug-and-display vaccination structural basis for potent neutralization of betacoronaviruses by single-domain camelid antibodies therapeutic monoclonal antibodies for ebola virus infection derived from vaccinated humans cross-neutralization of sars-cov- by a human monoclonal sars cov antibody protective humoral immunity in sars-cov- infected pediatric patients antibody responses to sars-cov- in patients of novel coronavirus disease serologic cross-reactivity of sars-cov- with endemic and seasonal an outbreak of human coronavirus oc infection and serological cross-reactivity with sars coronavirus recovery in tracheal organ cultures of novel viruses from patients with respiratory disease epidemiology of seasonal coronaviruses: establishing the context for the emergence of coronavirus disease human coronavirus oc associated with fatal development of a nucleocapsid-based human coronavirus immunoassay and estimates of individuals exposed to coronavirus in a u.s. metropolitan population the dominance of human coronavirus oc and nl infections in infants the human immune response to dengue virus is dominated by highly cross-reactive antibodies endowed with neutralizing and enhancing activity zika virus activates de novo and cross-reactive memory b cell responses in dengue-experienced donors broadly cross-reactive antibodies dominate the human b cell response against pandemic h n influenza virus infection receptor-binding domain of severe convergent antibody responses to sars-cov- in convalescent individuals potent neutralizing antibodies against sars-cov- identified by single-cell sequencing of convalescent patients' b cells potent neutralizing antibodies against multiple epitopes on sars-cov- spike a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace potent neutralizing antibodies from covid- patients define multiple targets of vulnerability studies in humanized mice and convalescent humans yield a sars cov- antibody cocktail isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model human neutralizing antibodies elicited by sars-cov- infection a human monoclonal antibody blocking sars-cov- infection human monoclonal antibodies block the binding of sars-cov- spike protein to angiotensin converting enzyme receptor receptor-binding domain as a target for developing sars vaccines the sars-cov- receptor-binding domain elicits a potent neutralizing response without antibody-dependent enhancement a vaccine targeting the rbd of the s protein of sars-cov- induces protective immunity structural basis of receptor recognition by sars-cov- structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structural and functional basis of sars-cov- entry by using sars-cov- neutralizing antibody structures inform therapeutic strategies complete mapping of mutations to the sars-cov- spike receptor-binding domain that escape antibody recognition antibody cocktail to sars-cov- spike protein prevents rapid mutational escape seen with individual antibodies a neutralizing human antibody binds to the n-terminal domain of the structure-function analysis of neutralizing antibodies to h n influenza from naturally infected humans optimisation of a micro-neutralisation assay and its application in antigenic characterisation of influenza viruses. influenza other respir viruses isolation and rapid sharing of the novel coronavirus from the first patient diagnosed with covid- in australia the data are presented as specificity, number of antibodies, and the percentage of total antibodies isolated from each patient. (b) the binding activity of anti-sars-cov- mabs with spike glycoprotein, rbd and the s subunit in elisa. anti-influenza h mab bs- a and anti-sars rbd cr were included as controls. each experiment was repeated twice. the od values are presented as mean ± standard error of the mean. panels (c) and (d) show numbers of variable domain mutations in mab genes and variation antibodies that strongly cross-react with at least one betacoronavirus (sars or mers or oc ) were defined as cross-reactive mabs. cdr length and mutation numbers are presented as mean ± standard error of the mean (anti-s , specific reactive, n= ; anti-n, specific, n= versus cross-reactive, n= ). the two-tailed test was performed to compare the mutations between two groups d, =day ; ns, non-significant hinge and fc region of human igg and ace -fc were included as controls the rbd was colored in green. the epitopes recognized by ey a, cr and vhh (cluster mab) ( , , ) were colored in magenta. the epitopes recognized by ace and h -h (cluster mab) ( ) were overlapping and colored in blue and light blue. the epitopes recognized by s convalescent sera were analysed in the ace -blocking (ace anchored) assay anti-rbd antibody fd a and anti- influenza h antibody bs a were included as controls. data are presented as mean ± standard error of the mean ace -blocking activity of anti-rbd antibody compared to ace -fc (see methods): +, partial; ++ abbreviations: ifa, immunofluorescence; rbd, receptor-binding domain; prnt, plaque reduction neutralisation assay key: cord- -qdmunb l authors: zhao, yongkun; wang, chong; qiu, boning; li, chufang; wang, hualei; jin, hongli; gai, weiwei; zheng, xuexing; wang, tiecheng; sun, weiyang; yan, feihu; gao, yuwei; wang, qian; yan, jinghua; chen, ling; perlman, stanley; zhong, nanshan; zhao, jincun; yang, songtao; xia, xianzhu title: passive immunotherapy for middle east respiratory syndrome coronavirus infection with equine immunoglobulin or immunoglobulin fragments in a mouse model date: - - journal: antiviral res doi: . /j.antiviral. . . sha: doc_id: cord_uid: qdmunb l middle east respiratory syndrome (mers) is a highly lethal pulmonary infection caused by a coronavirus (cov), mers-cov. with the continuing spread of mers-cov, prophylactic and therapeutic treatments are urgently needed. in this study, we prepared purified equine f(ab’)( ) from horses immunized with mers-cov virus-like particles (vlps) expressing mers-cov s, m and e proteins. both igg and f(ab’)( ) efficiently neutralized mers-cov replication in tissue culture. passive transfer of equine immune antibodies significantly reduced virus titers and accelerated virus clearance from the lungs of mers-cov infected mice. our data show that horses immunized with mers-cov vlps can serve as a primary source of protective f(ab’)( ) for potential use in the prophylactic or therapeutic treatment of exposed or infected patients. middle east respiratory syndrome (mers)-cov is an emerging pathogen that causes severe pneumonia in humans in the arabian peninsula and in travelers from this region (assiri et al., a; zaki et al., b; zumla et al., ) . human-to-human spread has been documented (assiri et al., b) . while infections of immunocompetent patients generally present with only mild symptoms, the elderly and patients with pre-existing illnesses such as diabetes or renal failure are likely to develop more severe disease (assiri et al., a) . as of september , , cases with deaths ( . % mortality) had been reported to the world health organization, although the actual number of infections could be much larger since mild, asymptomatic or undiagnosed cases are likely to be common (drosten et al., ) . as yet there are neither licensed vaccines nor any prophylactic or therapeutic treatments effective against mers-cov. given the ability of coronaviruses to rapidly adapt to new hosts, a major public health concern is that mers-cov will further adapt to replication in humans, triggering a global severe acute respiratory syndrome (sars)-like pandemic (peiris et al., ; zaki et al., a) . as of now, the most promising treatment is the passive administration of anti-mers-cov neutralizing antibodies. several research groups have developed and produced anti-mers patientderived or humanized monoclonal neutralizing antibodies in vitro that were able to protect mers-cov infected mice (corti et al., ; li et al., ; zhao et al., ) . however, since these antibodies react with a single epitope on the mers-cov spike (s) protein and since coronaviruses are prone to mutate, this approach has raised concerns about possible antibody escape (corti et al., ; sabir et al., ) . recently, we showed that sera from middle east dromedary camels contained high levels of anti-mers-cov neutralizing antibodies. passive immunotherapy with sera from these animals significantly reduced virus loads and accelerated virus clearance from the lungs of mers-cov infected mice . this provides proof of concept that immune animal sera are potentially useful in the treatment of patients with mers (hayden et al., ) . passive immunotherapy with animal sera or antibodies has been successfully used to prevent rabies and to neutralize snake venom (both et al., ; gutierrez et al., ) . convalescent plasma used to treat patients with sars has been found safe and has demonstrated some efficacy in a study with a small number of patients (mair-jenkins et al., ) . however, neutralizing antibody titers in mers patients are generally low and the limited number of mers survivors makes this approach impractical (drosten et al., ) . here, we show that immunization of healthy horses with mers-cov virus-like particles (vlps) expressing mers-cov s, m and e proteins induces strong polyclonal neutralizing antibodies against mers-cov. since administration of whole antibodies can induce allergic responses in some humans, we further tested f(ab') fragments prepared by digestion of antibody with pepsin. prophylactic or therapeutic treatment of mers-cov infected mice with either igg or f(ab') significantly decreased the virus load in their lungs. mers-cov vlps were produced and purified as previously described . in brief, army worm sf cells were infected with a single recombinant baculoviruses co-expressing mers-cov structural protein genes s, m, and e, at a multiplicity of infection (moi) of . . culture supernatants were harvested at h post-infection and centrifuged at g for min to remove cell debris. following centrifugation of the clarified supernatants at , g for h at c the resulting vlp pellets were resuspended in pbs and loaded onto a e e % discontinuous sucrose gradient. after an additional centrifugation at , g for . h at c, bands between and % sucrose containing mers-cov vlp were collected. four -year-old healthy horses received multi-point intramuscular injections of . , . , , , and mg mers-cov vlps in ml pbs at weeks , , , , and , respectively. freund's complete adjuvant (sigma) was included in the first dose, and incomplete adjuvant in the remaining ones. sera were collected from the jugular vein weeks after each injection, and stored at À c before further analysis. mers-cov specific antibodies in the sera were measured by an indirect enzyme-linked immunosorbent assay (elisa) using purified mers-cov receptor-binding domain (rbd) protein (i.e., s protein residues e cloned into the pet- a expression vector and purified by ni-nta affinity chromatograph column). briefly, -well microtitration plates (corning costar, usa) were pre-coated with ml purified rbd antigen diluted in . mol/l carbonate sodium buffer (ph . ) to a final concentration of mg/ ml and incubated at c overnight. after blocking with skimmed milk for h at c, ml twofold serially diluted serum samples were added to the wells, and incubated at c for h. the plates were washed three times with pbs containing . % tween- (pbst), before addition of ml hrp-labeled rabbit antibody against horse igg (bioss, china; : , ) and incubation at c for h. after washing with pbst, ml , , , '-tetramethylbenzidine (tmb) (sigma, usa) as substrate was added to each well and incubated for min. the reaction was stopped with ml m h so . optical densities at nm were measured in an elisa plate reader (bio-rad, usa). horse antiserum was diluted with vol of normal saline ( . % nacl) and a half volume of saturated ammonium sulfate was then added and mixed gently at room temperature for min before centrifugation at g for min. the resulting sediment was redissolved in saline and mixed with a one-third volume of saturated ammonium sulfate. after incubation at ambient temperature for min and centrifugation at g for min, the second sediments were dissolved in normal saline and dialyzed against normal saline to remove any remaining ammonium salt. immunoaffinity resins were prepared by coupling mg rbd protein to . m sodium periodate-activated sepharose b ( g), and then incubating with ml sodium borohydride for min. after reaction with m tris (ph . ) for min, a purified igg sample was diluted -fold with pbs and incubated with the rbd resin overnight at c with constant rotation. the flowthroughs (anti-rbd depleted) were collected, and then the flowthroughs were tested against the rbd protein by elisa to ensure rbdspecific igg all bound with the rbd sepharose b. after washing with pbs, the bound antibodies (anti-rbd) were eluted in . m glycine-hcl buffer (ph . ). the eluates were neutralized with m tris buffer (ph . ), and then dialyzed against pbs. all samples were adjusted to the same protein concentration and sterilized by passage through microspin filters ( . mm pore size; millipore). neutralizing activity of the igg, rbd-specific igg, and flowthroughs were tested. the ph of the horse antiserum was adjusted to . with mol/l hcl. following incubation with pepsin ( iu/ml) at c for . h, the reaction was stopped by adjusting the ph to . with mol/l naoh. the solution was then applied to protein-a and protein-g columns sequentially to remove whole immunoglobulins. the purity of the resulting f(ab') protein was assessed by sodium dodecyl sulfate polyacrylamide gel electrophoresis (sds-page) followed by coomassie blue staining and the target fraction in the gel was analyzed in a thin layer chromatography scanner (transmission, zigzag scan, dual wavelength, swing width: mm, delta y: . mm) (cs- , shimadzu). specific pathogen-free week old balb/c mice were purchased from charles river laboratories international and maintained in the animal care facility, university of iowa. briefly, all mice were housed in thoren individually ventilated cages. caging and bedding were autoclaved. irradiated diet was fed. filtered water ( . mm filter) was provided with edstrom automatic watering system. hepa-filtered cage changing stations were used. all persons entering animal rooms worn autoclaved gowns, gloves, hair bonnets, face masks, and shoe covers. serum samples, purified igg or f(ab') were serially diluted in dmem and mixed with an equal volume of mers-cov containing pfu. following incubation at c for h, aliquots were added to cultures of vero cells in well plates and incubated at c in % co for h with gentle rocking every min. plates were then overlaid with . % agarose/dmem/ % calf serum. after further incubation for days, agarose plugs were removed using a small spatula, and the remaining plaques were visualized by staining with . % crystal violet. six-week-old female balb/c mice were lightly anesthetized with isoflurane and transduced intranasally with .  pfu of ad -hdpp in ml dmem as described elsewhere (zhao et al., ) . five days post transduction, mice were infected intranasally with mers-cov (  pfu) in a total volume of ml dmem. mice were monitored daily for morbidity (weight loss) and mortality. all work with mers-cov was conducted in the university of iowa biosafety level (bsl- ) laboratory. separate groups were injected with ml horse antiserum or mg igg or f(ab') intraperitoneally (ip) day before or after intranasal infection with  pfu mers-cov. control mice were given an equal volume of normal horse serum (sigma). to obtain virus titers, lungs were harvested from subgroups of animals at the indicated time points (see results) and homogenized into ml of phosphate buffered saline (pbs), using a manual homogenizer. lung homogenates were aliquoted into micro tubes and kept in À c. virus was titered on vero cells. cells were fixed with % formaldehyde and stained with crystal violet three days post-infection (p.i.). viral titers are expressed as pfu/g tissue for mers-cov (zhao et al., ) . due to the biosafety risk, mers-cov must be handled in a bsl- laboratory, whereas vlps can be rapidly generated under bsl- conditions as an immunogen inducing high antibody titers. in addition, the horse provides little risk to humans and produces high antibody yields, making these animals an effective source for production of hyperimmune sera (zheng et al., ) . rbd-specific igg titers in the sera were all above : , after five immunizations (fig. ) as assessed by elisa. rbd contains the major neutralizing epitopes of the s protein, as shown by the observation that absorption of sars patient convalescent sera with sars-cov rbd removes the majority of neutralizing antibodies (he et al., ) . independent research groups have also shown more directly that the mers-cov rbd sequence contains the major antigenic determinants for inducing neutralizing antibodies, and that neutralizing epitopes within mers-cov s are also localized primarily in the rbd region (du et al., ; mou et al., ) . here, we have demonstrated that anti-rbd antibodies function as major components of neutralizing antibodies. we found that rbd-specific igg neutralized mers-cov infection with half maximal inhibitory concentration of . mg/ml, and .  mg/ml for flowthroughs (fig. ) , suggesting that the rbd of s protein act as an important neutralization determinant of mers-cov. our results demonstrate that equine antibodies are polyclonal and recognize more antigen determinants in mers-cov s protein than single mabs, which could potentially prevent antibody escape. the integrity of igg and f(ab') fragments was evaluated using an sds-page gel (fig. a) . the purity of the f(ab') fragments after protein-a/g chromatography was > % after gel electrophoresis (fig. b ). passive transfer of blood products from other humans poses a safety concern, with possible contamination with agents of blood-borne diseases (e.g., hiv, hepatitis). heterologous antibody carries a potential risk of allergic reaction, but generation of f(ab') fragments, results in antibodies being less immunoreactive and safer for use in humans. while we successfully generated equine antibodies against mers-cov vlps, their protective effect against authentic mers- fig. . robust mers-cov rbd-specific antibody in immunized horse sera. horses (n ¼ ) were injected intramuscularly with mers-cov vlps and boosted every two weeks an additional times. sera were collected weeks after each immunization. rbd-specific antibodies in immunized horse sera were detected using elisa. cov infection remained untested. using a plaque reduction neutralizing assay, we confirmed that immune sera significantly neutralized mers-cov infection in vitro, with a half effective maximal dilution of : , (fig. a, b) . further, we found that equine igg and f(ab') also neutralized mers-cov infection with half effective maximal concentrations (ec ) of . mg/ml and . mg/ml for igg and f(ab') , respectively (fig. c, d) . collectively, these results show that equine antibody products exhibit highly potent neutralizing activity against mers-cov. next we asked if adoptive transfer of equine antibodies could protect mice from mers-cov infection prophylactically and therapeutically. by using a mouse model we previously generated (zhao et al., ) , we injected animals with immune serum (fig. a, b) , purified igg (fig. c, d) or f(ab') (fig. e, f) i.p. day before (fig. a , c, e) or after (fig. b, d, f) mers-cov challenge. in both prophylactic and therapeutic settings, passive transfer of equine immune antibodies resulted in a e log reduction of virus titers in the lungs of mers-cov infected mice, and accelerated virus clearance in the serum treated group (fig. a, b) . we did not observe any difference in body weight loss and pathologic changes on the exterior surface of the lungs in treated and untreated mice after fig. . neutralizing activity of the rbd-specific antibodies in igg. in vitro neutralization tests of total igg, rbd-specific igg, and flowthroughs, were determined in a series of -fold dilutions and % neutralization was calculated using graphpad prism. infeciton, since in this model, mice only develope mild lung disease. rapid virus replication and inflammatory cell infiltration in the infected lungs are the major parameters to measure (zhao et al., ) . since the half-life of f(ab') in vivo is relatively short and mers-cov is cleared within days in this model (zhao et al., ) , we did not inject f(ab') antibodies before day À or after day p.i. of note, the purified igg seemed to have lower protective potency than that of the immune serum in vivo (fig. ) . the concentration of igg in serum is > mg/ml. we used ml of immune serum (equal to mg igg) per mouse which is much higher than the immune igg we used ( mg/mice). the other reason could be we purified immune igg using saturated ammonium sulfate precipitation method, which needed to be performed under room temperature. we speculated that some iggs were degraded or misfolded, and unable to bind to mers-cov spike protein under this circumstance. while, immune sera were properly stored at À c and contained high concentration of bsa and other proteins, which made the antiserum more stable. to date, there are several anti-mers-cov antibodies developed from different origins. each antibody contains its own advantages and disadvantages. for monoclonal antibodies, mouse-derived monoclonal antibody needs to be humanized before human use (li et al., ) ; a human neutralizing antibody derived from a convalescent mers patient can be produced in large amount from cho cells (corti et al., ) . however, the single clone antibody raises the concern of viral escape mutant when applied to human. administration of transchromosomic bovine human immunoglobulins (luke et al., ) or dromedary immune serum resulted in rapidly viral clearance in infected mouse lungs. the disadvantage of these antibodies is that these animals are not readily available. compared to the antibodies described above, the administration of equine igg-derived f(ab') fragment proved to be a versatile and feasible method (lu et al., ; zhou et al., ) . it provides a useful platform to produce therapeutics against emerging infectious diseases. in summary, by immunizing healthy horses with mers-cov vlps, we have successfully developed the first equine igg-derived f(ab') fragment that neutralizes mers-cov in vitro and in vivo. both prophylactic and therapeutic treatments decreased virus loads and accelerated virus clearance in the lungs of mers-cov-infected mice. therefore, horses immunized with mers-cov vlps can serve as a useful initial source for developing protective f(ab') fragments, for the purpose of preparedness and to serve as a strategic reserve for a potential mers epidemic and other emergent pathogens. the authors declare no competing interests. epidemiological, demographic, and clinical characteristics of cases of middle east respiratory syndrome coronavirus disease from saudi arabia: a descriptive study hospital outbreak of middle east respiratory syndrome coronavirus prophylactic and postexposure efficacy of a potent human monoclonal antibody against mers coronavirus transmission of mers-coronavirus in household contacts clinical features and virological analysis of a case of middle east respiratory syndrome coronavirus infection identification of a receptor-binding domain in the s protein of the novel human coronavirus middle east respiratory syndrome coronavirus as an essential target for vaccine development immunological profile of antivenoms: preclinical analysis of the efficacy of a polyspecific antivenom through antivenomics and neutralization assays towards improving clinical management of middle east respiratory syndrome coronavirus infection identification of a critical neutralization determinant of severe acute respiratory syndrome (sars)-associated coronavirus: importance for designing sars vaccines a humanized neutralizing antibody against mers-cov targeting the receptor-binding domain of the spike protein passive immunotherapy for influenza a h n virus infection with equine hyperimmune globulin f(ab') in mice human polyclonal immunoglobulin g from transchromosomic bovines inhibits mers-cov in vivo the effectiveness of convalescent plasma and hyperimmune immunoglobulin for the treatment of severe acute respiratory infections of viral etiology: a systematic review and exploratory meta-analysis the receptor binding domain of the new middle east respiratory syndrome coronavirus maps to a -residue region in the spike protein that efficiently elicits neutralizing antibodies severe acute respiratory syndrome co-circulation of three camel coronavirus species and recombination of mers-covs in saudi arabia mers-cov virus-like particles produced in insect cells induce specific humoural and cellular imminity in rhesus macaques isolation of a novel coronavirus from a man with pneumonia in saudi arabia isolation of a novel coronavirus from a man with pneumonia in saudi arabia rapid generation of a mouse model for middle east respiratory syndrome passive immunotherapy with dromedary immune serum in an experimental animal model for middle east respiratory syndrome coronavirus infection treatment with hyperimmune equine immunoglobulin or immunoglobulin fragments completely protects rodents from ebola virus infection inhibition of infection caused by severe acute respiratory syndrome-associated coronavirus by equine neutralizing antibody in aged mice middle east respiratory syndrome ad -hdpp transduced balb/c mice ( wks, female) were injected intraperitoneally with ml horse serum key: cord- -rovyvv authors: wagner, teresa r.; kaiser, philipp d.; gramlich, marius; becker, matthias; traenkle, bjoern; junker, daniel; haering, julia; dulovic, alex; schweizer, helen; nueske, stefan; scholz, armin; zeck, anne; schenke-layland, katja; nelde, annika; strengert, monika; walz, juliane s.; ruetalo, natalia; schindler, michael; schneiderhan-marra, nicole; rothbauer, ulrich title: neutrobodyplex - nanobodies to monitor a sars-cov- neutralizing immune response date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: rovyvv as the covid- pandemic escalates, the need for effective vaccination programs, diagnosis tools and therapeutic intervention ever increases. neutralizing binding molecules have become important tools for acute treatment of covid- and also provide a unique possibility to monitor the emergence and presence of a neutralizing immune response in infected or vaccinated individuals. here we identified unique nanobodies (nbs) with high binding affinities to the sars-cov- spike receptor domain (rbd). of these, effectively block the rbd:ace interface. via competitive binding analysis and detailed epitope mapping, we grouped all nbs into sets and demonstrated their neutralizing effect. combinations from different sets showed a profound synergistic effect by simultaneously targeting different epitopes within the rbd. finally, we established a competitive multiplex binding assay (“neutrobodyplex”) enabling the detection of neutralizing antibodies in serum of infected patients. overall, our nbs have high potential for prophylactic and therapeutic options and provide a novel approach to screen for a neutralizing immune response in infected or vaccinated individuals, helping to monitor immune status or guide vaccine design. phycoerythrin (pe)-labeled streptavidin after stringent washing. additionally, a non-specific nb (gfp-nb, negative control) and two inhibiting mouse antibodies (positive controls) were analyzed . data obtained by this multiplex binding assay showed that of the analyzed nbs inhibit ace binding to isolated rbd, s domain and homotrimeric spike. ic values calculated for inhibition of ace :rbd interaction ranges between . nm for nm and nm for nm (figure ) . notably, ic values obtained for the most potent inhibitory nbs nm ( . nm), nm ( . nm) and nm ( . nm) are highly comparable to ic values measured for the mouse iggs (mm : . nm; mm : . nm). additionally, the assay revealed that all nbs except nm , show a similarly strong inhibitory effect of ace binding to all tested antigens. nm seems to exclusively inhibit rbd:ace interaction and does not prevent binding of ace to either the homotrimeric spike or the s domain. after identifying rbd-specific nbs which have an inhibitory effect on ace binding, we investigated the relative location of their epitopes within the rbd. firstly, we first performed epitope binning experiments of nb combinations using biolayer interferometry. after coating sensors with biotinylated rbd, a nb was loaded until binding saturation was reached, followed by a short dissociation step to remove excess nb. a second nb from a different family was then exposed to the rbd-nb-complex. using this approach, we identified nbs which recognize overlapping and non-overlapping epitopes on rbd (figure , supplementary figure ) . as expected nbs with only minor differences in their cdr (nm , nm and nm , nb- set ) were suggested to recognize an identical or highly similar epitope as they cannot bind simultaneously to rbd. our analysis revealed that nbs with highly diverse cdr s such as nm , nm , nm and nm could not bind simultaneously, suggesting that these nbs recognize similar or at least overlapping epitopes. as a result, we clustered these diverse nbs in nb-set . overall, we identified five distinct nbs-sets, comprising at least one candidate targeting a different epitope within the rbd compared to any member of a different nb-set (figure ) . next, we performed hydrogen-deuterium exchange mass spectrometry (hdx-ms) with the most potent inhibitory nbs selected from the different nb-sets. this allowed us to more precisely locate their binding sites at the surface of rbd and compare with the rbd:ace interface. both members of nb-set , nm and nm , interacted with the rbd at the back/ lower right site (back view, figure ) . notably, the binding site of nm does not encompass amino acid residues involved in the rbd:ace interface. in contrast, nm (nb-set ) as well as nm (nb-set ) contacted the rbd at amino acid residues overlapping with the rbd:ace binding interface, whereas nm additionally covers parts of the spike- like loop region on one edge of the ace interface at the top front/ lower left side (front view, which did not contact any amino acid residues involved in the rbd:ace interface but rather binds to the opposite site (front view, figure ). comparing the data from epitope binning with the hdx-ms results, provides structural insights into the mechanism by which non- competing pairs of nbs can simultaneously bind the rbd. interestingly, the combination of nm (nb-set ) with nm (nb-set ) shows near complete coverage of the ace interface (figure ) whereas the observed inhibitory effect of nm might be due to steric hindrance. from these findings, we proposed that the combination of nb-set with nb-set might act synergistically on the inhibition of the interaction between rbd and ace . after identification of nbs which inhibit the rbd:ace interaction biochemically, we employed a cell-based viral infection assay to test for their neutralization potency. to this end, human caco- cells were co-incubated with the icsars-cov- -mng strain and serial dilutions of the inhibitory nbs nm , nm , nm and nm . h post-infection neutralization potency was determined via automated fluorescence-microscopy of fixed and nuclear-stained cells (supplementary figure ) . percentage of the infection rate following nb treatment normalized to a non-treated control was plotted and ic values were determined via sigmoidal inhibition curve fits. overall, data obtained from the multiplex binding assay and the viral infection assay were broadly consistent. representatives of nb-set , nm and nm , showed the highest neutralization potency with ic values of ~ nm and ~ nm followed by nm (~ nm) and nm (~ nm). as expected, nm (nb-set ) was not found to considering that nbs targeting diverse epitopes within the rbd:ace interface are beneficial in both reducing viral infectivity and preventing mutational escape, we next combined the most potent inhibitory and neutralizing candidates derived from nb-set (nm , nm ) and nb-set (nm ) and examined their response in both the multiplex binding assay and viral infection assay. in the multiplex binding assay the combination of nm and nm showed an increased effect in competing with ace binding to rbd illustrated by a ic of . nm which is -or -fold lower compared to treatment with individual nm or nm , respectively (figure a) . notably, the ic measured for the combination of nm and nm did not exceed the ic identified for nm alone indicating that nm by its own has a very high inhibiting effect (figure a). when we tested both combinations in the viral infection assay, we observed significantly improved effects in both as illustrated by an ic of ~ nm for the combination nm and nm and ~ . nm for nm and nm (figure b, supplementary figure ). from these findings we conclude, that a combinatorial treatment with two nbs targeting different epitopes within the rbd:ace interaction site is beneficial for viral neutralization. context, multiple studies have convincingly shown that neutralizing antibodies preferable bind to the rbd domain and sterically inhibit viral entry via ace , . from this, we can assume that our rbd nbs covering large parts of the rbd:ace interface might be suitable to monitor the emergence and presence of neutralizing antibodies in patients. to test this hypothesis, we set up a high-throughput competitive binding assay, termed neutrobodyplex, by combining our most potent neutralizing nb combinations with a recently developed, automatable multiplex immunoassay (figure a) . we incubated our previously generated color-coded beads comprising rbd, s domain or homotrimeric spike with serum samples from patients or non- infected individuals, in addition to dilution series of the combinations nm / nm or nm / nm and used this to detect patient-derived iggs bound to the respective antigens. depending on the nb concentration, neutralizing antibodies targeting the rbd:ace interaction site within the serum samples are displaced resulting in a reduction of the detectable signal (figure a) . when analyzing rbd specific iggs from serum samples, we detected a distinct signal reduction in the presence of increasing nb concentrations for all tested samples (figure to further demonstrate that our approach is able to determine the presence of iggs targeting the rbd:ace interaction site in detailed resolution, we highlight here the effect of competing iggs could be observed when measuring binding to rbd, however using the s domain as target antigen distinct differences between both serum samples became visible. while # comprise a substantial fraction of iggs addressing the rbd:ace interface also presented by the s domain, in sample # iggs binding to additional epitopes of the s domain cover the detectable signal reduction derived from displaced iggs (figure for functional analysis we employed a recently developed in vitro multiplex binding assay to monitor the replacement of ace as the natural ligand from binding to rbd, s domain or homotrimeric spike upon addition of rbd-specific nbs. with this assay, we were able to identify inhibiting nbs targeting those spike-derived antigens. interestingly, ic values obtained for inhibitory nbs on rbd and homotrimeric spike show a higher correlation compared to ic values obtained for the s domain. based upon detailed epitope mapping, we grouped our nbs in different nb-sets. of those nb-sets, comprise inhibitory nbs which were shown to target different epitopes within the rbd:ace interaction site. we confirmed the neutralizing potency of those nbs in a cell-based viral infection assay using fully intact sars-cov- . through this, we noted that the measurable viral neutralization effect of the individual nbs strongly correlates to the data obtained from the biochemical screen, which demonstrates that the multiplex binding assay as presented is highly relevant and suitable to identify virus neutralizing binders. as a result, we modified our previously described multiplex immunoassay (multicov-ab, ) and developed a novel diagnostic test called neutrobodyplex to monitor the presence and the emergence of neutralizing antibodies in serum samples of sars-cov- infected individuals. using combinations of high affinity nbs covering the rbd:ace interface, we were able to directly and specifically displace iggs present in serum samples from these particular rbd epitopes. according to previous studies, human iggs addressing those epitopes were classified as neutralizing antibodies , , . in our neutrobodyplex, we further demonstrated that such neutralizing antibodies can be detected best using the rbd. larger expression constructs for bacterial expression of nbs, sequences were cloned into the phen vector , thereby adding a c-terminal xhis-tag for imac purification as described previously , . the pcaggs plasmids encoding the stabilized homotrimeric spike protein and the receptor binding domain (rbd) of sars-cov- were kindly provided by f. krammer . the cdna encoding the s domain (aa - ) of the sars-cov- spike protein was obtained by pcr amplification using the forward primer s _cov -for ´-ctt ctg gcg tgt gac cgg - ´ and reverse primer s _cov -rev ´ -gtt gcg gcc gct tag tgg tgg tgg with high-confidence identification (q-value ≤ . ) were included to the list. peptides with overlapping mass, retention time and charge in nb and antigen digest, were manually removed. the deuterated samples were recorded in ms mode only and the generated peptide list was imported into hdexaminer v . . (sierra analytics, modesto, ca, usa). deuterium uptake was calculated using the increase of the centroid mass of the deuterated peptides. hdx could be followed for % of the rbd amino acid sequence. the calculated percentage deuterium uptake of each peptide between rbd-nb and rbd-only were compared. any peptide with uptake reduction of % or greater upon nb binding was considered as protected. cell culture caco- (human colorectal adenocarcinoma) cells were cultured at °c with % co in dmem containing % fcs, mm l-glutamine, μg/ml penicillin-streptomycin and % neaa. results from bead-based multiplex ace competition assay are shown for the three sars- cov- spike-derived antigens, rbd, s and homotrimeric spike. ace bound to the respective antigen was detected. for each nb, a dilution series from . µm to . nm is shown in the presence of ng/ml ace . mfi signals were normalized to the maximal signal per antigen as given by the ace -only control. ic values were calculated from a four-parametric sigmoidal model and are displayed for each nb and antigen. data is presented as mean +/- sd of three technical replicates (n = ). isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model structural basis for the recognition of sars-cov- by full-length human human neutralizing antibodies elicited by sars-cov- infection characterization of the receptor-binding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine nanobodies: natural single-domain antibodies structural basis for potent neutralization of betacoronaviruses by neutralizing nanobodies bind sars-cov- spike rbd and block interaction with ace an ultra-high affinity synthetic nanobody blocks sars-cov- infection by locking spike into an inactive conformation. biorxiv humanized single domain antibodies neutralize sars-cov- by targeting spike receptor binding domain. biorxiv an alpaca nanobody neutralizes sars-cov- by blocking receptor interaction. biorxiv affinity nanobodies block sars-cov- spike receptor binding domain interaction with human angiotensin converting enzyme. biorxiv, fast isolation of sub-nanomolar affinity alpaca nanobody against the spike rbd of sars-cov- by combining bacterial display and a simple single-step density gradient selection. biorxiv multivalent nanobody cocktails for highly efficient sars- a potent neutralizing nanobody against sars-cov- with inhaled delivery potential. biorxiv spike mutation pipeline reveals the emergence of a more transmissible form of sars-cov- . biorxiv tracking changes in sars-cov- spike: evidence that d g increases infectivity of the covid- virus potent neutralizing antibodies against sars-cov- identified by high- throughput single-cell sequencing of convalescent patients' b cells a neutralizing human antibody binds to the n-terminal domain of the spike protein of sars-cov- a serological assay to detect sars-cov- seroconversion in humans going beyond clinical routine in sars-cov- antibody testing -a multiplex corona virus antibody test for the evaluation of cross-reactivity to endemic coronavirus antigens. medrxiv quantum dot-conjugated sars-cov- spike pseudo-virions enable tracking of angiotensin converting enzyme binding and endocytosis evaluation of nine commercial sars-cov- immunoassays convergent antibody responses to sars-cov- in convalescent individuals a translational multiplex serology approach to profile the prevalence of anti-sars-cov- antibodies in home-sampled blood. medrxiv a high-throughput neutralizing antibody assay for covid- diagnosis and vaccine evaluation speed up to find the right ones: rapid discovery of functional nanobodies a sars-cov- surrogate virus neutralization test based on antibody- mediated blockage of ace -spike protein-protein interaction selection and identification of single domain antibody fragments from camel heavy- chain antibodies modulation of protein properties in living cells using nanobodies a versatile nanotrap for biochemical and functional studies with fluorescent fusion proteins targeting and tracing antigens in live cells with fluorescent nanobodies sars-cov- seroconversion in humans: a detailed protocol for antigen production, and test setup deuterium exchange mass spectrometry to study protein complexes optimization of feasibility stage for hydrogen/deuterium structure of the sars-cov- spike receptor-binding domain bound to the ace receptor s domain or homotrimeric spike of sars-cov- was incubated with nb combinations (concentrations ranging from . µm to . nm for each nb) and serum samples of convalescent sars-cov- patients and healthy donors at a : dilution. as positive control and maximal signal detection per sample, serum only was included and as negative control for nb binding a sars-cov- -unspecific gfp nanobody ( . µm) was used. to compare nb performance, the inhibiting mouse antibody ( -mm ) was added in concentrations of . µm to . nm. bound serum iggs were detected via anti-human-igg-pe as previously and fragments mass tolerance were set to ppm and . da, respectively. no enzyme selectivity was applied, however, identified peptides were manually evaluated to exclude peptides originated through cleavage after arginine, histidine, lysine, proline and the residue key: cord- -aywdmj o authors: song, wenfei; wang, ying; wang, nianshuang; wang, dongli; guo, jianying; fu, lili; shi, xuanling title: identification of residues on human receptor dpp critical for mers-cov binding and entry date: - - journal: virology doi: . /j.virol. . . sha: doc_id: cord_uid: aywdmj o middle east respiratory syndrome coronavirus (mers-cov) infects host cells through binding the receptor binding domain (rbd) on its spike glycoprotein to human receptor dipeptidyl peptidase (hdpp ). here, we report identification of critical residues on hdpp for rbd binding and virus entry through analysis of a panel of hdpp mutants. based on the rbd–hdpp crystal structure we reported, the mutated residues were located at the interface between rbd and hdpp , which potentially changed the polarity, hydrophobic or hydrophilic properties of hdpp , thereby interfering or disrupting their interaction with rbd. using surface plasmon resonance (spr) binding analysis and pseudovirus infection assay, we showed that several residues in hdpp –rbd binding interface were important on hdpp –rbd binding and viral entry. these results provide atomic insights into the features of interactions between hdpp and mers-cov rbd, and also provide potential explanation for cellular and species tropism of mers-cov infection. middle east respiratory syndrome (mers), a novel coronavirus which causes severe respiratory illness, was first reported in a patient from saudi arabia in (de groot et al., ) . to date, individual cases as well as small clusters and large outbreaks have been reported in several countries and the mortality rate is estimated at % among laboratory-confirmed cases (organization, ) . phylogenetic analysis demonstrates that the mers coronavirus (mers-cov) is genetically closest to clade c betacoronavirus found in camels and insectivorous bats (ithete et al., ) although the true viral reservoir remains uncertain. the clinical symptoms caused by mers-cov are similar to those caused by severe acute respiratory syndrome coronavirus (sars-cov) although the two viruses use two distinct receptors; mers-cov uses dipeptidyl peptidase (dpp ) while sars-cov uses angiotensin-converting enzyme (ace ). other coronaviruses use other receptors and perhaps this provides partial explanation for their cellular and species tropism. mers-cov can replicate in a range of cell lines derived from human, non-human primate, porcine, and bat (de wit et al., ) . traditional small laboratory animals, such as mice (coleman et al., ) , hamsters (de wit et al., ) , and ferrets (raj et al., ) , were shown to resist mers-cov infection. the finite host range of mers-cov has seriously restricted the development of appropriate animal models to study the pathogenesis of this virus and to assess the efficacy of potential therapeutic strategies. raj et al. ( ) demonstrated that human receptor dpp (hdpp ) domain (residues to ) could confer the susceptibility of ferret dpp to mers-cov infection. zhao et al. ( ) are the first to describe a method of developing a small-animal model for mers-cov in which an adenovirus expressing hdpp was utilized to transiently transduce mouse airway cells and make mice susceptible to mers-cov infection. recently van doremalen et al. ( ) showed that dpp played an important role in the observed species tropism of mers-cov infection and identified residues in dpp responsible for this restriction. these results indicate that the insusceptibility to infection is primarily determined by the inability of mers-cov binding to dpp of a non-permissive cell line. previous findings have shown that hdpp extracellular domain consists of a variable n-terminal eight-blade β-propeller domain and a conserved c-terminal α/β-hydrolase domain (engel et al., ; rasmussen et al., ) . however, our understanding of critical residues of hdpp on mers-cov interaction and entry is quite limited. we and others have previously characterized rbd-hdpp crystal structure lu et al., ; wang et al., ) . the rbd-hdpp crystal structure showed that the viral rbd recognized blades iv and v of the dpp β-propeller domain. the atomic interaction details of the binding interface revealed that the rbd receptor recognition was predominantly mediated by several amino-acid residue interactions, including rbd residue d with dpp residue k , rbd y with dpp r , rbd residues d and e with dpp residues r and q , rbd l , w and v with dpp l and i . previously, we have generated a panel of mers-cov mutant rbd proteins at the residues d , y , d , e , l , w and v to characterize their impacts on binding activity to hdpp and the entry efficiency into target cells. however, the impacts of the corresponding residues on hdpp have not been well characterized. here, through structure-guided mutagenesis, we identified several key residues in hdpp that were critical for rbd binding measured by both real-time surface plasmon resonance (spr) and pseudovirus entry. these residues included k and r on binding patch , and l , i , r and q on binding patch . the mutations of three positively charged residues k , r and r perhaps interfere with the interaction of the negatively charged residues on the surface of rbd; the mutations of l , i and q may lead to the change of hydrophobic or hydrophilic properties of hdpp at the interface with rbd. our previous findings have shown that the binding interface between hdpp and mers-cov rbd is mainly composed of two binding patches, patch and patch (fig. a) . the patch interface is characterized by interactions between c-terminal end of the long linker connecting the rbd β /β strands and the hdpp blade . the contact in patch is critically determined by the polar interactions among a group of hydrophilic amino-acid residues, including rbd e , d , d and y and hdpp k and r . in this patch, dpp residue k interacts with rbd d by salt bridge (fig. b) , while dpp residue r forms hydrogen bond with rbd residue y (fig. c) . patch has a hydrophobic core surrounded by a hydrophilic periphery. in the hydrophobic core, rbd and hdpp contacts are critically dependent on a few 'hot spot' residues including rbd l , w and v , and dpp l and i . however, the surrounding hydrophilic surface consists of rbd residues d , e and y , and dpp residues h , r and q . among these hydrophilic residues, the salt bridge and hydrogen bond between d and r , e and q contribute to the maintenance of rbd-receptor contact (fig. d) . to study the impacts of the substitutions of the critical residues on hdpp described above on the interaction between mers-cov rbd and hddp , we determined the binding efficiency between these two proteins by employing spr technique. first, we constructed a series of hdpp mutants guided by the rbd-hdpp complex crystal structure information . the wide-type and mutant hdpp were introduced into baculovirus expression system. all wide-type and mutant forms of hdpp were expressed efficiently (data not shown). second, the binding efficiency was measured by spr. as shown in fig. and table , mutations at several hdpp residues, in individual or combination, resulted in a significant attenuation in binding to mers-cov rbd. in patch , residue k mutation (k a and k e) presumably damaged the salt-bridge interaction, completely abrogated the binding between hdpp and rbd, while r a reduced rbd and hdpp binding about fold. in patch , double mutations at l and i (l a þ i a and l d þi d) completely eliminated the binding between rbd and hdpp , presumably by disrupting hydrophobic interactions with rbd l , w and v . in contrast, the single-residue substitution of r a and q a in the hydrophilic surface of patch had negligible effect on binding efficiency. to further study the importance of the critical residues on hdpp on viral entry, we measured the entry efficiency of pseudovirus into cos cells expressing the wide-type and mutant forms of hdpp . the expression levels of the wide-type and mutant hdpp were analyzed by fluorescence-activated cell sorting (facs) using goat anti-hdpp polyclonal antibody. all of the wide-type and mutant hdpp proteins could be expressed on the surface of cos cells with the similar expression efficiency (fig. a) . forty-eight hours later, these cells were exposed to pseudovirus infection and their entry efficiency was measured by luciferase activity h later. as showed in fig. b , the residue mutations located at patch (k a, k e and r a) and hydrophobic region of patch (l a þi a and l aþ i d) fig. . the amino-acid residue interaction details at the binding interface. (a) two patches of the binding interface. patch interface is characterized by interactions between the c-terminal end of the long linker connecting the rbd β /β strands (light magenta) and the hdpp blade (cyan). in patch , a gently concaved outer surface in rbd (light magenta) accommodates a linker containing a short α helix between hdpp blades and (cyan). (b) and (c) hydrophilic residues of rbd and hdpp interact through polar contacts in patch . rbd d has salt-bridge interaction with hdpp residue k (b). dpp residue r forms hydrogen bond with rbd residue y (c). the polar contacts (salt-bridge and hydrogen bond) are drawn as black dashed sticks. (d) hot spot residues in the hydrophobic core and hydrophilic periphery of patch . resulted in significantly reduction in viral entry. this is consistent with the binding results described previously. in the hydrophilic region of patch , residue substitution r led to partial loss of viral infection ( . %), while the mutation q modestly increased viral infection ( . %). in summary, we have identified several key residues in hdpp critical for viral binding and entry into target cells. these residues include positively charged residues of patch (k and r ) and hydrophobic zone of patch (l and i ). in contrast, the mutations at hydrophilic zone of patch (r and q ) had little influence on binding and virus entry efficiency. these results showed that the positively charged residues at the outer surface of blade and the hydrophobic regions of blade may play an important role in mediating viral binding and entry into the target cells, while the impact of mutations at hydrophilic region of patch was barely detectable. this is consistent with our earlier findings where residue mutations at the corresponding negatively charged and hydrophobic core positions on rbd of mers-cov could significantly reduce both binding and viral entry efficiency. sequence analysis of dpp from multiple animal species (fig. ) showed that mers-cov susceptible animals, such as macaque, camel and bat, shared the same sequence with hdpp at blades iv and v. in contrast, those mers-cov resistant animals, such as mouse, rat and ferret, have residues at l , i and r that are all different from hdpp . raj et al. ( ) reported that when these sites of hdpp were changed to the residues of ferret, the binding and viral infection efficiency could also be decreased. van doremalen et al. ( ) found residues involved in the hdpp -rbd interaction which were important to determine the susceptibility to mers-cov infection, in which i and r were included. these results are consistent with our findings and suggest these residues play an important role in rbd binding and viral entry, and determining the tropism to mers-cov infection. mers-cov rbd (residues - ) and the extracellular domain of hdpp (residues - ) were expressed using a bacto-bacs baculovirus expression system (invitrogen). in brief, the dna encoding rbd and hdpp were respectively cloned into the pfastbac™ dual vector (invitrogen) incorporating an n-terminal gp signal peptide to facilitate secretion and a c-terminal hexa histidine-tag for purification. the constructed dna was then transformed into the bacterial dh bac competent cells and the recombined bacmid dna was extracted and transfected into sf cells using cellfectin ii reagent (invitrogen). after - days of incubation at k, the low-titer viruses were harvested and then amplified. the amplified high-titer viruses were then used to infect sf cells and the cell culture supernatant containing target protein was harvested h after infection, concentrated, loaded to nickel (ni)-charged resin (ge healthcare), and eluted with . m imidazole and further purified using the superdex™ highperformance column (ge healthcare) pre-equilibrated with tris buffer ( mm tris, ph . , mm nacl). fractions containing the purified protein were collected and applied directly to a preequilibrated resource™ q column (ge healthcare) and then eluted with a . - m nacl gradient in mm tris buffer (ph . ). fractions containing protein were finally purified using super-dex™ column pre-equilibrated with hbs ( mm hepes, ph . , mm nacl) and centrifuged to mg/ml. mutants of the extracellular domain of hdpp were constructed using a standard pcr-based cloning strategy. and the mutant proteins were expressed and purified in the same way. the spr analyses were carried out using a biacore t instrument (ge healthcare) equipped with a research-grade cm sensor chip. to measure the affinity binding between rbd and wide-type or mutant hdpp , the rbd was immobilized on the sensor chip by standard amine coupling procedure. the flow cell was left blank to serve as a reference. purified rbd at a concentration of μg/ml in sodium acetate buffer ( mm, ph . ) was immobilized to a density of - response units on the flow cell . for the collection of binding data, hdpp or its mutants in a buffer of mm hepes, ph . , mm nacl, and . % (v/v) tween- were injected over the two flow cells at a series of concentration at a μl/min flow rate and k. the rbd-hdpp complex was allowed to associate for s and dissociated for s. the surfaces were regenerated with an injection of mm naoh between each cycle if needed. the data was analyzed with the biacore t evaluation software by fitting to a : langmuir binding model. mers-cov pseudovirus was generated by co-transfection of human immunodeficiency virus (hiv) backbone expressing firefly luciferase (pnl r-e-luciferase) and mers-cov spike glycoprotein expression vector (pcdna . þ , invitrogen) into the t cells. viral supernatants were harvested h later, normalized by p elisa kit (beijing quantobio biotechnology co., ltd, china) before infecting the target cos cells transiently expressing wide-type or mutant hdpp . the wide-type and mutant hdpp expressing cos cells were incubated with goat anti-hdpp polyclonal antibody (r&d) followed by incubation with fluorescein phycoerythrin (pe)labeled rabbit anti-goat igg antibody (santa cruz). the expression levels of wide-type and mutant hdpp were measured by flow cytometer (bd aria ii) and the mean fluorescence intensity (mfi) was analyzed. the cos cells infected by mers-cov pseudovirus were lysed at h post infection and viral entry efficiency was quantified by comparing the luciferase activity between pseudoviruses-infected cos cells expressing wide-type and those infected cos cells expressing mutant hdpp . the entry efficiency (%) of pseudovirus was calculated on the basis of luciferase activity. and the percentages of pseudovirus entry efficiency shown for mutant hdpp were luciferase activity values versus that of the wide-type hdpp , as the entry efficiency for wide-type hdpp was defined as %. data shown were corrected for the expression of different hdpp constructs by the parameter of mfi. error bars represent standard errors of the means of three independent experiments. student's t-test; n po . ; nn po . . crystal structure of the receptor-binding domain from newly emerged middle east respiratory syndrome coronavirus wild-wide-type and innate immune-deficient mice are not susceptible to the middle east respiratory syndrome coronavirus middle east respiratory syndrome coronavirus (mers-cov): announcement of the coronavirus study group the middle east respiratory syndrome coronavirus (mers-cov) does not replicate in syrian hamsters the crystal structure of dipeptidyl peptidase iv (cd ) reveals its functional regulation and enzymatic mechanism close relative of human middle east respiratory syndrome coronavirus in bat molecular basis of binding between novel human coronavirus mers-cov and its receptor cd middle east respiratory syndrome coronavirus (mers-cov) -update. world health organization adenosine deaminase acts as a natural antagonist for dipeptidyl peptidase -mediated entry of the middle east respiratory syndrome coronavirus crystal structure of human dipeptidyl peptidase iv/cd in complex with a substrate analog host species restriction of middle east respiratory syndrome coronavirus through its receptor, dipeptidyl peptidase structure of mers-cov spike receptor-binding domain complexed with human receptor dpp rapid generation of a mouse model for middle east respiratory syndrome we thank drs linqi zhang and xinquan wang for their kind support and helpful suggestions. this work was supported by national natural science fund , , ministry of science and technology of china ( cb ), the national science and technology major projects ( zx - ). key: cord- -fyvr tc authors: santiago, césar; mudgal, gaurav; reguera, juan; recacha, rosario; albrecht, sébastien; enjuanes, luis; casasnovas, josé m. title: allosteric inhibition of aminopeptidase n functions related to tumor growth and virus infection date: - - journal: sci rep doi: . /srep sha: doc_id: cord_uid: fyvr tc cell surface aminopeptidase n (apn) is a membrane-bound ectoenzyme that hydrolyzes proteins and peptides and regulates numerous cell functions. apn participates in tumor cell expansion and motility, and is a target for cancer therapies. small drugs that bind to the apn active site inhibit catalysis and suppress tumor growth. apn is also a major cell entry receptor for coronavirus, which binds to a region distant from the active site. three crystal structures that we determined of human and pig apn ectodomains defined the dynamic conformation of the protein. these structures offered snapshots of closed, intermediate and open apn, which represent distinct functional states. coronavirus envelope proteins specifically recognized the open apn form, prevented ectodomain progression to the closed form and substrate hydrolysis. in addition, drugs that bind the active site inhibited both coronavirus binding to cell surface apn and infection; the drugs probably hindered apn transition to the virus-specific open form. we conclude that allosteric inhibition of apn functions occurs by ligand suppression of ectodomain motions necessary for catalysis and virus cell entry, as validated by locking apn with disulfides. blocking apn dynamics can thus be a valuable approach to development of drugs that target this ectoenzyme. the apn ectodomain structure. as the apn protein is a type ii membrane protein, ectodomain expression required deletion of the n-terminal cytoplasmic and transmembrane domains, and introduction of a secretion signal sequence, as well as a hemagglutinin (ha) tag to allow protein detection and purification ( supplementary fig. s a ). as the n-terminal and middle portions of the hapn and papn ectodomains are heavily glycosylated, we produced them in cho cells (see methods). the purified proteins generated distinct crystal forms under different crystallization conditions (table and methods). in the past we reported a papn ectodomain crystal structure in complex with a cov spike (s) fragment (pdb code f c) , here we show three new structures for apn (table ). in the four structures, the n-terminal ha tag and ~ ectodomain residues were very disordered, indicating a large degree of flexibility in the membrane proximal polypeptide. the ectodomains have a hook-like conformation formed by domain i to iv and contained a zinc ion at the active site in domain ii (fig. ). the exposed convex side of domain iv mediates similar protein dimerization in the distinct crystals. approximately Å of each monomer is buried at the dimer interface ( table ) , indicative of a stable protein-protein interaction. domain iv is the largest apn domain and the most divergent in the m aminopeptidase family. in apn, domain iv has seven helix-turn-helix heat repeats and a single arm repeat formed by three alpha helices (α -α ) . the arm repeat is the most variable domain iv region in the hapn and papn structures, and can contact the peptide substrate bound to the active site (see below). although the dimeric assembly of human and pig apn ectodomains was preserved in various crystals, the conformation of each monomer differed among crystal forms, such that the distance between the n-terminal region of the ectodomains that formed the dimer varied from to Å in the structures ( table and supplementary fig. s b) . each crystal captured a single apn conformation, with all the monomers in the same form. these structures identified three distinct apn conformations, based on active site accessibility, which we termed closed, intermediate and open forms (fig. a) . as reported for other m family members , , the observed apn structural diversity indicated ectodomain dynamics in solution and on the cell surface. the active site accessibility at domain ii differed among crystal forms because of interdomain adjustments in the apn. the contacts between domain iv and other domains in the monomers varied among the structures, whereas the domain iv-iv buried surfaces in each monomer at the dimerization interfaces were preserved ( table ) . domain iv contacts with domain i or iii changed markedly less (~ - Å buried surface) than with domain ii ( Å ); domain ii-iv interaction thus mainly stabilized the closed apn conformation. there were no notable differences in the other interdomain contacts in the distinct apn forms (table ) . domains i-ii are distant from domain iv in the open conformation of the apn monomer (fig. a,b) , where the zinc ion at the catalytic site is more accessible to the solvent (fig. c) . the domain i to iii module swings ° toward domain iv, closing the active site (fig. b,c) . the hapn structure adopts an intermediate conformation in the crystals (fig. a) ; the distance between the n terminus of each monomer in the dimer is Å, and the angle difference of fig. a and table ). on the cell surface, the domain i to iii module must swing over domain iv, which is fixed by dimerization (fig. b, supplementary video s ). the module movement must be facilitated by the flexible ~ -residue polypeptide that links domain i to the transmembrane domain ( supplementary fig. s a ), although polypeptide length probably limits the interdomain movement shown here with apn ( °), which is less pronounced than that reported for erap- ( °, determined as in table ). the type of interdomain movement also differs between erap- and the apn. the domain iii-iv module moves together relative to domain i-ii in erap- , whereas domain i to iii swings over domain iv in the apn. in addition, the erap- hinge region is at the domain iii n terminus, whereas that of apn is in the domain iv n terminus. domains i, ii and iii can pivot at the beginning of the first (α ) or third (α ) domain iv helix, which are perpendicular to the swing angle (fig. b) . these differences in apn motion compared to other aminopeptidases are probably related to dimer formation, which is not observed in other m family members. the domain ii buried surface increases due to its interaction with domain iv when the conformation changes from open to closed (table ) , thus reducing accessibility of the active site cavity (fig. c) . m aminopeptidase dynamics is thought necessary for catalysis, and the closed the met residues of the papneh were replaced by seleno-met (see methods). highest-resolution shell is in parentheses. favored, allowed and outlier residues (%) in the ramachandran plot, as well as number of ectodomains in the asymmetric unit (asu) are shown. statistics for the papn-rbd crystal structure discussed here have been reported earlier (pdb code f c) . structure representations in supplementary fig. b . (fig. a) . in the closed papn, the side chain of a phenylalanine (phe ) at domain iv was placed at about . Å from the hydrolyzable peptide bond, whose carbonyl group is coordinated to the zinc ion. the phenylalanine was located in the loop that connects α and α in the single domain iv arm repeat of human and pig apn (fig. a) ; it penetrated the active site groove in the closed conformation and locked the peptide, ready for hydrolysis. domain iv residues that precede phe in the α -α loop contacted domain ii in the closed papn. a similar loop conformation is seen in a closed hapn structure (pdb code fys) . the phenylalanine side chain in closed apn probably hinders peptide release or translocation for further processing after p hydrolysis. it is likely that binding of the p ′ residue to the zinc ion required removal of the phenylalanine plug by domain ii displacement away from domain iv. the phenylalanine adopted a distinct conformation in the intermediate and open apn conformations (fig. a) . domain ii movement was accompanied by a conformational change of the α -α loop in domain iv (supplementary video s ), which became more solvent-exposed; the phenylalanine side chain faced into domain iv in the intermediate and open conformations and the peptide plug was removed from the active site. these changes would facilitate release of the n-terminal residue after hydrolysis. the small interdomain movement of the intermediate apn structure would be sufficient for peptide processing (fig. a) . we previously described in detail the cov spike rbd-papn binding interface . the porcine cov spike rbd binds to a papn region that is distant from the catalytic site ( supplementary fig. s b) . a critical cov receptor-binding motif, which bears an exposed tryptophan, penetrates a narrow cavity formed by domain ii and iv (fig. b) . the tryptophan aromatic side chain stacks onto papn domain iv residues his -pro , and is trapped by domain iv residues asn -pro on one side and domain ii residues gln -ser on the other (fig. b) . the main chain of domain ii residues is in close contact ( . Å) with the tryptophan side chain, and its imino nitrogen forms a hydrogen bond with the domain iv asn main chain carbonyl. domain iv-based superposition of the open papn with bound rbd and that of closed papn showed a shift in the domain ii main chain region that contacts the rbd; this region collides (< . Å) with the cov tryptophan (fig. b) . closing of the ectodomain would hinder penetration of the viral tryptophan between the papn domain ii and iv. cov binding to apn would lock the protein in its open conformation (fig. b) , preventing the ectodomain movement probably necessary for peptide hydrolysis (fig. a) . we analyzed the catalytic activity of soluble human and pig apn ectodomains in the presence of porcine cov s fragments bearing the rbd (fig. ) . the soluble s proteins specifically inhibited papn-mediated catalysis, measured as the hydrolysis of the l-pna substrate, and had no effect on hapn activity. the tgev (transmissible gastroenteritis coronavirus) spike does not bind hapn because it lacks the n-linked glycan recognized by porcine cov in papn , . the isolated rbd was sufficient to inhibit papn catalysis (fig. a) ; inhibition was dependent on rbd concentration. a high rbd:papn ratio was needed to achieve maximum inhibition ( - %; fig. b ), which decreased slowly after min (fig. c) drugs that bind the catalytic site inhibit cov binding to apn. non-hydrolyzable drugs that bind the apn catalytic site inhibit catalysis and prevent angiogenesis and tumor growth , , , . they appear to restrict ectodomain conformational changes, as shown by reduction in the number of some apn conformation-specific fig. s b) , with residues that contact the rbd in sticks with carbons in yellow (domain ii) and green (domain iv). the same residues are shown for the superposed closed structure (carbons in grey). the rbd motif that penetrates the papn cavity is shown with a grey surface and with residues as sticks (carbons in cyan or in magenta for trp). scientific reports | : | doi: . /srep mab epitopes . on the cell surface, active site epitopes recognized by the my mab decrease in the presence of actinonin, which indicates apn closure. crystal structures of m aminopeptidases in complex with these drugs show preferential adoption of a closed state , , , . drug binding would thus not only compete with substrates for active site binding, but might also restrict the aminopeptidase dynamics needed for peptide processing. the structure of the papn-rbd complex indicates that porcine cov would be specific for the open conformation (fig. b) . restriction of apn ectodomain opening by active site-binding drugs would thus have an allosteric effect on cov binding. to test this hypothesis, we studied tgev rbd binding to cell surface papn in the presence of drugs that bind to the active site (fig. ) . in flow cytometry, we determined the binding of an rbd-fc fusion protein to cells that expressed papn or an active site mutant (papn-hh/aa), alone or with various drugs (fig. a,b) . we analyzed the effect of the natural apn inhibitors actinonin and bestatin ; both reduced rbd-fc binding to cell surface papn (fig. a, left) . we then evaluated four synthetic amino-benzosuberone (abs) derivatives that bind with high affinity and selectivity to apn (supplementary fig. s a) ; all four abs molecules prevented rbd binding to papn and its effectiveness increased with apn-binding affinity (fig. b, left) . the bulkier abs and abs compounds, which contain a phenyl group and bind with the highest affinity to apn, more efficiently blocked binding of the tgev rbd to papn on the cell surface. the inhibitory molecules bind to the apn active site , , which is distant from the apn region recognized by cov (supplementary fig. s b) . to further determine whether the inhibitory effect was linked to drug binding to the papn active site, we analyzed rbd binding to the papn-hh/aa mutant, which lacks the two histidines (h and h ) that coordinate the zinc ion ( supplementary fig. s ) . staining for the rbd-fc protein was similar in cells expressing the mutant, alone or with the drugs (fig. a right and b right) , which showed that compound binding to the papn active site was necessary to prevent rbd binding to a distant site. in addition, inhibition of rbd binding to papn was drug concentration-dependent (fig. c) , and the amount of compound needed to reach % inhibition (ic ) decreased with compound affinity for apn (~ μ m for bestatin (ki ~ μ m), ~ μ m for actinonin (ki ~ μ m), ~ . μ m for abs (ki ~ . nm) ). these results show that drugs that bind to the cov cell entry and infection . we nonetheless found that active site-binding molecules hindered cov s protein binding and might inhibit virus infection. studies with low affinity binding drugs such as bestatin show no reduction in tgev infection . virus particles have high receptor-binding avidity, and these drugs might not have sufficient affinity to maintain most apn molecules closed. the selective compounds abs - have high affinity for apn and, at - μ m concentrations, inhibit capillary tube formation in cell cultures, with no cytotoxicity . in our cultures, we observed no toxicity at abs concentrations < μ m (not shown). we therefore analyzed the tgev-mediated cytopathic effect for each of the four abs molecules and actinonin at a μ m concentration and monitored inhibition of virus infection with abs ( log) and abs ( log) (fig. a) ; at the same concentration, the lower-affinity abs and abs compounds or actinonin did not inhibit. abs has a bromo substituent that is predicted to interact with the phenylalanine that plugs the substrate in the closed conformation ; this interaction likely helped maintain the closed ectodomain and efficiently prevented virus binding. tgev is a representative, extensively studied animal cov that use papn for cell entry , . to further determine whether abs inhibition of virus infection was linked to cell entry, we analyzed tgev replication at h post-infection and found that virus entry decreased with the abs concentration (fig. b ). abs addition after virus absorption at °c did not inhibit virus growth (not shown), which indicates that it prevented virus binding to cells. in addition, we studied the effect of abs concentration on tgev cell infection, and we observed that the tgev cytopathic effect was reduced and cell survival increased at higher abs concentrations (fig. c) . abs compounds are selective for apn molecules and designed to inhibit apn catalytic activity and tumor growth; here we show that they also prevent cov cell infections. to further analyze the importance of apn dynamics, we engineered disulfide bonds to bridge domains ii and iv and restrict ectodomain motion. we replaced the papn domain ii ser and domain iv ser and/or ser with cysteine to lock the ectodomain in the closed form with interdomain disulfide bridges. the ser main chain cα is ~ and ~ Å, respectively, from ser and ser in the closed form, but ser moves ~ Å away in the open form ( supplementary fig. s ). disulfide bond formation between papn cys and cys or cys should thus prevent ectodomain motion. we expressed the papn-cysteine mutants (c -c ) on the t cell surface and compared their catalytic and cov binding activity with that of the wild type papn (fig. ) . the papn cysteine mutants showed reduced catalytic activity (fig. a) and tgev rbd binding (fig. b ) relative to the wild type protein in t transfectants that express similar protein amounts. the higher activity of the papn c than the c or c mutants suggested that the cys -cys disulfide bond was more labile than the cys -cys bond, probably because cys is in a flexible loop (supplementary fig. s ). treatment with a reducing agent restored catalysis and rbd binding in the cysteine mutants and did not affect wild type papn binding activity (fig. ) . reducing the disulfide bonds fully restored rbd binding, but catalysis was partially recovered in the papn c and c mutants. substrate hydrolysis is proposed to close the ectodomain (see above), which would facilitate rebuilding the disulfides. locking the closed form and the phenylalanine in the domain iv arm repeat inside the active site probably impeded substrate processing (fig. a) . the papn cysteine mutants bound markedly less rbd than the wild type protein, which confirmed that cov s protein binding to the closed papn was sterically hindered (fig. b) , and that cov recognized the open form. overall, these results validate the functional relevance of the apn ectodomain conformations and its motion. structural dynamics is an intrinsic property of aminopeptidases. the apn crystal structures reported here indicate the dynamic conformation of its ectodomain, and functional studies show its relevance in catalysis and virus infection. distinct ectodomain regions mediate these functions, but agents that bind to one region prevent activities linked to the other. these allosteric effects with ligands are probably caused by restrictions in apn conformational dynamics, as confirmed with disulfide bond mutants. they demonstrated that preventing ectodomain motion and locking apn forms inhibits its functions. apn ectodomain movement is less pronounced and differs from that reported for other m aminopeptidases. these differences could be due to the apn dimeric conformation and its linkage to the cell surface. dimerization only engages the domain iv region, and we found that the dimer is conserved in all apn structures, closed, intermediate and open. apn domain iv thus does not move as described for erap- or f , , proteins that do not form dimers. the fixed conformation of the apn dimer determines that the domain i to iii module swings over domain iv (supplementary video s ), with the hinge at the domain iv n-terminal region. the length of this movement is less marked in apn ( °) than in erap- ( °), although the two proteins have very similar closed conformations. displacement of the apn domains i, ii and iii must be limited by the length of the flexible polypeptide that links domain i to the transmembrane region, whose movement is restricted by membrane fluidity. the extent of apn movement nonetheless appears to be sufficient for release of the hydrolyzed peptide n-terminal residue, which is not plugged by domain iv in the open or in the intermediate apn conformations (fig. a) . it is not clear how each monomer in the dimer moves, whether their movement is random or synchronized in the same or inverse directions. experiments with hapn antibodies and those shown here with the tgev rbd (fig. ) suggest that ~ % of the molecules adopt different forms; these data imply that each apn monomer maintains a distinct conformation (supplementary video s ). the crystal structures reported here provide snapshots of apn dynamic conformation, and also guided experiments that demonstrate its role in virus entry into cells and catalysis. the switch between a proteolytic active (closed) and an inactive (open) conformation has been proposed for several m aminopeptidases , , , . this dynamics is thought to be important for peptide hydrolysis and release from the aminopeptidase active site. the region that joins α and α in the domain iv arm repeat penetrates the active site groove in closed pig and human apn structures reported here and elsewhere (fig. a) , and a conserved phenylalanine in this region locks the substrate coordinated to the zinc ion, permitting hydrolysis. further peptide processing likely requires removal of the phenylalanine lock by opening the apn ectodomain, which facilitates n-terminal residue release and peptide translocation, both sterically hindered in the closed conformation (fig. a) . the inherent flexibility in the domain iv arm repeat that we demonstrate here is linked to interdomain arrangements might also enable substrate processing, and indicate how ectodomain movements participate in peptide hydrolysis. local changes in a conserved tyrosine (tyr in papn) at the active site of m aminopeptidases are also suggested to be important , , . among apn forms, the absence of conformational switches in active site residues at domain ii ( supplementary fig. s ) nonetheless indicates that tyrosine movement is not linked to interdomain motion. disulfide bonds that lock the apn closed conformation or drugs that prevent opening of the ectodomain inhibited cov protein binding and cell infection, whereas porcine cov s proteins probably hinder apn transition to the closed form and peptide hydrolysis. our results verify the critical role of apn dynamics in cov infection and catalysis, and demonstrate that the open apn structure is inactive in peptide hydrolysis. anti-apn antibodies that inhibit apn activity and reduce tumor growth , likely block ectodomain movements. the allosteric inhibition of apn functions shown here using viral proteins and drugs is likely to be due to suppression of apn transient conformational states, as shown for other enzymes . blocking apn movement prevents its functions, and suggests a new approach for the development of drugs that target this protein. small molecules or conformation-specific antibody inhibitors of ectodomain motions can bind to the active site or interact with distant sites, as shown here with cov spike fragments. high affinity drugs designed to inhibit catalysis and tumor growth prevent cov infections, which indicates that targeting apn ectodomain dynamics can be a valuable approach to block apn functions related to cancer progression and virus infections. . catalysis was determined at min as absorbance at od nm (see fig. and methods). relative rbd-fc binding to transfected cells determined from mean fluorescence intensity computed by flow cytometry as in fig. c . domain ii and iv residues replaced by cysteine are indicated at bottom (see supplementary fig. s ). mean ± sd (n ≥ ). . a papn protein with met residues replaced by seleno-met (se-met papn) was produced using methionine-and glutamine-free dmem (invitrogen) supplemented with % dialyzed fetal calf serum (fcs) and l-seleno-methionine (both from sigma). apn proteins secreted to culture supernatants were purified by affinity chromatography with anti-ha ac mab (roche), followed by size exclusion chromatography in hepes-saline buffer ( mm hepes, mm nacl) ph . . preparation of most soluble cov s proteins used here has been described , . s h and s h proteins were derived from the hol porcine cov s glycoprotein and bear the papn-binding domain. soluble tgev rbd was derived from the s glycoprotein of the tgev sc strain (genbank acc. n° aj ). it contains s residues to , an n-terminal ha peptide, and a flag sequence (monovalent rbd variant) or human igg fc (bivalent rbd-fc variant) at the c-terminal end. cov s proteins were produced in cho-lec or t cells and purified as described . analysis of apn catalysis. apn catalytic activity was determined using leucine p-nitroanilide (l-pna) (sigma) in a standard spectrophotometric assay in -well plates with soluble proteins or transfected cells. to study cov protein inhibition of apn catalysis, soluble apn ectodomains ( μ g/ml; ~ nm) were added to duplicate wells, alone or with soluble cov s protein variants, followed by the l-pna substrate ( mm) in μ l final volume ( °c). plates were incubated at room temperature and od at nm was measured at different times. background od of wells without apn was subtracted to determine specific catalytic activity. similar procedure was used with t cells ( × ) expressing papn hr after transfection. od of well with mock-transfected cells were taken as background. cell samples expressing various amounts of papn at the membrane were used to normalize the activity of the papn cysteine mutants. relative activity of the mutant to wild type was determined as the ratio of the papn mutant to the wild type od from samples with the same protein expression, as monitored by flow cytometry (see below). cov protein binding to apn. stably transfected bhk -papn, cho-papn and cho-papn mutant cells or transiently transfected t cells were used. the papn contained the ha peptide at the c terminus to monitor cell surface expression in cho and t cells. in the papn-hh/aa mutant, the two active site histidines were replaced with alanines, whereas in the papn cysteine mutants, the domain ii ser and the domain iv ser and/or ser were substituted by cysteine. we analyzed the effect on rbd binding to papn of two natural inhibitors of apn enzyme activity, bestatin and actinonin (sigma) , as well as four synthetic abs compounds . bestatin and actinonin were dissolved at mm in pbs, whereas abs compounds were used at mm in dmso. in wild type or mutant papn-expressing cells, we used flow cytometry to monitor tgev rbd binding to cell surface papn, essentially as reported , . cells were washed three times with cold pbs and resuspended ( cells/ml) in pbs supplemented with . % heat-inactivated fcs and . % bovine serum albumin (bsa; binding buffer); μ l of cell suspension were added to -well plates (nunc), cells were sedimented and resuspended in μ l of - μ g/ml rbd-fc solution alone or with inhibitors at indicated concentrations ( min, °c). an unrelated fc fusion protein was used as control. cells were washed and incubated with anti-human igg fluorescein isothiocyanate (fitc)-labeled secondary antibody ( min, °c). the mean fluorescent intensity was determined in a beckman coulter epics xl ; background cell staining with the fc protein was subtracted to determine the specific rbd-fc binding to cell surface papn. in parallel, the amount of cell surface papn expression was determined by flow cytometry with the anti-ha ac mab (roche) and an anti-mouse fitc-labeled secondary antibody (invitrogen). analysis of the papn cysteine mutants binding activity was normalized by the cell surface protein amounts as explained above for the catalytic activity. by qrt-pcr (quantitative reverse transcription polymerase chain reaction). stable transfected bhk -papn or bhk cells ( × cells/well) in dmem (dulbecco's modified eagle's medium) with % fcs were plated in -well plates ( h). plates were transferred to °c, medium was removed and μ l binding buffer alone or with apn-binding drugs or rdb protein were added to wells; after min, the solution was replaced with μ l virus inoculum at a multiplicity of infection (m.o.i) of , alone or with inhibitors in binding buffer. after virus adsorption at °c, cells were washed three times with binding buffer, and incubated in dmem with % fcs ( h, °c, % co ). cells were detached and lysed with μ l tri reagent (sigma) for rna extraction, and cdna was generated from μ g rna using the high capacity cdna reverse transcription kit (applied biosystems). real-time pcr reactions ( μ l) were performed in triplicate using μ l cdna sample, μ l of x hot firepol evagreen qpcr mix plus (rox) (solis biodyne) and . μ l of specific primers for mouse β -actin or for the tgev s gene, in a real time pcr system (applied biosystems) using a standard protocol. data were analyzed with software using the comparative ct method (Δ Δ ct). tgev s expression relative to β -actin was determined and the ratio of values alone and with inhibitor used as relative cell entry. infection or cytopathic effect of tgev was inhibited in porcine st cells. one day after seeding ( . × cells/well) in -well plates, cells were transferred to °c and pre-incubated with μ l binding buffer alone or with inhibitors, in duplicate. solutions were replaced with μ l of serial -fold dilutions of virus inoculum with inhibitors or with dmso (≤ . %) as control. after incubation ( h, °c), cells were washed three times with dmem with % fcs and incubated alone or with inhibitors for two days at °c. to determine cell survival after infection, medium was removed, cells were formalin-fixed, stained with crystal violet and viability determined by optical density (od) at nm. ratios from wells with and without virus were determined to calculate cell survival (see supplementary fig. s b ). crystallization and diffraction data collection. the endoglycosidase h-treated ( h, °c) papn (papneh) ectodomain was crystallized by the sitting drop technique with a crystallization solution of % polyethylene glycol (peg)- and % peg- (ph ~ ) and a mg/ml protein sample. alternatively, native glycosylated papn ectodomain crystals were prepared with a crystallization solution of % peg- and mm sodium acetate ph . . the hapn ectodomain ( mg/ml) was crystallized with a solution of % peg- , mm imidazole-hcl ph . . crystals were frozen in crystallization solutions containing % ethylene glycol for diffraction data collection at the european synchrotron radiation facility (esrf; id and id ) and swiss light source (sls; pxii) beamlines. diffraction data were processed with xds and scaled with scala programs . for statistical data, see table . structure determination. the structure of se-met papneh protein was solved by a combination of molecular replacement (mr) and single-wavelength anomalous dispersion (sad) methods. the crystals contained two molecules in the asymmetric unit (table ) . a partial structure was obtained by mr using the phaser program and domains i to iii of the tricorn-interacting factor f (pdb code z w), which share ~ % residue identity with papn. the phaser llg value for the best mr solution was , whereas rfz values were . and . , and tfz values of . and . . we then used the mrsad protocol in the auto-rickshaw server to determine the complete papneh structure, starting from the partial mr structure and using se-met papneh crystal diffraction data collected at the selenium peak wavelength. the final structure included the two papn molecules of the asymmetric unit, which were adjusted manually and refined with phenix.refine using data extending to . Å resolution (for statistics, see table ). the papneh structure comprises residues to and the zinc atoms coordinated in the enzyme active site. the other apn ectodomain structures (table ) were determined by the mr method using the papneh structure as search model. two ensembles including domain i, ii and iii or isolated domain iv were used for mr structure determination with phaser. structures were refined with phenix.refine (statistics in table ). in all structures, the engineered tags and - residues of the n-terminal ectodomains were very disordered and are not included in the final models. electron density maps of active site residues and of n-linked glycans are shown in supplementary figs s and s , respectively. structure representations prepared with pymol (pymol.org). aminopeptidases: structure and function families of zinc metalloproteases the moonlighting enzyme cd : old and new functions to target aminopeptidase n is a major receptor for the entero-pathogenic coronavirus tgev human aminopeptidase n is a receptor for human coronavirus e human melanoma invasion and metastasis enhancement by high expression of aminopeptidase n/cd aminopeptidase n is a receptor for tumor-homing peptides and a target for inhibiting angiogenesis role of aminopeptidase in angiogenesis aminopeptidase n in arterial hypertension aminopeptidase n (cd ) as a target for cancer chemotherapy impaired angiogenesis in aminopeptidase n-null mice the neovasculature homing motif ngr: more than meets the eye novel aminopeptidase n (apn/cd ) inhibitor f can suppress invasion of hepatocellular carcinoma cells as well as angiogenesis the molecular biology of coronaviruses mutational analysis of aminopeptidase n, a receptor for several group coronaviruses, identifies key determinants of viral host range structural bases of coronavirus attachment to host aminopeptidase n and its inhibition by neutralizing antibodies biosynthesis of intestinal microvillar proteins. dimerization of aminopeptidase n and lactase-phlorizin hydrolase structure and function of aminopeptidase n the x-ray crystal structure of human aminopeptidase n reveals a novel dimer and the basis for peptide processing structure of aminopeptidase n from escherichia coli suggests a compartmentalized, gated active site structural basis for multifunctional roles of mammalian aminopeptidase n a structural view of coronavirus-receptor interactions crystal structures of the tricorn interacting factor f from thermoplasma acidophilum, a zinc aminopeptidase in three different conformations crystal structures of the endoplasmic reticulum aminopeptidase- (erap ) reveal the molecular basis for n-terminal peptide trimming cryptic and regulatory epitopes in cd /aminopeptidase n development of synthetic aminopeptidase n/cd inhibitors to overcome cancer metastasis and angiogenesis selective aminopeptidase-n (cd ) inhibitors with relevance to cancer chemotherapy structural basis for antigenic peptide precursor processing by the endoplasmic reticulum aminopeptidase erap aminopeptidase-n/cd (ec . . . ) inhibitors: chemistry, biological evaluations, and therapeutic prospects a novel amino-benzosuberone derivative is a picomolar inhibitor of mammalian aminopeptidase n/cd determinants essential for the transmissible gastroenteritis virus-receptor interaction reside within a domain of aminopeptidase-n that is distinct from the enzymatic site exploring s plasticity and probing s ′ subsite of mammalian aminopeptidase n/cd with highly potent and selective aminobenzosuberone inhibitors mt - , a fully humanized antibody raised against aminopeptidase n, reduces tumor progression in a mouse model allosteric inhibition through suppression of transient conformational states antigenic modules in the n-terminal s region of the transmissible gastroenteritis virus spike protein collaborative computational project, n. the ccp suite: programs for protein crystallography pushing the boundaries of molecular replacement with maximum likelihood on the combination of molecular replacement and singlewavelength anomalous diffraction phasing for automated structure determination phenix: a comprehensive python-based system for macromolecular structure solution coot: model-building tools for molecular graphics we thank the esrf for provision of synchrotron radiation facilities through bag-madrid projects, as well as the swiss-sls facility, s. rodríguez for technical support, and c. mark for editorial assistance. gm was a recipient of a la caixa fellowship. jr was supported by the juan de la cierva program and rr by nih grant p ai - a . the work was supported by grants from the spanish ministry of science (bfu - and bio - -r to jmc). key: cord- - y n authors: xu, cong; wang, yanxing; liu, caixuan; zhang, chao; han, wenyu; hong, xiaoyu; wang, yifan; hong, qin; wang, shutian; zhao, qiaoyu; wang, yalei; yang, yong; chen, kaijian; zheng, wei; kong, liangliang; wang, fangfang; zuo, qinyu; huang, zhong; cong, yao title: conformational dynamics of sars-cov- trimeric spike glycoprotein in complex with receptor ace revealed by cryo-em date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: y n the recent outbreaks of severe acute respiratory syndrome coronavirus (sars-cov- ) and its rapid international spread pose a global health emergency. the trimeric spike (s) glycoprotein interacts with its receptor human ace to mediate viral entry into host-cells. here we present cryo-em structures of an uncharacterized tightly closed sars-cov- s-trimer and the ace -bound-s-trimer at . -Å and . -Å-resolution, respectively. the tightly closed s-trimer with inactivated fusion peptide may represent the ground prefusion state. ace binding to the up receptor-binding domain (rbd) within s-trimer triggers continuous swing-motions of ace -rbd, resulting in conformational dynamics of s subunits. noteworthy, sars-cov- s-trimer appears much more sensitive to ace -receptor than sars-cov s-trimer in terms of receptor-triggered transformation from the closed prefusion state to the fusion-prone open state, potentially contributing to the superior infectivity of sars-cov- . we defined the rbd t -t loop and residue y as viral determinants for specific recognition of sars-cov- rbd by ace , and provided structural basis of the spike d g-mutation induced enhanced infectivity. our findings offer a thorough picture on the mechanism of ace -induced conformational transitions of s-trimer from ground prefusion state towards postfusion state, thereby providing important information for development of vaccines and therapeutics aimed to block receptor binding. coronaviruses are a family of large, enveloped, positive-stranded rna viruses that cause upper respiratory, gastrointestinal and central nervous system diseases in humans and other animals (song et al., ; walls et al., ) . in the past few decades, new evolved coronaviruses have posed a global threat to public health, including the outbreaks of the severe acute respiratory syndrome coronavirus (sars-cov) in - and the middle east respiratory syndrome coronavirus (mers-cov) in which had caused thousands of infection, and the mortality rate of them was about % and . %, respectively (rabaan et al., ) . the recent coronavirus disease pandemic is caused by a novel coronavirus named severe acute respiratory syndrome coronavirus (sars-cov- ). on june , , there had been , , laboratory-confirmed sars-cov- infections globally, leading to , deaths. to date, there is no approved therapeutics or vaccines against sars-cov- and other human-infecting coronaviruses. as in other coronaviruses, the spike (s) glycoprotein of sars-cov- is a membranefusion machine that mediates receptor recognition and viral entry into cells and is the primary target of the humoral immune response during infection (rabaan et al., ; tang et al., ) . the s protein is a homotrimeric class i fusion protein that forms large protrusions from the virus surface and undergoes a substantial structural rearrangement to fuse the viral membrane with the host-cell membrane once binds to a host-cell receptor (bosch et al., ; li, ) . the s protein ectodomain consists of a receptor-binding subunit s and a membrane-fusion subunit s (tang et al., ; walls et al., ; wrapp et al., ) . two major domains in coronavirus s have been identified, including an n-terminal domain (ntd), and a c-terminal domain (ctd) also called receptor binding domain (rbd). following the rbd, s also contains two sub-domains (sd and sd ) . the s contains a variety of motifs, starting with the fusion peptide (fp). the fp describes a short segment, conserved across the viral family and composed of mostly hydrophobic residues, which inserts in the hostcell membrane to trigger the fusion event (epand, ; tang et al., ) . recent cryoelectron microscopy (cryo-em) studies on the stabilized ectodomain of sars-cov- s protein revealed a closed state of s trimer with three rbd domains in "down" conformation (walls et al., ) , as well as an open state with one rbd in the "up" conformation, corresponding to the receptor-accessible state (walls et al., ; wrapp et al., ) . unlike in mers-cov s protein (pallesen et al., ) , the two or three rbd "up" conformation has not been detected for sars-cov- s trimer. sars-cov- s and sars-cov s share % amino acid sequence identity, yet, they bind the same host-cell receptor-human angiotensin-converting enzyme (ace ) (hoffmann et al., ; wang et al., ; zhou et al., ) . it is usually considered that the transition process towards the postfusion conformation is triggered when the s subunit binds to a hostcell receptor; receptor binding destabilizes the prefusion trimer, resulting in shedding of the s subunit and transition of the s subunit to a stable postfusion conformation (walls et al., b) . the available crystal structures of the rbd domain of sars-cov- interacting with the extracellular peptidase domain (pd) of ace , together with the cryo-em structure of rbd domain associated with the full length ace provided important information on the rbd-ace interaction interface, revealing that the residues s to q , known as the receptorbinding motif (rbm), within rbd directly interact with ace (lan et al., ; wang et al., ; yan et al., ) . however, a complete picture of ace associating with the sars-cov- trimeric s protein is still missing, and it remains elusive on how ace binding induces sars-cov- s trimer conformational destabilization to facilitate transitions towards the postfusion state. here, we present cryo-em structures of sars-cov- s trimer in a tightly closed state, and the s trimer in complex with the receptor ace (termed sars-cov- s-ace ) at . Å and . Å resolution, respectively, in addition to a s trimer structure in the unliganded open state. the tightly closed ground prefusion state with originally dominant population may indicate a conformational masking mechanism of immune evasion for sars-cov- spike. our data suggested there is one rbd in the "up" conformation and is trapped with ace in the s-ace complex; ace can greatly shift the conformational landscape of s trimer, and trigger continuous swing motions of ace -rbd in the context of the s trimer resulting in conformational dynamics in s subunits. we demonstrated the rbm t -t loop and residue y as viral determinants for specific recognition of sars-cov- rbd by ace . our findings provide a blueprint for the understanding of the mechanisms of ace -induced conformational dynamics and resulted conformational transitions of the s trimer towards postfusion state, which may benefit anti-sars-cov- drug and vaccine development. prefusion stabilized ectodomain trimer of sars-cov- s glycoprotein was produced from hek f cells using the strategy also adopted in other studies (fig. s a ) (kirchdoerfer et al., ; miroshnikov et al., ; pallesen et al., ; tortorici et al., ; walls et al., a; walls et al., ; walls et al., ; walls et al., ; wrapp et al., ) , and was subjected to cryo-em single-particle analysis ( fig. s a-b ). our initial reconstruction suggested a preferred orientation problem associated with the s trimer (highly preferred "side" orientation but lacking tilted top views, fig. s c ), which is also the case for the influenza hemagglutinin (ha) trimer (but highly preferred "top" orientation) (tan et al., ) . to overcome this problem, we adopted the recently developed tilt stage strategy in data collection with additional data collected at º and º tilt angles (tan et al., ) . this allowed us to obtain a cryo-em structure of sars-cov- s trimer in a closed state at . Å resolution (with imposed c symmetry, termed s-closed) (figs. a, and s -s , movie ). excitingly, after overcoming the preferred orientation problem, our s-closed map very well resolved the peripheral edge of the ntd domain ( fig. a-c) , which was less well resolved in the recent reports (walls et al., ; wrapp et al., ) . this enabled us to build a more complete model of the sars-cov- s trimer containing the previously missing loop regions (including q -p , k -f , y -n , q -n , r -s , and s -a , fig. b, s g) ; additionally, the s -c loop in the rbm subdomain was also captured in our structure ( fig. d) . interestingly, compared with the recent closed state sars-cov- s trimer structure (walls et al., ) , our map represents an uncharacterized tightly closed conformation. for instance, the upper portion of s subunit especially ntd and rbd depicts an anti-clockwise rotation of . º and . º, respectively (fig. e ). accompanying this rotation, there is a slight inward tilt leading the peripheral edge of ntd exhibiting a . Å inward movement for ca of t (fig. s g ). these motions can be propagated to the central helix (ch) of s subunit, generating a clockwise rotation of . º (fig. e ). this central portion clockwise rotation associating with the outer potion opposite anti-clockwise rotation in reality twists the complex in a more compact conformation. indeed, the average interaction interface between protomers increased from ~ , . Å in their structure to , . Å in our structure (fig. f ). taken together, our map represents a tightly closed state of the sars-cov- s trimer, not captured before. furthermore, when comparing our sars-cov- s-closed structure with the closed state sars-cov s trimer cryo-em structure (gui et al., ) , there is an anti-clockwise rotation of . º and . º in ntd and rbd, respectively, and a clockwise rotation of . º in ch region from their structure to our s-closed structure, associating with a rbd inward shift towards the central axis (rmsd of . Å, fig. s h ). collectively, our s-closed structure appears more compact than that of sars-cov s trimer ( , . Å vs. , . Å in interaction interface, fig. f ). altogether, our study revealed a tightly closed conformation of sars-cov- s trimer, not observed in the homologous sars-cov s neither, extending the detected conformational space of sars-cov- spike protein. the tightly closed state with stably packed fusion peptide may represent the ground prefusion state of sars-cov- s trimer the hydrophobic fusion peptide, immediately after the s ' cleavage site and essential for host-cell membrane fusion, is highly conserved among sars-cov- , sars-cov, and mers-cov s proteins (tang et al., ) . still, the majority of fp is missing in the available sars-cov- s trimer structures. thus, how it folds and where it locates within s trimer of the virus and how it can be activated remain unclear. here, our s-closed map enabled us to capture the entire fp of sars-cov- including the previously undetected l -q fragment, which locates on the flank surface of s trimer, surrounded by hr of s subunit from the same protomer, and sd /sd of s subunit from the clockwise neighboring protomer (fig. g-h) . the fp fragment is well ordered, forming two small helixes (y -g , l -f ) and connecting loops (fig. g-h) . this observation further substantiates the notion that our sclosed structure with inactivated fp most likely represents the ground prefusion state. further interaction analysis revealed that sd and hr can form hydrogen bonds/salt bridges with the fp fragment, and sd plays a key role in this interaction involving in predicted hydrogen bonds/salt bridges (table s ) . noteworthy, among the sd -fp interactions, d from sd contributes to the formation of hydrogen bonds/salt bridges, majorly through its sidechain atoms, with k , y and k of fp, suggesting d may be essential in the interaction with and stabilization of fp (fig. i and table s ). this could be related to the recent reports suggesting that the d g mutation of sars-cov- s enhanced viral infectivity (more in discussion) (korber et al., ) . interestingly, it appears that before being activated, fp could serve as a linkage that wraps around the neighboring protomers in their s /s interface and simultaneously connects s with s , this way to coordinately lock the s trimer in the tightly closed ground prefusion state ( fig. g-h) . moreover, in this dataset the dominant population of the particles (~ %) is in the tightly closed state; although performed multiple rounds of d classification, eventually we found only a minor population ( %) of the particles is in the open state (fig. s ). our observations indicate that the open state sars-cov- s might be intrinsically dynamic and only exist transiently to expose the rbd domain. interestingly, the dominant population of the sars-cov- s trimer is in the ground prefusion state with inactivated fp and all the rbd domains buried, which may result in "conformational masking" preventing antibody binding and neutralization, similar to that described for hiv- envelope (env) (kwong et al., ; munro et al., ) . the population distribution of closed and open state of sars-cov- s varies among different studies (walls et al., ; wrapp et al., ) , which is reminiscent of observations made with sars-cov s and mers-cov s trimers. this observed variation could be potentially due to subtle difference in chemical condition used by different research groups (gui et al., ; kirchdoerfer et al., ; pallesen et al., ; song et al., ; walls et al., ; yuan et al., ) . to gain a thorough picture on how the receptor ace binding induces conformational dynamics of the sars-cov- s trimer and triggers transition towards the postfusion state, we determine the cryo-em structure of sars-cov- s trimer in complex with human ace pd domain to . Å resolution (termed sars-cov- s-ace , figs. a, s a-e, and s ). further focused-refinements improved the resolution of the s trimer portion of the map to . Å, and the connectivity in the ace -rbd portion of the map, respectively (fig. s e, s ) . we then built a pseudo atomic model of the complex with combined map information (fig. b ). to the best of our knowledge, the structure of sars-cov- s-ace complex has not been reported before. in this dataset we additionally captured an unliganded s trimer in the open state with one rbd up (resolved to . Å resolution, termed s-open), but did not detect the closed state . we should mention that our bio-layer interferometry (bli) assay revealed a relatively rapid disassociation kinetics between ace and the s trimer (koff = . x - s - , fig. s e ). we thus determined the complex structure in the presence of trace amount of cross linker glutaraldehyde (methods). additionally, we also determined the s-ace complex structure without cross linker at . Å resolution, and the two maps are in comparable conformation, suggesting that addition of cross linker did not change the conformation of the complex (fig. s g ). we then used the s-ace map at . Å resolution for detailed structural analysis. to inspect the conformational changes from the closed state to the unliganded open state, we first overlaid our s-open with our s-closed structures together. in the s-open structure, the only up rbd domain from protomer (termed rbd- ) shows a . º upwards/outwards rotation, resulting in an exposed rbm region accessible for ace binding (fig. c ). this rbd- rotation can be propagated to the underneath sd , inducing a downwards movement of sd (fig. c) . we also noticed a considerable clockwise rotation of . º, . º, and . º in ntd for protomer , , and , respectively, and anti-clockwise rotations in the ch of corresponding s subunit, greatly untwisting the s trimer from the tightly closed state (fig. d ). associated with this s untwisting, there is a downwards/outwards movement of ntds in the scale of ~ Å (fig. d, right panel) . these combined untwisting motion could release the original protomer interaction strength, beneficial for the transient raising up of the rbd. moreover, our local resolution analysis on the s-open map also suggested that other than rbd- , the consecutive rbd- also exhibits considerable dynamics (fig. s d ). our sars-cov- s-ace structure revealed that the s trimer binds with one ace through the only up rbd domain, while the other two rbds remain in the down conformation ( fig. a-b) , suggesting ace binding to sars-cov- strictly requires the up conformation of rbd. unlike the observations made with sars-cov and mers-cov s trimers, we did not detect s trimer with two rbd domains up with bound ace (kirchdoerfer et al., ; pallesen et al., ) . though our s-ace and s-open structures generally resemble each other especially in the s region, there are noticeable differences in the s region. specifically, after ace binding, the up rbd- from the s-open state can be pushed tilting downwards slightly, with the angle to the horizontal plane of s trimer reduced from . º to . º in ace bound state (fig. e ). this ace binding induced motion of rbd- could be propagated to the neighboring rbd- and the consecutive rbd- (rmsd: . Å, fig. f ), collectively disturbing the allosteric network of the fusion machinery. indeed, the neighboring protomer interaction interface was reduced from the original ~ . Å in the s-closed state to . ~ . Å in the ace bound state (fig. g ). altogether, these s subunits untwisting and rbd- tilting motions could destabilize the prefusion state of s trimer, prepared for the subsequent conformational transitions towards the postfusion state. interestingly, our s-ace structure showed that the core region of the up rbd- and the rbm t -f loop of the neighboring rbd- could form aromatic interactions with the involvement of y /f from rbd- and f /y from rbd- ( fig. h ), potentially enhancing interactions between neighboring s subunits, thus beneficial for subsequent simultaneous release of s subunits. this interaction was not detected in the counterpart of the homologous sars-cov s-ace structure, likely due to longer distance between the adjacent "up" and "down" rbds in that structure (kirchdoerfer et al., ; song et al., ) . noteworthy, the originally stably packed fp from protomer surrounded by sd /sd of the neighboring protomer in the s-closed structure is now mostly missing in the s-ace structure, which is also the case in the s-open structure. this is mostly caused by the s trimer untwisting-motion induced downwards shift of sd (fig. c , i). indeed, the b /b strands within sd shift downwards for up to . Å; consequently, the c and t from b and the connecting loop could clash with the y and l of the originally packed a helix of fp ( fig. i ), potentially resulting in destabilization and activation of the fp motif from protomer . since the untwisting/downwards-shift motions of s subunits are allosterically coordinated within the s trimer in its opening process, the density corresponding to fps in protomer and are also missing, indicating a coordinated activation mechanism of fp, which may be one of the key elements prepared for the subsequent fusion of s trimer. according to our sars-cov- s-ace cryo-em structure, the overall ace -rbd interaction interface is comparable to that of the crystal structures of the rbd domain of sars-cov- s interacting with the ace pd domain (fig. a ) (lan et al., ; wang et al., ) , i.e. our structure revealed residues of rbd are in contact with residues of ace with a distance cut-off of Å (table s ). sequence alignment demonstrated that the rbm t -f loop is the most diversified region between sars-cov- and sars-cov s proteins (fig. s ). in line with this, structural comparison revealed that the conformation of the rbm t -f loop in our sars-cov- s-ace structure is very distinct from that in the sars-cov rbd-ace crystal structure ( fig. b ) (li et al., ) . noteworthy, the rbm t -f loop can originally be resolved in our s-closed structure, but is mostly missing in our s-open structure, indicating the t -f loop may be activated in the open state. in our s-ace structure, a portion of this loop forms contact with the n-terminal helix of ace (fig. a ), for instance, a within this loop could interact with s /t of ace (table s ), suggesting that the rbm t -f loop may play an important role in receptor recognition. moreover, the s-ace structure indicated that the q -y region located in the other edge of rbm could also form close contact with ace , i.e. y could form hydrogen bonds/contacts with to further define the subdomains/residues critical for rbd binding to ace , we designed and produced three sars-cov- rbd mutant proteins, each of which had a single subdomain substituted with the counterpart of sars-cov. these rbd mutants were termed rbd-(core), rbd-(rbm-r ) and rbd-(rbm-r ), which harbored r to n of the core region, l to k , and t to t of the rbm from sars-cov, respectively (figs. c and s ). results from ace -binding enzyme linked immunosorbent assay (elisa) showed that the binding activity of the three rbd mutants towards anti-rbd polyclonal antisera and the crossreactive monoclonal antibody a was comparable to that of the wildtype sars-cov- rbd protein ( fig. c) , indicating that the mutations did not significantly affect the overall conformation of the rbds. the mutants rbd-(core) and rbd-(rbm-r ) bound ace as efficiently as the wildtype rbd; in contrast, rbd-(rbm-r ) completely lost ace -binding ( fig. c ). these results pinpoint the rbm-r region (residues -teiyqagst- ) as the critical viral determinant for specific recognition of sars-cov- rbd by the ace receptor. additionally, we constructed three single-point mutants of sars-cov- rbd protein, rbd (q a), rbd (v a), and rbd (y a). our elisa ace -binding assay showed that the mutation y a was sufficient to completely abolish the binding of ace , while the other two mutations did not show such effect (fig. d) , demonstrating that the residue y of sars-cov- rbd is a key amino acid required for ace receptor binding. our sars-cov- s-ace map showed well defined density for the s trimer region, but relatively lower local resolution in the associated ace -rbd region (fig. s c) , suggesting considerable conformational heterogeneity of ace -rbd as well as relative dynamics between ace -rbd and the remaining part of s trimer with respect to each other. this is in line with the report showing that in sars-cov s trimer, the associated ace -rbd is relatively dynamic, showing three major conformational states with the angle of ace -rbd to the surface of s trimer at ~ º, º, and º, respectively (song et al., ) . to better delineate the conformational space of the ace engaged sars-cov- s trimer, we performed multi-body refinement in relion . (fernandez-leiro and scheres, ) . principal component analysis of the movement revealed that approximately % of the movement of the complex is described by the first three eigenvectors representing swing motions in distinct directions relative to the s trimer (fig. a) . eigenvector describes a swing motion of ace -rbd towards rbd- direction with the angular range of . º, eigenvector corresponds to the swing motion of ace -rbd towards the original location of rbd- with the angular range of . º, and eigenvector describes the swing motion of ace -rbd along the ntd- to ntd- direction with the angular range of . º (fig. b) . histograms of the amplitudes along the three eigenvectors are unimodal, indicative of continuous motions (fig. c ). as the dynamic motions in the complex are formed by linear combination of all eigenvectors, these data suggested that ace -rbd processes on top of the s trimer in a noncorrelated manner. moreover, multi-body analysis on the non-cross-linked sars-cov- s-ace data showed similar swing motions (fig. s h) , indicating the presence of cross linker did not disturb the mode of ace -rbd motions within the s trimer. additionally, compared with the homologous sars-cov s-ace complex, which shows discrete movements of ace -rbd in one direction (similar to our eigenvector direction) (song et al., ) , ace binding to sars-cov- s induces more complex combined continuous swing motions of ace -rbd within the complex. putting together, our observations suggest that ace receptor binding to sars-cov- s triggers considerable conformational dynamics in s subunits that could destabilize the prefusion s trimer. indeed, the b-factor distribution of our s-ace complex demonstrated that ace binding induces strikingly enhanced dynamics in the s region including rbd and ntd domains (fig. d) , facilitating the release of the associated ace -s component and transitions of the s subunit towards a stable postfusion conformation. indeed, we found a notable drop in the interaction surface between s and s subunits from the s-closed state ( . Å ) to the s-ace state ( . Å ). it has been suggested that the large number of n-linked glycans covering the surface of the spike protein of sars-cov and mers-cov could pose challenge to antigen recognition, thus may help the virus evade immune surveillance yuan et al., ) . similar to sars-cov s, sars-cov- s also comprises n-linked glycosylation, with glycans in the s subunit and the other in the s subunit ( fig. a ) (walls et al., ; watanabe et al., ) . in our s-closed structure, we resolved the density for n-linked glycans per protomer (fig. a-b and s i ), including two undetected glycans at site n and n located in the ntd (fig. b) , while the three glycans located in the flexible c-terminal region are missing as in other studies (walls et al., ; wrapp et al., ) . similar to mers-cov and sars-cov s trimers (walls et al., b; walls et al., ; yang et al., ) , sars-cov- s trimer also forms a glycan hole at proximity of the s /s cleavage site and the fusion peptide (near the s ' cleavage site, fig. b) . although there is an extra glycan at n site near the s /s cleavage site in sars-cov- s, the hole region is still more sparsely glycosylated than the rest of the protomer. this glycan hole might be important for permitting the access of activating host proteases and for allowing membrane fusion to take place without obstruction (walls et al., b; walls et al., ; yang et al., ) . moreover, after ace binding, our s-ace structure revealed that the density corresponding to glycan at n site is weaker in protomer , while the other resolved glycans in the s-closed state can also be visualized in the s-ace structure (fig. c ). the outbreak of covid- caused by sars-cov- virus has become pandemic. several structures of sars-cov- spike rbd domain bound to ace have been reported (lan et al., ; wang et al., ; yan et al., ) . however, the complete architecture of sars-cov- trimeric s in complex with ace remains unavailable, leading to an incomplete understanding of the nature of this interaction and of the resulted conformational transitions of the s trimer towards postfusion and virus entry. in the present study, we determined an uncharacterized tightly closed state of sars-cov- s trimer revealing the stably packed fusion peptide, most likely representing a previously undetected ground prefusion state of s trimer. the tightly closed s trimer with originally dominant population may indicate a conformational masking mechanism of immune evasion for sars-cov- spike. importantly, we captured the complete architecture of sars-cov- s trimer in complex with ace . we found the presence of ace could dramatically shift the conformational landscape of the s trimer, and after engagement the continuous swing motions of ace -rbd in the context of the s trimer could generate considerable conformational dynamics in s subunits resulting in a significant decrease in s /s interface area. furthermore, our structural data combined with biochemical analysis revealed that the rbm t -t loop and residue y play vital roles in the binding of sars-cov- rbd to ace receptor. our findings depict a new role of fp in stabilizing s trimer and the mechanism of fp activation, expand the detected conformational space of the s trimer, and provide structural basis on the sars-cov- spike d g mutation induced enhanced infectivity. based on the data, we put forward a mechanism of ace binding-induced conformational transitions of sars-cov- s trimer from the tightly closed ground prefusion state transforming towards the postfusion state (fig. ) . in the receptor-free sars-cov- s, the majority of the s trimers is in the tightly closed ground prefusion state with inactivated fp, and only a minor population of the particles is in the transient open state with one rbd up representing the fusion-prone state, forming a dynamic balance between the two states (step ). however, the presence of ace and subsequent trapping of the rbd (discussed later) could overcome the energy barrier, break the balance and shift the conformational landscape towards the open state with an untwisting/downwards-shift motion of the s subunits, leading to unpacked/activated fps, weakened interactions among the protomers, and an up rbd. in step , once the receptor ace grasp the up rbd, the rbd will be trapped in the up conformation, and the associated ace -rbd together shows combined continuous swing motions on the topmost surface of the s trimer. these motions and dynamics could disturb the allosteric network and release the constrains imposed on the fusion machinery, beneficial for the releasing of the ace -s component, thereby allowing the s trimers to refold and fuse the viral and host membranes (step ). the dominantly populated conformation ( %) for the unliganded sars-cov- s trimer is in the tightly closed state (more compact than that of sars-cov s trimer) with all the rbd domains buried, resulting in conformational masking preventing antibody binding and neutralization at sites of receptor binding. this sars-cov- conformational masking mechanism of neutralization escape suggested here could affect all antibodies that bind to the receptor binding site, similar to that described for hiv- env (kwong et al., ; munro et al., ) . while for mers-cov or sars-cov s trimer, the closed state is less populated ( . % and . %, respectively, indicating the conformational masking mechanism may be less effective for the two viruses (gui et al., ; pallesen et al., ) . interestingly, our findings also suggest that unliganded s trimer proteins of sars-cov- are inherently competent to transiently display conformation with one rbd up ready for receptor ace binding; ace facilitates the capture of pre-existing s trimer open conformation that are spontaneously sampled in the unliganded spike, rather than triggering a trimer opening event. therefore, the spontaneously sampled s trimer conformations may serve a functional role in infectivity. intriguingly, our data also suggest that the sars-cov- s trimer is very sensitive to (gui et al., ; song et al., ) . this demonstrates that the sars-cov- s trimer is much more sensitive to the ace receptor than sars-cov s in terms of receptor-triggered transformation from the closed prefusion state to the fusion-prone open state, which might have contributed to the observed superior infectivity of sars-cov- as compared to that of sars-cov. noteworthy, the mutation sars-cov- spike d g has gained urgent concern; the mutated genotype g began spreading in early february, and it was detected to reach at a frequency of ~ % in early june according to gisaid public repository (daniloski et al., ; korber et al., ) . moreover, it has been reported that the d g mutation promotes the infectivity of sars-cov- and enhances viral transmissibility in multiple human cell types (daniloski et al., ; hu et al., ; zhang et al., ) . however, the structural basis of d g enhanced infectivity has not been fully understood yet. here our s-closed structure in the ground prefusion state showed that d heavily involves in the interaction with fp through its side chain atoms (fig. i , table s ). this interaction could contribute greatly to the linkage between neighboring protomers as well as between s and s subunits. however, the mutation of d to g without side chain could eliminate most of the hydrogen bonds/salt bridges the d originally forms with fp, hence greatly reduce its interaction with fp potentially leading to a coordinated unpacking/activation of fps. therefore, d g mutation could ( ) reduce the constrains between neighboring protomers as well as s /s interactions within s trimer, and ( ) lower the energy barrier for the conformational transformation from closed prefusion state to fusion-prone open state, leading to even more sensitive sars-cov- s trimer to ace binding. collectively, these factors may contribute to the enhanced infectivity and viral transmissibility of the g strain. in summary, our data revealed the unliganded sars-cov- s trimer to be intrinsically transforming between two distinct pre-fusion conformations, whose relative occupancies could be dramatically remodeled by receptor ace . these findings support a dynamics-based mechanism of immune evasion and ligand recognition (munro et al., ) . thus, our study delineates the properties of the sars-cov- spike glycoproteins that simultaneously allow the retention of function and the evasion of the humoral immune response. we also delineated that the substantial conformational dynamics of s subunits induced by ace binding could trigger the transition of the spike protein towards postfusion state prepared for viral entry and infection. collectively, our findings suggest that stabilization of the tightly closed ground prefusion state of s trimer with inactivated fps might be a general and effective means of inhibiting sars-cov- entry, and an understanding of the properties of the sars-cov- s trimer that permit neutralization resistance will guide attempts to create vaccines as well as therapeutics that target receptor binding. we are grateful to the staffs of the ncpss electron microscopy facility, database and cryo-em maps have been deposited in the electron microscopy data bank, https://www.ebi.ac.uk/pdbe/emdb/ (accession nos. ***), and the associated models have been deposited in the protein data bank, www.rcsb.org (accession nos. **, **, and **). the authors declare that they have no conflict of interest. to express sars-cov- s glycoprotein ectodomain, the mammalian codon-optimized gene coding sars-cov- (wuhan-hu- strain, genbank id: mn . ) s glycoprotein ectodomain (residues m -q ) with proline substitutions at k and v , a "gsas" substitution at the furin cleavage site (r -r ) was cloned into vector pcdna . +. a cterminal t fibritin trimerization motif, a tev protease cleavage site, a flag tag and a his tag were cloned downstream of the sars-cov- s glycoprotein ectodomain (fig. s a) before bli experiments, sars-cov- s trimer protein was biotinylated using the ez-link™ sulfo-nhs-lc-lc-biotin kit (thermo fisher) and then purified using zeba™ spin desalting column (thermo fisher), according to manufacturer's protocols. to determine binding affinity of ace , bli assay was carried out using an octet red instrument (pall fortébio, usa). briefly, biotinylated sars-cov- s trimer protein was loaded onto streptavidin (sa) biosensors (pall fortébio). s-trimer-bound biosensors were dipped into wells containing varying concentrations of ace protein and the interactions were monitored over a -sec association period. finally, the sensors were switched to dissociation buffer ( . m pbs supplemented with . % tween and . % bovine serum albumin) for a -sec dissociation phase. data was analyzed using octet data analysis software version . (pall fortébio). the purified sars-cov- s glycoprotein ectodomain and human ace pd domain were mixed at a molar ratio of : and were incubated on ice for hours. the mixture was purified by filtration chromatography using a superose increase / gl column (ge healthcare) pre-equilibrated with mm tris-hcl ph . , mm nacl, % glycerol. for cross linking complex, the buffer of purified sars-cov- s glycoprotein ectodomain and human ace pd domain were exchanged to mm hepes ph . , mm nacl; then sars-cov- s and human ace were mixed at a molar ratio of : . after incubation on ice for hours, the complex was cross linked by . % glutaraldehyde, which is commonly used in cryo-em studies of fragile macromolecular complexes (kastner et al., ; patel et al., ) . the glutaraldehyde was neutralized by adding mm tris-hcl ph . after incubated on ice for hour. the mixture was run over a superose increase / gl column (ge healthcare) in mm tris-hcl ph . , mm nacl, % glycerol. the complex peak fractions were concentrated and assessed by sds-page and negative-staining electron microscopy. for the ns sample, a volume of µl of sars-cov- s-ace sample was placed on a plasma cleaned copper grid for one minute. excess sample on the grid was blotted off using filter paper, and a volume of µl of . % uf (sigma-aldrich) was added to wash the grid. after blotting, another volume of µl of . % uf was placed on the grid again for one minute to stain. grids were visualized under a tecnai g spirit kv transmission electron microscope (thermo fisher scientific), and micrographs were taken using an eagle camera with a nominal magnification of , ×, yielding a pixel size of . Å. , particles were autopicked in eman (bell et al., ) . after d classification, we selected good averages with , particles for initial model building, which were performed in relion . (zivanov et al., ) . to prepare the cryo-em sample of sars-cov- s trimer, a . -µl aliquot of this sample was applied to a plasma cleaned holey carbon grid (r / , mesh; quantifoil) or graphene oxide-lacey carbon grid ( mesh, emr). the grid was blotted with vitrobot mark iv (thermo fisher scientific) and then plunged into liquid ethane cooled by liquid nitrogen. to prepare the cryo-em sample of s-ace complex with or without cross linking, we used graphene oxide-lacey carbon grid ( mesh, emr), and adopted the same vitrification procedure as for the s trimer. cryo-em movies of the samples were collected on a titan krios electron microscope (thermo fisher scientific) operated at an accelerating voltage of kv with a nominal magnification of , x (table s ). the movies were recorded on a k summit direct electron detector (gatan) operated in the super-resolution mode (yielding a pixel size of . Å after times binning), under low-dose condition in an automatic manner using serialem (mastronarde, ) . each frame was exposed for . s and the total accumulation time was . s, leading to a total accumulated dose of e -/Å on the specimen. to solve the problem of preferred orientation associated with sars-cov- s trimer, we additionally collected tilt datasets with the stage tilt at ° or °, while the other conditions remained the same. single particle analysis was mainly executed in relion . (fernandez-leiro and scheres, ) . all images were aligned and summed using motioncor software (zheng et al., ) . after ctf parameter determination using ctffind (rohou and grigorieff, ) , particle auto-picking, manual particle checking, and reference-free d classification, particles with s trimer features were maintained for further processing. for receptor-free s trimer sample, , particles were picked from non-tilt micrographs, and , remained after d classification (fig. s ) . these particles went through d auto-refine using available sars-cov- s trimer cryo-em map (emdb: ) lowpass filtered to Å resolution as initial model (walls et al., ) . these particles were refined into a closed state map of s trimer with imposed c symmetry. we then re-extracted the particles using the refinement coordinates to re-center it. after ctf refinement and polishing, these particles were refined with c symmetry again. noteworthy, the euler angle distribution of the map suggested the dataset is lacking tilted top views (fig. s c left panel) . indeed, when refine the dataset without imposing -fold symmetry, the top view of the map appeared distorted indicating a preferred orientation problem associated with the sample. to overcome the preferred orientation problem, we additionally collected tilt data, and boxed out , particles from º tilt micrographs and , particles from º tilt micrographs. after d classification, , particles remained. we then used goctf software to determine the defocus for each of the tilt particle, and these particles were re-extracted with corrected defocus (su, ) . after combining the tilt with non-tilt particles, we refined the dataset without imposing symmetry, then performed two rounds of d and d classifications to further cleanup the dataset, and obtained a dataset of , particles, of which , particles were from the tilt data. we then carried out heterogeneous refinement in cryosparc (punjani et al., ) , and obtained a closed state map from , particles and an open state reconstruction with , particles ( fig s ) . after ctf refinement and bayesian polishing, the closed state map was refined to . Å resolution with c symmetry, while the open state map was at . Å resolution and hardly to improve the resolution, indicating an intrinsic dynamic nature of the open state. the overall resolution was determined based on the gold-standard criterion using an fsc of . (scheres and chen, ) . for the sars-cov- s-ace cross-linked dataset, , particles were picked from original micrographs, and , particles remained after d classification (fig. s ) . these particles were refined with an initial model built from our negative staining data. we then reextracted the particles to re-center them. these particles went through a d- d classification step resulting in a further cleaned up dataset of , particles. we refined these particles into a map of ace bound s trimer complex. we then used this map as initial model to refine the originally picked , particles for one round to re-extract and re-center the particles. after d classification, , particles remained. after rounds of d- d cleaning step, , particles were left for further structure determination. after heterogeneous refinement in cryosparc, class resembled an ace -free open state of s trimer, and classes - adopted s-ace engaged conformation. for class , after further d classification, we refined the , cleaned up particles into a s-open map at . Å resolution using non-uniform refinement in cryosparc. among the other four classes with bound ace , we sorted out good particles for classes - by d classification and combined them with class exhibiting good structural details, resulting in a dataset of , particles. after refinement, bayesian polishing, and ctf refinement, we reconstructed a . Å resolution sars-cov- s-ace map. the s trimer portion without the up rbd was rather stable, could be locally refined to . Å using local refinement in cryosparc with non-uniform refinement option chosen. the ace associated with the up rbd was subtracted and refined in relion to obtain a more . Å map with better connectivity. multi-body refinement in relion . was applied to analyze the motion of the complex. for sars-cov- s-ace w/o crosslinking dataset, we followed similar classification and cleaning up strategy and obtained , particles. through heterogeneous refinement and d classification in cryosparc, we reconstructed a . Å resolution sars-cov- s-ace map from , particles using non-uniform refinement, and an unliganded open state map of . Å resolution from , particles, with the population of . % and . %, respectively. multi-body refinement was also applied to analysis the mobility of the complex. to build the pseudo atomic model for our sars-cov- s-closed structure, we used the available atomic model of sars-cov- s (pdb: vxx) as initial model (walls et al., ) . we first refined the model against our map using phenix.real_space_refine module in phenix (adams et al., ) . for the missing loop regions in s subunit, we either built the homology model based on sars-cov s structure (pdb: crw) (kirchdoerfer et al., ) through swiss-model webserver (waterhouse et al., ) , or built the loop manually according to the density in coot (emsley and cowtan, ) . for the fp region, we first built the homology model by modeller tool within chimera by using mers-cov s structure (pdb: nb ) as template (pettersen et al., ; sali, ; walls et al., ) , then used rosetta to refine this region against the density map (dimaio et al., ) . eventually, we used phenix.real_space_refine again for the protomer and s-trimer model refinement against the map. for the sars-cov- s-ace structure, we used the sars-cov- rbd-ace crystal structure (pdb: m j) as initial model for the ace and the associated up rbd portion, and our s-closed model as initial model for the remaining portion. these models were firstly refined against the corresponding focused map using rosetta and phenix (dimaio et al., ) , then combined together in coot. we then refined the combined model against our . Å resolution sars-cov- s-ace map using rosetta and phenix. for the s-open structure, we used the model of sars-cov- s-ace as initial model with ace removed, and refined against the map using rosetta. we used phenix.molprobility to evaluate the models, and calculated b-factors by atom displacement refinement function in phenix.real_space_refine. we used ucsf chimera and chimerax for figure generation (goddard et al., ; pettersen et al., ) , and also for rotation, translation, rmsd, and vdw contact measurement. interaction surface analysis was conducted by pisa server (krissinel and henrick, ) . to uncover the amino acids important for ace receptor recognition, ace ecotodomain (residues q to s ) gene, with an n-terminal il signal peptide, tagged with human igg fc and his tag at the c-terminus, were cloned into the pcdna . vector. codonoptimized rbd (residues v to g ) gene fragment, with an n-terminal il signal peptide, tagged with his tag at the c-terminus, were cloned into the pcdna . vector. three sars-cov- rbd mutants were constructed. for mutant rbd (core), amino acids r to n of core region in the sars-cov- rbd were substituted by the corresponding region of sars-cov strain tor (genbank id: aap . ). for mutants rbd (rbm-r ) and rbd (rbm-r ), residues l to k , and residues t to t of rbm region in the sars-cov- rbd were mutated into the corresponding regions of sars-cov strain tor , respectively. for single point mutations of rbd (q a), rbd (v a), and rbd (y a), rbd residues q , v , and y were substituted by ala, respectively. all mutant plasmids were constructed using the mutexpress tm ii fast mutagenesis kit v (vazyme, china) according to the manufacturer's instruction. the proteins were generated using hek f expression system and purified as described above. anti-rbd polyclonal antibody and monoclonal antibody (mab) a were prepared by immunizing balb/c mice with recombinant sars-cov- rbd fused with a c-terminal mouse iggfc tag (sino biological inc, beijing, china) using previously described protocols (qu et al., ) . the purified rbd mutants were tested by elisa for reactivity with the receptor ace . briefly, elisa plates were coated with ng/well of the purified rbd mutants in pbs at °c for hours and then blocked with % milk in pbs-tween (pbst). next, the plates were incubated with ng/well of ace -hfc fusion protein, µl/well of culture supernatant of hybridoma a , or µl/well of mouse anti-rbd sera (diluted at / ) at °c for h. after washing, the corresponding secondary antibodies, horseradish peroxidase (hrp)conjugated anti-human igg (abcam, usa) or hrp-conjugated anti-mouse igg (sigma, usa), were added and incubated at °c for h. after washing color development, absorbance at nm was determined. cryo-em data processing procedure for sars-cov- s trimer in the presence of ace . amino acid sequence alignment of sars-cov- s to sars-cov s. the secondary structure elements were defined based on an espript (robert and gouet, ) algorithm and are labeled based on our sars-cov- s-closed structure. the rbd domain is labeled in green frames, and the subdomains of rbm are also labeled. contacting residues at the sars-cov- rbd-ace interface (distance cutoff of Å) y h l k f t , d , k y t a s , t g s , q f l , m , y n q , y y t , f , k f k q k , h s h y h g k q y t y , d , r n k g k , g y k , g , a , r phenix: a comprehensive python-based system for macromolecular structure solution high resolution single particle refinement in eman the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex the d g mutation in sars-cov- spike increases transduction of multiple human cell types atomic-accuracy models from . -a cryo-electron microscopy data with density-guided iterative local refinement coot: model-building tools for molecular graphics fusion peptides and the mechanism of viral fusion a pipeline approach to single-particle processing in relion ucsf chimerax: meeting modern challenges in visualization and analysis cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor the d g mutation of sars-cov- spike protein enhances viral infectivity and decreases neutralization sensitivity to individual convalescent sera grafix: sample preparation for single-particle electron cryomicroscopy stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis spike mutation pipeline reveals the emergence of a more transmissible form of sars-cov- inference of macromolecular assemblies from crystalline state hiv- evades antibody-mediated neutralization through conformational masking of receptor-binding sites structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structure, function, and evolution of coronavirus spike proteins structure of sars coronavirus spike receptor-binding domain complexed with receptor automated electron microscope tomography using robust prediction of specimen movements engineering trimeric fibrous proteins based on bacteriophage t adhesins conformational dynamics of single hiv- envelope trimers on the surface of native virions immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen structure of human tfiid and mechanism of tbp loading onto promoter dna ucsf chimera--a visualization system for exploratory research and analysis cryosparc: algorithms for rapid unsupervised cryo-em structure determination a new class of broadly neutralizing antibodies that target the glycan loop of zika virus envelope protein sars-cov- , sars-cov, and mers-cov: a comparative overview deciphering key features in protein structures with the new endscript server ctffind : fast and accurate defocus estimation from electron micrographs comparative protein modeling by satisfaction of spatial restraints prevention of overfitting in cryo-em structure determination cryo-em structure of the sars coronavirus spike glycoprotein in complex with its host cell receptor ace goctf: geometrically optimized ctf determination for single-particle cryo-em addressing preferred specimen orientation in single-particle cryo-em through tilting coronavirus membrane fusion mechanism offers a potential target for antiviral development structural basis for human coronavirus attachment to sialic acid receptors crucial steps in the structure determination of a coronavirus spike glycoprotein using cryo-electron microscopy function, and antigenicity of the sars-cov- spike glycoprotein cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer tectonic conformational changes of a coronavirus spike glycoprotein promote membrane fusion unexpected receptor functional mimicry elucidates activation of coronavirus fusion structural and functional basis of sars-cov- entry by using human ace site-specific analysis of the sars-cov- glycan shield swiss-model: homology modelling of protein structures and complexes cryo-em structure of the -ncov spike in the prefusion conformation structural basis for the recognition of sars-cov- by full-length human ace two mutations were critical for bat-to-human transmission of middle east respiratory syndrome coronavirus cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains the d g mutation in the sars-cov- spike protein reduces s shedding and increases infectivity motioncor : anisotropic correction of beam-induced motion for improved cryo-electron microscopy a pneumonia outbreak associated with a new coronavirus of probable bat origin new tools for automated high-resolution cryo-em structure determination in relion- key: cord- -c arb s authors: jiang, shibo; he, yuxian; liu, shuwen title: sars vaccine development date: - - journal: emerg infect dis doi: . /eid . sha: doc_id: cord_uid: c arb s developing effective and safe vaccines is urgently needed to prevent infection by severe acute respiratory syndrome (sars)–associated coronavirus (sars-cov). the inactivated sars-cov vaccine may be the first one available for clinical use because it is easy to generate; however, safety is the main concern. the spike (s) protein of sars-cov is the major inducer of neutralizing antibodies, and the receptor-binding domain (rbd) in the s subunit of s protein contains multiple conformational neutralizing epitopes. this suggests that recombinant proteins containing rbd and vectors encoding the rbd sequence can be used to develop safe and effective sars vaccines. safe vaccines is urgently needed to prevent a new sars epidemic and for biodefense preparedness. currently, major classes of sars vaccines are under development: ) inactivated sars-cov (figure ), ) full-length s protein (figure a) , and ) those based on fragments containing neutralizing epitopes ( figure b ). sars-cov expresses several structural proteins, including nucleocapsid, membrane, envelope, and spike (s) proteins ( ) . all may serve as antigens to induce neutralizing antibodies and protective responses. in general, prior to identification of the protein that contains the major neutralizing epitopes, the inactivated virus may be used as the first-generation vaccine because it is easy to generate whole killed virus particles. however, once the neutralizing epitopes are identified, the inactivated virus vaccine should be replaced by vaccines based on fragments containing neutralizing epitopes since they are safer and more effective. several reports have showed that sars-cov inactivated with formaldehyde, uv light, and β-propiolactone can induce virus-neutralizing antibodies in immunized animals ( ) ( ) ( ) ( ) , and the first inactivated sars-cov vaccine is being tested in the clinical trials in china. however, safety of the inactivated vaccine is a serious concern; production workers are at risk for infection during handling of concentrated live sars-cov, incomplete virus inactivation may cause sars outbreaks among the vaccinated populations, and some viral proteins may induce harmful immune or inflammatory responses, even causing sars-like diseases ( , ) . the s protein of sars-cov, a type i transmembrane glycoprotein, is responsible for virus binding, fusion, and entry and is a major inducer of neutralizing antibodies ( , ) . s protein consists of a signal peptide (sp: amino acids [aa] - ) and domains: an extracellular domain (aa - ), a transmembrane domain (aa - ), and an intracellular domain (aa - ). its extracellular domain consists of subunits, s and s ( ) , although the cleavage site between these subunits has not been clearly defined. the s subunit is responsible for virus binding to the receptor, angiotensin-converting enzyme (ace ) ( , ) . a fragment located in the middle region of the s subunit (aa - ) is the receptor-binding domain (rbd) for ace ( ) ( ) ( ) . sars-cov may also bind to cells through the alternative receptors dc-sign or l-sign ( , ) , but the binding sites for these alternative receptors have not been defined. the s subunit, which contains a putative fusion peptide and heptad repeats (hr and hr ), is responsible for fusion between the viral and target cell membranes. infection by sars-cov is initiated by binding of rbd in the viral s protein s subunit to ace on target cells. this forms a fusogenic core between the hr and hr regions in the s domain that brings the viral and target cell membranes into close proximity, which results in virus fusion and entry ( ) ( ) ( ) . this scenario indicates that the s protein may be used as a vaccine to induce antibodies for blocking virus binding and fusion. several recombinant vector-based vaccines expressing sars-cov s protein have been assessed in preclinical studies. yang et al. ( ) reported that a candidate dna vaccine encoding the full-length s protein induced neutralizing antibodies (neutralizing titers ranging from : to : ) and protected mice from sars-cov challenge. using dna vaccines encoding the full-length and segments of s protein to immunize rabbits, wang et al. have produced higher titers of neutralizing antibodies and demonstrated that major and minor neutralizing epitopes are located in the s and s subunits, respectively ( ) . other groups also found neutralizing epitopes in the s subunit ( , ) . bisht et al. ( ) have shown that intranasal or intramuscular inoculations of mice with highly attenuated modified vaccinia virus ankara (mva) vaccines encoding full-length sars-cov s protein also produce neutralizing antibodies with mean neutralizing titers of : . bukreyev et al. ( ) reported that mucosal immunization of african green monkeys with an attenuated parainfluenza virus expressing s protein resulted in production of neutralizing antibodies and protected animals from infection by challenge with sars-cov. these data suggest that the s protein can induce neutralizing antibodies and protective responses in immunized animals. using convalescent-phase sera from sars patients and a set of peptides spanning the entire sequence of the sars-cov s protein, we have identified linear immunodominant sites (ids) in the s protein ( figure a ). ids i, ii, iii, and v reacted with > % of the convalescent-phase sera from sars patients, while ids iv was reactive with > % of sars sera, suggesting that ids iv is the major immunodominant epitope on the s protein ( ) . synthetic peptides corresponding to ids could induce high titers of s protein-specific antibodies, but none of these antibodies possesses neutralizing activity. these findings suggest that the ids in s protein may not induce neutralizing antibodies. whether these antibodies enhance infection by heterologous sars-cov strains or mediate harmful immune responses is unclear. the s protein of fipv expressed by recombinant vaccinia can cause antibody-dependent enhancement of disease if vaccinated animals are subsequently infected with wild-type virus ( ) . our previous studies on hiv- showed that antibodies against some immunodominant epitopes in the hiv- envelope glyco- rbd, a fragment (≈ aa residues) in the middle of s subunit of s protein ( figure b ), is responsible for virus binding to the receptor on target cells. we have demonstrated that the antisera from sars patients and from animals immunized with inactivated sars-cov reacted strongly with rbd ( , ) . absorption of antibodies by rbd from these antisera results in the removal of most of the neutralizing antibodies, and rbd-specific antibodies isolated from these antisera have potent neutralizing activity ( , ) . we have also shown that rabbits and mice immunized with rbd produced high titers of neutralizing antibodies against sars-cov with % neutralizing titers at a > : , serum dilution ( ) . the immunized mice were protected from sars-cov challenge (unpub. data). the antibodies purified from the antisera against sars-cov significantly inhibited rbd binding to ace ( , ( ) ( ) ( ) . using spleen cells from mice immunized with rbd, we have generated a panel of monoclonal antibodies (mabs) that recognize different conformational epitopes on rbd and possess potent neutralizing activity ( ) . our result is in agreement with the report by van den brink et al. ( ) , who identified human neutralizing anti-s mabs from antibody phage display libraries by using inactivated sars-cov as the target. these researchers also found that all of these mabs specifically bound to rbd and blocked interaction between rbd and ace . these findings suggest that rbd contains the major neutralizing epitopes in the s protein and is an ideal sars vaccine candidate because rbd contains the receptor-binding site, which is critical for virus attachment to the target cell for infection ( , ( ) ( ) ( ) . antibodies specific for rbd are expected to block binding of virus to the target cell. rbd induces higher titers of neutralizing antibodies than those vaccines expressing the full-length s protein ( , , , , , ) . rbd sequences among the late sars-cov strains are highly conserved. when the early and late sars-cov strains are compared, only to aa residues are variable among the residues in rbd and most of the isolates vary by only residue ( ). van den brink et al. ( ) showed that human mab (cr ) specific for rbd of sars-cov strain fm can effectively bind to most rbds of the early and late sars-cov strains. these data suggest that antibodies directed against rbd of a sars-cov isolate may neutralize infection by a broad spectrum of sars-cov strains. therefore, recombinant proteins containing rbd or vectors encoding rbd may be used as vaccines for preventing infection by sars-cov with distinct genotypes. an ideal sars vaccine should ) elicit highly potent neutralizing antibody responses against a broad spectrum of viral strains; ) induce protection against infection and transmission; and ) be safe by not inducing any infectionenhancing antibodies or harmful immune or inflammatory responses. currently, an inactivated sars-cov vaccine is in clinical trials in china. safety is the major concern for this type of vaccine ( ) . the s protein is the major inducer of neutralizing antibodies. recombinant vector-based vaccines expressing full-length s protein of the late sars-cov are under development. these vaccines can induce potent neutralizing and protective responses in immunized animals but may induce antibodies that enhance infection by early human sars-cov and animal sars-cov-like viruses ( ) . recent studies have demonstrated that recombinant rbd consists of multiple conformational neutralizing epitopes that induce highly potent neutralizing antibodies against sars-cov ( , , ( ) ( ) ( ) ( ) . unlike fulllength s protein, rbd does not contain immunodominant sites that induce nonneutralizing antibodies. rbd sequences are relatively conserved. thus, recombinant rbd or vectors encoding rbd may be used as safe and efficacious vaccines for preventing infection by sars-cov with distinct genotypes. dr. jiang is associate member and head of the viral immunology laboratory, lindsley f. kimball research institute, new york blood center. his primary research interests include development of vaccines and therapeutic agents against sars-cov and hiv. severe acute respiratory syndrome unique and conserved features of genome and proteome of sars-coronavirus, an early split-off from the coronavirus group lineage isolation and characterization of viruses related to the sars coronavirus from animals in southern china molecular evolution of the sars coronavirus during the course of the sars epidemic in china molecular epidemiology of the novel coronavirus that causes severe acute respiratory syndrome evasion of antibody neutralization in emerging severe acute respiratory syndrome coronaviruses effectiveness of precautions against droplets and contact in prevention of nosocomial transmission of severe acute respiratory syndrome (sars) immunogenicity of sars inactivated vaccine in balb/c mice inactivated sars-cov vaccine elicits high titers of spike protein-specific antibodies that block receptor binding and virus entry epitope mapping and biological function analysis of antibodies produced by immunization of mice with an inactivated chinese isolate of severe acute respiratory syndrome-associated coronavirus (sars-cov) intranasal immunization with inactivated sars-cov (sars-associated coronavirus) induced local and serum antibodies in mice caution urged on sars vaccines glycan arrays lead to the discovery of autoimmunogenic activity of sars-cov sars-associated coronavirus angiotensin-converting enzyme is a functional receptor for the sars coronavirus a model of the ace structure and function as a sars-cov receptor a -amino-acid fragment of the sars coronavirus s protein efficiently binds angiotensin-converting enzyme the sars-cov s glycoprotein: expression and functional characterization the secret life of ace as a receptor for the sars virus ph-dependent entry of severe acute respiratory syndrome coronavirus is mediated by the spike glycoprotein and enhanced by dendritic cell transfer through dc-sign cd l (l-sign) is a receptor for severe acute respiratory syndrome coronavirus interaction between the heptad repeat and regions in spike protein of sarsassociated coronavirus: implication for virus fusogenic mechanism and identification of fusion inhibitors structural characterization of the sars-coronavirus spike s fusion protein core crystal structure of sars-cov spike protein fusion core a dna vaccine induces sars coronavirus neutralization and protective immunity in mice identification of two neutralizing regions on the severe acute respiratory syndrome coronavirus spike glycoprotein produced from the mammalian expression system amino acids to in the s region of severe acute respiratory syndrome coronavirus s protein induce neutralizing antibodies: implications for the development of vaccines and antiviral agents b-cell responses in patients who have recovered from severe acute respiratory syndrome target a dominant site in the s domain of the surface spike glycoprotein severe acute respiratory syndrome coronavirus spike protein expressed by attenuated vaccinia virus protectively immunizes mice mucosal immunisation of african green monkeys (cercopithecus aethiops) with an attenuated parainfluenza virus expressing the sars coronavirus spike protein for the prevention of sars identification of immunodominant sites on the spike protein of severe acute respiratory syndrome (sars) coronavirus: implication for developing sars diagnostics and vaccines identification of antigenic sites mediating antibody-dependent enhancement of feline infectious peritonitis virus infectivity enhancement of human immunodeficiency virus type- (hiv- ) infection by antisera to peptides from the envelope glycoproteins gp /gp immunization with modified vaccinia virus ankara-based recombinant vaccine against severe acute respiratory syndrome is associated with enhanced hepatitis in ferrets identification of a critical neutralization determinant of severe acute respiratory syndrome (sars)-associated coronavirus: importance for designing sars vaccines recombinant modified vaccinia virus ankara expressing the spike glycoprotein of severe acute respiratory syndrome coronavirus induces protective neutralizing antibodies primarily targeting the receptor binding region receptor-binding domain of sars-cov spike protein induces highly potent neutralizing antibodies: implication for developing subunit vaccine receptor-binding domain of sars coronavirus spike protein contains multiple conformationdependent epitopes that induce highly potent neutralizing antibodies molecular and biological characterization of human monoclonal antibodies binding to the spike and nucleocapsid proteins of severe acute respiratory syndrome coronavirus key: cord- - wfb gt authors: ghorbani, mahdi; brooks, bernard r.; klauda, jeffery b. title: critical sequence hot-spots for binding of ncov- to ace as evaluated by molecular simulations date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: wfb gt the novel coronavirus (ncov- ) outbreak has put the world on edge, causing millions of cases and hundreds of thousands of deaths all around the world, as of june , let alone the societal and economic impacts of the crisis. the spike protein of ncov- resides on the virion’s surface mediating coronavirus entry into host cells by binding its receptor binding domain (rbd) to the host cell surface receptor protein, angiotensin converter enzyme (ace ). our goal is to provide a detailed structural mechanism of how ncov- recognizes and establishes contacts with ace and its difference with an earlier coronavirus sars-cov in via extensive molecular dynamics (md) simulations. numerous mutations have been identified in the rbd of ncov- strains isolated from humans in different parts of the world. in this study, we investigated the effect of these mutations as well as other ala-scanning mutations on the stability of rbd/ace complex. it is found that most of the naturally-occurring mutations to the rbd either strengthen or have the same binding affinity to ace as the wild-type ncov- . this may have implications for high human-to-human transmission of coronavirus in regions where these mutations have been found as well as any vaccine design endeavors since these mutations could act as antibody escape mutants. furthermore, in-silico ala-scanning and long-timescale md simulations, highlight the crucial role of the residues at the interface of rbd and ace that may be used as potential pharmacophores for any drug development endeavors. from an evolutional perspective, this study also identifies how the virus has evolved from its predecessor sars-cov and how it could further evolve to become more infectious. the novel coronavirus (ncov- ) outbreak emerging from china has become a global pandemic and a major threat for human public health. according to world health organization (who) as of june th , there has been about million confirmed cases and approaching , deaths due to coronavirus in the world. [ ] [ ] much of the human population including the united states of america were under lockdown or official stay-at-home orders to minimize the continued spread of the virus. coronaviruses are a family of single-stranded enveloped rna viruses. phylogenetic analysis of coronavirus genome has shown that ncov- belongs to the beta-coronavirus family, which also includes mers-cov, sars-cov and bat-sars-related coronaviruses. [ ] [ ] it is worth mentioning that sars-cov, which was widespread in caused more than , cases and about deaths and mers-cov (middle east respiratory syndrome coronavirus) in also spread in more than countries, causing about , cases and more than deaths. (www.who.int/health-topics/coronavirus). in all coronaviruses, a homotrimeric spike glycoprotein on the virion's envelope mediates coronavirus entry into host cells through a mechanism of receptor binding followed by fusion of viral and host membranes. , coronavirus spike protein contains two functional subunits s and s . the s subunit is responsible for binding to host cell receptor, and the s subunit is responsible for fusion of viral and host cell membranes. , the spike protein in ncov- exists in a meta-stable pre-fusion conformation that undergoes a substantial conformational rearrangement to fuse the viral membrane with the host cell membrane. , ncov- is closely related to bat coronavirus ratg with about . % sequence similarity in the spike protein gene. the sequence similarity of ncov- and sars-cov is less than % in the spike genome. s subunit in the spike protein includes a receptor binding domain (rbd) that recognizes and binds to the host cells receptor. the rbd of ncov- shares . % sequence identity to sars-cov rbd and the root mean squared deviation (rmsd) for the structure between the two proteins is . which shows the high structural similarity. , , experimental binding affinity measurements using surface plasmon resonance (spr) have shown that ncov- fold higher affinity than sars-cov binding to ace . based on the sequence similarity between rbd of ncov- and sars-cov and also the tight binding between rbd of ncov- and ace , it is most probable that ncov- uses this receptor on human cells to gain entry into the body. , , , the spike protein and specifically the rbd domain in coronaviruses have been a major target for therapeutic antibodies. however, no monoclonal antibodies targeted to rbd have been able to bind efficiently and neutralize ncov- . , the core of ncov- rbd is a - the sequence alignment between sars-cov in human, sars civet, bat ratg coronavirus and ncov- in the rbm is shown in figure . there is a % sequence similarity between the rbm of ncov- and sars-cov. rbm mutations played an important role in the sars epidemic in . , two mutations in the rbm of sars- from sars-civet were observed from strains of these viruses. these two mutations were k n and s t. these two residues are close to the virus binding hotspots in ace including hotspot- and hotspot- . hotspot- centers on the salt-bridge between k -e and hotspot- is centered on the salt-bridge between k -e on ace . residues k and s in sars-civet are in close proximity with these hotspots and mutations at these residues caused sars to bind ace with significantly higher affinity than sars-civet and played a major role in civet-to-human and human-to-human transmission of sars coronavirus in . , [ ] [ ] [ ] numerous mutations in the interface of sars-cov rbd and ace from different strains of sars isolated from humans in have been identified and the effect of these mutations on binding ace have been investigated by surface plasmon resonance. , two identified rbd mutations (y f and l f) increased the binding affinity of sars-cov to ace and two mutations (n k, t s) decreased the binding affinity. it was demonstrated that these mutations were viral adaptations to either human or civet ace . , a pseudotyped viral infection assay of the interaction between different spike proteins and ace confirmed the correlation between high affinity mutants and their high infection. further investigation of rbd residues in binding of sars-cov and ace was performed through ala-scanning mutagenesis, which resulted in identification of residues that reduce binding affinity to ace upon mutation to alanine. these residues are k , r , d , d , i , n , f , q , y and r . rbd mutations have also been identified in mers-cov, which affected their affinity to receptor (dpp ) on human cells. it is not known whether these mutations are linked to the severity of coronavirus in these regions. the focus of this article is to elucidate the differences between the interface of sars-cov and ncov- with ace to understand with atomic resolution the interaction mechanism and hotspot residues at the rbd/ace interface using long-timescale molecular dynamics (md) simulation. an alanine-scanning mutagenesis in the rbm of ncov- helped to identify the key residues in the interaction, which could be used as potential pharmacophores for future drug development. furthermore, we performed molecular simulations on the seven most common mutations found from surveillance of rbd mutations n k, t i, v a, g s, s p, v f and a v. from an evolutionary perspective this study shows the residues in which the virus might further evolve to be even more dangerous to human health. ncov- shares % sequence similarity with sars- spike protein, % sequence identity for rbd and % for the rbm. bat coronavirus ratg seems to be the closest relative of ncov- sharing about % sequence identity in the spike protein. the the mutations selected are listed in table s along with their location in rbd. the crystal structure of ncov- in complex with hace (pdb id: m j) as well as sars-cov complex with human ace (pdb id: acj) were obtained from rcsb (www.rcsb.org). the steps of energy minimizations were done using the steepest descent algorithm. in all steps the lincs algorithm was used to constraint all bonds containing hydrogen atoms and a time step of fs was used as the integration time step. equilibration of all systems were performed in three steps. in the first step, , steps of simulation were performed using a velocity-rescaling thermostat to maintain the temperature at k with a . ps coupling constant in nvt ensemble under periodic boundary conditions and harmonic restraints on the backbone and sidechain atoms of the complex. the velocity rescaling thermostat was used in all other steps of simulation. in the next step, we performed , steps in the isothermalisobaric npt ensemble at temperature of k and pressure of bar using a berendsen barostat. this was done by decreasing the force constant of the restraint on backbone and side chain atoms of the complex from to and finally to మ . berendsen barostat was only used for the equilibration step due to usefulness in rapidly correcting density. in the next step the restraints were removed, and the systems were subjected to , , steps of md simulation under npt ensemble. in the production run, harmonic restraints were removed and all the systems were simulated using a npt ensemble where the pressure was maintained at bar using the parrinello-rahman barostat with a compressibility of and a coupling constant of . ps. it is important to note that all the berendsen barostat was only used for the equilibration step as it was shown that this barostat can cause unrealistic temperature gradients. the production run lasted for ns for sars-cov and ncov- complexes and ns for all the mutants using with a fs timestep and the particle-mesh ewald (pme) for long range electrostatic interactions using gromacs . package. all mutant systems were constructed as described before and all complexes ran for ns of production run. the principal components were used to calculate and plot the approximate free energy landscape (afel). we refer to the free energy landscape produced by this approach to be approximate in that the ensemble with respect to the first few pc's (lowest frequency quasiharmonic modes) is not close to convergence, but the analysis can still provide valuable information and insight. g_sham, g_covar and g_anaeig functions in gromacs were used to obtain principal components and afel. in each afel the deep valleys represent the most stable conformations separated by some intermediate states. the dynamic cross-correlation maps (dccm) were obtained using md_task package to identify the correlated motions of rbd residues. in dccm the cross-correlation matrix by setting a value of and for solvent and solute dielectric constants. the non-polar free energy is simply estimated from solvent accessible surface area (sasa) of the solute from equation . to compute the rmsd of systems, the rotational and translational movements were removed by first fitting the c α atoms of the rbd to the crystal structure and then computing the rmsd with respect to c α atoms of rbd in each system. in most of the variants, the rmsd is stable during the ns simulation. however, a few mutations show some rmsd variance. in mutation y a, the rmsd increases from the first two eigenvectors were used to calculate and plot the afel as a function of first two principal components using the last ns of the simulation for mutant systems. afel for a few mutants are shown in figure and the rest of them are shown in figure s . the binding energetics between ace and the rbd of sars-cov, ncov- and all its mutant complexes were investigated by the mmpbsa method. for ncov- . the binding free energy for ncov- and sars-cov was decomposed into a perresidue based binding affinity to find the residues that contribute strongly to the binding and complex formation (figure ). most of the investigated residues in the rbm of ncov- had a favorable contribution to total binding energy. binding free energy decomposition to its individual components for all mutants is represented in contributes the most to this low binding energies for these mutants. the contribution of rbm residues to binding with ace for ncov- were mapped to the rbd structure and is shown in figure b . natural mutants exhibited similar or higher binding affinities compared to wild-type ncov- . importantly, mutation t i which is one of the most occurred mutations in england based on gisaid database table contribution of interface residues to structure in rbd of ncov- . the rbd domain is purple and the ace is yellow. the rbd in contact with ac is rendered in a surface format with more red being a favorable contribution to binding (more negative) and blue unfavorable contribution (positive). in this work, we preformed md simulations to unveil the detailed molecular mechanism . d in sars-cov is located in a region of high negative charge from residues e , e and d on ace . electrostatic repulsion between d on sars-cov and the acidic residues on ace is the reason for highly negative contribution of this residue to binding of sars-cov to ace . mutation to s in this location removes this highly negative contribution. to our knowledge this is first detailed molecular simulation study on the effect of mutations on binding of ncov- to ace . previous computational studies have found that ncov- binds to ace with a total binding affinity which was about stronger than sars-cov and is in fair agreement with the results here. the critical role of interface residues and residues are computationally investigated here and in other articles and the results of all the studies indicate the importance of these residues for the stability of the complex and finding hotspot residues for the interaction with receptor ace . , [ ] [ ] [ ] it is interesting to note the role of shown there is a correlation between higher binding affinity to receptor and higher infection rate by coronavirus. [ ] [ ] [ ] [ ] high binding affinity for some mutants such as t i could be the reason for higher human-to-human transmission rate in regions where these mutations are found. it is also alerting that mutations at other residues such as g a, g a, y a and y a increase the binding affinity considerably and should be monitored. mutations of ncov- rbd that do not change the binding affinity and complex stability, could have implications for antibody design purposes since they could act antibody escape mutants. escape from monoclonal antibodies are observed for mutations of sars-cov in and these mutations should be considered for any antibody design endeavors against consider these escape mutations. in conclusion, this study unraveled key molecular traits underlying the higher affinity of ncov- for ace compared to sars-cov and unveiled critical residues for the interaction higher affinity than wild-type. other occurring mutations n k and e a are found to increase the electrostatic interaction of rbd with ace . it is also alerting that some of the alanine substitutions at residues g , g and y substantially increased the binding affinity that may lead to a strongly rbd attachment to ace and influence the infection virulence. on the other hand, most mutations are found not to impact the binding affinity of rbd with ace in ncov- which could have implications for vaccine design endeavors as these mutations could act as antibody escape mutants. receptor recognition is the first line of attack for coronavirus and this study gives novel insights to key structural features of interface residues for advancement of effective therapeutic strategies to stop the coronavirus pandemic. the authors would like to dedicate this article to the doctors and nurses who sacrificed their time, health and even their lives to fight covid- , particularly those in iran and the united states. jbk would also like to dedicate this work to family friend joe kaplan (silver spring, md) who passed away due to covid- on april , . a novel coronavirus from patients with pneumonia in china a pneumonia outbreak associated with a new coronavirus of probable bat origin structure, function, and evolution of coronavirus spike proteins structural and functional basis of sars-cov- entry by using human ace receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus cryo-em structure of the -ncov spike in the prefusion conformation. science the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex structural insights into the middle east respiratory syndrome coronavirus a protein and its dsrna binding mechanism structure, function, and antigenicity of the sars-cov- spike glycoprotein potent binding of novel coronavirus spike protein by a sars coronavirus-specific human monoclonal antibody receptor recognition mechanisms of coronaviruses: a decade of structural studies receptor and viral determinants of sars-coronavirus adaptation to human ace structural analysis of major species barriers between humans and palm civets for severe acute respiratory syndrome coronavirus infections receptor recognition and cross-species infections of sars coronavirus mechanisms of host receptor adaptation by severe acute respiratory syndrome coronavirus structural analysis of major species barriers between humans and palm civets for severe acute respiratory syndrome coronavirus infections the sars coronavirus s glycoprotein receptor binding domain: fine mapping and functional characterization development and characterization of a severe acute respiratory syndrome-associated coronavirus-neutralizing human monoclonal antibody that provides effective immunoprophylaxis in mice neutralizing human monoclonal antibodies to severe acute respiratory syndrome coronavirus: target, mechanism of action, and therapeutic potential human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants vaccine efficacy in senescent mice challenged with recombinant sars-cov bearing epidemic and zoonotic spike variants structural basis for potent cross-neutralizing human monoclonal antibody protection against lethal human and zoonotic severe acute respiratory syndrome coronavirus challenge escape from human monoclonal antibody neutralization affects in vitro and in vivo fitness of severe acute respiratory syndrome coronavirus disease and diplomacy: gisaid's innovative contribution to global health structural basis for receptor recognition by the novel coronavirus from wuhan mcsm-ppi : predicting the effects of mutations on protein the sars coronavirus s glycoprotein receptor binding domain: fine mapping and functional characterization structure of the sars-cov- spike receptor-binding domain bound to the ace receptor the protein data bank gromacs: high performance molecular simulations through multi-level parallelism from laptops to supercomputers improved side-chain torsion potentials for the amber ff sb protein force field optimization of the additive charmm all-atom protein force field targeting chem. phys journal of computational chemistry polymorphic transitions in single crystals: a new molecular dynamics method constant pressure molecular dynamics simulation: the langevin piston method articles you may be interested in harmonic analysis of large systems. i. methodology principal component analysis: a method for determining the essential dynamics of proteins gromacs: high performance molecular simulations through multi-level parallelism from laptops to supercomputers a software suite for analyzing molecular dynamics trajectories assessing the performance of the g_mmpbsa tools to simulate the inhibition of oseltamivir to influenza virus neuraminidase by molecular mechanics poisson-boltzmann surface area methods recent developments and applications of the mmpbsa method virtual screening using molecular simulations insight into the interactive residues between two domains of human somatic angiotensin-converting enzyme and angiotensin ii by mm-pbsa calculation and steered molecular dynamics simulation enhanced receptor binding of sars-cov- through networks of hydrogen-bonding and hydrophobic interactions is the rigidity of sars-cov- spike receptor-binding motif the hallmark for its enhanced infectivity? insights from all-atom simulations characterization of the receptor-binding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine structural and functional basis of sars-cov- entry by using human ace phylogenetic network analysis of sars-cov- genomes is the rigidity of sars-cov- spike receptor-binding motif the hallmark for its enhanced infectivity? an answer from all-atoms simulations the sars-cov- exerts a distinctive strategy for interacting with the ace human receptor computational simulations reveal the binding dynamics between human ace and the receptor binding domain of sars-cov- spike protein receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus receptor and viral determinants of sars-coronavirus adaptation to human ace efficient replication of severe acute respiratory syndrome coronavirus in mouse cells is limited by murine angiotensin-converting enzyme sars-cov- spike glycoprotein receptor binding domain is subject to negative selection with predicted positive selection mutations key: cord- - lt h authors: jarvis, matthew c.; lam, ham ching; zhang, yan; wang, leyi; hesse, richard a.; hause, ben m.; vlasova, anastasia; wang, qiuhong; zhang, jianqiang; nelson, martha i.; murtaugh, michael p.; marthaler, douglas title: genomic and evolutionary inferences between american and global strains of porcine epidemic diarrhea virus date: - - journal: prev vet med doi: . /j.prevetmed. . . sha: doc_id: cord_uid: lt h porcine epidemic diarrhea virus (pedv) has caused severe economic losses both recently in the united states (us) and historically throughout europe and asia. traditionally, analysis of the spike gene has been used to determine phylogenetic relationships between pedv strains. we determined the complete genomes of pedv field samples from us swine and analyzed the data in conjunction with complete genome sequences available from genbank (n = ) to determine the most variable genomic areas. our results indicate high levels of variation within the orf and spike regions while the c-terminal domains of structural genes were highly conserved. analysis of the receptor binding domains in the spike gene revealed a limited number of amino acid substitutions in us strains compared to asian strains. phylogenetic analysis of the complete genome sequence data revealed high rates of recombination, resulting in differing evolutionary patterns in phylogenies inferred for the spike region versus whole genomes. these finding suggest that significant genetic events outside of the spike region have contributed to the evolution of pedv. porcine epidemic diarrhea virus (pedv) causes diarrhea, vomiting, and dehydration, leading to high mortality (up to %) in suckling piglets. pedv was first discovered in the united kingdom in , and later was found in belgium, hungary, france, italy, and the czech republic (chasey and cartwright, ; fan et al., ) . in , pedv was first reported in china, and proceeded to spread throughout asia (cui, ; song and park, ) . in late , a "variant" pedv strain with increased pathogenesis compared to the pedv is a single-stranded, positive sense rna virus belonging to the family coronaviridae, genus alphacoronavirus. the pedv genome is approximately kb in length and roughly two-thirds of the genome consists of open reading frame (orf) , which encodes non-structural proteins (nsps) (lai et al., ) . these nsps play important roles in viral replication, post-translational processing, and immune evasion (lai et al., ) . the virus produces various structural proteins, including spike, membrane, and nucleocapsid (lai et al., ) . the spike protein is crucial to cell attachment and infection, and the envelope is an integral membrane protein, aiding in membrane fusion while the nucleocapsid protein is necessary for genomic packaging (hagemeijer and de haan, ) . in addition, the pedv genome includes orf , located between the spike and membrane genes, that encodes an ion channel protein possibly associated with pedv pathogenesis (park et al., ; wang et al., ) . researchers have explored various regions of the coronavirus (cov) genome to link specific areas with virulence and host cell attachment. for example, the spike gene codes for a viral attachment protein that can be divided into the s ( - aa) and s ( - aa) regions (song and park, ) . comparative analysis of transmissible gastroenteritis virus (tgev), porcine respiratory coronavirus (prcv), and murine hepatitis virus (mhv) revealed two main antigenic sites in the s region: the n-terminal domain (ntd) and the c-terminal receptor binding domain (rbd) (li et al., ) . while both domains can influence virus infectivity, such as in tgev, one domain tends to be central to a cov's tropism: the ntd is important for mhv tropism, and the rbd is central to pedv infectivity and virulence (reguera et al., ) . the ntd can bind to various sialic acids on the host cell surface (reguera et al., ) . the rbd contains residues that bind to the porcine aminopeptidase-n (papn), the host receptor utilized by tgev and pedv (delmas et al., ) . since the last large-scale north american pedv outbreak ended in the spring of , the complete genomes of pedv strains from the us were sequenced and analyzed to further understand the origin and phylogenetic relationships among the american and global pedv strains. in-depth nucleotide and amino acid analysis was conducted to identify genes of high diversity. bayesian analysis was performed to understand the evolution of pedv and the emergence of different clades within us strains. in addition, the rbd was modeled to visualize the differences between american and asian strains to better understand how changes in the rbd might affect vaccine efficacy and development. samples were routinely submitted to the university of minnesota veterinary diagnostic laboratory (umvdl) for pathogen detection. between january and december samples were screened for pedv by real time rt-pcr . samples for complete genome sequencing were selected based on the criteria of a high viral concentration from the rt-pcr results and geographical diversity within the us. a total of samples, including fecal (n = ), intestinal homogenate (n = ), fecal swab (n = ), oral fluid (n = ), feedback (n = ), and environmental (n = ) samples were selected for complete genome sequencing using next generation sequencing (ngs) techniques as previously described (genbank numbers kr -kr , kr -kr ) (marthaler et al., ; marthaler et al., ) . whole genomic pedv sequences obtained using ngs techniques were also generously supplied from iowa state university (n = , genbank numbers km -km ) and the ohio department of agriculture (n = , genbank numbers kp -kp ), using previously described methods chen et al., ) . using the complete pedv genome sequences from this study (n = ) and the available pedv sequences from genbank (n = ), two nucleotide alignments were created and analyzed to determine the phylogenetic relationships between american and global pedv sequences: the concatenation of all orfs (orf , s, orf , envelope, membrane, and nucleocapsid), and a s alignment. vaccine and cell-passaged strains were excluded from the analysis (table s ). nucleotide and amino acid entropy analyses were performed using the matlab software (matlab v . and statistics toolbox v . , the mathworks, inc., natick, ma, usa). threshold values were determined using previously published methods (shannon, ; litwin and jores, ) . recombination analysis was performed using the recombination detection program (rdp) v , which uses multiple detection algorithms, including the rdp method, genecov, and maxchi, to check for the presence of recombinant sequences in the sequence dataset (martin et al., ) . window size was set to bp. breakpoints, the presence of major/minor donor sequences, and confidence intervals were used to determine regions that required excision from the alignment, or if entire sequences needed to be removed from the analysis due to multiple recombination events within the sequence. recombinant sequences were removed only prior to the bayesian analysis, but remained in the alignments for all entropy analysis and molecular modeling. bayesian markov chain monte carlo (mcmc) approach using beast v . . , with a relaxed molecular clock and bayesian skyline population (bsp) prior, with a general-time reversible nucleotide substitution and gamma distributed among-site rate variation was used to infer time-scaled phylogeny (drummond et al., , , drummond and rambaut, minin et al., ; drummond and suchard, ) . the mcmc chain was run for million generations, with sub-sampling every , iterations. a maximum clade credibility (mcc) tree was created by discarding the initial % of the chains and summarized in treeannotator (v. . . ). key nodes were identified using figtree (v. . . ) to determine time to most recent common ancestor (tmrca). the putative papn receptor-binding residues were analyzed to determine residue trends between classical and pandemic strains (reguera et al., ) . the c-terminal rbd within the s region of the spike gene was modeled using the open-source modeling server swiss-model provided by the swiss institute of bioinformatics (biasini et al., ) . predicted tertiary structure of the pedv papn rbd was modeled using prcv as a template since a pedv template was not available. spike monomer and trimer models were developed using a theoretical sars-cov model as a template (bernini et al., ) . illustrations were created using the open-source javabased molecular viewer jmol (herraez, ) and the python-based molecular viewer pymol (the pymol molecular graphics system, version . . schrödinger, llc.). , , , , , , , , , , , , , , , , , , , , , , , the pedv nucleotide sequences ranged from , to , bases in length. two pedv genomes from our study had insertions or deletions. ohio had a -nt insertion between positions , and , in the spike gene while minnesota with a -nt deletion from positions , to , in the utr compared to the original us strain, usa/colorado/ . entropy analysis was conducted with whole nucleotide and amino acid sequences, containing the concatenated orfs excluding and utrs. entropy values greater than . and . were considered highly variable for the nucleotide and amino acid alignments, respectively, based on the level of diversity in the dataset and previously determined entropy values (litwin and jores, ) . within the nucleotide alignment, of the pedv regions lacked positions with entropy levels above . (nsp , nsp -nsp , nsp , nsp , nsp , s , orf , envelope, membrane, and nucleocapsid) while regions had entropy levels above . (nsp , nsp , nsp , nsp , and s ) (fig. ) . the nsp and were the most divergent regions containing and diverse nucleotide positions, respectively (table ) . interestingly, the nsp gene contained diverse nucleotide positions, which were absent in the amino acid sequence (fig. a) . inversely, high amino acid diversity was observed in the nsp , nsp , nsp , and s genes, which were absent in the nucleotide alignment (fig. b) . higher entropy levels were present in the nsp , nsp , and s regions in both the nucleotide and amino acid alignments. overall, the orf a entropy levels were higher compared to orf b in the amino acid analysis. of the structural genes, the s gene had the highest entropy levels compared to the envelope, membrane, and nucleocapsid genes. recombination was detected in main areas of the concatenated full genome, including the nsp , nsp , nsp - , s domain, and nucleocapsid gene ( fig. a) . in these areas, recombination was present in the majority of the sequences, so the entire region was excised from the alignment prior to bayesian analysis. in addition, sequences ( from asia, from the americas) were omitted from the bayesian analysis due to evidence of widespread recombination throughout the genome (table s ). for example, the pandemic sequence minnesota contained a recombinant region with the characteristic s-indel deletions and insertions in the s domain, indicating a recombinant event occurred between an s-indel strain and a non s-indel pathogenic strain in the us (fig. b) . a maximum clade credibility (mcc) phylogeny was inferred for both the concatenated genomic sequences excluding the recombinant regions ( , nt) and the spike s gene ( nt). the analysis was run independently twice until convergence was reached, with high agreement between the two runs. in the concatenated alignment tree, the classical and pandemic asian strains were positioned as basal to the us strains, consistent with an asian origin for the us outbreak (fig. ) . importantly, the concatenated alignment tree suggests that the us epidemic may have resulted from two independent pedv introductions into the us, including minor and major clade of viruses. the minor clade contained the american and european s-indels, and a small subclade of non s-indel sequences from the ohio, including ohio / , pc a/ , and oh / . the major clade of us pedv strains was supported by high posterior probability ( %) and appears to have diverged further into two highly supported sublineages ( % and % posterior probability). the phylogeny is consistent with multiple incursions of the major clade of us pedv viruses into mexico, canada, and south korea. the minor clade includes sequences from late to early that are localized to the midwestern and eastern us regions. the estimated tmrca of the minor clade of us pedv strains is july - , and the estimated tmrca of the major clade of us pedv strains is september -august . the estimated evolutionary rate for the complete genome (excluding recombinant regions and sequences) is . × − substitutions/site/year ( . × − - . × − , % highest posterior density (hpd)). the rate estimate for the us strains is slightly higher, but not significantly: . × − substitutions/site/year ( . × − - . × − , % hpd). the spike tree illustrates the evolutionary relationship between the classical strains and the s-indels, which suggests a classical origin for the s-indel genotype (fig. ) . the pathogenic strains form a highly diverged major clade (fig. b) , which braches into large american clades. in addition, the bayesian analysis of the spike gene might suggest separate introductions of pedv into the us. the evolutionary rate for the s gene is . × − substitutions/site/year ( . × − - . × − , % hpd). considering the high entropy levels in the spike gene and the evolutionary rate determined from the s bayesian spike tree, the rbd within the s was further examined. the s-indel and classical pedv strains shared similar amino acid substitutions, specifically in the ntd of the s region (fig. s ) . furthermore, the pandemic pedv strains from china had an increased number of substitutions within the s domain when compared to the american strains due to the longer circulation time in china. compared to the attenuated vaccine strain dr , of ( %) american strains and of ( %) asian strains had at least one amino acid substitution in the papn rbd ( table ). the majority of the american strains (n = ) did not represent any amino acid differences in the papn rbd. in this region, positions in the american strains had amino acid differences compared to the vaccine strain dr (fig. a) . the most common substitutions were in the fourth region of the papn rbd at positions e d (n = ) and g d (n = ), which were substituted with aspartic acid. more substitutions (n = ) occurred in the papn rbd of the asian strains (table ) , with the most common substitutions at position h l (n = ). the rbd regions were three-dimensionally modeled to illustrate the and amino acid positions at which substitutions occurred in the north american and asian strains, respectively. the modeling of the spike protein suggests that the papn rbd residues cluster around the inner pore created by the trimer molecule, while the ntd is oriented around the outer surface of the s domain ( fig. b through f). genomic analysis depends critically on complete sequence data to conduct accurate research on phylogeny, evolution, and gene regulation. in the past, it was more economically and time effective to sequence smaller pieces of a genome and develop evolutionary conclusions from these relatively small genomic pieces. however, without full genomic sequences, it is impossible to compare variations within a genome to determine selective pressures on specific genes or regions. because of ngs technology, tools like site-specific entropy analysis can be used to examine variability throughout the genome of many pedv sequences. sun and collaborators reported four regions of diversity within the pedv genome, including v in the nsp and nsp , v in the s , v in the s and orf , and v in the nucleocapsid (sun et al., ) . while our results support the nucleotide variance in the nsp , nsp , and spike genes, high levels of diversity were not present in the s , orf , and the nucleocapsid. this could be due to the omission of pedv isolates in our analysis, as well as the comparatively large number of new us sequences in our dataset. our variance results may more accurately represent variance within american pedv strains, and underrepresent variance within chinese strains. the diversity in the s region is comprehensible since it is under strong immunological pressure while the s region is more conserved throughout covs (aydin et al., ) . the functions of nsp and nsp remain relatively ambiguous. despite being involved in viral growth and propagation, nsp is dispensable for viral replication because cov strains can replicate in absence of the nsp (graham et al., ) . the function of nsp may be related to innate immune evasion since it encodes proteases that facilitate proteasome degradation, changes in intracellular destination, signaling, protein interactivity, and host type i interferon (ifn) antagonist activities (xing et al., ) . due to the multifaceted nature of nsp , other nsp regions could produce proteins with novel effects not yet understood that mediate the virulence of cov species. acquired nucleotide differences throughout the nsp and nsp regions could contribute to the evasion of host immunity. thus, future research should focus on the functionality and importance of all pedv genes to further understand cov pathogenesis. recombination plays a pivotal role in the evolution of covs by creating new strains with altered virulence. the minnesota strain originated from a recombination even between an s-indel and a us pandemic strain, which has been associated with altered pathogenesis. while recombination may occur more often during an epidemic, recombination events occurred in most of the asian strains. recombination events can affect the phylogenetic analy-sis because different regions of the genome may have different evolutionary histories (spade et al., ) . our recombination analysis resulted in a significant portion of the complete genome being removed prior to more detailed phylogenetic analysis. at this time, the beast program cannot accommodate genetic data that includes recombined regions. our analysis supported an asian origin for the us outbreak while the inference is biased by the lack of background sequences from other regions. although over genomes were from the us, interpretation of the evolutionary and spatial his- tory of this data is limited by the lack of genomic pedv data from other regions, including europe and asia. us strains had a higher evolutionary rate compared to the global strains, but the bayesian skyline plot did not show any significant increase in evolutionary rate, possibly due to the lack of temporal sampling in the us dataset. the evolutionary rate for the spike gene was higher compared to the rest of the genome, reflecting greater selective pressure. the overall evolutionary rate of pedv ( . × − substitutions/site/year) is similar to that of tgev and wild animal covs ( . × − substitutions/site/year, . × − - . × − % hpd), but lower than that of sars-cov ( . × − nucleotide substitutions/site/year), except during the time of the us pedv epidemic ( . × − sub-stitutions/site/year) (song et al., ; vijaykrishna et al., ) . surprisingly, pandemic strains were positioned within the s-indel clade. possibly, a pandemic and s-indel strain were introduced into the americas, and a recombination event occurred in the ntd that removed the characteristic insertions and deletions of an s-indel strain, as indicated in the minnesota sequence. the relationship between the us minor clade and the recent pedv strains from europe is less clear. while these viral populations are closely related (posterior probability of %), the direction of transmission is unclear at this time. additional sequences from europe might help to resolve the origins of these recent european pedv cases. (e) a monomer model of the pedv spike protein, with the c-terminal rbd represented in green, dark blue represents the s region, light blue represents the s region, and yellow represents the n-terminal rbd. (f) a theoretical tertiary structure model of the pedv spike protein. blue represents the s region, with the specific n-and c-terminal rbds highlighted in yellow and green, respectively. the papn-rbd is shown in violet.(for interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) examining the spike gene can reveal interesting conclusions about the papn rbd. while the ntd spans a larger region of the s domain, it has not been directly linked to pedv tropism and functionality, as in tgev and mhv (reguera et al., ) . the korean attenuated vaccine strain and the us strains share similar residues in the papn rbd, with numerous differences compared to the older classical strains, suggesting this vaccine may protect against the american strains. however, developing a consistently and longitudinally efficacious vaccine may prove challenging, considering the high evolutionary rate of the s region. failures in the development of an efficacious vaccine have been reported, further supporting the difficulty in generating vaccines for pedv . despite some uncertainties in vaccine efficacy, a recent study demonstrated that prior exposure of sows to the s-indel strain provided a level of protective immunity when their piglets were challenged with the more virulent original us pedv strain, which is probably due to conservation within the c-terminal region of the viral genome (goede et al., ) . while the exact functionality of all the genes of pedv and other covs is unknown, adding the complete genomes of diverse strains to the global database promotes better understanding of evolutionary and phylogenetic relationships. multiple regions within the genome are variable, and recombination is common between pedv strains. despite excising a large portion of the genome prior to analysis, the bayesian trees illustrate two distinct entries of pedv into the us and characterize the evolution of pedv compared to other covs. modeling of the papn rbd region has revealed that asian strains have increasing diversity compared to previously developed vaccines, and the variability in both the american and asian strains needs to be considered for future vaccine development. as the us swine industry recovers from the pedv epidemic of - , research is maturing to understand the regions of diversity, evolution, and the rbd of pedv to prevent future outbreaks and foster vaccine development. influence of hydrophobic and electrostatic residues on sars-coronavirus s protein stability: insights into mechanisms of general viral fusion and inhibitor design prediction of quaternary assembly of sars coronavirus peplomer swiss-model: modelling protein tertiary and quaternary structure using evolutionary information virus-like particles associated with porcine epidemic diarrhoea isolation and characterization of porcine epidemic diarrhea viruses associated with the disease outbreak among swine in the united states studies on the detection of porcine epidemic diarrhea virus by immunofluorescent techniques aminopeptidase n is a major receptor for the entero-pathogenic coronavirus tgev relaxed phylogenetics and dating with confidence estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data beast: bayesian evolutionary analysis by sampling trees bayesian coalescent inference of past population dynamics from molecular sequences bayesian random local clocks, or one rate to rule them all bayesian phylogenetics with beauti and the beast . scientific opinion on porcine epidemic diarrhoea and emerginf pig deltacoronavirus complete genome sequence of a novel porcine epidemic diarrhea virus in south china previous infection of sows with a mild strain of porcine epidemic diarrhea virus confers protection against infection with a severe strain the nsp proteins of mouse hepatitis virus and sars coronavirus are dispensable for viral replication comparison of porcine epidemic diarrhea viruses from germany and the united states biomolecules in the computer: jmol to the rescue nidovirales: coronoviridae and ateriviridae outbreak-related porcine epidemic diarrhea virus strains similar to us strains, south korea porcine aminopeptidase n is a functional receptor for the pedv coronavirus phylogenetic analysis of porcine epidemic diarrhea virus (pedv) field strains in central china based on the orf gene and the main neutralization epitopes in theoretical and experimental insights into immunology complete genome sequence of strain sdcv/usa/illinois / , a porcine deltacoronavirus from the united states complete genome sequence of porcine epidemic diarrhea virus strain usa/colorado/ from the united states rdp : detection and analysis of recombination patterns in virus genomes outbreak of porcine epidemic diarrhea virus in portugal smooth skyride through a rough skyline: bayesian coalescent-based inference of population dynamics complete genome sequence of the porcine epidemic diarrhea virus variant tottori /jpn/ the first case of porcine epidemic diarrhea in canada cell culture isolation and sequence analysis of genetically diverse us porcine epidemic diarrhea virus strains including a novel strain with a large deletion in the spike gene structural bases of coronavirus attachment to host aminopeptidase n and its inhibition by neutralizing antibodies the mathematical theory of communication the bell system porcine epidemic diarrhoea virus: a comprehensive review of molecular epidemiology, diagnosis, and vaccines cross-host evolution of severe acute respiratory syndrome coronavirus in palm civet and human geometric ergodicity of a hybrid sampler for bayesian inference of phylogenetic branch lengths genomic and epidemiological characteristics provide new insights into the phylogeographical and spatiotemporal spread of porcine epidemic diarrhea virus in asia outbreak of porcine epidemic diarrhea in suckling piglets evolutionary insights into the ecology of coronaviruses distinct characteristics and complex evolution of pedv strains new variant of porcine epidemic diarrhea virus the papain-like protease of porcine epidemic diarrhea virus negatively regulates type i interferon pathway by acting as a viral deubiquitinase this study was supported partially by the rapid agricultural response fund, established by the minnesota legislature and administered by the university of minnesota agricultural experiment station, and by boehringer ingelheim vetmedica, inc.the authors thank the faculty and personal at the umvdl for their technical services. supplementary data associated with this article can be found, in the online version, at http://dx.doi.org/ . /j.prevetmed. . . . key: cord- -zw usmh authors: walter, justin d.; hutter, cedric a.j.; garaeva, alisa a.; scherer, melanie; zimmermann, iwan; wyss, marianne; rheinberger, jan; ruedin, yelena; earp, jennifer c.; egloff, pascal; sorgenfrei, michèle; hürlimann, lea m.; gonda, imre; meier, gianmarco; remm, sille; thavarasah, sujani; zimmer, gert; slotboom, dirk j.; paulino, cristina; plattet, philippe; seeger, markus a. title: highly potent bispecific sybodies neutralize sars-cov- date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: zw usmh the covid- pandemic has resulted in a global crisis. here, we report the generation of synthetic nanobodies, known as sybodies, against the receptor-binding domain (rbd) of sars-cov- spike protein. we identified a sybody pair (sb# and sb# ) that can bind simultaneously to the rbd, and block ace binding, thereby neutralizing pseudotyped and live sars-cov- viruses. cryo-em analyses of the spike protein in complex with both sybodies revealed symmetrical and asymmetrical conformational states. in the symmetric complex each of the three rbds were bound by both sybodies, and adopted the up conformation. the asymmetric conformation, with three sb# and two sb# bound, contained one down rbd, one up-out rbd and one up rbd. bispecific fusions of the sybodies increased the neutralization potency -fold, as compared to the single binders. our work demonstrates that linking two binders that recognize spatially-discrete binding sites result in highly potent sars-cov- inhibitors for potential therapeutic applications. the ongoing pandemic arising from the emergence of severe acute respiratory syndrome coronavirus (sars-cov- ) in , demands urgent development of effective antiviral therapeutics. several factors contribute to the adverse nature of sars-cov- from a global health perspective, including the absence of herd immunity [ ] , high transmissibility [ , ] , the prospect of asymptomatic carriers [ ] , and a high rate of clinically severe outcomes [ ] . despite intense development efforts, a vaccine against sars-cov- remains unavailable [ , ] , making alternative intervention strategies paramount. in addition to offering relief for patients suffering from the resulting covid- disease, therapeutics may also reduce the viral transmission rate by being administered to asymptomatic individuals subsequent to probable exposure [ ] . finally, given that sars-cov- represents the third global coronavirus outbreak in the past years [ , ] , development of rapid therapeutic strategies during the current crisis could offer greater preparedness for future pandemics. akin to all coronaviruses, the viral envelope of sars-cov- harbors protruding, club-like, multidomain, homotrimeric spike proteins that provide the machinery enabling entry into human cells [ ] [ ] [ ] . the spike ectodomain is segregated into two regions, termed s and s . the outer s subunit of sars-cov- is responsible for host recognition via interaction between its c-terminal receptor-binding domain (rbd) and human angiotensin converting enzyme (ace ), present on the exterior surface of airway cells [ , ] . while there is no known host-recognition role for the s n-terminal domain (ntd) of sars-cov- , it is notable that s ntds of other coronaviruses have been shown to bind host surface glycans [ , ] . in contrast to the spike subunit s , the s subunit contains the membrane fusion apparatus, and also mediates trimerization of the ectodomain [ ] [ ] [ ] . prior to host recognition, spike proteins exist in a metastable pre-fusion state, wherein the s subunits lay atop the s region and their rbds oscillate between up and down conformations that are, respectively, capable and incapable of receptor binding [ , , ] . upon processing at the s /s and s ' cleavage sites by host proteases as well as engagement to the receptor, the s subunit undergoes dramatic conformational changes from the pre-fusion to the post-fusion state. such structural rearrangements are associated with fusion of the viral envelope with host membranes, thereby allowing release of the rna genome into the cytoplasm of the host cell [ , ] . coronavirus spike proteins are highly immunogenic [ ] , and several experimental approaches have sought to target this molecule for the purpose of virus neutralization [ ] . the high specificity, potency, and modular nature of antibody-based antiviral therapeutics have shown exceptional promise [ ] [ ] [ ] , and the isolated, purified rbd has been a popular target for the development of antibodies directed against the spike proteins of pathogenic coronaviruses [ ] [ ] [ ] [ ] . however, binders of the isolated rbd may not effectively engage the aforementioned pre-fusion conformation of the spike protein, which could account for the poor neutralization ability of recently described single-domain antibodies that were raised against the rbd of sars-cov- spike protein [ ] . therefore, to more easily identify molecules with qualities befitting a drug-like candidate, it would be advantageous to validate rbdspecific binders in the context of the full, stabilized, pre-fusion spike assembly [ , ] . single domain antibodies based on the variable vhh domain of heavy-chain-only antibodies of camelids -generally known as nanobodies -have demonstrated great potential in several studies [ ] . nanobodies are small ( ) ( ) ( ) ( ) , stable, and inexpensive to produce in large amounts in bacteria and yeast [ ] , yet they bind targets in a similar affinity range as conventional antibodies. due to their minimal size, they are particularly suited to reach hidden epitopes such as crevices of target proteins [ ] . we recently designed three libraries of synthetic nanobodies, termed sybodies, based on elucidated structures of nanobody-target complexes (fig. a) [ , ] . sybodies can be selected against any target protein within twelve working days, which is considerably faster than the generation of natural nanobodies, which requires the repetitive immunization during a period of two months prior to binder selection by phage display [ ] . a considerable advantage of our platform is that the selection of sybodies is carried out under defined conditions -in the case of coronavirus spike proteins, this offers the opportunity to generate binders recognizing the metastable pre-fusion conformation [ , ] . finally, due to the feasibility of inhaled therapeutic nanobody formulations [ ] , virus-neutralizing sybodies could offer a convenient, fast and direct means of prophylaxis. here, we identified a series of sybodies, which bind to two non-overlapping epitopes at the rbd of sars-cov- . when fused to generate a bispecific binder format, the sybodies potently neutralize viral entry of both pseudotyped and live viruses. cryo-em analyses confirmed simultaneous binding of two sybodies and revealed a novel asymmetric spike conformation with one up rbd, one up-out rbd and one down rbd. sybodies were selected using two rbd constructs fused to additional domains (fc of mouse igg and vyfp, respectively). our "target swap" selection approach (fig. s ) resulted in two enriched pools for each of the three sybody libraries (concave, loop and convex, fig. a ). an off-rate selection step was performed using the pre-enriched purified sybody pool after phage display round as competitor (see materials and methods). after two rounds of phage display, strong enrichment by factors ranging from to were determined by qpcr (table s ). elisa screening was performed using rbd-vyfp (rbd), commercially acquired spike ectodomain containing wild-type s and s (ecd), and maltose binding protein (mbp) as negative control. elisa analysis revealed very high hit rates for the rbd and the ecd, ranging from % to % and % to %, respectively (fig. s , table s ). at a later stage, we also performed elisas using engineered pre-fusion-stabilized spike ectodomain, containing two stabilizing proline mutations (s- p) [ ] (fig. s ). while most elisa signals for the ecd and s- p were highly similar, we found around sybodies with stronger binding to ecd than to s- p, which can be explained by the fact that the s- p forms a stable trimer, whereas the ecd lacked stabilizing proline mutations as well as the c-terminal foldon trimerization motif and therefore may be predominantly dissociated into monomers with increased internal epitope accessibility. in addition, the ecd might partially or completely adopt a post-fusion state, whereas s- p is expected to be stabilized in the trimeric pre-fusion state [ , ] . elisa-positive sybodies were sequenced ( for each of the selection reactions numbered from sb# - , see also fig. s ). sequencing results of out of sybody clones were unambiguous. out of these clones, were found to be unique and belonged to the concave ( ), loop ( ) and convex ( ) sybody libraries (fig. s , fig. s , table s ). there were no duplicate binders identified in both selection variants, indicating that the two separate selection streams gave rise to completely different sybody populations. two other research groups also used our sybody libraries to generate binders against the sars-cov- rbd [ , ] . interestingly, there is no sequence overlap amongst binder hits in these three independent sybody generation campaigns. this demonstrates that the sybody libraries are highly diverse and suggests that identical binders must be the result of over-enrichment, likely occurring towards the end of the binder selection process (i.e., during phage display). although the high sybody sequence diversity was not unexpected due to the very large size of the sybody libraries, this unique and autonomous multi-institute sybody selection campaign clearly demonstrates that it is possible to get access to an enormous variety of binders via independent selection experiments. the selected unique sybodies were individually expressed in e. coli and purified via ni-nta affinity chromatography and size exclusion chromatography. ultimately, sybodies revealed appropriate biochemical features with respect to solubility, yield, and monodispersity, in order to proceed with further characterization. for an in vitro kinetic analysis of sybody interactions with the viral spike, we employed grating-coupled interferometry (gci) [ ] to probe sybody binding to immobilized rbd-vyfp. first, the purified sybodies were subjected to an off-rate screen, which revealed six sybodies (sb# , sb# , sb# , sb# , sb# , and sb# ) with strong binding signals and comparatively slow off-rates. binding constants were then determined by measuring on-and off-rates over a range of sybody concentrations, revealing affinities for rbd within a range of - nm using a langmuir : model for data fitting (fig. s a ). next, we evaluated the ability of the purified sybodies to compete with ace binding by elisa. to this end, binding of purified rbd to immobilized hace was measured in the presence or absence of an excess of each purified sybody ( fig. a) . nearly all sybodies were found to inhibit rbd-hace interaction. the signal decrease relative to unchallenged rbd was modest for most sybodies, with an average signal reduction of about %. however, five sybodies (sb# , sb# , sb# , sb# , and sb# ) reduced rbd-attributable elisa signal to near-background levels, implying that these binders were able to almost entirely abolish the interaction between rbd and hace . notably, these five hace -inhibiting sybodies were among the six aforementioned highest affinity rbd binders. we sought to determine if our set of sybodies recognized separate epitopes on the rbd surface. elisa experiments demonstrated that incubation of sb# with s- p only slightly diminished the ability of the spike from binding to immobilized sb# , whereas pre-incubation with sb# , sb# , sb# , sb# , or sb# almost completely prevented the interaction of the spike protein with immobilized sb# (fig. s ). this suggested that sb# and sb# can bind simultaneously to the spike. therefore, we characterized sb# and sb# in more detail and performed gci measurements with the rbd (as a repetition of the initial experiments), as well as s- p and an even further stabilized version of the spike protein containing six prolines (hexapro [ ] ), termed here s- p (fig. b, fig. s b ). in contrast to the data generated using rbd, for which the langmuir : model was used to fit the data, the experimental data for s- p and s- p could only be fitted adequately using a heterogenous ligand model, which accounts for a high and a low affinity binding site. as our cryo-em analysis revealed binding of three sb# molecules and two sb# molecules to a highly asymmetric spike trimer (see below), the heterogenous ligand model could be justified. in the case of sb# , the higher binding affinities (kd ) for s- p and s- p ( nm and nm, respectively) were found to be similar to the one determined for the rbd ( nm). in contrast, kd of sb# was more than -fold stronger for s- p and s- p ( nm and nm, respectively) than for rbd ( nm) (fig. b, fig. s b ). to investigate if both sybodies can also bind simultaneously in the context of the trimeric full-length spike protein, we used gci to monitor binding events of the sybodies injected either alone or in combination (fig. c) . when we analyzed the sybodies against coated rbd, the maximal binding signals for sb# ( pg/mm ) and sb# ( pg/mm ) were approximately additive when both sybodies were co-injected ( pg/mm ), clearly showing that both sybodies can bind simultaneously. interestingly, when the same analysis was performed using s- p and s- p, the binding signals of the co-injections ( pg/mm for s- p and pg/mm for s- p) were clearly greater than the sum of the binding signals of sb# and sb# when injected individually ( pg/mm and pg/mm for s- p and pg/mm and pg/mm for s- p). this suggests cooperative binding of the two sybodies to the full-length spike protein, but not of the isolated rbd. to investigate interference of sb# and sb# with ace binding in detail, we performed an ace competition experiment using gci. to this end, s- p was coated on a gci chip and sb# ( nm), sb# ( nm) and the non-randomized convex sybody control (sb# , nm) were injected alone or together with ace ( nm) to monitor binding (fig. b) . indeed, sb# did not bind when injected alone and consequently did not disturb ace binding when co-injected. conversely, both sb# and sb# were found to dominate over ace in the association phase during co-injection, and the resulting curves are highly similar to what was observed when these two sybodies were injected alone. this experiment unequivocally demonstrates a strong competition of ace binding by the two sybodies using s- p as target. ace competition by sb# to this extent was surprising in view of the initial ace elisa competition experiment ( fig. a) . however, the seeming discrepancy can be explained by our observation that the affinity of sb# for s- p (used in the gci experiment) is more than times stronger than for the isolated rbd (used in the elisa experiment). to determine the inhibitory activity of the identified sybodies, we conducted in vitro neutralization experiments. towards this aim, we employed engineered vesicular stomatitis viruses (vsv) that were pseudotyped with sars-cov- spikes [ ] . interestingly, only the high affinity sybodies (sb# and sb# ), which also efficiently blocked receptor binding, exhibited potent neutralizing activity with ic values of . µg/ml ( nm) and . µg/ml ( nm), respectively (fig. a , table ). in contrast, sb# and sb# inhibited pseudotyped vsvs only to a limited extent. in agreement with the high affinity of sb# for soluble spike and its ability to compete with ace in the context of s- p as determined by gci, the ic values were similar to those observed for sb# ( . µg/ml, nm). since sb# and sb# can bind simultaneously to the rbd and the full-length spike protein, we mixed sb# and sb# together to investigate potential additive or synergistic neutralizing activity of these two independent sybodies. indeed, consistent with the binding assays, the simultaneous presence of both sybodies resulted in improved neutralization profiles with ic values reaching . µg/ml ( nm) (fig. a , table ). note that no neutralization of the pseudotype virus was observed in a control experiment using a nanobody directed to mcherry at the highest concentration ( µg/ml), thus validating the specificity of the identified sybodies. in addition to the individual sybodies, we also explored potential avidity effects of sybodies genetically fused to human igg fc domains. the respective sybody-fc constructs (sb# -fc, sb# -fc, sb# -fc, sb# -fc and sb# -fc) exhibited vsv pseudotype ic values in the range of . to . µg/ml ( nm to nm) and were therefore clearly improved over the respective values of the sybodies alone, which ranged from . to µg/ml ( nm to nm) ( table ). this suggests that the bivalent arrangement of the fc fusion constructs resulted in a discernible avidity effect. it is interesting to note that for some sybodies the gain of neutralization potency was much higher (e.g. for sb# , the ic values for single sybody versus fc-fused sybodies were nm versus nm), whereas for others it was only modest (e.g. for sb# , the respective values were nm versus nm). this indicates that the avidity effect strongly depends on the binding epitope. next, the neutralizing activity of the various sybodies was assessed with live sars-cov- (strain münchen- . / / ) [ ] employing a % neutralization dose (nd ) assay (table ) . sybodies which exhibited the least potent neutralization activities in the pseudotyped vsv assays (sb# , sb# and sb# ), did not block sars-cov- infection. in sharp contrast, sb# and sb# successfully inhibited sars-cov- cell entry, with nd values of . and . µg/ml, respectively. with the exception of sb# , the overall neutralization data obtained with live sars-cov- virus corroborated the findings obtained with the pseudotyped vsv system, although the sybodies were less potent against live sars-cov- . the binding and neutralization data, as well as the structural data presented below, highlighted that sb# and sb# are (i) the most potent neutralizing sybodies; (ii) bind to non-overlapping epitopes on the rbd surface; and (iii) exhibit synergistic virus neutralizing effects. these findings provided the basis to investigate whether fusing both sybodies would further improve the neutralization potency. towards this aim, we engineered three constructs consisting of sb# and sb# fused via a flexible linker (ggggs) of various length (repetitions of x, x or x) (fig. a ). the resulting bi-specific sybodies were accordingly designated gs , gs and gs , respectively. the binding kinetics of these three bispecific sybodies were then analyzed by gci using coated s- p (fig. b) , and binding affinities were found to range between pm to pm (using a langmuir : fitting model). this pronounced improvement of the affinity of the bispecific sybodies over the individual binders indicated that the two sybodies of the fused construct bind simultaneously to the spike protein, thereby resulting in a strong avidity effect. in agreement with the improved affinity, all three engineered bispecific constructs displayed highly potent neutralizing activities against both pseudotyped virus and live sars-cov- (ic values of gs : . µg/ml ( nm), gs : . µg/ml ( . nm) and gs : . µg/ml ( . nm) (fig. c , table ). for live sars-cov- virus, nd values of gs : . µg/ml ( nm), gs : . µg/ml ( nm) and gs : . µg/ml ( nm) were determined (table ) . collectively, these data show that fusing sb# and sb# via flexible linkers results in bispecific sybodies with dramatically improved neutralization activity (by a factor of about times compared to the single binders). to gain structural insights into how sb# and sb# recognize the rbd, we performed single particle cryo-em analysis of the spike protein in complex with the sybodies. to generate complexes, sybodies (alone or in combination) were mixed with spike protein at a molar ratio of . : (sybody:spike monomer), prior to a final purification step using size-exclusion chromatography. in total, three cryo-em datasets were collected, allowing a glimpse of the spike protein either simultaneously bound to both sybodies, or associated to sb# or sb# alone ( fig. s - , table s ). the highest resolution was obtained for the spike protein in complex with both sybodies (fig. s ). in contrast, the structures with the individual sybodies were determined based on fewer particles and mainly served to unambiguously assign the binding epitopes of sb# (fig. s ) and sb# (fig. s ). although the global resolution of the spike protein in complex with both sybodies is around Å, the local resolution of the rbds with bound sybodies was only in the range of - Å, presumably due to conformational flexibility (fig. s ) . therefore, we did not build full models of the sybodies and provide details only on their interaction surface with the rbds. however, the cryo-em density is good enough to describe the general epitope location and the distinct conformations adopted by the rbds. for better assessment and visualization, we fitted homology models of the respective sybodies into the densities ( fig. s -s ). the sybody homology models were based on pdb: k k [ ] in case of the concave sb# and pdb: m [ ] for the convex sb# . analysis of the spike/sb# /sb# particles after d classification revealed that the spike protein adopts two distinct conformations (fig. s ). the first conformation ( % of particles) has a three-fold symmetry, with three rbds in the up conformation ( up) and two sybodies bound to each of the rbds, confirming that sb# and sb# bind simultaneously (fig. a, fig. s c , f and s a). according to the spike structure obtained with sb# alone (detailed analysis below, fig. s and s ), sb# binds to the top of the rbd. its binding epitope consists of two regions (residues - and - ) and thereby strongly overlaps with the ace binding site (fig. b ). in contrast, sb# binds to the side of the rbd ( fig. s and s d-e) and recognizes a conserved epitope [ ] clearly distinct from the ace interaction site, which includes residues - and - and is buried if the rbd is in its down conformation. although the binding epitope of sb# is clearly distinct from the one of ace , there would be a steric clash between the sb# backside loops and ace , if ace docks to the rbd (fig. b ). this accounts for sb# 's ability to compete with ace as evident from gci analyses (fig. b ). the second resolved conformation ( % of particles) of the spike/sb# /sb# complex is asymmetric with the rbds in three distinct states, and was obtained at a global resolution of . Å (fig. c, fig. s c , g and s b). in this case, three sb# and two sb# were bound. the first rbd was in the up conformation, having sb# and sb# bound in an analogous fashion as in the symmetric up structure. the second rbd adopted a down state with only sb# bound. this conformation of sb# bound rbd appears to act as a wedge, pushing the third rbd outward and away from the three-fold symmetry axis (fig. d ). the third rbd was in an up-out conformation with sb# and sb# bound. however, the density for sb# was very weak, indicating either a very high flexibility or a substoichiometric occupancy. we refer to this novel asymmetric spike conformation as a up/ upout/ down state (fig. c ). virtually the same asymmetric up/ up-out/ down spike conformation was observed for the spike/sb# complex, reinforcing our interpretation that wedging by sb# is responsible for the outward movement of the second up-rbd (fig. s ). however, according to our analysis, comprising only a limited number of images (fig. s d ), sb# alone was unable to induce the up conformation, suggesting that adoption of the up state requires the synergistic action of both sybodies to populate this symmetric conformation. finally, analysis of the spike/sb# complex dataset revealed two distinct populations ( figure s and s ). the most abundant class showed an up down conformation without sybody bound, which is identical to the one obtained for the spike protein alone [ , ] . the second structure featured two rbds in an up conformation with bound sb# . density for the third rbd was very weak, presumably due to high intrinsic flexibility, hindering the interpretation of its exact position and conformation. we therefore refer to this conformation as an up/ flexible state. structural comparisons revealed that sb# cannot access its epitope in the context of the up down conformation, due to steric clashes with the neighboring rbd (fig. s b ). in order to bind, at least two rbds need to be in the up conformation. in summary, both sybodies stabilized the up conformation of the rbds. notably, without sybodies, s- p predominantly assumes an equilibrium between the down and the up down conformation [ , ] . upon addition of sb# , the conformational equilibrium was shifted towards an asymmetric up/ up-out/ down state, whereas addition of sb# favored an asymmetric state with rbds adopting a up/ flexible conformation. when added together, the sybodies appear to synergistically act to stabilize two states: a predominant up state, as well as the asymmetric up/ up-out/ down state. in this work, we have demonstrated the ability of our rapid in vitro selection platform to generate sybodies directed to the sars-cov- rbd. the biochemical characterization of these sybodies led to the identification of a high-affinity subset of binders, which were further analyzed in depth using structural, biochemical and functional methods. thereby, we found a pair of sybodies, sb# and sb# , which bind simultaneously to the rbd. both sybodies were found to compete with ace binding, albeit likely through different mechanisms. while the binding epitope of sb# directly overlaps with the one of ace , this is not the case for sb# , which interferes with ace through a steric clash at the sybody backside (fig. b ). in agreement with their similar affinities for the s- p spike protein, sb# and sb# exhibited similar neutralization efficiencies in the range of . - . µg/ml ( nm). we noted a moderate synergistic effect in the virus neutralization test when both individual sybodies were mixed together, resulting in an improved ic of . µg/ml ( nm). this synergy can be explained by the concerted action of the sybodies to compete with ace docking via epitope blockage and steric clashing. cryo-em analyses revealed distinct binding epitopes for the two sybodies sb# and sb# . the s- p spike protein we used for cryo-em was shown to predominantly adopt the down and up/ down conformations [ , ] , whereas the s- p/sb# /sb# complex adopts either a novel up/ upout/ down or a up conformation. the structures further revealed that sb# can only bind to the up-rbd. the inability of sb# (and to some degree also sb# ) to bind to the down-rbd resulted in conformational selection of spike protein with at least two up rbds, thereby shifting the conformational equilibrium of the spike. it is interesting to note that the binding epitope of sb# is highly conserved between sars-cov- and sars-cov- , because it constitutes an interaction interface that, upon binder engagement, stabilizes the rbd in the down conformation. the same conserved epitope is also recognized by the human antibodies cr (isolated from a sars-cov- infected patient and showing cross-specificity against sars-cov- ) and ey a (vice versa) [ , ] (fig. ) . hence, the binding epitope of sb# is less likely to be remodelled due to drug-induced selection pressures, thereby limiting the evolution of sars-cov- escape mutants if sb# were to be used as a therapeutic antiviral drug. despite sharing a similar epitope on the rbd, cr and ey a do not display an obvious direct steric clash with ace and in contrast to sb# do not compete directly with ace binding (fig. ). since cr and ey a have strong neutralizing capacity, inhibition mechanisms in addition to ace blockage could exist, which may also apply for sb# and sb# . however, for the ey a antibody it has been proposed that surface glycans on ace may interact with ey a and at least partially account for its neutralizing effect [ ] . akin to the cr and ey a antibodies, our sybodies share the ability to stabilize spike conformations with -or -up rbds. thereby, the spike protein may be destabilized, resulting in the premature and unproductive transitions to the irreversible post-fusion state. this mechanism was dubbed "receptor mimicry" in a study on a neutralizing antibody s , which only bound to up-rbds and thereby triggered fusogenic conformational changes of sars-cov- spike [ ] . however, since we obtained well-resolved cryo-em structures with sb# and sb# bound to the spike after incubating the complex for more than hours, we consider the mechanism of receptor mimicry less plausible in our case. yet, it is important to note that recent investigations of nonengineered sars-cov- spike protein extracted from membranes by detergents revealed unique structural features not found in the stabilized pre-fusion spike, including a stronger compaction of the spike trimer and the pre-dominance of the -down rbd conformation [ ] . further, the study highlighted a high propensity of the native sars-cov- spike to spontaneously transit to the postfusion state without interacting with ace . therefore, it is still possible that the sybodies (and in particular sb# ) accelerate these spontaneous spike inactivation process in the context of live sars-cov- virus, without affecting the pre-fusion stabilized soluble spike protein used for cryo-em analyses. the recent months have brought about a large number of publications on neutralizing antibodies [ ] [ ] [ ] [ ] , nanobodies [ , , , ] and other binder scaffolds [ ] . for the smaller scaffolds, in particular in case of nanobodies, fusion of binders via flexible linkers emerged as a promising strategy to improve neutralization efficiencies by exploiting avidity effects in the context of the trimeric spike protein. however, strategies to exploit genetically fused nanobodies so far included only identical binders recognizing the same epitope on the rbds [ ] . a crucial issue regarding development of reliable therapeutics against enveloped rna viruses such as sars-cov- is their ability to rapidly develop resistance mutations. recently, the emergence of resistance against monoclonal antibodies targeting the sars-cov- spike-rbd was investigated in vitro [ ] . while drug-resistant viruses indeed emerged rapidly when such antibodies with overlapping epitopes were administered either individually or in combination, escape mutants were not generated when treated with cocktails of non-competing antibodies. because the neutralizing sybody pair (sb# /sb# ) identified in this study was found to simultaneously bind to two spatially-distinct epitopes on the spike-rbd (of which one is highly conserved among sarbecoviruses [ ] ), we anticipate that our rationally engineered single-format bispecific constructs, which displayed highly potent neutralization profiles, may also exhibit high resistance barriers. although monoclonal antibodies (mabs) hold great promise in modern medicine, their manufacture remains tedious, time-consuming and expensive. in addition, the administration of mabs must be performed by medical professionals at hospitals, which further hampers their fast and global availability. conversely, single domain antibodies and their derivative multi-component formats can be produced easily, quickly, and inexpensively in bacteria, yeast, or mammalian cell culture. furthermore, the biophysical properties of single domain antibodies make them feasible for development in an inhalable formulation, thereby not only enabling direct delivery to nasal and lung tissues (two key sites of sars-cov- replication), but also offering the potential of self-administration. overall, we present a robust platform to generate highly potent multi-specific biomolecules against coronaviruses. in particular, the rapid selection of sybodies and their swift biophysical, structural and functional characterization, provide a foundation for the accelerated reaction to potential future pandemics. finally, our recently described flycode technology can be utilized for deeper interrogation of sybody selection pools, in order to facilitate discovery of exceptional sybodies possessing very slow off-rates or recognizing rare epitopes [ ] . a gene encoding sars-cov- residues pro -gly (rbd, genbank accession qhd . ), downstream from a modified n-terminal human serum albumin secretion signal [ ] , was chemically synthesized (geneuniversal). this gene was subcloned using fx technology [ ] into a custom mammalian expression vector [ ] , appending a c-terminal c protease cleavage site, myc tag, venus yfp [ ] , and streptavidin-binding peptide [ ] onto the open reading frame (rbd-vyfp). - ml of suspension-adapted expi cells (thermo) were transiently transfected using expifectamine according to the manufacturer protocol (thermo), and expression was continued for - days in a humidified environment at °c, % co . cells were pelleted ( g, min), and culture supernatant was filtered ( . µm mesh size) before being passed three times over a gravity column containing nhsagarose beads covalently coupled to the anti-gfp nanobody k k [ ] , at a resin:culture ratio of ml resin per ml expression culture. resin was washed with column-volumes of rbd buffer (phosphate-buffered saline, ph . , supplemented with additional . m nacl), and rbd-vyfp was eluted with . m glycine, ph . , via sequential . ml fractions, without prolonged incubation of resin with the acidic elution buffer. fractionation tubes were pre-filled with / vol m tris, ph . ( µl), such that elution fractions were immediately ph-neutralized. fractions containing rbd-vyfp were pooled, concentrated, and stored at °c. purity was estimated to be > %, based on sds-page (not shown). yield of rbd-vyfp was approximately - μg per ml expression culture. a second purified rbd construct, consisting of sars-cov- residues arg -phe fused to a murine igg fc domain (rbd-fc) expressed in hek cells, was purchased from sino biological (catalogue number: -v h, µg were ordered). purified full-length spike ectodomain (ecd) comprising s and s (residues val -pro ) with a c-terminal his-tag and expressed in baculovirus-insect cells was purchased from sino biological (catalogue number: -v b , µg were ordered). the prefusion ectodomain of the sars-cov spike protein containing two stabilizing proline mutations (s- p) (residues - ) [ ] , was transiently transfected into x suspension-adapted expicho cells (thermo fisher) using mg plasmid dna and mg of pei max (polysciences) per l procho medium (lonza) in a l erlenmeyer flask (corning) in an incubator shaker (kühner). one hour post-transfection, dimethyl sulfoxide (dmso; applichem) was added to % (v/v). incubation with agitation was continued at °c for days. l of filtered ( . um) cell culture supernatant was clarified. then, a ml gravity flow strep-tactin®xt superflow® column (iba lifescience) was rinsed with ml buffer w ( mm tris, ph . , mm nacl, mm edta) using gravity flow. the supernatant was added to the column, which was then rinsed with ml of buffer w (all with gravity flow). finally, six elution steps were performed by adding each time . ml of buffer bxt ( mm biotin in buffer w) to the resin. all purification steps were performed at °c. to remove amines, all proteins were first extensively dialyzed against rbd buffer. proteins were concentrated to µm using amicon ultra concentrator units with a molecular weight cutoff of - kda. subsequently, the proteins were chemically biotinylated for min at °c using nhs-biotin (thermo fisher, # ) added at a -fold molar excess over target protein. immediately after, the three samples were dialyzed against tbs ph . . during these processes (first dialysis/concentration/biotinylation/second dialysis), %, %, % and % of the rbd-vyfp, rbd-fc, ecd and s- p respectively were lost due to adsorption to the concentrator filter or due to aggregation. biotinylated rbd-vyfp, rbd-fc and ecd were diluted to µm in tbs ph . , % glycerol and stored in small aliquots at - °c. biotinylated s- p was stored at °c in tbs ph . . sybody selections with the three sybody libraries concave, loop and convex were carried out as previously detailed [ ] . in short, one round of ribosome display was followed by two rounds of phage display. binders were selected against two different constructs of the sars-cov- rbd; an rbd-vyfp fusion and an rbd-fc fusion. mbp was used as background control to determine the enrichment score by qpcr [ ] . in order to avoid enrichment of binders against the fusion proteins (yfp and fc), we switched the two targets after ribosome display (fig. s ). for the off-rate selections we did not use non-biotinylated target proteins as described [ ] because we did not have the required amounts of purified target protein. instead, we employed a pool competition approach. after the first round of phage display, all three libraries of selected sybodies, for both target-swap selection schemes, were subcloned into the psb_init vector (giving approximately clones) and expressed in e. coli mc cells. the resulting three expressed pools were subsequently combined, giving one sybody pool for each selection scheme. these two final pools were purified by ni-nta affinity chromatography, followed by buffer exchange of the main peak fractions using a desalting pd column in tbs ph . to remove imidazole. the pools were eluted with . ml of tbs ph . . these two purified pools were used for the off-rate selection in the second round of phage display at concentrations of approximately µm for selection variant (competing for binding to rbp-fc) and µm for selection variant (competing for binding to rbp-yfp). the volume used for off-rate selection was µl, with . % bsa and . % tween- added to pools immediately prior to the competition experiment. off-rate selections were performed for minutes. elisas were performed as described in detail [ ] . single clones were analyzed for each library of each selection scheme. since the rbd-fc construct was incompatible with our elisa format due to the inclusion of protein a to capture an α-myc antibody, elisa was performed only for the rbd-vyfp ( nm) and the ecd ( nm) and later on with the s- p ( nm). of note, the three targets were analyzed in three separate elisas. as negative control to assess background binding of sybodies, we used biotinylated mbp ( nm). positive elisa hits were sequenced (microsynth, switzerland). the unique sybodies were expressed and purified as described [ ] . in short, all sybodies were expressed overnight in e.coli mc cells in ml cultures. the next day the sybodies were extracted from the periplasm and purified by ni-nta affinity chromatography (batch binding) followed by sizeexclusion chromatography using a sepax srt- c sec size-exclusion chromatography (sec) column equilibrated in tbs, ph . , containing . % (v/v) tween- (detergent was added for subsequent kinetic measurements). six out of the binders (sb# , sb# , sb# , sb# , sb# , sb# ) were excluded from further analysis due to suboptimal behavior during sec analysis (i.e. aggregation or excessive column matrix interaction). to generate the bispecific sybodies (sb# -sb# fusion with variable glycine/serine linkers), sb# was amplified from psb-init_sb# (addgene # ) using the forward primer atatatgctcttcaagtcaggttc and the reverse primer tatatagctcttcaagaaccgccaccgccgctaccgccaccacctgcgctcacagtcac, encoding x a ggggs motif, followed by a sapi cloning site. sb# was amplified from psb-init_sb# (addgene # ) using forward primers (atatatgctcttcttctcaagtccagctggtgg), (atatatgctcttcttctggtggtggcggtagcggcggtggcggtagtcaagtccagctggtgg) or (atatatgctcttcttctggtggtggcggtagcggcggtggcggttctggtggtggcggtagcggcggtggc ggtagtcaagtccagctggtgg) each combined with the reverse primer tatatagctcttcctgcagaaac. the forward primers start with a sapi site (compatible overhang to sb# reverse primer), followed by non, x or x the ggggs motif. the pcr product of sb# was cloned in frame with each of the three pcr products of sb# into psb-init using fx-cloning [ ] , thereby resulting in three fusion constructs with linkers containing x, x or x ggggs motives as flexible linkers between the sybodies (called gs , gs and gs , respectively). the three bispecific fusion constructs gs , gs and gs were expressed and purified the same way as single sybodies [ ] . the high affinity sybodies were cloned and produced as human igg fc-fusions by absolute antibody, where they are commercially available. purified recombinant hace protein (mybiosource, cat# mbs ) was diluted to nm in phosphate-buffered saline (pbs), ph . , and μl aliquots were incubated overnight on nunc maxisorp -well elisa plates (thermofisher # - - ) at °c. elisa plates were washed three times with μl tbs containing . % (v/v) tween- (tbst). plates were blocked with μl of . % (w/v) bsa in tbs for h at room temperature. μl samples of biotinylated rbd-vyfp ( nm) mixed with individual purified sybodies ( nm) were prepared in tbs containing . % (w/v) bsa and . % (v/v) tween- (tbs-bsa-t) and incubated for . h at room temperature. these μl rbd-sybody mixtures were transferred to the plate and incubated for minutes at room temperature. μl of streptavidin-peroxidase (merck, cat#s ) diluted : in tbs-bsa-t was incubated on the plate for h. finally, to detect bound biotinylated rbd-vyfp, μl of development reagent containing , ′, , ′-tetramethylbenzidine (tmb), prepared as previously described [ ] , was added, color development was quenched after - min via addition of μl . m sulfuric acid, and absorbance at nm was measured. background-subtracted absorbance values were normalized to the signal corresponding to rbd-vyfp in the absence of added sybodies. purified sybodies carrying a c-terminal myc-his tag (sb_init expression vector) were diluted to nm in µl pbs ph . and directly coated on nunc maxisorp -well plates (thermofisher # - - ) at °c overnight. the plates were washed once with µl tbs ph . per well followed by blocking with µl tbs ph . containing . % (w/v) bsa per well. in parallel, chemically biotinylated prefusion spike protein (s- p) at a concentration of nm was incubated with nm sybodies for h at room temperature in tbs-bsa-t. the plates were washed three times with µl tbs-t per well. then, µl of the s- p-sybody mixtures were added to the corresponding wells and incubated for min, followed by washing three times with µl tbs-t per well. µl streptavidin-peroxidase polymer (merck, cat#s ) diluted : in tbs-bsa-t was added to each well and incubated for min, followed by washing three times with µl tbs-t per well. finally, to detect s- p bound to the immobilized sybodies, µl elisa developing buffer (prepared as described previously [ ] ) was added to each well, incubated for h (due to low signal) and absorbance was measured at nm. as a negative control, tbs-bsa-t devoid of protein was added to the corresponding wells instead of a s- p-sybody mixture. kinetic characterization of sybodies binding onto sars-cov- spike proteins was performed using gci on the wavesystem (creoptix ag, switzerland), a label-free biosensor. for the off-rate screening, biotinylated rbd-vyfp and ecd were captured onto a streptavidin pcp-sta wavechip (polycarboxylate quasi-planar surface; creoptix ag) to a density of - pg/mm . sybodies were first analyzed by an off-rate screen performed at a concentration of nm (data not shown) to identify binders with sufficiently high affinities. the six sybodies sb# , sb# , sb# , sb# , sb# , and sb# were then injected at increasing concentrations ranging from . nm to μm (three-fold serial dilution, concentrations) in mm tris ph . , mm nacl supplemented with . % tween- (tbs-t buffer). sybodies were injected for s at a flow rate of μl/min per channel and dissociation was set to s to allow the return to baseline. in order to determine the binding kinetics of sb# and sb# against intact spike proteins, the ligands rbd-vyfp, s- p and s- p were captured onto a pcp-sta wavechip (creoptix ag) to a density of pg/mm , pg/mm and pg/mm respectively. sb# and sb# were injected at concentrations ranging from . nm to nm or . nm to nm, respectively ( -fold serial dilution, concentrations) in tbs-t buffer. sybodies were injected for s at a flow rate of μl/min and dissociation was set to s. in order to investigate if sb# and sb# bind simultaneously to the rbd, s- p and s- p, both binders were either injected alone at a concentration of nm or mixed together at the same individual concentrations at a flow rate of μl/min for s in tbs-t buffer. to measure binding kinetics of the three bispecific fusion constructs, gs , gs and gs , s- p was captured as described above to a density of pg/mm and increasing concentrations of the bispecific fusion constructs ranging from nm to nm ( -fold serial dilution, concentrations) in tbs-t buffer at a flow rate of μl/min. because of the slow off-rates, we performed a regeneration protocol by injecting mm glycine ph for s after every binder injection. for ace competition experiments, s- p was captured as described above. then sb# , sb# or sb# (non-randomized convex sybody control) were either injected individually or premixed with ace in tbs-t buffer. sybody concentrations were at nm and ace concentration was at nm. all sensorgrams were recorded at °c and the data analyzed on the wavecontrol (creoptix ag). data were double-referenced by subtracting the signals from blank injections and from the reference channel. a langmuir : model was used for data fitting with the exception of the sb# and sb# binding kinetics for the s- p and the s- p spike, which were fitted with a heterogeneous ligand model as mentioned in the main text. pseudovirus neutralization assays have been previously described [ , , ] . briefly, propagationdefective, spike protein-pseudotyped vesicular stomatitis virus (vsv) was produced by transfecting hek- t cells with sars-cov- sdel (sars- s carrying an aa cytoplasmic tail truncation) as described previously [ ] . the cells were further inoculated with glycoprotein g trans-complemented vsv vector (vsv*g(luc)) encoding enhanced green fluorescence protein (egfp) and firefly luciferase reporter genes but lacking the glycoprotein g gene [ ]. after h incubation at °c, the inoculum was removed and the cells were washed once with medium and subsequently incubated for h in medium containing : of an anti-vsv-g mab i (atcc, crl- tm ). pseudotyped particles were then harvested and cleared by centrifugation. for the sars-cov- pseudotype neutralization experiments, pseudovirus was incubated for min at °c with different dilutions of purified sybodies, sybdody fusions or sybody-fc fusions. subsequently, s protein-pseudotyped vsv*g(luc) was added to vero e cells grown in -well plates ( ' cells/well). at h post infection, luminescence (firefly luciferase activity) was measured using the one-glo luciferase assay system (promega) and cytation cell imaging multi-mode reader (biotek). the serial dilutions of control sera and samples were prepared in quadruplicates in -well cell culture plates using dmem cell culture medium ( µl/well). to each well, µl of dmem containing tissue culture infectious dose % (tcid ) of sars-cov- (sars-cov- /münchen- . / / ) were added and incubated for min at °c. subsequently, µl of vero e cell suspension ( , cells/ml in dmem with % fbs) were added to each well and incubated for h at °c. the cells were fixed for h at room temperature with % buffered formalin solution containing % crystal violet (merck, darmstadt, germany). finally, the microtiter plates were rinsed with deionized water and immune serum-mediated protection from cytopathic effect was visually assessed. neutralization doses % (nd ) values were calculated according to the spearman and kärber method. freshly purified s- p was incubated with a . -fold molar excess of sb# alone or with sb# and sb# and subjected to size exclusion chromatography to remove excess sybody. in analogous way, the sample of s- p with sb# was prepared. the protein complexes were concentrated to . - mg ml - using an amicon ultra- . ml concentrating device (merck) with a kda filter cut-off. . μl of the sample was applied onto the holey-carbon cryo-em grids (au r . / . , mesh, quantifoil), which were prior glow discharged at - ma for s, blotted for - s and plunge frozen into a liquid ethane/propane mixture with a vitrobot mark iv (thermo fisher) at °c and % humidity. samples were stored in liquid nitrogen until further use. screening of the grid for areas with best ice properties was done with the help of a home-written script to calculate the ice thickness (manuscript in preparation). cryo-em data in selected grid regions were collected in-house on a -kev talos arctica microscope (thermo fisher scientifics) with a post-column energy filter (gatan) in zero-loss mode, with a -ev slit and a μm objective aperture. images were acquired in an automatic manner with serialem on a k summit detector (gatan) in counting mode at × , magnification ( . Å pixel size) and a defocus range from − . to − . μm. during an exposure time of s, frames were recorded with a total exposure of about electrons/Å . on-the-fly data quality was monitored using focus [ ] . for the s- p/sb# / sb# complex dataset, in total , micrographs were recorded. beaminduced motion was corrected with motioncor _ . . [ ] and the ctf parameters estimated with ctffind . . [ ] . recorded micrographs were manually checked in focus ( . . ), and micrographs, which were out of defocus range (< . and > μm), contaminated with ice or aggregates, and with a low-resolution estimation of the ctf fit (> Å), were discarded. , particles were picked from the remaining , micrographs by cryolo . . [ ] , and imported in cryosparc v . . [ ] for d classification with a box size of pixels. after d classification, , particles were imported into relion- . . [ ] and subjected to a d classification without imposed symmetry, where an ab-initio generated map from cryosparc low-pass filtered to Å was used as reference. two classes resembling spike protein, revealed two distinct conformations. one class shows a symmetrical state with all rbds in an up conformation ( up) and both sybodies bound to each rbd ( , particles, %). in the asymmetrical class ( , particles, %) the rbds adopt one up, one up-out and one down conformation ( up/ up-out/ down), where both sybodies are bound to rbds up and up-out state, while only sb# is bound to the down rbd. the up class was further refined with c symmetry imposed. the final refinement, where a mask was included in the last iteration, provided a map at . Å resolution. six rounds of per-particle ctf refinement with beamtilt estimation and re-extraction of particles with a box size of pixels improved resolution further to . Å. the particles were then imported into cryosparc, where non-uniform refinement improved the resolution to Å. the asymmetrical up/ up-out/ down was refined in an analogous manner with no symmetry imposed, resulting in a map at . Å resolution. six rounds of per-particle ctf refinement with beamtilt estimation improved resolution to . Å. a final round of non-uniform refinement in cryosparc yielded a map at . Å resolution. local resolution estimations were determined in cryosparc. all resolutions were estimated using the . cut-off criterion [ ] with gold-standard fourier shell correlation (fsc) between two independently refined half-maps [ ] . the directional resolution anisotropy of density maps was quantitatively evaluated using the dfsc web interface (https:// dfsc.salk.edu) [ ] . a similar approach was performed for the image processing of the s- p/sb# complex. in short, , micrographs were recorded, and , used for image processing after selection. , particles were autopicked via cryolo and subjected to d classification in cryosparc. , selected particles were used for subsequent d classification in relion- . . , where the symmetrical up map, described above, was used as initial reference. the best class comprising , particles ( %) represented an asymmetrical up/ up-out/ down conformation with sb# bound to each rbd. several rounds of refinement and ctf refinement yielded a map of . Å resolution. for the dataset of the s- p/sb# complex, in total , images were recorded, with , used for further image processing. , particles were autopicked via cryolo and subjected to d classification in cryosparc. , selected particles were imported into relion- . . and used for subsequent d classification, where the symmetrical up map, described above, was used as initial reference. two distinct classes of spike protein were found. one class ( , particles, %) revealed a state in which two rbds adopt an up conformation with sb# bound, whereby the density for the third rbd was poorly resolved representing an undefined state. several rounds of refinement and ctf refinement yielded a map of . Å resolution. two other classes, comprising , particles ( %) and , particles ( %), were identical. they show a up/ down configuration without sb# bound to any of the rbds. both classes were processed separately, whereby the class with over k particles yielded the best resolution of . Å and was used for further interpretation. a final non-uniform refinement in cryosparc further improved resolution down to . Å. defocus range (μm) - . to - . - . to - . - . to - . - . to - . - . to - . pixel size (Å) the plasmids encoding for the six highest affinity binders are available through addgene (addgene # -# ). purified sb-fc constructs can be commercially obtained from absolute antibody. the three-dimensional cryo-em density maps are available for the reviewers upon request. all cryo-em data will be deposited in the electron microscopy data bank and include the cryo-em maps, both half-maps, the unmasked and unsharpened refined maps and the mask used for final fsc calculation. raw cryo-em data will be deposited in the electron microscopy public image archive (empair). inhibi on of the rdb-ace interac on by sybodies. (a) elisa inhibi on screen. individual purified sybodies ( nm, sybody number shown on x-axis) were incubated with bio nylated rbd-vyfp ( nm) and the mixtures were exposed to immobilized ace . bound rbd-vyfp was detected with streptavidin-peroxidase/tmb. each column indicates backgroundsubtracted absorbance at nm, normalized to the signal corresponding to rbd-vyfp in the absence of sybody (dashed red line). (b) compe on of sybodies and ace for spike binding inves gated by gci. s- p was immobilized on the gci chip and sb# ( nm), sb# ( nm) and non-randomized control sybody sb# ( nm) were injected alone or premixed with ace ( nm). neutraliza on of viral entry using pseudotyped vsvs. (a) rela ve infec vity in response to increasing sybody concentra ons was determined. the black curve shows data when a mixture of sb# and sb# was added. (b) same assay as in (a) with sybodies fused to human fc to generate bivalency. error bars represent standard devia ons of three biological replicates. sybody selec on strategy against sars-cov- rbds. a total of six independent selec on reac ons were carried out, including a target swap between ribosome display and phage display rounds. enriched sybodies of phage display round of all three libraries were expressed and purified as a pool and used to perform an off-rate selec on in phage display round . for each of the six independent selec on reac ons, clones were picked at random and analyzed by elisa. micro ter plate wells were coated with inidividual sybodies, incubated with bio nylated constructs (receptor-binding domain, rbd; spike ectodomain, ecd; pre-fusion spike, s- p; maltose-binding protein, mbp), and then detected with streptavidin-peroxidase/tmb. a nonrandomized sybody was used as nega ve control (wells h and h , respec vely). sybodies that were sequenced are marked with the respec ve sybody name (sb_# - ). please note that iden cal sybodies that were found - mes are marked with the same sybody name (e.g. sb_# ). loop . phylogene c tree of rbd sybodies. a radial tree was generated in clc . . . figure s kine c characteriza on of sybodies by gci. (a) rbd-vyfp and ecd were immobilized as indicated and the six top sybodies were injected at increasing concentra ons ranging from . nm to μm. data were fi ed using a langmuir : model. (b) in depth affinity characteriza on of sb# and sb# . rbd-vyfp and s- p were immobilized as indicated and sb# and sb# were injected at concentra ons ranging from . nm to nm for sb# and . nm to nm for sb# . for rbd, data were fi ed using a langmuir : model. for s- p, the data were fi ed with the heterogeneous ligand model, because the : model was clearly not appropriate to describe the experimental data. corresponding data for s- p is shown in main fig. c . simultaneous binding of sb# and sb# . compe on elisa experiment in which sb# was coated on the elisa plate and rbd binding was assesses in the absence of presence of tag-less sybodies as indicated in the x-axis. to determine the background signal, buffer devoid of protein was added. herd immunity -estimating the level required to halt the covid- epidemics in affected countries the reproductive number of covid- is higher compared to sars coronavirus estimation of the reproductive number of novel coronavirus (covid- ) and the probable outbreak size on the diamond princess cruise ship: a data-driven analysis presumed asymptomatic carrier transmission of covid- estimating clinical severity of covid- from the transmission dynamics in wuhan, china preliminary identification of potential vaccine targets for the covid- coronavirus (sars-cov- ) based on sars-cov immunological studies. viruses the sars-cov- vaccine pipeline: an overview use of antiviral drugs to reduce covid- transmission a novel coronavirus outbreak of global health concern a sars-like cluster of circulating bat coronaviruses shows potential for human emergence structure, function, and evolution of coronavirus spike proteins cryo-em structure of the -ncov spike in the prefusion conformation structure, function, and antigenicity of the sars-cov- spike glycoprotein sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor human coronaviruses oc and hku bind to -o-acetylated sialic acids via a conserved receptor-binding site in spike protein domain a pre-fusion structure of a human coronavirus spike protein cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains viral membrane fusion unexpected receptor functional mimicry elucidates activation of coronavirus fusion identification of immunodominant sites on the spike protein of severe acute respiratory syndrome (sars) coronavirus: implication for developing sars diagnostics and vaccines the spike protein of sars-cov--a target for vaccine and therapeutic development development and characterisation of neutralising monoclonal antibody to the sars-coronavirus human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants potent neutralization of mers-cov by human neutralizing monoclonal antibodies to the viral spike glycoprotein cross-neutralization of human and palm civet severe acute respiratory syndrome coronaviruses by antibodies targeting the receptor-binding domain of spike protein receptor-binding domain of severe acute respiratory syndrome coronavirus spike protein contains multiple conformation-dependent epitopes that induce highly potent neutralizing antibodies a novel nanobody targeting middle east respiratory syndrome coronavirus (mers-cov) receptor-binding domain has potent cross-neutralizing activity and protective efficacy against mers-cov structure of severe acute respiratory syndrome coronavirus receptorbinding domain complexed with neutralizing antibody fully human single-domain antibodies against sars-cov- . biorxiv immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen nanobodies: natural single-domain antibodies a general protocol for the generation of nanobodies for structural biology structure of a nanobody-stabilized active state of the beta( ) adrenoceptor. nature synthetic single domain antibodies for the conformational trapping of membrane proteins. elife generation of synthetic nanobodies against delicate proteins nanobodies® as inhaled biotherapeutics for lung diseases selection, biophysical and structural analysis of synthetic nanobodies that effectively neutralize sars-cov- . biorxiv potent synthetic nanobodies against sars-cov- and molecular basis for neutralization single beam grating coupled interferometry: high resolution miniaturized label-free sensor for plate based parallel screening structure-based design of prefusion-stabilized sars-cov- spikes rapid quantification of sars-cov- -neutralizing antibodies using propagation-defective vesicular stomatitis virus pseudotypes. vaccines (basel) virological assessment of hospitalized patients with covid- modulation of protein properties in living cells using nanobodies structural basis for the neutralization of sars-cov- by an antibody from a convalescent patient neutralization of sars-cov- by destruction of the prefusion spike distinct conformational states of sars-cov- spike protein. science structures of human antibodies bound to sars-cov- spike reveal common epitopes and recurrent features of antibodies ultrapotent human antibodies protect against sars-cov- challenge via multiple mechanisms structural basis of a shared antibody response to sars-cov- . science antibody cocktail to sars-cov- spike protein prevents rapid mutational escape seen with individual antibodies neutralizing nanobodies bind sars-cov- spike rbd and block interaction with ace an alpaca nanobody neutralizes sars-cov- by blocking receptor interaction de novo design of picomolar sars-cov- miniprotein inhibitors an ultra-high affinity synthetic nanobody blocks sars-cov- infection by locking spike into an inactive conformation engineered peptide barcodes for in-depth analyses of binding protein libraries a highly efficient modified human serum albumin signal peptide to secrete proteins in cells derived from different mammalian species a versatile and efficient high-throughput cloning tool for structural biology x-ray structure of a calcium-activated tmem lipid scramblase a variant of yellow fluorescent protein with fast and efficient maturation for cell-biological applications one-step purification of recombinant proteins using a nanomolar-affinity streptavidin-binding peptide, the sbp-tag. protein expression and purification structural basis for potent neutralization of betacoronaviruses by single-domain camelid antibodies a human monoclonal antibody blocking sars-cov- infection a vesicular stomatitis virus replicon-based bioassay for the rapid and sensitive determination of multi-species type i interferon focus: the interface between data collection and data processing in cryo-em motioncor : anisotropic correction of beam-induced motion for improved cryo-electron microscopy ctffind : fast and accurate defocus estimation from electron micrographs sphire-cryolo is a fast and accurate fully automated particle picker for cryo-em cryosparc: algorithms for rapid unsupervised cryo-em structure determination new tools for automated high-resolution cryo-em structure determination in relion- . elife optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy prevention of overfitting in cryo-em structure determination addressing preferred specimen orientation in single-particle cryo-em through tilting we thank rony nehmé and andré heuer (creoptix ag, wädeswil, switzerland) for the acquisition, fitting and interpretation of a first set of gci measurements using the wavesystem. we thank florence projer, david hacker and kelvin lau (protein production and structure core facility, epfl, switzerland) for the production of the pre-fusion spike protein. we are grateful to jason mclellan (the university of texas at austin, u.s.) for having provided the pre-fusion-stabilized soluble spike expression vectors for s- p and s- p. we thank michael fiebig (absolute antibody) for providing us with purified sb-fc. we thank raimund dutzler and marta sawicka (university of zurich) for freezing cryo-em grids. michiel punter (university of groningen) is acknowledged for it help. key: cord- -onfabcfv authors: klingler, j.; weiss, s.; itri, v.; liu, x.; oguntuyo, k. y.; stevens, c.; ikegame, s.; hung, c.-t.; enyindah-asonye, g.; amanat, f.; baine, i.; arinsburg, s.; bandres, j. c.; kojic, e. m.; stoever, j.; jurczyszak, d.; bermudez-gonzalez, m.; simon, v.; liu, s.; lee, b.; krammer, f.; zolla-pazner, s.; hioe, c. e. title: role of igm and iga antibodies to the neutralization of sars-cov- date: - - journal: medrxiv : the preprint server for health sciences doi: . / . . . sha: doc_id: cord_uid: onfabcfv sars-cov- has infected millions of people and is on a trajectory to kill more than one million globally. virus entry depends on the receptor-binding domain (rbd) of the spike protein. although previous studies demonstrated anti-spike and -rbd antibodies as essential for protection and convalescent plasma as a promising therapeutic option, little is known about the immunoglobulin (ig) isotypes capable of blocking virus entry. here, we studied spike- and rbd-specific ig isotypes in plasma/sera from two acutely infected and convalescent individuals. spike- and rbd-specific igm, igg , and iga antibodies were produced by all or nearly all subjects at varying levels and detected at - days post-disease onset. igg , igg , igg , and iga were also present but at much lower levels. all samples also displayed neutralizing activity. igm, igg, and iga were capable of mediating neutralization, but neutralization titers correlated better with binding levels of igm and iga than igg. in december , the first patients with coronavirus disease , caused by severe acute respiratory syndrome coronavirus (sars-cov- ) were identified in the city of wuhan, hubei province, china . since then, the epidemic has rapidly spread to most regions of the world, infecting millions of people . effective therapeutics and vaccines against sars-cov- are urgently needed. to this end, more information about the ig isotypes present in the plasma of covid- convalescent individuals and their antiviral activities are needed, as convalescent plasma transfusion showed promising results in patients with severe to life-threatening covid- - . the data would also inform vaccine development, as more than vaccine candidates are in different stages of preclinical development, and many are now in phase and clinical trials . although using different strategies , many vaccines are based on one of the three membrane-anchored proteins present on the virus envelope surface: the sars-cov- spike protein , , which contains the receptor-binding domain (rbd) required for binding to and entry into the cells [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . these vaccines aim to protect by inducing neutralizing antibodies (abs) capable of blocking the viral infection. however, which of the immunoglobulin (ig) isotypes are protective is not yet clear. monomeric igg constitutes approximately % of the abs found in serum and exists as four subtypes: igg (~ % of igg), igg (~ % of igg), igg (~ % of igg) and igg (~ % of igg) , . igm abs represent % of total serum abs and are the first to arise in response to new antigens , . although igm abs do not undergone extensive somatic hypermutation to increase their affinity as do igg and iga abs, their higher valency due to the oligomerization of igm enhances their avidity and potency against pathogens , , . iga abs exist as two subtypes: iga and iga , and represent % of total serum abs . they are dimeric in the mucosa, but in the circulation, these two iga subtypes are monomeric. sars-cov- spike-, rbd-and nucleocapsid-specific serum/plasma abs of igm, igg, and iga . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint isotypes are found in most covid- patients - , with ab neutralizing activities reported among the convalescent patients , , , . however, the neutralizing titers appear to vary greatly , , , and they correlate with ab binding levels against rbd, spike, and/or nucleocapsid, and also with age, duration of symptoms, and symptom severity , , . several rbd-specific monoclonal abs of igg isotype with potent antiviral activities has been generated from individuals with high neutralization titers and these confer protection in animal models , , , . moreover, a monoclonal ab of iga isotype capable of recognizing both the sars-cov- and sars-cov- spike proteins, and blocking ace receptor binding was recently described . however, no data are available regarding the neutralizing capacity of plasma igm and iga abs from covid- patients. studies on other respiratory viruses such as influenza show that, in addition to igg, iga could also mediate virus neutralization, and their relative contribution depends on the physiologic compartment in which they are found, with iga contributing to the protection of mostly the upper respiratory tract while igg was protecting the lower respiratory tract - . of note, an anti-hemagglutinin monoclonal iga has been demonstrated to mediate more potent antiviral activities against influenza when compared to a monoclonal igg against the same epitope . interestingly, an igm ab with potent antiviral activities targeting the receptor binding site of influenza b has also been described . in addition, mucosal respiratory syncytial virus (rsv)-specific iga neutralizing abs is a better correlate of protection than serum rsv-specific igg neutralizing abs . in the case of sars-cov- , high titers of mucosal iga in the lungs are correlated with reduced pathology upon viral challenge in animal models . whether iga in the blood and the respiratory tract mucosa offers protection against sars-cov- infection remains an open question. moreover, few data are available concerning the contribution of igm to neutralization and protection against viruses, including sars-cov- . we have recently published a luminex assay detecting ig total against spike and rbd . based on this assay, we studied here the ig isotype profiles against spike and rbd in the plasma and serum . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint from acutely infected or convalescent individuals using a luminex assay that detects antigen-specific igm, igg - , iga and iga . using a pseudovirus assay , we also measured neutralizing activities in plasma and serum samples and in ig isotype fractions to determine the neutralizing capacity of igm, iga, and igg. the data demonstrate high prevalence of spike-and rbd-specific igm and iga, similar to that of igg , in plasma/serum from covid- patients and their significant contributions to virusneutralizing activities. this is the first evidence that purified plasma igg, igm, and iga contributes to sars-cov- neutralization. a total of serum (p# - ) and plasma (tf# - ) specimens from covid- -convalescent individuals was tested. sera from three uninfected individuals banked as part of an ongoing longitudinal study prior to the covid- outbreak (n# - ), and an additional ten plasma from covid-negative contemporaneous blood bank donors (n# - ) were included for comparison. the specimens were initially titrated for total ig against spike and rbd (fig. ) . all covid- positive specimens exhibited titration curves of total ig abs against spike, while none of the negative controls did. similar results were observed with rbd, except that one covid- -negative sample had low titrating levels of rbd-specific ig. overall the background mfi values were higher for rbd than spike. the areas under curves (aucs) highly correlated with the : dilution mfi (p < . ; supplementary fig. ) , consequently all samples were tested for isotyping at this dilution. to assess the reproducibility of the assay, the samples were tested in at least two separate experiments run on different days, and a strong correlation was observed between the mfis from these independent experiments (supplementary fig. ). to evaluate for the presence of spike-specific and rbd-specific total ig, igm, igg , igg , igg , igg , iga and iga , the specificity of the secondary abs used to detect the different isotypes were first validated with luminex beads coated with myeloma proteins of known ig isotypes (igg , igg , igg , . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint igg , iga , iga , and igm) (supplementary fig. ) . while all convalescent individuals had anti-spike and anti-rbd total ig (fig. ) , the ig levels were highly variable, with mfi values ranging from , to , . all convalescent individuals displayed igm abs against spike at varying levels, although only % ( / ) were positive for anti-rbd igm, when evaluated using cut-off values calculated as mean + standard deviation (sd) of the covid-negative samples. the lower percentage of igm abs specific for rbd might be due to the high background observed for igm against rbd with covid-negative specimens (fig. b,c) . an igg response was also detected against both spike and rbd in % of the convalescent subjects, with mfi values that ranged from . to , , and the responses against spike and rbd were highly correlated for every isotype (supplementary fig. ) . in contrast igg , igg , and igg abs against spike and rbd were detected in only a small fraction of the subjects, and the levels were very low (mfi values < ) (fig. ) . surprisingly, almost all the individuals produced iga abs against spike ( %) and rbd ( %) while % exhibited iga against spike, and % exhibited iga against rbd (fig. ) . overall, these data demonstrate that igm, igg , and iga abs were all strongly induced against spike and rbd in all or almost all covid- convalescent individuals (fig. ) . the levels, however, were highly variable among individuals. while not reaching statistical significance, a general trend was observed toward higher levels of total ig, igm, igg , and iga in women compared to men (supplementary fig. ). in fig. , comparing levels of total ig with the different ig isotypes showed a highly significant correlation with igg for both abs specific for spike and rbd indicating that igg is the major isotype induced by sars-cov- infection. there was no other isotype which showed a significant correlation with total ig abs for anti-spike abs, although there was a significant correlation between total ig and iga for anti-rbd abs. moreover, none of the igm, igg , or iga isotypes correlated with one another (fig. c) . iga is induced early after disease onset in covid- patients. since almost all convalescent . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint covid- patients displayed iga ab responses to spike and rbd antigens, we sought to evaluate the kinetics of iga versus other ig isotype induction at early time points. we tested longitudinal samples collected from two patients (p# and p# ) from days to post-onset of symptoms. the earliest samples from both patients were positive for iga abs against spike (fig. a) and rbd (fig. b) , and the levels increased over time. like iga ab, total ig, igm, and igg abs were also detectable as early as - days post symptom onset, and their levels increased over time. on the contrary, iga ab levels were near or below background on days - and remained unchanged over two weeks post-onset. igg abs also remained low or near background, whereas igg and igg abs increased slightly to above background after - days. neutralizing activities is detected in all convalescent covid- individuals. we subsequently tested the ability of samples from convalescent subjects to neutralize a vsv∆g pseudovirus bearing the sars-cov- spike protein (cov pp). the results, shown in all specimens from covid- -convalescent individuals were able to neutralize the virus at levels superior to % (fig. a) . for of specimens, neutralization reached more than % (fig. a) . interestingly, one sample demonstrated highly potent neutralization with a reciprocal ic titer > , , and neutralization was still % at the highest dilution tested. in contrast, one sample had a low neutralization titer (reciprocal ic titer = ) and reached a neutralization plateau of only ~ %. in comparison, none of the samples from covid- -negative individuals reached % neutralization (fig. b) , while the srbd positive control demonstrated potent neutralization with an ic of . µg/ml ( fig. c) , similar to that recently reported . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint isotype levels and neutralization titers varied tremendously among convalescent covid- individuals ( fig. ) , we investigated if neutralizing activities correlated with a particular ab isotype. igm abs specific for spike and rbd displayed the strongest correlation with neutralizing abs (p < . and p = . , respectively), while igg did not show a significant correlation (fig. a) . neutralization reciprocal ic titers weakly correlated with rbd-specific total ig levels but did not correlate with spike-specific total ig levels (fig. a) . correlation with ic titers yielded similar results (data not shown). iga levels also correlated with reciprocal ic titers (p = . and p = . for spike and rbd, respectively; fig. a ). although there were significant correlations between anti-spike and -rbd abs of the igg and igg s isotypes, most of the neutralization values were below the cut-off and for the few igg and igg responders, the levels were near background (fig. b) . neutralizing activities are mediated by igm, igg, and iga fractions. the data above show the strongest correlation of neutralizing activity with igm. to ask directly to what extent abs of different isotypes mediate neutralization, we evaluated the neutralization activities of igm, igg, and iga fractions purified from plasma from five covid- -convalescent individuals (rp# - ). the enrichment of igm, igg , and iga abs reactive with spike and rbd was validated using the isotyping method used above (supplementary fig. and not shown). these igm, igg and iga fractions were then evaluated for neutralizing activity along with the original plasma (fig. ) . the rp# - plasma neutralizing reciprocal ic titers ranged from to (fig. a,b) . the purified igg, iga and igm fractions all displayed neutralization of more than %, while the negative control ig fractions (rn# ) did not (fig. c,d) . the ic values for igm and igg were similar, and both were significantly better than iga (fig c,d) . our study demonstrates the presence of igg , iga and igm within - days after the onset of symptom, and the correlation of neutralizing activity is strongest with igm, followed by iga , but does not correlate with igg or other igg isotypes. while correlations are important, direct testing of . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint neutralizing activities of different isotypes was performed, and these experiments showed that neutralizing activity was displayed most potently by igm and igg and less strongly by iga. these data indicate the protective potential of all three major ig isotypes and suggest that induction of each of these isotypes by vaccination may offer optimal protection against infection. these data also carry important implications for the use of hyperimmune globulin as treatment and prophylactic modalities. several sars-cov- vaccine candidates tested in animal models and humans were shown to induce igg responses against spike and rbd as well as virus neutralizing activities, but in many of these studies, the induction of other ig isotypes was not evaluated - . dna vaccines expressing full-length and truncated spike proteins were able to curtail virus infection in the respiratory tract by varying degrees. virus reduction correlated with levels of neutralization and also fc-mediated effector functions such as antibody-dependent complement deposition (adcd) . interestingly, these dna vaccines elicited spike-and rbd-specific igg , igg , igg , iga, and igm abs, and similar to our findings, neutralization correlated most strongly with igm. adenovirus serotype vaccine vectors encoding seven different sars-cov- spike variants showed high protection, and virus reduction correlated best with neutralizing ab titers together with igm binding levels, fcγrii-binding, and adcd responses . we noted that while all covid- convalescent individuals exhibited plasma/serum neutralization activities, reaching % neutralization, and of specimens attained % neutralization, neutralization levels were highly variable with reciprocal ic and ic titers ranging over three orders of magnitude. similarly, the levels of spike-and rbd-binding total ig and ig isotypes also varied greatly. a trend to higher levels of total ig and each ig isotype was seen in female compared to male subjects, as reported in another study . moreover, except for tf# (a male elite neutralizer), the median neutralizing reciprocal ic titer was higher in females than males, although the difference did not reach significance (data not shown). gender differences in ab induction have been observed following vaccination against influenza in humans and mice and were shown to result from the impact . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint of sex steroids , . whether and to what extent this contributes to gender differences seen in clinical outcomes of covid- remains to be investigated. other studies have shown that the ab levels were associated with multiple factors, including time from disease onset and disease severity . however, clinical data are not available for the subjects studied here, limiting our analysis only to neutralization and ig isotypes. one remarkable finding from our study is that neutralization levels correlated with binding levels of igm and iga , but not igg . this is consistent with our data showing the neutralization activities mediated by purified igm and iga fractions from covid- patients. nonetheless, purified igg fractions from convalescent plasma also exhibited potent neutralization, but since the fractions contained all four igg subtypes, it remains unknown which igg subtypes contribute to neutralization, although igg , the most abundant igg isotypes in the blood, did not correlate with neutralization titers, and igg and igg present at low levels displayed weak correlations. the lack of correlation between igg binding titers and neutralization activities may be explained by the fact that the dominant igg responses may target sites not critical for blocking virus entry. indeed, among mabs isolated from six covid- convalescent patients, > % of rbd-specific mabs did not display neutralizing activities and another study demonstrated that mabs that bind to non-rbd epitopes on the spike protein had poor neutralization potencies . nonetheless, the absence of a correlation between igg binding titers and neutralizing activities reported above requires more study given that data remain controversial since other studies have demonstrated correlations between neutralization titers and sars-cov- specific igg levels , , . in addition to neutralization, non-neutralizing ab activities have been implicated in protection from virus infection through potent fc-mediated functions such as antibody-dependent cellular cytotoxicity (adcc), antibody-dependent cellular phagocytosis (adcp), and complement-mediated lysis; this is true for hiv, influenza, marburg, and ebola viruses , - . these fc activities were not evaluated in our . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint study, and their contribution to protection against infection and disease progression in humans is yet unclear , , . interestingly, a recent study demonstrated enrichment of spike-specific igm and iga abs and spike-specific phagocytic and adcd activity in plasma of individuals who recovered from infection, while nucleocapsid-specific igm and iga responses and nucleocapsid-specific adcd activity were features enriched in deceased patients . defining the full potential of abs against sars-cov- that includes neutralizing, non-neutralizing and enhancing activities is vital for developing the most effective vaccines and determining the optimal convalescent ab treatment against covid- . when we examined plasma specimens collected within - days after covid- onset, we detected igg and iga against spike and rbd, as well as igm. this is consistent with published reports showing that % of covid- -infected individuals developed igg within days after symptom onset and that seroconversion for igg and igm occurred simultaneously or sequentially . iga was also found early after infection ( - days after symptom onset) in another study . these studies suggest that measuring total ig, rather than igg, would provide a better outcome for diagnosis of early disease. indeed, we found no correlation between the levels of different isotypes examined in our study. this lack of correlation may result from their non-synchronous, sequential induction (igm first, then igg, and finally iga), but the presence of iga early during acute infection also suggests the potential contribution of natural iga, which, similar to natural igm, arise spontaneously from innate b cells to provide the initial humoral responses before the induction, maturation, and class-switching of adaptive classical b cells , . in summary, this study demonstrates that spike-and rbd-specific igm, igg , and iga abs were present in serum or plasma of all or almost all analyzed covid- convalescent subjects and were detected at extremely early stages of infection. the plasma of convalescent individuals also displayed neutralization activities that were mediated by igm, igg, and iga , although neutralization titers correlated more strongly with levels of igm and iga than igg. the contribution of igm and iga abs to . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint the neutralizing activities against sars-cov- demonstrates their importance of protective immunity against this virus. recombinant proteins. the recombinant spike and rbd proteins were produced as previously described in expi f cells (thermofisher) by transfections of purified d.n.a. using an expifectamine transfection kit (thermofisher). the soluble version of the spike protein included the protein ectodomain (amino acids - ), a c-terminal thrombin cleavage site, a t foldon trimerization domain, and a hexahistidine tag. the protein sequence was also modified to remove the polybasic cleavage site (rrar to a) and two stabilizing mutations (k p and v p, wild type numbering). the rbd (amino acids - ) included the signal peptide (amino acids - ) and a hexahistidine tag. supernatants from transfected cells were harvested on day three post-transfection by centrifugation of the culture at g for min. the supernatant was then incubated with ml ni-nta agarose (qiagen) for one to two hours at room temperature. next, gravity-flow columns were used to collect the ni-nta agarose, and the protein was eluted. each protein was concentrated in amicon centrifugal units is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint obtained from study participants enrolled in irb-approved protocols at icahn school of medicine at mount sinai and the james j. peter va medical center. study participants provided written consent at enrollment and agreed to sample banking and future research use of their banked biospecimen. samples from these protocols included sera from seven participants with documented sars-cov- infection: specimens from p# (d , d , and d after symptom onset), p# (d , and d after symptom onset), and rp# - . in addition, sera were used, which had been collected from healthy donors prior to the spread of sars-cov- in the u.s.a. ig fractionation. iga was first isolated from plasma by mixing : diluted plasma with peptide m agarose beads ( µl/ ml plasma, invivogen #gel-pdm) for . hours at room temperature. beads were then collected on a column and washed with pbs until protein reading ( nm) by nanodrop reached background. iga was eluted from beads with a ph . elution buffer (thermo scientific # ) and neutralized with ph tris buffer. the pass-through plasma sample was collected for igg enrichment using protein g agarose beads (invivogen #gel-agg) using the same protocol as above and subsequently for igm isolation using a hitrap igm column (g.e. healthcare # - - ) according to the manufacturer's instruction. an additional purification step was performed using protein a plus mini-spin columns to separate igg from igm. protein concentrations were determined with nanodrop prior to use in luminex and neutralization experiments. luminex binding ab assay. the sars-cov- antigens used in this assay were a soluble recombinant trimerized form of the spike protein and a recombinant rbd protein . antigens were coupled as previously described, with minor changes . each antigen was covalently coupled individually to a uniquely labeled fluorochrome carboxylated xmap bead set at . μg protein/million beads using a twostep carbodiimide reaction with the xmap ab coupling (abc) kit following to the manufacturers' instructions (luminex, austin, tx). the coupled beads were pelleted, resuspended at x beads/ml in storage buffer (pbs, . % bovine serum albumin (b.s.a.), . % tween- , and . % sodium azide, . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint ph . ), and stored at - °c. three to five million beads per batch were prepared in a . ml conical tube. before each experiment, the beads needed for a single run ( , beads/well x number of wells) were pelleted and resuspended in assay buffer (pbs, . % b.s.a., . % tween- ) to deliver , beads in a volume of μl/well. sera/plasma samples were serially titrated ( : to : final dilution) or diluted in assay buffer to : (for a final dilution of : ). the samples were then added as μl/well to the wells containing the beads and incubated at room temperature for hour at rpm. after two washes in assay buffer, μl/well of biotinylated antibodies specific for total ig, igg , igg , igg , igg , iga , iga , or igm was added and incubated for min at room temperature on a plate shaker; these antibodies were rabbit biotinylated-anti-human total ig (abcam, catalog #ab ) at μg/ml, mouse biotinylated-anti-human igg fc (invitrogen #mh ) at μg/ml, mouse biotinylated-anti-human igg fc (southern biotech # - ) at μg/ml, mouse biotinylated-antihuman igg hinge (southern biotech # - ) at μg/ml, mouse biotinylated-anti-human igg fc (southern biotech # - ) at μg/ml, mouse biotinylated-anti-human iga fc (southern biotech # - ) at μg/ml, mouse biotinylated-anti-human iga fc (southern biotech # - ) at μg/ml or goat biotinylated-anti-human igm (southern biotech # - ) at μg/ml. after two washes, μl/well of streptavidin-phycoerythrin (p.e.) at μg/ml was added (biolegend # ) followed by a min incubation at room temperature on a plate shaker. after two additional washes, μl of assay buffer/well was added and put on a shaker to resuspend the beads. the plate was read with a luminex flexmap d instrument. specimens were tested in duplicate, and the results were recorded as mean fluorescent intensity (mfi). cov pp production and titration. the sars-cov- pseudovirus (cov pp) was produced as previously described . briefly, t cells were transfected to overexpress sars-cov- glycoproteins. for background entry with particles lacking a viral surface glycoprotein, pcagg empty vector was . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint r.l.u.) / virus control r.l.u.) * ). the inhibitory concentration % (ic ) and % (ic ) were respectively defined as the reciprocal sample dilution or purified ig fraction concentration achieving % and % neutralization. statistical analysis. the statistical significance was determined by a two-tailed mann-whitney test and correlations analyzed with a spearman rank-order correlation test using graphpad prism . the raw data that support the findings of this study are available from the corresponding author upon request. zhu . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . immunol. , - ( ). . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint of cov pp by samples from (a) covid- -convalescent individuals and (b) covid- -negative individuals, compared to (c) a recombinant soluble rbd (srbd) control. each plasma or sera specimen was tested at -fold dilutions from : to : , , and srbd was tested at -fold dilutions from to . µg/ml. the data are shown as mean percentage of neutralization + sd of triplicate. the extrapolated titration curves were generated using a nonlinear regression model in graphpad prism (inhibitor versus response -variable slope (four parameters), least squares regression). the dotted horizontal lines highlight % neutralization. tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# covid- -positive subjects tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# covid- -positive subjects tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# tf# covid- -positive subjects control. each plasma or sera specimen was tested at -fold dilutions from : to : , , and srbd was tested at -fold dilutions from to . µg/ml. the data are shown as mean percentage of neutralization + sd of triplicate. the extrapolated titration curves were generated using a nonlinear regression model in graphpad prism (inhibitor versus responsevariable slope (four parameters), least squares regression). the dotted horizontal lines highlight % neutralization. phase / study to describe the safety and immunogenicity of a covid rna vaccine candidate (bnt b ) in adults to years of age: interim report dna vaccine protection against sars-cov- in rhesus macaques structural basis for the recognition of sars-cov- by full-length human ace the novel coronavirus ( -ncov) uses the sars-coronavirus receptor ace and the cellular protease tmprss for entry into target cells sars-cov- invades host cells via a novel route: cd -spike protein neuropilin- facilitates sars-cov- cell entry and provides a possible pathway into the central nervous system a pneumonia outbreak associated with a new coronavirus of probable bat origin genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor plasma were tested at -fold dilutions from : to : , or : to : , . data are shown as the mean percentage of neutralization. the dotted horizontal lines highlight % neutralization. (b) neutralization reciprocal ic and ic titers of rp# - plasma. (c) neutralization of cov pp by purified igm, igg, and iga fractions from covid- -infected individual plasma (rp# - ) compared to a control ig fraction. the fractions were tested at -fold dilutions from to . µg/ml. data are shown as the mean percentage of neutralization. the dotted horizontal lines highlight % neutralization. (d) ic of purified igm, igg, and iga fractions from rp# - plasma. the statistical significance was determined by a two we thank all the donors for their contribution to research and dr. arthur nadas (nyu school of medicine) for statistical consultation. the authors declare no competing interests.. cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. fig. . summary of relative ig isotype levels and neutralization titers. key: cord- -s tkbh r authors: procko, erik title: deep mutagenesis in the study of covid- : a technical overview for the proteomics community date: - - journal: expert review of proteomics doi: . / . . sha: doc_id: cord_uid: s tkbh r introduction the spike (s) of sars coronavirus (sars-cov- ) engages angiotensin-converting enzyme (ace ) on a host cell to trigger viral-cell membrane fusion and infection. the extracellular region of ace can be administered as a soluble decoy to compete for binding sites on the receptor-binding domain (rbd) of s, but it has only moderate affinity and efficacy. the rbd, which is targeted by neutralizing antibodies, may also change and adapt through mutation as sars-cov- becomes endemic, posing challenges for therapeutic and vaccine development. areas covered deep mutagenesis is a big data approach to characterizing sequence variants. a deep mutational scan of ace expressed on human cells identified mutations that increase s affinity and guided the engineering of a potent and broad soluble receptor decoy. a deep mutational scan of the rbd displayed on the surface of yeast has revealed residues tolerant of mutational changes that may act as a source for drug resistance and antigenic drift. expert opinion deep mutagenesis requires a selection of diverse sequence variants; an in vitro evolution experiment that is tracked with next-generation sequencing. the choice of expression system, diversity of the variant library and selection strategy have important consequences for data quality and interpretation. investigations of protein mutations have classically been approached by precision targeting, in which a small number of mutations are deliberately introduced and tested individually. this requires preconceived ideas or hypotheses on which residues and what changes to those residues might be relevant. when the important residues in a protein sequence are unknown, screens and selections can be used instead, in which a library of random mutations is in some way sorted to enrich for a small number of mutants with the intended phenotype. both experiments are limited in the scale of information they provide. deep mutagenesis or deep mutational scanning take advantage of next-generation sequencing to bring experimental protein mutagenesis to the realm of big data [ ] . a screen or selection of a diverse library of variants is tracked by next-generation sequencing to observe how the population's genetic makeup changes. mutations with enhanced function are enriched, while deleterious mutations are depleted; the enrichment ratio comparing frequencies in the selected population with the naive library thus acts as a proxy for relative phenotype. now, the relative effects of thousands of mutations can be assessed simultaneously in a single experiment and a comprehensive mutational landscape can be calculated from experimental data. deep mutagenesis has been developed by multiple groups over the past decade [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] and has proven especially invaluable to meet three goals: assisting protein engineering, understanding mutational tolerance within a protein sequence, and predicting which mutations might be associated with adverse disease outcomes, especially in the context of cancer or drug resistance. two recent and prominent studies of sars coronavirus (sars-cov- ) have used deep mutagenesis to address each of these problems [ , ] . this special report summarizes the two studies with a focus on experimental details and caveats that will be unfamiliar to those outside the deep mutational scanning community. two deep mutagenesis studies have determined how thousands of mutations within the sars-cov- spike or the virus' human receptor affect their binding. the data have proven invaluable for engineering high affinity decoy receptors that are under preclinical development as a covid- therapy, and have revealed the scope of mutational tolerance within the spike that may have bearing on genetic drift as the virus becomes endemic and changes over time. while these two studies focused on expression and binding between the viral spike and its receptor, the underlying selection strategies used in deep mutational scans are increasingly tied to more complex phenotypes, such as selections for structural stability based on protease-sensitivity [ ] , using competing ligands to engineer specificity into proteins including viral receptors [ ] [ ] [ ] , and selections based on catalytic or biological activity [ ] [ ] [ ] [ ] . undoubtedly there are more questions related to sars-cov- biology and the biochemistry of its while much attention has been given to isolating monoclonal antibodies with tight affinity for the sars-cov- spike (s) glycoprotein [ ] [ ] [ ] [ ] [ ] [ ] [ ] , an alternative is to use the entry receptor as a soluble decoy to neutralize infection. s is a class i viral fusion protein that is proteolytically processed into two subunits, s and s , that are non-covalently associated and decorate the coronavirus envelope [ ] [ ] [ ] . s recognizes angiotensin-converting enzyme (ace ) on host cells to initiate attachment and fusion of the viral and plasma membranes [ ] [ ] [ ] [ ] [ ] [ ] . soluble ace (sace ) blocks receptor-binding sites on s [ , , [ ] [ ] [ ] [ ] and while escape mutations in s rapidly emerge in tissue culture in the presence of monoclonal antibodies [ ] , in principle the virus has limited mechanisms to escape a soluble decoy receptor without simultaneously losing affinity for the natural receptor. the decoy receptor might also have a virucidal effect by inducing conformational changes and s shedding, such that virus particles are inactivated even if sace dissociates. however, monoclonal antibodies have superior affinity and neutralization efficacy. to improve the therapeutic potential of decoy receptors, my group used deep mutagenesis to find mutations in ace that enhance affinity [ ] . a library of over , single amino acid substitutions in ace was constructed, focused on diversification of residues at the structurally defined interface with the receptor-binding domain (rbd) of s [ , ] and also within the ace catalytic cleft. the library was expressed in a human cell line, with a c-myc epitope tag fused to the extracellular n-terminus of ace for detection of surface expressed protein. other than the presence of the epitope tag, ace expressed in this experimental selection system matches native ace in the human body. the culture expressing the ace library was then selected by fluorescence activated cell sorting (facs) to collect cells expressing ace variants with tight affinity for fluorescently labeled rbd from s of sars-cov- ( figure a ). for the artificial selection to be successful, cells must express a single protein variant from a single sequence variant, thereby providing a tight physical link between the phenotype of ace expressed at the plasma membrane and a single sequence within the cell. getting human cells in culture to acquire and express a single coding variant is no trivial feat, as transfection methods typically introduce many plasmid copies. different methods to solve this technical challenge have included the use of episomal plasmids that randomly partition to daughter cells during division until progeny harbor a single coding variant over time [ ] , the use of engineered integration sites in the genome [ , , ] , or the use of viral vectors at low multiplicities-of-infection [ , ] . my group used carrier dna to sufficiently dilute the ace plasmid library such that each cell typically acquired no more than a single coding variant [ ] . an episomal plasmid is used for the library so that extrachromosomal replication within the cell enhances expression of the protein under investigation. (the carrier dna, itself a modified episomal plasmid, further assists in this process [ ] .) the disadvantage to this simple solution for linking a single genotype to phenotype is that the coding sequence is so diluted with carrier dna, most cells in the (a) a library of ace variants was expressed in human cells. full-length ace (tan) was tagged with a c-myc epitope at its extracellular n-terminus for detection of surface expression with a fluorescent antibody (red). sars-cov- rbd (pale green) genetically fused with superfolder green fluorescent protein (sfgfp; dark green) was incubated with the cell culture. facs was used to collect fluorescent cells expressing ace with bound rbd-sfgfp. (b) the isolated rbd of sars-cov- (pale green) was fused at its n-terminus to aga p (blue) and at its c-terminus to a c-myc epitope tag. a saturation mutagenesis library of the rbd was expressed on the yeast surface. following induction of rbd expression, the yeast were incubated with dimeric, biotinylated sace (tan). bound ace was detected with fluorescent streptavidin (purple) and surface expressed rbd was detected with a fluorescent antibody (red). • in deep mutagenesis, the relative phenotypes of thousands of mutations in a protein sequence are determined in a single experiment. • the experimental mutational landscape of ace for binding the rbd of sars-cov- provides a blueprint for engineering high affinity decoy receptors. • a deep mutational scan of the sars-cov- rbd reveals considerable opportunity for genetic drift without loss of receptor affinity. • different expression systems for selecting ace or spike variants have inherent advantages and disadvantages. • there are opportunities for deep mutagenesis to provide biochemical insights on other sars-cov- proteins. culture do not express ace and facs time is wasted on sorting a large number of negative cells. this has important consequences on the data, as time spent sampling negative cells is time not spent sampling cells expressing the protein under investigation, and consequently variants in the library may be under-sampled giving poor data accuracy. undersampling becomes exceptionally concerning as the library size increases, and for this reason the library was limited to single amino acid substitutions at just positions in ace . following facs selection of the human culture to enrich a cell population with high binding activity for sars-cov- protein s, rna transcripts were isolated and illumina sequenced. an enrichment ratio is calculated for each mutation by dividing its frequency in the sorted cell transcripts by its frequency in the naive plasmid library [ ] . illumina sequencing did not cover the full length of ace and instead the cdna was sequenced as a series of fragments that together provided full coverage of the diversified regions. one assumes during analysis that there are no additional mutations outside a sequenced fragment, a reasonable assumption when a mutation is found because the library was constructed to have only one amino acid substitution per plasmid. however, the assumption breaks down when no mutations are observed in the sequenced fragment, as one cannot know whether there was a mutation elsewhere outside the sequenced region. as a consequence, the wild type sequence is not directly observed and is instead only estimated. there are strategies using the introduction and analysis of silent mutations that can resolve this issue [ ] . overall, there was close agreement between the mutation enrichment ratios from two independent replicates of the facs experiments, indicating that the ace library was well sampled and there was high confidence in the data [ ] . the enrichment ratios calculated for each variant in the sorted ace library provide a mutational landscape that defines the relative phenotypes of thousands of ace mutations for binding to sars-cov- s [ ] . the data in this experiment are qualitative and it is unclear how a log enrichment ratio of, say, − or + translates to an exact change in a biophysical parameter such as k d . furthermore, mutations can impact not only binding affinity for the rbd of s but also ace surface expression. to filter out the contribution of mutations to expression, two populations of cells were collected by facs. in addition to collecting cells that express ace and tightly bind rbd, cells were simultaneously collected in the same experiment that express ace but have weak rbd binding. ace mutants that were not expressed at the cell surface would be depleted from both sorted populations, which was apparent from tracking the depletion of nonsense mutations. in this way, information was collected on how ace mutations impact expression and rbd binding from a single facs experiment. the deep mutational scan of ace revealed that mutations can indeed be found to enhance binding toward sars-cov- rbd (figure ) , suitable for engineering high affinity soluble decoy receptors [ ] . mutations were found at the binding interface where they enhance specific atomic contacts, and were also found distally in the second shell and beyond where they may impact ace conformation, folding and dynamics. a soluble ace variant that combines three mutations, called sace .v . , was found to be highly expressed, is a stable monodisperse dimer, binds sars-cov- s with picomolar affinity and potently neutralizes infection of a susceptible cell line by authentic virus. its properties rival affinity-matured monoclonal antibodies under commercial development for therapy and prophylaxis. despite only affinity toward sars-cov- being considered during the engineering process, sace . v . also potently neutralizes authentic sars-cov- , and we speculate that it will have broad activity against betacoronaviruses that use ace as an entry receptor. in unpublished work that has yet to be peer reviewed, we have found sace . v . broadly and tightly binds bat coronaviruses that may be a source for future pandemics, supporting the concept of receptor-based decoys as antiviral biologics with exceptional breadth. as determined by yeast display, the effects of mutations in the rbd of sars-cov- protein s on receptor affinity are plotted in the heat map at left, with dark green indicating the mutations are deleterious and pale colors indicating the mutations are neutral. the effects of mutations in human cell-expressed ace on binding to soluble rbd are plotted in the heat map at right, with depleted mutations in orange, neutral mutations in white and enriched mutations in blue. positional scores are mapped to the atomic structure of rbd-bound ace (pdb m ) at center. conserved ace residues for rbd binding are orange, while ace residues that are hot spots for mutations with increased affinity are blue. rbd residues conserved for ace binding are green. most rbd mutations in this region of the interface are deleterious, whereas numerous mutations were found in ace that increased affinity. in starr et al, deep mutagenesis was applied to the sars-cov - spike to assess mutational tolerance for expression and ace interactions [ ] . instead of investigating the entire trimeric s protein expressed on a cellular or viral membrane, the isolated rbd was fused to the yeast mating factor aga p and displayed on the yeast surface [ ] (figure b) . this is an artificial display platform that removes the rbd from its native context. n-glycosylation in yeast is also of high-mannose type and lacks the complex, terminally sialylated glycans produced by human cells [ ] , which can be important when binding interactions are glycan-dependent as is seen for some antibodies targeting viral spikes [ ] . however, this display platform harnesses yeast genetics to confer tremendous advantages for in vitro selection and evolution. using yeast display, large diverse libraries can be readily sorted by facs to provide highquality data. separate selections were completed at a range of different sace concentrations to simulate a titration experiment, from which the data could be converted to quantitative changes in apparent k d on the yeast surface ( figure ). as a surrogate for how rbd mutations may impact expression of the viral spike, the effects of mutations on rbd surface display were also assessed in a standalone facs selection. quality control pathways for protein secretion in yeast can be forgiving of misfolded protein sequences [ ] and there are residues of the rbd that would ordinarily be buried in the context of the full s protein; it therefore remains to be seen how closely the yeast display data will correlate with equivalent experiments in more physiologically relevant expression systems. nonetheless, the predicted effects by yeast display of some mutations were validated using full length s expressed in human cells and packaged in pseudovirus [ ] . the library encoding nearly , single amino acid substitutions in the sars-cov- rbd was pacbio sequenced, providing long reads that match untranslated nucleotide barcodes to a specific protein variant. following facs-based selection, only the barcodes are read to determine how favorable sequence variants are enriched or deleterious sequence variants are depleted. this resolves issues with illumina sequencing failing to cover the full cdna length, and because multiple barcodes are associated with any given protein variant, there are additional internal checks for data quality and consistency. despite the limitations of a yeast display platform, the deep mutational scan of the isolated rbd provides a high quality and useful data set from which several important conclusions were drawn. first, the ace binding surface of sars-cov- rbd tolerates surprisingly high sequence diversity, even though it is a critical site for function [ ] . high diversity is also seen in the ace -binding sites of s proteins from sars-related bat coronaviruses, but this matches corresponding diversity in ace from ecologically diverse bat species [ ] and does not necessarily mean that the rbd tolerates mutations for binding ace from a single species. the deep mutational scan addresses this uncertainty and is further supported by evidence showing that diverse rbd sequences from bat coronaviruses are all competent for binding human ace with varying affinities [ ] . second, mutations were found in the rbd that enhance binding to ace , yet there does not appear to be positive selective pressure for these variants in the human population [ ] . sars-cov- affinity for ace is therefore 'good enough,' with no additional fitness benefit for higher affinity. it is worth noting that classical sars-cov- is also a highly infectious and virulent pathogen, despite having weaker ace affinity [ , ] . the rapid spread of sars-cov - probably has more to do with asymptomatic and presymptomatic transmission than enhanced receptor binding. third, mutations were found within the epitopes for monoclonal antibodies but maintain high ace binding, and it is likely that sars-cov- can easily mutate to escape neutralization without losing infectivity [ ] . this agrees with selection experiments of pseudovirus expressing sars-cov- s variants, in which escape mutants in the viral spike rapidly emerge to neutralizing antibodies in a single passage [ ] . this has profound implications for antibody therapy, where the standard has become combinations of noncompeting monoclonals in a cocktail to prevent rapid resistance. it is currently unknown whether an engineered soluble decoy receptor, such as sace .v . , will similarly be susceptible to the emergence of viral spike variants that can discriminate between the engineered decoy and the native receptor. we hypothesize that engineered decoys will be broadly active against sars-cov- variants and this remains an active area of investigation. http://orcid.org/ - - - x papers of special note have been highlighted as either of interest (•) or of considerable interest deep mutational scanning: a new style of protein science high-resolution mapping of protein sequence-function relationships a fundamental protein property, thermodynamic stability, revealed solely from large-scale measurements of protein function deep mutational scanning of an antibody against epidermal growth factor receptor using mammalian cell display and massively parallel pyrosequencing affinity and cross-reactivity engineering of ctla -ig to modulate t cell costimulation an engineered switch in t cell receptor specificity leads to an unusual but functional binding geometry computational design of a protein-based enzyme inhibitor experimental estimation of the effects of all amino-acid mutations to hiv's envelope protein on viral replication in cell culture a platform for functional assessment of large variant libraries in mammalian cells multiplex assessment of protein variant abundance by massively parallel sequencing mapping interaction sites on human chemokine receptors by deep mutational scanning this study established a simple and effective method for linking phenotype to a single genotype in transfected human cells. this technical accomplishment is necessary for selection and deep mutational scanning in human cells a comprehensive biophysical description of pairwise epistasis throughout an entire protein domain high-throughput profiling of influenza a virus hemagglutinin gene at single-nucleotide resolution deep mutational scanning of sars-cov- receptor binding domain reveals constraints on folding and ace binding the isolated rbd of sars-cov- was displayed on yeast and deep mutationally scanned to understand the mutational landscape for yeast surface expression (a surrogate for folding) and ace binding. the data reveal substantial sequence diversity is tolerated on the rbd surface engineering human ace to optimize binding to the spike protein of sars coronavirus deep mutationally scanned ace expressed on a human cell membrane to identify substitutions that enhance binding to s of sars-cov- . this guided the engineering of high affinity and potently neutralizing decoy receptors global analysis of protein folding using massively parallel design, synthesis, and testing a computationally designed inhibitor of an epstein-barr viral bcl- protein induces apoptosis in infected cells computationally designed high specificity inhibitors delineate the roles of bcl family proteins in cancer engineered receptors for human cytomegalovirus that are orthogonal to normal human biology single-mutation fitness landscapes for an enzyme on multiple substrates reveal specificity is globally encoded a comprehensive, highresolution map of a gene's fitness landscape comprehensive sequence-flux mapping of a levoglucosan utilization pathway in e. coli molecular determinants of chaperone interactions on mhc-i for folding and antigen repertoire selection cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody studies in humanized mice and convalescent humans yield a sars-cov- antibody cocktail potent neutralizing antibodies from covid- patients define multiple targets of vulnerability broad neutralization of sars-related viruses by human monoclonal antibodies a human monoclonal antibody blocking sars-cov- infection a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model structure, function, and antigenicity of the sars-cov- spike glycoprotein structural insights into coronavirus entry sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor a pneumonia outbreak associated with a new coronavirus of probable bat origin receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars cryo-em structure of the -ncov spike in the prefusion conformation angiotensin-converting enzyme is a functional receptor for the sars coronavirus this study reports the original discovery of ace as the entry receptor for classical sars-cov- functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses * ace is shown to be a shared entry receptor for a clade of sars-associated betacoronaviruses, including diverse strains from bats and human virus sars-cov- susceptibility to sars coronavirus s protein-driven infection correlates with expression of angiotensin converting enzyme and infection can be blocked by soluble receptor neutralization of sars-cov- spike pseudotyped virus by recombinant ace -ig inhibition of sars-cov- infections in engineered human tissues using clinical-grade soluble human ace novel ace -igg fusions with improved in vitro and in vivo activity against sars-cov antibody cocktail to sars-cov- spike protein prevents rapid mutational escape seen with individual antibodies sars-cov- spike is able to rapidly acquire mutations to escape neutralizing monoclonal antibodies in tissue culture, necessitating the combination of multiple non-competing monoclonals in a cocktail to prevent resistance structural basis for the recognition of the sars-cov- by full-length human ace structure of sars coronavirus spike receptor-binding domain complexed with receptor an improved platform for functional assessment of large protein libraries in mammalian cells mammalian cell surface display for monoclonal antibody-based facs selection of viral envelope proteins hiv vaccine design to target germline precursors of glycan-dependent broadly neutralizing antibodies structure-based design of native-like hiv- envelope trimers to silence non-neutralizing epitopes and eliminate cd binding structural architecture of a dimeric class c gpcr based on co-trafficking of sweet taste receptor subunits enrich: software for analysis of protein function by enrichment and depletion of variants hiv- broadly neutralizing antibody precursor b cells revealed by germline-targeting immunogen isolating and engineering human antibodies using yeast surface display the humanization of n-glycosylation pathways in yeast glycan-dependent neutralizing antibodies are frequently elicited in individuals chronically infected with hiv- clade b or c. aids research and human retroviruses exceptional diversity and selection pressure on sars-cov and sars-cov- host receptor in bats compared to other mammals structural basis of receptor recognition by sars-cov- key: cord- -sqz yc b authors: huo, jiandong; zhao, yuguang; ren, jingshan; zhou, daming; duyvesteyn, helen me; ginn, helen m; carrique, loic; malinauskas, tomas; ruza, reinis r; shah, pranav nm; tan, tiong kit; rijal, pramila; coombes, naomi; bewley, kevin; radecke, julika; paterson, neil g; supasa, piyasa; mongkolsapaya, juthathip; screaton, gavin r; carroll, miles; townsend, alain; fry, elizabeth e; owens, raymond j; stuart, david i title: neutralization of sars-cov- by destruction of the prefusion spike date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: sqz yc b there are as yet no licenced therapeutics for the covid- pandemic. the causal coronavirus (sars-cov- ) binds host cells via a trimeric spike whose receptor binding domain (rbd) recognizes angiotensin-converting enzyme (ace ), initiating conformational changes that drive membrane fusion. we find that monoclonal antibody cr binds the rbd tightly, neutralising sars-cov- and report the crystal structure at . Å of the fab/rbd complex. some crystals are suitable for screening for entry-blocking inhibitors. the highly conserved, structure-stabilising, cr epitope is inaccessible in the prefusion spike, suggesting that cr binding would facilitate conversion to the fusion-incompetent post-fusion state. cryo-em analysis confirms that incubation of spike with cr fab leads to destruction of the prefusion trimer. presentation of this cryptic epitope in an rbd-based vaccine might advantageously focus immune responses. binders at this epitope may be useful therapeutically, possibly in synergy with an antibody blocking receptor attachment. highlights cr neutralises sars-cov- neutralisation is by destroying the prefusion spike conformation this antibody may have therapeutic potential alone or with one blocking receptor attachment incursion of animal (usually bat)-derived coronaviruses into the human population has caused several outbreaks of severe disease, starting with severe acute respiratory syndrome (sars) in (menachery et al., ) . in late a highly infectious illness, with cold-like symptoms progressing to pneumonia and acute respiratory failure, resulting in an estimated % overall death rate (baud et al., ) , with higher mortality among the elderly and immunocompromised populations, was identified and confirmed as a pandemic by the who on th march . the etiological agent is a novel coronavirus (sars-cov- ) belonging to lineage b betacoronavirus and sharing % sequence identity with bat coronaviruses (lu et al., a) . the heavily glycosylated trimeric surface spike protein mediates viral entry into the host cell. it is a large type i transmembrane glycoprotein (the ectodomain alone comprises over residues) (wrapp et al., ) . it is made as a single polypeptide and then cleaved by host proteases to yield an n-terminal s region and the c-terminal s region. spike exists initially in a pre-fusion state where the domains of s cloak the upper portion of the spike with the relatively small (~ kda) s rbd nestled at the tip. the rbd is predominantly in a 'down' state where the receptor binding site is inaccessible, however it appears that it stochastically flips up with a hinge-like motion transiently presenting the ace receptor binding site (roy, ; song et al., ; walls et al., ; wrapp et al., ) . ace acts as a functional receptor for both sars-cov and sars-cov- , binding to the latter with a to -fold higher affinity (k d of ~ nm), possibly contributing to its ease of transmission (song et al., ; wrapp et al., ) . there is % sequence identity between the rbds of sars-cov and sars-cov- ( figure s ). when ace locks on it holds the rbd 'up', destabilising the s cloak and possibly favouring conversion to a postfusion form where the s subunit, through massive conformational changes, propels its fusion domain upwards to engage with the host membrane, casting off s in the process (song et al., ; wrapp et al., ) . structural studies of the rbd in complex with ace (lan et al., ; wang et al., b; yan et al., ) how that it is recognized by the extracellular peptidase domain (pd) of ace through mainly polar interactions. the s protein is an attractive candidate for both vaccine development and immunotherapy. potent nanomolar affinity neutralising human monoclonal antibodies against the sars-cov rbd have been identified that attach at the ace receptor binding site (including m , cr and r (ter meulen et al., ; sui et al., ; zhu et al., ) ). for example r binds with nanomolar affinity, prevents binding to ace and the formation of syncytia in vitro, and inhibits viral replication in vivo (sui et al., ) . however, despite the two viruses sharing the same ace receptor these ace blocking antibodies do not bind sars-cov- rbd (wrapp et al., ) . in contrast cr , a sars-cov-specific monoclonal selected from a single chain fv phage display library constructed from lymphocytes of a convalescent sars patient and reconstructed into igg format (ter meulen et al., ) , has been reported to cross-react strongly, binding to the rbd of sars-cov- with a k d of . nm (tian et al., ) , whilst not competing with the binding of ace (ter meulen et al., ) . furthermore, although sars-cov escape mutations could be readily generated for ace blocking cr , no escape mutations could be generated for cr , preventing mapping of its epitope (ter meulen et al., ) . furthermore a natural mutation of sars-cov- has now been detected at residue (y n) (gisaid (shu and mccauley, ) : accession id: epi_isl_ wienecke-baldacchino et al., ), which forms part of the ace binding epitope. finally, cr and cr act synergistically to neutralise sars-cov with extreme potency (ter meulen et al., ) . whilst this work was being prepared for publication a paper reporting that cr does not neutralise sars-cov- and describing the structure of the complex with the rbd at . Å resolution was published (yuan et al., ) . here we extend the structure analysis to significantly higher resolution and, using a different neutralisation assay, show that cr does neutralise sars-cov- , but via a mechanism that would not be detected by the method of yuan et al (yuan et al., ) . we use cryo-em analysis of the interaction of cr with the full spike ectodomain to confirm this mechanism. taken together these observations suggest that the cr epitope should be a major target for therapeutic antibodies. to understand how cr works we first investigated the interaction of cr fab with isolated recombinant sars-cov- rbd, both alone and in the presence of ace . surface plasmon resonance (spr) measurements (methods and figure s ) confirmed that cr binding to rbd is strong (although weaker than the binding reported to sars-cov (ter meulen et al., ) ), with a slight variation according to whether cr or rbd is used as the analyte (k d = nm and nm respectively, derived from the kinetic data in table s ). an independent measure using bio-layer interferometry (bli) with rbd as analyte gave a k d of nm (methods and figure s ). these values are quite similar to those reported by tian et al. (tian et al., ) ( . nm), whereas weaker binding (k d ~ nm) was reported recently by yuan et al. (yuan et al., ) . using spr to perform a competition assay revealed that the binding of ace to the rbd is perturbed by the presence of cr ( figure s ). the presence of ace slows the binding of cr to rbd and accelerates the dissociation. similarly, the release of ace from rbd is accelerated by the presence of cr . these observations are suggestive of an allosteric effect between ace and cr . a plaque reduction neutralisation test using sars-cov- virus and cr showed an nd of : for a starting concentration of mg/ml (calculated according to grist (grist, ) ), superior to that of mers convalescent serum (nd of : ) used as a nibsc international standard positive control (see methods and table s ). this corresponds to % neutralisation at ~ nm (~ . ug/ml). this is similar to the neutralising concentration ( % neutralisation at ug/ml) reported by ter meulen et al. (ter meulen et al., ) for sars-cov, however, as discussed below, it is in apparent disagreement with the result reported recently by yuan et al. (yuan et al., ) . we determined the crystal structure of the sars-cov- rbd-cr fab complex (see methods and table s ) to investigate the relationship between the binding epitopes of ace and cr . crystals grew rapidly and consistently. two crystal forms grew in the same drop. the solvent content of the crystal form solved first was unusually high (ca %) with the ace binding site exposed to large continuous solvent channels within the crystal lattice ( figure s ). these crystals therefore offer a promising vehicle for crystallographic screening to identify potential therapeutics that could act to block virus attachment. the current analysis of this crystal form is at . Å resolution and so, to avoid overfitting, refinement used a novel real-space refinement algorithm to optimise the phases (vagabond, hmg unpublished, see methods). this, together with the favourable observation to parameter ratio resulting from the exceptionally high solvent content, meant that the map was of very high quality, allowing reliable structural interpretation ( figure s , methods). full interpretation of the detailed interactions between cr and the rbd was enabled by the second crystal form which diffracted to high resolution, . Å, and the structure of which was refined to give an r-work/r-free of . / . and good stereochemistry (methods, table s , figure s ). the high-resolution structure is shown in figure a . there are two complexes in the crystal asymmetric unit with residues - in one rbd, - and - in the other rbd well defined, whilst residues - of the cr heavy chains are disordered. the rbd has a very similar structure to that seen in the complex of sars-cov- rbd with ace , rmsd for ca atoms of . Å (pdb, m j (lan et al., ) ), and an rmsd of . Å compared to the sars cov rbd (pdb, ajf (li et al., ) ). only minor conformational changes are introduced by binding to cr , at residues - . the rbd was deglycosylated (methods) to leave a single saccharide unit at each of the n-linked glycosylation sites clearly seen at n and n ( figure s ). cr attaches to the rbd surface orthogonal to the ace receptor binding site. there is no overlap between the epitopes and indeed both the fab and ace ectodomain can bind without clashing ( figure d ) (tian et al., ) . such independence of the ace binding site has been reported recently for another sars-cov- neutralising antibody, d . the fab complex interface buries Å of surface area ( and Å by the heavy and light chains respectively, figure a and figure s ), somewhat more than the rbd-ace interface which covers Å (pdb m j (lan et al., ) ). typical of a fab complex, the interaction is mediated by the antibody cdr loops, which fit well into the rather sculpted surface of the rbd (figure b , c). the heavy chain cdr , and make contacts to residues from α , β and α (residues - ), while two of the light chain cdrs ( and ) interact mainly with residues from the β -α loop, α ( - ) and the α -β loop ( - ) (figures , s , s ). a total of residues from the heavy chain and from the light chain cement the interaction with residues from the rbd. for the heavy chain these potentially form h-bonds and salt bridges, the latter from d and e (cdr ) to k of the rbd. whilst the light chain interface comprises h-bonds and a single salt bridge between e (cdr ) and k of the rbd. the binding is consolidated by a number of hydrophobic interactions ( figure s b ). of the residues involved in the interaction are conserved between sars-cov and sars-cov- ( figure b and figure s ). the cr epitope is much more conserved than that of the receptor blocking anti-sars-cov antibody r for which only of the interacting residues are conserved (hwang et al., ) , in-line with the lack of cross reactivity observed for the latter. the reason for the conservation of the cr epitope becomes clear in the context of the complete pre-fusion s structure (pdb ids: vsb (wrapp et al., ) , vxx, vyb (walls et al., ) ) where the epitope is inaccessible ( figure ). when the rbd is in the 'down' configuration the cr epitope is packed tightly against another rbd of the trimer and the n-terminal domain (ntd) of the neighbouring protomer. in the structure of the pre-fusion form of trimeric spike the majority of rbds are 'down', although presumably stochastically one may be 'up' (walls et al., ; wrapp et al., ) . the structure of a sars-cov complex with ace ectodomain shows that this 'up' configuration is competent to bind receptor, and that there are a family of 'up' orientations with significantly different hinge angles (song et al., ) . however, the cr epitope remains largely inaccessible even in the 'up' configuration. modelling the rotation of the rbd required to enable fab interaction in the context of the spike trimer, showed a rotation corresponding to a > ° further declination from the central vertical axis was required, beyond that observed previously (walls et al., ; wrapp et al., ) (figure i ), although this might be partly mitigated by more complex movements of the rbd and if more than one rbd is in the 'up' configuration this requirement would be relaxed somewhat. since locking the up state by receptor blocking antibodies is thought to destabilise the pre-fusion state (walls et al., ) binding of cr presumably introduces further destabilisation, leading to a premature conversion to the post-fusion state, inactivating the virus. cr and ace blocking antibodies can bind independently but both induce an 'up' conformation, presumably explaining the observed synergy between binding at the two sites (ter meulen et al., ) . to test if cr binding destabilises the prefusion state of spike, the ectodomain construct described previously (wrapp et al., ) was used to produce glycosylated protein in hek cells (methods). cryo-em screening showed that the protein was in the trimeric prefusion conformation. spike was then mixed with an excess of cr fab and incubated at room temperature, with aliquots being taken at minutes and hours. aliquots were immediately applied to cryo-em grids and frozen (methods). for the minutes incubation, collection of a substantial amount of data allowed unbiased particle picking and d classification which revealed two major structural classes with a similar number in each, (i) the prefusion conformation, and (ii) a radically different conformation (methods , table s and figure s ). detailed analysis of the prefusion conformation led to a structure at a nominal resolution of . Å (fsc = . ), based on a broad distribution of orientations, that revealed the same predominant rbd pattern (one 'up' and two 'down') previously seen (wrapp et al., ) with no evidence of cr binding (figure a , figure s ). analysis of the other major particle class revealed strong preferential orientation of the particles on the grid ( figure s a ). despite this a reconstruction with a nominal resolution of . Å within the plane of the grid, and perhaps Å resolution in the perpendicular direction ( figure s b ), could be produced which allowed the unambiguous fitting of the cr -rbd complex (figure b ). note that in addition there is less well defined density attached to the rbd, in a suitable position to correspond to the spike n-terminal domain (wrapp et al., ) . these structures are no longer trimeric, rather two complexes associate to form an approximately symmetric dimer (however, application of this symmetry in the reconstruction process did not improve the resolution). the interactions responsible for dimerisation involve the ace binding site on the rbd and the elbow of the fab, however the interaction does not occur in our lowresolution crystal form and is therefore probably extremely weak and not biologically significant. since conversion to the post-fusion conformation leads to dissociation of s (which includes the n-terminal domain and rbd) these results confirm that cr destabilises the prefusion spike conformation. further evidence of this is provided by analysis of data collected after h incubation. by this point there were no intact trimers remaining and a heterogeneous range of oligomeric assemblies had appeared, which we were not able to interpret in detail but which are consistent with the lateral assembly of fab/rbd complexes ( figure s ). note that the relatively slow kinetics will not be representative of events in vivo, where the conversion might be accelerated by the elevated temperature and the absence of the mutations which were added to this construct to stabilise the prefusion state (kirchdoerfer et al., ; pallesen et al., ; wrapp et al., ) . until now the only documented mechanism of neutralisation of coronaviruses has been through blocking receptor attachment. in the case of sars-cov this is achieved by presentation of the rbd of the spike in an 'up' conformation. although not yet confirmed for sars-cov- it is very likely that a similar mechanism can apply. here we define a second class of neutralisers, that bind a highly conserved epitope ( figure s ) and can therefore act against both sars-cov and sars-cov- (cr was first identified as a neutralising antibody against sars-cov (ter meulen et al., ) ). we find that binding of cr to the isolated rbd is tight (~ nm) and the crystal structure of the complex reveals the atomic detail of the interaction. despite the spatial separation of the cr and ace epitopes we find an allosteric effect between the two binding events. the role of the cr epitope in stabilising the prefusion spike trimer explains why it has, to date, proved impossible to generate mutations that escape binding of the antibody (ter meulen et al., ) . whilst in our assay cr neutralises sars-cov- , a recent paper (yuan et al., ) reported an alternative assay that did not detect neutralisation. the difference is likely due to their removal of the antibody/virus mix after adsorption to the indicator cells, before incubating to allow cytopathic effect (cpe) to develop. this would be in-line with the distinction previously seen between neutralisation tests for influenza virus by antibodies which bind the stem of hemagglutinin and therefore do not block receptor binding (thomson et al., ) . these antibodies did not appear to be neutralising when tested with the standard who neutralisation assay, in which a similar protocol is used to that adopted by yuan et al, in which the inoculum of virus/antibody is washed out before development of cpe. neutralisation was observed, however, when the antibodies were left in the assay during incubation to produce cpe. by analogy we would expect antibodies to the rbd that block attachment to ace to behave in a similar way to antibodies against the globular head of ha, whilst antibodies such as cr , that neutralise by an alternative mechanism to blocking receptor attachment, may need to be present throughout the incubation period with the indicator cells to reveal neutralisation. this agrees with our observation that, in the absence of ace , the cr fab destroys the prefusion-stabilised trimer (t / ~ h at room temperature as measured by cryo-em). with monoclonal antibodies now recognised as potential antivirals (lu et al., b; salazar et al., ) our results suggest that cr may be of immediate utility, since the mechanism of neutralisation will be unusually resistant to virus escape. in contrast antibodies which compete with ace (whose epitope on sars-cov- is reported to have already shown mutation at residue (gisaid: accession id: epi_isl_ wienecke-baldacchino et al., (shu and mccauley, ) ), are likely to be susceptible to escape. furthermore, with knowledge of the detailed structure of the epitope presented here a higher affinity version of cr might be engineered. alternatively, since the same mechanism of neutralisation is likely to be used by other antibodies, a more potent monoclonal antibody targeting the same epitope might be found (for instance by screening for competition with cr ). additionally, since this epitope is sterically and functionally independent of the well-established receptor-blocking neutralising antibody epitope there is considerable scope for therapeutic synergy between antibodies targeting the two epitopes (indeed this type of to further validate the spr results the k d of fab cr for rbd was also measured by bio-layer interferometry. kinetic assays were performed on an octet red e (fortebio) at ℃ with a shake speed of rpm. fab cr was immobilized onto amine reactive nd generation (ar g) biosensors (fortebio) and serially diluted rbd ( , , , and nm) was used as analyte. pbs (ph . ) was used as the assay buffer. recorded data were analysed using the data analysis software ht v . (fortebio), with a global : fitting model. neutralising virus titres were measured in serum samples that had been heat-inactivated at °c for minutes. sars-cov- (strain victoria/ / at cell passage (caly et al., ) ) was diluted to a concentration of . e+ pfu/ml ( pfu/ µl) and mixed : in % fcs/mem containing mm hepes buffer with doubling serum dilutions from : to : in a -well v-bottomed plate. the plate was incubated at °c in a humidified box for hour to allow the antibody in the serum samples to neutralise the virus. cr (ph . ) at a starting concentration of mg/ml was diluted in . the dilutions were then made -fold up to . the neutralised virus was transferred into the wells of a twice dpbs-washed plaque assay -well plate that had been seeded with vero/hslam the previous day at . e+ cells per well in % fcs/mem. neutralised virus was allowed to adsorb at °c for a further hour, and overlaid with plaque assay overlay media ( x mem/ . % cmc/ % fcs final). after days incubation at °c in a humified box, the plates were fixed, stained and plaques counted. dilutions and controls were performed in duplicate. median neutralising titres (nd ) were determined using the spearman-karber formula (kärber, ) relative to virus only control wells. purified and deglycosylated rbd and cr fab were concentrated to . mg/ml and mg/ml respectively, and then mixed in an approximate molar ratio of : . crystallization screen experiments were carried out using the nanolitre sitting-drop vapour diffusion method in -well plates as previously described (walter et al., (walter et al., , transmission). data were indexed, integrated and scaled with the automated data processing program xia -dials (winter, ; winter et al., ) . the data set of ° was collected from a single frozen crystal to . Å resolution with -fold redundancy. the crystal belongs to space group p with unit cell dimensions a = b = . Å and c = . Å. the structure was determined by molecular replacement with phaser (mccoy et al., ) using search models of human germline antibody fabs - /o (pdb id, kmt (teplyakov et al., ) ) heavy chain and ighv - /igk - (pdb id, i d (teplyakov et al., ) ) light chain, and rbd of sars-cov- rbd/ace complex (pdb id, m j (lan et al., ) ). there is one rbd/cr complex in the crystal asymmetric unit, resulting in a crystal solvent content of ~ %. during optimization of the crystallization conditions, a second crystal form was found to grow in the same condition with similar morphology. a data set of ° rotation with data extending to . Å was collected on beamline i of diamond from one of these crystals (exposure time . s per . ° frame, beam size × μ m and % beam transmission). the crystal also belongs to space group p but with significantly different unit cell dimensions (a = b = . Å and c = . Å). there were two rbd/cr complexes in the asymmetric unit and a solvent content of ~ %. the initial structure was determined using the lower resolution data from the first crystal form. data were excluded at a resolution below Å as these fell under the beamstop shadow. one cycle of refmac (murshudov et al., ) was used to refine atomic coordinates after manual correction in coot (emsley and cowtan, ) figure s ). the final refined structure had an r work of . (r free , . ) for all data to . Å resolution. this structure was later used to determine the structure of the second crystal form, which has been refined with phenix (liebschner et al., ) to r work = . and r free = . for all data to . Å resolution. this refined model revealed the presence of one extra residue at each heavy chain n-terminus and extra residues at the n-terminus of one rbd from the signal peptide. there is well ordered density for a single glycan at each of the glycosylation sites at n and n in one rbd, and only one at n in the second rbd. data collection and structure refinement statistics are given in table s . structural comparisons used shp (stuart et al., ) , residues forming the rbd/fab interface were identified with pisa (krissinel and henrick, ) , figures were prepared with pymol (the pymol molecular graphics system, version . r pre, schrödinger, llc). purified spike protein was buffer exchanged into mm tris ph . , mm nacl, . % nan buffer using a desalting column (zeba, thermo fisher μ m and at a nominal magnification of x , , corresponding to a calibrated pixel size of . Å/pixel, see table s . cryo-em data processing for both the minute and h incubation datasets, motion correction and alignment of x binned super-resolution movies was performed using relion . . ctf-estimation with gctf (v . ) (zhang, ) and non-template-driven particle picking was then performed within cryosparc v . . -live followed by multiple rounds of d classification (punjani et al., ) . for the minutes dataset. d class averages for structure-a and structure-b were then used separately for template-driven classification before further rounds of d and d classification with c symmetry. both structures were then sharpened in cryosparc. data processing and refinement statistics are given in table s . an initial model for the spike (structure-a) was generated using pdb id, vyb (walls et al., ) and rigid body fitted into the final map using coot (emsley and cowtan, ) . the model was further refined in real space with phenix (liebschner et al., ) which resulted in a correlation coefficient of . . two copies of rbd-cr were fitted into structure-b in the same manner. because of the strongly anisotropic resolution the overall correlation coefficient vs the model was lower ( . ). for the h incubation dataset, particles were extracted with a larger box size ( pixels as compared to pixels), and, following multiple rounds of d classification, d class averages from 'blob-picked' particles showing signs of complete 'flower-like' structures were selected for ab initio reconstruction. for the h data no detailed fitting was attempted. t/e (red, negative; blue, positive). real estimates of mortality following covid- infection isolation and rapid sharing of the novel coronavirus (sar-cov- ) from the first patient diagnosed with covid- in australia coot: model-building tools for molecular graphics diagnostic methods in clinical virology. x structural basis of neutralization by a human anti-severe acute respiratory syndrome spike protein antibody, r beitrag zur kollektiven behandlung pharmakologischer reihenversuche stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis inference of macromolecular assemblies from crystalline state structure of the sars-cov- spike receptor-binding domain bound to the ace structural biology: structure of sars coronavirus spike receptor-binding domain complexed with receptor macromolecular structure determination using x-rays, neutrons and electrons: recent developments in phenix genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding development of therapeutic antibodies for the treatment of diseases phaser crystallographic software a sars-like cluster of circulating bat coronaviruses shows potential for human emergence human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants refmac for the refinement of macromolecular crystal structures a pipeline for the production of antibody fragments for structural studies using transient expression in hek t cells the production of glycoproteins by transient expression in mammalian cells hek cells: an alternative to e. coli for the production of secreted and intracellular mammalian proteins immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen immunopathogenesis of coronavirus infections: implications for sars cryosparc: algorithms for rapid unsupervised cryo-em structure determination dynamical asymmetry exposes -ncov prefusion spike antibody therapies for the prevention and treatment of viral infections gisaid: global initiative on sharing all influenza datafrom vision to reality cryo-em structure of the sars coronavirus spike glycoprotein in complex with its host cell receptor ace crystal structure of cat muscle pyruvate kinase at a resolution of . Å potent neutralization of severe acute respiratory syndrome (sars) coronavirus by a human mab to s protein that blocks receptor association antibody modeling assessment ii structural diversity in a human antibody germline library pandemic h n influenza infection and vaccination in humans induces cross-protective antibodies that potent binding of novel coronavirus spike protein by a sars coronavirusspecific human monoclonal antibody immunization with sars coronavirus vaccines leads to pulmonary immunopathology on challenge with the sars virus unexpected receptor functional mimicry elucidates activation of coronavirus fusion function, and antigenicity of the sars-cov- spike glycoprotein a procedure for setting up high-throughput nanolitre crystallization experiments. i. protocol design and validation a procedure for setting up highthroughput nanolitre crystallization experiments. crystallization workflow for initial screening, automated storage, imaging and optimization molecular mechanism for antibody-dependent enhancement of coronavirus entry a human monoclonal antibody blocking sars-cov- infection structural and functional basis of sars-cov- entry by using human ace xia : an expert system for macromolecular crystallography data reduction dials: implementation and evaluation of a new integration package cryo-em structure of the -ncov spike in the prefusion conformation structural basis for the recognition of the sars-cov- by full-length human ace a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov gctf: real-time ctf determination and correction potent cross-reactive neutralization of sars coronavirus isolates by human monoclonal antibodies rbds in the down conformation (generated by superposing our rbd structure on the prefusion trimer of ref (wrapp et al., ) ). the viral membrane would be at the bottom of the picture. all of s and s are shown in yellow, apart from the rbd, which is shown in grey, with the cr epitope coloured green. a, a cut-way of the trimer showing, in red, the dipeptide (residues - ) which has been mutated to pp to confer stability on the pre-fusion state. note the proximity to the cr epitope. c, showing a top view of the molecule (also used for panels d-f). one of the rbds has been drawn in light grey in the down configuration and hinged up in dark grey, using the motion about the hinge axis observed for several coronavirus spikes, but extending the motion sufficiently to allow cr to bind. the pp motif is shown in red and the glycosylated residue n in magenta. panels d-f show the trimer viewed from above d -all rbds down, e -one rbd up f -one rbd rotated (as in c)to allow access to cr . panels g-i are equivalent structures to d-f, but are viewed from the side. in e bound ace is shown and in f cr . key: cord- -qg jwbes authors: vadlamani, b. s.; uppal, t.; verma, s. c.; misra, m. title: functionalized tio nanotube-based electrochemical biosensor for rapid detection of sars-cov- date: - - journal: nan doi: . / . . . sha: doc_id: cord_uid: qg jwbes the coronavirus disease (covid- ) is a newly emerging viral disease caused by severe acute respiratory syndrome coronavirus (sars-cov- ). rapid increase in the number of covid- cases worldwide led the who declare pandemic within a few month after the first case of infection. due to the lack of a prophylactic measure to control the virus infection and spread, early diagnosis and quarantining of infected as well as the asymptomatic individuals are necessary for the containment of this pandemic. however, the current methods for sars-cov- diagnosis are expensive and time consuming although some promising and inexpensive technologies are coming out for emergency use. in this work, we report the synthesis of a cheap yet highly sensitive cobalt-functionalized tio nanotubes (co-tnts)-based electrochemical biosensor and its efficacy for rapid detection of spike glycoprotein of sars-cov- by examining s-rbd protein as the reference material. a simple, low-cost, and one-step electrochemical anodization route was used to synthesize tnts, followed by an incipient wetting method for cobalt functionalization of the tnts platform, which is connected to a potentiostat for data collection. the sensor specifically detected the s-rbd protein of sars-cov- even at very low concentration (range of nm to nm). additionally, our sensor showed linear response in the detection of viral protein with concentration. in summary, our co-tnt sensor is highly effective in detecting sars-cov- s-rbd protein in approximately seconds, which can be explored for developing a point of care diagnostics for rapid detection of sars-cov- in nasal secretions or saliva samples. sars-cov- is currently a global pandemic on a scale that has not been experienced since the influenza pandemic. one of the reasons why this pandemic virus has spread so quickly is because many infected individuals with sars-cov- remain asymptomatic and involuntarily transmit the virus before they come down with symptoms. therefore, uniform surveillance and quarantining of infected as well as the asymptomatic individuals could provide an effective measure to contain the spread of sars-cov- . however, the current methods for sars-cov- diagnosis are expensive and time consuming although some inexpensive technologies are getting approvals for emergency use. our manuscript reports the synthesis of a cheap yet highly sensitive cobalt-functionalized tio nanotubes (co-tnts)-based electrochemical biosensor for rapid detection of spike glycoprotein of sars-cov- . our sensor is synthesized through one-step electrochemical anodization route, followed by an incipient wetting method for cobalt functionalization of tnts platform. the readout of this sensor is an electrochemical signal collected through a potentiostat, which can be adopted for use through smartphone applications and the development of a point of care diagnostics for covid- . the current outbreak of novel coronavirus (ncov- or sars-cov- ), was first detected in wuhan, china in december , but quickly spread to other parts of china as well as to the entire world causing pandemic [ ] . according to the who, as of th august , around , , people are infected, and , people have died due to sars-cov- infection [ ] . sars-cov- infection causes a variety of symptoms including fever, cough and respiratory distress, which are collectively called as coronavirus disease or covid- [ ] . the transmission of sars-cov- primarily occurs from person to person through close contact or via small droplets produced during coughing, sneezing, and talking [ ] [ ] . the incubation period for sars-cov- is around - days, with no noticeable symptoms; however, the viral transmission from an infected person to a non-infected person is still possible during this asymptomatic period [ ] . under the current scenario, with no vaccines in the market, global lockdown regulations are in place in order to minimize the viral spread. additionally, the pandemic has caused a severe socio-economic impact on the world economy and raised fears of a global recession [ ] . currently, the real-time reverse-transcriptase polymerase chain reaction (rt-pcr) technique is the most common and reliable laboratory testing method for qualitative/quantitative sars-cov- detection [ ] [ ] followed by serum virus neutralization assay (svna) for the determination of antibody neutralization [ ] and enzyme-linked immunoassays (elisa) for the detection of antibody against sars-cov- [ ] . however, the major limitations of these laboratory based diagnostic tests is the invasive nature of the tests that often require trained personal for nasopharyngeal sample collection, along with the requirement of highly sophisticated machines, cross-reactivity with other viruses, and longer duration of testing. in order to contain the viral spread, surveillance of even . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted september , . . https://doi.org/ . / . . . doi: medrxiv preprint asymptomatic individuals are needed, which is feasible only after the development of a simple, portable and rapid point-of-use sensor for the detection of sars-cov- . sars-cov- has positive-sense, single-stranded rna (~ k bp) genome with orfs that encode for structural, replication and non-structural proteins [ ] . similar to its genetic cousin, human sars-cov, sars-cov- consists of four structural proteins viz. spike (s), envelope (e), membrane (m), and nucleocapsid (n). coronaviruses are named for the crown like spike glycoprotein, s (composed of two subunits: the s subunit and s subunit) on the surface/envelop [ ] . the s subunit of the s protein consists of a receptor binding domain (rbd) that has a high binding affinity towards the host angiotensin-converting enzyme ii (ace ) receptor present on the human cells and the s subunit mediates virus-host cell fusion and entry [ ] . importantly, the s protein is highly immunogenic and induces immune response to produce neutralizing antibodies as well as t-cell responses in sars-cov- infected individuals [ ] . functionally, binding of the s-rbd to the hace receptor is a crucial for the entry of sars-cov- into the human cells. infringingly, sars-cov- s-rbd shares only % sequence identity with sars-cov s-rbd, which has been evaluated for vaccines and therapeutic drug development [ ] . hence, the s-rbd of sars-cov- are excellent target for diagnostic and therapeutic intervenstions. electrochemical biosensors are advantageous for sensing biomolecules because of their ability to detect biomarkers with accuracy, specificity and high sensitivity [ ] . electrochemical biosensors have been successfully used in medical diagnostics for the detection of viruses such as middle east respiratory syndrome coronavirus (mers-cov), [ ] human enterovirus (ev ) [ ] , human influenza a virus h n [ ] , avian influenza virus (aiv) h n [ ] . lahyquah et al. [ ] used an array of carbon electrodes modified with gold nanoparticles for the detection of mers-cov. very recently, a biosensor using gold nanoparticle decorated fto glass immobilized . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted september , . . https://doi.org/ . / . . . doi: medrxiv preprint with ncovid- monoclonal antibody was reported for the detection of sars-cov- [ ] . the functionality of the electrochemical biosensor can be further improved by nanostructuring the electrode as it increases the electrochemical reaction rate due to an increased electrode surface area to volume ratio, thereby increasing the electrode surface area to analyte fluid volume. in the work by chin et al. on the encephalitis virus, it was found that nanostructuring of carbon electrodes with carbon nanoparticles increased the current response by % due to an enhanced electron charge transfer kinetics [ ] . similarly, we have reported that co functionalized tio nanotubes (ni-tnts) with higher surface-to-volume ratio can detect the biomarkers associated with tuberculosis [ ] [ ] . the proposed sensing mechanism involves the formation of a complex between co and the biomarker at specific bias voltage, due to the reduction of co ions and oxidation of biomarker. similarly, we hypothesized that s-rbd or sars-cov- can be detected through complexing of functionalized nanoparticles with the s-rbd protein and a schematic of viral detection directly from patient sample as shown in figure . in the current work, we have determined the potential of co-functionalized tio nanotubes (co-tnts) for the electrochemical detection of s-rbd protein of sars-cov- . tnts were synthesized by simple, cost-effective, one-step electrochemical anodization route, and co functionalization was carried out using the incipient wetting method. our data showed that cobalt functionalized tnts could selectively detect the s-rbd protein of sars-cov- using the amperometry electrochemical technique in ~ secs. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted september , . . tnts were synthesized by electrochemical anodization of ti sheet. ti sheet of size . cm x . cm, with a tab mm in width, was cut out of g grade ti sheet (thickness . mm). one side of the coupon was polished with grit polishing paper for min to remove any surface metal oxide layer. the coupon was ultrasonicated in : solution of ethanol and acetone for min. the unpolished side was masked with kapton tape to avoid any exposure to electrolyte during anodization. the electrochemical anodization was performed in a standard two-electrode configuration, using ti foil as a working electrode and platinum foil as a counter electrode with a cm gap between them. the anodization was carried out using an electrolyte of composition . ml (ch oh) , ml di h o, and . g nh f in a teflon beaker. the electrolyte was maintained at a subzero temperature and continuously stirred using a magnetic stirrer at a speed of rpm. the anodization was carried by maintaining a constant voltage of v across both the electrodes for min. after anodization, the sample was rinsed in di h o and baked in an oven at °c for hrs. the kapton tape was removed from the sample after baking, and the sample was annealed in a tube furnace at °c for h in a continuous flow of oxygen. the annealed tnts obtained from the furnace were functionalized with cobalt using an incipient wetting method, i.e. a wet ion exchange process. the same side of the sample that was masked earlier was again masked with kapton tape. the sample was ultrasonicated in a solution containing . g of cocl . h o in ml ethanol for min. the sample was baked in an oven at °c for hrs to obtain cobalt functionalized tnts. the morphology of the tnts and co-tnts were examined using dual beam scanning electron microscopy (sem, thermofisher scientific). the cobalt content in the co-tnts sample . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted september , . . https://doi.org/ . / . . . doi: medrxiv preprint was analyzed using the eds detector attached to sem. the sem micrographs were analyzed using imagej software. the . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted september , . . https://doi.org/ . / . . . doi: medrxiv preprint the electrochemical sensing of s-rbd protein was carried out using a custom-built co-tnt packaged printed circuit board setup. the sensor response was measured with the help of gamry reference + potentiostat attached to the printed circuit board. the schematic of the whole sensing set up along with the detection methodology is shown in figure . the sensor response with various s-rbd protein concentrations was determined using the amperometry technique, at a bias voltage of - . v. the bias voltage was determined by conducting the cyclic voltammetry experiments in the voltage window - v to + v. all the experiments were carried out at room temperature. the scanning electron microscopy (sem) micrographs of the tnts, prepared by electrochemical anodization, are shown in figure a . the inset shows the side view of the tnts ( figure a) . the outer diameter and wall thickness of tnts were ~ nm and ~ nm, respectively. the average length of tnts was found to be ~ . µm. in our earlier work, tnts synthesized under similar conditions were found to show the crystalline anatase phase predominantly [ ] . the surface morphology of the co-tnts examined under sem is shown in figure b . the sem micrograph reveals the presence of precipitates on top of the tnt surface. eds analysis confirmed the uniform distribution of co on top of tnts, and the co content was found to be ~ wt %. we have previously shown that co exists in co + state in the form of co(oh) on co-tnts [ ]. therefore, the morphology of tnts can be visualized as having a very large surface area, uniformly decorated with co + ions. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted september , . . https://doi.org/ . / . . . doi: medrxiv preprint the s-rbd protein, biomarker for sars-cov- detection, was characterized via sds-page (under denaturizing conditions). the rbd domain of spike glycoprotein comprises of amino acids - , which is a ~ kda protein with potential n-glycosylation sites. as shown in figure a -b, the sds-page gel of his -tagged s-rbd protein was either stained with simplyblue safestain (figure a ) or transferred to a nitrocellulose membrane and detected with ug/ml of mouse anti-his monoclonal antibody (figure b) , followed by incubation with infrareddye-tagged secondary ir-dye antibody and scanning with an odyssey infrared scanner. specific bands were detected for sars-cov- s-rbd protein at approximately kda and kda, representative of the monomeric and dimeric forms of s-rbd protein, respectively (fig. b ). we detected the s-rbd at a slightly higher molecular weight (~ kda) possibly because of post-translational modifications including glycosylation. the ability of co-tnt to sense s-rbd protein of sars-cov- was determined by performing an amperometry experiment at a bias voltage of - . v. the amperometry curves obtained at various concentrations of protein are shown in figure . the sensor was exposed to protein sec after beginning of experiment (marked by an arrow). the sensor response current increases sharply and rapidly as the sensor was exposed to the protein. at a protein concentration of nm (nano molar), the peak sensor current output was found to be ~ . µa (nano ampere). the peak current decreases to ~ . µa at a protein concentration of nm and further decreases to ~ . µa at a protein concentration of nm. the sensor detection time was ~ sec over the concentration range of nm to nm. it is hypothesized that the rapid increase in sensor response current could be attributed to the electrochemically triggered unfolding of protein that . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted september , . . exposes its interior [ ] [ ] [ ] and subsequent complex formation between co and the protein [ ] [ ] [ ] . the average sensor response time, which is defined as the time taken to reach the peak current, was found to be ~ sec. it is very short compared to our earlier studies on the sensor for colorectal cancer, where a sensor response time of ~ sec was documented [ ] . the shorter sensor response time indicates higher kinetics of reaction between co-tnt and the protein molecules. the sensor response (sr) was calculated at various protein concentrations based on the following equation: where !"#,%&'()*+ is the maximum current obtained when sensor is exposed to sars-cov- s-rbd protein and !"#,,"-) /*+) is the maximum current obtained when sensor is not exposed to the protein. the value of !"#,,"-) /*+) , which is the current obtained when sensor is not exposed to protein, was found to be ~ pa ( figure ). the sensor responses measured at different protein concentrations are shown in figure . the sensor response was found to increase with an increase in the concentration of protein. moreover, the sensor response exhibited excellent linearity over the concentration range to nm with a correlation coefficient of r = . . the regressed linear calibration curve for sensor response was obtained as follows: = ( . ± . ) log( ) + ( . ± . ); r = . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted september , . . https://doi.org/ . / . . . doi: medrxiv preprint where sr is the sensor response, and c is the concentration of protein in nm. using statistical analysis [ ] the limit of detection for measurements made using sensor was determined to be . nm. the limit of detection can be further improved by the use of (i) co-tnt synthesized by insitu anodization technique and (ii) co-tnts of even higher length. previously, we found that co-tnt synthesized by in-situ anodization with higher sensor sensitivity compared to co-tnt synthesized by incipient wetting route towards the detection of tuberculosis biomarkers [ ] . a higher sensor sensitivity corresponds to a better limit of detection and senstivity of quantitation. the increased sensitivity was attributed to the presence of co(oh) precipitate sites in direct contact with parent tio due to which direct conduction is possible. the sensor sensitivity can also be improved by using longer co-tnts as higher surface area results in a higher reaction rate; thereby, higher sensor response current can be obtained even at lower protein concentrations. in this study, we developed a co-metal functionalized tnt as a sensing material for electrochemical detection of sars-cov- infection through the detection of the receptor binding domain (rbd) of spike glycoprotein. we confirmed the biosensor's potential for clinical application by analyzing the rbd of the spike glycoprotein on our sensor. amperometry electrochemical studies indicated that the sensor could detect the protein in the concentration range nm to nm. the relationship between sensor response and protein concentration was found to be linear with the limit of detection as low as ~ . nm levels. importantly, our sensor detected sars cov- s-rbd protein in a very short time (~ sec) confirming its implication in developing a rapid diagnostic assay. thus, our report demonstrate the development of a simple, . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted september , . . https://doi.org/ . / . . . doi: medrxiv preprint inexpensive, rapid and non-invasive diagnostic platform that has the potential of detecting sars-cov- on clinical specimens including nasal, nasopharyngeal swabs or saliva. moreover, the developed approach has the potential for diagnosis of other respiratory viral diseases by identifying appropriate metallic elements to functionalize tnts. scv: conceptualization, methodology, project administration, funding, writing -review and editing. the environmental and biological safety committee of the university of nevada, reno, approved methods and techniques used in this study. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted september , . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted september , . . https://doi.org/ . / . . . doi: medrxiv preprint mol. wt. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted september , . . https://doi.org/ . / . . . doi: medrxiv preprint . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted september , . . https://doi.org/ . / . . . doi: medrxiv preprint novel coronavirus. world heal organ coronavirus disease (covid- ) the species severe acute respiratory syndrome-related coronavirus: classifying -ncov and naming it sars-cov- cluster of sars among medical students exposed to single patient identification of severe acute respiratory syndrome in canada international journal of infectious diseases the sars-cov- outbreak : what we know the socioeconomic implications of the coronavirus pandemic (covid- ): a review molecular diagnosis of a novel coronavirus ( -ncov ) causing an outbreak of pneumonia positive rt-pcr test results in patients recovered from covid- a sars-cov- surrogate virus neutralization test ( svnt ) based on antibody-mediated blockage of ace -spike ( rbd ) protein-protein interaction diagnostic performance of seven rapid igg / igm antibody tests and the euroimmun iga / igg elisa in covid- patients coronavirus infections and immune responses structural and functional properties of sars-cov- spike protein : potential antivirus drug development for covid- characterization of the receptorbinding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine. cellular and molecular immunology immune-mediated approaches against covid- identification of sars-cov rbd-targeting monoclonal antibodies with cross-reactive or neutralizing activity against sars-cov- electrochemical biosensors for pathogen detection an electrochemical immunosensor for the corona virus associated with the middle east respiratory syndrome using an array of gold nanoparticle-modified carbon electrodes biosensors and bioelectronics a colorimetric and electrochemical immunosensor for point-of-care detection of enterovirus biosensors and bioelectronics electrochemical detection of in fl uenza virus h n based on both immunomagnetic extraction and gold catalysis using an immobilization-free screen printed carbon microelectrode biosensors and bioelectronics an impedance immunosensor based on low-cost microelectrodes and speci fi c monoclonal antibodies for rapid detection of avian in fl uenza virus h n in chicken swabs ecovsens-ultrasensitive novel in-house built printed circuit board based electrochemical device for rapid detection of ncovid- carbon nanoparticle modified screen printed carbon electrode as a disposable electrochemical immunosensor strip for the detection of japanese encephalitis virus titania nanotube array sensor for electrochemical detection of four predominate tuberculosis volatile biomarkers anodic functionalization of titania nanotube arrays for the electrochemical detection of tuberculosis biomarker vapors anodic functionalization of titania nanotube arrays for the electrochemical detection of tuberculosis biomarker vapors analysis of redox activity of proteins on the carbon screen printed electrodes chemical-induced unfolding of cofactor-free protein monitored by electrochemistry biochemistry dominant protein electrochemistry: application in medicine. a review detection of food decay products using functionalized one-dimensional titania nanotubular arrays detection of four distinct volatile indicators of colorectal cancer using functionalized titania determination of the lower limit of article pdf first page preview authors declare no conflict key: cord- -csgmc authors: george, parakkal jovvian; tai, wanbo; du, lanying; lustigman, sara title: the potency of an anti-mers coronavirus subunit vaccine depends on a unique combinatorial adjuvant formulation date: - - journal: vaccines (basel) doi: . /vaccines sha: doc_id: cord_uid: csgmc vaccination is one of the most successful strategies to prevent human infectious diseases. combinatorial adjuvants have gained increasing interest as they can stimulate multiple immune pathways and enhance the vaccine efficacy of subunit vaccines. we investigated the adjuvanticity of aluminum (alum) in combination with rasp- , a protein adjuvant, using the middle east respiratory syndrome coronavirus mers-cov receptor-binding-domain (rbd) vaccine antigen. a highly enhanced anti-mers-cov neutralizing antibody response was induced when mice were immunized with rasp- and the alum-adjuvanted rbd vaccine in two separate injection sites as compared to mice immunized with rbd + rasp- + alum formulated into a single inoculum. the antibodies produced also significantly inhibited the binding of rbd to its cell-associated receptor. moreover, immunization with rasp- co-administered with the alum-adjuvanted rbd vaccine in separate sites resulted in an enhanced frequency of tfh and gc b cells within the draining lymph nodes, both of which were positively associated with the titers of the neutralizing antibody response related to anti-mers-cov protective immunity. our findings not only indicate that this unique combinatorial adjuvanted rbd vaccine regimen improved the immunogenicity of rbd, but also point to the importance of utilizing combinatorial adjuvants for the induction of synergistic protective immune responses. vaccination is one of the most successful strategy to prevent infectious diseases in the human population, including those caused by emerging viruses [ ] . among various vaccine types, such as inactivated virus, live attenuated virus, and viral vector-based vaccines, subunit vaccines using proteins or peptides are believed to be much safer as they do not contain any live virus components and/or cause undesirable severe side effects [ ] . however, unlike attenuated vaccines (composed of a virus or bacterium that replicates within the host) or inactivated vaccines (composed of either heat or chemically-inactivated parts of the pathogen), subunit vaccines (that are derived from known pathogen target antigens) are generally much less immunogenic, which can be improved with the addition of appropriate adjuvant(s) to the vaccine [ ] . adjuvants play an important role in enhancing the potency of subunit vaccines by improving humoral and/or cellular immune responses to the various subunit protein vaccines, decreasing the antigen dosages, and/or reducing immunization regimens [ , ] . aluminum salts (hereinafter alum) c bl/ mice were immunized intramuscularly (i.m.) twice, three weeks apart, with mers-rbd-fd ( µg; hereinafter rbd) formulated with or without alum (alhydrogel ® µg) (invivogen, san diego, ca, usa), rasp- ( µg), or with distinct adjuvant combinations. inoculums were prepared in a final volume of µl per mouse and µl was injected in the caudal thigh muscle of each hind leg in the appropriate site as outlined in table . control mice were injected with pbs in . % sds, the buffer solution of rasp- and is referred to as naive mice hereafter. complete adsorption of the rbd and rasp- proteins by alum was confirmed by sds-gel electrophoresis of the unbound protein samples after absorption with alum for min on a rotator at rt. sera samples were collected days post- nd immunization for analyses of anti-mers-cov neutralizing antibody titers, inhibition of mers-cov rbd-dpp receptor binding, and mers-cov rbd-specific antibody responses using mers-cov s as a target. the draining lymph nodes from each hind leg per mouse were also recovered at day post- nd immunization for analyses of the various cell profiles within. the neutralizing activity of sera from the immunized mice against mers-cov infection in vitro was carried out using our established pseudovirus neutralization assay [ , ] . briefly, t cells were co-transfected with a plasmid encoding the s protein of mers-cov (strain emc ) and a plasmid encoding env-defective, luciferase-expressing hiv- genome (pnl - .luc.re). the supernatants containing mers-cov s expressing pseudovirus collected h post-transfection were incubated with serially diluted mouse sera at • c for h. the virus-serum mixtures were then added into huh- cells expressing the mers-cov receptor dpp . the cells were refed with fresh medium h later, and after h, lysed using cell lysis buffer (promega, madison, wi, usa) before the supernatants were transferred into -well luminometer plates. after addition of luciferase substrate (promega, madison, wi, usa), the plates were measured for relative luciferase activity using infinite pro luminator (tecan, männedorf, switzerland). neutralizing activity was calculated using the calcusyn computer program [ ] and is expressed as % pseudovirus neutralizing antibody titers (nt ). sera from immunized mice were tested for their ability to inhibit the binding of the recombinant mers-rbd-fc protein to the cell-associated hdpp receptor in huh- cells using flow cytometry analysis [ ] . briefly, cells were incubated with the mers-rbd-fc protein ( µg/ml) in the presence or absence of diluted mouse sera ( : ) for min at room temperature. after three washes and staining with fitc-labeled goat anti-human igg fc secondary antibody ( : , thermo fisher scientific, waltham, ma, usa) for min at room temperature, the cells were measured for fluorescence in a flow cytometer (bd lsrfortessa system). mean fluorescence intensity (mfi) values of the fitc channel from cells incubated with mers-rbd-fc protein in the absence of diluted sera were treated as percentage binding. inhibition of binding was calculated as the percentage of reduced binding to hdpp receptor in the presence of diluted sera from the different immunization groups versus the maximum binding observed in the absence of sera. elisa to measure mers-cov rbd-specific antibody responses in immunized mouse sera was performed with mers-cov s as the target antigen and as previously described with some modifications [ , ] . the mers-cov s subunit of the mers-cov s protein contains the rbd region of the virus. briefly, -well elisa plates were coated with mers-cov s ( µg/ml) overnight at • c, blocked with % fat-free milk in pbs containing tween- (pbst) at • c for h, and then washed with pbst times. the plates were subsequently incubated at • c for h with serially diluted mouse sera, and horseradish peroxidase (hrp)-conjugated anti-mouse igg ( : ), igg ( : ), or igg c ( : ) antibodies (thermo fisher scientific, waltham, ma, usa). the substrate , , , -tetramethylbenzidine (sigma-aldrich, st. louis, mo, usa) was added to the plates after additional washes, and the reaction was stopped by the addition of n h so . absorbance at nm was measured using an elisa plate reader (tecan, männedorf, switzerland). endpoint titers were calculated as the reciprocal of the highest dilution of sera giving an optical density greater than the mean ± times the standard deviation of sera from naïve mice. draining lymph nodes from each hind leg per mouse were harvested at day post- nd immunization. the lymph nodes were dissociated into single cell suspensions using a syringe plunger, then passed through a µm cell strainer and resuspended in complete rpmi media containing % fetal bovine serum (r ). subsequently, . × cells were washed and resuspended in fresh r media in a -well cell culture plate for flow cytometry staining. . × cells were stained with a labeled antibodies: cd -af , cd b-pe-cy , cd c-bv , ly c-percp, cd -apc, ccr -pe, cd -bv , cd -bv , ly g-pe-cy , and b -bv . while × cells were stained with a cocktail of the following fluorescently labeled antibodies: cd -af , cd -pe-cy , cxcr -bv , pd- -bv , b -bv and gl- -af (all from biolegend, san diego, ca, usa) , and cd -bv (bd biosciences, dublin, ireland), in a brilliant violet cell stain buffer (bd biosciences, dublin, ireland) for min in the dark at room temperature. cells were then washed, resuspended in cell stain buffer (biolegend, san diego, ca, usa), and the number of stained cells was acquired using bd lsrfortessa cell analyzer (bd biosciences, dublin, ireland). the data were analyzed using flowjo software (tree star, ashland, or, usa). cd + cd c -ly c + cells were identified as monocytes, cd + cd c -ly c + cd + cells were identified as activated monocytes, cd + cd c -ly c + ccr + cells were identified as migratory monocytes, cd + cd + cells were identified as cd + t cells, cd + cd + cxcr + pd- + cells were identified as tfh cells, cd + b + cells were identified as b cells and cd + b + cd + gl- + cells were identified as gc b cells. one-way anova test with tukey's multiple comparison was used for statistical analysis using graphpad prism v (graphpad, san diego, ca, usa). spearman correlation was performed to determine the association of the fold increase in the frequency of tfh and gc b cells with neutralizing antibody titers using graphpad prism v (graphpad, san diego, ca, usa). p < . : *, p < . : **, p < . : ***, p < . : ****. nd: not detectable. to investigate whether rasp- in combination with alum enhances the humoral immune responses induced by the mres-rbd-fd (herein after rbd) vaccine, we immunized c bl/ mice twice. three weeks apart, using a formulation where rasp- and the rbd vaccine proteins were completely adsorbed to alum and then administered as a single inoculum (table ; group ). this adjuvanted vaccine was compared to that in which rasp- was co-administered with the alum-adjuvanted rbd vaccine as two inoculums and in two separate sites of the caudal thigh muscle (table ; group ). rbd formulated with either rasp- or alum alone, rbd alone, and pbs alone were included as controls. table . immunization of mice using different combinations and/or formulations of the vaccines and the site of injection: c bl/ mice were immunized intramuscularly (i.m.) with mers-rbd-fd (rbd) formulated with or without alum and/or rasp- alone or together in different combinations and/or formulations. mice were immunized twice, weeks apart according to the various g -g experimental groups either at the front (a: µl of inoculum) and/or the back (b: µl of inoculum) of the caudal thigh muscle in each hind leg. injection immunization of mice with rasp- and the alum-adjuvanted rbd vaccine in separate sites (g , figure ) significantly resulted in the highest neutralizing antibody titers against mers-cov infection in vitro, nt = , . it was approximately four-fold higher than in mice that were vaccinated with rbd + rasp- + alum in a single inoculum (g − nt = ; figure ),~ -,~ -, and -fold higher when compared to rasp- -adjuvanted rbd vaccine, alum-adjuvanted rbd vaccine, and rbd only, respectively (g − nt = , g − nt = , g − nt = , respectively; figure ). it appears that this unique rasp- and alum combinatorial adjuvants promoted synergy in the functional humoral response produced vs. the vaccines that utilized the other formulations and/or regimens, including the rasp- -adjuvanted rbd vaccine. ). it appears that this unique rasp- and alum combinatorial adjuvants promoted synergy in the functional humoral response produced vs. the vaccines that utilized the other formulations and/or regimens, including the rasp- -adjuvanted rbd vaccine. (table and x-axis legend). sera samples were collected days post- nd immunization and analyzed for neutralization of the pseudotyped mers-cov. the data represents the mean and standard error (sem) of nt titers from at least two independent experiments with to mice per group. "+" indicates the presence and "−" indicates the absence of the protein or adjuvants in the formulation. statistics was performed using one-way anova with tukey's multiple comparison. p < . : ***, p < . : ****, nd: not detectable. when the total igg response to the mers-rbd antigen was studied using the mers-cov s protein as the target protein, we found that although the rbd-specific total igg antibody titers were ~ times higher in mice that were vaccinated by co-administrating rasp- and the alum-adjuvanted rbd vaccine in separate sites (g - , end point titer), they were not significantly different from those elicited by immunization with rbd + rasp- + alum administered in a single inoculum (g - , end point titer), or with the alum-adjuvanted rbd vaccine (g - , end point titer; figure s a ). nevertheless, the co-administration of rasp- and the alum-adjuvanted rbd vaccine in separate sites significantly increased the rbd-specific total igg antibody titers by ~ -fold compared to rasp- -adjuvanted mers-rbd vaccine or ~ -fold compared to the rbd vaccine alone (g - , , g - and g - end point titers respectively; figure s a ); clearly showing that immunization using the unique rasp- and alum combinational adjuvants had a beneficiary effect in comparison to the rasp- (~ fold) or the alum (~ fold) adjuvanted vaccines. to elucidate the igg subtypes induced in the different immunization groups, we also analyzed the rbd-specific igg and igg c antibody titers. we observed that the highest titer of igg antibodies was induced when the rbd vaccine was formulated with alum alone or with rasp- + alum in one inoculum (g - , and g - , end point titers respectively; figure s b ), with the titers being (table and x-axis legend). sera samples were collected days post- nd immunization and analyzed for neutralization of the pseudotyped mers-cov. the data represents the mean and standard error (sem) of nt titers from at least two independent experiments with to mice per group. "+" indicates the presence and "−" indicates the absence of the protein or adjuvants in the formulation. statistics was performed using one-way anova with tukey's multiple comparison. p < . : ***, p < . : ****, nd: not detectable. when the total igg response to the mers-rbd antigen was studied using the mers-cov s protein as the target protein, we found that although the rbd-specific total igg antibody titers werẽ times higher in mice that were vaccinated by co-administrating rasp- and the alum-adjuvanted rbd vaccine in separate sites (g - , end point titer), they were not significantly different from those elicited by immunization with rbd + rasp- + alum administered in a single inoculum (g - , end point titer), or with the alum-adjuvanted rbd vaccine (g - , end point titer; figure s a ). nevertheless, the co-administration of rasp- and the alum-adjuvanted rbd vaccine in separate sites significantly increased the rbd-specific total igg antibody titers by~ -fold compared to rasp- -adjuvanted mers-rbd vaccine or~ -fold compared to the rbd vaccine alone (g - , , g - and g - end point titers respectively; figure s a ); clearly showing that immunization using the unique rasp- and alum combinational adjuvants had a beneficiary effect in comparison to the rasp- (~ fold) or the alum (~ fold) adjuvanted vaccines. to elucidate the igg subtypes induced in the different immunization groups, we also analyzed the rbd-specific igg and igg c antibody titers. we observed that the highest titer of igg antibodies was induced when the rbd vaccine was formulated with alum alone or with rasp- + alum in one inoculum (g - , and g - , end point titers respectively; figure s b ), with the titers being significantly higher when the vaccine was with both adjuvants and in one inoculum. the rbd-specific igg titers were significantly decreased when rasp- was used as an adjuvant in aqueous formulations; when the rasp- and the alum-adjuvanted rbd were co-administered in separate sites (~ -fold decrease; g - , end point titers) or with the rasp- -adjuvanted rbd vaccine (~ -fold decrease; g - end point titers) when it was compared to the administration of rbd + rasp- + alum in a single inoculum (g - , end point titers; figure s b ). rasp- is known as an igg (th )-biased adjuvant [ ] . notably, igg c responses were only elevated when rasp- was also administered as an adjuvant (g - , g - , and g - end point titers), with the combinatorial adjuvanted vaccines performing similarly and best (g and g ; figure s c ). these data suggest that when rasp- is adsorbed to alum in a vaccine formulation (g ), the two adjuvants work in synergy not only to elicit a stronger igg response than the alum-adjuvanted vaccine formulation, but also for inducing the igg c antibody response as compared to alum-adjuvanted vaccine formulation alone (~ -fold increase), suggesting that the combination of rasp- and alum in a vaccine with the rbd antigen works in synergy to elicit a more balanced igg -igg c antibody response (igg /igg c ratio of in g vs. in g ; figure s d ). the reduced igg /igg c ratio is more pronounced in a vaccine formulation where rasp- is not adsorbed to alum but co-administered separately (igg /igg c ratio of in g ; figure s d ). sera samples from day post- nd immunization were also tested for their ability to inhibit the binding of mers-rbd-fc protein to the hdpp receptor-expressing huh- cells by flow cytometry. similarly to the enhanced induction of neutralizing antibodies, immunization with rasp- and the alum-adjuvanted rbd vaccine in separate injection sites also increased the ability of the generated antibodies to inhibit the binding of the rbd protein to its receptor by~ -fold (g - %; figure ) as compared to rbd + rasp- + alum adjuvanted vaccine in a single inoculum or with alum-adjuvanted rbd vaccine (g - % and g - %; figure ), and by > -fold as compared to the rasp- -adjuvanted rbd vaccine (g - %; figure ). when group g was compared to groups g and g , a synergy of the combinatorial adjuvants, though when rasp- is not adsorbed to alum, was evident. however, when rasp- was adsorbed to alum (g ), the functionality of the antibodies elicited by the vaccine is much reduced. the inhibitory activity in group g was reduced by~ %, albeit not significantly, as compared to g . (table and x-axis legend). sera samples were collected on day post- nd immunization and assayed for inhibition of the binding of mers-cov rbd-fc to huh- cells expressing mers-cov receptor dpp . the data represents the mean and standard error (sem) of percentage inhibition of binding from at least two independent experiments with to mice per group. "+" indicates the presence and "−" indicates the absence of the protein or adjuvants in the formulation. statistics was performed using one-way anova with tukey's multiple comparison. p < . : *, p < . : ****. nd: not detectable. the draining lymph nodes (ln) were harvested from each leg days post- nd immunization and analyzed for the number of monocytes as well as their activation and migratory status. migration of innate cells from the site of injection to the lns are required to initiate an effective adaptive immune response [ ] . the analyses demonstrated that there was no significant difference in the total number of monocytes (cd + cd c -ly c + ; figures s a and s a) recruited into the ln per mouse between the various immunization groups. however, the number of activated monocytes (c + cd c -ly c + cd + ) in the ln were significantly higher in mice where rasp- and the alumadjuvanted rbd vaccine were co-administered in separate sites than in mice immunized with rbd + rasp- + alum in a single inoculum, or with alum-adjuvanted rbd vaccine (g vs. g and g , respectively; figure b ). notably, the number of activated monocytes in the ln of mice that received only the rasp- -adjuvanted rbd vaccine (g ) were similar to those present in mice where rasp- was administered without being adsorbed to alum (g ). it appears that alum does not add to the activation of the monocytes in any of the formulations tested (g , g or g ; figure b ). (table and x-axis legend). sera samples were collected on day post- nd immunization and assayed for inhibition of the binding of mers-cov rbd-fc to huh- cells expressing mers-cov receptor dpp . the data represents the mean and standard error (sem) of percentage inhibition of binding from at least two independent experiments with to mice per group. "+" indicates the presence and "−" indicates the absence of the protein or adjuvants in the formulation. statistics was performed using one-way anova with tukey's multiple comparison. p < . : *, p < . : ****. nd: not detectable. the draining lymph nodes (ln) were harvested from each leg days post- nd immunization and analyzed for the number of monocytes as well as their activation and migratory status. migration of innate cells from the site of injection to the lns are required to initiate an effective adaptive immune response [ ] . the analyses demonstrated that there was no significant difference in the total number of monocytes (cd + cd c -ly c + ; figures s a and s a ) recruited into the ln per mouse between the various immunization groups. however, the number of activated monocytes (c + cd c -ly c + cd + ) in the ln were significantly higher in mice where rasp- and the alum-adjuvanted rbd vaccine were co-administered in separate sites than in mice immunized with rbd + rasp- + alum in a single inoculum, or with alum-adjuvanted rbd vaccine (g vs. g and g , respectively; figure b) . notably, the number of activated monocytes in the ln of mice that received only the rasp- -adjuvanted rbd vaccine (g ) were similar to those present in mice where rasp- was administered without being adsorbed to alum (g ). it appears that alum does not add to the activation of the monocytes in any of the formulations tested (g , g or g ; figure b ). nevertheless, a synergistic effect was observed when rasp- was added to the rbd vaccine with or without alum with respect to ccr + (migratory) monocyte subset. the number of migratory monocytes (cd + cd c -ly c + ccr + ) in the ln were significantly higher in mice where rasp- and the alum-adjuvanted rbd vaccine were co-administered in separate sites, rbd + rasp- + alum was administered in a single inoculum or the rasp- -adjuvanted rbd vaccine vs. the alum-adjuvanted rbd vaccine (g , g and g vs. g ; figure c ). the number of migratory monocytes in the ln of mice that received the alum-adjuvanted rbd vaccine were actually similar to those in the control group that received only rbd (g vs. g ). as observed with the number of activated monocytes, migratory monocytes in the lns of mice that received rasp- and the alum-adjuvanted rbd vaccine co-administered in separate sites were significantly higher in mice than in mice received rbd + rasp- + alum administered in a single inoculum (g vs g ; figure c ). nevertheless, a synergistic effect was observed when rasp- was added to the rbd vaccine with or without alum with respect to ccr + (migratory) monocyte subset. the number of migratory monocytes (cd + cd c -ly c + ccr + ) in the ln were significantly higher in mice where rasp- and the alum-adjuvanted rbd vaccine were co-administered in separate sites, rbd + rasp- + alum was administered in a single inoculum or the rasp- -adjuvanted rbd vaccine vs. the alum-adjuvanted rbd vaccine (g , g and g vs. g ; figure c ). the number of migratory monocytes in the ln of mice that received the alum-adjuvanted rbd vaccine were actually similar to those in the control group that received only rbd (g vs. g ). as observed with the number of activated monocytes, migratory monocytes in the lns of mice that received rasp- and the alum-adjuvanted rbd vaccine co-administered in separate sites were significantly higher in mice than in mice received rbd + rasp- + alum administered in a single inoculum (g vs g ; figure c ). to investigate the possible contribution of tfh and gc b cells to the robust functional antibody responses induced in mice administered with rasp- and the alum-adjuvanted rbd vaccine in separate injection sites, ln from days post- nd immunization were analyzed for the tfh ( figure a ) and gc b ( figure a ) cell frequencies. although the frequency of total cd + t cells within the ln was not significantly different between the various immunization groups (figure s b ), the frequency of the tfh (cd + cxcr + pd- + ) cells within the ln of mice administered with rasp- and the alum-adjuvanted rbd vaccine in separate sites (g ) was . -fold higher than in mice administered with the rbd + rasp- + alum vaccine in a single inoculum, and . -and . -fold higher versus rasp- -and alum-adjuvanted rbd vaccines (g vs. g and g , respectively; figure b) . importantly, the fold increase in the tfh cells was positively and significantly associated with the neutralizing antibody titers against the pseudotyped mers-cov in two immunization groups, namely mice that were immunized with rasp- and the alum-adjuvanted rbd vaccine in separate sites (g ; r = . , p = . ) and mice immunized with rbd + rasp- + alum vaccine in a single inoculum (g ; r = . , p = . ) ( figure c) . notably, the frequency of b cells (b + ) in the ln was also significantly higher in mice administered with rasp- and the alum-adjuvanted rbd vaccine in separate sites than in mice administered with rbd + rasp- + alum in a single inoculum, or with the alum-adjuvanted rbd vaccine or with the rbd vaccine alone (g - % vs. g - %, g - %, and g - % respectively; figure b ). additionally, the frequency of b cells in mice immunized with rasp- -adjuvanted rbd was also significantly higher as compared to mice immunized with rbd + rasp- + alum in a single inoculum (g - % vs. g - % respectively; figure b ). formulating the vaccine with rasp- in an aqueous formulation (not adsorbed to alum) might have been critical for the increased number of b cells in these two vaccine formulations (g and g ). when the frequency of gc b cells (b + cd + gl- + ) in the draining ln ( figure c ) were analyzed, it appeared that immunization of mice with rasp- and the alum-adjuvanted rbd vaccine in separate sites (g ) induced~ -fold increase in the frequency of the gc b cells versus immunization with rbd + rasp- + alum in a single inoculum and the alum-adjuvanted rbd vaccine (g and g ), and~ -fold increase in gc b cells was induced versus immunization with the rasp- -adjuvanted rbd vaccine (g ; figure c ). interestingly, the alum-adjuvanted vaccines (g and g ) had > -fold increase in gc b cells in the ln compared to rasp- -adjuvanted rbd vaccine (g ; figure c ). importantly, only when rasp- and the alum-adjuvanted rbd vaccine were co-administered in separate sites (g ) was the fold increase in the gc b cells positively and significantly associated with the neutralizing antibody titers against the pseudotyped mers-cov (r = . , p = . ). no significant association was observed in mice when the rbd + rasp- + alum vaccine was administered (g ) as a single inoculum (r = − . , p = . ; figure d ). "−" indicates the absence of the protein or adjuvants in the formulation. statistics was performed using one-way anova with tukey's multiple comparison. p < . : *, p < . : ***. spearman correlation was performed to determine the association of tfh cells with neutralizing antibody titers. adjuvants are essential components in both prophylactic and therapeutic vaccines since they ameliorate antigen-specific protective immune responses [ ] . however, choosing the appropriate adjuvant that can be employed safely and that enhances vaccine efficacy is still elusive and needs to be optimized experimentally first [ ] . besides a handful of adjuvants such as cpg, poly i:c, mpla and mf , aluminum-based (alum) adjuvants are being used in most of the adjuvanted vaccines for humans globally even today since its inception years ago [ ] . although beneficial effects of alum as an adjuvant were observed with the dtap, hepb, and hepa vaccines, a biased th -type immune response, absence of strong cellular responses, and the induction of adverse reactions were some of the limitations found with the various alum-adjuvanted vaccines [ ] . therefore, the utilization of a combination of adjuvants in vaccines that can improve the safety and efficacy of vaccines against emerging pathogens is being actively pursued by the research community [ ] . the use of combinatorial adjuvant system is beneficial since they can be tailored to target varied pattern-recognition receptors (prrs) with each being able to enhance antigen-specific responses (cellular and humoral) in a complementary or synergistic outcome [ ] . for instance, intranasal vaccination with emulsified fine particles like pelc in combination with ld-indolicidin enhanced protective influenza-specific serological immunity in mice [ ] . mpl and cpg combination adjuvants promoted homologous and heterosubtypic cross protection when used with the inactivated split influenza virus vaccine [ ] . the co-administration of alum and a tlr- adjuvant enhanced memory b cell response to lymphocytic choriomeningitis virus (lcmv) antigen [ ] . alum in combination with mpla-ha-adjuvanted hbsag increased both the magnitude and the persistence of hbsag-specific immune responses against hepatitis b virus infection [ ] . the aim of the present study was to explore the synergistic potential of combining the o. volvulus-derived protein adjuvant, rasp- with alum as a novel combinatorial adjuvant system using mers-rbd-fd as the model vaccine antigen. we have previously shown that rasp- enhances the immune response when co-administered in an aqueous formulation with several bystander vaccine antigens [ ] [ ] [ ] . moreover, we have also reported that rasp- -adjuvanted trivalent influenza vaccine (iiv ) elicits a balanced igg /igg c response to iiv and protects mice following h n virus challenge, potentially via myd -independent tlr signaling [ , ] . in this study, we have shown that mice immunized with rbd + rasp- + alum in a single inoculum elicited neutralizing antibody titers against pseudotyped mers-cov that were not significantly different from mice that received either rasp- -adjuvanted rbd vaccine or alum-adjuvanted rbd vaccine alone ( figure ) . notably, when rasp- and the alum-adjuvanted rbd vaccine were co-administered in separate sites the vaccine ameliorated the production of neutralizing antibody titers by~ -fold as compared to the combinatorial adjuvant system administered in a single inoculum (figure ) . we also observed that mice that received two immunizations at three-week intervals of the combinatorial adjuvant system where rasp- and the alum-adjuvanted rbd vaccine were co-administered separately elicited neutralizing antibody titers against pseudotyped mers-cov infection that were similar to, or slightly lower than, those elicited in mice that received three immunizations of the montanide isa -adjuvanted, or two immunizations of the mf -adjuvanted mers cov-rbd-fc or mers-rbd-fd vaccines [ ] . another noteworthy observation is that the heightened neutralizing antibody titers induced by the unique experimental combinatorial adjuvant system was achieved using µg of the mers-rbd-fd vaccine protein, compared to µg of mers-rbd-fc or mers-rbd-fd proteins used in previous studies [ ] . this suggests that the rasp- in this unique combinatorial adjuvant system enabled also rbd dose sparing and with two immunizations. we have previously reported that rasp- also facilitates iiv antigen dose sparing up to a -or -fold decrease, and with a single immunization of the rasp- -adjuvanted iiv , mice were still protected from a lethal h n influenza virus challenge [ ] . importantly, the antibodies elicited by the different combinatorial rbd-adjuvanted vaccines were also functional in their ability to inhibit the binding of mers-rbd to the human dpp receptor. mice that received rasp- and the alum-adjuvanted rbd vaccine separately inhibited the binding bỹ -fold more as compared to mice administered with rbd rasp- + alum in a single inoculum (figure ). although mice that received alum-adjuvanted rbd vaccine significantly inhibited binding ( % ± . ) compared to rasp- -adjuvanted rbd vaccine, they were not significantly different compared to the combinatorial adjuvant system administered in a single inoculum (figure ). the inhibition of binding, however, was only enhanced in the combinatorial adjuvant system where rasp- and alum-adjuvanted rbd vaccine are co-administered separately. these data collectively suggest that adsorption of rasp- to alum in a combinatorial adjuvant system does not enhance the functional antibody responses elicited by alum-adjuvanted rbd vaccine. while rasp- when not adsorbed to alum in a combinatorial adjuvant system was able to ameliorate the functional antibody responses. an important concern raised when anti-viral vaccine are developed, especially with the ongoing covid- crisis, is that some vaccine approaches may induce unwantedly adverse side effects due to antibody dependent enhancement (ade) and thus more severe pathology [ ] . since ade is generally correlated to the neutralizing antibody titers, studies have also shown that high neutralizing antibody titer may eliminate the potential induction of ade [ , ] . [ ] [ ] [ ] ] . in our study, we show immunization of mice with rasp- and the alum-adjuvanted mers-cov rbd vaccine in separate sites have induced nt neutralizing antibody titers greater than : , against pseudotyped mers-cov infection. we expect that such high-titer neutralizing may prevent mers-cov infection in vivo without causing any adverse effects. however, this will have to be proven experimentally in the future. in this study, we also observed that immunization with the alum-adjuvanted rbd vaccine elicits an rbd-specific igg -biased response, while the rasp- -adjuvanted rbd vaccine elicits a balanced rbd specific igg -igg c response ( figure s b,c) . notably, in the combinatorial adjuvant system where rasp- and the alum-adjuvanted rbd vaccine are co-administered separately the balanced igg /igg c response ( figure s d ) was preserved, while mice that received the combinatorial adjuvant system in a single inoculum elicited an igg -biased response ( figure s b,d) . these results suggest that the presence of rasp- in an aqueous formulation shifts the dominant igg response elicited by the alum-adjuvanted rbd vaccine to a balanced igg -igg c response and this was more pronounced when rasp- was not adsorbed to alum. interestingly, the administration of the combinatorial adjuvant system showed differences in the ly c + activated monocyte but not in cd c + activated dc subsets ( figure s ). this may likely be due to the presence of rasp- in the vaccine, since we have previously shown that intra-muscular injection rasp- alone or the rasp- -adjuvanted trivalent influenza (iiv ) vaccine elicited an increased recruitment of monocytes than dcs at the site of injection ( h after injection) as compared to pbs control group or iiv alone [ ] . in the present study, the administration of the combinatorial adjuvant system where rasp- was completely adsorbed to alum (single inoculum immunization group) significantly reduced the number of cd + (activated) monocytes and ccr + (migratory) monocytes to the draining ln compared to the administration of the combinatorial adjuvant system where rasp- was not adsorbed to alum ( figure b,c) . interestingly, the number of cd + monocytes in the ln was similar whether rasp- + alum + rbd were administered as a single inoculum or as a co-administered vaccine in two separate sites. however, both of these vaccine formulations as well as the rasp- adjuvanted-mers-rbd vaccine resulted in significantly higher number of recruited cd + monocytes than the alum-adjuvanted rbd vaccine, suggesting that the presence of alum did not significantly alter the number of cd + monocytes recruited by the combinatorial rasp- and alum adjuvanted-mers-rbd vaccines ( figure s b ). moreover, there was a -fold increase in the number of the activated monocytes and migratory monocytes recruited to the draining ln in mice that received the rasp- -adjuvanted rbd vaccine when compared to the alum-adjuvanted rbd vaccine ( figure b ,c). the number of migratory monocytes doubled in the draining ln of mice immunized with the combinatorial adjuvant system where rbd + rasp- + alum was administered in a single inoculum as compared to the alum-adjuvanted rbd vaccine alone. the number of migratory monocytes further increased in mice that received the combinatorial adjuvant system where rasp- and the alum-adjuvanted rbd vaccine were co-administered separately ( figure c ). there were no significant differences observed in the number of activated and migratory dcs across all the immunization groups. collectively, these data suggest that the rasp- in the combinatorial adjuvant system may play a significant role in the enhanced recruitment of monocyte subsets. this is supported with the data where the administration of the rasp- -adjuvanted rbd vaccine also significantly increased the number of activated (cd + ) and migratory (ccr + ) monocytes in the draining ln compared to alum-adjuvanted rbd vaccine ( figure b,c) . also, rasp- and alum may work in synergy to improve the number of migratory monocytes in the draining ln compared to what alum could do alone. one of the important events in the generation of an adaptive cellular response is the effective migration of innate cells to the lymph nodes to encounter naïve t cells, a process in which ccr , a chemokine receptor, is known to play a dominant role [ ] . in addition, the absence of ccr has been shown to affect the magnitude of protective responses against viral infections in mouse models [ , ] . therefore, we suggest that rasp- , when not adsorbed to alum in a vaccine formulation, may improve the effective recruitment of innate cells that lead to the induction of effector adaptive cellular responses. tfh cells can determine humoral immunity that is also derived from gc b cells, and therefore both of these cell types have become an important aspect for rational designs of more effective vaccines, in particular those depending on functional antibodies for their efficacy [ , ] . to better understand what contributed to the improved elicitation of functional anti-mers-cov neutralizing antibodies, the frequencies of tfh (cd + cxcr + pd- + ) cells and gc b (b + cd + gl- + ) cells in the draining ln of immunized mice were analyzed. a two-fold increase in both tfh and gc b frequencies were induced when rasp- and the alum-adjuvanted rbd vaccine were co-administered in separate sites as compared to the combinatorial adjuvant system where rbd + rasp- + alum were administered in a single inoculum (figures b and c ). while no significant difference was observed in the fold increase of the frequency of gc b cells in the ln of mice that were immunized with rasp- and the alum-adjuvanted rbd vaccine co-administered separately as compared to the alum-adjuvanted rbd vaccine, a six-fold increase was observed when this was compared to rasp- -adjuvanted rbd vaccine alone ( figure c ). these data suggest that the complete adsorption of rasp- to alum diminished not only the ability to induce migratory monocyte, but also the development of cells that are important for mounting an effective humoral response. importantly, we found a significant and positive correlation between the neutralizing antibody titers in sera of mice vaccinated with rasp- and the alum adjuvanted rbd vaccine separately and the fold increase in the frequency of tfh and gc b cells recruited in the draining ln ( figures c and d) . interestingly, the fold increase in the frequency of tfh cells was also significantly and positively associated with the titers of neutralizing antibodies in mice that were immunized with the combinatorial adjuvant system administered in a single inoculum (rbd + rasp- + alum; figure b ), suggesting that the rasp- and alum may work in synergy. our study demonstrates that a unique combination of rasp- (a helminth-derived protein) protein adjuvant with alum and the mers-rbd-fd using the model vaccine antigen enhanced the protective immune responses to mers-cov, despite the fact that adjuvants have to be co-administered separately (where rasp- was not adsorbed to alum). also, for the first time, we were able to determine that the tfh and gc b cells in the lns in mice immunized with combinatorial adjuvanted-mers-rbd vaccine were significantly and positively associated with the essential functional protective immune responses to mers-cov neutralizing antibodies. further studies will be necessary, however, to elucidate the precise underlining mechanisms of this unique adjuvant combination of rasp- and alum. in our study, it appeared that the adsorption of rasp- to alum reduced the immunopotentiating activities of either rasp- or alum. as the potency of rasp- is highest when it is in an aqueous formulation, a better understanding of the targets of multiple immune pathways that are induced may also help us utilize the rasp- protein adjuvant in combination with other prr agonists that can be used in aqueous formulations as adjuvants in novel combinatorial formulations. such combinatorial adjuvants may be more advantageous with subunit vaccine models that generally are known to induce suboptimal protective immune responses alone and/or induce vaccine enhanced disease (ved) when used with the alum adjuvant [ ] . the following are available online at http://www.mdpi.com/ - x/ / / /s , figure s : induction of mers-cov-rbd specific igg subtypes in sera of immunized mice, figure s : representative flow cytometry plot determining the gating strategy of the immune cells recruited to the draining lymph nodes (lns) of immunized mice, figure s : number of monocyte and dc subsets recruited into the lymph nodes (lns) of immunized mice, figure s : frequency of cd + t cells in the lymph nodes (lns) of immunized mice. funding: this research was funded by nih grants u ai and r ai . recent advances of vaccine adjuvants for infectious diseases the latest advancements in zika virus vaccine development augmentation of vaccine-induced humoral and cellular immunity by a physical radiofrequency adjuvant from to and beyond. vaccines (basel) aluminium adjuvants-in retrospect and prospect optimizing the utilization of aluminum adjuvants in vaccines: you might just get what you want old and new adjuvants - , a th -biased protein adjuvant derived from the helminth onchocerca volvulus, can directly bind and activate antigen-presenting cells the adjuvanticity of an o. volvulus-derived rov-asp- protein in mice using sequential vaccinations and in non-human primates asp- , a recombinant secreted protein of the helminth onchocercavolvulus, is a potent adjuvant for inducing antibodies to ovalbumin, hiv- polypeptide and sars-cov peptide antigens enhanced humoral response to influenza vaccine in aged mice with a novel adjuvant, rov-asp- . vaccine the parasite-derived rov-asp- is an effective antigen-sparing cd (+) t cell-dependent adjuvant for the trivalent inactivated influenza vaccine, and functions in the absence of myd pathway isolation of a novel coronavirus from a man with pneumonia in saudi arabia crystal structure of the receptor-binding domain from newly emerged middle east respiratory syndrome coronavirus dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc advances in mers-cov vaccines and therapeutics based on the receptor-binding domain recombinant receptor-binding domains of multiple middle east respiratory syndrome coronaviruses (mers-covs) induce cross-neutralizing antibodies against divergent human and camel mers-covs and antibody escape mutants receptor-binding domain of mers-cov with optimal immunogen dosage and immunization interval protects human transgenic mice from mers-cov infection introduction of neutralizing immunogenicity index to the rational design of mers coronavirus subunit vaccines a recombinant receptor-binding domain of mers-cov in trimeric form protects human dipeptidyl peptidase (hdpp ) transgenic mice from mers-cov infection combinatorial delivery of dual and triple tlr agonists via polymeric pathogen-like particles synergistically enhances innate and adaptive immune responses the novel complex combination of alum, cpg odn and hh as adjuvant in cancer vaccine effectively suppresses tumor growth in vivo combination of adjuvants: the future of vaccine design hookworm burden reductions in balb/c mice vaccinated with recombinant ancylostoma secreted proteins (asps) from ancylostoma duodenale, ancylostoma caninum and necator americanus theoretical basis, experimental design, and computerized simulation of synergism and antagonism in drug combination studies identification of a receptor-binding domain in the s protein of the novel human coronavirus middle east respiratory syndrome coronavirus as an essential target for vaccine development dendritic cell migration to peripheral lymph nodes vaccine safety evaluation: practical aspects in assessing benefits and risks efficacy and safety of immunological adjuvants. where is the cut-off? advances in aluminum hydroxide-based adjuvant research and its mechanism from discovery to licensure, the adjuvant system story triggering intracellular receptors for vaccine adjuvantation mucosal delivery of a combination adjuvant comprising emulsified fine particles and ld-indolicidin enhances serological immunity to inactivated influenza virus mpl and cpg combination adjuvants promote homologous and heterosubtypic cross protection of inactivated split influenza virus vaccine alum/toll-like receptor adjuvant enhances the expansion of memory b cell compartment within the draining lymph node evaluation of hyaluronic acid-based combination adjuvant containing monophosphoryl lipid a and aluminum salt for hepatitis b vaccine the potential danger of suboptimal antibody responses in covid- cross-reactivity, and function of antibodies elicited by zika virus infection engineering a stable cho cell line for the expression of a mers-coronavirus vaccine antigen rot, a. ccr and its ligands: balancing immunity and tolerance impact of ccr on priming and distribution of antiviral effector and memory ctl antiviral immune responses in the absence of organized lymphoid t cell zones in plt/plt mice the adjuvant gla-se promotes human tfh cell expansion and emergence of public tcrbeta clonotypes can follicular helper t cells be targeted to improve vaccine efficacy? a unique combination adjuvant modulates immune responses preventing vaccine-enhanced pulmonary histopathology after a single dose vaccination with fusion protein and challenge with respiratory syncytial virus this article is an open access article distributed under the terms and conditions of the creative commons attribution (cc by) license we gratefully acknowledge kathy tang, head of the lars facility at nybc for providing animal and veterinary care. the authors also thank mihaela barbu-stevanovic, head of the flowcytometry core facility at nybc. the authors also acknowledge maria elena bottazzi and bin zhan from baylor college of medicine, texas children's hospital center for vaccine development, houston, texas for the production of the recombinant ov-asp- protein. the authors declare no conflict of interest. the funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results. key: cord- -x s l pp authors: yang, jinsung; petitjean, simon j. l.; koehler, melanie; zhang, qingrong; dumitru, andra c.; chen, wenzhang; derclaye, sylvie; vincent, stéphane p.; soumillion, patrice; alsteens, david title: molecular interaction and inhibition of sars-cov- binding to the ace receptor date: - - journal: nat commun doi: . /s - - - sha: doc_id: cord_uid: x s l pp study of the interactions established between the viral glycoproteins and their host receptors is of critical importance for a better understanding of virus entry into cells. the novel coronavirus sars-cov- entry into host cells is mediated by its spike glycoprotein (s-glycoprotein), and the angiotensin-converting enzyme (ace ) has been identified as a cellular receptor. here, we use atomic force microscopy to investigate the mechanisms by which the s-glycoprotein binds to the ace receptor. we demonstrate, both on model surfaces and on living cells, that the receptor binding domain (rbd) serves as the binding interface within the s-glycoprotein with the ace receptor and extract the kinetic and thermodynamic properties of this binding pocket. altogether, these results provide a picture of the established interaction on living cells. finally, we test several binding inhibitor peptides targeting the virus early attachment stages, offering new perspectives in the treatment of the sars-cov- infection. i n december , a novel coronavirus (cov) was determined to be responsible for an outbreak of potentially fatal atypical pneumonia, ultimately defined as coronavirus disease- , in wuhan, china. this novel cov, termed severe acute respiratory syndrome (sars)-cov- , was found to share similarities with the sars-cov that was responsible for the sars pandemic that occurred in . the resulting outbreak of covid- has emerged as a severe pandemic. the genome of sars-cov- shares about % identity with that of sars-cov and is about % identical to the bat coronavirus batcov ratg (ref. ). cov entry into host cells is mediated by its transmembrane spike (s) glycoprotein that forms homotrimers protruding from the viral surface (fig. a) . the s glycoprotein comprises two functional subunits responsible either for binding to the host cell receptor (s subunit including the receptor-binding domain (rbd)) or for fusion of the viral and cellular membranes (s subunit). recent studies claimed that the angiotensinconverting enzyme (ace ), previously identified as the cellular receptor for sars-cov, also acts as a receptor of the new coronavirus (sars-cov- ) (fig. b) . in the case of sars-cov, the s glycoprotein on the virion surface mediates receptor recognition (fig. c) and membrane fusion , . recently, the highresolution cryo-electron microscopy structure obtained on the full-length human ace in the presence of the rbd of the s glycoprotein of sars-cov- suggests simultaneous binding of two s-glycoprotein trimers to an ace dimer . the s subunit is further cleaved by host proteases located immediately upstream of the fusion peptide , leading to the activation of the glycoprotein that undergoes extensive irreversible conformational changes facilitating the membrane fusion process. altogether, the information obtained so far highlights the fact that cov entry into susceptible cells is a complex process that requires the concerted action of receptor binding and proteolytic activation of the s glycoprotein at the host cell surface to finally promote virus-cell membrane fusion. however, so far, direct evidence about the dynamics of the binding of the s -to the ace receptor at the single-molecule level is missing. here, we analyze the biophysical properties of the sars-cov- s-glycoprotein binding, on model surfaces and on living cells, to ace receptors using force-distance (fd) curve-based atomic force microscopy (fd-curve-based afm) (fig. c) . we extract the kinetics and thermodynamics of the interactions established in vitro, and compare the binding properties of both the s subunit and rbd. next, we test short ace -derived peptides targeting the viral s glycoprotein as potent binding inhibitor peptides and observe a significant reduction in the binding properties. results s subunit specifically binds to purified ace receptors. as sars-cov- binding to ace receptors is thought to play a key role in the first binding step at the cellular membrane fig. probing sars-cov- binding to the ace host receptor. a schematic of a sars-cov- particle, an enveloped ssrna virus expressing at its surface the spike glycoprotein (s) that mediates the binding to host cells. b structural studies have previously obtained a complex between the receptor-binding domain (rbd, a subunit of the s glycoprotein) and the angiotensin-converting enzyme (ace ) receptor. c schematic of probing sars-cov- binding using atomic force microscopy (afm). the initial attachment of sars-cov- to cells involves specific binding between the viral s glycoprotein and the cellular receptor, ace . the interactions are monitored by afm on model surfaces, where the ace receptor is attached to a surface and the s subunit or the rbd onto the afm tip, and on a living cells expressing or not fluorescently labeled ace . used fd-curve-based afm to evaluate at the single-molecule level the binding strength of the interaction established between the glycosylated s subunit and ace receptors on model surfaces (fig. a) . to mimic cell-surface receptors in vitro, ace receptors were covalently immobilized onto gold surfaces coated with ohand cooh-terminated alkanethiols using carbodiimide conjugation (see methods). these model surfaces were imaged by afm, and the thickness of the grafted layer was validated by a scratching experiment, revealing a deposited layer of . ± . nm (mean ± s.d., n = ) (see methods and supplementary fig. ). to study the interaction between the s subunit and the immobilized ace receptors, we covalently grafted either the purified full s subunit or rbd only to the free end of a long polyethylene glycol (peg) spacer attached to the afm tip [ ] [ ] [ ] . to investigate the properties of the binding complex, force-distance (fd) curves were recorded by repeatedly approaching and withdrawing the s subunit or rbd-functionalized tip from the ace model surface (fig. a, b) . specific adhesion events were observed on - % of the retraction fd curves at rupture distances > nm, which corresponds to the extension of the peg linker ( fig. c and supplementary fig. ), and is in line with studies carried out for other virus-cell-surface receptor systems , - . to confirm the specificity of these interactions, we conducted additional independent control experiments using (i) an afm tip only functionalized with the peg linker or (ii) toward oh-/coohterminated alkanethiol surfaces missing the receptor. the binding frequency observed during those control experiments is significantly lower, thereby confirming the specificity of the s subunit/rbd-ace complexes under our experimental conditions (fig. c) . exploring the dynamics of s subunit-ace interaction. single-molecule force-probing techniques, such as fd-based afm, measure the strength of a bond under an externally applied force, enabling to get insights into the binding free-energy landscape. according to the bell-evans model , , an external force stressing a bond reduces the activation-energy barrier toward dissociation and, hence, reduces the lifetime of the ligandreceptor pair (fig. d) . the model also predicts that far-fromequilibrium, the binding strength of the ligand-receptor bond is proportional to the logarithm of the loading rate (lr), which describes the force applied on the bond over time. to investigate the kinetics of the probed complex, fd curves were recorded at various retraction rates and contact times ( fig. e-h) . dynamic force spectroscopy (dfs) plots were obtained for both s subunit (fig. e) and rbd (fig. f) x u = . ± . nm rbd fig. probing s-glycoprotein binding to the ace host receptor on model surface. a binding of s-glycoprotein subunit (s or rbd) is probed on an ace -coated surface. b retraction part of four force-distance curves showing either nonadhesive or specific adhesive curves. c box plot of specific binding probabilities (bp) measured by afm between the functionalized tip (s , rbd, or peg) and the grafted surface (ace or oh-/ cooh-terminated alkanethiol (bare surface)). one data point belongs to the bp from one map acquired at µm/s retraction speed. the square in the box indicates mean, the colored box indicates the th and th percentiles, and the whiskers indicate the highest and the lowest values of the results. the line in the box indicates median. n = (s , rbd), (peg), and (s , rbd vs. bare surface) maps examined over (s , rbd), (peg), and (s , rbd vs. bare surface) independent experiments. d bell-evans model describing a virus-receptor bond as a two-state model. the bound state is separated from the unbound state by a single energy barrier located at distance x u . k off and k on represent the dissociation and association rate, respectively. e, f dynamic force spectroscopy (dfs) plot showing the distribution of the rupture forces as a function of their loading rate (lr) measured either between the s subunit and the ace receptor (n = data points) (e) or between the rbd and the ace receptor (n = data points) (f). the error bar indicates s.d. of the mean value for a single interaction ( - pn). the solid line represents the fit of the data with the bell-evans fit. experiments were reproduced at least four times with independent tips and samples. g, h the bp is plotted as a function of the contact time for s subunit and rbd on ace model surfaces, and data points were fitted using a least-squares fit of a monoexponential growth. one data point belongs to the bp from one map acquired at µm/s retraction speed for the different contact times. experiments were reproduced three times with independent tips and samples. p values were determined by two-sample t test in origin. the error bar indicates s.d. of the mean value. source data are provided as a source data file. virus-receptor bonds , , , , . to determine whether single-or multiple-bond rupture between s /rbd and ace is taking place, bond strengths (every single gray data point in fig. e , f) were analyzed through distinct discrete ranges of lrs, plotted as force histograms and further fitted with multipeak gaussian distribution, as established previously , (supplementary figs. and ) . using this distribution, we are able to determine the most probable unbinding force of each force peak (maximum of rupture force distribution; black dots plotted over mean lr of this range in fig. e , f), and can determine if single or multiple interactions were taking place. the presence of multiple parallel unbinding events is first observed in the distribution of rupture forces with the presence of multiple gaussian fits. the histograms show that most probably only single interactions were taking place; thus, the bell-evans model was used to fit the data enabling to interpret the binding complex as a simple two-state model, in which the bound state is separated from the unbound state by a single energy barrier (fig. d) . from the slope of the fit, we estimated the length scale of the energy barrier (x u ). we obtained very close values, x u = . ± . nm and . ± . nm for both the s subunit and rbd, showing that we are probing similar bonds (fig. e , f). the kinetic off-rate (k off ) or dissociation rate is obtained from the intercept of the fit (at lr = ) yielding k off values of . ± . s − and . ± . s − for s subunit and rbd, respectively. these values are in good agreement with reported values obtained by surface plasmon resonance for the s glycoprotein (k off = . s − ) and the rbd subunit (k off = . s − ) binding to ace receptors . assuming that the receptor-bond complex can be approximated by a pseudo-first-order kinetics, we also estimated the kinetic on-rate (k on ) from our single-molecule force spectroscopy experiments (fig. g, h) . this association rate is extracted from the binding probability (bp) measured at various contact times, and depends on the effective concentration described as the number of binding partners (ligand + receptor) within an effective volume v eff accessible under free-equilibrium interaction. v eff can be approximated by a half-sphere with a radius including the linker, the viral glycoprotein (s subunit or rbd) and the ace receptor. for both the s subunit and rbd, we observed that the binding frequency increased exponentially with contact time, and we extracted an interaction time of~ . ms, leading to a k on of . × m − s − and . × m − s − , respectively. finally, the dissociation constant k d is calculated as the ratio between the k off and the k on , yielding values around nm for both complexes. this value corresponds to a highaffinity interaction, confirming the specificity of the complexes established by sars-cov- with the ace cell-surface receptor, which in turn results in a long lifetime of the virus attachment to the cell surface. other interaction studies between sars-cov ( % sequence homology to sars-cov- ) and ace reported specific, high-affinity association values also in the nm range . for comparison, a variety of examples for low-as well as highaffinity interactions between other virus-receptor pairs are summarized in dimitrov et al. and include influenza a-sa (mm) or hiv- -cd (nm) interactions. for single-molecule interactions, the bond lifetime τ can be directly related to the inverse kinetic off-rate (τ = k off − ), resulting here in a τ of ms for the s subunit and ms for the rbd, respectively. of course, at the virion level, the overall bond lifetime will increase with the multivalence of the interaction. by definition, highaffinity interaction has a long lifetime as the dissociation constant k d is defined as the ratio between k off and k on . for high-affinity interactions, the k d is in the nm range, leading to k off « k on and therefore maintaining the interaction in its bond state for very long times, making the development of anti-binding molecules targeting this interaction more difficult. finally, we also used optical biolayer interferometry (bli) to confirm the kinetic parameters characterizing this interaction, and obtained very close affinities in the same nm range as afm experiments ( supplementary fig. ). taken together, our in vitro experiments confirm that sars-cov- binding to the ace receptors is mediated by the rbd-ace interface as our experimental conditions did not highlight any significant difference between s subunit and rbd binding. validation of the interaction on living cells. next, we wanted to investigate whether the interaction probed on isolated receptors is also established in physiologically relevant condition. to this end, we performed binding assays on living a cells (human adenocarcinoma alveolar basal epithelial cells). while this cell line is widely used as a type ii pulmonary epithelial cell model, it has been shown recently that those cells are incompatible with sars-cov- infection . interestingly, ace expression positively correlated with the differentiation state of epithelia. although undifferentiated cells (cultured at low confluency) only express little ace , overexpression of ace in undifferentiated a cells facilitated virus entry . we transiently transfected ace -egfp in a cells (a -ace ) and probed s -subunit binding to those cells as well as to a cells (serving as internal control) ( fig. a and supplementary fig. ). confocal images showed ace -egfp receptors homogeneously distributed in small domains at the surface of a cells (fig. b ). guided by fluorescence (fig. c) , we chose areas in which both cell types, i.e., transfected (a -ace , green fluorescence) and nontransfected (a , no fluorescence) cells, were in proximity to one another. having both a cell types in one image area served as a direct control to evaluate whether interactions measured by the functionalized tip were indeed due to specific binding to fluorescent ace -egfp receptors, and to evaluate the extent of other types of interactions (fig. c-e) . in such area, we simultaneously recorded a height image (fig. d ) and the corresponding adhesion map (fig. e) , which were reconstructed from fd curves recorded for each topographic pixel. the retraction part of fd curves showed specific adhesion events mainly on a -ace cells, with a significantly higher bp (fig. f ), as exemplified with the presented adhesion map that shows . % of adhesive pixels on the a -ace cell versus . % on the control cells ( fig. e and supplementary fig. ). specific binding forces (and corresponding lr) were extracted from force vs. time curves recorded on a -ace cells (fig. g) and overlaid on the dfs plot obtained on purified ace receptors (fig. h ). to explore a wide range of lr, we probed the interaction at various frequencies and amplitudes (see methods). we observed a very good alignment between the data obtained on purified receptors and on living cells confirming the physiological relevance of our results obtained on model surfaces. s subunit binding to the cell involves other receptors. our fdbased afm experiments performed on living cells put in evidence that the s subunit interacts even on control cells with a frequency ≈ % although the expression level of ace should be very low as the cells are not differentiated. nevertheless, some evidence pointed out that human cov s glycoproteins possess sialic acid (sa)-binding sites and in particular to -o-acetylsialogycans , and that integrins could also be a receptor for the sars-cov- (ref. ), which possesses a rgd motif close to the ace -binding site. to evaluate whether these other receptors could be involved during the early binding steps to the cell surface, we performed additional experiments by injecting -oacetyl-sialogycans to block interaction with cell-surface sa, or added cyclo-rgd (crgd) to compete with the interactions with integrins. after sa injection, the binding frequency was reduced on a cells down to~ % and to~ % on ace -transfected cells ( fig. f and supplementary fig. ). for integrins, injection of crgd only reduces the binding frequency of~ - % on both cell types, which is in good agreement with the fact that integrins are mostly expressed on the bottom of the cell . altogether, these data obtained on cells by afm represent to date the best evidence that s -ace complex is established in physiologically relevant conditions and underlines the complex situation with multiple cell-surface receptors accounting for the whole interaction. inhibition of s -subunit binding using ace -derived peptides. human recombinant soluble ace (hrsace ) is currently being considered for treatment of covid- (refs. , ). however, ace is involved in many key cellular processes, such as bloodpressure regulation and other cardiovascular functions. therefore, hrsace treatment could lead to dysregulation of those vital processes and subsequently cause deleterious side effects for treated patients. to avoid any interference of the ace homeostasis, we wanted to test whether small ace -derived peptides can also interfere with sars-cov- binding, by blocking binding article sites on the s glycoprotein. to this end, we synthetized four different peptides (sequences provided in supplementary fig. ), which have been selected to mimic the regions of ace that interact with the s subunit as determined by the crystal structure , and we tested their binding inhibition properties using our single-molecule force spectroscopy approach (fig. a, b) . we first measured the bp between the s subunit and the ace in the absence of peptide ( µm), with a contact time of ms, as reference, and then injected our ace-derived peptides at three different concentrations ( , , and µm). for the four peptides, we observed a progressive reduction of the bp as a function of the concentration confirming a specific inhibition. in addition, for each peptide, we noticed a reduction of > % of the probed interactions already for the - µm concentration, suggesting a % inhibitory concentration (ic ) in the µm range. the peptide shows the highest inhibition of the s -ace complex formation with a measured reduction in the bp of~ %. the peptide shows a similar inhibition potential (~ %), suggesting that the additional amino acids do not influence the overall affinity of the peptide for the s subunit, as also confirmed by molecular dynamics (md) simulations showing that although the peptide - is longer, less h bonds are established between the peptide and the rbd domain ( supplementary fig. ). overall, these results are in good agreement with the structural insights because these peptides are derived from the n-terminal helix of the ace and therefore form with the rbd interface an important network of hydrophilic interactions (including nine hydrogen bonds and a salt bridge). within the ace -rbd complex, the [ − ] fragment is also part of a "hot binding spot" that results in our test by a good score with a reduction of % of the initial specific bp. finally, the [ - -g- - ] peptide was also synthetized and tested based on the fact that in the crystal structure, the distance between s and l is close enough to be filled by a single amino acid. a glycine residue was added between the two fragments because the two ace fragments have opposite directionality, and glycine has a high propensity to form reverse turns. nevertheless, under our experimental conditions, we did not notice any strong improvement in the binding inhibition. altogether, our in vitro assays at the single-molecule level provide direct evidence that ace derived peptides are strong candidates to potentially inhibit sars-cov- binding to ace receptors (fig. c) . finally, we tested whether the [ − ]-binding inhibition peptide could also prevent s -subunit binding in the cellular context (fig. ) . the interaction between the s subunit and the confluent layer of a coculture of a and a -ace cells was probed before and after addition of the peptide at µm. before injection, cells overexpressing the ace receptors (a -ace ) show higher bp ( . ± . % vs. . ± . %, for a and a -ace , respectively) (mean ± s.d., n = ) (fig. a-d) , in good agreement with our previous observation (fig. f) . after injection of the [ − ] ace -derived peptide, we observed a significant decrease of the bp on both cell types (fig. e, f) . in particular, the bp on a -ace cells significantly drops (~ %), reaching a level close to the one of the control cells. taking into account that undifferentiated a cells express little ace and are poorly infected by cov , this result supports the biological relevance of our ace -derived peptide acting as potential inhibitor capable of efficiently blocking sars-cov- binding. in conclusion, we investigated the interaction established between the sars-cov- s glycoprotein and the ace receptor using single-molecule force spectroscopy. we demonstrated a specific binding mechanism between the s subunit and the ace receptor. by comparing the binding of the s subunit and the rbd toward the ace receptor, our experiment evidenced that both domains interact with the same kinetic and thermodynamic properties toward the ace receptor, highlighting that sars-cov- binding to ace is dominated by the rbd/ace interface. our measurements show that under our physiologically relevant conditions, the rbd binds the ace receptor with an intrinsic high affinity (~ nm), which could even be further stabilized at the whole-virus level, thanks to possible multivalent bonds between the s-glycoprotein trimer and ace dimer. based on the available crystal structures of the molecular complex, we examined how several ace -derived peptide fragments could interfere with the s -ace complex formation. while all tested peptides show binding inhibition properties, peptides mimicking the n-terminal helix of the ace receptor show the best results. both and peptides exhibit an anti-binding activity with ic in the µm range, resulting in a > % decrease in the bp observed by afm on purified receptor and > % on living cells. on the cellular model, we observed that the bp drops to the level of the control cells (undifferentiated a cells) that are poorly infected by cov . therefore, those peptides appear as strong therapeutic candidates against the sars-cov- infection. cell culture and transfection. a cells (atcc® ccl- ) were grown in ham's f- nutrient mix with % fetal bovine serum, penicillin ( u ml − ), and streptomycin ( µg ml − ) (gibco) at °c in a humidified atmosphere with % co . pcdna . (+) ace -egfp was transfected using lipofectamine ltx (invitrogen) according to the manufacturer's protocol. in brief, μg of fig. probing s-glycoprotein binding to the ace host receptor on living cells. a binding of s-glycoprotein subunit (s ) is probed on a and a -ace cells. b confocal microscopy (z stack) of a -ace -egfp (green) cell transduced with plasma membrane bfp (blue). c overlay of egfp and dic images of a mixed culture of a and a -ace -egfp cells. d, e force-distance (fd)-based afm topography image (d) and the corresponding adhesion map (e) in the specified area in (c). the frequency of adhesion events is indicated. f box plot of the binding probability between s and a cells (gray) or a -ace cells (green) without and after injection of cyclic rgd (crgd, checked boxes) or sialic acid (sa, dashed boxes), respectively. the square in the box indicates mean, the colored box indicates the th and th percentiles, and the whiskers indicate the highest and the lowest values of the results. the line in the box indicates median. g force versus time curves showing either a nonadhesive curve (bottom) or specific adhesive curves acquired at different lrs (lr -lr ). h dfs plot showing the distribution or the rupture forces measured either between the s subunit and the ace on model surfaces (black dots, extracted from fig. e) , and between the s subunit and ace -overexpressing a cells acquired at three different lrs (blue and red dots) (n = ). blue dots belong to a data set acquired in fast-force volume mode, with a retraction velocity of µm s − (lr ). red dots belong to data sets acquired in peak force tapping mode with . khz peak force frequency and -nm amplitude (lr ) or at . khz and nm (lr ), respectively. the error bar indicates s.d. of the mean value. histograms of force distribution on a -ace cells for lr -lr are shown on the side. for experiments without injection of crgd or sa, data are representative of at least n = cells from n = independent experiments. the data for blocking experiments with crgd or sa were acquired for at least n = cells from n = independent experiments. p values were determined by twosample t test in origin. source data are provided as a source data file. pcdna . (+) ace -egfp was transfected to a cells ( -mm plate) using μl of lipofectamine ltx and μl of plus reagent (invitrogen). functionalization of afm tips. pfqnm-lc and msct-d cantilevers (bruker) were used to probe the interaction between s subunit (genscript, #u fc ) or rbd protein (genscript, #u fc ) and ace protein (sino biological, -c h). nhs-peg -ph-aldehyde linkers were used to functionalize afm tips as previously described . briefly, the cantilevers were immersed in chloroform for min and further cleaned in a uv radiation and ozone (uv-o) cleaner (jetlight), and immersed overnight in an ethanolamine solution ( . g of ethanolamine in . ml of dmso). they were washed with dmso and ethanol three times, respectively. ethanolamine-coated cantilevers were immersed in nhs-peg -ph-aldehyde solution ( . mg of it was diluted in . ml of chloroform and μl of triethylamine) and finally washed times with chloroform and dried with nitrogen. for afm tips functionalized with s -subunit protein, µl of s -subunit protein solution ( . mg/ml) was put onto the cantilevers placed on parafilm (bemis na) and µl of fresh nacnbh solution ( wt% vol- in . m naoh(aq)) was mixed in the protein solution. the cantilevers were incubated in the solution for h on ice. then, µl of m ethanolamine solution was carefully added to the protein solution and incubated min to quench the reaction and finally washed three times with pbs. for afm tips derivatized with the rbd protein, µl of a µm trisnitrilotriacetic amine trifluoroacetate (toronto research chemicals, canada) (tris-nta) solution was put onto them placed on parafilm, and µl of fresh nacnbh solution was mixed in the protein solution. they were incubated in the solution for h on ice. then, µl of m ethanolamine solution in the protein solution was added and incubated for min. the mixture of µl of rbd solution ( . mg ml − ) and . µl of mm nicl were put onto them and they were incubated for h. after incubation, they were washed in pbs solution three times. preparation of ace -coated model surfaces. ace protein (sino biological, -c h) was immobilized using nhs-edc chemistry. gold-coated surfaces were first rinsed with ethanol, dried with a gentle stream of nitrogen gas, cleaned for min by uv and ozone treatment (jetlight), and incubated overnight in an alkanethiol solution ( % -mercapto- -undecanol mm (sigma aldrich) and % -mercaptohexadecanoic acid mm (sigma aldrich) in ethanol). the chemically activated samples were rinsed with ethanol, dried with nitrogen gas, and immersed for min in the solution of mg of chemically activated dimethylaminopropyl carbodiimide (sigma aldrich) and mg of n-hydroxysuccinimide in ml of milliq water. finally, the surfaces were rinsed with milliq water, incubated with ace protein ( . µg µl − in pbs) on parafilm (bemis na), and washed in pbs. fd-based afm on model surfaces. fd-based afm on model surfaces was performed in pbs at room temperature using functionalized msct-d probes (bruker, nominal spring constant of . n/m and actual spring constants calculated using thermal tune) . a bioscope resolve afm (bruker) operated in the force-volume (contact) mode (nanoscope software v . ) was used. areas of × µm were scanned, ramp size set to nm, and set point force of pn, with a resolution of × pixels and a line frequency of hz. dfs analysis (using a constant approach speed of µm/s and variable retraction speeds of . , . , , , , and µm/s) and kinetic on-rate estimation (measuring the bp for different hold times of , , , , , , and ms) were performed. regarding dfs experiments, data including lrs and disruption forces were extracted using nanoscope analysis (v . , bruker). origin software (originlab) was used to display the results in dfs plots to fit histograms of rupture force distributions for distinct lr ranges, and to apply various force spectroscopy models, as described , . for kinetic on-rate analysis, the bp (fraction of curves showing binding events) was determined at a certain hold time (t) (the time the tip is in contact with the surface). those data were fitted and k d calculated as described previously . in brief, the relationship between interaction time (τ) and anti-binding effects of ace -derived peptides on s -subunit binding. a efficiency of blocking peptides is evaluated by measuring the binding probability of the interaction between the s subunit and ace receptor on model surface before and after incubation of the functionalized afm tip with the four different peptides at increasing concentration ( - µm). b histograms, with the corresponding data points overlaid in dark gray, showing the binding probability without peptide ( µm) and upon incubation with , , or µm of ace -derived peptides ( , , [ - - bp is described by the following equation: where a is the maximum bp and t the lag time. origin software is used to fit the data and extract τ. in the next step, k on was calculated by the following equation, with r eff the radius of the sphere, n b the number of binding partners, and n a the avogadro constant the effective volume v eff ( πr eff ) represents the volume in which the interaction can take place. this results in a half-sphere, since only half of the s molecules can interact with its corresponding receptor on the substrate. peptides and competition-binding assays. to assess the influence of peptides on the s -subunit-ace interaction, binding probabilities were measured before and after tip incubation with , , and µm of peptide. briefly, a first map was recorded as described above (i.e., force-volume mode, µm/s approach and retraction speed, ramp size of nm, an applied force of pn, resolution of × pixels, line frequency of hz, and hold time of ms), then the peptide at the appropriate concentration was injected, and a new map was recorded. all the peptides ( , [ ] [ ] [ ] [ ] [ ] [ ] [ ] , , and [ - -g- - ] ) were synthesized by genscript (hong kong). those peptides are designed according to the sequence of the ace receptor in complex with the rbd domain of the s glycoprotein. fd-based afm and fluorescence microscopy on living cells. an afm (bioscope resolve, bruker) coupled to a confocal microscope (zeiss lsm- ) was used to acquire correlative images. the afm was equipped with a -µm piezoelectric scanner. the afm and the microscope were equipped with a cell-culture chamber allowing maintaining the temperature ( ± °c). to keep cells alive, the humidified ( ± % relative humidity) synthetic air ( % n and % o ) was supplemented with % co and filled continuously around the cell plate allowing to diffuse into cell-culture media . fluorescence images were recorded using a waterimmersion lens (× , na . , zeiss c-apochromat). pfqnm-lc cantilevers (bruker) were used to record afm images (~ μm ) at imaging forces of~ pn. the cantilevers were oscillated either at . -khz peak force frequency with a nm amplitude, . khz with a -nm amplitude in the peakforce tapping mode, or at μm s − retraction speed in fast-force-volume mode. the sample was scanned using pixels per line ( lines) and a frequency of . hz. to study the involvement of other receptors, cells were treated with either mm of -oacetyl-sialogycans or μm of crgd. afm images and fd curves were analyzed using nanoscope analysis software, origin, gwyddion, and imagej. optical images were analyzed using zen software (zeiss). the fluorescence intensity was measured with zen software (zeiss). the same size of the area was taken on a -ace and a cells. the average intensity of the area was calculated with zen software. the statistical analysis was performed with prism (graphpad). plasma membrane staining. plasma membrane-cfp bacmam . (invitrogen) was used to check the co-localization of ace protein and plasma membrane according to the manufacturer's protocol. in brief, μl of plasma membrane-cfp bacmam . per , cells was added on the cell-culture dish h ( °c) before imaging. z-stack image was recorded by confocal lsm- (zeiss) using a waterimmersion lens (× , na . , zeiss c-apochromat) and -and -nm laser line. affinity measurements using bli. affinity between the s subunit or rbd and ace was also investigated by bli, using a blitz® device equipped with aminereactive second-generation (ar g) biosensors (pall fortebio). after hydrating the biosensor for min and performing an initial baseline ( min), the biosensor surface was chemically activated ( min) by a freshly prepared mm edc and mm nhs (in milliq water) solution. then, ace ( . µg µl − in acetate buffer, ph ) was loaded onto the biosensor during min and the reaction quenched with ethanolamine m (ph ). after another baseline step ( min in pbs), binding of s subunit or rbd ( . mg ml − ) was measured for min. finally, the dissociation step ( min) was performed in pbs. data processing and analysis were run using a routine provided by graphpad prism. md simulation between ace peptides and s glycoprotein. the pdb (code: m j) was used to perform a md simulation between ace -derived peptides and the sars-cov- spike protein complex. md simulations were performed utilizing the gromacs package , and carried out using the amber sb-ildn force fields in tip p water . the simulation system consisted of a peptide, a protein, and water (about , molecules) in a cubic box that extended nm from the protein. appropriate amounts of sodium/chlorine ions were added in the system. for starting the simulation, the environment had to be developed as follows. the steepest descent algorithm was performed either up to , steps or by kj mol − nm − . then, the environment of the system changed at k (nvt ensemble) and subsequently at k and bar (npt ensemble). after developing the environment, the particle mesh ewald method was used to calculate the longrange electrostatic interactions. short-range dispersion interactions were described by a lennard-jones potential with the cutoff of nm. after reaching the equilibrium of temperature and pressure, mds were conducted for ns at k and bar. the lincs algorithm was applied to constrain the covalent bonds with hydrogen atoms. the time step of the simulations was set to fs. the interactions above Å were regarded as nonbond. to determine whether a hydrogen bond exists between a peptide and a protein in the md models, a geometrical criterion was adopted, in which the formation of a hydrogen bond was defined by both atom distance and bond orientation. for example, assuming donor d, hydrogen h, and acceptor a consists of d-h ··· a configuration. then when the distance between donor d and acceptor a was shorter than . Å as well as the bond angle h-d ··· a smaller than . °, it has been regarded as a hydrogen bond. the hydrogen bonds are counted for - ns while running the simulations. open access this article is licensed under a creative commons attribution . international license, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons license, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons license, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. to view a copy of this license, visit http://creativecommons.org/ licenses/by/ . /. a pneumonia outbreak associated with a new coronavirus of probable bat origin structural insights into coronavirus entry structural basis for the recognition of sars-cov- by full-length human ace coronavirus spike proteins in viral entry and pathogenesis proteolytic activation of the sars-coronavirus spike protein: cutting enzymes at the cutting edge of antiviral research host cell proteases: critical determinants of coronavirus tropism and pathogenesis imaging g protein-coupled receptors while quantifying their ligand-binding free-energy landscape glycan-mediated enhancement of reovirus receptor binding combining confocal and atomic force microscopy to quantify single-virus binding to mammalian cell surfaces multivalent binding of herpesvirus to living cells is tightly regulated during infection multiple receptors involved in human rhinovirus attachment to live cells monitoring early fusion dynamics of human immunodeficiency virus type at singlemolecule resolution models for the specific adhesion of cells to cells sensitive force technique to probe molecular adhesion and structural linkages at biological interfaces dynamic strength of molecular adhesion bonds nanomechanical mapping of first binding steps of a virus to animal cells influenza virus binds its host cell using multiple dynamic interactions cryo-em structure of the -ncov spike in the prefusion conformation structural basis of receptor recognition by sars-cov- angiotensin-converting enzyme is a functional receptor for the sars coronavirus virus entry: molecular mechanisms and biomedical applications isolation and characterization of sars-cov- from the first us covid- patient ace receptor expression and severe acute respiratory syndrome coronavirus infection depend on differentiation of human airway epithelia structural basis for human coronavirus attachment to sialic acid receptors a potential role for integrins in host cell entry by sars-cov- mechanical forces guiding staphylococcus aureus cellular invasion angiotensinconverting enzyme (ace ) as a sars-cov- receptor: molecular mechanisms and potential therapeutic target inhibition of sars-cov- infections in engineered human tissues using clinical-grade soluble human ace structure of the sars-cov- spike receptor-binding domain bound to the ace receptor linking of sensor molecules with amino groups to aminofunctionalized afm tips calculation of thermal noise in atomic force microscopy determination of the kinetic on-and off-rate of single viruscell interactions gromacs: a message-passing parallel molecular dynamics implementation : a package for molecular simulation and trajectory analysis improved side-chain torsion potentials for the amber ff sb protein force field comparison of simple potential functions for simulating liquid water a smooth particle mesh ewald method lincs: a parallel linear constraint solver for molecular simulation synthesis of a new series of sialylated homo-and heterovalent glycoclusters by using orthogonal ligations synthesis of -o-acyl-and -o-acetyl-sialic acids modeller: generation and refinement of homology-based protein structure models synthesis of -o-acetyl- -α-o-propargyl-sc. -α-o-propargyl sc was synthesized by the protocol described by dashkan et al. . this molecule was selectively acetylated at the -position following the procedure of ogura et al. .reporting summary. further information on research design is available in the nature research reporting summary linked to this article. the source data underlying figs. c, e-h, f, h, b, c, a and supplementary figs. , , are provided as a source data file. all other relevant data are available from the corresponding authors upon reasonable request. source data are provided with this paper.received: july ; accepted: august ; this work was supported by the universitécatholique de louvain, the foundation louvain, and the fonds national de la recherche scientifique (frs-fnrs). this project received funding from the european research council under the european union's horizon research and innovation program (grant agreement no. ) and from the fnrs-welbio (grant # cr- s- ). the funders had no role in study design, data collection and analysis, decision to publish, or preparation of the paper. s.p., a.c.d., and d.a. are research fellow, postdoctoral researcher, and research associate at the fnrs, respectively. q.z., w.c., and s.p.v. are grateful to china scholarship council. j.y., s.j.l.p., m.k., a.c.d., and d.a. conceived the project, planned the experiments, and analyzed the data. j.y., s.j.l.p., and s.d. conducted the afm experiments. q.z. performed md simulation and structure predictions. s.p., m.k., and p.s. conducted and analyzed the blitz experiments. w.c. and s.p.v. conceived and synthesized the sa derivative. all authors wrote the paper. the authors declare no competing interests. supplementary information is available for this paper at https://doi.org/ . /s - - - .correspondence and requests for materials should be addressed to d.a.peer review information nature communications thanks the anonymous reviewers for their contribution to the peer review of this work. peer reviewer reports are available.reprints and permission information is available at http://www.nature.com/reprintspublisher's note springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. key: cord- -ybdkp authors: bruni, margherita; cecatiello, valentina; diaz-basabe, angelica; lattanzi, georgia; mileti, erika; monzani, silvia; pirovano, laura; rizzelli, francesca; visintin, clara; bonizzi, giuseppina; giani, marco; lavitrano, marialuisa; faravelli, silvia; forneris, federico; caprioli, flavio; pelicci, pier giuseppe; natoli, gioacchino; pasqualato, sebastiano; mapelli, marina; facciotti, federica title: persistence of anti-sars-cov- antibodies in non-hospitalized covid- convalescent health care workers date: - - journal: j clin med doi: . /jcm sha: doc_id: cord_uid: ybdkp although antibody response to sars-cov- can be detected early during the infection, several outstanding questions remain to be addressed regarding the magnitude and persistence of antibody titer against different viral proteins and their correlation with the strength of the immune response. an elisa assay has been developed by expressing and purifying the recombinant sars-cov- spike receptor binding domain (rbd), soluble ectodomain (spike), and full length nucleocapsid protein (n). sera from healthcare workers affected by non-severe covid- were longitudinally collected over four weeks, and compared to sera from patients hospitalized in intensive care units (icu) and sars-cov- -negative subjects for the presence of igm, igg and iga antibodies as well as soluble pro-inflammatory mediators in the sera. non-hospitalized subjects showed lower antibody titers and blood pro-inflammatory cytokine profiles as compared to patients in intensive care units (icu), irrespective of the antibodies tested. noteworthy, in non-severe covid- infections, antibody titers against rbd and spike, but not against the n protein, as well as pro-inflammatory cytokines decreased within a month after viral clearance. thus, rapid decline in antibody titers and in pro-inflammatory cytokines may be a common feature of non-severe sars-cov- infection, suggesting that antibody-mediated protection against re-infection with sars-cov- is of short duration. these results suggest caution in using serological testing to estimate the prevalence of sars-cov- infection in the general population. the coronavirus disease- (covid- ) is a respiratory illness caused by the severe acute respiratory syndrome coronavirus (sars-cov- ), a novel beta-coronavirus firstly described in wuhan city, china, on december [ ] . sars-cov- spreading has been declared pandemic in mid-march by who [ ] . at present, the virus has infected more than million people worldwide with an associated case mortality rate of % to %, depending on the country [ ] . covid- is associated with a broad range of mild-to-severe symptoms, potentially leading to hospitalization in intensive care units (icu) for the most severe cases. the respiratory tract is initially involved with possible development of severe interstitial pneumonia [ , ] , albeit the gastrointestinal tract can also significantly participate in disease pathogenesis as a consequence of the expression of the ace receptor, that mediates sars-cov- viral entry [ ] , on both alveolar and enteric epithelial cells [ ] . infected subjects manifest a complex clinical pattern appearing as early as two days post exposure and lasting several weeks [ ] . infection with sars-cov- induces a prompt activation of the immune system, finalized to the clearance of infected cells [ ] . innate and adaptive immune cells accumulate at the site of infection, where production of cytokines and inflammatory mediators may result in patient recovery or, in case of ineffective viral clearance, in hyperactivation of the immune system and development of severe complications, such as acute respiratory distress syndrome ards [ , ] . overexpression of pro-inflammatory cytokines (i.e., il- beta, il- , il- , il- , tnfα etc.) and impairment of humoral immunity have been described in patients with the most severe form of the disease [ ] . antibodies against sars-cov- proteins are produced as a consequence of the activation of the humoral arm of the immune system. virus-specific igm antibodies are secreted as first class of immunoglobulins, followed by the more specific igg [ ] . among the latter, those specific for the viral spike receptor binding domain (rbd) when expressed at higher titer manifest direct neutralizing activity towards the viral entry into cells, as they prevent effective engagement of surface ace receptors by the spike protein [ , ] . the iga response against sars-cov- has been shown to be rapid and persistent [ , ] and is associated with mucosal responses, including respiratory and gastrointestinal responses. serological testing is a valuable tool to monitor viral spreading throughout the population [ ] . furthermore, serological assays allow the identification of past infection in individuals with viral rna levels undetectable by rt-pcr for epidemiological purposes [ ] . various commercial and in-house assays that utilize distinct viral antigens and detect different antibody classes are currently available. however, sars-cov- serological tests available on the market do not always allow systematic simultaneous detection of a wide antibody spectrum for several antigens in a reliable manner, and this may hamper a proper population testing for clinical or epidemiological purposes [ ] . conversely, serological enzyme-linked immunosorbent assays (elisa) to detect immunoglobulins raised against the viral spike soluble ectodomain (spike) and its highly immunogenic receptor binding domain (rbd), or against the nucleocapsid protein (n), provide promising results in terms of accuracy and reproducibility [ ] . recently, these elisa assays have been used to show that neutralizing antibodies (nab) against different viral antigens may decline after - days post symptoms onset, and that the magnitude of nab response may be associated with disease severity in covid- patients [ ] . in order to measure the presence and variation of antibody responses against different viral proteins, we set up and validated an in-house direct elisa assay based on three distinct sars-cov- viral antigens, i.e., eukaryotically-expressed rbd and spike and bacterially-expressed nucleocapsid protein. using this assay, we simultaneously measured igm, igg and iga anti-viral antibodies titers in the sera of covid- patients, as well as levels of pro-inflammatory cytokines. in addition, we longitudinally collected the sera of convalescent healthcare workers who tested positive for sars-cov- by nasopharyngeal (nf) swabs, and were symptomatic but not hospitalized. our data show that humoral immune responses against sars-cov- correlated with disease severity in terms of both antibody titers, persistence over time and serum levels of pro-inflammatory cytokines. notably, % of covid- mildly symptomatic patients halved their anti-rbd igg titers after weeks from viral negativization, thus confirming the short lifespan of humoral immune responses against sars-cov- . health care workers of two different covid hospitals in milan (n = ) with documented covid- infection (by nf swab), not hospitalized but with manifested covid- symptoms (supplementary table s ) were monitored for seroconversion by igm, igg and iga serum levels at two time points after viral clearance between april and june . the study has been conducted in accordance with the standards of good clinical practice, with the ethical principles deriving from the helsinki declaration and the current legislation on observational studies. clearance from the ethical committee has been obtained (ieo ). additional study populations were icu hospitalized severe covid- patients (n = ) and (n = ) covid- negative subjects whose sera were collected between april and june . in total, pre-covid subjects enrolled in ieo studies between and were used to calculate the roc curves for the assays. the exclusion criterion was, for all subjects involved in the study, the inability to provide informed consent. the inclusion criteria were, for those not hospitalized with covid- , (i) being health care workers (medical doctors, practitioners, post-graduate students, nurses), potentially exposed to sars-cov- between february and june , (ii) documented sars-cov- infection by nf swab, (iii) not being hospitalized for covid- ; for those hospitalized with covid- : (i) documented sars-cov- infection by nf swab, (ii) being admitted in the icu between february and june for covid- ; for negative controls: (i) sera being collected before . the recombinant spike sars-cov- glycoprotein receptor binding domain (rbd) and the soluble full-length trimeric ectodomain have been produced in mammalian hek f cells as glycosylated proteins by transient transfection with pcaggs vectors generated in prof. krammer's laboratory [ ] . the constructs were synthesized using the genomic sequence of the isolated virus, wuhan-hi- released in january , and contain codons optimized for expression in mammalian cells. briefly, hek f cells were seeded at a final concentration of . million/ml in freestyle medium (thermo fisher scientific, milano, italy), incubated at • c, % co at rpm o/n in an eppendorf new brunswick s i incubator. the day after hek f cells were transfected using µg of dna per × cells and a dna: pei max ratio of : in optimem medium. four hours post-transfection, the medium was supplemented with peptone primatone rl (merck) to a final concentration of . % w/v. cells were then incubated for days, checking cell viability daily if needed (a mortality higher than % is indicative of a toxic protein). for protein purification, the culture supernatant was transferred to conical centrifuge tubes, cleared by centrifugation at ×·g for min and filtered with . µm stericup filters. the filtered medium addition with : volume of mm nah po ph . , mm nacl and loaded on a hisprep fast flow / column (ge-healthcare) equilibrated in mm nah po , mm nacl. his-tagged protein was eluted with step gradients of - - - mm imidazole. peak fractions were pooled, dialyzed overnight against pbs and concentrated to . mg/ml (spike soluble) or . mg/ml (rbd) in kda-mwco amicon filter units. retrieved proteins were quantified, flash frozen in liquid nitrogen in aliquots and stored at − • c. his-tagged sars-cov- full length n-protein plasmid (kind gift of david d. ho, md, columbia university, new york, ny, usa) was transformed in e.coli bl plyss cells. protein expression was induced with . mm iptg and carried on at • c overnight. cells were harvested by centrifugation in lysis buffer ( mm tris-hcl ph . , mm nacl, mm dtt, % glycerol, mm imidazole, with calbiochem protease inhibitor cocktail iii). all following steps were carried out at • c or using ice-cold buffers. cells were lysed by sonication; lysate was cleared by centrifugation at , × g for min, then pei (ph . -final concentration . %) was dropwise added, under stirring, and lysate was then further cleared by centrifugation at , × g for min. next, ml ni-nta beads per liter of culture, pre-equilibrated in lysis buffer, were added to the cleared lysate and protein binding was continued for h in gentle agitation at • c. beads were washed with at least column volumes of mm tris-hcl ph . , mm nacl, mm dtt, % glycerol, mm imidazole and his-tagged protein was eluted with column volumes of mm tris-hcl ph . , mm nacl, mm dtt, % glycerol, mm imidazole. the eluted fractions containing protein were diluted with heparin buffer ( mm tris-hcl ph . , mm dtt) to reach final a nacl concentration of . m and were subsequently loaded onto a hi-trap heparin hp column (ge healthcare) equilibrated in mm tris-hcl ph, mm nacl, mm dtt (buffer a). a linear gradient reaching % buffer b ( mm tris-hcl ph, m nacl, mm dtt) in column volumes was applied and fractions containing his-tagged n-protein were pooled, concentrated and loaded onto a superdex / size exclusion chromatography. fractions containing n-protein were pooled. a l culture yielded . ml of . mg/ml pure n-protein, which was flash frozen in liquid nitrogen in aliquots and stored at stored at − • c. the elisa assay to detect immunoglobulins (ig) uses fragments of the sars-cov- spike glycoprotein (s-protein) and the nucleocapsid (n) as antigens based on the protocol published in [ , ] . after binding of the proteins to a nunc maxisorp elisa plate, and blocking aspecific bindings with pbs-bsa %, patients' sera to be analyzed were applied to the plate to allow antibody binding at a final dilution of : , revealed with secondary anti-human-igg (bd, clone g - ), igm (merck, polyclonal code a ), iga (biolegend, poly ) antibody conjugated to hrp. samples are read on a glomax reader at nm. this elisa test is not intended for commercial use and is currently under evaluation at the italy's ministry of health (aut.min.rich. . . ) for emergency use approval. the assay has been validated with a cohort of n = covid- subjects (severe, moderate and mild disease) and n = (subjects collected in pre-covid era (between and )). roc curves have been implemented to determine the sensitivity and specificity of the assay (supplementary figure s ). quantification of soluble biomarkers was performed in sera of patients collected immediately after virus clearance ( consecutive negative nf swabs) and one month post virus clearance using a luminex immunoassay (human cytokine/chemokine/gf procartaplex plex, thermo fisher) with map technology according to manufacturer's protocol. samples were acquired on a luminex sd and analyzed with xponent software . . the sera of healthy subjects (n = ) collected between april and june as well as icu covid- patients (n = ) were used as control groups. the categorical variables were described as absolute frequency and percentage. the continuous variables with normal distribution were described as median ± standard deviation (sd), whereas the continuous variables without normal distribution were given as median and range. normality of continuous variables was checked with d'agostino-pearson omnibus normality test. the mann-whitney test or student's t-test for continuous variables, and the chi-square or fisher's exact tests for categorical variables, were used to associate clinical variables with the result of sars-cov- serological test (positive or negative). the p values lower than . , two-tailed, will be considered statistically significant. graphpad prism software was used for all statistical analyses. to evaluate the antibody response of individuals infected by sars-cov- , elisa assays were developed in-house by producing and purifying recombinant rbd, spike and nucleocapsid proteins of the sars-cov- virus following the protocols published in [ ] ( figure a) . the performances of these elisa assays were assessed for the different viral antigens and classes of antibodies by determining roc curves using (i) a cohort of sera from covid- patients collected between april and june and tested positive for nasopharyngeal swabs, and (ii) pre-covid- sera, collected between and (supplementary table s and figure s ). anti-sars-cov- igg showed the highest specificity and sensitivity, irrespective of the antigen used (supplementary figure s a,b) . anti-rbd igg showed a specificity and sensitivity of % and %, respectively, while the assay performed with the spike ectodomain reached values of . % and % and the one with the n protein values of % and % (supplementary table s and figure s ). these performances are in line with those published for both in-house and commercial assays approved for emergency use by the fda [ , ] . the performance of iga detection was high for the rbd assay ( . % specificity and % sensitivity), while it was slightly lower for the n protein ( % and %) and for the spike ( % and %). the performance of the igm assay was comparatively lower for all the viral proteins tested (supplementary figure s a,b) . the validated elisa assays were then used to systematically test the antibody titers of different classes of sars-cov- specific antibodies in sera from the following groups of patients: (i) severe covid- patients admitted to icus; (ii) health care workers from two hospitals in milan, exposed to the virus between february and march and confirmed positive to sars-cov- rna by rt-qpcr on nasopharyngeal swabs. fifty-eight sars-cov- -negative subjects collected between april and june were used as negative controls (supplementary table s ). sera of the health care workers were collected in the convalescence phase of the disease after two consecutively negative nasopharyngeal swab tests. time between the first detection of the virus and the first negative swab ranged from to days from onset of symptoms to disappearance of viral rna (supplementary table s ). these subjects all manifested clinical symptoms strongly related to sars-cov- infection, including fever, ageusia, anosmia, fatigue, myalgia, diarrhea, coryza and cough [ ] . two of them manifested a more severe disease course with episodes of dyspnea. none of the patients required hospitalization and they all recovered from the disease (supplementary table s ). non-hospitalized covid- subjects manifested a lower antibody titer as compared to severe icu patients for all the tested antibody classes and viral antigens ( figure b-d) . this finding is in accordance to what published for asymptomatic [ ] and paucisymptomatic [ ] patients whose antibody titers were detected using commercial elisa or chemiluminescence assays against either the spike or the n-protein. when comparing the presence of the different classes of antibodies, all the covid- positive subjects resulted positive for the presence of igg antibodies against all the viral antigens tested ( figure e ). interestingly, a few of them were igm negative or with an antibody concentration close to the detection limit of the spike and rbd assay, as compared to the n protein. the observation that all of them instead showed n-specific igm antibodies may be a genuine persistence of anti-n protein igm or the consequence of a lower specificity of the n assay, possibly reflecting the high conservation of the n proteins among beta-coronaviruses other than sars-cov- [ ] . interestingly, % of the non-hospitalized covid- patients did not develop rbd-specific iga, and only out of developed n-specific iga antibodies, a percentage that was instead above % for the hospitalized ones ( figure e ). since severe covid- is associated with a strong release of pro-inflammatory cytokines [ ] , the sera from covid- patients were analyzed for the presence of pro-and anti-inflammatory cytokines, chemokines and growth factors by multidimensional analysis (figure , supplementary figures s and s and supplementary table s ). icu patients, whose sera were collected in the acute phase of the disease, showed a sustained production of pro-inflammatory mediators, among which il- , il- a, il- p , il- beta, il- , il- and il- , all associated with the "cytokine storm" observed in very severe covid- patients, were the most abundantly detected (figure a ). on the contrary, even in the early convalescent phase, those cytokines were undetectable in the sera of non-hospitalized covid- patients (figure a) . interestingly, pro-inflammatory cytokines-such as ifn-gamma, tnf, il- , il- , il- and ip- /cxcl- -were detected both in the sera of severe icu hospitalized and of non-hospitalized covid- patients ( figure b ). to note, chemokines involved in the recruitment in inflamed tissues of both monocytes and t cells like mcp /ccl , rantes/ccl , mip alpha/ccl and eotaxin/ccl ( figure c ) were present at comparable concentrations in severe icu hospitalized and in non-hospitalized patients, indicating active recruitment of immune cell populations also in milder forms of covid- . in order to evaluate the kinetics of antibody titers in convalescent non-hospitalized covid- patients, serum ig levels were measured at different time points, i.e., two days (t ) and one month (t ) after the first negative nf swab ( figure a) . interestingly, within a month after negativization of the viral rna, rbd-and spike-specific antibody titers halved in the sera of the vast majority of convalescent covid- patients ( figure b ,c). when tested against the rbd, / , / and / patients showed a decrease in the antibody title ranging from % to % in their viral-specific igm, igg and iga antibodies classes ( figure b,e) . similarly, / , / and / patients showed a decrease of at least % of their spike igm, igg and iga antibody titers ( figure c ,e). in both cases antibodies titers were still above the od detection threshold. on the contrary, antibodies against the viral nucleocapsid protein did not show a significant decrease at the second time point of evaluation ( figure d,e) . interestingly, similarly to the antibody titers, the presence of proinflammatory mediators in the sera of convalescent patients also decreased over time and became almost undetectable one month after a negative pcr for viral rna, a finding that mirrors the successful control of the infection and the consequent switch off of the immune response ( figure f, supplementary figure s ). during the last months many key aspects of the immune response to sars-cov- have been elucidated. however, given the complexity and diversity of the clinical manifestation of covid- disease, several outstanding questions remain still to be addressed. here we show that humoral immune responses against sars-cov- correlated with disease severity in terms of both antibody titers, persistence over time and serum levels of pro-inflammatory mediators. moreover, we showed that the vast majority of covid- mildly symptomatic patients analyzed in the study halved their anti-rbd antibody titers after weeks from viral negativization, thus confirming the short lifespan of humoral immune responses against sars-cov- . humoral immune response against sars-cov- proteins leads the production of antibodies against the portions of the viral proteins [ ] [ ] [ ] . in this sense, serological tests, based on the search of specific anti-sars-cov- antibodies, represent a useful tool aimed at identifying patients who contracted the infection and, consequently, comparing the clinical course and eventual complications between the general population and population at risk, such as health care workers [ ] . importantly, measurable variations in the humoral response might account for a re-activation of the immune system as a consequence of viral re-exposure, both in healthcare workers and in the general population. serological monitoring of antibody levels can thus provide information on the actual circulation of the virus, which can be used by decision makers to adapt safety and restriction measures according to the real presence of the virus within the population. nonetheless, the specificity and sensitivity of the different assays greatly vary among kits taking into consideration the different techniques implemented (elisa, clia, lateral flow) and the antigens used (spike ectodomain, s -s of the spike, spike rbd, nucleocapsid). thus, only highly sensitive tests can detect with high accuracy whether people, including mildly symptomatic or asymptomatic subjects, have specific anti-sars-cov- antibodies present in their blood. the test utilized in this study is a robust elisa assay imported from the laboratory of prof. krammer at mount sinai, that has been approved for emergency use by the fda [ , ] . we reproduced its excellent performance in our lab, that allowed us to detect a broad range of antibody levels, spanning form those measured in the blood of severe hospitalized patients and not hospitalized mild covid- + individuals. the elisa assay has been validated with a cohort of more than positive and negative subjects, giving rise to extremely high performance values. specificity and sensitivity of the elisa assays were high for anti-rbd igg and iga ( - %) and slightly lower for igm and the spike and n proteins ( - %). these performances are in line with those published for both in-house and commercial assays [ , ] . for this reason, this test is also being currently evaluated by the italy's istituto superiore di sanita' (iss) for its emergency use approval. one additional key strength of this assay as compared to other types of serological assays is its flexibility, i.e., the possibility to simultaneously assess different classes of antibodies against a broad panel of sars-cov- antigens within the same assay. thus, this elisa assay gave us a comprehensive understanding of the magnitude and persistence of antibody titer against different viral proteins and their correlation with the strength of the immune response, as measured by the serum levels of pro-inflammatory mediators. the presence of few false positives among the covid-negative population tested with the viral nucleocapsid protein as compared to the rbd might be a consequence of a mistakenly detection of anti-n antibodies previously raised against common cold coronaviruses which cross-react with the sars-cov- nucleocapsid [ ] . the nucleocapsid protein is the more conserved protein among different coronaviruses. it is possible to speculate that antibodies produced against previous common cold coronaviruses (and cross-reacting with the sars-cov- antigens) might still be present in the sera at high levels, and therefore be detectable. as a consequence, when analyzed longitudinally, we observed that only the antibodies specific to sars-cov- decline while those aspecific and possibly reacting to previous coronaviruses remain detectable at the same levels over time. a similar observation was recently published by a large longitudinal study [ ] . moreover, a recent paper evaluated the persistence of anti-n specific antibodies raised against four different common cold coronaviruses in a cohort of hiv+ individuals followed longitudinally for more than years [ ] . the study confirmed that n-specific antibodies undergo fluctuations in their detection levels as a consequence of seasonal re-infections with a kinetic of - months. interestingly, the authors reported that out of patients ( % of the individuals enrolled in the study) showed cross-reactive antibodies against the viral n-proteins of the four viruses, and in one of them these cross-reactive antibodies persisted over the years. the duration of circulating igg antibodies is still unclear and might depend on several factors, including the type and extent of immune response elicited upon the encounter with the virus [ ] . in this study, non-hospitalized subjects showed lower antibody titers and blood pro-inflammatory cytokine profiles compared to patients in intensive care units (icu), irrespective of the antibodies tested. this finding is in accordance to what published for asymptomatic [ ] and paucisymptomatic [ ] patients whose antibody titers were detected using commercial elisa or chemiluminescence assays against either the spike or the n-protein. anti-rbd iga antibodies manifested a similar kinetic compared to that of igg. iga response against sars-cov- has been reported to be rapid and persistent [ , ] and possibly associated with mucosal immune response in the gut and lungs. notably, iga production has been associated with disease severity, suggesting that iga production might occur locally at the mucosal sites, possibly correlating with the viral load, the duration of the viral exposure and the virus entry route [ , ] . consistently, a recent communication [ ] confirmed that the highest levels of igg and iga antibodies against the spike s domain, encompassing the n-terminal half of the protein with the rbd, were associated with severe disease [ , ] . severe hospitalized covid- patients overexpressed pro-inflammatory cytokines (i.e., il- beta, il- , il- , il- , tnfα). in one of the very first reports of the clinical course of covid- patients, as early as march , serum increase in interleukin (il)- , il- , gmcsf, ip- , mcp , mip -α, and tnf-α was associated to disease severity [ ] . elevated il- levels were detected in hospitalized patients and have been associated with icu admission, respiratory failure, and poor prognosis in several studies [ , , ] . presently, conflicting results regarding il- b and il- have been reported [ ] [ ] [ ] . the elevation of pro-inflammatory cytokines, albeit being widely described in covid- patients, does not seem presently to have prognostic value, because they do not always differentiate moderate cases from severe cases [ ] . levels of il- at first assessment might predict respiratory failure [ ] , other publications with longitudinal analyses demonstrated that il- increases fairly late during the disease's course, consequently compromising its prognostic value at earlier stages [ ] . moreover, serum concentrations of kl- , a molecule elevated in serum of patients with interstitial lung diseases (ilds), such as idiopathic pulmonary fibrosis and hypersensitivity pneumonitis, was recently proposed to be capable of differentiating between severe and mild covid- patients, being mainly produced by damaged or regenerating alveolar type ii pneumocytes [ , ] . conversely, ip- , mcp- , and il- ra were capable of differentiating between severe and mild covid- patients [ ] . interestingly, mip alpha, il and eotaxin, similarly to the results published by long et al. [ ] , were expressed to a greater extent by healthy subjects compared to covid- patients. human mip alpha and eotaxin were reported to be potent inhibitors of m-tropic hiv- infection, and were therefore considered as potential hiv- inhibitors [ ] . a similar protective mechanism of action might be envisaged in sars-cov- infection. we also observed that during non-severe covid- infections, pro-inflammatory cytokines are produced and correlate with the severity of the disease. similarly to anti-sars-cov- antibodies, pro-inflammatory mediators also decreased within a month after viral clearance, as expected upon the resolution of the disease. overall, we suggest that the decline in antibody titer and pro-inflammatory cytokines is a common characteristic of sars-cov- infection. this study therefore has important implications for the use of serological testing for the monitoring of infection outbreaks against re-infection with sars-cov- . our results indicate that the detection of antibodies with serological assays for epidemiological and monitoring purposes in non-hospitalized seroconverted covid- + subjects, who most likely represent the majority of people who encountered the virus, is only highly reliable within a limited window of time after viral clearance. supplementary materials: the following are available online at http://www.mdpi.com/ - / / / /s , figure s : roc curves, figure s : cytokine levels in sera of covid- patients, figure s : sera growth factors concentration, figure s : not significant longitudinal variation of serum cytokines and chemokines in non-hospitalized covid- patients table s : patients' clinical characteristics table s : covid- non-hospitalized patients clinical symptoms table s . luminex analytes. funding: this research was funded by a generous contribution from giuseppe caprotti and the fondazione guido venosta and partially supported by the italian ministry of health with ricerca corrente and × funds; we thank the enthusiastic support of francesco niutta and nicolo' fontana-rava. the funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results. a new coronavirus associated with human respiratory disease in china johns hopkins coronavirus resource center clinical characteristics of hospitalized patients with novel coronavirus-infected pneumonia in clinical features of patients infected with novel coronavirus in angiotensin-converting enzyme (ace ) as a sars-cov- receptor: molecular mechanisms and potential therapeutic target evidence for gastrointestinal infection of sars-cov- the trinity of covid- : immunity, inflammation and intervention risk factors associated with acute respiratory distress syndrome and death in patients with coronavirus disease antibody responses to sars-cov- in patients with covid- a serological assay to detect sars-cov- seroconversion in humans analysis of a sars-cov- -infected individual reveals development of potent neutralizing antibodies with limited somatic mutation iga-ab response to spike glycoprotein of sars-cov- in patients with covid- : a longitudinal study spectrum of innate and adaptive immune response to sars cov infection across asymptomatic, mild and severe cases meta-analysis of diagnostic performance of serological tests for sars-cov- antibodies up to temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by sars-cov- : an observational cohort study antibody tests in detecting sars-cov- infection: a meta-analysis longitudinal evaluation and decline of antibody responses in sars-cov infection sars-cov- seroconversion in humans: a detailed protocol for a serological assay, antigen production, and test setup medicaldevices/emergency-situations-medical-devices/eua-authorized-serology-test-performance (accessed on covid- in vitro diagnostic devices and test methods database clinical and immunological assessment of asymptomatic sars-cov- infections crystal structure of sars-cov- nucleocapsid protein rna binding domain reveals potential unique drug targeting sites humoral immune response to sars-cov- in iceland seasonal coronavirus protective immunity is short-lasting distinct early iga profile may determine severity of covid- symptoms: an immunological case series detectable serum sars-cov- viral load (rnaaemia) is closely correlated with drastically elevated interleukin (il- ) level in critically ill covid- patients the role of interleukin- in monitoring severe case of coronavirus disease virologic and clinical characteristics for prognosis of severe covid- : a retrospective observational study in wuhan correlation analysis between disease severity and inflammation-related parameters in patients with covid- pneumonia immune cell profiling of covid- patients in the recovery stage by single-cell sequencing plasma ip- and mcp- levels are highly associated with disease severity and predict the progression of covid- elevated levels of interleukin- and crp predict the need for mechanical ventilation in covid- clinical course and risk factors for mortality of adult inpatients with covid- in wuhan, china: a retrospective cohort study serum kl- concentrations as a novel biomarker of severe covid- peripheral lymphocyte subset monitoring in covid patients: a prospective italian real-life case series identification of rantes, mip- α, and mip- β as the major hiv-suppressive factors produced by cd +t cells the authors declare no conflict of interest. key: cord- -xo ruswo authors: new, r.r.c.; moore, b.d.; butcher, w.; mahood, r.; lever, m.s.; smither, s.; o'brien, l.; weller, s.a.; bayliss, m.; gibson, l.c.d.; macleod, c.; bogus, m.; harvey, r.; almond, n.; williamson, e.d. title: antibody-mediated protection against mers-cov in the murine model() date: - - journal: vaccine doi: . /j.vaccine. . . sha: doc_id: cord_uid: xo ruswo murine antisera with neutralising activity for the coronavirus causative of middle east respiratory syndrome (mers) were induced by immunisation of balb/c mice with the receptor binding domain (rbd) of the viral spike protein. the murine antisera induced were fully-neutralising in vitro for two separate clinical strains of the mers coronavirus (mers-cov). to test the neutralising capacity of these antisera in vivo, susceptibility to mers-cov was induced in naive recipient balb/c mice by the administration of an adenovirus vector expressing the human dpp receptor (ad -hdpp ) for mers-cov, prior to the passive transfer of the rbd-specific murine antisera to the transduced mice. subsequent challenge of the recipient transduced mice by the intra-nasal route with a clinical isolate of the mers-cov resulted in a significantly reduced viral load in their lungs, compared with transduced mice receiving a negative control antibody. the murine antisera used were derived from mice which had been primed sub-cutaneously with a recombinant fusion of rbd with a human igg fc tag (rbd-fc), adsorbed to calcium phosphate microcrystals and then boosted by the oral route with the same fusion protein in reverse micelles. the data gained indicate that this dual-route vaccination with novel formulations of the rbd-fc, induced systemic and mucosal anti-viral immunity with demonstrated in vitro and in vivo neutralisation capacity for clinical strains of mers-cov. the middle east respiratory disease syndrome (mers) first emerged in in saudi arabia [ , ] . since then, there have been an estimated laboratory-confirmed cases with deaths, reported from a total of countries in the eastern mediterranean region and from countries elsewhere [ ] . saudi arabia, however, remains the main focus of infection and a disease outbreak in south korea involving cases was traced back to an index case who had travelled from saudi arabia. whilst the incidence of mers cases in saudi arabia peaked in , there are still a significant number of cases reported from the country and in the period september -may , there were cases including deaths with a case fatality rate of . % [ ] . mers coronavirus (mers-cov) is a member of the betacoronavirus genus [ ] and as for other betacoronaviruses, bats may provide a natural reservoir for the virus [ , ] , but high levels of antibodies to mers-cov in dromedary camels [ ] suggest that the dromedary camel is the principal source for animal-to-human transmission of mers-cov [ ] . however, evidence of human-to-human transmission comes from the reporting of outbreaks in countries remote from saudi arabia such as the uk, europe, usa, and china where small outbreaks have also occurred [ ] . mers-cov is an enveloped, positive-sense, single-stranded rna virus [ ] . the virus possesses an envelope-anchored trimeric spike protein which binds to the human receptor dipeptidyl peptidase (dpp or cd ) and gains host cell entry by the fusion of viral and host membranes [ ] . the spike protein comprises an s sub-unit and a membrane fusion s sub-unit. in the coronaviruses, the s sub-units are further divided into n-terminal and c-terminal subdomains and for mers-cov, it is the c-terminal sub-domain that comprises the receptor-binding domain (rbd) [ ] . the rbd also incorporates a receptor-binding motif at its c-terminal and the crystal structures of mers-cov rbd [ ] and of the rbd bound to the extracellular domain of human dpp have been reported [ ] . the rbds of the coronaviruses represent vaccine and therapeutic targets and the rbd of mers-cov as a vaccine antigen has been demonstrated to induce neutralising antibody [ ] and to protect mice transduced with a viral vector expressing hdpp , or nonhuman primates from viral challenge [ ] [ ] [ ] [ ] [ ] . there are significant ongoing efforts to develop vaccines for mers-cov infection, predominantly involving live attenuated viral vectors such as adenovirus, modified vaccinia ankara or measles [ ] to induce anti-viral immunity and some of these vaccines are already in clinical trials. here, we were interested to determine the relative importance of inducing systemic and/or mucosal immunity in vaccination to protect against mers-cov, an infection predominantly of the respiratory tract and lungs. to this end, we have used a dual route immunisation regimen in balb/c mice to induce both systemic and mucosal immunity, to generate rbd-specific murine antisera. initially, we immunised balb/c mice sub-cutaneously (s.c.) with rbd-fc in the mf adjuvant to induce rbd-specific igg. subsequently, we have immunised further groups of balb/c mice by s.c. priming and per oral (p.o.) boosting with the rbd-fc, to induce both systemic igg and mucosal iga responses. to do this, we have used novel formulations of rbd-fc coated onto microcrystals formed from histidine or glutamine and also incorporating calcium phosphate for sub-cutaneous priming [ ] , whilst the formulation for oral boosting comprised rbd-fc in reverse micelles dispersed in a self-emulsifying oil phase, which has been optimised from previous formulations [ , ] . the advantages of these formulations are that they are very stable under extremes of temperature [ ] . furthermore, on translation to the clinic, only one injected priming dose would be required, followed by a p.o. booster dose; the latter could be self-administered in capsule form. we have compared the relative abilities of the two sources of antiserum to neutralise clinical isolates of mers-cov in vitro. to do this, we have used two clinical strains of mers-cov (erasmus medical center or emc and london - :), each of which were derived from severely-ill individuals who had contracted the virus in the middle east in . subsequent sequencing of the polymerase gene from these isolates indicated them to be newly-emerged members of the betacoronavirus genus with a close sequence homology and phylogenetic relationship to the bat coronaviruses hku and hku- [ , ] . mice are not naturally susceptible to mers-cov infection, but susceptibility can be induced by the administration of an adenovirus vector which induces expression of the human receptor (hdpp /cd ) for the virus in vivo for a limited time, providing a non-lethal murine model of the disease [ ] . we have used this transduced mouse model to test the capacity of the antiserum derived from the dual route immunisation to neutralise mers-cov in vivo, by passive transfer prior to challenge with the emc strain and we have demonstrated a significant reduction in viral load in lung tissue in transduced mice. the rbd was synthesised and expressed according to methods adapted from du et al. [ ] . in brief, a single dna fragment containing an in-frame fusion of the coding sequences for the human il signal peptide, the rbd and human igg -fc was synthesised. this was transferred into the plasmid pef-dest (invitrogen) so that the target sequence was expressed as a secreted protein with a c-terminal human igg fc tag. this construct was transfected by cationic transfection into human embryo kidney (hek) cells in suspension (fshek) or adherent hek cells stably expressing the sv large t antigen ( ft), using serum-free media and incubated for - days. small scale purifications of rbd-fc were performed using protein a chromatography. for this, medium from the transfected cells was treated with ammonium sulphate to precipitate the protein, prior to dialysis and resuspension in buffers for binding to protein a beads. the latter were washed and eluted with buffer containing m urea. protein concentration was determined by uv absorbance spectroscopy and purity was estimated by sds-page with coomassie staining and subsequent optical densitometry using a syngene g:box imaging system. the rbd-fc was incorporated on glutamine calcium phosphate (cap) microcrystals for s.c. immunisation, using methodology adapted from [ ] . briefly, aqueous mixtures of rbd-fc with sodium orthophosphate and glutamine were precipitated as cap protein-coated microcrystals (cap-pcmc), by addition to of a fold excess of isopropanol containing dissolved calcium chloride. the resultant suspension contained self-assembled microcrystals comprising a glutamine core with the rbd-fc protein embedded in a thin surface layer of cap (now termed rbd-fc-pcmc). the pcmc were isolated by vacuum filtration and dried to a powder. protein content and integrity was determined by elisa and sds-page. for oral dosing, the rbd-fc was incorporated in mineral oil with added excipients using methodology adapted from [ , ] . the oral formulation comprised rbd-fc with the mucosal adjuvant cholera toxin b sub-unit (ctb), retinoic acid (ra), vitamin d, e and trehalose debehenate (tdb), a synthetic analog of the mycobacterial trehalose dimycolate [ ] and imiquimod, a tlr / agonist [ ] . specific pathogen-free female balb/c mice ( - weeks of age) were obtained from a commercial breeder and used throughout this study. on receipt, mice were randomised for allocation to cages and given free access to food, water and environmental enrichment. mice were fully acclimatised to the animal housing facility for at least five days prior to any procedure. all animal procedures were performed in accordance with uk legislation as stated in the uk animal (scientific procedures) act . the institutional animal care and use committee approved the relevant project licence. naïve mice were randomised for allocation to a treatment group (typically per group) and immunised in one of two regimens: either with a s.c. priming dose followed by two s.c. doses, given at and days after the prime; alternatively, mice received a s.c. priming dose followed by an oral or s.c. booster dose days after the prime (table ) . for s.c. immunisation, mice received . lg of rbd-fc-pcmc in . ml pbs injection volume, whereas for all per oral (p.o.) dosing, mice received lg of rbd-fc in a total volume of . ml mineral oil (mo), by oral gavage. where rbd-fc was administered s.c. in the conventional adjuvants mf or alhydrogel, mf (novartis, us) was used in a : ratio by volume with rbd-fc in pbs, whilst alhydrogel (brenntag biosector, denmark) was used in a : ratio with rbd-fc by volume in pbs. at selected intervals after dosing, mice were blood-sampled from the tail vein for assay of specific antibody titre. at the end of the immunisation schedule, individual mice were terminally anaesthetised for collection of blood by cardiac puncture, then culled prior to removal of small and large intestines for collection of faecal pellets for extraction of iga. titres of rbd-fc-specific antibody in serum samples were determined by elisa. in brief, test sera were bound to microtitre plates pre-coated with rbd-fc and antibody binding was detected with an hrpo-labelled secondary antibody to mouse igg, igg , igg a or iga (bio-rad). a standard curve for calibration comprising the relevant murine ig isotype (sigma) captured with an anti-fab reagent, was included on each plate. plates were developed by the addition of , -azino-bis( -ethylbenzothiazoline- -sulfonic acid) diammonium salt (abts) substrate (sigma) and optical density (od) was read at nm (multiskan plate reader). for assay of antibody in faecal samples, faecal pellets were extracted in supplemented pbs as described previously [ ] . in brief, ml of cold pbs was prepared, supplemented with tablet of complete mini protease inhibitor cocktail (sigma) and ll tween were added. to . g faecal pellets, ml of supplemented pbs was added and left at room temperature for min. samples were vortexed for approximately s, incubated on ice for a further min. and then centrifuged ( , g, min.). supernatants were retained and stored at À °c pending assay. the faecal extracts were assayed for specific igg and iga content, by elisa, as for serum samples. antibody concentrations in all samples were determined from the relevant standard curves using ascent software with fourparameter logistic curve-fitting and reported in ng/ml or lg/ml serum or faecal extract, as appropriate. to determine if the antibody induced by immunisation with to rbd-fc was neutralising for mers-cov in vitro, plaque assays were performed. for this, two strains of mers-cov were used: london - (genbank accession number kc . ) [ ] and erasmus medical center (emc genbank accession number jx ) ( ) . the london - strain was obtained from the national collection of pathogen viruses, phe porton, salisbury, uk and the emc strain was kindly provided by the erasmus university medical center rotterdam, the netherlands. both strains were prepared in serum-free media (gibco) at a multiplicity of infection (moi) of . , equivalent to plaque-forming units (pfu). the murine antiserum for testing was prepared at a dilution range from undiluted to : in pbs. virus was incubated overnight ( °c) with murine antiserum, negative control antibody (nibsc, uk) or media, prior to infection of a confluent monolayer of vero e cells (ecacc, salisbury uk) with ll of the mixture. the neutralising ability of the murine antiserum was tested in duplicate or triplicate at each dilution. after incubation ( h, °c), an overlay comprising a : dilution of carboxymethyl cellulose with serum-free media was added to the cells and incubation continued for a further days ( °c) prior to fixing ( . % formaldehyde) and staining ( . % crystal violet) with enumeration of the number of plaques per ml. mice are not naturally susceptible to infection with mers-cov, since they lack the human dpp receptor. to induce transient susceptibility in balb/c mice, we used an ad construct (oxford genetics) to express the human dpp receptor (ad hdpp ), as previously described ( ) . mice were administered the ad hdpp construct ( . Â pfu in ll) by the intra-nasal (i.n.) route under light sedation with inhalational isofluorane and then monitored by serial blood sampling for serum levels of hdpp /cd by elisa (thermoscientific). at peak levels of expression of hdpp (days - ), mice were lightly sedated as before and challenged by the i.n. route with mers-cov (emc strain) at pfu in ll per mouse. mice were weighed prior to challenge on each subsequent day to monitor changes in body weight during infection. to test the in vivo neutralising capacity of murine antiserum raised to the rbd-fc construct, naïve mice (n = per treatment group) were passively immunised by the i.p. route at h. prior to i.n. challenge with the mers co-v (emc strain), as described above. the murine antiserum, pooled from mice who had been primed with rbd-fc pcmc and boosted orally (regimen , treatment group ), was delivered at a dilution of : in pbs and delivered in a total volume of ll per mouse. a further group of mice received a purified polyclonal human igg at a single dose level ( lg/mouse in ll i.p.), which had been raised to inactivated mers-cov. control mice received a non-specific human igg at a single dose-level ( lg/mouse in ll, i.p.). both sets of human igg (specific and non-specific) were raised in a bovine transchromosomal model and purified prior to use. a further group of negative control mice were included, which received pbs in place of either the ad dpp construct or the mers-cov-specific antibody, and were also challenged i.n. with mers-cov (emc strain) at pfu/mouse. to determine the protection afforded by the passive immunisation, pairs of mice from each treatment group were culled on days - after challenge and their lungs were removed and weighed and then rapidly frozen (À °c) prior to the determination of viral load. pairs of lungs from each of mice per treatment group were individually thawed and homogenised in serum-free media ( ml). rna was extracted from ll of each homogenate using the qiaamp viral rna kit (qiagen), following the manufacturers' instructions. real-time pcr was conducted on duplicate ll the amount of virus in tested samples was determined in duplicate using the standard curve and reported as pfu/g lung tissue. all data were analysed using graph pad prism software v. and expressed as mean ± s.e.m. statistical comparisons were made using one-way anova or unpaired t-test. the rbd-fc protein was expressed in both adherent ft and suspension human embryo kidney (hek) cells, but with greater expression in adherent cells (fig. ) . purification of protein from adherent cells with protein a was very effective, yielding protein which was > % pure, with molecular weight of approximately kda (fig. a) . the use of m urea for elution was optimum, as it was sufficient to solubilise the protein without denaturing it, yielding rbd-fc in optimum yield ( . mg/ml) and predominantly in a dimeric form (fig. b) . this method of protein purification was therefore selected for forward use. rbd-fc, formulated for either sub-cutaneous (s.c.) or per oral (p.o.) immunisation, was tested for immunogenicity and the formulations optimised in an iterative approach. initially, a s.c. dosing regimen was used in which rbd-fc was formulated in either alhydrogel or mf to deliver . lg of protein on each of three occasions at , and days. mice were monitored for days after the final boost and igg titre determined ( fig. a) . at day , the total igg titres achieved with rbd-fc in alhydrogel or mf did not differ significantly. to determine if the presentation of rbd-fc in either alhydrogel or mf influenced the ability to develop virus-neutralising antibody, antisera were selected from mice in each immunisation group and tested in a plaque assay for neutralisation of both the emc and london - strains of mers-cov ( fig. b and c) . all four sera gave some neutralisation of viral activity, although at a : dilution, sera and were most potent, against both viral strains. sera and were derived from the treatment group immunised with mf adjuvanted rbd-fc, whereas sera and were derived from alhydrogel-adjuvanted rbd-fc ( fig. a) . based on this pilot data, we subsequently used mf as the conventional adjuvant for rbd-fc, to compare with some novel formulations. having demonstrated to proof-of-principle that the rbd-fc, when delivered in mf can induce a high titre of antibody with neutralising activity, we next investigated how to tailor an rbd-fc vaccine optimally to induce both systemic and mucosal immunity, with the aim also of reducing to a -dose immunisation regimen and increasing functional antibody. for this, we selected novel formulations in which rbd-fc protein was presented as rbd-fc-pcmc for s.c. priming and incorporated into mineral oil (mo) for p.o. boosting. we compared the serum igg response achieved from this -dose dual route immunisation with that induced to rbd-fc delivered in mf in a -dose s.c. regimen (fig. ) . at month after the booster dose, at day , there was no significant difference in the serum igg titres achieved, so that the -dose dual-route immunisation with rbd-fc-pcmc for s.c. priming and incorporated in mo with excipients for p.o. boosting, was just as immunogenic as the -dose s.c. immunisation with rbd-fc in mf (fig. a) . at day , the serum response to rbd-fc in the dual-route regimen was predominantly igg biased, whereas s.c. dosing with rbd-fc in the presence of mf induced both igg and igg a (fig. b) . since dual route immunisation effectively induced serum igg to rbd-fc, it was of interest to determine whether it could also effectively induce mucosal immunity. in this study, the rbd-fc-specific iga response was determined in serum and in faecal pellet extracts from individual animals on day . in this case, the rbd-specific iga responses of mice immunised in the -dose dual route regimen were compared with that of mice immunised by the oral route twice, and with mice immunised by the s.c. route in mf twice, on exactly the same days ( , ) (fig. ) . this comparison showed that s.c. immunisation in mf did not induce serum iga. however s.c. priming with rbd-fc-pcmc with p.o. boosting effectively induced rbd-fc-specific iga and was not inferior to oral priming and boosting in this effect in either serum (fig. a ) or faecal extracts (fig. b ). however mice primed and boosted orally did not develop rbd-specific systemic igg (data not shown). additionally, day sera from mice in all treatment groups were tested for their ability to neutralise either strain of mers-cov in vitro (table ) . from this it can be seen that sera from out of mice in the dual route regimen were fully neutralising neutralisation of mers-cov in vitro. in in vitro for both strains (emc and london - ), when tested at : dilution ( fig. a and b) . in order to test whether the in vitro neutralising activity translated into viral neutralisation in vivo, sera from these mice (highlighted in table ) were pooled in equal aliquots at : dilution to enable a subsequent passive transfer study. in order to design the passive transfer study, it was necessary to define the duration of expression of cd in murine lungs in vivo, following induction with the ad hdpp construct. mice dosed with ad hdpp i.n. at t were culled in pairs and lung homogenates prepared and assayed for cd expression. cd in lung tissue was expressed in a time-dependent manner, with levels peaking at day and declining to day (fig. a) , setting a sufficient window to use the model for the determination of the protection against viral challenge afforded by the passive transfer of mers-specific antibody. to determine the protection afforded by the passive transfer of murine antiserum raised in the dual route immunisation regimen, against infection, susceptibility to mers-cov was induced at t with i.n. administration of ad hdpp to groups of mice. passive transfer by the i.p. route of the pooled serum sample derived from the mice highlighted in table , which had previously been shown to be neutralising in vitro (table ) was conducted days later and mice were challenged after a further h with mers-cov emc . additional groups of mice, which had been transduced with ad hdpp , were passively immunised with a mers-cov specific human igg and a non-specific human igg. at - days after challenge, pairs of mice were culled for the determination of viral load in lungs, which was determined to peak at days p.i. (data not shown). at days p.i., the pooled murine antiserum significantly reduced viral titres in lungs, to the same extent as the specific human igg, and contrasting with the negative control human igg, demonstrating significant in vivo neutralising activity (fig. b ). fig. . a. expression of cd was induced in lung tissue by the administration of ad hdpp ( . Â pfu) to mice by the i.n. route at t . subsequently, mice were culled in pairs on the days shown and their lungs assayed for the expression of cd . the plot shows the time-course of cd expression from to days post-induction. all data points were normalised for background values from control mice. b: content of mers-cov (emc strain) in murine lungs (pfu/g tissue) determined by rt-pcr at day post-infection, (equivalent to day after passive transfer with murine antisera to rbd-fc which had previously been shown to neutralise the emc strain in vitro). mice received either a mers-cov-specific human igg ( lg) or non-specific human igg ( lg) in ll /mouse i.p.; or murine antisera to rbd-fc, which had been pooled from murine donors and which was delivered at : dilution ( ll/mouse i.p.). negative control mice received pbs in place of ad hdpp or antiserum all mice were challenged with mers-cov emc i.n. at pfu/mouse. statistical significance was determined at the p < . level by one way anova and unpaired t-test. no significant differences in body weight were detected between treatment groups challenged with mers-cov, which was attributed to the short time period of the study. mers is a serious endemic respiratory infectious disease for which there is no licensed vaccine, although there are several vaccines in clinical trial currently including adenovirus-vectored delivery of the spike protein and sub-units [ ] , dna vaccines and nanoparticle-delivered sub-unit approaches [ ] . we were interested in determining the advantage of r a vaccine which could induce mucosal as well as specific systemic immunity to the key target, the rbd protein, in order to achieve optimum protective efficacy. vaccination to induce effective immunity at mucosal surfaces, should prime the immune system to respond rapidly to invading pathogens such as mers-cov. previous studies have used adenovirus delivery of the mers spike protein with intra-muscular (i.m.) or intra-gastric (i.g.) delivery to induce neutralising systemic igg, but not iga; further, whilst i.m. delivery also induced specific t-cell immunity, i.g. delivery did not [ ] . others have shown that intra-nasal delivery of a live attenuated adenovirus-vectored subunit vaccines does induce specific mucosal as well as systemic immunity, although translation of this approach to the clinic may raise safety issues [ ] . here, we have relied on novel formulations of a sub-unit protein to enable dual route vaccination (parenteral and oral) to induce mucosal as well as systemic immunity. in this study, we have achieved the expression and purification of a recombinant rbd-fc protein in milligram quantities. we have also demonstrated that when formulated as a sub-unit vaccine, the construct induced murine antibody which effectively neutralised two different clinical strains of mers-cov in vitro. additionally, we have shown that these murine antisera, when passively transferred into naïve mice transduced to express the hdpp /cd receptor, conferred protection against viral challenge in the recipients, with significantly reduced viral loads in the lung tissue of the recipient mice. whilst the use of conventional adjuvants such as mf or alhydrogel to formulate the rbd-fc protein resulted in high titres of specific igg in serum, the mf formulation did not induce specific iga in serum. in order to promote both systemic and mucosal immunity to rbd-fc, we have formulated this protein for injected priming and p.o. boosting, entailing the optimisation of the cap pcmc and of the reverse micelles in oil emulsion, respectively. this has enabled the achievement of a vaccination regimen comprising only two doses and rapidly inducing rbd-fc-specific systemic and mucosal immunity. whilst the pcmc formulation of rbd-fc was as effective as rbd in mf in inducing a primary igg response, we have shown that an oral formulation of rbd-fc in mineral oil with selected immunostimulants was as effective as mf when used as a booster immunisation. additionally, we have shown that non-invasive oral priming and boosting is as effective at inducing a specific mucosal response, measured as specific iga in serum and faeces, as is injected priming with rbd-fc in the pcmc formulation together with oral boosting, leading to the exciting concept of a potential orally-dosed sub-unit vaccine for mers. whilst both alhydrogel and the combination of pcmc and oral formulations are th -polarising, as evidenced by the predominantly igg titres raised to rbd-fc, the influence of mf on the response to rbd-fc was a mixed th- /th effect, with a significant induction of specific igg and igg a. to counter a viral infection, it would be expected that a th response would be most appropriate. however, the fact that neutralising antibody to rbd-fc was raised under either th or th -polarising influences, suggests that either isotype can be protective and primes the immune system sufficiently and would allow for cross-presentation to occur on subsequent exposure to the virus [ ] . in this study, we have not examined the induction of a cell-mediated memory response to the rbd-fc protein, although this will play a significant role in protection against the virus. currently, we are presenting the rbd protein in our formulations with an fc tag, derived from human igg and useful in purifying the protein. the fc tag may contribute additional adjuvantising activity by engaging antigen-presenting cells in the vaccinee [ ] and it may aid mucosal immunity since the fc receptor, an mhc transmembrane protein, is also expressed at mucosal surfaces e.g. in the respiratory tract [ ] . vaccination of the zoonotic host, the dromedary camel, may also effectively curb outbreaks of mers in endemic regions and limit the risk of viral recombination [ ] and significant progress with an orthopox-vectored vaccine for mers has recently been reported [ ] . the potential use of a sub-unit vaccine for mers in camel vaccination could be aided by varying the sequence of the rbd protein [ ] and substituting the human fc tag with an alternative tag recognised by the camel, to design approaches tailored for animal vaccination, bearing in mind that a single dose vaccine would be ideal in this context. however, future work in our laboratory will also address the value of retaining or removing the fc tag from the rbd protein for clinical or veterinary iterations of the vaccine. in this study we have determined vaccine efficacy by demonstrating in vitro and in vivo neutralising ability of murine antisera raised in the dual route two-dose regimen against two virulent clinical strains of mers-cov which have greater than % genome sequence homologyy. in future work, it will be worthwhile to test the efficacy of this approach against other clinical isolates of mers-cov. this is the first report of a dual route dosing regimen applied to a sub-unit vaccine for mers-cov. future development of this approach would require the direct testing of efficacy in the immunised transduced mouse model. as well as giving a direct readout of vaccine efficacy, this will enable the identification of the immune correlates of protection, ready for transitioning this candidate vaccine into more extensive pre-clinical testing and clinical development. the authors declare that all the data supporting the findings of this study are available within the paper. ksa mers-cov investigation team. hospital outbreak of middle east respiratory syndrome coronavirus genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans mers-cov origin and animal reservoir novel betacoronavirus in dromedaries of the middle east middle east respiratory syndrome coronavirus (mers-cov): animal to human interaction spiking the mers-coronavirus receptor crystal structure of the receptor-binding domain from newly emerged middle east respiratory syndrome coronavirus structure of mers-cov spike receptor-binding domain complexed with human receptor dpp a truncated receptor-binding domain of mers-cov spike protein potently inhibits mers-cov infection and induces strong neutralizing antibody responses: implication for developing therapeutics and vaccines receptor-binding domain of mers-cov with optimal immunogen dosage and immunization interval protects human transduced mice from mers-cov infection evaluation of candidate vaccine approaches for mers-cov introduction of neutralising immunogenncity index to the rational design of mers coronavirus sub-unit vaccines recombinant receptor binding domain protein induces partial protective immunity in rhesus macaques against middle east respiratory syndrome coronavirus challenge mers-cov spike protein: targets for vaccines and therapeutics toward developing a preventive mers-cov vaccine-report from a workshop organized by the saudi arabia ministry of health and the international vaccine institute protein coated microcrystals formulated with model antigens and modified with calcium phosphate exhibit enhanced phagocytosis and immunogenicity a new oil-based antigen delivery formulation for both oral and parenteral vaccination reverse micelle-encapsulated recombinant baculovirus as an oral vaccine against h n infection in mice dual route vaccination for plague with emergency use applications genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans severe respiratory illness caused by a novel coronavirus rapid generation of a mouse model for middle east respiratory syndrome real-time reverse transcription-pcr assay panel for middle east respiratory syndrome coronavirus cutting edge: mincle is essential for recognition and adjuvanticity of the mycobacterial cord factor and its synthetic analog trehalose-dibehenate the tlr agonists imiquimod and gardiquimod improve dc-based immunotherapy for melanoma in mice chadox and mva based vaccine candidates against mers-cov elicit neutralising antibodies and cellular immune responses in mice novel chimeric virus-like particles vaccine displaying mers-cov receptor-binding domain induce specific humoral and cellular immune response in mice systemic and mucosal immunity in mice elicited by a single immunization with human adenovirus type or vector-based vaccines carrying the spike protein of middle east respiratory syndrome coronavirus a tetravalent dengue vaccine based on a complex adenovirus vector provides significant protection in rhesus monkeys against all four serotypes of dengue virus intracellular recycling and cross-presentation by mhc class i molecules fc-fusion proteins: new developments and future perspectives fc-fusion proteins and fcrn: structural insights for longer-lasting and more effective therapeutics co-circulation of three camel coronavirus species and recombination of mers-covs in saudi arabia an orthopoxvirus-based vaccine reduces virus excretion after mers-cov infection in dromedary camels recombinant receptor-binding domains of multiple middle east respiratory syndrome coronaviruses (mers-covs) induce cross-neutralizing antibodies against divergent human and camel mers-covs and antibody escape mutants the authors acknowledge with thanks the expert technical assistance of lin eastaugh, louise thompsett and vicky roberts. this work was supported by sbri awards and from innovate uk to rrcn and na, respectively and on independent research commissioned from na and funded by the nihr policy research programme, [ ]. the views expressed in the publication are those of the author(s) and not necessarily those of the nhs, the nihr, the department of health, 'arms' length bodies or other government departments. the authors declare no conflict of interests. supplementary data to this article can be found online at https://doi.org/ . /j.vaccine. . . . key: cord- -kifqgskc authors: lupala, cecylia s.; li, xuanxuan; lei, jian; chen, hong; qi, jianxun; liu, haiguang; su, xiao-dong title: computational simulations reveal the binding dynamics between human ace and the receptor binding domain of sars-cov- spike protein date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: kifqgskc a novel coronavirus (the sars-cov- ) has been identified in january as the causal pathogen for covid- pneumonia, an outbreak started near the end of in wuhan, china. the sars-cov- was found to be closely related to the sars-cov, based on the genomic analysis. the angiotensin converting enzyme protein (ace ) utilized by the sars-cov as a receptor was found to facilitate the infection of sars-cov- as well, initiated by the binding of the spike protein to the human ace . using homology modeling and molecular dynamics (md) simulation methods, we report here the detailed structure of the ace in complex with the receptor binding domain (rbd) of the sars-cov- spike protein. the predicted model is highly consistent with the experimentally determined complex structures. plausible binding modes between human ace and the rbd were revealed from all-atom md simulations. the simulation data further revealed critical residues at the complex interface and provided more details about the interactions between the sars-cov- rbd and human ace . two mutants mimicking rat ace were modeled to study the mutation effects on rbd binding to ace . the simulations showed that the n-terminal helix and the k of the human ace alter the binding modes of the cov -rbd to the ace . the outbreak of a new type of severe pneumonia covid- started in december has been going on world-wide, and caused over , fatalities, infected more than , individuals globally. although the earlier infected cases were mainly found in china before march, , particularly in hubei province, the confirmed covid- cases have been reported in more than countries or territories by the end of march, , and still increasing rapidly. one urgent desire in coping with this global crisis is to develop or discover drugs that can treat the diseases caused by the novel coronavirus, the sars-cov- (also known as -ncov) . according to the genome comparative studies, the sars-cov- belongs to the genus beta-coronavirus, with nucleotide sequence identity of about % compared to the closest bat coronavirus ratg , about % compared to two other bat sars-like viruses (bat-sl-covzc & bat-sl-covzxc ), and % compared to the sars-cov , . furthermore, the sars-cov- spike protein has a protein sequence identity of % for the receptor binding domain (rbd) with the sars-cov rbd (denoted as sars-rbd in the following). the sars-cov and sars-cov- both utilize the human angiotensin converting enzyme protein (ace ) to initiate the spike protein binding and facilitate the fusion to host cells - . the -residue rbd of the sars-cov spike protein has been found to be sufficient to bind the human ace . based on this fact, the rbd of sars-cov- becomes a critical protein target for drug development to treat the covid- . when this study was started, neither the crystal structure of the sars-cov- spike protein nor the rbd segment were determined, so the homology modeling approach was applied to construct the model of the sars-cov- spike rbd in complex with the human ace binding domain (denoted as cov -rbd/ace in the following). similar approach has been applied to predict the complex structure and estimate the binding energies . because of the high sequence similarity between cov -rbd and sars-rbd, the predicted structure was found to be highly consistent with the resolved crystal structures (see another crystal structure at http://nmdc.cn/ncov entry:nmdcs , pdbid: lzg). these structures laid the foundation for the dynamics investigation of the cov -rbd/ace complex using computational simulation method. the predicted cov -rbd/ace model was subjected to all-atom molecular dynamics (md) simulations to study the binding interactions. although the crystal structure and the predicted model of the cov -rbd/ace complex provide important information about the binding interactions at the molecular interfaces, md simulations can extend the knowledge to a dynamics regime in a fully solvated environment. the importance of the ace residues was investigated by simulating the complexes with ace mutants, in which partial dissociation from the ace was observed within ns simulations. the control simulations of the sars-rbd/ace complexes allowed the detailed comparison in receptor binding for the two different types of viruses. the results showed that the wild type cov -rbd/ace complex is stable in ns simulations, especially in the well-defined binding interface. on the other hand, the mutations on the helix- or k of the ace can alter the binding, revealing new binding poses with reduced contacts compared to those in the crystal structures. the analysis of the interaction energy showed that the binding is enhanced by adjusting conformations to form more favorable interactions as the simulation progressed, consistent with the increased hydrogen bonding patterns. furthermore, the analysis also showed that sars-rbd and cov -rbd have comparable binding affinities to the ace , with the former slightly stronger than the latter. the dynamic information obtained by this study shall be useful in understanding sars-cov- host interaction and for designing inhibitors to block cov -rbd binding. the computer model of the sars-cov- spike rbd in complex with human ace the spike rbd of sars-cov- (genbank: mn ) comprises cys -gly residues according to the sequence homology analysis with sars-cov spike rbd. the predicted three-dimensional structure model of these residues was obtained with the swiss-model server . this predicted sars-cov- rbd model was subsequently superimposed into the x-ray structure of sars-cov rbd in complex with human ace (pdb code: ajf, chain d ). finally, the computer model of sars-cov- rbd with human ace (cov -rbd/ace ) was obtained for further simulations and analysis. based on the analysis of the predicted model, sequence alignment, and literature survey, two other systems containing mutations in the human ace were prepared and subject to md simulation studies. the mutant construct is based on the fact that rat ace markedly diminishes interactions with sars spike protein , and it was proposed that the rat ace likely has reduced binding affinity to the cov -rbd . to investigate the roles of critical residues on the ace , we created two mutants of the human ace (see table ): ( ) mutant mut_h , with the ace n-terminal (residue - ) mutated to the residues of rat ace ; and ( ) mutant k h, in which the highly conserved k was mutated to histidine (the corresponding amino acid in rat and mouse ace proteins). to focus on the impact of these two binding sites, the rest of the ace were kept to be the same as human ace . the predicted model of cov -rbd/ace complex was used as the starting models for md simulations. the spike protein rbd domain is composed of residues ( - ), while the ace protein contains residues from the n-terminal domain. the simulation parameterization and equilibration were prepared for complex structures including the mutant systems, using the charmm-gui webserver . each system was solvated in tip p water and sodium chloride ions to neutralize the systems to a salt concentration of mm. approximately, each system was composed of about , atoms that were parametrized with the charmm force field . after energy minimization using the steepest descent algorithm, each system was equilibrated at human body temperature . k, which was maintained by nose-hoover scheme with three independent trajectories starting from random velocities based on maxwell distributions were simulated for both cov -rbd/ace and sars-rbd/ace complex systems in their wild types. in all simulations, a time step of . fs was used and the pme (particle mesh ewald) was applied for long-range electrostatic interactions. the van der waals interactions were evaluated within the distance cutoff of . Å. hydrogen atoms were constrained using the lincs algorithm . the human ace mutants in complex with cov -rbd were constructed as described previously. each mutant complex model was simulated in two independent trajectories. furthermore, as the crystal structure of the cov -rbd/ace complex became available, two additional simulations were carried out using the crystal structure as the starting model to cross-validate the simulation results based on the homology model. each trajectory was propagated to ns by following the same protocol as the wild type cov -rbd/ace complex simulations. analyses were carried out with tools in gromacs (such as rmsd, rmsf, energy, and pairdist) to examine the system properties, such as the overall stability, local residue and general structure fluctuations through the simulations. the g_mmpbsa program was applied to extract the molecular mechanics energy emm (lennard jones and electrostatic interactions) between ace and the rbd of spike proteins. vmd and chimera were applied to analyze the hydrogen bonds, molecular binding interface, water distributions, visualization and rending model images , . the homology structure of the cov -rbd/ace was compared to the sars-rbd/ace crystal structure and the newly resolved crystal structure of the cov -rbd/ace . the results indicated that the homology model is accurate, especially at the binding interface. the md simulations further refined the side chain orientations to improve the model quality. the simulation data revealed the stable binding between the cov -rbd and the ace , in spite of the conformational changes of the ace . the relative movement between the cov -rbd and the ace mainly exhibited as a swinging motion pivoted at the binding interface. simulations also revealed the roles of water molecules in the binding of the rbd to the ace receptor. the md simulation of complex with ace mutants suggested that mutation to the ace helix- and the k can alter the binding modes and binding affinity. the predicted cov -rbd/ace complex structure is highly similar to the sars-rbd/ace , as shown in figure . the rbd domain has an rmsd of . Å for the aligned residues ( . Å for all residue pairs), indicating that the homology model of the cov -rbd is in good agreement with the sars-rbd. for ace residues near the binding interface (within . Å of the rbd), the rmsd is smaller than . Å compared to the sars-rbd/ace complex. the superposed structures revealed that the rbd/ace interfaces are almost identical in two complexes (figure c ). in a retrospective comparison, the predicted complex structure was superposed to the newly resolved crystal models (see figure d for a detailed comparison at the interface). the results indicated that the homology model is very accurate, especially for the binding interface. the residues near the cov -rbd/ace interface (defined as the combined set of ace residues within . Å of rbd and the rbd residues within . Å of the ace ) exhibited a difference of . Å rmsd, which is comparable to the difference between the two independently reported crystal models (an rmsd of . Å for the same comparison). the rmsd is about . Å for residues in an extended region within . Å of the binding interface. the rbd domain of the spike protein showed an overall rmsd values less than . Å, and the ace domain with an rmsd about . Å between the predicted model and the crystal structures. in three simulations of the cov -rbd/ace systems, the binding interface was highly stable, exhibiting very small conformational changes, especially for the interfacing residues of the ace protein. the rmsd for the residues at the rbd binding interface is . Å (+/- . Å) on average. side chain atom positions were refined to form more favorable interactions (figure d) . one outstanding example is the k side-chain, which pointed in the wrong orientation in the predicted structure, was quickly refined to the correct orientation, consistent with the crystal structure (right panel of figure d ). in terms of collective conformational changes, the cov -rbd/ace complex showed two interesting movements: ( ) the loop (l ) between β and β (residues between s and g in particular) of cov -rbd was found to expand its interactions with the nterminal helix (the helix- ) of the ace (figure a) , while it pointed away from the helix- in the predicted and the crystal structures (figure d, left) ; ( ) a tilting movement of the rbd relative to the ace was observed, which can be depicted as a swinging motion with the binding interface as the pivot (see figure for an illustration). in both predicted and the crystal structures of the cov -rbd/ace complex, the l does not form close contacts with the ace . the analysis of the crystal packing revealed that this loop participated in the interaction with another asymmetric unit (see supplemental information). interestingly, the simulation data suggested that the l could move towards the ace and form contacts with the helix- . this can potentially enhance the binding, as reflected in the change of interaction energies. in the crystal structure, the c and c of the rbd are cross-linked via a disulfide bond that reduces the flexibility of the l region, limiting its access to the ace . on the other hand, it has been reported that the binding of sars-rbd to ace is insensitive to the redox states of the cysteines to a high extend . based on the simulation results, we hypothesize that the reduced form of c and c can also exist during the virus invasion to host cells, and the reduced cysteines can potentially enhance the binding to ace . in the other two simulation trajectories, we found that the l remained in conformations similar to that in the crystal structure and the cysteines (c and c ) were close enough for disulfide bond formation. by examining the binding interface of cov -rbd and the ace , we found the polar and charged residues account for a large fraction, therefore the electrostatic interactions play critical roles for the complex formation. based on the distances between the two proteins, the key residues at the binding interface were identified and summarized in table for the three representative models (see figure ). majority of these residues are conserved for these three models, except that the model# (figure a) has additional contacts to the ace from residues in the l region (highlighted with green color in table ). as shown in figure , the l remained in the starting position for the other two representative models (figure b,c) . the same simulations were carried out for the sars-rbd/ace complex, serving as a comparative system. interestingly, the sars-rbd counterpart of the l in cov -rbd did not form close contacts with the ace in three simulations. it is worthwhile to mention that the sequence identity between cov -rbd and sars-rbd is low in this loop region, suggesting the loop region might be partially responsible for the difference in the receptor binding. the hydrogen bonds between the cov -rbd and ace were extracted using vmd based on the statistics of three simulation trajectories, the cov -rbd/ace complex has . hydrogen bonds between the subunits on average with stringent criteria. in comparison, the sars-rbd/ace has . hydrogen bonds on average (see supplementary information ). the statistics of hydrogen bonds suggest a slightly weaker binding between the cov -rbd and the ace , relative to the sars-rbd/ace complex. it is also noteworthy to point out the important roles of water molecules at the complex interface for cov -rbd/ace complex. at any instant time, there are approximately water molecules at the binding interface, simultaneously located within . Å of both the cov -rbd and the ace (figure ). these water molecules can function as bridges by forming hydrogen bonds with the residues from the rbd or the ace . the dwelling time of water molecules at the interface can be a few nanoseconds, as revealed by the md simulations. this results is also consistent with the crystal structure, which has water molecules at the interface (figure c ). these discoveries emphasize the role of the water molecules, which desires detailed quantification to understand the interactions between the rbd and the ace . it has been demonstrated that the ace from several mammalian species possess high sequence similarities, yet their binding to the sars-rbd differs significantly. in particular, the binding of sars-rbd to the rat ace is much weaker as discovered in experiments . inspired by these information, two mutants of the cov -rbd/ace were constructed: ( ) ace -mut-h by mutating n-terminal helix- to that of the rat ace ; ( ) ace -k h by mutating k to histidine (the amino acid in wild type rat ace ). two ns md simulations were carried out for each mutant system. the simulation showed that the mutations in ace -mut-h reduced the interaction between the cov -rbd and the helix- , and the ace -k h showed weaker binding between the cov -rbd and the β-hairpin centered at the h . using the clustering analysis, the representative structures were identified from each simulation trajectory (figure ) . although the overall topology is very similar to the wild type complex structure, there are pronounced differences. for the ace -mut-h system, the cov -rbd tilted further away from the ace helix- in one simulation (figure a) ; and the cov -rbd lost its contact with helix- (g to n ) in another simulation for the ace -k h (figure c ). in the wild type ace , the k is a hydrogen donor, and its mutant h cannot form the hydrogen bond with the cov -rbd as in the wild type cov -rbd/ace complex. the number of contacting residue pairs was significantly reduced in the ace -k h mutant system. this is in line with the report that k is more critical than the other residues, as its hydrophobic neighborhood enables this positively charged residue high selectivity to the rbd , . the physical interactions between the rbd and the ace were quantified for the simulated structures. we considered the molecular mechanics energy emm , which is composed of the van der waals and the electrostatic interactions. furthermore, the number of residue contacts (nc) between (rbd and ace ) was extracted from simulated structures. both the emm and nc indicate that the rbd interactions with the ace are comparable for cov and sars spike proteins (figure ) . from the simulations, the we would like to point out that the energy emm is the physical interaction between the rbd and the ace , rather than the binding energy, which requires accurately incorporating solvation energy and entropy. furthermore, the standard deviations of emm are . kj/mol and . kj/mol for the two complexes. therefore, we infer that the binding affinities are comparable for cov -rbd/ace and sars-rbd/ace . the simulations started from the predicted and crystal models yielded very similar results (purple triangles). this is in line with a recent study, in which the authors showed similar binding affinity to human ace for both sars-cov- and sars-cov spike proteins . they found the association rate constants kon to be the same at . x m- s- , while the sars-cov spike protein showed a faster dissociation, with the rate constant koff to be . x - s- , about . times larger than the sars-cov- spike protein koff = . x - s- . similar kon values and the equilibrium dissociation constants kd in nanomolar range were reported in other studies for sars-cov- spike protein (or rbd) binding to human ace , . more interestingly, the mutation impacts were reflected in the emm and nc analysis: the ace -mut_h is likely to reduce the binding to the ace due to the tilting movement of cov -rbd, making it further from the ace helix- (the blue triangle symbol at lower right, see figure a for the representative structure). in the other simulation trajectory for the cov -rbd/ace -mut_h complex (blue triangle at the left upper corner), the largest nc was observed among all simulations. for simulations of the complex with ace -k h mutants (green diamonds), the number of contacts were both reduced compared to the wild type system. in one simulation, the contacts between the cov -rbd and the helix- of the ace were completely lost (see figure c) , consistent with the less favorable interactions reflected on an increase of emm. for the sars-rbd interaction with the ace -mut_h , both simulations revealed fewer contacts compared to the wild type sars-rbd/ace complex (purple stars in figure ). the homology modeling of the cov -rbd/ace complex yielded highly consistent models compared to the crystal structures. all-atom molecular dynamics simulations were carried out to study the dynamic interactions of cov -rbd with human ace , the results were compared to the sars-rbd/ace system. the human ace mutants were also constructed to mimic the rat ace to investigate the roles of critical residues, and possible binding modes in other mammals. it is observed that md simulations improved the structure at the binding interface and strengthened the interactions between the subunits. the structure of the complex interface is highly stable for all simulations of cov -rbd/ace complex in the wild type. the loop region between β and β can potentially form more contacts with the ace as observed in one simulation trajectory. the simulations results also reveal that the interactions between cov -rbd and the ace are mediated by water molecules at the interfaces, stressing the necessity of accounting for the explicit water molecules when quantifying the binding affinity. the interactions between the rbd and the ace were quantified by physical interaction energies (molecular mechanics energy) and the number of contacting residues. the detailed comparison results suggest that the cov -rbd and the sars-rbd bind to human ace with comparable affinity. the comparison between the sars-rbd/ace and the cov -rbd/ace complexes, with the former forms fewer contacts than the latter (figure ), yet exhibiting stronger interactions. the decomposition of the emm to the van der waals and the electrostatic interactions revealed that the major difference is attributed to the electrostatic interactions. furthermore, we compared the major contacting residues and found that the sars-rbd has two charged residues (r and d ) and the cov -rbd has only one charged residue (k ) at the complex interface. the polar and hydrophobic residues are comparable in the two rbds. this is consistent with the statistics of hydrogen bonds at the complex interfaces. this study was started with a structure predicted using homology modeling method, which later found to be highly consistent with the crystal structure, demonstrating the potentiality of structure prediction and dynamics simulation in revealing molecular details before the availability of high resolution experimental information. furthermore, the interactions between cov -rbd and the ace mutants mimicking rat ace protein were investigated. the results provide valuable information at the atomic level for the reduced binding affinity. the recent report on the sars-cov- infection to a dog remark mut_h t l, q k, k e, t s, d n, h q, f s l , n , q , and s are conserved between rat and mouse. k h k h h is conserved between rat and mouse table . contact residues between the cov -rbd and the ace . green color denotes new interaction not observed in crystal structure. model# t f d k h e e d y q m y n k g d r k y l f q a g s t g f n y q y g q t n g y s q t f d k h e d y q l m y n k g d r k g y y l f a f n y q y g q t n g y q t f d k h e e d y q m y n k g d r k g y y l f a f n y q y g q t n g y table s . contact residues at the sars-cov-rbd/ace interface traj traj traj ace cov ace cov ace cov s q t k h e d y q l l l l m y q e n k g d r r y y y l d l n y n g y t t g i y q t d k h e d y q l l l l m y q e n k g d r r y y y l p d g l n y n g y t t g i y q t k h e d y q l l l l m y q e n k g d r r y y l p d g p l n y l n g y t t g i y m t k k y e m t k k y e fig. cov -rbd/ace sars-rbd/ace superposed a. b. c. fig. a. b. c. a novel coronavirus from patients with pneumonia in china a new coronavirus associated with human respiratory disease in china evolution of the novel coronavirus from the ongoing wuhan outbreak and modeling of its spike protein for risk of human transmission genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding angiotensin-converting enzyme is a functional receptor for the sars coronavirus a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensinconverting enzyme structure of sars coronavirus spike receptor-binding domain complexed with receptor trilogy of ace : a peptidase in the renin-angiotensin system, a sars receptor, and a partner for amino acid transporters the pathogenicity of novel coronavirus in hace transgenic mice crystal structure of the -ncov spike receptor-binding domain bound with the ace receptor swiss-model: homology modelling of protein structures and complexes efficient replication of severe acute respiratory syndrome coronavirus in mouse cells is limited by murine angiotensin-converting enzyme receptor recognition by novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars simulations using the charmm additive force field optimization of the additive charmm all-atom protein force field targeting improved sampling of the backbone φ, ψ and side-chain χ and χ dihedral angles a unified formulation of the constant temperature molecular dynamics methods canonical dynamics: equilibrium phase-space distributions polymorphic transitions in single crystals: a new molecular dynamics method gromacs: high performance molecular simulations through multi-level parallelism from laptops to supercomputers particle mesh ewald: an n·log(n) method for ewald sums in large systems lincs: a linear constraint solver for molecular simulations g-mmpbsa -a gromacs tool for highthroughput mm-pbsa calculations vmd: visual molecular dynamics ucsf chimera -a visualization system for exploratory research and analysis receptor and viral determinants of sars-coronavirus adaptation to human ace significant redox insensitivity of the functions of the sars-cov spike glycoprotein: comparison with hiv envelope structural analysis of major species barriers between humans and palm civets for severe acute respiratory syndrome coronavirus infections receptor recognition mechanisms of coronaviruses: a decade of structural studies structure, function, and antigenicity of the sars-cov- spike glycoprotein cryo-em structure of the -ncov spike in the prefusion conformation. science ( -. ) coronavirus: hong kong confirms a second dog is infected the authors declare no competing interests. key: cord- -es qhz authors: rogers, thomas f.; zhao, fangzhu; huang, deli; beutler, nathan; burns, alison; he, wan-ting; limbo, oliver; smith, chloe; song, ge; woehl, jordan; yang, linlin; abbott, robert k.; callaghan, sean; garcia, elijah; hurtado, jonathan; parren, mara; peng, linghang; ramirez, sydney; ricketts, james; ricciardi, michael j.; rawlings, stephen a.; wu, nicholas c.; yuan, meng; smith, davey m.; nemazee, david; teijaro, john r.; voss, james e.; wilson, ian a.; andrabi, raiees; briney, bryan; landais, elise; sok, devin; jardine, joseph g.; burton, dennis r. title: isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model date: - - journal: science doi: . /science.abc sha: doc_id: cord_uid: es qhz countermeasures to prevent and treat covid- are a global health priority. we enrolled a cohort of sars-cov- -recovered participants, developed neutralization assays to interrogate antibody responses, adapted our high-throughput antibody generation pipeline to rapidly screen over antibodies, and established an animal model to test protection. we isolated potent neutralizing antibodies (nabs) to two epitopes on the receptor binding domain (rbd) and to distinct non-rbd epitopes on the spike (s) protein. we showed that passive transfer of a nab provides protection against disease in high-dose sars-cov- challenge in syrian hamsters, as revealed by maintained weight and low lung viral titers in treated animals. the study suggests a role for nabs in prophylaxis, and potentially therapy, of covid- . the nabs define protective epitopes to guide vaccine design. the novel coronavirus disease (covid- ) has had devastating global health consequences and there is currently no cure and no licensed vaccine. neutralizing antibodies (nabs) to the causative agent of the disease, severe acute respiratory syndrome coronavirus- (sars-cov- ), represent potential prophylactic and therapeutic options and could help guide vaccine design. indeed, a nab to another respiratory virus, respiratory syncytial virus (rsv), is in widespread clinical use prophylactically to protect vulnerable infants ( ). furthermore, nabs prevent death from the emerging ebola virus in macaques, even when given relatively late in infection, and thus have been proposed for use in humans in outbreaks ( , ). generally, nabs with outstanding potency ("super-antibodies") ( ) can be isolated by deeply mining antibody responses of a sampling of infected donors. outstanding potency together with engineering to extend antibody halflife from weeks to many months brings down the effective costs of abs and suggests more opportunities for prophylactic intervention. at the same time, outstanding potency can permit anti-viral therapeutic efficacy that is not observed for less potent antibodies ( ). here, we present the isolation of highly potent nabs to sars-cov- and demonstrate their in vivo protective efficacy in a small animal model, suggesting their potential utility as a medical countermeasure. to interrogate the antibody response against sars-cov- and discover nabs, we adapted our pipeline to rapidly isolate and characterize monoclonal antibodies (mabs) from convalescent donors (fig. ) . briefly, a cohort of previously swabpositive sars-cov- donors was recruited for peripheral blood mononuclear cell (pbmc) and plasma collection. in parallel, we developed both live replicating and pseudovirus neutralization assays using a hela-ace (angiotensin-converting enzyme- ) cell line that gave robust and reproducible virus titers. convalescent serum responses were evaluated for neutralization activity against sars-cov- and sars-cov- , and eight donors were selected for mab discovery. single antigen-specific memory b cells were sorted, and their corresponding variable genes were recovered and cloned using a high-throughput production system that enabled antibody expression and characterization in under two weeks. promising mabs were advanced for further biophysical characterization and in vivo testing. two platforms were established to evaluate plasma neutralization activity against sars-cov- , one using replicationcompetent virus and another using pseudovirus (psv). vero-e cells were first used as target cells for neutralization assays, but this system was relatively insensitive at detecting replicating virus compared to a hela cell line that stably expressed the cell surface ace receptor ( fig. s a ). the hela-ace target cells gave reproducible titers and were used for the remainder of the study. in certain critical instances, hela-ace and vero cells were compared. the live replicating virus assay used the washington strain of sars-cov- , usa-wa / (bei resources nr- ) and was optimized to a -well format to measure plaque formation. in parallel, a psv assay was established for both sars-cov- and sars-cov- using murine leukemia virus (mlv)-based psv ( ). the assay used single cycle infectious viral particles bearing a firefly luciferase reporter for high-throughput screening. unlike mlv-psv, which buds at the plasma membrane, coronaviruses assemble in the er-golgi intermediate compartment, so the c terminus of the sars-cov- spike protein (s protein) contains an er retrieval signal ( ). the alignment of sars-cov- and sars-cov- s proteins showed that this er retrieval signal is conserved in sars-cov- (fig. s b). to prepare high titers of infectious mlv-cov- and sars-cov- psv particles, various truncations of sars-cov- and sars-cov- s protein were expressed in which the er retrieval signal was removed to improve exocytosis of the virus. pseudovirion versions carrying sars-cov -sΔ and sars-cov -sΔ s protein efficiently transduced ace -expressing target cells, but not control hela or a cells (fig. s c). control vsv-g pseudotyped virions showed a similar transduction efficiency in all target cells. luciferase expression in transduced cells proved to be proportional to viral titer over a wide range ( fig. s d ). in parallel to the development of neutralization assays, a cohort was established in san diego, california, of donors who had previously been infected with sars-cov- ( fig. a, fig. s a , and table s ). the cohort was % female and the average age was years. infection was determined by a positive sars-cov- pcr test from a nasopharyngeal swab. all donors also had symptoms consistent with covid- , and disease severity ranged from mild to severe, including intubation in one case, although all donors recovered. donor plasma were tested for binding to recombinant sars-cov- and sars-cov- s and receptor binding domain (rbd) proteins, for binding to cell surface expressed spikes and for neutralization in both live replicating virus and pseudovirus assays (fig. , b to d, and fig. s b ; donors cc , cc and cc that are further pursued below are highlighted). binding titers to sars-cov- s protein varied considerably, reaching ec s at serum dilutions of around , with titers against the rbd about an order of magnitude less. titers against sars-cov- s protein were notably less than for sars-cov- s protein and titers against sars-cov- rbd were only detected in a small number of donors. neutralizing titers in the psv assay varied over a wide range for sars-cov- ( fig. d and fig. s a ) and were low or undetectable against sars-cov- . importantly, rbd binding and psv neutralization were well correlated (fig. e) . there was also a positive correlation between cell surface spike binding and live replicating virus neutralization ( fig. s c ). the titers in the psv assay and the replicating virus assay were largely similar (figs. s and s ). in most later measurements, the psv assay was preferred owing to its higher throughput. cryopreserved pbmcs from eight donors were stained for memory b cells markers (cd +/igg+) and both avi-tag biotinylated rbd and sars-cov- s antigen baits before singlecell sorting. s+ and s+/rbd+ memory b cells were present at an average frequency of . % and . %, respectively, across the eight donors ( fig. s a ). in total, antigen-positive (ag+) memory b cells were sorted to rescue native heavy and light chain pairs for mab production and validation (fig. s b). a total of antibodies were cloned and expressed, which represents, on average, a % pcr recovery of paired variable genes and > % estimated recovery of fully functional cloned genes ( fig. s c ). the bulk-transformed ligation products for both the heavy chain and light chain were transfected and tested for binding to rbd and s protein, and for neutralization in the sars-cov- pseudovirus assay using hela-ace target cells ( fig. s ). the majority of transfected pairs resulted in igg expression ( %). of these, % showed binding only to s protein while . % bound to both s and rbd proteins and . % bound only to rbd. the supernatants were also screened for binding to an unrelated hiv antigen (bg sosip) to eliminate non-specific or polyreactive supernatants. the supernatants were next evaluated for neutralization activity using sars-cov- and sars-cov- pseudoviruses. strikingly, a small proportion of the binding antibodies showed neutralization activity and that activity was equally distributed between rbd+/s+ and s+ only binders despite a much larger number of s+ only binding supernatants as exemplified by the three donors cc , cc and cc , (fig. a) . these data indicate that viral infection generates a strong response against the non-rbd regions of s protein, but only a small proportion of that response is neutralizing. in contrast, there are fewer rbd binding antibodies but a larger proportion of these neutralize sars-cov- pseudovirus. antibodies that tested positive for neutralization in the high-throughput screening were sequence confirmed and advanced for expression at large scale for additional characterization. a total of antibodies were prioritized for in depth characterization from the donors, cc , cc and cc . within that subset, we identified distinct lineages, with containing a single member (table s ) . vh and vh -gene families were notably prominent in these abs and there was a diversity of cdr lengths (fig. , b and c). there was one prominent example of a clonally expanded lineage, with recovered clonal members that averaged . % and . % mutations from germline at the nucleotide level in the heavy chain and light chain, respectively (fig. d ). the remaining clones were relatively unmutated, averaging just above % mutation at the nucleotide level suggesting that these antibodies were primed by the ongoing covid infection and likely not recalled from a previous endemic human coronavirus (hcov) exposure. all antibodies that were expressed at scale were evaluated in standard elisa-based polyreactivity assays with solubilized cho membrane preparations, ssdna and insulin ( , ) , and none were polyreactive ( fig. s ). the antibody hits that were identified in the high-throughput screening were next evaluated for epitope specificity by biolayer interferometry (bli) using s and rbd proteins as capture antigens. the antigens were captured on anti-his biosensors before addition of saturating concentrations ( μg/ml) of antibodies that were then followed by competing antibodies at a lower concentration ( μg/ml). accordingly, only antibodies that bind to a non-competing site would be detected in the assay. among the antibodies evaluated, the results reveal three epitope bins for rbd (designated as rbd-a, rbd-b, and rbd-c) and three epitope bins for the s protein (designated as s-a, s-b, and s-c) ( fig. a and fig. s ). interestingly, the mab cc . appears to compete with antibodies targeting two different epitopes, rbd-b and s-a ( fig. s ), which might indicate that this mab targets an epitope spanning rbd-b and s-a. to evaluate epitope specificities further, we next assessed binding of the antibodies to extended rbd-constructs with subdomains (sd) and , including the independently folding rbd-sd and rbd-sd - , and the n-terminal domain (ntd) ( fig. b and fig. s , a and b). none of the antibodies showed binding to the ntd. cc . binds to all the other constructs, which supports the epitope binning data described in fig. a . the other antibodies grouped in the s-a epitope bin that compete with cc . show either no binding to rbd or rbd-sd constructs (cc . and cc . ) or do show binding to rbd-sd and rbd-sd - but not rbd (cc . ). these data suggest two competing epitopes within the s-a epitope bin; one that is confined to the non-rbd region of s protein and the other that includes some element of rbd-sd - . this interpretation will require further investigation by structural studies. we next evaluated the mabs for neutralization activity against sars-cov- and sars-cov- pseudoviruses. the neutralization ic potencies of these antibodies are shown in fig. c and their associated maximum plateaus of neutralization (mpns) are shown in fig. d . a comparison of neutralization potencies between pseudovirus ( fig. s c ) and live replicating virus ( fig. s d ) is also included. notably, the most potent neutralizing antibodies were those directed to rbd-a epitope including two antibodies, cc . and cc . , that neutralize sars-cov- pseudovirus with an ic of ng/ml and ng/ml, respectively (fig. c ). in comparison, antibodies directed to rbd-b tended to have higher ic s and many plateau below % neutralization. despite this trend, cc . is directed against rbd-b and showed complete neutralization of sars-cov- with an ic of ng/ml and also neutralized sars-cov- with an ic of ng/ml. this was the only antibody that showed potent neutralization of both pseudoviruses. the antibodies that do not bind to rbd and are directed to non-rbd epitopes on s protein all show poor neutralization potencies and mpns well below %. to evaluate whether the rbd-a epitope might span the ace binding site, we next performed cell surface competition experiments. briefly, antibodies were premixed with biotinylated s (fig. e ) or rbd (fig. f) proteins at a molar ratio of : of antibodies to target antigen. the mixture was then incubated with the hela-ace cell line and the percent competition against ace receptor was recorded by comparing percent binding of the target antigen with and without antibody present ( fig. s e ). the antibodies targeting the rbd-a epitope compete best against the ace receptor and the neutralization ic correlates well with the percent competition for ace receptor binding for both s protein (fig. e ) and for rbd (fig. f ). we also assessed the affinity of all rbd-specific antibodies to soluble rbd by surface plasmon resonance (spr) and found a poor correlation between affinity and neutralization potency ( fig. g and fig. s ). however, the correlation is higher when limited to antibodies targeting the rbd-a epitope. the lack of a correlation between rbd binding and neutralization for mabs contrasts with the strong correlation described earlier for serum rbd binding and neutralization. overall, the data highlight epitope rbd-a as the preferred target for eliciting neutralizing antibodies and that corresponding increases in affinity of mabs to rbd-a will likely result in corresponding increases in neutralization potency. sars-cov- has shown some propensity for mutation as it has circulated worldwide as evidenced for example in the emergence of the d g variant ( ). we investigated the activity of nabs against viral variants that have been reported. the sera studied above neutralized all the variants s a ). all nabs neutralized the d g variant. however, one variant with a mutation in the ace binding site (g s) did show effectively complete resistance to one of the nabs and another variant (v f) showed a -fold higher ic than the wa- strain ( fig. s b ). to investigate the relationship between in vitro neutralization and protection in vivo against sars-cov- , we selected two mabs for passive transfer/challenge experiments in a syrian hamster animal model based on a summary of the nab data (table s and fig. s ). the experimental design for the passive transfer study is shown in fig. a . in the first experiment, we tested nab cc . , which targets the rbd-a epitope and has an in vitro ic neutralization of . μg/ml against pseudovirus and in the second we tested nab c . , which targets the s-b epitope with an ic neutralization of μg/ml. in both experiments an unrelated antibody to dengue virus, den , was used as a control. the anti-sars-cov- nabs were delivered at different concentrations to evaluate dose-dependent protection starting at mg/animal (average of . mg/kg) at the highest dose and μg/animal at the lowest dose. the den control antibody was delivered at a single dose of mg/animal. sera were collected from each animal hours post ip infusion of the antibody and all animals were subsequently challenged with a dose of x pfu of sars-cov- (usa-wa / ) by intranasal administration hours post antibody infusion (fig. a) . syrian hamsters typically clear virus within one week after sars-cov- infection ( ) . accordingly, the hamsters were weighed as a measure of disease due to infection. lung tissues were collected to measure viral load on day . a data summary is presented in fig. b and fig. s a for animals that received cc . , which targets the rbd-a epitope. the control animals that received den lost on average . % of body weight at days post virus challenge. in comparison, the animals that received the neutralizing rbd-a antibody at a dose of mg (average of . mg/kg) or μg (average of . mg/kg) exhibited no weight loss. however, animals that received a dose of μg (average of . mg/kg) had an average % loss of body weight, while animals that received a dose of μg/ml ( . mg/kg) and μg/ml ( . mg/kg) lost . % and . % of body weight, respectively. we note these animals showed a trend for greater weight loss than control animals but this did not achieve statistical significance (table s ) . given concerns about antibody-mediated enhanced disease in sars-cov- infection, this observation merits further attention using larger animal group sizes. the weight loss data are further corroborated by quantification of lung viral load measured by real-time pcr (fig. c ) and showed a moderate correlation to weight loss. the data indicate comparable viral loads between the three higher doses ( mg, μg, and μg) of nabs. in contrast, equivalent viral loads were observed between the control group receiving den and the low dose groups receiving μg and μg of nab. in contrast to the nab to rbd-a, the less potent and incompletely neutralizing antibody to the s-b epitope showed no evidence of protection at any concentration compared to the control animals ( fig. s b) . to determine the antibody serum concentrations that may be required for protection against disease from sars-cov- infection, we also measured the antibody serum concentrations just prior to intranasal virus challenge (fig. d) . the data highlight that an antibody serum concentration of approximately μg/ml of nab ( x psv neutralization ic ) enables full protection and a serum concentration of μg/ml ( x psv neutralization ic ) is adequate for % reduced disease as measured by weight loss. the effective antibody concentration required at the site of infection to protect from disease remains to be determined. sterilizing immunity at serum concentrations that represent a large multiplier of the in vitro neutralizing ic is observed for many viruses ( ) . using a high-throughput rapid system for antibody discovery, we isolated more than mabs from convalescent donors by memory b cell selection using sars-cov- s or rbd recombinant proteins. about half of the mabs isolated could be expressed and also bind effectively to either s and/or rbd proteins. only a small fraction of these abs was neutralizing, highlighting the value of deep mining of responses to access the most potent abs ( ). a range of nabs were isolated to different sites on the s protein. the most potent abs, reaching single digit ng/ml ic s in psv assays, are targeted to a site that, judged by competition studies, overlaps the ace binding site. only one of the abs, directed to rbd-b, neutralized sars-cov- psv, as may be anticipated given the differences in ace contact residues between the two viruses (fig. s ) and given that the selections were performed with sars-cov- target proteins. abs directed to the rbd but not competitive with soluble ace , (although they may be competitive in terms of an array of membrane-bound ace molecules interacting with an array of spike proteins on a virion), are generally less potent neutralizers and tend to show incomplete neutralization, plateauing at around or less than % neutralization. the one exception is the cross-reactive rbd-b antibody above. similar lower potency and incomplete neutralization are observed for abs to the s protein that are not reactive with recombinant rbd. the cause(s) of these incomplete neutralization phenomena is unclear but presumably originates in some spike protein heterogeneity, either glycan, cleavage or conformationally based. in any case, the rbd-a nabs that directly compete with ace are clearly the most preferred for prophylactic and therapeutic applications, and as reagents to define nab epitopes for vaccine design. we note that, even for a small sampling of naturally occurring viral variants, two were identified that showed notable resistance to individual potent nabs to the wa- strain and neutralization resistance will need to be considered in planning for clinical applications of nabs. cocktails of nabs may be required. in terms of nabs as passive reagents, the efficacy of a potent anti-rbd nab in vivo in syrian hamsters is promising in view of the positive attributes of this animal model ( ) and suggests that human studies are merited. nevertheless, as for any animal model, there are many limitations, including, in the context of antibody protection, differences in effector cells and fc receptors between humans and hamsters. the failure of the non-rbd s-protein nab to protect in the animal model is consistent with its lower potency and, likely most importantly, its inability to fully neutralize challenge virus. in the context of human studies, improved potency of protective nabs by enhancing binding affinity to the rbd epitope identified, improved half-life and reduced fc receptor binding to minimize potential antibody dependent enhancement (ade) effects, should they be identified as concerning, are all antibody engineering goals to be considered. as observed for heterologous b cell responses against different serotypes of flavivirus infection, there is a possibility, but no current experimental evidence, that subtherapeutic vaccine serum responses or subtherapeutic nab titers could potentially exacerbate future coronavirus infection disease burden by expanding the viral replication and/or cell tropism of the virus. if ade is found for sars-cov- and operates at sub-neutralizing concentrations of neutralizing antibodies as it can for dengue virus ( ) then it would be important, from a vaccine standpoint, to carefully define the full range of nab epitopes on the s protein as we have begun here. from a passive antibody standpoint, it would be important to maintain high nab concentrations or appropriately engineer nabs. the nabs described have remarkably little shm, typically one or two mutations in the vh gene and one or two in the vl gene. such low shm may be associated with the isolation of the nabs relatively soon after infection, and perhaps before affinity-maturation has progressed. low shm has also been described for potent nabs to ebola virus, respiratory syncytial virus (rsv), middle east respiratory syndrome coronavirus (mers-cov) and yellow fever virus ( ) ( ) ( ) ( ) and may indicate that the human naïve repertoire is often sufficiently diverse to respond effectively to many pathogens with little mutation. of course, nab efficacy and titer may increase over time as described for other viruses and it will be interesting to see if even more potent nabs to sars-cov- evolve in our donors in the future. what do our results suggest for sars-cov- vaccine design? in the first instance, the results suggest a focus on the rbd and indeed strong nab responses have been described by immunizing mice with a multivalent presentation of rbd ( ) . the strong preponderance of non-neutralizing antibodies and very few nabs to s protein that we isolated could arise for a number of reasons including: (i) the recombinant s protein that we used to select b cells is a poor representation of the native spike on virions. in other words, there may be many nabs to s but we failed to isolate them because of the selecting antigen, (ii) the recombinant s protein that we used is close to native but non-neutralizing antibodies bind to sites on s that do not interfere with viral entry, (iii) the s protein in natural infection disassembles readily generating a strong ab response to "viral debris" that is non-neutralizing because the antibodies recognize protein surfaces that are not exposed on the native spike. importantly, the availability of both neutralizing and non-neutralizing antibodies generated in this study will facilitate evaluation of s protein immunogens for presentation of neutralizing and non-neutralizing epitopes and promote effective vaccine design. the design of an immunogen that improves on the quality of nabs elicited by natural infection may well emerge as an important goal of vaccine efforts ( ) . in summary, we describe the very rapid generation of neutralizing antibodies to a newly emerged pathogen. the antibodies can find clinical application and will aid in vaccine design. (e) correlation between psv sars-cov- neutralization and rbd subunit elisa binding area-under-the-curve (auc). auc was computed using simpson's rule. the % confidence interval of the regression line is shown in light grey and was estimated by performing , bootstrap re-samplings. r and p values of the regression are also indicated. cc participants from whom mabs were isolated are specifically highlighted in dark blue (cc ), pine green (cc ) and hot pink (cc ). biophysical properties of the clinical-stage antibody landscape minimally mutated hiv- broadly neutralizing antibodies to guide reductionist vaccine design severe acute respiratory syndrome coronavirus infection of golden syrian hamsters the antiviral activity of antibodies in vitro and in vivo simulation of the clinical and pathological manifestations of coronavirus disease (covid- ) in golden syrian hamster model: implications for disease pathogenesis and transmissibility dengue antibody-dependent enhancement: knowns and unknowns longitudinal analysis of the human b cell response to ebola virus infection junctional and allele-specific residues are critical for mers-cov neutralization by an exceptionally potent germline-like antibody infants infected with respiratory syncytial virus generate potent neutralizing antibodies that lack somatic hypermutation longitudinal dynamics of the human b cell response to the yellow fever d vaccine the sars-cov- receptor-binding domain elicits a potent neutralizing response without antibody-dependent enhancement rational vaccine design in the time of covid- pre-fusion structure of a human coronavirus spike protein cryo-em structure of the -ncov spike in the prefusion conformation rational design of envelope identifies broadly neutralizing human monoclonal antibodies to hiv- recombinant hiv envelope trimer selects for quaternary-dependent antibodies targeting the trimer apex efficient generation of monoclonal antibodies from single human b cells by single cell rt-pcr and expression vector cloning r (ai , ai to d.n.) and k (ai to r.k.a.) awards, the iavi neutralizing antibody center, the bill and melinda gates foundation (opp to i.a.w. and d.r.b.), (opp to j.e.v.) and (opp / inv- to ds and drb) key: cord- - e u aza authors: tian, xiaolong; li, cheng; huang, ailing; xia, shuai; lu, sicong; shi, zhengli; lu, lu; jiang, shibo; yang, zhenlin; wu, yanling; ying, tianlei title: potent binding of novel coronavirus spike protein by a sars coronavirus-specific human monoclonal antibody date: - - journal: emerg microbes infect doi: . / . . sha: doc_id: cord_uid: e u aza the newly identified novel coronavirus ( -ncov) has caused more than , laboratory-confirmed human infections, including deaths, posing a serious threat to human health. currently, however, there is no specific antiviral treatment or vaccine. considering the relatively high identity of receptor-binding domain (rbd) in -ncov and sars-cov, it is urgent to assess the cross-reactivity of anti-sars cov antibodies with -ncov spike protein, which could have important implications for rapid development of vaccines and therapeutic antibodies against -ncov. here, we report for the first time that a sars-cov-specific human monoclonal antibody, cr , could bind potently with -ncov rbd (kd of . nm). the epitope of cr does not overlap with the ace binding site within -ncov rbd. these results suggest that cr may have the potential to be developed as candidate therapeutics, alone or in combination with other neutralizing antibodies, for the prevention and treatment of -ncov infections. interestingly, some of the most potent sars-cov-specific neutralizing antibodies (e.g. m , cr ) that target the ace binding site of sars-cov failed to bind -ncov spike protein, implying that the difference in the rbd of sars-cov and -ncov has a critical impact for the cross-reactivity of neutralizing antibodies, and that it is still necessary to develop novel monoclonal antibodies that could bind specifically to -ncov rbd. very recently, a novel coronavirus which was temporarily named " novel coronavirus ( -ncov)" emerged in wuhan, china [ ] . as of february , -ncov has resulted in a total of , laboratory-confirmed human infections in china, including deaths, and exported cases in countries outside of china (https://www.who.int/emergencies/ diseases/novel-coronavirus- /situation-reports). currently, there is no vaccine or effective antiviral treatment against -ncov infection. based on the phylogenetic analysis (gisaid accession no. epi_isl_ ) [ ] , -ncov belongs to lineage b betacoronavirus and shares high sequence identity with that of bat or human severe acute respiratory syndrome coronavirus-related coronavirus (sarsr-cov) and bat sars-like coronavirus (sl-cov) (figure (a) ). in previous studies, a number of potent monoclonal antibodies against sars coronavirus (sars-cov) have been identified [ ] [ ] [ ] [ ] [ ] . these antibodies target the spike protein (s) of sars-cov and sl-covs, which is a type i transmembrane glycoprotein and mediates the entrance to human respiratory epithelial cells by interacting with cell surface receptor angiotensin-converting enzyme (ace ) [ ] . more specifically, the amino acid length (n -v ) receptor binding domain (rbd) within the s protein is the critical target for neutralizing antibodies [ ] . some of the antibodies recognize different epitopes on rbd; e.g. the sars-cov neutralizing antibodies cr and cr bound noncompetitively to the sars-cov rbd and neutralized the virus in a synergistic fashion [ ] . we predicted the conformation of -ncov rbd as well as its complex structures with several neutralizing antibodies, and found that the modelling results support the interactions between -ncov rbd and certain sars-cov antibodies (figure (b) ). this could be due to the relatively high identity ( %) of rbd in -ncov and sars-cov (figure (c) ). for instance, residues in rbd of sars-cov that make polar interactions with a neutralizing antibody m as indicated by the complex crystal structure [ ] are invariably conserved in -ncov rbd (figure (d) ). in the structure of sars-cov-rbd-m , r in rbd formed a salt bridge with d of m -vl. concordantly, the electrostatic interaction was also observed in the model of -ncov-rbd-m , forming by r (rbd) and d (m -vl). this analysis suggests that some sars-cov-specific monoclonal antibodies may be effective in neutralizing -ncov. in contrast, the interactions between antibody f g [ ] or r [ ] and the rbd in -ncov decreased significantly due to the lack of salt bridges formed by r -d in sars-cov-rbd-f g or d -r in sars-cov-rbd- r, respectively. furthermore, while most of the r-binding residues on the rbd of sars-cov are not conserved on rbd of -ncov ( figure (c) ), it is unlikely that the antibody r could effectively recognize -ncov. therefore, it is urgent to experimentally determine the cross-reactivity of anti-sars-cov antibodies with -ncov spike protein, which could have important implications for rapid development of vaccines and therapeutic antibodies against -ncov. in this study, we first expressed and purified -ncov rbd protein. we also predicted the conformations of -ncov rbd and its complex with the putative receptor, human ace . comparison of the interaction between the complex of ace [ ] and sars-cov rbd and homology model of ace and -ncov rbd revealed similar binding modes (data not shown). in both complexes, β -β loop and β -β loop form extensive contact, including at least seven pairs of hydrogen bonds, with the receptor. notably, r on the forth α helix in sars-cov rbd builds a salt bridge with e and a hydrogen bond with q on ace . however, the arginine (r in sars-cov rbd) to asparagine (n ) mutation in -ncov rbd abolished the strong polar interactions, which may induce a decrease in the binding affinity between rbd and the receptor. interestingly, a lysine (k in -ncov rbd) replacement of valine (v in sars-cov rbd) on β formed an extra salt bridge with d on ace , which may recover the binding ability. these data indicate that the rbd in s protein of -ncov may bind to ace with a similar affinity as sars-cov rbd does. indeed, we measured the binding of -ncov rbd to human ace by the biolayer interferometry binding (bli) assay, and found that -ncov rbd bound potently to ace . the calculated affinity (k d ) of -ncov rbd with human ace was . nm (figure (f) ), which is comparable to that of sars-cov spike protein with human ace ( nm) [ ] . these results indicate that ace could be the potential receptor for the new coronavirus, and that the expressed -ncov rbd protein is functional [ ] . next, we expressed and purified several representative sars-cov-specific antibodies which have been reported to target rbd and possess potent neutralizing activities, including m [ ] , cr [ ] , cr [ ] , as well as a mers-cov-specific human monoclonal antibody m developed by our laboratory [ ] , and measured their binding ability to -ncov rbd by elisa (figure (e)) . surprisingly, we found that most of these antibodies did not show evident binding to -ncov rbd. to confirm this result, we further measured the binding kinetics using bli. an irrelevant anti-cd antibody was used as a negative control. similarly, the antibody m , which was predicted to bind -ncov rbd (figure (d) ), only showed slight binding at the highest measured concentration ( . µm). further studies are needed to solve the high-resolution structure of -ncov rbd and understand why it could not be recognized by these antibodies. notably, one sars-cov-specific antibody, cr , was found to bind potently with -ncov rbd as determined by elisa and bli (figure (e,f) ). it followed a fast-on (k on of . × ms − ) and slow-off (k off of . × − s − ) binding kinetics, resulting in a k d of . nm (figure (f) ). this antibody was isolated from blood of a convalescent sars patient and did not compete with the antibody cr for binding to recombinant s protein [ ] . to further elucidate the binding epitopes of cr , we measured the competition of cr and human ace for the binding to -ncov rbd. the streptavidin biosensors labelled with biotinylated -ncov rbd were saturated with human ace in solution, followed by the addition of the test antibodies in the presence of ace . as shown in figure (g), the antibody cr did not show any competition with ace for the binding to -ncov rbd. these results suggest that cr , distinct from the other two sars-cov antibodies, recognizes an epitope that does not overlap with the ace binding site of -ncov rbd. the rbd of -ncov differs largely from the sars-cov at the c-terminus residues (figure (c) ). our results implied that such a difference did not result in drastic changes in the capability to engage the ace receptor, but had a critical impact on the cross-reactivity of neutralizing antibodies. some of the most potent sars-cov-specific neutralizing antibodies (e.g. m , cr ) that target the receptor binding site of sars-cov failed to bind -ncov spike protein, indicating that it is necessary to develop novel monoclonal antibodies that could bind specifically to -ncov rbd. interestingly, it was reported that the antibody cr completely neutralized both the wild-type sars-cov and the cr escape viruses at a concentration of . μg/ml, and that no escape variants could be generated with cr [ ] . furthermore, the mixture of cr and cr neutralized sars-cov in a synergistic fashion by recognizing different epitopes on rbd [ ] . these results suggest that cr has the potential to be developed as candidate therapeutics, alone or in combination with other neutralizing antibodies, for the prevention and treatment of -ncov infections. we expect more cross-reactive antibodies against -ncov and sars-cov or other coronaviruses to be identified soon, facilitating the development of effective antiviral therapeutics and vaccines. no potential conflict of interest was reported by the author (s). genomic characterization of the novel human-pathogenic coronavirus isolated from a patient with atypical pneumonia after visiting wuhan a pneumonia outbreak associated with a new coronavirus of probable bat origin potent cross-reactive neutralization of sars coronavirus isolates by human monoclonal antibodies human monoclonal antibody as prophylaxis for sars coronavirus infection in ferrets. the lancet human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants potent neutralization of severe acute respiratory syndrome (sars) coronavirus by a human mab to s protein that blocks receptor association an efficient method to make human monoclonal antibodies from memory b cells: potent neutralization of sars coronavirus coronavirus spike proteins in viral entry and pathogenesis a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensin-converting enzyme structure of severe acute respiratory syndrome coronavirus receptorbinding domain complexed with neutralizing antibody structural insights into immune recognition of the severe acute respiratory syndrome coronavirus s protein receptor binding domain structural basis of neutralization by a human anti-severe acute respiratory syndrome spike protein antibody, r structure of sars coronavirus spike receptor-binding domain complexed with receptor unexpected receptor functional mimicry elucidates activation of coronavirus fusion junctional and allelespecific residues are critical for mers-cov neutralization by an exceptionally potent germline-like antibody key: cord- -amfv z y authors: nguyen-contant, phuong; embong, a. karim; kanagaiah, preshetha; chaves, francisco a.; yang, hongmei; branche, angela r.; topham, david j.; sangster, mark y. title: s protein-reactive igg and memory b cell production after human sars-cov- infection includes broad reactivity to the s subunit date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: amfv z y the high susceptibility of humans to sars-cov- infection, the cause of covid- , reflects the novelty of the virus and limited preexisting b cell immunity. igg against the sars-cov- spike (s) protein, which carries the novel receptor binding domain (rbd), is absent or at low levels in unexposed individuals. to better understand the b cell response to sars-cov- infection, we asked whether virus-reactive memory b cells (mbcs) were present in unexposed subjects and whether mbc generation accompanied virus-specific igg production in infected subjects. we analyzed sera and pbmcs from non-sars-cov- -exposed healthy donors and covid- convalescent subjects. serum igg levels specific for sars-cov- proteins (s, including the rbd and s subunit, and nucleocapsid [n]) and non-sars-cov- proteins were related to measurements of circulating igg mbcs. anti-rbd igg was absent in unexposed subjects. most unexposed subjects had anti-s igg and a minority had anti-n igg, but igg mbcs with these specificities were not detected, perhaps reflecting low frequencies. convalescent subjects had high levels of igg against the rbd, s , and n, together with large populations of rbd- and s -reactive igg mbcs. notably, igg titers against the s protein of the human coronavirus oc in convalescent subjects were higher than in unexposed subjects and correlated strongly with anti-s titers. our findings indicate cross-reactive b cell responses against the s subunit that might enhance broad coronavirus protection. importantly, our demonstration of mbc induction by sars-cov- infection suggests that a durable form of b cell immunity is maintained even if circulating antibody levels wane. importance recent rapid worldwide spread of sars-cov- has established a pandemic of potentially serious disease in the highly susceptible human population. key questions are whether humans have preexisting immune memory that provides some protection against sars-cov- and whether sars-cov- infection generates lasting immune protection against reinfection. our analysis focused on pre- and post-infection igg and igg memory b cells (mbcs) reactive to sars-cov- proteins. most importantly, we demonstrate that infection generates both igg and igg mbcs against the novel receptor binding domain and the conserved s subunit of the sars-cov- spike protein. thus, even if antibody levels wane, long-lived mbcs remain to mediate rapid antibody production. our study also suggests that sars-cov- infection strengthens preexisting broad coronavirus protection through s -reactive antibody and mbc formation. the high susceptibility of humans to sars-cov- infection, the cause of covid- , reflects the novelty of the virus and limited preexisting b cell immunity. igg against the sars-cov- spike (s) protein, which carries the novel receptor binding domain (rbd), is absent or at low levels in unexposed individuals. to better understand the b cell response to sars-cov- infection, we asked whether virus-reactive memory b cells (mbcs) were present in unexposed subjects and whether mbc generation accompanied virus-specific igg production in infected subjects. we analyzed sera and pbmcs from non-sars-cov- -exposed healthy donors and covid- convalescent subjects. serum igg levels specific for sars-cov- proteins (s, including the rbd and s subunit, and nucleocapsid [n] ) and non-sars-cov- proteins were related to measurements of circulating igg mbcs. anti-rbd igg was absent in unexposed subjects. most unexposed subjects had anti-s igg and a minority had anti-n igg, but igg mbcs with these specificities were not detected, perhaps reflecting low frequencies. convalescent subjects had high levels of igg against the rbd, s , and n, together with large populations of rbd-and s -reactive igg mbcs. notably, igg titers against the s protein of the human coronavirus oc in convalescent subjects were higher than in unexposed subjects and correlated strongly with anti-s titers. our findings indicate cross-reactive b cell responses against the s subunit that might enhance broad coronavirus protection. importantly, our demonstration of mbc induction by sars-cov- infection suggests that a durable form of b cell immunity is maintained even if circulating antibody levels wane. importance the betacoronavirus sars-cov- , the causative agent of a respiratory disease termed covid- , emerged in china in late and rapidly spread worldwide ( ). a pandemic was declared in march and global deaths from covid- now exceed , . the rapid increase in cases in many countries has challenged healthcare systems and shutdowns and quarantine measures introduced to slow virus spread have caused major disruptions to society and economies ( ). sars-cov- infection produces a wide spectrum of outcomes. a proportion of infections, likely more than %, remain asymptomatic. most clinical cases develop mild to moderate respiratory symptoms, but up to % progress to a more severe disease with extensive pneumonia ( , ). when sars-cov- emerged and began to spread, the severity of the threat was primarily attributed to the novelty of the virus to the human immune system and, consequently, a lack of preexisting immune memory to quickly clear virus and limit disease progression. four types of common cold coronavirus are endemic in humans, the alphacoronaviruses e and nl and the betacoronaviruses oc and hku . however, limited relatedness between key structural proteins of these human coronaviruses (hcovs) and those of sars-cov- suggested that significant cross-reactive immunity was unlikely ( , ). initial studies of non-sars-cov- -exposed individuals found negligible levels of igg against the sars-cov- spike (s) protein, the viral attachment protein that binds the receptor angiotensin converting enzyme (ace ) on host cells to initiate infection ( ). more recently, however, studies have provided evidence of sars-cov- -reactive b and t cell memory in unexposed subjects that could confer some protection against sars-cov- or modulate disease pathogenesis. sera from non-sars-cov- -exposed individuals have been screened for igg binding to the s and s subunits of the sars-cov- s protein. the membrane-distal s subunit contains the receptor binding domain (rbd) for receptor recognition, and the membrane-proximal s , which has higher homology among coronaviruses than does s ( , ), mediates membrane fusion to release viral rna into the host cell. in two large cohorts of unexposed subjects, approximately % had igg that bound s , but not s or the rbd. approximately % of subjects had igg against the sars-cov- nucleocapsid (n) protein, which is highly conserved among coronaviruses ( , ) . although n is an internal viral protein and not a target of neutralizing antibodies (abs) , coronavirus infections typically elicit strong anti-n ab production ( ). the idea that circulating hcovs elicit igg that cross-reacts with sars-cov- is supported by the finding that sars-cov- infection increases igg titers against the s proteins of multiple hcovs ( ). in t cell studies, cd + t cells in up to % of non-sars-cov- -exposed donors responded to epitopes in s and non-s proteins of sars- ) . notably, s-reactive cd + t cells in unexposed subjects were mostly reactive to the conserved s subunit, consistent with cross-reactivity to circulating hcovs ( ). sars-cov- -reactive cd + t cells were also detected in unexposed donors, but the response was less marked than for cd + t cells ( ). are also likely to be present in non-sars-cov- -exposed individuals. indeed, mbcs might be more important than preexisting cross-reactive abs as a source of protection against sars-cov- . igg mbcs are more broadly reactive than abs generated against the same antigen, they persist after circulating ab levels wane, and they are readily activated to generate strong ab responses or seed germinal centers for additional rounds of affinity maturation ( ). concurrent early production of virus-specific igm and igg in the response to sars-cov- infection suggests a response mediated by igg mbcs as well as naïve b cells ( , ( ) ( ) ( ) . this picture is supported by to extend our understanding of the b cell response to sars-cov- infection, the current study compared ab and mbc immunity to sars-cov- in unexposed individuals and individuals in the convalescent phase of infection. in particular, we were interested in the presence of sars- cov- -reactive mbcs in unexposed subjects that could confer some protection against sars- cov- , and formation of mbcs by sars-cov- infection to provide durable protection against igg mbcs reactive to the novel rbd and the conserved s subunit of the s protein. mbcs are thus likely to be available to mediate rapid protective ab responses if circulating ab levels wane and reinfection occurs. our study also draws attention to preexisting sars-cov- - cross-reactive b cell memory to the s subunit in sars-cov- -naïve subjects. we speculate that the strong response to s after sars-cov- infection reflects preexisting s -reactive mbc activation and strengthens broad coronavirus protection. convalescent subjects sampled - weeks after symptom onset. reactivity was measured against the s (including the rbd and s subunit) and n proteins of sars-cov- and the s proteins of the human alphacoronavirus e and betacoronavirus oc . the h influenza virus hemagglutinin and tetanus toxoid (ttd) were included as control antigens that humans are commonly exposed to through infection and vaccination. serum igg levels were measured by elisa. approximately one-third of non-sars-cov- -exposed subjects in the healthy donor cohort had low levels of serum igg against the s and n proteins of sars-cov- , likely reflecting cross-reactivity with seasonal hcovs ( figure a ). notably, % of unexposed subjects had igg against the highly conserved s subunit of the s protein. it is possible that inherent features of the bulky s reagent used in our analysis reduced binding by anti-s abs. igg that bound the highly novel rbd was not detected in unexposed subjects. all non-sars-cov- -exposed subjects had igg against s proteins of the hcovs e and oc , indicating previous infection, and against the control proteins h and ttd ( figures c- f). response to the s subunit. levels of igg against s, rbd, s and n were markedly higher in convalescent subjects than unexposed subjects, indicating strong induction of these abs by sars- cov- infection ( figure a) . in a small number of convalescent subjects, high anti-s igg titers were associated with low levels of anti-n igg. indeed, more than % of convalescent subjects had anti-n igg levels within the range in unexposed subjects, questioning the reliability of using anti-n igg measurement to identify previous sars-cov- infection. notably, serum igg titers against s were consistently higher than against the rbd in convalescent subjects, perhaps reflecting the novelty of the rbd and a response dependent on naive b cell activation ( figure b) . interestingly, titers of igg were higher against the s protein of the hcov oc in convalescent subjects than in unexposed subjects, but this was not the case for the s protein of hcov e (or for the control proteins h and ttd) ( figures c- f ). the cov- infection ( figure g ). the particularly strong correlation between igg titers against oc s and the sars-cov- s suggests a cross-reactive response to the s subunit. since the healthy donor samples in our analysis were collected - years before the emergence of sars-cov- , we considered the possibility that a recently circulating hcov could have been responsible for the higher anti-oc s igg titers in the convalescent subjects. to exclude this possibility, we measured anti-oc s igg titers in sera collected from healthcare workers in . the healthcare workers cared for hospitalized sars-cov- patients, but all were negative for igg against sars-cov- s and rbd, consistent with the effectiveness of personal protective equipment and appropriate work practices. oc s-reactive igg levels in healthcare worker sera were similar to those in non-sars-cov- -exposed healthy donor sera and significantly lower than those in sera from convalescent subjects ( figure c ). taken together, our results indicate that sars-cov- infection generates a strong igg response that cross-reacts with the s of human betacoronaviruses. reactivity to the rbd and s subunit. pbmcs from non-sars-cov- -exposed subjects and convalescent subjects were analyzed for mbcs reactive to sars-cov- proteins. circulating proportion of unexposed subjects suggested that igg mbcs with the same specificity had also been formed. however, these mbcs were not detected, possibly because of very low frequencies in the circulation. in contrast, igg mbcs reactive to the s proteins of the hcovs oc and e and the control proteins h and ttd were detected in nearly % or more of non-sars-cov- - exposed subjects, consistent with the higher levels of serum igg against these antigens ( figure e - h) . as expected, sars-cov- rbd-reactive mbcs were not detected in unexposed subjects. in marked contrast to non-sars-cov- -exposed subjects, the vast majority of convalescent subjects had circulating igg mbcs reactive to the sars-cov- s, rbd, and s , indicating strong induction by sars-cov- infection of mbcs reactive to novel and conserved regions of the s protein ( figure a) . notably, numbers of igg mbcs reactive to the s protein of the hcov oc were higher in convalescent subjects than in unexposed subjects ( figure e generates igg mbcs reactive to the sars-cov- s that cross-react with the s of human betacoronaviruses. interestingly, only a small proportion of the convalescent subjects generated detectable n-reactive igg mbcs, even though most subjects produced high levels of anti-n igg in serum (figures c, d) . it is unclear whether this reflects a real difference between s-and n- reactive mbc formation or an effect of the sampling time. overall, we demonstrate that sars- cov- infection induces strong s-reactive mbc formation that would be expected to provide lasting protection against reinfection and potentially broad protection against betacoronaviruses. our goals in this study were to investigate sars-cov- -reactive b cell memory in unexposed subjects that could provide some protection against sars-cov- infection, and the generation of b cell memory by sars-cov- infection that could provide lasting protection against re-infection. in particular, we were interested in igg mbcs, which respond to cognate antigens with rapid, vigorous, and high-affinity ab production. importantly, mbcs are long-lived cells that continue to provide strong protection when circulating ab levels wane. our approach was to analyze circulating igg as well as igg mbcs from the sars-cov- -naïve and sars- cov- -convalescent subject groups. our key findings are as follows: (i) the presence of igg reactive to the s subunit of sars-cov- in most unexposed subjects, likely reflecting cross- reactivity to hcovs, (ii) markedly increased levels of igg against the sars-cov- s and n proteins, including reactivity to the rbd and s subunit of s, in convalescent subjects, (iii) increased igg binding to the s protein of the oc hcov, but not e hcov, in convalescent subjects, reflecting greater cross-reactivity between s subunits of betacoronaviruses, (iv) strong formation of igg mbcs reactive with the rbd and s subunit of the sars-cov- s protein in convalescent subjects, and (v) formation of igg mbcs reactive with the s protein of oc , but not e, in convalescent subjects, consistent with s subunit cross-reactivity between approximately one-third of our cohort of non-sars-cov- -exposed subjects had low levels of igg against the sars-cov- s and n proteins. the anti-n igg likely reflects infection with hcovs, which have low level ( - %) homology with the sars-cov- n protein ( ). however, a protective function for anti-n abs has not been established ( ). notably, % of unexposed subjects had igg against the s subunit, reflecting homology with hcovs, but none had igg against the highly novel sars-cov- rbd ( , , ) . abs that target the s subunit have been shown to have virus neutralizing activity, raising the possibility that preexisting anti-s igg confers some protection against sars-cov- ( ). the processes that generate anti-s igg are also likely to generate s -reactive igg mbcs and these might provide more significant protection than low levels of anti-s abs. however, s -reactive mbcs (or s-reactive and n-reactive mbcs) were not detected in non-sars-cov- -exposed subjects. taken together with the identification of s-reactive mbcs in unexposed healthy donors ( ), it is likely that s -reactive mbcs were below the limit of detection in our assays. most mbcs are resident in lymphoid tissues, except for mbcs against frequently seen immunogenic antigens (for example, the influenza h or ttd in this study), and are at very low frequencies in circulation in steady state ( , ) . anti-rbd, -s, and -n igg levels were markedly higher in the convalescent subjects than in non-sars-cov- -exposed subjects, indicating strong induction by sars-cov- infection. perhaps notably, the majority of convalescent subjects had higher igg titers against the s than against the rbd. this is particularly surprising because of the accessibility of the rbd to b cells and the expected immunodominance over the s subunit ( , ). our demonstration of strong anti-s igg production is consistent with the activation of a preexisting population of igg mbcs against the conserved s subunit in the absence of mbcs reactive to the novel rbd. however, we cannot exclude inherent differences in the stability or antigenicity of rbd and s reagents as an explanation. in convalescent subjects, igg levels against the s protein of hcov oc (but not e) were significantly higher than in non-sars-cov- -exposed subjects and correlated strongly with anti-s igg levels. these findings support stronger b cell cross-reactivity between the s subunits of sars-cov- and human betacoronaviruses than alphacoronaviruses ( ). importantly, we demonstrate that sars-cov- infection generates rbd-reactive and s - reactive igg mbcs. recently, long et al. ( ) found that levels of sars-cov- -reactive abs, including neutralizing abs, start to decrease within - weeks of infection, especially when the infection is asymptomatic. since mbc populations are maintained for many years, perhaps decades, our findings indicate that mbcs generated by sars-cov- infection will be available to rapidly generate protective abs if waning ab levels allow re-infection to occur ( ). notably, three convalescent subjects in our analysis had undetectable rbd-reactive igg, but nevertheless had rbd-reactive igg mbcs. this might reflect mbc production by germinal centers that remained active after recovery from infection ( ). the proportion of subjects with mbcs reactive to the hcovs oc and e was greater for the convalescent group than the unexposed group, likely reflecting the increase in s -reactive mbcs in the convalescent group and cross-reactivity with hcovs. s -reactive mbc expansion by sars-cov- infection could enhance protection against a broad range of coronaviruses ( ). n-reactive mbc formation in convalescent subjects was less than expected given the large number of subjects with high titers of n-reactive igg, but additional sampling times are required to confirm this observation. in conclusion, our analysis investigated ab and mbc immunity to sars-cov- in unexposed subjects and individuals soon after recovery from sars-cov- infection. findings emphasized the novelty of the sars-cov- s protein rbd in unexposed subjects. however, igg reactive to the s was widespread in unexposed subjects and likely resulted from exposure to hcovs. although our approach was unable to directly identify s -reactive mbcs in the unexposed subjects, we suggest that these cells are present and strongly contribute s -reactive igg early in the response to sars-cov- infection. the igg response in sars-cov- convalescent subjects was also strong against the rbd and, less consistently, against the n protein. importantly, sars-cov- convalescent subjects had generated rbd-reactive and s -reactive igg mbcs. the may, and consisted of pcr-confirmed patients and non-pcr-confirmed subjects who were contacts of confirmed cases or displayed covid- -like symptoms. the convalescent subjects were sampled - weeks after symptom onset. symptoms reported (percent of subjects) were fever ( %) cough ( %), sore throat ( %), stuffy/runny nose ( %), difficulty breathing ( %), fatigue ( %), headache ( %), body aches ( %), nausea/vomiting ( %), and diarrhea/loose stool ( %). (isolate wuhan-hu- ) were expressed in-house in hek cells using pcaggs plasmid constructs kindly provided by florian krammer (icahn school of medicine at mount sinai) ( ). baculovirus-expressed s subdomain and hek cell-expressed n protein were obtained from sino biological (chesterbrook, pa) and raybiotech (peachtree corners, ga), respectively. baculovirus-expressed s proteins from seasonal hcovs oc and e were obtained from sino biological. in-house hek cell-expressed hemagglutinin from egg-derived h n mabtech stockholm, sweden) and p-nitrophenyl phosphate substrate (thermo fisher) were subsequently added to detect bound antigen-specific abs. absorbance was read at nm after color development. a weight-based concentration method was used to quantify antigen-specific ab levels in test samples as described previously ( , ) . sera from healthy donors and convalescent subjects with high titers for test antigens were used to establish human serum standards. the cutoff for assay positivity was set at approximately x the mean od value for negative wells. statistical analyses. the medians with (q , q ) were summarized by subject group and compared by the wilcoxon rank-sum test. spearman correlation analysis together with corresponding robust regression models was used to assess monotonic associations among ab responses. multiple test adjustment was not applied for this explorative study and thus a p value < . was considered significant for all analyses. statistical analyses were performed using software sas . (sas institute inc, cary, nc). cov- -exposed and covid- convalescent subjects. sera were collected from ( proteins in non-sars-cov- -exposed and covid- convalescent subjects. pbmcs for mbc analysis were collected from (i) healthy donors sampled from - (hd) and (ii) covid- convalescent subjects sampled - weeks after symptom onset (conv). pbmcs were stimulated in vitro to induce mbc differentiation into ab-secreting cells. antigen-specific a pneumonia outbreak associated with a new coronavirus of probable bat origin sars-cov- vaccines: status report clinical features of patients infected with novel coronavirus in wuhan clinical and immunological assessment of asymptomatic sars-cov- infections genome composition and divergence of the novel coronavirus ( -ncov) originating in china phylogenetic analysis and structural modeling of sars-cov- spike protein reveals an evolutionary distinct and proteolytically sensitive activation loop a serological assay to detect sars-cov- seroconversion in humans reimer presence of sars-cov- reactive t cells in covid- patients and healthy donors infectious diseases (except hiv/aids) pre-existing and de novo humoral immunity to sars-cov- in humans characterization of a novel coronavirus associated with severe acute respiratory syndrome antibody response of patients with severe acute respiratory syndrome (sars) targets the viral nucleocapsid virological assessment of hospitalized patients with covid- targets of t cell responses to sars-cov- coronavirus in humans with covid- disease and unexposed individuals b cell responses: cell interaction dynamics and decisions antibody responses to sars-cov- in patients with covid- kinetics of sars-cov- specific igm and igg responses in covid- patients covid- serology at population scale: sars-cov- -specific antibody responses in saliva. infectious diseases (except hiv/aids) deep sequencing of b cell receptor repertoires from covid- patients reveals strong convergent immune signatures broad neutralization of sars-related viruses by human monoclonal antibodies convergent antibody responses to sars-cov- in convalescent individuals contributions of the structural proteins of severe acute respiratory syndrome coronavirus to protective immunity human monoclonal antibodies against highly conserved hr and hr domains of the sars-cov spike protein are more broadly neutralizing the transcription factor t-bet resolves memory b cell subsets with distinct tissue distributions and antibody specificities in mice and humans broad dispersion and lung localization of virus- specific memory b cells induced by influenza pneumonia a sequence approach can predict candidate targets for immune responses to sars-cov- the receptor binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients cutting edge: long-term b cell memory in humans after smallpox vaccination role of memory b cells in hemagglutinin- specific antibody production following human influenza a virus infection broad hemagglutinin-specific memory b cell expansion by seasonal influenza virus infection reflects early-life imprinting and adaptation to the infecting virus assignment of weight-based antibody units to a human antipneumococcal standard reference serum, lot -s individual hd and conv subjects in order of ascending titers against s. the assigned cutoff for positivity is shown by the shaded bar. (b) proportions of serum igg against the sars-cov c) serum igg concentrations against the s protein of the hcov oc in conv, hd, and hcw subjects. (d-f) serum igg concentrations against the s protein of the hcov e (d), the influenza virus h hemagglutinin (e), and ttd (f) in conv and hd subjects. (g) correlation between serum igg concentrations against the s subunit of sars-cov- and the s protein of the hcov oc ; ns [not significant]) for comparisons of serum igg concentrations between subject groups was determined by the wilcoxon rank-sum test. correlations were tested by spearman correlation analysis with corresponding robust regression models quantitation of mbc-derived ab (igg)-secreting cells (mascs) or mbc-derived polyclonal mpabs) provided a measure of the abundance of specific igg mbcs. (a) igg mbcs reactive to the sars-cov- spike (s), receptor binding domain (rbd), and nucleocapsid (n) in conv subjects. mbc numbers were determined by enumeration of igg mascs by elispot essay after in vitro mbc stimulation. the assigned cutoff for positivity is shown by the shaded bar mbcs reactive to the influenza virus h hemagglutinin and ttd in conv subjects. mbc numbers were determined by enumeration of igg mascs. (c) proportions of igg mbcs reactive to the sars-cov- rbd, s , and n for individual conv subjects. (d) comparison of serum igg concentrations (upper panels) and igg mbc numbers cov- s (left-hand side) and n (right-hand side) proteins. serum igg was measured by elisa dilution curves are shown for individual conv subjects; curves for subjects are shown in different colors to identify particular response patterns key: cord- -y n ykt authors: garcia-beltran, w. f.; lam, e. c.; astudillo, m. g.; yang, d.; miller, t. e.; feldman, j.; hauser, b. m.; caradonna, t. m.; clayton, k. l.; nitido, a. d.; murali, m. r.; alter, g.; charles, r. c.; dighe, a.; branda, j. a.; lennerz, j. k.; lingwood, d.; schmidt, a. g.; iafrate, a. j.; balazs, a. b. title: covid- neutralizing antibodies predict disease severity and survival date: - - journal: medrxiv : the preprint server for health sciences doi: . / . . . sha: doc_id: cord_uid: y n ykt covid- exhibits variable symptom severity ranging from asymptomatic to life-threatening, yet the relationship between severity and the humoral immune response is poorly understood. we examined antibody responses in covid- patients and found that severe cases resulting in intubation or death exhibited increased inflammatory markers, lymphopenia, and high anti-rbd antibody levels. while anti-rbd igg levels generally correlated with neutralization titer, quantitation of neutralization potency revealed that high potency was a predictor of survival. in addition to neutralization of wild-type sars-cov- , patient sera were also able to neutralize the recently emerged sars-cov- mutant d g, suggesting protection from reinfection by this strain. however, sars-cov- sera was unable to cross-neutralize a highly-homologous pre-emergent bat coronavirus, wiv -cov, that has not yet crossed the species barrier. these results highlight the importance of neutralizing humoral immunity on disease progression and the need to develop broadly protective interventions to prevent future coronavirus pandemics. coronavirus infectious disease of , caused by infection with severe acute respiratory syndrome coronavirus (sars-cov- ), exhibits significant variability in the severity of presentation. the impact of this variability on the development of protective immune responses and the role of antibodies in disease progression is unclear. there is currently no standard treatment regimen for either mild or severe cases of covid- , and there is limited understanding of the impact that current investigational therapies have on immune responses against sars-cov- . non-human primates (nhp) that have been exposed to sars-cov- have been found to develop potent antibody responses and are largely immune to reinfection (chandrashekar et al. , ; deng et al. , ) . similarly, animal models testing candidate vaccine approaches have demonstrated that protection against sars-cov- challenge is positively correlated with the development of high titers of neutralizing antibodies (mercado et al. , ; yu et al. , ) . importantly, passive transfer of convalescent sera has been shown to prevent infection in otherwise naive animals, highlighting the crucial role of antibodies in mediating protection against viral infection (hassan et al. , ; rogers et al. , ) . in contrast, the role of antibodies on the clearance of established sars-cov- infection and clinical outcomes is less clear. ordinarily, infections with viruses require cell-mediated immunity for viral clearance. antibodies mediate functions such as antibody-dependent cellular cytotoxicity (adcc) and phagocytosis (adcp) via innate immune cells such as nk cells and macrophages. yet, the need for antibodies in the clearance of sars-cov- infection has been challenged by two recent cases of patients with x-linked agammaglobulinemia who acquired and survived sars-cov- infection without requiring oxygen or intensive care (soresina et al. , ) . some studies even propose the possibility of a pathogenic role of antibodies in primary infection via antibody dependent enhancement (ade) and augmentation of inflammation (liu et al. , ) , although it is believed that this is insufficient to explain the prevalence of severe cases of sars-cov- infection (arvin et al. , ) . as such, a beneficial, neutral, or harmful role of antibodies in active coronavirus infection remains controversial. despite numerous clinical studies presently in progress, no broadly effective standard-of-care treatment has yet emerged for covid- . remdesivir, a nucleotide analog active against sars-cov- , has shown modest benefit in severe covid- cases by improving time to recovery (beigel et al. , ; . hydroxychloroquine was initially tested in patients based on in vitro studies z. chen et al. , ) , but subsequent meta-analyses and randomized controlled trials have demonstrated no benefit in preventing or treating covid- (boulware et al. , ; tang et al. , ; ullah et al. , ) . morbidity and mortality due to covid- is largely a consequence of adult respir atory distress syndrome (ards) caused by a combination of both hyperinflammatory and hypercoagulable states (domingo et al. , ) . among experimental treatments currently being evaluated, dexamethasone and other corticosteroids that result in immunosuppression have been shown to reduce disease severity (siemieniuk et al. , ) and improve survival (horby et al. , ) . given the involvement of immune dysregulation in the pathology of infection, the consequence of current interventions on the development of humoral immunity is not known. recent studies have demonstrated the emergence of sars-cov- variants containing amino acid substitutions in the viral spike protein targeted by antibodies, raising concerns for potential resistance to neutralization. one mutation, d g, has rapidly become the predominant transmitted variant by outcompeting wildtype infections (korber et al. , ) . while it has been suggested that this mutant results in a more fit virus, the serological consequences of this change are unclear. additionally, recent studies in bats have described a novel coronavirus (wiv -cov) with high homology to sars-cov- that uses the same ace receptor for cell entry (menachery et al. , ) . it has been postulated that this virus may present a similar pandemic risk if it were to spread from bats to humans. however, the consequence of prior sars-cov- seroconversion on neutralization of related coronaviruses like wiv -cov has not been described. in this study, we characterized humoral immune responses and clinical outcomes in sars-cov- -infected patients of varying severity who received a range of treatments, as well as , pre-pandemic individuals. our covid- patient cohort contained a wide range of outcomes, including non-hospitalized, hospitalized, intubated, and deceased individuals. we assessed inflammatory markers, il- levels, lymphocyte counts, and demographic variables such as age and sex. a quantitative elisa that measures igg, igm, and iga antibodies to the receptor binding domain (rbd) of sars-cov- and a high-throughput neutralization assay using lentiviral vectors pseudotyped with sars-cov- and wiv -cov were developed to assess neutralization potency and cross-neutralizing responses. remarkably, we find that anti-rbd antibody levels, neutralization titer, and neutralization potency index predicted disease severity and survival, yet lacked cross-neutralizing activity to pre-emergent wiv -cov. taken together, our results highlight the impact of an effective humoral immune response on covid- , as quantified by a neutralization potency index, and describe the a cross-sectional cohort of covid- cases confirmed by sars-cov- nasopharyngeal pcr was studied and followed for at least months. the cohort was divided into the following five groups based on disease severity, outcomes, and pre-existing health status: ( i ) non-hospitalized, which were never admitted to the hospital due to ( ii ) hospitalized, which were admitted for at least one day but were never intubated and were eventually discharged, ( iii ) intubated, which were intubated for at least one day but were subsequently extubated and discharged; ( iv ) deceased, which passed away due to and ( v ) immunosuppressed, which were includes some non-hospitalized, hospitalized, and intubated patients, but none deceased) ( supplementary table ). when compared to non-hospitalized individuals, all cases of covid- resulting in hospital admission were significantly older in age (median age versus , p < . ) and there was a significant enrichment for males in severe cases resulting in intubation and/or death ( % versus % males, p = . ) ( figure a ), consistent with prior studies (meng et al. , ; n. chen et al. , ) . laboratory data showed that clinical severity correlated with markers of inflammation, namely, peak serum levels of c-reactive protein ( figure b ), ferritin ( figure s a ), d-dimer ( figure s b ), lactate dehydrogenase ( figure s c ), and il- ( figure c ), as well as lymphopenia ( figure d ), as has been previously shown wynants et al. , ; x. chen et al. , ; zhou et al. , ) . interestingly, covid- severity was also associated with peak serum levels of troponin-t ( figure s d ), a marker of myocardial damage and/or ischemia that may reflect cardiac injury, as has been previously described (tersalvi et al. , ) . altogether, our cohort contained a wide range of clinical presentations of sars-cov- infection with our analyses confirming previously described associations. its specificity to sars-cov- as well as its ease of production and stability (stadlbauer et al. , ) . full-length spike has more regions of homology to other coronaviruses that may cause greater false positivity, as has been shown between sars-cov, sars-cov- , mers-cov, and common cold covs (chan et al. , ; ju et al. , ) . in addition, studies have shown that rbd is the main target of coronavirus neutralizing antibodies (he et al. , ) . we determined the sensitivity and specificity of this assay by assessing anti-rbd antibody levels in a cohort of sars-cov- -infected patient serum samples collected between to days after symptom onset ( n = ) in order to maximize seropositivity for igg, igm, and iga. we also assessed , pre-pandemic serum samples composed of a large unbiased cohort ( n = , ) and selected cohorts of individuals ( n = ) with positive serology results for cytomegalovirus, varicella-zoster virus, hepatitis b virus, hepatitis c virus, hiv, syphilis, toxoplasma, and/or rheumatoid factor ( figure b ). anti-rbd igg, igm, and iga levels were measured for each sample by interpolation on to the standard curve and a receiver operating curve (roc) analysis was used to determined optimal cut-offs that distinguished sars-cov- -infected patients from pre-pandemic controls ( figure c ). cut-offs of . u/ml for anti-rbd igg achieved % sensitivity, . u/ml for anti-rbd igm achieved % sensitivity, and . u/ml for anti-rbd iga achieved % sensitivity, with > . % specificity for all three. to assess the cross-reactivity of anti-rbd igg in sera of sars-cov- seropositive individuals, we modified our elisa to detect igg antibodies against the rbd of sars-cov and mers-cov. interestingly, no cross-reactivity was seen to sars-cov rbd despite % homology, nor to mers-cov, which has only % homology ( figure s c and s d ). additional experiments measuring igg antibodies against the rbd of two common cold coronaviruses-nl , which has % homology to sars-cov- rbd, and hku , which has . % homology ( figure s c )-showed a seroprevalence of > % ( figure s e ), as has been shown in previously published studies (gorse et al. , ) , with no correlation between the igg antibody levels of nl or hku with sars-cov- ( figure s e ). these data show that anti-rbd igg antibodies induced during sars-cov- infection do not cross-react to recognize the rbd of other pandemic coronaviruses. in addition, anti-rbd igg antibodies to common cold coronaviruses appear to not provide detectable pre-existing reactivity to sars-cov- rbd nor do they correlate with anti-rbd igg levels in covid- patients. overall, these data suggest that natural infection with coronavirus results in anti-rbd antibodies with limited cross-reactivity. previous studies have demonstrated the potential to pseudotype retroviral vectors with sars-cov spike proteins (moore et al. , ) . however, pseudoviruses bearing sars-cov- spike produced by these methods yield low titers (nie et al. , ) , hampering large-scale testing of neutralization. recently, a forward genetics approach identified an efficiently replicating vesicular stomatitis virus (vsv) variant encoding sars-cov- spike containing a truncated form lacking the c-terminal amino acids (case et al. , ) . interestingly, previous studies also showed a role of the cytoplasmic tail of sars-cov in altering surface expression and fusogenic potential (corver et al. , ) . to determine whether analogous truncations might improve sars-cov- pseudovirus production, we examined the cell-surface expression of truncated forms of sars-cov- spike and found that removal of amino acids from the c-terminus (Δ ) resulted in significantly greater cell-surface expression and higher titers of pseudovirus ( figure s f -h ). this truncation removed a putative er-retention signal (lontok, corse and machamer, ; mcbride, li and machamer, ; ujike et al. , ) while retaining cysteine-rich domains that are highly conserved among coronaviruses. using these spike modifications, we developed a cov pseudovirus neutralization assay compatible with high-throughput liquid handling instrumentation in -well plate format using our previously published lentiviral vector system expressing both luminescent and fluorescent marker transgenes ( figure d ) (crawford et al. , ) . to validate our assay, the potency of a neutralizing monoclonal antibody, b , and a non-neutralizing monoclonal antibody, cr , both of which target sars-cov- rbd with known ic values, was determined. this yielded ic values of~ μg/ml for b and undetectable (> μg/ml) for cr , which were in agreement with previous reports (tian et al. , ; wu et al. , ) ( figure e and s i ). in addition, we found that luciferase activity was directly proportional to the number of infected (i.e. zsgreen+) cells, providing flexibility in assay readout ( figure s j ). to determine the performance of our assay on human sera, we measured the neutralization potency of human sera from , pre-pandemic individuals and covid- patient samples > days after symptoms onset, with a dilution range of : to : , . the dilution titer that achieved % neutralization (nt ) was calculated for each specimen and roc analysis was performed, revealing that an nt threshold of : achieves a sensitivity of % and specificity of > . % in identifying covid- patients ( figure g -h ). overall, we found median titers of : in covid- patients, with potency ranging from < : to > : , . comparatively, titers of : for yellow fever vaccination, : for rubella vaccination, . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . and : for influenza vaccination are considered to indicate protective immunity (hannoun, megas and piercy, ; plotkin, ) . altogether, we established a highly accurate high-throughput sars-cov- pseudovirus neutralization assay that can accurately quantify the neutralization potency of humoral immune responses directed to sars-cov- spike protein. as a separate note for investigators using pseudovirus neutralization assays, we excluded pre-pandemic individuals taking antiretroviral therapy for human immunodeficiency virus infection or pre-exposure prophylaxis ( n = in the original cohort of , ) after finding that potent inhibition of pseudovirus infection occurred in a majority of these individuals ( figure s k ). we believe this was due to antiretroviral compounds in the patients' sera inhibiting transduction with our lentivirus-based vector system, thus generating an artifact. of note, non-documented antiretroviral use may explain a proportion of the false positives observed in the remaining specimens ( n = out of , ). we proceeded to analyze antibody responses in our cohort of covid- patients as well as a negative control cohort of healthy blood donors, and found that in contrast to the typical kinetics of antibody responses in viral infections (i.e. igm before class-switched igg and iga), anti-rbd igg antibodies appear almost simultaneously with anti-rbd igm antibodies after symptom onset, and only a subset of individuals generate anti-iga antibodies concomitantly ( figure a -c ). interestingly, the development and quantity of anti-rbd igg antibodies appeared to be increased and sustained in the time frame analyzed (up to days), while anti-rbd igm and iga antibodies waned after~ days. neutralization titers closely resembled anti-rbd igg levels and were similarly sustained over time ( figure d ). overall, seropositivity at > days after symptom onset was % for anti-rbd igg, % for anti-rbd igm, % for anti-rbd iga, and % overall for any antibody. neutralization was detected in % of samples > days after symptom onset. to assess the humoral immune response among the pre-defined cohorts of varying disease severity, we focused on patients for which samples were collected between and days after symptom onset ( n = ). this time frame was chosen to prevent biases resulting from time of sampling post-infection ( figure s a ), which is known to have a significant impact on the magnitude of antibody responses. we found that severely ill patients that were intubated or passed away due to covid- had the highest anti-rbd igg and iga levels, but no significant . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint differences were seen for igm ( figure e -g ). these individuals also had the highest neutralization titers ( figure h ). in contrast, individuals that were not hospitalized had the lowest anti-rbd igg and iga levels and neutralization titers. unsurprisingly, immunosuppressed individuals-none of whom passed away-had significantly blunted igg, iga, and neutralizing responses. upon analyzing anti-rbd antibody seropositivity and neutralization titer, we found igg seropositivity was an excellent predictor of neutralization with a sensitivity of % and specificity of % ( figure i ). when seropositivity for any anti-rbd antibody was present, neutralization could be predicted with a sensitivity of % and specificity of %. anti-rbd igg levels correlated the most with neutralization ( r = . ) ( figure j although anti-rbd igg levels correlated with neutralization by regression analysis, there was variability that appeared to segregate by our pre-defined severity cohorts ( figure j ). to better visualize this, we plotted residuals of each neutralization titer subtracted from its predicted titer based on the regression ( figure k ). this revealed that samples from severely ill patients were biased towards lower-than-predicted neutralization titers, suggesting that they harbored higher levels of non-neutralizing anti-rbd igg antibodies that did not contribute to neutralization. consequently, we calculated a neutralization potency index (nt /igg) for each patient, and found that intubated or subsequently deceased patients had a significantly lower index ( figure l ), with all deceased patients having an index < . accordingly, when patients were classified as having neutralization potency indices that were 'high' (≥ ) or 'low' (< ), there was a significant risk of death in the days following sample collection in the 'low' index group ( % -day survival, n = ) and there were no deaths in the 'high' index group ( % -day survival, n = ) (p = . ; figure m ). of note, this finding was true across our entire cohort of covid- patients (including non-hospitalized and immunosuppressed individuals) for which both anti-rbd igg. in addition, neutralization potency index did not correlate with days after symptom onset and remained predictive of survival when using a cox proportional hazards model that accounted for age, sex, hospitalization status, intubation status, . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . and days between symptom onset and sample collection ( p = . ). these results suggest that neutralization potency index may help risk stratify patients irrespective of where they are in their disease course. altogether, severity of sars-cov- infection significantly correlates with higher anti-rbd antibody levels but sub-optimal neutralization potency is a significant predictor of mortality. to explore the influence of pre-existing medical conditions and covid- therapies on humoral immune responses to sars-cov- we performed multivariate analysis of all available demographic, clinical, laboratory, and experimental data ( figure s ). with the exception of immunosuppressed individuals, which had significantly decreased antibody and neutralizing responses, our cohort was not large enough to conclusively detect the effects of particular pre-existing medical conditions on the overall humoral immune response. however, a principle components analysis (pca) that included demographic data, pre-existing medical conditions, laboratory data, treatments received, anti-rbd antibody levels and neutralization titers but not clinical outcomes demonstrated clustering of patients by the severity cohorts ( figure a ). principal components were mainly influenced by inflammatory markers, anti-rbd antibody levels, and neutralization titers, but a contribution from pre-existing medical conditions such as hypertension and diabetes was observed ( figure b ). to assess the effect of different treatments on the humoral immune response, we performed an analysis limited to samples collected from patients that had initiated treatment and were in the hospital for at least days ( n = ). covid- -directed treatment regimens included azithromycin (an antibiotic with anti-inflammatory properties), remdesivir, hydroxychloroquine, corticosteroids, and tocilizumab (an anti-il- receptor antibody). of note, individuals in the tocilizumab-treated cohort included individuals known to receive tocilizumab for compassionate use and patients enrolled in a blinded clinical trial with : tocilizumab-to-placebo randomization (i.e. some patients might have received placebo). azithromycin, remdesivir, and hydroxychloroquine-for which there was concern of attenuating antibody responses (de miranda santos and costa, ) -did not significantly affect anti-rbd antibody levels or neutralization titers in our cohort ( figure c ). however, we found that use of corticosteroids and tocilizumab significantly decreased anti-rbd igg concentration, and in the case of corticosteroids, neutralization titer ( figure c ). corticosteroids are a general . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . immunosuppressant known to decrease antibody production, whereas il- signaling is important in several aspects of antibody responses (kopf et al. , ) . interestingly, tocilizumab-treated patients had a significant increase in the neutralization potency index stemming from the larger effect on anti-rbd igg as compared to neutralization ( figure c ). this result raises new questions regarding the role of il- signaling in the production of non-neutralizing versus neutralizing antibodies and how these might become de-coupled, although a selection bias cannot be excluded. altogether, immunomodulatory therapies, some of which have shown clinical efficacy or are actively being studied, influence humoral immune responses in sars-cov- -infected patients. given the importance of humoral immunity in preventing most viral infections, the recent emergence of a mutation in the sars-cov- spike protein (d g) has raised concerns for the potential for convalescent patients to become re-infected. studies have demonstrated that this variant may possess greater replicative fitness and an altered conformation of the spike protein that may render pre-existing immunity less effective (korber et al. , ) . to determine the impact of this variant on the neutralization potency of sera from patients previously infected with sars-cov- , we introduced the d g mutation into the sars-cov- Δ spike ( figure a ). when characterizing this new construct, we found that both surface expression and infectivity were further increased relative to that of the d sars-cov- Δ spike ( figure b and s a,c,d,f ), in line with previous studies (korber et al. , ) . we tested this new pseudovirus, normalizing for infectious units per well, against the same panel of patient samples and found an increase in neutralizing titers that was very small but statistically significant ( figure c -d ), an effect that was seen in a prior study (korber et al. , ) . this indicates that individuals that have been infected with either d wild-type or g mutant sars-cov- will have cross-neutralization to the opposite strain, both of which are circulating in boston, massachusetts (lemieux et al. , ) and were likely represented in our study cohort. the emergence of sars-cov, mers-cov, and now sars-cov- within the last two decades has demonstrated the ability of zoonotic coronaviruses to cross the species barrier and pose pandemic threats. this has prompted microbiologists and epidemiologists to seek out and characterize zoonotic coronaviruses that have the potential to cross into humans. recent studies in bats have identified a novel coronavirus, wuhan institute of virology coronavirus (wiv -cov), which, like sars-cov- and sars-cov, has a spike that uses ace receptor for . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . cell entry and bears high sequence homology to both sars-cov ( %) and sars-cov- ( %). we generated wiv -cov pseudovirus using an analogous spike truncation (Δ ) ( figure a ), which resulted in high expression of wiv -cov spike on producer cells as well as infectivity and titer ( figure e and s b,c,e,f ). these results suggest that this c-terminal truncation can serve as a general approach for modifying coronavirus spike proteins for efficient pseudovirus production. interestingly, wiv -cov spike could be detected at the cell surface by the sars-cov and -cov- -specific monoclonal antibody cr ( figure s b ), a finding that, to the best of our knowledge, has not been previously described. using wiv -cov pseudovirus, we found that sera from sars-cov- -infected individuals showed a lack of cross-neutralization except for relatively low-level neutralization in a few individuals with very high sars-cov- neutralization titers ( figure f -g ). this indicates that humoral immunity raised against one coronavirus is generally insufficient to generate cross-neutralizing immunity to even highly related coronavirus strains. . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint traditionally, cellular immunity is responsible for clearing an established viral infection, while humoral immune responses play a more critical role in preventing future infection. here we found that severely ill covid- patients had the highest levels of anti-rbd antibodies, which other studies have similarly described (shrock et al. , ) . to further characterize this antibody response, we measured neutralization titers and developed a neutralization potency index derived from our quantitative readouts (nt /igg) to assess the quality of anti-rbd antibodies irrespective of the quantity produced. remarkably, neutralization potency was significantly diminished in severely ill patients, and survival analysis demonstrated that an index of ≥ was predictive of % -day survival, whereas < was associated with % -day survival in our limited cohort of covid- patients. thus, this neutralization potency index may be a useful metric for physicians seeking to risk-stratify covid- patients. despite the clear correlation between covid- severity and development of humoral immunity, the cause-effect relationship between these two is unclear. one possibility is that severe disease caused by hyperinflammation and/or uncontrolled viral replication induces overproduction of antibodies that serve as a 'biomarker' of severity. this is supported by our finding that the most severely affected patients had the highest levels of inflammatory markers and cytokines, which can drive antibody production. in support of this possibility, a recent study suggests a pathogenic role of immune activation and exuberant antibody production from extrafollicular b cells in critically ill patients (woodruff et al. , ) . indeed, of all the covid- treatment regimens being used and tested, dampening of the immune response with corticosteroids has proven to have one of the greatest benefits in improving outcomes and survival (siemieniuk et al. , ) , and we find that corticosteroids decrease both anti-rbd igg levels and neutralization titers. however, another possibility is that high levels of antibodies with low neutralization potency worsen disease severity, possibly via ade. this is supported by our finding of decreased neutralization potency in severely ill patients, and raises concerns over the use of convalescent plasma as a treatment strategy. one exception, however, may be in immunosuppressed individuals, which generally have sub-optimal antibody levels and neutralization titers. further studies in animal models of covid- testing passive transfer of low-potency index sera may help resolve this controversy. a multitude of vaccines are presently being evaluated for sars-cov- prevention, including inactivated virus (gao et al. , ) , spike antigen (jackson et al. , ; keech et al. , ) , and rbd antigen (dai et al. , ; mulligan et al. , ) . each will likely result in . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint humoral immunity with different ratios of neutralizing and non-neutralizing antibodies. given our results, it will be important to assess the potency index of each candidate to determine those with maximal potential. interestingly, one study showed that vaccination of mice with rbd generated potently neutralizing antibodies without antibody-dependent enhancement. this was postulated to be due to the lack of immunodominant non-neutralizing epitopes present on the remainder of the spike protein (quinlan et al. , ) . the diverse and atypical kinetics of antibody production-in particular, early rise of igg and in some cases iga-suggests the possibility of a contribution from class-switched (igg+ or iga+) memory b cells early in the humoral immune response rather than solely from the naive (igm+) b cell pool, as has been recently postulated (song et al. , ) . regardless, our results support a role for anti-rbd igm and iga in contributing to sars-cov- neutralization, despite their transient nature in serum. anti-rbd igg responses and neutralization, on the other hand, were sustained in the time frame analyzed (~ days), but several studies have emerged that question the longevity of these responses, which have yet to be determined. it is tempting to speculate that severely afflicted individuals may have more enduring immunity than mild cases. the differences in humoral response induction may stem from a combination of factors, including host permissibility to viral replication and a rapid response from innate immune effector cells and cytotoxic t cells, some of which have been postulated to arise from cross-reactive memory cells to other coronaviruses (grifoni et al. , ) . although the mutation rate of coronaviruses is very low when compared to other viruses such as influenza or hiv, certain mutations in the spike protein of sars-cov- have emerged in the setting of the rapidly spreading pandemic. we found that one such mutation, d g, which has now spread and become a dominant strain worldwide, does not affect the neutralizing ability of patient sera, reducing concerns for re-infection. still, prior coronavirus pandemics (e.g. sars-cov, mers-cov, and now sars-cov- ) have occurred due to zoonotic coronaviruses crossing the species barrier, indicating an ongoing threat of future pandemics even in the face of effective vaccines to current viruses. one pre-emergent bat coronavirus, wiv -cov, is highly homologous to sars-cov and sars-cov- and can infect ace -expressing human cells (menachery et al. , ) . our data demonstrate that sera from sars-cov- infected patients exhibit very limited cross-neutralization of wiv -cov, except for rare individuals with relatively low-level neutralization of wiv -cov, suggesting that generation of broadly neutralizing antibodies is indeed possible, as has been previously described (wec et al. , ) . in summary, the development of potently neutralizing humoral immunity against . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint sars-cov- appears to increase survival, and may protect against re-infection with other circulating strains of sars-cov- . however, it is generally unlikely to provide protection against subsequent coronavirus pandemics. as such, future efforts should focus on the development of broadly active therapies and prevention modalities that generate potently neutralizing antibodies with activity across different coronavirus strains. . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint to quantitatively detect igg, igm, and iga antibodies to sars-cov- receptor binding domain (rbd), we developed an indirect elisa using an anti-sars-cov and -cov- monoclonal antibody (cr ) with igg , igm, and iga isotypes (kindly provided by galit tween- detergent served as an inactivation agent to render samples non-infectious, as has been previously described for other enveloped viruses (mayo and beckwith, ) . a seven-point standard curve was created using each of the standards (i.e. cr -igg , cr -igm, cr -iga ) starting at μg/ml by performing : serial dilutions with dilution buffer. samples and standards were added to corresponding wells and incubated for h at °c, followed by washing. human antibody isotypes were detected with specific antibodies (bethyl) diluted as indicated: anti-human igg-hrp ( : , ), anti-human igm-hrp ( : , ), and anti-human iga-hrp ( : , ). these were added to each plate and incubated for min at room temperature. after washing, tmb substrate (inova) was added to each well and incubated for min (for igg), min (for igm), and min (for iga), before stopping with m h so . buffer compositions, reagent concentrations and incubation times and temperatures were optimized in separate experiments for each analyte to maximize signal-to-noise ratio. optical density (o.d.) was measured at nm with subtraction of the o.d. at nm as a reference wavelength on a spectramax abs microplate reader. anti-rbd antibody levels were calculated by interpolating onto the standard curve and correcting for sample dilution; one unit per ml (u/ml) was defined as the equivalent reactivity seen by μg/ml of cr . . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint to compare the neutralizing activity of patient sera against coronaviruses, we produced lentiviral particles, pseudotyped with different spike proteins, by transient transfection of t cells and titered the viral supernatants by flow cytometry on t-ace cells (moore et al. ). virus production was also quantified by p elisa on viral supernatants using the hiv- p ca antigen capture assay (leidos biomedical research, inc). to increase throughput and consistency, assays and readouts were performed on a fluent automated workstation (tecan) using -well plates (grenier). following an initial -fold dilution, the liquid handler performed serial three-fold dilutions (ranging from : to : , ) of each patient serum and/or purified antibody in μl followed by addition of μl of pseudovirus containing infectious units and incubation for h at room temperature. finally, , t-ace (moore et al. , ) cells in μl cell media containing μg/ml polybrene were added to each well and incubated at °c for - h. cells were lysed using a previously described assay buffer (siebring-van olst et al., ) and luciferase expression was quantified using a spectramax l luminometer (molecular devices). percent neutralization was determined by subtracting background luminescence measured in cell control wells (cells only) from sample wells and dividing by virus control wells (virus and cells only). of note, repeated sera neutralization measurements in independent assays using , , infectious units of pseudovirus per well generated similar results (data not shown), indicating that the nt is not significantly influenced by pseudovirus titers. data was analyzed using graphpad prism and nt values were calculated by taking the inverse of the % inhibitory concentration value for all samples with a neutralization value of % or higher at the highest concentration of serum or antibody. to quantify the pseudotyped lentiviral supernatants in terms of infectious units, we plated , of either t or t-ace cells in ml in a -well plate format (corning). h later, ten-fold serial dilutions of lentiviral transfection supernatant were made in μ l, which was then used to replace μ l of media on the plated cells. cells were then incubated with lentivirus supernatant for h at °c and then harvested with trypsin-edta (corning), resuspended in pbs supplemented with % fbs (pbs+), and measured on a stratedigm s exi flow cytometer. samples were gated for zsgreen expression. to compare the relative surface expression of pseudovirus spike protein, we plated , t cells per well in ml in a -well plate. h later, we transfected each well with . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint a lentiviral helper vector coding for different spike proteins. the cells were incubated for h at °c and harvested into pbs containing % fetal bovine serum (sigma) (called pbs+). cells transfected with each vector were divided into aliquots, stained with either pbs+, cr sars-cov antibody ( μ g/ml in pbs+), or b sars-cov- antibody ( μg/ml in pbs+) for minutes at room temperature. cells were then washed with ml pbs+, spun at , x g, and stained with anti-human igg af polyclonal antibody (invitrogen) at μg/ml in pbs+ for minutes at rt. cells were washed with ml of pbs+, spun at , x g, resuspended in μ l of pbs+ and measured on a stratedigm s exi flow cytometer. - hours after neutralization assay setup, each well in a serum dilution series within a -well plate was imaged using a fitc filter to detect cellular zsgreen expression. images were acquired using a x air objective on a zeiss lsm instrument. acquired images were analyzed using imagej to produce overlays. use of patient samples for the development and validation of sars-cov- diagnostic tests was approved by partners institutional review board (protocol p ). serum samples from patients diagnosed with covid- (confirmed by at least one sars-cov- pcr-positive nasopharyngeal swab at massachusetts general hospital) were collected over course of several weeks, resulting in partially longitudinal, cross-sectional cohort consisting of serum samples, with a prospective follow-up period of at least months to assess clinical course and outcomes by manual chart review by at least two physicians. for each patient, the following information was obtained: age, sex, sars-cov- pcr results, date of symptom onset, hospitalization and discharge dates, intubation and extubation dates, and deceased date. date of symptom onset was defined as the earliest date that at least one of the following covid- -related symptoms was reported as developing acutely and new from baseline: fever, chills, loss of smell or taste, body aches, rhinorrhea, nasal congestion, sore throat, cough, shortness of breath. if the date of symptom onset could not be determined with confidence, this information was excluded from the analysis. patients were assessed for the presence of absence of the following pre-existing medical conditions: lung disease (e.g. asthma, copd), heart disease (e.g. coronary artery disease, heart failure), vascular disease (e.g. peripheral vascular disease), hypertension, diabetes, obesity (bmi > ), kidney disease, autoimmune . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . disorder, solid organ cancer, chemotherapy for solid organ cancer, hematologic cancer, chemotherapy or immunotherapy for hematologic cancer, history of organ transplant, history of hematopoietic stem cell transplant, and pre-existing use of corticosteroids or other immunosuppressive medications. based on these information, the cohort was divided into the following groups based on severity of disease and underlying health status: ( i ) non-hospitalized, consisting of individuals that were never admitted to the hospital and were sent home to quarantine; ( ii ) hospitalized, which included individuals that were hospitalized for at least one night but were never intubated and were eventually discharged; ( iii ) intubated, comprising hospitalized individuals that were intubated for at least one day but survived and were eventually discharged; ( iv ) deceased, for which we had obtained a specimen before they eventually passed away in the hospital; and ( v ) immunosuppressed, which consisted of people that were on immunosuppressive medication (including high-dose corticosteroid) and/or were afflicted by a clinically significant hematologic malignancy before being diagnosed with covid- . laboratory data throughout admission were analyzed, and the maximum documented serum levels of ferritin, c-reactive protein, d-dimer, lactate dehydrogenase, troponin-t, and il- were recorded for each patient, as well as the lowest absolute lymphocyte count documented (lymphocyte count nadir). in addition, use of the following treatments were documented: corticosteroids, hydroxychloroquine, azithromycin, atorvastatin, remdesivir, lopinavir/ritonavir, tocilizumab (part of treatment versus placebo trial, currently blinded), and anakinra. all information obtained from medical records was verified by at least two physicians. pre-pandemic serum samples ( n = , ) were obtained from the clinicals laboratories at massachusetts general hospital. these samples were comprised of an unbiased cohort of individuals being tested for measles, mumps, and rubella titers ( n = ), as well as a selected subset of individuals with positive serology results for cytomegalovirus ( n = ), varicella-zoster virus ( n = ), hepatitis b virus ( n = ), hepatitis c virus ( n = ), hiv ( n = ), syphilis ( n = ), toxoplasma ( n = ), and rheumatoid factor ( n = ). statistical and data analyses were performed using graphpad prism . . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . proportional hazards models performed by both jmp pro and r confirmed these findings after accounting for additional variables. when using r, the cox proportional hazards model was performed using the coxph function from the survival package v . - (https://cran.rproject.org/package=survival) in r v . . (r core team ). in . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . ( c ) roc analyses for each assay were done to assess how seropositivity predicted covid- . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . status. area under the curve (auc) was . for igg, . for igm, and . for iga. cut-offs of . u/ml for igg achieved a sensitivity of %, . u/ml for igm achieved %, and . u/ml for iga achieved %, with > . % specificity for all three. ( d ) a schematic of the high-throughput sars-cov- pseudovirus neutralization assay is shown. ( e ) validation of the neutralization assay using a recently discovered neutralizing monoclonal antibody, b , was performed and showed an ic of μg/ml. ( f ) neutralization titers that achieved % neutralization (nt ) were calculated for pre-pandemic samples ( n = , , individuals on antiretroviral therapy excluded) and covid- patient samples ( n = ). ( g ) an roc analysis demonstrated an auc of . , with an nt cut-off of achieving sensitivity of % and specificity of > . %. ( a-c ) anti-rbd igg, igm, and iga levels were plotted over days after symptom onset for confirmed covid- cases for which data of symptom onset was known ( n = patients, n = samples total). healthy blood donors ( n = ) are included as a negative control within the gray region. the dotted lines indicate the cut-offs for anti-rbd igg, igm, and iga seropositivity. ( d ) titers that achieve % neutralization (nt ) were plotted over days after symptom onset for each patient sample. ( e-h ) patient samples were selected for collection between and days after symptom onset (earliest time point for each patient), and for each cohort of healthy blood donors, non-hospitalized, hospitalized, intubated, deceased, and immunosuppressed patients, anti-rbd igg, igm, iga, and neutralization (nt ) was plotted. non-parametric multivariate anova was performed for each (excluding healthy blood donors); statistical significance is indicated as follows: **** p < . , *** p < . , ** p < . , and * p < . . ( i-j ) an roc and log-log regression analyses were performed on igg versus neutralization. for j , the severity cohort is indicated as follows: healthy (white), non-hospitalized (green), hospitalized (yellow), intubated (red), deceased (gray), and immunosuppressed (blue). for j , pearson correlations were performed and r and p values are indicated. ( k ) a residual plot for neutralization titer was generated from the log-log correlation. the gray ellipse indicates a cluster of samples from intubated (red) and deceased (gray) patients. . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . ( c ) sub-analyses on covid- patients that were in the hospital for at least days to ( n = ) were performed on the last collected specimen to show the effect of azithromycin ( n = treated), remdesivir ( n = treated), hydroxychloroquine ( n = treated), corticosteroids ( n = treated), and tocilizumab ( n = treated as part of a trial with : randomization to placebo) on anti-rbd igg levels (upper panel), neutralization titer (middle panel), and neutralization potency index (nt /igg) (lower panel). a t test was performed for each comparison; * indicates unadjusted p < . . mutant sars-cov- spike but not the highly homologous pre-emergent bat coronavirus wiv -cov. ( a ) a schematic of the sars-cov- and wiv -cov spike proteins, including full-length, truncated (Δ ), and mutant (d g) forms is shown. full-length wiv -cov spike has . % sequence homology to sars-cov- spike and has the same putative er retention signal (errs) as sars-cov- . ( b ) expression of full-length, Δ , and Δ d g sars-cov- spike constructs in t cells . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . in comparison to empty vector (neg. ctrl) was measured by flow cytometry (upper panel). infectivity of lentivirus, which was defined as the infectious units divided by the quantity of p in lentiviral supernatant, was also measured and compared to vsv-g-pseudotyped lentivirus (lower panel). ( c-d ) cross-neutralization of serum samples from covid- patients that were non-hospitalized (green, n = ), hospitalized (yellow, n = ), intubated (red, n = ), deceased (gray, n = ), or immunosuppressed (blue, n = ) and healthy blood donors ( n = ) was measured for wild-type versus d g mutant sars-cov- Δ spike pseudovirus. for c, pearson correlations were performed and r and p values are indicated; for d , paired, non-parametric t test was performed; *** indicates p < . . ( e ) similar to b , expression and infectivity of full length and Δ wiv -cov spike was measured. ( f-g ) similar to c-d , cross-neutralization of serum samples from covid- patients was measured for wild-type sars-cov- versus wiv -cov pseudovirus. for f, pearson correlations were performed and r and p values are indicated; for g , paired, non-parametric t test was performed; **** indicates p < . . . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . clinical laboratory-defined cut-offs of the upper limit of normal are indicated with a dotted line. for each parameter, a non-parametric anova was performed; statistical significance is indicated as follows: **** p < . , *** p < . , ** p < . , and * p < . . ( h ) pseudovirus titers of the indicated spike constructs were quantified. ( i ) lack of neutralizing ability of cr mab was confirmed in pseudovirus neutralization . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . a multi-variate analysis of all available data including age, sex, language, hospital course and events, pre-existing medical conditions, treatments received, clinical laboratory data, and antibody and neutralization data was performed, with pearson coefficients ( r ) ranging from - (red) to (white) to + (blue). the presence of an 'x' indicates that there were insufficient data to correlate the variables in question. the following abbreviations were used: daso, days after symptom onset; dapp, days after pcr positivity; dpp, days pcr positive (total number of days between first pcr positive results and last pcr positive result that was followed by one . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . negative result); dhos, days hospitalized; hsct, hematopoietic stem cell transplant; crp, c-reactive protein; ldh, lactate dehydrogenase; ck, creatine kinase; anti-rbd, anti-receptor binding domain; anti-nc ab, anti-nucleocapsid antibody (as measured by the commercially available roche sars-cov- total antibody chemiluminescent assay); sc , sars-cov- . figure s . characterization of cov spike expression vectors. ( a ) surface level expression of sars-cov- spike protein following transfection of t cells. several constructs of spike were tested: codon-optimized full-length spike from sars-cov- , a truncated version with amino acids deleted from the cytoplasmic tail (Δ ), and a truncated version that also includes a d g mutation. expression was measured via flow cytometry by staining with b antibody at a concentration of μg/ml followed by staining with an anti-human igg antibody conjugated to af at μg/ml. ( b ) surface level expression of full-length and truncated (Δ ) wiv -cov spike proteins were also measured following transfection of t cells via flow cytometry. expression was measured via flow cytometry by staining with cr antibody at a concentration of μg/ml followed by staining with an anti-human igg antibody conjugated to af at μg/ml. ( f ) transduction with -fold serial dilutions and subsequent assessment of zsgreen expression by flow cytometry was performed to calculate pseudovirus titer (u/ml) for each construct indicated. . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint (a) a cross-sectional cohort of covid- patients (n = ) was divided into groups of varying clinical severity, i.e., non-hospitalized (n = ), hospitalized (n = ), intubated (n = ), deceased (n = ), and immunosuppressed (n = ) and analyzed for their age and sex. median age was years in patients who were never hospitalized (n = ; includes from immunosuppressed group) and years in all patients who were admitted to the hospital (n = ), with statistical significance of p < . with t test. fisher's exact test on the number of males who were intubated or deceased (n = males out of total; includes from immunosuppressed group who were intubated) versus not (n = males out of total) demonstrated a significant enrichment with p = . . (b-d) peak levels of c-reactive protein and il- as well as lymphocyte count nadir are presented in violin plots when data was available. in c, none of the non-hospitalized patients had serum il- levels measured (n.a., not assessed). for b and c, clinical laboratory-defined cut-offs of the upper limit of normal are indicated with a dotted line; for d, the dotted line represents the lower limit of normal. for each parameter, a non-parametric anova was performed; statistical significance is indicated with the following notations: **** p < . , *** p < . , ** p < . , and * p < . . . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . values from samples and calculate units/ml (u/ml), with u/ml defined as the equivalent reactivity caused by μg/ml of the corresponding cr monoclonal antibody (b) anti-rbd igg, igm, and iga antibodies were measured in both pre-pandemic samples (n = , ) and covid- patient samples (n = ). dotted lines indicate the threshold of seropositivity that achieves > . % specificity on roc analyses. (c) roc analyses for each assay were done to assess how seropositivity predicted covid- status. area under the curve (auc) was . for igg, . for igm, and . for iga. cut-offs of . u/ml for igg achieved a sensitivity of %, . u/ml for igm achieved %, and . u/ml for iga achieved %, with > . % specificity for all three. (d) a schematic of the high-throughput sars-cov- pseudovirus neutralization assay is shown. (e) validation of the neutralization assay using a recently discovered neutralizing monoclonal antibody, b , was performed and showed an ic of μg/ml. (f) neutralization titers that achieved % neutralization (nt ) were calculated for pre-pandemic samples (n = , , individuals on antiretroviral therapy excluded) and covid- patient samples (n = ). (g) an roc analysis demonstrated an auc of . , with an nt cut-off of achieving sensitivity of % and specificity of > . %. . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint (a-c) anti-rbd igg, igm, and iga levels were plotted over days after symptom onset for confirmed covid- cases for which data of symptom onset was known (n = patients, n = samples total). healthy blood donors (n = ) are included as a negative control within the gray region. the dotted lines indicate the cut-offs for anti-rbd igg, igm, and iga seropositivity. (d) titers that achieve % neutralization (nt ) were plotted over days after symptom onset for each patient sample. (e-h) patient samples were selected for collection between and days after symptom onset (earliest time point for each patient), and for each cohort of healthy blood donors, non-hospitalized, hospitalized, intubated, deceased, and immunosuppressed patients, anti-rbd igg, igm, iga, and neutralization (nt ) was plotted. non-parametric multivariate anova was performed for each (excluding healthy blood donors); statistical significance is indicated as follows: **** p < . , *** p < . , ** p < . , and * p < . . (i-j) an roc and log-log regression analyses were performed on igg versus neutralization. for j, the severity cohort is indicated as follows: healthy (white), non-hospitalized (green), hospitalized (yellow), intubated (red), deceased (gray), and immunosuppressed (blue). for j, pearson correlations were performed and r and p values are indicated. (k) a residual plot for neutralization titer was generated from the log-log correlation. the gray ellipse indicates a cluster of samples from intubated (red) and deceased (gray) patients. (l) neutralization potency index (nt /igg) was calculated for all patients (at earliest time point) and plotted by cohort. a nonparametric multivariate anova was performed without correction for multiple comparisons; unadjusted p values are indicated as follows: ** p < . , * p < . . (m) survival analysis of covid- patients classified as having a high (≥ ) (n = ) or low (< ) (n = ) neutralization potency index (nt /igg) was performed using kaplan-meier method and revealed significantly decreased risk of death in low neutralization potency individuals (p = . ). . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint (c) sub-analyses on covid- patients that were in the hospital for at least days to (n = ) were performed on the last collected specimen to show the effect of azithromycin (n = treated), remdesivir (n = treated), hydroxychloroquine (n = treated), corticosteroids (n = treated), and tocilizumab (n = treated as part of a trial with : randomization to placebo) on anti-rbd igg levels (upper panel), neutralization titer (middle panel), and neutralization potency index (nt /igg) (lower panel). a t test was performed for each comparison; * indicates unadjusted p < . . . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted october , . . https://doi.org/ . / . . . doi: medrxiv preprint : sars-cov- -infected patient sera cross-neutralizes both wild-type and d g mutant sars-cov- spike but not the highly homologous pre-emergent bat coronavirus wiv -cov. (a) a schematic of the sars-cov- and wiv -cov spike proteins, including full-length, truncated (Δ ), and mutant (d g) forms is shown. full-length wiv -cov spike has . % sequence homology to sars-cov- spike and has the same putative er retention signal (errs) as sars-cov- . (b) expression of full-length, Δ , and Δ d g sars-cov- spike constructs in t cells in comparison to empty vector (neg. ctrl) was measured by flow cytometry (upper panel). infectivity of lentivirus, which was defined as the infectious units divided by the quantity of p in lentiviral supernatant, was also measured and compared to vsv-g-pseudotyped lentivirus (lower panel). (c-d) cross-neutralization of serum samples from covid- patients that were non-hospitalized (green, n = ), hospitalized (yellow, n = ), intubated (red, n = ), deceased (gray, n = ), or immunosuppressed (blue, n = ) and healthy blood donors (n = ) was measured for wild-type versus d g mutant sars-cov- Δ spike pseudovirus. for c, pearson correlations were performed and r and p values are indicated; for d, paired, non-parametric t test was performed; *** indicates p < . . (e) similar to b, expression and infectivity of full length and Δ wiv -cov spike was measured. (f-g) similar to c-d, cross-neutralization of serum samples from covid- patients was measured for wild-type sars-cov- versus wiv -cov pseudovirus. for f, pearson correlations were performed and r and p values are indicated; for g, paired, non-parametric t test was performed; **** indicates p < . . . cc-by-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted october , . . https://doi.org/ . a perspective on potential antibody-dependent enhancement of sars-cov- ' remdesivir for the treatment of covid- -preliminary report a randomized trial of hydroxychloroquine as postexposure prophylaxis for covid- replication-competent vesicular stomatitis virus vaccine vector protects against sars-cov- -mediated pathogenesis in mice sars-cov- infection protects against rechallenge in rhesus macaques serological responses in patients with severe acute respiratory syndrome coronavirus infection and cross-reactivity with human coronaviruses e, oc , and nl epidemiological and clinical characteristics of cases of novel coronavirus pneumonia in wuhan, china: a descriptive study detectable serum sars-cov- viral load (rnaaemia) is closely correlated with drastically elevated interleukin (il- ) level in critically ill covid- patients', clinical infectious diseases: an official publication of the infectious diseases society of america efficacy of hydroxychloroquine in patients with covid- : results of a randomized clinical trial', medrxiv mutagenesis of the transmembrane domain of the sars coronavirus spike glycoprotein: refinement of the requirements for sars coronavirus cell entry protocol and reagents for pseudotyping lentiviral particles with sars-cov- spike protein for neutralization assays a universal design of betacoronavirus vaccines against covid- , mers, and sars primary exposure to sars-cov- protects against reinfection in rhesus macaques sars-cov- infection (covid- )', ebiomedicine , development of an inactivated vaccine candidate for sars-cov- prevalence of antibodies to four human coronaviruses is lower in nasal secretions than in serum targets of t cell responses to sars-cov- coronavirus in humans with covid- disease and unexposed individuals immunogenicity and protective efficacy of influenza vaccination a sars-cov- infection model in mice demonstrates protection by neutralizing antibodies identification of a critical neutralization determinant of severe acute respiratory syndrome (sars)-associated coronavirus: importance for designing sars vaccines dexamethasone in hospitalized patients with covid- -preliminary report dynamics and significance of the antibody response to sars-cov- infection', medrxiv : the preprint server for health sciences an mrna vaccine against sars-cov- -preliminary report human neutralizing antibodies elicited by sars-cov- infection phase - trial of a sars-cov- recombinant spike protein nanoparticle vaccine interleukin influences germinal center development and antibody production via a contribution of c complement component tracking changes in sars-cov- spike: evidence that d g increases infectivity of the covid- virus phylogenetic analysis of sars-cov- in the boston area highlights the role of recurrent importation and superspreading events', medrxiv : the preprint server for health sciences anti-spike igg causes severe acute lung injury by skewing macrophage responses during acute sars-cov infection intracellular targeting signals contribute to localization of coronavirus spike proteins near the virus assembly site inactivation of west nile virus during serologic testing and transport the cytoplasmic tail of the severe acute respiratory syndrome coronavirus spike protein contains a novel endoplasmic reticulum retrieval signal that binds copi and promotes interaction with membrane protein sars-like wiv -cov poised for human emergence sex-specific clinical characteristics and prognosis of coronavirus disease- infection in wuhan, china: a retrospective study of severe patients single-shot ad vaccine protects against sars-cov- in rhesus macaques' human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants impact of hydroxychloroquine on antibody responses to the sars-cov- coronavirus', frontiers in immunology retroviruses pseudotyped with the severe acute respiratory syndrome coronavirus spike protein efficiently infect cells expressing angiotensin-converting enzyme ' phase / study of covid- rna vaccine bnt b in adults establishment and validation of a pseudovirus neutralization assay for sars-cov- ', emerging microbes & infections correlates of protection induced by vaccination', clinical and vaccine immunology: cvi the sars-cov- receptor-binding domain elicits a potent neutralizing response without antibody-dependent enhancement isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model sars-cov- -specific elisa development viral epitope profiling of covid- patients reveals cross-reactivity and correlates of severity drug treatments for covid- : living systematic review and network meta-analysis cross-reactive serum and memory b cell responses to spike protein in sars-cov- and endemic coronavirus infection', biorxiv : the preprint server for biology two x-linked agammaglobulinemia patients develop pneumonia as covid- manifestation but recover', pediatric allergy and immunology: official publication of the european society of pediatric allergy and immunology sars-cov- seroconversion in humans: a detailed protocol for a serological assay, antigen production, and test setup hydroxychloroquine in patients with mainly mild to moderate coronavirus disease : open label, randomised controlled trial elevated troponin in patients with coronavirus disease : possible mechanisms potent binding of novel coronavirus spike protein by a sars coronavirus-specific human monoclonal antibody', emerging microbes & infections the contribution of the cytoplasmic retrieval signal of severe acute respiratory syndrome coronavirus to intracellular accumulation of s proteins and incorporation of s protein into virus-like particles safety and efficacy of hydroxychloroquine in covid- : a systematic review and meta-analysis c-reactive protein levels in the early stage of covid- remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus ( -ncov) in vitro broad neutralization of sars-related viruses by human monoclonal antibodies extrafollicular b cell responses correlate with neutralizing antibodies and morbidity in covid- ' a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace dna vaccine protection against sars-cov- in rhesus macaques a new predictor of disease severity in patients with covid- in wuhan, china', medrxiv key: cord- -nvphu fm authors: thomson, emma c.; rosen, laura e.; shepherd, james g.; spreafico, roberto; da silva filipe, ana; wojcechowskyj, jason a.; davis, chris; piccoli, luca; pascall, david j.; dillen, josh; lytras, spyros; czudnochowski, nadine; shah, rajiv; meury, marcel; jesudason, natasha; de marco, anna; li, kathy; bassi, jessica; o’toole, aine; pinto, dora; colquhoun, rachel m.; culap, katja; jackson, ben; zatta, fabrizia; rambaut, andrew; jaconi, stefano; sreenu, vattipally b.; nix, jay; jarrett, ruth f.; beltramello, martina; nomikou, kyriaki; pizzuto, matteo; tong, lily; cameroni, elisabetta; johnson, natasha; wickenhagen, arthur; ceschi, alessandro; mair, daniel; ferrari, paolo; smollett, katherine; sallusto, federica; carmichael, stephen; garzoni, christian; nichols, jenna; galli, massimo; hughes, joseph; riva, agostino; ho, antonia; semple, malcolm g.; openshaw, peter j.m.; baillie, j. kenneth; rihn, suzannah j.; lycett, samantha j.; virgin, herbert w.; telenti, amalio; corti, davide; robertson, david l.; snell, gyorgy title: the circulating sars-cov- spike variant n k maintains fitness while evading antibody-mediated immunity date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: nvphu fm sars-cov- can mutate to evade immunity, with consequences for the efficacy of emerging vaccines and antibody therapeutics. herein we demonstrate that the immunodominant sars-cov- spike (s) receptor binding motif (rbm) is the most divergent region of s, and provide epidemiological, clinical, and molecular characterization of a prevalent rbm variant, n k. we demonstrate that n k s protein has enhanced binding affinity to the hace receptor, and that n k virus has similar clinical outcomes and in vitro replication fitness as compared to wild- type. we observed that the n k mutation resulted in immune escape from a panel of neutralizing monoclonal antibodies, including one in clinical trials, as well as from polyclonal sera from a sizeable fraction of persons recovered from infection. immune evasion mutations that maintain virulence and fitness such as n k can emerge within sars-cov- s, highlighting the need for ongoing molecular surveillance to guide development and usage of vaccines and therapeutics. sars-cov- , the cause of covid- , emerged in late and expanded globally, resulting in over million confirmed cases as of october . molecular epidemiology studies across the world have generated over , viral genomic sequences and have been shared with unprecedented speed via the gisaid initiative (https://www.gisaid.org/). these data are essential for monitoring virus spread (meredith et al., ) and evolution. of particular interest is the evolution of the sars-cov- surface protein, spike (s), which is responsible for viral entry via its interaction with the human angiotensin-converting enzyme (hace ) receptor on host cells. the s protein is the target of neutralizing antibodies generated by infection or vaccination (folegatti et al., ; jackson et al., ; keech et al., ) as well as monoclonal antibody (mab) drugs currently in clinical trials (hansen et al., ; jones et al., ; pinto et al., ) . a sars-cov- s variant, d g, is now dominant in most places around the globe (callaway, ) . studies in vitro indicate that this variant may have greater infectivity while molecular epidemiology indicates that it spreads efficiently and likely maintains virulence (hu et al., ; korber et al., ; volz et al., ; . amino acid is outside the receptor binding domain (rbd) of s, the domain targeted by % of neutralizing antibody activity in serum of sars-cov- survivors (piccoli et al., ) . initial studies suggest that d g actually exhibits increased sensitivity to neutralizing antibodies, likely due to its effects on the molecular dynamics of the spike protein (hou et al., ; yurkovetskiy et al., ) . therefore, this dominant variant is unlikely to escape antibody-mediated immunity. the low numbers of novel mutations reaching high frequency in sequenced sars-cov- isolates may relate to the moderate intrinsic error rate of the replication machinery of sars-cov- (li et al., c; robson et al., ) and to this new human coronavirus requiring no significant adaption to humans (maclean et al., ) . nevertheless, the increasing number of infected individuals and the large reservoir of individuals susceptible to infection increases the likelihood that novel variants that impact vaccine and therapeutic development will emerge and spread. moreover, the full impact of immune selection, which can drive variant selection, likely has not yet had a dominant influence on the pandemic, since herd immunity has not yet been attained. as population immunity increases and vaccines are deployed at scale this might change. the potential for circulating viral variants to derail promising vaccine or antibody-based prophylactics or treatments, even in the absence of selective pressure from the drug or vaccine, is demonstrated by the failure of a phase iii clinical trial of a mab targeting the respiratory syncytial virus (simoes et al., ) , and the need for new influenza vaccines on a yearly basis. it is therefore critical to understand whether and how sars-cov- may evolve to evade antibody-dependent immunity. here, we examined the immunodominant sars-cov- receptor binding motif (rbm), the primary target of the neutralizing ab response within the rbd (piccoli et al., ) and found it to be less conserved than the rbd or the entire spike protein in circulating viruses. to understand the implications of this structural plasticity for immune evasion, we defined the clinical and epidemiological impact, the molecular features, and the immune response to an rbm variant, n k. this variant has arisen independently twice, in both cases forming lineages of more than sequences. as of october , it has been observed in countries and is the second most commonly observed rbd variant worldwide. we find that the n k mutation is associated with a similar clinical spectrum of disease and slightly higher viral loads in vivo compared with isolates with the wild-type n residue, and that it results in immune escape from polyclonal sera from a proportion of recovered individuals and a panel of neutralizing mabs. n k provides a sentinel example of immune escape, indicating that rbm variants must be evaluated when considering vaccines and the therapeutic or prophylactic use of mabs. long term control of the pandemic will require systematic monitoring of immune escape variants and selection of strategies that address the variants circulating in targeted populations. competing pressures influence the evolution of the spike rbm. first, the rbm mediates viral entry (shang et al., ; walls et al., ; wrapp et al., b) and therefore it must maintain sufficient affinity to engage the entry receptor hace . second, it is a major target of neutralizing antibodies (robbiani et al., ; rogers et al., ; wec et al., ) and could be a primary location for the emergence of immune escape mutations. we set out to understand these competing pressures by evaluating the landscape of rbm sequence divergence observed in circulating sars-cov- variants and in other viruses of the sarbecovirus lineage. we used published x-ray structures of sars-cov and sars-cov- rbd:hace complexes (lan et al., ; li et al., ) to define the rbm residues using a Å distance cutoff (figures a-c) . we evaluated ~ , sars-cov- genomic sequences deposited in gisaid as of october , and observed a high number of variants occurring in the rbm (figure a) . to understand how the divergence of the rbm compares to the divergence of the entire rbd and the whole spike protein, we divided the spike protein into three non-overlapping regions: the rbm, the rbd outside of the rbm, and the full s protein outside of the rbd. we counted individual variants occurring at least ten times, and quantified substitutions of different amino acids at the same position as separate variants. we found that the rbm is the least conserved region of s ( figure b) . to understand this result further, we evaluated a published deep mutational scanning (dms) data set of the rbd and compared it to sequences of circulating viruses. the dms data defines the effect of each possible single amino acid change on both expression of the rbd and its capacity to bind hace . for each position in the rbm, we compared the dms results for all amino acid substitutions at that position versus only substitutions that have been observed in circulating sars-cov- isolates ( figure c) . a subset of residues shows the largest loss of hace binding upon mutation (top ~ / of rbm residues in figure c ) and, as would be expected, few natural variants of these residues have been observed to be circulating to date. surprisingly, these conserved residues each contribute weakly to the rbd:hace total interaction energy (the sum of pair-wise interaction energies for all residues at the binding interface in the x-ray structure; "binding energy" in figure c ). for the majority of the rbm (bottom ~ / of rbm residues in figure c ), variation in circulating virus sequences confirms the tolerance to mutation predicted by the dms data. notably, several rbm residues forming the strongest interactions with the receptor, e.g. k and e , are not highly conserved despite their predicted importance. these results suggest that the rbm has a degree of structural plasticity whereby it is able to accommodate mutations without disrupting hace binding. evolutionary analysis of sarbecoviruses provides further support for rbm plasticity li et al., b; rambaut et al., ) . the sars-cov rbm is highly divergent from the sars-cov- rbm (figure s a-b) while maintaining hace binding affinity. additionally, there are many sequence changes in the rbm across a panel of related coronaviruses from animal isolates (figure s a-b, table s ). to determine the ability of members of the sarbecovirus lineage to bind hace , we produced nine recombinant rbd proteins corresponding to seven animal isolates, sars-cov- , and sars-cov and evaluated their binding to recombinant hace ( figure s c ). we found that three of the rbds from animal isolates showed strong affinity for hace : gd pangolin, which has a highly similar rbm to sars-cov- , and gx pangolin and bat cov wiv , which have highly divergent rbms (figure s a-b) . this further supports the conclusion that the rbm is structurally plastic, while retaining binding with hace as a receptor. given this plasticity, we next considered whether an rbm variant can lead to immune evasion while retaining virulence. the two most commonly observed circulating rbd variants as of october contain mutations in the rbm (s n and n k). we first identified the n k variant in march , circulating in scotland from lineage b. on the background of d g (da silva filipe et al., ) . using phylogenetic analysis, we determined this variant represented a single lineage (figure a ) that increased in frequency to sequences by june , (~ % of the available scottish viral genome sequences for this time period). numbers of n k and all other isolates decreased in scotland concurrent with control of the pandemic by initiation of stringent public health measures and this lineage has not been detected in scotland after june. however, the n k variant has been observed in > sequences in a second lineage in europe, first sampled in romania on may , , then norway on june , and is now circulating in countries, as well as arising independently in the u.s. (figure a-c) . as of oct , , all n k variants arose from a c-to-a transversion in the third codon position, though these counts are heavily influenced by sampling frequency which varies widely between countries. as scotland has a high sampling frequency for its population size (~ . m), it is possible to calculate a growth rate (voltz and frost, ) based on a comparison of the scottish lineages. we find that the growth rate is similar to what has already been shown for the d g background with no evidence for a faster rate of growth than n lineages ( figure s a ). in addition to its frequency and spread, the n k variant stood out from other circulating rbm variants as having a plausible mechanism for maintenance of viral fitness. the equivalent position to n k in the sars-cov rbm is also a positively-charged amino acid (r ), which forms a salt bridge with hace (li et al., ) . we therefore hypothesized that the n k sars-cov- variant may form this additional salt bridge at the rbd-hace interface (rbd n k:hace e ). structural modeling supported that this salt bridge could form without disrupting the binding interface, including the two original salt bridges (rbd k :hace d and rbd e :hace k ) (figure a-c) . a salt bridge is the strongest type of non-covalent bond and the n k mutation could plausibly increase the number of salt bridges at the binding interface from two to three, presenting the hypothesis that the n k variant may have enhanced binding for hace . to test this hypothesis, we used surface plasmon resonance (spr) to evaluate binding of recombinant n k s or rbd protein to recombinant hace . we also evaluated the n r and k v variants, each of which is found in sars-cov at these positions. across multiple assay formats, we found that the n k and n r variants exhibited a ~ -fold enhanced binding affinity for hace as compared to the original n variant (termed herein wt) ( figure d ). the magnitude of this enhancement was paralleled by a ~ -fold loss of binding affinity for the k v variant relative to wt. lastly, we also tested the effect of the n k/r and k v mutations in combination. these double variants form the same number of salt bridges at the hace binding interface as compared to wt, but one is at rbd position rather than ; we found they had an hace affinity similar to the wt ( figure d ). these data indicate that acquisition of the n k mutation enhances binding affinity, which could have implications in vivo in the context of natural infection. also, the enhanced affinity could plausibly compensate for other mutations that would otherwise be detrimental (e.g. k v), further highlighting the plasticity of the rbm. the enhanced hace affinity of the n k variant, its geographical emergence as independent lineages as well as its prevalence among circulating viral isolates is consistent with maintained viral fitness. we set out to directly examine fitness by evaluating clinical data and outcomes of virus carrying the n k mutation versus wt n , as well as by direct in vitro viral growth and competition. we used qpcr to evaluate viral load (as measured by cycle threshold, ct) in , scottish patients whose viral isolates had been sequenced (figures a-b ). viral isolates were either n k/d g (n= ), n /d g (n= ) or ancestral (n /d ) (n= ). our analysis found strong evidence that the n k/d g genotype was associated with marginally lower cycle threshold (ct) than the n /d g genotype (mean ct value difference between n k/d g and n /d g: - . , % ci: - . , - . ) ( figure b ). as ct measurements were carried out in multiple sites, a sub-analysis of viral load using rna standards was carried out with available samples and showed a near-complete correlation with ct ( figure b ). d g has previously been associated with higher viral loads/lower ct values than d (korber et al., ) but we did not detect this difference in this statistical analysis due to the intercept of the model being imprecisely estimated (table s ) . clinical outcomes were also obtained for a subset of these patients (n= , ), who were scored for severity of disease based on oxygen requirement: . no respiratory support, : supplemental oxygen, : invasive or non-invasive ventilation or high flow nasal cannulae, : death (figures c and s b ). genotype counts for this analysis were n k/d g (n= ), d g (with n ) (n= ) or ancestral (n /d ) (n= ). analysis based on our ordinal scale indicated that the n k/d g viral genotype was associated with similar clinical outcomes compared to d g or ancestral genotypes (posterior mean: . , % ci: - . , . ) ( table s ). all other results from the severity analysis were qualitatively similar to a previous analysis of the d g mutation (volz et al., ) . these clinical data indicate that the n k virus is not attenuated. we next tested growth of two representative sars-cov- isolates, gla (wt n ) and gla (n k), both with the d g background (table s ) . culture was carried out for hours in vero e -ace cells either with or without tmprss expression. there was no significant difference between the growth of these strains after inoculation at multiplicities of infection (mois) of . and . . the n k strain replicated slightly faster early after inoculation ( figure d ). these data indicate that the n k mutation does not exhibit dominant negative effects on viral growth, and most likely supports normal replication. to further assess fitness for replication in cultured cells, we carried out a cross-competition assay using inoculation of cells at a matched moi followed by quantitation of n and n k by metagenomic ngs over time (figure e ). the n k strain demonstrated similar fitness as the wt n strain, with a possible fitness advantage for n k in cells expressing tmprss . taken together with the clinical outcomes, these results indicate that the n k mutation results in viral fitness that is similar or possibly slightly improved compared to the wild-type n . having established that virus carrying the n k mutation is fit, we sought to understand whether this mutation evades antibody-mediated immunity by evaluating recognition of the n k variant by monoclonal antibodies and by polyclonal immune serum from recovered individuals, including donors who were infected by the sars-cov- n k variant. . % of the tested sera showed a greater than -fold reduction in binding to n k rbd as compared to wt rbd (figures a-b and s ) . in some individuals the rbd response was diminished to low titers of < : by the n k mutation. thus, the response to the rbd is significantly influenced by the n k mutant within the immunodominant rbm domain (piccoli et al., ) in a significant portion of persons potentially immune to wt sars-cov- . the majority of sera demonstrating loss of binding were those that had overall lower responses to wt rbd, indicating lower ab titers. the sera from the six individuals known to have recovered from infection with sars-cov- n k virus showed no change in binding levels to wt rbd as compared to n k rbd (figures a-b and s ) . this may reflect a true variant-specific response or that differential binding could not be measured due to the limited number of samples analyzed. to understand our results at the level of individual antibodies, we evaluated a panel of mabs isolated from individuals recovered from sars-cov- infection early in the pandemic (likely with n wt virus) (piccoli et al., ; tortorici et al., ) , as well as clinical-stage mabs regn , regn , ly-cov , and s (the parent of vir- ) hansen et al., ; chen et al., ; pinto et al., ) . . % of these mabs demonstrated a > -fold reduction of rbd binding in response to the n k mutation ( figures c-d and s ). for comparison, we also evaluated the k v mutation which eliminates one salt bridge at the rbm:hace interface and the n k/k v double mutation. a similar percentage ( . % for k v vs . % for n k) of mabs lost > -fold binding to these variants, including several ( . %) which were not sensitive to either single mutant but were sensitive to the double mutant ( figures c-d) . the reduced binding of mabs to these rbd mutants were also confirmed by bio-layer interferometry analysis (bli) (figures e and s a) . to define the potential biological importance of these mutations for evasion of antibody-mediated neutralization, we tested mabs against pseudoviruses expressing s variants n k, k v or n k/k v (figures f-h and s b ). neutralization of pseudoviruses containing these mutations was significantly diminished for certain mabs, including some that are in clinical development. as predicted by its non-rbm epitope , s was capable of neutralizing each of these variants. sensitivity of some neutralizing mabs to mutations at these positions have also been reported in other studies greaney et al., ; li et al., a; weisblum et al., ) but combinations of mutations have not typically been evaluated. overall, our results demonstrate that mutations compatible with viral fitness can result in immune evasion from both monoclonal and polyclonal antibody responses. the evolution of the sars-cov- rbm, a critical epitope for vaccine response and therapeutic mabs, will depend on the fitness of rbm variants. the findings herein describe an example of a naturally-occurring rbm variant which can evade antibody-mediated immunity while maintaining fitness. fitness of this variant, n k, was demonstrated by repeated emergence by convergent evolution, spread to multiple countries and significant representation in the sars-cov- sequence databases, the fact that the n k rbd retains a high affinity interaction with the hace receptor, efficient viral replication in cultured cells, and no disease attenuation in a large cohort of infected individuals. the fitness of n k is consistent with our findings that the rbm is the most divergent region of s. this divergence indicates an ability of sars-cov- to accommodate mutations at the rbm while retaining the functional requirement of hace binding, and is likely to be linked to immune pressure from neutralizing ab responses. there is precedent for the most immunogenic region of a viral surface protein to be the fastest mutating despite harboring the receptor binding site; for example, the immunogenic globular head domain of the influenza virus hemagglutinin surface protein, which contains the sialic acid receptor binding site, evolves faster than the stalk region (doud et al., ; kirkpatrick et al., ) . the ability to accommodate mutations in the rbm indicates a high likelihood that immune-evading sars-cov- variants compatible with fitness will continue to emerge, with implications for reinfection, vaccines, and both monoclonal and polyclonal antibody therapeutics. in our profile of immune escape from the n k variant, we observed resistance to a mab currently being evaluated in clinical trials as part of a two-mab cocktail. the promise of using cocktails of mabs is that they should significantly lower the likelihood of drug-induced selection of resistant viruses . however, if circulating viral strains already carry resistant mutations to one antibody in the cocktail, this could reduce the cocktail to a monotherapy. additionally, considering the high level of plasticity of the rbm demonstrated in the present study, there could be many combinations of rbm mutations compatible with viral fitness while leading to immune escape. this is supported by our result that n k can compensate for a mutation (k v) that otherwise decreases receptor binding affinity ( figure d ). this particular combination of mutations is plausibly compatible with fitness as it parallels sars-cov rbm:hace interactions (salt bridge at sars-cov rbd position r and no salt bridge at v , figure a) . notably, several mabs which were not sensitive to these mutations individually were sensitive to them in combination, including the two-mab cocktail ( figure c-h) . we propose two approaches that will be critical for minimizing the impact of mab escape mutations. one is to develop mabs with epitopes that are highly resistant to viral escape. this may include epitopes outside of the rbm and/or epitopes that are crossreactive across sars-cov and sars-cov- , indicating conserved epitopes with a low tolerance for mutation wec et al., ; wrapp et al., a) . a comparison of epitopes of rbm-targeting mabs with the most conserved regions of the rbm ( figure c ) may also identify rbm mabs with a higher barrier to escape. the second approach is to screen patients, likely at the population level, for the presence of potential resistance variants prior to drug administration. the availability of multiple different mab therapeutics in the clinic could provide the opportunity to tailor the choice of therapeutic to local circulating variants. in general, given that access to therapeutic monoclonal antibodies via clinical trials and emergency use authorization is expanding, and as more people develop immune responses to the wildtype virus, monitoring the evolution of sars-cov- will be increasingly critical. although sars-cov- is evolving slowly and at present should be controllable by a single vaccine (dearlove et al., ) , variation accumulating in the rbm could put this at risk, especially for individuals with a moderate ab response to vaccination or infection. while we only report on evasion of antibody-mediated immunity here, it would be surprising to us if similar changes are not observed to evade t cell immunity and innate immunity. wec, a.z., wrapp, d., herbert, a.s., maurer, d.p., haslwanter, d., sakharkar, m., jangra, r.k., dieterle, m.e., lilov, a., huang, d., et al. ( ) . broad neutralization of sars-related viruses by human monoclonal antibodies. science , - . weisblum, y., schmidt, f., zhang, f., dasilva, j., poston, d., lorenzi, j.c.c., muecksch, f., rutkowska, m., hoffmann, h.-h., michailidis, e., et al. ( ) . escape from neutralizing antibodies by sars-cov- spike protein variants. ( ) . a new coronavirus associated with human respiratory disease in china. nature , - . yurkovetskiy, l., wang, x., pascal, k.e., tomkins-tinch, c., nyalile, t.p., wang, y., baum, a., diehl, w.e., dauphin, a., carbone, c., et al. ( ) . structural and functional analysis of the d g sars-cov- spike protein variant. cell. zhang, l., jackson, c.b., mou, h., ojha, a., rangarajan, e.s., izard, t., farzan, m., and choe, h. ( ) . the d g mutation in the sars-cov- spike protein reduces s shedding and increases infectivity. https://wwwbiorxivorg/content/ / v . samples from sars-cov- infected individuals were obtained from the ticino healthcare workers cohort (switzerland), described previously (piccoli et al., ) , and under study protocols approved by the local institutional review board (canton ticino ethics committee, switzerland). all donors provided written informed consent for the use of blood and blood components (such as pbmcs, sera or plasma). in the ticino region of switzerland and during the time period of collection (february-march ) no n k sars-cov- isolates were reported. samples from six n k variant infected individuals were obtained from the isaric c consortium (https://isaric c.net/). ethical approval was given by the south central-oxford c research ethics committee in england (reference /sc/ ), and by the scotland a research ethics committee (reference /ss/ ). the study was registered at https://www.isrctn.com/isrctn . residual nucleic acid extracts derived from the nose-throat swabs of sars-cov- positive individuals whose diagnostic samples were submitted to the west of scotland specialist virology centre between rd march and th june were sequenced as part of the cog-uk consortium under study protocols approved by the relevant national biorepositories ( /ws/ nhs and /s / ) (consortiumcontact@cogconsortium.uk, ) . rbm residues were determined based on the rbd:ace complex crystal structures ajf for sars-cov (li et al., ) and m j for sars-cov- (lan et al., ) . the ajf structure was obtained from the pdb-redo server (pdb-redo.eu) and was subsequently prepared in the molecular modeling software moe (v . , https://www.chemcomp.com) using the structure preparation, protonation and energy minimization steps with default settings. rbd residues within . a distance of any ace atoms (determined using moe) were determined for each of the two copies of the complex in the asymmetric unit, and then were combined to obtain the rbm. m j was obtained from the coronavirus structural task force server (https://github.com/thornlab/coronavirus_structural_task_force) and was further refined (using refmac v . . ), manually fitted (using coot v . ) and prepared (using moe, as described above) in multiple iterative cycles. the final structure was analyzed for rbd-ace contact residues with a . a cutoff to obtain the rbm (using moe). the final list of rbm residues (figure c ) was arrived at by combining the sars-cov and sars-cov- results. using moe, the pairwise binding energy between each residue in sars-cov- rbd and each residue in ace , and the total binding energy for all interactions, was determined at cutoff distances . a, . a, . a, . a, . a, . a, . a, . a and . a. the percentage of the total binding energy for each interacting rbd residue was calculated for each distance cutoff and was then averaged over all cutoffs. the resulting values are shown in green in figure c . differential accumulation of amino acid variants in the rbm, rbd or spike protein was computed taking into account only the presence or absence of a variant at any residue. each variant called present counts one. a variant is called present if there are at least x number of supporting sequences deposited in gisaid, where x varies from to . the number of variants is then normalized to the size of the domain (number of residues). dms data was retrieved from . variant-level dms scores were aggregated to residue-level by taking the minimum (most disruptive variant) or the average score across all variants of a residue, except for the reference residue and the stop codon. alternatively, minimum and average scores are computed only across variants that have been observed as naturally occurring. data were represented as a heatmap annotated with: frequency of non-reference amino acids in deposited gisaid sequences (n ≈ , , at least sequences were required to call a variant as present), in log scale; number of countries in which a variant was observed; and percentage of total binding energy computed from an x-ray crystal structure (cf. structural analysis methods section). prefusion-stabilized sars-cov- spike protein variants (residues - , containing the p and furin cleavage site mutations with a muphosphatase signal sequence and a c-terminal avi- xhis-epea-tag in a pd -v vector (atum bio) were expressed in expi f cells at °c and % co according to manufacturer's instructions (thermo fisher scientific). cell culture supernatant was collected after four days and purified over a ml c-tag affinity matrix (thermo fisher scientific). elution fractions were concentrated and injected on a superose increase / gl column with x pbs ph . as running buffer. sars-cov- rbd variants (residues - with a c-terminal thrombin-cleavage site-twinstrep- xhis-tag, and n-terminal signal sequence) were expressed in expi f cells at °c and % co in a humidified incubator. transfection was performed using expifectamine reagent (thermo fisher scientific). cell culture supernatant was collected three days after transfection and supplemented with x pbs to a final concentration of . x pbs ( . mm nacl, . mm kcl and . mm phosphates), or . x for rbd n r. sars-cov- rbds were purified using or ml histalon superflow cartridges (takara bio) and subsequently buffer exchanged into cytiva x hbs-n buffer or pbs. rbds from other sarbecoviruses were expressed in expi f cells at °c and % co . cells were transfected using pei max. cell culture supernatant was collected seven days after transfection. proteins were purified using a ml strep-tactin xt superflow high capacity cartridge followed by buffer exchange to pbs using hiprep / desalting columns. for s binding measurements, recombinant ace (residues - from uniprot q byf with a c-terminal thrombin cleavage site-twinstrep- xhis-ggg-tag, and nterminal signal sequence) was expressed in expi cells at °c and % co in a humified incubator. transfection was performed using expifectamine reagent (thermo fisher scientific). cell culture supernatant was collected seven days after transfection, supplemented with buffer to a final concentration of mm tris-hcl ph . , mm nacl, and then incubated with biolock solution for one hour. after filtration through a . µm filter, ace was purified using a ml streptrap hp column (cytiva) followed by isolation of the monomeric ace by size exclusion chromatography using a superdex increase / gl column pre-equilibrated in pbs (gibco - ). for binding measurements with surface-captured rbd, recombinant ace (residues - from uniprot q byf with a c-terminal avitag- xhis-ggg-tag, and nterminal signal sequence) was expressed in hek .sus using standard methods (atum bio). protein was purified via ni sepharose resin followed by isolation of the monomeric ace by size exclusion chromatography using a superdex increase / gl column pre-equilibrated with pbs. for binding measurements with surface-captured ace , recombinant ace (residues - with a c-terminal gs-igg a-mm-fc tag, and n-terminal signal sequence) was stably transfected in cho-k gs knock-down cell line (atum bio). protein was purified via protein a and buffer exchanged into pbs. spr binding measurements were performed using a biacore t instrument. s protein was surface captured via anti-avitag pab covalently immobilized on a cm chip, rbd protein was surface captured via streptactin xt covalently immobilized on a cm chip, and ace -mfc was surface captured via covalent immobilization of the cytiva mouse antibody capture kit on a c chip. running buffer was cytiva hbs-ep+ (ph . ) and all measurements were performed at °c. all experiments were performed as singlecycle kinetics, with a -fold dilution series of monomeric ace starting from nm, each concentration injected for sec, or a -fold dilution series of rbd starting from nm, each concentration injected for sec. all data were double reference-subtracted and fit to a binding model using biacore evaluation software. for one representative replicate, capture levels were normalized to wt for visualization. binding data with ace as analyte were fit to a : binding model. binding data with rbd as analyte were fit to a heterogeneous ligand binding model, due to an artifactual kinetic phase with very slow dissociation that arises when rbd is an analyte; the lower affinity of the two kds reported by the fit is reported as the kd of the rbd-ace interaction (the two reported kds are separated by at least two orders of magnitude for all fits). the measured kd for ace binding to s is likely influenced by conformational dynamics of the rbds in the context of the prefusion s trimer. reported kds are an average of - replicates measured on at least two separate days, with error given as sem. a national sequencing collaboration formed at the start of the epidemic in the uk, cog-uk consortium (consortiumcontact@cogconsortium.uk, ), has facilitated the tracking of sars-cov- sequences across scotland since the start of the outbreak in february ( , sequences by oct , ) and real-time monitoring of genetic changes in the spike gene that might be associated with changes in virulence or transmissibility. sequencing was carried out using an amplicon-based protocol in real-time at a rate of up to genomes per week. % of samples were selected as surveillance samples, representing scottish health boards proportionately based on population size, while % were selected to allow intervention with local issues such as nosocomial infection in hospitals and nursing homes. a gradual increase in the prevalence of the n k polymorphism was noted to become increasingly prevalent during april . this was noted to be particularly common in the greater glasgow & clyde nhs health board region but spread to adjacent scottish health boards also. sequencing libraries were prepared according to the artic ncov- described in detail at https://artic.network/ncov- . briefly, pcr amplicons were generated using the ncov- primalseq sequencing primers using - cycles of amplification. generated amplicons were used to prepare either oxford nanopore or illumina sequencing libraries. oxford nanopore libraries were prepared as described in the link above and sequenced in a flow cell r . . (oxford nanopore technologies, part number flo-min d), using minknow version . . . raw fast files were basecalled using guppy version . . in high accuracy mode with a minimum quality score of . reads were size filtered, demultiplexed and trimmed with porechop (https://github.com/rrwick/porechop), and mapped against reference strain wuhan-hu- (mn ). variants were called using nanopolish . . and accepted if they had a loglikelihood score of greater than and minimum read coverage of . for illumina sequencing, amplicons were used to prepare libraries using the kapa hyperprep kit (kapa biosystems, part number kk ) and further processed as described in the competition assay sequencing method. sequencing was carried out on illumina's miseq system (illumina, part number sy- - ) using a miseq reagent v cycle kit (illumina, part number ms- - ) . reads were trimmed with trim_galore (http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/) and mapped with bwa (li and durbin, ) ) to the wuhan-hu- (mn ) reference sequence, followed by primer trimming and consensus calling with ivar (grubaugh et al., ) and a minimum read coverage of . uk sequences were obtained from the cog-uk consortium, https://www.cogconsortium.uk. global sequences were obtained from the gisaid initiative, https://www.gisaid.org on oct . the sequences were mapped using minimap and padded against the wuhan/wh / reference. the sequences were downsampled with weights that normalise sequence count per epiweek, maximise the number of countries and lineages represented, and enriching for sequences with the n k mutation. a maximum-likelihood phylogenetic tree was constructed using iq-tree with the the following parameters: -czb -blmin . -m hky --runs and all other parameters set to default. the tree was visualised with custom python code using the baltic library, https://github.com/evogytis/baltic. for the phylodynamic analysis, scottish "introduction" lineages were identified (lycett et al., , in prep and see http://sars .cvr.gla.ac.uk/risefallscotcovid), and the skygrowth package in r was used to estimate the effective population size over time, and the growth rate of the lineage within scotland (volz and frost, ) . clinical samples submitted to the west of scotland specialist virology centre for sars-cov- diagnostic rt-pcr testing were selected for sequencing as part of the covid- uk genomics uk consortium (cog-uk) project, resulting in whole genome sequences originating from the nhs greater glasgow and clyde health board region. sequences were linked to electronic patient records and basic metadata including sample date, age, sex, admission to hospital and mortality at days post diagnosis extracted. the electronic patient records of a subset of patients underwent full casenote review and clinical severity was recorded based on a -level ordinal scale: . no requirement for respiratory support, . treatment with supplemental oxygen via facemask or low-flow nasal cannulae, . intubation and ventilation, non-invasive ventilation or oxygen delivery by high flow nasal cannulae devices, . death within the days following diagnosis. we modified the who ordinal scale to these points as described previously (volz et al., ) to avoid using hospitalisation as a criterion of severity because ) many patients in nursing homes had severe infection but were not admitted to hospital, and ) early in the outbreak, all cases were hospitalised irrespective of the severity of their infection. these data had previously been analysed to test for an effect of the d g mutation on the severity of disease (volz et al., ) ; we extend that analysis here using the same methodology to test for an effect of the n k mutation. additionally, we perform a new analysis using a model with the same structure to test for an effect of both the d g mutation and the d g/n k mutation combination on the viral load of infected patients, as measured by cycle threshold value. in both cases we cannot estimate the marginal effect of the n k mutation, as we only have the mutation on the g genetic background, so the individual effect of n k cannot be separated from any potential epistatic interactions between the mutations. briefly, the structure of the model used previously (volz et al., ) and in the present study is a phylogenetic generalised additive model with mutation being the primary predictor of interest. the model controls for biological sex, age and the number of days since the first reported case in the dataset, with the latter two being included as penalised splines with a maximum of knots. if the patient was part of a cluster of cases, this was included as a random effect, with individuals not part of clusters being assigned their own levels. correlations driven by the rest of the genome are controlled for by a phylogenetic random effect using a correlation matrix generated under a brownian motion assumption from a phylogeny estimated in iq-tree v. . . (minh et al., ) using a hky + Γ model, masking the positions recommended by de maio et al. as of / / (https://virological.org/t/issues-with-sars-cov- -sequencing-data/ / ), rooted on the first sequenced sars-cov- genome (wu et al., ) . the priors for the severity model were those used in the previous analysis of this data. the priors for the model of the viral load were a student-t (mean = , scale = , degrees of freedom = ) prior on the model intercept, a gaussian (mean = , standard deviation = ) prior over the fixed effects, and an exponential (lambda = . ) prior over the random effect, penalised spline and residual standard deviations. there are two key structural differences between the model used previously (volz et al., ) and the model used here. firstly, mutation is a three level rather than two level factor (d /n , d g/n and d g/n k) with the ancestral d /n being the reference level. secondly, as we are now interested in two mutations, we estimated the phylogeny used to control for the effect of the rest of the genome excluding both the nucleotide position underlying the d g mutation and the nucleotide position underlying the n k mutation (in addition to the sites from de maio et al mentioned above). the severity model used a cumulative error structure while the model on the ct values used a gaussian error structure. in both cases, the models were estimated in brms v. . . (bürkner, ) . the presented models had no divergent transitions, rhat values less than . , and appropriate bulk and tail effective sample sizes for all parameters. shortest probability intervals were calculated using the r package spin v. . (liu et al., ) . analysis code is available at https://github.com/dpascall/sars-cov- -mutationanalysis. all samples were tested in duplicate using the -ncov_n assay rt-qpcr assay (https://www.fda.gov/media/ /download). ready-mixed primers and probe were obtained from idt (leuven, belgium). pcr was carried out using neb luna universal probe one-step reaction mix and enzyme mix (new england biolabs, herts, uk), primers and probe at nm and . nm, respectively, and µl of rna sample in a final volume of µl. no template negative controls were included after every seventh sample. six ten-fold dilutions of sars-cov- rna standards were tested in duplicate in each assay; standards were calibrated using a plasmid containing the n sequence that had been quantified using droplet digital pcr. thermal cycling was performed on an applied biosystems™ fast pcr instrument running sds software v . (thermofisher scientific) under the following conditions: °c for minutes and °c for minute followed by cycles of °c for s and °c for minute. assays were repeated if the reaction efficiency was < % or the r value of the standard curve was ≤ . . where possible, testing of samples was repeated if the %cv of the duplicates was < %. veroe -ace cells (veroe cells induced to overexpress ace ) either with or without tmprss overexpression (rhin et al., under review) were seeded in a well plate and inoculated with an moi of . with either the gla (n /d g) or gla (n k/d g) virus isolates for hr before washing the cells three times in pbs and replacing with % dmem. ul of media was removed at each timepoint, rna was extracted, and the presence of sars-cov- determined using -ncov-n assays (idt) with an neb luna universal probe one-step rt-qpcr kit. a standard curve was used to determine the copy number present per ml of cell culture media. ul of the fresh media was also tested for the presence of virus, which was undetectable in all wells. three t flasks were seeded with veroe -ace or veroe -ace -tmprss and inoculated with either single viruses or both gla and gla virus strains at an moi of . for hr. the flasks were washed three times with pbs, with ul of the final wash being retained to determine the presence of free virus, before adding ml of fresh % dmem. at , , and hrs, ul of media was removed, which was replaced with ul fresh media. ul was used for rna extraction and ngs analysis of the frequencies of the specific positions within the spike protein. the single virus inoculations showed no alternations in the frequency of the amino acid positions and the final wash showing no free virus in the supernatant. we used an unbiased metagenomic ngs sequencing pipeline to quantify variation across the whole viral genome on the illumina ngs next seq platform. briefly, extracted nucleic acid was incubated with dnasei (thermo fisher, part number am ) followed by cdna synthesis using superscript iii (thermo scientific, part number ) and nebnext ultra ii non-directional rna second strand synthesis module (new england biolabs, part number e l). samples were further processed using the kapa ltp library preparation kit for illumina platforms (kapa biosystems, part number kk ) and indexed with the nebnext multiplex oligos for illumina unique dual index primer pairs (new england biolabs, part number e s). libraries were sequenced on illumina's nextseq system (illumina, part number sy- - ), generating million pairs of reads per sample. human mabs were isolated from plasma cells or memory b cells of sars-cov or sars-cov- immune donors, as previously described (corti et al., ; pinto et al., ; tortorici et al., ) . ly-cov mab was obtained from eli lilly and company. regn and regn mabs were produced recombinantly based on published sequences (hansen et al., ) . a total of human monoclonal antibodies or human sera were tested for binding to rbd wt and mutants. spectraplate- plates with high protein binding treatment (custom made from perkin elmer) were coated overnight at °c with . µg/ml (for mabs) or ug/ml (for sera) sars-cov- rbd wt, n k, k v or n k/k v in phosphate-buffered saline (pbs), ph . . plates were subsequently blocked with blocker casein % supplemented with . % tween (sigma-aldrich) for h at room temperature. the coated plates were incubated with serial dilutions of the monoclonal antibodies or of the sera for h at room temperature. the plates were then washed with pbs containing . % tween- (pbs-t), and alkaline phosphatase-goat anti-human igg (southern biotech) was added and incubated for h at room temperature. after washing steps with pbs-t, p-nitrophenyl phosphate (pnpp, sigma-aldrich) substrate was added and incubated for min at room temperature. the absorbance of nm was measured by a microplate reader (biotek). fitting was performed using a -parameter logistic ( pl) model, yielding dose-response curves from which the area under the curve (auc) between and ng/ml was computed. the auc allows to capture, in a single metric, shifts of interest in two parameters of the pl model: ec and upper asymptote. bli binding measurement was performed on a selection of human monoclonal antibodies tested by elisa. antibodies were diluted to . µg/ml in kinetic buffer (pbs supplemented with . % bsa) and immobilized on protein a biosensors of an octet red system (fortébio). antibody-coated biosensors were incubated for min with a solution containing µg /ml of sars-cov rbd wt, n k, k v or n /k v in kinetic buffer. a dissociation step was then performed by incubating the biosensors for min in kinetic buffer. change in molecules bound to the biosensors caused a shift in the interference pattern that was recorded in real time and plotted using graphpad prism software. replication defective vsv pseudovirus (takada et al., ) expressing sars-cov- spike protein were generated as previously described (riblett et al., ) with some modifications. plasmids encoding sars-cov- spike variants were generated by site-directed mutagenesis of the wild-type plasmid, pcdna . (+)-spike-d (giroglou et al., ) . lenti-x™ t cells (takara, ) were seeded in -cm dishes at a density of e cells/cm and the following day transfected with µg of spike expression plasmid with transit-lenti (mirus, ) according to the manufacturer's instructions. one day post-transfection, cells were infected with vsv-luc (vsv-g) (kerafast, eh -pm) for h, rinsed three times with pbs, then incubated for an additional h in complete media at °c. the cell supernatant was clarified by centrifugation, filtered ( . µm), aliquoted, and frozen at - °c. vero e cells (atcc crl- ) were seeded into clear bottom white well plates (costar, ) at a density of e cells per well. the next day, mabs were serially diluted in pre-warmed complete media, mixed at a : ratio with pseudovirus and incubated for h at °c in round bottom polypropylene plates. media from cells was aspirated and µl of virus-mab complexes were added to cells and then incubated for h at °c. an additional µl of prewarmed complete media was then added on top of complexes and cells incubated for an additional - h. conditions were tested in duplicate wells on each plate and at least six wells per plate contained uninfected, untreated cells (mock) and infected, untreated cells ('no mab control'). virus-mab-containing media was then aspirated from cells and ml of a : dilution of bio-glo (promega, g ) in pbs was added to cells. plates were incubated for mins at room temperature and then were analyzed on the envision plate reader (perkinelmer). relative light units (rlus) for infected wells were subtracted by the average of rlu values for the mock wells (background subtraction) and then normalized to the average of background subtracted "no mab control" rlu values within each plate. percent neutralization was calculated by subtracting from the normalized mab infection condition. data were analyzed and visualized with prism (version . . ). ic curves were calculated from the interpolated value from the log(inhibitor) vs. response -variable slope (four parameters) nonlinear regression with an upper constraint of < . each neutralization infection was conducted on three independent days. . dms score is the binding or expression fold change over wt on a log scale. aggregated dms data is shown for each residue by taking the minimum (most disruptive variant) or the average score across all possible variants of a residue, except for the reference residue and the stop codon ('mutagenesis' columns). alternatively, minimum and average scores are computed only across variants that have naturally occurred ('observed variants' columns). when no natural variants have been observed, cells are grey. the heatmap is annotated with frequency of non-reference amino acids in deposited sequences (at least sequences were required to call a variant), in log scale; number of countries in which a variant was observed; and percentage of total binding energy between rbd and hace computed from an x-ray crystal structure. data were sorted on the leftmost dms column. legend on next page (h) correlation of elisa-binding fold change and neutralization fold change for each variant relative to wt (where a smaller elisa auc and therefore a smaller ratio represents loss of binding, and a larger ic and therefore a larger ratio represents loss of neutralization) a rbm rbd table s . details of the sarbecovirus sequences used for figure s . the top sequences shaded in gray were used for the similarity plot and all sequences were used for the entropy plot. parameter estimates on the link scale from the model estimating the impact of the n k mutation on the ct value of patients infected with sars-cov- in scotland. credible intervals represent % the shortest posterior density intervals. the difference between d g/n and d g/n k was estimated by direct subtraction of the hamiltonian monte carlo samples of the d g/n k estimate from the d g/n estimate. ct value did not appear strongly correlated with biological sex or age after controlling for the other factors. patients infected with related viral genomes had correlated ct values at testing potentially implying that there are other undescribed mutations in the genome that are affecting the viral load. parameter estimates on the link scale from the model estimating the impact of the n k mutation on the severity of infection of patients infected with sars-cov- in scotland. credible intervals represent % the shortest posterior density intervals. thresholds correspond to the positions of the boundaries between the different severity classes. amino acid change gene mutation gla c t nsp p l c t s d g a g e v a a t t c gla c t nsp p l c t nsp v a t c s n k c a s d g a g orf v f g t table s nucleotide differences between gla and gla . snps determined by cov-glue on consensus sequences relative to wuhan-hu- (nc_ . ). antibody cocktail to sars-cov- spike protein prevents rapid mutational escape seen with individual antibodies evolutionary origins of the sars-cov- sarbecovirus lineage responsible for the covid- pandemic advanced bayesian multilevel modeling with the r package brms an integrated national scale sars-cov- genomic surveillance network sars-cov- neutralizing antibody ly-cov in outpatients with covid- a neutralizing antibody selected from plasma cells that binds to group and group influenza a hemagglutinins genomic epidemiology of sars-cov- spread in scotland highlights the role of european travel in covid- emergence a sars-cov- vaccine candidate would likely match all currently circulating variants how single mutations affect viral escape from broad and narrow antibodies to h influenza hemagglutinin safety and immunogenicity of the chadox ncov- vaccine against sars-cov- : a preliminary report of a phase / , single-blind, randomised controlled trial retroviral vectors pseudotyped with severe acute respiratory syndrome coronavirus s protein complete mapping of mutations to the sars-cov- spike receptor-binding domain that escape antibody recognition an amplicon-based sequencing framework for accurately measuring intrahost virus diversity using primalseq and ivar sars-cov- d g variant exhibits enhanced replication ex vivo and earlier transmission in vivo d g mutation of sars-cov- spike protein enhances viral infectivity an mrna vaccine against sars-cov- -preliminary report neutralizing antibodies against sars-cov- and other human coronaviruses ly-cov , a rapidly isolated potent neutralizing antibody, provides protection in a non-human primate model of sars-cov- infection phase - trial of a sars-cov- recombinant spike protein nanoparticle vaccine the influenza virus hemagglutinin head evolves faster than the stalk domain tracking changes in sars-cov- spike: evidence that d g increases infectivity of the covid- virus structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structure of sars coronavirus spike receptorbinding domain complexed with receptor fast and accurate short read alignment with burrows-wheeler transform the impact of mutations in sars-cov- spike on viral infectivity and antigenicity emergence of sars-cov- through recombination and strong purifying selection transmission dynamics and evolutionary history of -ncov simulation-efficient shortest probability intervals natural selection in the evolution of sars-cov- in bats, not humans rapid implementation of sars-cov- sequencing to investigate cases of health-care associated covid- : a prospective genomic surveillance study iq-tree : new models and efficient methods for phylogenetic inference in the genomic era refmac for the refinement of macromolecular crystal structures mapping neutralizing and immunodominant sites on the sars-cov- spike receptor-binding domain by structure-guided high-resolution serology cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody a dynamic nomenclature proposal for sars-cov- lineages to assist genomic epidemiology a haploid genetic screen identifies heparan sulfate proteoglycans supporting rift valley fever virus infection convergent antibody responses to sars-cov- in convalescent individuals coronavirus rna proofreading: molecular basis and therapeutic targeting isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model structural basis of receptor recognition by sars-cov- suptavumab for the prevention of medically attended respiratory syncytial virus infection in preterm infants deep mutational scanning of sars-cov- receptor binding domain reveals constraints on folding and ace binding a system for functional analysis of ebola virus glycoprotein ultrapotent human antibodies protect against sars-cov- challenge via multiple mechanisms scalable relaxed clock phylogenetic dating evaluating the effects of sars-cov- spike mutation d g on transmissibility structure, function, and antigenicity of the sars-cov- spike glycoprotein top -pairwise similarity to sars-cov- (sliding window size of amino acids) for seven related sarbecoviruses (see figure key) across the rbd region of the spike protein. bottom -site-specific entropy plot across the rbd protein alignment of sars-cov- and related viruses (data s ). entropy for each position l (h(l)) was calculated using shannon's entropy formula with a natural log as sites constituting the rbm are annotated in blue the x-axis refers to absolute positions in the sars-cov- spike protein sequence. rightbox plot of site-specific entropy values for the rbm sites (blue) and remaining non-rbm rbd sites (gray) sequence alignment (left) and identity for rbm and rbd (right) to sars-cov- of the rbd sequences showing binding to hace . rbm residues indicated by blue boxes. (c) binding of hace to human, pangolin and bat sarbecovirus rbds by bli. bat cov ratg we thank all scottish nhs virology laboratories who provided samples for sequencing and scott arkison for hpc maintenance. we thank chiara silacci-fregni from humabs biomed, sandra jovic, blanca fernandez rodriguez, federico mele, from the institute for research in biomedicine in bellinzona and tatiana terrot from ente ospedaliero cantonale in lugano for the help in collecting sera samples. we thank cindy ng for help with protein production. we thank julia di iulio for help with analyzing gisaid sequences. we gratefully acknowledge the authors, originating and submitting laboratories of the sequences from gisaid, https://www.gisaid.org, on which much of this research is based.the isaric who ccp-uk study protocol is available at https://isaric c.net/protocols; study registry https://www.isrctn.com/isrctn . this work uses data provided by patients and collected by the nhs as part of their care and support #datasaveslives. we are grateful to the frontline nhs clinical and research staff and volunteer medical students who collected the data in challenging circumstances; and the generosity of the participants and their families for their individual contributions in these difficult times. we also acknowledge the support of jeremy j farrar, nahoko shindo, devika dixit, nipunie rajapakse, lyndsey key: cord- -f qcabcx authors: han, yanxiao; král, petr title: computational design of ace -based peptide inhibitors of sars-cov- date: - - journal: acs nano doi: . /acsnano. c sha: doc_id: cord_uid: f qcabcx [image: see text] peptide inhibitors against the sars-cov- coronavirus, currently causing a worldwide pandemic, are designed and simulated. the inhibitors are mostly formed by two sequential self-supporting α-helices (bundle) extracted from the protease domain (pd) of angiotensin-converting enzyme (ace ), which bind to the sars-cov- receptor binding domains. molecular dynamics simulations revealed that the α-helical peptides maintain their secondary structure and provide a highly specific and stable binding (blocking) to sars-cov- . to provide a multivalent binding to the sars-cov- receptors, many such peptides could be attached to the surfaces of nanoparticle carriers. the proposed peptide inhibitors could provide simple and efficient therapeutics against the covid- disease. s evere acute respiratory syndrome coronavirus (sars-cov- ), previously known as novel coronavirus ( -ncov), is causing a pandemic of coronavirus disease. , sars-cov- shares about % of its genome identity with sars-cov, which emerged in − . sars-cov- is highly contagious in humans, which has rapidly caused an unprecedented pandemic, with a large number of fatalities worldwide. the sars-cov- virion, − nm in diameter, contains four structural proteins, known as the s (spike), e (envelope), m (membrane), and n (nucleocapsid) proteins. the s protein, imaged at the atomic level using cryo-electron microscopy, is responsible for the host attachment and fusion of the viral and host-cell membranes. , this process is triggered when the s subunit of s protein binds to a host-cell receptor. to engage a host-cell receptor, the receptor-binding domain (rbd) of s undergoes transient hinge-like conformational motions (receptor-accessible or receptor-inaccessible states). the angiotensin-converting enzyme (ace ) is the host cellular receptor with a higher affinity to sars-cov- than to sars-cov. in the recognition of rbd, the protease domain (pd) of ace mainly engages the α -helix with a minor contribution from the α -helix and the linker of the β and β -sheets. , in addition to a hectic search for vaccines against covid- , there is a very fast ongoing search for therapeutics acting on sars-cov- . depending on the activity, the therapies can be divided into several main categories: ( ) preventing the viral rna synthesis and replication, ( ) blocking the virus from binding to human cell receptors, ( ) restoring the host's innate immunity, and ( ) blocking the host's specific receptors or enzymes. despite many experimental and computational studies currently exploring all of these categories, to date, there is no confirmed effective treatment specifically available for covid- . computational approaches have been used to search potential therapeutics against sars-cov- protease (category ). analogous screening of potential drugs against the s protein of sars-cov- (category ) provided small molecular compounds with a high binding affinity. unfortunately, most of these compounds do not attach with the binding interface of the rbd−ace complex. hesperidin was predicted to lie on the surface of rbd, but it did not cover the whole interface. in the early attempts of sars-cov blocking, short peptide inhibitors were studied and amino acid mutations were implemented to the s protein of sars-cov. , however, the proposed peptide was too short ( residues) to maintain secondary structure, so it was unable to block the whole sars-cov binding surface. broad-spectrum antiviral nanoparticles and cyclodextrins were designed, simulated, and implemented in blocking of other viruses. − they are category or inhibitors, but their applicability to sars-cov- is unknown. proteins or rigid peptides with specific (multivalent) binding domains and conformations matching rbd could be promising therapeutics for covid- . overall, protein therapies show a high specificity, small interference with biological processes, good tolerance to human organisms, and faster fda approval times. in this work, we design and simulate several peptide inhibitors against sars-cov- , which included components from the virus-binding domains of ace ; based on the recently released crystal structure (pdb code: m ). the inhibitors, which have relatively low molecular weights, are structurally stable, they conformationally match the s protein, and are highly specific to sars-cov- . this study could provide a potential guidance in antigen recognition and structure-based designs of antibodies with high affinities. the proposed small peptides could be used as inhaled therapeutics for topical lung delivery, providing an efficient way to combat covid- . preparation of inhibitors. in the crystal structure of ace and rbd of sars-cov- (pdb: m ), we first analyzed the interacting amino acids at the ace and rbd interface. in total, residues from ace interact with rbd: residues (q), (t), (d), (k), (h), (e), (e), (d), (y), and (q) are in α , one residue (residue m) comes from α , residues (k), (g), (d), and (r) come from the linker between β and β . therefore, the amino acids can be labeled as critical amino acids and α , α , β , and β as critical binding components. because most of the interacting residues are from α , we picked as inhibitor the α -helix alone. in particular, the − residues, shown in figure a , were selected. realizing that α (alone) might not even be stable, we next picked as inhibitor both α -and α -helices (residues to ) and the residues to (residues between β and β shown in orange in figure b ). this selection included all interacting residues from the crystal structure m . as the two α-helices are closely joined on one side (figure b) , they stabilize each other. to connect the two helices (red) with the β-sheets with residues to (orange), as shown in figure b , residues (leu) and (leu) were linked together by a side chain with a carbon−carbon bond, as shown in figure b . we have also designed other inhibitors that are closer to the ace protein, whose parts are connected by peptide bonds, and which contain all residues that initially bind to rbd in the m crystal structure. figure c (detail in figure e ) shows inhibitor , where residues to (orange) include the two β-sheets and a random coil (residues to ), whereas residues to (red) include the two α-helices with another random coil (residues to ). the two sequences are joined together by a peptide bond between residues and , and the two pieces of random coils were moved close to each other. finally, figure d (detail in figure f ) shows inhibitor , where two sequences including residues to (red) and residues to (orange) were selected. an extra peptide bond was made between residue and residue by adjusting the position of the corresponding sequences. the sequences of all inhibitors are shown in table s . to examine how these potential inhibitors bind to rbd of sars-cov- , we prepared these systems in the initial position known from the crystal structure (pdb: m ) and simulated them in physiological solution (methods), as shown in figure a −d. as a control, the pd of ace (residues to ) and rbd of sars-cov- were also simulated ( figure e) . binding conformations. in figure a , ns long simulations showed that the helical structure of inhibitor deforms from the left sideloose end unfolding, although it still binds to the rbd of sars-cov- . in figure b −d, − ns long simulations revealed that inhibitors − bind in a stable way to the rbd of sars-cov- , without α losing its structure. due to different linkages among the critical binding components, the overall conformations of inhibitors − vary. specifically, the α -helix, which mostly contributes to the complementary sequence and conformational matching to rbd, is maintained in inhibitors − with different degrees of bending. the β-sheets in the structures of inhibitors and are also preserved. overall, the critical binding components in inhibitors − bind to rbd in a manner very similar to that of the crystal structure. the simulated stable conformation of inhibitors , , and correspond to their energy minima of folding, which would drive the folding process toward the stable direction. energies. to further quantify the binding of these inhibitors to rbd, we calculated the rmsd for the critical amino acids in each inhibitor and for the whole inhibitors. figure f shows the average rmsd at the end of our simulations (see also figure s ). inhibitor has larger rmsd for the critical amino acids compared to that of the control and the largest fluctuations for both the critical amino acids and the overall rmsd ( figure s a,b) . this can be attributed to unfolding of α , shown in figure a . a highly promising inhibitor has a rmsd of the critical amino acids and the overall rmsd similar to those in the control (lowest). inhibitor has a rmsd of the critical amino acids and the overall rmsd higher than that of the control and inhibitors and . however, figure s b shows that inhibitor has a very smooth overall rmsd at later times. this may be due to a poor adaptation of their added connections at early times. inhibitor shows slightly bigger fluctuation for the overall rmsd but steady rmsd ( figure s a ) for the critical amino acids at later times, which indicates fluctuation shown in the overall structure comes from nonessential connection parts. the interaction energies have van der waals (vdw) and electrostatic components, calculated by the namd energy plugin. the total energies are shown in figure g and figure s (detail). the residues which contribute to the interaction energies between inhibitors and sars-cov- are selected with a cutoff of Å. the selections are updated in every frame. inhibitors and show interaction energies similar to those of the control, with inhibitor having slightly stronger binding than the control; however, inhibitor shows an interaction energy slightly lower than that of the control. the larger interaction energy in inhibitor might be due to nonspecific interactions caused by the deformed helix. the lower interaction energy in inhibitor could be attributed to the total number of residues, which is less than those of inhibitor and . in summary, using classical molecular dynamics simulations, we have shown that peptide inhibitors extracted from ace provide highly promising trails for sars-cov- blocking. the single α -helix used in inhibitor is less stable, whereas the α , -helices used in inhibitors − support each other and retain their bent shape, which provides a conformational matching to the rbd of sars-cov- and a full cover of the rbd surface. precise conformational matching between the designed peptides and the virus provides room for improving the binding affinity, which should be considered in future inhibitor design protocols. suitable inhibitors should have a selective binding with lower rmsd for critical amino acids and relatively high binding energies. the binding affinity could be further enhanced by a multivalent binding of multiple peptides attached to surfaces of nanoparticles, dendrimers, and clusters. in analogy to nanoparticle-based inhibitors, we could attach to the α helix a sulphonated ligand mimicking a heparane sulfate, which can attach to positively charged residues at the bottom of rbd. these inhibitors could be used as inhaled therapeutics, preventing the virus activation in lungs. the inhibitors and rbd of the virus were simulated by namd and the charmm protein force field. the particle mesh ewald (pme) method was used for the evaluation of long-range coulombic interactions. the time step was set to fs. the simulations were performed in the npt ensemble (p = bar and t = k), using the langevin dynamics with a damping constant of ps − . after steps of minimization, ions and water molecules were equilibrated for ns around proteins, which were restrained using harmonic forces with a spring constant of kcal/(mol Å ). the last frames of restrained equilibration were used to start simulations of free inhibitors and partially constrained pd of ace (two residues on the bottom). the simulations last for − ns due to different atom numbers in different systems and different computer power used. calculation of rmsd. the time-dependent rmsd for the critical amino acids and the whole inhibitors ( figure s ) were calculated from where n α is the number of atoms whose positions are being compared, r⃗ α (t j ) is the position of atom α at time t j , and r⃗ α (t ) is the initial coordinate. the selection of coordinates contains all of the atoms in the inhibitors or critical amino acids, excluding hydrogens. the time-dependent rmsd was averaged over the last ns of simulation time, which corresponds to the last frames of each trajectory, as shown in figure f . the standard deviations are shown by the error bars. calculation of binding energy. the interacting residues from inhibitors and rbd of sars-cov- were first selected with a Å cutoff distance. the electrostatic and vdw energy contributions between the interacting residues are calculated by the namd energy plugin. the electrostatic contribution is given by where |r⃗ i − r⃗ j | is the distance between the two charges, q i and q j ; ε is the dielectric constant of the solvent which is set to . to increase the efficiency of the simulations, pairwise interaction calculations are not performed beyond a cutoff distance. long-range electrostatic interactions are calculated by the pme method. the lennard-jones (lj) − potential energies are used to describe the vdw interactions and close distance atomic repulsions: where ε ij is the maximum stabilization energy for the ith and the jth atoms, σ ij is the distance between ith and jth atoms at the minimum of the potential, and r ij is the actual distance between the two atoms. the lj parameters between different atom types are calculated using a mixing rule, such as σ ij = (σ ii + σ jj )/ and ε εε = ij ii jj (lorentz− berthelot rules). the time evolution of the interaction energy is shown in figure s , and the time-averaged interaction energy over the last ns ( frames) is shown in figure g , with standard deviation shown by the error bar. the supporting information is available free of charge at https://pubs.acs.org/doi/ . /acsnano. c . sequences of inhibitors, rmsd for the critical amino acids in each inhibitor and for the whole inhibitors, and interaction energies between the contact residues of inhibitors (or ace ) and sarscov- (pdf) the species severe acute respiratory syndrome-related coronavirus: classifying -ncov and naming it sars-cov- epidemiological and clinical characteristics of cases novel coronavirus pneumonia in wuhan, china: a descriptive study clinical features of patients infected with novel coronavirus in wuhan cryo-em structure of the -ncov spike in the prefusion conformation structure, function, and evolution of coronavirus spike proteins the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus structural basis for the recognition of the sars-cov- by full-length human ace analysis of therapeutic targets for sars-cov- and discovery of potential drugs by computational methods fast identification of possible drug treatment of coronavirus disease- (covid- ) through computational drug repurposing study molecular modeling and chemical modification for finding peptide inhibitor against severe acute respiratory syndrome coronavirus main proteinase computational simulation of interactions between sars coronavirus spike mutants and host species-specific receptors broad-spectrum non-toxic antiviral nanoparticles with a virucidal inhibition mechanism computational studies of micellar and nanoparticle nanomedicines protein therapeutics: a summary and pharmacological classification designing inhaled protein therapeutics for topical lung delivery: what are the next steps? scalable molecular dynamics with namd all-atom empirical potential for molecular modeling and dynamcis studies of proteins particle mesh ewald: an n.log(n) method for ewald sums in large systems the authors declare no competing financial interest. we would like to thank lela vuković(utep) for useful discussions. y.h. acknowledges the support from the dean's scholar fellowship (uic). key: cord- - aotsb g authors: dong, jianbo; huang, betty; wang, bo; titong, allison; gallolu kankanamalage, sachith; jia, zhejun; wright, meredith; parthasarathy, pannaga; liu, yue title: development of humanized tri-specific nanobodies with potent neutralization for sars-cov- date: - - journal: sci rep doi: . /s - - -y sha: doc_id: cord_uid: aotsb g sars-cov- is a newly emergent coronavirus, which has adversely impacted human health and has led to the covid- pandemic. there is an unmet need to develop therapies against sars-cov- due to its severity and lack of treatment options. a promising approach to combat covid- is through the neutralization of sars-cov- by therapeutic antibodies. previously, we described a strategy to rapidly identify and generate llama nanobodies (vhh) from naïve and synthetic humanized vhh phage libraries that specifically bind the s sars-cov- spike protein, and block the interaction with the human ace receptor. in this study we used computer-aided design to construct multi-specific vhh antibodies fused to human igg fc domains based on the epitope predictions for leading vhhs. the resulting tri-specific vhh-fc antibodies show more potent s binding, s /ace blocking, and sars-cov- pseudovirus neutralization than the bi-specific vhh-fcs or combination of individual monoclonal vhh-fcs. furthermore, protein stability analysis of the vhh-fcs shows favorable developability features, which enable them to be quickly and successfully developed into therapeutics against covid- . sars-cov- is a coronavirus that causes the human disease covid- , which is contagious and can rapidly spread to cause mild to severe infection, including death [cdc (https ://www.cdc.gov/coron aviru s/types .html) ]. the spread of this newly emergent virus has reached a pandemic level with a significant public impact on the world, leading to more than million infections and more than a . million deaths worldwide [world health organization (who) (https ://www.who.int/emerg encie s/disea ses/novel -coron aviru s- )]. in addition to threatening human health, covid- has also caused a significant socio-economic impact around the world [united nations (https ://www.undp.org/conte nt/undp/en/home/coron aviru s/socio -econo mic-impac t-of-covid - .html)]. although there are relatively successful diagnostic methods to detect the sars-cov- infection in humans, there are currently no successful therapies that can interfere with virus replication. the small antiviral molecule remdesivir (gilead) which inhibits the rna-dependent rna polymerase of sars-cov- decreases the recovery time in patients with covid- , but it most likely cannot completely stop or prevent sars-cov- infections in humans. another small antiviral molecule, grl- , shows promise in interfering with the sars-cov- replication by inhibiting the papain-like protease, however, it is yet to be tested in clinical trials . moreover, there are no fda-approved vaccines to prevent sars-cov- infections in humans, although several groups are currently in the pursuit such vaccines [who (https ://www.who.int/publi catio ns/m/item/draft -lands cape-of-covid - -candi date-vacci nes)]. therefore, rapid development of therapeutics and preventative strategies has become an essential and urgent need to fight the covid- pandemic. the trimeric spike (s) proteins that protrude through the envelope of the sars-cov- virion mediate virus entry into the host cells by interacting with the human ace receptor [ ] [ ] [ ] [ ] [ ] [ ] . therefore, a major target for anti-sars-cov- neutralizing antibodies in development are to block the interaction of sars-cov- s protein with ace . in particular, two popular strategies have been employed to discover and develop monoclonal igg antibodies that can recognize sars-cov- s protein mainly by binding to its receptor binding domain (rbd) [ ] [ ] [ ] [ ] [ ] [ ] . the first commonly used method is to clone the antibody v genes from the b cells of surviving covid- patients who have mounted a natural immune response against sars-cov- , , . this strategy has yielded a number of neutralizing monoclonal antibodies; however, it is important to note that the patients' antibody repertoire condition and the timing of blood sample collection play a critical role in its success. the other well-recognized and classic approach for antibody generation is by immunizing humanized mice . additionally, new sars-cov- identification of vhhs binding to different epitopes of sars-cov- s protein rbd. recently, we reported the identification of llama vhhs that bind to the sars-cov- s protein rbd . briefly, we used two llama vhh libraries (one naïve library and another humanized synthetic library derived from the naïve library) to screen for vhhs that bind to the sars-cov- s protein in-vitro . we identified a total of s protein binders, from the naïve and from the humanized libraries. out of the s protein binders, vhhs blocked the interaction between sars-cov- s rbd and ace receptor, with s/ace blockers identified from the naïve library and identified from the humanized library (data not shown). furthermore, we observed that the pairwise addition of some of the vhhs caused synergistic effects on sars-cov- s/ace blocking . we hypothesized that this synergistic effect is caused by binding of the vhhs to different epitopes within the s rbd. to test this, we performed epitope binning assays by biolayer interferometry (fig. a -c) or elisa (fig. d ) on a selected number of candidates. in the initial epitope binning assay (fig. a-c) , we used an s rbd sensor to capture a-fc, b-fc, or f-fc separately, followed by the incubation with our lead candidates b-fc, f-fc or a-fc. the vhhs were fused to human igg fc domains to render the fc effector functions against sars-cov- . this analysis showed that with the a-fc-loaded probe, the addition of f-fc further increased the signal compared to the a-fc control, while the addition of b-fc decreased the signal compared to the control (fig. a) . this indicates that f-fc does not compete with the a-fc site and it is likely that they bind to different s rbd epitopes. in contrast, b-fc competed with a-fc, indicating that they compete for binding to the same s rbd epitope (fig. a) . similarly, with the b-fc-loaded probe (fig. b) , f-fc increased the signal compared to the b-fc control. this shows that f-fc does not compete with b-fc. interestingly, a-fc also increased its signal compared to b-fc control. this suggests that although having a common binding region, a binds to a wider epitope than b (fig. b) . with f-fc-loaded probe, both a-fc and b-fc showed an increase of the signal compared to the f-fc control. this further shows that f-fc does not compete with either b-fc or a-fc, and likely bind to a different epitope (fig. c) . these results confirm our hypothesis and show that s/ace blocking vhhs bind to at least two separate unique epitopes within the s rbd. next, we performed an elisa-based epitope binning assay to assess five additional vhhs ( c, f, a, f, and g ) unfused to fc, but previously assessed to block the sars-cov- s/ace interaction . the assessment of more vhhs would allow us to categorize several of our other vhhs into binding groups, which will aid in multi-specific antibody design and construction. in this elisa, wells were coated with sars-cov- s and incubated with bi-specific vhh-fc b- a (based on previous data, b and a likely bind the same epitope) or monoclonal vhh-fc f-fc (based on previous data, this binds a different epitope than b or a) premixed with the vhh candidates. the resulting relative fluorescence signals obtained for each sample were calculated to reflect the percent difference from c, f, a, f, g , and controls ( f-fc and b- a-fc) signals, when the vhhs are combined with b- a-fc or f-fc (fig. d) . the results show that vhh-fcs c, f, f, as well as the b- a-fc control have almost % difference from b- a-fc, which highly suggest that they compete for the same epitope (highlighted in red). however, g (highlighted in light red) may partially compete with b- a-fc, whereas a does not likely compete for the same epitope (highlighted in green). additionally, these results show that a and the f-fc control may compete with f-fc (fig. d) , while other vhhs, including the b- a-fc control resulted in a lower percent difference. we also performed additional epitope binning assays using biolayer interferometry to assess the competition of the vhh-fcs c, g , and a to bind to s rbd. the vhh-fcs f and f poorly bound to the biolayer interferometry probes used for this assay and were excluded from analysis. this approach confirmed the results that we obtained by elisa and showed that c and g likely belong to group , and a belongs to group in terms of the binding competition ( supplementary fig. ). interestingly, g -fc shows competition with either b-fc and a-fc when it is loaded onto the probe first (data not shown). in contrast, reversal the of loading further increased its signal compared to both b-fc control and fig. ). taken together, we could categorize vhh blockers of s/ace interaction into major groups based on their epitope binding; group consist of vhhs, whereas group consist of vhhs (fig. e ). elucidation of epitopes on s rbd that bind to vhh-fcs. in an effort to elucidate the structural basis of the newly discovered epitope binding groups, we computationally generated structural models for b, f, and a vhhs and docked them with sars-cov- s rbd structure exported from pdb m j using schrodinger bioluminate software [ ] [ ] [ ] . for context, fig. a shows the sars-cov- s protein with the ace binding residues in red font. this approach generated an array of poses of each s rbd/vhh complex structure, which allowed us to further analyze the interfaces of those poses with a good piper cluster size and led us to identify five regions in the rbd which may interact with vhh b, a, and f, respectively (fig. a,b) [ ] [ ] [ ] . next, we generated different s rbd deletion mutants to validate the computationally mapped epitopes in-vitro to select the best docking model for molecular analysis. interestingly, these s rbd deletion regions have been shown to mediate the s rbd/ace interaction in recently published literature [ ] [ ] [ ] [ ] (table ) . we tested wild-type and all the s deletion mutants for their ability to bind to a tri-specific vhh-fc to check whether the proteins are folded and expressed on the cells. the results show that they are indeed expressed and folded as they all bind to the tri-specific vhh-fc, although the level of expression and/or folding might be different across the mutants based on the strength of the binding signals. the wild-type s rbd and the deletion (del ) shows stronger binding, whereas the deletions (del ), (del ), (del ) and (del ) show weaker binding to the tri-specific vhh-fc ( supplementary fig. ). then we assessed the binding profiles of the s rbd wild-type and the deletion mutants with selected vhh-fcs from group and group , as well as ace (fig. c,d) . the binding of vhh-fc candidates from both group and group , as well as ace to s rbd are affected following the removal of del . it is possible that this result is due to a conformational change or decrease of s protein expression following its deletion because based on crystal structure of the rbd/ace complex (pdb m j), the deleted domain is not part of the s rbd/ace interface. the del mutant, which is adjacent to a computationally-predicted epitope domain in region , does not prevent the binding of both group and group vhh-fcs to s rbd. in addition, it does not prevent the binding of ace to s rbd. the del , , and mutants all decreased binding of both group and group vhh-fcs to s rbd. however, these regions are more critical for group than (d) epitope binning of vhhs were assessed using an elisa method. briefly, the sars-cov- s protein was incubated with b- a-fc or f-fc, and binding competition was performed with the vhhs followed by the detection of the biotinylation. the experiment was performed in duplicates, and the average percent difference from the competing pairs relative to the vhh-fc alone signal are indicated in the table. the vhh associated percentages highlighted in red are likely high vhh competitors, in light red are partial competitors, and in green are likely non-competitors. (e) the two groups of vhhs categorized based on the binding to epitopes on s rbd. figure . elucidation of epitopes on s rbd that bind to vhh-fcs. (a) ace binding residues on sars-cov- s rbd were determined by schrodinger bioluminate based on the protein-protein interactions of protein data bank (pdb) m j. (schrödinger release - : bioluminate, schrödinger, llc, new york, ny, . https ://www.schro dinge r.com/produ cts/biolu minat e. requires permission to be used). the residues in red are predicted ace interactors. the deletions generated on sars-cov- s rbd are shown with the boxed regions. (b) deletion map schematics of the s rbd deletion mutants. (c) the binding of vhh-fcs and ace to expi cells expressing sars-cov- s wild type (wt) or mutant proteins (del -del ) were assessed by flow cytometry following fitc-conjugated secondary antibody treatment. an isotype control antibody and facs buffer were used as negative controls. the experiment was performed at least three times which yielded similar trends in results. a representative image of a single experiment is shown here. the graph was generated by the prism (graphpad) software (prism version . . . https ://www.graph pad.com/scien tific -softw are/prism /. requires permission to be used) (d) in the experiment shown in (c), the binding percentages relative to the s wt for each vhh-fc were calculated in the context of each deletion mutant. the group vhh-fcs and the values with binding differences that contributed to their categorization into that group are shown in red. those values for group vhh-fcs are shown in blue. (e) docking models between sars-cov- s rbd and the lead vhhs generated with schrodinger bioluminate software. (schrödinger release - : bioluminate, schrödinger, llc, new york, ny, . https ://www.schro dinge r.com/produ cts/biolu minat e. requires permission to be used). scientific reports | ( ) : | https://doi.org/ . /s - - -y www.nature.com/scientificreports/ group for their binding. in addition, these regions are critical for ace to bind to s rbd. taken together, the binding epitopes for group is more associated with del , and regions which are located at the interface of s rbd/ace , while at least part of the epitopes for group are shifted farther away from the s rbd/ace interface relative to the epitopes for group vhhs (fig. c,d) . based on the binding and epitope binning data, we constructed d docking models that predicted the interactions between sars-cov- s rbd, ace and lead vhh-fcs (fig. e) . these models show that predicted binding epitopes for group vhhs b and a are located at the s rbd/ace interface. in contrast, the epitope for group vhh f is located away from the s rbd/ace interface (fig. e) . interestingly, there are binding variations seen within group . the binding of a to del , del , del and del have decreased more than that of b. this shows that epitopes for a and b are not the same even though they compete with each other and were initially characterized to be within the same binding group (fig. c,d) . taken together, our analysis confirms that there are two major binding groups (group and group ) and we show the likely binding regions on the sars-cov- s protein for each vhh. tri-specific vhh-fcs show potent s rbd binding and s/ace blocking activity. next, we tested whether the combination of individual vhhs binding to different s rbd epitopes into bi-specific antibody molecules would yield synergistic effects in sars-cov- binding and s/ace blocking. as expected, the resulting bi-specific vhh-fc b- f showed superior binding to s rbd and s/ace blocking compared to individual component vhh-fcs . since sars-cov- s proteins formed trimers, we started to study whether trispecific antibodies with two binding units from group and another binding unit from group or vice versa would have better binding and blocking function than the bi-specific antibody [ ] [ ] [ ] . here, we only focused on tri-specific, as any larger multi-specific molecules will likely affect developability with fc fusion proteins. we selected the vhhs from both group and with the most favorable binding, functional and developability features, and constructed tri-specific vhh-fcs with the computer-aided antibody design using the software bioluminate (schrodinger) that enabled their effective construction and optimization [ ] [ ] [ ] . then, we tested the tri-specific, bi-specific and mono-specific vhh-fcs for their ability in-vitro for sars-cov- s protein binding and s/ace blocking (fig. a,d) . as expected, the multi-specific antibodies showed higher binding affinities to sars-cov- s protein rbd in-vitro, with the tri-specific vhh-fcs f- b- a (kd ~ . nm) and b- f- a (kd ~ . nm) showing more potent binding than bi-specific vhh-fc b- f ( fig. a -c,e). the binding affinities for tri-specific vhh-fcs were higher than that of individual component vhh-fcs b, f and a used in combination, and the binding affinity for b- f-fc was higher than that of individual component vhh-fcs b, and f used in combination (fig. a ). in addition, f- b- a and b- f- a showed potent blocking of the sars-cov- s/ace interaction, with ic values of . nm and . nm, and full inhibition around nm for both, respectively, that were far superior to using individual component vhh-fcs as combinations (ic of . nm and full inhibition around nm). in addition, f- b- a and b- f- a were more potent than bi-specific vhh-fc b- f in blocking sars-cov- s/ace interaction (fig. d) . interestingly, the tri-specific vhh-fc a- b- f had lower s/ace blocking ability showing the physical arrangement and/or binding orientation of the vhhs in a multi-specific antibody is important for its binding and blocking (fig. d) . taken together, this data indicates that the tri-specific vhh-fcs have a higher synergistic potency in both binding and blocking the s or s /ace interaction than bi-specific or mono-specific antibodies. tri-specific vhh-fcs have favorable developability features. during the computer-aided design process, we incorporated several development-enhancing features in the structures of our vhh-fcs. therefore, we analyzed the physico-chemical properties, using dls and dsf/sls methods, of our lead bi-and tri-specific antibodies to determine whether they possess favorable characteristics for large-scale manufacturing that is essential for the commercial development of the antibodies (fig. e) . our data revealed that the lead tri-specific vhh-fc f- b- a has lower aggregation potential based on the dls method and is thermostable based on the dsf/sls method (fig. e) . we tested the multispecific vhh-fcs for their ability to target sars-cov- in cell biological functional assays. first, we analyzed the virus neutralizing ability of our antibodies using a pseudovirus that expresses the sars-cov- s protein. the tri-specific vhh-fcs f- b- a, b- f- a, and the mono-specific combinations of vhhs ( b-fc + f-fc + a-fc) prevented the infection of human cells by the pseudoviruses (fig. a ). in accordance with (fig. a) . this pseudovirus data presented here confirm the synergistic effect of the tri-specific antibodies and most importantly, it suggests that it is likely effective in preventing the sars-cov- infection. as our vhh-fcs contain the fc domain of human igg , we expected it would be able to trigger the fcdependent functions to eliminate the viruses from the body. to test this, we used a cell line that transiently expresses the sars-cov- s protein. then, we assessed the ability of our multi-specific vhh-fcs to promote antibody-dependent cellular cytotoxicity (adcc) that is an fc-dependent function of the antibodies. in addition to our lead tri-specific vhh-fc antibody f- b- a, we also tested a- f- a-fc, another tri-specific antibody we constructed with similar s binding and s/ace blocking potency (supplementary fig. ). as expected, the vhh-fcs were able to induce adcc in the cells (fig. b) . this suggests that these vhh-fcs could bind to immune cells through their fc domain and elicit fc-dependent functions, thereby allowing multiple mechanisms of actions against sars-cov- , including binding sars-cov-s and blocking s /ace interactions. docking model for sars-cov- s rbd with tri-specific vhh-fc f- b- a was generated by bioluminate (schrödinger release - : bioluminate, schrödinger, llc, new york, ny, . https ://www.schro dinge r.com/produ cts/biolu minat e. requires permission to be used). in the software, the sars-cov- rbd spike protein trimer (pdb x a) was split into three monomeric forms (chain a, b and c). then, b/ f/rbd model structure was aligned with chain a of pdb x a to create group and a/rbd model structure was aligned with chain b of pdb x a to create group . then, group , group and chain c were merged to generate the final structure. the s rbd/vhh docking structure is represented with a surface structure (a) and ribbon structure (b). the enlarged s rbd/vhh docking structure is shown in right (c). in this study we developed and characterized llama-derived multi-specific nanobodies that yielded data that strongly suggests they will be effective against sars-cov- that causes covid- . the covid- pandemic has caused widespread health and social issues around the globe, and requires therapeutics that can effectively stop and prevent the infection of sars-cov- . several monoclonal antibodies against sars-cov- have been suggested and being tested as anti-viral therapies, either as individual agents or combination therapies; however, this is the first study that introduces and demonstrates the efficacy of multi-specific antibodies against sars-cov- to our knowledge [ ] [ ] [ ] [ ] [ ] [ ] , . to successfully design and construct multi-specific vhh binders, the epitope information for each individual vhh clone is necessary. here, instead of obtaining crystal structures for each antigen/antibody complex, we utilized a different method for epitope identification. we initially performed epitope binning with biolayer interferometry and categorized s/ace blocking vhhs into groups. the vhhs within each group competed, but there was no competition with vhhs from the other group, strongly suggesting that group and group vhhs are two separate binding groups. then, we computationally constructed vhh models and docked them separately to an s rbd structure obtained from a publicly-available crystal structure of sars-cov- s rbd/ ace , and utilized docking structures with higher pose cluster size to predict possible epitopes for the individual vhhs. to validate the involvement of predicted epitopes in vhh/s rbd binding, we compared the binding ability of each vhh to wild type s rbd and five deletion mutants with each predicted epitope deleted. as shown in fig. , both group and group vhhs likely bind to the regions del , del , and del which overlap with the ace binding interface of s rbd, however, at least part of the epitope for group is likely located more outwards of this region and has relatively less overlap with the ace binding interface of s rbd. currently, a number of structures of s rbd/antibody complexes have been published. the analysis of these structures show that there are likely main "hot" antibody binding regions in s rbd: one likely in the n-terminal region (del ) , , and the other likely in the ace binding interface (del , del , del ) , , . our selected vhh binders in tri-specific antibodies possibly cover both of these regions (fig. ). based on this information, we were able to define the lead tri-specific vhh-fc format, including the linker length and the order of the vhh binders. the tri-specific antibodies are advantageous as therapeutic agents because they simultaneously bind multiple epitopes within the s protein rbd that increase their antigen-binding affinity and avidity (fig. ) . the vhhs b and f that comprises the bi-specific antibody bind to two different epitopes in the s protein rbd . in our tri-specific antibody design, we incorporated the vhh a that has an almost identical epitope as b. these vhhs could bind in different orientations to the same or similar epitopes, or to a corresponding epitope in another s protein in the trimer, increasing the binding and blocking potency of the tri-specific vhh-fc. in fact, this phenomenon has been previously shown for other multi-specific antibodies. for example, the cd targeting t cell engager antibody cd -tcb (roche) with two cd binding domains ( : molecular format) has increased potency compared to other cd -binding bi-specific antibodies in clinical development . in agreement with this hypothesis, the resulting tri-specific vhh-fcs showed very potent characteristics in terms of the s binding and s/ace blocking efficacy, which are among the best in currently published anti-sars-cov- therapeutic antibodies ( table ) . because of these characteristics, the tri-specific vhh-fcs could be used at low concentrations for therapeutic applications that would potentially lower their toxicity in humans. in addition, the strong binding of the antibodies to the virions would minimize the risk of antibody-dependent enhancement (ade) that is caused by sub-optimal antigen-antibody interactions and promotes enhanced viral infections , . the multi-specific targeting approach also minimizes the loss of antibody binding to viral antigens due to the mutations of the viruses. the rna viruses are known to mutate, and in this sense coronaviruses could lose the binding to antibodies relatively easily due to structural changes in the viral components , . however, the vhh multi-specific antibodies would still bind to the mutated virus since the other vhhs in the tri-specific antibody would bind the unmutated epitopes of the virus. another advantage of the vhh multi-specific platform is the ability to target multiple viruses. for example, it is possible to adjoin vhhs that bind to other coronaviruses such as sars-cov and mers-cov, and construct pan-coronavirus tri-specific vhh-fcs that would be effective in preventing and treating a broad spectrum of coronaviruses. our multi-specific antibody design connects human igg fc domain to bi-or tri-specific vhhs. having the fc domain in the antibody structure confers fc-dependent cytotoxic functions such as adcc, complementdependent cytotoxicity (cdc) and antibody-dependent cellular phagocytosis (adcp) [ ] [ ] [ ] [ ] [ ] . these additional fc-dependent functions, in addition to blocking virus entry and possible virus aggregation, would equip the vhh-fcs with multiple mechanisms of action, making them more potent in neutralizing the coronaviruses. indeed, our lead tri-specific vhh-fc f- b- a show potent neutralization of sars-cov- pseudovirus infection in human cells. one of the questions in the field of antibody therapeutics is whether the effect of using multi-specific single molecule is better than using a combination of monoclonal antibodies that collectively target the same epitopes or not . here, we show that multi-specific antibodies are more effective in blocking host-virus interactions than a combination of monoclonal antibodies. our tri-specific vhh-fc f- b- a was much more potent in blocking the sars-cov- s/ace interaction than using vhh-fcs f, b and a individually as a combination. it is likely that physically combining the vhhs increases overall association constants (k on values) and decreases overall dissociation constants (k off values), producing lower binding constants, thus increasing antibody affinity towards antigens. it also increases the avidity of the antibodies, making them more effective in neutralizing viruses. one of the hallmarks of a successful therapeutic antibody is its developability features , . especially during pandemics such as covid- when rapid production of antibodies in high quantities is essential, the developability and manufacturability of the antibodies play even crucial roles. our design has the advantage of using scientific reports | ( ) : | https://doi.org/ . /s - - -y www.nature.com/scientificreports/ llama vhh nanobodies that have high stability , . indeed, the biochemical and biophysical characteristics of the multi-specific vhh-fc show that they can be purified in high quantity, have better aggregation resistance, and have favorable thermostability. in addition, our antibodies have high developability because the multi-specific design combines the individual vhhs into single molecules instead of combinations, making their manufacturing easier. an alternative strategy of increasing developability of the anti-sars-cov- multi-specific antibodies would be to combine vhhs without the addition of igg fc domain to construct tetra-specific vhhs. these molecules would have the added advantage of increased affinity and avidity towards sars-cov- s protein compared to bi-and tri-specific vhh-fcs, despite lacking the fc effector functions. these tetra-specific antibodies would be ideally suited as antibody prophylactic to prevent the sars-cov- infection in humans because their llama vhh-only structure would have increased thermostability, easier combination capability, and the possibility of easy large-scale manufacturing using cost-effective expression systems such as yeast . one of the key features of our therapeutic antibodies is the use of computer-aided design that greatly reduces their development time and enhances their optimization efficiency. for instance, from the inception of this project, it was possible to produce, optimize and test our lead tri-specific vhh-fcs in less than months. this shows that this strategy is powerful for producing novel therapeutic antibodies for time-sensitive unmet needs, and can be utilized for future outbreaks that would require rapid development of antibody therapeutics. cell lines and transfections. the cell lines used in this study were cultured in media as stated below. epitope binning (competition) assays. the initial assay was performed using gator system (probe life). after pre-wetting the sars-cov- s rbd sensors in q buffer (probe life), the sensor captured - µg/ ml of the first monoclonal vhh-fc for about s, then the loaded sensor captured µg/ml of the second monoclonal vhh-fc, either b, f, or a, which was quantified over time by gator. the follow-up assay for vhh-fcs b- a and f was performed by elisa. a -well plate was coated to a final concentration of µg/ml of sars-cov- s protein and placed overnight at ºc. to test the binding with vhhs c, f, a, f and g , the following method was used. b- a-fc or f-fc at µg/ml were premixed with each competing c-myc tagged vhhs from periplasmic supernatant at a : ratio. after another one hour of incubation, a biotinylated anti-c-myc antibody ( e ) was added and the samples were incubated for another one hour. then, streptavidin-hrp was added followed by the treatment with amplex red (thermo fisher scientific) and % h o containing development solution. the emitted signal for each sample was detected by using a fluorescence plate reader (spectramax gemini xps). to test the binding with vhhs b- a-fc and f-fc, the following method was used. b- a-fc or f-fc at µg/ml were premixed with competing biotinylated b- a-fc or f-fc at a : ratio. after one hour of incubation, the biotin-streptavidin detection system as described above was used to analyze their competition. the percent difference from the competing pairs versus the vhh-fc alone signal was calculated using the following formula; % difference from vhh-fc signal = ( − ((signal of competing pair − no antibody signal)/(signal of vhh-fc alone − no antibody signal)) × . cell binding assay. binding of vhh-fcs to sars-cov- s expressing cells was assessed by flow cytometry. briefly, cells were harvested and resuspended in pbs with % fetal bovine serum (fbs) and plated in v-bottom -well plates at × cells/well density. they were incubated for h at room temperature with µg/ml of indicated vhh-fcs, control antibodies or recombinant biotinylated human ace also dissolved in pbs with % fbs. then, the cells were washed twice with the same buffer, and incubated with fitc-conjugated goat anti-human igg (jackson immunoresearch) at : dilution or pe-conjugated streptavidin (thermo fisher, scientific reports | ( ) : | https://doi.org/ . /s - - -y www.nature.com/scientificreports/ for the detection of biotinylated ace ) at : dilution for min at room temperature. cells were washed again following the secondary antibody treatment. then they were analyzed by a facscalibur cytometer (bd biosciences). cell populations were visualized as forward vs side scatter and gated to exclude dead cells. cells treated with no antibodies were used to establish background fluorescence. the resulting facs data were analyzed by the software flowjo (bd biosciences) and the graphs were generated by the software prism (graphpad). in-vitro s protein binding assay. the -well elisa plates (greiner bio-one) were directly coated with sars-cov- s protein (acro biosystems) diluted in pbs at µg/ml and incubated overnight at °c. then, the plates were washed with pbs containing . % tween (pbst) and blocked with % bsa in pbs at room temperature for one hour. the plates were washed again with pbst and incubated with the test antibodies at room temperature for one hour. the antibodies were used at : serial dilutions. the plates were washed with pbst followed by the addition of anti-human-fc antibodies conjugated to horseradish peroxidase (hrp) (jackson immunoresearch) at : dilution in pbst and the plates were incubated at room temperature for h. following washing again by pbst, the plates were treated with elisa development buffer solution containing amplex red and % h o . the emitted binding signal for each sample was detected by using a fluorescence plate reader. the blocking was measured in relative fluorescence units (rfu) and the % inhibition was calculated as follows; % inhibition = ( − (mean experimental value/mean of no antibody control)) × . s/ace blocking assay. the -well elisa plates (greiner bio-one) were coated with sars-cov- s protein (acro biosystems) and incubated overnight as stated previously. then, the plates were washed with pbst and blocked with % bsa in pbs at room temperature for one hour. the plates were washed again with pbst and incubated with the test antibodies at room temperature for min. the antibodies were used at : serial dilutions. then, recombinant biotinylated-ace (acro biosystems) was directly added to the plates at . µg/ µl and incubated at room temperature for another min. the plates were washed with pbst followed by the addition of streptavidin conjugated to hrp at : dilution in pbst. the plates were incubated at room temperature for another min. then, they were washed with pbst and treated with elisa development buffer. the emitted binding signal for each sample was detected by using a fluorescence plate reader. analysis of physical characteristics of vhh-fcs. the purified vhh-fcs were analyzed by the uncle system (unchained labs) for their thermostability using differential scanning fluorimetry (dsf) and static light scattering (sls), and aggregation potential using dynamic light scattering (dls) assays. the dls was measured at °c and the data was analyzed using uncle analysis software. for dsf/dls assays, a temperature ramp of °c/min was performed with monitoring from to °c. sls was measured by uncle at nm and nm. tm and tagg were analyzed and calculated by the uncle analysis software. with genscript biotech (piscataway, nj). briefly, pseudovirus expressing luciferase and containing sars-cov- s as the envelope glycoprotein in a lentiviral vector was produced in hek t cells, and the virus titration was determined by elisa. hek cells expressing ace receptor and transmembrane serine protease (tmprss ) were used as the target cells, and were seeded in -well plates. then, pseudovirus with the serial dilutions of the antibodies were mixed with the target cells. the cells were incubated for h at °c and an amount of µl of the cell suspension was transferred to an assay plate. it was mixed with luciferase detection reagents from bio-glo™ luciferase assay system (promega) and incubated for - min at room temperature. then, the luminescence was measured by a plate reader. the background rlu was subtracted from the rlu of the experimental samples. the values for % inhibition were derived from rlu as follows; % inhibition = ( − (mean of experimental value − mean of cells treated only with buffer)/(mean of cells treated only with sars-cov- )) × . antibody-dependent cellular cytotoxicity (adcc) assay. target expi cells expressing s protein ( sprot) were washed with rpmi media containing % horse serum and ng/ml il- , and plated in -well plates at × cells/well density. then, they were mixed with antibodies at µg/ml of final concentration. then, nk- -cd cells expressing gfp were added to wells at × cells/well density (effector: target- : ) and the plates were incubated overnight °c, % co . then, the cells were washed twice and resuspended in dpbs with % fbs. they were assessed by flow cytometry using a facscalibur cytometer (bd biosciences). sprot and gfp-nk- -cd cells were each used as a reference to set up overall target cell gating and to establish the gfp positive nk- -cd populations, allowing differentiation between the nk- -cd effector cells and sprot target cells. the gfp negative sprot cell percentage was evaluated for all samples. then, cell death percentage was calculated as follows; % cell death = ( − (antibody treated cell percentage/average of isotype control percentage)) × . statistical analysis. the four-parameter non-linear regression analysis from prism software version . . was used for all binding and blocking curves, which also included the ic values for the blocking assays. all error bars represented in the data are based on standard deviation, unless otherwise specified. the data generated and/or analyzed during this study are available from the corresponding author on reasonable request. severe acute respiratory syndrome coronavirus -specific antibody responses in coronavirus disease patients remdesivir for the treatment of covid- -preliminary report papain-like protease regulates sars-cov- viral spread and innate immunity a pneumonia outbreak associated with a new coronavirus of probable bat origin the severe acute respiratory syndrome angiotensin-converting enzyme is a functional receptor for the sars coronavirus dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc difference in receptor usage between severe acute respiratory syndrome (sars) coronavirus and sars-like coronavirus of bat origin functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses a human neutralizing antibody targets the receptor binding site of sars-cov- human neutralizing antibodies elicited by sars-cov- infection cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody potent neutralizing antibodies against sars-cov- identified by high-throughput single-cell sequencing of convalescent patients' b cells structural basis for potent neutralization of betacoronaviruses by single-domain camelid antibodies a human monoclonal antibody blocking sars-cov- infection neutralizing nanobodies bind sars-cov- spike rbd and block interaction with ace single-domain antibodies and their formatting to combat viral infections. antibodies (basel) , single-domain antibody fragments with high conformational stability development of multi-specific humanized llama antibodies blocking sars-cov- /ace interaction with high affinity and avidity antibody structure determination using a combination of homology modeling, energy-based refinement, and loop prediction structure-based approach to the prediction of disulfide bonds in proteins applying physics-based scoring to calculate free energies of binding for single amino acid mutations in protein-protein complexes decoys as the reference state) potentials for proteinprotein docking an fft-based protein docking program with pairwise potentials the th meeting on the critical assessment of predicted interaction (capri) held at the mare nostrum a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace the molecular biology of coronaviruses architecture of the sars coronavirus prefusion spike cryo-em structure of the -ncov spike in the prefusion conformation isolation of a human monoclonal antibody specific for the receptor binding domain of sars-cov- using a competitive phage biopanning strategy cd -tcb with obinutuzumab pretreatment as next-generation treatment of hematologic malignancies enhanced infection of liver sinusoidal endothelial cells in a mouse model of antibody-induced severe dengue disease the potential danger of suboptimal antibody responses in covid- rapid evolution of rna genomes complexities of viral mutation rates neutralization of virus infectivity by antibodies: old problems in new perspectives nk-mediated antibody-dependent cell-mediated cytotoxicity in solid tumors: biological evidence and clinical perspectives complement in monoclonal antibody therapy of cancer antibody-dependent cellular phagocytosis in antiviral immune responses fc-mediated antibody effector functions during respiratory syncytial virus infection and disease engineering multi-specific antibodies against hiv- developability assessment during the selection of novel therapeutic antibodies structure, heterogeneity and developability assessment of therapeutic antibodies we thank all ab studio inc. and ab therapeutics inc. members for their valuable support and input in this project. all authors of this study are employees of ab studio inc. yue liu also serves as the chief executive officer of ab therapeutics inc. supplementary information is available for this paper at https ://doi.org/ . /s - - -y.correspondence and requests for materials should be addressed to j.d.reprints and permissions information is available at www.nature.com/reprints.publisher's note springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. license, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons licence, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons licence, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. to view a copy of this licence, visit http://creat iveco mmons .org/licen ses/by/ . /. key: cord- - y yxcp authors: lim, hocheol; baek, ayoung; kim, jongwan; kim, min sung; liu, jiaxin; nam, ky-youb; yoon, jeonghyeok; no, kyoung tai title: hot spot profiles of sars-cov- and human ace receptor protein protein interaction obtained by density functional tight binding fragment molecular orbital method date: - - journal: sci rep doi: . /s - - - sha: doc_id: cord_uid: y yxcp the prevalence of a novel β-coronavirus (sars-cov- ) was declared as a public health emergency of international concern on january and a global pandemic on march by who. the spike glycoprotein of sars-cov- is regarded as a key target for the development of vaccines and therapeutic antibodies. in order to develop anti-viral therapeutics for sars-cov- , it is crucial to find amino acid pairs that strongly attract each other at the interface of the spike glycoprotein and the human angiotensin-converting enzyme (hace ) complex. in order to find hot spot residues, the strongly attracting amino acid pairs at the protein–protein interaction (ppi) interface, we introduce a reliable inter-residue interaction energy calculation method, fmo-dftb /d/pcm/ d-spies. in addition to the sars-cov- spike glycoprotein/hace complex, the hot spot residues of sars-cov- spike glycoprotein/hace complex, sars-cov- spike glycoprotein/antibody complex, and hcov-nl spike glycoprotein/hace complex were obtained using the same fmo method. following this, a d-spies-based interaction map was constructed with hot spot residues for the hace /sars-cov- spike glycoprotein, hace /hcov-nl spike glycoprotein, and hace /sars-cov- spike glycoprotein complexes. finally, the three d-spies-based interaction maps were combined and analyzed to find the consensus hot spots among the three complexes. as a result of the analysis, two hot spots were identified between hace and the three spike proteins. in particular, e , k , g , and d of the hace receptor strongly interact with the spike proteins of coronaviruses. the d-spies-based map would provide valuable information to develop anti-viral therapeutics that inhibit ppis between the spike protein of sars-cov- and hace . domain (rbd) of s undergoes hinge-like conformational changes that transiently conceal or reveal the determinants of receptor binding . sars-cov- could possibly use angiotensin-converting enzyme (hace ), the same receptor as sars-cov and hcov-nl . since the essential function of the s protein is to penetrate host cells, it is considered as the optimal target for the prevention of cell infection. for this reason, s protein-targeted antibody-mediated neutralization has been considered as a suitable treatment for sars-cov diseases. therefore, the hot spot analysis on the interface between the rbd domain of the s subunit and the hace receptor would provide crucial information for antibody engineering and for small-molecular drug development. to investigate protein-protein interactions (ppis) between hace and the rbd domain of s subunit at the molecular level, an ab initio quantum mechanical (qm) method was introduced. this method was used to obtain the most accurate information on the ppis through analysis of the wave function obtained from the qm calculation, especially of the fragment molecular orbital (fmo) approximation method. even with the fmo method, the calculations in a biomolecular system need a huge amount of computer resources. in order to obtain results within a reasonable computation time while maintaining a certain degree of accuracy of ab initio mo, we introduced the density functional tight-binding (dftb) method, which is an efficient parameterized qm method and is expected to exhibit reasonable accuracy at a remarkably reduced computational cost . the fmo method is one of various linear-scaling methods to reduce the huge computational cost of qm calculations by the fragmentation of target molecules. the energies of fragment and their pairs are computed in the embedding electrostatic potential . recently, the fmo method has been combined with dftb, and the polarizable continuum model (pcm) was introduced to consider the effect of a solvent on a model system . pair interaction energies (pies) among the fragments of the model system from the fmo-dftb/pcm method correlate well with pies from ab initio dft fmo/pcm and with an ignition møller-plesset perturbation theory (mp ) fmo/pcm . in our earlier work, we investigated ppis between programmed cell death and its ligand pd-l using fmo-mp / pcm and the results efficiently explained the experimental site-directed mutagenesis data . in this work, to find common hot spot amino acids on the interfaces between the rbd domain and hace of the three complexes, rbd-sars-cov- /hace (twelve experimental structural data), rbd-sars-cov- / hace (four experimental structural data), and rbd-hcov-nl /hace (one experimental structural data), we performed fmo-dftb /d/pcm calculations. to visualize the interaction energy and the distance of the interacting amino acid pairs, the fmo/ d-spies analysis tool was introduced. to narrow down the hot spot region, we also performed the same calculation with rbd-sars-cov- /antibody complexes (five experimental structural data). based on the fmo/ d-spies results, we constructed d-spies-based interaction maps of the hace and rbd domains from sars-cov- , hcov-nl , and sars-cov- . in order to validate the ppi predictability of the fmo-dftb /d/pcm results, we compared them with the site-directed mutagenesis results. consequently, we summarized the fmo-dftb /d/pcm/ d-spies results as interaction maps and found the hot spot regions in rbd-sars-cov- and hace at a qm level. all experimental structures calculated in this work are summarized in table . all missing side chains were filled using prime implemented in maestro program . hydrogen atoms were added to the crystal structures at ph . and their positions were optimized with the propka function implemented in maestro program . water molecules in the crystal structures were included in the fmo calculations to explore their roles in ppis. in the fmo calculations, all n-acetylglucosamine (glcnac) residues in the rbd domains and hace were included, only rbd domains of three coronaviruses were included, and only fv domains of antibodies that bind to rbd domains were included. all fmo calculations were performed with the version feb , gamess . the two-body fmo method was applied to all calculations in this work for the fmo /dftb method; this is a recent extension of the selfconsistent-charge density functional tight-binding method and derived via a third-order expansion of the dft method . dftb calculations were performed using the ob parameter set , , and the uff-type dispersion correction (dftb /d) , . due to the exchange-repulsion term is not computed in dftb, the ex terms are all zero (see supplementary table s -s ) . the polarizable continuum model (pcm), an implicit solvation model, was employed with the explicitly expressed water molecules present in the x-ray crystal and cryo-em structures. all input files were prepared in compliance with the hybrid orbital projection (hop) scheme fragmentation . each residue and water molecule was defined as one fragment. two cysteine residues forming the disulfide bridge were defined as one fragment, and glcnac, with which the asparagine residue formed covalent bonds, was defined as one fragment. all d-spies results were generated with the reported protocol . in the protocol, we selected only pies within a specific distance ( . Å) between two fragments, which reflected the distance used for the approximate of electrostatic potential in fmo method . we considered the interaction with an pie more stable than − . kcal/mol to be significant on the basis of previous reports , , . to investigate ppis between hace and rbd domains of the three coronaviruses, we collected experimental structures summarized in table and performed fmo-dftb /d/pcm calculations for all the experimental structures. due to structural arrangements from mutations summarized in table s , we collected all available structures to consider them together. subsequently, we performed hot spot analysis using the fmo/ d-spies tool. hot spot region between hace and rbd-sars-cov- . in order to investigate the hot spot region in the rbd of the sar-cov- and hace receptor complex, we performed fmo calculations on rbd-sars-cov- /hace complexes (supplementary table s -s ). we summarized the fmo results in fig . when comparing the amino acid pairs of this study with the mutagenesis experimental results from two papers, it was confirmed that of the amino acid pairs correlated with the experimental results. the changes in the binding affinity between the proteins that form a complex by mutation can be explained by comparing the structural changes (i.e. changes in the amino acid pairs that contribute to the increase or decrease of the binding affinity) of the mutated proteins with those of the wild-type proteins. qu et al. reported that the n k/t s mutation on rbd-sars-cov- lowers the binding affinity . one complex (pdb id: d h) has the t s mutation in rbd-sars-cov- . t in wt rbd-sars-cov- attractively interacts with amino acids, y , g , n , g , f , and r , whereas s in the mutated complex attractively interacts with only amino acids, n , g , and r . wu et al. reported that the k t mutation on hace increases the binding affinity , because the k in wt hace (pdb id: ajf) interacts only with y of rbd-sars-cov- , whereas t in the mutated hace (pdb id: d g) interacts with two amino acids, y and y . the common hot spot region in rbd-sars-cov- against hace and sars-cov- antibodies. in order to narrow down the hot spot regions between hace and rbd-sars-cov- , we performed fmo calculations on four rbd-sars-cov- /antibody complexes (supplementary table s -s ). we summarized the fmo results in fig. . when comparing the amino acid pairs of this study with the previously reported results, it was confirmed that of the amino acid pairs are correlated: r /y , t /w , n /r , y /d , p /d , n /d , d /r , d /s , d /n , d /r , y /r , y /y , t /y , t /y , t /d , g /a , and y /d . in rbd-sars-cov- /m (pdb id: dd ) complex, the fmo results detected amino acid pairs, which are summarized in supplementary table s . the amino acid pairs that contributed to the stability of the complexes are well correlated with the published sitedirected mutagenesis study, in which the t mutation does not significantly affect the neutralizing activity of the antibody . the fmo results supported that t s mutation would change only minor van der waals interactions between t and hc y . in the rbd-sars-cov- /s (pdb id: nb , nb ) complex, the fmo results detected amino acid pairs, which are summarized in supplementary table s -s . the s binds to rbd-sars-cov- in different two states. the fmo results of state are detailed in supplementary table s , and those of the state are mentioned in supplementary table s -s . in the rbd-sars-cov- / the interactions between four antibodies ( r, m , s , and f g ) and rbd domain from sar-cov- are shown in the right-hand with color bars. the main hot spot region is colored in light red, and the secondary hot spot region in hace is colored in light blue, and all interactions shown in this map have attractive pie value more stable than − . kcal/mol, whose magnitudes are ignored. in order to find common hot spot amino acids in rbd-sars-cov- against hace and sars-cov- antibodies, we illustrated the fmo results with a d-spies-based map. (see fig. ). all four antibodies ( r, m , s , and f g ) and hace have two common amino acids, r and t , in rbd-sars-cov- . three of the four antibodies and hace have four common amino acids, t , g , i , and y , in rbd-sars-cov- . two of the four antibodies and hace have two common amino acids, f and q , in rbd-sars-cov- . only s and hace share four common amino acids, d , n , y , and y , in rbd-sars-cov- . only r and hace share two common amino acids, q and y , in rbd-sars-cov- . other interactions between antibodies and rbd-sars-cov- do not share interactions between hace and rbd-sars-cov- . considering the possibility of mutation prediction in viruses by the fmo methods , , the evolutionary process of sars-cov- can be performed to elude neutralization of antibody by switching the unshared interactions between the antibody and hace receptor. according to the map, there are two hot spot regions between hace and rbd-sars-cov- (see fig. ). the main hot spot region on hace consists of d , y , k , d , and several residues. the counter part of that on rbd-sars-cov- comprises r , t , t , i , y , and so on. the secondary hot spot region on hace receptor consists of d , k , and several residues. the counter part of that on rbd-sars-cov- comprises y , d , n , n , and so on. we found that sars-cov- antibodies focus on the main hot spot to block the formation of amino acid pairs between hace and rbd-sars-cov- . although the rbd of hcov-nl does not share structural homology with the rbds of sars-cov- and sars-cov- , the three viruses recognize the same hace receptor to invade host cells. in order to investigate the hot spot region between hcov-nl and hace , we performed fmo calculations on the hcov-nl /hace complex (pdb id: kbh). the fmo results in which amino acid pairs were detected are summarized in supplementary table s . the fmo results were in agreement with the six amino acid pairs (hace /rbd-hcov-nl ) previously reported by wu et al. : d /s , h /g , h /s , e /y , m /h , and g /g . in order to find amino acids in hot spot regions in the ppi interface between sars-cov- and hace , we performed fmo calculations on four sars-cov- /hace complexes (supplementary table s to investigate the common hot spot region on hace against rbds from the three viruses, and vice versa, we illustrated the fmo results in fig. . in the three viruses, all rbds have common interactions with d , k , e , k , g , and d in hace . sars-cov- and sars-cov- have common interactions with the s , q , f , e , a , d , y , q , y , e , n , and r in hace . only sars-cov- and hace share interactions with e , a , f , t , q , g , and f , whereas only nl -cov and hace share interactions with n , m , and f . the common interactions between sars-cov and nl -cov were h and r in hace . we created a d-spies based interaction map to find the hot spot regions from the ppi information between hace and rbd-sars-cov- (see figs. and ) . when comparing the interacting residues between hace and rbd of the three viruses, there are two hot spot regions consisting of shallow grooves on the hace receptor. the main hot spot is formed by e , k , g and d . the secondary hot spot consists of d and k . according to the map, the main hot spot is expected to be the most important hot spot between hace and rbd-sars-cov- . we observed that the secondary hot spot on hace has interactions with k , l , e , p , and q in rbd-sars-cov- , whereas the main hot spot has interactions with r , f , q , t , n , g , y , and q in rbd-sars-cov- . the results from the common hot spot region in sars-cov- antibodies supported the results that the main hot spot region was important for the ppi between rbd-sars-cov- and its antibodies. in the results of sars-cov- and its antibody (b ) summarized in supplementary table s , the antibody had interactions with r , q , n , g , and y of rbd-sars-cov- , which are the counterpart of the main hot spot. it can be used to develop antibodies and antiviral agents by using the information of the hot spot regions suggested in this work. even though the fmo method was successfully applied to evaluate ppis, analysis of biomolecular systems still requires huge computational costs. here, we combined parameterized quantum chemical approaches (fmo-dftb /d/pcm) and the d-scattered pair interaction energies ( d-spies) protocol to analyze ppis between sars-cov- and hace complex. the fmo-dftb /d/pcm/ d-spies results also showed a qualitative scientific reports | ( ) : | https://doi.org/ . /s - - - www.nature.com/scientificreports/ www.nature.com/scientificreports/ correlation with site-directed mutagenesis results, such as the fmo-mp /pcm/ d-spies results in our earlier work . the reliable inter-residue interaction energy calculation method, fmo-dftb /d/pcm/ d-spies, would be a powerful tool for drug discovery and protein engineering in the future. furthermore, the quantum-mechanical-level hot spot analysis results will provide new directions for antibody engineering and small-molecule development. the d-spies-based map would provide valuable information for the discovery of anti-viral therapeutics that inhibit ppis between the spike protein of sars-cov- and hace . a familial cluster of pneumonia associated with the novel coronavirus indicating person-to-person transmission: a study of a family cluster clinical features of patients infected with novel coronavirus in wuhan cryo-em structure of the -ncov spike in the prefusion conformation the origin, transmission and clinical therapies on coronavirus disease (covid- ) outbreak-an update on the status structure, function, and evolution of coronavirus spike proteins the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex dftb : extension of the self-consistent-charge density-functional tight-binding method (scc-dftb) fragment molecular orbital method: an approximate computational method for large molecules the fragment molecular orbital method combined with density-functional tight-binding and the polarizable continuum model investigation of protein-protein interactions and hot spot region between pd- and pd-l by fragment molecular orbital method on the role of the crystal environment in determining protein side-chain conformations propka : consistent treatment of internal and surface residues in empirical p k a predictions gamess as a free quantum-mechanical platform for drug research parametrization and benchmark of dftb for organic molecules parameterization of dftb / ob for sulfur and phosphorus for chemical and biological applications an efficient a posteriori treatment for dispersion interaction in density-functional-based tight binding uff, a full periodic table force field for molecular mechanics and molecular dynamics simulations fragment molecular orbital method: application to polypeptides fragment molecular orbital method: use of approximate electrostatic potential the fragment molecular orbital method reveals new insight into the chemical nature of gpcr-ligand interactions exploring chemistry with the fragment molecular orbital method a virus-binding hot spot on human angiotensin-converting enzyme is critical for binding of two different coronaviruses receptor and viral determinants of sars-coronavirus adaptation to human ace identification of two critical amino acid residues of the severe acute respiratory syndrome coronavirus spike protein for its variation in zoonotic tropism transition via a double substitution strategy mechanisms of host receptor adaptation by severe acute respiratory syndrome coronavirus structural basis of neutralization by a human anti-severe acute respiratory syndrome spike protein antibody, r structure of severe acute respiratory syndrome coronavirus receptor-binding domain complexed with neutralizing antibody possibility of mutation prediction of influenza hemagglutinin by combination of hemadsorption experiment and quantum chemical calculation for antibody binding prediction of probable mutations in influenza virus hemagglutinin protein based on large-scale ab initio fragment molecular orbital calculations crystal structure of nl respiratory coronavirus receptor-binding domain complexed with its human receptor h.l. a.b., and j.k. contributed equally to this work. m.k. and j.l. supported this work by collecting mutation data. all authors contributed to writing the manuscript and approved the final version of the manuscript. the authors declare no competing interests. supplementary information is available for this paper at https ://doi.org/ . /s - - - .correspondence and requests for materials should be addressed to k.t.n.reprints and permissions information is available at www.nature.com/reprints.publisher's note springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.open access this article is licensed under a creative commons attribution . international license, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons licence, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons licence, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. to view a copy of this licence, visit http://creat iveco mmons .org/licen ses/by/ . /. key: cord- - dfiwo authors: paris, kristina a.; santiago, ulises; camacho, carlos j. title: loss of ph switch unique to sars-cov supports unfamiliar virus pathology date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: dfiwo cell surface receptor engagement is a critical aspect of viral infection. this paper compares the dynamics of virus-receptor interactions for sars-cov (cov ) and cov . at low (endosomal) ph, the binding free energy landscape of cov and cov interactions with the angiotensin-converting enzyme (ace ) receptor is almost the same. however, at neutral ph the landscape is different due to the loss of a ph-switch (his lys) in the receptor binding domain (rbd) of cov relative to cov . namely, cov stabilizes a transition state above the bound state. in situations where small external strains are applied by, say, shear flow in the respiratory system, the off rate of the viral particle is enhanced. as a result, cov virions are expected to detach from cell surfaces in time scales that are much faster than the time needed for other receptors to reach out and stabilize virus attachment. on the other hand, the loss of this ph-switch, which sequence alignments show is unique to cov , eliminates the transition state and allows the virus to stay bound to the ace receptor for time scales compatible with the recruitment of additional ace receptors diffusing in the cell membrane. this has important implications for viral infection and its pathology. cov does not trigger high infectivity in the nasal area because it either rapidly drifts down the respiratory tract or is exhaled. by contrast, this novel mutation in cov should not only retain the infection in the nasal cavity until ace -rich cells are sufficiently depleted, but also require fewer particles for infection. this mechanism explains observed longer incubation times, extended period of viral shedding, and higher rate of transmission. these considerations governing viral entry suggest that number of ace -rich cells in human nasal mucosa, which should be significantly smaller for children (and females relative to males), should also correlate with onset of viral load that could be a determinant of higher virus susceptibility. critical implications for the development of new vaccines to combat current and future pandemics that, like sars-cov , export evolutionarily successful strains via higher transmission rates by viral retention in nasal epithelium are also discussed. although accurate assessments are still evolving, reports from the world health organization indicate that infection with sars-cov- (cov ) is significantly different relative to infection with previous respiratory viruses. the biggest distinctions are the longer incubation period and increased viral shedding (both likely responsible for higher rates of transmission), strong correlation of infected-fatality rates (ifr) with age and comorbidities, higher ifr for males relative to females, and minimal impact in children. as much as % of deaths from cov relate to cardiovascular complications (akhmerov and marban, ) , while cellular and animal models have also revealed inappropriate inflammatory responses along with high chemokine production (blanco-melo et al., ) . using thermodynamic, kinetic and molecular modeling, we explored potential reasons for these cov -unique aspects and aimed to identify determinants of its complex pathology. while it stands to reason that some of the answers may be found in the complex genomic changes triggered by the virus in cells, tissues, and organs, longer incubation and infectivity time scales also suggest that differences could have a biophysical origin. work on sars-cov (cov ) has already determined that the virus enters cells via receptormediated endocytosis in a ph-dependent manner (wang et al., ) that is characterized by cotranslocation of the viral spike glycoprotein and its specific functional receptor, the angiotensinconverting enzyme (ace ), from the cell surface to early endosomes. key steps that control the fate of the virus in the early and late endosome are driven in part by lowering the ph from . -to- . and from . -to- . (bui et al., ) , respectively; exposure to low ph triggers a spike catalyzed fusion between the viral and endosomal membranes followed by viral genome release. the cell machinery is then hijacked to replicate and assemble new virus particles that eventually exit the cell, primarily through budding. broadly speaking, this is the same ph-dependent endocytic path followed by the influenza virus (qin et al., ; yamauchi, ) , and likely all other coronaviruses (including mers-cov). while infections by cov and cov are mediated by the ace receptor, mers-cov (mers) gains entry to cells through dpp (raj et al., ) . figure shows structures for the receptorbinding domains (rbd) in complex with their receptors for all three viruses. surprisingly and likely significantly, while the rbd of cov (li et al., ) and mers (wang et al., ) have one histidine on opposite ends of their binding interface, cov (wang et al., ) does not have his residues in this domain. as we have been able to determine to date, all known strains of cov have mutated away their last his residue that is still present in the rbd of cov /mers and related zoonotic viruses (see below). because cell surface receptor engagement is a critical aspect of viral infection and life cycle, and sensing ph is relevant for both viral replication and regulation of histidine protonation, we set to decipher the mechanistic role of the remaining his residue that distinguishes the rbds of cov and cov . results ph-switch. at low (endosomal) ph, the binding free energy landscape of cov and cov interactions with their ace receptor is almost the same. this is important because low ph is critical for the activation of the spike fusion in late endosomes (martin and helenius, ) . in particular, his in ace (fig. ) , located at the core of the rbd binding interface, should play a key role in this process. indeed, we predict based on the co-crystal structures that his + should readily form a hydrogen bond network that stabilizes the rbd/ace complex in both cov and cov (fig. ) . thus, loss of the ph-switch in cov has no impact on the low ph bound conformation with ace . on the other hand, at neutral ph the landscape is different due to the loss of the ph-switch (his lys) in the rbd of the spike protein of cov relative to cov . we studied this loss by performing three independent unconstrained molecular dynamics simulations (mds) of cov pdb dd (prabakaran et al., ) and cov from the receptor (apo) in pdb lzg at both physiological and low ph ~ . conditions (see methods in supplementary information). in practice, lowering the ph protonates the his residue from neutral to positively charged (his + ) by the addition of an extra hydrogen. of note, although not studied here, the ph-switch in the rbd of mers has previously been observed in ph-dependent crystal structures (zhang et al., ) . mds revealed two distinct regions in the binding interface of cov : (a) a loop motif (f spdgkpctppalncy ) that for his and his + shows mostly not-bound-and bound-like conformations, respectively; and, (b) the rest of the binding interface that consistently adopts bound-like conformations that are independent of ph ( fig. a-c) . figure a -b shows the dominant clusters observed in the his and his + simulation of cov , as well as the ph-independent simulation of cov . figure c -d shows detailed analyses of the corresponding root-mean-squaredeviations (rmsds) of these two regions relative to the co-crystals as a function of time (additional mds are shown in fig. s ). the plots include the equilibration time (between - ns) in order to emphasize that the distinct trajectories were not biased by different initial conformations. remarkably, deletion of the ph switch in cov that mutates the last histidine, his , in cov by lys in cov generates a motif, which includes t eiyqag , that yields almost exclusively bound-like conformations (fig. d) ¾i.e., cov is always ready to bind ace . it is interesting to note that the effect of both lys in cov and his + in cov ( fig. c -d) is to stabilize the bound-like state. implications of these findings in the binding free energy landscape are sketched in fig. . namely, at low (late endosomal) ph, the landscape of cov and cov interactions with the ace receptor are very similar. however, at neutral ph the landscape is different due to the loss of the ph switch in the rbd of cov relative to cov . specifically, the not-bound-like conformations of the phdependent loop in cov stabilizes a higher free energy transition state, whereas the persistent bound-like behavior of cov yields a much tighter bond. dynamics of virus-receptor interactions for cov and cov . what are the implications of this transition state found in cov but not in cov at neutral ph? to answer this question, we need to consider that viral particles in the respiratory system are under small external strains from, e.g., shear flow in the respiratory airways, when engaging cell surface receptors. the cell mechanics of these interactions can be described by applying the reaction-limited kinetics of membrane-to-surface adhesion and detachment first envisioned by dembo et al (dembo et al., ) . specifically, one can write the free energy of the bound-state (bs) ∆ #$ under a tensile force as a function of the cell-cell gap width l, a constant binding free energy ∆ #$ & and a "spring" energy such that where #$ is the equilibrium length for the bonded state. a similar equation holds for the free energy of the transition state (ts) ∆ .$ at the same cell-cell gap, this treatment assumes equilibrium between bonded and de-bonded states, so there must be a very slow "ramp" rate for the force of pulling or pushing. if these conditions apply, the equilibrium constant for bond formation can be written as where # is the boltzmann factor. note that ( ) = ∆ / : = cd / cee . according to arrhenius theory, the de-bonding rate constant at a given gap-width can then be written as the mechanical or structural basis of the rbd/receptor interactions in fig. s can be characterized as a "door-knob" type junction across the gap, as opposed to a gripping or fish-hook bound state. the knob interactions with the receptor entailed two characteristic patches, a large bound-like domain and the smaller switching loop (fig. ) , which one could model with spring constants k #i and k iccj , respectively (see red and yellow surfaces in fig. s ). then, the elastic constant of the bound-and transition-state can be written as k #$ ≈ k #i + k iccj and k .$ ≈ k #i , respectively, with the equilibrium rest lengths being essentially the same, i.e., l #$ = .$ = l . thus, virus detachment to the transition state corresponds to an ideal case of the theory, where the only allowed change between the bonded and transition state is in the spring constants, such that it is clear that the spring approximation should only apply for small deformations. in general, the springs across the gap undergo a "twisting" motion around its long-axis to reach the transition state from the bonded state. the motion will increase "tightness" of the spring if (k #$ − k .$ ) < , which defines a catch-bond. here, however, cov always loosens tightness, (k #$ − k .$ ) = k iccj > , corresponding to a typical slip-bond whose lifetime is shortened by tensile forces acting in the bond. free energy landscapes and estimates of bond detachment for cov /cov and ace . we use the fastcontact server (champ and camacho, ) to compute the electrostatic ∆ i r & and desolvation ∆ s $ci & binding free energies of the bound and transition states using co-crystal structures and chimeras that incorporate changes triggered by low ph structures. entropies coupled to the unbound state could be somewhat higher for cov relative to cov due to the larger conformational entropy associated with the switching loop in fig. . other error bars are correlated since interactions are very similar such that ∆∆ ′ have an error bar of ± / . absolute free energies need to account for size-dependent configurational and vibrational entropy changes upon binding, which for high affinity protein-protein complexes have been estimated to be anywhere between -to- kcal/mol. however, for flat and rather superficial complexes such as those here (fig. s ) , the entropy loss could be much lower. finally, the pre-exponential factor cee (eq. ) in the absence of a transition state and at equilibrium is exactly cd at m concentration, which for diffusion-limited association can be approximated by cd~ \ s - (camacho et al., ) . (fig. a-c) . experimental equilibrium binding free energy is from (walls et al., ) . free energy estimates of bound complexes are fully consistent with experimental data (walls et al., ) ; alternative measurements have suggested a fold weaker binding (shang et al., ) . these differences will only re-scale cee & by a factor of but will not be significant to our conclusions. the key observation is that cov stabilizes a transition state by about . / above the bound state. as a result, small external strains applied by, say, shear flow in the respiratory system enhance the off rate of the viral particle as shown in eq. . thus, cov virions are expected to detach from cell surfaces in faster time scales. these binding free energy estimates are depicted in the landscapes in fig. . only at physiological ph should the landscape of cov display a ph-dependent transition state. other bonds are expected to break in an all-or-none type of transition. optimal dwelling times and endocytosis. in principle, the ph-switch in cov could provide a natural mechanism to optimize virus internalization. namely, cov is expected to "bounce around" cell surfaces many times before cell entry. if the density of receptors is high enough, a "stick-and-slip" approach could be an efficient mechanism to find clusters of receptors randomly distributed on the cell surface. on the other hand, if only a small number of cell surface receptors are available, then receptor diffusion will be the limiting step to accrue the critical number of receptors needed for endocytosis, and longer rbd/ace dwelling times will be required. of note, tighter binding to ace would also make it easier for a smaller number of cov particles to establish an initial foothold in the respiratory system compared to weak binding where particles could be exhaled out. broad estimates of "high" concentration, e.g., in the range of -to- , receptors, yield an average separation between receptors ~ . − . µ (see fig. b ) that is larger than the diameter of the virus ~ . µ . thus, after attachment of the first spike to its receptor, recruitment of a second receptor to stabilized virus attachment will be limited by other receptors circulating in the cell membrane. lateral protein diffusion in cell membranes is length-scale dependent, varying between ~ . (kusumi et al., ) for - nm and > nm, respectively. thus, diffusion time scales to bring two receptors into close proximity for the above length scales are ~ − . it is noteworthy that the number of surface receptors in cells have an upper limit of about , , which in the respiratory airways could limit infectivity to dwelling times of about cee &~ v (or ~ \ v ) based on ~ . µ (fig. ) . viral infection and its pathology. the lifetime of cov rbd/ace bonds at physiological ph (~ s) is marginally short-lived for efficiently triggering endocytosis, even at high ace receptor concentrations. as a result, cov virions are expected to detach from cell surfaces in time scales that are much faster than the time needed for other receptors to reach out and stabilize virus attachment. and for human nasal goblet cells, it will be significantly worse since, after each bounce, particles will be biased by gravity to either diffuse down the respiratory tract or be exhaled, where they will not find significant amounts of ace receptors until reaching lung alveolar epithelial cells (hamming et al., ) . on the other hand, deletion of the ph switch allows cov to have rbd/ace bonds with dwelling times of about ~ s, commensurate with the diffusion time scales needed to recruit enough ace receptors to trigger endocytosis. this mechanism implies that, for the most part, cov will not co-localize in the nasal cavity. this prediction is consistent with cov being mainly a lower respiratory tract disease, causing complications that include acute respiratory distress (ding et al., ; hamming et al., ) . viral replication in human mucous gland cells will release viruses back into the same area where they can infect new cells until the supply of ace receptors is depleted below the critical threshold needed for binding and internalization. this process will trap viral particles in the upper respiratory tract, naturally leading to longer incubation times. similarly, accumulation of viral particles in the nasal mucosa will lead to extended periods of viral shedding. of note, since viral transit to the lower respiratory tract will be significantly slower for cov relative to cov , this period of higher infectivity rates could be for the most part mediated by asymptomatic individuals. based on our findings, incubation times should correlate with the number of ace -rich cells in the nasal area. it is important to note that children do not have well developed sinuses until adolescence (henson et al., ) . thus, large areas for viral replication will not be available in children, resulting in shorter incubation times due to the faster diffusion down to the lower respiratory tract. something similar could apply to females who have smaller nasal cavities relative to males (samolinski et al., ) . shorter times in the nasal cavity would lead to a lower viral load in the upper airways and could explain the lower transmission and milder symptoms that are observed in children, as well as the lower ifr in adult women relative to men. our proposed mechanism is also consistent with reported loss of sense of smell (anosmia) that may occur by day of a cov infection (speth et al., ) , as cells in the nasal cavities support olfactory mucous membranes needed for the perception of smell. proximity to the brain also suggest that cov infections could impact the brain in ways that other sars viruses cannot. moreover, cardiovascular and immunological complications triggered by cov could also be explained on the basis of long-term insult of endothelial cells by viral sequestration of the ace receptor (gurley and coffman, ) . ph-switch across species. further supporting the observation that cov is unique among other coronaviruses is shown in table that compares sequence alignments of ph domains in rbds of both cov , cov , mers, as well as other closely related zoonotic viruses. cov , and related coronaviruses in one species of pangolin and some bats do not share the ph-switch present in cov , instead they share the lys stabilization motif. however, these zoonotic viruses still have ph-switches that co-localize next to the ph-switch in the rbd of mers structure (fig. ) . interestingly, different bat-infecting strains show putative ph-switches that are closer in both sequence and structure to either mers or pangolin-associated coronavirus. while we have not yet found the species or strain where the loss of the ph-switch first occurred, these relationships point at the possible zoonotic origin of cov as well as evolutionary pressures to preserve the phswitch. it is noteworthy that dpp , the receptor of the mers rbd, is not found in nasal epithelial cells (meyerholz et al., ) . outlook. this newly discovered difference in protein sequence in the receptor binding domain of the spike glycoprotein and its impact on receptor binding reveals a mechanism that allows sars-cov internalization to take advantage of the high expression of ace in the nasal epithelium¾resulting in increased retention times in the upper respiratory tract and augmented infectivity. this mechanism reconciles observed epidemiological traits and pathologies specific to sars-cov and explains differences with those associated with sars-cov, which due to its stick-and-slip ph-switch is unable to efficiently undergo endocytosis in the nasal cavity. sars-cov also has a higher infected-fatality rate than sars-cov . while the evolutionary advantage of higher infectivity by sars-cov in the nasal area is clear, this property comes at the expense of an important regulatory mechanism that would have allowed this virus to more readily move in other organs and tissues. in fact, the life-cycle of sars-cov is significantly slower than that of sars-cov because cov is essentially immobilized at its initial cell receptor contact. thus, it seems unlikely that the diffusion limited recruitment of ace receptors affecting the virus in the respiratory airways would also be the limiting step in tissues. after internalization, the virus is encapsulated in a vesicle supported by rbd/ace complexes. the actual final number of complexes in each vesicle should vary above a given threshold, though not much is known about the details of this process. contrary to sars-cov, cov complexes would be expected to have greater difficulty slipping and breaking. it is not difficult to imagine that for vesicles compressed by an excess of receptors the fusion with the early endosome might be hindered, hosting a population of viruses that could stay latent or activate at much later times. this simple mechanism could underlie the still anecdotal evidence for infection recurrence (chen et al., ) , as well as extremely long-term of viral shedding. collectively, our studies provide insight pertinent to the molecular basis of viral infectivity and, at the same time, validate this form of thermodynamic and molecular modeling as an approach to probe the evolution of the next sars-mediated pandemic. from a therapeutic perspective, our findings linking viral pathology with long-term viral infection/retention in nasal epithelium of the upper respiratory tract suggest that vaccine development should not just concentrate on fighting systemic infection through induction of igg responses, but should instead aim to elicit high titers of secretory iga antibodies capable of neutralizing the virus in the nasal mucosa. therefore, intranasal delivery of a vaccine with strong iga producing potential is a logical approach to consider as the next step in countering the current and future pandemics that, like sars-cov , export evolutionarily successful strains via higher transmission rates. surface representation of the co-crystals reveal two characteristic lobes with flat and mostly superficial contacts. yellow surface corresponds to ph-switch loop and red surface indicates remaining of binding interface. covid- and the heart imbalanced host response to sars-cov- drives development of covid- effect of m protein and low ph on nuclear transport of influenza virus ribonucleoproteins kinetics of desolvation-mediated protein-protein binding fastcontact: a free energy scoring tool for protein-protein complex structures recurrence of positive sars-cov- rna in covid- : a case report the reaction-limited kinetics of membrane-to-surface adhesion and detachment the clinical pathology of severe acute respiratory syndrome (sars): a report from china angiotensin-converting enzyme gene targeting studies in mice: mixed messages tissue distribution of ace protein, the functional receptor for sars coronavirus. a first step in understanding sars pathogenesis anatomy, head and neck, nose sinuses paradigm shift of the plasma membrane concept from the twodimensional continuum fluid to the partitioned fluid: high-speed single-molecule tracking of membrane molecules structure of sars coronavirus spike receptor-binding domain complexed with receptor transport of incoming influenza virus nucleocapsids into the nucleus dipeptidyl peptidase distribution in the human respiratory tract: implications for the middle east respiratory syndrome structure of severe acute respiratory syndrome coronavirus receptor-binding domain complexed with neutralizing antibody real-time dissection of dynamic uncoating of individual influenza viruses dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc changes in nasal cavity dimensions in children and adults by gender and age structural basis of receptor recognition by sars-cov- single-particle tracking of immunoglobulin e receptors (fcepsilonri) in micron-sized clusters and receptor patches otolaryngol head neck surg function, and antigenicity of the sars-cov- spike glycoprotein sars coronavirus entry into host cells through a novel clathrin-and caveolae-independent endocytic pathway structure of mers-cov spike receptor-binding domain complexed with human receptor dpp structural and functional basis of sars-cov- entry by using human ace quantum dots crack the influenza uncoating puzzle structural definition of a unique neutralization epitope on the receptor-binding domain of mers-cov spike glycoprotein the protein data bank the pymol molecular graphics system, version . r pre, schrödinger, llc routine microsecond molecular dynamics simulations with amber on gpus. . generalized born routine microsecond molecular dynamics simulations with amber on gpus. . explicit solvent particle mesh ewald simmerling. ff sb: improving the accuracy of protein side chain and backbone parameters from ff sb development and testing of a general amber force field software for processing and analysis of molecular dynamics trajectory data vmd -visual molecular dynamics acknowledgements. this work was supported by nih gm , ns and protonated state of h + is predicted to form stable h-bond network with d in ace , and y and y in (a) cov and (b) cov , respectively. h-bond network is based on rotamers already observed in co-crystals of the unprotonated forms: (c) pdb ajf for cov , and, (d) pdb lzg for cov (see also pdb m j). unprotonated co-crystal structures of cov assigned h with a ndh making a bond with backbone oxygen of d , which is already making a bond in the a-helix.our mds indicate that even in the unprotonated form the rotamer should be rotated o having nd and ne more readily interacting with d and y (as shown in panel d). figure . role of ph switch in cov relative to cov . (a) and (b) show the same co-crystals as in fig. superimposed with centroid of largest . Å rmsd cluster of conformations of phswitching loop from mds shown in (c) for cov and (d) for cov . also indicated is the size of the corresponding cluster relative to simulation time. (c) root-mean-squared-deviation (rmsd) of amino acid loop as a function of time that switches between not-bound-like (~ . Å) to bound-like (~ . Å) relative to co-crystal (pdb ajf), for his and his + , respectively; (d) same analysis for cov homologous loop shows most conformations under Å rmsd relative to pdb lzg. binding interface, not including loop, stays in a bound-like conformation for % of the simulation time for both (e) cov and (f) cov . atomic coordinates for starting structures were acquired from the protein data bank [ ] : dd was used for cov rbd and lzg was used for the cov rbd. the rbd from dd (bound to neutralizing antibody) was chosen instead of that in pdb id ajf (bound to ace ) as a starting structure as it includes otherwise missing portions of the domain. modification of his to different tautomeric or protonation states was done with pymol's mutagenesis wizard [ ] . molecular dynamics simulations (mds) were carried out with pmemd.cuda from amber [ ] [ ] [ ] using amber ff sb force field [ ] and generalized amber force field (gaff) [ ] . we used tleap binary (part of amber ) for solvating the structures in a cubed tip p water box with a Å distance from structure surface to the box edges, and closeness parameter of . Å. the system was neutralized and solvated. simulations were carried out after minimizing the system, gradually heating the system from k to k over ps, and equilibrating the system for ns at npt. ns of production was then carried out using npt at k with the langevin thermostat, a non-bonded interaction cut off of Å, time step of fs, and the shake algorithm to constrain all bonds involving hydrogens. clustering was completed using cpptraj [ ] and h-bond and rmsd calculations were done with vmd [ ] . all figures were drawn using pymol [ ] and gnuplot. key: cord- -qbjrlog authors: okba, nisreen m. a.; widjaja, ivy; van dieren, brenda; aebischer, andrea; van amerongen, geert; de waal, leon; stittelaar, koert j.; schipper, debby; martina, byron; van den brand, judith m. a.; beer, martin; bosch, berend-jan; haagmans, bart l. title: particulate multivalent presentation of the receptor binding domain induces protective immune responses against mers-cov date: - - journal: emerging microbes & infections doi: . / . . sha: doc_id: cord_uid: qbjrlog middle east respiratory syndrome coronavirus (mers-cov) is a who priority pathogen for which vaccines are urgently needed. using an immune-focusing approach, we created self-assembling particles multivalently displaying critical regions of the mers-cov spike protein ─fusion peptide, heptad repeat , and receptor binding domain (rbd) ─ and tested their immunogenicity and protective capacity in rabbits. using a “plug-and-display” spytag/spycatcher system, we coupled rbd to lumazine synthase (ls) particles producing multimeric rbd-presenting particles (rbd-ls). rbd-ls vaccination induced antibody responses of high magnitude and quality (avidity, mers-cov neutralizing capacity, and mucosal immunity) with cross-clade neutralization. the antibody responses were associated with blocking viral replication and upper and lower respiratory tract protection against mers-cov infection in rabbits. this arrayed multivalent presentation of the viral rbd using the antigen-spytag/ls-spycatcher is a promising mers-cov vaccine candidate and this platform may be applied for the rapid development of vaccines against other emerging viruses such as sars-cov- . emerging zoonotic viruses, such as severe acute respiratory syndrome coronavirus (sars-cov) and middle east respiratory syndrome coronavirus (mers-cov) have been able to cross the species barrier posing a threat to the human population. mers-cov causes severe respiratory disease and fatalities in humans [ , ] , and the virus is continuously introduced into the human population through infected dromedary camels, the viral reservoir with resulting outbreaks [ ] . the wide geographical distribution of this viral reservoir, the high case-fatality rate in humans ( %), and the lack of treatment and licensed vaccines, make the virus a threat to the human population. this has put mers-cov on the recent who list of diseases having an epidemic or even pandemic potential for which countermeasures are lacking and are urgently needed [ ] . vaccination is potentially one of the most effective ways to prevent the ongoing mers-cov outbreaks. several mers-cov vaccine candidates have been developed using different platforms including inactivated, live-attenuated, and subunit vaccines [ ] . compared to other vaccine production platforms, recombinant subunit proteins have a higher safety profile, are relatively faster and easier to produce, and can be scaled-up in a more cost-effective manner; nonetheless, they tend to induce lower levels of protective immunity [ ] . the use of self-assembling multimeric protein scaffold particles (mpsp) to present antigens in a multivalent virus-mimicking manner (size, repetitiveness, and geometry), has been shown to enhance vaccine-induced immune responses [ ] [ ] [ ] [ ] [ ] , and to offer advantages over other multimeric antigen presentation platforms (reviewed in [ ] ). both lumazine synthase (ls) and i - (i ) can self-assemble into -meric particles, which can be expressed in e. coli and have been used as scaffolds for development of multimeric vaccines with improved immune responses compared to monomeric forms [ ] [ ] [ ] . an ls-based hiv vaccine, (eod-gt ), has recently advanced to a phase i human clinical trial (nct ). linking of antigens to these mpsp can be achieved through several mechanisms; as e.g. genetic fusion or the syptag-spycatcher (st/sc) system [ ] . while the former requires the antigen and scaffold to be produced in the same expression system, the latter allows each to be expressed in its suitable system harnessing a rapid post-translational "plug-andplay" assembly. this is advantageous, allowing scaffold-sc to be produced at scalable levels in e. coli and spytagged glycosylated antigens such as viral surface proteins to be produced in its optimal system, such as mammalian or insect cells. the antigen-st can then be multivalently displayed on the surface of the scscaffolds through the spontaneous formation of a stable isopeptide bond. this can be a platform for rapid vaccine manufacturing in case of epidemics or pandemics, to create optimized vaccines at reduced costs and also with reduced development times. the mers-cov spike (s) protein is the main target for subunit vaccine development [ ] it assembles as a homotrimer and consists of an n-terminal head (s subunit) and a c-terminal stalk (s subunit). the s subunit mediates virus attachment and entry through its n-terminal s a domain and its c-terminal receptor binding domain (rbd), respectively [ , ] . the s a domain binds sialic acids, a viral attachment factor, while the rbd binds to the viral receptor, dipeptidyl peptidase (dpp ). following attachment and entry, the s subunit mediates viral fusion to the host cell through its fusion machinery; comprised of the fusion peptide (fp) and the two heptad repeats -hr and hr [ ] . mers-cov neutralizing antibodies (abs) mainly recognize epitopes in the rbd of the spike head s subunit; and to a lower extent, epitopes in the sialic acid binding domain and the fusion-mediating more conserved s stalk (s ). nonetheless, antibodies directed against the sialic acid binding s a domain or the more conserved s subunit, although subdominant, may protect against mers-cov [ , ] . immune focusing can enhance immune responses to subdominant regions [ ] . in the current study, using ls and i self-assembling particles, we evaluated whether immune focusing and multivalent presentation can induce immune responses to the more sequence-conserved s regions: fp and hr . furthermore, using a syptag/spycatcher system and ls particles, we tested whether immune focusing with/ without multivalent presentation of the viral rbd can lead to enhanced protection against a mers-cov challenge in rabbits. expression constructs were cloned using standard pcr methods. the gene encoding the , -dimethyl- -ribityllumazine synthase (ls; genbank accession no. wp_ . ) of a. aeolicus was synthesized using human-preferred codons obtained from gen-script usa, inc, as described previously [ ] . the cysteine at position and asparagine at position of ls were mutated to alanine and glutamine, respectively. the gene encoding i - (i ; pdb kp , amino acid residues - ) derived from thermotoga maritima was synthesized using human-preferred codons obtained from genscript usa. the gene fragments encoding the Δn spycatcher (sc; uniprot accession no. afd . ; amino acid residues - ; [ ] ) and spytag (st; uniprot accession no. wp_ . ; amino acid residues - ) based on the cna b-type domain-containing protein of streptococcus pyogenes were synthesized using human-preferred codons obtained from genscript usa, inc. the ls and i gene constructs were cloned into the pgex- t bacterial expression vector (sigma aldrich). to generate the hr -ls expression vector, the hr region (amino acid residues - ) encoding sequence of the mers-cov s gene (accession no. nc_ ) was ligated in-frame with an n-terminal sequence encoding a cd signal sequence and streptag tag purification tag, and with a c-terminal sequence encoding the ls via a linker, and subsequent cloned into the pcaggs mammalian expression vector. to generate the i -hr expression vector, the heptad repeat encoding region (hr , amino acid residues - ) of the mers-cov s gene was ligated in-frame with an n-terminal sequence encoding the i - and a c-terminal streptag purification tag interspaced with a linker, and subsequent cloned into the pgex- t bacterial expression vector (sigma aldrich). to generate the fp-i and fp-ls expression vectors, the fusion peptide (fp; amino acid residues - ) encoding sequence of the mers-cov s gene was ligated in-frame with an n-terminal sequence encoding the i - or ls, and a c-terminal streptag purification tag and subsequently cloned into the pgex- t bacterial expression vector (sigma aldrich). to generate the rbd-st expression vector, the mers-rbd (amino acid residues - ) encoding sequence of the mers-cov s gene was ligated inframe with an n-terminal sequence encoding a cd signal sequence and with a c-terminal sequence encoding the st followed by a double streptag, and subsequently cloned into the pcaggs mammalian expression vector. to generate the ls-sc expression vector, the codon optimized sc sequence equipped with an n-terminal flag-tag (dykddddk) was cloned to the n-terminus of the ls sequence in the pet b bacterial expression vector (novagen). all protein sequences are provided in supplementary figures s and s . mammalian expression of the hr -ls and rbd-st constructs was done, as described previously [ ] . in short, expression plasmids were polyethylenimine (pei)-transfected into % confluent hek- t cells for h, after which transfections were removed and medium was replaced with sfm ii-based expression medium (gibco life technologies) and incubated at °c in % co . tissue culture supernatants were harvested - d post transfection, and expressed proteins were purified using streptactin sepharose beads (iba) according to the manufacturer's instruction. bl cells (novagen) were transformed with pgex- t expression vectors and grown in × yeast-tryptone medium to log phase (od ∼ . ) and subsequently induced by adding iptg (isopropyl-β-d-thiogalactopyranoside) (gibco brl) to a final concentration of mm. two hours later, the cells were pelleted, resuspended in / volume of mm tris (ph . )- mm edta- mm phenylmethylsulfonyl fluoride, and sonicated on ice (five times, min each). the cell homogenates were centrifuged at , × g for min at °c. proteins were purified from the cell lysate supernatant using streptactin sepharose beads (iba) according to the manufacturer's instruction. all purified proteins were analyzed on a % sds/ page gel under reducing conditions and stained with gelcodeblue stain reagent (thermo scientific). purified proteins were stored at °c until further use. expression of the flag-ls-sc was performed as described above with the following modifications: ) cells were treated with mg/ml lysozyme in lysis buffer ( mm tris-hcl, mm nacl, % triton x- ) for h at room temperature prior to sonification on ice. ) purification was performed using anti-flag® m affinity gel (sigma aldrich) as recommended by the manufacturer. purified proteins were dialyzed against x tbs buffer ( mm tris-hcl, mm nacl, ph . ) and stored at − °c until further use. rabbit immunizations and challenge were carried out at viroclinics bioscience b.v. under permit no. avd -wp , using bsl- containment facilities. female new zealand white rabbits (envigo, venray, the netherlands) of weeks age were assigned to six groups (i-vi) of five animals each. immunizations were performed intramuscularly with either i) hr -ls, ii) fp-ls, iii) ls, at day and boosted with either i) hr -i , ii) fp-i , iii) i on day or iv) pbs, v) rbd + ls, vi) rbd-ls on days and . each animal received each time µg of antigen adjuvanted with adjuplex ( %; sigma-aldrich, zwijndrecht, the netherlands) in a total volume of µl. three weeks after the last vaccination (day of the study), all animals were challenged intranasally under anesthesia with mers-cov ( % tissue culture infectious dose (tcid ) mers-cov emc strain (accession no. nc_ ) in a volume of ml divided over both nostrils). the animals were euthanized on day postchallenge (day of the study). serum samples were collected on days , , and . nasal swabs were collected on day (pre-challenge) and on days through post-challenge. following euthanasia, lungs were examined for gross pathology and lung tissue samples were collected for virus detection, and in % formalin histopathology and immunohistochemistry. antigen-binding and anti-ls (scaffold) antibodies produced after vaccination were tested in the sera collected at different time points as well as in pre-challenge nasal swabs using elisa. costar high-binding -well elisa plates were coated overnight at °c with µg/ ml of either recombinant ls, mers-cov s or s proteins in pbs. the plates were washed with pbs and blocked for hr using %bsa/ . %tween- /pbs. following blocking, diluted samples ( : or serially diluted) were added and further incubated for hr. the plates were then washed and and probed with an hrp-labeled goat anti-rabbit ig ( : , dako) secondary antibody. tmb was used for signal development and the absorbance of each sample was measured at nm (od ). antibody avidity was assessed using an ammonium thiocyanate (nh scn)-displacement elisa. this was carried out as described above using serum dilutions containing same level of s absorbance units added in triplicates. following serum incubation and washing, nh scn ( - m) was added to the wells for min. the plates were then washed and further developed as described above. the concentration of nh scn resulting in a % reduction in signal was taken as the avidity index (ic ). to confirm the antigenicity of the rbd-ls particles, we tested its binding to well-characterized monoclonal antibodies binding conformational rbd epitopes [ ] . human monoclonal antibodies . g , . f , . g , . e , . e targeting the receptor binding domain of the mers-cov spike protein were produced and purified as described earlier [ ] . nunc maxisorp plates (thermo scientific) were coated with the rbd-ls antigen at ng /well at °c overnight. plates were washed three times with pbs containing . % tween- and blocked with pbs with % protifar in pbs containing . % tween- at room temperature for h. four-folds serial dilutions of mabs starting at µg/ml (diluted in blocking buffer) were added and plates were incubated for h at room temperature. plates were washed three times and incubated with hrp-conjugated goat anti-human secondary antibody (itk southern biotech) diluted : in blocking buffer for one hour at room temperature. hrp activity was measured at nm using tetramethylbenzidine substrate (biofx) and an elisa plate reader (el- , biotek). the presence of mers-cov neutralizing antibodies in the sera and nasal swabs of vaccinated animals was tested using a plaque reduction neutralization assay (prnt). heat -inactivated two-fold serially diluted samples (starting : ) were mixed : with pfu of mers-cov (emc/ ) and incubated for one hour. the mix was then overlaid on huh- cells in -well plates. following one hour of incubation, the mix was removed and the cells were incubated for hr. the cells were then fixed, permeabilized and stained using a mouse anti-mers-cov n protein monoclonal antibody (sino biological) followed by an hrp-labelled goat anti-mouse igg (southernbiotech). the signal was developed using a precipitate forming peroxidase substrate (true blue, kpl). the immunospot® image analyzer (ctl europe gmbh) was used to count the number of infected cells per well. the neutralization titre of each serum sample was determined as the reciprocal of the highest dilution resulting in a ≥ % (prnt ) or ≥ % (prnt ) reduction in the number of infected cells. a titre of ≥ was considered to be positive. to evaluate the protective efficacy of vaccination against mers-cov challenge, nasal swabs, and homogenated lung tissues were tested for the presence of mers-cov rna using rt-qpcr for and for the presence of infectious virus by virus titration. the presence of viral rna in nasal swabs and lung tissues was tested using upe rt-qpcr as previously described [ ] . rna was extracted from samples using magnapure lc total nucleic acid isolation kit (roche). rna amplification and quantification were carried out using a real-time pcr system (applied biosystems). samples with a c t value < were considered positive. rna dilutions extracted from a mers-cov stock of known titre was used to generate a standard curve in order to calculate the tcid equivalent of rna detected in samples. concentrations of viral rna in lung tissue are expressed in as tcid equivalents per gram tissue (tcid eq/ g), and in the nasal swabs as tcid eq/ml. the presence of mers-cov infectious viral particles in respiratory tract samples (nasal swabs and lung tissue homogenates) was detected by titration on vero cells as described previously [ ] . briefly, -fold serially diluted samples (starting undiluted) were overlaid on vero cells and the plates were incubated for five days at °c and the cytopathic effect was recorded. infectious virus titres in lung tissue are expressed as tcid per gram tissue (tcid /g), and infectious virus titre in nose swabs are expressed as tcid /ml. lung tissue samples were collected in formalin and embedded in paraffin for pathological analysis. hematoxylin-eosin staining was carried out for histopathological analysis. the presence of mers-cov nucleoprotein was detected by immunohistochemistry as previously published [ ] . statistical analyses were performed using prism (graphpad software inc, usa). data were compared using mann-whitney u test or student's t-test. pvalues < . were considered significant. all data are available within the article and its supplementary information or available from the authors on request. particulate multivalent antigen display can enhance immunogenicity through different mechanisms, allowing for induction of immune responses against otherwise weakly immunogenic antigens [ , ] . we sought to design antigens capable of inducing strong immune responses against critical parts of the viral entry and fusion machinery within the mers-cov spike protein through immune focusing and multivalent presentation on self-assembling particles (figure ). within the s subunit, the rbd is the main target for the induction of neutralizing antibodies and has been used to develop several vaccine candidates for mers-cov [ , ] . indeed, the immunogenicity of rbd can be enhanced by its presentation on ferritin nanoparticles [ ] . likewise, the fusion peptide (fp) and the hr , which show a high degree of sequence conservation among covs relative to the rbd, play crucial roles in the cov spike-mediated fusion machinery, and can be targets for cov protective antibodies [ ] [ ] [ ] [ ] [ ] . genetic fusion was chosen for fp and hr , due to their small size, whereas the st/sc system was used for rbd display on particles to ensure correct folding of the protein. two -meric hyperstable self-assembling particles with icosahedral symmetry were used for multivalent display of mers-cov domains. the lumazine synthase (ls) particle, an icosahedron with a diameter of nm (pmid: ) and the i - (i ) particle, a dodecahedron with a diameter of nm (pmid: ). the n-and c-termini of both scaffolds are surface exposed, providing a platform to multivalently present (antigenic) domains. two functional segments of the s subunit of the mers-cov spike protein were genetically fused to these nanoparticles; the fusion peptide containing region (amino acid residues - ) and the hr containing region (amino acid residues - ) ( figure b, supplementary figure s ). chimeric nanoparticles were purified after expression in eukaryotic (mammalian) or prokaryotic systems (figure ). in addition, we used the spytag/spycatcher system to multivalently display the mers-cov rbd on ls nanoparticle via covalent bonding [ ] . for this purpose, the spycatcher (sc) was genetically fused to ls and expressed and purified from e. coli. the spytag (st) was genetically fused to the mers-rbd (amino acid residues - ) and expressed and purified from hek- t cells ( figure c ). rbd-st was incubated with ls-sc in different molar ratios to assess the optimal coupling of both components. a : molar ratio of rbd-st and ls-sc allowed the optimal coupling of all of the provided rbd-st antigens to the sc-ls particles ( figure d ). the resulting conjugation products were used for immunization. in order to assess the effect of the particle-based multivalent antigen display on immunogenicity, a mixture of non-coupled rbd-st and ls (without sc) was taken along for immunization in the same molar ratio. all particulate preparations displaying mers-s antigenic domains (genetically fused or sc/st coupled) were analyzed by sds-page ( figure e, supplementary table s ), confirming their molecular integrity. we further confirmed the antigenicity of the rbd-ls particles by testing their capacity to bind monoclonal antibodies directed against conformational epitopes on the rbd [ ] using elisa. all antibodies bound to rbd-ls in a dose dependant manner ( figure s ) indicating that the rbd is correctly folded confirming its antigenicity. we then evaluated the immunogenicity of the multimeric spike antigens using six groups of rabbits (n = per group), which were intramuscularly immunized twice at a -week interval (figure a ). the ls/i and pbs immunized groups served as controls. after the first immunization, we detected antibody responses against the corresponding s subunit (s or s ) in the vaccinated rabbits. while the control groups remained negative ( figure b -e). endpoint antibody titres for the vaccinated groups are shown as geometric mean titres (gmt) in supplementary table s . the antibody responses were further boosted after the second immunization in all groups, while no responses were detected in the control groups, confirming the immunogenicity of the tested antigens in rabbits. anti-s antibody responses were detected in the hr and fp vaccinated groups with weak to no mers-cov neutralizing capacity ( figure b,c) . only hr vaccination induced low levels of mers-cov neutralizing antibodies (prnt titres: - ) in / rabbits; all had mers-cov neutralizing antibodies at a % cut-off (data not shown). likewise, both the monomeric rbd (rbd + ls) and the multimeric rbd-ls were immunogenic and elicited high s -specific antibody titres which were further boosted after the second immunization. the rbd-ls-induced s antibody titres were significantly higher than those induced by the monomeric rbd following the prime-as well as booster-vaccination (p = . and p = . , respectively by mann-whitney u test) ( figure d ). multimeric rbd-ls vaccination elicited higher mers-cov neutralizing antibodies, a main correlate of protection, than the monomeric rbd + ls when tested for live virus neutralization using prnt assay (p = . , and p = . , post-prime and boost, respectively by mann-whitney u test) ( figure e ). the vaccine induced antibodies were able to neutralize clade a (emc/ strain; figure e ) as well as the more recently circulating clade b (qatar / strain; supplementary figure s ) viruses. the spike protein of the former strain differs from the clade a emc/ strain in two positions; t s and q r. following a single immunization, binding antibody titres were four-fold higher and neutralizing antibodies were eleven-fold higher in the coupled multimeric rbd-ls group than in the uncoupled monomeric rbd + ls (supplementary table s ). three weeks after the boost, binding antibody responses were seven-fold higher (p = . , mann-whitney u test) and neutralizing antibodies were ten-fold higher (p = . , mann-whitney u test) in the coupled rbd-ls group than in the uncoupled rbd + ls ( figure d , e supplementary table s ). additionally, we tested for vaccine induced mucosal immunity in the respiratory tract of vaccinated rabbits pre-challenge (day ) using elisa. mers-cov specific antibodies were only detected in the nasal swabs of the groups vaccinated with conjugated or non-conjugated rbd (figure f,g) . antibody responses detected in the rbd-ls vaccinated group were higher than those in the rbd + ls vaccinated group (p = . , student's t-test). this demonstrates that rbd-ls induces improved local mucosal immune responses compared to the monomeric rbd. thus, vaccination with the newly produced rbd-ls mers-cov mpsp vaccines induce a robust immune response. the avidity of mers-cov spike-specific antibodies in the monomeric versus the multimeric rbd vaccinated groups was analyzed at days ( weeks after prime) and ( weeks after boost) using an ammonium thiocyanate (nh scn)-displacement elisa [ ] . the avidity index ic was determined for each vaccinated rabbit and compared between the two groups. the avidity of the s -specific antibody responses was higher following rbd-ls vaccination compared to the monomeric rbd + ls vaccination (p < . , student's ttest) (figure ), indicating that a multimeric rbd-ls vaccine can induce antibody responses of both higher quantity and quality ( figures d,e and ) . in addition to evaluating anti-s (antigen) responses, we also tested for the induction on ls-specific (scaffold) antibodies. antibody responses were elicited against the ls-particle in all groups except the pbs group, indicating that the particle was accessible and not sterically hidden by antigens displayed on its surface; even when rbd was displayed on its surface using spytag:spy-catcher linkage (figure ). despite that, antigenspecific responses were not adversely affected by the presence of these anti-scaffold antibodies, as demonstrated by the booster effect after the second immunization ( figure d,e) . nonetheless, we tested whether a heterologous scaffold boost could help in minimizing such anti-scaffold responses using an ls/i primeboost scheme. using this approach, we found no significant increase in anti-scaffold antibody responses compared to the homologous prime-boost scheme ( figure c ). this indicates that a heterologous scaffold prime-boost approach could be advantageous for limiting unnecessary anti-scaffold responses. to evaluate the protective efficacy of the immune responses induced by the different mers-cov spike mpsp vaccines, rabbits were challenged intranasally with tcid of mers-cov (strain hcov-emc/ ) and nasal swabs were collected up to days post inoculation (pi) (figure a ). on day pi, the animals were euthanized, and lung tissue samples were collected. consistent with earlier reports [ , ] , none of the rabbits in the control group developed any clinical signs of infection upon mers-cov inoculation, and titration of infectious virus from lung tissues and nasal swabs was variable. thus, to evaluate protection, we tested for mers-cov rna by qrt-pcr, for mers-cov infectious virus by virus titration, and for mers-cov antigen (n protein) in lung tissues by immunohistochemistry (ihc). except for the rbd-ls vaccinated group, viral rna was detected in all vaccinated groups from day through day postchallenge at levels similar to control groups ( figures and ). viral rna titres were significantly reduced in the nasal swabs of the rbd-ls vaccinated groups as early as day post-challenge and were undetectable by day , in line with the absence of detectable infectious virus particles ( figure ). viral rna was also reduced in the lungs of rbd-ls-vaccinated rabbits ( figure ). consistently, ihc revealed no viral antigen in the lungs of the rbd-ls vaccinated rabbits ( figure c ), and antigen was also not detected a, b) the percentage of serum antibodies bound following the addition of different concentration of scn was used to determine (c) the avidity index (ic ). the difference in serum avidity between both groups was tested for statistical significance using a student's t-test, with asterisks indicating the level of significance. ***p ≤ . , ****p ≤ . . error bars indicate mean ± s.e.m. in the rbd + ls vaccinated rabbits. overall, in contrast to the monomeric form, the antigen-focused multimeric rbd-ls vaccine was able to block mers-cov replication significantly in the nose and lungs of the infected rabbits. the efficacy of rbd-ls immunization in protecting against a mers-cov challenge, makes it a potential vaccine candidate. however, for production at industrial scale, unnecessary sequences (e.g. tags) need to be removed, preparations have to be further structurally and biochemically characterized. recombinant subunit proteins provide advantages regarding safety, costs, and speed of vaccine production, making them very attractive platforms for the development of vaccines for emerging viruses. multivalent antigen display allows for virus-mimicking presentation of antigens and has been shown to induce antibodies of high avidity and magnitude [ , , , , ] ; with non-viral self-assembling mpsp providing advantages over other multimeric antigen presentation platforms [ , ] . among the mers-cov vaccine candidates developed so far, the latter approach has been used to design two candidates, both are based on the receptor-binding domain [ , ] , the main target for mers-cov protective antibodies [ ] . one used self-assembling ferritin nanoparticles [ ] and the second used canine parvovirus (cpv) vp structural protein forming virus like particles [ ] as scaffolds. both vaccine candidates were able to induce humoral and cellular immune responses in mice, nonetheless none has been tested for its protective capacity in a viral-challenge animal model. in our study, using an immune-focusing approach to target protective epitopes and domains along with multivalent presentation on self-assembling ls particles using a spontaneous covalent linker (spy-tag/spycatcher). we report for the first time the invivo protective capacity of a multimeric mers-cov rbd particle vaccine. we used self-assembling ls and i particles to generate chimeric multimeric protein scaffold particle displaying critical domains in the mers-cov spike protein and evaluated their immunogenicity and protective efficacy in rabbits. multimeric fp and hr vaccinations induced high levels of anti-s antibodies, nonetheless, with low to undetectable virus neutralizing capacities and couldn't protect rabbits against virus challenge. meanwhile, multimeric rbd-ls vaccination was highly immunogenic and induced robust antibody responses of high magnitude, avidity and neutralizing capacity. following a live virus challenge, it protected upper and lower respiratory tract of rabbits as detected by decrease in viral rna titres, with an ). despite producing strong antibody responses, the monomeric rbd failed to protect rabbits against mers-cov following an intranasal challenge. the presence of ls did not seem to influence the outcome, as it was included in the formulation of the monomeric form (rbd + ls), indicating that the coupling and the multimeric presentation are responsible for the enhanced response seen with the multimeric rbd-ls vaccine. the "plug-and-display" spytag/spycatcher system [ ] used to generate these multimeric rbd-ls particles allows for rapid and robust production of vaccines in a cost-effective manner. this enables the development of vaccines in a timely manner, which is crucial to prevent global public health consequences of evolving, emerging and re-emerging viruses. the efficacy of rbd-ls immunization in protecting against a mers-cov challenge, makes it a potential vaccine candidate for further development. nonetheless, in case of production at an industrial scale, unnecessary sequences (e.g. tags) need to be removed, preparations have to be further structurally and biochemically characterized. when using scaffolds as antigen carriers, anti-scaffold antibody responses need to be considered to avoid their potential to compromise the targeted antigen-induced responses or to induce potential auto-antibodies against human antigens. antibody responses were induced against the ls protein scaffold used in this study. however, antigen-specific responses were boosted following the second immunization and were not adversely affected by the presence of these anti-scaffold antibodies (figure ), similar to other reports [ ] . since the sequence of the ls protein does not show any similarity to any human sequences, it is unlikely that they will induce unwanted auto-(antihuman) antibodies. an ls-based vaccine for hiv, in a current phase clinical trial (nct ), can provide further evidence for the safety of this platform. nonetheless, we developed a heterologous scaffold prime-boost using ls and i which can help in reducing anti-scaffold responses. a challenge facing mers-cov vaccine development is the limited number of appropriate animal models for testing protection against clinical virus isolates. rabbits provide some advantages as an animal model for mers-cov. by having the mers-cov receptor dpp expressed in both the upper and lower respiratory tract epithelium [ ] , the rabbits can be naturally infected. this allows the evaluation of both upper and lower respiratory tract mers-cov infection and in turn protection using natural field virus isolates rather than adapted strains. however, the animals are not able to develop severe infection such as that seen in severe human cases [ ] . nonetheless, severe infection, thus far, has not been established consistently in any of the other animal models without genetic modification and/or virus adaptation, except for marmosets [ ] . in addition to the aforementioned, rabbits are readily available and easier to handle compared to other species that can be naturally infected such as non-human primates. following the addition of mers-cov as a priority pathogen in the who r&d blueprint for action to prevent epidemics, a target product profile was developed which called for three types of mers-cov vaccines [ ] . these include one for camels to prevent virus shedding and transmission, and two for humans: a two-dose vaccine for long-term protection of those at continuous high risk such as camel handlers and health-care workers, and a single-dose vaccine for rapid onset of immune responses to protect those at acute risk in outbreak settings. the rbd-ls can be used to develop the two-dose vaccine required to protect the high-risk populations, and can be further optimized using the heterologous scaffold prime/boost scheme developed in this study. nonetheless, evaluating the longevity of the induced immune responses is warranted. following the prime, rbd-ls vaccination induced antibody responses of high avidity and mers-cov neutralizing capacity. owing to the robust immune responses induced after one dose, the rbd-ls can be a candidate for developing a rapid single-dose vaccine for mers-cov, which is required for reactive use in outbreak situations [ ] . additionally, this vaccine candidate was able to block mers-cov replication in the upper respiratory tract of infected rabbit, thus it could potentially be of use as a dromedary vaccine to block mers-cov transmission. however, both approaches need to be further validated. isolation of a novel coronavirus from a man with pneumonia in saudi arabia middle east respiratory syndrome coronavirus (mers-cov) middle east respiratory syndrome coronavirus in dromedary camels: an outbreak investigation list of blueprint priority diseases middle east respiratory syndrome coronavirus vaccines: current status and novel approaches influenza vaccines: from whole virus preparations to recombinant protein technology nanoparticle vaccines adopting virus-like features for enhanced immune potentiation plug-and-display: decoration of virus-like particles via isopeptide bonds for modular immunization. sci rep a sweeter approach to vaccine design innate immune recognition of glycans targets hiv nanoparticle immunogens to germinal centers induction of potent neutralizing antibody responses by a designed protein nanoparticle vaccine for respiratory syncytial virus selfassembling protein nanoparticles in the design of vaccines rational hiv immunogen design to target specific germline b cell receptors design of a hyperstable -subunit protein dodecahedron engineering a rugged nanoscaffold to enhance plugand-display vaccination new routes and opportunities for modular construction of particulate vaccines: stick, click, and glue identification of sialic acid-binding function for the middle east respiratory syndrome coronavirus spike glycoprotein the receptor binding domain of the new middle east respiratory syndrome coronavirus maps to a -residue region in the spike protein that efficiently elicits neutralizing antibodies structure-based discovery of middle east respiratory syndrome coronavirus fusion inhibitor towards a solution to mers: protective human monoclonal antibodies targeting different domains and functions of the mers-coronavirus spike glycoprotein. emerg microbes infect importance of neutralizing monoclonal antibodies targeting multiple antigenic sites on the middle east respiratory syndrome coronavirus spike glycoprotein to avoid neutralization escape application of built-in adjuvants for epitope-based vaccines structural analysis and optimization of the covalent association between spycatcher and a peptide tag lack of middle east respiratory syndrome coronavirus transmission in rabbits. viruses vaccine delivery: a matter of size, geometry, kinetics and molecular patterns advances in mers-cov vaccines and therapeutics based on the receptor-binding domain. viruses chaperna-mediated assembly of ferritin-based middle east respiratory syndrome-coronavirus nanoparticles identification of an immunodominant linear neutralization domain on the s portion of the murine coronavirus spike glycoprotein and evidence that it forms part of complex tridimensional structure analysis of murine coronavirus surface glycoprotein functions by using monoclonal antibodies human monoclonal antibodies against highly conserved hr and hr domains of the sars-cov spike protein are more broadly neutralizing characterization of neutralizing monoclonal antibodies recognizing a -residues epitope on the spike protein hr region of severe acute respiratory syndrome coronavirus (sars-cov) monoclonal antibodies targeting the hr domain and the region immediately upstream of the hr of the s protein neutralize in vitro infection of severe acute respiratory syndrome coronavirus antibody avidity determination by elisa using thiocyanate elution asymptomatic middle east respiratory syndrome coronavirus infection in rabbits enhanced inflammation in new zealand white rabbits when mers-cov reinfection occurs in the absence of neutralizing antibody nanoassembly routes stimulate conflicting antibody quantity and quality for transmission-blocking malaria vaccines. sci rep novel chimeric viruslike particles vaccine displaying mers-cov receptorbinding domain induce specific humoral and cellular immune response in mice self-assembling influenza nanoparticle vaccines elicit broadly neutralizing h n antibodies infection with mers-cov causes lethal pneumonia in the common marmoset a roadmap for mers-cov research and product development: report from a world health organization consultation we thank the technical staff of the preclinical department of viroclinics biosciences b.v. for their excellent technical support. key: cord- -co essuw authors: johnson, marina; wagstaffe, helen r.; gilmour, kimberly c.; mai, annabelle lea; lewis, joanna; hunt, adam; sirr, jake; bengt, christopher; grandjean, louis; goldblatt, david title: evaluation of a novel multiplexed assay for determining igg levels and functional activity to sars-cov- date: - - journal: j clin virol doi: . /j.jcv. . sha: doc_id: cord_uid: co essuw background: the emergence of sars-cov- has led to the development of serological assays that could aid in an understanding of the burden of covid- disease. many available tests lack rigorous evaluation and therefore results may be misleading. objectives: the aim of this study was to assess the performance of a novel multiplexed immunoassay for the simultaneous detection of antibodies against sars-cov- trimeric spike (s), spike receptor binding domain (rbd), spike n terminal domain and nucleocapsid antigen and a novel pseudo-neutralisation assay. methods: a multiplexed solid-phase chemiluminescence assay (meso scale discovery) was evaluated for the simultaneous detection of igg binding to four sars-cov- antigens and the quantification of antibody-induced ace- binding inhibition (pseudo-neutralisation assay). sensitivity was evaluated with a total of covid- serum samples ( confirmed pcr positive and anti-nucleocapsid igg positive) from individuals with mild symptomatic or asymptomatic disease. specificity was evaluated with control serum samples collected from adults prior to december . results: the specificity and sensitivity of the binding igg assay was highest for s protein with a specificity of . % and sensitivity of . % for samples taken days and . % for samples taken days following the onset of symptoms. igg concentration to s and rbd correlated strongly with percentage inhibition measured by the pseudo-neutralisation assay. conclusion: excellent sensitivity for igg detection was obtained over days since onset of symptoms for three sars-cov- antigens (s, rbd and n) in this multiplexed assay which can also measure antibody functionality. severe acute respiratory syndrome-related coronavirus- (sars-cov- ) was first recognised in january and rapidly spread world-wide ( ) . tests designed to measure antibodies to sars-cov- antigens were rapidly developed and are important for diagnostics and seroprevalence studies. the latter could help inform disease burden estimates, studies of transmission dynamics and modelling of the epidemic. antibody tests are particularly important in the context of mild or asymptomatic disease where a swab reverse transcriptase polymerase chain reaction (rt-pcr) test may be negative. for this reason, an understanding of the sensitivity and specificity of the tests being used is critical. the trimeric spike (s) protein of sars-cov- is present on the viral surface and in most cases is cleaved by host proteases into the s and s subunits, responsible for receptor recognition and membrane fusion respectively. s uses a region of the molecule, known as the receptor binding domain (rbd) to bind to host ace- receptor and thereby gain entry to the cell ( ) . specific immunoglobulin-g (igg) and igm antibody responses to sars-cov- s, n and rbd of the spike protein develop between - days following disease-onset ( ) . despite a rapid increase in the number and availability of sars-cov- serologic assays, most have undergone minimal external evaluation and validation ( ) . a recent large scale spanish seroprevalence study used a point of care igg test with a stated sensitivity of . % but on verification found it to have a sensitivity of either . %, . %, . % or % depending on the sample sets used for evaluation ( ) . all assays currently suffer from the absence of a defined standard serum so results are reported as positive or negative or as optical density readouts complicating the comparison between assays and studies and for many binding assays the relationship between antibody concentration and function is unclear. we have evaluated a novel assay designed to simultaneously measure igg to four sars-cov- antigens; full-length trimeric s, rbd and ntd of spike as well as n protein. the assay, based on meso scale discovery (msd) technology, utilises a -well based solid-phase antigen printed plate and an electrochemiluminescent detection system. in addition this assay can measure the ability of serum to inhibit the interaction between spike protein components and soluble ace- , also called a pseudo-neutralisation assay ( ) . to evaluate the sensitivity and specificity of the msd assay, we were able to utilise a relatively large number of samples obtained from sars-cov- rt-pcr positive health care workers or patients as well as antibody positive health care staff enrolling in a large sars-cov- cohort study. samples were screened for igg to sars-cov- n protein using a commercially available kit (epitope diagnostics inc, san diego, usa) as previously described ( ) . to measure igg antibodies, plates were blocked with msd blocker a following which reference standard, controls and samples diluted : in diluent buffer were added. after incubation, detection antibody was added (msd sulfo-tag™ anti-human igg antibody) and then msd gold™ read buffer b was added and plates read using a meso® sector s reader. plates were blocked and washed as above, assay calibrator (covid- neutralising antibody; monoclonal antibody against s protein; µg/ml), control sera and test sera samples diluted in in assay diluent were added to the plates. following incubation plates an . µg/ml solution of msd sulfo-tag™ conjugated ace- was added after which plates were read as above. percentage inhibition was calculated relative to the assay calibrator (maximum % inhibition). statistical analysis was performed using msd discovery workbench and graphpad prism version . (graphpad, san diego, ca). antibody concentration in arbitrary units (au) was interpolated from the ecl signal of the internal standard sample using a -parameter logistic curve fit. roc curves showing the sensitivity and specificity (plotted as %-specificity %) calculated using each value in the data as a cut-off were plotted for each antigen. a cut-off antibody concentration was chosen based on the lowest value leading to a positive likelihood ratio (lr) of > , in order to maximise sensitivity while providing strong evidence to rule-in j o u r n a l p r e -p r o o f infection ( ) . for s antigen binding, all lr's were above , therefore the llod was used as the cut-off for this antigen. comparisons between groups were performed by kruskal-wallis one-way anova with dunn's correction for multiple comparisons. correlation analysis was performed using spearman correlation. p values of < . were considered as significant. latent class models with two classes were fitted with the binary antibody responses as outcome variables, using the polca package in the r statistical environment. the code used for the latent class analysis is available on request. the lower limit of detection (llod) was assigned as % of the standard value in au, and upper limit of detection (ulod) was assigned for ntd and rbd only as the s and n antigen did not reach an upper limit (table ) . for statistical purposes, ulod was assigned the highest calculated concentration plus % and llod as . %. the mean coefficient of variation (cv) between duplicates was < % for all except ntd ( . %, data not shown). the mean intra-assay cv was . % and inter-assay variation < % across all sars-cov- antigens except ntd ( . %) on one of four samples (supplementary the specificity for s, rbd and n assays are shown in table table : assay specificity calculated for each sars-cov- antigen from the control cohort. table . sensitivity and specificity was calculated for groups - d, > d, > d and > d since the onset of symptom the s antigen was the most sensitive of the three, with a sensitivity of . % and . % > days and > days respectively (table ) . rocs were plotted to visualise the trade-off between sensitivity and specificity for s and rbd neutralisation. cut-offs (lr> ) were . % for s and . % for rbd (shown by the dotted line on figure a -b). sensitivity and specificity for s were . % and . % respectively but lower for rbd ( . % and . % respectively). in the covid- cohort there were some igg positive sera that did not demonstrate neutralisation (below cut-off, n= for s and for rbd). these sera were predominantly those taken soon after the onset of symptoms; between - days, over days and over days. using a carefully defined cohort of known sars-cov- exposed individuals and relevant controls we were able to show the sensitivity and specificity of the assay for the four antigens of interest. comparing the performance of s and rbd assays in a recently published systematic review and metanalysis of the diagnostic accuracy of serological tests for covid- ( ) the s assay we evaluated had superior sensitivity to all of the assays included in the review while rbd performance was superior to most. the reason for this could be related to the technical aspects of the assay itself including the integrity of the antigen used and the sensitivity of the detection platform but also the use of a well-defined cohort of individuals with known exposure to sars-cov- . only the n terminal domain of the spike protein did not perform well in this assay with poor sensitivity due to the overlap in antibody titres between the covid- cohort and controls. the assay format permitted the measurement of antibody against spike protein derived from sars- , mers and two seasonal coronaviruses, but the results of antibody binding to these antigens could not be assessed in the same way as for the sars-cov- antigens due to the absence of defined negative and positive serum sets. an advantage of this assay is its ability to measure antibody induced inhibition of ace- receptor-spike interaction thought to be the major mechanism by which sars viruses, including sars-cov- attach to host cell surfaces ( , ) . in the covid- cohort, there was a good correlation between anti-s and anti-rbd igg and function although a few sera bound antigen but did not neutralize. these were dominated by sera taken soon after infection and as recently described, could be non-neutralising and targeting epitopes outside the rbd ( ). few of the control cohort sera had any pseudo-neutralisation activity despite pre-existing igg to seasonal coronavirus spike proteins suggesting season coronavirus exposure is unlikely to modify interaction with sars-cov- . other cross reactive immunological mechanisms (eg t cells) cannot be ruled out and may explain the varied clinical response following exposure to sars-cov- ( ) . this pseudo-neutralisation assay has been shown to correlate well with neutralisation assays using live sars-cov- (msd, personal communication). in summary, the msd multiplexed coronavirus panel assay evaluated in this study is highly reproducible, specific and sensitive for the detection of anti-sars-cov- antibody over days since the onset of covid- symptoms. the assay can be adapted to measure antibody function which corelated well with spike protein antibody concentration. funding: this research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. none world health organisation. . coronavrius disease (covid- ) situation report - relationship between anti-spike protein antibody titers and sars-cov- in vitro virus neutralization in convalescent plasma structural proteins in severe acute respiratory syndrome coronavirus- temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by sars-cov- : an observational cohort study serodiagnostics for severe acute respiratory syndrome-related coronavirus- : a narrative review prevalence of sars-cov- in spain (ene-covid): a nationwide, population-based seroepidemiological study a sars-cov- serological assay to determine the presence of blocking antibodies that compete for human ace binding comparison of four new commercial serologic assays for determination of sars-cov- igg diagnostic tests : likelihood ratios diagnostic accuracy of serological tests for covid- : systematic review and metaanalysis angiotensin-converting enzyme : a functional receptor for sars coronavirus structural and functional basis of sars-cov- entry by using human ace characterization of neutralizing antibodies from a sars-cov- infected individual transmission, diagnosis, and treatment of coronavirus disease (covid- ): a review the study team would like to thank meso scale discovery for the donation of the plates and reagents that allowed us to complete the work, the costars study team at great ormond street children's hospital, staff in the great ormond street children's hospital clinical immunology laboratory for additional support and the nihr ucl great ormond street j o u r n a l p r e -p r o o f key: cord- -w ytp q authors: lokman, syed mohammad; rasheduzzaman, m.d.; salauddin, asma; barua, rocktim; tanzina, afsana yeasmin; rumi, meheadi hasan; hossain, m.d. imran; siddiki, a.m.a.m. zonaed; mannan, adnan; hasan, m.d. mahbub title: exploring the genomic and proteomic variations of sars-cov- spike glycoprotein: a computational biology approach date: - - journal: infect genet evol doi: . /j.meegid. . sha: doc_id: cord_uid: w ytp q the newly identified sars-cov- has now been reported from around countries with more than a million confirmed human cases including more than , deaths. the genomes of sars-cov- strains isolated from different parts of the world are now available and the unique features of constituent genes and proteins need to be explored to understand the biology of the virus. spike glycoprotein is one of the major targets to be explored because of its role during the entry of coronaviruses into host cells. we analyzed whole-genome sequences and spike protein sequences of sars-cov- using multiple sequence alignment. in this study, unique variations have been identified among the genomes of sars-cov- including nonsynonymous mutations and one deletion in the spike (s) protein. among the variations detected, variations were located at the n-terminal domain and variations at the receptor-binding domain (rbd) which might alter the interaction of s protein with the host receptor angiotensin converting enzyme- (ace ). besides, amino acid insertions were identified in the spike protein of sars-cov- in comparison with that of sars-cov. phylogenetic analyses of spike protein revealed that bat coronavirus have a close evolutionary relationship with circulating sars-cov- . the genetic variation analysis data presented in this study can help a better understanding of sars-cov- pathogenesis. based on results reported herein, potential inhibitors against s protein can be designed by considering these variations and their impact on protein structure. wuhan, hubei province of china in december . the death toll rose to more than , among , , confirmed cases around the globe (until april , ) [ ] . the virus causing covid- is named as severe acute respiratory syndrome coronavirus (sars-cov- ). based on the phylogenetic studies, the sars-cov- is categorized as a member of the genus betacoronavirus, the same lineage that includes sars coronavirus (sars-cov) [ ] that caused sars (severe acute respiratory syndrome) in china during [ ] . recent studies showed that sars-cov- has a close relationship with bat sars-like covs [ , ] [ ] ]. interestingly, s glycoprotein is characterized as the critical determinant for viral entry into host cells which consists of two functional subunits namely s and s . the s subunit recognizes and binds to the host receptor through the receptor-binding domain (rbd) whereas s is responsible for fusion with the host cell membrane [ [ ] , [ ] , [ ] ]. mers-cov uses dipeptidyl peptidase- (dpp ) as entry receptor [ ] whereas sars-cov and sars-cov- utilize ace- (angiotensin converting enzyme- ) [ ] , abundantly available in lung alveolar epithelial cells and enterocytes, suggesting s glycoprotein as a potential drug target to halt the entry of sars-with remarkable properties like glutamine-rich aa long exclusive molecular signature (dsqqtvgqqdgsednqtttiqtivevqpqlemeltpvvqtie) in position - of polyprotein ab (pp ab) [ ] , diversified receptor-binding domain (rbd), unique furin cleavage site (prrar↓sv) at s /s boundary in s glycoprotein which could play roles in viral pathogenesis, diagnosis and treatment [ ] . to date, few genomic variations of sars-cov- are reported [ [ ] , [ ] ]. there is growing evidence that spike protein, a amino acid long glycoprotein having multiple domains, possibly plays a major role in sars-cov- pathogenesis. viral entry to the host cell is initiated by the receptor-binding domain (rbd) of s head. upon receptor-binding, proteolytic cleavage occurs at s /s cleavage site and two heptad repeats (hr) of s stalk form a six-helix bundle structure triggering the release of the fusion peptide. as it comes into close proximity to the transmembrane anchor (tm), the tm domain facilitates membrane destabilization required for fusion between virus-host membranes [ [ ] , [ ] ]. insights into the sequence variations of s glycoprotein among available genomes are key to understanding the biology of sars-cov- infection, developing antiviral treatments and vaccines. in this study, we have analyzed genomic sequences of sars-cov- to identify mutations between the available genomes followed by the amino acid variations in the glycoprotein s to foresee their impact on the viral entry to host cell from structural biology viewpoint. analysis. the ncbi reference sequence of sars-cov- s glycoprotein, accession number yp_ was used as the canonical sequence for the analyses of spike protein variants. variant analyses of sars-cov- genomes were performed in the genome detective coronavirus typing tool version . which is specially designed for this virus the dataset was then aligned with muscle [ ] . entropy (h(x)) plot of nucleotide variations in sars-cov- genome was constructed using bioedit [ ] . mega x (version . . ) was used to construct the msas and the phylogenetic tree using pairwise alignment and neighborjoining methods in clustalw [ , ] . tree structure was validated by running the analysis on bootstraps [ ] replications dataset and the evolutionary distances were calculated using the poisson correction method [ ] . variant sequences of sars-cov- were modeled in swiss-model [ ] using the cryo-em spike protein structure of sars-cov- (pdb id vsb; [ ] ) as a template. the overall quality of models was assessed in rampage server [ ] by generating ramachandran plots (supplementary table ). pymol and biovia discovery studio were used for structure visualization and superpose [ , ] . j o u r n a l p r e -p r o o f . results multiple sequence alignment of the available genomes of sars-cov- were performed and variations were found throughout the , bp long sars-cov- genome with in total variations in utr region, synonymous variations that cause no amino acid alteration, non-synonymous variations causing change in amino acid residue, indels, and variations in non-coding region (supplementary table ). among the variations, variations ( synonymous, non-synonymous mutations and one deletion) were observed in the region of orf s that encodes s glycoprotein which is responsible for viral fusion and entry into the host cell [ ] . notable that, most of the sars-cov- genome sequences were deposited from the usa ( ) and china ( ) (supplementary fig. ). positional variability of the sars-cov- genome was calculated from the msa of sars-cov- whole genomes as a measure of entropy value (h(x)) [ ] . excluding ′ and ′ utr, ten hotspot of hypervariable position were identified, of which seven were located at orf ab ( c>t, c>t, c>t, c>t, c>t, a>g, c>t) and one at orf s ( a>g), orf a ( g>t), and orf ( t>c) respectively. the variability at position and were found to be the highest among the other hotspots ( fig. ). the phylogenetic analysis of a total of sequences ( unique sars-cov- and different coronavirus s glycoprotein sequences) was performed. the evolutionary distances showed that all the sars-cov- spike proteins cluster in the same node of the phylogenetic tree confirming the sequences are similar to refseq yp_ (fig. ) . bat coronaviruses has a close evolutionary relationship as different strains were found in the nearest outgroups and clades (bat coronavirus bm - , bat hp-beta coronavirus, bat coronavirus hku ) conferring that j o u r n a l p r e -p r o o f journal pre-proof coronavirus has vast geographical spread and bat is the most prevalent host (fig. ) . in other clades, the clusters were speculated through different hosts which may describe the evolutionary changes of surface glycoprotein due to cross species transmission. viral hosts reported from different spots at different times is indicative of possible recombination. the s glycoprotein sequences of sars-cov- were retrieved from the ncbi virus variation resource repository and aligned using clustalw. the position of sars-cov- spike protein domains was measured by aligning with the sars-cov spike protein (fig. ) [ , ] . from the sequence identity matrix, unique variants among unique sars-cov- spike glycoprotein sequences were identified to have substitutions and a deletion ( fig. a and supplementary table ). sequences were found identical with sars-cov- s protein reference sequence (yp_ ) while sequences were identical with the same variation of d g (supplementary table respectively due to substitution of amino acid that differs in charge. the remaining variants were mutated with the amino acids that are similar in charge (fig. a) . the sars-cov- spike protein variants were superposed with the cryo-electron microscopic structure of sars-cov- spike protein [ ] . fig. ) . the s subunit of spike protein, especially the heptad repeat region , fusion peptide domain, transmembrane domain, and cytoplasmic tail, were found to be highly conserved in the sars-cov and the sars-cov- variants while the s subunit was more diverse, specifically the n-terminal domain (ntd) and receptor-binding domain (rbd). the spatial distribution of s protein sequences having different variation over time reveals that most of the variants ( out of s glycoprotein sequences) were reported from the us j o u r n a l p r e -p r o o f journal pre-proof followed by out of sequences (including y deletion) and out of sequences from india and china, respectively (fig. ) . only one variant was found out of only one available sequence in the repository from sweden, australia, south korea and peru. interestingly, all sequences are unique among countries from the sequence reported except d g, which was found in the us and peru (fig. ) . moreover, we have also analyzed sequences from brazil, italy, nepal, pakistan, spain, taiwan and vietnam but there is no variation in the s glycoprotein sequence was found when compared to refseq yp_ . covid is one of the most contagious pandemics the world has ever had with , , confirmed cases to date (april , ) and the cases have increased as high as times in less than a month [ ] . phylogenetic analysis showed that the sars-cov- is a unique coronavirus presumably related to bat coronavirus (bm - , hp-betacoronavirus). during this study, we [ ] , [ ] , [ ] ]. likewise, a number of studies targeting sars-cov- spike protein have been undertaken for the therapeutic measures [ ] , but the unique structural and functional details of sars-cov- spike protein are still under scrutiny. we also found a variant (r i) at receptor binding domain (rbd) that mutated from positively charged arginine residue to neutral and smaller sized isoleucine residue (fig. i) . this change might alter the interaction of viral rbd with the host receptor because the r residue of sars-cov- is known to interact with the ace receptor for viral entry [ ] . similarly, alterations of rbd (g s, v a, h q, and a s) also could affect the interaction of sars-cov- spike protein with other molecules j o u r n a l p r e -p r o o f which require further investigations. qia and qis variants were found to have an alteration of alanine to valine (a v), and aspartic acid to tyrosine (d y) respectively in the alpha helix of the hr domain. previous reports have indicated that hr domain plays a significant role in viral fusion and entry by forming helical bundles with hr , and mutations including alanine substitution by valine (a v) in hr region are predominantly responsible for conferring resistance to mouse hepatitis coronaviruses against hr derived peptide entry inhibitors [ ] . this study hypothesizes the mutation (a v) found in that of sars-cov- might also have a role in the emergence of drug-resistance virus strains. also, the mutation (d h) found in the heptad repeat (hr) sars-cov- could play a vital role in viral pathogenesis. moreover, we found that variants including one deletion out of were located within s especially within ntd and rbd region of glycoprotein s (fig. a) which region is responsible for the preliminary interaction with the host cell receptor ace . this indicates that the ntd and rbd are very prone to mutations. however, the ntd and rbd portions harbour potential epitopes that might serve as potential peptide vaccine candidates against sars-cov- as reported in different studies [ ] [ ] [ ] . the reason behind choosing the sequences from s protein domain ntd and rbd is they are situated in the outer surface of the virus that could be more accessible for the immune system (fig. c ). so the variations reported herein within the outer domains of s glycoprotein could help to design effective epitope-based vaccines or antivirals. the sars-cov- s protein contains additional furin protease cleavage site, prrars, in s /s domain which is conserved among all sequences as revealed during this study ( supplementary fig. ). this unique signature is thought to make the sars-cov- more virulent than sars-cov and regarded as novel features of the viral pathogenesis [ ] . according to previous reports the more the host cell protease can process the coronavirus s can accelerate viral tropism accordingly in influenza virus [[ ] , [ ] , [ ] , [ ] ]. apart from that, this could also promote viruses to escape antiviral therapies targeting transmembrane protease j o u r n a l p r e -p r o o f tmprss (clinicaltrials.gov, nct ) which is well reported protease to cleave at s /s of s glycoprotein [ ] . comparative analyses between sars-cov and sars-cov- spike glycoprotein showed % similarity between them where the most diverse region was coronavirus disease (covid- ) situation reports severe acute respiratory syndrome-related coronavirus--the species and its viruses, a statement of the coronavirus study group lim, others, a novel coronavirus associated with severe acute respiratory syndrome bats are natural reservoirs of sars-like coronaviruses, science ( -. ) huang, others, a pneumonia outbreak associated with a new coronavirus of probable bat origin pei, others, a new coronavirus associated with human respiratory disease in china genome composition and divergence of the novel coronavirus ( -ncov) originating in china cryo-em structure of the -ncov spike in the prefusion conformation, science ( -. ) structure, function, and antigenicity of the sars-cov- spike glycoprotein structure analysis of the receptor binding of -ncov fouchier, others, dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc greenough, others, angiotensin-converting enzyme is a functional receptor for the sars coronavirus functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses a. nitsche, others, sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor an exclusive amino acid signature in pp ab protein provides insights into the evolutive history of the novel human-pathogenic coronavirus (sars-cov ) the spike glycoprotein of the new coronavirus -ncov contains a furin-like cleavage site absent in cov of the same clade genomic variance of the -ncov coronavirus genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex interaction between heptad repeat and regions in spike protein of sars-associated coronavirus: implications for virus fusogenic mechanism and identification of fusion inhibitors muscle: multiple sequence alignment with improved accuracy and speed bioedit: a user-friendly biological sequence alignment editor and analysis program for windows / /nt mega x: molecular evolutionary genetics analysis across computing platforms the neighbor-joining method: a new method for reconstructing phylogenetic trees bootstrap confidence levels for phylogenetic trees evolutionary divergence and convergence in proteins swiss-model: homology modelling of protein structures and complexes structure validation by calpha geometry: phi, psi and cbeta deviation pymol: an open-source molecular graphics tool receptor recognition mechanisms of coronaviruses: a decade of structural studies a parvovirus b synthetic genome: sequence features and functional competence cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding long-term protection from sars coronavirus infection conferred by a single immunization with an attenuated vsv-based vaccine human monoclonal antibodies against highly conserved hr and hr domains of the sars-cov spike protein are more broadly neutralizing a truncated receptor-binding domain of mers-cov spike protein potently inhibits mers-cov infection and induces strong neutralizing antibody responses: implication for developing therapeutics and vaccines fusion mechanism of -ncov and fusion inhibitors targeting hr domain in spike protein role of changes in sars-cov- in the interaction with the human ace receptor: an in silico analysis coronavirus escape from heptad repeat (hr )-derived peptide entry inhibition as a result of mutations in the hr domain of the spike fusion protein development of epitope-based peptide vaccine against novel coronavirus (sars-cov- ): immunoinformatics approach in silico identification of novel b cell and t cell epitopes of wuhan coronavirus ( -ncov) for effective multi epitope-based peptide vaccine production epitope-based chimeric peptide vaccine design against s, m and e proteins of sars-cov- etiologic agent of global pandemic covid- : an in silico approach host cell proteases controlling virus pathogenicity role of hemagglutinin cleavage for the pathogenicity of influenza virus host cell proteases: critical determinants of coronavirus tropism and pathogenesis coronaviruses: an overview of their replication and pathogenesis receptor for mouse hepatitis virus is a member of the carcinoembryonic antigen family of glycoproteins laude, others, aminopeptidase n is a major receptor for the enteropathogenic coronavirus tgev positional organization of major structural protein-encoding genes in orange color (s = spike protein, e = envelope protein, m = membrane protein, n = nucleocapsid protein) and accessory protein orfs in blue colors. b. variability within sars-cov- genomic sequences represented by entropy (h(x)) value across genomic location key: cord- -wf qxplf authors: gomez, santiago a.; rojas-valencia, natalia; gomez, sara; egidi, franco; cappelli, chiara; restrepo, albeiro title: binding of sars–cov– to cell receptors: a tale of molecular evolution date: - - journal: chembiochem doi: . /cbic. sha: doc_id: cord_uid: wf qxplf the magnified infectious power of the sars–cov– virus compared to its precursor sars–cov is intimately linked to an enhanced ability in the mutated virus to find available hydrogen bond sites in the host cells. this characteristic is acquired during virus evolution because of the selective pressure exerted at the molecular level. we pinpoint the specific residue (in the virus) to residue (in the cell) contacts during the initial recognition and binding and show that the virus· · · cell interaction is mainly due to an extensive network of hydrogen bonds and to a large surface of non–covalent interactions. in addition to the formal quantum characterization of bonding interactions, computation of absorption spectra for the specific virus· · · cell interacting residues yields significant shifts of ∆λ max = and nm in the wavelength for maximum absorption in the complex with respect to the isolated host and virus, respectively. at the time of the writing of this manuscript, the situation regarding the global pandemic produced by the spread of the sars-cov- virus (over . million confirmed cases, over deaths, with no end in sight), with dire consequences in all aspects of life, from social interactions, to the overwhelming of health and economic systems, is changing fast. because this is a critical problem, just as the rate of virus transmission on the early stages of dissemination, the number of scientific papers on the subject (mostly preprints) increases exponentially. sars-cov- is an enveloped virus of the coronaviridae family with a single-stranded rna genome. [ ] figure highlights the most important structural features of the virus: besides the nucleocapsid (n) proteins, the only proteins in direct contact with the genetic material (the n-rna core is embedded in a lipid environment), there are membrane (m), envelope (e), and spike (s) proteins. it is the spike proteins which lead to the now familiar external morphology of the virus, but more importantly, s proteins are responsible for the interactions with receptors in the host membrane (epithelial cells in humans). these s· · · receptors contacts initiate the infectious cycle of the virus. [ ] each spike consists of a trimer of s proteins. individual s proteins have been divided into two clear s , s sections, [ ] with s containing the n-terminal domain (ntd), and the receptor binding domain (rbd), the domain ultimately responsible for the interactions with the coupling factors present in cell membranes. [ ] coupling factors include a variety of proteins, carbohydrates, or other types of biomolecules expressed on the surface of the cell membrane and in charge of signaling and transport, among other functions. viruses take advantage of these molecules during the infection process. it seems well established that initial virus↔host recognition and binding is driven by s , and that further changes in the conformation of the s section mediate the viral envelope fusion to the host cell membrane. we also show a cell with internal organelles and with a few enzymes that act as a virus receptors. . /cbic. accepted manuscript chembiochem this article is protected by copyright. all rights reserved. the most commonly invoked culprit (with plenty of experimental evidence [ ] ) for the reception of sars-cov- is the angiotensin converting enzyme (ace ). this receptor is the subject of intensive studies aiming at finding effective therapies, and is the central focus of the ongoing race to find a vaccine. in this work, we are interested in two crucial aspects of the initial virus· · · cell interaction problem: to pinpoint the specific residue to residue binding sites between the structurally known spike proteins of the virus [ ] and the structurally known ace receptor in cell membranes, [ ] and to understand, from a fundamental, quantum perspective, the molecular factors driving the virus· · · cell binding. we expect this knowledge to considerably better our understanding of the problem and to hopefully contribute to a rational design of drugs and vaccines to fight the virus. see the computational methods section for details of our calculations. our data shows that the rbd(s)· · · ace complex reached well defined persistent equilibrium states long before the ns of the molecular dynamics (md) simulation time are consumed in each of the three replicas. this stability is especially encouraging in the interaction region as clearly shown in the highlighted areas of the bottom panels in figure . we obtained an interaction energy ∆g int = − . kcal/mol, which is in excellent agreement with calculations reported in closely related systems. [ ] . virus· · · cell contacts in all cases, only hydrogen bonds (hbs) were found as responsible for explicit virus· · · cell pair-wise interactions. naturally, this does not mean that other weak, long range cumulative for the later steps of the simulations. table lists all individual binary contacts in the form of hydrogen bonds between residues in the rbd(s)· · · receptor complex found in our md simulations with an arbitrary threshold average of % occupancy on the triplicate runs. notice that this procedure intends to extract a representative sample from the simulations, thus, there are considerably more contacts not explicitly shown because they have lower occupancies or, because while having high occupancies, are not persistent in the replicas. these hb contacts, whose bonding interactions are dissected below, are responsible for the attachment of the virus to epithelial cells in humans, initiating the infection process. number of fragment to fragment hydrogen bonds in the rbd(s)· · · ace complex averaged over three independent replicas as the md progresses. a summary of the quantum descriptors for the virus· · · cell interactions is listed in table , the corresponding pictures are shown in figures , . without exception, despite being weak organic acids, residue to residue hydrogen bonds are stronger than the archetypal hb in the water dimer, this is seen in the larger binding energies, smaller distances, orbital interaction energies, e d→a of comparable magnitudes, larger bond indices, and in the properties of the bond critical points. the lys →asp is an exceptional case because it corresponds to table : properties of virus· · · cell hydrogen bonds. persistent hydrogen bonds in the rbd(s)· · · receptor complex exceeding the % occupancy threshold during the entire ns md simulation of each replica. the arrows state the directionality of the donor → acceptor interaction in the corresponding hydrogen bond according to the classical electrostatic x δ− − h δ+ → y δ− description. all occupancies averaged over the three md replicas. wbi are the wiberg bond indices. [ ] the archetypal hydrogen bond in the water dimer is included for comparison purposes. see the specialized literature [ ] [ ] [ ] [ ] [ ] [ ] for the formalism on how nbo and qtaim descriptors are related to bonding. bridge in a previous work. [ ] all hydrogen bonds are well characterized long range interactions. this is clearly seen in (i) there is never a formal σ orbital between the fragments, on the contrary, orbital in other words, in the nbo picture ( figure ), all explicit virus· · · cell contacts are stabilized by charge transfer from one lone pair in an oxygen atom in the donor residue to an antibonding orbital in the acceptor residue. (ii) all properties of the bond critical points support the same picture: small ρ (r c ), small bond orders, virial ratios smaller than , and positive bond degree parameters. again, is the exception, with all the calculated descriptors indicating a highly ionic contact. the non covalent interactions (nci) calculation for the interaction region uncovers a large discontinuous non-covalent wall separating rbd(s) from the receptor. therefore, we characterize the virus· · · cell binding as due to a large number of non-covalent contacts between the two proteins, enhanced by the water molecules, acting in conjunction with the specific residue to residue hydrogen bonds. the ace receptor (in blue). the snapshot was randomly extracted from the late stages of one of the three md replicas. [ ] all persistent virus· · · cell hydrogen bonds listed in table are explicitly highlighted on the right frames. the non covalent [ , ] virus· · · cell interaction surface is explicitly shown in green, including the water molecules. notice that both fragments contain glycosylated glycoproteins and that all fine structure is accounted for during the calculations, however, the glycans are not shown in this picture for clarity. bottom: nbo donor→acceptor interactions responsible for the persistent hydrogen bonds. we concentrate on simulating the spectra of the aminoacids involved to investigate whether measurable changes in their spectral response occur upon binding, while residues that are not involved in the interaction can safely be viewed as changing very little as the two structures connect. for this purpose we propose a qm/mm approach where the aminoacid pairs involved in the binding are treated quantum mechanically by means of density functional theory (dft), while the rest of the protein environment is modelled classically, through the use of the amber force field. [ ] in this way, the electronic structure of the qm portion is influenced by its environment by means of an electrostatic embedding paradigm, [ ] where fixed charges are assigned to the mm atoms and directly affect the qm density and computed electronic excitations. figure shows that in all six cases, binding events set off drastic changes in the spectral response of the system, explicitly seen in red-shifted absorptions. the red-shift of the absorption bands is clearly visible in the convoluted spectrum and therefore provides unequivocal evidence of virus· · · cell bonding. the results listed here are quite encouraging and constitute an initial step that will hopefully motivate the design of experimental protocols to detect virus infection. however, it is clear that a number of details need to be worked out before practical applications can be devised. in particular, the potential interference of signals arising from functional groups in the same region of λ max , and the ability of the dimer model to accurately mimic physiological environments, should be addressed. the most serious problems faced when developing vaccines and therapies, and is particularly true for sars-cov- . [ ] we argue that this view of evolution as driven by environment induced molecular responses at the virus/biomolecules scales helps explain many aspects of evolution that are difficult to rationalize otherwise, namely, (i) in most cases evolution is a highly localized process (ii) because of the large number of mutation possibilities, which occur no matter how small the individual probabilities, an increase in entropy of the universe is the ultimate factor driving evolution (iii) evolution is a deterministic process driven by cumulative random changes (iv) in that sense, life itself is a deterministic process that only requires a large increase in the entropy of the universe (in other words, a long time), such that it will emerge in local environments capable of sustaining it. see for example early arguments by schrodinger [ ] stating the the apparent macroscopic stability is due to the microscopic chaos resulting from random events, and invoking a net entropy gain by the universe due to the continual energy transformation despite the heavy entropy investment in maintaining highly organized living organisms. more recently, england has discussed the statistical physics of self-replication. [ ] in the context of this work, the previous discussion of molecular and virus evolution leads us to hypothesize that one of the key factors in the molecular evolution problem faced by the precursor sars-cov virus on its way to mutating into sars-cov- was solved by favoring those changes in rbd(s) that lead to an improved ability to locate available sites for hydrogen bonding in the host cell, ability that is further enhanced by the slightly basic ph ≈ . found in physiological environments. this improved hydrogen bonding capabilities may be achieved in a number of ways, for example, incorporating aminoacids with more acidic protons, or incorporating larger aminoacids whose hydrogen bonding regions are simply closer to the receptor, among others. we support the need for improved hydrogen bonding as the selective pressure in virus mutation hypothesis in the following evidence: . table shows that fragment to fragment contacts are all in the form of hydrogen bonds. for the specific case of the sars-cov- virus, in five of the six identified contacts, including the most persistent hbs, the residues in rbd(s) act as donors to the corresponding hydrogen bond . besides a small sheet and a small helix (figure ), there is no secondary structure in rbd(s), thus, the receptor binding domain of the spike protein has a high structural flexibility which allows the virus to probe for available hydrogen bonding sites in the receptor, which in contrast has well defined secondary and tertiary structures in the interaction region . we obtained from the genbank the sequences of aminoacids for the precursor sars-cov (id afr . ) and for the mutated sars-cov- (id qhd . ) viruses. [ ] we compare below only the aminoacids in the rbd(s) and highlight in red the receptor binding motif (rbm). we also underline aminoacid substitution in the mu- proteins. [ ] [ ] [ ] [ ] [ ] here, we take a pragmatic approach to determine relative acidities between the precursor sars-cov and the mutated sars-cov viruses. we took averages of the experimental isoelectric points [ ] (iep) for the aminoacids involved in the mutation, that is, we calculated the average ipe in the replaced aminoacids in rbd(s) and found . and . for the precursor and for the mutated viruses, respectively. we also calculated the same averages for the interaction motifs only and obtained . and . over the mutations. thus the mutated virus is collectively considerably more acidic in the interaction region, which improves its ability to donate protons to hydrogen bonds the most important contributions from this work may be summarized as follows: . we pinpoint the specific residue (in the virus) to residue (in the cell) interactions during the initial virus· · · cell binding . we characterize the virus· · · cell molecular attachment as the result of a large number the starting point of our calculations was the complex between the receptor binding domain of the s protein, rbd(s), and the ace receptor. cartesian coordinates for the rbd(s)· · · ace complex were taken from the protein data bank (pdb id lzg [ ] ) and then treated with charmm-gui, the graphical user interface of charmm [ , ] to include missing hydrogen atoms at ph = . , to ensure that all glycans are included, and to construct the force field. the entire system was enclosed by a truncated octahedral box such that the smallest atom· · · wall distance was set to Å, then the available volume in the box was filled with tip p [ ] water molecules. nacl molecules were added until a physiological . m concentration was attained and, finally, counterions were added to restore charge neutrality. this procedure lead to a system comprising a total of atoms, with water molecules, atoms in the receptor, and atoms in rbd(s). the system was subjected to a steepest descent energy minimization in order to correct for potential inconsistencies in atom coordinates that may arise during the procedure of randomly filling the available space with water, nacl, and counterions. once minimized, we ran triplicate all-atom md simulations under the conditions summarized in table and described next. first, there were three equilibration steps lasting a total of . nanoseconds (ns) with femtosecond (fs) time intervals, during which the structural constraints were progressively relaxed until finally being totally lifted. these structural constraints were imposed by harmonic constants that prevent deformation of the backbone (k bb ), side chain (k sc ), and dihedrals (k d ). then, the system underwent a production step lasting ns with time intervals of fs. for all md runs, the lennard-jones potential was softened starting at . nm until eventually vanishing at . nm. also, the cutoff radius for electrostatic interactions was set to . nm. all these simulations were conducted using the charmm m force field [ , ] as implemented in gromacs . [ ] at . k and bar. the rbd(s)· · · ace interaction energy (∆g int ) was estimated via the mm/pbsa [ ] as implemented in gromacs. [ ] dielectric constants were set to solute = , solvent = at the simulation temperature. in short, ∆g int = ∆e virus···cell + ∆g p + ∆g np − t ∆s. here, ∆e virus···cell is the gas phase energy of the rbd(s)· · · ace complex, ∆g p , ∆g np are the solvation energies due to the polar and non-polar interactions, respectively, and t ∆s is the entropy contribution. more precisely, ∆g p was computed under the poisson-boltzmann model, ∆g np was estimated using the solvent accessible surface area, and the entropy term was obtained from the model of duan and coworkers. [ ] to finally estimate ∆g int , we took points in the - ns interval of one of the md replicas. it has been recently shown [ ] that randomly chosen configurations from late stages of md simulations are adequate sources to obtain deep insight into interfragment bonding. accordingly, aiming at understanding the fundamental forces driving the attachment of rbd(s) to host cells, virus· · · cell bonding interactions were dissected following these steps: . persistent residue (in the virus) to residue (in the host cell) contacts during the ns of the md simulations were identified using the vmd program [ ] with a cutoff radius of . Å . one frame was randomly chosen from the late stages of one md run . we extracted all extended interacting pairs in the chosen frame, kept them in the configurations they had in the interacting system (this is more accurate to understand the virus· · · cell bonding interactions than reoptimizing the isolated pairs), and (a) computed accurate interaction energies using highly correlated domain based local pair-natural orbital coupled-cluster (dlpno-ccsd(t)/aug-cc-pvdz) single point energy calculations [ , ] on the dimers and in the monomers. the orca suite of programs, version . . . , was used to this end [ ] (b) dissected the intermolecular interactions using the tools provided by the natural bond orbitals (nbo [ ] [ ] [ ] as implemented in nbo . [ ] ) and by the quantum theory of atoms in molecules (qtaim [ ] [ ] [ ] as implemented in aimall [ ] ) (c) calculated qm/mm absorption spectra for the monomers and for the dimers. all td-dft calculations were carried out using the b lyp/aug-cc-pvdz model chemistry [ ] [ ] [ ] [ ] (tests using the dispersion corrected b lyp-d , ωb xd, functionals yielded essentially identical results). qm/mm electrostatic embedding was exploited, [ ] in which only the extended dimers were considered as the quantum region, and the rest of the system as the mm region, which was modelled by the amber force field [ ] and by assigning to atom types the same charges used in the md runs. a large number of excited states are needed to guarantee that both the intensities and shapes of the absorption spectra are accurately reproduced. therefore, in this work the first excited states were computed at the td-dft/qm-mm level in each case. vertical excitations were shifted by - . ev to account for the systematic error due to the choice of functional. this value was chosen in order to match the experimental absorption maximum for tyrosine. [ , ] all spectra were then convoluted with gaussian lineshapes with full width half maximum (fwhm) of . ev. all qm/mm calculations were carried out with gaussian [ ] . we isolated the interaction region by including everything within a . Å radius from the last atom at the end of each aminoacid ( atoms in total) and calculated the interfragment non covalent interaction (nci as implemented in nciplot [ ] ) surface using the promolecular densities approximation. [ , ] atoms in molecules: a quantum theory discovering chemistry with natural bond orbitals proceedings of the national academy of sciences what is life? the physical aspect of the living cell electrophoresis crc handbook of chemistry and physics wires computational molecular science aimall (version . . ) gaussian revision b. internal support from universidad de antioquia via "estrategia para la sostenibilidad" is acknowledged. partial funding for this project from h -msca-itn- european training network "computational spectroscopy in natural sciences and engineering" (co-sine), grant number is also acknowledged. n.r. thanks colciencias for her doctoral scholarship. cartesian coordinates for all dimer pairs listed in table and for the fragment taken for the nci surfaces. a figure with the minimum interacting distances used to determine the interaction regions is included as well. a video of one of the trajectories is also provided. key: cord- -fwn wds authors: juno, j. a.; tan, h.-x.; lee, w. s.; reynaldi, a.; kelly, h. g.; wragg, k.; esterbauer, r.; kent, h. e.; batten, c. j.; mordant, f. l.; gherardin, n. a.; pymm, p.; dietrich, m. h.; scott, n. e.; tham, w.-h.; godfrey, d. i.; subbarao, k.; davenport, m. p.; kent, s. j.; wheatley, a. k. title: immunogenic profile of sars-cov- spike in individuals recovered from covid- date: - - journal: nan doi: . / . . . sha: doc_id: cord_uid: fwn wds the rapid global spread of sars-cov- and resultant mortality and social disruption have highlighted the need to better understand coronavirus immunity to expedite vaccine development efforts. multiple candidate vaccines, designed to elicit protective neutralising antibodies targeting the viral spike glycoprotein, are rapidly advancing to clinical trial. however, the immunogenic properties of the spike protein in humans are unresolved. to address this, we undertook an in-depth characterisation of humoral and cellular immunity against sars-cov- spike in humans following mild to moderate sars-cov- infection. we find serological antibody responses against spike are routinely elicited by infection and correlate with plasma neutralising activity and capacity to block ace /rbd interaction. expanded populations of spike-specific memory b cells and circulating t follicular helper cells (ctfh) were detected within convalescent donors, while responses to the receptor binding domain (rbd) constitute a minor fraction. using regression analysis, we find high plasma neutralisation activity was associated with increased spike-specific antibody, but notably also with the relative distribution of spike-specific ctfh subsets. thus both qualitative and quantitative features of b and t cell immunity to spike constitute informative biomarkers of the protective potential of novel sars-cov- vaccines. the rapid global spread of sars-cov- has highlighted the intrinsic vulnerability of humans to emerging zoonotic infections and spurred frantic efforts to expedite vaccine and antiviral drug development, manufacture and deployment. in contrast to historical pandemics, such as the "spanish" h n influenza, modern recombinant technology enables a rapid scientific response, with multiple vaccines under development, almost exclusively aimed at eliciting antibodies to the viral "spike" protein. the spike (s) protein of beta-coronaviruses is expressed as a single protein, with proteolytic cleavage yielding s and s subunits . s localises on the virion surface and mediates both recognition of cellular receptors and membrane fusion. in the case of sars-cov- , a receptor binding domain (rbd) within s directly interacts with high affinity with the peptidase domain of angiotensin-converting enzyme (ace ) - . the s subunit of s mediates membrane fusion. the s/ace interaction mediates viral entry and provides an attractive target for vaccine-elicited humoral immunity , with antibodies potentially capable of either (i) directly blocking binding of ace by s, (ii) blocking conformational changes in s critical for membrane fusion, (iii) eliminating infected cells through antibody effector mechanisms such as antibody- dependent cellular cytotoxicity (adcc), or (iv) driving accelerated clearance of free virus. the dominant targets for human antibody against the sars-cov- s are unclear. some human mabs originally characterised against sars-cov s cross-react with sars-cov- . for example cr which binds a cryptic epitope on the rbd , , while s , derived from the memory b cells of a sars-cov recovered subject, blocks ace engagement by sars-cov- s . a recent report of monoclonal antibodies recovered from sars-cov- convalescent donors revealed multiple non- overlapping epitopes on the rbd, with different capacities for mediating neutralisation . few neutralising epitopes localised outside the rbd have been characterised to date, with preliminary reports of neutralising epitopes within the n- immunogens in humans are poorly resolved. here we undertook an in-depth characterisation of humoral and cellular immunity against spike in humans who recovered from mild to moderate sars-cov- infection. we find antibody responses to both s and the rbd are consistently elicited following sars-cov- infection, the magnitude of which correlates with both plasma neutralising activity and inhibition of rbd/ace binding. s-specific b cells comprise a significant proportion of the circulating memory b cell pool following infection, with rbd-specific b cells constituting a minor subpopulation in most subjects. assessment of the circulating t follicular helper (ctfh) population reveals that s-specific ctfh cells are also readily detected in convalescent subjects, while t cell responses toward the rbd are all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / significantly lower in frequency. finally, we find the development of comparatively high plasma neutralisation activity is associated not only with the magnitude of anti-s immune responses, but also with the phenotype of circulating tfh populations, suggesting these features may serve as attractive biomarkers for candidate s-based vaccines entering the clinic. serological responses to spike antigens following sars-cov- infection we recruited a cross-sectional cohort (n= ) of australian adults recovered from mild-moderate sars-cov- infection and isolated plasma and pbmc samples at a median of (iqr: - ) days post-positive pcr test. the cohort had a median age of (iqr: - ) and was % female ( of ). subjects reported mild to moderate upper and lower respiratory tract symptoms with only ( %) requiring hospitalisation, and none requiring mechanical ventilation (table s ) . a control cohort of healthy adults was recruited prior to widespread infection in australia (table s ) . as we had an interest in the degree to which baseline cross-reactive coronavirus immunity affected sars-cov- responses, we pre-screened the uninfected subjects for serological reactivity against the beta coronavirus hcov- hku (hku ) ( figure s ), selecting individuals with the highest and lowest plasma titres as controls for the study. serological profiles are presented stratified across the cohort based on neutralisation activity for each subject. antibodies binding the sars-cov- spike ( figure a ) or the rbd ( figure b) were consistently observed in all infected individuals by elisa, with minimal reactivity in the controls. titres of s-and rbd-specific antibody were highly correlated ( figure s ). consistent with previous reports , low- level antibody responses cross-recognising the sars-cov rbd were observed in most of our sars-cov- infected cohort ( figure c ). antibody responses to the human coronavirus strain hku were prevalent at moderate to high levels across the cohort, in line with previous reports of widespread seropositivity to s proteins of human coronaviruses in adults , ( figure d ). the capacity of immune plasma to block interaction between recombinant ace and rbd was assessed by elisa, with modest levels of inhibition detected in most subjects, and selected subjects exhibiting potent inhibitory activity ( figure e ). virus neutralising activity in the plasma was similarly assessed using a microneutralisation assay with live sars-cov- infection of vero cells as previously described for . neutralising antibody titres ranged from to (iqr: - ) ( figure f ). in summary, antibody responses against both s and the rbd are consistently elicited in sars-cov- infected individuals, the endpoints titres of which correlate significantly with neutralising activity (r= . and r= . respectively) and ace binding inhibition (r= . and r= . respectively) in the plasma ( figure s ) . b cell responses to s antigens following sars-cov- infection we next examined the frequency and specificity of class-switched b cells in convalescent subjects using sars-cov- spike or rbd proteins as flow cytometric probes. clear antigen-specific populations of cd + igd -b cells (gating in figure s ) binding spike, spike and rbd or rbd alone could be resolved in our cohort of recovered from sars-cov- subjects, with minimal background staining in uninfected controls (figure a ; figure s ). frequencies of spike + rbd -, spike + rbd + and spike -rbd + b cells as a proportion of the cd + igdpopulation were a median . % (iqr . - . ), . % (iqr . - . ) and . % (iqr . - . ), all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / respectively ( figure b ). the very low frequencies of spike -rbd + b cells likely constitute a mix of background staining and b cells that recognise rbd epitopes occluded in recombinant s or intact virus. immunoglobulin isotypes were determined for spike + rbdand spike + rbd + populations using igm and igg surface staining, with igm -iggclass-switched b cells previously established to be almost exclusively iga + . in our cohort sampled a median of days after symptom onset, the majority of spike + rbdclass-switched b cells were igg + (median . %; iqr . - . ), with smaller proportions displaying igm+ ( . %; iqr . - . ) and iga + (igm -igg -) ( . %; iqr . - . ) ( figure c ). isotype distribution of spike + rbd + b cells was more variable due to low event counts, with median frequencies of . % . ) igg, . % (iqr - . ) igm and . % iga ). the activation phenotype of antigen-specific b cells was assessed using cd and cd surface staining ( figure s ). most spike + rbd -(median . %; iqr . - . ) or spike + rbd + ( . %; iqr . - ) class-switched b cells displayed a resting memory phenotype (cd + cd + ), also consistent with the median duration of infection. however, a significant proportion of activated memory b cells (cd -cd + ) was still evident for both spike + rbd -( . %; iqr . - . ) or spike + rbd + b cell populations ( . %; , with only low proportions of cd -cd and cd + cd phenotypes observed. overall, sars-cov- infection efficiently elicits both s-and rbd-specific b cells in most subjects after recovery, which constitute a significant proportion of the memory b cell pool, which are mostly igg + and of a resting memory phenotype. the rbd of sars-cov and sars-cov- share significant homology, but with marked diversity within the ace binding motif despite shared recognition of this all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / . . . doi: medrxiv preprint cellular receptor , . we examined whether differential staining with sars-cov and sars-cov- probes would allow more precise identification of b cells recognising the unique ace binding site of sars-cov- , to understand why some individuals had notable rbd-specific antibody titres but with limited neutralisation activity or rbd/ace binding inhibition. pbmcs from a subset of covid+ subjects (n= ) were stained with sars-cov- spike, sars-cov- rbd and sars-cov rbd probes as before ( figure d ). both sars-cov- rbd-specific and sars- cov/sars-cov- cross-reactive igg+ b cells could be resolved in most subjects across sars-cov- convalescent and uninfected donors ( figure s ). antigen specificity of the ctfh population was determined using an activation induced marker (aim) assay in response to stimulation with sars-cov- spike or rbd proteins ( figure a ). overall, recovered subjects exhibited robust ctfh responses to the sars-cov- spike protein, with a median of . % spike-specific ctfh cells (iqr . - . ; figure b ). in contrast to the full spike, rbd-specific ctfh responses were significantly lower (p< . ), with a median of only . % of ctfh all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / . . . doi: medrxiv preprint cells exhibiting rbd specificity (iqr . - . ). consistent with the high frequency of hku seropositivity among the convalescent cohort ( figure d ), ctfh responses to hku spike were detected among . % of donors (median . % of ctfh cells, iqr . - . ). the frequency of hku -specific ctfh was generally higher among the convalescent cohort than the uninfected controls ( figure b comparison to seb-stimulated cells from a subset of donors confirmed that in vitro tcr stimulation does not preferentially activate or upregulate expression of ccr among the ctfh population ( figure d ). analysis of spike-specific non-ctfh cd memory (cd + cd + cd ra -cxcr -) cells revealed similar patterns of antigen reactivity to the ctfh compartment; namely, strong recognition of sars-cov- and hku spike proteins (median . % and . % of cd memory cells, respectively) and lower frequencies of rbd-specific t cells (median . % of cd memory cells) ( figure s ). all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / predictors of plasma neutralisation activity the development of serological neutralisation activity will be a critical endpoint for upcoming sars-cov- vaccine trials. a co-correlation matrix of subject characteristics and immunological parameters was generated ( figure a ). this analysis highlighted broad co-correlation of many immune parameters related to s immunogenicity, namely antibody titres and the circulating frequencies of s-specific b and t cell populations. principal component analysis (pca) on immunological variables revealed clustering of the cohort into subjects with stronger and weaker plasma neutralisation activity ( figure b ). using a multiple regression approach, we identified titres of s-specific antibody and the proportion of s-specific ctfh with a th -like phenotype (ccr + cxcr -) as the two most significant predictive factors related to neutralisation activity ( figure c ). efficient elicitation of potent antibodies capable of neutralising viral entry is likely to be a critical feature of effective vaccines against sars-cov- . in the current study, we observed that neutralisation activity in the plasma of convalescent subjects ranged from potent to negligible, despite near universal detection of antibodies binding s and/or rbd, suggesting that qualitative aspects of the humoral immune response may be a critical consideration for vaccine development. direct assessment of key immunological events within the respiratory tract and draining lymphoid tissues is challenging in humans, however assessing b and t cell immunity in more readily sampled blood can be informative. spike-specific class-switched b cells were expanded in nearly all infected subjects, with a predominantly igg + and resting memory phenotype consistent with the sampling time several weeks after the resolution of infection. b cells binding the all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / . . . doi: medrxiv preprint rbd, which contains the ace interaction site, were markedly less frequent than s- specific b cells, and not detected at all in many subjects. combinatorial b cell staining with both sars-cov and sars-cov- probes enabled focused assessment of the uniquely variant epitope on the sars-cov- rbd that facilitates high affinity recognition of ace . a minority of sars-cov- rbd-specific b cells also recognise the sars-cov rbd, a finding consistent with the relative infrequency of sars-cov or mers-cov cross-reactive antibodies recovered from convalescent patients to date , . we find the frequency of igd -igg + b cells that bound s and sars-cov- rbd, but not cells binding sars-cov rbd, tracked with serological rbd/ace binding inhibition but not with overall neutralising activity. overall, our data suggest that in some subjects, precise antibody recognition and blockade of the rbd ace -binding site is the principal pathway to generating neutralising antibody. however, the disconnect seen in many subjects between plasma neutralising titres and rbd-specific antibody, b and t cell responses, strongly suggests sufficient non-rbd epitope targets exist to constitute an alternative pathway to comparable virus neutralisation outcomes. to sars-cov- rbd were observed, which may reflect limited cd t cell epitopes given the small size of rbd. this has implications for rbd-based vaccine strategies, all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (ctfh ), consistent with responses to other neo-antigens such as ebola glycoprotein vaccines . however despite this predominance, the relative proportion of s-specific ctfh (ccr + cxcr -) was negatively correlated with virus neutralisation activity. in contrast, increased frequencies of both ctfh (ccr -cxcr + ) and ctfh (ccr -cxcr -) were observed in subjects with the highest plasma neutralising activity. expansion of ctfh is well characterized following seasonal influenza immunisation, where peak frequencies in the blood correlate with both plasmablast expansion and subsequent serum neutralising antibody titres , , . similarly, bias toward cxcr + phenotypes is reported for antigen-specific ctfh in many chronic infections , . the functional significance of cxcr + ctfh during sars-cov- infection is currently unclear, however may reflect differences in lymph node tfh activity or egress from the gc. the impact of widespread pre-existing immunity to human coronaviruses ( e, nl , hku , oc ) upon the responses to sars-cov- infection is an open question. here we found serum antibody against hku was widely prevalent, all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / . . . doi: medrxiv preprint consistent with the high seroprevalence rates in adults reported previously , . however, we see no evidence of hku-specific immunity modulating binding or neutralising titres against sars-cov- antigens. our data suggest cd t cell responses to hku may be boosted following sars-cov- infection, possibly via recognition of conserved epitopes within the s domain . the predominantly ccr + phenotype of sars-cov- and hku- -specific ctfh may reflect a coronavirus- specific tfh response, but further epitope mapping is required to deconvolute the contribution of hku memory responses or recently boosted sars-cov- cross- reactivity. there is understandably considerable scientific interest in predicting the biogenesis of protective immunity against sars-cov- , of which neutralising antibodies against s are likely to be consequential. although the current study is limited by cohort size, we find that concomitant factors demarking robust humoral immunity, namely increased the study protocols were approved by the university of melbourne human research ethics committee (# ) and all associated procedures were carried out in all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. subjects who had recovered from covid and healthy controls were recruited through contacts with the investigators and invited to provide a blood sample. subject characteristics of sars-cov- convalescent subjects are collated in table s a set of proteins was generated for serological and flow cytometric assays. the ectodomain of sars-cov- (isolate whu ;residues - ) or hcov-hku s protein (isolate n ;residues - ) were synthesised with furin cleavage site removed and p / stabilisation mutations , a c-terminal t trimerisation domain, avitag and his-tag, expressed in expi cells and purified by ni-nta affinity and size-exclusion chromatography using a superose / column (ge healthcare) ( figure s ). sars-cov s was biotinylated using bir-a (avidity). the sars-cov- rbd with a c-terminal his-tag (residues - ; kindly provided by florian krammer) was similarly expressed and purified. sars-cov rbd (residues n -p ) with a c-terminal avitag and his-tag, was expressed in expi cells and purified by ni-nta, biotinylated using bir-a (avidity) and purified by ize- exclusion chromatography using a s- superdex (ge healthcare). the human (residues - ) and mouse (residues - ) ace ectodomain with c-terminal all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / his-tag (kindly provided by merlin thomas) were expressed in expi cells and purified using ni-nta and size-exclusion chromatography ( figure s ). antigenicity of coronaviral proteins was assessed by binding to immune sera, anti-rbd mabs cr and b, or human and mouse ace ( figure s ). the glycosylation profile of recombinant s proteins ( figure s ) was assessed using mass spectrometry as previously described by sp protein clean up and trypsin in-solution digestion. purified peptides were desalted then separated using a two-column chromatography set up comprising a pepmap c mm × μm trap and a pepmap c mm × μm analytical column on dionex ultimate uplc (thermofisher). samples were concentrated onto the trap column at μl/min with buffer a ( % acetonitrile, . % formic acid) for min and infused into a q-exactive™ plus mass spectrometry (thermofisher) at nl/min via the analytical column. min gradients were used altering the buffer composition from % buffer b ( % acetonitrile, . % formic acid) to % b over min, then from % b to % b over min, then from % b to % b over min, the composition was held at % b for min, and then dropped to % b over min and held at % b for another min. the q-exactive™ plus mass spectrometer was operated in a data-dependent mode automatically switching between the acquisition of a single orbitrap ms scan ( , resolution, agc of × ) followed by data-dependent hcd ms events tolerance of ± ppm was allowed for hcd ms scans. searches were performed all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. washed and developed using tmb substrate (sigma), stopped using sulphuric acid and read at nm. endpoint titres were calculated as the reciprocal serum dilution giving signal x background using a fitted curve ( parameter log regression). an elisa was performed to measure the ability of plasma antibodies to block interaction between recombinant human ace and rbd proteins. -well maxisorp plates (thermo fisher) were coated overnight at o c with µg/ml of recombinant rbd protein in carbonate-bicarbonate coating buffer (sigma). after blocking with all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. sars-cov- isolate cov/australia/vic / was passaged in vero cells and stored at - c. plasma was heat-inactivated at °c for min. plasma was serially- diluted : to : before addition of tcid of sars-cov- in mem/ . % bsa and incubation at room temperature for hour. residual virus infectivity in the plasma/virus mixtures was assessed in quadruplicate wells of vero cells incubated in serum-free media containing µg/ml tpck trypsin at °c/ % co ; viral cytopathic effect was read on day . the neutralising antibody titre is calculated using the reed/muench method as previously described , . probes for delineating sars-cov- s-specific b cells within cryopreserved human pbmc were generated by sequential addition of streptavidin-pe (thermofisher) to trimeric s protein biotinylated using recombinant bir-a (avidity). biotinylated sars-cov rbd was similarly conjugated to streptavidin-bv (bd). sars-cov- rbd protein was directly labelled to apc using an apc conjugation lightning- link kit (abcam). cells were stained with aqua viability dye (thermofisher). monoclonal antibodies for surface staining included: cd -ecd (j - ) (beckman all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. acknowledgements we thank the generous participation of the trial subjects for providing samples. the sars-cov- rbd expression plasmids were kindly provided by florian krammer, mt sinai school of medicine, ny, usa. the human and mouse ace expression all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. blood are clonally convergent but divergent from non-tfh cd (+) cells. cell reports , - .e ( ). all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . glycoproteomics. journal of proteome research , - ( ) . all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / caly, l., et al. isolation and rapid sharing of the novel coronavirus (sars-cov- ) from the first patient diagnosed with covid- in australia. med j aust ( ). all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / figure . specificity of ctfh responses to coronavirus spike proteins (a) representative staining of cd and cd co-expression on ctfh (cd + cd + cd ra -cxcr + ) cells following stimulation with µg/ml bsa (negative control), sars-cov- spike, sars-cov- rbd or hcov hku protein, or seb (positive control). (b) antigen-specific ctfh (n= sars-cov- negative, n= sars-cov- positive donors) frequencies were calculated as the proportion of cd + cd + ctfh cells in each stimulation condition after background subtraction using the negative control. (c) representative expression of ccr and cxcr on bulk ctfh, sars-cov- spikespecific, hcov hku spike-specific or seb-responsive (cd + ox- + ) ctfh. (d) quantification of ccr + cxcr -, ccr + cxcr + , ccr -cxcr + or ccr -cxcr -ctfh populations among sars-cov- positive donors (n= ). lymphocytes were identified by fsc-a vs ssc-a gating, followed by doublet exclusion (fsc-a vs fsc-h), and gating on live cd + b cells. class-switched b cells were identified as igd -, and surface istoype resolved by staining for igm or igg, with the double negative population (igm -/igg -) previously established as predominantly iga. binding to sars-cov- spike (s) and/or sars-cov- rbd probes was assessed for each population. figure s . representative staining of s-and rbd-specific igd -igg + b cells uninfected subjects (left panels) and subjects after recovery from sars-cov- infection (middle and right panels). cd + igd -igg + b cells cells were identified using gating strategy shown in figure s . binding to sars-cov- spike (s) and/or sars-cov- rbd probes was assessed. figure s . memory b cell phenotypes in subjects after sars-cov- infection (a) representative memory b cell phenotypes identified by cd and cd co-stain of probe + cd + igdcells (blue) overlaid on cd + igdcells (grey) and (b) the corresponding frequencies of the four populations in subjects previously infected with sars-cov- (resting memory -cd + cd + ; activated memory -cd -cd + ; naïve/cd lo memory -cd + cd -; atypical b cells -cd -cd -); n.d -not detected due to absent probe + cells. figure s . gating strategy for resolving spike + cd + igd -igg + b cells specific for sars-cov- and sars-cov rbd (a) lymphocytes were identified by fsc-a vs ssc-a gating, followed by doublet exclusion (fsc-a vs fsc-h), and gating on live cd + b cells. igd -igg + b cells were gated and assessed for binding to sars-cov- spike. cross-reactive specificities versus those unique to sars-cov- were discriminated by co-staining with sars-cov- and sars-cov rbd probes. (b) representative staining shown for subjects with prior sars-cov- infection. figure s . gating strategy for ctfh and memory cd + t cell subsets lymphocytes were identified by fsc-a vs ssc-a gating, followed by doublet exclusion (fsc-a vs fsc-h gate), and exclusion of dead or cd + cells. t cells were identified as cd + cd -. following exclusion of gamma delta t cells by vd /vd tcr staining, cd + cd -t cells were identified. memory cd + t cells were defined as cd ra -cxcr -, while ctfh cells were defined as cd ra -cxcr + . ctfh cells were further characterized by pd- and ccr /cxcr expression. ( . %) severe -no. (%) ( . %) * subjects had a compatible illness and history of exposure but did not have a positive nasal swab ** illness severity was classified as: mild: prominent upper respiratory tract symptoms and not hospitalised. moderate: prominent lower respiratory tract symptoms and not hospitalised. severe: prominent lower respiratory tract symptoms and requiring hospital care. all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted may , . . https://doi.org/ . / sars-cov- rbd sars-cov- spike igg igd igg igm igd -igg + igd -igm + igd -igm -igg -) sars-cov- rbd sars-cov- spike sars-cov- rbd sars-cov- spike immune plasma (n= ) control plasma (n= ) mab cr mab b control mab key: cord- -wyx ib s authors: sinegubova, maria v.; orlova, nadezhda a.; kovnir, sergey v.; dayanova, lutsia k.; vorobiev, ivan i title: high-level expression of the monomeric sars-cov- s protein rbd - in stably transfected cho cells by the eef a -based plasmid vector date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: wyx ib s the spike (s) protein is one of the three proteins forming the coronaviruses’ viral envelope. the s protein of the severe acute respiratory syndrome coronavirus (sars-cov- ) has a spatial structure similar to the s proteins of other mammalian coronaviruses, except for a unique receptor-binding domain (rbd), which is a significant inducer of host immune response. recombinant sars-cov- rbd is widely used as a highly specific minimal antigen for serological tests. correct exposure of antigenic determinants has a significant impact on the accuracy of such tests – the antigen has to be correctly folded, contain no potentially antigenic non-vertebrate glycans, and, preferably, should have a glycosylation pattern similar to the native s protein. based on the previously developed p . vector, containing the regulatory sequences of the eukaryotic translation elongation factor alpha gene (eef a ) from chinese hamster, we created two expression constructs encoding sars-cov- rbd with c-terminal c-myc and polyhistidine tags. rbdv contained a native viral signal peptide, rbdv – human tpa signal peptide. we transfected a cho dg cell line, selected stably transfected cells, and performed a few rounds of methotrexate-driven amplification of the genetic cassette in the genome. for the rbdv variant, a high-yield clonal producer cell line was obtained. we developed a simple purification scheme that consistently yielded up to mg of rbd protein per liter of the simple shake flask cell culture. purified proteins were analyzed by polyacrylamide gel electrophoresis in reducing and non-reducing conditions and gel filtration; for rbdv protein, the monomeric form content exceeded % for several series. deglycosylation with pngase f and mass spectrometry confirmed the presence of n-glycosylation. the antigen produced by the described technique is suitable for serological tests and similar applications. humanity is faced with an unprecedented challenge -the severe acute respiratory syndrome coronavirus (sars-cov- ), which causes a severe respiratory illness -coronavirus disease (covid- ) pandemic. countries were sent to lockdown; people could not make informed decisions about the possibility of social contacts; the need for diagnostic tests is very high. existing tests for sars-cov are reviewed in [ ] . at the beginning of the pandemic, pcr testing methods dominated since such test systems can be developed urgently, soon after the emergence of a new virus in the population. among the disadvantages of pcr-tests is a high sensitivity to contamination and dependence on sampling's correctness, a high proportion of falsepositive signals. unlike pcr diagnostics, serological testing gives positive results long after the event of infection, at least for several months. this testing method makes it possible to reliably determine whether a person is infected with the sars-cov- , even in the absence of disease symptoms. we need serological tests, both in express format and screening tests based on elisa. serologic tests are also needed to detect convalescent plasma of therapeutic interest and assess emerging vaccines' effectiveness. in order for serological testing to have a more significant predictive value, mapping of the epitopes to which neutralizing antibodies appear should be carried out, as was done for sars-cov [ ] , аnd convalescent or postvaccinal sera should be massively tested for the presence of neutralizing antibodies, for example, with a surrogate virus neutralization test based on antibody-mediated blockage of ace -spike protein-protein interaction [ ] or another that can be carried out on a relatively large scale. the use of highly specific and high-affinity viral antigens is already a big step towards improving diagnostic accuracy. the immunodominant antigen of sars-cov is the rbd domain of the spike protein [ ] . another antigen widely used for diagnostics -the nucleocapsid (n) protein -combines high sensitivity and low specificity; therefore, it needs accurate antigen mutagenesis to remove highly conserved areas without compromising affinity. cases are described for sars-cov when the results of testing with n-protein were clarified using two subunits of spike protein [ ] . the coronaviruses' spike (s) protein forms large coronal-like protrusions on the virions surface, hence the name of the family coronaviridae. the s protein plays a crucial role in receptor recognition, cell membrane fusion, internalization of viruses, and their exit from the endosomes. it is described in detail in the review [ ] . it consists of s and s subunits and, in the case of the sars-cov- virus, has amino acids [ ] . the s protein is co-translationally incorporated into the rough endoplasmic reticulum (er) and is glycosylated by n-linked glycans. glycosylation is essential for proper folding and transport of the s protein. the s protein trimer is transported from the er. interacting with the m and e proteins s protein trimer is transported to the virus's assembly site. s protein is required for cell entry but not necessary for virus assembly [ ] . during their intracellular processing, s proteins of many types of coronaviruses, including sars-cov- and mers-cov, but not sars-cov, undergo partial proteolytic degradation at the furin signal protease recognition site with the formation of two subunits s and s . apparently, most of the s protein copies on the membrane of sars-cov- viral particles are trimers of s subunits that are incapable of interacting with the receptor. the full-length s protein trimer on the viral particle's surface also undergoes complex conformational rearrangements during the formation of the rbd-receptor complex and the virus's penetration into the cell. the s protein homotrimer binds to the ace dimer, detailed study of this interaction is available here [ ] . as part of the trimer, the spike protein's monomer "moves its head" -the s subunit can form the open or closed conformation; that is, it can have a raised fragment or a lowered rbd domain, this can influence the affinity of antibodies targeted to it. the s protein of sars-cov- amino acid sequence is variable, with more than and relatively frequent s protein amino acid variations. a glycan shield is formed by n-linked glycans on the s protein surface, which is likely to help viral immune escape. in a comparative study of genome-wide sequencing data of natural isolates of sars-cov- [ ] for the detected variants of the s protein, all potential nglycosylation sites within the s protein's ectodomain were completely conserved, which confirms the importance of each of these sites for maintaining the integrity of the s protein oligosaccharide envelope. it should be noted that not all of the potential n-glycosylation sites are occupied, for s and s subunits, obtained from transiently transfected hek cells [ ] n-glycosylation events were experimentally confirmed only for out of sites, also at least one o-glycosylation site was experimentally found inside the rbd-domain area of the s subunit with the mucin-like structures. non-vertebrate cells may be used to produce the s protein or its fragments; in this case, n-glycans are present mostly in the form of bulky high mannose or paucimannose structures, possibly blocking the interaction of antibodies with the folded s protein [ ] . computational modeling of the glycan shield, performed for the hek -derived s protein, revealed that in the case of human cells, around % of the protein's surface is effectively shielded from igg antibodies [ ] . the use of full-length s protein for practical serological testing is nearly impossible due to its insolubility, caused by the presence of transmembrane domain. an artificial trimer of its ectodomain has been successfully used as an antigen in serological tests; however, such complex protein cannot be obtained in large quantities in mammalian cells, apparently due to the limitation on the folding of the trimerized abundantly glycosylated protein and subsequent difficulties in its isolation and purification. it is generally believed that the sars-cov- s protein receptor-binding domain is a minimal proteinaceous antigen, adequately resembling the immunogenicity of the whole spike protein. this domain contains only two occupied n-glycosylation sites [ ] and - occupied o-glycosylation sites. it does not contribute to the trimer formation, and its surface is mostly unshielded. isolated rbd's of the s proteins of beta-coronaviruses were produced in various expression systems. bacterial expression of the rbd from mers-cov produced no soluble target protein, refolding attempts also were unsuccessful [ ] . budding yeasts pichia pastoris were the suitable host for the secretion of mers-cov rbd with at least two (from three) n-linked glycosylation sites present. similar data were obtained for the rbd from sars-cov virus -removal of all n-glycosylation sites resulted in the sharp drop of protein secretion rate in the p. pastoris yeast, in the case of full rbd domain (residues - ), secretion of the unglycosylated target protein was stopped completely [ ] . it may be proposed that the addition of n-glycans in these sites is needed for correct folding of the rbd in the er of eukaryotic cells. the sars-cov- s protein rbd, expressed in e.coli, also was detected only as inclusion bodies and was found to be unreactive even on blotting [ ] . hyperglycosylated yeast-derived sars-cov- rbd was obtained in reasonable quantities ( mg/l in bioreactor culture) by the p. pastoris expression system and successfully used for mice immunization [ ] . unfortunately, yeast-derived glycosylated proteins contain immunogenic glycans and cannot be used for immune assays with human antibodies. similarly, sars-cov- rbd may be produced in the nicotiana benthamiana plant, resulting in non-vertebrate n-glycans addition, potentially reactive with human antibodies [ ] . most early preprints and peer-reviewed articles describing the sars-cov- s protein and its rbd domain production methods were focused on transient transfection of hek cells [ ] [ ] and purification of small protein lots in a very short time. for example, d. stadlbauer [ ] reports more than mg/l target protein titer in transiently transfected hek- cells. simultaneously, the scalability of transiently transfected cell lines cultivation is still questionable, and gram quantities of rbd, needed for large scale in vitro diagnostic activity, may be produced only by stably transfected cell lines. previously we have developed the plasmid vector p . , containing large fragments of non-coding dna from the eef a gene of the chinese hamster and fragment of the epstein-barr virus long terminal repeat concatemer [ ] and employed it for unusually high-level expression of various proteins in cho cells, including blood clotting factors viii [ ] , ix [ ] , and heterodimeric follicle-stimulating hormone [ ] . cho cells were successfully used for transient sars-cov rbd expression at mg/l secretion level [ ] . we have proposed that sars-cov- rbd, suitable for in vitro diagnostics use, may be expressed in large quantities by stably transfected cho cells, bearing the eef a -based plasmid. p . -tr -rbdv construction. the rbd coding sequence was synthesized according to [ ] . the dna fragment encoding the rbdv orf with kozak consensus sequence and c-terminal c-myc and xhis tags were obtained by pcr using primers ad-cov-absf and ad-rbd-myc hnher (listed in table the resulting ptm vector was sequenced as described above, available from addgene, plasmid # . ptm-rbdv construction. rbd orf was amplified using adaptor primers ad-sfr -nhef and ad-sfr -xmar restricted by nhei and xmai (sibenzyme, novosibirsk, russia) and cloned into ptm vector, restricted by nhei and asigi (sibenzyme, novosibirsk, russia). the resulting construct was sequenced using sq- ch -f and sq-mych-r primers. ptm-rbdv is available from addgene, plasmid # plasmids for cell transfections were purified by the plasmid midiprep kit (evrogen, moscow, russia) and concentrated by ethanol precipitation in sterile conditions. the transgene copy number in the cho genome was determined by the quantitative real-time-pcr (qpcr) as described in [ , ] . serial dilutions of p . -egfp [ ] or pgem-rab plasmids were used for calibration curves generation. the weight of one cho haploid genome was taken as pg, according to [ ] . genomic chinese hamster ovary dg- cells (thermo fischer scientific) were cultured in the procho medium (lonza, switzerland), supplemented by mm glutamine, mm alanyl-glutamine and hypoxanthinethymidine supplement (ht) (paneco, moscow, russia). cells were grown as a suspension culture in sterile ml erlenmeyer flasks with vented caps, routinely passaged to days with centrifugation ( g, min) and seeding density - * cells/ml. the - µg of each plasmid were precipitated by the addition of % ethanol and m sodium acetate, washed with % ethanol, dried, and resuspended in µl of sterile r-buffer, neon transfection kit (thermo seeding cell culture was grown in ml erlenmeyer shake flasks with ml of lonza procho medium, supplemented with mm glutamine, mm alanyl-glutamine and - µm mtx until cell concentration exceeds - . mln cells/ml. cell suspension was transferred to four ml erlenmeyer flasks, each containing ml of culture medium, and grown to the same cell density. the entire cell suspension was transferred to a single l erlenmeyer flask with l culture medium, final seeding density - * cells/ml. cells were cultured for three days, on the fourth day of culture, daily glucose measurements were started. glucose concentration in the cell supernatant was measured by the accutrend plus system (roche, switzerland); if glucose level was below mm, it was added up to mm as the sterile % solution. the culture in l flask was grown for to days until the cell viability, measured by trypan blue exclusion, dropped below %. the clonal cell line was obtained by the limiting dilution method from the cell population, cultured in µm mtx. methotrexate was omitted in the culture medium for two d passage before cloning. cells were additionally split by : dilution hours before the cloning procedure. cells were diluted in excell-cho (merck, germany) culture medium supplemented with mm glutamine, mm alanyl-glutamine, ht and % of untransfected cho dg conditioned medium resulting in seeding density . cell/well, and the suspension was seeded into -well plates ( μl/well). plates were left undisturbed for days at °c, % co atmosphere. wells with single colonies were screened by microscopy; well grown colonies were detached by pipetting and transferred to the wells of -well plate, containing ml of the excell-cho, supplemented as described above and grown for days undisturbed. product titer was measured by elisa, as described below, wells with highest rbdv titer were used for further cultivation. best-producing clonal cell lines were transferred to ml erlenmeyer flasks with the procho culture medium supplemented with mm glutamine, mm alanyl-glutamine and µm mtx and after days in suspension culture, the best producing clone was determined by measuring the product titer and cell concentration. sds-page was performed with the . % acrylamide in the separating gel, in reducing conditions, if not stated otherwise, with the pageruler prestained marker, µl/lane (thermofisher scientific). gels were stained by the colloidal coomassie blue according to [ ] , scanned by the conventional flatbed scanner in the transparent mode as -bit grayscale images and analyzed by the totallab tl gel densitometry software (nonlinear dynamics, uk). sds-page was performed as described above, protein transfer, blocking, hybridization and color development were done according to [ ] using nitrocellulose transfer membrane (gvs group, bologna, italy) and towbin buffer with methanol. primary anti-c-myc antibody (sci store, moscow, russia #psm - ) was used at the : dilution, anti-mouse-hrp conjugate (abcam, cambridge, uk, ab ) was used at : dilution; membrane was developed by the dab-metal substrate and scanned by the flatbed scanner in the reflection mode. multimeric forms of the rbd were quantified by size exclusion chromatography, utilizing waters extracts were vacuum-dried and redissolved in the . % trifluoroacetic acid (tfa), % acn solution. prepared solutions were mixed at : ratio with % α-cyano- -hydroxycinnamic acid (merck) solution in % acn, . % tfa on the target plate. solutions of intact and deglycosylated proteins were passed through the ziptip c microcolumns (millipore), washed and eluted according to manufacturer protocol. one and a half µl of protein solutions were mixed on the target plate with . µl of the % , -dihydroxybenzoic acid (merck) solution in % acn, . % tfa. mass spectra were obtained by the maldi-tof mass spectrometer ultraflextreme peptides identification was performed by the gpmaw . software (lighthouse data, denmark) and by the mascot server (matrix science, boston, usa). glycopeptides mass assignment was performed by the glycomod online software tool [ ] . sandwich elisa with anti-s protein antibodies was performed using a prototype of the sars-cov- antigen detection kit (xema co., ltd., moscow, russia, a generous gift of dr. yuri lebedin). pre-covid- normal human plasma sample (renam, moscow, russia) was used for preparation of the sars-cov- negative serum sample. serum samples of five patients with the pcr-confirmed sars-cov- infection were pooled for testing and one serum sample with the borderline igg titer level was tested separately. the blood sampling protocol conformed to the local hospital human ethics committee guidelines. antibody capture elisa with human serum samples was performed according to [ ] at the ng per well antigens load. antigens were applied on elisa -well plates (corning, usa) overnight at + oc, in pbs, the t-test was performed using the graphpad quickcalcs web site: https://www.graphpad.com/quickcalcs/ttest .cfm (accessed november ). the native n-terminal signal peptide of sars-cov- s protein (amino acid sequence mfvflvllplvssq) was fused to the rbd sequence ( - , according to yp_ . ) and joined with a c-terminal c-myc epitope (eqkliseedl), short linker sequence, and hexahistidine tag. n-terminal part of the rbdv gene was constructed according to [ ] , utilizing the optimized codon usage gene structure. c-terminal tags were not optimized for codon usage frequencies. the resulting synthetic gene was cloned into the p . -tr vector plasmid, a shortened derivative of the p . plasmid [ ] , and used for transfection of dhfr-deficient cho dg cells. the resulting expression plasmid p . -tr -rbdv [genbank: mw ] is shown on fig a. the stably transfected cell population was obtained by selection in the presence of nm of dhfr inhibitor methotrexate, rbd titer . mg/l was detected for -days culture (fig. s ). one-step target gene amplification was performed by increasing the mtx concentration tenfold and maintaining the cell culture for days until cell viability restored to more than %; the resulting polyclonal cell population could secrete up to , mg/l rbd in the -days culture. the target protein was purified by a single imac chromatography step, utilizing the ida-based resin chelating sepharose fast flow (cytiva), ni + ions, and step elution by increasing imidazole concentrations (fig b, fig c) . the resulting protein production method was found to be sub-optimal due to unexpectedly low secretion rate, signs of cellular toxicity of the target gene - h cell duplication time, maximal cell density in shake flask of . mln сells/ml (fig s ) , and unacceptable level of contaminant proteins co-eluting with the rbdv . at the same time, the rbdv protein was stable in the culture medium during the extended batch cultivation of cells for at least days (fig d) , making the long-term feed batch cultivations a viable option for its production in large quantities. we proposed that target protein secretion rate and its purity after one-step purification could be significantly improved by a simultaneous shift of the rbd domain boundaries, exchange of the sars-cov- s protein native signal peptide to the signal peptide of more abundantly expressed protein, two-step genome amplification and switch from ida-based resin to the nta-based one (fig e) . human tissue plasminogen activator signal peptide (htpa sp, amino acid sequence mdamkrglccvlllcgavfvsas) is commonly used for heterologous protein expression in mammalian cells. it was successfully used for the expression of sars-cov s protein in the form of dna vaccine [ ] and envelope viral protein gp [ ] . in the case of mers-cov s protein rbd -fc fusion protein, various heterologous signal peptides modulate target protein secretion rate by the factor of two [ ] . corrected boundaries of the sars-cov- rbd were determined according to the cryo-em data [pdb id: vxx] [ ] obtained for the trimeric sars-cov- s protein ectodomain. initially used - coordinates, described in the [ ] include one unpaired cys residue originated from the n-terminal part of the next domain sd (structural domain ), so we excluded lys from the n-terminus of the mature rbd protein, aiming at the maximization of signal peptide processing, and removed c-terminal aminoacids c vnf , which form the structure of the sd domain. both linker areas surrounding the folded rbd domain core remain present in the rbdv protein ( - , according to yp_ . ). additionally, we redesigned c-terminal tags by introducing the pro residue immediately upstream of the c-myc tag, adding the short linker sequence sagg between the c-myc tag and polyhistidine tag, and extending the polyhistidine tag up to residues. we expected this structure to expose the c-myc tag properly on the protein globule's surface and move the decahistidine tag away from possible masking negatively charged protein surface areas. we constructed an expression vector ptm [genbank: mw ], where consensus kozak sequence, htpa sp and c-myc and -histidine tags are coded in the polylinker. rbd coding fragment was cloned in-frame, resulting ptm-rbdv expression plasmid [genbank: mw ] is shown on fig. a . cho dg cells were transfected by the ptm-rbdv plasmid, stably transfected cell population was established at the nm mtx selection pressure. target protein titer was similar to the previous plasmid design - . mg/l for -days culture, but after one step of the mtx-driven genome amplification, it increased eleven-fold to . mg/l at µm mtx ( fig b) and then increased by a factor of . after second amplification step at µm mtx, resulting titer was . mg/l for -days culture (fig a) . a steady increase of the target protein titer was detected for the extended batch cultivation of polyclonal cell population obtained at µm mtx, peaking at mg/l at days of cultivation in the l shake flask (fig d, e) . a similar ratio of product titer increase after multi-step mtx-driven genome amplification was described for the mers-cov rbd - -fold increase after steps of consecutive increments of mtx concentration, overall amplification period length was days [ ] . vector plasmid ptm, used in this study, allowed a much more rapid amplification course -a -fold titer increase in two steps, days total. this all cell populations, secreting rbd proteins, were analyzed by the quantitative pcr and it was found, that increased productivity of populations, adapted to higher concentrations of mtx corresponds to higher copy numbers of target gene (fig c) . higher cell productivity in the case of rbdv protein was not due to higher target gene copy numbers, then in the case of rbdv . cell culture medium pro cho (lonza), utilized in this study, contains unknown components, blocking histagged rbd protein's interaction with the ni-nta chromatography resin. clarified conditioned medium, used for protein purification, was concentrated approximately tenfold by tangential flow ultrafiltration on the kda mwco cassettes and completely desalted by diafiltration, diafiltration volumes of the mm imidazole-hcl, ph . solution. rbdv and rbdv proteins were purified by imac utilizing ni-nta agarose (thermo fischer scientific, usa) in the same conditions. desalted conditioned medium was applied onto the column in the presence of mm imidazole; the column was washed by the solution containing elution was performed by the mm imidazole solution; further column strip by the mm edta-na solution revealed no detectable target protein rbdv in the eluate (fig c) . purified proteins were desalted by another round of ultrafiltration/diafiltration on the centrifugal concentrators with kda mwco membranes; diafiltration solution was pbs; final concentration - mg/ml. purified proteins were flashfrozen in liquid nitrogen and stored frozen in aliquots. overall protein yield for rbdv was %, mg of purified rbdv were obtained from l shake flask culture. the apparent molecular weight of intact rbdv was determined as . kda, deglycosylated rbdv - . kda, theoretical molecular weight - da. rbdv molecular weight was determined as . kda for the intact protein, deglycosylated protein - . kda, theoretical molecular mass - da (fig a) . both protein variants possess two distinct forms of intramolecular disulfide bonds sets, visible as two closely adjacent bands in non-reducing conditions and complete absence of such band pattern in reducing conditions. previously it was reported that sars-cov- rbd - , expressed transiently in hek- cells, tends to form a covalent dimer, around % from the total, visible as the kda band on the denaturing gel in nonreducing conditions [ ] . we confirmed this observation; in the case of stably transfected cho cells, covalent dimerization was also % according to gel densitometry data. at the same time, it should be noted that the rbdv protein, redesigned explicitly for mitigation of this unwanted dimerization and containing an even number of cys residues, still forms % of the covalent dimer. purified rbdv was tested by size exclusion chromatography. the major monomer form's apparent molecular weight was determined as . kda (fig s ) , admixtures peaks apparent molecular masses corresponded well to rbd dimer, tetramer, and two high molecular mass oligomers accounting for % of all peak areas (fig b) . mass-spectrometry analysis of rbdv and rbdv revealed that both proteins' molecular masses diminished ( fig s , s ). this long peptide was completely absent in both spectra of de-glycosylated proteins (table s -s ) . a more detailed analysis of this area of the rbd protein may be of some interest for the s protein structure-function investigation but is out of scope for the present study. purified rbd variants were used as antigens for microplates coating and subsequent direct elisa with pooled sera obtained from patients with the rt-pcr-confirmed covid- diagnosis, weakly positive serum sample from the rt-pcr-confirmed covid- patient, and serum sample obtained from a healthy volunteer before december (fig e) . both rbd variants perform equally -all serum samples produce highly similar od readings for all dilutions tested with both antigens. here we describe a method of generating stably transfected cho cell lines, secreting large quantities of monomeric sars-cov- rbd, suitable for serological assays. at present, serological assays for detection of seroconversion upon sars-cov- infection are mostly based on two viral antigens -nucleoprotein (np) and s protein or fragments of the s protein, including the rbd. there are various reports on the specificity and sensitivity of assays based on these two antigens. in some cases, the sensitivity of clinically approved npbased assays was challenged by direct re-testing of np-negative serum samples by the rbd-based assays [ ] . other studies question the specificity of np-based elisa tests, demonstrating a significant level of false-positive results for the full-length sars-cov- np [ ] . it may be proposed that testing of serum samples with both sars-cov- antigens will produce the most accurate results, as was done, for example, in the south-east england population study [ ] ; this conclusion was made in the microarray study of a limited number of patients serum samples [ ] . it is unclear yet, which part of the s protein is the optimal antigen for serological assays; microarray analysis revealed that s fragment generates more false-positive results than s or rbd antigen variants [ ] in the case of igg detection, at the same time the rbd protein generated much lower signals on covid- patients serum samples then s or s +s antigens. in another microarray study it was found that igg response toward the rbd domain in the convalescent plasma samples correlates well with the response toward full-length soluble s protein [ ] . in the conventional elisa test format, rbd demonstrated nearly % specificity and sensitivity on a limited number of sars-cov- patients and control serum samples [ ] . as of . . , at least various immunoassays for sars-cov- antibodies were authorized for in vitro diagnostic use in the eu [ ], many of them use rbd as the antigen. a simple elisa screening test with the -well microplate will consume around µg of the rbd antigen for test samples, so even one million tests will require mg of the purified rbd protein, making the antigen supply a critical step in the production of such tests. method of the generation of highly productive stably transfected cho cell line, secreting the rbd protein, may be important for ivd test manufacturers in securing the sources of rbd antigen with highly predictable properties. although the rbd fragment of the s protein from sars-cov- is not the most popular antigen variant in the current efforts of anti-sars-cov- vaccine development [ ] , it may be considered as the viable candidate for a simple subunit vaccine. it demonstrated the significant protective immune response development in rodents, without signs of ade effect [ ] and some rbd-based protein subunit vaccine have advanced to phase ii clinical trials. cultured cho cells are the reliable source of rbd protein for this kind of vaccines; at the productivity level achieved in our study, only m of cell culture supernatant will provide enough antigen material for mln of typical µg/vial vaccine doses. С, d -protein sequence coverage by tryptic peptides, maldi-tof analysis. glycosylated peptides found are not pictured, signal peptides are yellow, detected tryptic peptides -violet, experimentally obtained masses, [m+h]+, are stated in the boxes. e -immunoreactivity of rbdv and rbdv by elisa with pooled serum samples from pcr-positive patients -(+)pooled, single serum sample from pcr-positive patient (+) and pre-covid- pooled sera (-). all sera samples were analyzed in duplicates, data are mean. supporting figure s . cell growth and viability dynamics of initial selection and mtx-driven target gene amplification. supporting figure s . cell growth curve for the extended batch cultivation of rbdv and rbdv producing cell populations, um mtx selection pressure. supporting figure s . size exclusion chromatography trace of molecular mass calibrators and molecular mass calibration curve. supporting figure s . maldi-tof spectra traces of intact proteins in glycosylated and deglycosylated forms. supporting figure s . maldi-tof spectra traces of tryptic peptides mxtures from intact and deglycosylated rbdv . supporting figure s . maldi-tof spectra traces of tryptic peptides mxtures from intact and deglycosylated rbdv . supporting table s . peptides mass list of the rbdv intact protein, in-gel digestion, reduced protein. supporting table s . peptides mass list of the rbdv intact protein, in-gel digestion, reduced protein. supporting molecular and immunological diagnostic tests of covid- : current status and challenges. iscience antigenic and immunogenic characterization of recombinant baculovirus-expressed severe acute respiratory syndrome coronavirus spike protein: implication for vaccine design a sars-cov- surrogate virus neutralization test based on antibody-mediated blockage of ace -spike protein-protein interaction the receptor binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients false-positive results in a recombinant severe acute respiratory syndromeassociated coronavirus (sars-cov) nucleocapsid-based western blot assay were rectified by the use of two subunits (s and s ) of spike for detection of antibody to sars-cov structure, function, and evolution of coronavirus spike proteins structural and functional properties of sars-cov- spike protein: potential antivirus drug development for covid- fenner and white's medical virology molecular interaction and inhibition of sars-cov- binding to the ace receptor variations in sars-cov- spike protein cell epitopes and glycosylation profiles during global transmission course of covid- . front immunol deducing the n-and o-glycosylation profile of the spike protein of novel coronavirus sars-cov- site-specific n-glycosylation characterization of recombinant sars-cov- spike proteins analysis of the sars-cov- spike protein glycan shield reveals implications for immune recognition engineering a stable cho cell line for the expression of a mers-coronavirus vaccine antigen. vaccine yeast-expressed recombinant protein of the receptor-binding domain in sars-cov spike protein with deglycosylated forms as a sars vaccine candidate recombinant sars-cov- spike proteins for sero-surveillance and epitope mapping. biorxiv structural and functional comparison of sars-cov- -spike receptor binding domain produced in pichia pastoris and mammalian cells rapid production of sars-cov- receptor binding domain (rbd) and spike specific monoclonal antibody cr in nicotiana benthamiana purification of recombinant sars-cov- spike, its receptor binding domain, and cr mab for serological assay sars-cov- seroconversion in humans: a detailed protocol for a serological assay, antigen production, and test setup improved elongation factor- alpha-based vectors for stable high-level expression of heterologous proteins in chinese hamster ovary cells stable high-level expression of factor viii in chinese hamster ovary cells in improved elongation factor- alpha-based system a highly productive cho cell line secreting human blood clotting factor ix high-level expression of biologically active human follicle stimulating hormone in the chinese hamster ovary cell line by a pair of tricistronic and monocistronic vectors a -mer cho-expressing receptor-binding domain of sars-cov s protein induces potent immune responses and protective immunity a serological assay to detect sars-cov- seroconversion in humans eukaryotic genome size databases highly sensitive and fast protein detection with coomassie brilliant blue in sodium dodecyl sulfate-polyacrylamide gel electrophoresis antibodies : a laboratory manual glycomod--a software tool for determining glycosylation compositions from mass spectrometric data identification of two neutralizing regions on the severe acute respiratory syndrome coronavirus spike glycoprotein produced from the mammalian expression system extracellular matrix proteins mediate hiv- gp interactions with alpha beta structure, function, and antigenicity of the sars-cov- spike glycoprotein testing for responses to the wrong sars-cov- antigen? whole nucleocapsid protein of severe acute respiratory syndrome coronavirus may cause false-positive results in serological assays estimates of the rate of infection and asymptomatic covid- disease in a population sample from se england analysis of sars-cov- antibodies in covid- convalescent blood using a ): p. . . database. foundation for innovative new diagnostics. sars-cov- diagnostic pipeline a systematic review of sars-cov- vaccine candidates the sars-cov- receptor-binding domain elicits a potent neutralizing response without antibody-dependent enhancement we thank mr. arthur isaev (genetico, moscow, russia) and dr. alexander ivanov (institute of molecular biology russian academy of sciences, moscow, russia) for valuable comments and early access to the sars-cov- s protein sequence data, dr. yuri lebedin, eugenia kostrikina and xema co., ltd., for providing anti-rbd mabs and conjugates.the measurements were carried out on the equipment of the shared-access equipment centre "industrial biotechnology" the research center of biotechnology of the russian academy of sciences. dna sequencing was carried out in the inter-institutional center for collective use "genome" imb ras, organized with the support of the russian foundation of basic research.the authors would like to acknowledge all the doctors who diagnose and treat patients during the covid- pandemic. primers for rbdv cloning, restriction sites are underlined ad-cov-absf aacctcgaggccgccaccatgttcatgccttctt ad-rbd-myc hnher gctagcctaatggtgatggtgatgatgaccggtatgcatat tcagatcctcttctgagatgagtttttgttcgaagttcacgc atttgtt primers for ptm construction, sticky ends of annealed pairs are underlinedctagtgatggtgatggtgatggtgatggtgatgaccgcctg cagacagatcctcttcgctgatcagtttttgttcaccggta primers for rbdv cloning, restriction sites are underlined ad-sfr -nhef gctagcgtgcagcccaccgaatcc ad-sfr -xmar cccgggtttgttcttcacgagattggt sequencing primers sq- ch -f gccgctgcttcctgtgac iresa rev aggtttccgggccctcacattg sq-mych-r gatgaccgcctgcagac key: cord- -s fp z q authors: chan, kui k.; tan, timothy j.c.; narayanan, krishna k.; procko, erik title: an engineered decoy receptor for sars-cov- broadly binds protein s sequence variants date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: s fp z q the spike s of sars-cov- recognizes ace on the host cell membrane to initiate entry. soluble decoy receptors, in which the ace ectodomain is engineered to block s with high affinity, potently neutralize infection and, due to close similarity with the natural receptor, hold out the promise of being broadly active against virus variants without opportunity for escape. here, we directly test this hypothesis. we find an engineered decoy receptor, sace .v . , tightly binds s of sars-associated viruses from humans and bats, despite the ace -binding surface being a region of high diversity. saturation mutagenesis of the receptor-binding domain followed by in vitro selection, with wild type ace and the engineered decoy competing for binding sites, failed to find s mutants that discriminate in favor of the wild type receptor. we conclude that resistance to engineered decoys will be rare and that decoys may be active against future outbreaks of sars-associated betacoronaviruses. zoonotic coronaviruses have crossed over from animal reservoirs multiple times in the past two decades, and it is almost certain that wild animals will continue to be a source of devastating outbreaks. unlike ubiquitous human coronaviruses responsible for common respiratory illnesses, these zoonotic coronaviruses with pandemic potential cause serious and complex diseases, in part due to their tissue tropisms driven by receptor usage. severe acute respiratory syndrome coronaviruses (sars-cov- ) and (sars-cov- ) engage angiotensin-converting enzyme (ace ) for cell attachment and entry ( - ). ace is a protease responsible for regulating blood volume and pressure that is expressed on the surface of cells in the lung, heart and gastrointestinal tract, among other tissues ( , ) . the ongoing spread of sars-cov- and the disease it causes, covid- , has had a crippling toll on global healthcare systems and economies, and effective treatments and vaccines are urgently needed. as sars-cov- becomes endemic in the human population, it has the potential to mutate and undergo genetic drift and recombination. to what extent this will occur as increasing numbers of people are infected and mount counter immune responses is unknown, but already a variant in the viral spike protein s (d g) has rapidly emerged from multiple independent events and effects s protein stability and dynamics ( , ). another s variant (d y) became prevalent in portugal, possibly due to a founder effect ( ). coronaviruses have moderate to high mutation rates. for example, − substitutions per year per site occur in hcov-nl ( ), an alphacoronavirus that also binds ace , albeit via a smaller interface that is only partially shared with the rbds of sars-associated betacoronaviruses ( ). additionally, large changes in coronavirus genomes have frequently occurred in nature from recombination events, especially in bats where co-infection levels can be high ( , ) . recombination of mers-covs has also been documented in camels ( ). this will all have profound implications for the current pandemic's trajectory, the potential for future coronavirus pandemics, and whether drug resistance in sars-cov- becomes prevalent. the viral spike is a vulnerable target for neutralizing monoclonal antibodies that are progressing through clinical trials, yet in tissue culture escape mutations in the spike rapidly emerge to all antibodies tested ( ). deep mutagenesis of the isolated receptor-binding domain (rbd) by yeast surface display has easily identified mutations in s that retain high expression and ace affinity, yet are no longer bound by monoclonal antibodies and confer resistance ( ) . this has motivated the development of cocktails of non-competing monoclonals ( , ) , inspired by lessons learned from the treatment of hiv- and ebola, to limit the possibilities for the virus to escape. notably, drug maker eli lilly has a monoclonal monotherapy (ly-cov ) in advanced trials (nct ) where the emergence of resistant virus variants has occurred; the trial has been updated to include an arm with a second monoclonal (ly- cov ). however, even the use of monoclonal cocktails does not address future coronavirus spill overs from wild animals that may be antigenically distinct. indeed, large screening efforts were required to find antibodies from recovered sars-cov- patients that cross-react with sars-cov- ( ), indicating antibodies have confined capacity for interacting with variable epitopes on the spike surface, and are unlikely to be broad and pan-specific for all sars-related viruses. an alternative protein-based antiviral to monoclonal antibodies is to use soluble ace (sace ) as a decoy to compete for receptor-binding sites on the viral spike ( , ( ) ( ) ( ) ( ) of diverse sars-associated betacoronaviruses that use ace for entry. we further fail to find mutations within the rbd, which directly contacts ace and is where possible escape mutations will most likely reside, that redirect specificity towards the wild type receptor. we conclude that resistance to an engineered decoy receptor will be rare, and sace .v . targets common attributes for affinity to s in sars-associated viruses. the affinities of the decoy receptor sace .v . were determined for purified rbds from the s proteins of five coronaviruses from rhinolophus bat species (isolates lyra , rs , rs , rs and rsshc ) and two human coronaviruses, sars-cov- and sars-cov- . these viruses fall within a common clade of betacoronaviruses that use ace as an entry receptor ( ). they share close sequence identity within the rbd core while variation is highest within the functional ace binding site (figures and s ) , possibly due to a co-evolutionary 'arms race' with polymorphic ace sequences in ecologically diverse bat species ( ). affinity was measured by biolayer interferometry (bli), with sace (a.a. s -g ) fused at the c-terminus with the fc moiety of human igg immobilized to the sensor surface and monomeric his-tagged rbd ( figure s ) used as the soluble analyte. this arrangement excludes avidity effects, which otherwise cause artificially tight (picomolar) apparent affinities whenever dimeric sace in solution is bound to immobilized rbd decorating an interaction surface. wild type sace bound all the rbds with affinities ranging from nm for sars-cov- to nm for lyra , with median affinity nm ( sars-cov- to . nm for isolate rs , with median affinity less than nm ( table ). the approximate -fold affinity increase of the engineered decoy applies universally to coronaviruses in the test panel and the molecular basis for affinity enhancement must therefore be grounded in common attributes of rbd/ace recognition. the rbd of sars-cov- (pdb m ) is colored by diversity between sars-associated cov strains (blue, conserved; red, variable). a deep mutational scan of the rbd in the context of full-length s reveals residues in the ace binding site are mutationally tolerant to explore potential sequence diversity in s of sars-cov- that may act as a 'reservoir' for drug resistance, the mutational tolerance of the rbd was evaluated by deep mutagenesis ( ). saturation mutagenesis was focused to the rbd (a.a. c -l ) of full-length s tagged at the extracellular n- terminus with a c-myc epitope for detection of surface expression. the spike library, encompassing , single amino acid substitutions, was transfected in human expi f cells under conditions where cells typically acquire no more than a single sequence variant ( , ). the culture was incubated with wild type, his-tagged, dimeric sace at a sub-saturating concentration ( . nm). bound sace - h and surface-expressed s were stained with fluorescent antibodies for flow cytometry analysis ( figure a ). compared to cells expressing wild type s, the library was poorly expressed, indicating many mutations are deleterious for folding and expression. a cell population was clearly discernable expressing s variants that bind ace with decreased affinity ( figure b ). after gating for c-myc-positive cells expressing s, cells with high and low levels of bound sace were collected by fluorescence-activated cell sorting (facs), called the ace -high and ace -low populations, respectively ( figure c ). both the expression and sace binding signals decreased over minutes to hours during sorting, possibly due to shedding of the s subunit. cells were therefore collected and pooled from three separate facs experiments for a combined hours sort time. averaging the log enrichment ratios for each of the possible amino acids at a residue position. by adding conservation scores for both the ace -high and ace -low sorts we derive a score for surface expression, which shows that the hydrophobic rbd core is tightly conserved for folding and trafficking of the viral spike ( figure a ). by comparison, residues on the exposed rbd surface are mutationally permissive for s surface expression. this matches the mutational tolerance of proteins generally. for tight ace binding (i.e. s variants in the ace -high population), conservation increases for rbd residues at the ace interface, yet mutational tolerance remains high ( figure c ). the sequence diversity observed among natural betacoronaviruses, which display high diversity at the ace binding site, is therefore replicated in the deep mutational scan, which predicts the sars-cov- spike tolerates substantial genetic diversity at the receptor-binding site for function. from this accessible sequence diversity sars-cov- might feasibly mutate to acquire resistance to monoclonal antibodies or engineered decoy receptors targeting the ace -binding site. binding site (e.g. v , y and c ) is free to mutate for yeast surface display, but its sequence is constrained in our experiments; this region of the rbd is buried by connecting structural elements to the global fold of an s subunit in the closed-down conformation (this is the dominant conformation for s subunits and is inaccessible to receptor binding) ( , , , ). we used targeted mutagenesis to individually test alanine substitutions to all the cysteines in the rbd ( figure s ). we found all cysteine- to-alanine mutations severely diminish s surface expression in expi f cells, including c a and c a on the rbd 'backside' that were neutral in the yeast display scan ( ). these differences demonstrate that there are tighter sequence constraints on the rbd in the context of a full spike expressed at a human cell membrane, yet overall we consider the two data sets to closely agree. for binding to dimeric sace , we note that interface residues were more tightly conserved in the starr et al data set (figure d ), possibly a consequence of three differences between the deep mutagenesis experiments. first, our selections for ace binding of s variants at the plasma membrane appears to primarily reflect mutational effects on surface expression, which is almost certainly more stringent in human cells. yeast permit many poorly folded proteins to leak to the cell surface ( ). second, the yeast selections were conducted at multiple sace concentrations from which apparent k d changes were computed ( ); the starr et al data in this regard is very comprehensive. due to the long sort times required for our human cell libraries where only a small fraction of cells express spike, we sorted at a single sace concentration that cannot accurately capture a range of different binding affinities quantitatively. third, dimeric sace may geometrically complement trimeric s densely packed on a human cell membrane, such that avidity masks the effects of affinity-reducing mutations. nonetheless, there is overall agreement that ace binding often persists following mutations to the rbd surface, and our data simply suggests mutational tolerance may be even greater than that already observed by starr et al. having shown that the ace -binding site of sars-cov- protein s tolerates many mutations, we asked whether mutations might therefore be found that confer resistance to the engineered decoy sace .v . . resistance mutations are anticipated to lose affinity for sace .v . while maintaining binding to the wild type receptor, and are most likely to reside in the rbd where physical contacts are made. similar reasoning formed the foundation of a deep mutagenesis-based selection of the isolated rbd by yeast surface display to find escape mutations to monoclonal antibodies, and the results were predictive of escape mutations in pseudovirus growth selections ( ). to address whether escape mutations from the engineered decoy might be found in the rbd, we repurposed the s protein library for a specificity selection. cells expressing the library, encoding all possible substitutions in the rbd, were co-incubated with wild type sace fused to the fc region of igg and his-tagged sace .v . at concentrations where both proteins bind competitively ( ). it was immediately apparent from flow cytometry of the expi f culture expressing the s library that there were cells expressing s variants shifted towards preferential binding to sace .v . , but no significant population with preferential binding to the wild type receptor (figures a and b ). cells expressing s variants that might preferentially bind sace (wt)-igg or sace .v . were gated and collected by facs (figure c ), followed by deep sequencing of s transcripts to determine enrichment ratios. there was close agreement between two independent replicate experiments ( figures d- g ). most rbd mutations were depleted following sorting, consistent with deleterious effects on s folding and expression. soluble ace .v . has three mutations from wild type ace : t y buried within the rbd interface, and l t and n y at the interface periphery ( figure a) . a substantial number of mutations in the rbd of s were selectively enriched for preferential binding to sace .v . ( figure b , upper-left quadrant). while sace .v . -specificity mutations could be found immediately adjacent to the sites of engineered mutations in ace (in particular mutations to s-f adjacent to ace -l and s-t adjacent to ace -n ), major hot spots for sace .v . -specificity mutations were also mapped to rbd loop - , contacting the region where the ace -α helix packs against a β-hairpin motif ( figure a ). by comparison, there were no hot spots in the rbd for sace (wt)-specificity mutations. indeed, only a small number of mutations were selectively enriched for preferential binding to wild type receptor ( figure b ), and the abundance of these putative wild type-specific mutations barely rose above the expected level of noise in the deep mutagenesis data. in this competition assay, s binding to wild type sace is therefore more sensitive to rbd mutations than s binding to engineered sace .v . . to determine whether the potential wild type ace -specific mutations found by deep mutagenesis are real as opposed to false predictions due to data noise, we tested mutants of s selectively enriched in the wild type-specific gate by targeted mutagenesis (blue data points in figure b ). only minor shifts towards binding wild type sace were observed ( figure s ). two s mutants were investigated further in sace titration experiments, n w and n y, which both retained high receptor binding and displayed small shifts towards wild type sace in the competition experiment. n of s is located in the - loop and its substitution to large aromatic side chains might alter the loop conformation to cause steric strain with nearby ace mutation n y in sace .v . . after titrating the concentrations of his-tagged sace (wt) and sace .v . and measuring bound protein to s-expressing cells by flow cytometry, it was found s-n w and s-n y do show enhanced specificity for wild type sace , but the effect is small and sace .v . remains the stronger binder ( figure c ); these mutations therefore will not confer resistance in the virus to the engineered decoy. by comparison, multiple independent escape mutations are readily found in s of sars-cov- that diminish the efficacy of monoclonal antibodies by many orders of magnitude ( , ) . finally, representative mutations to s predicted from the deep mutational scan to increase specificity towards sace .v . (purple data points in figure b ) were cloned and were found to have large shifts towards preferential sace .v . binding in the competition assay ( figure s ). these s mutations were y k/q/s, l g/r/y and g k. none of the mutated sites is in direct contact with an engineered residue on sace .v . and the molecular bases for specificity changes are therefore ambiguous, but we speculate may involve local conformational perturbations. validation by targeted mutagenesis therefore confirms that the selection can successfully find mutations in s with altered specificity. the inability to find mutations in the rbd that impart high specificity for the wild type receptor means such mutations are rare or may not even exist, at least within the receptor-binding domain where direct physical contacts with receptors occur. we cannot exclude mutations elsewhere having long-range conformational effects. engineered, soluble decoy receptors therefore live up to their promise as broad therapeutic candidates against which a virus cannot easily escape. the allure of soluble decoy receptors is that the virus cannot easily mutate to escape neutralization. mutations that reduce affinity of the soluble decoy will likely also decrease affinity for the wild type receptor on host cells, thereby coming at the cost of diminished infectivity and virulence. however, this hypothesis has not been rigorously tested, and since engineered decoy receptors differ from their wild type counterparts, even if by just a small number of mutations, it is possible a virus may evolve to discriminate between the two. here, we show that an engineered decoy receptor for sars-cov- broadly binds with low nanomolar k d to the spikes of sars-associated betacoronaviruses that use ace for entry, despite high sequence diversity within the ace -binding site. mutations in s of sars-cov- that confer high specificity for wild type ace were not found in a comprehensive screen of all substitutions within the rbd. the engineered decoy receptor is therefore broad against zoonotic ace - utilizing coronaviruses that may spill over from animal reservoirs in the future and against variants of sars-cov- that may arise as the current covid- pandemic rages on. we argue it is unlikely that decoy receptors will need to be combined in cocktail formulations, as is required for monoclonal antibodies or designed miniprotein binders to prevent the rapid emergence of resistance ( , ), facilitating manufacture and distribution. our findings give insight into how a potential therapeutic can achieve breath with a low chance of virus resistance for a family of highly infectious and deadly viruses. physiology to exert unacceptable toxicity. for example, the entry receptor for human cytomegalovirus is a growth factor receptor, and growth factor interactions had to be knocked out to make a virus-specific decoy suitable for in vivo administration ( ). however, ace in this regard is different and its endogenous activity -the catalytic conversion of vasoconstrictive and inflammatory peptides of the renin- angiotensin system -may be of direct benefit for addressing covid- symptoms. during infection, ace activity is dysregulated and the renin-angiotensin system becomes imbalanced, possibly driving aspects of acute-respiratory distress syndrome (ards) ( - ). administration of recombinant sace converts angiotensin (ang) i and ii to the protective peptides ang-( - ) and ang-( - ), respectively, with potential benefits for the pulmonary and cardiovascular systems that include decreased lung elastance, increased blood oxygenation, reduced hypertension and diminished inflammation ( , , ( ) ( ) ( ) ( ) provide no more than a single coding variant per cell ( , ). expi f cells at × / ml were transfected with a mixture of ng coding plasmid (i.e. library dna) with . µg pcep -Δcmv carrier plasmid (described in ( ) fitc fluorescence for bound sace (wt)- h were collected ( figure c ). collection tubes were coated overnight with fetal bovine serum prior to sorting and contained expi expression medium. collected cell pellets were frozen at - °c and were pooled across separate sort experiments prior to extraction of total rna. the competition selection was performed similarly, with the exception that cells expressing the s library were incubated for minutes in a mixture of nm sace .v . - h and nm sace (wt)-igg . after washing twice, bound proteins were stained for minutes with anti-human igg-apc (clone hp , / dilution; biolegend) and anti-his-fitc (chicken polyclonal, / dilution; immunology consultants laboratory). cells were washed twice and sorted. after gating for the main population of viable cells as described above, the % of cells with the highest fitc-relative-to-apc and highest apc-relative-to-fitc signals were collected ( figure c ). total rna was extracted from the collected cells using a genejet rna purification kit (thermo scientific). first strand cdna was synthesized with accuscript (agilent) primed with a gene-specific oligonucleotide. the region of s scanned by saturation mutagenesis was pcr amplified as overlapping fragments that together span the full rbd sequence. following a second round of pcr, primers added adapters for annealing to the illumina flow cell and sequencing primers, together with barcodes for experiment identification. the pcr products were sequenced on an illumina novaseq using a × nt paired end protocol. data were analyzed using enrich ( ), where the frequencies of s variants in the transcripts of the sorted populations were compared to their frequencies in the naive plasmid library. log enrichment ratios for all the individual mutations were calculated and normalized by subtracting the log enrichment ratio for the wild type sequence across the same pcr-amplified fragment. a pneumonia outbreak associated with a new coronavirus of probable bat origin. nature structure, function, and antigenicity of the sars-cov- spike glycoprotein receptor recognition by novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars cryo-em structure of the -ncov spike in the prefusion conformation sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor angiotensin-converting enzyme is a functional receptor for the sars coronavirus. nature functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses much more than just a receptor for sars-cov- . front angiotensin-converting enzyme and angiotensin - : novel therapeutic targets. the d g mutation in the sars-cov- spike protein reduces s shedding and increases infectivity tracking changes in sars-cov- spike: evidence that d g increases infectivity of the covid- virus on the track of the d y mutation in the sars-cov- spike fusion peptide: emergence and geotemporal spread of a highly prevalent variant in portugal mosaic structure of human coronavirus nl , one thousand years of evolution crystal structure of nl respiratory coronavirus receptor-binding domain complexed with its human receptor genetic recombination, and pathogenesis of coronaviruses evolutionary origins of the sars-cov- sarbecovirus lineage responsible for the covid- pandemic co-circulation of three camel coronavirus species and recombination of mers-covs in saudi arabia antibody cocktail to sars-cov- spike protein prevents rapid mutational escape seen with individual antibodies complete mapping of mutations to the sars-cov- spike receptor-binding domain that escape antibody recognition ultrapotent human antibodies protect against sars-cov- challenge via multiple mechanisms cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody susceptibility to sars coronavirus s protein-driven infection correlates with expression of angiotensin converting enzyme and infection can be blocked by soluble receptor neutralization of sars-cov- spike pseudotyped virus by recombinant ace -ig inhibition of sars-cov- infections in engineered human tissues using clinical-grade soluble human ace engineering human ace to optimize binding to the spike protein of sars coronavirus engineered ace receptor traps potently neutralize sars-cov- . biorxiv high affinity modified ace receptors prevent sars-cov- infection. biorxiv exceptional diversity and selection pressure on sars-cov and sars-cov- host receptor in bats compared to other mammals structural basis of receptor recognition by sars-cov- stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis receptor and viral determinants of sars-coronavirus adaptation to human ace deep mutational scanning: a new style of protein science mapping interaction sites on human chemokine receptors by deep mutational scanning structural architecture of a dimeric class c gpcr based on co-trafficking of sweet taste receptor subunits enrich: software for analysis of protein function by enrichment and depletion of variants deep mutational scanning of sars-cov- receptor binding domain reveals constraints on folding and ace binding de novo design of ace protein decoys to neutralize sars-cov- . biorxiv distinct conformational states of sars-cov- spike protein molecular architecture of the sars-cov- virus global analysis of protein folding using massively parallel design, synthesis, and testing de novo design of picomolar sars-cov- miniprotein inhibitors thpdb: database of fda-approved peptide and protein therapeutics engineered receptors for human cytomegalovirus that are orthogonal to normal human biology angiotensin-converting enzyme protects from severe acute lung failure recombinant angiotensin-converting enzyme improves pulmonary blood flow and oxygenation in lipopolysaccharide-induced lung injury in piglets the pivotal link between ace deficiency and sars-cov- infection renin-angiotensin-system, a potential pharmacological candidate, in acute respiratory distress syndrome during mechanical ventilation sars-cov- and ace : the biology and clinical data settling the arb and acei controversy ace improves right ventricular function in a pressure overload model novel ace -fc chimeric fusion provides long-lasting hypertension control and organ protection in mouse models of systemic renin angiotensin system activation pharmacokinetics and pharmacodynamics of recombinant human angiotensin- converting enzyme in healthy human subjects a pilot clinical trial of recombinant human angiotensin-converting enzyme in acute respiratory distress syndrome novel ace -igg fusions with improved activity against sars-cov . biorxiv computational design of a protein-based enzyme inhibitor cytometer (bd biosciences) and data were processed with fcs express (de novo software). quantification of myc-s surface expression is detailed in figure s . part supported by nih award r ai to e.p. the university of illinois has filed a provisional patent for engineered decoy receptors and e.p. and k.k.c. are co-founders of orthogonal biologics, inc. key: cord- -k uk b authors: bouwman, kim m.; tomris, ilhan; turner, hannah l.; van der woude, roosmarijn; bosman, gerlof p.; rockx, barry; herfst, sander; haagmans, bart l.; ward, andrew b.; boons, geert-jan; de vries, robert p. title: multimerization- and glycosylation-dependent receptor binding of sars-cov- spike proteins date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: k uk b receptor binding studies using recombinant sars-cov proteins have been hampered due to challenges in approaches creating spike protein or domains thereof, that recapitulate receptor binding properties of native viruses. we hypothesized that trimeric rbd proteins would be suitable candidates to study receptor binding properties of sars-cov- and - . here we created monomeric and trimeric fluorescent rbd proteins, derived from adherent hek t, as well as in gnti mutant cells, to analyze the effect of complex vs high mannose glycosylation on receptor binding. the results demonstrate that trimeric fully glycosylated proteins are superior in receptor binding compared to monomeric and immaturely glycosylated variants. although differences in binding to commonly used cell lines were minimal between the different rbd preparations, substantial differences were observed when respiratory tissues of experimental animals were stained. the rbd trimers demonstrated distinct ace expression profiles in bronchiolar ducts and confirmed the higher binding affinity of sars-cov- over sars-cov- . our results show that fully glycosylated trimeric rbd proteins are attractive to analyze receptor binding and explore ace expression profiles in tissues. afford additional means for fluorescent-based experiments [ ] , and thus are attractive to be fused to rbd proteins. the resulting proteins were analyzed for binding to cell culture cells and paraffin-embedded tissues of various hosts including susceptible and non-susceptible animals. the results demonstrate that fully glycosylated trimeric sars-cov- rbd proteins reveal the differences in ace expression between cell cultures and tissue sections. these trimeric rbd proteins bind ace efficiently in a species-dependent manner and can be used to profile ace tissue expression. finally, we without the gcn trimerization domain, fused to either sfgfp or morange (fig a ). the monomeric and trimeric rbds were efficiently expressed in both hek t as well as gnti cells (data not shown), with an increased yield up to -to -fold when fused to a c-terminal sfgfp ( fig b) . expression yields of the morange fusions were comparable to that of sfgfp fusions (data not shown). to illustrate the expression yields of sars spike proteins or domains thereof we measured the fluorescence in the cell culture supernatant (fig c) . the wild-type full-length ectodomains were difficult to express even with the addition of sfgfp or morange fusion (fig b) . to increase yields for the full-length ectodomain we introduced the p and additional hexapro mutations [ ] , and analyzed the fluorescence in cell culture supernatants five days post- transfection after incubation at or °c. although we did not observe a large increase in yields, we were able to purify sufficient protein to compare full-length ectodomain trimers vs monomeric and trimeric rbd and ntd proteins. spike rbd domains in frame with a c-terminal gcn and fluorescent reporter protein display multimeric features on gel and maintain antigenicity after purification, all rbd proteins were analyzed on gel under non-and reducing conditions (fig a) . without reducing agent, monomeric rbd proteins revealed dimeric fractions which could be reduced to a single monomeric form. the ntd trimers were reduced under non-reducing conditions, thus solely by sds. the trimeric rbd variants, on the other hand, revealed dimers and trimers that could be reduced. besides, the ntd of prototypical γ-coronavirus ibv-m and influenza a virus ha pr as control proteins were included. finally, we determined the extent of n-glycosylation maturation on purified proteins expressed in either gnti or t by subjecting the monomeric and trimeric proteins to pngasef and endoh treatment ( fig s ) next, we examined the antigenicity of the sars-cov- and - proteins using serum collected from macaques days post-infection with sars-cov- [ ] . both sars-cov- rbd monomers and trimers derived from t cells were efficiently recognized, indicating proper folding ( fig b) . as expected, sars- cov- rbd proteins were poorly recognized, and the negative controls m ntd and pr ha displayed baseline binding identical to pre-infection serum. the ntd trimers were likewise not recognized by the serum (not shown), indicating that the majority of antibodies in naïve animals after infection are directed against the sars-cov rbd [ ] . similar results were obtained using gnti-derived proteins, with the rbd trimer being less efficiently recognized by the macaque serum than its monomeric counterpart. this is in line with recent observations that insect cell-derived proteins are less well bound by serum to determine whether the fluorescent rbd trimers are indeed structured in a trimeric manner we subjected these proteins to negative stain single-particle em. the em data revealed that the rbd proteins form stable trimers that resemble known spike structures (fig. c) . initially, , individual particles were picked, placed into a stack, and submitted to reference-free two- dimensional ( c) classification. from the initial d classes, particles that did not resemble rbd were removed, resulting are final particle stacks of , particles, which were then subject to relion d classification. all resultant classes demonstrated evident and distinct trimeric rbd, gcn , and three sfgfp protein structures that could be identified in the em images. from the em images, we generated a model in which we took the crystal structures of sfgfp, the gcn trimerization domain (pdb: o h), and the sars-cov- rbd (pdb: xm ) to demonstrate the likely structure of our rbd trimer (fig d) . to determine the biological activity of our rbd proteins we stained a and vero cells that are reported to support sars-cov replication, with the latter being more susceptible [ ] . however, a cells were bound by all our rbd proteins with a slight increase in intensity from monomeric gnti derived rbd proteins to trimeric t derived rbds (fig ) . trimeric t rbd binding was efficiently blocked using μm recombinant ace whereas nm ace pre- incubation was not sufficient to prevent binding of fully glycosylated trimeric rbd proteins to cells completely. sars-cov- rbd proteins bound slightly more intensely to a cells compared to the same sars-cov- rbd proteins. a similar pattern was observed for vero-e cells, however, the fully glycosylated sars-cov- rbd trimer bound markedly stronger compared to the other rbd preparations ( fig s a) . importantly, the full-length ectodomain also bound efficiently to a cells ( fig s b) . we did not observe any binding of the trimeric ntd domains to a cells ( fig s b) . mdck cells, derived from canine kidney, served as negative controls, to which we indeed did not observe any binding with any of the indicated proteins ( fig s c) . applied. in all cases, sars-cov- displayed a higher avidity compared to sars-cov- . a similar trend of binding intensities was observed for monomeric, trimeric, and different n-glycosylated sars-cov-rbd proteins fused to morange ( fig s a) . again specific binding was seen to the epithelium of terminal bronchioles and, to a much lower extent, to alveoli and endothelium. the results were confirmed using horseradish peroxidase readout with a hematoxylin counterstain (fig s b) , which output is enzyme driven and purely qualitative, however, we did observe similar differences in staining intensities. here, very minimal staining using the sars-cov ntd domains was observed (fig s b) , which we did not detect using a fluorescent readout (fig s c) . to determine if the binding was ace dependent we pre-incubated trimeric rbd proteins with recombinant ace . while μm was sufficient to block binding to cell culture cells (fig c) , μm was needed to prevent all detectable binding to ferret lung tissue. to confirm our observations of different binding on tissues, we quantified the intensities of the ace antibody and sars-cov- and - rbd proteins, except for the monomeric gnti derived proteins as these were almost at the background ( fig d) . as expected a noteworthy trend was observed of increasing binding strength from sars-cov gnti derived monomers to sars-cov- fully glycosylated rbd trimers. interestingly multimerization appears to be more important for strong ace interaction to tissue compared to the glycosylation status. viroscience, erasmus university, the netherlands, respectively. tissue sections were rehydrated in a series of alcohol from %, % to %, and lastly in distilled water. tissues slides were boiled in citrate buffer ph . for min at kw in a microwave for antigen retrieval and washed in pbs-t three times. endogenous peroxidase activity was blocked with % hydrogen peroxide for min. tissues were subsequently incubated with % bsa in pbs-t overnight at °c. the next day, the purified viral spike proteins ( μg/ml) were human monoclonal antibodies block the binding of sars-cov- spike protein to angiotensin converting enzyme receptor structural and functional basis of sars-cov- entry by using human ace functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses structure, function, and antigenicity of the sars-cov- spike glycoprotein epub / / a serological assay to detect sars-cov- seroconversion in humans a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov potent neutralizing antibodies from covid- patients define multiple targets of vulnerability conformational dynamics of sars-cov- trimeric spike glycoprotein in complex with receptor ace revealed by cryo-em closing coronavirus spike glycoproteins by structure-guided design structures, conformations and distributions of sars-cov- spike protein trimers on intact virions stabilizing the closed sars-cov- spike trimer structure-based design of prefusion-stabilized sars- fluorescent trimeric hemagglutinins reveal multivalent receptor binding properties structure-based design of prefusion cov- spikes. science. . epub / / comparative pathogenesis of covid- , mers, and sars in a nonhuman primate model structural basis of a shared antibody response to sars-cov- severe acute respiratory syndrome coronavirus from patient with infection and rapid transmission of sars-cov- in ferrets okba nma, et al. sars-cov- is transmitted via contact and via the air between ferrets sars-cov- infection in farmed minks, the netherlands syrian hamsters as a small animal model for sars-cov- infection and countermeasure development severe acute respiratory syndrome coronavirus infection of golden syrian pathogenesis and transmission of sars-cov- in golden hamsters deducing the n-and o-glycosylation profile of the spike protein of novel coronavirus sars-cov- site-specific glycan analysis of the sars-cov- spike epub / / site- specific n-glycosylation characterization of recombinant sars-cov- spike glycans on the sars-cov- spike control the receptor binding domain conformation sars-cov- receptor ace and tmprss are primarily expressed in bronchial transient secretory cells the protein expression profile of ace in human tissues the sars-cov- spike protein has a broad tropism for mammalian ace proteins cell entry of sars-cov- conferred by angiotensin-converting enzyme (ace ) of different species the protein expression profile of ace in human tissues tissue distribution of ace protein, the functional receptor for sars coronavirus. a first step in understanding sars pathogenesis de haan ca. the influenza a virus hemagglutinin glycosylation state affects receptor- binding specificity improving the photostability of bright monomeric orange and red fluorescent proteins three amino acid changes in avian coronavirus spike protein leginon: a system for fully automated acquisition of electron micrographs a day appion: an integrated, database-driven pipeline to facilitate em image processing dog picker and tiltpicker: software tools to facilitate particle selection in single particle electron microscopy μg of protein was subjected without or with pngase f or endoh for hr and subjected to sds-page and western blot analyzes supplemental figure . binding of rbd proteins to cell lines (a) protein binding of rbd proteins observed on vero e cells proteins were applied μg/ml and where indicated pre-incubated with spike proteins were detected using anti-strep and goat-anti-mouse antibodies binding of full-length sars-cov- ectodomain, ibv-m , antibodies only, and ntd spike proteins to a cells. proteins were applied μg/ml and detected using anti-strep and goat-anti-mouse antibodies c) non-binding of rbd trimers to mdck cells binding of rbd proteins to tissues (a) binding of rbd proteins fused to morange to ferret lung tissues proteins were applied μg/ml and detected using anti-strep and goat-anti- mouse antibodies binding of rbd proteins fused to sfgfp proteins to ferret lung tissues, using hrp as a readout identical experiment to (a) but using an hrp readout using anti-strep and goat- anti-mouse antibodies control staining on ferret lung tissues using hrp as readout proteins were applied μg/ml and detected using anti-strep and goat-anti-mouse antibodies scalebar is μm lack of ntd binding to ferret lung tissue using fluorescence proteins were applied μg/ml and detected using anti-strep and goat-anti- mouse antibodies control stainings to syrian hamster tissues, antibodies only and m proteins were applied μg/ml and where indicated pre-incubated with recombinant ace protein key: cord- -z rwznmv authors: li, qianqian; wu, jiajing; nie, jianhui; zhang, li; hao, huan; liu, shuo; zhao, chenyan; zhang, qi; liu, huan; nie, lingling; qin, haiyang; wang, meng; lu, qiong; li, xiaoyu; sun, qiyu; liu, junkai; zhang, linqi; li, xuguang; huang, weijin; wang, youchun title: the impact of mutations in sars-cov- spike on viral infectivity and antigenicity date: - - journal: cell doi: . /j.cell. . . sha: doc_id: cord_uid: z rwznmv summary the spike protein of sars-cov- has been undergoing mutations and is highly glycosylated. it is critically important to investigate the biological significance of these mutations. here we investigated variants and glycosylation site modifications for the infectivity and reactivity to a panel of neutralizing antibodies and sera from convalescent patients. d g, along with several variants containing both d g and another amino acid change, were significantly more infectious. most variants with amino acid change at receptor binding domain were less infectious but variants including a v, l r, v a and f l became resistant to some neutralizing antibodies. moreover, the majority of glycosylation deletions were less infectious whilst deletion of both n and n glycosylation drastically reduced infectivity, revealing the importance of glycosylation for viral infectivity. interestingly, n q was markedly resistant to neutralizing antibodies, whereas n q became more sensitive. these findings could be of value in the development of vaccine and therapeutic antibodies. the spike protein of sars-cov- has been undergoing mutations and is highly glycosylated. it is critically important to investigate the biological significance of these mutations. here we investigated variants and glycosylation site modifications for the infectivity and reactivity to a panel of neutralizing antibodies and sera from convalescent patients. d g, along with several variants containing both d g and another amino acid change, were significantly more infectious. most variants with amino acid change at receptor binding domain were less infectious but variants including a v, l r, v a and f l became resistant to some neutralizing antibodies. moreover, the majority of glycosylation deletions were less infectious whilst deletion of both n and n glycosylation drastically reduced infectivity, revealing the importance of glycosylation for viral infectivity. interestingly, n q was markedly resistant to neutralizing antibodies, whereas n q became more sensitive. these findings could be of value in the development of vaccine and therapeutic antibodies. covid- pandemic is a tremendous threat globally. as of july , , countries have reported covid- cases, with more than million confirmed cases and approximately , deaths (https://www.who.int/emergencies/diseases/novel-coronavirus- /situation-reports/). the causative agent of covid- , sars-cov- causes a lower respiratory tract infection that can progress to severe acute respiratory syndrome and even multiple organ failure (lv et al., a; yang et al., ) . sars-cov- is a single-stranded positive-strand rna virus whose genome encodes four structural proteins: spike (s), small protein (e), matrix (m) and nucleocapsid (n) (chan et al., ) . the s protein is a type i fusion protein that forms trimers on the surface of the virion. it is composed of two subunits, with s responsible for receptor binding and s for membrane fusion single mutants were also constructed to compare with the double mutants with d g. group c is comprised of mutants at the putative glycosylation sites ( sites). this group includes both variants (n k, n h and t a) and investigational mutants that we made for the analyses of the effects of glycosylation. specifically, all sites (n to q) were made in the lab to generate individual mutants; we also made a combination by deleting the two glycosylation sites in rbd. in total, we have generated pseudotyped viruses, i.e., variants and glycosylation mutants ( figure ). these viruses were prepared as described previously (nie et al., ) (see star methods). to determine the infectivity of these variants and mutants, we first infected cell lines with pseudotyped viruses with either sars-cov- s protein or vsvg protein (see star methods). as expected, the two types of pseudotyped viruses are different in the infection efficiency in the cell lines ( figure ) . while almost all cell lines were generally susceptible to infection by vsv g pseudotyped virus, sars-cov- pseudotyped virus could efficiently infect certain cell lines including three human cell lines ( t-hace , t and huh- ) and three non-human primate cell lines (vero, veroe and llc-mk ). as such, we selected these four out of the six cell lines in subsequent experiments, including t-hace , huh- , vero and llc-mk . we first tested the infectivity of pseudotyped viruses ( natural variants and glycosylation mutants) in t-hace cells, where a difference by -fold in rlu compared with the reference wuhan- strain (genbank: mn ) was deemed as being significant ( figure s ). of all pseudotyped viruses, were determined as low-infectivity ( natural mutants and glycosylation mutants), with rlu reading decreased by to folds ( figure a) . among them, were located in the rbd region. variant v i and investigational glycosylation mutant (n q +n q) were deemed as no-infectivity as demonstrated by over -fold decrease in rlu values compared with the reference strain. both of them were located in rbd. it is worth noting that double glycosylation deletions at n and n resulted in a drastic reduction in viral infectivity ( -fold), whereas single deletion at each site caused modest reduction in viral infectivity, with the infectivity of n q reduced by only -fold and n q by -fold. moreover, the non-natural double glycosylation mutations in rbd (n q and n q) resulted in significantly reduced infectivity, suggesting that the two glycosylation sites in the rbd region may participate in the binding of the receptor or maintain the conformation of the rbd region. the remaining variants were tested further with other three cell lines for infectivity suggesting that the enhanced infectivity was more likely ascribed to d g itself. antibodies having identified the variants with altered infectivity, we next set out to investigate the antigenicity of the infectious mutants using neutralizing monoclonal antibodies (mabs) (see star methods). it was noted that some changes in rbd region demonstrated altered sensitivity to neutralizing mabs (figure and figure s ). specifically, a v reduced the sensitivity to mabs , , cb , p c- f , b and ca , while f l reduced the sensitivity to mabs x , - , h and p b- f . moreover, v a became resistant to mabs x and p b- f , and l r to mabs x and p b- f . finally, y h reduced the sensitivity to mabs h , n k to mab h s , a v to mab b , d g+i v to mab x and d g+a s to mab h by more than times. in addition, some changes in the rbd region, including v f, q e, q e, i f, i t, y h and a v, were observed to be more susceptible to neutralization mediated by mabs. we next determine how infectious glycosylation mutants reacted to the same panel of mabs. mutant n q actually became more sensitive to mab p b- f , whereas n q reduced the neutralization sensitivity to different set of mabs including , , cb , p c- f , h s , b , ab and h . these results confirmed that these two glycosylation sites are important for receptor binding. these mabs have proven to be valuable in our analyses of the amino acid changes. as shown in figure , five mabs, i.e., , , cb , p c- f and b , were unable to effectively neutralize both a v and n q. neither x nor p b- f was effective in neutralizing l r, v a and f l whilst p b- f was more effective in neutralizing n q. in addition, mab h was incapable of neutralizing n q, y h and d g+a s while mabs h and - were found not to neutralize f l. furthermore, finally, h s was unable to neutralizing n k and n q. finally, we determined the sensitivity of the strains with amino acid changes to ten covid- convalescent sera (see star methods). none of the variants and mutants demonstrated significantly altered sensitivity to all convalescent sera, i.e., the ec values were not altered by more than -fold, irrespective of an increase or decrease, when compared with the reference strain ( figure a and figure s ). however, the neutralization sensitivity of both f l and h p to three of ten patient sera were found to have decreased by more than times, while six variants and mutants (n h, n q, n q, n d, n q and n q) became over -fold sensitive to one or two of the ten tested sera. notably, five out of the six were glycan deletion mutants. as shown in figure b , when the data of individual convalescent sera were pooled together to analyze the sensitivity of all variants, no marked difference was observed (> fold). however, modest differences between some variants and reference strain (within -fold) were observed in their reactivity to grouped convalescent sera. these differences were statistically significant (p< . ). it is worth mentioning that some variants including f l, v f, i f, i t and v l ( figure b ) were even more sensitive to the convalescent sera compared with reference strain, whereas more variants were found to be resistant to the convalescent sera. these variants include single amino acid change such as y del, q e, n k, g v, k n, i v, a v, t i, v i, f l and a v, as well as the double amino acid changes including d g + q l, d g +i v, d g +a v, d g +a s and d g +m i. similar to natural variants, although the magnitude of some glycosylation deletions in sensitivity to the sera is less than -fold, the differences between mutants and the reference strain (wuhan- ) were found to be still several-folds and statistically significant, i.e., glycosylation mutants n q and n q significantly increased the sensitivity to convalescent sera ( and ambiguous sequences, we narrowed down to variants. moreover, as glycosylation of viral protein is well documented to affect viral replication and immune response and sars-cov- s protein is heavily glycosylated, we also made substitutional mutations at all putative glycosylation sites. in total, we made pseudotyped viruses, allowing us to characterize them using the established method (nie et al., ) (see star methods). table summarize the characteristics of variants and investigational mutants. of all variants, d g is of particular note. this variant has been shown to rapidly accumulating since its emergence and linked to more clinical presentations (korber et al., ) . at the beginning of this study (may , ), it accounted for . % of all circulating strains, but by july , it had reached . %. this dominant strain could effectively infect the four cell lines tested, being -fold more infectious than the original wuhan- strain ( figure ) . another important finding is that natural variants capable of affecting the reactivity to neutralizing mabs were almost all located in the rbd region (except a v and d g+a v), as all antibodies used in this study were targeting the rbd ( with decreased sensitivity to neutralization by p b- f mab; as both l r and f l remain sensitive to p c- f , suggesting this mab is not derived from the same clone for p b- f . moreover, both mutants displayed decreased sensitivity to another neutralizing mab x by -fold compared with the reference strain (figure ). while we identified multiple variants with decreased sensitivity to neutralizing mabs, we need to look at how frequent these variants are in the field. v a in rbd is one of the two variants with a mutation frequency of over . %. it showed decreased reactivity to the two mabs (p b- f and x ) ( figure a and b) (ju et al., ) . another rbd variant a v sits in the binding epitope of rbd. it is significantly resistant to several neutralizing mabs including p c- f , ca , and cb . it is noteworthy that cb mab targets the receptor binding epitope ( figure c and d) (shi et al., ) . specifically, y was buried in the epitope targeted by mab h (figure e and f) (lv et al., b) . indeed, the y h was found to be resistant to this mab. it is worth mentioning that d g+i v has shown increased infectivity and more resistance to neutralizing antibodies (table ), but only one sequence (originated from canada) was reported in gisaid. moreover, some variants, including n k, l r, a v, v a, f l and y h, do have decreased sensitivity to neutralizing mabs. however, only v a exceeded . % in frequency at the beginning of the study, all of which were found in us, with sequences reported as of may , , and up to july , . variants containing n k showed a significant increase in circulation, i.e., with case reported as of may , (all in uk) to by july , ( in uk, in romania). in addition, only one sequence from france containing y h was deposited in girsaid as of may , while four sequences reported as of july , , of which two originated from netherlands, one from sweden, and one from france. only one or two isolates were reported for other variants, which have not been observed to have increased during the time frame we have been monitoring. nevertheless, as rna viruses mutate all the time and some variants may only appears during certain period of time, while others could emerge in an unpredictable fashion, continued analyses of the circulating strains in terms of the mutation frequency and temporal pattern are warranted. our results suggest that the mabs used in this study could be divided into seven groups as they appear to be different in the inhibitory effects on the variants. as such, it would be interesting to formulate a therapeutic regimen comprised of at least two mabs. for example, a combination of p c- f and x should be effective to inhibit all variants in this study. it would be of interest to test more neutralizing antibodies which could be targeting epitopes outside rbd. with regard to the glycosylation mutants analyzed in this study, n q increased the sensitivity to mab p b- f whilst n q displayed resistance to neutralizing mabs such ca , cb , and others. although neither of them is found in circulation, the reactivity of these two mutants to neutralizing mab is still worth noting. as n and n are located near the rbd region (watanabe et al., ) , these mutants may affect some epitopes targeted by neutralizing mabs. specifically, n glycosylation site is involved in the binding of mab to the rbd region of s protein (cao et al., ) . it is likely that the sugar chain can mask the epitope targeted by the antibody. this type of glycan shield has been observed in other virus such as hiv- . specifically the use of sera from convalescent patients in neutralizing assay largely confirmed the results obtained with the well characterized neutralizing mabs. it is understood that the magnitude of altered reactivity is slightly smaller with human sera than that with mabs, given that polyclonal antibodies from convalescent patents are directed against multi-epitopes on the full-length s protein; as a result, these polyclonal antibodies could complement one another. however, the differences in their reactivity to the human antibodies were found to be by several folds in most cases and all determined as statistically significant. notably, some rbd variants such as a v and f l have been confirmed to have decreased sensitivity to both human sera and multiple neutralizing mabs. a v reduced the sensitivity to mabs out of the mab used in this study, while f l reduced the sensitivity to neutralization by mabs. it is possible that antibodies in convalescent sera are able to neutralize these critical epitopes targeted by these mabs that are known to disrupt the binding of the s protein to hace receptor (ju et serial dilutions of mab preparations were pre-incubated with the pseudotyped viruses at °c for one hour before they were added to huh- cells. luciferase activity was measured hours later to calculate ec of each antibody. the ratio of ec between the variant or mutant strains and the reference strain (wuhan- ) was calculated and analyzed to generate heatmap using hem i (deng et al., ) . the data were the results from - replicates. the red and blue boxes indicate the increase or decrease of the neutralization activity as shown in the scale bar. see also figure s . serial dilutions of mab preparations were pre-incubated with the virus at °c for one hour before they were added to huh- cells. luciferase activity was measured hours later to calculate ec of each antibody. the y-axis represents the ratio of ec between the variant/mutant strain and the reference strain (wuhan- ). the data were the results from - replicates. the horizontal dashed lines indicate the threshold of -fold difference. the significant changes were marked with colored symbols, blue for decreased, red for increased. related to further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, dr. youchun wang (wangyc@nifdc.org.cn). all the unique reagents generated in this study are available from the lead contact with a completed materials transfer agreement. this study did not generate any unique datasets or code. primers. following site-directed mutagenesis pcr, the template chain was digested using dpni restriction endonuclease (neb, usa). afterwards, the pcr product was directly used to transform e. coli dh α competent cells; single clones were selected and then sequenced. the primers designed for the specific mutation sites are listed in table s , and the frequency of different variants in the epidemic population is listed in table s . highlights over mutations were selected for analyses on their infectivity and antigenicity the dominant d g itself and combined with other mutations are more infectious ablation of both n and n glycosylation at rbd drastically reduced infectivity ten mutations such as n q, l r, a v, v a was markedly resistant to some mabs eighty natural variants and twenty-six glycosylation spike mutants of sars-cov- were analyzed in terms of infectivity and antigenicity using high throughput pseudovirus assay in conjunction with neutralizing antibodies. reference l f l v l w h y y del f l p l n d n k s n v f k r p l r i q e q e a s n k g v l r k r k n i f i t i v a v g s t i v a v i f l y h h p h q a s a s a v v l a v d e d y s t s r g v y del+r i d g+q k d g+q l d g+v i d g+a s d g+k r d g+i v d g+h p d g+a v d g+a s d g+a s d g+d y d g+s f d g+s t d g+m i d g d g+m i d g+p l d g+l f n q n q n q n q n q n q n q n q n q n q n q n q n q n q n q n q n q n k n h a b reference cs cs cs cs cs cs cs cs cs cs l f l v l w h y y del f l a t n d n k s n v f k r p l r i q e q e a s n k g v l r k r k n i f i t i v a v g s t i v a v i f l y h h p h q a s a s a v d g v l a v d e d y s t s r g v y del+r i d g+l f d g+q k d g+q l d g+v i d g+a s d g+k r d g+i v d g+h p d g+a v d g+a s d g+a s d g+d y d g+s f d g+s t d g+m i d g+m i d g+p l n q n q n q n q reference l f l v l w h y y del f l a t n d n k s n v f k r p l r i q e q e a s n k g v l r k r k n i f i t i v a v g s t i v a v i f l y h h p h q a s a s a v d g v l a v d e d y s t s r g v y del+r i d g+l f d g+q k d g+q l d g+v i d g+a s d g+k r d g+i v d g+h p d g+a v d g+a s d g+a s d g+d y d g+s f d g+s t d g+m i d g+m i d g+p l n q n q n q n q n q n q n q n q n q n q n q n q n q n q n q n q n q n k sars-cov- viral spike g mutation exhibits higher case fatality rate potent neutralizing antibodies against sars-cov- identified by high-throughput single-cell sequencing of convalescent patients' b cells genomic characterization of the novel human-pathogenic coronavirus isolated from a patient with atypical pneumonia after visiting wuhan mutated covid- , may foretells mankind in a great risk in the future hemi: a toolkit for illustrating heatmaps ebola virus glycoprotein with increased infectivity dominated the - epidemic the hiv glycan shield as a target for broadly neutralizing antibodies the spike protein of sars-cov--a target for vaccine and therapeutic development why are rna virus mutation rates so damn high? the highly conserved glycan at asparagine of hiv- gp is indispensable for viral entry identification of immunodominant sites on the spike protein of severe acute respiratory syndrome (sars) coronavirus: implication for developing sars diagnostics and vaccines airborne transmission of influenza a/h n virus between ferrets n-linked glycans and k residue on hemagglutinin synergize to elicit broadly reactive h n influenza virus coronavirus spike protein and tropism changes human neutralizing antibodies elicited by sars-cov- infection crystal structure of a fully glycosylated hiv- gp core reveals a stabilizing role for the glycan at asn tracking changes in sars-cov- spike: evidence that d g increases infectivity of the covid- virus structural, glycosylation and antigenic variation between novel coronavirus ( -ncov) and sars coronavirus quasispecies theory and the behavior of rna viruses functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses structure, function, and evolution of coronavirus spike proteins removal of a single n-linked glycan in human immunodeficiency virus type gp results in an enhanced ability to induce neutralizing antibody responses coronavirus disease (covid- ): a scoping review structural basis for neutralization of sars-cov- and sars-cov by a potent therapeutic antibody establishment and validation of a pseudovirus neutralization assay for antigenic drift of influenza a(h n ) virus hemagglutinin influenza a(h n ) virus evolution: which genetic mutations are antigenically important? a virus that has gone viral: amino acid mutation in s protein of indian isolate of coronavirus covid- might impact receptor binding, and thus, infectivity emerging genetic diversity among clinical isolates of sars-cov- : lessons for today a human neutralizing antibody targets the receptor binding site of sars-cov- a single mutation in chikungunya virus affects vector specificity and epidemic potential human adaptation of ebola virus during the west african outbreak two n-linked glycosylation sites in the v and c regions of human immunodeficiency virus type crf _ae envelope glycoprotein gp regulate viral neutralization susceptibility to the human monoclonal antibody specific for the cd binding domain emergence of genomic diversity and recurrent mutations in sars-cov- emerging wuhan (covid- ) coronavirus: glycan shield and structure prediction of spike glycoprotein and its interaction with human cd function, and antigenicity of the sars-cov- spike glycoprotein structural and functional basis of sars-cov- entry by using human ace a systematic study of the n-glycosylation sites of hiv- envelope protein on infectivity and antibody-mediated neutralization n glycosylation site on v loop of a mutant gp regulates the sensitivity of hiv- to neutralizing monoclonal antibodies vrc / site-specific glycan analysis of the sars-cov- spike cryo-em structure of the -ncov spike in the prefusion conformation a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace clinical course and outcomes of critically ill patients with sars-cov- pneumonia in china: a single-centered, retrospective, observational study characterization of a filovirus (mengla virus) from rousettus bats in china role of stem glycans attached to haemagglutinin in the biological characteristics of h n avian influenza virus pseudotyped viruses incorporated with spike protein from either sars-cov- , variants or mutants were constructed using a procedure described by us recently (nie et al., ) . on day before transfection, t cells were prepared and adjusted to the concentration of - × cell/ml, ml of which were transferred into a t cell culture flask and incubated overnight at c in an incubator conditioned with % co . the cells generally reach - % confluence after overnight incubation. thirty microgram of dna plasmid expressing the spike protein was transfected according to the user's instruction manual. the transfected cells were subsequently infected with g*∆g-vsv (vsv g pseudotyped virus) at concentration of . × tcid /ml. these cells were incubated at °c for - hours in the presence of in % co . afterwards, cell supernatant was discarded, followed by rinsing the cells gently with pbs + % fbs. next, ml fresh complete dmem was added to the flask and cultured for h. twenty-four hours post infection, sars-cov- pseudotyped viruses containing culture supernatants were harvested, filtered ( . -µm pore size, millipore, cat#slhp rb) and stored at − °c in -ml aliquots until use. the % tissue culture infectious dose (tcid ) of sars-cov- pseudovirus was determined using a single-use aliquot from the pseudovirus bank to avoid inconsistencies resulted from repeated freezing-thawing cycles. for titration of the pseudotyped virus, a -fold initial dilution with six replicates was made in -well culture plates followed by serial -fold dilutions. the last column was employed as the cells control without pseudotyped virus. subsequently, the -well plates were seeded with huh- cells adjusted to × cells/ml. after h incubation at °c in a humidified atmosphere with % co , the supernatant was aspirated and discarded gently to leave µl in each well; next, µl of luciferase substrate (perkinelmer, cat# ) was added to each well. after -min incubation at room temperature in the dark, µl of lysate was transferred to white -well plates for the detection of luminescence using a luminometer (perkinelmer, ensight). positive was determined to be ten-fold higher than the negative (cells only) in terms of relative luminescence unit (rlu) values. the % tissue culture infectious dose (tcid ) was calculated using the reed-muench method (nie et al., ) . before quantification, all the pseudotyped viruses were purified through a % sucrose cushion by ultra-centrifugation at , × g for h (nie et al., ) resources table. using the quantitative rt-pcr, we normalized the pseudotyped virus particles to the same amount. after normalization, µl of the pseudotyped virus with -fold dilution was added to wells in -well cell culture plate. after the cells were trypsin-digested, × / µl cells were added to each well in the -well plates. the plates were then incubated at °c in a humidified atmosphere with % co . after incubation for hours, chemiluminescence detection was performed as described in the titration of pseudotyped viruses. each group contained - replicates. the virus neutralization assay was conducted as described previously (nie et al., ) . briefly, µl serial dilutions of human sera or monoclonal antibody preparations were added into -well plates. after that, µl pseudoviruses with concentration of tcid /ml were added into the plates, followed by incubation at °c for hour. afterwards, huh- cells were added into the plates ( × cells/ µl cells per well), followed by incubation at °c in a humidified atmosphere with % co . chemiluminescence detection was performed after hours incubation. the reed-muench method was used to calculate the virus neutralization titer. the results are based on - replicates unless specified. in order to validate the test operation process, the coefficient of variance (cv) control of replicates is set within % of six wells, so is the cv for the duplicate sample wells. graphpad prism was used for plotting and statistical analysis; the values were expressed as mean ±sem. one-way anova and holm-sidak's multiple comparisons test was used to analyze the differences between groups. a p-value of less than . was considered to be significant. * p< . , ** p< . , *** p< . , **** p< . , ns represents no significant difference. key: cord- - ti r eh authors: bruni, m.; cecatiello, v.; diaz-basabe, a.; lattanzi, g.; mileti, e.; monzani, s.; pirovano, l.; rizzelli, f.; visintin, c.; bonizzi, g.; giani, m.; lavitrano, m.; faravelli, s.; forneris, f.; caprioli, f.; pelicci, p. g.; natoli, g.; pasqualato, s.; mapelli, m.; facciotti, f. title: persistence of anti-sars-cov- antibodies in non-hospitalized covid- convalescent health care workers date: - - journal: nan doi: . / . . . sha: doc_id: cord_uid: ti r eh background. coronavirus disease- (covid- ) is a respiratory illness caused by the severe acute respiratory syndrome coronavirus (sars-cov- ), a novel beta-coronavirus. although antibody response to sars-cov- can be detected early during the infection, several outstanding questions remain to be addressed regarding magnitude and persistence of antibody titer against different viral proteins and their correlation with the strength of the immune response, as measured by serum levels of pro-inflammatory mediators. methods. an elisa assay has been developed by expressing and purifying the recombinant sars-cov- spike receptor binding domain (rbd), soluble ectodomain (spike), and full length nucleocapsid protein (n protein). sera from healthcare workers affected by non-severe covid- were longitudinally collected over four weeks, and compared to sera from patients hospitalized in intensive care units (icu) and sars-cov- -negative subjects for the presence of igm, igg and iga antibodies as well as soluble pro-inflammatory mediators in the sera. results. specificity and sensitivity of the elisa assays were high for anti-rbd igg and iga ( - %) and slightly lower for igm and the spike and n proteins ( - %). the elisa allowed quantification of igm, igg and iga antibody responses against all the viral antigens tested and showed a correlation between magnitude of the antibody response and disease severity. non-hospitalized subjects showed lower antibody titers and blood pro-inflammatory cytokine profiles as compared to patients in intensive care units (icu), irrespective of the antibodies tested. noteworthy, in non-severe covid- infections, antibody titers against rbd and spike, but not against the n protein, as well as pro-inflammatory cytokines decreased within a month after viral clearance. conclusions. rapid decline in antibody titers and in pro-inflammatory cytokines may be a common feature of non-severe sars-cov- infection, suggesting that antibody-mediated protection against re-infection with sars-cov- is of short duration. these results suggest caution in use serological testing to estimate the prevalence of sars-cov- infection in the general population. syndrome coronavirus (sars-cov- ), a novel beta-coronavirus firstly described in wuhan city, china, on december [ ] . sars-cov- spreading has been declared pandemic in mid-march by who [ ] . at present the virus has infected more than million people worldwide with an associated case fatality rate of to %, depending on the country [ ] . covid- is associated with a broad range of mild-to-severe symptoms, potentially leading to hospitalization in intensive care units (icu) for the most severe cases. the respiratory tract is initially involved with possible development of severe interstitial pneumonia [ , ] , albeit the gastrointestinal tract can also significantly participate in disease pathogenesis as a consequence of the expression of the ace receptor, that mediates sars-cov- viral entry [ ] , on both alveolar and enteric epithelial cells [ ] . infected subjects manifest a complex clinical pattern appearing as early as two days post exposure and lasting several weeks [ ] . infection with sars-cov- induces a prompt activation of the immune system, finalized to the clearance of infected cells [ ] . innate and adaptive immune cells accumulate at the site of infection, where production of cytokines and inflammatory mediators may result in patient recovery or, in case of ineffective viral clearance, in hyperactivation of the immune system and development of severe complications, such as acute respiratory distress syndrome ards [ , ] . overexpression of pro-inflammatory cytokines (i.e. il- beta, il- , il- , il- , tnfα etc.) and impairment of humoral immunity have been described in patients with the most severe form of disease [ ] . antibodies against sars-cov- proteins are produced as a consequence of the activation of the humoral arm of the immune system. virus-specific igm antibodies are secreted as first class of immunoglobulins, followed by the more specific igg [ ] . among the latter, those specific for the viral spike receptor binding domain (rbd) when expressed at higher titer manifest direct neutralizing activity towards the viral entry into cells, as they prevent effective engagement of surface ace receptors by the spike protein [ , ] . the iga response against sars-cov- has been shown to be rapid and persistent [ , ] and is associated with mucosal responses, including the respiratory and gastrointestinal ones. serological testing is a valuable tool to monitor viral spreading throughout the population [ ] . furthermore, serological assays allow the identification of past infection in individuals with viral rna levels undetectable by rt-pcr for epidemiological purposes [ ] . various commercial and inhouse assays that utilize distinct viral antigens and detect different antibody classes are currently available. however, sars-cov- serological tests available on the market do not always allow systematic simultaneous detection of a wide antibody spectrum for several antigens in a reliable manner, and this may hamper a proper population testing for clinical or epidemiological purposes [ ] . conversely, serological enzyme-linked immunosorbent assay (elisa) to detect immunoglobulins raised against the viral spike soluble ectodomain (spike) and its highly immunogenic receptor binding domain (rbd), or against the nucleocapsid protein (n), are providing promising results in terms of accuracy and reproducibility [ ] . recently, these elisa assays have been used to show that neutralizing antibodies (nab) against different viral antigens may decline after - days post symptoms onset, and that the magnitude of nab response may be associated with disease severity in covid- patients [ ] . in order to measure the presence and evolution of antibody responses against different viral proteins, we set up and validated an in-house direct elisa assay based on three distinct sars-cov- viral antigens, i.e. eukaryotically-expressed rbd and spike and bacterially-expressed nucleocapsid protein. using this assay, we simultaneously measured igm, igg and iga anti-viral antibodies titers in the sera of covid- patients, as well as levels of pro-inflammatory cytokines. in addition, we longitudinally collected the sera of convalescent healthcare workers who tested positive for sars-cov- by nasopharyngeal (nf) swabs, and were symptomatic but not hospitalized. our data show that humoral immune responses against sars-cov- correlated with disease severity in terms of both antibody titers, persistence over time and serum levels of pro-inflammatory cytokines. notably, % of covid- mildly symptomatic patients halved their anti-rbd igg titers after weeks from viral negativization, thus confirming the short lifespan of humoral immune responses against sars-cov- . to evaluate the antibody response of individuals infected by sars-cov- , elisa assays were developed in-house by producing and purifying recombinant rbd, spike and nucleocapsid proteins of the sars-cov- virus following the protocols published in [ ] ( figure a ). the performances of these elisa assays were assessed for the different viral antigens and classes of antibodies by determining roc curves using i) a cohort of sera from covid- patients collected between april and june and tested positive for nasopharyngeal swabs, and ii) pre-covid- sera, collected between and (supplementary table and figure s ). anti-sars-cov igg showed the highest specificity and sensitivity irrespective of the antigen used (supplementary figures a,b) . anti-rbd igg showed specificity and sensitivity of % and %, respectively, while the assay performed with the spike ectodomain reached values of . % and % and the one with the n protein values of % and % (supplementary table and figure s ). these performances are in line with those published for both in-house and commercial assays approved for emergency use by the fda [ , ]. performance of iga detection was high for the rbd assay ( . % specificity and % sensitivity), while it was slightly lower for the n protein ( % and %) and for the spike ( % and %). the performance of the igm assay was comparatively lower for all the viral proteins tested (supplementary figures a,b) . the validated elisa assays were then used to systematically test the antibody titers of different classes of sars-cov- specific antibodies in sera from the following groups of patients: i) severe covid- patients admitted to icus; ii) health care workers from two hospitals in milan, exposed to the virus between february and march and confirmed positive to sars-cov- rna by rt-qpcr on nasopharyngeal swabs. sars-cov- -negative subjects collected between april and june were used as negative controls (supplementary table ). sera of the health care workers were collected in the convalescence phase of the disease after two consecutively negative nasopharyngeal swab tests. time between the first detection of the virus and the first negative swab ranged from to days from onset of symptoms to disappearance of viral rna (supplementary table ). these subjects all manifested clinical symptoms strongly related to sars-cov- infection, including fever, ageusia, anosmia, fatigue, myalgia, diarrhea, coryza and cough [ ] . two of them manifested a more severe disease course with episodes of dyspnea. none of the patients required hospitalization and they all recovered from the disease (supplementary table ). non-hospitalized covid- subjects manifested a lower antibody titer as compared to severe icu patients for all the tested antibody classes and viral antigens ( figure b-d) . this finding is in accordance to what published for asymptomatic [ ] and paucisymptomatic [ ] patients whose antibody titers were detected using commercial elisa or chemiluminescence assays against either the spike or the n-protein. when comparing the presence of the different classes of antibodies, all the covid- positive subjects resulted positive for the presence of igg antibodies against all the viral antigens tested ( figure e ). interestingly, a few of them were igm negative or with an antibody concentration close to the detection limit of the spike and rbd assay, as compared to the n protein. the observation that all of them instead showed n-specific igm antibodies may be a genuine persistence of anti-n protein igm or the consequence of a lower specificity of the n assay, possibly reflecting the high conservation of the n proteins among beta-coronaviruses other than sars-cov [ ] . interestingly, % of the non-hospitalized covid- patients did not develop rbd-specific iga, and only out of developed n-specific iga antibodies, a percentage that was instead above % for the hospitalized ones ( figure e) . notably, iga production has been associated with disease severity, suggesting that iga production might occur locally at the mucosal sites, possibly correlating with the viral load, the duration of the viral exposure and the virus entry route [ , ] . consistently, a recent communication [ ] confirmed that the highest levels of igg and iga antibodies against the spike s domain, encompassing the n-terminal half of the protein with the rbd, were associated with severe disease [ , ] since severe covid- is associated with a strong release of pro-inflammatory cytokines [ , ] table ). icu patients, whose sera were collected in the acute phase of the disease, showed a sustained production of pro-inflammatory mediators, among which il- , il- a, il- p , il- b, il- , il- and il- , all associated with the "cytokine storm" observed in very severe covid- patients, were the most abundantly detected (figure a ). on the contrary, even in the early convalescent phase, those cytokines were undetectable in the sera of non-hospitalized it is tempting to speculate that as a consequence of the higher conservation of the n protein compared to the spike protein across different coronavirus species [ ] , antibodies produced against previous common cold coronaviruses (and cross-reacting with the sars-cov antigens) might still be present in the sera at high levels, and therefore be detectable at the same titers, while the antibodies specific to sars-cov- decline. interestingly, similarly to the antibody titers, the presence of proinflammatory mediators in the sera of convalescent patients also decreased over time and became almost undetectable one month after a negative pcr for viral rna, a finding that mirrors the successful control of the infection and the consequent switch off of the immune response ( figure f, supplementary figure ) . overall, we suggest that the decline in antibody titer and pro-inflammatory cytokines is a common characteristic of sars-cov- infection. this study has therefore important implications for the use of serological testing for the monitoring of infection outbreaks against re-infection with sars-cov- . our results indicate that the detection of antibodies with serological assays for epidemiological and monitoring purposes in non-hospitalized seroconverted covid + subjects, who most likely represent the majority of people who encountered the virus, is highly reliable only within a limited window of time after viral clearance. human subjects. infection (by nf swab), not hospitalized but with manifested covid- symptoms (supplementary table ) were monitored for seroconversion by igm, igg and iga serum levels at two time points after viral clearance between april and june . the study has been conducted in accordance with the standards of good clinical practice, with the ethical principles deriving from the helsinki declaration and the current legislation on observational studies. clearance from the ethical committee has been obtained (ieo ). additional study populations were icu hospitalized severe covid- patients (n= ) and (n= ) covid- negative subjects whose sera were collected between april and june . pre-covid subjects enrolled in ieo studies between and were used to calculate the roc curves for the assays. the recombinant spike sars-cov glycoprotein receptor binding domain (rbd) and the soluble fulllength trimeric ectodomain have been produced in mammalian hek f cells as glycosylated proteins by transient transfection with pcaggs vectors generated in prof. krammer's laboratory [ ] . the constructs were synthesized using the genomic sequence of the isolated virus, wuhan-hi- released in january , and contain codons optimized for expression in mammalian cells. briefly, hek f cells were seeded at a final concentration of . million/ml in freestyle medium (thermo fisher scientific), incubated at °c, % co at rpm o/n in an eppendorf new brunswick s i incubator. the day after hek f cells were transfected using µg of dna per x cells and a dna:pei max ratio of : in optimem medium. h post-transfection, the medium was supplemented with peptone primatone rl (merck) to a final concentration of . % w/v. cells were then incubated for days, checking cell viability daily if needed (a mortality higher than % is indicative of a toxic protein). the elisa assay to detect immunoglobulins (ig) uses fragment of the sars-cov spike glycoprotein (s-protein) and the nucleocapsid (n protein) as antigens based on the protocol published in [ , ] . after binding of the proteins (rbd and n proteins) to a nunc maxisorp elisa plate, and blocking aspecific bindings with pbs-bsa %, patients' sera to be analyzed were applied to the plate to allow for emergency use approval. the assay has been validated with a cohort of n= covid- + subjects (severe, moderate and mild disease) and n= (subjects collected in pre-covid era (between and )). roc curves have been implemented to determine sensitivity and specificity of the assay (supplementary figure ) multiplexing analysis of sera cytokines. quantification of soluble biomarkers was performed in sera of patients collected immediately after virus clearance ( consecutive negative nf swabs) and one month post virus clearance using a luminex immunoassay (human cytokine/chemokine/gf procartaplex plex, thermo fisher) with map technology according to manufacturer's protocol. samples were acquired on a luminex sd and analyzed with xponent software . . the sera of healthy subjects (n= ) collected between april and june as well as icu covid- + patients (n= ) were used as control groups. the categorical variables were described as absolute frequency and percentage. the continuous variables with normal distribution were described as median ± standard deviation (sd), whereas the continuous variables without normal distribution were given as median and range. normality of continuous variables was checked with d'agostino-pearson omnibus normality test. the mann-whitney test or student's t-test for continuous variables, and the chi-square or fisher's exact tests for categorical variables, were used to associate clinical variables with the result of sars-cov- serological test (positive or negative). the p values lower than . , two-tailed, will be considered statistically significant. prism software was used for all statistical analyses. cytokines not significantly different between icu (dark blue symbols) and non-hospitalized (blue symbols) covid+ patients (c) chemokines levels in sera of patients (icu, dark blue, not hospitalized blue symbols, healthy subjects light blue). p < . (*), p < . (**) p < . (***), p < . (****) were regarded as statistically significant. ns, not significant (e) cumulative fold decrease between t and t antibody titers in elisa assays against the rbd (squares), the spike ectodomain (circles) and the n (triangles) sars-cov viral proteins. (f) longitudinal variation of serum cytokines and chemokines in non-hospitalized covid- + patients. statistical significance was calculated using kruskal-wallis nonparametric test for multiple comparisons. p < . (*), p < . (**) p < . (***) were regarded as statistically significant. ns, not significant. figure : cytokine levels in sera of covid + patients. cytokines not significantly different between hospitalized (dark blue) and non-hospitalized (blue) covid+ patients. statistical significance was calculated using kruskal-wallis nonparametric test for multiple comparisons. p < . (*) were regarded as statistically significant. supplementary figure : (a) growth factors present in icu hospitalized (dark blue) but not in non-hospitalized (blue) covid+ patients (b) growth factors not significantly different between icu hospitalized (dark blue) and non-hospitalized (blue) covid+ patients. statistical significance was calculated using kruskal-wallis nonparametric test for multiple comparisons. p < . (*), p < . (**) were regarded as statistically significant. a new coronavirus associated with human respiratory disease in china clinical characteristics of hospitalized patients with novel coronavirus-infected pneumonia in clinical features of patients infected with novel coronavirus in wuhan angiotensin-converting enzyme (ace ) as a sars-cov- receptor: molecular mechanisms and potential therapeutic target evidence for gastrointestinal infection of sars-cov- the trinity of covid- : immunity, inflammation and intervention risk factors associated with acute respiratory distress syndrome and death in patients with coronavirus disease antibody responses to sars-cov- in patients with covid- a serological assay to detect sars-cov- seroconversion in humans analysis of a sars-cov- -infected individual reveals development of potent neutralizing antibodies with limited somatic mutation iga-ab response to spike glycoprotein of sars-cov- in patients with covid- : a longitudinal study piano mortari e, terreri s, spectrum of innate and adaptive immune response to sars-cov- infection across asymptomatic, mild and severe cases-a longitudinal study meta-analysis of diagnostic performance of serological tests for sars-cov- antibodies up to april and public health implications temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by sars-cov- : an observational cohort study antibody tests in detecting sars-cov- infection: a meta-analysis. diagnostics (basel) longitudinal evaluation and decline of antibody responses in sars-cov infection sars-cov- seroconversion in humans: a detailed protocol for a serological assay, antigen production, and test setup clinical and immunological assessment of asymptomatic sars-cov- infections crystal structure of sars-cov- nucleocapsid protein rna binding domain reveals potential unique drug targeting sites distinct early iga profile may dteremine severity of covid- symptoms: an immunological case series. medrxive phenotype and kinetics of sars-cov- -specific t cells in covid- patients with acute respiratory distress syndrome key: cord- -sju hev authors: hu, yiwen; buehler, markus j. title: comparative analysis of nanomechanical features of coronavirus spike proteins and correlation with lethality and infection rate date: - - journal: matter doi: . /j.matt. . . sha: doc_id: cord_uid: sju hev the novel coronavirus disease, covid- , has spread rapidly around the world. its causative virus, sars-cov- , enters human cells through the physical interaction between the receptor-binding domain (rbd) of its spike protein and the human cell receptor ace . here, we provide a novel way in understanding coronavirus spike proteins, connecting their nanomechanical features – specifically its vibrational spectrum and quantitative measures of mobility – with virus lethality and infection rate. the key result of our work is that both, the overall flexibility of upward rbd and the mobility ratio of rbds in different conformations, represent two significant factors that show a positive scaling with virus lethality and an inverse correlation with the infection rate. our analysis shows that epidemiological virus properties can be linked directly to pure nanomechanical, vibrational aspects, offering an alternative way of screening new viruses and mutations, and potentially exploring novel ways to prevent infections from occurring. the novel coronavirus disease, covid- , has spread rapidly around the world [ ] [ ] [ ] [ ] [ ] . its causative virus, sars-cov- , enters human cells through the interaction between the receptor-binding domain (rbd) of its spike protein and the cell receptor ace [ ] [ ] [ ] . due to the significant role that coronavirus spike protein plays in receptor recognition, viral fusion and cell entry, it is a promising target for drug and vaccine development. here, we provide a novel way towards better understanding the coronavirus spike proteins, connecting its nanomechanical features -especially their vibrational patterns -with virus lethality and infection rate . in a broader context, the mechanics of proteins has long been a subject of interest, and this study shows how it can be a useful tool to help us understand complex disease etiology by connecting nanoscopic physical features with epidemiological data [ ] [ ] [ ] [ ] [ ] [ ] [ ] . to provide a comparative analysis -specifically focused on how nano-level features relate with macroscopic epidemiolocal observables -we focus on different coronavirus types within the same family of pathogens. over the past decades, several types of coronaviruses have emerged. the virus types hcov-nl and hcov-hku are often reported to cause lower respiratory tract infections, while hcov-oc and hcov- e are usually associated with comparatively mild symptoms similar to the common cold , , . the ones that threaten public health more seriously are three highly pathogenic human coronaviruses -namely: sars-cov, mers-cov and sars-cov- . sars-cov was first reported in china in november , then quickly spread globally, resulting in over , infections with about deaths . mers-cov was first identified in saudi arabia in june , featuring limited transmission with case fatality rate as high as % , . sars-cov- was first reported in china in december , , . it can easily transmit from human to human, resulting in more than million global cases as of october , . the spike protein of the coronavirus plays an essential role in receptor recognition, viral fusion and cell entry , , . the process represents a complex mechano-chemical process, whereby during the entry into the host cell, the spike protein first binds to a cell receptor through the receptor-binding domain (rbd) and then begins the fusion process. it is believed that the rbds of different coronaviruses recognize different cell receptors , . sars-cov, sars-cov- and hcov-nl recognize angiotensinconverting enzyme (ace ) as their receptor in the human body, while mers-cov recognizes dipeptidyl peptidase (dpp ) as its receptor , , , . in order to successfully bind to the receptor, the spike protein of a specific coronavirus must maintain a receptor-accessible state with at least one rbd in upward conformation. this is because otherwise there would be steric clashes hindering the binding process . in experimental work, this specific type of receptor-accessible state has been captured for mers-cov, sars-cov and sars-cov- , . while the structure of coronavirus spike protein is well studied, much less attention has been focused on the connection between the mechanical features of the virus with virus lethality as well as infection rate. the structural basis of sars-cov viral infectivity has been explored to some extent, pointing out that the trp-rich region of s protein is essential . it has also been observed that cleavage of the spike protein of sars-cov is associated with viral infectivity . however, there has been no lateral comparative study between similar coronaviruses on this type of connection. it remains a question which kind of mechanical and structural properties could possibly relate to the mortality and infection rate of the virus. if successful, this approach may provide an alternative or complementary way to screen viruses or mutations against large-scale epidemiological data, provide additional mechanistic insights into disease etiology, and offer potential targets for therapies or preventive measures. in protein science, normal mode analysis (nma) has long been one of the most comprehensive yet efficient methods to calculate vibrational normal modes and analyze protein flexibility, which provides the rationale for use in this study . another reason behind the broad use of nma is that the lowfrequency modes elucidated by nma could often describe the real-world motions of a protein, and often bear important functional significance . in nma, the atoms are modeled as point masses, which are connected by springs that represents the interatomic force fields. after constructing the hessian matrix based on the second-order partial derivatives of the potential energy function, the normal modes and corresponding frequencies can be directly obtained by diagonalizing the matrix and computing its eigenvalues (further details see experimental procedures). sars-cov- enters human cells through the interaction between the receptor-binding domain (rbd) of its spike protein and the cell receptor ace , as is depicted in the schematic shown in figure . our study hence focuses on the spike protein, which is essential for the infection to take place. the spike protein of coronavirus is composed of an amino (n)-terminal s subunit and a carboxyl (c)-terminal s subunit. the s subunit, which consists of an n-terminal domain (ntd) and three c-terminal domains (ctd), is responsible for recognizing and binding to the host cell receptor. it has been reported that for the betacoronavirus that utilizes ctd of its s subunit as rbd, there is a prerequisite conformational state for receptor binding where at least one rbd should be upward , . in this paper, we refer to this receptoraccessible state as "open state" and receptor-inaccessible state as "closed state". as outlined in figure , we conduct a normal mode analysis (nma) , our open-state analysis is limited to these three highly pathogenic beta-coronaviruses. for closed-state exploration, since there have been no reports about closed conformational state of the mers-cov spike protein, we choose the hcov-nl spike protein because it shares the same relevant cell receptor ace with sars-cov and sars-cov- spike proteins. figure depicts data that shows that the lowest-frequency normal modes of mers-cov, sars-cov and sars-cov- spike proteins are all associated with a swing motion of upward receptor-binding domain (rbd) to different extents. this type of global movements corresponds well with the molecular motions directly reported in experiments , . considering the required receptor-accessible state for receptor binding, the swing of rbd is of functional significance since it is the likely way by which a spike protein changes from closed state to open state to facilitate the binding of target receptors. from a lateral aspect, this observed shared type of lowest-frequency normal mode movements indicates the structural similarity of mers-cov, sars-cov and sars-cov- spike proteins. according to our analysis, while the sequence identity of sars-cov s and sars-cov- s is as high as . %, mers-cov s has only about % sequence identity with sars-cov s and sars-cov- s. this indicates that a small portion of the whole sequence of beta-coronavirus would largely determine the general structure topology and thus the overall global motion (details see supplementary material figure s ). figure s depicts a sequence alignment of mers-cov s, sars-cov s and sars-cov- s, where identical residues are denoted by *. the analysis reported in reveals that the percentage identity in ntd and rbd, which reflect the major parts participating in the lowest normal mode movement, is about %. compared with % sequence identity of the whole spike protein, this partial percentage identity is lower and confirms the concept that likely only a few sequence pieces ultimately determine the shared global movement of coronavirus spike protein in open state. we further note that the sequence similarity in the s subunit could play an important role, as it may contributes to the comparatively higher rigidness of the s subunit. while mers-cov, sars-cov and sars-cov- spike proteins share the same type of lowestfrequency normal mode movements, their fluctuation profiles differ dramatically, as is illustrated by figure . generally speaking, the s subunit of coronavirus spike protein is much more flexible than its s subunit. among these beta-coronaviruses, the active rbd of mers-cov spike enjoys super mobility and only sars-cov- spike protein and its d g mutant exhibit a comparatively flexible downward rbd. interestingly, when we focus on fluctuations over the upward rbd, the figure reveals that for all these spike proteins the fluctuations first slowly build up to its maximum and then decrease sharply, which demonstrates that the appearance of large flexibility of upward rbd is based on some common detailed structures of these beta-coronaviruses. notably, panel (d) of figure shows that the d g mutation decreases the general flexibility of upward rbd and significantly enhances the mobility of a limited region in one downward rbd in the spike protein. it is also shown that the d g mutation results in a general slight flexibility decrease in the ntds of all three chains, and importantly, no noticeable differences in other areas. using the fluctuation profile data, two significant mechanical factors are identified, which are: ( ) the overall flexibility of upward rbd and ( ) the mobility ratio of rbds in different conformations. figure provides a correlation diagram for mers-cov, sars-cov and sars-cov- spike protein, where the overall flexibility of upward rbd is evaluated by the average fluctuation of open-state rbd and the mobility ratio is quantified as the ratio of maximum fluctuations over upward and downward rbds. the data shows that both factors have positive correlation with case fatality rate and inverse relationship with the virus infectivity. we find that for the flexibility ratio, the smaller it is, the larger the mobility of downward rbd is compared to upward one, which could indicate a larger possibility toward a second standing rbd. this would make it even easier for the spike protein to bind to the host cell receptor and thereby increasing the virus infectivity. on the other hand, if the flexibility of downward rbd is not large enough to generate conformational change, as flexibility ratio decreases, it becomes more difficult for the j o u r n a l p r e -p r o o f receptor to bind with the right rbd since the downward one is quite active. this may provide an explanation for the positive correlation between flexibility ratio and virus lethality that we see in the epidemiological data. one possible reason for the positive relationship between overall flexibility of upward rbd and the mortality rate could be that the flexible upward rbd is more active when binding to the receptor, and may hence benefit the subsequent membrane fusion process. even though there is limited available empirical data at this point, there is actually an intrinsic negative relationship between the mortality rate and infection rate of mers-cov, sars-cov and sars-cov- , which could help explain why the infectivity is inversely correlated with the general flexibility of upward rbd. while there are many other factors situated between nanomechanical and epidemiological aspects, such as binding affinities and dysregulation of type i interferon responses , the influence of which on different epidemiological characteristics of coronavirus has not been fully explained, and our analysis points out the direct correlation between nanomechanical features and the lethality and infection rate of coronavirus. the goal is to attempt to improve our understanding of the direct relationship between the nanoscale and epidemiological level, not considering internal relationships. as the results show, this perspective provides useful insights into the mechanics of disease relationships ( figure s ). figure show different flexibility variations in sars-cov- , sars-cov and hcov-nl spike protein, which share the same human receptor ace . among them, there exists a sharp increase and large variation in flexibility in rbd of sars-cov- s, while sars-cov s appears to have more flexible structural regions. hcov-nl , the only one in this comparative study that is classified as alpha-coronavirus and cause only mild symptoms, has a different s subunit topology, bringing more flexibility to ntd rather than ctd , where its rbd is situated. thus, our suggestion about the importance of the general flexibility of upward rbd in open-state analysis needs to be expanded. as for closed states, the overall flexibility of rbd in a single protomer shows positive relationship with virus lethality, since the disease severity is regressive in order of sars-cov, sars-cov- and hcov-nl . notably, for these three coronaviruses in closed states, the overall flexibility of rbd of a single protomer is positively associated with their disease severity. panels (e) to (g) in figure provide detailed structures of the rbd of the above virus complexed with their shared receptor human ace . the ctd in s subunit of coronavirus often contains a core structure, which is composed of several antiparallel βsheet and short connecting α-helices, and one or more extended loops. theses extended loops, referred to as receptor binding motif (rbm), is located at the edge of the core structure and is usually responsible for realizing the interactions with the receptor if the virus uses ctd as its rbd. beta-coronaviruses such as sars-cov and sars-cov- have a unique long-extended loop as their rbm, accounting for the most flexible region of their rbd in both open and closed states. as for hcov-nl , there are three separated short rbms, which are quite restricted and unable to generated large mobility. these insights may also explain its lower-affinity interaction with ace , at least to some extent. we reported an analysis linking key nanomechanical vibrational features of various coronavirus spike proteins with epidemiological data. as shown in figure , structural similarity and major movement associated with the swing of upward rbd is seen throughout this family of viruses, representing a sort of universal feature of this class of pathogens. the molecular modeling results show that the general motion corresponds with experimental observation and have functional importance. we find that the active rbd of mers-cov enjoys super mobility, whereas only sars-cov- and its d g variant show a comparatively flexible downward rbd. the more recently occurring d g mutation decreases the general flexibility of upward rbd and largely enhances the mobility of some small region in one downward rbd in the spike protein. as shown in figure , the key result of this study, the general fluctuation profiles of upward rbd and the associated fluctuation ratio have a positive correlation with case fatality rate and inverse relationship with the virus infectivity. our results offer two different explanations for the effects of the flexible downward rbd: ( ), the possibility towards a second standing rbd if the mobility is large enough, and ( ) that it could indicate difficulty for the receptor to bind with the right rbd if no conformational change happens. we hypothesize that there may be a possible threshold between these two effects, which could be studied in future research. these insights offer several possible applications, including a search for inhibitors that could bind to downward rbd may provide a viable strategy. we find a sharp increase and large variation in flexibility in sars-cov- s, whereas more flexible structural regions are present in the closed-state sars-cov s. the long-extended loop is unique for beta-coronaviruses, and also accounts for the most flexible region in open states. hcov-nl s has three separated short rbms, which are unable to generate large mobility. this may also explain its lower-affinity interaction with ace . as for possible applications, we may target the rbm to identify new inhibitors that lock the closed s-protein conformation. this opens a question whether perhaps, we can we utilize the significant flexibility difference for potential inhibitor screening for the development of novel treatment methods or design future experiments for gain-offunction research , . future work may address additional influences of temperature dependence (to explore whether seasonal variations of temperature and other environmental factors can be linked to nanoscopic phenomena). other aspects may include a more detailed analysis of intermediate steps in the mechanistic hierarchical progression as schematically outlined in figure s , including aspects of the strong age-dependence of covid- disease progression. please contact prof. markus buehler via mbuehler@mit.edu. no new materials were generated in this work. all data are available upon request to the lead contact author. to assess the molecular mechanics from an atom-by-atom perspective, we conduct normal mode analyses (nma) of coronavirus spike proteins in receptor-accessible state with one upward rbd as well as receptor-inaccessible state where all three rbds are in downward position. we access the desired threedimensional protein structures from the protein data bank and prepare the atomistic models with visual molecular dynamics (vmd) , . before normal mode analysis, , steps of conjugate gradient energy minimization are performed using nanoscale molecular dynamics (namd) in order to relax the protein structure . no further md simulation is performed since the protein structure from protein data bank is already experimentally equilibrated. a coarse-grained elastic network model (enm) available in the bio d package in r is employed to analyze the normal modes of coronavirus spike protein structures [ ] [ ] [ ] . this model uses n, ca, c atoms to represent the protein backbone and selects to significant side chains based on their size and distance to ca atoms, proved to have comparable accuracy with all-atom enm. here, the atomic displacements are scaled for temperature k. for the sars-cov- d g mutant, we implement the mutation to the open-state spike protein (pdb id: vsb) and carry out , steps of energy minimization and md simulation for ns with namd. we then compute the average residue mean square derivations (rmsd) based on the last ns equilibrium period and pick out frames with the nearest rmsd from the trajectory file so that we could conduct normal mode analysis on them. the fluctuation profile of d g mutant is calculated as the average of the normal mode fluctuations of these configurations. we notice that some local unfolding occurs at the end of each chain, which induces abnormally high fluctuations in the fluctuation profile. since these local events are far from the rbd, we do not consider them in the analysis and set the fluctuations of the terminal - residues to zero. for consistency, the same approach is used for the fluctuation difference of sars-cov- spike protein and the d g mutant. the global confirmed case numbers and the case fatality rate presented in figure are updated as of sep , . according to the analysis based on gisaid sars-cov- sequence database, by aug , , % of the global sequence database were identified with d g mutation ( sequences counted in total) , . based on this estimate we use % of the covid- global case number as the infection number of sars-cov- with g and the remaining % as the case number of the original sars-cov- virus type. since there has been little evidence assessing an association between d g mutation and disease severity, the same case fatality rate is applied to the original sars-cov- spike protein and its d g variant. the two factors depicted in figure the causative virus of the covid- , sars-cov- , enters human cells through the physical interaction between the receptor-binding domain (rbd) of its spike protein and the human cell receptor ace . while the structure of coronavirus spike protein is well studied, it remains unclear how those mechanical features of the virus affect its epidemiological characteristics. here, we report that both, the overall flexibility of upward rbd and the mobility ratio of rbds in different conformations, represent two significant factors that show a positive scaling with virus lethality and an inverse correlation with the infection rate. our analysis shows that epidemiological virus properties can potentially be linked directly to pure nanomechanical, vibrational aspects, offering an alternative way of screening new viruses and mutations, and perhaps even novel ways to prevent infections from occurring. • this work provides a novel way in understanding coronavirus pathology using mechanics • reports major movement associated with the swing of upward rbd for open-state viruses • links key nanomechanical vibrational features directly to the epidemiological data • provides possibility of screening new viruses or mutations from a mechanical aspect we provide a novel way towards understanding coronavirus spike proteins, connecting their nanomechanical features -specifically its vibrational spectrum and quantitative measures of mobilitywith virus lethality and infection rate. our study shows that the nanomechanics of proteins -captured in their continuous motions -can be a useful tool to help us understand complex disease etiology by connecting nanoscopic physical features with epidemiological data. potential application includes developing mechanical ways of screening new viruses and mutations, and exploring novel treatment methods. crystal structure of nl respiratory coronavirus receptor-binding domain complexed with its human receptor sartorius products receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding human coronavirus nl , a new respiratory virus cryo-em structure of the -ncov spike in the prefusion conformation structural biology: structure of sars coronavirus spike receptor-binding domain complexed with receptor structure of the sars-cov- spike receptor-binding domain bound to the ace receptor coiled-coil intermediate filament stutter instability and molecular unfolding deformation and failure of protein materials in physiologically extreme conditions and disease triangular core as a universal strategy for stiff nanostructures in biology and biologically inspired materials nanomechanical sonification of the -ncov coronavirus spike protein through a materiomusical approach molecular model of human tropoelastin and implications of associated mutations nanomechanics of functional and pathological amyloid materials sustained release silk fibroin discs: antibody and protein delivery for hiv prevention coronavirus hku and other coronavirus infections in hong kong epidemiology and clinical presentations of the four human coronaviruses e, hku , nl , and oc detected over years using a novel multiplex real-time pcr method sars and mers: recent insights into emerging coronaviruses isolation of a novel coronavirus from a man with pneumonia in saudi arabia mers in south korea and china: a potential outbreak threat? a familial cluster of pneumonia associated with the novel coronavirus indicating person-toperson transmission: a study of a family cluster clinical features of patients infected with novel coronavirus in wuhan covid- map -johns hopkins coronavirus resource center estimates of the severity of coronavirus disease : a model-based analysis pre-fusion structure of a human coronavirus spike protein structural basis for the recognition of the sars-cov- by full-length human ace structural basis of receptor recognition by sars-cov- molecular basis of binding between novel human coronavirus mers-cov and its receptor cd cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains importance of sars-cov spike protein trprich region in viral infectivity cleavage of spike protein of sars coronavirus by protease factor xa is associated with viral infectivity building-block approach for determining low-frequency normal modes of macromolecules global dynamics of proteins: bridging between structure and function analysis of the vibrational and sound spectrum of over , protein structures and application in sonification dynamut: predicting the impact of mutations on protein conformation, flexibility and stability the embl-ebi search and sequence analysis tools apis in cell entry mechanisms of sars-cov- dysregulation of type i interferon responses in covid- might sars-cov- have arisen via serial passage through an animal host or cell culture?: a potential explanation for much of the novel coronavirus' distinctive genome ethical and philosophical considerations for gain-of-function policy: the importance of alternate experiments the protein data bank the protein data bank namd: a parallel, object-oriented molecular dynamics program scalable molecular dynamics with namd harmonicity in slow protein dynamics a new approach for determining lowfrequency normal modes in macromolecules building-block approach for determining low-frequency normal modes of macromolecules tracking changes in sars-cov- spike: evidence that d g increases infectivity of the covid- virus covid- and cardiovascular disease: from basic mechanisms to clinical perspectives we acknowledge support from the mit-ibm ai lab, onr (n and n ), afosr (fate muri fa - - - ), nih (u eb ), as well as aro (w nf ). the authors declare no competing interests. key: cord- -d p u authors: abe, kento t.; li, zhijie; samson, reuben; samavarchi-tehrani, payman; valcourt, emelissa j.; wood, heidi; budylowski, patrick; dupuis, alan p.; girardin, roxie c.; rathod, bhavisha; wang, jenny h.; barrios-rodiles, miriam; colwill, karen; mcgeer, allison j.; mubareka, samira; gommerman, jennifer l.; durocher, yves; ostrowski, mario; mcdonough, kathleen a.; drebot, michael a.; drews, steven j.; rini, james m.; gingras, anne-claude title: a simple protein-based surrogate neutralization assay for sars-cov- date: - - journal: jci insight doi: . /jci.insight. sha: doc_id: cord_uid: d p u most of the patients infected with severe acute respiratory syndrome coronavirus (sars-cov- ) mount a humoral immune response to the virus within a few weeks of infection, but the duration of this response and how it correlates with clinical outcomes has not been completely characterized. of particular importance is the identification of immune correlates of infection that would support public health decision-making on treatment approaches, vaccination strategies, and convalescent plasma therapy. while elisa-based assays to detect and quantitate antibodies to sars-cov- in patient samples have been developed, the detection of neutralizing antibodies typically requires more demanding cell-based viral assays. here, we present a safe and efficient protein-based assay for the detection of serum and plasma antibodies that block the interaction of the sars-cov- spike protein receptor binding domain (rbd) with its receptor, angiotensin-converting enzyme (ace ). the assay serves as a surrogate neutralization assay and is performed on the same platform and in parallel with an elisa for the detection of antibodies against the rbd, enabling a direct comparison. the results obtained with our assay correlate with those of viral-based assays, a plaque reduction neutralization test (prnt) that uses live sars-cov- virus and a spike pseudotyped viral vector–based assay. the coronavirus s-protein (spike) is responsible for both receptor binding and fusion of the virus and host cell membranes. within the spike protein, the receptor binding domain (rbd) mediates the interaction with the host cell receptor, and sequence/structural variation in the rbd is responsible for the receptor binding specificity shown by those coronaviruses that use host proteins as receptors ( ) . most of the patients infected with severe acute respiratory syndrome coronavirus (sars-cov- ) mount a humoral immune response to the virus within a few weeks of infection, but the duration of this response and how it correlates with clinical outcomes has not been completely characterized. of particular importance is the identification of immune correlates of infection that would support public health decision-making on treatment approaches, vaccination strategies, and convalescent plasma therapy. while elisa-based assays to detect and quantitate antibodies to sars-cov- in patient samples have been developed, the detection of neutralizing antibodies typically requires more demanding cell-based viral assays. here, we present a safe and efficient protein-based assay for the detection of serum and plasma antibodies that block the interaction of the sars-cov- spike protein receptor binding domain (rbd) with its receptor, angiotensin-converting enzyme (ace ). the assay serves as a surrogate neutralization assay and is performed on the same platform and in parallel with an elisa for the detection of antibodies against the rbd, enabling a direct comparison. the results obtained with our assay correlate with those of viral-based assays, a plaque reduction neutralization test (prnt) that uses live sars-cov- virus and a spike pseudotyped viral vector-based assay. sars-cov- , like sars-cov, uses the cell surface carboxypeptidase angiotensin-converting enzyme (ace ) as a receptor for viral entry ( figure a) . the use of a common receptor is consistent with the fact that the viruses share a high degree of sequence similarity and that their rbds are ~ % identical, though the rbd of sars-cov- binds ace with higher affinity than does that of sars-cov ( ) . the spike proteins of both viruses are also both primed by the host protease, tmprss , but unlike sars-cov- , the spike protein of sars-cov does not contain a furin recognition motif that can be cleaved during viral biogenesis ( , ) . the coronavirus spike protein is also a major target of the host immune system, and antibodies directed against it play a central role in host-mediated neutralization ( ) . among neutralizing antibodies, those that block the interaction between viruses and their receptors represent the most common route to neutralization ( ) . for this reason, both the spike protein and the rbd form the basis for most of the sars-cov- vaccines currently in development. the detection and study of neutralizing antibody activity following natural infection (or vaccination) can, therefore, support research aimed at the development of novel therapeutics and vaccine candidates. it can also aid in the identification of acceptable donors for convalescent plasma therapy ( ) and, more generally, to establish immune correlates of infection. for sars-cov- , viral neutralization assays are performed using either live virus ( ) or viral vectors pseudotyped with the spike protein ( ) . however, these cell culture-based assays are challenging to implement and time-consuming to run -factors that limit scalability. the conventional plaque reduction neutralization test (prnt) that uses live sars-cov- virus is further complicated by the need for containment level (cl- ) and a specialized laboratory setup. although the pseudotyped viral vector-based assays do not require biosafety level (bsl- ) containment ( ) , they are complicated multistep procedures ( ) . by contrast, the detection and quantitation of antigen-specific antibodies in patient samples can be easily assayed by elisa ( ) . sars-cov- elisas are performed by immobilizing a recombinantly produced viral antigen (such as the spike trimer or rbd) ( figure b and supplemental figures and ; supplemental material available online with this article; https://doi.org/ . /jci.insight. ds ) (see methods) onto multiwell plastic plates that are then incubated with diluted patient serum or plasma samples. the detection of antibodies that bind to the antigen involves a second incubation with enzyme-conjugated antihuman antibodies, where the enzyme is often horseradish peroxidase (hrp). this enables the detection of a color change when an hrp substrate such as , ′, , ′-tetramethylbenzidine (tmb) is used. in direct binding assays of this type ( figure c ), the presence of patient antibodies against the viral antigen leads to a dose-dependent increase in the signal observed. elisa-based profiling has been developed by multiple groups and has been used to measure the kinetics of the antibody response in patient cohorts following sars-cov- infection. in several recent studies, including ours, this has revealed the relative stability in the igg response to the spike and rbd over several months, along with a more transient igm and iga response that wanes as patients convalesce ( ) ( ) ( ) ( ) ( ) . however, the levels of neutralizing antibodies are not typically measured in large cohorts over time (with a few notable exceptions, as seen in refs. , ) , as current assays have relatively low throughput. the relative lack of neutralizing antibody data represent a significant gap in our understanding of the immune response to sars-cov- . here, we describe a modified elisa-type assay that serves as a surrogate neutralization assay. it measures the presence of antibodies capable of blocking the rbd-ace interaction, and like the direct binding elisa, it is easily scaled to allow for the analysis of large patient cohorts over time. we show that the results obtained by this assay correlate with those of both the sars-cov- prnt and a spike pseudotyped viral vector neutralization assay in a cohort of convalescent patients and on purified antibodies. we aimed to develop a simple protein-based assay to monitor the ability of antibodies, present in the serum or plasma of patients, to block the interaction between the rbd and the host receptor ace . to do so, we elected for an elisa-type assay, since such assays are already widely used to detect antibodies that recognize sars-cov- antigens such as the spike trimer and its rbd. as with the standard direct elisa, the antigen (here, the rbd or the spike trimer) is first immobilized on multi-well plates and then incubated with patient plasma or serum ( figure b) . however, because we were interested in detecting functional antibodies that can prevent the interaction between the rbd (or spike) and ace , we replaced the hrp-conjugated secondary antibody used in the direct elisa by a detection method involving human ace . in our assay, recombinantly expressed soluble ace bearing a biotinylated c-terminal avitag is added to the antigen-bound plate after the plate has been incubated with the patient plasma or serum (see methods). bound ace is then detected by the addition of streptavidin-poly hrp and its colorimetric substrate tmb. the presence of patient antibodies that can block the rbd-ace interaction leads to a dose-dependent decrease in the signal observed, and as such, we refer to it as a surrogate neutralization elisa (snelisa; figure d ). we explored different versions of the assay: the configuration described above and one involving immobilized ace and soluble biotinylated rbd, a configuration similar to that previously reported ( ) . the assay with immobilized rbd and soluble biotinylated ace was more sensitive than its counterpart (supplemental figure ). moreover, with the rbd immobilized, the same overall protocol and colorimetric detection can be used for both the direct binding elisa and the snelisa, thereby facilitating a direct comparison. although the snelisa worked well with either the rbd or the spike ectodomain trimer immobilized ( figure e and supplemental figure a ), we focused on the rbd, since it is easier to produce and provides a simple one-to-one binding interaction with ace . using a small test set (supplemental figure b) , we first showed that the serum/plasma from positive but not negative control patients inhibited the interaction between ace and the immobilized rbd ( figure f ). the technical reproducibility of the assay was within %- % coefficient of variation (cv). the total time required to perform the assay (once the plates are coated with the antigen) is . - hours, and the assay can be performed using the same equipment and biosafety protocols as a standard elisa. using both the surrogate neutralization and direct binding (with a dilution series) elisas, we then profiled a set of serum samples acquired at the canadian blood services as part of a screen for convalescent plasma therapy donors ( table ). with reference to the direct binding results, the snelisa showed that samples with high levels of igg against the rbd were typically the most potent at blocking the rbd-ace interaction (e.g., cbs , which is included as a positive control). conversely, samples lacking detectable rbd-binding antibodies were not able to block the interaction. to more systematically evaluate the relationship between the rbd-binding antibody levels and the ability to block the rbd-ace interaction (as determined by the snelisa), we calculated the auc for both assays and plotted the rbd-binding auc versus the snelisa auc ( figure c and supplemental figure ). the plot showed a clear correlation (r = . ), with the sera containing the highest rbd-binding antibody levels being the most effective at blocking the rbd-ace interaction ( figure c ; compare figure b with figure a ; supplemental table ). nevertheless, there are samples with similar rbd-binding antibody concentrations that differ in their ability to block the rbd-ace interaction (figure d and supplemental figure ). differences in antibody isotype, affinities, and abundance, as well as the rbd epitopes bound, are all factors that could explain these outliers. while it is reasonable to expect that antibodies that block the rbd-ace interaction would be neutralizing, we validated this using cell-based viral infectivity and entry assays. fifty-seven of the samples analyzed by the snelisa were analyzed by prnt, the gold standard in the field. prnt is defined as the concentration of patient serum or plasma capable of reducing the formation of viral plaques by %; prnt is the concentration that reduces plaque formation by %. as shown in figure a , most of the samples displaying high values in the direct binding and snelisas were also positive by prnt (and those with low titers were negatives). both elisas also gave an overall agreement with the prnt titers (see supplemental figure , with a coefficient of determination of . ). we also adapted and optimized a spike-pseudotyped lentiviral-based entry assay ( ) , and we reprofiled the neutralization potential of a subset of samples. there was also a high correlation (r = . ) between the snelisa results and the titers obtained with this spike-pseudotyped lentiviral-based entry assay ( figure b and supplemental figure ). taken together, these results indicate that our snelisa is a good surrogate neutralization assay, particularly for distinguishing between samples with high versus low neutralization activity. as such, the assay should be of value in the selection of candidate donors for convalescent plasma therapy and for monitoring immune correlates of patient outcomes. future work will focus on providing a better understanding of the outliers observed across all assays. indeed, rare but potent neutralizing antibodies in patient samples with low pseudovirus neutralization titers have recently been reported ( ) . to assess whether our snelisa might also be of value for screening the neutralization potential of monoclonal antibodies, we tested it using a number of neutralizing and nonneutralizing monoclonal antibodies and compared the results with the results obtained with the pseudotyped lentiviral-based entry assay or cytopathic effect-reduction neutralization assay with sars-cov- . the llama vhh monoclonal antibody (expressed as a human fc fusion), previously shown to neutralize in a sars-cov- spike pseudotyped entry assay ( ) , blocked the rbd-ace interaction in our snelisa and viral entry in our spike pseudotyped lentivirus assay; similar results were obtained for the active motif - antibody, which was isolated from a convalescent patient and was shown to be neutralizing ( ) (figure , c and d, and supplemental figure ). in contrast, other antibodies, such as an igg derived from the monoclonal anti-sars cr or a commercial antibody (hc ) from genscript, had a much more moderate effect in the snelisa (supplemental figure ) , and the genscript antibody had no effect in the cytopathic effectreduction neutralization assay (supplemental figure ). the active motif - antibody was previously shown to be incapable of neutralizing live sars-cov- virus ( , ) . in our assays, it efficiently bound the rbd in the direct elisa but did not block the rbd-ace interaction in the snelisa. the same antibody partially prevented entry in our lentivirus entry assay. taken together, these observations suggest that our snelisa is a good complement to more complex cell-based assays for the discovery and screening of neutralizing monoclonal antibodies. in summary, we have developed a simple and safe snelisa for sars-cov- . it can be readily incorporated into existing testing platforms and may be of particular value in the selection of donors for convalescent plasma therapy and as a means of monitoring the immune response to vaccination. given that neutralizing antibody titres have recently been shown to wane fairly rapidly in some ( ) ( ) ( ) ) but not all ( , ) studies, the assay may also be useful for broad serosurveillance, especially as it should be more scalable than the approaches requiring viral infection assays. when coupled with epidemiological studies, it might also be used to assess the risk of infection/reinfection. we also note that the optimized conditions used here for the direct rbd-binding elisa are similar to those reported in ref. using rbd-expression constructs that have been widely distributed. their rbd can be obtained from bei resources ( ), and we found that it generates similar results when used with our biotinylated ace in the snelisa (supplemental figure ). this should further facilitate the broad implementation of our assay across multiple laboratories. there are limitations to the assay, however, that need to be acknowledged. first, the snelisa is limited to the detection of neutralizing antibodies that function by blocking the interaction between the rbd and ace . while by no means dominant, examples of antibodies that neutralize by other mechanisms are beginning to emerge ( ) ( ) ( ) . the snelisa, in conjunction with a neutralization assay, could be used to identify further such examples. as with those identified in this work, the outliers (e.g., those with high viral neutralization titers but low snelisa levels) provide a starting point for further work aimed at understanding the mechanisms of antibody-mediated neutralization. another limitation of our approach is that the current assay cannot directly map the epitopes targeted by the various antibodies. undoubtedly, the antibodies detected by the snelisa bind to different sites on the rbd, a suggestion supported by the structures of neutralizing antibody fragment antigen binding (fabs) in complex with the sars-cov- rbd. in one example, different neutralizing antibodies that bind to different epitopes on the rbd were found to synergistically mediate viral neutralization ( ) . while, in the current study, we simply wanted to provide evidence of antibodies that could block the rbd-ace interaction, the snelisa could be adapted to provide information on the site of antibody binding. as recently shown, a series of structure-guided point mutants in the rbd could be used to infer where on the rbd the antibodies are binding ( ) . this type of approach would likely be more important in the characterization of monoclonal antibodies, such as those presented in figure c , and would set the stage for in-depth biophysical and structural studies. while the direct binding elisa described here employed an anti-igg secondary antibody (the predominant isotype in convalescent serum), we note that the snelisa measures the ability of any antibody isotype (or even antibodies from different species or any other molecule) to block the rdb-ace interaction. in this regard, it is similar to that of a viral-based neutralization assay. while we have not performed a detailed analysis, we did show that single-point direct binding elisas performed for igm, and to a lesser extent iga, are also correlated with the results obtained by the snelisa (supplemental figure ) . the safety and simplicity of the snelisa should make it a valuable addition to the arsenal of assays for monitoring the immune response to sars-cov- infection. for all elisas, inactivation of potential infectious viruses in plasma or serum was performed by incubation with triton x- to a final concentration of % for hour before use ( ) . for the pseudotyped lentiviral assays, the serum was heat inactivated for hour at °c ( ). the expression plasmid generated is a derivative of those previously reported in our piggybac transposonbased mammalian cell expression system ( ) . two versions of the plasmid were constructed: one contains the cmv promoter (pb-cmv) and the other the tre promoter (pb-tre). the vectors are otherwise identical and can be used to generate stable cell lines for constitutive or inducible protein expression. the protein cloning region contains several optional elements separated by restriction sites as follows: an n-terminal human cystatin-s secretion signal, the protein of interest, a foldon trimerization motif ( ), a xhis purification tag, and an avitag biotinylation motif ( ) (supplemental figure ) . a woodchuck hepatitis virus posttranscriptional regulatory element (wpre) follows the orf to facilitate nuclear export of the mrna. a pair of piggybac transposon terminal repeats flank the expression cassette and an attenuated puromycin resistance marker (bioshop canada inc., pur ), thereby allowing for the generation of stable cell lines using the piggybac transposase. the human codon optimized cdna of the sars-cov- spike protein (mc_ ) was purchased from genscript. the human ace cdna was derived from mgc clone . to stabilize the soluble spike ectodomain trimer, regions of the spike protein were mutated. residues - (rrar) were mutated to ssas to remove the furin cleavage site, and residues - (kv) were each mutated to a proline residue to stabilize the prefusion form as previously described ( ) . the soluble spike protein ectodomain construct includes residues - (mfvf...qyik), followed by the foldon trimerization motif, a xhis tag, and an avitag. both the sars-cov- rbd and the human ace constructs are preceded by the human cystatin-s secretion signal and followed by the xhis and avitag. the rbd and ace constructs contain residues - (rfpn...cgpk) and - (stie...pyad), respectively. the cdna of the human cr fab was synthesized by genscript based on its previously reported sequence ( ). the light chain and heavy chains were individually cloned into the pb-tre expression plasmid. for fab production, a xhis tag was added to the c-terminal end of the fab heavy chain. an igg form was generated by fusing the human igg fc coding sequence to the c-terminal end of the fab heavy chain. freestyle -f suspension cells (thermo fisher scientific, r ) were grown in shaker flasks ( rpm) in freestyle expression medium (thermo fisher scientific, ) in a humidified °c incubator filled with % (v/v) co . the cell density and viability were monitored by manual counting using a hemocytometer and trypan blue staining. for transfection, cells of > % viability were counted and seeded at a density of approximately × cells/ml into ml freestyle medium supplemented with μg/ml aprotinin (bioshop canada inc., apr ). the pb-cmv plasmid dna ( μg) and fectin ( μl; thermo fisher scientific, ) were each added to separate tubes containing ml of opti-mem medium (thermo fisher scientific, ). the solutions were then mixed and incubated for minutes before being added to the ml cell culture. two days after transfection, the ml culture was expanded into three l shaker flasks each containing ml of culture medium. protein expression was continued for an additional days. the stable cells were scaled up in l shaker flasks containing ml freestyle medium without supplements. when the cell densities reached approximately × cells/ml, μg/ml doxycycline (milliporesigma, d ), and μg/ml aprotinin were added to initiate protein expression. during the expression phase, ml of the medium was removed, and fresh medium added every other day. the harvested expression medium was centrifuged at , g for minutes at °c to remove the cells and debris. for the xhis tagged proteins, the clarified media were passed through an ni-nta column (qiagen, ). for the spike ectodomain, ml of ni-nta resin was used for each liter of medium. for the rbd, ace , and cr fab, ml of ni-nta resin was used for each liter of medium. the ni-nta resin was washed with column volumes of phosphate buffered saline (pbs), followed by - column volumes of pbs containing mm imidazole. the protein was eluted with pbs containing mm imidazole (bio basic, ib ) and . % (v/v) protease inhibitor cocktail (milliporesigma, p- ). for the cr antibody, the harvested medium was incubated with rprotein a sepharose ff resin (ge healthcare, ). the resin was then washed with column volumes of pbs, and the antibody was eluted with mm glycine, ph . , containing mm nacl. the acid-eluted antibody was immediately neutralized by the addition of / volume of m tris, ph . . protease inhibitor cocktail was also added to a final concentration of . % (v/v). the approximate purified yields of the various proteins are as follows: rbd, mg/l; spike trimer, mg/l; ace , mg/l; cr fab, mg/l; and cr igg, mg/l. the protein samples were stored in % glycerol at - °c. shortly before use, the glycerol stocks were further purified using size-exclusion chromatography. for the rbd, ace , and cr fab/igg, a superdex increase (ge healthcare, ) column was used. for the spike ectodomain, a superose increase (ge healthcare, ) column was used (supplemental figure ). each biotinylation reaction contained μm biotin, μm atp (milliporesigma, cat # a ), μm mgcl , μg/ml bira (produced from e. coli; a gift from walid houry, university of toronto), . % (v/v) protease inhibitor cocktail, and no more than μm of the protein-avitag substrate. the mixture was incubated at °c for hours, followed by size-exclusion chromatography to remove unreacted biotin (bioshop canada inc., bio ). for the rbd, the degree of biotinylation was assessed using a band-shift assay. a total of μg of the biotinylated rbd was heated to °c for seconds in sds-page loading buffer (containing % sds, mm dtt); after cooling, μl of a mg/ml streptavidin solution was added. the mixture was then analyzed by sds-page to assess the formation of the rbd-streptavidin complex (supplemental figure ) . the llama single domain antibody vhh sequence (pdb entry waq_ ) was obtained from wrapp et al. ( ) . a cdna encoding vhh fused to an adcc-attenuated human igg fc domain (hfc x , from patent us a ) was codon optimized for expression in cho cells, synthesized by genscript, and cloned into the ptt plasmid ( ) . the ptt -vhh hfc x plasmid was transiently expressed in cho e cells ( ) using pei max transfection reagent (polysciences) and a slightly modified protocol as described previously ( ) . the cell culture was harvested at day after transfection, centrifuged minutes at g at room temperature, and filter sterilized using a . μm membrane vacuum filter (express plus, milliporesigma). filtered supernatant was loaded on a ml mabselect sure column (ge healthcare) equilibrated in pbs. the column was washed with pbs, and the antibody eluted with mm citrate buffer ph . . the fractions containing the antibody were pooled, and elution buffer was exchanged for pbs using nap- columns (ge healthcare). purified vhh h-fc x in pbs was quantified by absorbance at nm using a nanodrop spectrophotometer (thermo fisher scientific) and the calculated extinction coefficient of the protein. overall volumetric yield after protein a purification was mg/l. the purified protein was analyzed by analytical size-exclusion ultra high-performance liquid chromatography coupled to a mals detector and eluted as a major (> % integrated area) symmetrical peak of kda with less than % aggregates (not shown). an alternative source for rbd was bei resources nr- (contributors f. krammer, f. amanat, s. strohmeier; icahn school of medicine, mount sinai, new york, usa; lot ). commercial antibodies tested also included a human igg chimeric antibody from genscript (sars-cov- spike s antibody, hc ; genscript, a ) and sars-cov- spike antibodies from active motif (am , ; am , ). manual single-point elisas in -well format. for the manual single-point elisas in -well format, concentrations and incubation times were optimized to maximize the separation between anti-rbd levels in convalescent plasma or serum from that of pre-covid-era banked serum while maintaining the required levels of antigens as low as possible. a total of μl of serum or plasma was used for the detection of antibodies on -well plates coated with ng/well of recombinant purified rbd. single-point elisas are expressed as ratios to a positive control convalescent plasma sample. multipoint elisas. for the multipoint elisas, the rbd amount was fixed to ng/well to match the design of the snelisa, and -fold serial dilutions of the serum or plasma sample from μl to . μl were employed. both cases. in both cases, the rbd antigen (diluted to μg/ml in pbs) was first adsorbed to -well clear immulon hbx plates (thermo fisher scientific, ) in pbs overnight at °c and then washed times with μl pbs plus . % tween- (pbs-t; milliporesigma). plates were blocked for hour at room temperature with μl % blocker blotto (thermo fisher scientific, ) and washed times with μl pbs-t. in the single-point elisas, plate blocking was performed with % w/v milk powder (bioshop canada inc., alb . , lot h ) in pbs for - hours. patient samples (pretreated with % final triton x- for viral inactivation) diluted in pbs-t containing % w/v milk powder ( : for the single-point elisa) were then added to the plates and incubated for hours at room temperature ( μl total volume); technical duplicates were performed unless otherwise indicated. a chimeric human anti-spike antibody (sars-cov- spike s antibody, hc ; genscript, a ) was added to a set of wells on each plate as a serial dilution ( : , - : , or - . ng per well in steps) to enable cross-plate comparisons. positive (convalescent plasma from a single patient) and negative controls (pre-covid-era banked serum) were also added to each plate, at μl. wells were washed times with μl pbs-t. goat anti-human anti-igg (goat anti-human igg fcy hrp, jackson immunoresearch, - - ) at a : , dilution ( . ng/well) in % blotto was added and incubated for hour. wells were washed times with μl pbs-t, and μl of -step ultra tmb-elisa substrate solution (thermo fisher scientific, ) was added for minutes at room temperature. the reaction was quenched with μl stop solution containing . n sulfuric acid (thermo fisher scientific, n ). the plates were read in a spectrophotometer (biotek instruments inc., cytation ) at nm. for all elisa-based assays, raw od values had blank values subtracted before analysis. for the single-point direct binding assay, the average cv across cbs samples is . % (mean) and . % (median) (supplemental table ). for single-point assays, all data were normalized to the positive serum control (single point) on each plate and expressed as a ratio to this control. for the multipoint dose responses, blank-adjusted reads were used. variations. variations to this protocol included the following. (a) replacement of the rbd on the plate by the bei resources nr- . the assay was set up identically to and in parallel with our in-house produced rbd (supplemental figure ). (b) replacement of rbd ( ng) on plate by the spike trimer purified above ( ng) (supplemental figure ) . (c) performing the single-point elisas using an automated platform with chemiluminescent detection for anti-igg, -iga, and -igm, exactly as described in ( ) (supplemental figure ) . our final optimized snelisa used ng immobilized recombinant rbd on -well immulon hbx plates incubated overnight at °c ( μg/ml). all volumes added to the well were μl, unless specified otherwise. plates were washed times with μl pbs-t and blocked for - . hours at room temperature with μl % bsa (bioshop canada inc., ski . , lot h ). after washing as above, a -step, -fold serial dilution series of patient serum or plasma ( . - μl of sample) was incubated for hour. the wells were washed as above and incubated with ng biotinylated recombinant ace for hour. after washing as above, the wells were incubated with ng streptavidin-peroxidase polymer (milliporesigma, s ). the resultant signal was developed and quantified with tmb in an identical manner to the direct elisas. due to day-to-day variation in signal, all od values are normalized to the od of the well where no patient serum/antibody was added for each sample. all values are expressed in this ratio space. variations. variations of this protocol included using a different source of rbd (bei resources, nr- ) and using spike trimer as shown above ( ng/well) ( figure c and supplemental figure ). another variation of the assay was to bind nonbiotinylated ace to the plate ( ng) and to use biotinylated rbd ( ng) for detection (supplemental figure ). neutralization assays on the canadian blood services samples used in figure were performed by independent laboratories, the nml of the public health agency of canada, and the wadsworth center, new york state department of health. the cytopathic effect-reduction neutralization assay on the recombinant genscript antibody was performed in toronto. for the prnt assay at nml, sars-cov- (canada/on_on-vido- - / , epi_isl_ ) stocks were titrated ( ) for use in a prnt adapted from a previously described method for sars-cov ( ) . briefly, serological specimens were diluted -fold from : to : in dmem supplemented with % fbs and incubated with pfu of sars-cov- at °c and % co for hour. the sera-virus mixtures were added to -well plates containing vero e cells at % confluence, followed by incubation at °c and % co for hour. after adsorption, a liquid overlay composed of . % carboxymethylcellulose diluted in mem, supplemented with % fbs, l-glutamine, nonessential amino acids, and sodium bicarbonate, was added to each well; the plates were incubated at °c and % co for hours. the liquid overlay was removed, and the cells were fixed with % neutral-buffered formalin for hour at room temperature. the monolayers were stained with . % crystal violet for minutes and washed with % ethanol. plaques were enumerated and compared with controls. the highest serum dilution resulting in % and % reduction in plaques compared with controls were defined as the prnt and prnt endpoint titers, respectively. prnt titers ≥ : and prnt titers ≥ : were considered positive. for the prnt assay at wadsworth, the assay for the detection of sars-cov- neutralizing antibodies was a modified version of previously described methods ( ) ( ) ( ) . patient sera and sars-cov- (usa/wa- / , bei resources, nr- ) were diluted in vero e cell culture maintenance medium (emem, % heat-inactivated fbs, u/ml penicillin g, u/ml streptomycin). patient samples were serially diluted : - : and mixed with an equal volume of virus containing pfus. virus and serum mixtures were incubated at °c and % co for hour. following the initial incubation, . ml of each dilution was plated in a single well of a -well plate containing confluent monolayers of vero e cells (atcc, crl- ) and allowed to adsorb for hour at °c and % co . following adsorption, cell cultures were overlaid with . % agar in cell culture medium and returned to the incubator. at days after infection, a second overlay containing . % neutral red was added. monolayers were inspected for days, and plaques were counted. antibody titers were reported as the inverse of the serum dilution resulting in % (prnt ) and % (prnt ) reduction in plaques as compared with the virus inoculum control. for the cytopathic effect-reduction neutralization assay in toronto, μl of . × veroe cells/ml were seeded into a -well flat-bottom plate to adhere overnight. all plasma and serum samples were heat inactivated at °c for minutes. in a separate -well plate, the serum, plasma, or antibody ( μg/ml) samples were serially diluted -fold times in serum-free dmem starting from a dilution of : to : in a volume of μl. to all wells, μl of sars-cov- sb clone was added, ensuring that each well had a dose of issue culture infectious dose (tcid). for the cell control, μl of serum-free dmem was added. for the virus control, μl of sars-cov- sb clone was added with a dose of tcid and topped off with μl of serum free dmem. the plate was incubated for hour at °c, % co with shaking every minutes. after incubation, all the media from the veroe culture were removed, and the full μl of serum/sars-cov- coculture was layered on the cells. the plate was again incubated for hour at °c, % co , with shaking every minutes. after the incubation, the inoculum was removed, and μl of dmem containing % fbs was added. the plate was incubated for days and cytopathic effect was tracked. the assay was established using constructs previously described ( ) (constructs obtained through a gift from jesse bloom and katharine crawford, fred hutchison cancer research centre, seattle, washington, usa, and now available through bei resources) and optimized in-house. major changes to the reported protocol included: (a) use of a second-generation pspax (addgene, ) lentivirus packaging system instead of the third-generation system used by the bloom lab, (b) production of spike pseudotyped virus-like particles (vlps) at °c, (c) a neutralization assay plate layout that increases throughput, (d) adjustments to the luciferase protocol to minimize variability in readings, and (e) use of a cell line that coexpresses ace and tmprss . to generate this cell line, entry vectors for ace and tmprss coding sequences were cloned into plenti cmv puro dest (addgene, ) and plenti cmv hygro dest (addgene, ), respectively. the resulting transfer vectors were used to generate lentivirus via the second-generation pspax and vsv-g (addgene, ). hek t cells were transduced with ace lentivirus at an moi < and selected with puromycin ( μg/ml) to generate a stable population. these cells were subsequently transduced with tmprss lentivirus and selected with hygromycin ( μg/ml) in a similar fashion. for vlp generation, hek t cells were transiently cotransfected in a -well-plate format containing ml growth medium ( % fbs, % penicillin/streptomycin [pen/strep] in dmem) with . μg pspax , . μg phage-cmv-luc -ires-zsgreen-w (bei, nr- ; a gift from jesse bloom and katharine crawford; lentiviral backbone plasmid that uses a cmv promoter to express luciferase followed by an ires and zsgreen), and . μg hdm-idtspike-fixk (bei, nr- ; a gift from jesse bloom and katharine crawford; expressed under a cmv promoter a codon-optimized wuhan-hu- spike; genbank, nc_ ) using μl jetprime (polyplus-transfection sa, - )in μl jetprime buffer. after hours of transfection, the medium was replaced by ml of dmem containing % heat-inactivated fbs and % pen/strep, and the cells were incubated for hours at °c and % co ; they were then transferred to °c and % co for an additional hours. at hours after transfection, the supernatant was collected, spun at g for minutes at room temperature, filtered through a . μm filter, and frozen at - °c. the virus titers were evaluated using hek t-ace /tmprss cells at , cells per well on a poly-l-lysine-coated ( - μg/ml) -well plate using hi media ( % heat-inactivated fbs, % pen/strep), along with a virus dilution resulting in > relative luciferase units (rlu) over control (~ : virus stock dilution). for the neutralization assay, . -fold serial dilutions of the serum samples were incubated with diluted virus at a : ratio for hour at °c before being transferred to plated hek -ace /tmprss cells and incubated for an additional hours at °c and % co . after hours, cells were lysed, and bright-glo luciferase reagent (promega, e ) was added for minutes before reading with a perkinelmer envision instrument. auc values were tabulated for both the direct binding elisa and the snelisa using r version . . and r package pracma. for the snelisa, the ratios (normalized values) are used in the auc calculations. to identify outliers, we calculated the distance of each point from the regression line using total least squares and labeled points with distances > . . for the lentiviral pseudotyping assays, % inhibitory concentration or dilution (ic or id ) were calculated with nonlinear regression (log[inhibitor] versus normalized response -variable slope) using graph-pad prism (graphpad software inc.). the "variable slope" option is a parameter selected in graphpad prism for nonlinear regression that does not assume a standard slope of - . with each dose-response curve but, instead, determines the slope of the curve based on the data generated. for the extended direct binding dilution series, titres were calculated by taking the dilution of serum that produced % of the maximum response in the elisa as determined by the nonlinear regression line (sigmoidal, pl, x is log[dilution]) using graphpad prism . the assay reproducibility was estimated across experiments by comparing the auc values for those samples profiled across different batches. cbs (n = ) cv for displacement was . % and direct binding was . %; cbs (n = ) cv for displacement was . % and . % for binding; cbs (n = ) cv for displacement was . % and binding was . %. when applicable, graphical data from experiments with or more replicates are presented as mean ± sem. all samples were collected after research ethics board (reb) review. the elisas were performed at the lunenfeld-tanenbaum research institute with mount sinai hospital (msh; toronto, ontario, canada) reb approval (study no. - -e). external samples were transferred through material transfer agreements. all research has been performed in accordance with relevant guidelines and regulations. all participants have provided informed consent. the samples were deidentified before transfer to the assay laboratory. kta and acg designed the snelisa and the direct rbd antibody assay. zl and jmr designed the protein expression, biotinylation, and purification procedures. rs and pst optimized the lentiviral pseudotyping assay. ejv, hw, mad, apd, rcg, kam, pb, and mo developed, performed and analyzed the prnt and/or cytopathic effect-reduction neutralization assays. kta and br performed direct elisa experiments. yd designed the vhh hfc x construct, expression, and purification procedures. jhw and mbr implemented the automated direct binding elisa. kc helped coordinate the project. sjd provided samples, coordinated neutralization testing, and integrated prnt and snelisa data. kta and acg analyzed the snelisa data. jlg, ajm, sm, mo, and sjd contributed essential patient samples. kta, jmr, and acg wrote the manuscript with input from all authors. the order of authors for the co-first author was determined by the contribution of kta in the overall study design, as well as data analysis and manuscript preparation. coronavirus spike protein and tropism changes structure, function, and antigenicity of the sars-cov- spike glycoprotein sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor neutralizing antibodies against sars-cov- and other human coronaviruses broadly neutralizing antiviral antibodies deployment of convalescent plasma for the prevention and treatment of covid- two detailed plaque assay protocols for the quantification of infectious sars-cov- protocol and reagents for pseudotyping lentiviral particles with sars-cov- spike protein for neutralization assays pseudotype neutralization assays: from laboratory bench to data analysis a serological assay to detect sars-cov- seroconversion in humans evidence for sustained mucosal and systemic antibody responses to sars-cov- antigens in covid- patients dynamics of neutralizing antibody titers in the months after sars-cov- infection longitudinal evaluation and decline of antibody responses in sars-cov- infection neutralizing and binding antibody kinetics of covid- patients during hospital and convalescent phases sars-cov- infection induces robust, neutralizing antibody responses that are stable for at least months a sars-cov- surrogate virus neutralization test based on antibody-mediated blockage of ace -spike protein-protein interaction convergent antibody responses to sars-cov- in convalescent individuals structural basis for potent neutralization of betacoronaviruses by single-domain camelid antibodies clinical and immunological assessment of asymptomatic sars-cov- infections potent neutralizing antibodies from covid- patients define multiple targets of vulnerability a neutralizing human antibody binds to the n-terminal domain of the spike protein of sars-cov- a human monoclonal antibody blocking sars-cov- infection a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace . science human-igg-neutralizing monoclonal antibodies block the sars-cov- infection evaluation of inactivation methods for severe acute respiratory syndrome coronavirus in noncellular blood products simple piggybac transposon-based mammalian cell expression system for inducible protein production structure of bacteriophage t fibritin: a segmented coiled coil and the role of the c-terminal domain site-specific biotinylation of purified proteins using bira immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants chromosomal transposition of piggybac in mouse embryonic stem cells purification and characterization of a recombinant g-protein-coupled receptor, saccharomyces cerevisiae ste p, transiently expressed in hek ebna cells rapid protein production from stable cho cell pools using plasmid vector and the cumate gene-switch optimization of a high-cell-density polyethylenimine transfection method for rapid protein production in cho-ebna cells assays for the assessment of neutralizing antibody activities against severe acute respiratory syndrome (sars) associated coronavirus (scv) a plaque reduction test for dengue virus neutralizing antibodies serum dilution neutralization test for california group virus identification and serology antigenic relationships between flaviviruses as determined by cross-neutralization tests with polyclonal antisera we thank janet mcmanus at canadian blood services for her technical and logistical expertise and the wadsworth center media and tissue culture core. we thank joan wither for the lupus patient samples, and jesse bloom and katharine crawford for sharing protocols and reagents for the lentiviral s pseudotyping assay. key: cord- -uxn olw authors: lu, maolin; uchil, pradeep d.; li, wenwei; zheng, desheng; terry, daniel s.; gorman, jason; shi, wei; zhang, baoshan; zhou, tongqing; ding, shilei; gasser, romain; prévost, jérémie; beaudoin-bussières, guillaume; anand, sai priya; laumaea, annemarie; grover, jonathan r.; liu, lihong; ho, david d.; mascola, john r.; finzi, andrés; kwong, peter d.; blanchard, scott c.; mothes, walther title: real-time conformational dynamics of sars-cov- spikes on virus particles date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: uxn olw sars-cov- spike (s) mediates entry into cells and is critical for vaccine development against covid- . structural studies have revealed distinct conformations of s, but real-time information that connects these structures, is lacking. here we apply single-molecule förster resonance energy transfer (smfret) imaging to observe conformational dynamics of s on virus particles. virus-associated s dynamically samples at least four distinct conformational states. in response to hace , s opens sequentially into the hace -bound s conformation through at least one on-path intermediate. conformational preferences of convalescent plasma and antibodies suggest mechanisms of neutralization involving either competition with hace for binding to rbd or allosteric interference with conformational changes required for entry. our findings inform on mechanisms of s recognition and conformations for immunogen design. against the virus( - ). s is synthesized as a precursor, processed into s and s by furin proteases, and activated for fusion when human angiotensin-converting enzyme (hace ) engages the receptor-binding domain (rbd) and when the n-terminus of s is proteolytically processed ( ) ( ) ( ) . structures of soluble ectodomains and native virus particles have revealed distinct conformations of s, including a closed trimer with all rbd oriented downward, trimers with one or two rbds up, and hace -stabilized conformations with up to three rbd oriented up ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ). real-time information that connects these structures, however, has been lacking. smfret is well suited to inform on conformational dynamics of proteins reporting domain movements in the millisecond to second range, and has previously been applied to study hiv- , influenza a, and ebola spike glycoproteins, via measurements of the distance-dependent energy transfer from an excited donor to a nearby acceptor fluorophores in real-time( - ). to probe dynamics of sars-cov- spikes, we used available high-resolution structures of the sars-cov- s trimer to identify sites of fluorophore pair labeling that have the potential to inform on distance changes expected to accompany conformational changes between the rbddown and receptor hace -induced rbd-up trimer structures( , ) (fig. s ). accordingly, we engineered a and q labeling peptides before and after the receptor-binding motif (rbm) to allow site-specific introduction of donor and acceptor fluorophores at these positions ( fig. , a, b, and fig. s ). we optimized retroviral and lentiviral pseudoviral particles carrying the sars- cov- s protein (fig. s ) to test the impact of these peptides on infectivity, and found that they were well tolerated, both individually and in combination (fig. s d ). to measure conformational dynamics of the sars-cov- s trimer on the surface of virus particles, we established two types of particles, lentiviral pseudoparticles carrying s, as well as coronaviruslike particles generated by expression of s, membrane (m), envelope (e) and nucleocapsid (n) protein (s-men)( , ) (fig. , a and b ). s-men particles co-express coronavirus surface proteins m and e. particle quality and the presence of the corona-like s proteins on both particle surfaces were confirmed by cryo-electron microscopy ( fig. , c and d) . for smfret, lentivirus particles and s-men particles were generated (see materials and methods) by transfecting hek t cells with an excess of plasmid-encoding wild-type, doped with trace amounts of plasmid expressing labeling peptide-carrying s to ensure the production of virus particles that contain, on average, only a single engineered s protein. as for analogous investigations of hiv- envelope protein( , ), donor (cy b( s)) and acceptor fluorophore (ld ) were enzymatically conjugated to the engineered s proteins presented on the virus particle surface in situ (see materials and methods). a biotinylated lipid was then incorporated into the virus particle membrane to allow their immobilization within passivated microfluidic devices coated with streptavidin to enable imaging by total internal reflection microscopy (fig. a) . donor fluorophores on single, immobilized virus particles were excited by a singlefrequency nm laser and fluorescence emission from both donor and acceptor fluorophores were recorded at hz ( fig. a) . from the recorded movies, we computationally extracted hundreds of smfret traces exhibiting anti-correlated donor and acceptor fluorescence intensities, the telltale signature of conformational changes within the s protein on individual virus particles. analyses of smfret data from ligand-free s protein on lentiviral particles revealed that the sars-cov- s protein is dynamic, sampling at least four distinct conformational states to identify the receptor-bound conformation of the sars-cov- s protein by smfret, we measured the conformational consequences of soluble, monomeric hace binding. addition of the monomeric hace receptor to surface-immobilized virus particles lead to increased occupancy of the low-(~ . ) fret s protein conformation (fig. e) , which was observed at both the single-molecule and population level (fig. f ). similar hace receptor impacts on the sars-cov- s protein were observed in both lentiviral particle and s-men coronavirus-like particle contexts (fig. , e to g). dimeric hace , a more potent ligand (fig. s a )( ), stabilized the low-(~ . ) fret s protein conformation more efficiently (fig. s , b and c), suggesting that the observed low-fret state likely represents the receptor-bound state in which all three rbd domains are oriented upwards (rbd-up conformation). a unique strength of single-molecule imaging is its capacity to reveal directly both the structural and kinetic features that define biological function ( , ) . to extract such information for the sars-cov- s protein, we employed hidden markov modeling (hmm)( ) to idealize individual smfret traces. these data allowed quantitative analyses of the thousands of discrete hace -binding -could be achieved spontaneously. as expected, the binding of the hace receptor modified the dynamic s protein conformational landscape towards the rbd-up conformation (~ . fret), rendering it the most populated ( fig. , b, c, f, g). this change resulted from an increase in the transition rate from the rbd-down conformation (~ . fret) towards the intermediate-(~ . ) fret state and subsequently the rbd-up (~ . fret) conformation, which was also modestly stabilized. the energy barriers for reverse transitions towards the rbd-down conformation (~ . fret), were also elevated, explaining receptor-bound conformation accumulation over time (figs. s ). these analyses lead to a qualitative model for hace activation of the sars-cov- s protein from the ground state to the receptor-bound state through at least one intermediate conformation (fig. h ). the summary of relative state occupancies, transition rates among conformations and errors are listed in tables s and s , respectively. in most cell types, the serine protease tmprss is required for ph-independent sars-cov- entry ( , , ) . in vitro, the effect of tmprss is mimicked by the serine protease trypsin, which has similar cleavage specificity ( , ) . smfret analysis of trypsin-treated s protein on lentiviral particles in the absence of receptor revealed a clear shift towards activation (fig. , a, b). after trypsin treatment, the addition of hace receptor was more effective at stabilizing the s protein in the rbd-up (~ . fret) conformation (fig. , c and d, fig. s ). to further validate this finding, we measured the effects of trypsin pre-treatment in virus-cell and cell-cell fusion assays using split nanoluc system consisting of lgbit and hibit (fig. , e and f, fig. s ). here, membrane fusion restored luciferase activity between lentiviral particles carrying the s protein as well as a vpr-hibit fusion protein with cells expressing the lgbit counterpart fused to a ph domain. this assay revealed fusion to be strictly dependent on the hace receptor and to be stimulated by trypsin treatment (fig. , e and f). nearly identical results were observed for cell-cell fusion between donor cells expressing s and target cells expressing hace ( fig. s ) , confirming the activating role of trypsin treatment. we next explored the suitability of the smfret assay to characterize the conformational consequences of antibody binding to the sars-cov- s protein. multiple studies on antibodies generated from covid- patients have shown that one type of antibody often dominates immune responses( - ). this prompted us to screen plasma from convalescent patients with neutralizing activity that can bind to the s protein on lentiviral particles( ) using a modified virus-capture assay (vca)( ). cross-reactive cr ( ), one of the very first reported antibodies from sars-cov- that also bind to sars-cov- spike rbd domain, served as a good indicator of rbd binding (fig. a) . we identified two plasma samples (s and s ) able to specifically bind the rbd, recognize s expressed at the cell surface and to neutralize viral particles (fig. , a to c, and fig. s ). smfret analysis of antibody-bound s revealed that both cr and plasma from patient s , stabilized s in the rbd-up (~ . fret) conformation, in a similar fashion as receptor hace (fig. , d and e). these data point to the presence of rbd-directed antibodies in patient s . by contrast, smfret indicated that plasma from patient s contained an activity that stabilized the rbd-down (~ . fret) conformation (fig. f ). plasma s antagonized hace binding, but rbd competition did not affect its recognition of s, suggesting that its neutralization activity does not solely rely on blocking the receptor interface we then assessed the conformational preference of four rbddirected antibodies: the potently neutralizing antibodies h , - and - , and the neutralization nanobody vhh , each of which binds rbd in a different way( - ). antibody h and nanobody vhh stabilized the s protein in an rbd-up (~ . fret) conformation similar to hace , cr , and s , whereas antibody - shifted the conformational landscape towards rbd-down (~ . fret) conformation, similar to s (fig. , g to j). the very potent neutralizing antibody - ( ), meanwhile, showed a partial shift to the rbd-up (~ . fret) conformation (fig. s ) . the absence or presence of hace did not appear to affect the rbd-up stabilization evidenced for antibodies cr , s , vhh , or h (fig. s ) . however, plasma s , and to a lesser extent antibody - , reduced the hace -dependent stabilization of the rbd-up (~ . fret) conformation, suggesting that they may interfere with hace receptor binding via an allosteric mechanism. these findings indicate that sars-cov- neutralization can be achieved in two ways: ) antibodies that conformationally mimic hace and directly compete with hace receptor binding, or ) by allosterically stabilizing the s protein in its rbd-down conformation. the strength of the presented smfret approach is revealed by the capacity to examine the dynamic properties of the s protein in real time, including: ) the distinct conformational states that it spontaneously transits under physiological conditions; ) the impact of sequence alterations on s protein dynamics; and ) the responses of the s protein to cognate hace receptor and antibody recognition. the present analyses of dynamic s protein molecules provides three lines of evidence that indicate that the intermediate-(~ . ) fret state observed represents the rbd-down, ground state conformation of the s protein, in which all three rbd domains are oriented towards the viral particle membrane. first, in line with previous electron microscopy (em) investigations ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ), the rbd-down state is the most populated. in further agreement with recent em studies, both the disulfide bridge (s c, d c)( , ) and antibody - stabilized the s protein in a conformation with all three rbd oriented down . while our smfret observations highlight considerable conformational flexibility in these contexts compared to em of soluble trimers, we attribute these distinctions to a tendency of our analysis approach to over-emphasize dynamic features, while em may over-emphasize static conformations rigidified by cryogenic temperatures that may be more readily identified and classified( ). multiple lines of evidence also facilitated assignment of the rbd-up (~ . fret) conformation of the s protein with all three rbd domains oriented away from the virus particle membrane. for instance, this conformation was stabilized by soluble monomeric hace receptor, and even further stabilized in the presence of soluble, dimeric hace receptor as well as rbd-targeting antibodies, such as cr , that are known to access their epitopes when the s protein is in an activated, rbd-up conformation( - ). the structure of the on-path (~ . fret) intermediate observed during s opening is likely similar to the all-down ground state; cryo-em structures of soluble sars-cov- s trimers( ) that engage one or two hace molecules receptors( ) reveal that the distance between the two labeling sites increases in the ligand-free protomers adjacent to a protomer engaged to hace (fig. s a ). the additional, highly compacted s conformation (~ . fret) evidenced, which is also depopulated by activating ligands, remains unknown. these smfret analyses are in global agreement with the conformational states observed at the single particle em and cryoet level ( , , , - , - , , ) . the observed fret changes are also are in good agreement with expected increase in the distance between the labeling peptide insertion sites that carry the fluorophores in the rbd-down and rbd-up conformations of the s trimer. the capacity to examine the conformational preferences of rbd-directed antibodies to the s protein enabled us to identify conformational signatures of antibodies in patient plasma. this approach identified patients with antibody activities that either mimicked ace (indicating anti-rbd activity) or stabilized the ground state of s, thereby interfering with data and materials availability: all data is available in the main text or the supplementary materials. the data that support the findings of this study are available from the corresponding authors upon reasonable request. the full source code of spartan, which was used for analysis of smfret data, is publicly available. (http://www.scottcblanchardlab.com/software). some small customized matlab scripts are available upon reasonable requests. a full-length wild-type pcmv -sars-cov- spike (s +s )-long (termed as pcmv-s, codon-optimized, sino biological, cat # vg -ut) plasmid was used as a template to generate tagged pcmv-s. the translated amino acid sequence of pcmv-s is identical to each pair of inserted tags did not compromise s-dependent lentivirus infectivity. infectivity measurements the infectivity of lentivirus particles carrying sars-cov- spike proteins was cryo-electron tomography nm gold tracer was added to the concentrated s-decorated hiv- lentivirus and s-men particles viruses at : ratio, and µl of the mixture was placed onto freshly glow discharged holey carbon grids for min. grids were blotted with filter paper, and rapidly frozen in liquid ethane using a homemade gravity-driven plunger apparatus. cryo-grids were imaged on a cryo-transmission electron microscope (titan krios, thermo fisher scientific) that was operated at kv, using a gatan k direct electron detector in counting mode with a ev energy slit. tomographic tilt series between − ° and + ° were collected by using serialem( ) in a dose-symmetric scheme( ) with increments of °. the nominal magnification was , x, giving a pixel size of . Å on the specimen. the raw images were collected from single-axis tilt series with accumulative dose of ~ e− per Å . the defocus was - µm and frames were saved for each tilt angle. frames were motion-corrected using motioncorr ( ) to generate drift-corrected stack files, which were aligned using gold fiducial makers by imod/etomo( ). tomograms were reconstructed by weighted back projection and tomographic slices were visualized with imod. virus particles were labeled through site-specifically enzymatic labeling, as previously the sars-cov- rbd elisa assay used was recently described ( , ) introduction of fluorophores (cy b, green; ld , red) was guided by conformational changes in s induced by binding of the cellular receptor human angiotensin-converting enzyme (hace ) from the "rbd-down" to the "rbd-up" conformation (fig. s ) . rbd, receptorbinding domain; ntd, n-terminal domain. structures were adapted from rcsb protein data bank accessories vsb ('down' s /s protomer: s , light cyan; s , dark blue) and vyb/ m j ('up' protomer s /s engaged with hace : hace , magenta). table s . (h) relative free energy model of conformational landscapes of sars-cov- spikes in response to the hace binding. the differences in free energies between states were roughly scaled based upon relative state occupancies of each state. (fig. b) . fret histograms represent mean ± s.e.m., determined from three randomly assigned populations of fret traces. for state occupancies see table s . fret histograms represent mean ± s.e.m., determined from three randomly-assigned populations of all fret traces. evaluated state occupancies see table s . table s . table s . relative state-occupancy and fitting parameters in each of four fret-defined conformational states of sars-cov- spike protein on the surface of virus particles. the fret efficiency histograms were fitted into the sum of four gaussian distributions (µ, the mean or expectation of the gaussian distribution; σ, s.d. of the gaussian distribution) for each conformational state. parameters were based upon the observation of original fret efficiency data and were further determined using hidden markov modelling. relative conformational stateoccupancy of sars-cov- spike protein on viral particles are presented as mean ± s.e.m., determined from three independent measurements. r-squared values were evaluated to indicate the goodness of fit. ligand-free . % +/- % % +/- % % +/- % % +/- % + hace . % +/- % % +/- % % +/- % % +/- % table s . transition rates between observed conformational states of sars-cov- spike on virus particles. the survival probability plots (figs. s and s ) were derived from distributions of dwell times for each state-to-state transitions determined through hidden markov modeling (hmm). then plots were fitted by double-exponential distributions: y(t) = a exp -k t + a exp -k t ), where y(t) is the probability and t is the dwell time. the presented rates were the weighted average of two rates derived from double-exponential decays. rates were finally presented in the table as (weighted average +/- % confidence intervals). development of an inactivated vaccine candidate for sars-cov- an mrna vaccine against sars-cov- -preliminary report dna vaccine protection against sars-cov- in rhesus macaques single-shot ad vaccine protects against sars-cov- in rhesus macaques sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor structure, function, and antigenicity of the sars-cov sars-cov- and bat ratg spike glycoprotein structures inform on virus evolution and furin-cleavage effects distinct conformational states of sars-cov- spike protein controlling the sars-cov- spike glycoprotein conformation structure-based design of prefusion-stabilized sars-cov- spikes closing coronavirus spike glycoproteins by structure-guided design. biorxiv structural basis of receptor recognition by sars-cov- structural and functional basis of sars-cov- entry by using human ace structure of the sars-cov- spike receptor-binding domain bound to the ace receptor molecular architecture of the sars-cov- virus. biorxiv structures and distributions of sars-cov- spike proteins on intact virions key: cord- -qvc fb c authors: chen, long; liu, bo; sun, peng; wang, wenjun; luo, shiqiang; zhang, wenyuan; yang, yuanfan; wang, zihao; lin, jian; chen, peng r. title: severe acute respiratory syndrome coronavirus‐ spike protein nanogel as a pro‐antigen strategy with enhanced protective immune responses date: - - journal: small doi: . /smll. sha: doc_id: cord_uid: qvc fb c prevention and intervention methods are urgently needed to curb the global pandemic of coronavirus disease‐ caused by severe acute respiratory syndrome coronavirus‐ (sars‐cov‐ ). herein, a general pro‐antigen strategy for subunit vaccine development based on the reversibly formulated receptor binding domain of sars‐cov‐ spike protein (s‐rbd) is reported. since the poor lymph node targeting and uptake of s‐rbd by antigen‐presenting cells prevent effective immune responses, s‐rbd protein is formulated into a reversible nanogel (s‐rbd‐ng), which serves as a pro‐antigen with enhanced lymph node targeting and dendritic cell and macrophage accumulation. synchronized release of s‐rbd monomers from the internalized s‐rbd‐ng pro‐antigen triggers more potent immune responses in vivo. in addition, by optimizing the adjuvant used, the potency of s‐rbd‐ng is further improved, which may provide a generally applicable, safer, and more effective strategy for subunit vaccine development against sars‐cov‐ as well as other viruses. the outbreak of severe acute respiratory syndrome coronavirus- (sars-cov- ) infection has caused a pandemic of coronavirus disease- (covid- ) , posing a great threat to human life globally. [ ] till mid-june of , more than million individuals were tested positive for covid- , with a death toll over worldwide. [ ] early efforts have focused on finding small-molecule drugs such as favipiravir, chloroquine, enzyme (hace ). [ ] the s-rbd of sars-cov- has been used as a candidate subunit vaccine to prevent virus entry into cells. [ ] when formulated with adjuvants, sars-cov- rbd can elicit protective immune responses. [ ] since sars-cov- s-rbd possesses % sequence similarity to sars-cov- rbd with an even higher binding affinity to human ace , [ ] it should also be a suitable antigen for subunit vaccine development. nevertheless, the poor pharmacokinetics and low immunogenicity greatly hindered the use of s-rbd for subunit vaccine development. [ ] a critical reason for the low immunogenicity of s-rbd lies in its poor targeting ability to lymph nodes, which is crucial for antigen uptake and processing by dendritic cells (dcs) and macrophages. [ ] as nano particle formulations have been shown to enhance cell permeability and potency of anti-cancer drugs, [ ] we envisioned that formulating s-rbd into redox-responsive nanogels may serve as a pro-antigen with improved lymph node targeting and dc and macrophage accumulation, which can lead to synchronized release of internalized s-rbd monomers with enhanced protective immune responses. herein, we report a pro-antigen strategy based on the reversibly formulated spike protein nanogel (s-rbd-ng) for subunit vaccine development against sars-cov- (scheme ). we started by overexpressing and purifying sars-cov- s-rbd from yeast cells and then verified its integrity via sds-page and western blotting analysis ( figure s , supporting information). since sars-cov- s-rbd contains two n-linked glycosylation sites, the observed mass from lc-ms analysis was heterogeneous but higher than the calculated molecular weight without n-glycosylation ( figure s , supporting information). we further used enzyme-linked immunosorbent assay (elisa) to prove that recombinant human ace can bind to s-rbd expressed in yeast ( figure s , supporting information). the affinity was measured to be ≈ . nm ( figure s , supporting information), which was consistent with the reported value. [ ] furthermore, the interaction can be competed with an s-rbd targeting sars-cov- neutralizing nanobody, which indicated a similar binding pattern of the yeast-expressed s-rbd ( figure s , supporting information). [ ] we then used amine reactive, redoxresponsive reversible chemical crosslinkers to generate protein nanogels ( figure a) . two crosslinkers with different spacer groups were synthesized ( figure b) , which contain an internal disulfide bond that is reduced upon uptake by antigen-presenting cells (apcs) to disassemble the ngs back to protein monomers. notably, whereas reduction of crosslinker (cl ) would generate a thiol group on the protein, crosslinker (cl ) would undergo a rearrangement to regenerate the native amine on the protein upon reduction ( figure c ). s-rbd protein was treated with both crosslinkers at different equivalents and sds-page analysis showed that ngs formed by cl exhibited a slightly higher efficiency than formed by cl and all ngs could be reduced to monomers ( figure d ). the crosslinked bands were further collected and subjected to dynamic light scattering (dls) and transmission electron microscopy ( figure e ,f). in contrast to the native s-rbd that had a diameter of ≈ nm, as measured by dls ( figure s , supporting information), its average diameter was increased to ≈ nm upon crosslinking, which confirmed the formation of s-rbd ngs. as the uptake of antigens by apcs (dcs and macrophages) is critical for antigen processing and cross-presentation, we first examined the internalization of s-rbd-ngs by these cells. confocal microscopy was used to quantify the uptake by dc . or raw . cells after incubation with ngs formulated using different s-rbd/crosslinker ratios (figure , figures s and s , supporting information). compared to the s-rbd monomer, enhanced cellular uptake of s-rbd-ngs prepared with both cl and cl was observed, and quantitative analysis of the imaging data showed an approximately fourfold enhancement of uptake with equivalents of crosslinkers ( figure b ,d). since reduction of cl can regenerate native proteins, we used cl at a : ratio with s-rbd to produce ngs for the following study. scheme . the design of reversibly formulated sars-cov- s-rbd protein nanogel (s-rbd-ng) as a pro-antigen strategy for subunit vaccine development for covid- . s-rbd was formulated with redox-responsive crosslinkers as a pro-antigen with enhanced lymph node targeting and antigen presenting cell (apc) accumulation. synchronized regeneration of s-rbd monomers from the internalized s-rbd-ng pro-antigen triggered more potent immune responses to neutralize sars-cov- . next, we tested whether s-rbd-ng could improve lymph node targeting ability in vivo. cy -labeled s-rbd or s-rbd-ng were administered to c bl/ n mice via intramuscular injection, and inguinal lymph nodes were collected after h ( figure a) . ex vivo imaging showed significantly higher accumulation of s-rbd-ng in lymph nodes compared to s-rbd monomers ( figure b ). further quantitative analysis showed an ≈ . -fold enhanced accumulation of s-rbd-ng compared to s-rbd alone ( figure c ). dcs and macrophages in the inguinal lymph nodes were further analyzed by flow cytometry, and enhanced uptake of s-rbd-ng was observed in these cells ( figure d and figure s , supporting information). the mechanism underlying the enhanced uptake of s-rbd-ng by lymph nodes and apcs remains unclear. since the immune system has evolved the ability to capture and process nanosized viruslike particles, [ ] the nanoparticles could be filtered and accumulate in the lymphoid organs (e.g., liver, spleen, and lymph nodes), followed by rapid uptake and phagocytosis by apcs, the major cell types responsible for capturing these nanoparticles in a size-dependent manner. [ ] in addition, the change in surface charge may play an important role in lymph node uptake. [ ] encouraged by the lymph node targeting and dcs/macrophage uptake results, we next examined the immunogenicity of s-rbd-ng in vivo. c bl/ n mice were immunized intramuscularly with pbs, s-rbd ( µg per mouse), or s-rbd-ng ( µg per mouse) in the presence or the absence of an aluminum hydroxide adjuvant ( µg per mouse), one of the most commonly used adjuvants for vaccine development. mice were further boosted with the same dosage on days and , and sera were collected one week after each immunization ( figure a ). s-rbd-specific serum igg was detected using elisa, and the titers were calculated. one week after the first immunization, the igg titers were still below our detection limit (lower than the lowest dilution factor , data not shown) for all groups. after the second round of immunization, s-rbd-specific serum igg titers were increased to ≈ for s-rbd-ng treated groups, both in the presence and absence of aluminum hydroxide adjuvant ( figure b,c) . notably, after the third round of immunization, the titers for the s-rbd-ng-treated group reached ≈ , while the s-rbd monomer-treated group had a titer less than . quantitative comparison showed that s-rbd-ng induced . -and . -fold higher titers than the s-rbd monomer in the absence or presence of aluminum hydroxide adjuvant. taken together, these results showed that s-rbd-ng possessed higher immunogenicity than s-rbd, and disassembly of this internalized proantigen elicited more potent and rapid immune responses. next, we examined whether the toll-like receptor / agonist pam csk , another frequently used adjuvant in vaccine development, could further boost the immunogenicity of s-rbd-ng. [ ] indeed, coadministration of s-rbd-ng with pam csk stimulated similar but more potent immune responses ( figure f,g) . the s-rbd-specific igg titer reached ≈ after the third round of immunization. this suggested that the vaccination titer of s-rbd ngs could be further improved by optimizing the adjuvant used. since blocking the interaction between spike protein and ace is crucial for preventing sars-cov- 's entry into host cells, we investigated whether the sera from the immunized mice were able to inhibit this interaction. a competitive elisa strategy was employed in which the sera were used to compete with hace for binding to the immobilized s-rbd. indeed, the sera from s-rbd-ng-immunized groups (either in the absence or presence of adjuvant) efficiently blocked the s-rbd-hace interaction ( figure s , supporting information), which was consistent with the aforementioned titer measurement. therefore, s-rbd-ng induced the development of specific antibodies that could target and block the interaction between s-rbd and hace . finally, we used the pseudovirus to further test the utility of s-rbd-ng as a pro-antigen for subunit vaccine development for sars-cov- neutralization. the sars-cov- pseudovirus contains the spike protein shell and harbors a luciferase gene as a reporter (termed spike-pv-luc). the cos cell line stably expressing hace (cos -hace ) was generated to mimic human cells. expression of hace was first validated by immunofluorescence ( figure s , supporting information). the neutralization activity of the sera from different immunized mice was assessed by monitoring the transduction efficiency of spike-pv-luc. indeed, the inhibition of spike-pv-luc transduction by immunized sera was consistent with the titer measured ( figure ) . in particular, sera from mice immunized with s-rbd-ng and pam csk almost completely inhibited pseudovirus entry under both dilution factors. sera from s-rbd-ng immunized mice inhibited pseudovirus transduction in a dilution-dependent manner, both in the presence or absence of the aluminum hydroxide adjuvant. in contrast, no inhibition was observed using sera from pbs-or s-rbd immunized mice. interestingly, when the sera were used at a higher concentration ( -fold dilution), the pseudovirus transduction was enhanced. this may be due to the antibodydependent enhancement, [ ] which indicated that antibodies from sars-cov- infected blood may facilitate virus entry. to further confirm the results, another spike pseudovirus harboring the gfp gene as the reporter (termed spike-pv-gfp) was prepared. inhibition of spike-pv-gfp transduction into cos -hace cells by immunized sera was observed by confocal microscopy imaging ( figure b ). the sera from mice immunized with s-rbd-ng were found to neutralize the sars-cov- pseudovirus. to show that our pro-antigen strategy can be generally applicable to other viruses, we formulated the recombinant s subunit of sars-cov- as ngs and the uptake by raw . cells was validated. indeed, the intracellular uptake of the resulting sars-cov-s -ng was greatly enhanced ( figure s , supporting information). since many other viruses such as ebola virus also depend on the envelopeattached glycoproteins for entering host cells, we envision that our pro-antigen strategy may be extended to these viruses for subunit vaccine development. in conclusion, we developed a generally applicable pro-antigen strategy by employing the reversibly formulated s-rbd-ng to enhance the immunogenicity of sars-cov- spike proteins. s-rbd-ng showed improved lymph node targeting and accumulation in apcs, which can be rapidly converted into s-rbd monomers after internalization, leading to more potent immune responses during in vivo immunization. these results demonstrated the advantages of s-rbd-ng over s-rbd monomer for future subunit vaccine development. notably, s-rbd-ng alone was able to induce rapid and potent immune responses, which offers the possibility of developing subunit vaccine without the use of adjuvants. the immunized sera were further shown to block the interaction between the spike protein and hace , which is crucial for virus entry into host cells. finally, in the pseudovirus neutralization assay, sera from s-rbd-ng-immunized groups effectively neutralized the pseudovirus in a concentration-dependent manner. the s-rbd-ngbased pro-antigen strategy within the lymph node niche can elicit more rapid and potent immune responses and may serve as a potential subunit vaccine candidate against sars-cov- . supporting information is available from the wiley online library or from the author. figure . neutralization of sars-cov- spike pseudovirus using immunized mouse sera. a) transduction inhibition of the spike-pv-luc by different sera. spike-pv-luc was pre-incubated with sera from different groups at : or : dilution and then added to cos -hace . transduction efficiency was assessed by luciferase reporter. data are presented as mean ± sem. n = or . b) transduction inhibition of spike-pv-gfp by different sera. spike-pv-gfp was pre-incubated with sera from different groups at : dilution and then added to cos -hace . transduction efficiency was assessed by confocal microscopy imaging. scale bar: µm. , ; c) key: cord- -k svq n authors: pollet, jeroen; chen, wen-hsiang; versteeg, leroy; keegan, brian; zhan, bin; wei, junfei; liu, zhuyun; lee, jungsoon; kundu, rahki; adhikari, rakesh; poveda, cristina; mondragon, maria-jose villar; de araujo leao, ana carolina; rivera, joanne altieri; gillespie, portia m.; strych, ulrich; hotez, peter j.; bottazzi, maria elena title: sars-cov- rbd -n c : a yeast-expressed sars-cov- recombinant receptor-binding domain candidate vaccine stimulates virus neutralizing antibodies and t-cell immunity in mice date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: k svq n there is an urgent need for an accessible and low-cost covid- vaccine suitable for low- and middle-income countries. here we report on the development of a sars-cov- receptor-binding domain (rbd) protein, expressed at high levels in yeast (pichia pastoris), as a suitable vaccine candidate against covid- . after introducing two modifications into the wild-type rbd gene to reduce yeast-derived hyperglycosylation and improve stability during protein expression, we show that the recombinant protein, rbd -n c , is equivalent to the wild-type rbd recombinant protein (rbd -wt) in an in vitro ace- binding assay. immunogenicity studies of rbd -n c and rbd -wt proteins formulated with alhydrogel® were conducted in mice, and, after two doses, both the rbd -wt and rbd -n c vaccines induced high levels of binding igg antibodies. using a sars-cov- pseudovirus, we further showed that sera obtained after a two-dose immunization schedule of the vaccines were sufficient to elicit strong neutralizing antibody titers in the : , to : , range, for both antigens tested. the vaccines induced ifn-γ, il- , and il- secretion, among other cytokines. overall, these data suggest that the rbd -n c recombinant protein, produced in yeast, is suitable for further evaluation as a human covid- vaccine, in particular, in an alhydrogel® containing formulation and possibly in combination with other immunostimulants. introduction the number of coronavirus disease (covid- ) cases globally is readily approaching the - million-person mark, with over . million deaths. in response to the pandemic, an international enterprise to develop effective and safe vaccines is underway. there are many ways to categorize the more than potential covid- vaccine candidates , but one approach is to divide them as those employing new technologies for production, but that have not yet been licensed for use, versus terms of production, scale-up, potential efficacy and safety, and delivery. we have previously reported on recombinant protein-based coronavirus vaccine candidates, formulated with alhydrogel ® to prevent severe acute respiratory syndrome (sars) - and middle east respiratory syndrome (mers) . in both cases, the receptor-binding domain (rbd) of the sars or mers spike proteins was used as the target vaccine antigen. in a mouse model, the sars- cov rbd -n /alhydrogel ® vaccine induced high titers of virus-neutralizing antibodies and protective immunity against a mouse-adapted sars-cov virus challenge. it was also found to minimize or prevent eosinophilic immune enhancement compared to the full spike protein . the rbd of sars-cov- has likewise attracted interest from several groups now entering clinical trials with rbd-based vaccines , - . our approach was to apply the lessons learned from the development of the sars-cov vaccine candidate and accelerate the covid- vaccine induction temperature was set to °c and the ph to . and, the methanol feed rate was between - ml/l/hr. the fermentation supernatant (fs) was filtered ( . m pes filter) and stored at - °c before purification. a hexahistidine-tagged sars-cov- rbd -wt was purified from fermentation supernatant (fs) by immobilized metal affinity chromatography followed by size exclusion chromatography (sec). the fs was concentrated and buffer exchanged to buffer a ( mm tris- hcl ph . and . m nacl) using a pellicon cassette with a kda mwco membrane to evaluate the size of rbd -wt and rbd -n c , μg of these two proteins were loaded onto a - % tris-glycine gel under non-reduced and reduced conditions. these two proteins were also treated with pngase-f (neb, ipswitch, ma, usa) under the reduced condition to remove n- glycans and loaded on the gel to assess the impact of the glycans on the protein size. gels were stained using coomassie blue and analyzed using a bio-rad g densitometer with image alhydrogel ® formulations were centrifuged at , x g for min, and the supernatant was removed. the protein in the supernatant fraction and the pellet fraction were quantified using a micro bca assay (thermofisher, waltham, ma, usa). for the ace- binding study, the alhydrogel ® -rbd vaccine formulations were blocked overnight with . % bsa. after hace- -fc (lakepharma, san carlos, ca, usa) was added, the samples were incubated for hours at rt. after incubation, the alhydrogel ® was spun down at , x g for washed once with l pbst using a biotek ts plate washer and diluted mouse serum samples were added to the plate in duplicate, l/well. as negative controls, pooled naïve mouse serum ( : diluted) and blanks ( . % bsa pbst) were added as well. plates were incubated for hours at room temperature, before being were washed four times with pbst. subsequently, : , diluted goat anti-mouse igg hrp antibody ( l/well) was added in . % bsa in pbst. plates were incubated hour at room temperature, before washing five times with pbst, followed by the addition of l/well tmb substrate. plates were incubated for min at room temperature while protected from light. after incubation, the reaction was stopped by adding l/well m hcl. the absorbance at a wavelength of nm was measured using a biotek epoch spectrophotometer. duplicate values of raw data from the od were averaged. the titer cutoff value was calculated using the following formula: titer cutoff = x average of negative control + x standard deviation of the negative control. for each sample, the titer was determined as the lowest dilution of each mouse sample with an average od value above the titer cutoff. when a serum sample did not show any signal at all and a titer could not be calculated, an arbitrary baseline titer value of was assigned to that sample (baseline). sample/ rlu of negative control) x . serum from vaccinated mice was also characterized by the ic -value, defined as the serum dilution at which the virus infection was reduced to % compared with the negative control (virus + cells). when a serum sample did not neutralize % of the virus when added at a : dilution, the ic titer could not be calculated and an arbitrary baseline titer value of was assigned to that sample (baseline). as a control, human convalescent sera for sars- for the re-stimulation assays, splenocyte suspensions were diluted to x live cells/ml in a -ml deep-well dilution plate and l of each sample was seeded in two -well tissue culture treated culture plates. splenocytes were re-stimulated with g/ml rbd -wt, ng/ml pma + g/ml ionomycin or just media (unstimulated). for the flow cytometry plate, the pma/i was not added until the next day. l ( x concentration) of each stimulant was mixed with the l splenocytes suspension in the designated wells. after all the wells were prepared, the plates were incubated at °c % co . one plate was used for the cytokine release assay, while the other plate was used for flow cytometry. for flow cytometry, another plate was prepared with splenocytes, which would be later used as fluorescence minus onecontrols (fmos). after hours in the incubator, splenocytes were briefly mixed by pipetting. then plates were centrifuged for min at x g at rt. without disturbing the pellet l supernatant was transferred to two skirted pcr plates and frozen at - °c until use. for the in vitro cytokine release assay, splenocytes were seeded in a -well culture plate at x live cells in µl crpmi. splenocytes were then (re-)stimulated with either µg/ml rbd -wt protein, µg/ml rbd -n c protein, pma/iomycin (positive control), or nothing (negative control) for hours at °c % co . after incubation, -well plates were centrifuged to pellet the splenocytes down and supernatant was transferred to a new -well plate. the supernatant was stored at - °c until assayed. a milliplex mouse th luminex kit (md millipore) with analytes il- β, il- , il- , il- , il- , il- (p ), il- , il- a, il- , ifn-γ, and tnf-α was used to quantify the cytokines secreted in the supernatant by the re-stimulated splenocytes. an adjusted protocol based on the manufacturers' recommendations was used with adjustments to use less sample and kit materials . the readout was performed using a magpix luminex instrument. raw data was analyzed using bio-plex manager software, and further analysis was done with excel and prism. surface staining and intracellular cytokine staining followed by flow cytometry was performed to measure the amount of activated (cd =) cd + and cd + t cells producing ifn-, il- , tnf-, and il- upon re-stimulation with s rbd wt. five hours before the -hour re-stimulation incubation, brefeldin a was added to block cytokines from secretion. pma/i was also added to designated wells as a positive control. after the incubation, splenocytes were stained for the relevant markers. a viability dye and an fc block were also used to remove dead cells in the analysis and to minimize non-specific staining, respectively. results here we report on the expression of a modified, recombinant rbd of the sars-cov- spike protein using the yeast (p. pastoris) expression system. the candidate antigen selection, modifications, and production processes were based on eight years of process development, manufacture, and preclinical prior experience with a sars-cov recombinant protein-based receptor-binding domain (rbd) - . the rbds of the sars-cov- and sars-cov share significant amino acid sequence similarity (> % identity, > % homology) and both use the human angiotensin-converting enzyme (ace ) receptor for cell entry , . process development using the same procedures and strategies used for the production, scale-up, and manufacture of the sars-cov recombinant protein allowed for a rapid acceleration in the development of a scalable and reproducible production process for the sars-cov- rbd -n c protein, suitable for its technological transfer to a manufacturer. we found that the modifications used to minimize yeast-derived hyperglycosylation and optimize the yield, purity, and stability of the sars-cov rbd -n protein were also relevant to the sars-cov- rbd expression and production process. the modified sars-cov- antigen, rbd -n c , when formulated on alhydrogel ® , was shown to induce virus-neutralizing antibodies in mice, equivalent to those levels elicited by the wild-type (rbd -wt) recombinant protein counterpart. the wild-type sars-cov- rbd amino acid sequence comprises residues - of the spike (s) protein (genbank: qhd . ) of the wuhan-hu- isolate (genbank: mn . ) (figure ) . in the rbd- -wt construct, the gene fragment was expressed in p. pastoris. after fermentation at the l scale, the hexahistidine-tagged protein was purified by immobilized metal affinity chromatography, followed by size-exclusion chromatography. we observed glycosylation and aggregation during these initial expression and purification studies, and therefore, similar to our previous strategy , we generated a modified construct, the rbd -n c , by deleting the n residue and mutating the c residue to alanine. the additional mutation of c to a was done because we observed that in the wild-type sequence nine cysteine residues likely would form four disulfide bonds. therefore, the c residue was likely available for intermolecular cross-linking, leading to aggregation. as a result, in the rbd -n c construct, and based on the modifications, the pichia-derived hyperglycosylation, as well as aggregation via intermolecular disulfide bridging, were greatly reduced. we note that the deleted and mutated residues are structurally far from the immunogenic epitopes and specifically the receptor-binding motif (rbm) of the rbd (figure ) when mixing µg of either rbd -wt or rbd -n c proteins to µg of alhydrogel ® , we observed that > % of the proteins bind to alhydrogel ® after min of incubation. only when the alhydrogel ® was reduced to less than µg (alhydrogel ® /rbd ratio < ), the alhydrogel ® surface was saturated, and protein started to be detected in the supernatant (figure a) . it is known that unbound protein may impact the immunogenicity of the vaccine formulation, therefore we proceeded to only evaluate formulations with alhydrogel ® /rbd ratios higher than . figure b shows that hace- -fc, a recombinant version of the human receptor used by the virus to enter the host cells, can bind with the rbd proteins that are adsorbed on the surface of the alhydrogel ® . this demonstrates that bound rbd proteins are structurally and possibly functionally active and that after adsorption the protein does not undergo any significant conformational changes that could result in the loss of possible key epitopes around the receptor-binding motif (rbm). we saw no statistical differences between the binding of hace- -fc to rbd -wt (red, figure b ) or rbd -n c (green, figure b ) proteins, based on an unpaired t-test (p= . ). likewise, we saw no relation between the amount of alhydrogel ® to which the rbd was bound and the interaction with hace- -fc, indicating that the surface density of the rbd proteins on the alhydrogel ® plays no role in the presentation of ace binding sites. alhydrogel ® , produced a lower igg response, albeit slightly higher than the negative control that had been immunized with g alhydrogel ® alone (figure b, supplemental table ) . importantly, based on a mann-whitney test, we determined that there was no statistical difference between the groups vaccinated with the modified and the wild-type version of the rbd protein (p= . ). the average neutralizing antibody titers observed on day (ic range: . x to . x , supplemental (figure c) . on day , days after receiving the boost vaccination, half of the mice in each group (n= ), those with the highest igg titers, were sacrificed to determine the total igg, the igg subtypes, and the neutralizing antibody titers. as we observed on day , all animals that had received the vaccine produced strong antibody titers, with the groups receiving > g alhydrogel ® eliciting a higher titer than those that received only g of alhydrogel ® , albeit no statistical significance was detected (figures b) . for all animals, as typical for vaccine formulations containing aluminum, the igg a:igg titer ratio was < . (supplemental figure ) . in the pseudovirus neutralization assay for the day samples (figure c) , all vaccines containing > g alhydrogel ® elicited ic titers that, on average, were several-fold higher than on day (ic range: . x to . x , supplemental table ). there again was no difference between the rbd -wt and rbd-n c vaccines. on day , all remaining animals were sacrificed. in contrast to the animals studied on days and , these animals had received a second boost vaccination. a robust immune response in all vaccinated mice, including those immunized with the protein adsorbed to g alhydrogel ® achieved high average igg titers. the total igg titers in the mice sacrificed on day , had increased after the third vaccination, compared to the titers seen on day . likewise, we observed a corresponding increase in the average ic values (ic range: . x to . x , supplemental table ) for all animals, including those immunized with the protein adsorbed to g alhydrogel ® . interestingly, for this time point, the cohort receiving g rbd -n c with g alhydrogel ® appeared to show higher neutralizing antibody titers than the corresponding for all samples, we employed flow cytometry to quantify intracellular cytokines in cd + and cd + cells after restimulation ( figure a) . on day , high percentages of cd + -il- and, to a slightly lesser extent cd + -tnf producing cells were detected. conversely, as expected for an alhydrogel ® -adjuvanted vaccine, low levels of il- producing cd + cells were seen. in a cytokine release assay, strong ifn-, il- , and il- secretion was observed independent of whether the animals had received two or three immunizations, whereas low amounts of secreted th -typical cytokines such as il- or il- were seen ( figure b ). cytokine concentrations of non-stimulated controls were subtracted from re-stimulated samples. discussion here we report on a yeast-expressed sars-cov- rbd -n c protein and its potential as a vaccine candidate antigen for preventing covid- . building on extensive prior experience developing vaccines against sars-cov and mers-cov - , we initially selected and compared the sars-cov- rbd -wt and the sars-cov- rbd -n c proteins for their potential to induce high titers of virus-neutralizing antibodies, t-cell responses, and protective immunity. previously we observed that the sars-cov rbd -n antigen, formulation with alhydrogel ® elicited high levels of neutralizing antibodies without evidence of eosinophilic immune enhancement. that rbd-based vaccine was even superior to the full-length spike protein in inducing specific antibodies and fully protected mice from sars-cov infection while preventing eosinophilic pulmonary infiltrates in the lungs upon challenge . in this work, using the sars-cov- rbd protein analog, we observed that, just like in the case of the sars-cov rbd antigen, the deletion of the n-terminal asparagine residue reduced hyperglycosylation, thus allowing for easier purification of the antigen obtained from the yeast expression system. moreover, mutagenesis of a free cysteine residue further improved protein production through the reduction of aggregation. based on the predicted structure of the rbd, no impact on the functionality of the rbd -n c antigen was expected, and using an ace- in vitro binding assay we indeed showed similarity to the rbd -wt antigen. in addition, we showed that, in mice, the modified rbd -n c antigen triggered an equivalent immune response to the rbd -wt protein when both proteins were adjuvanted with alhydrogel ® . similar to our previous findings with the sars-cov rbd antigen , we show the rbd - n c protein when formulated with alhydrogel ® elicits a robust neutralizing antibody response with ic values up to . x in mice, as well as an expected t-cell immunological profile. some of the titers of virus-neutralizing antibodies exceed the titer, . x , measured in-house with human convalescent serum research reagent for sars-cov- (nibsc / , national institute for biological standards and control, uk). in a mouse virus challenge model for the sars cov rbd recombinant protein vaccine, we found that alhydrogel ® formulations induced high levels of protective immunity but did not stimulate eosinophilic immune enhancement, suggesting that alhydrogel ® may even reduce immune the selection of the p. pastoris expression platform for the production of the rbd antigen was motivated by the intent to develop a low-cost production process that could easily be transferred to manufacturers in lmics. currently, there are several types of covid- vaccine candidates in advanced clinical trials , - . the focus of some of the initiatives behind these vaccines is to provide vaccines for the developed world that might struggle to be successful without advanced infrastructure. being able to match the existing experience in lmics with the production of other biologics in yeast increases the probability of successful technology transfer . for example, currently, the recombinant hepatitis b vaccine is produced in yeast by several members of the development country vaccine manufacturers network (dcvmn), and we foresee that, given the existing infrastructure and expertise, those facilities could be repurposed to produce a yeast-produced covid- vaccine . recently, the research cell bank and production process for the rbd -n c antigen was technologically transferred to a vaccine manufacturer in india and produced under cgmp conditions with the intent to enter into clinical development. in addition, preclinical studies using the rbd -wt and rbd -n c antigens are ongoing to further optimize and evaluate other novel formulations, including a challenge study in a non-human primate model. the covid- vaccine-development multiverse. the new england journal of medicine the sars-cov- vaccine pipeline: an overview developing safe and effective covid vaccines -operation warp speed's strategy and approach. the new england journal of medicine who. access to covid- tools (act) accelerator development of an inactivated vaccine candidate for sars-cov- phase - trial of a sars-cov- recombinant spike protein nanoparticle a vaccine targeting the rbd of the s protein of sars-cov- induces protective immunity sars-cov- spike produced in insect cells elicits high neutralization titres in non-human primates. emerging microbes & infections yeast-expressed sars-cov recombinant receptor-binding domain (rbd -n ) formulated with alum induces protective immunity and reduces immune yeast-expressed recombinant protein of the receptor-binding domain in sars-cov spike protein with deglycosylated forms as a sars vaccine candidate optimization of the production process and characterization of the yeast- expressed sars-cov recombinant receptor-binding domain (rbd -n ) vaccine candidate engineering a stable cho cell line for the expression of a mers- coronavirus vaccine antigen randomized double blind, placebo controlled phase i trial for anti novel coronavirus pneumonia (covid- ) recombinant vaccine (sf ) kbp- covid- vaccine trial in healthy volunteers a study to evaluate the safety and immunogenicity of covid- (adimrsc- f) vaccine clinical study of recombinant novel coronavirus vaccine soberano -estudio fase i/ii, aleatorizado, controlado, adaptativo, a doble ciego y multicéntrico para evaluar la seguridad, reactogenicidad e inmunogenicidad del candidato vacunal profiláctico finlay-fr- anti sars -cov - en un esquema de dos dosis developing a low-cost and accessible covid- vaccine for global health will covid- become the next neglected tropical disease? plos neglected tropical diseases developing a low-cost and accessible covid- vaccine for murine leukemia virus (mlv)-based coronavirus spike- pseudotyped particle production and infection vaccine-linked chemotherapy improves benznidazole efficacy for acute transferring luminex(r) cytokine assays to a wall-less plate technology: validation and comparison study with plasma and cell culture supernatants differences in cd surface expression levels and function discriminates il- and ifn-gamma producing a pneumonia outbreak associated with a new coronavirus of probable bat origin sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor covid- vaccines: neutralizing antibodies and the alum advantage the potential role of th immune responses in coronavirus immunopathology and vaccine-induced immune enhancement. microbes and infection prospects for a safe covid- vaccine draft landscape of covid- candidate vaccines s-trimer, a covid- subunit vaccine candidate, induces protective immunity in nonhuman primates scb- as covid- vaccine development of cpg-adjuvanted stable prefusion sars-cov- spike antigen as a subunit vaccine against covid- . biorxiv a study to evaluate the safety and immunogenicity of mvc-cov study of the safety, reactogenicity and immunogenicity of "epivaccorona" vaccine for the prevention of covid- (epivaccorona a study to evaluate the safety, tolerability, and immunogenicity of ub- covid- vaccine enhancing blood-stage malaria subunit vaccine immunogenicity in rhesus macaques by combining adenovirus, poxvirus, and protein-in-adjuvant vaccines combining viral vectored and protein-in-adjuvant vaccines against the blood-stage malaria antigen ama : report on a phase a clinical trial. molecular therapy : the journal of the sars-cov- mrna vaccine development enabled by prototype pathogen preparedness. biorxiv chadox ncov- vaccine prevents sars-cov- pneumonia in rhesus macaques safety and immunogenicity of the ad .rsv.pref investigational vaccine coadministered with an influenza vaccine in older adults safety and immunogenicity of the chadox ncov- vaccine against sars-cov- : a preliminary report of a phase / , single-blind, randomised controlled trial an mrna vaccine against sars-cov- -preliminary report. the new england journal of medicine rna-based covid- vaccine bnt b selected for a pivotal efficacy study. medrxiv : the preprint server for health sciences prequalified vaccines the authors declare that baylor college of medicine recently licensed the rbd -n c technology to an indian manufacturer for further development. the research conducted in this paper was performed in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. key: cord- - rr j tk authors: du, lanying; tai, wanbo; yang, yang; zhao, guangyu; zhu, qing; sun, shihui; liu, chang; tao, xinrong; tseng, chien-te k.; perlman, stanley; jiang, shibo; zhou, yusen; li, fang title: introduction of neutralizing immunogenicity index to the rational design of mers coronavirus subunit vaccines date: - - journal: nat commun doi: . /ncomms sha: doc_id: cord_uid: rr j tk viral subunit vaccines often contain immunodominant non-neutralizing epitopes that divert host immune responses. these epitopes should be eliminated in vaccine design, but there is no reliable method for evaluating an epitope's capacity to elicit neutralizing immune responses. here we introduce a new concept ‘neutralizing immunogenicity index' (nii) to evaluate an epitope's neutralizing immunogenicity. to determine the nii, we mask the epitope with a glycan probe and then assess the epitope's contribution to the vaccine's overall neutralizing immunogenicity. as proof-of-concept, we measure the nii for different epitopes on an immunogen comprised of the receptor-binding domain from mers coronavirus (mers-cov). further, we design a variant form of this vaccine by masking an epitope that has a negative nii score. this engineered vaccine demonstrates significantly enhanced efficacy in protecting transgenic mice from lethal mers-cov challenge. our study may guide the rational design of highly effective subunit vaccines to combat mers-cov and other life-threatening viruses. a major goal of viral subunit vaccine development is to rationally design immunogens that can elicit strong neutralizing immune responses in hosts [ ] [ ] [ ] [ ] . the receptorbinding domains (rbds) of virus surface spike proteins are the prime candidates for subunit vaccine design because they contain epitopes that can trigger strong immune responses . in addition, viral rbds play essential roles in viral infection cycles by binding to their host receptor for viral attachment . thus, part of the host immune responses elicited by viral rbds can target the receptorbinding region and thereby neutralize viral entry into host cells. however, two problems potentially hinder the development of viral rbds as subunit vaccines. first, viruses can evade the host immune responses elicited by their own spikes or rbd-based vaccines. one of the immune evasion mechanisms by viruses is to use immunodominant non-neutralizing epitopes on their rbds to divert host immune responses, which has been thoroughly illustrated in the case of the hiv receptor-binding subunit gp (refs , ) . second, when taken out of the context of the full-length spike proteins, recombinant viral rbd vaccines expose large areas of previously buried surfaces that likely contain immunodominant non-neutralizing epitopes. whether an outcome of viral evolution or vaccine design, these immunodominant non-neutralizing epitopes on viral rbds can outcompete other epitopes in triggering host immune responses, so that the resulting immune responses target these non-neutralizing epitopes while neglecting neutralizing epitopes on viral rbds (refs - ) . rational design of viral subunit vaccines aims to focus the immune responses on neutralizing epitopes through masking or deletion of immunodominant non-neutralizing epitopes [ ] [ ] [ ] . a critical gap in subunit vaccine design is the lack of an effective way to evaluate an epitope's neutralizing immunogenicity (that is, its capacity to elicit neutralizing immune responses). there have been extensive efforts to predict epitopes' immunogenicity based on the physical and chemical properties of the epitopes . however, these methods are not designed to predict epitopes' 'neutralizing' immunogenicity, which holds the key for subunit vaccine design. although some experimental methods are available to measure the neutralizing immunogenicity of linear epitopes by taking linear peptides out of the context of proteins , , these methods do not work for conformational epitopes, which are prevalent on rbd-based viral vaccines . finding a way to measure the neutralizing immunogenicity of different conformational epitopes on viral rbds will tremendously help rational design of viral subunit vaccines. rbd-based coronavirus vaccines have been extensively pursued due to the threat that coronaviruses pose to human health. coronaviruses are enveloped and positive-stranded rna viruses. in - , sars coronavirus (sars-cov) infected over , people with b % fatality rate , . since , mers coronavirus (mers-cov) has infected about people with b % fatality rate , . the rbds from sars-cov and mers-cov both contain a core structure and a receptor-binding motif (rbm). their core structures are highly similar, but their rbms are markedly different [ ] [ ] [ ] [ ] , leading to different receptor specificity: sars-cov recognizes angiotensin-converting enzyme (ace ), whereas mers-cov recognizes dipeptidyl peptidase (dpp ) , , . both sars-cov and mers-cov rbds are capable of eliciting strong neutralizing antibody responses , [ ] [ ] [ ] [ ] . on the one hand, because of the enriched neutralizing epitopes in their rbm and their high-yield expression as recombinant proteins, coronavirus rbds are promising subunit vaccine candidates. moreover, because of their relatively simple structures compared with the intact spike proteins, coronavirus rbds provide an excellent model system for structure-based subunit vaccine design. on the other hand, recently determined cryo-em structures of coronavirus spike proteins revealed that whereas the rbm of coronavirus rbds is accessible, large surface areas of the rbd core structure are buried in the full-length spike proteins (supplementary fig. ) , . thus, when these previously buried areas on the surface of the rbd core become exposed in recombinant rbd vaccines, they likely contain immunodominant non-neutralizing epitopes that divert host immune responses. therefore, coronavirus rbds both hold promises and present challenges for vaccine development. it is critical to evaluate the neutralizing immunogenicity of different epitopes on coronavirus rbds, such that immunodominant neutralizing and non-neutralizing epitopes can be preserved and eliminated, respectively. in this study we introduce a novel concept 'neutralizing immunogenicity index' (nii) to evaluate the neutralizing immunogenicity of different epitopes on viral subunit vaccines. as proof-of-concept, we used nii as a tool to identify epitopes with different neutralizing immunogenicity on a mers-cov-rbdbased vaccine. furthermore, we successfully applied this tool and significantly enhanced the efficacy of the mers-cov rbd vaccine in protecting human-dpp -transgenic mice from lethal mers-cov challenge. our study fills in a critical gap in subunit vaccine design, and can facilitate rational design of subunit vaccines against mers-cov and other life-threatening viruses. to evaluate the neutralizing immunogenicity of a specific epitope on viral rbd vaccines, we can either delete or mask the epitope and then measure the corresponding changes in the vaccine's capacity to elicit neutralizing immune responses. alanine scanning of vaccine-surface residues likely leads to changes in the vaccine's overall immunogenicity that are too subtle to be measurable using currently available experimental methods, while deletion of a whole epitope may disturb the tertiary structure of the viral rbd. instead, in this study we chose to mask the epitope of interest using a host-cell-derived glycan probe. this approach is effective and convenient because the glycan probe can impose steric interference for the access of antibodies and immune cells to the epitope, and also because the glycan probe is unlikely to interfere with the folding and solubility of the rbd. to place the glycan probe on an epitope, we introduced the n-linked glycosylation motif, asparagine-x-threonine (where x is any amino acid other than proline) , onto different epitopes on viral rbd vaccines using site-directed mutagenesis. as proof-of-concept, we chose to study several epitopes on the mers-cov rbd vaccine. the fc-tagged rbd fragment containing residues from to was selected in this study because we previously showed that this fragment is a stable and effective vaccine candidate . four distinct epitopes on this mers-cov rbd fragment were selected based on their location on the rbd surface and their possible functional role in receptor binding: (i) arg (located on a protruding loop and in the receptor-binding motif (rbm) region); (ii) ala (located on a b-strand and in the rbm region); (iii) val (located on a b-strand and in the core region); (iv) thr (located on a protruding loop and in the core region) (fig. a,b) . on the basis of three-dimensional protrusion index map ( supplementary fig. ) , the epitopes containing arg and thr both have a high protrusion index, whereas the epitopes containing ala and val both have a low protrusion index. we introduced a glycan probe onto each of the above four epitopes on mers-cov rbd. to this end, we introduced single mutations v n, t n and a n to pair with the already existent thr , thr and thr , respectively, to generate three n-linked glycosylation sites. we also introduced double mutations r n/e t to generate the fourth n-linked glycosylation site. each of these glycosylation sites was located in an individual mers-cov rbd fragment. we expressed and purified each of the four mutant rbds in mammalian cells ( supplementary fig. a ). to test whether each of the above four epitopes on mers-cov rbd was actually glycosylated, we performed both sds gel electrophoresis and mass spectrometry. compared with the wild type rbd, each of the mutant rbds exhibited a slower electrophoretic mobility on the gel, consistent with additional glycosylation (supplementary fig. a ). mass spectrometry revealed that the molecular weights of the mutant rbds were b - kda larger than that of the wild type rbd, which was also consistent with an introduced glycan probe in each of the mutant rbds ( supplementary fig. b-f) . for each of the purified mutant rbd samples, there was no visible presence of unglycosylated rbd on the sds gel or the mass spectrometry spectrum ( supplementary fig. ) . thus, each of the four epitopes on mers-cov rbd had been successfully glycosylated. to understand the correlation between the epitopes' role in receptor binding and their potential to be recognized by immune responses, we examined whether these engineered glycan probes on mers-cov rbd interfered with receptor binding. to this end, we used two alternative approaches. one approach was an alphascreen assay, which analysed the interaction between recombinant rbds and recombinant human dpp in solution (fig. c) , and the other approach was fluorescence-activated cell sorting (facs), which examined the interaction between recombinant rbds and human dpp expressed on the huh- cell-surface (fig. d) . the results from both assays revealed that the glycan probe located at residue reduced the binding of the rbd to dpp , the glycan probe located at residue reduced the binding of the rbd to dpp even more, and the ones located at residues and had no impact on dpp binding. structural analysis of the rbd/dpp interactions suggests that a glycan probe located at residue would have serious steric clash with dpp binding, whereas a glycan probe located at residue would have partial steric interference with dpp binding (fig. b) . glycan probes located at residues and would be too far away from the receptor-binding region to have any impact on dpp binding. hence, both the biochemical and structural analyses similarly elucidated the role of each of the glycan probes in the binding of the rbd to dpp . to understand the epitopes' potential to interact with neutralizing monoclonal antibodies (mabs), we analysed how the engineered glycan probes interfered with the binding of the rbd to different neutralizing mabs. we had access to four humanized mabs (hms- , m -fab, m -fab, and m -fab). all of these mabs were previously shown to be highly potent in neutralizing mers-cov infection of human cells [ ] [ ] [ ] [ ] . elisa between each of the rbds and each of the mabs demonstrated that the glycan probe located at residue abolished the binding of the rbd to hms- (fig. a) , reduced the binding of the rbd to m -fab and m -fab (fig. b,c) , and had no significant impact on the binding of the rbd to m -fab (fig. d ). in contrast, the glycan probes located at the other three residues, , and , did not interfere with the binding of the rbd to any of the mabs. the binding sites on the rbd for each of the mabs were previously characterized through mutagenesis and/or structural studies [ ] [ ] [ ] [ ] . three of the four mabs, hms- , m -fab and m -fab, bind at or near the epitope containing arg , whereas all of the mabs bind away from the epitopes containing ala , val , and thr ( fig. e) . overall, among the four selected epitopes, the epitope containing arg played the most important role in the binding of neutralizing mabs, and consequently the glycan probe covering this epitope interfered most with the binding of neutralizing mabs. this study thus far has characterized the structural features, receptor binding, and neutralizing mab binding for four selected rbd epitopes using a glycan probe strategy. each of the glycan probes introduced to one of the rbd epitopes only interfered with the binding of dpp or mabs that interact with this specific epitope, but had no impact on the binding of dpp or mabs to distant epitopes. this observation suggests that each of the glycan probes only shielded the epitope where the glycan probe was attached to, but did not affect the structures of other antigenic sites. it is consistent with findings obtained in studies on another viral spike protein, respiratory syncytial (rsv) virus f protein . to evaluate how the glycan probes altered the neutralizing immunogenicity (that is, the capacity to induce neutralizing immune responses) of mers-cov rbds, we immunized balb/c mice with each of the four rbds containing one of the glycan probes. the immunization schedule is shown in supplementary fig. a . sera were collected from mice immunized with each of the rbds, and tested for mers-cov-neutralizing antibodies. compared to the wild type rbd vaccine, the rbds containing a glycan probe at residues and induced significantly higher and lower neutralizing antibody titres, respectively, in mouse sera, whereas the rbds containing a glycan probe at residues and failed to induce significant changes in neutralizing antibody titres in mouse sera ( fig. a ; supplementary table ) . thus, masking the epitope containing arg led to reduced neutralizing antibody titres in the immunized mice, demonstrating that this epitope made a positive contribution to the vaccine's overall neutralizing immunogenicity. based on the same rationale, the epitope containing thr made a negative contribution and the epitopes containing val and ala made insignificant contributions to the vaccine's overall neutralizing immunogenicity. the experiments were further repeated twice and similar results were obtained. these results provided a qualitative evaluation of the neutralizing immunogenicity for each of these epitopes. how can we quantitatively evaluate the epitopes' neutralizing immunogenicity? here we introduce a novel concept nii to describe an epitope's neutralizing immunogenicity. nii is defined as the contribution of an epitope to the vaccine's overall neutralizing immunogenicity. it can be determined by masking the epitope with a glycan probe and then measuring the relative change of the vaccine's overall capacity to elicit neutralizing antibody titres (fig. b) . based on this definition, we calculated the nii for each of the four epitopes on the rbd ( fig. c ; supplementary table ). the epitope containing thr had an nii of À . . the negative sign of the nii suggests a negative contribution from this epitope to the vaccine's overall neutralizing immunogenicity, and the value of the nii implicates that masking this epitope using a glycan probe increased the vaccine's overall neutralizing immunogenicity by three fold. conversely, the epitope containing arg had an nii of . , suggesting that this epitope made a positive contribution to the vaccine's overall neutralizing immunogenicity and that masking this epitope using a glycan probe reduced the vaccine's overall neutralizing immunogenicity to % of that of the wild type vaccine. therefore, the nii can serve as an effective tool to quantitatively evaluate the neutralizing immunogenicity of any epitope on the mers-cov rbd vaccine. to investigate why masking a negative epitope led to enhanced neutralizing immunogenicity of the mers-cov rbd vaccine, we performed a competition assay between neutralizing mabs and mutant-rbd-induced mouse serum for the binding of wild type mers-cov rbd. more specifically, elisa was carried out between a neutralizing mab and mers-cov rbd in the presence of mouse serum induced by the -glycosylated mers-cov rbd (fig. a,b) . as a comparison, the mouse serum induced by the wild type mers-cov rbd was also included. two different mabs were used in the competition binding assay: hms- , which binds to the rbm epitope containing arg (refs , ) , and m -fab, which binds to the rbm epitope surrounding glu -asp (refs , ) . the result showed that the serum induced by the -glycosylated rbd inhibited the mab-rbd binding significantly better than the serum induced by the wild type rbd, revealing enhanced neutralizing capability of the mouse serum due to the glycosylation at the position. moreover, the mouse serum induced by the -glycosylated rbd demonstrated enhanced binding for at least two separate neutralizing epitopes on the rbm, one surrounding arg and the other glu -asp . thus, masking an epitope on the rbd core structure with a high negative nii refocuses the host immune response on neutralizing epitopes on the rbm, leading to enhanced neutralizing immunogenicity of the rbd vaccine. rational design of rbd vaccine with enhanced efficacy. to prove that highly effective mers-cov rbd vaccines can be rationally designed based on epitopes' neutralizing immunogenicity, we investigated the efficacy of two engineered mers-cov rbd vaccines using virus challenge studies. these engineered rbd vaccines have a negative epitope (that is, the epitope containing thr and with an nii of À . ) and a positive epitope (that is, the epitope containing arg and with an nii of . ) masked, respectively, by a glycan probe. we chose to mask the epitopes rather than deleting them or mutating all of their residues to alanines because introducing a glycan is more convenient in practice and less disruptive to the immunogen's tertiary structure. the wild type rbd vaccine was used as a control. the animal model for vaccine testing was the lethal transgenic mouse model expressing human dpp (hdpp -tg mice) , . these mice were chosen for analysis because they are very susceptible to mers-cov and also because preventing disease in these mice is a stringent test of efficacy. the immunization schedule is shown in supplementary fig. b . briefly, hdpp -tg mice were immunized with each of the rbd vaccines and challenged with mers-cov, and the survival rate and weight changes of the mice were recorded. the efficacies of the rbd vaccines were evaluated based on the morbidity and mortality of the immunized and challenged mice. first, hdpp -tg mice immunized with the negative-epitopemasked rbd vaccine (that is, rbd containing t n mutation) all survived mers-cov challenge ( % survival rate), whereas hdpp -tg mice immunized with the wild type rbd vaccine and with the positive-epitope-masked rbd vaccine (that is, rbd containing r n/e t mutations) demonstrated survival rates of and %, respectively, after mers-cov challenge (fig. a) . second, mers-cov challenge did not cause any weight loss in hdpp -tg mice immunized with the negative-epitope-masked rbd vaccine, but led to significant weight loss in hdpp -tg mice immunized with either the wild type rbd vaccine or the positive-epitope-masked rbd vaccine (fig. b) . the experiments were further repeated twice and similar results were obtained. these results revealed the enhanced efficacy of the hms- mab current vaccine design lacks an effective approach to evaluate the neutralizing immunogenicity of epitopes on viral subunit vaccines. in this study, we have developed a novel approach to measure vaccine epitopes' neutralizing immunogenicity. using the mers-cov rbd as a model, we singly mask selected epitopes using host-derived glycan probes, and then measure the corresponding changes in the vaccine's overall neutralizing immunogenicity. we have also developed a method for calculating the nii for the selected epitopes. an epitope's neutralizing immunogenicity contains two parts: the neutralization capacity and immunogenicity. on the one hand, an epitope's neutralizing capacity is determined by the physical overlap of the epitope with the receptor-binding region and the potential role of the epitope in receptor binding. on the other hand, an epitope's immunogenicity is determined by its immune selfness (that is, how similar or dissimilar the viral epitope is to a host-originated epitope), protrusion, and other physical and chemical properties of the epitope. logically, an epitope's nii is correlated with a combination of factors such as immune selfness, protrusion, potential overlap with receptor-binding region, and more. because of the complex nature of nii, it is unlikely that the nii can be reliably predicted by software; instead, this study demonstrates that nii can be experimentally measured using the glycan probe approach. as proof-of-concept, we measured the nii for four distinct epitopes on the mers-cov rbd vaccine, and also characterized the protrusion index, receptor binding, and monoclonal antibody binding of the rbds each with an epitope masked by a glycan probe. the results revealed that the epitopes with a high and low protrusion index tend to have an nii with a high and low absolute value, respectively. in addition, epitopes within the receptor-binding region tend to have a positive nii, and the epitopes located outside the receptor-binding region tend to have a negative nii. we cannot correlate the immune selfness of epitopes with nii because there is no good method to evaluate the immune selfness of conformational epitopes. overall, in rational design of viral subunit vaccines, the epitopes with a high positive nii should be preserved and exposed, while those with a high negative nii should be eliminated via deletion or masking. indeed, our study has identified an epitope containing thr as one with a high negative nii on mers-cov rbd. thr is located on a protruding loop and away from the receptor-binding region, both of which contribute to its high negative nii. importantly, thr is buried inside the full-length coronavirus spike proteins, and only becomes exposed on the surface of the recombinant mers-cov rbd vaccine as an outcome of subunit vaccine design ( supplementary fig. ). to overcome this limitation of subunit vaccine design, the newly exposed epitopes with a high negative nii need to be masked or deleted. to apply the nii strategy to vaccine design, we successfully enhanced the efficacy of the mers-cov rbd vaccine in virus challenge studies by masking its strong negative epitope (that is, the epitope containing thr , with an nii of À . ) with a glycan probe. this engineered vaccine effectively protected hdpp -transgenic mice from a lethal mers-cov infection. compared with the wild type rbd vaccine, mice immunized with the engineered rbd vaccine showed increased neutralizing antibody responses in their sera; when challenged by mers-cov, they also demonstrated higher survival rate and less weight loss. these results prove that negative epitopes should be eliminated in vaccine design. in contrast, another engineered vaccine with a positive epitope masked (that is, the epitope containing arg , with an nii of . ) showed reduced efficacy in virus challenge studies, confirming that positive epitopes should be preserved and exposed in vaccine design. taken altogether, we validated both the significance and feasibility of the nii strategy in vaccine design by successfully engineering a variant form of the mers-cov rbd vaccine with significantly enhanced efficacy. overall, our study contributes to viral subunit vaccine design in the following ways. first, our study introduces a new concept nii for the evaluation of how individual epitopes contribute to the overall neutralizing immunogenicity of subunit vaccines. previous studies could not evaluate the neutralizing immunogenicity of conformational b-cell epitopes that dominate coronavirus rbd vaccines. second, using the nii strategy our study identified an immunodominant non-neutralizing epitope on the surface of the mers-cov rbd core structure. this result shows that exposure of previously buried epitopes on viral subunit vaccines poses a challenge for subunit vaccine design. this concept, although needing further investigations, may be critical for the development of many viral rbd-based vaccines. third, our study demonstrates that masking an immunodominant non-neutralizing epitope with a negative nii value on the surface of the mers-cov rbd core structure can shift host immune responses towards the neutralizing epitopes in the rbm region, providing means to overcome the limitation of viral subunit vaccines from vaccine design. previous studies showed that hypervariable regions on hiv gp divert host immune responses and that masking these regions can shift host immune responses towards conserved neutralizing epitopes , , providing means to overcome the limitation of viral subunit vaccines from viral evolution. fourth, although the nii strategy was used in the current study to improve the efficacy of viral subunit vaccines, it can also be potentially helpful in other epitope-based vaccine research. for example, previous studies masked or resurfaced non-neutralizing epitopes on viral immunogens, and used the engineered immunogens as baits to screen from neutralizing sera for monoclonal antibodies that bind to conserved neutralizing epitopes [ ] [ ] [ ] [ ] . it is conceivable that the nii strategy can help identify immunodominant non-neutralizing epitopes on immunogens, allowing more targeted epitope modifications for efficient antibody screening. finally, our study suggests that a three-dimensional 'neutralizing immunogenicity map' (nim) can be drawn to describe the distribution of epitopes with different neutralizing immunogenicity on the surface of viral subunit vaccines. such an nim can guide targeted masking of multiple strong negative epitopes, further enhancing the efficacy of viral subunit vaccines. we believe that our approach can facilitate the rational subunit vaccine design not only for coronaviruses such as mers-cov and sars-cov, but also for other life-threatening viruses such as hiv, influenza virus, and ebola virus. cell lines. hek t (human embryonic kidney) and vero e (monkey kidney) cells were obtained from american type culture collection. huh- (human hepatoma) cells were kindly provided by dr charles m. rice at rockefeller university. these cell lines were cultured in dulbecco's modified eagle medium (dmem) supplemented with % fetal bovine serum (fbs), mm l-glutamine, units ml À penicillin, and mg ml À streptomycin (life technologies inc.). sf insect cells were purchased from life technologies inc., and cultured in sf- iii sfm medium supplemented with units ml À penicillin and mg ml À streptomycin (life technologies inc.) expression and purification of recombinant proteins. the expression and purification of recombinant mers-cov rbd was carried out as previously described . briefly, wild type (wt) rbd (residues - ; genbank accession number: afs . ) containing a c-terminal human igg fc tag was expressed in hek t cells, secreted into the cell culture supernatant, and purified by protein a affinity chromatography (ge healthcare). mutant rbd fragments containing engineered glycan probes were constructed via site-directed mutagenesis, and expressed and purified in the same way as the wild type rbd. the expression and purification of recombinant human dpp was carried out as previously described . briefly, human dpp ectodomain (residues - ; genbank accession no. np_ . ) containing an n-terminal human cd signal peptide and a c-terminal his tag was expressed in insect sf cells using the bac-to-bac expression system (life technologies inc.), secreted to cell culture medium, and purified sequentially on hitrap nickel chelating hp column and superdex gel filtration column (ge healthcare). sds gel electrophoresis. mg wild type or mutant mers-cov rbds were subjected to sds gel electrophoresis under denatured condition. protein bands were stained using coomassie brilliant blue r (sigma-aldrich), and image captured using myecl imager (life technologies inc.). mass spectrometry. wild type or mutant mers-cov rbds at mm concentration in mm tris-cl, ph . , mm nacl was ultrafiltrated with deionized water five times using an amicon ultra centrifugal filter with a kda molecular weight cutoff. the desalted protein samples were subjected to maldi-tof mass spectrometry at tufts university core facility. mass spectrometry was performed in linear mode for molecular weight screening. alphascreen protein-protein binding assay. binding between recombinant mers-cov rbds and recombinant human dpp was measured using an alphascreen assay as previously described , . briefly, nm wild type or mutant mers-cov rbd with a c-terminal fc tag was incubated with nm human dpp with a c-terminal his tag at room temperature for h. alphascreen protein a acceptor beads and nickel chelate donor beads (perkinelmer life sciences) were added to the mixture at a final concentration of mg ml À each. after incubation at room temperature for h, the alphascreen signal was measured using an enspire plate reader (perkinelmer life sciences), reflecting the binding affinity between the two proteins. facs. the binding between recombinant mers-cov rbds and human dpp expressed on the huh- cell-surface was measured using fluorescence-activated cell sorting (facs) as previously described , . briefly, huh- cells were incubated with wild type or mutant mers-cov rbd ( . mg ml À ) at room temperature for min, followed by addition of fitc-conjugated anti-human-igg-fc polyclonal antibody ( : dilution) (sigma-aldrich) for min. the amounts of rbd-bound huh- cells were measured using flow cytometry, and the binding affinity between rbd and cell-surface dpp was characterized as median fluorescence intensity. animal immunization and sample collection. animal immunization and sample collection were carried out as previously described . briefly, balb/c mice were subcutaneously immunized with wild type or mutant mers-cov rbd ( mg per mouse) in the presence of montanide isa adjuvant , . pbs plus montanide isa was included as a negative control. immunized mice were boosted twice with the same immunogen and adjuvant at a -week interval, and sera were collected days after the last immunization for detection of neutralizing antibodies. elisa. the binding between recombinant mers-cov rbd and neutralizing mabs was measured using elisa as previously described . briefly, elisa plates were pre-coated with the same amount of wild type or mutant rbd ( mg ml À ) overnight at °c. after blocking with % non-fat milk at °c for h, serially diluted mabs were added to the plates and incubated at °c for h. after washes, the plates were incubated at °c for h with horseradish-peroxidaseconjugated anti-human-igg-fab polyclonal antibody ( : , dilution) (sigma-aldrich). enzymatic reaction was carried out using substrate , , , tetramethylbenzidine (life technologies inc.) and stopped with n h so . absorbance at nm (a ) was measured using elisa plate reader (tecan group ltd.). the competition between neutralizing mabs and mutant-rbd-induced mouse serum for the binding of wild type mers-cov rbd was carried out using elisa as described above, except that the binding between wild type rbd and the neutralizing mab (hms- or m -fab at mg ml À concentration) was performed in the presence of serially diluted mouse serum (t n-rbdinduced, wild-type-rbd-induced, or pbs-induced). the rbd-mab binding was detected by addition of horseradish-peroxidase-conjugated anti-human-igg-fab polyclonal antibody ( : , dilution) and subsequent enzymatic reaction. live mers-cov neutralization assay. a micro-neutralization assay was carried out to test neutralizing antibodies against live mers-cov as previously described . briefly, serially diluted mouse sera were incubated at room temperature for h with b infectious mers-cov virions (emc- strain), and were then incubated with vero e cells at °c for h. the neutralizing capability of the mouse sera was measured by determining the presence or absence of virus-induced cytopathic effect (cpe). neutralizing antibody titres were expressed as the reciprocal of the highest dilution of sera that completely inhibited virus-induced cpe in at least % of the wells (nt ). mers-cov challenge studies. mers-cov challenge studies were carried out using human-dpp -transgenic mice as previously described . briefly, mice were intramuscularly immunized with wild type or mutant mers-cov rbd ( mg per mouse) in the presence of aluminium adjuvant , and boosted once weeks after the initial immunization. weeks after the second immunization, mice were challenged with mers-cov (emc- strain, tcid ), and observed for days for detection of survival rate and weight changes. statistical analyses. in fig. c -d, comparisons between wt rbd and each of the mutant rbds in their binding to recombinant dpp by alphascreen (fig. c) or to cell-surface dpp by facs (fig. d) were done using two-tailed t-test (***, po . ; measurements for each rbd in figs c and measurements for each rbd in fig. d) . in figs a-d, nonlinear regression was performed using a log(inhibitor) versus normalized response-variable slope model. r of curve fit is larger than . for all curves in fig. a- the extra sum-of-squares f test (***, po . ; different dilutions of each mab, measurements at each dilution for each mab). in fig. a , comparisons between wt rbd and each of the mutant rbds in their capacity to induce neutralizing serum in mice were done using two-tailed t-test (*, po . ; measurements for each rbd). in fig. , nonlinear regression was performed using a log(inhibitor) versus normalized response-variable slope model. r of curve fit is larger than . for all curves in fig. . comparisons between wt-rbd-induced serum and t n-rbd-induced serum in their inhibition of rbd/mab binding by elisa were done using the extra sum-of-squares f test (***, po . ; different dilutions of each serum, measurements at each dilution for each serum). all statistical analyses were performed using graphpad prism software. data availability. all relevant data are available from the authors. rational design of vaccines to elicit broadly neutralizing antibodies to hiv- . cold spring harb structure-based antigen design: a strategy for next generation vaccines advances in structure-based vaccine design structural vaccinology starts to deliver the spike protein of sars-cov-a target for vaccine and therapeutic development receptor recognition mechanisms of coronaviruses: a decade of structural studies immunodominance of cd t cells to foreign antigens is peptide intrinsic and independent of molecular context: implications for vaccine design immunodominance: a pivotal principle in host response to viral infections immunodominance with progenitor b cell diversity in the neutralizing antibody repertoire to influenza infection immunodominance of conformation-dependent b-cell epitopes of protein antigens refocusing neutralizing antibody response by targeted dampening of an immunodominant epitope immune focusing and enhanced neutralization induced by hiv- gp chemical cross-linking designing immunogens to elicit broadly neutralizing antibodies to the hiv- envelope glycoprotein prediction of immunogenicity for therapeutic proteins: state of the art bcr-abl fusion regions as a source of multiple leukemiaspecific cd þ t-cell epitopes ctl responses of hla-a . -transgenic mice specific for hepatitis c viral peptides predict epitopes for ctl of humans carrying hla-a . a novel coronavirus associated with severe acute respiratory syndrome coronavirus as a possible cause of severe acute respiratory syndrome isolation of a novel coronavirus from a man with pneumonia in saudi arabia middle east respiratory syndrome coronavirus (mers-cov): announcement of the coronavirus study group structure of mers-cov spike receptor-binding domain complexed with human receptor dpp molecular basis of binding between novel human coronavirus mers-cov and its receptor cd structure of sars coronavirus spike receptor-binding domain complexed with receptor crystal structure of the receptor-binding domain from newly emerged middle east respiratory syndrome coronavirus angiotensin-converting enzyme is a functional receptor for the sars coronavirus dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc receptor-binding domain of severe acute respiratory syndrome coronavirus spike protein contains multiple conformation-dependent epitopes that induce highly potent neutralizing antibodies identification of a receptor-binding domain in the s protein of the novel human coronavirus middle east respiratory syndrome coronavirus as an essential target for vaccine development receptor-binding domain of sars-cov spike protein induces long-term protective immunity in an animal model current advancements and potential strategies in the development of mers-cov vaccines pre-fusion structure of a human coronavirus spike protein cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer assembly of asparagine-linked oligosaccharides searching for an ideal vaccine candidate among different mers coronavirus receptor-binding fragments-the importance of immunofocusing in subunit vaccine design an algorithm that identifies protruding atoms in proteins a conformation-dependent neutralizing monoclonal antibody specifically targeting receptor-binding domain in middle east respiratory syndrome coronavirus spike protein exceptionally potent neutralization of middle east respiratory syndrome coronavirus by human monoclonal antibodies junctional and allele-specific residues are critical for mers-cov neutralization by an exceptionally potent germline-like antibody single-dose treatment with a humanized neutralizing antibody affords full protection of a human transgenic mouse model from lethal middle east respiratory syndrome (mers)-coronavirus infection prefusion f-specific antibodies determine the magnitude of rsv neutralizing activity in human sera multi-organ damage in human dipeptidyl peptidase transgenic mice infected with middle east respiratory syndrome-coronavirus characterization and demonstration of the value of a lethal mouse model of middle east respiratory syndrome coronavirus infection and disease rational design of envelope identifies broadly neutralizing human monoclonal antibodies to hiv- a combinatorial mutagenesis approach for functional epitope mapping on phage-displayed target antigen: application to antibodies against epidermal growth factor high throughput functional epitope mapping: revisiting phage display platform to scan target antigen surface defining a protective epitope on factor h binding protein, a key meningococcal virulence factor and vaccine antigen receptor usage and cell entry of bat coronavirus hku provide insight into bat-to-human transmission of mers coronavirus and : a new generation of water in oil emulsions as adjuvants for human vaccines current adjuvants and new perspectives in vaccine formulation we thank drs. dimiter s. dimitrov and tianlei ying at the national institutes of health for providing m , m and m mabs this work was supported by nih grants r ai and r ai (f.l.), nih grant po ai (s.p.), nih grants r ai , u ai , r ai and the intramural fund of new york blood center nyb (l.d. and s.j.), nih grant r ai - and pilot grants from the center for biodefense and emerging infectious diseases and from the galveston national laboratory (c.k.t), china national l.d., s.j., y.z., f.l. designed the experiments; l.d., w.t., y.y., g.z., q.z., s.s., c.l. and x.t. performed the experiments; l.d., c.k.t., s.p., s.j., y.z., f.l. analysed the data; l.d., s.p., s.j., y.z. and f.l. wrote the paper. key: cord- - jeaqqn authors: ma, huan; zeng, weihong; he, hongliang; zhao, dan; yang, yunru; jiang, dehua; zhou, peigen; qi, yingjie; he, weihuang; zhao, changcheng; yi, ruting; wang, xiaofang; wang, bo; xu, yuanhong; yang, yun; kombe kombe, arnaud john; ding, chengchao; xie, jiajia; gao, yong; cheng, linzhao; li, yajuan; ma, xiaoling; jin, tengchuan title: covid- diagnosis and study of serum sars-cov- specific iga, igm and igg by a quantitative and sensitive immunoassay date: - - journal: nan doi: . / . . . sha: doc_id: cord_uid: jeaqqn background the current pandemic of the severe acute respiratory syndrome coronavirus (sars-cov- ) has caused a great loss in lives and economy. detecting viral rnas on nasopharyngeal and throat swabs is the standard approach for sars-cov- diagnosis with variable success. currently, there are only a few studies describing the serological diagnostic methods that involve the detection of sars-cov- -specific igm and igg. here, we aimed to develop a more quantitative and sensitive serological test for covid- diagnosis, monitoring and clinical investigation, based on the detection of antigen-specific iga as well as igm and igg in blood in response to sars-cov- infection. methods in this investigation, we report the development of a set of validated diagnostic kits for detecting serum iga, igm, and igg specific to sars-cov- nucleocapsid protein (np) and receptor-binding domain (rbd) of the spike protein by chemi-luminescence immuno-analysis. the kits were tested with a cohort of sera from laboratory-confirmed covid- patients, and sera from sars-cov- negative or healthy individuals as negative controls. a standard receiver operating characteristic (roc) analysis was conducted to evaluate the diagnostic accuracy. using the kits, serum levels of iga, igm, and igg were analyzed, in response to sars-cov- infection and covid- pathogenesis. findings the diagnostic kits based on the rbd antigen outperformed those based on the np. rbd-specific iga, igm, and igg detection kits showed sensitivities of . %, . %, and . %, and specificities of . %, . %, and . %, respectively. in addition, using purified rbd-specific immunoglobulins from a serum pool of covid- patients as standards, the serum concentrations of rbd-specific iga, igm, and igg proteins were determined. the concentrations varied widely among different patients. median concentration of iga and igm reached peaks at - days after illness onset at . μg/ml and . μg/ml, respectively, while median concentration of igg peaked during - days after illness onset at . μg/ml. furthermore, the serum iga level positively correlates with covid- severity. interpretation our immunoassay of measuring sars-cov- specific antibodies iga, igm, and igg in serum provides a better serological testing with improved sensitivity and specificity. data of iga, igm, and igg responses in blood of covid- patients may provide novel insight for the monitoring and treatments of covid- . the kits are also suitable for epidemiological studies and vaccine validations. the diagnostic kits based on the rbd antigen outperformed those based on the np. rbd-specific iga, igm, and igg detection kits showed sensitivities of · %, · %, and · %, and specificities of · %, · %, and · %, respectively. in addition, using purified rbd-specific immunoglobulins from a cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted april , . . https://doi.org/ . / . . . doi: medrxiv preprint a sensitivity below %. - therefore, there is an urgent need for more reliable and rapid diagnostic methods to screen sars-cov- infected people including those who do not have overt symptoms. a serological test of virus-induced antibody production has unique advantages in clinical diagnostics, especially for identifying people who acquired immunity against pathogens without noticeable symptoms. when the virus invades host, the body produces large amounts of immunoglobulin (ig) by the immune system and released into blood, among them, igg, igm, and iga isotypes. it has been widely believed that igm is the first antibody to be transiently synthesized in response to the virus invasion. igg is a major class of immunoglobulins found in the blood, comprising % of total serum immunoglobulins and has long-term immunity and immunological memory. , therefore, a combination of igm and igg has been used in various serological tests for detecting infection of sars- cov- as previously used for sars and other coronaviruses. , , , [ ] [ ] [ ] [ ] [ ] in contrast, iga, which is mainly produced in mucosal tissues to hinder virus invasion and replication but also detected in blood (~ % of total immunoglobulins in blood), has not been widely used in serological tests for detecting coronavirus infection. iga's production kinetics and roles in anti-viral immunity of iga are even less known. currently, only a few published studies reported diagnosis of covid- by using elisa or "flow immunoassay" for detection of serum igm and igg with limited accuracy, , , - although sars-cov- specific iga in serum was also detected in recent papers or a preprint. , , the kinetics of antibody responses in covid- remains undefined, specifically for iga production. in this study, we designed and evaluated a set of sensitive and quantitative kits to measure serum iga, cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted april , . . https://doi.org/ . / . . . doi: medrxiv preprint underline illnesses; the most common one was hypertension in eighteen patients ( · %). a total serum samples were taken from the covid- patients. negative controls and potentially interfering non-covid- patient serum samples were collected in order to evaluate the reliability of the kits. this cohort contains sera from obviously healthy people, fifteen sera from once suspected cases (rt-qpcr negative but had typical manifestation of pneumonia) and sera from other patients with different underlying diseases. all sera were stored at - °c. diagnostic kit preparation and testing briefly, the purified np or rbd viral antigens were coated to magnetic particles to catch sars-cov- specific iga, igm, and igg in patient sera. then a second antibody that recognizes iga, igm, or igg is . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted april , . . https://doi.org/ . / . . . doi: medrxiv preprint based on the clinical rt-qpcr diagnosis results of sars-cov- infection, receiver operating characteristic (roc) analysis was conducted using medcalc software to determine the optimal cut-off value (criterion) and evaluate the diagnostic value of np-or rbd-specific iga, igm, and igg kits. the specificity and sensitivity of the antibody kits were calculated according to the following formulas: in order to analyze the correlation of serum antibody levels and age with disease severity, we first used the kruskal wallis test to test if there is any significant difference of iga among the three groups (mild, moderate, severe). then dunn's test was used to perform a pair-wise test between each group, and benjamini-hochberg procedure was used to adjust p-values. all the above analyses use r software version . . . a p value less than · was judged statistically significant. results highly purified sars-cov- np and rbd proteins (supplementary figure ) were employed to develop a series of serological test kits, to detect the presence of np-and rbd-specific iga, igm, and igg, respectively (hereinafter referred to as "np kit" and "rbd kit"). a cohort of sera from sars-cov- infected patients was tested with both np and rbd kits, together with sera from non-sars-cov- infected patients as negative controls initially. the np kits for iga, igm, and igg showed diagnostic sensitivities of · %, · %, and · %, and specificities of · %, · %, and % respectively (supplementary figure a-c). however, the rbd kits for detecting iga, igm, and igg showed higher diagnostic sensitivities of · %, · %, and · %, and specificities of %, · %, and %, respectively (supplementary figure d-f). we conclude that the rbd based kits igg kits, the sensitivity, specificity and overall agreement elevate to · %, %, and · %, respectively. this is much better than when igm and igg were combined. when iga, igm, or igg individual kit was used, we observed a total of ( . % to . %), ( . % to . %), and ( to cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted april , . . https://doi.org/ . / . . . doi: medrxiv preprint and non-covid- patients also indicate that our rbd-based detection kits did not cross-interact with antibodies raised against other human coronaviruses presenting in ~ % of common cold cases and also causing pneumonia. taken together, our detection systems are highly specific to sars-cov- rbd. we attempted to analyze the kinetics of all the three isotypes of antibodies when multiple serum were significantly older (median age of · ) than those patients with moderate (median age of ) and mild symptoms (median age of ). remarkably, we found that iga concentrations in severe cases were significantly higher than mild or moderate cases (figure a). igg levels in moderate and severe covid- patients were also higher than mild cases (p < · ) ( figure c ). the observation that serum igg levels were higher in severe and moderate than mild covid- patients have been previously reported . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted april , . . https://doi.org/ . / . . . doi: medrxiv preprint , . we also provided here a novel observation that serum iga levels correlate with covid- severity (figure a), how the levels and roles of different types of antibodies as related to covid- severity remain to be determined. discussion compared to sampling of nasopharyngeal or throat swabs, blood extraction is more convenient and reliable. furthermore, serum antibody test is more convenient, fast and accurate, and with other advantages over the detection of viral rna. , we report here an improved serological kit that can sensitively and quantitatively detect serum levels of iga as well as igm and igg. together with recent reports by others - , - , , , the serological data that we obtained from serum samples of the nucleocapsid protein (np) is the most abundant protein in coronaviruses, which was reported to be highly immunogenic and often used as a diagnostic marker for coronaviruses such as sars-cov . the rbd of the spike protein on viral surface is the ligand binding to the major host receptor ace ; therefore rbd could be a main target for neutralization antibodies. , in this study, we explored the possibility of using either np or rbd as an immobilized antigen in for developing a clinical covid- diagnostic kit. our data (supplementary figure ) showed rbd-based diagnostic kits were better performed than that of np in detecting all the three types of antibodies. a few previous studies reported that rbd-based igm and igg detection is better than np once a comparison was made , , and the measurement is agreeable with the titers measured by virus neutralization assays , . we provided here the evidence that rbd as an immobilized antigen is also better than np in detection serum iga from covid- patients. the exact mechanisms of difference between the use of two types of viral antigens remain to be resolved. it could be that the np as a highly basic protein interacts with acidic residues in complementarity determining region in antibodies is less specific. it could also be due to the fact that the np antigen is expressed in bacteria as most investigators do, and the rbd protein we used is expressed in a human cell line enabling critical glycosylation and high-affinity binding to antibodies raised in covid- patients. nonetheless, we showed that our serological kits based on sars-cov- spike protein rbd as an immobilized antigen provide a high sensitivity and specificity for detecting iga, igm, and igg in a quantitative manner. our serological kits have overall good performance our kits have much higher accuracy than rt-qpcr (sensitivity less than %) for detecting viral rna - , and published immune-assays such as " flow immunoassay" and elisa in earlier studies - , - , , . when we combined rbd-specific iga and igg kits together, the sensitivity, specificity and overall agreement elevate to · %, %, and · %, respectively (table ). in addition, this rbd- based detection kit may also help to screen and detect neutralization antibodies targeting sars-cov- rbd, because this peptide domain is exposed on viral surface and functions as a ligand binding to the host cell surface receptor ace . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted april , . although one serum collected at the th day after illness onset was diagnosed as positive by our igm kit (not iga or igg kits in this study), the igm kit overall showed a lower diagnostic specificity of · % compared to that of igg and iga (figure ). igm is known to have relatively lower affinity toward antigens compared with that of igg or iga. in addition, igm often causes false positive signals as we also observed (supplemental table ), due to its pentameric structure . to the contrary, iga or igg antibody does not have this problem. our rbd-specific igg kit showed high specificity of · % (figure ) but relatively low sensitivity of · %. this is expected, because that most ( / ) false negative cases were samples collected at - days after illness onset when igg production is likely very low. our rbd-based iga kit showed high sensitivity and specificity of · % and · %, except two sera collected at the th day after illness onset. all other sera ( at the th day, at the th day, at the th we observed the presence of high-level of rbd-specific iga in covid- patients' sera. it is widely believed that mucosal plasma cells are a major production source of iga, which is rapidly transported . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted april , . . https://doi.org/ . / . . . doi: medrxiv preprint iga is traditionally recognized to play an anti-inflammatory role and prevent tissue damage at mucosal sites. however, recent reports also demonstrated that serum iga is involved in the formation of immune complexes to amplify inflammatory responses. serum iga induced proinflammatory cytokine production by macrophages, monocytes and kupffer cells in non-mucosal tissues including liver, skin and peripheral blood. in this study, we observed that iga was present in covid- patients' serum, and its levels positively correlated with covid- severity ( figure a) . in our cohort, we also observed that igg levels were associated with worse clinical outcomes (figure c), as previously described , . the latter phenomena has been suggestive of possible antibody-dependent enhancement (ade) of these observations suggest that gut may be an important place for anti-viral response to coronaviruses, and large amounts of secretory iga could be detected in these mucosal tissues in addition to that in blood. weakness of this study the current study at the present form has several limitations. we used serum samples from confirmed covid- patients in this study, and serum samples were not available every day for each patient. the earliest collected serum is at the th day after self-reported illness onset, and the last one there were severe and five critical cases, respectively, accounting for · % and · % respectively. there were also few cases of covid- patients whose symptoms remained mild and serum samples were collected during hospitalization. therefore, this study of the correlation between antibody levels and disease severity needs further verification. in summary, this study reports a novel sensitive and quantitative serological testing kit of detecting iga as well as igm and igg, for the diagnostics of covid- . due to its high specificity and sensitivity, this kit could sensitively and quantitatively measure levels of iga in blood and other tissues. the serological study also provides valuable information for monitoring and understanding of covid- . acknowledgements . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted april , . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted april , . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted april , . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted april , . . https://doi.org/ . / . . . doi: medrxiv preprint coronavirus disease a novel coronavirus from patients with pneumonia in china covid- and the cardiovascular system a novel coronavirus associated with severe acute respiratory syndrome isolation of a novel coronavirus from a man with pneumonia in saudi arabia diabetes and covid- detection of novel coronavirus ( -ncov) by real- time rt-pcr clinical features of patients infected with the laboratory diagnosis of covid- infection: current issues and challenges a preliminary study on serological assay for severe acute respiratory syndrome coronavirus (sars-cov- ) in admitted hospital patients profiling early humoral response to diagnose novel coronavirus disease (covid- ). clinical infectious diseases comparison of throat swabs and sputum specimens for viral nucleic acid detection in cases of novel coronavirus (sars-cov- ) infected pneumonia (covid- ) detection of specific antibodies to severe acute respiratory syndrome (sars) coronavirus nucleocapsid protein for serodiagnosis of sars coronavirus pneumonia structure and function of immunoglobulins role of natural and immune igm antibodies in immune responses development and clinical application of a rapid igm-igg combined antibody test for sars-cov- infection diagnosis temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by sars-cov- : an observational cohort study. the lancet infectious diseases antibody responses to sars-cov- in covid- patients: the perspective application of serological tests in clinical practice immune phenotyping based on neutrophil-to-lymphocyte ratio and igg predicts disease severity and outcome for patients with covid- interaction of antigens and antibodies at mucosal surfaces. annual review of microbiology sars-cov- specific antibody responses in covid- patients a serological assay to detect sars-cov- seroconversion in humans the use of solvent/detergent treatment in pathogen reduction of use of ranks in one-criterion variance analysis key: cord- -fufattj authors: den hartog, gerco; schepp, rutger m; kuijer, marjan; geurtsvankessel, corine; van beek, josine; rots, nynke; koopmans, marion p g; van der klis, fiona r m; van binnendijk, robert s title: sars-cov- –specific antibody detection for seroepidemiology: a multiplex analysis approach accounting for accurate seroprevalence date: - - journal: j infect dis doi: . /infdis/jiaa sha: doc_id: cord_uid: fufattj background: the covid- pandemic necessitates better understanding of the kinetics of antibody production induced by infection with sars-cov- . we aimed to develop a high-throughput multiplex assay to detect antibodies to sars-cov- to assess immunity to the virus in the general population. methods: spike protein subunits s and receptor binding domain, and nucleoprotein were coupled to microspheres. sera collected before emergence of sars-cov- (n = ) and of non-sars-cov- influenza-like illness (n = ), and laboratory-confirmed cases of sars-cov- infection (n = ) with various severities of covid- were tested for sars-cov- –specific igg concentrations. results: our assay discriminated sars-cov- –induced antibodies and those induced by other viruses. the assay specificity was . %– . % with sensitivity . %– . %. by merging the test results for all antigens a specificity of % was achieved with a sensitivity of at least %. hospitalized covid- patients developed higher igg concentrations and the rate of igg production increased faster compared to nonhospitalized cases. conclusions: the bead-based serological assay for quantitation of sars-cov- –specific antibodies proved to be robust and can be conducted in many laboratories. we demonstrated that testing of antibodies against multiple antigens increases sensitivity and specificity compared to single-antigen–specific igg determination. coronavirus disease (covid- ) caused by the newly emerged severe acute respiratory syndrome coronavirus (sars-cov- ) has resulted in a pandemic in a largely immune-naive population. the presence of specific antibodies is currently being investigated to assess the induction of an immune response in patients and to assess the degree of exposure and immunity in the general population [ ] [ ] [ ] . as it is a recently emerged coronavirus variant, the kinetics and degree of immunity induced following contact with the virus and covid- disease are largely unknown. sars-cov- expresses a spike protein, highly similar to spike of sars-cov, which binds to angiotensin converting enzyme (ace ) [ , ] . binding of antibodies to the receptor binding domain (rbd) of spike neutralizes the ability of the virus to infect cells [ ] . in addition, antibodies are detected against other viral proteins, including nucleoprotein (n) [ ] . n is shielded within the virion and therefore n-specific antibodies are probably unable to neutralize the virus. although n may not be involved in neutralization of the virus, antibodies to n could provide an indicator of exposure to the virus. antibodies to n induced by sars-cov reportedly recognize n of sars-cov- but not of seasonal coronaviruses [ ] . estimates of the prevalence of seroconversion as proxy for protection of the general population may support health decision making, including the decision to lift lockdown measures. to appropriately apply an assay for serosurveys we need to know the precision of the assay, that is the sensitivity and specificity, which are variable between currently available tests [ , ] . performing and sustaining large studies to assess changing population immunity requires high-throughput screening assays that are robust and accurate [ ] . many countries now aim to assess the protective status of the general population for covid- using antibody assays. to guarantee high specificity, the assay should be validated with a representative number of sera from patients infected with other coronaviruses and other pathogens causing influenza-like illness (ili), but this is often lacking [ ] [ ] [ ] . to date, covid- prevalence of seroconverted individuals is relatively low and there is a risk of significant overestimation if an assay has insufficient specificity (supplementary table ). thus, high specificity is important at this stage [ , ] . our laboratory has extensive experience in developing multiplex assays to quantify antibodies to many bacterial and viral pathogens in the general population, of which most are part of the national immunization program [ , [ ] [ ] [ ] [ ] . we developed a high-throughput and highly quantitative bead-based multiplex immunoassay to assess the prevalence of seropositivity in the general population, and also anticipating the introduction of future sars-cov- vaccines. by multiplexing a broader range of sars-cov- antigens in a single assay we may generate a better understanding of the proportion of persons that have seroconverted. moreover, in a multiplex assay positivity can be compared among antigens to provide a more detailed evaluation of the antibody levels and to enhance assay performance [ ] . the developed assay was tested on samples from covid- patients with various severities of disease collected at multiple timepoints to determine the kinetics of seroconversion. serum samples were obtained from the following cohorts: ( ) a random selection of individuals (n = ) from a national (dutch) cohort representing all age groups and obtained years prior to sars-cov- emergence (pienter study, netherlands trial register number nl ); ( ) individuals (supplementary table ) with proven non-sars-cov- ili caused by human coronaviruses (n = , hcov ili) or other viruses (n = , non-hcov ili) obtained from the national institute for public health and the environment (rivm), bilthoven, the netherlands (trial register number nl ) [ ] , and from erasmus medical center, rotterdam, collected prior to the sars-cov- outbreak and at least weeks after polymerase chain reaction (pcr) detection of the virus; and ( ) the steps in assay validation were similar to recently developed bead-based multiplex immunoassays for cmv, ebv, and rsv, with minor modifications as described below [ , ] . for the multiplex bead-based immune assay the following antigens obtained from sino biological were used: sars-cov- monomeric spike s ( -v h), rbd ( -v b), and nucleoprotein (n) ( -v b). microplex fluorescent beads were activated in mm -(n-morpholino)ethanesulfonic acid (mes) ph . . the proteins were diluted to a concentration of . mg/ml in phosphate-buffered saline (pbs) ph . and added at µg per µl of activated beads. an internal reference sample was created by pooling sera of covid- patients with varying immunoglobulin g (igg) concentrations. an arbitrary antibody concentration unit of was assigned on the basis of the mean fluorescence intensity (mfi) signal in the upper limit of linearity of a -fold serial dilution of the reference sample. sera ( µl) diluted : and : in sm buffer (surmodics) plus % fetal calf serum were incubated with antigen-coated beads for minutes at room temperature at rpm in the dark. following incubation, samples were washed times with pbs, incubated with phycoerythrin-conjugated goat anti-human igg for minutes and washed. samples were acquired on an lx or fm d (luminex). mfi was converted to arbitrary units (au/ml) by interpolation from a -parameter logistic standard curve, using bioplex manager . (bio-rad laboratories) software and exported to microsoft excel. different batches of antigen-conjugated beads were incubated with serially diluted sera to test linearity and parallelism between bead conjugations, reference, and serum samples. assay robustness was tested by analyzing a serum panel by different operators on independent days using different bead and reference batches. the ability to discriminate igg concentrations between covid- patients and controls was evaluated by receiver operator characteristic (roc) analyses. to select the optimal assay defaults, both the youden j statistic, which balances between sensitivity and specificity, and a specificity-optimized cutoff (specificity of at least . % for low-prevalence settings of %- %) were selected. data were entered into graphpad prism . . to generate graphs and perform statistical analyses. reproducibility of the assay was evaluated using r and coefficient of variation (% cv) calculated by standard deviation divided by average × . for the roc analyses antibody concentrations of cross-sectional pienter participants (n = ), ili patients with coronavirus (n = ), or other viral infection (n = ) were used as the negative control group and pcr-confirmed covid- samples (n = ) with various clinical severities were used in the positive group. we selected for serum samples that were obtained more than days post onset of disease symptoms to meet a reasonable degree of seroconversion, as shown in recent reports [ , ] . both the youden j statistic-determined cutoff and the specificity-optimized cutoff (specificity of at least . %) were determined. to compare differences in concentrations, data were logtransformed and tested with either a t test between groups, or -way anova and tukey's multiple comparison test to compare multiple groups and adjusted p values reported. antibody kinetics was fitted using a nonlinear -parameter least square fit in graphpad prism . . . we prepared a reference serum by pooling pcr-confirmed covid- sera and tested serial dilutions in the multiplex assay consisting of distinct fluorescent beads coupled to sars-cov- nucleoprotein (n), s , and the s subunit rbd ( figure a ). this was repeated for varying batches of beads to assess consistency of performance. the assay was able to quantify concentrations in a to -fold concentration range, using a single dilution of the serum. to reliably quantify antibody concentrations between the reference serum and test samples, we confirmed that the reference and a selection of samples display the same rate of decline of fluorescence signal with increasing dilutions, which is referred to as parallelism ( figure b) . these data show that the triplex assay is a highly quantitative assay to detect antibodies to sars-cov- . applying an assay in large population and longitudinal studies requires reproducibility of assay results. therefore, antibody concentrations were determined in independent experiments performed on different days, using a selection of samples for rbd and samples for n and s with different concentrations of sars-cov- antibodies ( figure c ). in addition, the reproducibility test was performed by different technicians using different bead batches and references to reflect the expected maximum variability of the assay over time. comparison of sample data determined on independent assays runs resulted in an r of . , . , and . for n, s , and rbd, respectively ( figure c ). the obtained % cvs were . , . , and . for n, s , and rbd, respectively, showing that the assay results were reproducible. sera of pcr-confirmed covid- patients after days of symptoms were tested in the assay and the results compared to a control panel of sera collected prior to the outbreak of sars-cov- . in covid- patients, high concentrations of igg were observed to all antigens (figure a ). despite clear discrimination of igg concentrations between groups of control and covid- patients, some samples overlapped between the groups. therefore, the specificity and sensitivity of the assay to discriminate between covid- patients and controls using igg concentrations was evaluated by an established statistical standard to analyze assay performance, the roc analyses. for the roc analyses, concentration data of hospitalized and nonhospitalized covid- disease cases were included to provide a realistic evaluation of the performance of the assay (figure a ). the area under the curves ranged from . to . ( figure b ). the roc generated cutoff concentrations of . , . , and . au/ml using the roc youden j statistic. to gain a higher specificity of the assay optimized for a low population seroprevalence, the cutoff concentrations were . , . , and . for n, s , and rbd, respectively ( figure c ). the latter cutoffs resulted in a specificity of . %, . %, and . % at a sensitivity of . %, . %, and . % for n, s , and rbd, respectively. to study how our assay discriminates between antibodies of individuals with different laboratory-confirmed viral infections, antibodies were measured in a cross-sectional population panel (n = ), a panel of noncorona ili patients (non-hcov ili, n = ), and non-sars-cov- corona ili patients (hcov ili, n = ) and compared to pcr-confirmed covid- patients' samples. some of the covid- patients were admitted to hospital (n = ) because of severe covid- and these were compared to nonhospitalized covid- cases (n = ). for each of the negative control groups the majority of the samples had concentrations below the cutoff for all antigens ( figure a ). the number of falsepositive samples ranged from to out of or samples tested for the different antigens. the non-hcov and hcov ili panels were from persons infected with multiple different non-sars viruses including different endemic coronavirus (supplementary table ). the proportion of false positives did not increase by testing the convalescent sera from patients with a laboratory (pcr)-confirmed infection with either of the seasonal coronaviruses ( figure b , and data not shown), indicating that the antigens used in the assay are selective for sars-cov- -induced antibodies. comparison of pcrconfirmed sars-cov- patients samples shows that all hospitalized patients induced antibodies to n and the majority of hospitalized patients induced antibodies to s and rbd. the majority of the nonhospitalized cases showed antibody concentrations above the cutoff for n, whereas around % of the nonhospitalized patients did not produce antibodies above the cutoffs for s and rbd. overall, the concentrations of antibodies in serum samples from patients that were hospitalized were significantly higher compared to patients that were not hospitalized. following infection, an immune response is initiated, resulting in the production of serum antibodies. to study the time between onset of disease symptoms and the development of antibodies, paired serum samples were collected from the majority of patients. data were separated for patients that were either admitted to the hospital or not ( figure a and b) . apart from the paired samples from patients that were obtained before days after onset of disease, all other hospitalized cases showed seroconversion for all antigens tested ( figure a ). in line with other reports, hospitalized covid- patients seroconverted around day of disease onset. of nonhospitalized cases, seroconverted, whereas showed slight increases in concentrations but failed to formally cross the cutoff value for any of the analytes to be regarded a specific seroconversion. hospitalized patients reached a plateau of antibody production shortly after weeks from onset of symptoms, which took at least days for the nonhospitalized cases ( - fold lower slope; figure c and d). as a consequence of the slower increase of antibody concentrations the time to detectable antibodies was delayed, especially with respect to antibodies reacting to s and rbd. the variance in the nonhospitalized cases was high compared to the hospitalized cases, which is illustrated by the lower r of the nonlinear least square fit of the patient groups. cutoff (au/ml) figure . ability of the assay to identify covid- patients. a, control sera (n = ) and covid- sera (n = ) collected after day of symptoms were tested and compared for concentrations of igg. median concentration and % confidence intervals are shown. b, the sera tested in (a) were analyzed by roc. c, the roc data were used to determine youden j statistic cutoff (lower cutoff) and a specificity-optimized cutoff of at least . % specificity (higher cutoff). abbreviations: au, arbitrary unit; igg, immunoglobulin g; n, nucleoprotein; rbd, receptor binding domain; roc, receiver operator characteristic; s , spike protein subunit . table ) and concentration data of ili patients are shown to confirm that the assay discriminates sars-cov- -specific antibodies from antibodies induced by various laboratory-confirmed viral infections. abbreviations: au, arbitrary unit; covid- , coronavirus disease ; hcov, human coronavirus; mers-cov, middle east respiratory syndrome coronavirus; n, nucleoprotein; non-hcov, noncoronavirus; rbd, receptor binding domain; rsv, respiratory syncytial virus; s , spike protein subunit . the engagement of different structural sars-cov- proteins in serological determination (multiplex testing) instead of protein could improve the sensitivity and the specificity. if only analyte is analyzed, the sensitivities for hospitalized cases were . %, . %, and % for rbd, s , and n, respectively, using the specificity-optimized cutoff (table ) . using the roc youden j statistic cutoff the sensitivities were . % for both s and rbd and % for n. nonhospitalized cases typically had lower concentrations of igg, which reduced the sensitivity: . % for s and up to . % for n using the specificity-optimized cutoff. using the youden j statistic cutoff, the sensitivity increased to . % for s . in this multiplex approach an increased sensitivity can be obtained by evaluating a sample as positive when either of the antibody concentrations determined is higher than the set cutoff (logical or analysis in table ). any combination of antigen reached a sensitivity of % when n was used in hospitalized cases and ranged from . % (s or rbd) up to . % (n or s or rbd) using the specificity-optimized cutoff. applying the youden j statistic cutoff resulted in a sensitivity for nonhospitalized cases of at least . % (n or s ) up to . % (n or s or rbd). the specificity of the youden j analyses using n or s or rbd dropped to . %. this specificity is far too low for serosurveillance purposes in areas of low prevalence. the specificity-optimized cutoff ( . %- . %) is clearly better, which may be considered adequate if the true prevalence in the population is above %. because in most countries the overall covid- seroprevalence is currently under %, high specificity is required to provide reliable seroprevalence estimates (illustrated in supplementary table ). this could be achieved by defining a sample positive when at least antibody test results in multiplex are above the cutoff. this resulted in a specificity of % for any of the combinations and both the specificityoptimized and the youden j statistic-determined cutoffs (logical and; table ). as expected, this increased specificity comes at the expense of the sensitivity. here, if only s and rbd are taken into consideration, this combination resulted in the highest possible sensitivity of . % and . % for nonhospitalized and hospitalized patients, respectively. we aimed to develop a high-throughput quantitative assay to measure true concentrations of antibodies to spike s , spike rbd, and n of sars-cov- . the assay presented here uses a very small sample volume, which can be obtained from, for example, fingerstick blood, while retaining highly quantitative output. this bead-based multiplex immunoassay generates robust results and is able to discriminate covid- with different degrees of disease severity, especially from day of disease onward. the results of the assay presented here provide detailed insight into the performance of the assay in terms of parallelism between the references and sera containing different concentrations of antibodies. in addition, we show consistency of assay results when the same samples are measured on independent days, by different investigators using different batches of reagents, basically incorporating all potential variability. large population studies are in high demand to provide insight into the spread of the virus and the protective status of the population, which can be used for policy makers to manage the pandemic or lift the lockdown measures [ , , , ] . assays results have to be accurate to generate reliable seroprevalence data of the general population. in addition to knowing the performance of an assay, we need to understand how the majority of infections in the general population relate to the induction of detectable antibodies. our data comparing hospitalized and nonhospitalized cases revealed that milder disease results in both lower levels of antibodies and later seroconversion, which is in line with previous reports [ , ] . also, comorbidities may play a role in the production of specific serum antibodies following infection, which warrants further study. approximately % of the nonhospitalized cases in our selection did not show any seroconversion at all, indicating that such mild infections may not be detected by serological assays. however, assay performance could be improved by adding other sars-cov- proteins or subunits of these to further improve the sensitivity of the assay to detect low seroconversion in some cases. essential performance characteristics of assays aiming to identify seroprevalence in the population are the specificity and sensitivity. the specificity and sensitivity determine the positive and negative predictive value (ppv and npv) of the assay given the prevalence of seropositivity in the population [ ] . in current low-prevalence settings insufficient specificity will generate a low ppv, resulting in a significant overestimation of the proportion of seropositive individuals (illustrated in supplementary table ). however, the accuracy of the reported sensitivity and specificity of an assay also highly depends on the patient selection used for this evaluation, for example, using sera of severe covid- patients will result in beneficial statistics of an assay because of the acknowledged higher antibody concentration and seroconversion rate [ ] . these statistics will not apply in a population serosurvey where the majority of persons will not develop severe covid- . for this reason, we included a heterogeneous group of covid- patients' samples, consequently reducing sensitivity. scoring samples positive if at least of the analytes generated positive results improved the specificity of the assay to % at a sensitivity > %. at a true seroprevalence of %, this would provide a seroprevalence estimate of . % and therefore would be much more accurate than using a single analyte. we recommend transparent reporting of underlying assay performance using heterogeneous panels of controls and covid- patients. furthermore, implementation of international reference materials as being distributed by, for example, the national institute for biological standards and control, to facilitate comparison of seroepidemiological data between studies and countries is greatly recommended [ , ] . from an immunological point of view, it needs to be established which sars-cov- -specific antibodies correlate with protection. antibodies to rbd of s have been shown to associate with neutralization of the virus in vitro, and preliminary data indicate that the antibodies reported in our assay correlate quantitatively with virus neutralization in vitro as well [ ] . the data presented here show detection of total igg. another study has shown that igg subclasses are not equally induced by sars-cov- infection, with a bias towards the production of igg , at least in the first weeks after infection [ ] . infection with sars-cov- also induces the production of iga and igm, which can contribute to protection and in vitro neutralization of the virus, but these isotypes are currently not captured by our assay [ , , ] . follow-up studies are needed to establish the longevity of the production of antibodies, the degree of protection antibodies confer through various fc receptor-mediated and other mechanisms, and how b-cell memory is induced. such studies should also consider different viral loads detected in a patient and degree of severity of covid- . in conclusion, we developed a robust multiplex assay to detect antibodies to sars-cov- in small blood volumes. our study is unique in validating the assay against hcov and non-hcov ili panels. because of the differences in seroconversion rates and quantitative antibody concentrations among nonhospitalized covid- cases, which represents the majority of patients in the general population, further investigation is required to improve assay performance for serosurveys in general. we show the advantages of multiplexed analysis in determining seroconversion and provide a framework for reliable seroprevalence estimates in different settings. supplementary materials are available at the journal of infectious diseases online. consisting of data provided by the authors to benefit the reader, the posted materials are not copyedited and are the sole responsibility of the authors, so questions or comments should be addressed to the corresponding author. immune surveillance for vaccine-preventable diseases developing antibody tests for sars-cov- towards the next phase: evaluation of serological assays for diagnostics and exposure assessment ace receptor expression and severe acute respiratory syndrome coronavirus infection depend on differentiation of human airway epithelia structure of the sars-cov- spike receptor-binding domain bound to the ace receptor • jid :xx (xx xxxx) • den hartog et al neutralizing antibodies against sars-cov- and other human coronaviruses temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by sars-cov- : an observational cohort study profiling early humoral response to diagnose novel coronavirus disease (covid- ) evaluation of nine commercial sars-cov- immunoassays. medrxiv test performance evaluation of sars-cov- serological assays. medrxiv the role of antibody testing for sars-cov- : is there one? world health organization a serological assay to detect sars-cov- seroconversion in humans third national biobank for population-based seroprevalence studies in the netherlands, including the caribbean netherlands immunogenicity of -valent pneumococcal conjugate vaccine administered according to different primary immunization schedules in infants: a randomized clinical trial the development of a bead-based multiplex immunoassay for the detection of igg antibodies to cmv and ebv development and standardization of a high-throughput multiplex immunoassay for the simultaneous quantification of specific antibodies to five respiratory syncytial virus proteins influenza-like illness incidence is not reduced by influenza vaccination in a cohort of older adults, despite effectively reducing laboratory-confirmed influenza virus infections severe acute respiratory syndrome coronavirus -specific antibody responses in coronavirus disease patients antibody responses to sars-cov- in patients with covid- diagnostic tests: how to estimate the positive predictive value antibody responses to sars-cov- in patients of novel coronavirus disease national institute for biological standards and control. coronavirus (covid- )-related research reagents available from the nibsc detection of sars-cov- -specific humoral and cellular immunity in covid- convalescent individuals longitudinal change of severe acute respiratory syndrome coronavirus antibodies in patients with coronavirus disease acknowledgments. the authors acknowledge jorgen de jonge and puck van kasteren for critically reviewing our manuscript and gert-jan godeke for providing technical assistance.financial support. this work was supported by the national institute for public health and the environment, the netherlands.potential conflicts of interest. all authors: no reported conflicts of interest. all authors have submitted the icmje form for disclosure of potential conflicts of interest. conflicts that the editors consider relevant to the content of the manuscript have been disclosed. key: cord- - wfcc y authors: li, tingting; cai, hongmin; yao, hebang; zhou, bingjie; zhang, ning; gong, yuhuan; zhao, yapei; shen, quan; qin, wenming; hutter, cedric a.j.; lai, yanling; kuo, shu-ming; bao, juan; lan, jiaming; seeger, markus a.; wong, gary; bi, yuhai; lavillette, dimitri; li, dianfan title: a potent synthetic nanobody targets rbd and protects mice from sars-cov- infection date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: wfcc y sars-cov- , the causative agent of covid- , recognizes host cells by attaching its receptor-binding domain (rbd) to the host receptor ace – . neutralizing antibodies that block rbd-ace interaction have been a major focus for therapeutic development – . llama-derived single-domain antibodies (nanobodies, ∼ kda) offer advantages including ease of production and possibility for direct delivery to the lungs by nebulization , which are attractive features for bio-drugs against the global respiratory disease. here, we generated synthetic nanobodies (sybodies) by in vitro selection using three libraries. the best sybody, mr bound to rbd with high affinity (kd = . nm) and showed high neutralization activity against sars-cov- pseudoviruses (ic = . μg ml− ). structural, biochemical, and biological characterization of sybodies suggest a common neutralizing mechanism, in which the rbd-ace interaction is competitively inhibited by sybodies. various forms of sybodies with improved potency were generated by structure-based design, biparatopic construction, and divalent engineering. among these, a divalent mr conjugated with the albumin-binding domain for prolonged half-life displayed highest potency (ic = ng ml− ) and protected mice from live sars-cov- challenge. our results pave the way to the development of therapeutic nanobodies against covid- and present a strategy for rapid responses for future outbreaks. strategy for rapid responses for future outbreaks. the binding through hydrophobic interactions and h-bonding that involves both side chains and main chains (fig. d ). in addition, tyr , a framework residue, also participated binding by forming an h-bond with the rbd gly backbone. mr also binds to the rbd at the 'seat' and 'backrest' regions but approaches the rbd at an almost perfect opposite direction of sr (fig. c, e) , indicating divergent binding mode for these sybodies. the binding of mr to the rbd occurred on an . Å surface area with noticeable electrostatic complementarity (extended data fig. b) . interestingly, this surface was largely shared with the sr binding surface (fig. f) . the interactions between mr and the rbd were mainly mediated by h- bonding. apart from the three cdrs, two framework residues, lys and tyr , interacted with the same rbd residue glu , via a salt bridge with its side chain, and an h-bond with its main chain (fig. g) . molecular mechanism for neutralization structure alignment of sr -, mr -and ace -rbd showed that both sybodies engage with rbd at the receptor-binding motif (rbm) ( fig. a, b) . superposing sr and mr to the s trimer showed both sybodies could bind to the 'up' conformation of rbd with no steric clashes (fig. c, d) , and to the 'down' conformation with only minor clashes (extended data fig. ) owing to their minute sizes. consistent with the structure observation, both sr and mr inhibited the binding of ace to rbd, as revealed by bio-layer interferometry (bli) assays (fig. e, f) . to probe the epitope for mr without a structure, competitive bli binding assays were carried out. the results showed that mr could block ace (fig. g) , and sr and mr (fig. h, i) , suggesting it also binds to at least part of the rbm, although the possibility of allosteric inhibition remains to be investigated. taken together, sr and mr , and probably mr , neutralize sars-cov- by competitively blocking the for biparatopic fusion, we first identified two sybodies, namely lr and lr (fig. a, b), that could bind rbd in addition to mr using the bli assay. as lr showed higher affinity and neutralization activity than lr (fig. a) , we fused this non- competing sybody to the n-terminal of mr with various length of gs linkers ranging from to amino acids (extended data table s ). interestingly, the linker length had little effect on neutralization activity and these biparatopic lr -mr sybodies were more potent than either sybodies alone ( fig. a) with an ic of . g ml - (fig. c). lr -mr may be more tolerant to escape mutants - owing to its ability to recognize two distinct epitopes. this decreased ic by folds for fc-mr ( ng ml - ) and folds fc-mr ( . g ml - ), respectively (fig. d, e) . consistently, the fc fusion increased the apparent binding affinity for both sybodies, with a kd of . nm for fc-mr and less than pm for fc-mr (extended data fig. h, i) . note, however, fc-mr did not gain as much neutralization potency as for the apparent binding affinity. table ). the optimal construct for mr m-mr m had the shortest linker ( -gs) (fig. d , e). by contrast, optimal neutralization activity was observed with the longest linker ( -gs) for mr -mr (fig. d, e) . again, mr -mr was superior compared to mr m-mr m, showing a -fold higher neutralization activity with an ic of ng ml - (fig. e) . compared to the monovalent mr (ic of . g ml - ), the divalent the most potent divalent sybody (mr -mr ) was chosen to investigate the potential of nanobodies to protect mice from sars-cov- infection. nanobodies have very short serum half-lives of several minutes due to their minute size . to circumvent this, we fused mr -mr to the n-terminus of an albumin-binding domain (abd) which has been known to extend the circulating half-life of its fusion partners by increase in size and preventing intracellular degradation . conveniently, we expressed mr -mr -abd in pichia pastoris, which is the preferred host to express nanobody therapeutics owing to its robustness and its endotoxin-free production. small-scale expression of mr -mr -abd showed a secretion level of ~ mg l - with an apparent purity of > % without purification (fig. a) . note, this experiment was carried out using a shaker which gave cell density of od of . given its ability to grow to od of without compromising yield, the expression level of mr -mr -abd may reach . g l - in fermenters. the potential for simple and high-yield production is especially attractive for the pandemic at a global scale. the construct for the rbd with an avi-tag for biotinylation was made by fusing dna, from '-to '-end, of the encoding sequence for the honey bee melittin signal neutralization assay results for sars-cov- pseudovirus. (b) neutralization assay results for sars-cov pseudovirus. veroe -hace cells were infected with a premix of pseudotypes and sybodies at two concentrations ( m and nm) a-i) biotinylated rbd immobilized on a streptavidin-coated sensor was titrated with various concentrations (nm) of sybodies as indicated open-book' view of molecular electrical potential surfaces of the interface between the rbd and sr (a) and between the rbd and mr (b). the electrical potential maps were calculated by adaptive poisson-boltzmann solver (apbs) built-in in pymol structure-based design of a mr mutant (mr m) with improved affinity and potency. (a,b) neutralization assay for sars-cov- (a) or sybody concentrations were used at m (green) and nm (magenta) concentrations. data are from three independent experiments electrostatic repel and hydrophobic mismatch would make lys unfavorable at this position. according to the original library design, lys was unvaried , meaning that lys was not selected and hence opportunities for optimization. (e) the k y mutation fits the hydrophobic microenvironment well, as revealed by the crystal structure of mr m (extended data table ). (f) binding kinetics of mr m binding to rbd. bli signals were recorded under ic values (g ml - ) for sars-cov- are indicated in brackets. data for mr are from data are from three independent experiments extended data fig. . evaluation of in vivo stability and toxicity of nanobodies. (a) for neutralization assay, sera were preincubated with sars-cov- pseudovirus for h before infection at / dilution. the infection rates on veroe - hace were measure by facs days post infection. (b) body weight changes. the body weight data are presented as means  the sd of mice in each group (n= ). no significant differences are observed. (c) representative histopathology of the lungs, heart, liver, spleen, lungs, kidney, and thymus for the different sybodies injected the images and areas of interest are magnified ×. bars indicate m a novel coronavirus outbreak of global health concern structure, function, and antigenicity of the sars-cov- spike glycoprotein cryo-em structure of the -ncov spike in the prefusion conformation structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structural basis of receptor recognition by sars-cov- structural and functional basis of sars-cov- entry by using human ace structural basis for the recognition of sars-cov- by full-length human ace a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace potent neutralizing antibodies against sars-cov- identified by high-throughput single-cell sequencing of convalescent patients' b cells human neutralizing antibodies elicited by sars-cov- infection general strategy to humanize a camelid single-domain antibody and identification of a universal humanized nanobody scaffold isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model studies in humanized mice and convalescent humans yield a sars-cov- antibody cocktail an alpaca nanobody neutralizes sars-cov- by blocking receptor interaction. biorxiv neutralizing nanobodies bind sars-cov- spike rbd and block interaction with selection, biophysical and structural analysis of synthetic nanobodies that effectively neutralize sars-cov- synthetic nanobodies targeting the sars-cov- receptor-binding domain an ultra-high affinity synthetic nanobody blocks sars-cov- infection by locking spike into an inactive conformation. biorxiv nanobodies® as inhaled biotherapeutics for lung diseases the socio-economic implications of the coronavirus pandemic (covid- ): a review cell entry mechanisms of sars-cov- sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor convergent antibody responses to sars-cov- in convalescent individuals natural single-domain antibodies yeast surface display platform for rapid discovery of conformationally selective nanobodies synthetic single domain antibodies for the conformational trapping of membrane proteins generation of synthetic nanobodies against delicate proteins nali-h : a universal synthetic library of humanized nanobodies providing highly functional antibodies and intrabodies an improved yeast surface display platform for the screening of nanobody immune libraries selection, biophysical and structural analysis of synthetic nanobodies that effectively neutralize sars-cov- . biorxiv the therapeutic potential of nanobodies inference of macromolecular assemblies from crystalline state tracking changes in sars-cov- spike: evidence that d g increases molecular cancer therapeutics fusion to a highly stable consensus albumin binding domain allows for tunable pharmacokinetics. protein engineering, design & selection : peds generation of a broadly useful model for covid- pathogenesis, vaccination, and treatment a human neutralizing antibody targets the receptor-binding site of sars-cov- a sars-cov- infection model in mice demonstrates protection by neutralizing antibodies enzymatic assembly of dna molecules up to several hundred kilobases a fluorescence-detection size-exclusion chromatography- based thermostability assay for membrane protein precrystallization screening the protein complex crystallography beamline (bl u ) at the shanghai synchrotron radiation facility how good are my data and what is the resolution? phaser crystallographic software features and development of coot phenix: a comprehensive python-based system for macromolecular structure solution the pymol molecular graphics system, version . schrödinger, llc electrostatics of nanosystems: application to microtubules and the ribosome key: cord- -sgm q i authors: walter, justin d.; hutter, cedric a.j.; zimmermann, iwan; wyss, marianne; earp, jennifer; egloff, pascal; sorgenfrei, michèle; hürlimann, lea m.; gonda, imre; meier, gianmarco; remm, sille; thavarasah, sujani; plattet, philippe; seeger, markus a. title: sybodies targeting the sars-cov- receptor-binding domain date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: sgm q i the covid- pandemic, caused by the novel coronavirus sars-cov- , has resulted in a global health and economic crisis of unprecedented scale. the high transmissibility of sars-cov- , combined with a lack of population immunity and prevalence of severe clinical outcomes, urges the rapid development of effective therapeutic countermeasures. here, we report the generation of synthetic nanobodies, known as sybodies, against the receptor-binding domain (rbd) of sars-cov- . in an expeditious process taking only twelve working days, sybodies were selected entirely in vitro from three large combinatorial libraries, using ribosome and phage display. we obtained six strongly enriched sybody pools against the isolated rbd and identified unique anti-rbd sybodies which also interact in the context of the full-length sars-cov- spike ectodomain. among the selected sybodies, six were found to bind to the viral spike with double-digit nanomolar affinity, and five of these also showed substantial inhibition of rbd interaction with human angiotensin-converting enzyme (ace ). additionally, we identified a pair of anti-rbd sybodies that can simultaneously bind to the rbd. it is anticipated that compact binders such as these sybodies could feasibly be developed into an inhalable drug that can be used as a convenient prophylaxis against covid- . moreover, generation of polyvalent antivirals, via fusion of anti-rbd sybodies to additional small binders recognizing secondary epitopes, could enhance the therapeutic potential and guard against escape mutants. we present full sequence information and detailed protocols for the identified sybodies, as a freely accessible resource. the ongoing pandemic arising from the emergence of the novel coronavirus, sars-cov- , demands urgent development of effective antiviral therapeutics. several factors contribute to the adverse nature of sars-cov- from a global health perspective, including the absence of herd immunity [ ] , high transmissibility [ , ] , the prospect of asymptomatic carriers [ ] , and a high rate of clinically severe outcomes [ ] . moreover, a vaccine against sars-cov- is unlikely to be available for at least - months [ ] , despite earnest development efforts [ , ] , making alternative intervention strategies paramount. in addition to offering relief for patients suffering from the resulting covid- disease, therapeutics may also reduce the viral transmission rate by being administered to asymptomatic individuals subsequent to probable exposure [ ] . finally, given that sars-cov- represents the third global coronavirus outbreak in the past years [ , ] , development of rapid therapeutic strategies during the current crises could offer greater preparedness for future pandemics. akin to all coronaviruses, the viral envelope of sars-cov- harbors protruding, club-like, multidomain spike proteins that provide the machinery enabling entry into human cells [ ] [ ] [ ] . the spike ectodomain is segregated into two regions, termed s and s . the outer s subunit of sars-cov- is responsible for host recognition via interaction between its c-terminal receptor-binding domain (rbd) and human angiotensin converting enzyme (ace ), present on the exterior surface of airway cells [ , ] . while there is no known host-recognition role for the s n-terminal domain (ntd) of sars-cov- , it is notable that s ntds of other coronaviruses have been shown to bind host surface glycans [ , ] . in contrast to spike region s , the s subunit contains the membrane fusion apparatus, and also mediates trimerization of the ectodomain [ ] [ ] [ ] . prior to host recognition, spike proteins exist in a metastable pre-fusion state wherein the s subunits lay atop the s region and the rbd oscillates between "up" and "down" conformations that are, respectively, accessible and inaccessible to receptor binding [ , , ] . upon processing at the s /s and s ' cleavage sites by host proteases as well as engagement to the receptor, the s subunit undergoes dramatic conformational changes from the pre-fusion to the post-fusion state. such structural rearrangements are associated with the merging of the viral envelope with host membranes, thereby allowing injection of the genetic information into the cytoplasm of the host cell [ , ] . coronavirus spike proteins are highly immunogenic [ ] , and several experimental approaches have sought to target this molecular feature for the purpose of viral neutralization [ ] . the high specificity, potency, and modular nature of antibody-based antiviral therapeutics has shown exceptional promise [ ] [ ] [ ] , and the isolated, purified rbd has been a popular target for the development of anti-spike antibodies against pathogenic coronaviruses [ ] [ ] [ ] [ ] . however, binders against the isolated rbd may not effectively engage the aforementioned pre-fusion conformation of the full spike, which could account for the poor neutralization ability of recently described single-domain antibodies that were raised against the rbd of sars-cov- [ ] . therefore, to better identify molecules with qualities befitting a drug-like candidate, it would be advantageous to validate rbd-specific binders in the context of the full, stabilized, pre-fusion spike assembly [ , ] . single domain antibodies based on the variable vhh domain of heavy-chain-only antibodies of camelids -generally known as nanobodies -have emerged as a broadly utilized and highly successful antibody fragment format [ ] . nanobodies are small ( ) ( ) ( ) ( ) , stable, and inexpensive to produce in bacteria and yeast [ ] , yet they bind targets in a similar affinity range as conventional antibodies. due to their minimal size, they are particularly suited to reach hidden epitopes such as crevices of target proteins [ ] . we recently designed three libraries of synthetic nanobodies, termed sybodies, based on elucidated structures of nanobody-target complexes (fig. a) [ , ] . sybodies can be selected against any target protein within twelve working days, which is considerably faster than natural nanobodies, which requires the repetitive immunization during a period of two months prior to binder selection by phage display [ ] (fig. c) . a considerable advantage of our platform is that sybody selections are carried out under defined conditions -in case of coronavirus spike proteins, this offers the opportunity to generate binders recognizing the metastable pre-fusion conformation [ , ] . finally, due to the feasibility of inhaled therapeutic nanobody formulations [ ], virusneutralizing sybodies could offer a convenient and direct means of prophylaxis. here, we report of in vitro selection and characterization of sybodies against the rbd of sars-cov- spike protein. two independently prepared rbd constructs were used for in vitro sybody selections, and resulting single clones that could bind the full spike ectodomain were sequenced, expressed, and purified. six unique sybodies show favorable binding affinity to the sars-cov- spike, and five of these were also found to substantially attenuate the interaction between the viral rbd and human ace . moreover, pairs of sybodies were identified that can simultaneously bind to the rbd. we present all sequences for these clones, along with detailed protocols to enable the community to freely produce and further characterize these sars-cov- binders. based on sequence alignments with isolated rbd variants from sars-cov- that were amenable to purification and crystallization [ , ] , a sars-cov- rbd construct was designed, consisting of residues pro -gly fused to venus yfp (rbd-vyfp). this construct was expressed and secreted from expi cells, and rbd-vyfp was extracted directly from culture medium supernatant using an immobilized anti-gfp nanobody [ ], affording a highly purified product with negligible background contamination. initial efforts to cleave the c-terminal vyfp fusion partner with c protease resulted in unstable rbd, so experiments were continued with full rbd-vyfp fusion protein. to account for the presence of the vyfp fusion partner, a second rbd construct, consisting of a fusion to murine igg fc domain (rbd-fc), was commercially acquired. to remove any trace amines, buffers were exchanged to pbs via extensive dialysis. proteins were chemically biotinylated, and the degree of biotinylation was assessed by a streptavidin gel-shift assay and found to be greater than % of the target proteins [ ] . we note that while both rbd fusion proteins were well-behaved, a commercially acquired purified full-length sars-cov- spike ectodomain construct (ecd) was found to be aggregation-prone. very recently, we also produced an engineered spike protein ectodomain containing two point mutations known to stabilize the pre-fusion state, an inactivated furin cleavage site, and a c-terminal trimerization motif [ , , ] . while this purified pre-fusion spike (pfs) had not yet been available for binder selections and characterization by grating-coupled interferometry, it was used to conduct elisas in order to identify selected sybodies which recognize the rbd in the pre-fusion context (see below). since both our rbd constructs bear additional fusion domains (fc of mouse igg and vyfp, respectively), sybody selections were carried out with a "target swap" approach (fig. b) . hence, selections with the three sybody libraries (concave, loop and convex) were started with the rbd-vyfp construct using ribosome display, and the rbd-fc construct was then used for the two phage display rounds (selection variant : rbd-vyfp/rbd-fc/rbd-fc) and vice versa (selection variant : rbd-fc/rbd-vyfp/rbd-vyfp). accordingly, there were a total of six selection reactions (table , fig. b) . to increase the average affinity of the isolated sybodies, we included an off-rate selection step using the preenriched purified sybody pool after phage display round as competitor. to this end, sybody pools of all three libraries of the same selection variant were sub-cloned from the phage display vectors into the sybody expression vector psb_init. subsequently, the two separate pools (all sybodies of selection variants and , respectively) were expressed and purified. the purified pools were then added to the panning reactions of the respective selection variant in the second phage display round. thereby, rebinding of sybody-phage complexes with fast off-rates was suppressed. enrichment of sybodies against the rbd was monitored by qpcr. already in the first phage display round, the concave and loop sybodies of selection variant showed enrichment factors of and , respectively (table ) . after the second phage display round (which included the off-rate selections step), strong enrichment factors in the range of - were determined. after sub-cloning the pools from the phage display vector pdx_init into the sybody expression vector psb_init, clones of each of the selections reactions ( table , fig. b ) were picked at random and expressed in small scale. our standard elisa was initially performed using rbd-vyfp (rbd), spike ectodomain containing s and s (ecd), and maltose binding protein (mbp) as unrelated dummy protein. as outlined in the materials and methods section, elisa analysis revealed very high hit rates for the rbd and the ecd ranging from % to % and % to %, respectively (fig. , table ). the majority of the sybodies giving an elisa signal to the rbd also gave a clear signal the full-length spike protein (fig. ). however, there was a total of hits that only gave an elisa signal for rbd-vyfp, but not for the ecd. this could be due to the presence of cryptic rbd epitopes that are not accessible in the context of the full-length spike protein, or the respective sybodies may recognize the vyfp portion of the rbd-vyfp construct, though the selection procedure clearly disfavors the latter explanation. importantly, background binding to the dummy protein mbp was not observed for any of the analyzed sybodies, clearly showing that the binders are highly specific. we then sequenced sybodies that were elisa-positive against rbd-vyfp as well as the full-length spike ( for each of the selection reactions numbered from sb# - , see also fig. b ). subsequent to sybody sequencing, we also performed the elisa using engineered pre-fusion-stabilized spike ectodomain (pfs) (fig. ) , which was not available at the onset of the project. overall, the elisa signals for the ecd and pfs are highly similar. however, there are around sybodies that bind to the ecd clearly stronger than to the pfs (yet the opposite scenario was never observed). this could be explained by the fact that the pfs forms a trimer, while the oligomeric state of the ecd is not clear. in addition, the ecd might adopt partially or completely a post-fusion state, whereas pfs is expected to predominantly adopt the pre-fusion state. trimer formation as well as pre-fusion stabilization might shield certain binding epitopes on the rbd in the context of the pfs, which might become accessible as the spike falls apart into monomers and/or transits to the post-fusion state. in light of our elisa data, the pfs construct will be a crucial element in any future sybody selection campaigns. sequencing results of out of sybody clones were unambiguous. out of these clones, were found to be unique and the respective clone names are indicated in the elisa figure (fig. , table ). of note, there were no duplicate binders identified in both selection variants, indicating that the two separate selection streams gave rise to completely different arrays of sybodies. as an additional note, one sybody identified from the supposed convex library turned out to belong to the concave library; spill-over of sybodies across libraries is occasionally observed. hence, there was a total of concave, loop and convex sybodies, which were then aligned according to their library origin . as a final analysis, all sybody sequences were aligned to generate a phylogenetic tree, which shows a clear segregation across the three libraries and indicates a large sequence variability of the identified sybodies ( fig. ). the unique sybodies were individually expressed in e. coli and purified via ni-nta affinity chromatography and gel filtration. ultimately, sybodies were sufficiently well-behaved, with respect to solubility, yield, and monodispersity, to proceed with further characterization. for a kinetic analysis of sybody interactions with the viral spike, we employed grating-coupled interferometry (gci) to probe sybody binding to immobilized rbd-vyfp or ecd. first, the purified sybodies were subjected to an off-rate screen, which revealed six sybodies (sb# , sb# , sb# , sb# , sb# , and sb# ) with strong binding signals and comparatively slow off-rates. binding constants were then determined by measuring on-and off-rates over a range of sybody concentrations, revealing affinities within a range of - nm to the sars-cov- spike (fig. , table ). of note, binding affinities were consistently equal or higher for the ecd as compared to the rbd-vyfp, in particular in case of sb# for which the off-rate differs by more than two-fold. this might indicate a binding avidity effect arising from binding epitopes clustering in the context of the spike trimer or differences with regards to the glycan structures (rbd-vyfp was produced in hek cells, whereas the ecd was produced in insect cells). to our surprise, the majority of purified and elisa-positive sybodies ( out of ) displayed binding affinities worse than nm. this may be attributed to the presence of complex heterogeneous asnlinked glycans within the rbd, which could hinder the isolation of specific high-affinity binders. alternatively, given that the final elisa step of the selection process resulted in a substantial number of positive clones, insufficiently stringent conditions may have favored the high positive hit rate of lowaffinity binders. since virulence of sars-cov- is dependent on the ability of the viral rbd to bind to human ace (hace ), we sought to determine which of the selected sybodies that were well-behaved upon purification could inhibit interaction between the isolated rbd and purified hace . for this assessment, elisa plates were coated with purified hace , and the binding of purified rbd to the immobilized hace was measured in the presence or absence of an excess of each purified sybody (fig. ). while the absence of any added sybody resulted in a strong elisa signal corresponding to rbd association with hace , the pre-incubation of nearly all sybodies with the rbd resulted in an attenuated signal, implying that these binders inhibit rbd-hace association. this signal decrease relative to unchallenged rbd was modest for most sybodies, with an average signal reduction of about %, but five sybodies demonstrated exceptionally high apparent inhibition of rbd-hace interaction (sb# , sb# , sb# , sb# , and sb# ), showing ≥ % signal reduction. notably, the aforementioned kinetic analysis had shown that these sybodies were also among the strongest rbd binders. taken together, this data suggests that sb# , sb# , sb# , sb# , and sb# recognize a surface region on the rbd that overlaps with the hace binding site. while kinetic analysis had revealed sb# to be among the stronger binders to the sars-cov- ectodomain (kd ≈ nm, fig. , table ), the hace competition elisa revealed that sb# does not inhibit hace -rbd interaction to the same extent as other sybodies with comparable affinities ( % inhibition for sb# , compared to > % for sb# , sb# , sb# , and sb# ). therefore, it was hypothesized that sb# may interact with a non-or partially-overlapping surface on the rbd, relative to the more strongly-inhibiting sybodies. using sb# as a representative of the hace -inhibiting sybodies, we analyzed the ability of sb# and sb# to simultaneously associate with the rbd. first, elisa experiments demonstrate that incubation of sb# with the pre-fusion spike only slightly prevents the spike from binding to immobilized sb# , whereas pre-incubation with sb# , sb# , sb# , sb# , or sb# completely prevents spike interaction with immobilized sb# (fig. ). in agreement with the elisa data, gci experiments revealed that co-injection of sb# and sb# results in a clear (but not fully additive) increase of the response signal, relative to sb# or sb# injected alone, implying simultaneous binding of sb# and sb# (fig. ). the control gci experiment involving the co-injection of sb# and sb# did not result in a similar signal increase ( fig. ). in sum, this data plausibly suggests that sb# and sb# can simultaneously bind to the rbd. for the design of therapeutics against sars-cov- , the fusion of such a pair of non-overlapping binders could provide benefits via increased overall avidity to the spike protein. we have demonstrated the ability of our rapid in vitro selection platform to generate sybodies against the sars-cov- rbd, within a two-week timeframe. characterization of these sybodies has identified a high-affinity subset of binders that also inhibit the rbd-ace interaction. we anticipate that the presented panel of anti-rbd sybodies could be of use in the design of urgently required therapeutics to mitigate the covid- pandemic, particularly in the development of inhalable prophylactic formulations [ ] . furthermore, our identification of a pair of sybodies that can simultaneously associate with the rbd may offer an attractive foundation for the construction of a polyvalent sybodybased therapeutic. we have attempted to provide a complete account of the generation of these molecules, including full sequences and detailed methods, such that other researchers may contribute to their ongoing analysis. future work may include virus neutralization assays using the identified sybodies, as well as further selection campaigns targeting additional spike epitopes. finally, our recently described flycode technology could be utilized for deeper interrogation of selection pools, in order to facilitate discovery of exceptional sybodies that possess very slow off-rates or recognize rare epitopes [ ] . a gene encoding sars-cov- residues pro -gly (rbd, genbank accession qhd . ), downstream from a modified n-terminal human serum albumin secretion signal [ ] , was chemically synthesized (geneuniversal). this gene was subcloned using fx technology [ ] into a custom mammalian expression vector [ ] , appending a c-terminal c protease cleavage site, myc tag, venus yfp [ ] , and streptavidin-binding peptide [ ] onto the open reading frame (rbd-vyfp). - ml of suspension-adapted expi cells (thermo) were transiently transfected using expifectamine according to the manufacturer protocol (thermo), and expression was continued for - days in a humidified environment at °c, % co . cells were pelleted ( g, min), and culture supernatant was filtered ( . µm mesh size) before being passed three times over a gravity column containing nhsagarose beads covalently coupled to the anti-gfp nanobody k k [ ], at a resin:culture ratio of ml resin per ml expression culture. resin was washed with column-volumes of rbd buffer (phosphate-buffered saline, ph . , supplemented with additional . m nacl), and rbd-vyfp was eluted with . m glycine, ph . , via sequential . ml fractions, without prolonged incubation of resin with the acidic elution buffer. fractionation tubes were pre-filled with / vol m tris, ph . ( µl), such that elution fractions were immediately ph-neutralized. fractions containing rbd-vyfp were pooled, concentrated, and stored at °c. purity was estimated to be > %, based on sds-page (not shown). yield of rbd-vyfp was approximately - μg per ml expression culture. a second purified rbd construct, consisting of sars-cov- residues arg -phe fused to a murine igg fc domain (rbd-fc) expressed in hek cells, was purchased from sino biological (catalogue number: -v h, µg were ordered). purified full-length spike ectodomain (ecd) comprising s and s (residues val -pro ) with a c-terminal his-tag and expressed in baculovirus-insect cells was purchased from sino biological (catalogue number: -v b , µg were ordered). the prefusion ectodomain of the sars-cov spike protein (residues - ) [ ] , was transiently transfected into x suspension-adapted expicho cells (thermo fisher) using mg plasmid dna and mg of pei max (polysciences) per l procho medium (lonza) in a l erlenmeyer flask (corning) in an incubator shaker (kühner). one hour post-transfection, dimethyl sulfoxide (dmso; applichem) was added to % (v/v). incubation with agitation was continued at °c for days. l of filtered ( . um) cell culture supernatant was clarified. then, a ml gravity flow strep-tactin®xt superflow® column (iba lifescience) was rinsed with ml buffer w ( mm tris, ph . , mm nacl, mm edta) using gravity flow. the supernatant was added to the column, which was then rinsed with ml of buffer w (all with gravity flow). finally, six elution steps were performed by adding each time . ml of buffer bxt ( mm biotin in buffer w) to the resin. all purification steps were performed at °c. to remove amines, all proteins were first extensively dialyzed against rbd buffer. proteins were concentrated to µm using amicon ultra concentrator units with a molecular weight cutoff of - kda. subsequently, the proteins were chemically biotinylated for min at °c using nhs-biotin (thermo fisher, # ) added at a -fold molar excess over target protein. immediately after, the three samples were dialyzed against tbs ph . . during these processes (first dialysis/ concentrating/ biotinylation/ second dialysis), %, %, % and % of the rbd-vyfp, rbd-fc, ecd and pfs respectively were lost due to sticking to the concentrator filter or due to aggregation. biotinylated rbd-vyfp, rbd-fc and ecd were diluted to µm in tbs ph . , % glycerol and stored in small aliquots at - °c. biotinylated pfs was stored at °c in tbs ph . . sybody selections with the three sybody libraries concave, loop and convex were carried out as described in detail before [ ] . in short, one round of ribosome display followed by two rounds of phage display were carried out. binders were selected against two different constructs of the sars-cov- rbd; an rbd-vyfp fusion and an rbd-fc fusion. mbp was used as background control to determine the enrichment score by qpcr [ ] . in order to avoid enrichment of binders against the fusion proteins (yfp and fc), we switched the two targets after ribosome display (fig. b) . for the offrate selections we did not use non-biotinylated target proteins as described in the standard protocol, because we did not have enough purified protein at hand to do so. instead we sub-cloned all three libraries for both selections after the first round of phage display into the psb_init vector ( clones) and expressed the six pools in e. coli mc cells. then the pools corresponding to the same selection were pooled for purification. the two final pools were purified by ni-nta resin using gravity flow columns, followed by buffer exchange of the main peak fraction using a desalting pd column in tbs ph . to remove imidazole. the pools were eluted with . ml instead of . ml tbs ph . in order to ensure complete buffer exchange. these two purified pools were used for the off-rate selection in the second round of phage display at concentrations of approximately µm for selection variant (rbp-fc) and µm for selection variant (rbp-yfp). the volume used for off-rate selection was µl. just before the pools were used for the off-rate selection, . % bsa and . % tween- was added to each sample. off-rate selections were performed for minutes. elisas were performed as described in detail before [ ] . single clones were analyzed for each library of each selection. since the rbd-fc construct was incompatible with our elisa format due to the inclusion of protein a to capture an α-myc antibody, elisa was performed only for the rbd-vyfp ( nm) and the ecd ( nm) and later on with the pfs ( nm). of note, the three targets were analyzed in three separate elisas. as negative control to assess background binding of sybodies, we used biotinylated mbp ( nm). positive elisa hits were sequenced (microsynth, switzerland). the unique sybodies were expressed and purified as described [ ] . in short, all sybodies were expressed overnight in e.coli mc cells in ml cultures. the next day the sybodies were extracted from the periplasm and purified by ni-nta affinity chromatography (batch binding) followed by sizeexclusion chromatography using a sepax srt- c sec size-exclusion chromatography (sec) column equilibrated in tbs, ph . , containing . % (v/v) tween- (detergent was added for subsequent kinetic measurements). six out of the binders (sb# , sb# , sb# , sb# , sb# , sb# ) were excluded from further analysis due to suboptimal behavior during sec analysis (i.e. aggregation or excessive column matrix interaction). kinetic characterization of sybodies binding onto sars-cov- spike proteins was performed using gci on the wavesystem (creoptix ag, switzerland), a label-free biosensor. biotinylated rbd-vyfp and ecd were captured onto a streptavidin pcp-sta wavechip (polycarboxylate quasi-planar surface; creoptix ag) to a density of - pg/mm . sybodies were first analyzed by an off-rate screen performed at a concentration of nm (data not shown) to identify binders with sufficiently high affinities. the six sybodies sb# , sb# , sb# , sb# , sb# , and sb# were then injected at increasing concentrations ranging from . nm to μm (three-fold serial dilution, concentrations) in tbs buffer supplemented with . % tween- . sybodies were injected for s at a flow rate of μl/min per channel and dissociation was set to s to allow the return to baseline. sensorgrams were recorded at °c and the data analyzed on the wavecontrol (creoptix ag). data were double-referenced by subtracting the signals from blank injections and from the reference channel. a langmuir : model was used for data fitting. purified recombinant hace protein (mybiosource, cat# mbs ) was diluted to nm in phosphate-buffered saline (pbs), ph . , and μl aliquots were incubated overnight on nunc maxisorp -well elisa plates (thermofisher # - - ) at °c. elisa plates were washed three times with μl tbs containing . % (v/v) tween- (tbst). plates were blocked with μl of . % (w/v) bsa in tbs for h at room temperature. μl samples of biotinylated rbd-vyfp ( nm) mixed with individual purified sybodies ( nm) were prepared in tbs containing . % (w/v) bsa and . % (v/v) tween- (tbs-bsa-t) and incubated for . h at room temperature. these μl rbd-sybody mixtures were transferred to the plate and incubated for minutes at room temperature. μl of streptavidin-peroxidase (merck, cat#s ) diluted : in tbs-bsa-t was incubated on the plate for h. finally, to detect bound biotinylated rbd-vyfp, μl of development reagent containing , ′, , ′-tetramethylbenzidine (tmb), prepared as previously described [ ] , was added, color development was quenched after - min via addition of μl . m sulfuric acid, and absorbance at nm was measured. background-subtracted absorbance values were normalized to the signal corresponding to rbd-vyfp in the absence of added sybodies. purified sybodies carrying a c-terminal myc-his tag (sb_init expression vector) were diluted to nm in µl pbs ph . and directly coated on nunc maxisorp -well plates (thermofisher # - - ) at °c overnight. the plates were washed once with µl tbs ph . per well followed by blocking with µl tbs ph . containing . % (w/v) bsa per well. in parallel, chemically biotinylated prefusion spike protein (pfs) at a concentration of nm was incubated with nm sybodies for h at room temperature in tbs-bsa-t. the plates were washed three times with µl tbs-t per well. then, µl of the pfs-sybody mixtures were added to the corresponding wells and incubated for min, followed by washing three times with µl tbs-t per well. µl streptavidin-peroxidase polymer (merck, cat#s ) diluted : in tbs-bsa-t was added to each well and incubated for min, followed by washing three times with µl tbs-t per well. finally, to detect pfs bound to the immobilized sybodies, µl elisa developing buffer (prepared as described previously [ ] ) was added to each well, incubated for h (due to low signal) and absorbance was measured at nm. as a negative control, tbs-bsa-t devoid of protein was added to the corresponding wells instead of a pfssybody mixture. ( ) ( ) ) sb# belongs to the concave library (spill-over). ) two sequencing reactions failed. sb# qvqlvesggglvqaggslrlscaasgfpvrkanmhwyrqapgkerewvaaimskgeqtvyadsve grftisrdnakntvylqmnslkpedtavyycrvfvgwhyfgqgtqvtvs sb# qvqlvesggglvqaggslrlscatsgfpvyqanmhwyrqapgkerewvaaiqsygdgthyadsvk grftisrdnakntvylqmnslkpedtavyycravyvgmhyfgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvnyktmwwyrqapgkerewvaaiwsyghtthyadsvk grftisrdnakntvylqmnslkpedtavyycvvwvghnyegqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvyaqnmhwyrqapgkerewvaaiyshgywtlyadsvk grftisrdnakntvylqmnslkpedtavyycevqvgawytgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvfsghmhwyrqapgkerewvaailsngdsthyadsvk grftisrdnakntvylqmnslkpedtavyycrvhvgahyfgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpveqgrmywyrqapgkerewvaaiishgtvtvyadsvk grftisrdnakntvylqmnslkpedtavyycyvyvgaqywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvlftymhwyrqapgkerewvaaiwssgnstwyadsvk grftisrdnakntvylqmnslkpedtavyycfvkvgnwyagqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvnagnmhwyrqapgkerewvaaiqsygrttyyadsvk grftisrdnakntvylqmnslkpedtavyycrvfvgmhyfgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvssstmtwyrqapgkerewvaainsygwethyadsvk grftisrdnakntvylqmnslkpedtavyycyvyvggsyigqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvqshymrwyrqapgkerewvaaiestghhtayadsvk grftisrdnakntvylqmnslkpedtavyyctvyvgyeyhgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvetenmhwyrqapgkerewvaaiyshgmwtayadsvk grftisrdntkntvylqmnslkpedtavyycevevgkwyfgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvkasrmywyrqapgkerewvaaiqsfgevtwyadsvk grftisrdnakntvylqmnslkpedtavyycyvwvgqeywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvyasnmhwyrqapgkerewvaaiesqgymtayadsvk grftisrdnakntvylqmnslkpedtavyycwvivgeyyvgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvqaremewyrqapgkerewvaaikstgtytayaysvk grftisrdnakntvylqmnslkpedtavyycyvyvgssyigqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvknfemewyrkapgkerewvaaiqsggvetyyadsvk grftisrdnakntvylqmnslkpedtavyycfvyvgrsyigqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvayktmwwyrqapgkerewvaaiesygikwtryadsv kgrftisrdnakntvylqmnslkpedtavyycivwvgaqyhgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvagrnmwwyrqapgkerewvaaiyssgtyteyadsvk grftisrdnakntvylqmnslkpedtavyychvwvgslykgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvkharmwwyrqapgkerewvaaidshgdttwyadsvk grftisrdnakntvylqmnslkpedtavyycyvyvgasywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvnshemtwyrqapgkerewvaaiqstgtvteyadsvk grftisrdnakntvylqmnslkpedtavyycyvyvgssylgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpveqremewyrqapgkerewvaaidsngnytfyadsvk grftisrdnakntvylqmnslkpedtavyycyvyvgksyigqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvkhhwmfwyrqapgkerewvaaiksygygteyadsvk grftisrdnakntvylqmnslkpedtavyycfvgvgthyagqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvyaaemewyrqapgkerewvaaissqgtityyadsvk grftisrdnakntvylqmnslkpedtavyycfvyvgksyigqgtqvsvs sb# qvqlvesggglvqaggslrlscaasgfpvhawemawyrqapgkerewvaairsfgssthyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdfgthhyaydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvntwwmhwyrqapgkerewvaaitswgfrtyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdkgmavqwydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvyntwmewyrqapgkerewvaaitshgyktyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdegdmftaydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvyhstmfwyrqapgkerewvaaiyssgqhtyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdsgqwrqeydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvehemawyrqapgkerewvaairsmgrktlyadsvkg rftisrdnakntvylqmnslkpedtavyycnvkdfgytwheydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvtmawmwwyrqapgkerewvaairsegvrtyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdygqahayydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvnshfmewyrqapgkerewvaaiqhssgfhtyyadsv kgrftisrdnakntvylqmnslkpedtavyycnvkdtgttedydywgqgtqvtvs sb# qvqldesggglvqaggslrlscaasgfpvyhawmewyrqapgkerewvaaitssgrhtyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdagrvynsydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvahawmewyrqapgkerewvaaitsygyktyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdtgtyrfyydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvwnqtmvwyrqapgkerewvaaiwsmghtyyadsvkg rftisrdnakntvylqmnslkpedtavyycnvkdagvynryydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvehywmewyrqapgkerewvaaitsfgyrtyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdwgfashaydywgqgiqvtvs sb# qvqlvesggglvqaggslrlscaasgfpeiawemawyrqapgkerewvaairsfgertlyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdfgwqhqeydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvyhaymewyrqapgkerewvaaiysngehtyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdsgsfnqaydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvewshmhwyrqapgkerewvaaivskggytlyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdygvhfkrydywgqgtqvtvi sb# qvqlvesggglvqaggslrlscaasgfpvfhvwmewyrqapgkerewvaaidsagwhtyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdagnttsaydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvyynwmewyrqapgkerewvaaihsngdetfyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdidaeayaydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvyhvwmewyrqapgkerewvaaitssgshtyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdsgqwrvqydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvywhhmhwyrqapgkerewvaaiiswgwyttyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdhgaqnqmydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvyrdrmawyrqapgkerewvaaiysagqqtryadsvk grftisrdnakntvylqmnslkpedtavyycnvkdvghhyeyydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvdngymhwyrqapgkerewvaaidsygwhtiyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdkgqmraaydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvswhsmywyrqapgkerewvaaifsegdwtyyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdygssyykydywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvsqsvmawyrqapgkerewvaaiyskgqythyadsvk grftisrdnakntvylqmnslkpedtavyycnvkdagssywdydywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgsigqieylgwfrqapgkeregvaalntwtgrtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaarwgrtkplntyyysywgqgtpvtvs sb# qvqlvesgggsvqaggslrlscaasgyidkivylgwfrqapgkeregvaalytlsghtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaateghahalyrlhyywgqgtqvtvs sb# qvqlvesggglvqaggslrlscaasgfpvyqgemhwyrqapgkerewvaairstgvqtwyadsvk grftisrdnakntvylqmnslkpedtavyycrvwvgthyfgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgniqriyylgwfrqapgkeregvaalmtytghtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaayvgaenplpysmygywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgqishikylgwfrqapgkeregvaalitrwgqtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaadygasdplwfihylywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgkiwtikylgwfrqapgkeregvaalmtrwgytyyadsvk grftvsldnakntvylqmnslkpedtalyycaaanygsnfplaeedywywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgnisqihylgwfrqapgkeregvaalntdygytyyadsvk grftvsldnakntvylqmnslkpedtalyycaaayyfgddiplwweaysywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgnistieylgwfrqapgkeregvaalytwhgqtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaarwgrhmplsateysywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgniesiyylgwfrqapgkeregvaalwtgdgetyyadsvk grftvsldnakntvylqmnslkpedtalyycaaaawgnsaplttyryyywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgfiygitylgwfrqapgkeregvaalvtwngqtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaadwgydwplwdewywywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgtiadikylgwfrqapgkeregvaalmtrwgstyyadsvk grftvsldnakntvylqmnslkpedtalyycaaanyganyplysqqysywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgsissikylgwfrqapgkeregvaalmtrwgmtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaanyganeplqythynywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgeiesifylgwfrqapgkeregvaalytyvgqtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaasygaahplsimryyywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgtiahikylgwfrqapgkeregvaalmtkwgqtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaasyganfplkasdysywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgsiqaitylgwfrqapgkeregvaalvtwngqtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaadwgydwplwdewywywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgsissitylgwfrqapgkeregvaalvtysgntyyadsvk grftvsldnakntvylqmnslkpedtalyycaaatwghswplyndeywywgqgsqvtvs sb# qvqlvesgggsvqaggslrlscaasgsissitylgwfrqapgkeregvaalitvnghtyyadsvk grftvsldnakntvylqmnslkpedtalyycaaaawgyawplhqddywywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgsissitylgwfrqapgkeregvaalntfngttyyadsvk grftvsldnakntvylqmnslkpedtalyycaaatwgyswpliaeynwywgqgtqvtvs sb# qvqlvesgggsvqaggslrlscaasgsissitylgwfrqapgkeregvaalktqagftyyadsvk grftvsldnakntvylqmnslkpedtalyycaaanwgyswplyeaddwywgqgtqvtvs the plasmids encoding for the six highest affinity binders will very soon be available through addgene (addgene # -# ). for each of the six independent selec on reac ons, clones were picked at random and analyzed by elisa. a non-randomized sybody was used as nega ve control (wells h and h , respec vely). sybodies that were sequenced are marked with the respec ve sybody name (sb# - ). please note that iden cal sybodies that were found - mes are marked with the same sybody name (e.g. sb# ) . elisa analyses shown in these graphs were performed on three different days: ( ) rbd and mbp, ( ) ecd, ( ) . . . . concave adsvkgrftisrdnakntvylqmnslkpedtavyycx-vxvgxxyxgqgtqvtvs phylogene c tree of rbd sybodies. a radial tree was generated in clc . . . sybodies inhibit rbd binding to ace . the effect of sybodies on rbd associa on with human ace was assessed with an elisa. individual sybodies ( nm, sybody number shown on x-axis) were incubated with bio nylated rbd-vyfp ( nm) and the mixtures were exposed to immobilized ace . bound rbd-vyfp was detected with streptavidin-peroxidase/tmb. each column indicates background-subtracted absorbance at nm, normalized to the signal corresponding to rbd-vyfp in the absence of sybody (dashed red line). simultaneous binding of sb# and sb# . (a) simultaneous binding of sybodies was analyzed using gra ng-coupled interferometry on the wave system (creop x ag, switzerland). bio nylated ecd was immobilized and the binders were injected alone and simultaneously at satura ng concentra ons (sb# : nm, sb# : nm, sb# : nm). superimposed sensorgrams are shown. (b) compe on elisa. title of the graphs indicate the sybody which was directly coated on the plate at a concentra on of nm. the labels on the x-axes depict the sybody used for compe on. to determine the background signal, buffer devoid of protein was added. herd immunity -estimating the level required to halt the covid- epidemics in affected countries the reproductive number of covid- is higher compared to sars coronavirus estimation of the reproductive number of novel coronavirus (covid- ) and the probable outbreak size on the diamond princess cruise ship: a data-driven analysis presumed asymptomatic carrier transmission of covid- estimating clinical severity of covid- from the transmission dynamics in wuhan, china predicting the future trajectory of covid- preliminary identification of potential vaccine targets for the covid- coronavirus (sars-cov- ) based on sars-cov immunological studies. viruses the sars-cov- vaccine pipeline: an overview use of antiviral drugs to reduce covid- transmission a novel coronavirus outbreak of global health concern a sars-like cluster of circulating bat coronaviruses shows potential for human emergence structure, function, and evolution of coronavirus spike proteins cryo-em structure of the -ncov spike in the prefusion conformation structure, function, and antigenicity of the sars-cov- spike glycoprotein sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor we thank rony nehmé and andré heuer (creoptix ag, wädeswil, switzerland) for the acquisition, fitting and interpretation of gci measurements using the wavesystem. we thank florence projer, david hacker and kelvin lau (protein production and structure core facility, epfl, switzerland) for the production of the pre-fusion spike protein. we are grateful to jason mclellan (the university of texas at austin, u.s.) for having provided the pre-fusion-stabilized soluble spike expression vector. kine c characteriza on of the top six sybodies. (a) binding kine cs were measured by gra ng-coupled interferometry on the wave system (creop x ag, switzerland). rbd-vyfp and ecd were immobilized and the sybodies were injected at increasing concentra ons ranging from . nm to μm. data were fi ed using a langmuir : model. key: cord- -ndke agh authors: gollapalli, pavan; b. s, sharath; rimac, hrvoje; patil, prakash; nalilu, suchetha kumari; kandagalla, shivanandha; shetty, praveenkumar title: pathway enrichment analysis of virus-host interactome and prioritization of novel compounds targeting the spike glycoprotein receptor binding domain–human angiotensin-converting enzyme interface to combat sars-cov- date: - - journal: journal of biomolecular structure & dynamics doi: . / . . sha: doc_id: cord_uid: ndke agh sars-cov- has become a pandemic causing a serious global health concern. the absence of effective drugs for treatment of the disease has caused its rapid spread on a global scale. similarly to the sars-cov, the sars-cov- is also involved in a complex interplay with the host cells. this infection is characterized by a diffused alveolar damage consistent with the acute respiratory disease syndrome (ards). to explore the complex mechanisms of the disease at the system level, we used a network medicine tools approach. the protein-protein interactions (ppis) between the sars-cov and the associated human cell proteins are crucial for the viral pathogenesis. since the cellular entry of sars-cov- is accomplished by binding of the spike glycoprotein binding domain (rbd) to the human angiotensin-converting enzyme (hace ), a molecule that can bind to the spike rdb-hace interface could block the virus entry. here, we performed a virtual screening of compounds to identify potential molecules that can bind to the spike glycoprotein and spike-ace complex interface. it was found that the compound ethyl -{ -[( , -dichlorobenzyl) carbamoyl]- -ethyl- -fluoro- -oxo- , -dihydro- -quinolinyl}- -piperidine carboxylate (the s ligand) and ethyl -{ -[( , -dichlorobenzyl) carbamoyl]- -ethyl- -fluoro- -oxo- , -dihydro- -quinolinyl}- piperazine carboxylate (the s ligand) forms hydrophobic interactions with tyr a, tyr b and tyr b, leu a, phe b, respectively of the spike glycoprotein, the hotspot residues in the spike glycoprotein rbd-hace binding interface. furthermore, molecular dynamics simulations and free energy calculations using the mm-gbsa method showed that the s ligand is a stronger binder than a known sars-cov spike inhibitor ssaa e (n-( , -dioxo- , -dihydroanthracen- -yl) benzamide). communicated by ramaswamy h. sarma covid- is a coronavirus disease caused by the severe acute respiratory syndrome coronavirus- (sars-cov- ). the antiviral therapeutics acting on the virus exhibit various modes of action, like disabling the viral rna synthesis and virus replication or blocking virus attachment to the host cell receptors (angiotensin-converting enzyme , ace ) or to the viral structural proteins in order to inhibit the viral selfassembly process (canrong et al., ) . the host-sars-cov interaction is established through various strategies and various host cellular mechanisms are utilized by the virus for its successful multiplication during the infection (zumla et al., ) . the mechanism of the viral infection can be elucidated systematically by identifying the protein-protein interactions (ppis) during the virus-host interplay (yang et al., ; chuang et al., ) . viruses induce malfunction of the host cell by mimicking interacting domains of the host proteins, thus manipulating the signalling networks and cellular responses for their benefit (pawson & warner, ) . in such a way, by interacting with the host proteins, viruses (e.g. sars-cov- ) alter host responses at a systems level. one of the most effective ways to develop a potential drug for sars-cov- within a short period of time is by repurposing the existing compounds. recently, several compounds with anti-covid properties were identified by this approach. combination therapy was found to be more efficient in combating certain viruses like hiv and coronavirus and hence the synergistic effect of lopinavir, oseltamivir, and ritonavir was used against the sars-cov- protease (mpro). a better insight into the interaction between these three drugs and mpro was obtained by performing molecular docking and molecular dynamic simulations (nisha muralidharan et al., ) . a comparative analysis of sars-cov- mpro with proteases of other viruses from the coronoviridea family and further virtual screening of phytochemicals and active ingredients of ayurvedic anti-tussive medicines in india, and the synthetic anti-viral drugs revealed several potential sars-cov- mpro inhibitors, such as delta d-viniferin, myricitrin, chrysanthemin, myristicin, taiwanhomoflavone a, lactucopicrin -oxalate, nympholide a, afzelin, biorobin, hesperidin and phyllaemblicin b. these molecules showed an equally strong binding to other sars-cov- targets, e.g. rdrp and hace- (joshi et al., ) . recent studies using molecular dynamic (md) simulations of isothymol-ace docked complex revealed that isothymol is a functional inhibitor of ace activity and the components of ammoides verticillata essential oils can be used as potential inhibitors of the ace receptor-sars-cov- interaction (abdelli et al., ) . in silico studies on the binding affinity of a truncated ace (tace ) for spike glycoprotein rbd by protein-protein docking and md simulations demonstrated that the tace has a high binding affinity for the rbd when compared to the intact ace and thus forms a more stable complex (basit et al., ) . drugs that can interfere with the sars-cov- rbd binding to human ace (hace ) can potentially prevent sars-cov- from entering human cells. nine short peptides that have this potential were designed by liu et al. ( ) and md simulations of the free peptides and their sars-cov- rbd-bound forms showed a high binding affinity of peptides to sars-cov- spike glycoprotein (lupala et al., ) . in the present work, we employed computational approaches to model protein-protein interactions of the host-virus complex and functional enrichment and pathway analysis of the gene/protein set was performed. as was already said, the virus entry into the host cell is initiated by its binding to human ace via the receptor-binding domain (rbd) of the spike glycoprotein and hence serves as a potential drug target (lupala et al., ) . therefore, the genes/ proteins which are the first neighbours of the spike glycoprotein in the interaction network were used to gain mechanistic insights into the virus-host interplay. this information was then used for the virtual screening of a small library of compounds against the spike glycoprotein rbd. the top hit molecules from this screening were then docked to the sars-cov- spike glycoprotein rbd-ace interface, after which molecular dynamic simulations of the top scored compound and a reference ligand were performed to compare their binding affinities. the search tool for the retrieval of interacting genes/ proteins database specific for viral-host interactions (stringvirus v . ) was used to construct the network of the human-sars coronavirus protein-protein interactions (cook et al., ) . given the set of viral proteins, the stringvirus database generates a ppi network between the query proteins and their associated human proteins, with emphasis on primary interactions. the sars-cov- shares a high nucleotide sequence identity of . % with the human sars-cov . hence, human protein data associated with the sars-cov were used here to construct the protein-protein interaction network. first, based on the virus seed proteins, an interaction network was constructed associated with the human proteins. these interactions were derived based on different sources: text mining, experiments, databases, co-expression, neighbourhood, gene fusion, and cooccurrence with a mean confidence level of . . later, the number of interactions was increased to . cystoscope . . (su et al., ) with default settings was used for the network visualization to analyse and calculate the properties of the nodes. several topological measures, i.e. degree (k), betweenness centrality (bc), eccentricity, closeness centrality (cc), network density, diameter, average number of clusters, average shortest path length, and clustering coefficient were adopted to evaluate nodes of the ppi network (albert & barab asi, ; barabasi and oltvai, ) . these topological parameters were calculated using the networkanalyzer (fienner et al., ) . the input and output values of the node are received as mathematical functions (jeanquartier et al., ) . a comprehensive analysis and visualization of a functionally enriched set of genes was performed using cluego (bindea et al., ), a cytoscape plug-in that significantly improves the biological interpretation of large lists of genes. a functionally organized go/pathway term network was created by integrating gene ontology (go) terms as well as kegg pathways. we considered the first neighbours of the hub spike glycoprotein for the functional enrichment analysis. a total of neighbours were found to interact with the spike glycoprotein. parameters specified for protein/gene list enrichment analysis were set as follows: statistical testenrichment/depletion (two-side hypergeometric test), correlation test-bonferroni step down, min go level- , max go level- , kappa score threshold- . , go fusion-false, go group-true and p . . the drug-like compounds were collected from the zinc database (http://zinc .docking.org) (sterling & irwin, ) , by using the search term 'spike glycoprotein'. the structures of all the compounds were obtained in smiles format. the three-dimensional ( d) conformation of compounds was protonated at the physiological ph and biologically relevant tautomers were generated for each molecule. a known inhibitor of the sars-cov spike-glycoprotein, ssaa e (n-( , -dioxo- , -dihydroanthracen- -yl) benzamide), which prevents the fusion of the viral membrane with the host cellular membrane and blocks the interaction of the sars-spike glycoprotein with the ace receptor (adedeji et al., ) was taken as the reference molecule. the d structure of the coronavirus spike glycoprotein receptor-binding domain (rbd) complexed with the ace receptor (pdb entry: lzg; resolution: . Å) was obtained from the rcsb protein data bank (wrapp et al., ; berman et al., ) . both the receptor and the ligand molecules were prepared for docking using the ucsf chimera . program (pettersen et al., ) . initial docking calculations were performed using autodock vina (adt) (trott & olson, ) with a modified python script on ubuntu . lts platform. to detect the probable binding sites for all ligands with the spike glycoprotein rbd (chain b) and hace (chain a), we employed a blind docking procedure for both chains separately. thereafter, we opted for the spike glycoprotein rbd active pocket sites rather than the hace receptor because hace is expressed in various types of human cells and targeting hace might cause more side effects. before docking, the hace domain (chain a) was deleted from the original pdb complex ( lzg). additionally, ligand and water molecules were removed from the structure, polar hydrogen atoms and gasteiger charges were added. all ligand structures collected from the zinc database (sterling & irwin, ) and the reference ssaa e ligand were imported into ucsf chimera . as smiles strings, after which their structure was optimized using openbabel . . (o'boyle et al., ). as a final step before docking, receptor and ligand molecules were saved in the pdbqt format using mgl . . of autodocktools (adt) (morris et al., ) . initially, the spike glycoprotein rbd-hace complex was used to obtain the interface residues by using pdbsum (laskowski et al., ) . a grid map of size   Šwas generated with a . Å spacing to cover the interface area (centred at À . , . , . ). for initial compound screening, both exhaustiveness and the number of binding modes were set to . docking calculations were first performed for the spike glycoprotein rbd (chain b). the top five molecules underwent a second round of docking, with the exhaustiveness parameter set to for a better conformation search. docking was conducted for both the spike glycoprotein rbd alone (chain b) and the spike glycoprotein rbd-ace complex (at the interface). top hit molecules were then analysed and visualized using maestro . (schr€ odinger release ) and chimerax (goddard et al., ) . md simulations for all ligands (the reference ssaa e ligand and the s , s , s , s , and s ligands) were run in complex with the sars-cov- spike glycoprotein ( lzg). docked positions with the highest affinities to the protein were used as starting points for the md simulations. amber ff sb force field (maier et al., ) was used to model the enzyme and gaff force field (as implemented in antechamber (wang et al., ) , was used in the case of ligands. such protein-ligand complexes were solvated in a truncated octahedral box of tip p water molecules spanning a Å thick buffer, and na þ and clions were added according to machado and pantano ( ) to achieve a neutral environment with a salt concentration of . m (with the number of water molecules , na þ ions , and clions in the case of the standard ssaa e ligand, number of water molecules , na þ ions , and clions in the case of the s ligand, number of water molecules , na þ ions , and clions in the case of the s and s ligands, number of water molecules , na þ ions , and clions in the case of the s ligand, and number of water molecules , na þ ions , and clions in the case of the s ligand). such structures were then submitted for geometry optimization in the amber program (case et al., ) , employing periodic boundary conditions in all directions. for the first cycles, the complex was restrained and only water molecules were optimized, after which another cycles of optimization followed where both water molecules and the complex were unrestrained. optimized systems were gradually heated from to k and equilibrated during ps using nvt conditions, followed by productive and unconstrained md simulations of ns employing a time step of fs at constant pressure ( atm) and temperature ( k), the latter held constant using langevin thermostat with a collision frequency of ps À . bonds involving hydrogen atoms were constrained using the shake algorithm (ryckaert et al., ) , while the long-range electrostatic interactions were calculated employing the particle mesh ewald method (darden et al., ) . the non-bonded interactions were truncated at . Å. analysis of the trajectories was performed using the cpptraj module of ambertools (roe & cheatham, ). the binding energy, dg bind , of simulated complexes was calculated using the mm-gbsa (molecular mechanics -generalized born surface area) protocol (genheden & ryde, ; hou et al., ) , available as a part of ambertools (case, ) . mm-gbsa is a method for the calculation of dg bind from snapshots of md trajectory (ferenczy, ) with an estimated standard error of - kcal/mol (genheden & ryde, ) . dg bind is calculated in the following manner: where the symbol < > represents the average value over snapshots collected from a ns part of the corresponding md trajectories. the whole trajectory was divided into parts of ns length and dg bind was calculated for all parts of the simulation and reported as mean ± standard deviation. the calculated mm-gbsa binding free energies were decomposed into specific residue contribution on a per-residue basis according to established procedures. this protocol calculates the contributions to dg bind arising from each amino acid side chains and identifies the nature of the energy change in terms of interaction and solvation energies, or entropic contributions (gohlke et al., ; rastelli et al., ) . in this case, the entropy term was not calculated. identification of potential metabolic sites of a drug can give key information of its pharmacokinetic and pharmacodynamic characteristics. drugs are commonly metabolized by a special class of enzymes which are known as cytochrome p (cyp) enzymes. in this concern, by using the smartcyp . tool (rydberg et al., ) , metabolic sites for cyp mediated metabolism were predicted for the top hit molecule. the protein-protein interaction was constructed by assembling the sars-cov associated human proteins using the stringvirus database. based on various experimentally collected data, we obtained nearly human proteins associated with sars-cov (supplementary material, table ). it was also found that these proteins are involved in crucial pathways of the viral infection. the core part (the core network) of the human sars-cov-host ppi network (the giant network) generated by using the string database consisted of nodes and edges ( figure ). the number of edges connected to a designated node is termed a degree, implying the significance of the protein in the biological interactions. the highest degree in the core network was found to be , while the average degree was . . the ppi network is characterized by a small number of highly connected nodes, while most of the nodes have only a few connections. the nodes which degrees or bc are in the top % were considered as the key nodes, i.e. the critical points. out of nodes in the network, the top nodes with the highest bc values were: the spike glycoprotein, acvr b, cd , alb, myc, b m, creb , phb , stat , and il . these include both the viral and the human proteins, with the spike glycoprotein identified as the hub node that was further validated as an important target protein. to distinguish these nodes in the network and their roles, they are highlighted in a different colour (figure , supplementary material) . the spike glycoprotein was identified as the hub protein with the highest degree and the second highest bc value, while acvr b is the second hub protein with the highest bc value and the second highest degree. the proteins which are directly interacting (first neighbours) with the spike glycoprotein are shown in figure (supplementary material). the sars-cov spike glycoprotein, identified as the key protein, is involved in binding to the ace receptor, a human cell receptor, through its receptor-binding domain (rbd). rbd-up conformation of the spike glycoprotein is a prerequisite for the formation of the rbd-ace complex (walls et al., ) . a drastic conformational change is found to be triggered in the s domain of spike glycoprotein due to the specific interaction between the receptor-binding domain and ace receptor, which leads to the viral fusion with the cellular membrane and the nucleocapsid release into the cytoplasm of the host cell (lan et al., ) . the second identified hub protein (acvr b) in the interactome is a transmembrane serine/threonine kinase activin type- receptor. network analyzer v. . . was employed to evaluate the confidence of the core interactome, using the power law fit of the form y ¼ ax b : power law uses the least square method to determine the topological parameters and considers the points with positive coordinate values for the fit. the betweenness centrality (bc), closeness centrality (cc), and topology correlation coefficient scores of . , . , and . , respectively, were considered as network topology parameters. additionally, the neighbourhood connectivity ( . ) and the shortest path length distribution were also considered in the analysis. the hub proteins in the interactome were determined by using network topology parameters like bc and topological correlation coefficient with a cut-off value of . and . , respectively. these topological parameters considered for the network generation by using the above cut-off values were graphically plotted ( figure a -d, supplementary material). the extended global network topological measures of the two protein-protein interaction networks, i.e. the giant or the core network and the backbone or the subnetwork, are presented in table . therefore, the biological process is essentially regulated by the bottleneck node in the interactome ( functional enrichment analysis was carried out using cluego, a plug-in for cytoscape. a total of go terms were collected, out of which , , and go terms corresponded to biological processes (figure (a) ), molecular function (figure (b) ) and pathways (figure (c) ), respectively (listed in the supplementary material, tables - ). genes related to the specific go terms are presented in figure . the protein interaction network from stringvirus showed a dense network of the spike glycoprotein with first neighbour nodes. we carried out a functional enrichment analysis of these protein interactions, which showed that the proteins in the network play a major role in biological processes related to the viral entry into the host cell (go: ), the heterophilic cell-cell adhesion via plasma membrane cell adhesion molecules (go: ), the transition metal ion homeostasis (go: ), the natural killer cell-mediated immunity (go: ), the glycogen catabolic process (go: ) and the regulation of humoral immune response mediated by circulating immunoglobulins (go: ), as well as molecular functions, such as the viral receptor activity (go: ), the mhc protein binding (go: ), the phosphorylase kinase activity (go: ) and the mannose binding (go: ). enriched pathways (reactome) included immunoregulatory interactions between a lymphoid and a non-lymphoid cell, dap interactions, glycogenolysis, and the complement cascade. the go term related to the reactions is reported as the genes involved in exocytosis of tertiary granule membrane proteins (r-has- , clec a, clec a, and ola ). the interaction of sars-cov spike glycoprotein with cellular receptors is indispensable for the viral entry into the host cells. from these enriched go terms, we have identified the genes associated with the viral entry into the host cell and the viral receptor activity: cd , cd , clec m, clec a, and clecag. the clec m (c-type lectin domain family member m) acts as an attachment receptor for ars-cov (marzi et al., ) . it was demonstrated that both the ace and clec m (cd ) are highly expressed in human lung microvascular endothelial cells and lymphatic endothelial cells, respectively (jing et al., ) . the ace and other proteins are associated with clec m through a primary interaction and act as a receptor for binding of the viral spike glycoprotein (figures and ) . several studies have demonstrated that the interaction between the spike glycoprotein and the ace receptor is found to be crucial for the viral entry into the host and thus targeting this mechanism and identifying the inhibitors to interrupt this interaction could result with promising lead compounds (zumla et al., ) . recent reports showed that the sars-cov- spike glycoprotein uses the ace receptor to enter the host cell and that the rbds of sars-cov- and sars-cov spike glycoproteins bind with similar affinity to the human ace receptor (walls et al., ) . the ligand-binding specificity of the spike glycoprotein rbd was detected by running blind docking and setting both the exhaustiveness and the number of modes to . using blind docking, we analysed the cavity at the spike glycoprotein rbd-hace interface. the results suggest that the majority of ligands' best scored poses were found in the a and c pockets. additionally, most of the ligands bind at the interface (the c pocket) and have higher binding affinities compared to the a pocket ( figure ) . therefore, targeting this position may contribute to the interruption of the interaction between the spike glycoprotein rbd and the hace . due to its high ranking and position advantages, we used this cavity (the c pocket) for further studies under the assumption that targeting this region may induce conformational changes that could inhibit virus-host interactions and prevent viral entry. among the compounds identified from the zinc database, binding energies for most of the ligands were found to be between À and À kcal/mol ( ligands). one ligand (s ) showed a lower affinity with the binding energy of À . kcal/mol and eight ligands showed binding energies of À kcal/mol or lower (supplementary material, table ). among them, the top ligands were selected based on the interaction pattern with the spike glycoprotein rbd (chain b) for the second round of the docking by setting the exhaustiveness parameter to . analysis of the top molecules interacting with the spike glycoprotein alone showed a good inhibition potential. the s molecule forms two h-bonds, with asn b and gly b of spike glycoprotein and has a binding energy of À . kcal/mol. similarly, calculated binding energies for the s and s ligands were À . and À . kcal/mol, respectively. however, they form only one favourable h-bond interaction, with gly b and asn b, respectively. on the other hand, the other three molecules form no h-bonds with the spike glycoprotein rbd (table ) . after this, we merged the hace domain with the spike glycoprotein rbd-s complex, and found a hydrophobic interaction between spike glycoprotein rbd and tyr a of figure . functional assessment analysis of the spike glycoprotein first neighbour proteins. the genes recognized as close neighbours of the spike glycoprotein are highlighted in different colours based on their functional enrichment: genes for the viral entry into the host cell and the viral receptor activity are shown in green, genes for the heterophilic cell-cell adhesion via plasma membrane cell adhesion molecules in grey, genes for the transition metal ion homeostasis in light blue, genes for the natural killer cell-mediated immunity in dark blue, genes for the glycogen catabolic process in red, genes for the regulation of humoral immune response mediated by circulating immunoglobulin in pink, genes for the mhc protein binding in sky blue, and genes for the immunoregulatory interactions between a lymphoid and a non-lymphoid cell in yellow. the genes shown in the subnetwork (orange) show the interaction of clec m protein with the human ace protein (receptor for the spike glycoprotein), and the genes which are involved in the viral entry into the host. the hace domain (yan et al., ) . the binding analysis of the s ligand compared to the reference ssaa ligand is shown in figure . the results suggest that the h-bond interaction with the asn b residue of the spike glycoprotein may play a key role in destabilizing hace (chain a) by forming hydrophobic interactions with the tyr a residue since tyr a was found to play a crucial role in the interaction of the spike glycoprotein rbd-hace complex formation (lan et al., ; yan et al., ) . these results encouraged us to investigate the complicated role of the spike glycoprotein rbd-hace complex in virus entry into the cells. to focus on the role of amino acid residues asn b and tyr a, we performed a docking simulation of the top ligands at the interface of the spike glycoprotein rbd-hace complex (amino acid residues were taken from pdbsum) by setting exhaustiveness to (supplementary material, table and figure ) . these calculations showed significant h-bond interactions of all ligands and the reference ssaa ligand with amino acids his a, arg b, asp a, asn a, and gly b at the interface. equally, all ligands form hydrophobic interactions with amino acids tyr b, tyr b, tyr b, pro a, ala a, val a, and leu a. the binding modes of s and the reference ssaa ligand at the interface of spike-ace are shown in figure . s , which has a similar fingerprint as s , forms two hbond interactions with asp a and gly b and forms favourable hydrophobic interactions with val a, leu a, tyr b, phe b, and tyr b. tyr b, tyr b, and tyr b were also found to play a key role in forming interaction with the spike glycoprotein alone. interestingly, robetta alanine scanning (kortemme et al., ) results also replicated the docking results by confirming the mutagenic amino acid residue tyr a (hace ) with a ddg complex score of . kcal/mol and tyr b (spike glycoprotein) with ddg complex score of . kcal/mol. the docking calculations at the spike glycoprotein rbd-hace interface showed an even better binding energy and formation of h-bond interactions, and the detailed interaction analysis for the top ligands is reported in table . among the top ligands, s and s were found to show good binding energies, hbond interactions, hydrophobic interactions, as well as alanine scanning by forming favourable interaction with tyr a and tyr b. therefore, apart from h-bond interacting amino acids asn b, asp a, asn a, gly b, the hydrophobic residues tyr a and tyr b were also found to be important hotspots responsible for the binding of the spike glycoprotein to the hace domain. interestingly, s forms interactions with hotspot residues tyr a and tyr b along with the h-bond interactions at the interface of the spike glycoprotein rbd (gly b) and the hace domain (asp a, asn a). these observations clearly explain the main reason behind the tighter affinity of the sars-cov- spike glycoprotein to hace when compared to that of sars-cov (masters, ) . our results suggest that s is a potent inhibitor of both spike glycoprotein alone, as well as the spike glycoprotein rbd-hace complex. structure-based drug design showed that the interaction of s at the interface of the spike-hace involves forming hydrogen bonds and favourable hydrophobic interactions, which play a major role in destabilizing the spike-hace interaction, thus inhibiting viral entry into human cells. the top molecules were subjected to molecular fingerprinting analysis (fp) with maccs and morgan circular fingerprint method to check their similarity against all the molecules collected form zinc database. the importance of the fp method selection for virtual screening was highlighted and the difference in results obtained by different fp approaches was analysed (cereto-massagu e et al., ; matsuyama & ishida, ) . the extended connectivity fingerprint diameter (ecfp) offers the highest precision on average, according to database search by compound similarity based on fp (riniker & landrum, ) . in our analysis, the compound s and s are structurally similar, with both having piperazine, benzene, and h-quinolin- -one in the structure. the only difference between these two compounds is the presence of the nitrogen atom in the piperazine ring of the s compound, which makes the compound more rotatable. further, molecular fingerprinting analysis showed a similarity score of . between s and s in morgan circular fingerprint. other lead compounds such as s , s and s also share common traits, such as the chloro-fluorobenzene functional group. the s compound along with chloro-fluorobenzene, contains also the -azaspiro [ . ] nonane functional group and the s compound contains pyrrolidine and cyclohexane rings along with the chloro-fluorobenzene group. finally, s compound contains a piperidine ring along with the chloro-fluorobenzene group. among these, the s compound shows the similarity of score . and . with s and s , respectively (supplementary material, table ). md simulations were carried out for all the top five compounds (s , s , s , s , and s ) and the reference ssaa ligand complexed with the spike glycoprotein. all complexes were found to be stable throughout the entire duration of the simulation ( ns). however, since only the ligands s and s had dg bind significantly lower than the reference ssaa ligand, they will be discussed more thoroughly (the complete mm-gbsa results are shown in table supplementary material). figure shows the backbone mass-weighted root-meansquare deviation (rmsd) for the s , s and the reference ssaa ligand complexes through time. it can be seen that all three complexes achieve the equilibrium state very early and remain stable for the entire duration of the simulation. for the ssaa and the s ligand, this can also be seen in the intermolecular h-bond graph (figure ) , where the number of h-bonds for all three ligands remains constant through time (with the reference ssaa ligand forming on average . ± . , ligand s . ± . ), while the ligand s ( . ± . intermolecular h-bonds) undergoes a slight conformational change around the ns mark, but without any significant influence on the protein structure. this indicates that the docking procedure was successful in finding the correct binding poses for the ssaa and the s ligands since these ligands did not need to optimize their conformation inside the binding pocket, while the s ligand had to go through some conformational changes to obtain the optimal pose. this is in accordance with the docking results, where the binding affinity of the s and s ligands are almost the same, while the mm-gbsa results differ. binding energies for all the complexes were calculated using the mm-gbsa protocol. since all the complexes are stable throughout the entire simulation, their entire trajectories ( ns) were divided into ten segments of ns. dg bind was calculated for all segments individually and the final dg bind was calculated as mean ± standard deviation. for the reference ssaa ligand dg bind ¼ À . ± . kcal/mol, for the s ligand dg bind ¼ À . ± . kcal/mol, and for the s ligand dg bind ¼ À . ± . kcal/mol (table ) . dg bind for ligands s , s , s are À . ± . kcal/mol, À . ± . kcal/mol, and À . ± . kcal/mol, respectively (supplementary material, table ). it has to be emphasized that since the entropy term was not calculated, these results are overestimated in their absolute terms (genheden & ryde, ) . however, since all ligands bind to the same binding site of the same protein, the entropic contribution in both cases would also be approximately the same. therefore, this method can be used in predicting relative binding energies in biomolecular complexes and their comparison (homeyer & gohlke, ) . for this reason, the obtained dg bind of the tested compounds should only be analysed relative to each other. that being said, the mm-gbsa results are in accordance with the h-bonds analysis: the s ligand forms a slightly higher number of intermolecular h-bonds than the s ligand and has a slightly higher binding affinity, similar to the docking results. additional decomposition of dg bind (supplementary material, table (a), (b), and (f)) show that throughout the simulation time, s and s ligands are more stable in their binding sites, with a lower standard deviation of all types of interactions, excluding the non-polar solvation interaction for the s ligand, and with significantly more favourable van der waals and electrostatic interactions. from table it is also visible which amino acid residues contribute the most in the binding of the tested ligands. in the case of the reference ligand and the s ligand, the most contributing residue is pro a, but for the s ligand, its contribution is significantly higher. additionally, the s ligand forms strong bonds with residues lys a and thr a, which are not present in the list of top contributing residues in the case of the standard ssaa ligand. on the other hand, the standard ssaa ligand forms a much stronger bond with the his a residue than the s ligand. however, even with these differences, the most important amino acid residues for these two complexes are more similar than compared to the complex with the s ligand. as opposed to them, in the case of the s ligand, eight of the ten most important amino acid residues come from the b chain, which is a result of its position deeper inside the interaction pocket (figures and ) . given that the structural difference between ligands s and s is only in one nitrogen atom (piperidine and piperazine rings, respectively), it goes to show how small structural changes can have a significant influence on ligand binding. an interesting observation can also be made when comparing the per residue root-mean-square fluctuation (rmsf) for all three complexes ( figure ). while the a chain has practically the same rmsf in all three cases, rmsf for the chain b shows significant differences in residues - . for all amino acid residues except ser , rmsf is the lowest for the s ligand, followed by s ligand and the reference ligand, which corresponds to their decreasing binding affinities. it also seems figure . overlay of the spike glycoprotein complexes with the reference ssaa ligand (tan), the s ligand (light blue), and the s ligand (pink) after ns md simulation with top contributing amino acid residues (yellow) shown. figure . binding interactions of the s and the s ligand at the interface of the spike glycoprotein rbd-hace complex after ns md simulations. that the s ligand stabilizes the chain b the most, which is in accordance with the fact that, among the three ligands, it has the strongest interactions with it. however, this part of the chain b is not in vicinity of the binding pocket, so the exact stabilization mechanism remains unknown. all pharmacokinetic properties of s and s were performed using preadmet and molinspiration tool/online server (jan et al., ). the results are in the acceptable range and follow lipinski's rule of five (supplementary material, tables and ). additionally, the metabolic site analysis of s (zinc ) and s (zinc ) has predicted that it is not an inhibitor of cytochrome enzymes, but is a cyp a substrate with putative metabolic sites: c , c , c , c and c , c , c , respectively which is depicted in supplementary material, figure a and b. the database smiles arbitrary target specification-based fragment was used by smartcyp in combination with an accessibility descriptor to obtain the ranking for the site of metabolism (soms) (hashem & mahrouse, ) . here, the primary site of cyp-mediated metabolism for the s ligand was predicted to be c and c of the amine group and c of the aryl fluoride group. similarly, the s ligand belongs to quinoline- -carboximide class of organic compounds, where the quinoline ring is substituted by one carboxiamide group at the -position. these compounds were found to inhibit the nipah virus glycoprotein g/f-mediated cell-cell fusion expressed in african green monkey vero cells after h relative to untreated control (niedermeier et al., ) . in this study, we adopted a systems biology method to construct an extended ppi network of sars-cov and associated human proteins. our findings suggest that the spike glycoprotein has the highest degree and the second-highest bc, and acvr b has the second highest degree and the highest bc. the spike glycoprotein is mainly involved in binding to the human ace receptor, while human acvr b is involved in the transmembrane receptor protein serine/threonine kinase signalling pathway. both proteins are essential in the viral entry and causing infection in humans. furthermore, studies on the sars-cov spike glycoprotein rbd inhibition with the top five ligands were successfully carried out using molecular dynamics approach. ligands s and s were found to be selectively interacting with the tyr a and tyr b hotspots inside the binding pocket via formation of an inclined tape over the binding site with the oh group. these results demonstrate the likelihood of the ethyl -f - [( , -dichlorobenzyl) carbamoyl]- -ethyl- -fluoro- -oxo- , dihydro- -quinolinylg- piperidine carboxylate (the s ligand) and ethyl -f -[( , -dichlorobenzyl) carbamoyl]- -ethyl- -fluoro- -oxo- , -dihydro- -quinolinylg- piperazine carboxylate (the s ligand) activity to block the virus spike glycoprotein rbd from docking to hace . the trajectory analysis of the spike rbd-hace -s /s complexes also displayed structural stability and lower binding free energy when compared to the complex with the reference ssaa ligand. however, these computationally validated results need to be investigated on in vivo models before classifying molecules as potential covid- inhibitors. authors provided critical feedback and helped shape the research, analysis and manuscript. no potential conflict of interest was reported by the author(s). in silico study the inhibition of angiotensin converting enzyme receptor of covid- by ammoides verticillata components harvested from western algeria novel inhibitors of severe acute respiratory syndrome coronavirus entry that act by three distinct mechanisms statistical mechanics of complex networks network biology: understanding the cell's functional organization truncated human angiotensin converting enzyme ; a potential inhibitor of sars-cov- spike glycoprotein and potent covid- therapeutic agent the protein data bank cluego: a cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks molecular fingerprint similarity search in virtual screening viruses.string: a virus-host protein-protein interaction database particle mesh ewald: an nÁlog(n) method for ewald sums in large systems computation of drug-binding thermodynamics the mm/pbsa and mm/gbsa methods to estimate ligand-binding affinities ucsf chimerax: meeting modern challenges in visualization and analysis insights into protein-protein binding by binding free energy calculation and free energy decomposition for the ras-raf and ras-ralgds complexes in vitro metabolism study of a novel p kinase inhibitor: in silico predictions, structure elucidation using ms/ms-i free energy calculations by the molecular mechanics poisson-boltzmann surface area method assessing the performance of the mm/pbsa and mm/gbsa methods. . the accuracy of binding free energy calculations based on molecular dynamics simulations integrated web visualizations for protein-protein interaction databases discovery of potential multi-target-directed ligands by targeting host-specific sars-cov- structurally conserved main protease computational alanine scanning of protein-protein interfaces structure of the sars-cov- spike receptor-binding domain bound to the ace receptor pdbsum: structural summaries of pdb entries computational network biology: data, models, and applications computational analysis on the ace -derived peptides for neutralizing the ace binding to the spike protein of sars-cov- split the charge difference in two! a rule of thumb for adding proper amounts of ions in md simulations ff sb: improving the accuracy of protein side chain and backbone parameters from ff sb dc-sign and dc-signr interact with the glycoprotein of marburg virus and the s protein of severe acute respiratory syndrome coronavirus the molecular biology of coronaviruses stacking multiple molecular fingerprints for improving ligand-based virtual screening autodock and autodocktools : automated docking with selective receptor flexibility computational studies of drug repurposing and synergism of lopinavir, oseltamivir and ritonavir binding with sars-cov- protease against covid- open babel: an open chemical toolbox oncogenic re-wiring of cellular signaling pathways ucsf chimera-a visualization system for exploratory research and analysis fast and accurate predictions of binding free energies using mm-pbsa and mm-gbsa open-source platform to benchmark fingerprints for ligand-based virtual screening ptraj and cpptraj: software for processing and analysis of molecular dynamics trajectory data numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes smartcyp: a d method for prediction of cytochrome p -mediated drug metabolism a small-molecule inhibitor of nipah virus envelope protein-mediated membrane fusion schr€ odinger release - : maestro ( ). schr€ odinger, llc zinc -ligand discovery for everyone biological network exploration with cytoscape sars-cov- entry factors are highly expressed in nasal epithelial cells together with innate immune genes autodock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading structure, function, and antigenicity of the sars-cov- spike glycoprotein automatic atom type and bond type perception in molecular mechanical calculations cryo-em structure of the -ncov spike in the prefusion conformation structural basis for the recognition of sars-cov- by full-length human ace understanding human-virus protein-protein interactions using a human protein complex-based analysis framework a pneumonia outbreak associated with a new coronavirus of probable bat origin coronaviruses -drug discovery and therapeutic options the authors are thankful to registrar, nitte (deemed to be university), mangalore, india for providing all the facilities to complete this work. the authors also acknowledge the university of zagreb, university computing centre (srce) for granting computational time on the isabella cluster. . per residue root-mean-square fluctuation (rmsf) for complexes with the reference ssaa , the s ligand, and the s ligand. key: cord- -h xjudm authors: nyon, mun peak; du, lanying; tseng, chien-te kent; seid, christopher a.; pollet, jeroen; naceanceno, kevin s.; agrawal, anurodh; algaissi, abdullah; peng, bi-hung; tai, wanbo; jiang, shibo; bottazzi, maria elena; strych, ulrich; hotez, peter j. title: engineering a stable cho cell line for the expression of a mers-coronavirus vaccine antigen date: - - journal: vaccine doi: . /j.vaccine. . . sha: doc_id: cord_uid: h xjudm abstract middle east respiratory syndrome coronavirus (mers-cov) has infected at least patients and caused deaths since its first appearance in , yet neither pathogen-specific therapeutics nor approved vaccines are available. to address this need, we are developing a subunit recombinant protein vaccine comprising residues – of the mers-cov spike protein receptor-binding domain (rbd), which, when formulated with the addavax adjuvant, it induces a significant neutralizing antibody response and protection against mers-cov challenge in vaccinated animals. to prepare for the manufacture and first-in-human testing of the vaccine, we have developed a process to stably produce the recombinant mers s - protein in chinese hamster ovary (cho) cells. to accomplish this, we transfected an adherent dihydrofolate reductase-deficient cho cell line (adcho) with a plasmid encoding s - fused with the human igg fc fragment (s - -fc). we then demonstrated the interleukin- signal peptide-directed secretion of the recombinant protein into extracellular milieu. using a gradually increasing methotrexate (mtx) concentration to μm, we increased protein yield by a factor of . the adcho-expressed s - -fc recombinant protein demonstrated functionality and binding specificity identical to those of the protein from transiently transfected hek t cells. in addition, hcd /dipeptidyl peptidase- (dpp ) transgenic mice vaccinated with addavax-adjuvanted s - -fc could produce neutralizing antibodies against mers-cov and survived for at least days after challenge with live mers-cov with no evidence of immunological toxicity or eosinophilic immune enhancement. to prepare for large scale-manufacture of the vaccine antigen, we have further developed a high-yield monoclonal suspension cho cell line. with over deaths and confirmed cases since its original appearance on the arabian peninsula in [ ] , middle east respiratory syndrome (mers) coronavirus (mers-cov) has emerged as an important global pathogen and potential pandemic threat. there remains a critical need for a vaccine targeting mers-cov [ ] , and the newly established coalition for epidemic preparedness innovation (cepi) has now designated research and development for the mers-cov vaccine as a global priority [ ] . recently, phase i studies of dna-based vaccines against mers-cov showed that % of vaccinated volunteers generated antibody against mers cov [ ] . however, to date there is no licensed dna vaccine for humans due in part to questions about their long-term safety, and their ability to induce high titers of protective or neutralizing antibodies relative to recombinant protein-based vaccines [ , ] . a lead candidate for such a protein-based vaccine is the receptor-binding domain (rbd) of the mers-cov spike (s) protein. the mers-cov rbd plays an essential role in host receptor binding, membrane fusion, and cell entry [ ] [ ] [ ] , thus making it an ideal vaccine target. moreover, focusing on the rbd component, rather than the full-length s protein, reduces the likelihood of eosinophilic-or antibody-dependent immune enhancement [ , ] . expressed and purified as a recombinant fusion protein with the human fc fragment, the addavax(mf -like)-adjuvanted rbd (residues - ) of mers-cov has been shown to elicit high neutralizing antibody titers in both mice and rabbits [ , [ ] [ ] [ ] [ ] [ ] . these antibodies displayed potent neutralizing activity against almost human and camel mers-cov strains, including those with amino acid mutations in the rbd region of their spike proteins [ , , ] . in a complementary approach, recombinant rbd mutants representing different human and camel virus isolates were all able to elicit broad-spectrum neutralizing antibodies against a wide range of human and camel mers-cov strains [ ] . antibodies against the rbd were able to block the binding of the rbd to the mers-cov's cellular receptor, dpp [ ] , and thus block the viral entry into permissive human cells [ ] . most importantly, recent studies found that these neutralizing antibodies were indeed associated with protection, when vaccinated ad -hdpp -transduced mice and hdpp -transgenic mice were found to be immune against lethal mers-cov challenge [ , , ] . although the vaccine antigen has been produced at small, laboratory scale in a transient hek cell system [ ] , little effort has been put forth to develop and scale up of mers-cov rbd suitable for future vaccine manufacture. therefore, we have now engineered a stable cho cell line suitable for producing this mers-cov protein vaccine antigen. the codon-optimized dna sequence encoding the human igg fc-fused s - and the signaling peptide of interleukin- (il ) residing at the n-terminus were synthesized and cloned into pjet . using xbai and noti restriction sites (genescript) (pjet . _il _s - -fc). sequences encoding signal peptides derived from igk light chain (igk), human serum albumin (sa), and azurocidin (azu) were incorporated into the mers s - -fc by touchdown pcr with ultramer oligomers (table , supp. table ). the pcr products were gel purified using a qiaquick pcr purification kit (qiagen) and subsequently cloned into the poptivec-topo vector (invitrogen), followed by escherichia coli top transformation. ampicillin-resistant transformants were selected on lb agar plates containing lg/ml of ampicillin and subsequently grown in lb broth. plasmid dna prepared from isolated colonies were sequenced. to construct the expression plasmid with the il signal peptide, pjet . _il _s - -fc was digested with xbai and noti restriction enzymes, and the gene cassette was gel purified. the expression plasmid poptivec was digested with the same enzymes and gel purified, followed by ligation with t dna ligase. adherent, dihydrofolate reductase (dhfr)-deficient cho cells (adcho) (atcc Ò crl- tm , cho-dhfr) were cultured in iscove's modified dulbecco's medium (imdm, gibco) supplemented with mm l-glutamine (gibco), % fetal bovine serum (fbs, gibco) and . mm sodium hypoxanthine/ . mm thymidine (h/t, gibco). to establish s - -fc-expressing adcho cell lines, x cells were transfected with lg plasmid dna with different signal peptides using lipofectamine Ò (invitrogen), according to the manufacturer's instructions. all plasmid dnas were linearized with ahdi prior to transfection. stable transfectants were selected by culturing in selective medium (same medium as previously described without h/t supplementation). cells were passaged every days after reaching a maximum cell density of .  viable cells per ml (split ratio : ). to investigate the effect of the different signal peptides on protein expression, conditioned medium from each transfection was collected for quantitative analysis. for gene amplification, transfected adcho cells were grown to confluence ( days) in the selective medium supplemented with mtx. the concentration of mtx was increased gradually during each passage ( days per passage) of adcho cells ( nm > nm > nm > nm > nm > nm > nm > nm > nm > nm, [ ] ). the recombinant mers s - -fc was loaded onto a hitrap protein-a hp column (ge) at a flow rate of ml/min. the column was washed with x pbs (ph . , amesco) for column volumes (cvs) and eluted with cvs of - % elution buffer ( mm citric buffer ph . ), followed by cvs of % elution buffer. the elution fractions ( . ml) were collected in tubes containing . ml of m tris-hcl (ph . ) to elevate the ph of the eluted protein to ph . . sds-page and western blotting analysis were performed with rabbit-anti-mers-cov rbd ( : , [ ] ) and rabbitanti-bovine (fab) -biotin ( : , sigma) antibodies to identify the s - -fc protein in the elution fractions. the peak fractions containing the mers s - -fc protein were pooled and concentrated to mg/ml with amicon ultra- centrifugal filter unit (mwco kda) and buffer-exchanged into mm tris-hcl and mm nacl (ph . ). the protein secondary structure was predicted from circular dichroism (cd) spectra. samples for cd experiments were prepared in mm citrate phosphate at a concentration of . mg/ml. cd spectra were recorded with a jasco j- s spectrophotometer, scanning from nm to nm at nm/min with a bandwidth of nm and response time of s. experiments were performed using one quartz cuvette with a path length of . cm, keeping a constant temperature of °c. the average value was determined after five scans, and the spectrum of the matching 'buffer alone' sample served as the control. the secondary structure was predicted using the cdpro software by comparing with three reference sets (sp , sdp and smp ) and using two data fitting programs (contin and cdsstr). a real-time protein melt experiment was performed using protein thermal shift tm dye (thermo fisher scientific) in an applied biosystems viia realtime pcr system (thermo fisher scientific), yielding a fluorescence table overview of tested signal peptides in the adcho expression system. signal peptide signal peptide sequence ref. profile specific to purified mers s - -fc. to evaluate ptm, the purified protein was treated with peptide-n-glycosidase f (pngasef), o-glycosidase and neuraminidase in a nondenatured form and subsequently analyzed by gel electrophoresis. a co-ip assay was carried out to analyze the interactions between adcho-expressed mers s - -fc and human dpp (hdpp ) receptor in huh- cell lysates, using mers s - -fc expressed in hek t cells [ ] as a positive control and hek t-expressing sars-cov rbd-fc as a negative control. the huh- cell lysates (  /ml in ml lysis buffer containing . % n-decyl-d-maltopyranoside-phosphate-buffered saline [ ] ) and sars-cov rbd-specific g mab (anti-sars-rbd, lg/ml, [ ] ). flow cytometry analysis was carried out to quantify the binding between mers-cov rbd and hdpp -expressing huh- cells. cells were incubated with s - -fc ( lg/ml), expressed either in adcho cells or hek t cells, for min at room temperature, followed by addition of fitc-labeled anti-human igg antibody (abcam) for min. cells were then analyzed by flow cytometry. to detect binding between mers-cov rbd and hdpp protein, -well elisa plates were pre-coated overnight at °c with ll of lg/ml purified s - -fc protein, expressed either in hek t cells [ ] or in adcho cells, and blocked with % fatfree milk at °c for h. serial dilutions of hdpp protein (histagged, ll/well) were then added to the plates and incubated at °c for . h, followed by four washes with pbs in . % tween- (pbst). subsequently, the plates were incubated with mouse anti-his primary antibody ( : , sigma) at °c for . h. after four washes with pbst, horseradish peroxidase (hrp)-conjugated anti-mouse igg antibody ( : , ge healthcare) was added to the wells and incubated at °c for min. finally, plates were washed with pbst, and binding was visualized by adding the colorogenic substrate , , , -tetramethylbenzidine (tmb, sigma). the reaction was stopped after min by adding n h so , and absorbance at nm (a ) was measured on an elisa plate reader (tecan). detection of the binding between mers-cov rbd and rbdspecific neutralizing mabs was performed following a protocol similar to that described above, except that the plates were precoated with lg/ml of purified s - -fc proteins, followed by sequential incubation with serially diluted mouse mab (mers-mab ) or human mabs (m -fab, m -fab, and m -fab) [ ] [ ] [ ] and hrp-conjugated anti-mouse igg ( : , for mouse mab, ge healthcare) and anti-human-fab ( : , for human fab-mabs, sigma) antibodies. the binding between denatured mers-cov rbd and the aforementioned rbd-specific neutralizing mabs or rbd-immunized mouse sera was tested by elisa as described above, except that the plates were pre-coated with s - -fc ( lg/ml) protein treated with dithiothreitol (dtt) ( mm, sigma) at °c for h, followed by incubation with iodoacetamide ( mm, sigma) at °c for h to stop the reaction [ ] . after three washes, the elisa was carried out as described above. all in vitro and in vivo studies required the usage of infectious mers-cov (emc/ strain) and were conducted within approved biosafety level (bsl- ) and animal bsl- laboratories at the galveston national laboratory, strictly following approved notification-of-usage (nou) and animal protocols and the guidelines and regulations of the national institutes of health and aaalac. for a ''proof-of-principal" study to confirm that adchoexpressed mers s - -fc is an effective and safe vaccine, two groups of five age-matched cd /dpp transgenic (tg) mice were immunized twice, four weeks apart, via the intramuscular (i.m.) route, with either lg of mers s - -fc formulated with addavax (invivogen) or pbs/addavax only (as control). this immunization protocol was selected because it is optimized for mers-cov rbd proteins [ ] . the addavax adjuvant was chosen because it promoted the rbd-fc protein to generate the highest neutralizing antibodies among several adjuvants tested in our previous studies [ ] . serum specimens were collected at day after the second immunization through the retro-orbital bleeding route to determine the prospective capacity to neutralize infectious mers-cov by using the standard vero e -based micro-neutralization test. immunized mice were subsequently challenged intranasally (i.n.) with x % lethal dose (ld ) ($ tcid ) of mers-cov (emc/ strain), a gift of heinz feldmann (nih, hamilton, mt) and ron a. fouchier (erasmus medical center, rotterdam, the netherlands), followed by daily monitoring for the onset of clinical manifestations (e.g., weight loss and other clinical manifestations) and mortality. three mice from each group were euthanized at day post-infection (p.i.) to assess lung viral loads by vero e -based infection assay and quantitative (q) pcr analysis targeting the upe gene of mers-cov for quantifying infectious virus and viral rna, respectively. additionally, de-paraffinized lung tissues were hematoxylin-and-eosin (h&e)-stained for routine histopathologic evaluations, as described. we continued to monitor the remaining two mice in each group for their overall well-being for a total of weeks until terminating the experiment. all methodologies required to assess the immunogenicity (neutralization antibody titers) and efficacy of mers s - -fc have been previously reported ( [ , ] , supplementary methods). suspension cho (cho dg , gibco, hereinafter termed sus-cho) cells were cultured in cd dg medium (gibco) supplemented with mm l-glutamine (gibco) and . % pluronic Ò f- prior to transfection. transfection was performed by combining lg ahdi-linearized plasmid popti_il _s - -fc and ll of freestyle tm max reagent (invitrogen) in . ml optipro sfm and incubated at room temperature for min, followed by dropwise addition to .  cells in ml of cd dg culture medium (non-selective) according to the manufacturer's instructions. after h, cells were transferred to selective medium (cd opti-cho, invitrogen), supplemented with mm l-glutamine and . % pluronic Ò f- (gibco), and cultivated until cell viability reached %. after selection, stably transfected suscho cells underwent dna amplification by gradually increasing mtx concentration ( - nm) in selective medium. all suspension culture flasks were maintained in a humidified incubator, °c/ % co on a shaker, at a constant rotation rate of rpm. the lm mtx-adapted suscho cell pools from serum-free medium were used for single-cell cloning by limited dilution at . - cells/well. cloning was performed in -well plates (falcon u-bottom untreated), utilizing a cloning medium composed of % hybridoma sfm (clonacell) and % conditioned media supplemented with . x cho acf supplement (clonacell) at °c/ % co for approximately days. conditioned medium from wells with actively growing single colonies was assayed by elisa as follows. a mixture of ll of conditioned medium and ll of coating buffer were added to a -well elisa plate (thermo fisher scientific) and incubated at °c overnight. after washing the plate with pbst, rabbit anti-human igg was used to detect s - -fc protein, followed by a biotinylated goat antirabbit antibody and streptavidin hrp. tetramethylbenzidine (kpl inc., vwr) was added ( ll) before the reaction was stopped with ll m hcl. elisa plates were read on an epoch microplate spectrophotometer (biotek instruments, inc) at nm. clones which gave high absorbance reading were further propagated in -well plates, utilizing a cd opticho medium with mm l-glutamine and . % pluronic Ò f- . conditioned medium from -well plates was also screened by elisa for confirmation, and the highest expressing clone was selected and expanded by passaging in shake flasks ( °c, % co in air on an orbital shaker platform rotating at rpm). clonal cell lines and heterogeneous suscho cell pools were collected daily (up to days) and counted using the acridine orange (ao) and propidium iodide (pi) nuclear staining dyes (nexcelom bioscience), which enter live cells and dead cells, respectively. conditioned medium from each time point was analyzed by sds-page, and the protein concentration was estimated by densitometry, comparing it to protein standards using the chemidoc tm imaging system (biorad). statistical significance was calculated by student's t test using graphpad prism statistical software. ⁄⁄⁄ indicates p < . . in our previous studies, the human igg fc-fused s - fragment of the mers-cov s protein (genbank afs . ) (hereinafter termed mers s - -fc) had already been expressed in transiently transfected hek t cells [ ] . however, to establish stably expressing cell clones, we transfected an adherent cho (adcho) with a poptivec construct containing an internal ribosome entry site (ires)-driven dhfr gene for selection and copy number amplification, as well as mers s - -fc gene fusion. signal peptides were added to the n-terminal end of the s - -fc gene in order to drive secretion into the culture medium (fig. a) . addition of gradually increasing methotrexate (mtx) to the culture medium of transfected cells resulted in binding to and inactivation of dihydrofolate reductase (dhfr) activity. transfected adcho cells compensated for this reduced dhfr activity by increasing the dhfr copy number in the genome to overcome inhibition by mtx. since the mers s - -fc fusion gene was integrated into the same genetic locus as that of the dhfr gene, the s - -fc gene was amplified, as well, leading to increased production of the protein. in the course of developing the adcho cell line expressing the s - -fc protein, optimization of a purification protocol using hitrap protein-a hp was performed. on the purification chromatogram, two protein peaks were observed in the elution step (fig. b) . denaturing sds-page and western blotting analysis with anti-mers-rbd-specific antibodies confirmed the second peak to be the s - -fc fragment (fig. c) . further analysis using non-denaturing sds-page and western blotting with rb-anti-bovine (fab) antibodies showed that the first peak was mainly contaminating bovine igg (fig. d-e) that originated from the fetal bovine serum (fbs) supplemented in the culture medium. we estimated relative productivity of adcho cells by measuring the ratio of the area of first peak and second peak. different signal peptides have been shown to result in different expression levels in cho cell systems [ ] . therefore, we transfected adcho cells with linearized mers s - -fc expression plasmids with four different signal peptides at the n-terminus. peptides were derived from interleukin (il ), igk light chain (igk), human serum albumin (sa), and azurocidin (azu). conditioned media from confluent monolayers of each transfected cell line were collected, followed by protein purification using protein a to establish yield and estimate relative productivity. in these studies, we found that adcho cells with the signal peptide derived from il showed - % more secretion of s - -fc than cells with other signal peptides (fig. a) . elevated expression levels were achieved by gradually increasing mtx concentration during each cell passage from nm to lm. conditioned medium from transfected adcho cells was collected during the dna amplification process, followed by protein purification using protein a to estimate relative productivity (fig. b) . we observed the expected correlation between the expression of s - -fc and increased resistance to higher levels of mtx. compared to adcho cells with nm mtx, expression of s - -fc was increased -fold in the presence of - lm of mtx (fig. c ). the purified protein was analyzed by circular dichroism (cd) spectroscopy. the mers-cov s - -fc consists mainly of beta-sheet ( . %) and loop structures ( %) with limited helices ( . %) (fig. a) . the secondary structure of the protein starts to unfold at °c. during thermal melt analysis, mers s - -fc had two endothermic transitions: . °c and . °c (fig. b) . a comparison between mers s - -fc expressed from hek t cells and adcho cells was carried out using different page analyses. both proteins appeared to be identical following reduced and non-reduced sds-page. on native page and ief gels, adchoexpressed mers s - -fc had a lower isoelectric point (pi . ) when compared to the protein expressed in hek t cells (pi . ) (fig. c) . after removal of n-linked glycans, the molecular size of mers s - -fc was slightly smaller than that of the n-linked glycosylated form (fig. d) . no change in molecular weight was observed after o-linked deglycosylation and desialylation (reduced sds-page). enzymatic treatment did not affect dimerization of mers s - -fc, as determined by nonreducing sds-page. the high pi (> . ) isomers of s - exhibited a lower shift (between pi . and pi . ) in electrophoretic mobility after n-linked deglycosylation, while no change of band pattern for the protein was seen after treatment with o-glycosidase. after removal of sialic acid through neuraminidase treatment, s - -fc isomers shifted to pi higher than . (fig. d) . three assays, including co-immunoprecipitation (co-ip), flow cytometry, and elisa, were performed to detect the binding of mers-cov rbd to its receptor, hdpp . co-ip demonstrated that similar to the hek t-expressed mers s - -fc protein, the rbd protein expressed in adcho bound strongly to hdpp expressing huh- cells. two clear bands were identified from immunoprecipitated mixture of s - -fc and huh- cell lysates, and these bands were recognized by both anti-hdpp antibody and anti-mers-rbd antibody. in contrast, only one band was identified in huh- cells only and s - -fc protein only samples, and it was reactive with either anti-hdpp antibody or anti-mers-rbd antibody, but not with both antibodies. as expected, sars-cov rbd-fc protein was only recognized by sars-cov rbd-specific mab g (fig. a) . flow cytometry analysis further quantified the binding between mers-cov rbd protein and hdpp receptor in huh- cells. results showed a similar strong binding for all mers s - -fc proteins from adcho and hek t, but not for sars-cov rbd-fc control (fig. b) . the elisa analysis demonstrated a dose-dependent binding between these mers-cov rbd proteins and hdpp protein, while no binding was observed between hfc control and hdpp (fig. c) . the antigenicity of mers-cov rbd proteins was carried out using elisa to test their binding with rbd-specific neutralizing antibodies (mersmab , m , m , and m ), which recognize epitopes at rbd residues f , d , r , w , d , y , r , or w , and demonstrate strong activity to block rbd-hdpp receptor binding and neutralize mers-cov infection [ ] [ ] [ ] ] . similar to the s - -fc protein expressed in hek t, the rbd proteins expressed in adcho bound strongly to mouse mab mersmab and human mabs m , m , and m in a dose-dependent manner (fig. d) , confirming their antigenicity. while these mabs bound strongly to the non-denatured (no dtt) s - -fc proteins expressed in both adcho and hek t, they had significantly reduced affinity to rbd proteins treated with dtt, a reducing agent cleaving disulfide bonds of rbds and thus disrupting a protein's native conformation (fig. e) . these results demonstrate that the neutralizing mabs recognize conformational structures of mers-cov rbd [ ] . transgenic mice expressing the human cd /dpp receptor (hdpp -tg) are a well-characterized animal model with which to evaluate the efficacy of vaccine candidates against mers-cov infection and disease [ , ] . the immunogenicity of the mers s - -fc-based subunit vaccine was verified in hdpp -tg mice. immune sera obtained from immunized mice were tested by elisa for the binding with denatured and non-denatured s - -fc protein or subjected to the vero e -based micro-neutralization assay to quantify their capacity to neutralize infectious mers-cov. to evaluate protective efficacy against viral infection, viral loads and histopathology of the lungs of three mice in immunized and control groups were measured at day after lethal challenge with x ld of mers-cov. the remaining two mice in each group were monitored for morbidity (weight loss) and mortality to determine if this vaccine formulation would sufficiently protect against the disease and lethality caused by mers-cov infection. mers s - -fc induced high titers of rbd-specific igg antibodies, which reacted strongly with non-denatured s - -fc protein. nevertheless, these rbd-specific antibodies had significantly reduced activity with s - -fc treated by dtt (fig. ) . this suggests that the rbd vaccine-induced antibody response was indeed directed towards one or more conformational epitopes. mice immunized with pbs/addavax uniformly failed to elicit any detectable neutralizing antibodies; however, those immunized with mers s - -fc/addavax consistently produced readily detectable titers of neutralizing antibody, ranging from - and - of neutralizing titer (nt)- (nt ) and nt- (nt ), respectively ( table ) . consistent with the readily detectable neutralization antibodies, mers s - -fc/addavaximmunized mice were fully protected against viral infection and disease, as evidenced by the absence of recoverable infectious virus and negligible focal inflammatory responses, if any (supp. fig. ) , within the infected lungs at days post-infection (dpi). importantly, the remaining two immunized mice did not suffer any significant morbidity (weight loss) and survived until dpi when the experiment was terminated. in contrast, pulmonary infectious viruses, albeit low in titers, were detected from all three unimmunized controls, with an average of . ± . tcid /g of mers-cov recovered at dpi, accompanied by mild inflammatory responses. the remaining two control mice suffered profound weight loss prior to succumbing to infection by dpi. taken together, results of this ''proof-of-principle" pilot study indicated not only immunogenicity of mers-cov s - -fc, but also its efficacy and safety in the protection of hdpp -tg mice against lethal challenge with mers-cov. similar to the adcho cell development described earlier, the dna copy number of stably transfected suspension cho dg dhfr-cells was amplified by gradual exposure to increased mtx concentrations of up to lm. the resulting heterogeneous suscho cell pools were subsequently cloned by limited dilution to obtain a monoclonal cell population. elisa analysis was performed on the supernatants from individual clones, leading to the identification of clone b as the highest expressing clone. by comparing the growth curves of clone b to the heterogeneous (non-clonal) suscho cells, we discovered that clone b and heterogeneous suscho cells reached their maximum growth on the seventh day with viable cell counts of  cells/ml and  cells/ml, respectively (fig. ) . through quantitative analysis using sds-page, we determined that the supernatant of clone b expressed approximately mg/l of mers s - -fc, while the heterogeneous non-clonal cell pools expressed about mg/l ( table , supp. fig. ) on the seventh day. the mers-cov rbd subunit fragment s - has been identified to be a critical neutralizing receptor-binding fragment and an ideal candidate for the development of an effective mers-cov recombinant protein vaccine [ ] . our aim here was to optimize expression and purification conditions suitable for pilot scale production of this rbd vaccine candidate. initially, both escherichia coli (bacteria) and pichia pastoris (yeast) expression systems proved well suited for recombinant vaccine production because of low production costs. however, our data showed that e. coli could not produce soluble mers s - , and yeast cells could not overexpress mers s - . thus, these two systems were not considered suitable to advance the mers vaccine candidate into process development and scale up production (see supplementary information for more detail). in previous reports, mers fc-fused s - had been expressed in transiently transfected hek t cells. however, transiently transfected cell lines may give low protein yield and potentially lose their production ability over time in continuously expressing recombinant proteins [ , ] . unlike transient transfection, dna is integrated into cells long-term through stable transfection. despite the fact that the development of stably expressing cell lines is laborious and time-consuming, production with stable cell lines can be scaled up easily and would be suitable for use in manufacturing processes. hence, we generated a stably transfected adcho cell line by transfecting mtxdriven poptivec into cells to produce recombinant mers s - -fc protein. fc-fused gp protein was first constructed in as a potential candidate for aids therapy [ ] . while there is no fda approved fc-fusion vaccine, vaccine development using fc-fusion proteins is active and ongoing. a number of studies have been initiated on the development of vaccines against ebola, hiv, influenza, as well as tuberculosis [ ] [ ] [ ] [ ] . it is noted that adverse side effects with vaccines are likely limited as biotherapeutic fc fusions have been repeatedly shown to be safe and biocompatible in humans. currently, all commercial therapeutics use the fc domain from human igg , although other options, such as igg , iga, and igm are also currently being explored [ ] . furthermore, the fc domain is known to increase plasma half-life and simplify the purification process [ ] . purification of s - -fc was performed using a protein a sepharose column, removing most of the impurities from the culture medium. bovine igg originates from fbs in the culture medium in constant amount and was used as a standard to gauge expression levels of the s - -fc protein. this estimation method allowed us to select the most suitable signal peptide and to evaluate the effect of the mtx-induced dna amplification process. since the poptivec plasmid has no signal peptide, four different signal peptides were subsequently tested at the n-terminus of the s - -fc sequence to drive secretion of the protein, leading to the identification of il signal peptide as the most suitable signal sequence. although signal peptides derived from human sa and human azu have been reported to improve production rates in other adcho systems [ ] , the yield in our hands was lower than that with the il signal peptide. stable and highly productive cell pools were then isolated through a gradual increase in the mtx concentration in the culture [ ] . analysis of the relative productivity of adcho cells indicated an increase in s - -fc in media proportional to the mtx concentration in the medium, reaching a plateau at lm mtx. the biophysical and biochemical characterization of the mers s - -fc protein revealed that is was stable up to a temperature of . °c. after the first major unfolding event at . °c, another unfolding event occurred between and . °c, which could have resulted from destabilization of the ch -ch bond of the fc domain of the recombinant protein [ ] . no change in molecular weight was observed after removal of o-linked glycan on mers s - -fc, which suggested no o-linked glycosylation in the protein. after neuraminidase treatment, the recombinant protein band pattern showed fewer bands and a pi shift, suggesting that sialic acids might contribute to the charge of the protein. interestingly, although the complexities of the multiple protein bands were greatly reduced after glycosidase and sialidase treatments, the pattern representing charge heterogeneity remained, suggesting the existence of additional ptms, such as mannose- phosphate, of the recombinant mers s - -fc protein. due to the charge differences between adcho-and hek texpressed s - -fc proteins, we evaluated the functionality and antigenicity of the target protein. functionality studies, including co-ip assays, flow cytometry analyses, and elisa binding assay, confirmed that the adcho-expressed mers s - -fc protein maintained functionality equal to that expressed in hek t cells in binding the dpp receptor of mers-cov, both of which showed dose-dependent binding with the soluble hdpp protein. in addition, mers s - -fc expressed in either adcho or hek t demonstrated similar dose-dependent binding to rbd-specific neutralizing antibodies, an indicator that both s - -fc proteins could maintain sufficient antigenicity. we further investigated the protective efficacy of adcho-expressed s - -fc protein vaccine in protecting against mers-cov infection in the established transgenic mouse model expressing hdpp (hdpp -tg). by formulating this s - -fc protein with adda-vax all vaccinated animals could produce neutralizing antibodies and survive a live viral challenge for days. taken together, we confirmed the absence of functional, antigenic and immunogenic differences between adcho-and hek t-expressed mers s - -fc proteins. moreover, mouse vaccinations with the rbd subunit vaccines did not appear to elicit eosinophilic or antibody-dependent immune enhancement. although we verified adcho-expressed mers s - -fc protein as an effective vaccine against mers-cov infection, the use of fbs in the growth medium proved unsuitable for a human vaccine antigen [ ] . for both safety and compliance with future regulatory requirements, we therefore developed a stably transfected suspension cho cell line in serum-free medium. the adcho cell development process described here became the foundation for the establishment of the serum-free suspension cho cell line. subsequently, we transfected the poptivec expression plasmid with the il signal peptide into suscho cells and carried out the dna amplification as before. from the heterogeneous cell pools adapted to lm mtx, we isolated clone b , which was the highest expressing clone from a two-cycle screening process. in shake flasks, the growth of clone b was slightly slower than that from the heterogeneous cell pools, but b expressed % more mers s - -fc protein than the heterogeneous cell pool. typically, highly productive cell clones have lower growth rates since a significant portion of resources are used for expression of the recombinant protein [ ] . additional experiments in the transgenic mice and non-human primate models will be needed to further determine the immunogenicity of mers s - -fc protein that produced by suscho. we envision that with a proper production process, the recombinant protein can be scaled up, manufactured, formulated and stockpiled as an efficient countermeasure against future mers-cov outbreaks. table s - -fc expressed from heterogeneous non-clonal suscho cell pools and from the monoclonal clone b . the protein concentration (mg protein per liter of culture supernatant) was determined by sds-page gel analysis (supp. fig. ). who. middle east respiratory syndrome coronavirus the middle east respiratory syndrome coronavirus -a continuing risk to global health security vaccine development against prioritized epidemic infectious diseases inovio reports new positive clinical data on vaccine advances in the fight against emerging infectious diseases innate immune signaling by, and genetic adjuvants for dna vaccination the future of human dna vaccines receptor recognition mechanisms of coronaviruses: a decade of structural studies middle east respiratory syndrome: current status and future prospects for vaccine development vaccines for the prevention against the threat of mers-cov yeast-expressed recombinant protein of the receptor-binding domain in sars-cov spike protein with deglycosylated forms as a sars vaccine candidate roadmap to developing a recombinant coronavirus s protein receptor-binding domain vaccine for severe acute respiratory syndrome receptor-binding domain-based subunit vaccines against mers-cov searching for an ideal vaccine candidate among different mers coronavirus receptor-binding fragments-the importance of immunofocusing in subunit vaccine design identification of an ideal adjuvant for receptor-binding domain-based subunit vaccines against middle east respiratory syndrome coronavirus recombinant receptorbinding domains of multiple middle east respiratory syndrome coronaviruses (mers-covs) induce cross-neutralizing antibodies against divergent human and camel mers-covs and antibody escape mutants receptor-binding domain of mers-cov with optimal immunogen dosage and immunization interval protects human transgenic mice from mers-cov infection a truncated receptor-binding domain of mers-cov spike protein potently inhibits mers-cov infection and induces strong neutralizing antibody responses: implication for developing therapeutics and vaccines characteristics of early-and lateonset rapid eye movement sleep behavior disorder in china: a case-control study generation of a transgenic mouse model of middle east respiratory syndrome coronavirus infection and disease characterization and demonstration of the value of a lethal mouse model of middle east respiratory syndrome coronavirus infection and disease evaluation of stable and highly productive gene amplified cho cell line based on the location of amplified genes a recombinant receptorbinding domain of mers-cov in trimeric form protects human dipeptidyl peptidase (hdpp ) transgenic mice from mers-cov infection junctional and allelespecific residues are critical for mers-cov neutralization by an exceptionally potent germline-like antibody exceptionally potent neutralization of middle east respiratory syndrome coronavirus by human monoclonal antibodies a conformation-dependent neutralizing monoclonal antibody specifically targeting receptor-binding domain in middle east respiratory syndrome coronavirus spike protein immunization with inactivated middle east respiratory syndrome coronavirus vaccine leads to lung immunopathology on challenge with live virus optimized signal peptides for the development of high expressing cho cell lines mers-cov spike protein: a key target for antivirals evaluation of transfection methods for transient gene expression in chinese hamster ovary cells gene expression in mammalian cells and its applications designing cd immunoadhesins for aids therapy ebola virus glycoprotein fc fusion protein confers protection against lethal challenge in vaccinated mice a neonatal fc receptortargeted mucosal vaccine strategy effectively induces hiv- antigen-specific immunity to genital infection adjuvant-free immunization with hemagglutinin-fc fusion proteins as an approach to influenza vaccines apc targeting enhances immunogenicity of a novel multistage fc-fusion tuberculosis vaccine in mice fc-fusion proteins: new developments and future perspectives stabilisation of the fc fragment of human igg by engineered intradomain disulfide bonds a plea to reduce or replace fetal bovine serum in cell culture media advances in mammalian cell line development technologies for recombinant protein production identification of a receptorbinding domain in the s protein of the novel human coronavirus middle east respiratory syndrome coronavirus as an essential target for vaccine development novel vectors for the expression of antibody molecules using variable regions generated by polymerase chain reaction this study was supported through the us-malaysian vaccine development program, funded by the university of malaya, and grants from the nih (r ai - s and r ai ). we thank drs. dimiter s. dimitrov and tianlei ying for providing m , m , and m mabs. the authors are involved in the development of a vaccine against mers coronavirus. supplementary data associated with this article can be found, in the online version, at https://doi.org/ . /j.vaccine. . . . key: cord- -aes l s authors: steffen, tara l.; stone, e. taylor; hassert, mariah; geerling, elizabeth; grimberg, brian t.; espino, ana m.; pantoja, petraleigh; climent, consuelo; hoft, daniel f.; george, sarah l.; sariol, carlos a.; pinto, amelia k.; brien, james d. title: the receptor binding domain of sars-cov- spike is the key target of neutralizing antibody in human polyclonal sera date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: aes l s natural infection of sars-cov- in humans leads to the development of a strong neutralizing antibody response, however the immunodominant targets of the polyclonal neutralizing antibody response are still unknown. here, we functionally define the role sars-cov- spike plays as a target of the human neutralizing antibody response. in this study, we identify the spike protein subunits that contain antigenic determinants and examine the neutralization capacity of polyclonal sera from a cohort of patients that tested qrt-pcr-positive for sars-cov- . using an elisa format, we assessed binding of human sera to spike subunit (s ), spike subunit (s ) and the receptor binding domain (rbd) of spike. to functionally identify the key target of neutralizing antibody, we depleted sera of subunit-specific antibodies to determine the contribution of these individual subunits to the antigen-specific neutralizing antibody response. we show that epitopes within rbd are the target of a majority of the neutralizing antibodies in the human polyclonal antibody response. these data provide critical information for vaccine development and development of sensitive and specific serological testing. natural infection of sars-cov- in humans leads to the development of a strong neutralizing antibody response, however the immunodominant targets of the polyclonal neutralizing antibody response are still unknown. here, we functionally define the role sars-cov- spike plays as a target of the human neutralizing antibody response. in this study, we identify the spike protein subunits that contain antigenic determinants and examine the neutralization capacity of polyclonal sera from a cohort of patients that tested qrt-pcr-positive for sars-cov- . using an elisa format, we assessed binding of human sera to spike subunit (s ), spike subunit (s ) and the receptor binding domain (rbd) of spike. to functionally identify the key target of neutralizing antibody, we depleted sera of subunit-specific antibodies to determine the contribution of these individual subunits to the antigen-specific neutralizing antibody response. we show that epitopes within rbd are the target of a majority of the neutralizing antibodies in the human polyclonal antibody response. these data provide critical information for vaccine development and development of sensitive and specific serological testing. severe acute respiratory syndrome coronavirus (sars-cov- ) was initially identified in patients with severe pneumonia in wuhan, china in december of . due to its initial zoonotic transmission and human to human spread within an immunologically naïve population, it has since caused over million confirmed cases and over , deaths worldwide (who ), with approximately % of all cases occurring in the united states as of july th - . infection with sars-cov- can result in a range of states from asymptomatic to symptomatic, with symptomatic cases ranging from mild non-specific symptoms, like malaise, to severe pneumonia and multiple organ failure - , , . sars-cov- is a positive sense, single stranded, enveloped rna virus with a ~ kb genome that is virologically similar to the enzoonotic beta-coronaviruses sars-cov and mers-cov. the sars-cov- genome encodes non-structural proteins and structural proteins: spike (s), nucleocapsid (n), envelope (e), and membrane (m). the coronavirus n protein functions by interacting with viral rna to form the ribonucleoprotein, while e and m function in virion assembly and budding [ ] [ ] [ ] [ ] . spike is a homotrimeric transmembrane protein that is comprised of two subunits per monomer, s and s that are responsible for binding the host cell receptor and viral fusion, respectively. similarly to the human coronavirus nl- and sars-cov, sars-cov- spike uses human angiotensin converting enzyme (ace ) to gain entry into target cells [ ] [ ] [ ] . specifically, the s subunit of sars-cov- contains the receptor binding motif (rbm) within the receptor binding domain (rbd) that makes direct contact with the ace receptor for receptor-mediated entry [ ] [ ] [ ] . important to note for antibody structural determinants, the pre-fusion confirmation of the trimeric spike has a range of states that are described as "up" or "down" based on the angle of rbd within s . for a virion to be able to interact with ace and gain entry into host cells, rbd must be in the "up" conformation between ° and ° that represents a receptorbinding active state , . when the interaction between rbd and ace is disrupted, the entry of sars-cov- into susceptible cells is blocked , . spike is known to be a major antibody antigenic determinant for both mers-cov and sars-cov that leads to the generation of protective immune responses including the production of highly neutralizing antibodies [ ] [ ] [ ] . targets for these antibodies within spike include both conformation dependent and linear epitopes of rbd and the s fusion peptide. these neutralizing antibodies are proposed to block rbd-ace receptor interactions or prevent s fusion with host membranes , . spike being a major antigenic determinant for the antibody response against closely related beta-coronaviruses contributes to our hypothesis that the neutralizing polyclonal antibody response to sars-cov- will target spike and its sub-domains. to determine the current antigenic variation and display that variation within the structure of sars-cov- spike we interrogated , sars-cov- genomes derived from human samples available from gisaid on june , . the spike homotrimer contains multiple subunits, including s and s , both of which contain a total of glycosylated residues which can affect spike protein folding, receptor interactions and potentially block antibody recognition and are represented as lollipops in the schematic ( figure a ). the s subunit (residues - ) of spike contains the n terminal domain (ntd), c terminal domain (ctd), the receptor binding domain (rbd, residues - ), and the receptor binding motif (rbm, residues - ). the s subunit contains the fusion peptide (fp, residues - ), and heptad repeat and (hr , residues - , hr , residues - ), the transmembrane domain and cytoplasmic tail. in our analysis of naturally occurring amino acid (aa) variation, low quality sequences determined by gaps or ambiguous nucleotides > nt were removed (- sequences). the , remaining sequences were translated and aligned using muscle (supplemental file ), then duplicate sequences were removed. this resulted in a multiple sequence alignment of amino acids (aa), with , unique aa sequences and an overall pairwise identity of . %. the prevalence of aa variation per site and aa conservation was determined using the sequence variation tool (viprbrc) and the consurf server , respectively. the level of aa variation was measured by calculating the aa frequency at each position within the multiple sequence alignment, then shannon entropy was used to define aa conservation using data on the potential aa present. these conservation scores were broken down into discrete color coded categories with a score of being most variable, representing > , mutations at that site, and being most highly conserved with - mutations per site. the aa conservation was then displayed in the context of the spike pre-fusion trimer (pdb: vsb) to represent exposure to the human antibody response, where chain a displays the rbd within the up position and the b and c chains display the rbd in the down position ( figure b) . the spike trimer color coded for aa variation is located next to the spike protein where the subunits are color coded with s ntd in cyan, rbd in dark green, and s in light green. from the aa sequence variation analyses, we observed the well documented g d variation, which may have a fitness advantage. we also observed an additional positions that contained a range of variation from / ( . %) to / ( . %). once the aa variation was mapped onto the trimer structure, we observed that the greatest level of aa variation is found within the s ntd ( . % identity), while the lowest level of aa variation is within the rbd ( . % identity) and s ( . %). the low level of aa variation within rbd was also recently described by starr et al . our data, in addition to that of starr et al, indicate that overall the rbd and s domains are highly conserved and are currently genetically stable targets for vaccine and therapeutic intervention. we next wanted to investigate the specificity and immunodominance of the polyclonal antibody response to the subunits of spike due to its potential as a target for the development of vaccines and therapeutics. this concept is based upon vaccines developed against sars-cov and mers-cov , , and the strongly neutralizing monoclonal antibodies that have been identified as spike specific [ ] [ ] [ ] [ ] [ ] [ ] . to address specificity and immunodominance, we analyzed serum samples from laboratory confirmed sars-cov- infection cases as determined by qrt-pcr, with of the subjects being admitted to the hospital. the sars-cov- positive serum samples analyzed were obtained from patients in saint louis, mo, cleveland, oh, and san juan, pr, with no subjects succumbing to infection. the median age of the individuals is . years, with males and females. specific demographics regarding the subjects are listed in table . the cohort controls were collected prior to the emergence of sars-cov- ( - ) from previous studies conducted at saint louis, mo, cleveland, oh, and san juan, pr, and had a similar age and sex distribution. to investigate and quantify the igg response to sars-cov- , we performed elisa assays using serum from sars-cov- + subjects and sars-cov- negative control subjects. we serially diluted sera from : to : , as four-fold dilutions and evaluated binding to recombinant s , s , and rbd (figure a -c). polyclonal sera from all sars-cov- subjects showed igg binding to each spike subunit by elisa. igg reactivity to the s subunit, which contains the rbd, ranged in optical density (od) from . to . , while the control subjects had an od range from . to . at the highest concentration tested. four subjects were responsible for the majority of the elisa binding to s within the control group ( figure a ). the overlap of these four negative subject samples with the sars-cov- + subjects suggests that there is antibody cross-reactivity to the sars-cov- s protein, most likely due to prior human coronavirus (hcov) infections (nl , hku , oc , e). however, the focus of our study is to functionally define the key targets of the neutralizing antibody response to sars-cov- , and further studies would need to be completed to define the nature of the cross-reactive response. we next interrogated the antibody response to the s subunit, where at a : dilution the sars-cov- positive subjects had an od range of . to . , while control subjects had an od range of . to . ( figure b ). only one control subject had antibody binding that overlapped with the lower range of the sars-cov- subjects. we used an identical approach to evaluate the antibody response to rbd ( figure c ). here we saw a robust antibody response with a range in od from . to . at a serum dilution of : , and we observed no antibody binding above background in the control group. multiple groups have observed a similar responses to the rbd subunit, indicating the specificity of the rbd antibody response , , which could in part be due to the low level of conservation of the rbd amino acid sequences between sars-cov- and the hcovs, which cause the common cold , . to further quantify differences in binding to the individual spike subunits we calculated the area under the curve (auc) of the s , s , and rbd elisa assays for each subject ( figure d -f, table ). quantification of the auc measures the antibody binding at multiple antibody concentrations quantifying a combination of avidity and specificity of the sera for each subject. when assessing binding to s , the mean auc of sars-cov- subjects was . +/- . and the mean auc of controls was . +/- . (p< . ; figure d ). upon comparing the auc for antibody binding to s , we observed a mean auc of . +/- . and . +/- . (p< . ) sars-cov- + patients and controls, respectively ( figure e ). interestingly, when we assess binding to rbd, we observed a mean auc of . +/- . and . +/- . (p< . ) showing minimal cross-reactivity from the negative subjects ( figure f ). the rbd binding data matches recently described results , , which show that the antibody response to rbd is specific to sars-cov- infection, with no known cross-reactivity from antibodies derived from endemic hcov infection. overall, we show that s , s , and rbd from spike are targeted by the human polyclonal response in all individuals from our cohort. additionally, we observe potential cross-reactivity within the control group to the s domain outside of the rbd. this cross-reactivity is important to note for serological and vaccine evaluation, as using rbd as a target antigen may provide the most specific and sensitive test that results with fewer false-positives. interestingly, this also highlights the potential for cross-reactive s antibodies to play a role in either protection or exacerbation of sars-cov- disease. it has been recently demonstrated that human mabs generated after sars-cov infection were shown to cross-react and neutralize sars-cov- ( ), while sars-cov infection generates a polyclonal antibody response that is able to bind spike from sars-cov- while not able to neutralize the virus . furthermore, mechanisms aside from neutralization that are dependent on the fc region of the antibody are capable of limiting viral infection , , . as there was a broad dynamic range of antibody binding to spike subunits from our sars-cov- + subjects, we stratified the samples based upon days post qrt-pcr positive test, because days post symptoms was unavailable, to evaluate the potential role of time post infection on antibody binding and specificity. samples were stratified into three groups: prior to day , day - , and day - post qrt-pcr + test. we compared the auc values from the elisa binding curves to s , s , and rbd over this time period and did not observe a role for time post infection on antibody binding and specificity with our limited sample set (supplemental figure a-c). the heterogeneity of antibody binding has been observed in other patient cohorts , , . additionally, we quantified the relative binding of the polyclonal antibody response between different spike subunits to determine if the subunits were equally targeted by the antibody response. to this end, we evaluated the correlation of auc between the s , s and rbd subunits (figure g-i). when we compared the auc of s and s we observed significant correlation (p= . , r= . ) ( figure g ). as expected, based on the location of rbd within s , s auc significantly correlated with rbd auc (p= . , r= . ), which may suggest epitopes for binding within s as well as rbd ( figure h ). based on the location of rbd within s we would anticipate correlation between their auc values ( figure i ), and indeed there is a significant correlation (p= . , r= . ) that would suggest that either the majority of binding to s occurs within rbd, or that there are antibody epitopes throughout s that drive a robust antibody response. the elisa antibody binding results indicate that all sars-cov- + patients within our cohort had antibodies which bound to each subdomain of the spike protein. antibody neutralization is one mechanism of protection from severe viral disease. the mechanism of action of neutralizing antibodies often include the targeting of viral proteins that interact with the host receptor for entry or viral proteins required for fusion with host cell membranes (reviewed in ). for sars-cov- , the multifunctional spike protein is required for entry and fusion. specifically, the s domain contains rbd, which is responsible for binding the human ace protein mediating entry , , while s contains the fusion peptide . it has been shown by other groups that monoclonal antibodies targeting spike can block infection with sars-cov- and that natural infection of humans often produces neutralizing antibodies - , , , , which is thought to prevent subsequent covid- . however, the specificity of human polyclonal neutralizing antibodies against infectious sars-cov- is only now beginning to be understood. to begin to understand the human polyclonal neutralizing antibody response we utilized a focus reduction neutralization tests (frnt) ( figure a ) based upon the assay we had developed for multiple emerging infectious diseases [ ] [ ] [ ] and for sars-cov- ( figure a ). there are multiple advantages to the frnt assay over pseudotype-virus assays and plaque assays, including the use of infectious virus that may better reflect heterogeneity in the conformational structure of the virion, quantitative measurement of the reduction of viral replication and spread as each foci diameter measured represents multiple cells, and finally the use of well plates allowing for titers to be quantified using multiple technical and biological replicates. overall, this assay allows for a rigorous and quantitative determination of antibody neutralization potential. using the frnt assay, we determined the concentration of patient sera required to neutralize sars-cov- infection. based upon the antibody neutralization curve ( figure b ), the serum dilution necessary to neutralize % of the virus (frnt ) ranged from / to / with a mean of / ( figure c ). the serum dilution necessary to neutralize % of the virus (frnt ) ranged from / - / with a mean of / ( figure c) . notably, the sera from sars-cov- patients in our cohort were capable of neutralizing infectious virus independent of day post positive test ( figure b -c, table ); while, sera from the majority of control subjects had no demonstrated antibody neutralization. one control subject, whose sera was cross-reactive in the s /s elisa binding assay demonstrated % sars-cov- neutralization potential at a / dilution, but further investigation of cross-neutralization is beyond the scope of this current study. based upon the ability of the sars-cov- subjects to neutralize at least % of the virus, we show that the polyclonal antibody response has the breadth and specificity to completely neutralize sars-cov- infection. this would suggest that natural infection would be capable of controlling viral infection and limiting the potential of disease and transmission at the timepoints we assessed ( figure d ). in animal model studies, hamsters have demonstrated that immune sera can protect from challenge , although currently the mechanisms of that protection are unknown. to functionally determine which of the spike subunits are the main target of neutralizing antibodies, we performed a functional assay developed by the de silva lab for use in flaviviruses . in this approach individual spike subdomains are linked to beads and are used to depleted sera in an antigen specific manor. in our studies his-tagged proteins are conjugated to cobalt coated magnetic beads and serum from sars-cov- subjects are incubated with the conjugated proteins. this allows a complex of antibody:antigen:bead to form and be pulled down by a magnet, leaving the serum depleted of that particular antibody specificity ( figure a ). to understand the contribution of antibodies specific to each individual subunits, antibodies specific to each spike subunit, s , s , and rbd, were depleted from human polyclonal sera, and the antibody binding and neutralization potential of polyclonal sera after depletion was determined by elisa and frnt, respectively. using the bead-based approach, sera from patients were depleted for s , s , and rbd individually by sequentially incubating serum two times with protein coated beads. to quantify the effects of the antigen-specific antibody depletions, the auc from elisa binding curves pre and post depletion (supplemental figure , table ) were quantified, and the values were paired per subject ( figure b ). after antigen specific depletions we observed significant reduction in spike subunit antibodies represented by a . (p= . ), . (p< . ) and . (p< . ) fold reduction in auc binding to s , s , and rbd respectively ( figure b ). moreover, to confirm that depletion protocol did not impact sars-cov- neutralization we performed depletions with an irrelevant protein, vacv a r ( figure b ). the subunit depletion protocol significantly reduced the level of subunit specific antibody, which allowed us to evaluate the contribution of each individual subunit to the neutralizing antibody response. to measure the functional effect of s , s , and rbd antibody depletion on virus specific neutralization we evaluated post-depletion neutralization activity by frnt (supplemental figure ). to confirm that the depletion protocol itself had no off-target effects on sars-cov- neutralization, a control depletion with vacv a r was completed and neutralization pre and post depletion was measured. the control depletion had a minimal effect on the ability of the polyclonal sera to neutralize sars-cov- ( figure c : frnt : . fold decrease; frnt : . fold decrease). we then measured the antibody neutralization curves after depleting serum with s , rbd or s , and determined the serum dilution required to reduce infection by % (frnt ) and % (frnt ) ( table ). to take into account the effects of the antibody depletion protocol, we compared the frnt of the control depleted serum with the subunit depleted serum, and observed a . , . , and . fold reduction after s , rbd, and s depletion, respectively ( figure c ). based upon the frnt and frnt values, the depletion of s and rbd significantly reduced virus neutralization (p= . and p= . ). this suggests that polyclonal antibody binding to the rbd domain of the spike protein represents the key target of neutralizing antibody to sars-cov- after natural infection. since we observed a similar fold reduction after s and rbd depletion, it is likely that the majority of the neutralizing response is found within the rbd domain of s . however, this is the average neutralizing antibody response, which is applicable to our cohort. when we evaluate changes in individuals, there are two patients that have a strong rbd neutralizing response, but also have a s specific neutralizing antibody response with . , and . fold change after s depletion. overall, these data demonstrate natural sars-cov- infection generates a robust anti-rbd polyclonal neutralizing antibody response with some individuals mounting a neutralizing antibody response to s . we conclude that the polyclonal neutralizing antibody response to sars-cov- primarily targets receptor interactions (s /rbd) in the majority of individuals. to compare the relative neutralizing differences between spike domains, we normalized the data based upon frnt values and represented the data as % subunit specific neutralizing antibodies. this allows us to calculate the percentage of neutralizing antibodies that bind to s , s , or rbd, while taking into account the impact of the depletion protocol, based on our control and subunit specific depletions ( figure d ). further confirming the paired frnt data, % +/- and % +/- % the highest percentage of neutralizing antibodies indeed bind to rbd and s , suggesting a prevention in virus interaction with viral receptor maybe the dominant mechanism for antibody neutralization of sars-cov- after natural infection. additionally, s has the lowest percentage % +/- % of s binding antibodies capable of neutralization, suggesting that viral fusion with host membranes is not a dominant target of the neutralizing antibody response to sars-cov- after natural infection, with our cohort of patients ( figure d ). this data has been further represented as % binding neutralizing antibodies based on the pre depletion frnt values (supplemental figure d) . overall, these data further confirm that a majority of neutralizing antibodies are targeted against the rbd within s . in this study we examined the antigenic targets of the sars-cov- igg neutralizing antibody response that develop during natural infection. we quantified the immunodominance of anti-spike subdomain antibodies for binding by elisa and neutralization activity by antigen specific depletion followed by a sars-cov- neutralization assay. to define the specificity of the antibody response during natural infection, we needed to understand the amino acid variation present in the currently circulating sars-cov- human isolates. human sars-cov- isolates has a low frequency of amino acid variation within the spike protein, with the exception of the d g mutation, allowing us to estimate that the majority of known isolates permit effective polyclonal antibody binding and neutralization. the human polyclonal antibody response recognizes three subdomains (s , s , and rbd) of the spike protein as evidenced by elisa. interestingly, we identified cross-reactive sera from sars-cov- naïve subjects to s suggesting conserved sequences in the s subunit of spike may impact non-neutralizing responses to sars-cov- as well as serological tests for sars-cov- . most importantly, our antigen-specific antibody depletion approach demonstrated that the rbd domain of the spike protein is responsible for % +/- . % of the human polyclonal neutralizing antibody activity to spike after natural sars-cov- infection. although our study shows that the dominant target of igg neutralizing antibody response after natural sars-cov- infection is the rbd domain of the spike protein, we have evaluated a limited number (n= ) of patients by antigen-specific antibody depletion. there is the potential that immunodominance of the neutralizing antibody response may vary based upon a number of variables including viral load, co-morbidities including age and obesity, as well as genetic background. additionally, we have only focused on the igg response and it has been recently determined that the iga antibody response can neutralize sars-cov- virus and the antigen specificity of that response could be different than the igg response . importantly, it has also been recently described that more than % of individuals who seroconvert generate detectible neutralizing antibody responses and that these igg responses are indeed sustained for up to three months , , which has the potential to protect against re-infection. to begin to evaluate the correlates of protection beyond antibody neutralization, and investigate additional antibody mechanisms such as antibody dependent cellular cytotoxicity. as we detected antibodies targeted against s that are non-neutralizing these could provide a different mechanism of protection that may be valuable when considering vaccine design. there is also a strong t cell response established during natural infection [ ] [ ] [ ] [ ] , as well as a cross-reactive t cell response from potentially prior hcov infection , . currently the role of the human t cell response to sars-cov- has only begun to be dissected. overall our study describes the polyclonal igg response to sars-cov- from sera obtained from patients in a range of - days post positive qrt-pcr test. we focused on the relationship between antibody binding to the subdomains of spike and the neutralization capacity against infectious virus. we demonstrate that infection with sars-cov- results in an antibody response that results in a similar amount of igg that targets spike subunits s , s , and rbd regardless of time post infection (supplemental figure ) . furthermore, we show that this response results in a neutralizing antibody response by days post positive qrt-pcr, as determined by frnt ( figure b) . finally, using a bead-based immune depletion approach, we show that the highest percentage of neutralizing antibodies against sars-cov- bind to the receptor binding domain (rbd) ( figure d ) that directly interacts with human ace . these findings are important in the further development and prioritization of therapeutics and vaccine development. plates were coated with ul of a ug/ml mixture of recombinant protein in carbonate buffer ( . m na co . m nahco ph . ) overnight at °c. the next day plates were blocked with blocking buffer (pbs + %bsa + . % tween) for hours at room temperature and washed x with wash buffer prior to plating of serially diluted polyclonal sera. sera was incubated for hour at room temperature in the elisa plate, washed x with wash buffer, followed by addition of goat-anti-human igg hrp (sigma) conjugated secondary ( : ) for hour at room temperature. the plate was washed again x with wash buffer and the elisa was visualized with ul of tmb enhanced substrate (neogen diagnostics) and placed in a dark space for minutes. the reaction was quenched with n hcl and the plate was read for an optical density of nanometers on a biotek epoch plate reader. total peak area under the curve (auc) was calculated using grappad prism . antigen specific antibody depletions. antigen specific antibodies were depleted in a beadbased approach using ni-nta magnetic beads (thermo scientific) as described ( ). sars-cov his tagged proteins or vacv his tagged protein (control depletion) were conjugated to the hisspecific magnetic beads as suggested by manufacturer's protocol. briefly, mg of beads were washed with equilibration buffer followed by addition of ug of protein diluted in equilibration buffer. after addition of protein, the tube was rotated end over end for hour at °c. the beads were collected on a magnetic stand and washed twice with wash buffer followed by separation into two tubes of µl each. next, the human sera were diluted in tissue culture sterile pbs and placed into the first tube of beads and incubated end-over end at °c for hour. once again, the beads were collected with a magnetic stand, the supernatant was removed and transferred into the second tube for another end-over-end incubation at °c for hour. after incubation the beads were collected, and the supernatant was removed and placed at °c for subsequent elisas and (a)schematic of the full-length sars-cov- spike protein with the s and s highlighted. s is divided into the n-terminal domain, and the c-terminal domain which contains the receptor binding domain (rbd) subunit in dark green with the receptor binding motif displayed using black hashed lines. the separation between s and s is represented by a slash line. s contains the fusion peptide (fp) and heptad repeat one and two (hr and hr ) . the spike protein is where the surface reconstruction is colored according to discrete groups, with a score of being highly conserved ( - mutation per position) to being highly diverse with a score of (> mutations per position). the color coded bar describes the corresponding color for each range of mutations. next to each aa variation coded structure is the cryoem trimer structure with the individual trimers color coded to allow orientation. the forward facing trimer for the rbd up is color coded by subdomain, with rbd up being dark cyan, s as cyan, and s as pale green. the rbd down trimer is color coded with rbd down as brown, s as gold and s as pale yellow. we observed naturally occurring aa variations are less within the rbd as noted by the high level of purple colored ca residues, and greatest aa variations within the s -ntd as indicated by the white and green color residues. the epidemiology and clinical information about covid- clinical characteristics of coronavirus disease in china clinical features of patients infected with novel coronavirus in wuhan the epidemiology and pathogenesis of coronavirus disease (covid- ) outbreak clinical findings in a group of patients infected with the novel coronavirus (sars-cov- ) outside of wuhan, china: retrospective case series epidemiological and clinical characteristics of cases of novel coronavirus pneumonia in wuhan, china: a descriptive study coronavirus disease (covid- ): epidemiology, pathogenesis, diagnosis, and therapeutics structural basis of receptor recognition by sars-cov- structure, function, and antigenicity of the sars-cov- spike glycoprotein cryo-em structure of the -ncov spike in the prefusion conformation characterization of spike glycoprotein of sars-cov- on virus entry and its immune cross-reactivity with sars-cov cryo-em structure of the sars coronavirus spike glycoprotein in complex with its host cell receptor ace inhibition of sars-cov- infections in engineered human tissues using clinical-grade soluble human ace structures of human antibodies bound to sars-cov- spike reveal common epitopes and recurrent features of antibodies humoral immunogenicity and efficacy of a single dose of chadox mers vaccine candidate in dromedary camels neutralizing antibody and protective immunity to sars coronavirus infection of mice induced by a soluble recombinant polypeptide containing an n-terminal segment of the spike glycoprotein a dna vaccine induces sars coronavirus neutralization and protective immunity in mice vaccine efficacy in senescent mice challenged with recombinant sars-cov bearing epidemic and zoonotic spike variants receptor-binding domain of severe acute respiratory syndrome coronavirus spike protein contains multiple conformation-dependent epitopes that induce highly potent neutralizing antibodies structural definition of a neutralization epitope on the n-terminal domain of mers-cov spike glycoprotein vipr: an open bioinformatics database and analysis resource for virology research consurf : an improved methodology to estimate and visualize evolutionary conservation in macromolecules cryo-em structure of the -ncov spike in the prefusion conformation deep mutational scanning of sars-cov- receptor binding domain reveals constraints on folding and ace binding. biorxiv immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis epidemiology and outcomes of acute decompensated heart failure in children genomic and serologic characterization of enterovirus a brainstem encephalitis potent neutralizing monoclonal antibodies directed to multiple epitopes on the sars-cov- spike. biorxiv convergent antibody responses to sars-cov- in convalescent individuals isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model potently neutralizing human antibodies that block sars-cov- receptor binding and protect animals. biorxiv a serological assay to detect sars-cov- seroconversion in humans the receptor binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody cross-reactive antibody response between sars-cov- and sars-cov infections. biorxiv non-neutralizing antibodies from a marburg infection survivor mediate protection by fc-effector functions and by enhancing efficacy of other antibodies antibody-dependent cellular phagocytosis in antiviral immune responses sars-cov- infection induces robust, neutralizing antibody responses that are stable for at least three months. medrxiv antibody responses to viral infections: a structural perspective across three different enveloped viruses sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor structure, function, and antigenicity of the sars-cov- spike glycoprotein inhibition of sars-cov- (previously -ncov) infection by a highly potent pan-coronavirus fusion inhibitor targeting its spike protein that harbors a high capacity to mediate membrane fusion rapid isolation and profiling of a diverse panel of human monoclonal antibodies targeting the sars-cov- spike protein. biorxiv a potently neutralizing antibody protects mice against sars-cov- infection propagation, quantification, detection, and storage of west nile virus. current protocols in microbiology isolation and quantification of zika virus from multiple organs in a mouse heterotypic immunity against vaccinia virus in an hla-b* : transgenic mousepox infection model characterization of cells susceptible to sars-cov- and methods for detection of neutralizing antibody by focus forming assay. biorxiv syrian hamsters as a small animal model for sars-cov- infection and countermeasure development dissecting the human serum antibody response to secondary dengue virus infections the rbd of the spike protein of sars-group coronaviruses is a highly specific target of sars-cov- antibodies but not other pathogenic human and potent neutralizing antibodies from covid- patients define multiple targets of vulnerability iga dominates the early neutralizing antibody response to sars-cov- . medrxiv evidence for sustained mucosal and systemic antibody responses to sars-cov- antigens in covid- patients. medrxiv sars-cov- -specific t cell immunity in cases of covid- and sars, and uninfected controls phenotype and kinetics of sars-cov- -specific t cells in covid- patients with acute respiratory distress syndrome targets of t cell responses to sars-cov- coronavirus in humans with covid- disease and unexposed individuals clinical characteristics and outcomes for , patients with sars-cov- infection. medrxiv selective and cross-reactive sars-cov- t cell epitopes in unexposed humans sars-cov- -reactive t cells in healthy donors and patients with covid- oligomeric state of the zikv e protein defines protective immune responses key: cord- -fspmgg s authors: sehailia, moussa; chemat, smain title: antimalarial-agent artemisinin and derivatives portray more potent binding to lys and lys -binding hotspots of sars-cov- spike protein than hydroxychloroquine: potential repurposing of artenimol for covid- date: - - journal: journal of biomolecular structure & dynamics doi: . / . . sha: doc_id: cord_uid: fspmgg s medicinal herbs have proved along history to be a source of multiple cures. in this paper, we demonstrate how hydroxychloroquine can act as a good inhibitor of sars-cov- spike protein receptor-binding-domain using molecular docking studies. we also unveil how hydroxychloroquine can interfere in the prevention of lys in hace from interacting with the corresponding binding hotspot present on the spike protein. further screening of artemisinin & derived compounds produced better vina docking score than hydroxychloroquine (- . kcal mol(− ) for artelinic acid vs. − . kcal mol(− ) for hydroxychloroquine). artesunate, artemisinin and artenimol, showed two mode of interactions with lys and lys binding hotspots of the spike protein. molecular dynamics analysis confirmed that the formed complexes are able to interact and remain stable in the active site of their respective targets. given that these molecules are effective antivirals with excellent safety track records in humans against various ailment, we recommend their potential repurposing for the treatment of sars-cov- patients after successful clinical studies. in addition, an extraction protocol for artemisinin from artemisia annua l. is proposed in order to cope with the potential urgent global demand. communicated by ramaswamy h. sarma the spread of covid- pandemic has triggered a race to unveil the secrets of severe sars-cov- and its underlying acute respiratory syndrome. although, functional importance of different targets has been linked to the viral replication and maturation of coronaviruses' family such as chymotrypsin-like protease( clpro) or known as mpro (khan et al., ; muralidharan et al., ) or envelope protein (e) (gupta et al., ; boopathi et al., ) but it has been confirmed that the binding of the viral trimeric surface spike glycoprotein (sprotein) of sars-cov- to the human receptor angiotensin-converting enzyme (hace ) is the first step in host infection . in fact, vankadari and wilce ( ) revealed that s domains of the spike protein has an open conformation, enabling it to interact with target host proteins. yan et al. ( ) identified the structure of hace as a dimer in complex with membrane protein, they also showed that the two trimeric sproteins of the receptor binding domain (rbd) of sars-cov- bind very tightly to the hace dimer. the latter is activated by a specific cellular enzyme called furin (hasan et al., ) . medicinal plants reap an important source of complex active molecules that have been proposed, and sometimes traditionally used for the treatment of several pathologies, such as cancer, autoimmune diseases, and infectious diseases. the scientific community are relying on different modes and various mechanisms of action these molecules hold to halt sars-cov- severity. recently, remdesivir and chloroquine have been proved to inhibit in vitro vero e cells of emerged novel coronavirus ( -ncov) . chloroquine has been envisaged for sars-cov- (de clercq, ) , and has long been used for malaria therapy but has been replaced with hydroxychloroquine (hcq) due to the increased plasmodium falciparium parasite resistance, whereas an overdose of cq can cause acute poisoning and death (weniger, ) . they have shown similar activities in in-vivo essays against malaria parasite but accompanied by an increased risk of retinopathy (schrezenmeier & d€ orner, ) . nonetheless, hcq is considered safer than cq as the later has mediated cardiotoxic effects including rhythm disorders, and propels the development of cardiomyopathy in patients with rheumatic diseases (schrezenmeier & d€ orner, ) . for its turn, hcq has been found to be effective in inhibiting sars-cov- infection in vitro and attenuate inflammatory response . organic extracts of artemisia annua l. have been found to be more effective, faster, and less toxic than cq and hcq in treating malaria. a. annua contains a vital compound known as artemisinin, a sesquiterpene lactone with a peroxide linkage exhibiting low toxicity (table ) , also the parent compound for semisynthetic derivatives chemically modified at the c- position to produce artesunate, artemether, arteether, artenimol (dihydroartemisinin), and artelinic acid (table ) . these compounds, and in some cases their sodium salts, have been formulated as antimalarials for oral, rectal, and parenteral administration (woodrow et al., ) . several reports proved the efficiency of artemisinin derivatives as potent antivirals for hpv bovine viral diarrhea virus (bvdv) for the treatment of anal and cervical intraepithelial high-grade neoplasia, human herpes virus- (hhv- ), human immunodeficiency virus (hiv) and more particularly, artesunate, against human cytomegalovirus (hcmv) (d'alessandro et al., ) . this antiviral potency put artemisinin class of compounds as promising candidates for the treatment of patients suffering from sars-cov- virus. the encouraging results generated from the utilization of hcq to treat patients suffering from covid- pandemic further raises many questions surrounding its mode of action (million et al., ) . at the cellular level, direct and indirect mechanisms of cq and hcq are believed to inhibit immune activation by reducing toll-like receptor signaling and cytokine production and, in t cells, reducing cd expression (schrezenmeier & d€ orner, ) ; however, the absence of binding assay studies between the sprotein and hace protein in the presence of hcq opens the door to two main possibilities (vincent et al., ) : the first possibility revolves around hcq prevention of terminal glycosylation of hace protein which consequently impacts the final attachment between the sprotein and hace protein, whereas the second possibility revolves around hcq interaction with the receptor binding domain (rbd) of sprotein, thus preventing its docking on hace receptor. to further expand on the second possibility, we elected to perform computational studies based on molecular docking to help us understand more about the mode of interaction between hcq and the rbd of sars-cov- sprotein, and eventually, how such interaction prevents the sprotein from docking on the hace . another class of antimalarial and antiviral molecules comprised of artemisinin and artemisinin derived compounds are investigated to reveal how effective these molecules act with binding sites of sprotein, then ultimately how their mode of interaction occur. this study aims to propose also an extraction protocol for artemisinin from artemesia annua l. in order to cope with the potential urgent global demand. the pdb file of sars-cov- sprotein rbd-hace complex (pdb ref. lzg, version . ) was obtained from the research collaboratory for structural bioinformatics (rcsb) protein data bank (pdb) (http://www.rcsb.org/structure/ lzg). ucsf chimera . was used to visualise the structure of the ligand and/or protein-complex structure, to perform the various functions associated with ligand and protein preparations, and acting as an interface to enable molecular docking calculations using locally hosted autodock vina software (pettersen et al., ; trott & olson, ) . prior to molecular docking, the hace protein (part a, protein section coloured in green- figure ) was deleted from the pdb file of the complex. in addition, all non-standard residues including that of water were also removed. the structure of each ligand was incorporated into ucsf chimera using smiles string followed by structure minimisation. the pdbqt files of the sprotein rbd and each ligand was generated after adding all hydrogens and charges to each structure. the number of binding modes was set to with exhaustiveness of search set to . the maximum energy difference was set to kcal mol À . the best scoring pose for each molecule was analysed in terms of its interaction with the receptor binding motif (rbm). the x, y, z coordinates of the grid box covered the full area of the receptor binding motif of the sprotein, in our case, for center (x ¼ À . , y ¼ . , z ¼ À . ) whereas for size (x ¼ . , y ¼ . , z ¼ . ). in the case of the receptor, all hydrogen atoms were added to the structure, charges were merged and nonpolar hydrogen were removed; water molecules and side chains of non-standard residues were also ignored. for the studied ligands, charges were merged and non-polar hydrogens were also removed. the obtained molecular docking results were then aligned with the pdbqt file of the sprotein rbd-hace complex in order to visualise the type of interactions of each docked molecule in the sprotein-hace binding interface. computations were performed at the al-farabi cluster computer of the ecole nationale polytechnique oran -maurice audin (algeria). all molecular dynamics (md) studies are performed using groningen machine for chemical simulations (gromacs) which is a software package designed to perform md simulations of proteins, lipids and nucleic acids. amber force field was utilised during the parameterisation of each protein complex system followed by solvation with tip p water molecules; water.tip p force field in a cubic periodic box with Å spacing protein-box edge was applied. na þ ions were introduced to neutralise the overall charge. further energy minimisation was performed using steepest descent and conjugate gradient algorithms. the system was subjected to ns md at k and pressure of bar, the latter value was maintained using berendsen barostat. the generated trajectory files were analysed using visual molecular dynamics (vmd) software. most explanations associated with hcq mode of action are based on findings revolving around the mode of action of cq on sars-cov infection (simmons et al., ; yang et al., ) . amongst the cited reasons are: (i) cq can increase the value of endosomal ph which can reduce the transduction of sars-cov pseudo-type viruses (simmons et al., ; yang et al., ) , (ii) cq can raise the possibility of affecting the endosome-mediated fusion if added at the initial stage of the infection (vincent et al., ) , and (iii) once the virus is inside the cell, cq can inhibit the production of glycoproteins in vesicular stomatitis, thus preventing virus replication (dille & johnson, ) . previously, vincent et al. ( ) showed that introduction of cq prevents terminal glycosylation of ace receptor protein of the host cell, thus destabilizing the recognition mode of the sprotein on the surface of the virus, albeit such action did not impact the level of expression of surface hace proteins of the host cell. therefore, we elected to perform computational docking studies of hcq as safe cq substitute against sprotein rbd of sars-cov- to further study the nature of such interaction and explore its inhibition potential against sprotein. our molecular docking studies of hcq on the rbd of sars-cov- sprotein produced a vina score of À . kcal mol À (table ). the obtained scoring is relatively moderate given that vina score of the best molecule, i.e. physalin b, previously docked on a homology model of sars-cov- sprotein rbd was À . kcal mol À (micholas & jeremy, ) . analysis of our docking results showed that the hydroxyl group (oh) of hcq molecule formed strong hydrogen bonding with asn side chain residing on one part of the receptor binding motif (rbm) of the sprotein (figure ). in the sprtoein-hace complex, asn of sars-cov- sprotein forms favourable hydrogen bonding with tyr of hace while at the same time helps neutralising the charge of lys (lan et al., ) . therefore, the resulting hydrogen bonding between the oh group of hcq and asn can play a role in destabilising other interactions with hace residues, e.g. tyr , lys , gly and asp , which were already found to play key roles in the interaction with the sprotein furthermore, when our initial docking results were aligned with the structure of sprotein-hace complex, we successfully observed significant clash between the aminoalkyl side-chain of hcq and the lys residue side-chain of hace ( figure ). equally, lys (o) was found to form one hydrogen bonding with gln of the sprotein in the complex as reported by lan et al. ( ) ; at the same time, lys (n) forms important hydrogen bonding with gln while maintaining a salt bridge with asp (lan et al., ; yan et al., ) . similar to lan et al. ( ) findings, lys and lys were both found to be important hotpots in hace responsible for binding to the sprotein of sar-cov- . it is believed that the latter specie developed key mutations to help stabilise and/or neutralise both lysine hotspots via introduction of asn (to neutralise lys ) and gln & leu (to neutralise lys ) of hace protein, thus ensuring tighter incorporation of both hotspots deep into the hydrophobic pockets of the sprotein, which would explain the main reason behind the higher affinity of sars-cov- sprotein to hace compared to that of sars-cov (lan et al., ) . therefore, it is very likely that selective interaction of hcq with the surface of sars-cov- sprotein through the formation of an inclined tape over the hydrophobic pocket responsible for hosting the lys hotspot (the oh group in this case is acting like a hook by forming a hydrogen bond with asn ), can be responsible for the prevention of tighter binding with hace protein via restricting penetration of lys into its finally assigned destination on the sprotein rbd (figure ). similar to asn in sars-cov- , thr in sars-cov sprotein was previously reported by shang et al. ( ) to interact with tyr , lys , gly and asp in hace protein. we advocate here that similarity in the hydrogen bond networking system between both types of sproteins (i.e. that of sars-cov and sars-cov- ) and that of hace may be used to explain the supposed efficacy of hcq in inhibiting sars-cov and sars-cov- interaction with hace . on the other hand, our molecular docking results also showed good interaction between the quinoline aromatic ring in hcq and his in hace ; in normal circumstances, the latter residue interacts well with leu and gln of the sprotein. by doing that, hcq can also disturb interaction in the middle region of the binding interface between the sprotein and hace (figure ) . previously, samarth and kirk ( ) studied the interaction of hydroxychloroquine/azithromycin with sars-cov- sprotein-hace complex using a virtualised quantum mechanical modelling approach. in agreement with our study, the authors found that hcq had low potency of interaction with the studied complex compared to azithromycin; they also recommended molecular docking studies to further strengthen their rationale. our approach to study the interaction of molecules with the receptor binding motif (rbm) of the sprotein prior to complexation with hace has helped properly analyse the type of interaction with the sprotein, and how such interaction could prevent hace from docking onto the rbm of sprotein. in addition, we successfully addressed some of the aspects surrounding the mechanism of action of hcq and artemisinin derivatives in preventing the interaction between the virus' sprotein and hace receptor via selectively interacting with the lys binding hotspot of sprotein. with this information on hand, we then elected to perform in-silico screening of other potent antimalarial compounds derived from the core structure of artemisinin; by doing so, we believe we can gain more insight about the potential use of such class of compounds as safer and more potent substitutes to hcq. in this regard, a total number of compounds were successfully screened against sprotein rbd using the same molecular docking approach previously followed with hcq. the obtained results are shown in table . analyses of the data show that artemisinin and its derivative compounds have scored better than hcq, with compounds in entry & of table producing the least and closest vina score (- . kcal mol À ) to hcq. on the other hand, artelinic acid (table , entry ) gave the best vina score of À . kcal mol À ; however, this compound was discarded due to its high toxicity levels (li et al., ) . although artemisinin and its derivative compounds resulted in good vina scoring (- . score . kcal mol À ), only those approved and prescribed as antimalarials were selected namely, artesunate (table , entry ), artemisinin (table , entry ) and artenimol (table , entry ). these were found to possess good clinical records with very promising antiviral properties, thus enhancing their potential to be repurposed for the treatments of sars-cov- . artesunate (table , entry ) with vina score of À . kcal mol À . the calculated inhibition constant (ki) of each top scoring pose is also reported in table . analysis of the data shows that artemisinin class of compounds possess much lower ki than hcq, thus enabling them to become good antiviral candidates against sars-cov- . the elevated ki value of hcq also reflects the high cytotoxic concentration of hcq (cc > mm at multiplicity of infection (moi) ¼ . ) required to eradicate the virus during in-vitro assay studies as previously reported by liu et al. ( ) . besides its antiviral activity, artemisinin derivatives hold immunoregulatory properties and modulate neutrophils, t-cell and b-cell components of the immune system (yao et al., ) , thus enhancing system immunity and touting themselves as promising candidates to synergistically enhance their antiviral effect in vivo and treat inflammation-associated diseases (li et al., ) . the nature of interaction between the aforementioned three compounds and the rbd of sars-cov- sprotein was also analysed in order to see which molecule best influence the repulsion of hace lys and lys from binding to the inner hydrophobic pockets of the sprotein. by analysing the top scoring pose of artesunate (table , entry ), we observe that the molecule binds far away from the hydrophobic regions of lys and lys hotspot binding sites (figure ) . furthermore, upon alignment of artesunate docking results with the structure of sprotein hace complex, no clashes were observed between artesunate structure and the other side-chains present in the hace protein. however, the carboxylic acid moiety of the artesunate side chain was observed to form hydrogen bonding with lys (n), which can further neutralise the overall charge through the formation of a salt bridge, this can adversely lead to tighter interaction between hace and sprotein. therefore, in spite of the high vina scoring associated with artesunate, we predict that this molecule is unlikely to act as a good inhibitor, in its current form, to sars-cov- sprotein (figure ) . in the case of artemisinin's (table , entry ) top pose, despite no hydrogen bonding is observed with the sprotein rbd, we notice a lateral incorporation of the six-membered ring cyclohexane group of artemisinin into the lys hotspot binding pocket, with the peroxy-bridge facing the peripheral hydrophilic surface of the binding region( figure (a) , region coloured in blue next to the peroxy-bridge). such mode of interaction could well be used to prevent the penetration of lys side-chain into the hydrophobic pocket ( figure ). artemisinin lateral penetration into these binding hotspots may reduce the random motion of a-helix and loops present in sprotein and its capability to attach with hac as hypothesized by gupta et al. ( ) where binding with phytochemicals reduces their motion present in the envelope (e) protein of sars-cov , therefore inhibiting a modulation of ion channel activity and stop the pathogenesis caused via sars-cov . artemisinin was also found to interact with lys hotspot binding pocket, although at slightly lower vina score (- . kcal mol À ). the average score is perhaps attributed to the absence of hydrophilic surfaces close to the binding pocket ( figure ) . therefore, the selective interaction of artemisinin with both lys and lys hotspot binding regions raises its possibility to be repurposed for the treatment of sars-cov- patients following successful clinical trials. artemisinin has better tolerance on human hepatoma cell line hepg with cc of mm than artesunate cc of mm (romero et al., ) . additionally, in vitro cytoxicity levels of artesunate against human epithelial cells (hela cells) and human foreskin fibroblasts (hff cells) report very low tolerance with cc of . mm and . mm respectively (he et al., ) . artenimol on the other hand showed a similar mode of interaction to that of artemisinin with both lys and lys hotspot binding sites, although at slightly lower vina scores of À . and À . kcal mol À , respectively (figure ) . both artemisinin and artesunate are susceptible to cyp reduction to generate artenimol once incorporated into the human body, albeit at different conversion rates. we therefore recommend that artenimol can be prioritised for clinical trials to achieve the repurposing of such class of molecules for covid- , however, careful considerations need to be taken into account given the water solubility characteristics of each compound (woodrow et al., ) . successful completion of md calculations permitted us to obtain two crucial graphs, i.e. that of root-mean square deviation (rmsd) vs. time (figure ) and the root-mean square fluctuation (rmsf) of each residue in the three protein complexes, i.e. that of the sprotein rbm in complexation with hcq, artemisinin or artenimol (figure ). rmsd represents a measure of the average change in the displacement of an atom for a particular frame compared to a reference frame. on the other hand, rmsf measures the local changes of each residue in the protein backbone. figure indicates a good protein-ligand stability for all three complexes, with hcq protein complex showing the lowest rmsd value ( . Å) followed by artenimol-protein complex ( . Å) and artemisinin-protein complex ( . Å). similarly, artemisinin-protein complex showed the highest deviation . Å followed by artenimol-protein complex . Å and hcq-protein complex . Å. those values imply the currently, major production of artemisinin is based on solvent extraction from a. annua despite modest but not scalable enough trials to produce it chemically or semisynthetically via its precursor artemisinic acid in engineered bacteria (hale et al., ; dietrich et al., ). artemisinin is abundant in a. annua leaves ( . - . %) and production includes several steps starting with screening and drying before biomass being processed generally via solvent leaching or percolation at - c using low polarity solvents like hexane, toluene or petroleum (chemat et al., ) . this operation is not selective for artemisinin, therefore terpenes, fatty acids and some pigments are inevitably co-extracted which calls for secondary refinery steps including adsorption, flash chromatography and sequential crystallization. herein, an indicative facile production setup is proposed to enhance worldwide production capacity ( figure ). however, a fireproof equipment and facility is a pre-requisite to ensure safety and security measures are met. the plant comprises an extraction step ( ) in which biomass is placed in hollow-fibre bags and processed at c for min using a solvent mixture of hexane/ethyl acetate ( : v/v) with solvent/biomass ratio of to ( l for each kg) (chemat et al., ) . this step can be conducted by means of a stirring tank or a percolation type reactor to : extraction reactor; : frame and plate filter-press (or vibrating-screener/decanter); : adsorption column bed; : clarification column bed; : crystallization stirred reactor; : spray dryer; : distillation column; : solvent storage tank ensure up to % extraction yield is achieved. then, the mixture passes through a cloth or diaphragm plate and frame filter-press ( ) in order to discard fine biomass particles and recover the extract. the latter is submitted to an adsorption bed column ( ) filled with activated carbon aiming at the removal of pigments and tannins. another clarification stage is required to remove other impurities such as free fatty acids and pigments; for instance, an adsorption bed column ( ) filled with celite (merck) is recommended (chemat et al., ) . due to some affinity with activated carbon and celite , an artemisinin loss of - % is expected. after that, the purified extract should be concentrated to at least / th its initial volume using an evaporator. the concentrated extract is submitted into a jacketed crystallization reactor ( ) equipped with a stirring shaft set at tip-speed in the range of - rpm to control breakage effects and to generate a narrow particle size distribution (huter et al., ) . the cooling rate is set to . c/min to reach c and is kept at this temperature for min to let artemisinin crystals settle down. the crystals are sent into a spray drying system ( ) to recover high purity crystals of - % as a final product. the overall yield of artemisinin is expected to reach % from the initial content of artemisinin in a. annua leaves. the spent mother liquor is guided for another row of crystallization with longer residence time. the recovered crude crystals are washed with cold ethanol to recover purer artemisinin and increase the final yield. the inhibition of sars-cov- sprotein rbd with hcq was successfully studied using molecular docking techniques. hcq was found to selectively interact with the lys hotspot binding pocket via the formation of an inclined tape over the binding site with the oh group of hcq acting like a hook. artemisinin class of compounds were also found to interact the same binding pocket. in addition, artemisinin & derived molecules showed extra mode of interaction with the lys binding hotspot, although at slightly lower vina score. molecular dynamics studies confirmed that the formed complexes are able to interact and remain stable in the active site of their respective targets. these results demonstrate the likelihood of repurposing artemisinin as a less toxic substitute of hcq to block the sprotein rbd of the virus from docking onto hace , while at the same time enhancing the immune system of the patient. more focus should be intended to study the in-vivo mode of action of artenimol as most artemisinin derivatives are converted to this compound once incorporated to the body. novel coronavirus structure, mechanism of action, antiviral drug promises and rule out against its treatment extraction mechanism of ultrasound assisted extraction and its effect on higher yielding and purity of artemisinin crystals from artemisia annua l. leaves biosynthesis of spathulenol and camphor stand as a competitive route to artemisinin production as revealed by a new chemometric convergence approach based on nine locations' field-grown artemisia annua l the use of antimalarial drugs against viral infection. microorganisms potential antivirals and antiviral strategies against sars coronavirus infections a novel semi-biosynthetic route for artemisinin production using engineered substrate-promiscuous p (bm ) inhibition of vesicular stomatitis virus glycoprotein expression by chloroquine in-silico approaches to detect inhibitors of the human severe acute respiratory syndrome coronavirus envelope protein ion channel microbially derived artemisinin: a biotechnology solution to the global problem of access to affordable antimalarial drugs a review on the cleavage priming of the spike protein on coronavirus by angiotensin-converting enzyme- and furin unique and highly selective anticytomegalovirus activities of artemisinin-derived dimer diphenyl phosphate stem from combination of dimer unit and a diphenyl phosphate moiety systematic and model-assisted process design for the extraction and purification of artemisinin from artemisia annua l identification of chymotrypsin-like protease inhibitors of sars-cov- via integrated computational approach structure of the sars-cov- spike receptor-binding domain bound to the ace receptor toxicity evaluation of artesunate and artelinate in plasmodium berghei-infected and uninfected rats hydroxychloroquine, a less toxic derivative of chloroquine, is effective in inhibiting sars-cov- infection in vitro repurposing therapeutics for covid- : supercomputer-based docking to the sars-cov- viral spike protein and viral spike protein-human ace interface early treatment of covid- patients with hydroxychloroquine and azithromycin: a retrospective analysis of cases in marseille computational studies of drug repurposing and synergism of lopinavir, oseltamivir and ritonavir binding with sars-cov- protease against covid- ucsf chimera-a visualization system for exploratory research and analysis potential inhibitors of the enzyme acetylcholinesterase and juvenile hormone with insecticidal activity: study of the binding mode via docking and molecular dynamics simulations effect of artemisinin/artesunate as inhibitors of hepatitis b virus production in an "in vitro" replicative system energetics based modeling of hydroxychloroquine and azithromycin binding to the sars-cov- spike (s)protein -ace complex mechanisms of action of hydroxychloroquine and chloroquine: implications for rheumatology structural basis of receptor recognition by sars-cov- characterization of severe acute respiratory syndrome-associated coronavirus (sars-cov) spike glycoprotein-mediated viral entry autodock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization and multithreading emerging wuhan (covid- ) coronavirus: glycan shield and structure prediction of spike glycoprotein and its interaction with human cd . emerging microbes & infections chloroquine is a potent inhibitor of sars coronavirus infection and spread remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus ( -ncov) in vitro review of side effects and toxicity of chloroquine. the bulletin of the world health organization structural basis for the recognition of sars-cov- by full-length human ace ph-dependent entry of severe acute respiratory syndrome coronavirus is mediated by the spike glycoprotein and enhanced by dendritic cell transfer through dc-sign immunomodulation of artemisinin and its derivatives a pneumonia outbreak associated with a new coronavirus of probable bat origin the authors would like to acknowledge the support of the directorate general of scientific research and technological development of the ministry of high education and scientific research in algeria. we are thankful to dr. djamila benrezkallah (djillali liabes university of sidi bel abbes, algeria) and al-farabi cluster computer of the ecole nationale polytechnique oran -maurice audin for running the md computations and insightful analysis of the molecular dynamics calculation. no potential conflict of interest was reported by the author(s). http://orcid.org/ - - - key: cord- -nnv e gr authors: mulgaonkar, nirmitee; wang, haoqi; mallawarachchi, samavath; fernando, sandun; martina, byron; ruzek, daniel title: bcr-abl tyrosine kinase inhibitor imatinib as a potential drug for covid- date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: nnv e gr the rapid geographic expansion of severe acute respiratory syndrome coronavirus (sars-cov- ), the infectious agent of coronavirus disease (covid- ) pandemic, poses an immediate need for potent drugs. enveloped viruses infect the host cell by cellular membrane fusion, a crucial mechanism required for virus replication. the sars-cov- spike glycoprotein, due to its primary interaction with the human angiotensin-converting enzyme (ace ) cell-surface receptor, is considered as a potential target for drug development. based on in silico screening followed by in vitro studies, here we report that the existing fda-approved bcr-abl tyrosine kinase inhibitor, imatinib, inhibits sars-cov- with an ic of nm. we provide evidence that although imatinib binds to the receptor-binding domain (rbd) of sars-cov- spike protein with an affinity at micromolar, i.e., . ± . μm levels, imatinib does not directly inhibit the spike rbd:ace interaction – suggesting a bcr-abl kinase-mediated fusion inhibition mechanism is responsible for the inhibitory action. we also show that imatinib inhibits other coronaviruses, sars-cov, and mers-cov via fusion inhibition. based on promising in vitro results, we propose the abl tyrosine kinase inhibitor (atki), imatinib, to be a viable repurposable drug against covid- . in early december , the chinese health authorities reported several cases of pneumonia of unknown cause that had originated in wuhan, a city in the hubei province of china. the causative agent of this outbreak was identified to be a virus that belonged to the sarbecovirus subgenus, orthocoronavirinae subfamily which was previously referred to by its interim name novel coronavirus ( -ncov) [ , ] and was later named as sars-cov- [ ] . due to the rapid spread of covid- , the world health organization (who) declared it a global pandemic in march [ ] . by mid-august , over million cases have been confirmed around the world, resulting in more than , deaths [ ] . unfortunately, there is no approved antiviral treatment or preventive vaccine for coronaviruses in humans. since supportive care is the only recommended interim treatment, it is imperative to identify repurposable lead compounds to rapidly treat covid- patients until a sars-cov- -specific drug and a vaccine is developed. although the coronavirus genome consists of numerous conserved druggable enzymes, including papain-like protease (plpro), c-like protease ( clpro), non-structural proteins rna-dependent rna polymerase (rdrp) and helicase, development of clinically approved antiviral therapies has proven to be a difficult task [ ] . the surface structural spike glycoprotein (s), a key immunogenic cov antigen essential for virus and host cell-receptor interactions, is an important target for therapeutic development. the spike protein consists of an n-terminal s subunit (receptor binding) and a c-terminal s subunit (membrane fusion). the s subunit contains the receptor-binding domain (rbd) which attaches to the host membrane, thus playing an important role in viral entry. sars-cov- utilizes the ace receptor for entry and the transmembrane protease, serine (tmprss ) for spike protein priming [ ] . crystallographic studies have shown that sars-cov- binds to the ace receptor, with a binding mode nearly identical to that of sars-cov [ ] [ ] [ ] [ ] . the binding affinity of the ace receptor to the rbd of the sars-cov- spike protein is reported to be significantly higher as compared to sars-cov [ , ] . based on the importance of virus membrane fusion events in the viral life cycle and its infectivity, the spike protein of sars-cov- was targeted for drug screening. this study utilizes in silico methodology followed by in vitro experimental validation to screen existing fda-approved small molecule drugs specific to the rbd of the spike protein of sars-cov- to identify repurposable drugs targeting further clinical validation. a model for sars-cov- spike protein was constructed using the crystal structure ( vsb_chain a) to correct missing residues. the amino acid sequence identity between the target sequence (genbank: qhd . ) and template ( vsb_chain a) was . %. the sars-cov- model showed an rmsd of . Å relative to the crystal structure ( vsb_chain a). structure assessment of the predicted model using the ramachandran plot showed . % residues in the most favored regions with . % outliers. none of the outliers contained the residues present at the active site of the protein. the predicted model was further used for in-silico studies. in virtual screening, a library of approximately , compounds was docked against the sars-cov- spike rbd protein. the output was analyzed for common classes of drugs with highest (most negative) docking scores that resulted in seven compounds with three compounds, antiviral , antiviral and antiviral with docking scores of - . ± . , - . ± . and - . ± . kcal/mol from the enamine antiviral library, and four compounds, ponatinib, imatinib, ergotamine, and glecaprevir with docking scores of - . ± . , - . ± . , - . ± . and - . ± . kcal/mol from the zinc fda library respectively. the above libraries were chosen to help identify a repurposable drug that can potentially inhibit the sars-cov- . the screened compounds had the highest scores within their respective sets and had one or more binding conformations at the ace binding domain of the spike protein. the most common class of drugs was found to be abl tyrosine kinase inhibitors (atki), and hence two drugs (ponatinib and imatinib) with the highest scores were selected for in vitro testing. the binding scores for the seven screened compounds at the rbd are shown in fig. a and detailed description of the screened drugs is given in table s under supplementary data. the high affinity of the screened compounds is visible when compared with the negative control dimethyl sulfoxide (dmso), which is ineffective against coronaviruses [ ] . based on promising in silico data, and initial viral plaque assay results (fig. a) , imatinib was chosen to be advanced for further experimental validation. (due to a supply-shortage ponatinib and ergotamine were unavailable for purchase and hence could not be included in the initial viral plaque assays). previous studies have shown imatinib to inhibit sars-cov and mers-cov by blocking endosomal fusion at the cell-culture level [ , [ ] [ ] [ ] [ ] . it has been suggested that tyrosinekinase inhibitors do not affect the cleavage of the spike protein but inhibit spike-mediated endosomal fusion [ , , ] . the high affinity of tyrosine-kinase inhibitors towards the spike protein is deduced from the initial docking results, where both imatinib and ponatinib have shown highly negative binding free energies. first, we evaluated the toxicity of imatinib when incubating the compound on vero cells for one hour or eight hours. in the experiments where the compound remained on the cells for eight hours toxicity was measured at concentrations of µm, . µm, . µm, and . µm. however, in the -hour design, no toxicity was observed. next, we evaluated the ability of imatinib to inhibit replication and entry. at concentrations as low as . µm the compound was effective in suppressing % of plaque formation in the -hr design, and the ic value determined using linear regression was nm. consistent with the toxicity data, toxicity was observed between and . µm. the compound also showed efficacy in the -hour design, with higher ic values. these data indicate that imatinib inhibits virus replication in vitro as shown in fig. a and b. to evaluate if imatinib inhibits viral entry, we performed two fusion assays: endosomal (vero) and plasma membrane (vero-tmprss ) as shown in fig. c and d, respectively. based on cytotoxicity, at concentrations below nm, no toxicity was observed microscopically (red arrow in the graph). vsv-g control revealed % infectivity (cytopathic effect at every concentration below this, suggesting the inhibitor did not affect vsv-g entry. vsv-g particles cells do not carry spike proteins and thus, no significant entry inhibition occurred, suggesting that entry inhibition is likely mediated through the spike protein. however, the effect on vero-tmprss cells was less clear for any of the coronaviruses used when compared to the vsv-g control. a similar level of toxicity was observed in these cells. it is worth noting that toxicity is probably the result of incubating cells with imatinib for hours in the assay. taken together, there is evidence that imatinib inhibits spike fusion and prevents viral entry, possibly by preventing endosomal entry. the binding kinetics of imatinib to the rbd of sars-cov- spike protein was evaluated using biolayer interferometry (bli), as shown in fig. a . the analysis showed that imatinib binds to the sars-cov- rbd protein with an on-rate (kon) as ( . ± . ) × m - s - and dissociates with an off-rate (koff) as ( . ± . ) × - s - . this resulted in an equilibrium affinity constant (kd) . ± . µm which is calculated as a ratio of the koff and kon rates. the affinity values indicate that % of the rbds on the surface spike glycoproteins will be occupied at micromolar concentrations of imatinib. however, this value is too close to the toxicity levels observed in the above assays and very high compared to the ic value, as well as the nanomolar affinity of ace on immobilized rbd (fig. s ) suggest that it is likely to inhibit spike fusion by the other previously suggested moa [ ] . in vitro colorimetric assays were performed over a pm to nm range in imatinib concentration to assess the ability of imatinib to directly inhibit the rbd:ace interaction. the colorimetric signal of the positive control (no inhibitor) reaction was strong, and the blank wells exhibited an absorbance of ~ . at nm as per the manufacturer's instructions. the test wells (with inhibitor) showed absorbance comparable to the positive control wells indicating that imatinib did not affect the sars-cov- rbd:ace interaction in the indirect competitive enzyme-linked immunosorbent assay (elisa), as shown in fig. b . a pharmacophore analysis was done to evaluate as to why imatinib showed promising in-silico results yet failed to directly inhibit the rbd:ace interaction. the primary binding site of the sars-cov- rbd was revealed via docking. pharmacophore analyses were done to further elucidate the interactions between the drug molecules and their receptors (fig. ) . twenty-five pharmacophores were collected from the top five binding positions of imatinib at the primary active site of abl tyrosine kinase (native receptor), where each purple sphere represents a pharmacophore as depicted in fig a. similarly, an additional pharmacophores were collected from the first five binding sites of imatinib at the primary active site of the sars-cov- rbd. the rbd pharmacophores were represented as yellow spheres in fig b. from the results of elixir-a alignment, it is evident that four pharmacophores between abl kinase and rbd overlapped (red spheres). this significant overlap reveals why the compounds that were originally screened using sars-cov- rbd also bound to abl kinase, ultimately ensuing in the inhibitory action. the above point is further explicated due to the . % identity between the active sites of the abl (uniprotkb: p aa - ) and sars-cov- spike rbd (uniprotkb: p dtc aa - ) generated by a protein blast (blastp) [ , ] search. there is an urgent need for finding a treatment against the current pandemic of the sars-cov- . health experts across the globe are trying to use existing clinically approved drugs to treat patients until a specific drug is developed. the present study, using a combination of computational techniques followed by in vitro studies, identified imatinib, an fda approved anti-cancer drug as a potential treatment of sars-cov- infection. the data indicate inhibition of sars-cov- replication at ic of nm. our results suggest that imatinib prevents viral replication by inhibiting the virus at the fusion stage, possibly by preventing endosomal entry. binding studies revealed that the affinity of imatinib for the sars-cov- spike rbd protein is still lower (higher kd value) than the previously published values of nanomolar range (ligand id: bdbm ) [ ] for imatinib on abl tyrosine kinase [ ] and in range with the micromolar affinities of imatinib to the src-family kinases, frk and fyn [ ] . although imatinib is not a promiscuous drug, it has been found to bind tightly to tyrosine kinases other than abl [ , ] . pharmacophore mapping between abl and sars-cov- rbd and a . % identity at the active site of the two proteins explains why imatinib binds to the sars-cov- rbd as well. however, imatinib failed to directly inhibit the sars-cov- spike rbd:ace interaction in the competitive elsa assays. therefore, it is likely that imatinib causes inhibition of virus fusion via cellular kinase pathway resulting in inhibition of virus replication, as previously described for other coronaviruses [ ] . the results provide further evidence supporting the recent clinical trials (clinicaltrials.gov identifier: nct , nct , nct , nct , and nct ) for covid- patients with imatinib. a swiss-model server [ ] was used to construct a homology model of the sars-cov- spike protein using the crystal structure of the sars-cov- spike protein (pdb: vsb_chain a) as the template [ ] . the genome sequence wuhan-hu- (genbank: mn . ) was used as a representative of the sars-cov- . spike protein sequence (genbank: qhd . ) was used as the target sequence [ ] . the swiss-model structure assessment tool was used to validate the quality of the predicted model. around , compounds, including , nucleoside-like compounds from the enamine targeted antiviral library (enamine.net) and , food and drug administration (fda)approved drugs from the zinc database [ ] were used for molecular docking. all molecules were prepared with obabel [ ] from .sdf or .mol format to .pdbqt format. the d compound structures from the enamine library were resolved by obabel --gen d command. the docking file of the protein model was prepared with mgltools v . . [ ] and the molecules were docked at the rbd of the spike protein via autodock vina . . [ ] . the grid box of × × size with . Å spacing was fixed around the rbd (thr -val ) of the spike protein. each docking was done in three replicates, and the conformation with the highest binding score was recorded. the batch processing of docking and data collection was performed using an in-house python script which is deposited in github. data were analyzed statistically using r studio [ ] and graphs were constructed with ggplot in r [ ]. the ligand-receptor interactions were studied using schrödinger maestro [ ] , and molecules with high docking scores were selected from each screening library for further studies. codon-optimized mers-cov (isolate emc, vg -g-n) and sars-cov (isolate cuhk-w ; vg -g-n) s expression plasmids (pcmv) were ordered from sino-biological and subcloned into pcaggs using the clai and kpni sites. the last amino acids of the sars-cov spike protein were deleted to enhance pseudovirus production. codon-optimized cdna encoding sars-cov- s glycoprotein (isolate wuhan-hu- ) with a c-terminal amino acid deletion was synthesized and cloned into pcagss in between the ecori and bglii sites. pvsv-egfp-dg (# ), pmd .g (# ), pcag-vsv-p (# ), pcag-vsv-l (# ), pcag-vsv-n (# ) and pcaggs-t opt (# ) were ordered from addgene. s expressing pcaggs vectors were used for the production of pseudoviruses, as described below. the cdna encoding human tmprss (nm_ ; ohu d) was obtained from genscript. the cdna fused to a c-terminal ha tag was subcloned into pqxcih (clontech) in between the noti and paci sites to obtain the pqxcih-tmprrs -ha vector. vero-tmprss cells were produced by retroviral transduction. to produce the retrovirus, μg pqxcih-tmprrs -ha was co-transfected with polyethylenimine (pei) with . μg pbs-gagpol (addgene # ) and μg pmd .g in a cm dish of % confluent hek- t cells in opti-mem i ( x) + glutamax. retroviral particles were harvested at hours post-transfection, cleared by centrifugation at x g, filtered through a . μm low protein-binding filter (millipore), and used to transduce vero cells. polybrene (sigma) was added at a concentration of μg/ml to enhance transduction efficiency. transduced cells were selected with hygromycin b (invitrogen). hek- t cells were maintained in dulbecco's modified eagle's medium (dmem, gibco) supplemented with % fetal bovine serum (fbs), x non-essential amino acids (lonza), mm sodium pyruvate (gibco), mm l-glutamine (lonza), μg/ml streptomycin (lonza) and u/ml penicillin. vero, vero-tmprss , and veroe cells were maintained in dmem supplemented with % fbs, . mg/ml sodium bicarbonate (lonza), mm hepes (lonza), mm l-glutamine, μg/ml streptomycin and u/ml penicillin. all cell lines were maintained at °c in a % co , humidified incubator. the protocol for vsv-g pseudovirus rescue was adapted from whelan and colleagues ( ). briefly, a % confluent cm dish of hek- t cells was transfected with µg pvsv-egfp-dg, µg pcag-vsv-n (nucleocapsid), µg pcag-vsv-l (polymerase), µg pmd .g (glycoprotein, vsv-g), µg pcag-vsv-p (phosphoprotein) and µg pcaggs-t opt (t rna polymerase) using pei at a ratio of : (dna:pei) in opti-mem i ( x) + glutamax. forty-eight hours post-transfection the supernatant was transferred onto new plates transfected hours prior with vsv-g. after a further hours, these plates were retransfected with vsv-g. after hours the resulting pseudoviruses were collected, cleared by centrifugation at x g for minutes, and stored at - °c. subsequent vsv-g pseudovirus batches were produced by infecting vsv-g transfected hek- t cells with vsv-g pseudovirus at a moi of . . titres were determined by preparing -fold serial dilutions in opti-mem i ( x) + glutamax. aliquots of each dilution were added to monolayers of × vero cells in the same medium in a -well plate. three replicates were performed per pseudovirus stock. plates were incubated at °c overnight and then scanned using an amersham typhoon scanner (ge healthcare). individual infected cells were quantified using imagequant tl software (ge healthcare). all pseudovirus work was performed in a class ii biosafety cabinet under bsl- conditions at erasmus medical center. for the production of mers-cov, sars-cov, and sars-cov- s pseudovirus, hek- t cells were transfected with µg s expression plasmids. twenty-four hours post-transfection, the medium was replaced for in opti-mem i ( x) + glutamax, and cells were infected at a moi of with vsv-g pseudotyped virus. two hours post-infection, cells were washed three times with optimem and replaced with medium containing anti-vsv-g neutralizing antibody (clone g f ; absolute antibody) at a dilution of : , to block remaining vsv-g pseudovirus. the supernatant was collected after hours, cleared by centrifugation at x g for minutes and stored at °c until use within days. coronavirus s pseudovirus was titrated on veroe cells as described above. transduction experiments were carried out by incubating pseudovirus with imatinib at concentrations ranging from - nm in opti-mem i ( x) + glutamax for hour at °c. pseudovirus-imatinib mixes were added to monolayers of × vero or vero-tmprss cells in a -well plate. plates were incubated for hours before quantifying gfp-positive cells using an amersham typhoon scanner and imagequant tl software. to determine the toxicity profile of imatinib, we performed the mtt assay using a -hr and an hr design. briefly, a serial dilution of imatinib was prepared and incubated on vero cells for hr at o c. subsequently, cells were washed, further cultured for eight hrs. in the -hr design, cells were incubated with a serial dilution of imatinib for eight hours without a washing step. we tested serial dilutions of imatinib for its ability to neutralize sars-cov- (german isolate; gisaid id epi_isl ; european virus archive global # v- ) using a plaque reduction neutralization test (prnt) as previously described [ ] . fifty μl of the virus suspension ( spot forming units) was added to each well and incubated at °c for either hr. following incubation, the mixtures were added on vero cells and incubated at °c for either hr or hrs. the cells incubated for hr were then washed and further incubated in medium for hrs. after the incubation, the cells were fixed and stained with a polyclonal rabbit anti-sars-cov antibody (sino biological; : ). staining was developed using a rabbit anti-sars-cov serum and a secondary alexa-fluor-labeled conjugate (dako). the number of infected cells per well were counted using the imagequant tl software. the binding kinetics of imatinib on sars-cov- rbd protein were studied using a blitz® system (fortébio). experiments were conducted using the advanced kinetics mode, at room temperature and a buffer system consisting of x kinetics buffer (fortébio), % anhydrous dimethyl sulfoxide (dmso; sigma aldrich). recombinant his-tagged sars-cov- rbd protein ( -v h; sino biological) at a concentration of µg/ml was loaded on anti-penta-his (his k) biosensors (fortébio), followed by a washing step with assay buffer to block the unoccupied sensor surface. the association and dissociation profiles of imatinib (sigma aldrich) were measured at various concentrations (four-point serial dilutions from . µm to . µm). a reference biosensor loaded in the same manner with µm imatinib was used for baseline correction in each assay. the final binding curves were analyzed with the blitz pro . software (fortébio) using the : global-fitting model. the assay was repeated twice to validate the binding constants. here, data is represented as mean ± sd. similarly, sars-cov- rbd was immobilized on his k biosensors to study the binding kinetics of mfc-tagged hace ( -h h; sino biological) before being dipped into tubes containing the x kinetics buffer. various concentrations of ace (four-point serial dilutions from to nm) were used to measure the association and dissociation profiles. data were reference subtracted and fit to a : binding model using the blitz pro . software. the ability of imatinib to inhibit the interaction of spike rbd:ace proteins was evaluated by using the spike rbd (sars-cov- ): ace inhibitor screening colorimetric assay kit (bps bioscience). the avi-his-tagged spike s rbd (sars-cov- ) protein ng/well in pbs was coated onto -well microplate by overnight incubation at °c. blocking buffer was used to block the nonspecific binding sites by incubation for hour. different concentrations of imatinib were added and incubated for hour at room temperature with slow shaking. for the wells designated "blank" and "positive control", inhibitor buffer (pbs with . % dmso) was added. the reaction was initiated by adding ace his-avi-tagged biotin-labeled hip tm protein ( ng/well) in x immuno buffer to the "positive control" and "test inhibitor" wells by incubation for hour at room temperature with slow shaking. streptavidin-hrp (dilution : , in blocking buffer ) was added to each well and incubated at room temperature for hour with slow shaking. washing procedure ( × µl x immuno buffer ) was performed after each step. the chromogenic reaction was initiated by adding colorimetric hrp substrate to each well and incubated at room temperature until blue color was developed (approximately minutes) in the "positive control" well. after the blue color was developed, the reaction was terminated by adding n hcl, and absorbance at nm was measured using synergy h hybrid multi-mode microplate reader (biotek instruments). the pharmacophore model was generated using pharmit [ ] , an online interactive platform to elucidate pharmacophores from the receptor and ligand complex. top five binding conformations of drug-protein complexes were produced by autodock vina. the pharmacophores of the ligands interacting with the receptor were considered active pharmacophores while the rest were defined as inactive pharmacophores. the pharmacophores from the native receptor of imatinib (abl tyrosine kinase; pdb: gvu) and rbd of sars cov- were generated using imatinib as the ligand. using the enhanced ligand exploration and interaction recognition algorithm (elixir-a), the two sets of pharmacophores were merged and processed to identify any overlap in d space. a detailed description of elixir-a can be found in our previous work [ ] and the algorithm has been deposited in github. the python script 'elixir-a-vina-batch-screening-module' used for running docking jobs in batch mode and elixir-a, the algorithm used for pharmacophore mapping have been deposited in github and will be made public upon publication of the manuscript. with md simulations. we are thankful to mart lammers for allowing us to use the fusion assays. funding: dr was supported by the ministry of health of the czech republic (project no. - - ). author contributions: sf, bm, and dr conceived and designed the study. first author nm designed the experiments, performed bli studies and immunoassays, reviewed literature, and compiled the manuscript and figures. co-first author hw conducted in silico experiments and compiled figures. sm did literature review on the resulting compounds and compiled the manuscript and figures. bm performed virology experiments. sf directed and verified studies and authored the manuscript. all authors reviewed and edited the paper. competing interests: all the authors declare that there are no conflicts of interest. percent inhibition compared to the amount of plaques on cells. here, -fold dilution of the compounds were done in duplo. then tcid of sars-cov- was added to each well, and plates were incubated at c for hour. then, the mixes were added onto vero cells and incubated for hours at c. subsequently, cells were fixed for min with % pfa, followed by another min fixation with % ethanol. fixed cells were stained with a monoclonal antibody, followed by alexafluor . here imatinib shows significant % inhibition as compared to other compounds tested. ( µg/ml) were incubated for hour at room temperature with slow shaking in the presence of various imatinib concentrations. streptavidin hrp ( : , ) was added to the reaction mixture. colorimetric substrate was added to initiate the chromogenic reaction, and minutes were allowed for color development. the reaction was terminated with the addition of n hcl and absorbance was measured at nm. positive control (no inhibitor) was assumed to represent % inhibition. values obtained from test wells (with imatinib) compared to the positive control showed % inhibition of rbd:ace interaction, indicating that imatinib does not inhibit spike fusion by direct inhibition. pharmacophore distribution of five most stable conformations on tyrosine kinase abl (purple spheres); and b] pharmacophore distribution (yellow spheres) on sars-cov- spike protein rbd with pharmacophores common to both receptors depicted in red. a novel coronavirus from patients with pneumonia in china the species severe acute respiratory syndrome-related coronavirus: classifying -ncov and naming it sars-cov- who declares covid- a pandemic an interactive web-based dashboard to track covid- in real time. the lancet infectious diseases coronaviruses-drug discovery and therapeutic options sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor functional assessment of cell entry and receptor usage for lineage b β-coronaviruses discovery of a novel coronavirus associated with the recent pneumonia outbreak in humans and its potential bat origin. biorxiv cryo-em structure of the -ncov spike in the prefusion conformation structure of the sars-cov- spike receptor-binding domain bound to the ace receptor coronavirus s protein-induced fusion is blocked prior to hemifusion by abl kinase inhibitors abelson kinase inhibitors are potent inhibitors of severe acute respiratory syndrome coronavirus and middle east respiratory syndrome coronavirus fusion corona virus drugs -a brief overview of past, present and future repurposing of clinically developed drugs for treatment of middle east respiratory syndrome coronavirus infection gapped blast and psi-blast: a new generation of protein database search programs protein database searches using compositionally adjusted substitution matrices. the febs journal bindingdb: a web-accessible database of experimentally determined protein-ligand binding affinities. nucleic acids research a quantitative analysis of kinase inhibitor selectivity a small molecule-kinase interaction map for clinical kinase inhibitors molecular therapeutics: is one promiscuous drug against multiple targets better than combinations of molecule-specific drugs? swiss-model: homology modelling of protein structures and complexes complete genome characterisation of a novel coronavirus associated with severe human respiratory disease in wuhan docking.org: over . billion compounds you can search and buy; million leadlike you can dock. abstracts of papers of the open babel: an open chemical toolbox autodock and autodocktools : automated docking with selective receptor flexibility software news and update autodock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading gplots: various r programming tools for plotting data. . . release, s., : maestro. schrödinger, llc severe acute respiratory syndrome coronavirus -specific antibody responses in coronavirus disease pharmit: interactive exploration of chemical space a non-beta-lactam antibiotic inhibitor for enterohemorrhagic escherichia coli o : h pharmacophore distribution on sars-cov- spike protein rbd with imatinib we gratefully acknowledge the support from texas a&m high performance research computing (hprc) and tamu laboratory for molecular simulation (lms). we would like to thank dr. lisa perez (associate director for advanced computing enablement, hprc tamu) for guidance key: cord- - hvq qaj authors: nguyen-contant, phuong; embong, a. karim; kanagaiah, preshetha; chaves, francisco a.; yang, hongmei; branche, angela r.; topham, david j.; sangster, mark y. title: s protein-reactive igg and memory b cell production after human sars-cov- infection includes broad reactivity to the s subunit date: - - journal: mbio doi: . /mbio. - sha: doc_id: cord_uid: hvq qaj the high susceptibility of humans to severe acute respiratory syndrome coronavirus (sars-cov- ) infection, the cause of coronavirus disease (covid- ), reflects the novelty of the virus and limited preexisting b cell immunity. igg against the sars-cov- spike (s) protein, which carries the novel receptor binding domain (rbd), is absent or at low levels in unexposed individuals. to better understand the b cell response to sars-cov- infection, we asked whether virus-reactive memory b cells (mbcs) were present in unexposed subjects and whether mbc generation accompanied virus-specific igg production in infected subjects. we analyzed sera and peripheral blood mononuclear cells (pbmcs) from non-sars-cov- -exposed healthy donors and covid- convalescent subjects. serum igg levels specific for sars-cov- proteins (s, including the rbd and s subunit, and nucleocapsid [n]) and non-sars-cov- proteins were related to measurements of circulating igg mbc levels. anti-rbd igg was absent in unexposed subjects. most unexposed subjects had anti-s igg, and a minority had anti-n igg, but igg mbcs with these specificities were not detected, perhaps reflecting low frequencies. convalescent subjects had high levels of igg against the rbd, s , and n, together with large populations of rbd- and s -reactive igg mbcs. notably, igg titers against the s protein of the human coronavirus oc were higher in convalescent subjects than in unexposed subjects and correlated strongly with anti-s titers. our findings indicate cross-reactive b cell responses against the s subunit that might enhance broad coronavirus protection. importantly, our demonstration of mbc induction by sars-cov- infection suggests that a durable form of b cell immunity is maintained even if circulating antibody levels wane. t he betacoronavirus severe acute respiratory syndrome coronavirus (sars-cov- ), the causative agent of a respiratory disease termed coronavirus disease , emerged in china in late and rapidly spread worldwide ( ) . a pandemic was declared in march , and global deaths from covid- now exceed , . the rapid increase in cases in many countries has challenged health care systems, and shutdowns and quarantine measures introduced to slow virus spread have caused major disruptions to society and economies ( ) . sars-cov- infection produces a wide spectrum of outcomes. a proportion of infections, likely more than %, remain asymptomatic. most clinical cases develop mild to moderate respiratory symptoms, but up to % progress to a more severe disease with extensive pneumonia ( , ) . when sars-cov- emerged and began to spread, the severity of the threat was primarily attributed to the novelty of the virus to the human immune system and, consequently, a lack of preexisting immune memory to quickly clear virus and limit disease progression. four types of common cold coronavirus are endemic in humans, including the alphacoronaviruses e and nl and the betacoronaviruses oc and hku . however, limited relatedness between key structural proteins of these human coronaviruses (hcovs) and those of sars-cov- suggested that significant crossreactive immunity was unlikely ( , ) . initial studies of non-sars-cov- -exposed individuals found negligible levels of igg against the sars-cov- spike (s) protein, the viral attachment protein that binds receptor angiotensin converting enzyme (ace ) on host cells to initiate infection ( ) . more recently, however, studies have provided evidence of sars-cov- -reactive b and t cell memory in unexposed subjects that could confer some protection against sars-cov- or modulate disease pathogenesis ( ) ( ) ( ) . sera from non-sars-cov- -exposed individuals have been screened for igg binding to the s and s subunits of the sars-cov- s protein. the membrane-distal s subunit contains the receptor binding domain (rbd) for receptor recognition, and the membrane-proximal s subunit, which has higher homology among coronaviruses than does s ( , ) , mediates membrane fusion to release viral rna into the host cell. in two large cohorts of unexposed subjects, approximately % had igg that bound s but not s or the rbd. approximately % of the subjects had igg against the sars-cov- nucleocapsid (n) protein, which is highly conserved among coronaviruses ( , ) . although n is an internal viral protein and not a target of neutralizing antibodies (abs), coronavirus infections typically elicit strong anti-n ab production ( ) . the idea that circulating hcovs elicit igg that cross-reacts with sars-cov- is supported by the finding that sars-cov- infection increases igg titers against the s proteins of multiple hcovs ( ) . in t cell studies, cd ϩ t cells in up to % of non-sars-cov- -exposed donors responded to epitopes in s and non-s proteins of sars-cov- ( , ) . notably, s-reactive cd ϩ t cells in unexposed subjects were mostly reactive to the conserved s subunit, consistent with cross-reactivity to circulating hcovs ( ) . sars-cov- -reactive cd ϩ t cells were also detected in unexposed donors, but the response was less marked than for cd ϩ t cells ( ) . sars-cov- -reactive memory b cells (mbcs) generated in b cell responses to hcovs are also likely to be present in non-sars-cov- -exposed individuals. indeed, mbcs might be more important than preexisting cross-reactive abs as a source of protection against sars-cov- . igg mbcs are more broadly reactive than bulk serum abs generated against the same antigen, they persist after circulating ab levels wane, and they are readily activated to generate strong ab responses or seed germinal centers for additional rounds of affinity maturation ( ) . concurrent early production of virusspecific igm and igg in the response to sars-cov- infection suggests a response mediated by igg mbcs as well as by naive b cells ( , ( ) ( ) ( ) . this picture is supported by identification of b cell subsets with high and low immunoglobulin v gene mutation frequencies during the response to sars-cov- infection ( ) . however, little direct analysis of sars-cov- -reactive mbcs in unexposed subjects has been performed. characterization of populations of mbcs generated and/or expanded by sars-cov- infection can also provide insights into cross-reactivity between coronaviruses and participation of preexisting mbcs in the response. wec et al. ( ) used cells from a survivor of the sars-cov outbreak as a source of mbcs that bound the s protein of sars-cov- ; a comprehensive panel of abs expressed by the mbcs were cloned and characterized. notably, most of the highly mutated mabs bound the s subunit of multiple hcov s proteins, often with higher affinity than to the s of sars-cov- . a screening of healthy donors identified low frequencies of mbcs reactive to the s proteins of the sars-cov and sars-cov- ( ) . findings suggest that s -reactive mbcs generated by hcovs were activated and expanded by the sars-cov. rbd-binding mbcs sampled in the convalescent phase of sars-cov- infection expressed abs with relatively low numbers of v gene mutations, suggesting that this component of the response largely reflected naive b cell activation by novel epitopes ( ) . to extend our understanding of the b cell response to sars-cov- infection, the current study compared ab and mbc immunities to sars-cov- in unexposed individuals and individuals in the convalescent phase of infection. in particular, we were interested in the presence of sars-cov- -reactive mbcs in unexposed subjects that could confer some protection against sars-cov- and in formation of mbcs by sars-cov- infection to provide durable protection against reinfection. most importantly, we demonstrate that sars-cov- infection generates both igg and igg mbcs reactive to the novel rbd and the conserved s subunit of the s protein. long-lived mbcs are thus likely to be available to mediate rapid protective ab responses if circulating ab levels wane and reinfection occurs. our results also draw attention to preexisting sars-cov- -cross-reactive b cell memory corresponding to the s subunit in sars-cov- -naive subjects. we speculate that the strong response to s after sars-cov- infection reflects preexisting s -reactive mbc activation and strengthens broad coronavirus protection. igg against sars-cov- proteins in unexposed subjects primarily targets the s subunit of the s protein. to investigate preexisting b cell immunity to sars-cov- in unexposed individuals and sars-cov- -reactive b cell immunity generated by infection, we analyzed sera and peripheral blood mononuclear cells (pbmcs) from (i) healthy donors sampled prior to the emergence of sars-cov- and (ii) nonhospitalized covid- convalescent subjects sampled to weeks after symptom onset. reactivity was measured against the s protein (including the rbd and s subunit) and n protein of sars-cov- and the s proteins of the human alphacoronavirus e and betacoronavirus oc . h influenza virus hemagglutinin and tetanus toxoid (ttd) were included as control antigens that humans are commonly exposed to through infection and vaccination. serum igg levels were measured by enzyme-linked immunosorbent assay (elisa). approximately one-third of non-sars-cov- -exposed subjects in the healthy donor cohort had low levels of serum igg against the s and n proteins of sars-cov- , likely reflecting cross-reactivity with seasonal hcovs (fig. a) . notably, % of unexposed subjects had igg against the highly conserved s subunit of the s protein. it is possible that inherent features of the bulky s reagent used in our analysis reduced binding by anti-s abs. igg that bound the highly novel rbd was not detected in unexposed subjects. all non-sars-cov- -exposed subjects had igg against s proteins of hcovs e and oc , indicating previous infection, and against control proteins h and ttd ( fig. c to f). s-and n-specific igg production following sars-cov- infection includes a strong response to the s subunit. levels of igg against s, rbd, s , and n were markedly higher in convalescent subjects than in unexposed subjects, indicating strong induction of these abs by sars-cov- infection (fig. a) . in a lower number of convalescent subjects, high anti-s igg titers were associated with low levels of anti-n igg. indeed, more than % of convalescent subjects had anti-n igg levels within the range seen in unexposed subjects, questioning the reliability of using anti-n igg measurement to identify previous sars-cov- infection in recovered patients ( ) . notably, serum igg titers against s were consistently higher than against the rbd in convalescent subjects, perhaps reflecting the novelty of the rbd and a response dependent on naive b cell activation (fig. b) . interestingly, titers of igg were higher against the s protein of the hcov oc in convalescent subjects than in unexposed subjects, but this was not the case for the s protein of hcov e (or for the control proteins h and ttd) (fig. c to f) . the anti-oc s igg titers correlated with those against the sars-cov- s (r s ϭ . , p ϭ . ), rbd (r s ϭ . , p ϭ . ), and s (r s ϭ . , p Ͻ . ), indicating a relationship with sars-cov- infection (fig. g) . the particularly strong correlation between igg titers against oc s and the sars-cov- s suggests a cross-reactive response to the s subunit. since the healthy donor samples in our analysis were collected to years before the emergence of sars-cov- , we considered the possibility that a recently circulating hcov was responsible for the higher anti-oc s igg titers in the convalescent subjects. to exclude this possibility, we measured anti-oc s igg titers in sera collected from health care workers in . the health care workers had cared for hospitalized sars-cov- patients, but all were negative for igg against sars-cov- s and rbd, consistent with the effectiveness of personal protective equipment and appropriate work practices. oc s-reactive igg levels in health care worker sera were similar to those in non-sars-cov- -exposed healthy donor sera and significantly lower than those in sera from convalescent subjects (fig. c) . taken together, our results indicate that sars-cov- infection generates a strong igg response that cross-reacts with the s of human betacoronaviruses. strong s-reactive mbc formation following sars-cov- infection includes reactivity to the rbd and s subunit. pbmcs from non-sars-cov- -exposed subjects and convalescent subjects were analyzed for the presence of mbcs reactive to sars-cov- proteins. circulating antigen-specific igg mbc populations were measured by in vitro stimulation of mbcs to induce differentiation into ab-secreting cells (ascs). poststimulation antigen-specific measurement of levels of mbc-derived ascs (mascs) by enzyme-linked immunosorbent spot (elispot) assay or of mbc-derived polyclonal abs (mpabs) by elisa provided a measure of the levels of precursor mbcs ( ) . analysis of mascs by elispot assay was performed against the sars-cov- s, rbd, and n proteins and against influenza virus h and ttd. mpab levels were measured against those of antigens used in the elispot assay, as well as sars-cov- s and the s proteins of hcovs oc and e. antigen-specific igg mpab concentrations correlated strongly with the frequency of igg mascs derived from stimulated mbcs (determined for sars-cov- s, sars-cov- rbd, influenza virus h , and ttd, r s ϭ . , . , . , and . , respectively, p Յ . ), validating the use of the mpab concentration as a measure of the size of specific mbc populations. the presence of a low level of igg against the sars-cov- s, rbd, and n proteins in a proportion of unexposed subjects suggested that igg mbcs with the same specificity had also been formed. however, these mbcs were not detected (fig. c) , possibly because of very low frequencies in the circulation. in contrast, igg mbcs reactive to the s proteins of the hcovs oc and e and the control proteins h and ttd were detected in nearly % or more of non-sars-cov- -exposed subjects, consistent with the higher levels of serum igg against these antigens (fig. e to h) . as expected, sars-cov- rbd-reactive mbcs were not detected in unexposed subjects. in marked contrast to non-sars-cov- -exposed subjects, the vast majority of convalescent subjects had circulating igg mbcs reactive to sars-cov- s, rbd, and s , indicating strong induction by sars-cov- infection of mbcs reactive to novel and conserved regions of the s protein ( fig. a and c) . notably, numbers of igg mbcs reactive to the s protein of the hcov oc were higher in convalescent subjects than in unexposed subjects (fig. e) , but there was no difference between the two subject groups in the levels of igg mbcs reactive to the hcov e s protein or influenza virus h or ttd (fig. b and f to h) . s -reactive igg mbc numbers correlated well with levels of igg mbcs reactive to sars-cov- s (r s ϭ . , p Ͻ . ) and rbd (r s ϭ . , p ϭ . ) and to s of hcov oc (r s ϭ . , p ϭ . ) but not with those reactive to s of hcov e (r s ϭ Ϫ . , p ϭ . ), influenza virus h (r s ϭ . , p ϭ . ), or ttd (r s ϭ . , p ϭ . ). the findings of our mbc analysis are consistent with serum igg measurement and indicate that sars-cov- infection generates igg mbcs reactive to the sars-cov- s that cross-react with the s of human betacoronaviruses. interestingly, only a small proportion of the convalescent subjects generated detectable n-reactive igg mbcs, even though most subjects produced high levels of anti-n igg in serum ( fig. c and d) . it is unclear whether this reflects a real difference between s-reactive mbc formation and n-reactive mbc formation or an effect of the sampling time. overall, we demonstrate that sars-cov- infection induces strong s-reactive mbc formation that would be expected to provide lasting protection against reinfection and, potentially, broad protection against betacoronaviruses. our goals in this study were to investigate sars-cov- -reactive b cell memory in unexposed subjects that could provide some protection against sars-cov- infection and the generation of b cell memory by sars-cov- infection that could provide lasting protection against reinfection. in particular, we were interested in igg mbcs, which respond to cognate antigens with rapid, vigorous, and high-affinity ab production. importantly, mbcs are long-lived cells that continue to provide strong protection when circulating ab levels wane. our approach was to analyze circulating igg as well as igg mbcs from the sars-cov- -naive and sars-cov- -convalescent subject groups. our key findings are as follows: (i) the presence of igg reactive to the s subunit of sars-cov- in most unexposed subjects, likely reflecting cross-reactivity to hcovs; (ii) markedly increased levels of igg against the sars-cov- s and n proteins, including reactivity to the rbd and s subunit of s, in convalescent subjects; (iii) increased igg binding to the s protein of the oc hcov, but not the e hcov, in convalescent subjects, reflecting greater cross-reactivity between s subunits of betacoronaviruses; (iv) strong formation of igg mbcs reactive with the rbd and s subunit of the sars-cov- s protein in convalescent subjects; and (v) formation of igg mbcs reactive with the s protein of oc , but not with that of e, in convalescent subjects, consistent with s subunit cross-reactivity between betacoronaviruses. approximately one-third of our cohort of non-sars-cov- -exposed subjects had low levels of igg against the sars-cov- s and n proteins. the low anti-n igg level likely reflects infection with hcovs, which have low-level ( % to %) homology with the sars-cov- n protein ( ) . however, a protective function for anti-n abs has not been established ( ) . notably, % of unexposed subjects had igg against the s subunit, reflecting homology with hcovs, but none had igg against the highly novel sars-cov- rbd ( , , ) . abs that target the s subunit have been shown to have virus-neutralizing activity, raising the possibility that the presence of preexisting anti-s igg confers some protection against sars-cov- ( ) . the processes that generate anti-s igg are also likely to generate s -reactive igg mbcs, and these might provide more significant protection than low levels of anti-s abs. however, s -reactive mbcs (or s-reactive and n-reactive mbcs) were not detected in non-sars-cov- -exposed subjects. taking those findings together with the identification of s-reactive mbcs in unexposed healthy donors ( ) , it is likely that the levels of s -reactive mbcs were below the limit of detection in our assays. on the basis of an estimate of to igg mascs generated per igg mbc after in vitro stimulation ( ) , our analysis suggests that the frequency of s -reactive mbcs, if present in unexposed healthy donors, would be Ͻ / pbmcs. most mbcs are resident in lymphoid tissues and, except for mbcs against frequently seen immunogenic antigens (for example, the influenza virus h or ttd in this study), are at very low frequencies in the circulation in the steady state ( , ) . anti-rbd, anti-s, and anti-n igg levels were markedly higher in the convalescent subjects than in non-sars-cov- -exposed subjects, indicating strong induction by sars-cov- infection. perhaps notably, the majority of convalescent subjects had higher igg titers against the s than against the rbd. this is particularly surprising because of the accessibility of the rbd to b cells and the expected immunodominance over the s subunit ( , ) . our demonstration of strong anti-s igg production is consistent with the activation of a preexisting population of igg mbcs against the conserved s subunit in the absence of mbcs reactive to the novel rbd. however, we cannot exclude the possibility of inherent differences in the stability or antigenicity of rbd and s reagents as an explanation. igg levels against the s protein of hcov oc (but not e) were significantly higher in convalescent subjects than in non-sars-cov- -exposed subjects and correlated strongly with anti-s igg levels. these findings support the idea of stronger b cell cross-reactivity between the s subunits of sars-cov- and human betacoronaviruses than alphacoronaviruses ( ) . importantly, we demonstrated that sars-cov- infection generates rbd-reactive and s -reactive igg mbcs. recently, long et al. ( ) found that levels of sars-cov- reactive abs, including neutralizing abs, start to decrease within to weeks of infection, especially when the infection is asymptomatic. since mbc populations are maintained for many years, perhaps decades, our findings indicate that mbcs generated by sars-cov- infection would be available to rapidly generate protective abs if waning ab levels were to allow reinfection to occur ( ) . notably, three convalescent subjects in our analysis had undetectable rbd-reactive igg levels but nevertheless had rbd-reactive igg mbcs. this might reflect mbc production by germinal centers that remained active after recovery from infection ( ) . the proportion of subjects with mbcs reactive to the hcovs oc and e was greater for the convalescent group than for the unexposed group, likely reflecting the increase in levels of s -reactive mbcs in the convalescent group and cross-reactivity with hcovs. s -reactive mbc expansion mediated by sars-cov- infection could enhance protection against a broad range of coronaviruses ( ) . the level of n-reactive mbc formation in convalescent subjects was lower than expected given the large number of subjects with high titers of n-reactive igg, but additional sampling times are required to confirm this observation. the antigen-specific b cell response to infection and vaccination in humans is characterized by entry into the circulation of recently proliferated class-switched b cells, termed activated b cells (abcs), which are phenotypically and transcriptionally distinct from ascs ( ) . circulating abc frequencies peak at to weeks after antigen exposure and have substantially decreased by months. frequencies of antigen-specific resting mbcs (negative for markers of recent proliferation) increase together with those of abcs and decrease much more slowly ( , ) . abcs, like resting mbcs, were activated by the in vitro stimulation conditions used in our study to divide and differentiate into ascs ( ) . we therefore cannot exclude the possibility that abc activation contributes, to some degree, to measurement of what we designate mbcs. on the basis of the kinetics of abc and resting mbc formation and maintenance of immunoglobulin gene clonal lineages in the two populations, ellebedy et al. ( ) suggested that at least a subset of abcs form resting mbcs. however, the differentiation pathways of abcs are not well established ( ) and the proportion that becomes part of long-maintained mbc populations remains uncertain. in conclusion, our analysis investigated ab and mbc immunity to sars-cov- in unexposed subjects and individuals soon after recovery from sars-cov- infection. the findings emphasized the novelty of the sars-cov- s protein rbd in unexposed subjects. however, igg reactive to the s was widespread in unexposed subjects and likely resulted from exposure to hcovs. although our approach was unable to directly identify s -reactive mbcs in the unexposed subjects, we suggest that these cells were present and strongly contributed s -reactive igg early in the response to sars-cov- infection. the igg response in convalescent sars-cov- subjects was also strong against the rbd and, less consistently, against the n protein. importantly, the convalescent sars-cov- subjects had generated rbd-reactive and s -reactive igg mbcs. the rbd-reactive mbcs are likely to provide strong long-term protection if rbd-reactive neutralizing ab levels wane and reinfection occurs. additional studies are required to establish the importance of s -reactive igg in providing broad anticoronavirus activity and the influence of expanded s -reactive mbc populations on a de novo b cell response to the rbd. study participants and clinical samples. all study participants were recruited at the university of rochester medical center, rochester, ny, and provided written informed consent prior to inclusion in the studies. the studies were approved by the university of rochester human research subjects review board (protocols - , - , and - ) and conducted in accordance with the principles of good clinical practice. a prepandemic cohort of healthy donors (median age, years; interquartile range [iqr], to years) were enrolled from to (non-sars-cov- -exposed subjects). a cohort of health care workers (median age, years; iqr, to years) at strong memorial hospital, rochester, ny, were enrolled in may . the health care workers had not been diagnosed with covid- prior to enrollment. a cohort of nonhospitalized covid- convalescent subjects ( males and females) (median age, years; iqr, to years) were enrolled in may and consisted of pcr-confirmed patients and non-pcr-confirmed subjects who were contacts of confirmed cases or displayed covid- -like symptoms. the convalescent subjects were sampled to weeks after symptom onset. symptoms reported (percentages of subjects) were fever ( %), cough ( %), sore throat ( %), stuffy/runny nose ( %), difficulty breathing ( %), fatigue ( %), headache ( %), body aches ( %), nausea/vomiting ( %), and diarrhea/loose stool ( %). recombinant proteins. rbd and stabilized ectodomain s protein from sars-cov- (isolate wuhan-hu- ) were expressed in-house in hek cells using pcaggs plasmid constructs kindly provided by florian krammer (icahn school of medicine at mount sinai) ( ) . baculovirus-expressed s subdomain and hek cell-expressed n protein were obtained from sino biological (chesterbrook, pa) and raybiotech (peachtree corners, ga), respectively. baculovirus-expressed s proteins from seasonal hcovs oc and e were obtained from sino biological. in-house hek cell-expressed hemagglutinin from eggderived h n a/california/ / and ttd (milliporesigma, burlington, ma) were used as noncoronavirus control proteins. mbc analysis. measurement of levels of antigen-specific mbcs was essentially performed as described previously ( ) . briefly, cryopreserved pbmcs were thawed and rested overnight at °c in complete medium. rested pbmcs were stimulated for days at ϫ pbmcs/well in -well plates to induce mbc expansion and differentiation into ascs. the stimulation cocktail consisted of complete medium supplemented with g/ml r (sigma, st. louis, mo), ng/ml interleukin- (il- ) (gibco, gaithersburg, md), and ng/ml il- (stemcell technologies, vancouver, canada). after stimulation, cells were harvested and pelleted by centrifugation. the undiluted supernatant containing abs secreted by ascs generated from stimulated mbc precursors (mpabs) was collected and stored for analysis by elisa. supernatants from unstimulated cultures of rested pbmcs were collected to control for abs produced by preexisting ascs. antigen-specific ascs in the cell pellet (mascs) were enumerated by elispot assay. for each antigen, , stimulated pbmcs were analyzed by elispot assay and the limit of masc detection was set at spots (mascs)/ pbmcs. on the basis of elispot assay results, antigen-specific mbcs in peripheral blood were quantified as antigen-specific igg mascs as a proportion of stimulated pbmcs. antigen-specific igg concentrations in mpab samples (after subtraction of ab concentrations in supernatants from the levels seen in unstimulated pbmc control cultures) were also used as a measure of the relative sizes of reactive mbc populations. enzyme-linked immunosorbent assay (elisa). concentrations of ag-specific serum abs and mpabs were measured by elisa as previously described ( ) . briefly, nunc maxisorp -well plates (thermo fisher, waltham, ma) were coated overnight with optimized concentrations of antigens. serially diluted samples were added to blocked plates and incubated for h at room temperature. alkaline phosphatase-conjugated anti-human igg (clone mt ; mabtech, stockholm, sweden) and p-nitrophenyl phosphate substrate (thermo fisher) were subsequently added to detect bound antigen-specific abs. absorbance was read at nm after color development. a weight-based concentration method was used to quantify antigen-specific ab levels in test samples as described previously ( , ) . sera from healthy donors and convalescent subjects with high titers for test antigens were used to establish human serum standards. the cutoff for assay positivity was set at approximately ϫ the mean optical density (od) value for negative wells. statistical analyses. the medians (with q and q ) were summarized by subject group and compared by the wilcoxon rank sum test. spearman correlation analysis was used together with corresponding robust regression models to assess monotonic associations among ab responses. multiple-test adjustment was not applied for this explorative study; thus, a p value of Ͻ . was considered significant for all analyses. statistical analyses were performed using sas . software (sas institute inc, cary, nc). we thank the staff of the university of rochester infectious disease research clinic for subject enrollment and sample collection and bei resources for providing some of the reagents used in this study. this project was funded in part with federal funds from the national institute of a pneumonia outbreak associated with a new coronavirus of probable bat origin sars-cov- vaccines: status report clinical features of patients infected with novel coronavirus in wuhan clinical and immunological assessment of asymptomatic sars-cov- infections genome composition and divergence of the novel coronavirus ( -ncov) originating in china phylogenetic analysis and structural modeling of sars-cov- spike protein reveals an evolutionary distinct and proteolytically sensitive activation loop a serological assay to detect sars-cov- seroconversion in humans sars-cov- -reactive t cells in healthy donors and patients with covid- targets of t cell responses to sars-cov- coronavirus in humans with covid- disease and unexposed individuals pre-existing and de novo humoral immunity to sars-cov- in humans characterization of a novel coronavirus associated with severe acute respiratory syndrome antibody response of patients with severe acute respiratory syndrome (sars) targets the viral nucleocapsid virological assessment of hospitalized patients with covid- b cell responses: cell interaction dynamics and decisions antibody responses to sars-cov- in patients with covid- kinetics of sars-cov- specific igm and igg responses in covid- patients covid- serology at population scale: sars-cov- -specific antibody responses in saliva deep sequencing of b cell receptor repertoires from covid- patients reveals strong convergent immune signatures broad neutralization of sars-related viruses by human monoclonal antibodies convergent antibody responses to sars-cov- in convalescent individuals sensitivity in detection of antibodies to nucleocapsid and spike proteins of severe acute respiratory syndrome coronavirus in patients with coronavirus disease broad hemagglutinin-specific memory b cell expansion by seasonal influenza virus infection reflects early-life imprinting and adaptation to the infecting virus contributions of the structural proteins of severe acute respiratory syndrome coronavirus to protective immunity an outbreak of human coronavirus oc infection and serological cross-reactivity with sars coronavirus human monoclonal antibodies against highly conserved hr and hr domains of the sars-cov spike protein are more broadly neutralizing high efficiency human memory b cell assay and its application to studying plasmodium falciparum-specific memory b cells in natural infections the transcription factor t-bet resolves memory b cell subsets with distinct tissue distributions and antibody specificities in mice and humans broad dispersion and lung localization of virus-specific memory b cells induced by influenza pneumonia a sequence homology and bioinformatic approach can predict candidate targets for immune responses to sars-cov- the receptor binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients cutting edge: long-term b cell memory in humans after smallpox vaccination role of memory b cells in hemagglutinin-specific antibody production following human influenza a virus infection defining antigenspecific plasmablast and memory b cell subsets in human blood after viral infection or vaccination activation dynamics and immunoglobulin evolution of pre-existing and newly generated human memory b cell responses to influenza hemagglutinin assignment of weight-based antibody units to a human antipneumococcal standard reference serum, lot -s allergy and infectious diseases, national institutes of health, department of health and human services, under ceirs contract no. hhsn c.we declare no conflicts of interest. key: cord- -h a xym authors: armijos‐jaramillo, vinicio; yeager, justin; muslin, claire; perez‐castillo, yunierkis title: sars‐cov‐ , an evolutionary perspective of interaction with human ace reveals undiscovered amino acids necessary for complex stability date: - - journal: evol appl doi: . /eva. sha: doc_id: cord_uid: h a xym the emergence of sars‐cov‐ has resulted in nearly , , infections and , deaths globally so far. this novel virus acquired the ability to infect human cells using the sars‐cov cell receptor hace . because of this, it is essential to improve our understanding of the evolutionary dynamics surrounding the sars‐cov‐ hace interaction. one way theory predicts selection pressures should shape viral evolution is to enhance binding with host cells. we first assessed evolutionary dynamics in select betacoronavirus spike protein genes to predict whether these genomic regions are under directional or purifying selection between divergent viral lineages, at various scales of relatedness. with this analysis, we determine a region inside the receptor‐binding domain with putative sites under positive selection interspersed among highly conserved sites, which are implicated in structural stability of the viral spike protein and its union with human receptor ace . next, to gain further insights into factors associated with recognition of the human host receptor, we performed modeling studies of five different betacoronaviruses and their potential binding to hace . modeling results indicate that interfering with the salt bridges at hot spot could be an effective strategy for inhibiting binding, and hence for the prevention of sars‐cov‐ infections. we also propose that a glycine residue at the receptor‐binding domain of the spike glycoprotein can have a critical role in permitting bat sars‐related coronaviruses to infect human cells. the recent emergence of the novel sars coronavirus (sars-cov- ) marked the third introduction of a highly pathogenic coronavirus into the human population in the twenty-first century, following the severe acute respiratory syndrome coronavirus (sars-cov) (drosten et al., ; who, ) . mers-cov was the second emergence and was first detected in saudi arabia in and resulted in nearly , human infections and deaths in countries (fehr, channappanavar, & perlman, ; zaki, boheemen, bestebroer, osterhaus, & fouchier, ) . in december , sars-cov- , a previously unknown coronavirus capable of infecting humans was discovered in the chinese city of wuhan, in the hubei province (huang et al., ; zhu et al., ) . sars-cov- is associated with an ongoing pandemic of atypical pneumonia, now termed coronavirus disease (covid- ) that has affected over , , people with , fatalities as of april , (who, . both sars-cov and mers-cov are thought to have originated in colonies of bats, eventually transmitted to humans, putatively facilitated by intermediate hosts such as palm civets and dromedary camels, respectively (cui, li, & shi, ) . the genome of sars-cov- shares about % nucleotide identity with that of sars-cov and is % identical to the bat coronavirus batcov ratg genome, reinforcing the probable bat origin of the virus . however, better assessing the evolutionary dynamics of sars-cov- is an active research priority worldwide. betacoronavirus within the subfamily coronavirinae of the family coronaviridae. members of this family are enveloped viruses containing a single positive-strand rna genome of - kb in length, the largest known rna virus genome. the coronavirus spherical virion consists of four structural proteins: the spike glycoprotein (s-protein), the envelope protein, membrane protein, and nucleocapsid. the transmembrane trimeric s-protein plays a critical role in virus entry into host cells (gallagher & buchmeier, ; tortorici & veesler, ) . it comprises two functional subunits: s subunit, where the receptor-binding domain (rbd) is found, is responsible for binding host cell surface receptors and s subunit mediates subsequent fusion between the viral and cellular membranes (kirchdoerfer et al., ; yuan et al., ) . both sars-cov and sars-cov- interact directly with angiotensin-converting enzyme (ace ) to enter host target cells (hoffmann et al., ; li et al., ; walls et al., ; yan et al., ) . in the case of sars-cov, ace binding was found to be a critical determinant for the range of hosts the virus can infect, and key amino acid residues in the rbd were identified to be essential for ace -mediated sars-cov infection and adaptation to humans (li et al., (li et al., , . understanding the dynamics that permits a virus to shift hosts is of considerable interest and can further be an essential preliminary step toward facilitating the development of vaccines and the discovery of specific drug therapies. we employ a multidisciplinary approach to look for evidence of diversifying selection on the s-protein gene and model the interactions between human ace (hace ) and the rbd of selected coronavirus strains, which ultimately afforded us novel insights detailing virus and host cell interactions. given the rapid pace of discovery, we aim to add clarity to evolutionary dynamics of diseases strains by more precisely understand the dynamics at the s-protein and its interaction with hace . the most similar genomes to sars-cov- mn were retrieved using blastp (altschul et al., ) versus the nr database of genbank (table ) . genomes were then aligned using mauve (darling, mau, blattner, & perna, ) , and the s-protein gene was trimmed. the extracted genomic sections were aligned using the translation align option of geneious (kearse et al., ) with a mafft plugin (katoh & standley, ) . the phylogenetic reconstruction of s-protein genes was performed with phyml (guindon et al., ) , using a gtr + i + g model, using with nonparametric bootstrap replicates. both, the alignment and the tree were used as inputs for paml codeml (yang, ) . the presence of sites under positive selection was tested by the comparison of m model (which it allows for a proportion of positive, neutral, and negative selection sites in the alignment) versus the m model (it which only allows a proportion of neutral and negative selection sites in the alignment) and m (ω follows a beta distribution plus a proportion of sites with ω > ) versus m (ω follows a beta distribution) models using the ete toolkit . (huerta-cepas, (weaver et al., ) was used to perform the hyphy analyses. the crystal structure of the sars-cov s-protein rbd (genebank id nc_ ) in complex with hace was retrieved from the protein data bank (code ajf) (berman et al., ) . homology models were constructed using this structure as template for the rbds of sars-cov- (sars , genebank id mn ), the bat sars-like coronavirus isolate rm (rm , genebank id dq ), and the bat sars-like coronavirus isolate rs (rs , genebank id ky ). one additional homology model for the g d mutant of the sars-cov- rbd (sars -mut) was constructed. homology models were built with modeller v. (webb & sali, ) using its ucsf chimera interface (pettersen et al., ) . five models were constructed for each target sequence and the one with the lowest discrete optimized protein energy (dope) score was selected for the final model. all nonamino acidic residues were removed from the sars-cov rbd-hace complex to obtain a clean complex. the homology models of the sars , rm , rs rbds, and sars -mut were superimposed into the sars-cov rbd to obtain their initial complexes with hace . these complexes were then subject to molecular dynamics (md) simulations and estimation of their free energies of binding using amber (case et al., ) . for the later, ace was considered as the receptor and the rbds as ligands. the protocol described below was employed for all complexes and otherwise noted default software parameters were employed. systems preparation was performed with the tleap program of the amber suite. each complex was enclosed in a truncated octahedron box extending Å from any atom. next, the boxes were solvated with tip p water molecules and na+ ions were added to neutralize the excess charge. systems were minimized in two steps, the first of which consisted in steps of the steepest descent algorithm followed by cycles of conjugate gradient with protein atoms restrained using a force constant of kcal/mol.Å . the pme method with a cutoff of Å was used to treat long-range electrostatic interactions. during the second minimization step, the pme cutoff was set to Å and it proceeded for , steps of the steepest descent algorithm followed by , cycles of conjugate gradient with no restrains. the same pme cutoff of Å was used in all simulation steps from here on. both minimization stages were performed at constant volume. the minimized systems were heated from to k at constant volume constraining all protein atoms with a force constant of kcal/mol.Å . the shake algorithm was used to constrain all bonds involving hydrogens and their interactions were omitted from this step on. heating took place for , steps, with a time step of fs and a langevin thermostat with a collision frequency of . ps − was employed. all subsequent md steps utilized the same ta b l e list of coronavirus isolates used for positive selection analysis (closer dataset) thermostat settings. afterward, the systems were equilibrated for ps at a constant temperature of k and a constant pressure of bar. pressure was controlled with isotropic position scaling with a relaxation time of ps. the equilibrated systems were used as input for ns length production md simulations. the free energies of binding were computed under the mm-pbsa approach implemented in ambertools (case et al., ) . a total of md snapshots were evenly selected, one every ps, from the last ns of the production run for mm-pbsa calculations. the ionic strength was set to mm and the solute dielectric factor was set to for all systems. in order to detect branches and sites under positive/negative selection, two datasets were explored. the first ("closer" dataset) harbors the most similar genomes to wuhan-hu- coronavirus (sars-cov- ) (mn ). for this dataset, several genomes were excluded from the analysis because they showed minimal variation to other sequences. we used a preliminary phylogeny to select a representative isolate of each clade (table ) in both datasets, we observed evidence of purifying selection in the majority of nodes of the tree. specifically, in the "closer" dataset, we identified nodes with evidence of negative selection, and under positive selection when free ratios model of codeml model was applied. to confirm the four nodes under positive selection, we use ltr test for contrasting hypothesis using branch free, branch neutral, and m models of codeml. using these approximations, any node predicted by free ratios model with ω > was significantly different to the purifying (ω < ) or neutral (ω = ) models. an equivalent analysis was performed using absrel of hyphy, observing episodic diversifying selection in at least of nodes of the phylogenetic tree reconstructed with the "closer" dataset ( figure ). interestingly, one of the divisions detected with diversifying selection was the branch that contains sars-cov- , pangolin coronavirus isolate mp and bat coronavirus ratg (called sars-cov- group) but not the specific branch that contains sars-cov- . and (using sars-cov- s-protein as a reference) have drastic amino acid changes for alpha-helical tendencies. in addition, the section between residues and shows radical changes in amino acids implicated in the equilibrium constant (ionization of cooh). in the structural analysis we performed, the section between and forms a loop that is not present in certain s-proteins of coronavirus isolated in bats. this loop extends the interaction area between rbd of s-protein and human ace ; in fact, the lack of this loop decreases the negative energy of interaction (increasing the binding) among these two molecules (see table ). these results, obtained from independent analyses, strongly highlight the importance of - region. additionally, important hace -binding residues in the rbd from sars-cov- obtained from the crystallography and structure determination performed by shang et al. ( ) are also present in the section we highlight here. we propose that this region is the most probable to contain the sites under positive selection due to predictions by our codeml and fubar models. in that sense, we refer to this section as region under positive selection (rps). it is important to additionally clarify that even inside the rps we found at least aa highly conserved between coronaviruses, several of them are predicted as sites under purifying selection. this shows that it is necessary to maintain conserved sites which are located around polymorphic sites, probably to maintain the protein structure and at the same time to have the ability to colonize more than one host. with a list of broader observations related to the role of selection across viral genomes, we aimed to specifically understand how these regions could affect virus/host interactions. to understand more in more detail the importance of rps in the evolution of sars-cov- , we quantified the relative importance of this region in the interaction between the rbds and hace . in that sense, md simulations were run for five complexes (listed in methods). simulations were initially performed for four rbds corresponding to the sars , sars, rs and rm coronaviruses. as discussed below in this section, the g d mutation is predicted to have a large negative influence in the stability of the rbd-hace complexes. to further clarify this influence, we added as fifth system a g d sars mutant rbd-hace complex in our studies. in all cases, the systems were stable with root mean square deviations (rmsd) of their backbones between . and . Å relative the contribution of each residue in the studied coronaviruses that interact with the hace receptor are shown in table . rows are presented in such a way that each of them contains the residues occupying the same position in the viral rbds structures as in the sar rbd structure. from here on, residues numeration will take that of sars as reference. to better interpret the influence of the key interactions between the studied coronaviruses rbds and the hace receptor, their complexes were visually analyzed. the predicted rbd-hace complexes for sars , sars, and sars -mut are depicted in figure . for each complex, the structure used to create figure is the representative one of the largest cluster formed by the md snapshots previously used in mm-pbsa calculations. many studies have focused on coronaviruses mutations that favor adaptations for infecting human host infections. for f i g u r e predicted interaction of sars (top), sars -mut (middle) and sars (bottom) with the human ace receptor. hace in depicted in gray and the rbd of coronaviruses in cyan. oxygen atoms are colored red and nitrogen blue. the numbering of the residues corresponds to that of the sequence of each spike glycoprotein example, it has been shown that specific substitutions at positions , , , , and ( , , , , and in sars) of the rbd of sars favors the interaction between the rbd of sars and hace (cui et al., ) . likewise, homology modeling studies found favorable interactions between the residues occupying these positions in the sars rbd and the human receptor (wan, shang, graham, baric, & li, ) . the cornerstone of these favorable interactions is the complementarity of the rbds with hot spots and . these are salt bridges between k and e and between d and k of ace which are buried in a hydrophobic environment (see figure ). in the cases of sars table ) respectively, do not interact with any hot spot residue. instead, they interact with d of hace in the sars complex and with e of the human receptor in the sars complex. this could indicate that interactions additional to those previously identified with the hace hotspots could be critical for the stabilization of the rdb-human receptor complexes. finally, we analyzed the possible reasons for the predicted negative impact that the g d mutation has on the predicted free energies of binding of the rbd to hace . as depicted in figure , g directly interacts with k in hot spot and its mutation interferes with the d -k salt bridge. specifically, d of the rdb point to d of hace yields a high electric repulsion between these amino acids. consequently, this portion of the rbd is pushed to a position further from hace than that observed in the wild type receptor, resulting in the reduction of its network of contacts with k . as a result, the binding of the rbd to hace is considerably inhibited and unlikely to occur. propose that blocking its interaction with the receptor d could be a promising strategy for future drug discovery efforts. the authors would like to thank daniela santander for her valuable comments on the manuscript. this work was supported by the none declared. the phylogenetic tree presented in this manuscript was uploaded to dryad (https://doi.org/ . /dryad.w r mp). the full networks of interactions between the coronaviruses and the hace receptor are provided as supporting information. vinicio armijos-jaramillo https://orcid. org/ - - - justin yeager https://orcid.org/ - - - claire muslin https://orcid.org/ - - - yunierkis perez-castillo https://orcid. org/ - - - gapped blast and psi-blast: a new generation of protein database search programs the -new coronavirus epidemic: evidence for virus evolution the protein data bank origin and evolution of pathogenic coronaviruses mauve: multiple alignment of conserved genomic sequence with rearrangements identification of a novel coronavirus in patients with severe acute respiratory syndrome middle east respiratory syndrome: emergence of a pathogenic human coronavirus coronavirus spike proteins in viral entry and pathogenesis new algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of phyml . sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor discovery of a rich gene pool of bat sars-related coronaviruses provides new insights into the origin of sars coronavirus clinical features of patients infected with novel coronavirus in wuhan ete: a python environment for tree exploration more effective purifying selection on rna viruses than in dna viruses mafft multiple sequence alignment software version : improvements in performance and usability geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data pre-fusion structure of a human coronavirus spike protein hyphy . -a customizable platform for evolutionary hypothesis testing using phylogenies identification of -ncov related coronaviruses in malayan pangolins in southern china angiotensin-converting enzyme is a functional receptor for the sars coronavirus animal origins of the severe acute respiratory syndrome coronavirus: insight from ace -s-protein interactions receptor and viral determinants of sars-coronavirus adaptation to human ace the effect of species representation on the detection of positive selection in primate gene data sets ucsf chimera-a visualization system for exploratory research and analysis structural basis for receptor recognition by the novel coronavirus from wuhan on the origin and continuing evolution of sars-cov- . national science review, advance online publication structural insights into coronavirus entry structure, function, and antigenicity of the sars-cov- spike glycoprotein receptor recognition by novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars datamonkey . : a modern web application for characterizing selective and other evolutionary processes comparative protein structure modeling using modeller summary of probable sars cases with onset of illness from evidence of recombination in coronaviruses implicating pangolin origins of ncov- treesaap: selection on amino acid properties using phylogenetic trees isolation and characterization of -ncov-like coronavirus from malayan pangolins structural basis for the recognition of the sars-cov- by full-length human ace paml : phylogenetic analysis by maximum likelihood cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains isolation of a novel coronavirus from a man with pneumonia in saudi arabia a pneumonia outbreak associated with a new coronavirus of probable bat origin a novel coronavirus from patients with pneumonia in china key: cord- - xmxymu authors: br, bharath; damle, hrishikesh; ganju, shiban; damle, latha title: in silico screening of known small molecules to bind ace specific rbd on spike glycoprotein of sars-cov- for repurposing against covid- date: - - journal: f res doi: . /f research. . sha: doc_id: cord_uid: xmxymu background: human coronavirus (sars-cov- ) is causing a pandemic with significant morbidity and mortality. as no effective novel drugs are available currently, drug repurposing is an alternative intervention strategy. here we present an in silico drug repurposing study that implements successful concepts of computer-aided drug design (cadd) technology for repurposing known drugs to interfere with viral cellular entry via the spike glycoprotein (sars-cov- -s), which mediates host cell entry via the hace receptor. methods: a total of known and approved small molecules were screened for interaction with sars-cov- -s through docking studies and lead molecules were shortlisted. additionally, streptomycin, ciprofloxacin, and glycyrrhizic acid (ga) were selected based on their reported anti-viral activity, safety, availability and affordability. the molecules were subjected to molecular dynamics (md) simulation. results: the md simulation results indicate that ga of plant origin may be repurposed for sars-cov- intervention, pending further studies. conclusions: repurposing is a beneficial strategy for treating covid- with existing drugs. it is aimed at using docking studies to screen molecules for clinical application and investigating their efficacy in inhibiting sars-cov- -s. sars-cov- -s is a key pathogenic protein that mediates pathogen-host interaction. hence, the molecules screened for inhibitory properties against sars-cov- -s can be clinically used to treat covid- since the safety profile is already known. the complete genome of severe acute respiratory syndrome coronavirus (sars-cov- ) is % identical to sars-cov, both viruses share a common clade encompassing the genus betacoronavirus as the root node , . currently no novel antivirals exist that are effective against either of the viruses - . drug repurposing is a commercially viable strategy, as it exploits existing drugs, thus significantly reducing the cost and time involved in developing effective therapeutics - . experimental approaches, however, at pre-clinical and clinical stages for drug repurposing involve high cost and time . computational approaches can offer quick, considerable, and novel testable hypotheses for systematic drug repositioning . current drugs in different phases of clinical trials are being investigated for inhibitory activity against viral targets that play a significant role in the coronavirus infection lifecycle. the drug targets might be involved in entry into the host (e.g. umifenovir and chloroquine), replication (e.g. lopinavir/ ritonavir), or rna synthesis (e.g. remdesivir/favipiravir). among these, targeting sars-cov- cellular entry via the spike glycoprotein (sars-cov- -s) has emerged as the leading option for repurposing . as sars-cov- -s is a surface protein involved in adhesion/fusion and entry into host cells, it has been identified as a potential drug target for both biologics and small molecules . the entry of covid- pathogen is mediated by the homotrimeric transmembrane protein sars-cov- -s. it is comprised of two functional subunits, s and s , which are non-covalently bound in the pre-fusion conformation. the s subunit interacts with the human ace receptor through the receptor binding domain (rbd), while the s subunit is one of the components of viral envelope [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . apart from interacting with the ace receptor, the rbd site also contributes to the stabilisation of the prefusion state of the s subunit equipped with fusion machinery [ ] [ ] [ ] [ ] [ ] [ ] [ ] . in covs, the s-protein is cleaved by host proteases at the s site located above the fusion peptide , . this activates the protein via extensive irreversible conformational changes , , , , . it is well understood that the entry of cov into the susceptible host is a complex process that requires the vigorous actions of receptor binding and proteolytic processing of the s-protein to promote fusion with the pathogen . hence, the current study aims to predict and validate the structure of sars-cov- -s protein using computer-aided homology modelling tools and screen a library of small molecules for their interaction with the sars-cov- -s protein. the whole genome of sars-cov- (genbank accession number: mt . , length: bp) was retrieved from ncbi and used as a query to perform a sequence similarity search using ncbi-blast topological analysis of pathogen-host interactome for target validation drug target identification and validation were carried out using a network-based topological analysis method using the webbased application pathogen-host interaction search tool (phisto) by setting pathogen type to virus, family to coronaviridae, species to sars-related coronavirus and strain to sars-cov. the node properties like the degree of connectivity (k) and betweenness centrality (bc) were assessed , . the statistical significance of k and bc values were assessed by the fligner-killeen (median) test. the similarity between rbd domains of s-protein from sars-cov- (accession number qii , length: aa), sars-cov (accession number: afr , length: aa) and ratg (accession number: qhr , length: aa) was evaluated by using the multiple sequence alignment (msa) tool clustal omega from embl-ebi. conservation in ace receptor interaction was seen among all the three sequences aligned. this conservation aided in the active binding site prediction. the protein-protein interaction between sars-cov- -s and host ace receptor complex was studied using the crystal structure from protein data bank (pdb id: cs ). the amino acids involved in the interaction were identified as ligand binding sites for inhibitor molecules. homology modelling of sars-cov- -s protein homology modelling was performed with swiss-model for the protein sequence of sars-cov- -s using the crystal structure of sars-cov-s and ace complex (pdb id: acd) as a template. the modelled protein was validated for quality using a ramachandran plot and prepared for molecular docking studies using the protein preparation wizard feature of the schrodinger small molecule suite . this analysis could also have been performed using open source software such as autodock or swissdock . the modelled receptor was processed for docking studies by deleting crystallographic water molecules with less than three h-bonds. this could also be done manually by editing the .pdb file in a text editor. next, hydrogen atoms corresponding to neutral ph were added in consideration of ionisation states of amino acids. following this, coordinates for any missing side-chain atoms were added using prime v . , schrödinger - . finally, the energy of the modelled structure was minimised using the opls_ force field . this analysis could also have been performed using open source software such as autodock . the three-dimensional conformations of the small molecule drugs already in use to treat various diseases and as nutritional supplements were downloaded from the drugcentral database and subjected to ligand minimisation using ligprip (ligprep, version . , schrödinger, llc, new york, ny, ). this analysis could also have been performed using open source software such as autodock . the compounds were minimised by assigning force field opls_ and stereoisomers were calculated after retaining specific chiralities. the absorption, distribution, metabolism and excretion (adme) predictions were performed for all ligands using the qikprop package . this analysis could also have been performed using open source software such as swiss-adme . the active site on the prepared receptor was defined around the selected residues (arg , tyr , pro , thr , gly , and tyr ) with a Å radius. this generated a grid box measuring x x Å. the docking of small molecules over sars-cov- -s was performed using glide v . , this analysis could also have been performed using open source software such as autodock . schrödinger - in different modes sequentially with defined and incremental precision, and computational time differences. the best-docked conformer with minimum glide energy and e model energy was selected and lowest-energy docked complex of three known molecules streptomycin, ciprofloxacin, and glycyrrhizic acid (ga) in complex with sars-cov- -s were selected for molecular dynamic simulations. the md of shortlisted complexes were studied using the opls_ force field in a plane tip p water model . md simulations were performed using desmond version . . this analysis could also have been performed using open source software such as gromacs . the system was built by dissolving the streptomycin/sars-cov- -s, ciprofloxacin/sars-cov- -s, and ga/sars-cov- -s complexes in an orthorhombic box containing water molecules, allowing a buffer region of Å between atoms and box peripherals. the system was further minimised using the l-bfgs algorithm for a minimum of steepest descent steps and a maximum of iterations until a gradient threshold of kcal/mol/Å and convergence threshold of . kcal/mol/Å was reached. for short-range electrostatic interactions, the solid-phase microextraction method was employed at e- tolerance and Å cut-off radius. the built systems were gradually warmed up to k in the npt ensemble with a time step of fs. a ns md simulation in the npt ensemble was performed using a nose-hoover thermostat . resulting root mean square deviation (rmsd) and root mean square fluctuation (rmsf) values were analysed. multiple sequence alignment of complete genomes the complete genome of coronavirus sars-cov- , bat coronavirus ratg , pangolin coronavirus isolate mp , and sars-cov obtained as blast hits were aligned and a phylogenetic tree was constructed ( figure ). the msa demonstrated the molecular similarities between the organisms. ratg has been identified as a neighbour genome for sars-cov- and this justifies the hypothesis that the infection may be transmitted from bats. meanwhile, the subsequent neighbours were pangolin mp and sars-cov. this preliminary sequence alignment enabled the understanding of sequence similarities and evolutionary information, which is deeply fundamental to the process of drug discovery. a detailed investigation of the pathogen-host interactome can shed clear insights on the mechanism of viral infection and the pathology involved. due to a lack of interaction data on sars-cov- , the sars-cov proteome was considered and the sars-cov/human interactome was built by screening domain interactions between sars-cov/human protein-protein interactions, and then the network distribution, topological and functional analyses were performed ( figure ). the circular shapes correspond to proteins (nodes) which are labelled by uniprot_ids and details about the nodes are listed in table . among proteins of sars-cov, the majority of sars-cov/human interaction involves five non-structural proteins (ns b, ns , ns a, ns b and ns a with four, three, eight, three and one human proteins, respectively), three open reading frame (orf) polyproteins (orf b, a j l and a j l with four, one and one human proteins, respectively), two replicase proteins (r a and r ab with two and one human proteins, respectively), the membrane protein (vme with human ikkb), envelope membrane protein (vemp with human b cl ), nucleoprotein (ncap with four human proteins) and spike glycoprotein (spike with human ace ). with these observations, we determine the high specificity of membrane, envelope and spike glycoprotein interactions with the host through specific entry points. hence, these three sars-cov proteins can be a potential target to inhibit the pathogen-host interaction specifically, while other interactions are more versatile. to ensure the impact of inhibition of ikkb, b cl , and ace mediated interaction, the landscape of the sars-cov/ human interaction was further analysed for degree and betweenness centrality distributions of the host, as shown in table . the degree of connectivity estimates the number of directly connecting neighbours to a particular node, while betweenness centrality estimates the frequency of nodes occurring on the shortest paths in the context of other nodes. in the protein interactomes, a node with a high degree of connectivity is identified as hub protein and a node with maximum betweenness centrality is identified as bottleneck protein. in the current topological analysis, the node with the lowest degree of distribution and betweenness centrality . was a a r . however, the molecular function of a a r (uniprot id: a a r ) is not well understood in both human physiology or pathology. hence, ace , with the degree of distribution and betweenness centrality , was the next most significant node, as shown in figure , and it was identified as a key node or key player in the sars-cov/host interaction. hence, the sars-cov-s interaction with host ace was identified as a potential drug target. as information about the sars-cov- /human interaction is not available, the sars-cov/human interaction data was used. we studied the similarity between sars-cov and sars-cov- by sequence analysis and rbd prediction. as depicted in figure , the alignment between the s-protein of sars-cov- and that of bat coronavirus ratg was closer than with the s-protein of sars-cov. the alignment at rbd site residues to was found to be more than % similar to sars-cov and ratg , particularly at major residues including tyr , thr , gly and tyr but excluding arg and pro , as shown in figure . considering the evolution, the available elucidated structure of the sars-cov/ace complex (pdb id: cs ) was used as a template for homology modelling. the residues involved in the interaction of sars-cov with ace were predicted using the prime module available in schrodinger small molecule suite and the major interactions are tabulated in table and shown in figure . a very strong interaction was seen between the smallest amino acid, gly , with lys , gly , and asp . this interaction is facilitated by two features; one hydrogen bond and . % buried solvent accessible surface area. remaining residues also showed significantly strong interactions with ace . hence, the same residues were made centric to generate the grid. the modelling of sars-cov- was performed using the crystal structure of sars-cov-s as a template, which was % identical to the query. the modelled protein shown in figure a was validated for quality and preparedness. the ramachandran plot generated using the protein preparation wizard confirmed the quality of modelled structure by plotting > % residues in the allowed region, as shown in figure b . the repurposing of small molecules as therapeutics to treat covid- requires knowledge of the interaction of the therapeutic molecule with sars-cov- -s. initial high-throughput virtual screening suggested molecules that exhibit reasonable interaction with sars-cov- -s. following this, the shortlisted molecules were docked in sp mode where the accuracy of prediction was improved. the docking in sp mode suggested top table as lead molecules. as hydroxychloroquine has been identified as a possible treatment for covid- , it was also subjected to subsequent docking in xp mode. all molecules showed better interaction than hydroxychloroquine with sars-cov- -s. the three molecules streptomycin, ciprofloxacin, and ga had low interaction penalties and displayed better interactions with the ace binding site on the rbd of sars-cov- -s, as shown in figure a -c, respectively. the three molecules were selected based on their reported anti-viral activity, safety, availability, and affordability - . for sars-cov- -s, the glide generated docking model showed that streptomycin could bind to sars-cov- -s in a manner highly similar to the sars-cov- -s and ace interaction. the binding pocket of streptomycin was in the rbd site, which has been observed to be an acceptor for ace . streptomycin was well-fitted with the shape of the pocket, as shown in figure a and figure a , with an xp score of - . , where it formed a total five hydrogen bonds, among which two hydrogen bonds were formed by donating electrons from n and n atoms to the glu side-chain atoms. simultaneously, two other hydrogen bonds were observed between the backbone atoms of leu by receiving electrons from hydroxyl groups at the th and th carbon atoms of the s six-carbon ring of streptomycin. the remaining h-bonds were formed between the backbone atom of ser and the hydroxyl group at the th carbon atom at the g group of streptomycin. however, the stability of the interaction cannot be pronounced without molecular dynamic simulations. the docking model of ciprofloxacin illustrated its binding mode on the rbd site, which has been observed to be a key interference site for virus-host interaction. the ciprofloxacin fit with reasonable steric complementarity into the rbd pocket, as shown in figure b and figure b , with an xp score of - . . the interaction of ciprofloxacin with sars-cov- -s was facilitated by two hydrogen bonds between val and phe ; each bond being formed by receiving and donating electrons from hydroxyl and ketone groups, respectively. the docking model of ga illustrated its binding mode on the rbd site, which has been observed to be a key site for interference of the virus-host interaction. the ga fit with steric complementarity in the rbd pocket, as shown in figure c and figure c , with an xp score of - . . the docking of ga with sars-cov- -s was facilitated by three hydrogen bonds with leu , val , and glu by receiving electrons from the hydroxyl groups of ga. additionally, the ketone group of ga formed a hydrogen bond, with the backbone atoms of phe receiving the electrons. as the sars-cov- -s receptor has aa, it requires enormous computational time to perform md simulation for the whole range of protein, hence, we confined this study only to the rbd portion, ranging from th residue to th residue, for md simulation. the rmsd can illustrate the average difference in the displacement of selected atoms in a particular frame compared to its reference frame. the plots in figure illustrate the evolution of a protein (left y-axis) and ligand (right y-axis) rmsd. post simulation, the protein and ligand frames are initially aligned over the backbone atom coordinates of the reference frame, and then the rmsd is extrapolated. the information on protein-ligand rmsd can dissect and demonstrate the conformational differences that occurred throughout the simulation. the rmsd of between - Å is fairly acceptable for small, globular proteins. an rmsd exceeding this indicates a major conformational change during the simulation and pronounces the instability of the complex. the rmsd plot for the streptomycin/sars-cov- -s complex, shown in figure a , attained equilibrium at ns and thereafter showed stability with a maximum rmsd of Å (peaks of . Å - . Å) up to ns. after ns a change in the equilibrium state was observed. however, the rmsd was within . Å, which is acceptable. similarly, the streptomycin rmsd (right y-axis) was observed to be significantly higher than the rmsd of the receptor at the rbd site. thus, it is likely that streptomycin diffuses from its initial binding site after ns. the rmsd plot for the ciprofloxacin/sars-cov- -s complex, shown in figure b , attained equilibrium at ns and thereafter showed stability with a maximum rmsd of . Å (peaks of . Å - . Å) up to ns. after ns a sudden change in equilibrium state was observed. however, the rmsd was within Å, which is acceptable. on the other hand, the rmsd values for ciprofloxacin were observed to be significantly in alignment with the rmsd of sars-cov- -s at the rbd site. thus, it is likely that ciprofloxacin can retain its initial binding site up to ns. the rmsd plot for the ga/sars-cov- -s complex, shown in figure c , attained equilibrium until ns. compared to sars-cov- -s complexes with streptomycin and ciprofloxacin, it was found that the sars-cov- -s complex with ga was stable until the end of the simulation without any drift in equilibrium. on the other hand, the rmsd values for ga were observed to be significantly in alignment with the rmsd of the sars-cov- -s rbd domain in almost all the frames. hence, it is likely that it remains in its initial binding site up to ns. it is predicted to inhibit sars-cov- -s at the rbd domain comparatively better than streptomycin and ciprofloxacin and for a longer duration, but its contact with key ligands has to be confirmed through rmsf and protein-ligand contact analysis. the rmsf helps characterise minute differences in the protein chain during the simulation. in rmsf plots, peaks correspond to the residues on the protein that fluctuate more during the course of a simulation. usually, terminals and loop regions fluctuate more than other secondary structures like alpha-helices and beta-strands. the secondary structure of the rbd of sars-cov- -s has the same secondary structural elements as the rbd from sars-cov, with % homologous residues. these residues are majorly formed of loops and are highly flexible. a unique phe residue in the loop plays a key role in ace interaction by occupying a deep hydrophobic pocket in ace . in the trimmed rbd structure this loop starts from th residue and ends at nd residue. since the ligand-binding site is located in this loop region, a higher rmsd was noticed. in the rmsf plot for the rbd domain of the streptomycin/sars-cov- -s complex, shown in figure a , the rmsf at the loop region was . Å with many ligand contacts (green-coloured vertical bars). this was on par with molecular docking interactions. in the rmsf plot for the rbd domain of the ciprofloxacin/sars-cov- -s complex, shown in figure b , the rmsf at the loop region was . Å with a few ligand contacts (green-coloured vertical bars). this justifies the interactions seen in molecular docking. further, in the rmsf plot for the rbd domain of the ga/sars-cov- -s complex, shown in figure c , the rmsf at loop region was . Å with a high number of ligand contacts (green-coloured vertical bars), justifying the interactions seen in molecular docking. though the ligand contacts are seen in interactions, the simulation time coverage determines their stability. protein-ligand interactions can be traced throughout the simulation and can be categorised into four types: hydrogen bonds, hydrophobic, ionic, and water bridges, as summarised in figure a - c. the stacked bars in the plots are normalised over the course of the trajectory and help us to understand the retention of contact throughout the simulation time. the contacts with a value of more than . are expected to be retained for over % of the total simulation time. in the protein-ligand contact plot for the streptomycin/ sars-cov- -s complex, shown in figure a , residues glu and lys showed maximum interactions fractions, i.e. . facilitated by hydrogen bonds and water bridges. this suggests that the specific interaction is maintained for % of the simulation time, and such short interactions are not promising. hence, streptomycin cannot be a potential inhibitor of sars-cov- -s to offer anti-covid- activity. in the protein-ligand contact plot for the ciprofloxacin/ sars-cov- -s complex shown in figure b , residues phe , tyr , tyr , and phe were seen to have the interactions fractions . , . , . and . respectively facilitated by hydrophobic, hydrogen bonds and water bridges. this suggests that for %, %, % and % of the simulation time, the specific interaction is maintained by respective residues and such interactions are considered good. hence, ciprofloxacin may be a potential inhibitor of sars-cov- -s and may offer anti-covid activity. in the protein-ligand contact plot for the ga/sars-cov- -s complex shown in figure c , residues val , glu , asn , cys , and phe had the interactions fractions . , . , . , . and . , respectively, facilitated by hydrophobic, hydrogen bonds and water bridges. this suggests that for %, %, %, % and % of the simulation time, the specific interaction is maintained by respective residues and such interactions are excellent and promising. hence, ga can be a potential inhibitor of sars-cov- -s and can offer anti-covid activity. through our topological analysis, we have determined the degree of distribution for viral proteins, and we show that, due to its low degree of distribution, ace is likely to be targeted by viruses like sars-cov. hence, the interaction between the viral protein sars-cov- -s and the host ace receptor is a potential drug target for the repurposing of known drugs. further, sequence alignment and domain analysis suggest that the rbd is the ligand-binding site. molecular docking studies have suggested streptomycin, ciprofloxacin, and ga as possible leads to inhibit sars-cov- -s. molecular dynamic simulation analysis has indicated that ga is a promising small molecule that could be repurposed as a potential inhibitor of sars-cov- -s to offer anti-covid activity. all data underlying the results are available as part of the article and no additional source data are required. current peer review status: july reviewer report https://doi.org/ . /f research. .r © hl r. this is an open access peer review report distributed under the terms of the creative commons attribution , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is license properly cited. college of medical and health sciences, wollega university, nekemte, ethiopia the manuscript entitled " screening of known small molecules to bind ace specific rbd on in silico spike glycoprotein of sars-cov- for repurposing against covid- " addresses the repurposability of known drugs against covid- . spike glycoprotein is the most popular drug target for sars-cov- having a significant role in host entry. with this basic understanding, the manuscript is reviewed and the observations are listed below. abstract: conclusion: the statement "hence, the molecules screened for inhibitory properties against sars-cov- -s can be clinically used to treat covid- since the safety profile is already known." can be rewritten as "the ga of plant origin shown superior interaction with sars-cov- -s compared to rest other molecules, hence, ga can be clinically investigated to confirm its efficacy to treat covid- ." methods: homology modelling of sars-cov- -s protein the crystal structure of sars-cov-s and ace complex (pdb id: acd) is used as a template for homology modelling. it is expected to discuss the identity, query coverage between the query and template. discuss the criteria for template selection. in the figure legend for figures c, c , c, c, and c glycyrrhizic acid can be replaced by ga. the manuscript can be accepted for indexing with above-mentioned changes. are sufficient details of methods and analysis provided to allow replication by others? yes are all the source data underlying the results available to ensure full reproducibility? yes no competing interests were disclosed. reviewer expertise: phytochemical screening, pharmacology, molecular docking i confirm that i have read this submission and believe that i have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. the benefits of publishing with f research: your article is published within days, with no editorial bias you can publish traditional articles, null/negative results, case reports, data notes and more the peer review process is transparent and collaborative your article is indexed in pubmed after passing peer review dedicated customer support at every stage for pre-submission enquiries, contact research@f .com a pneumonia outbreak associated with a new coronavirus of probable bat origin scalable algorithms for molecular dynamics simulations on commodity clusters gromacs: high performance molecular simulations through multi-level parallelism from laptops to supercomputers publisher full text . thomas db, susan ow: streptomycin as an antiviral agent: mode of action possible antiviral effect of ciprofloxacin treatment on polyomavirus bk replication and analysis of non-coding control region sequences in silico screening of known small molecules to bind ace specific rbd on spike glycoprotein of sars-cov- for repurposing against covid- in the present study authors have made an effort to repurpose the known drugs against sars-cov- . although the study is well organized and presented, a few minor changes are suggested for betterment.the name of the molecules should begin with uppercase.in an abstract, authors have mentioned about the anti-viral activity of streptomycin, ciprofloxacin, and glycyrrhizic acid. but, it requires the literature support. authors can cite the references for the anti-viral/ ant-microbial activity in methods section.in results and discussions, authors could claim the better affinity of glycyrrhizic acid towards sars-cov- -s in comparison with other two molecules. change suggested: additionally, the ketone group of ga formed a hydrogen bond, with the backbone atoms of phe receiving the electrons and possess the better affinity towards sars-cov- -s when compared to streptomycin and ciprofloxicine.in conclusion section, authors have mentioned that, ace is likely to be targeted by viruses like sars-cov. since, authors have tabulated the methods and references for ace and sars-cov spike protein interaction in table , it is better to rewrite as "ace is known to be targeted by viruses like sars-cov". key: cord- -t ywnshj authors: premkumar, lakshmanane; segovia-chumbez, bruno; jadi, ramesh; martinez, david r.; raut, rajendra; markmann, alena; cornaby, caleb; bartelt, luther; weiss, susan; park, yara; edwards, caitlin e.; weimer, eric; scherer, erin m.; rouphael, nadine; edupuganti, srilatha; weiskopf, daniela; tse, longping v.; hou, yixuan j.; margolis, david; sette, alessandro; collins, matthew h.; schmitz, john; baric, ralph s.; de silva, aravinda m. title: the receptor binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients date: - - journal: sci immunol doi: . /sciimmunol.abc sha: doc_id: cord_uid: t ywnshj the severe acute respiratory syndrome coronavirus (sars-cov- ) that first emerged in late is responsible for a pandemic of severe respiratory illness. people infected with this highly contagious virus can present with clinically inapparent, mild, or severe disease. currently, the virus infection in individuals and at the population level is being monitored by pcr testing of symptomatic patients for the presence of viral rna. there is an urgent need for sars-cov- serologic tests to identify all infected individuals, irrespective of clinical symptoms, to conduct surveillance and implement strategies to contain spread. as the receptor binding domain (rbd) of the spike protein is poorly conserved between sars-covs and other pathogenic human coronaviruses, the rbd represents a promising antigen for detecting cov-specific antibodies in people. here we use a large panel of human sera ( sars-cov- patients and control subjects) and hyperimmune sera from animals exposed to zoonotic covs to evaluate rbd's performance as an antigen for reliable detection of sars-cov- -specific antibodies. by day after the onset of symptoms, the recombinant sars-cov- rbd antigen was highly sensitive ( %) and specific ( %) for antibodies induced by sars-covs. we observed a strong correlation between levels of rbd binding antibodies and sars-cov- neutralizing antibodies in patients. our results, which reveal the early kinetics of sars-cov- antibody responses, support using the rbd antigen in serological diagnostic assays and rbd-specific antibody levels as a correlate of sars-cov- neutralizing antibodies in people. the severe acute respiratory syndrome coronavirus (sars-cov- ) is responsible for an ongoing pandemic that has already killed over , people and paralyzed the global economy ( ) . currently, the main method for laboratory diagnosis of sars-cov- is pcr testing of nasopharyngeal swabs. there is an urgent need for highly specific and sensitive antibody detection assays to answer fundamental questions about the epidemiology and pathogenesis of sars-cov- and to implement and evaluate population-level control programs ( ) . efforts to understand the pathogenesis and define risk factors for severe sars-cov- disease have been hampered by our inability to identify all infected individuals, irrespective of clinical symptoms. to contain the pandemic, many countries resorted to the widespread quarantine of cities and regions. by deploying reliable antibody assays for population-level testing, it will be possible to obtain the highresolution spatial data needed to implement policies for containing the epidemic and informing strategies for re-opening communities and cities. studies with sars-cov- and other human covs demonstrate that people rarely develop specific antibodies within the first days after onset of symptoms ( ) ( ) ( ) ( ) ( ) . by - days after onset of symptoms, greater than % of sars-cov- the receptor binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients (page numbers not final at time of first release) patients develop specific igg and igm ( ) ( ) ( ) ( ) . for sars-cov- and the more distantly related mers-cov, igg antibodies have been observed to persist for at least one year after infection ( , ) . these observations strongly support the feasibility of using antibody assays for identifying recent and remote sars-cov- infections and for conducting population-level surveillance. sars-cov- is a β-coronavirus, a subgroup that includes the closely related sars-cov- and the more distantly related mers-cov and the common-cold human covs (hcov-oc and hcov-hku ) ( ) . many companies have quickly developed tests for sars-cov- antibody detection. these assays utilize the inactivated whole virion, viral nucleocapsid protein or viral spike protein as antigens in elisa, lateral flow or other testing platforms. while the performance of these assays has not been fully evaluated, some assays appear quite sensitive when used days or more after the onset of symptoms ( , ) . the specificity of sars-cov- antibody assays has not been adequately addressed. humans are frequently infected with hcov-oc and hcov-hku and most adults have antibodies to these viruses ( ) . any antibody cross-reactivity between common hcovs and sars-cov- would result in false-positive results interfering with antibody-based testing and surveillance for sars-cov- . sars-cov- and hcov oc elicit antibodies that crossreact against related covs ( , ) . following the sars-cov- outbreak in , the overall specificity of serological assays utilizing the nucleocapsid protein of sars-cov- was poor, whereas assays based on the spike protein were more specific ( ) ( ) ( ) . in recent studies, the receptor binding domain (rbd) of the spike protein of sars-cov- has shown promise as an antigen for specific antibody detection ( , , ) . here we report the production of properly folded recombinant receptor binding domains (rbds) from the spike proteins of sars and common-cold hcovs in mammalian cells. we use these recombinant antigens and a large diverse panel of human and animal sera to evaluate the rbd as an antigen for sars-cov- serology. we demonstrate that the recombinant sars-cov- rbd antigen is highly sensitive and specific for detection of antibodies induced by sars-covs. we also observed a strong correlation between the levels of rbd-binding antibodies and levels of sars-cov- neutralizing antibodies in patients. our results support the use of rbd-based antibody assays for serology and as a correlate of neutralizing antibody levels in symptomatic people who have recovered from sars-cov- infections. the s and s subunits of the spike (s) protein of coronaviruses are required for viral entry. the surface accessible receptor binding domain (rbd) on the s subunit binds to receptors on target cells, whereas the exposure of the fusion loop in the s subunit induces fusion of the viral envelope to the host cellular membranes ( ) . the rbds of sars-covs, which bind to angiotensin-converting enzyme (ace ) receptor on the host cells, are also a major target of human antibodies ( fig. a and b) . as the rbd is a common target of human antibodies and poorly conserved between sars-covs and other pathogenic human coronaviruses (fig. c) , this domain is a promising candidate for use in antibody-based diagnostic assays. we expressed the rbd of and sars-co-vs and four common human coronaviruses (hcov-hku- , -oc , -nl and - e) as fusion proteins that were secreted from human cells. the recombinant rbds were purified from the cell culture medium by affinity chromatography and purity was confirmed by sds-page (fig. d ). we used sera and monoclonal antibodies from animals immunized with sars-cov- or - spike proteins to assess the structural integrity of the purified recombinant rbd antigens. pooled serum from mice immunized with sars-cov- spike protein had antibodies that bound well to the rbd of sars-cov- and poorly to the rbds of sars-cov- and other common hcovs (fig. e ). sera from mice or rabbits immunized with sars-cov- or cross-reactive monoclonal antibody c reacted with the rbds of sars cov- and - but not common human covs (fig. e ). human serum collected before sars-cov- emerged contained antibodies to common α-and β-hcovs (nl and hku- ) but not to sars-cov rbd antigens (fig. e) . these results suggest that the purified recombinant rbd antigens retain native structures required for specific antibody binding. to evaluate the specificity of the recombinant sars-cov- rbd in serology, we used human sera collected from different populations before the current pandemic. the sera were tested at a high concentration ( : dilution) for binding to the recombinant rbds from sars-cov- , sars-cov- and common α-and β-hcovs (fig. ) . sera collected from healthy american adults (n = ) before the sars-cov- pandemic frequently had high levels of antibodies to the recombinant rbds of nl and hku- covs but not to sars-covs ( fig. a) . we also tested archived pre-sars-cov- pandemic sera collected from individuals in south asia, the caribbean and central america who had recently recovered from arbovirus infections. as in the case of healthy adults from the usa, most of the subjects from different parts of the world had high levels of antibodies to the rbd of common hcovs but no antibodies to the rbd of sars-covs (fig. b ). to assess if other human respiratory viruses stimulated antibodies that cross-reacted with the recombinant sars-cov rbd, we tested early convalescent sera from people with laboratory the known pathogenic human covs are members of the α-coronavirus and β-coronavirus genera (fig. a) . hcov-nl and e are two α-coronaviruses that frequently infect and cause a mild common-cold-like illness in most people. hcov-oc and hku- are two group a βcoronaviruses that also commonly infect people and cause mild disease. most adults (> %) have antibodies to these common-cold hcovs. sars-cov- and - and mers-cov are group b and c zoonotic β-coronaviruses that have recently crossed into humans and caused severe illness. the α-and βcoronavirus genera also contain a large number of zoonotic viruses that infect different animal hosts, which have not been implicated in human disease to date. to further assess the specificity of sars-cov- rbd for serology, we obtained and tested sera from people who had recently recovered from a laboratory-confirmed common-cold hcov infection and sera from guinea pigs immunized with different animal covs ( fig. b and c) . none of the immune sera from people exposed to recent hcov infections cross-reacted with the recombinant rbd of sars-covs. none of the guinea pigs vaccinated with different zoonotic covs had antibodies that cross-reacted with the recombinant sars-cov rbds ( fig. b and c). these results establish that most individuals, including people who have been recently exposed to acute common hcov infections, do not have detectable levels of cross-reactive antibodies to the recombinant rbd of sars-covs. to evaluate the sensitivity of the rbd of sars-cov- for identifying infected individuals, we obtained a total of serum samples from patients with laboratory-confirmed (i.e., pcr positive) sars-cov- infections collected at different times after the onset of symptoms. all the samples were tested for binding of total immunoglobulin (ig) and igm antibodies to recombinant rbd antigens from sars-covs and common-cold hcovs. the sensitivity of the assay was high ( % and % respectively for ig and igm) for specimens collected days or more after onset of symptoms (fig. a ). as expected, overall sensitivity was lower ( % and % respectively for ig and igm) for specimens collected between and days after onset of symptoms (fig. a ). with samples collected days or more after onset of symptoms, we observed some ig and igm antibody cross reactivity with the rbd of sars-cov- ( % and % respectively for ig and igm), which was anticipated as these viruses are closely related group b β-coronaviruses ( , ) . when the specimens were further analyzed to estimate the timing of seroconversion, we observed a marked transition from seronegative to seropositive for both ig and igm about days after the onset of symptoms ( fig. a and b ). by day after onset of symptoms, most patients had high end-point titers in the rbd ig elisa (fig. s ). to analyze the kinetics of all three of the major isotypes of serum antibodies within the first weeks after the onset of symptoms, we separately measured igg, iga, and igm in serum samples obtained from sars-cov- infected patients at > days after onset of symptoms. most individuals ( / ) developed igg responses (fig. c ). iga and igm responses were observed less frequently (iga = / , igm = / ) than igg (fig. c ). for individuals with laboratory-confirmed sars-cov- infection, we had two specimens collected at different times early in the infection (fig. d ). two subjects (p and p ) were seronegative within the first days and seropositive for both ig and igm or more days after onset (fig. d ). for three subjects (p , p , p ) the acute samples were collected after days and the convalescent samples were collected days or more after onset. in these individuals both acute and convalescent samples were positive, and we observed an increase in ig and igm levels in the second specimen. for the remaining subjects, the acute specimen was collected on day after onset and the convalescent specimen was collected > days after onset. six out of the subjects already had specific ig, igm or both in the acute specimen collected on day . all the subjects except one (p ) seroconverted or had elevated levels of antibody in the convalescent sample collected > days after onset of symptoms. these results indicate that most people seroconvert between days and after onset of symptoms. subject p was an outlier and did not develop specific ig or igm antibodies. all the individuals with documented sars-cov- had ig but not igm antibodies that bound to the rbd of common hcovs, which is consistent with their high prevalence in humans (fig. a ). these results demonstrate that the rbd of sars-cov- is a highly sensitive antigen for antibody detection in patients days or more after onset of symptoms. the administration of convalescent plasma containing antibodies to sars-cov- is being evaluated for patients with severe disease. while the fda has not approved convalescent plasma therapy, on may , , the fda recommended that sars-cov- neutralizing titers of at least : should be used for human passive immunization studies. further, the fda also recommended that a titer of : may be acceptable if an alternative matched unit is not available. as the rbd domain of s protein is critical for viral entry, antibodies targeting this domain of sars-cov- are likely to be neutralizing and potentially protective, as is seen in cell culture and animal models for other pathogenic covs ( , ) . to assess the relationship between the rbd-binding activity and the neutralizing antibody response, we tested pcr-confirmed sars-cov- patient immune sera in a sars-cov- luciferase neutralization assay (fig. ). as judged by the spearman test (ρ = . , p < . ), we observed that the magnitude of the total rbd-binding ig antibody strongly correlated with the levels of neutralizing antibodies in sars-cov- patients ( currently, patients who have had a documented sars-cov- infection identified by rt-pcr or a serologic test, and who are clear of symptoms for at least days, are recruited for convalescent plasma donation. we evaluated the neutralizing potency in patient samples collected between and days with a titer of at least : (fig. d ). we observed that % of patients ( / ) developed weak to no neutralizing antibodies even days after onset of symptoms, suggesting that days after the start of symptoms is a poor determinant of the levels of sars-cov- neutralizing antibodies in the patients included in our study, particularly within the early convalescent phase (< weeks). to evaluate whether a simple rbd elisa can be used as a surrogate for neutralizing potency in sars-cov- patients, we analyzed the relationship between the level of total ig antibody to rbd and a neutralizing antibody titer of at least : . we observed that / people who had a substantial total ig binding antibody to rbd (> . od) also developed a robust neutralizing antibody titer (fig. e) . notably, only / people who developed a relatively weak rbd-binding antibody had a neutralizing antibody titer higher than : . one subject (p ) neither seroconverted for rbd antigen nor developed neutralizing antibodies to sars-cov- ( fig. d and e, and fig. s ). serology is critical to understanding the transmission, pathogenesis, mortality rate and epidemiology of emerging viruses. in the few months after the discovery of sars-cov- as a human pathogen, scientists have developed a large number of antibody assays and many commercial tests are now available. although none of the assays have been fully validated yet, the fda has granted emergency use authorization (eua) for multiple tests, while stressing the need for further validation. investigators have already encountered problems with the specificity and sensitivity of commercial assays rushed to market ( , ) . widespread use of inaccurate antibody assays could lead to policies that exacerbate the current sars-cov- pandemic instead of containing it. to address the need for reliable antibody-based diagnostic assays, we focused on the rbd domain of the spike protein because this region is poorly conserved between different covs and is also known to be a major target of human antibodies ( ) . a major concern with using a protein domain instead of a full-length protein or whole virion for antibody detection is possible reduction in assay sensitivity. however, we observed that over % of sars-cov- patients developed antibodies to the rbd days after onset of symptoms. although our study included only a few recent convalescent sera and relatively large numbers of presumably positive samples from past common human cov infections, the high specificity of the rbd antigen was also evident with the serum specimens from animals that were hyperimmunized with other zoonotic covs. some patients infected with sars-cov- had antibodies that cross-reacted with the rbd of sars-cov- . we have not tested the more distantly related rbd ag from mers cov or the serum samples from individuals with confirmed mers infection. since sars-cov- and mers cov seroprevalence are very low in humans, the sars-cov- antibody cross-reactivity with sars-cov- is unlikely to pose diagnostic challenges. other recent studies that have been published or under peer review also support the high specificity and sensitivity of the sars-cov- rbd for antibody detection ( , , ) . amanat and colleagues tested samples from sars-cov- patients collected at the beginning of the epidemic in the usa and reported that the full length s protein and the rbd performed well for specific antibody detection ( ) . okba and colleagues compared the performance of different sars-cov- antigens for antibody detection using samples from sars-cov- patients in europe ( ) . for the sars-cov- spike rbd, they observed levels of specificity and sensitivity that were comparable to our results reported here. the s subunit, which comprises conserved regions between covs, was less specific than the rbd ( ). perera and colleagues evaluated the performance of the rbd for antibody detection using samples from sars-cov- patients in hong kong ( ) . they also observed high specificity and sensitivity when patients were tested days or more after onset of illness. our study with specimens from documented sars-cov- patients, which includes patients presenting to hospitals in north carolina and georgia with varying levels of severity, together with these recent studies conducted in new york, europe and hong kong, strongly support the use of sars-cov- rbd as an antigen for antibody detection. we designed the assay for separate detection of rbdspecific total ig and igm. as the pandemic is ongoing and most infections are likely to have occurred within the past few months, infected individuals have variable levels of antigen-specific igg, igm and iga (fig. c ). to maximize assay sensitivity and to prevent different antibody isotypes competing for binding sites and reducing assay signal, we measured total ig. we did not observe any decrease in assay specificity by designing the assay to monitor levels of total ig instead of igg binding to the rbd even at high serum concentration or with hyperimmune sera. our study showed that igm and iga antibodies can also be detected using rbd-based serological assays. both iga and igm antibodies are relatively short lived and indicative of a recent exposure. when conducting large scale population level surveillance for sars-cov- antibodies, it will be possible to distinguish recent from remote infections by measuring both total ig and igm (or iga) binding to the rbd. antibody assays that correlate with protective immune responses in individuals who have recovered from sars-cov- infection and also reflect herd immunity at a population level are urgently needed to define each individual's risk of disease and to identify communities at high risk for new waves of infection. in animal studies with sars-cov- , virus-neutralizing antibodies were strongly correlated with protective immune responses ( ) . we observed a striking correlation between the levels of rbd antibodies in patients and the ability of patient sera to neutralize sars-cov- virus. other groups have recently reported finding a strong correlation between spike/rbd antibodies and sars-cov- neutralization in patients infected with sars-cov- ( , , ) . our results point out that roughly one-third of patients develop very low or no neutralizing antibodies to sars-cov- and that ig and igm antibodies are useful predictors of neutralizing antibody levels in patients in the early convalescent phase (< weeks). as people developing a high level of rbd-binding antibodies (> . od) also have a robust neutralizing response, a simple rbd-based elisa can be a useful tool to identify blood plasma donors. while further studies are needed to fully evaluate rbd antibodies as correlate of protective immunity, the results to date indicate that rbd antibodies are a promising correlate of protection in the early convalescent phase. a simple antibody detection assay that also predicts individuallevel risk of disease will be a major advance for vaccine development and immunogenicity of vaccines because sars-cov- neutralization assays are time-consuming and require bsl- containment. one sars-cov- patient (p ) who tested positive for viral rna and required hospitalization did not develop rbdspecific ig, igm or neutralizing antibodies, even at days after the onset of symptoms. this was the only person among the pcr positive subjects who did not seroconvert by days after onset of symptoms in the rbd-based assay. while we cannot rule out the possibility of a false positive pcr test result, others have also reported rare instances where people infected with sars-covs have atypical, dampened immune responses ( ) . further studies are needed to establish the frequency and significance of atypical antibody responses in sars-cov- patients and characterize the serological repertoire and epitopes targeted by the antibodies in convalescent sera. as sars-cov- infections in the southeastern u.s. have started to increase relatively recently, all convalescent samples used in this study were collected within days following onset of symptoms. in most patients, the convalescent sera had high end-point titers (> : ) in the rbd ig elisa supporting the utility of this assay even as antibody levels start to wane over time. we need to prioritize studies to prospectively monitor sars-cov- patients to determine the long-term kinetics of antibody levels and the performance of antibody detection assays over time. all the sars-cov- human immune sera used for this study were collected from symptomatic patients that included many with serious illness requiring hospitalization. the research community currently does not know if individuals experiencing mild/inapparent symptoms after sars-cov- infection have similar kinetics and levels of rbdbinding antibodies as those experiencing symptomatic infections. studies must be done with individuals experiencing mild/inapparent sars-cov- infections to define the kinetics and levels of rbd antibodies before implementing large population-level antibody testing. the goal of the study was to evaluate the performance of rbd-based spike antigen for reliable detection of sars-cov- -specific antibodies. we produced properly folded rbd from the spike proteins of sars and common-cold hcovs in mammalian cells and used this antigen to evaluate a large panel of human sera from documented sars-cov- patients and control subjects, and hyperimmune sera from animals exposed to zoonotic covs. we also used a sars-cov- luciferase neutralization assay to assess the dynamics of the neutralizing antibody response and its association with the rbd-binding activity. the structure coordinate sets of the spike proteins, spike protein complexes with their cognate receptor ace and monoclonal antibodies were obtained from the protein data bank (pdb). the structures were aligned to the reference spike protein using the pymol molecular graphics system (version . r pre, schrödinger, llc). molecular figures were drawn using pymol. the pdb coordinates used for the containing human serum albumin secretion signal sequence, three purification tags ( xhistidine tag, halo tag, and twin-strep tag) and two tev protease cleavage sites was cloned into the mammalian expression vector pαh. s rbds were expressed in expi cells (thermofisher) and purified from the culture supernatant by nickel-nitrilotriacetic acid agarose (qiagen). to generate virus replicon particles (vrps), the sars-cov- s gene was inserted into pvr as previously described ( ) . in summary, the sars-cov- s gene was ligated into pvr following digestion by restriction endonuclease sites, paci and apai. t rna transcripts were generated using the sars-cov- -s-pvr construct in conjunction with plasmids containing the venezuelan equine encephalitis virus envelope glycoproteins and capsid protein. the rna transcripts were then electroporated into baby hamster kidney fibroblasts and monitored for cytopathic effect. vrp were harvested hours after electroporation and purified via high-speed ultra-centrifugation. to generate serum samples against sars-cov- , -week-old balb/c mice (jackson labs) were inoculated via footpad injection with the vrp and boosted with the same dose one time three weeks later. serum samples were then collected from individual animals at weeks post-boost and pooled for use in assays. all human specimens used in these studies were obtained after informed consent under good clinical research practices (gcp) and compliant with oversight by the relevant institutional review boards (irbs). a list of the sars-cov- patient samples included in the study with basic demographic and clinical information can be found in table s . emory university school of medicine specimens: specimens were obtained from patients with symptomatic illness and clinical testing confirming sars-cov- by pcr (cdc sars-cov- test). de-identified specimens were shared with researchers at unc consistent with local irb protocols (emory irb# and ). blood plasma donor study: convalescent sera was obtained from donors who volunteered for plasma collections at the unc donation center. fresh sera collected as part of the standard plasmapheresis procedure were saved for research from donors who signed informed consent. unc irb - is conducted under good clinical research practices (gcp) and is compliant with institutional irb oversight. all donors had confirmed sars-cov- infection by nasopharyngeal swab indicating the presence of sars-cov- rna as performed by eua approved qrt-pcr in a us laboratory with a clinical laboratory improvement amendments (clia) certification. all donors had recovered from their sars-cov- illness and were at least days post last symptoms. donors who presented for plasma collection prior to days from their last symptoms had a confirmed negative nasopharyngeal rt-pcr test done within hours prior to donation. healthy unexposed donors: samples from healthy u.s. adult donors were obtained by the la jolla institute for immunology (lji) clinical core or provided by a commercial vendor (carter blood care) for prior, unrelated studies between early and early , at least one year before the emergence of sars-cov- . the lji institutional review board approved the collection of these samples (lji; vd- ). samples from the caribbean, central america and south asia were obtained from archived samples at unc collected before december for other studies. human and animal specimens from bei resources: the following reagents were obtained through bei resources, niaid, nih as part of the human microbiome project: pooled sera obtained from rabbits dosed with a recombinant sars-cov spike protein (nrc- ), monoclonal anti-sars-cov s protein (similar to c) (nr- ), anti-porcine respiratory coronavirus (prcov; isu- ) serum obtained from pig (nr- ), anti-porcine transmissible gastroenteritis virus obtained from pig (nr- ), anti-porcine respiratory coronavirus (prcov; isu- ) serum obtained from guinea pig (nr- ), anti-sars coronavirus obtained from guinea pig (nr- ), anti-bovine coronavirus (mebus) obtained from guinea pig (nr- ), anti-feline infectious peritonitis virus, - obtained from guinea pig (nr- ), anti-avian table s ). in-house rbd ig and igm elisa all serum specimens tested by elisa assay were heat-inactivated at °c for min to reduce risk from any possible residual virus in serum. briefly, μl of spike rbd antigen at μg/ml in tris buffered saline (tbs) ph . was coated in the -well high-binding microtiter plate (greiner bio-one cat # ) for hour at °c. then the plate was washed three times with μl of wash buffer (tbs containing . % tween ) and blocked with μl of blocking solution ( % milk in tbs containing . % tween ) for hour at °c. the blocking solution was removed, and μl of serum sample at : or indicated dilutions in blocking buffer was added for hour at °c. the plate was washed in the wash buffer, μl of alkaline phosphatase-conjugated secondary goat anti-human secondary antibody at : dilution was added for hour at °c. for measuring total ig, a mixture of anti-igg (sigma cat # a ), anti-iga (abcam cat # ab ), and anti-igm (sigma cat # a ] were added together. for measuring specific antibody isotype, only secondary goat anti-human igg or iga or igm was used. the plate was washed, and μl p-nitrophenyl phosphate substrate (sigma fast, cat no n ) was added to the plate and absorbance measured at nm using a plate reader (biotek epoh, model # ). for testing animal sera, the secondary antibody was matched to the species as follows: goat antimouse igg (sigma, a ), goat anti-rabbit igg (abcam, ab ), goat anti-pig igg (abcam, ab ), and goat antiguinea pig igg (abcam, ab ). full-length viruses expressing luciferase were designed and recovered via reverse genetics and described previously ( , ) . viruses were tittered in vero e usamrid cells to obtain a relative light units (rlu) signal of at least x the cell only control background. vero e usamrid cells were plated at , cells per well the day prior in clear bottom black-walled -well plates (corning ). neutralizing antibody serum samples were tested at a starting dilution of : , and were serially diluted -fold up to eight dilution spots. antibody-virus complexes were incubated at °c with % co for hour. following incubation, growth media was removed and virus-antibody dilution complexes were added to the cells in duplicate. virus-only controls and cell-only controls were included in each neutralization assay plate. following infection, plates were incubated at °c with % co for hours. after the hour incubation, cells were lysed and luciferase activity was measured via nano-glo luciferase assay system (promega) according to the manufacturer's specifications. sars-cov- neutralization titers were defined as the sample dilution at which a % reduction in rlu was observed relative to the average of the virus control wells. each data points in fig. e, fig. , fig. b and c, fig. and are presented as means of technical duplicates. the correlation of rbd binding and neutralization titers shown in fig. a and fig. b was evaluated using a spearman correlation coefficient (rs) and the associated two-tailed p-value (graphpad prism, version ). receiver operating characteristic (roc) analyses were performed to establish cutoff values for sars-cov- seropositivity using spss software. statistical analyses were performed using spss software ver. . (ibm, armonk, ny, usa). immunology.sciencemag.org/cgi/content/full/ / /eabc /dc fig. s . titration curves of sera from sars-cov- positive patients. fig. s . seroconversion of sars-cov- neutralizing antibodies. fig. s . estimation of rbd elisa assay cutoff. table s . summary of samples tested and associated characteristics (excel spreadsheet). table s . raw data file (excel spreadsheet). covs. (e) binding characterization of the spike rbd antigens with immune sera and a monoclonal antibody. sars-cov- monoclonal antibody ( c), serum from a mouse immunized with vrp expressing sars-cov- or sars-cov- spike protein, serum from a rabbit immunized with sars-cov- spike protein and an archived human sample collected before sars-cov- were tested for binding against rbd spike antigens from sars-cov- , sars-co-v- , hcovα (nl ) and hcovβ (hku- ). the cutoff values determined by the receiver operating (roc) curve analysis (fig s ) for the elisa assay are indicated by the broken line. scatter plots were generated using individual serum binding to rbd antigen (y-axis) versus sars-cov- neutralizing antibody titers (x-axis). the nonparametric spearman correlation coefficient (rs) and the associated two-tailed p-value were calculated (graphpad prism, version . ). (c) relationship between sars-cov- neutralizing antibody titer and days after onset of symptoms. (d) total ig antibody binding to rbd as a surrogate for identifying people with high sars-cov- neutralizing antibodies. a total of serum samples collected between and days after onset of symptoms from pcr-confirmed sars-cov- subjects were measured for ig and igm binding to spike rbd antigen and sars-cov- neutralization assay. the fdarecommended neutralizing antibody titer for plasma therapy ( : ) is indicated by the broken green line. virology, epidemiology, pathogenesis, and control of covid- the important role of serology for covid- control profiling early humoral response to diagnose novel coronavirus disease (covid- ) severe acute respiratory syndrome coronavirus -specific antibody responses in coronavirus disease temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by sars-cov- : an observational cohort study antibody responses to sars-cov- in patients of novel coronavirus disease chronological evolution of igm, iga, igg and neutralisation antibodies after infection with sarsassociated coronavirus mers-cov antibody responses year after symptom onset two-year prospective study of the humoral immune response of patients with severe acute respiratory syndrome origin and evolution of pathogenic coronaviruses development and clinical application of a rapid igm-igg combined antibody test for sars-cov- infection diagnosis cross-reactive antibodies in convalescent sars patients' sera against the emerging novel human coronavirus emc ( ) by both immunofluorescent and neutralizing antibody tests an outbreak of human coronavirus oc infection and serological cross-reactivity with sars coronavirus. can falsepositive results in a recombinant severe acute respiratory syndrome-associated coronavirus (sars-cov) nucleocapsid-based western blot assay were rectified by the use of two subunits (s and s ) of spike for detection of antibody to sars-cov antigenic cross-reactivity between severe acute respiratory syndrome-associated coronavirus and human coronaviruses e and oc a serological assay to detect sars-cov- seroconversion in humans serological assays for severe acute respiratory syndrome coronavirus (sars-cov- ) the spike protein of sars-cov-a target for vaccine and therapeutic development cross-reactive antibody response between sars-cov- and sars-cov infections potent binding of novel coronavirus spike protein by a sars coronavirusspecific human monoclonal antibody rapid point-of-care testing for sars-cov- in a community screening setting shows low sensitivity neutralizing antibody responses to sars-cov- in a covid- recovered patient cohort and their implications. medrxiv development of a broadly accessible venezuelan equine encephalitis virus replicon particle vaccine platform reverse genetics with a full-length infectious cdna of the middle east respiratory syndrome coronavirus competing interests: the authors declare that they have no competing interests. data and materials availability: the recombinant rbd antigens from the spike proteins used in this study are available under a standard mta with the university of north carolina. please contact lakshmanane premkumar (prem@med.unc.edu) or aravinda m acknowledgments: we gratefully acknowledge bei resources (https://www.beiresources.org) for the prompt processing and shipping of the reagents. we are grateful for the expert procedural care provided by the unc hospital and blood donor center and to the patients and blood donors providing samples for the study. key: cord- - mo u authors: miersch, shane; li, zhijie; saberianfar, reza; ustav, mart; blazer, levi; chen, chao; ye, wei; pavlenco, alia; subramania, suryasree; singh, serena; ploder, lynda; ganaie, safder; leung, daisy; chen, rita e.; case, james brett; novelli, guiseppe; matusali, giulia; colavita, francesca; copabianchi, maria r.; jain, suresh; gupta, j.b.; amarasinghe, gaya; diamond, michael; rini, james; sidhu, sachdev s. title: tetravalent sars-cov- neutralizing antibodies show enhanced potency and resistance to escape mutations date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: mo u recombinant neutralizing antibodies (nabs) derived from recovered patients have proven to be effective therapeutics for covid- . here, we describe the use of advanced protein engineering and modular design principles to develop tetravalent synthetic nabs that mimic the multi-valency exhibited by iga molecules, which are especially effective natural inhibitors of viral disease. at the same time, these nabs display high affinity and modularity typical of igg molecules, which are the preferred format for drugs. we show that highly specific tetravalent nabs can be produced at large scale and possess stability and specificity comparable to approved antibody drugs. moreover, structural studies reveal that the best nab targets the host receptor binding site of the virus spike protein, and thus, its tetravalent version can block virus infection with a potency that exceeds that of the bivalent igg by an order of magnitude. design principles defined here can be readily applied to any antibody drug, including iggs that are showing efficacy in clinical trials. thus, our results present a general framework to develop potent antiviral therapies against covid- , and the strategy can be readily deployed in response to future pathogenic threats. to date, all clinically advanced candidate nabs against sars-cov- infection have been derived by cloning from b-cells of recovered covid- patients or from other natural sources , , - . here, we applied an alternative strategy using in vitro selections with phage- displayed libraries of synthetic abs built on a single human framework derived from the highly validated drug trastuzumab. this approach enabled the rapid production of high affinity nabs with properties optimized for drug development. moreover, the use of a highly stable framework enabled facile and modular design of ultra-high affinity nabs in tetravalent formats that retained favorable drug-like properties and exhibited neutralization potencies that greatly exceeded those of the bivalent igg format. these methods provide a general means to rapidly improve the potency of virtually any nab targeting sars-cov- and its relatives, and thus, our strategy can be applied to improve covid- therapies and can be adapted in response to future pathogenic threats. results using a phage-displayed human antigen-binding fragment (fab) library similar to the highly validated library f , we performed four rounds of selection for binding to the biotinylated rbd of sars-cov- immobilized on streptavidin-coated plates. screening of clones for binding to cov- rbd, revealed fab-phage clones that bound to the rbd but not to streptavidin. fab-phage were screened by elisa and those that exhibited > % loss in binding to rbd in the presence of nm ace were sequenced, revealing unique clones (fig. a) , deemed to be potential nabs and converted into the full-length human igg format for purification and functional characterization. to estimate affinities, elisas were performed with serial dilutions of igg protein binding to biotinylated s protein trimer captured with immobilized streptavidin, and these assays showed that three iggs bound with ec values in the sub-nanomolar range (fig. b,c and table ). elisas also confirmed that each igg could partially block the binding of biotinylated ace to immobilized s protein (fig. d) . moreover, similar to the highly specific igg trastuzumab, elisas showed that the three iggs did not bind to seven immobilized proteins that are known to exhibit high non- specific binding to some iggs, and lack of binding to these proteins has been shown to be predictive of good pharmacokinetics in vivo (fig. e) , . we also used biolayer interferometry (bli) to measure binding kinetics and determine avidities more accurately, and all three antibodies exhibited sub-nanomolar dissociation constants (table , fig. s ), in close accord with the estimates determined by elisa. igg exhibited the highest avidity, which was mainly due to a two-or seven-fold faster on-rate than igg or , respectively, and thus, we focused further efforts on this ab. we took advantage of the precision design of our synthetic ab library to rapidly improve the affinity of ab . the synthetic library was designed with tailored diversification of key positions in all three heavy chain complementarity-determining regions (cdrs) and the third cdr of the light chain (cdr-l ). consequently, we reasoned that the already high affinity of ab could be further improved by recombining the heavy chain with a library of light chains with naïve diversity in cdr-l . following selection for binding to the rbd, the light chain library yielded numerous variants, of which were purified in the igg format and analyzed by bli (fig. s ) . several of the variant light chains resulted in iggs with improved binding compared with igg , and in particular, igg - (fig. b) exhibited significantly improved avidity (kd = or pm, respectively) due to an off-rate that was an order of magnitude slower (table , fig. s ) . to understand the molecular basis for antagonism of ace binding, we solved the x-ray crystal structures of the sars-cov- rbd in complex with fab or - at . or . Å resolution, respectively ( fig. a) . as expected, backbone superposition showed that the two complexes were essentially identical (rmsd = . Å). however, there were differences in side chain interactions due to sequence differences in the cdr-l loop, which explained the enhanced affinity of fab - compared with fab (fig. b) . although the side chains of tyr l in fab and his l in fab - both make hydrogen bonds with the side chain of tyr in the rbd, the bond mediated by his l is shorter, and thus, likely to be stronger. moreover, in fab - , the side chain of his l also makes an intramolecular hydrogen bond with the side chain of thr l , which tyr l and arg l are incapable of making in fab , and this interaction may stabilize the cdr-l loop of fab - in a conformation that is favorable for antigen recognition. thus, the crystal structures show that the two substitutions in the cdr-l loop of fab - relative to fab act in a cooperative manner to mediate favorable intermolecular contacts with the rbd, and also, intramolecular interactions that stabilize the loop in a conformation that may be better positioned to interact with the rbd. we next analyzed the structures to understand how the abs could function as antagonists of rbd binding to ace . binding of fab - to the rbd involves an extensive interface, with and Å of surface area buried on the epitope or paratope, respectively, and % or % of the structural paratope is formed by the light or heavy chain, respectively (fig. c) . comparison of the fab and ace epitopes on the rbd revealed extensive overlap, with % or % of the fab or ace epitope occluded by the other ligand (fig. c) . thus, direct steric hinderance explains the blockade of ace binding by fabs and - (fig. d) . we also used cryogenic electron microscopy to visualize fab in complex with the s protein trimer (fig. s a) . this analysis revealed that all three rbds in a single trimer were positioned in an "up" conformation, which was similar to the conformation bound to ace , and the three rbds were bound to three fab molecules. notably, the c-termini of the three fabs were positioned close to each other and pointed away from the s protein, suggesting that a single igg may be able to present two fabs in a manner that would enable simultaneous engagement of two rbds on a single s protein. indeed, this was confirmed in single particle negative stain electron micrographs of igg and the s protein, which revealed that the two fabs of a single igg bound two rbds on a single s protein trimer with a pincer-like grip (fig. s b) . taken together, the x-ray crystallography and electron microscopy showed that fabs and - block ace binding to rbd by direct steric hinderance, and simultaneous binding of fabs to multiple rbds on the s protein trimer enables the iggs to inhibit ace binding with enhanced potency due to avidity. next, we explored whether we could further enhance the avidity of nabs by taking advantage of modular design strategies to engineer tetravalent formats. each sars-cov- particle displays multiple s protein trimers, suggesting that multivalent fab binding could enhance avidity, especially since a single igg molecule can utilize both fab arms to bind a single s protein trimer. we reasoned that additional fab arms added to an igg may further enhance avidity by interacting with rbds on s protein trimers close to the trimer engaged by the core igg. thus, we designed tetravalent versions of and - by fusing additional fabs to either the n-or c-terminus of the igg heavy chain to construct molecules termed fab-igg or igg-fab, respectively (fig. a) . consistent with our hypothesis, the tetravalent molecules exhibited higher avidity, and consequently, greatly reduced off rates compared with their bivalent counterparts, and dissociation constants were in the low single-digit picomolar range ( fig. b, table ). our ultimate aim was to produce therapeutic abs that could be used to treat covid- in patients. aside from high affinity and specificity, effective ab drugs must also possess favorable biophysical properties including high yields from recombinant expression in mammalian cells, high thermodynamic stability, and lack of aggregation and excessive hydrophobic surface area. all iggs and tetravalent molecules were produced in high yields by transient expression in expi f cells ( - mg/l, table ). all proteins were highly thermostable with melting temperatures of the ch /fab domain ranging from - o c, which exceeded the melting temperature of the trastuzumab fab ( . o c, table ). size exclusion chromatography revealed that all iggs eluted as a predominant monodisperse single peak with elution volumes nearly identical to that of trastuzumab ( fig. c and table ) , and the monomeric fraction was calculated to be to > % ( table ) to explore neutralization of potential escape mutants, we generated hiv-gag-based lentivirus-like particle (vlps) pseudotyped with the sars-cov- s protein. we confirmed ace - dependent uptake of the pseudotyped vlps by hek- cells stably over-expressing exogenous ace , and we showed that uptake was inhibited by either fc-tagged rbd (rbd-fc) or igg . within this system, we generated a panel of pseudotyped vlp variants, each containing a single alanine substitution at an rbd position within or close to the ace -binding site. twenty of these vlp variants exhibited a > -fold reduction in internalization compared with the wild-type (wt) vlp, suggesting that these wt side chains contributed favorably to the interaction between the rbd and ace . the remaining vlp variants were internalized with high efficiency, and these represent good mimics of escape mutants, which maintain strong ace -mediated infectivity but may potentially reduce binding to nabs that compete directly with ace . with the panel of vlp variants that mimicked potential escape mutants, we surveyed the effects on cellular uptake after treatment with various nabs (fig. b) . we defined as escape mutants those vlp variants for which cellular uptake in the presence of nm nab was > % of the uptake in the absence of the nab. based on this definition, we found that of the mutations enabled escape from igg , whereas only three mutations enabled escape from igg - . presenting the paratope in tetravalent formats resulted in nabs that could neutralize more variants than igg , and most importantly, tetravalent nabs containing the - paratope strongly neutralized all variants except one. as expected, these results showed that enhancing the avidity of the igg paratope for the s protein enhanced both potency and resistance to escape mutations. moreover, similar enhancements were also achieved by the presentation of paratopes in tetravalent rather than bivalent formats, and the most effective nabs were those that presented the optimized paratope in the tetravalent format. discussion sars-cov- has wreaked havoc on global health and economics, and along with its relatives sars-cov and mers, has shown that viral outbreaks and pandemics will continue to plague the world in the future. consequently, it is essential for the scientific community to adapt the most advanced drug development technologies to combat not only covid- , but also, pathogenic disease in general. in this context, we have deployed advanced synthetic antibody engineering to rapidly develop human nabs, which are potent therapeutic candidates in the natural igg format, and are even better neutralizing agents in the synthetic tetravalent formats that our modular design strategies enable. most importantly, the enhanced affinities and potencies afforded by tetravalent nabs are achieved without compromising any of the favorable characteristics that make igg molecules ideal drugs. moreover, tetravalent nabs resist potential escape mutants, which further augments the power of these molecules as drugs to combat not only sars-cov- , but also, its relatives that may emerge in the future. covid- has also exposed the need for drug development to respond to viral outbreaks ab for the rbd, residues in the ace -binding site are also shown as colored surfaces, and the following color scheme was used: red, contacts with both fab - and ace ; blue, contacts with fab - only; yellow, contacts with ace only. fab - residues that contact the rbd are colored magenta or cyan if they reside in the light or heavy chain, respectively. the cdr-l residues that differ between - and are shown as red spheres. the vlps were treated with nm of the indicated nab (x-axis) and uptake by ace -expressing hek- cells was measured in triplicate and results are representative of n= independent experiments. the heat map shows uptake normalized to uptake in the absence of nab. boxed cells indicate vlps that represented escape mutants for a given nab, as defined by > % uptake with nab treatment compared with untreated control (the percent uptake is shown in each cell). remdesivir for the treatment of covid- -preliminary a randomized trial of hydroxychloroquine as postexposure prophylaxis for covid- convergent antibody responses to sars-cov- in convalescent individuals clinical and immunological assessment of asymptomatic sars-cov- infections convalescent plasma therapy on time to clinical improvement in patients with severe and life-threatening covid- : a randomized clinical trial treatment of critically ill patients with covid- with convalescent plasma effectiveness of convalescent plasma therapy in severe covid- patients a potently neutralizing antibody protects mice against sars-cov- a human neutralizing antibody targets the receptor-binding site of sars-cov- potent neutralization of severe acute respiratory syndrome (sars) coronavirus by a human mab to s protein that blocks receptor association potent cross-reactive neutralization of sars coronavirus isolates by human monoclonal antibodies human monoclonal antibody as prophylaxis for sars coronavirus infection in ferrets prophylactic and postexposure efficacy of a potent human monoclonal antibody against mers coronavirus sars-cov- structure and replication characterized by in situ cryo-electron tomography structures and distributions of sars-cov- spike proteins on intact virions entry depends on ace and tmprss and is blocked by a clinically proven protease studies in humanized mice and convalescent humans yield a sars-cov- antibody cocktail cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody neutralizing antibodies against sars-cov- identified by high-throughput single-cell sequencing of convalescent patients' b cells isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model igg-neutralizing monoclonal antibodies block the sars-cov- infection a panel of human neutralizing mabs targeting sars-cov- spike at multiple epitopes broad neutralization of sars-related viruses by human monoclonal antibodies diversity is not required for antigen recognition by synthetic antibodies polyreactivity increases the apparent affinity of anti-hiv antibodies by heteroligation biophysical properties of the clinical-stage antibody landscape title: ly-cov , a rapidly isolated potent neutralizing antibody, provides protection in a non-human primate model of sars a strategy to prevent future epidemics similar to the -ncov outbreak a review of studies on animal reservoirs of the sars coronavirus a high through-put platform for recombinant antibodies to folded proteins simple piggybac transposon-based mammalian cell expression system for inducible protein production structure of bacteriophage t fibritin: a segmented coiled coil and the role of the c-terminal domain site-specific biotinylation of purified proteins using bira immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen diversity synthetic antibody libraries yield novel anti-egfr antagonists high-throughput screening of formulations to optimize the thermal stability of a therapeutic monoclonal antibody neutralizing antibody and soluble ace inhibition of a replication-competent vsv- sars-cov- and a clinical isolate of sars-cov- a highly conserved cryptic epitope in the receptor binding domains of sars-cov- and sars-cov molecular characterization of sars-cov- from the first case of covid- in italy a simple method of estimating fifty percent endpoints -scienceopen crystal violet assay for determining viability of cultured cells primary structure of the streptomyces enzyme endo-beta-n-acetylglucosaminidase h - phaser crystallographic software features and development of coot towards automated crystallographic structure refinement with phenix.refine imgt unique numbering for immunoglobulin and t cell receptor variable domains and ig superfamily v-like domains key: cord- -db rqz d authors: kalathiya, umesh; padariya, monikaben; mayordomo, marcos; lisowska, małgorzata; nicholson, judith; singh, ashita; baginski, maciej; fahraeus, robin; carragher, neil; ball, kathryn; haas, juergen; daniels, alison; hupp, ted r.; alfaro, javier antonio title: highly conserved homotrimer cavity formed by the sars-cov- spike glycoprotein: a novel binding site date: - - journal: j clin med doi: . /jcm sha: doc_id: cord_uid: db rqz d an important stage in severe acute respiratory syndrome coronavirus (sars-cov- ) life cycle is the binding of the spike (s) protein to the angiotensin converting enzyme- (ace ) host cell receptor. therefore, to explore conserved features in spike protein dynamics and to identify potentially novel regions for drugging, we measured spike protein variability derived from viral genomes and studied its properties by molecular dynamics (md) simulation. the findings indicated that s subunit (heptad-repeat (hr ), central helix (ch), and connector domain (cd) domains) showed low variability, low fluctuations in md, and displayed a trimer cavity. by contrast, the receptor binding domain (rbd) domain, which is typically targeted in drug discovery programs, exhibits more sequence variability and flexibility. interpretations from md simulations suggest that the monomer form of spike protein is in constant motion showing transitions between an “up” and “down” state. in addition, the trimer cavity may function as a “bouncing spring” that may facilitate the homotrimer spike protein interactions with the ace receptor. the feasibility of the trimer cavity as a potential drug target was examined by structure based virtual screening. several hits were identified that have already been validated or suggested to inhibit the sars-cov- virus in published cell models. in particular, the data suggest an action mechanism for molecules including chitosan and macrolides such as the mtor (mammalian target of rapamycin) pathway inhibitor rapamycin. these findings identify a novel small molecule binding-site formed by the spike protein oligomer, that might assist in future drug discovery programs aimed at targeting the coronavirus (cov) family of viruses. the global pandemic developing from december by a strain of severe acute respiratory syndrome coronavirus (sars-cov- ) can cause coronavirus disease disease. this emergent variant adds to the additional coronavirus strains that can infect humans including human coronavirus oc (hcov-oc ), human coronavirus hku (hcov-hku ), sars-cov, human coronavirus e (hcov- e), human coronavirus nl (hcov-nl ), and human coronavirus nl (hcov-nl ) [ ] [ ] [ ] [ ] [ ] [ ] . coronaviruses (covs) are positive-sense, enveloped, single-stranded rna viruses that are classified taxonomically as a family coronavirdiae and order nidovirales [ ] . there are four genera of covs, including αcov, βcov, δcov, and γcov; most δcovs and γcovs target avians, whilst αcovs and βcovs infect rodents and bats [ , , ] . severe acute respiratory syndrome cov (sars-cov) outbreaks have also emerged previously creating an epidemic [ , , [ ] [ ] [ ] [ ] [ ] . although the mortality of mers-cov, sars-cov, and sars-cov- is substantial, there are no preventative vaccines or drugs available to treat patients infected with the virus [ , , ] . the current public health emergency of international concern (pheic) by the world health organization (who) has declared sars-cov- (covid- ; a novel βcov) as a pandemic threat. the data obtained from who ( /may/ ) suggest that the virus has caused , , infections, , deaths, and it has affected over countries. the open reading frame ab (orf ab) of sars-cov- encodes for three proteins that are broadly recognized as drug targets, since they are key components for infections and disease progression: the sars-cov- protease [ , ] , the rna-dependent rna polymerase (rdrp) [ , , ] , and the sars-cov- spike (s) glycoprotein [ , [ ] [ ] [ ] . the sars-cov- protease processes the polyproteins that are translated from the viral rna, and it has been heavily studied using small molecules inhibitors [ ] . to penetrate the host, the sars-cov- makes use of homotrimeric class i glycosylated fusion spike protein [ , , ] . fusion of the viral and host cell membranes is facilitated by the spike glycoprotein, which undergoes a significant conformational change upon fusion [ , , ] . sars-cov- studies suggest [ , , ] that the spike glycoprotein functions as a homotrimer. the recognition and subsequent fusion of the viral and cellular membranes are triggered by the s subunit of the spike protein, which binds the host cell receptor; angiotensin converting enzyme- (ace ) [ , [ ] [ ] [ ] [ ] [ ] [ ] [ ] . several insights from structural biology are consistent with the role for this domain in affecting the infection rate of the virus. this host-virus interaction is mediated by the receptor binding domain (rbd) domain from s subunit of sars-cov- spike glycoprotein that forms a hinge-like conformation [ , ] , i.e., "down" and "up" states that represents the host cell receptor-inaccessible and receptor-accessible [ ] . this receptor-accessible "up" conformation exists in a highly fluctuating state [ ] [ ] [ ] [ ] . binding to the host target destabilizes the pre-fusion homotrimer, which sheds off the s subunit, and allows for the transition of the s subunit to a highly stable postfusion conformation [ ] . interestingly, protein-mediated cell-cell fusion assays suggest that sars-cov- spike protein displays an elevated plasma membrane fusion capacity when compared to that of sars-cov [ , ] . several studies have aimed to define the mechanism of binding of sars-cov- to the host cell receptor [ ] . molecular dynamics simulations of the spike (rbd)-ace complex, over ns indicated that spike(rbd)-ace binding free energy for sars-cov- is better than for the sars-cov [ ] . similarly, other studies have shown that the sars-cov- spike protein has a better binding affinity to ace at two different "up" angles of the rbd domain than the sars-cov [ ] . structural features at the spike-ace interface suggest that residues q and p from the spike rbd domain are responsible for maintaining protein-protein stability [ ] . using a virtual high-throughput screening approach, small-molecules have been identified that can interact with the rbd domain of sars-cov- spike protein [ ] . natural compounds present in curcuma sp., citrus sp., alpiniagalanga, and caesalpiniasappan could also target the rbd domain of the sars-cov- spike glycoprotein, the protease domain (pd) from ace , and the sars-cov- protease [ ] . a set of b cell and t cell epitopes derived from the spike and nucleocapsid proteins that map identically to sars-cov- proteins, were identified as potential vaccine candidates [ ] . applying an integrative, antiviral drug repurposing methodology, the interplay between the cov-host interactome and drug targets in the human protein-protein interaction network have been defined [ ] . bioinformatics methodologies were used to identify neutralizing antibodies that might interact with interfaces formed by the spike glycoprotein and the ace host cell receptor [ ] . by targeting the rbd domain of the spike protein using docking experiments, kanishka et al. identified small molecule inhibitors [ ] . in the majority of studies, the most common strategy is focused on targeting the interface formed by sars-cov- spike glycoprotein and the ace host cell receptor (i.e., spike(rbd)-ace ). currently, there are no robust drugs for wide-spread dissemination available against coronaviruses including; the sars-cov- virus. due to the relatively rapid spread in the current outbreak and the relatively high mortality rate ( . %), more rapid development of new or repurposed antiviral drugs is of high value. although the majority of drug discovery programs target classically druggable enzymes encoded by the virus, such as the viral rna polymerase inhibited by remdesivir [ , , ] , there is a paucity of information concerning the other regions of spike glycoprotein outwith the ace -binding domains, especially the domains interacting with the viral membrane. the sars-cov- spike protein is a homotrimer composed of three monomers (chains a, b, and c; figure a ). each monomeric protein contains an n-terminal ace binding domain (receptor binding domain; rbd), a central helix/heptad repeat, and a c-terminal region that interacts with the plasma membrane [ ] . homotrimer spike protein assembly from monomeric forms can be rate limiting in cells, suggesting a possible space for intervention on the viral life cycle [ ] . our current study focuses on understanding the variability of the trimer spike glycoprotein in sars-cov- with respect to the genomes from other coronavirus strains, and identifying the changes in the molecular properties due to conformational flexibility in the spike protein. the analysis suggests that residues in the s subunit are less variable compared to the other regions. in addition, the molecular dynamics simulations (mds) identified that residues from the rbd domain obtained substantial flexibility which may be an obstacle in finding active hits. by contrast, residues in the s subunit (trimer cavity) showed the least flexibility representing a novel binding region for ligands. this information was used to identify potentially novel drug pockets or the active site regions specifically in the oligomeric sars-cov- spike glycoprotein. we performed md simulations on the monomeric and trimeric form of the sars-cov- spike glycoprotein, and developed a virtual screening using a food and drug administration (fda) approved chemical library. we identified and focused on an apparent cavity formed by three subunits (the homotrimer), that our simulations suggest can mediate dynamic movements that mimic a "bouncing spring" or a "sarrus linkage (converting a circular motion to a linear motion or vice versa)" when interacting with the ace host cell receptor. this motion might be important in the fusion of the virion and the host cell membrane. we hypothesized that such a cavity formed by three monomers or subunits of the spike protein (i.e., chains a, b, and c) might form an acceptor for small molecules, and we asked whether small molecules could be identified with a relatively high binding energy. we identified several known compounds with predicted binding energy of gbvi/wsa dg (generalized-born volume integral/weighted surface area) from − to − kcal/mol, some of which are already proposed for clinical trials including an mtor (mammalian target of rapamycin) pathway inhibitor, sirolimus (rapamycin; a macrolide type; nct not yet recruiting) [ ] [ ] [ ] and ritonavir (open-label trial in hospitalized adults with severe covid- ) [ , [ ] [ ] [ ] . a recent study that screened hundreds of approved molecules in a sars-cov- assay using artificial intelligence-enabled phenomic assays [ ] , also identified sirolimus (rapamycin) as a promising candidate. in addition to the macrolides, one of the top hits we have identified, chitosan, has a recently reported derivative inhibiting sars-cov- coronavirus replication in cell lines [ , ] . a previous study has also shown that the chitosan derivatives can interact with the spike protein and block its interaction with the host receptor [ ] . our data suggest a mechanism whereby chitosan (and possibly its derivatives), as well as macrolide type molecules, might bind to a pocket formed by the spike protein trimer and provide a novel domain to focus on for future drug discovery projects. [ , ] . (c) receptor binding domain (rbd) illustrating the "up" or "open" (pdb id. vsb [ ] ) and "down" or "closed" (pdb id. vxx [ ] ) conformation. a total of viral genome sequences were downloaded from the global initiative on sharing all influenza data platform (gisaid) [ ] , in order to define the evolutionary variability in different domains of the spike glycoprotein. only genomes with high coverage and complete sequences were selected. further filtering was applied to obtain complete sequences on the targeted domains which reduced the total number of strains to . total protein sequences were acquired from frame translation using the transeq tool from emboss (european molecular biology open software suite) package (version . . ) (european bioinformatics institute (embl-ebi), hinxton, cambridge, uk) [ ] . the amino acid chains from the spike glycoprotein were aligned to the reference protein (pdb id. vsb [ , ] ) using muscle [ ] . variations in the amino acid or the residue changes were scanned on the entire spike protein sequence, along with two areas of interest in the multiple alignment file, focusing on a subset of the s subunit (hr , ch, and cd domains) and the rbd domain ( figure and tables s -s ). the cryo-em (cryogenic electron microscopy) homotrimer structure of sars-cov- spike glycoprotein was retrieved from the protein data bank database (http://www.rcsb.org/pdb; pdb id. vsb; figure ) [ , ] . in addition, the missing amino acid (residues range: - , - , - , - , - , - , - , - , - , - , - , - , and - ) coordinates in the structure of sars-cov- spike glycoprotein were built using the swissmodel ( figure ) [ ] . molecular dynamics simulations were carried on the model systems as per the standardized pipelines [ ] [ ] [ ] (detailed method explained in the supplementary materials; file s ). the gromacs . . [ ] program (gromacs; groningen machine for chemical simulations, university of groningen, groningen, the netherlands) was used to perform md calculations assigning the charmm forcefield [ ] . we performed ns molecular dynamics simulations on two systems: (i) the monomeric form and (ii) the homotrimer form of the spike protein. in our analysis of the md simulations, the dynamics of the monomeric form of the spike protein serves as control to the homotrimer, which is the functional unit. initially, the model systems were energy minimized, which provides a base-line model structure and resolves poorly-resolved conformations often found in crystal structures [ , , , [ ] [ ] [ ] [ ] . a simulation box of solvent atoms is then added to enhance simulation realism. following that, using the npt (number of particles (n), system pressure (p), and temperature (t); isobaric-isothermal) thermodynamic ensemble, equilibration of the systems was performed to adjust solvent molecules with counter ions in the simulation box [ ] . these equilibrated systems were subsequently used to perform the final md production runs for ns, and results were analyzed using gromacs [ ] , biovia discovery studio (dassaultsystèmes, biovia corp., san diego, ca, usa), chimera, and visual molecular dynamics (vmd) tools [ ] [ ] [ ] . structure-based virtual screening (sbvs) is an application of in silico methods that identify promising lead molecules from chemical libraries or databases. these methods are computational counterparts of experimental biological evaluation methods, such as high-throughput screening (hts). fda approved drug libraries were retrieved from target molecule corp. (targetmol; www.targetmol.com) and selleck chemicals (selleckchem; www.selleckchem.com) vendors. the sbvs against the sars-cov- spike glycoprotein was performed using the molecular operating environment (moe; chemical computing group inc., montreal, qc, canada) package [ , ] . receptor-ligand binding or docking using the charmm forcefield [ ] was evaluated using the gbvi/wsa ∆g scoring function [ ] . the compounds showing best energies with the spike protein were selected for further analysis. gbvi/wsa ∆g is a forcefield based scoring function which determines the free energy of binding of the ligand from a given position [ ] . in addition, we have also selected the compounds that showed comparatively stable interactions with the homotrimer spike protein. applying the "triangle matcher"placement method, receptor-ligand docking was performed defining the receptor as rigid and ligands as flexible [ , ] . we were interested to define the evolutionary variance in the sars-cov- spike protein. understanding regions of high and low variance can identify domains that may be functionally conserved and potentially important to the virus life cycle, or those under positive evolutionary pressure whose selection might avoid the immune system. examining the variability of the spike protein in sars-cov- and its different domains, a total of genome viral sequences were retrieved from gisaid [ ] . a global view of the mutation space of the virus is presented in figure a , which represents the amino acid substitutions in bins of aminoacids across the spike glycoprotein. these hotspots of variation are mostly confined to the ntd and the rbd domains (figures b and b) . we investigated the variability in the entire sequence of the spike protein, focusing on the regions that showed low-variability in the structure (figure ). by investigating the variations in the residue changes across the entire spike protein sequence or all the regions of lower variability (figure a and table s ), the s subunit exhibited the lowest sequence variability (residue range: - ; figures b and c) . moreover, previous studies have identified that the active site region for this spike protein is located in the rbd domain which interacts with the ace host cell receptor [ , , , [ ] [ ] [ ] , ] . comparing the variability of the rbd domain and s subunit domains, the rbd domain was shown to contain more mutations in its region compared to the s subunit (hr , ch, and cd) domains (figure b ,c, and tables s -s ). these data suggest that during mutation by natural selection, the viral-host "arms race" might operate more frequently on the rbd domain. by contrast, the s subunit conservation is suggestive of an important core function where mutations cannot be tolerated. these findings prompted our focus on the s subunit as an important region to investigate for identifying potentially novel druggable pockets. we next traced the dynamics of different domains in the spike protein using md simulations (figure b ). the simulated model systems of the spike protein in the monomer and the homotrimer forms were first processed to check the stability of the protein. stability of the simulated spike protein in both forms in the solvent environment was traced by rmsds (root mean square deviation), a time dependent change in the non-hydrogen atoms (figure a ). the rmsd plots (figure a) suggest that the trimer form of the spike protein is more stable compared to the monomer form. in addition, chain a in trimer has a higher rmsd (~ Å) compared to the other two chains which is a consequence of the fact that the "up" (or ace -active) conformation [ ] induces flexibility. since the monomer form has a higher rmsds compared to the trimer (figure a) , we performed independent triplicates (mds was repeated three times) of md simulation for the monomer form ( figure s ). the findings from these replicates indicate that the monomer form has a higher rmsd compared to the trimer spike protein (figure a and figure s ). the root mean square fluctuations (rmsf) were computed on the cα atoms of each residue from the spike protein, in order to trace their flexibility and thereby define the motions of different domains (figure b ). the rmsf findings in both forms (monomer and trimer) indicated that the amino acids in the rbd domain (residue range: - ) were highly fluctuating (figures b and b ). in addition, the triplicate md simulations of the monomer form, also suggests that the rbd domain has a higher rmsf in all three simulation replicates ( figure s ). these analyses correlate with previous studies [ ] [ ] [ ] [ ] ] . particularly, amino acids ranging from - , responsible for interacting with the ace , that were highly fluctuating. furthermore, examining other regions of the spike protein suggests that the s subunit domains (residue range: - ; hr , ch, and cd) showed the least fluctuations within the entire protein sequence (figure b ). this correlates with the cryo-em studies performed on the spike protein; that the s subunit is more stable [ ] compared to the rbd domain, and that this subunit is responsible for a highly stable postfusion conformation of the spike protein [ , ] . from the perspective of designing drugs, the more stable or less flexible a region is within a protein, the more accurately we can trace a better hit molecule. in the case of the spike protein, the rmsf findings guided us towards focusing on the s subunit ( figure b) . moreover, by tracing the residues involved in the h-bond interactions between two monomers (i.e., chains a-b, a-c, or b-c) of the homotrimer, we observed that the rbd domain residues were also involved in intermolecular interactions with each other and with high occupancy (%). this suggests that intermolecular interactions between chains in the homotrimer might equilibrate the spike protein, and might stimulate conversion from a "down" to "up" conformation of the rbd domain that interacts with the ace receptor ( figure s and table s ). the structural dynamics over the time course for spike protein in the monomer and the homotrimer form was monitored during md simulations (figure c,d) . the monomeric spike protein in the solvent environment exhibited a movement from the "up" active state towards the "down" inactive state for the rbd domain (figure c, figure c , and movie s ). these correlate with the previous findings that the rbd domain can form two different conformations, i.e., "down" and "up" states, which represents the host cell receptor-inaccessible and receptor-accessible, respectively [ ] [ ] [ ] [ ] . the sars-cov- spike protein has a better binding affinity to the ace receptor at two different "up" angles of the rbd domain compared to the sars-cov [ ] . figure c (and movie s ) describes the conformational change in other regions of the spike protein, when the rbd domain moves towards an "up" to "down" state in the monomeric form. d (and movie s ) represents the dynamics of the homotrimer spike protein, suggesting that the rbd domain of chain a opens more widely in its "up" state. domains hr , ch, and cd close to the viral transmembrane exhibited the least movement (figure d ) during md simulations. in addition, exploration of the structural orientation of these s subunit domains (figure d and movie s ) suggests that they form a large pocket or cavity using three chains (or monomers) from a homotrimer spike protein. the slight movement observed in homotrimer during md simulation of this cavity (movies s and s ), and the structural orientation suggest that it could work as "bouncing spring" or "sarrus linkage". one may postulate that, when the spike protein interacts with the ace receptor, this "bouncing spring" or "sarrus linkage" movement may be important in the fusion of the virion with the host membrane. additionally, this cavity from the spike protein could work as a platform for the design or development of new drug leads against this protein (figure d ). such molecules might alter the trimer stability upon viral entry or upon viral coat assembly. there have been several studies performed to design drugs specific for the sars-cov- spike protein [ , [ ] [ ] [ ] , ] ; however, most of them are focused on the rbd domain. in addition, from our md simulation and variability analysis (figures and ) , the rbd domain is highly flexible and variable, therefore, drugging this variable site may be an obstacle in finding active hit molecules. by targeting the less variable s region, such as the cavity formed by the homotrimer (figures and d) we suggest that this might be a novel approach to develop small molecule drug leads. we next investigated the targetability of the trimer cavity formed by the s subunit (hr , ch, and cd domains) in the spike protein (figures and ) using the moe (chemical computing group inc.) package [ , ] , before using it for high-throughput virtual screening (or sbvs) using a library of fda approved drugs. the "alpha shapes" construction [ , ] geometric method was used to compute the possible residues that can be considered for ligand docking from this trimer cavity in the spike protein (figure a ). high-throughput virtual screening is a powerful computational approach that is increasingly being used in the drug discovery process, through the in silico identification of novel hits from large compound databases [ ] . we applied the sbvs approach to dock the molecules to the trimer cavity and to check its feasibility as a target. ligand binding to this cavity might reduce or increase the "bouncing spring" movement in the spike protein, as observed in md simulations (figure , movies s and s ). this perturbation might affect its interactions with the host cell receptor or the hinge movement of the rbd domain. the compounds that exhibit a relatively high binding affinity towards the sars-cov- spike glycoprotein trimer cavity with a binding affinity − to − kcal/mol (gbvi/wsa dg) were recorded. from the list of ligands showing best binding, the compounds that were already validated or suggested to be/can be active against the sars-cov- virus includes: chitosan [ ] [ ] [ ] , rapamycin [ ] [ ] [ ] , everolimus (rad ) [ ] , paclitaxel [ ] , ritonavir [ , [ ] [ ] [ ] , selameerin (selamectin) [ ] , and danoprevir [ ] (table ) . among these molecules rapamycin and everolimus drugs were previously identified as mtor pathway inhibitors [ , , [ ] [ ] [ ] . the antibacterial or antiparasitic drugs from the list are chitosan [ ] or selameerin (selamectin) [ ] , respectively. paclitaxel, has been found to be previously target bcl- and microtubule associated functions [ , ] . in addition, the fda approved drugs that target the protease are: ritonavir [ ] and danoprevir (itmn- ) [ ] . by docking known drugs within the trimer cavity of spike protein, the relative selectivity of the cavity suggests that the majority of higher-affinity drugs will have a molecular weight (mw) ≥~ g/mol (table and figure s ). however, this is with the certainty that compounds with high mw can form more interactions with the spike protein, in addition, our finding highlights the possibility that the trimer cavity can occupy large ligands deep inside the binding pocket (figure a ). particularly, a specific class of ligands (mostly macrolide type) were found to exhibit a better fit to the trimer cavity ( figure a) ; for example, rapamycin [ ] [ ] [ ] , everolimus (rad ) [ ] , paclitaxel [ ] , and selameerin (selamectin) [ ] (figure a ). the intermolecular interactions between the spike protein and the compounds suggest that residues from all three monomers (chains a, b, and c) are actively involved in binding to the drugs. in addition, placement of the compounds inside the trimer cavity suggests that they make use of the pocket space (forming different conformation) to form stable interactions with the spike protein (figure a ). the ligands that were found interacting with the homotrimer cavity with high binding affinity were also docked with an interface formed by the spike proteins (rbd domain; pdb id. lzg [ ] ) that interact with the ace receptor. sbvs, structure-based virtual screening. in order to check the selectivity of these ligands to the trimer cavity, we docked this same subset (table ) with the rbd domain of the viral spike protein (figure b ; pdb id. lzg [ ] ). the rbd domain is involved in interacting with the ace host cell receptor [ , , , [ ] [ ] [ ] [ ] [ ] , ] . the docking suggests that all compounds from table have better binding affinity to the trimer cavity compared to that of the rbd domain. in addition, chitosan [ ] [ ] [ ] ] (a linear polysaccharide; − . kcal/mol) could form a linear conformation in its structure when binding tothe rbd domain (figure b and table ), whilst the same ligand (due to its molecular structural nature) can form a slightly folded shape (as shown in d-diagram; figure a ) within the trimer cavity. by contrast, everolimus [ ] (a macrolide type) exhibits high affinity for the trimer pocket, and very little selectivity for the rbd domain (table ) . moreover, ligands (figure b ) that interact with the rbd domain overlap with the region bound by cr (a neutralizing antibody isolated from a convalescent sars patient that, interacts with the receptor binding domain of the sars-cov- spike protein [ ] ). the sars-cov- virus causing covid- disease uses the fusion spike glycoprotein to penetrate into the host cell, and therefore a detailed understanding of this protein forms a critical intervention point in the viral life-cycle. we interrogated the spike protein with a diversity of computational approaches. first, the variability in spike protein from different viral genome sequences was evaluated. residues in the s subunit (residue range: - ; hr , ch, and cd domains) were found to be less evolutionarily variable compared to other regions or domains. by contrast, residues h y, q k, v f, v a, s p, k p, and v p were found to be the most common amino acid substitutions in the spike protein from related viruses. secondly, md simulations revealed that residues in rbd domain (residue range: - ) were more flexible compared to residues in the s subunit, making it more complicated for drug design strategies. an examination of less variable regions revealed that the hr , ch, and cd domains (s subunit) located close to the viral transmembrane formed a large cavity or pocket that is formed from three spike monomers. the md simulation traced an "up" active state and a "down" inactive state of the spike protein in its monomer form. slight movement of the trimer cavity within this structural orientation suggests that it could work as "bouncing spring" or "sarrus linkage" when interacting with the host cell receptor. the conversion between "up" and "down" states in the monomer form of spike protein using the in silico methods is defined by the md field to be relatively fast. nevertheless, there are different structural isoforms that have been identified on the spike protein using different experimental methods or virus strains. this indicates that, although conversion may be quick, there are structural endpoints which are "stable". using the recent cryo-em structure of the sars-cov- spike protein [ ] ; an asymmetric hinge-like movement was observed in only one of the three rbd domains in the s subunit, which was also observed in mers-cov and sars-cov [ , ] . however, there are also other structures where all three rbd domains are in the "up" or "down" conformation [ , , , , ] . these data suggest a physiological relevance due to heterogeneous protein conformational dynamics. for example, asymmetric conformational flexibility might have a functional role, perhaps in evading the exposure of b-cell epitopes (only one rbd domain is in the "up" conformation) and/or optimized interaction with the ace receptor depending on virus strain. in addition, because of the "bouncing spring" mechanism (communication between the trimer pocket and the rbd domain conformation), it is possible that these different spike protein conformational isoforms provide another avenue to develop drug discovery programs that exploit and/or circumvent these dynamics. our investigation into the genomic variation within virus strains, as well as our findings from the md simulations, identified a conserved trimer cavity or pocket formed by the s subunit in the spike protein. these findings suggest that a novel target, "the trimer cavity formed by spike protein oligomerization", may be suitable to manipulate viruses of this class. targeting the trimer pocket might identify a new functional class of drugs against this protein. applying the sbvs approach, we docked drug libraries against the trimer cavity with the hypothesis that such a ligand might perturb the predicted "bouncing spring" movement and the homotrimer formation. protein-ligand docking identified severalhits that have already been published or proposed to inhibit the sars-cov- virus in cell systems. for example, our studies suggest an action mechanism for molecules such as chitosan and macrolide types (e.g., rapamycin). based on the sequence variability of the coronavirus, including our findings from md simulations of the spike protein, a conserved trimer cavity (hr , ch, and cd domains) is a feature of the spike protein in most coronaviruses. consistent with this, previous work has shown that the molecule ek exhibited potent inhibitory activity against all human coronaviruses (hcovs) tested through binding to the c-terminal hr domain [ ] . additionally, the "up" and "down" conformations of rbd domain observed during md simulations, supports that concept that the spike protein can also be a target of a possible igg therapeutic [ ] . from the list of the top compounds identified that dock into the trimer cavity, some of them have already been validated or suggested as sars-cov- virus inhibitors in cells, including; a chitosan derivative [ ] [ ] [ ] , rapamycin [ ] [ ] [ ] , everolimus (rad ) [ ] , paclitaxel [ ] , ritonavir [ , [ ] [ ] [ ] , selameerin (selamectin) [ ] , and danoprevir [ ] . among these, a modified polymeric version of the chitosan drug (a top hit in our analysis) was recently shown to inhibit cov replication with evidence that the molecule inhibits the binding of the viral spike protein to the host ace receptor [ ] [ ] [ ] . the protein-protein interaction map or the network-based methodologies [ , ] suggest that sirolimus (rapamycin) emerges as a common potential drug lead for repurposing against covid- . this rapamycin (mtor inhibitor) drug was found previously to disrupt larp (la-related protein ) and mtorc (mammalian target of rapamycin complex ) binding, and has been shown to reduce mers infection by~ % in vitro [ ] . the postulated geroprotectors, such as sirolimus (rapamycin) and its close derivative, the rapalog everolimus (rad ), decreased infection rates in a small sample of elderly patients [ ] . moreover, the drugs sirolimus (rapamycin) and ritonavir are currently in clinical trials for repurposing against covid- [ , , ] . sirolimus (rapamycin) is registered in a clinical trial (nct not yet recruiting) designed to evaluate adjunctive use of sirolimus (rapamycin) and oseltamivir in patients hospitalized with influenza [ , ] . ritonavir, a hiv protease inhibitor is in an open-label trial in hospitalized adults with severe covid- [ , ] . the data from this small-sample clinical study showed that danoprevir boosted by ritonavir is safe and well tolerated in all patients [ ] . selamectin is a potential drug for treating covid- found active against the pangolin coronavirus gx_p v, a workable model for sars-cov- research [ ] . the antitumor drug paclitaxel increases cellular methylglyoxal to virucidal levels, providing a rationale for repurposing doxorubicin and paclitaxel for covid- treatment [ ] . nevertheless, whether the hit molecules we have identified that dock into the trimer cavity and impact on the virus life cycle requires orthogonal validation. we hope the findings of our study can help to understand the function of the highly conserved spike interspecies transmission and emergence of novel viruses: lessons from bats and birds severe acute respiratory syndrome coronavirus as an agent of emerging and reemerging infection middle east respiratory syndrome coronavirus: another zoonotic betacoronavirus causing sars-like disease genomic characterization of the novel human-pathogenic coronavirus isolated from a patient with atypical pneumonia after visiting wuhan coronavirus hku and other coronavirus infections in hong kong perspectives on monoclonal antibody therapy as potential therapeutic intervention for coronavirus disease- (covid- ). asian pac a novel bat coronavirus reveals natural insertions at the s /s cleavage site of the spike protein and a possible recombinant origin of hcov- composition and divergence of coronavirus spike proteins and host ace receptors predict potential intermediate hosts of sars-cov- responding to global infectious disease outbreaks: lessons from sars on the role of risk perception, communication and management characterization and complete genome sequence of a novel coronavirus, coronavirus hku , from patients with pneumonia coronavirus as a possible cause of severe acute respiratory syndrome mers coronavirus induces apoptosis in kidney and lung by upregulating smad and fgf an orally bioavailable broad-spectrum antiviral inhibits sars-cov- and multiple endemic, epidemic and bat coronavirus a sars-cov- -human protein-protein interaction map reveals drug targets and potential drug-repurposing crystal structure of sars-cov- main protease provides a basis for design of improved α-ketoamide inhibitors structure of the sars-cov- spike receptor-binding domain bound to the ace receptor mechanism of inhibition of ebola virus rna-dependent rna polymerase by remdesivir cryo-em structure of the -ncov spike in the prefusion conformation perspectives on therapeutic neutralizing antibodies against the novel coronavirus sars-cov- revealing the potency of citrus and galangal constituents to halt sars-cov- infection structure, function, and evolution of coronavirus spike proteins the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex preliminary identification of potential vaccine targets for the covid- coronavirus (sars-cov- ) based on sars-cov immunological studies spike protein binding prediction with neutralizing antibodies of sars-cov- angiotensin-converting enzyme is a functional receptor for the sars coronavirus the novel coronavirus ( -ncov) uses the sars-coronavirus receptor ace and the cellular protease tmprss for entry into target cells receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus a pneumonia outbreak associated with a new coronavirus of probable bat origin structural basis for the recognition of the -ncov by human ace structure, function, and antigenicity of the sars-cov- spike glycoprotein tectonic conformational changes of a coronavirus spike glycoprotein promote membrane fusion cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen unexpected receptor functional mimicry elucidates activation of coronavirus fusion cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains inhibition of sars-cov- infection (previously -ncov) by a highly potent pan-coronavirus fusion inhibitor targeting its spike protein that harbors a high capacity to mediate membrane fusion research and development on therapeutic agents and vaccines for covid- and related human coronavirus diseases molecular mechanism of evolution and human infection with the novel coronavirus ( -ncov) exploring the binding mechanism and accessible angle of sars-cov- spike and ace by molecular dynamics simulation and free energy calculation interaction of the spike protein rbd from sars-cov- with ace : similarity with sars-cov, hot-spot analysis and effect of the receptor polymorphism repurposing therapeutics for covid- : supercomputer-based docking to the sars-cov- viral spike protein and viral spike protein-human ace interface network-based drug repurposing for novel coronavirus -ncov/sars-cov- virtual screening of inhibitors against spike glycoprotein of novel corona virus: a drug repurposing approach the antiviral compound remdesivir potently inhibits rna-dependent rna polymerase from middle east respiratory syndrome coronavirus assembly of coronavirus spike protein into trimers and its role in epitope expression antiviral potential of erk/mapk and pi k/akt/mtor signaling modulation for middle east respiratory syndrome coronavirus infection as identified by temporal kinome analysis assessment of evidence for covid- -related treatments: updated / / geroprotective and senoremediative strategies to reduce the comorbidity, infection rates, severity, and lethality in gerophilic and gerolavic infections coronavirus information. iuphar/bps guide to pharmacology first clinical study using hcv protease inhibitor danoprevir to treat naive and experienced covid- patients identification of potential treatments for covid- through artificial intelligence-enabled phenomic analysis of human cells infected with sars-cov- htcc as a highly effective polymeric inhibitor of sars-cov- and mers-cov htcc: broad range inhibitor of coronavirus entry novel polymeric inhibitors of hcov-nl data, disease and diplomacy: gisaid's innovative contribution to global health emboss: the european molecular biology open software suite the rcsb protein data bank: redesigned web site and web services multiple sequence alignment with high accuracy and high throughput swiss-model: homology modelling of protein structures and complexes recognition dynamics of cancer mutations on the erp -tapasin interface insights into the effects of cancer associated mutations at the upf and atp-binding sites of nmd master regulator: upf structural, functional, and stability change predictions in human telomerase upon specific point mutations : a high-throughput and highly parallel open source molecular simulation toolkit implementation of the charmm force field in gromacs: analysis of protein stability effects from correction maps, virtual interaction sites, and water models particle mesh ewald: ann·log(n) method for ewald sums in large systems lincs: a linear constraint solver for molecular simulations canonical sampling through velocity rescaling polymorphic transitions in single crystals: a new molecular dynamics method a leap-frog algorithm for stochastic dynamics vmd: visual molecular dynamics ucsf chimera-a visualization system for exploratory research and analysis docking and scoring in virtual screening for drug discovery: methods and applications molecular operating environment (moe) . ; chemical computing group: montreal, qc, canada generalized born model: analysis, refinement, and applications to proteins the union of balls and its dual shape virtual screening in drug design vulnerabilities of the sars-cov- virus to proteotoxicity-opportunity for repurposed chemotherapy of covid- infection repurposing of clinically approved drugs for treatment of coronavirus disease in a -novel coronavirus ( -ncov) related coronavirus model targeted therapy and promising novel agents for the treatment of advanced soft tissue sarcomas synergistic activity of the mtor inhibitor ridaforolimus and the antiandrogen bicalutamide in prostate cancer models genome sequencing identifies a basis for everolimus sensitivity trimethylatedchitosans as non-viral gene delivery vectors: cytotoxicity and transfection efficiency efficacy of selamectin, spinosad, and spinosad/milbemycin oxime against the ks ctenocephalidesfelis flea strain infesting dogs. parasites vectors prior acquired resistance to paclitaxel relays diverse egfr-targeted therapy persistence mechanisms paclitaxel directly binds to bcl- and functionally mimics activity of nur pharmacological and therapeutic properties of ritonavir-boosted protease inhibitor therapy in hiv-infected patients danoprevir, a small-molecule ns / a protease inhibitor for the potential oral treatment of hcv infection the first-in-class peptide binder to the sars-cov- spike protein structural basis of receptor recognition by sars-cov- a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov this article is an open access article distributed under the terms and conditions of the creative commons attribution (cc by) license the international centre for cancer vaccine science project is carried out within the international research agendas programme of the foundation for polish science co-financed by the european union under the european regional development fund. authors would also like to thank the pl-grid infrastructure, poland for providing their hardware and software resources. the authors declare no conflict of interest. the funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results. key: cord- -bbvlowyo authors: sang, eric r.; tian, yun; gong, yuanying; miller, laura c.; sang, yongming title: integrate structural analysis, isoform diversity, and interferon-inductive propensity of ace to predict sars-cov susceptibility in vertebrates date: - - journal: heliyon doi: . /j.heliyon. .e sha: doc_id: cord_uid: bbvlowyo the current new coronavirus disease (covid- ) has caused globally near . / million confirmed deaths/infected cases across more than countries. as the etiological coronavirus (a.k.a. sars-cov ) may putatively have a bat origin, our understanding about its intermediate reservoir between bats and humans, especially its tropism in wild and domestic animals are mostly unknown. this constitutes major concerns in public health for the current pandemics and potential zoonosis. previous reports using structural analysis of the viral spike protein (s) binding its cell receptor of angiotensin-converting enzyme (ace ), indicate a broad potential of sars-cov susceptibility in wild and particularly domestic animals. through integration of key immunogenetic factors, including the existence of s-binding-void ace isoforms and the disparity of ace expression upon early innate immune response, we further refine the sars-cov susceptibility prediction to fit recent experimental validation. in addition to showing a broad susceptibility potential across mammalian species based on structural analysis, our results also reveal that domestic animals including dogs, pigs, cattle and goats may evolve ace -related immunogenetic diversity to restrict sars-cov infections. thus, we propose that domestic animals may be unlikely to play a role as amplifying hosts unless the virus has further species-specific adaptation. findings may relieve relevant public concerns regarding covid- -like risk in domestic animals, highlight virus-host coevolution, and evoke disease intervention through targeting ace molecular diversity and interferon optimization. erupting in china last december, the novel coronavirus disease has become a worldwide pandemic and caused near . million confirmed deaths and million infected cases across countries by the end of may [ , ] . the etiological virus, designated as severe acute respiratory syndrome coronavirus (sars-cov ) has been identified [ ] and related to the viruses previously causing sars or middle east respiratory syndrome (mers) in humans in and , respectively [ ] . these three human-pathogenic coronaviruses putatively evolve from bat coronaviruses, but have different animal tropisms and intermediate reservoirs before transmission to humans [ , ] . as civet cats and camels were retrospectively determined as reservoirs for sars and mers respectively, there is no conclusion about what animal species passing sars-cov to humans [ , ] . investigations indicated that canivora animals including raccoon dogs, red foxes, badgers and minks as well swine, at a less extent, are susceptible to sars virus infections [ , ] . although the viral nucleic acids and antibodies to mers were detectable in multiple ruminant species including sheep, goat, and donkeys, the virus inoculation studies did not result in a productive infection for mers disease in these domestic ruminants, nor in horses [ , ] . as a group of obligate pathogens, viruses need to engage cell receptors for entering cells and race with the host immunity for effective replications and spreading to initiate a productive infection [ ] . in this context, the spike proteins protruding on the coronavirus surface are responsible for cell receptor binding and mediating viral entry [ , , ] . for example, mers-cov adopts the dipeptidyl peptidase (dpp , a.k.a cd ) and sars-cov uses angiotensin-converting enzyme (ace ) as primary receptors for cell attachment and entry [ , , , , , ] . several groups have reported that sars-cov uses the same ace receptor as sars-cov, but exerts higher receptor affinity to human ace , which may ascribe to the efficacy of sars-cov infection in humans [ , ] . after cell attachment via the receptor binding domain (rbd) in the n-terminal s region of the s protein, the c-terminal s region thus engages in membrane fusion. further cleavage of s from s by a furin-like protease will release and prime the virus entering the receipient cells. several furin-like proteases, especially a broadly expressed trans-membrane serine protease (tmprss ), are adopted for priming sars-cov entry [ , ] . compared with sars-cov, studies showed that sars-cov spike protein also evolutionarily obtains an additional furin-like proteinase cleavage site within the s /s junction region for efficient release from the cell surface and entry into the cells [ , , , ] . because tmprss is widely expressed, the tissue-specific expression of ace has been shown to determine sars-cov cell tropism in humans [ , ] . namely, human nasal secretory cells, type ii pneumocytes, and absorptive enterocytes are ace -tmprss double positive and highly permissive to sars-cov infection [ , ] . for cross-species animal tropism, the potential infectivity of sars-cov in both wild and domestic animals raises a big public health concern after the prevalence of sars-cov infections in humans [ , ] . this concern involves two aspects: ( ) screening to identify the animal species that serve as a virus reservoir originally passing sars-cov to humans; and ( ) the existing risk of infected people passing the virus to animals, particularly domestic species, thus potentially amplifying the zoonotic cycle to worsen sars-cov evolution and prevalence [ , ] . by diagnosis of animals that in close contact with covid- patients or screening of animal samples in some covid- epidemic zones, studies detected that domestic cats and dogs could be virally or serologically positive for sars-cov [ , , , , , , ] , as was a reported infection in a zoo tiger [ ] . using controlled experimental infection of human sars-cov isolates, several studies demonstrated that ferrets, hamsters, domestic cats and some non-human primate species are susceptible to human sars-cov strains [ , , , , , , , ] . obviously, it is impractical to test sars-cov susceptibility experimentally in all animal species. by adoption of a structural simulation based on published structures of the viral s-rbd/ace complex, studies have predicted a broad spectrum of vertebrate species with high potential for sars-cov susceptibility, which, if true, entails unexpected risks in both public and animal health, and warrants further critical evaluation [ , , ] . ace is a key enzyme catalyzing angiotensin (agt) further conversion into numeral active forms of agt - , which are hormonal mediators in the body's renin-angiotensin system (ras) [ , ] . thus, ace plays a regulatory role in the blood volume/pressure, body fluid balance, sodium and water retention, as well as immune effects on apoptosis, inflammation, and generation of reactive oxygen species (ros) [ , ] . in this line, the expression of ace is also inter-regulated by immune mediators pertinent to its systemic function. multiple physio-pathological factors, including pathogenic inflammation, influence on ras through action on ace expression [ , , ] . interferon (ifn) response, especially that mediated by type i and type iii ifns, comprises a frontline of antiviral immunity to restrict viral spreading from the initial infection sites, and therefore primarily determines if a viral exposure becomes controlled or a productive infection [ ] . most recent studies indicated that sars-cov evolved viral antagonisms against ifn responses; however, the viral infections was significantly inhibited in vitro or at the early phase in vivo using human ifn-α, ifn-β or type iii inf-λ, indicating that ifns are potential anti-covid prophylactics [ , , ] . several recent studies revealed that human ace gene behaves like an interferon-stimulated gene (isg) and is stimulated by a viral infection and ifn treatment; however, mouse ace gene is not [ , , ] . therefore, to determine the cell tropism and animal susceptibility to sars-cov , the cross-species ace genetic and especially epigenetic diversity in regulation of ace expression and functionality should be evaluated [ , , , , , , , , ] . in this study, through integration of structural analysis and key immunogenetic factors that show species-dependent differences, we critically refine the sars-cov susceptibility prediction to fit recent experimental validation [ , , , , , , , , , ] . along with showing a broad susceptibility potential across mammalian species based on structural analysis [ , , ] , our results further reveal that domestic animals including dogs, pigs, cattle and goats may evolve previously unexamined immunogenetic diversity to restrict sars-cov infections. the amino acid sequences of ace proteins and dna sequences of the proximal promoters of each ace genes were extracted from ncbi gene and relevant databases (https://www.ncbi.nlm.nih.gov/gene). ace genes and corresponding transcripts have been well annotated in most representative vertebrate species. in most cases, the annotations were double verified through the same gene entries at ensembl (http s://www.ensembl.org). the protein sequences were collected from all non-redundant transcript variants and further verified for expression using relevant rna-seq data (ncbi geo profiles). the proximal promoter region spans~ . kb before the predicted transcription (or translation) start site (tss) of ace or other genes. the protein and dna sequences were aligned using the multiple sequence alignment tools of clustalw or muscle through an embl-ebi port (https://www.ebi.ac.uk/). other sequence management was conducted using programs at the sequence manipulation suite (http://www.bioinformatics.org). sequence alignments were visualized using jalview (http://www.ja lview.org) and megax (https://www.megasoftware.net). sequence similarity calculation and plotting were done using sdt . (http://web.cbio.uct.ac.za/~brejnev). other than indicated, all programs were run with default parameters. the phylogenic analysis and tree visualization were performed using megax and an online program, evoview. the evolutionary history was inferred using the neighbor-joining method. percentage of replicate trees in which the associated taxa clustered together in the bootstrap test ( , replicates) was also performed. the evolutionary distances were computed using the p-distance method and in units of the number of amino acid differences per site. other than indicated, all programs were run with default parameters as the programs suggested. the structure files of human ace protein and its interaction with sars-cov s-rbd were extracted from the protein data bank under the files of m and m j. the residual mutation and structure simulation were performed using ucsf chimera and pymol available at http s://www.cgl.ucsf.edu/chimera/ and https://pymol.org/, respectively. structural visualization were using pymol. the binding affinity energy (Δg), dissociation constant (kd) and interfacial contacts between s-rbd and each ace were calculated using a prodigy algorithm at https://bi anca.science.uu.nl/prodigy/. the regulatory elements (and pertinent binding factors) in the~ . kb proximal promoter regions were examined against both human/animal tfd database using a program nsite (version . , at http: //www.softberry.com). the mean position weight matrix (pwm) of key cis-elements in the proximal promoters were calculated using pwm tools through https://ccg.epfl.ch/cgi-bin/pwmtools, and the binding for expression confirmation, several sets of rna-seq data from ncbi gene databases, and one of ours generated from porcine alveolar macrophages (bioproject with an accession number of srp ), were analyzed for verification of the differential expression of ace genes in most annotated animal species. especially, the expression of porcine ace isoforms and relevant other genes in the porcine lung macrophage datasets. significantly differentially expressed genes (degs) between two treatments were called using an edger package and visualized using heatmaps or bar charts as previously described [ ] . the phylogenic tree of major identified ace orthologs/variants from different species was built with a neighbor-joining approach and visualized using an evoview program under default parameter setting. the prediction of sars-cov susceptibility is based on the sequence similarity of each ace to human ace in the s-rbd binding region and simulated using a published human ace -rbd structure ( m j) and refers to two recent publications using similar procedures but different structural models [ , ] . compared with the currently available experimental data, incongruence of the predicted sars-cov susceptibility is clearly demonstrated in pangolin, ferret, tiger, cat and horseshoe bat, indicating that some other factors besides ace -rbd affinity should be considered. figure . prediction of sars-cov susceptibility in major livestock species based on the conservation of key interacting residues and binding capacity between the viral spike (s) protein on the host ace receptor. (a) sars-cov- uses the cell receptor, angiotensin-converting enzyme (ace ) for entry and the serine protease tmprss and furin for s protein priming. (b) as tmprss is broadly expressed and active with a furin-like cleavage activity, the affinity adaption of the s receptor binding domain (rbd) and ace receptor determines the viral permissiveness. the contacting residues of human ace (a distance cutoff . Å) at the sars-cov- rbd/ace interfaces are shown, and the contacting network involves at least residues in ace (listed in the table cells and referred to the aligned residual positions in human ace ) and residues in the sars-cov- rbd (blue circles with residue labels), which are listed and connected with black lines (indicating hydrogen bonds) and red line (represents salt-bridge interaction). the cross-species residual identity (%) of these interacting residues in ace are listed in a broad range ( - %) [ , , ] . (c) we also detected several short ace isoforms (underlined) in the domestic animals including dog, pig, goat and cattle, which have an n-terminal truncation spanning - key residues in the contacting network to s-rbd but keeping the enzyme active sites (indicated by yellow triangles), thus resulting in little engagement by the viral s protein and predicting an unexpected evolutionary advantage for relieving potential covid- risk caused by the viral engagement and functional distortion on the classical long ace isoforms in these animal species. the ncbi accession numbers of the ace orthologs are listed as in figure . human ace -rbd dog-ace l-rbd table . predicƟon of binding affinity energy (Δg), dissociaƟon constant (kd) and interfacial contacts of the sars-cov s-rbd with ace orthologs of major livestock species simulated using the human ace /cov -rbd structure ( m j). . most residues involved in binding are highlighted as magenta (ace ) or orange (s) sticks and labeled as one-letter amino-acid codes plus residual numbers in bold or regular font respectively for s or ace residues. the dotted/ blue lines indicate intermolecular salt bridge or hydrogen bonds between interacting residues (generated and visualized with ucsf chimera and pymol from protein data bank file m j). (b) to (d) rbd interaction with the simulated structures of ace long isoforms from the dog, pig and cattle, respectively. amino acid exchanges in ace from another species compared with human ace are highlighted in red. (e) prediction of binding affinity energy (Δg), dissociation constant (kd) and interfacial contacts of the sars-cov s-rbd with ace orthologs of major livestock species. most domestic animals ace including that from mouse and rat (species known not to be susceptible to sars-cov ) have a binding affinity (Δg) at - . to - . kcal/mol that is within the range ( . - . kcal/mol) between the rbd and the ace from the known susceptible species (underlined in the left part of the table), indicating that some other factors, especially those from genetic divergence and natural immunity, contribute to the sars-cov susceptibility of different animal species. figure . detection of several short ace isoforms (ace -s) in the domestic animals including dog, pig, goat and cattle. (a) in contrast to most splicing isoforms such as in cats and humans, which share a common proximal promoter and encode ace proteins with similar sequences containing all key rbd-interacting residues, these short ace -s isoforms in domestic animals truncate for (cattle/goat ace -s) or (dog/pig ace -s) residues at their n-termini compared with human ace or the long ace isoforms in these species, thus destroying - key residues in the contacting network to s-rbd but retaining all enzyme active sites (yellow triangles in the blue ace domain bar). this results in little chance to be engaged by the viral s protein binding and predicts an unexpected evolutionary advantage to relieve potential covid- risk caused by the viral engagement and functional distortion on the classical long ace isoforms in these animal species. (b), (c) and (d) paired structural comparison between the human ace structure ( m ) with each simulated ace -s structure from pig (b), dog (c) and cattle/goat (d). human ace structure are in green, and each compared animal ace -s structure in magenta. the n-terminal residues of both compared structures are in cyan (arrows indicating n-termini of the ace -s isoforms) and shared c-termini are in red. the key s-interacting residues in human ace are shown in blue sticks. in general, all ace -s orthologs, particular the porcine, show high structural similarity to the human ace except the n-terminal truncations. . results and discussions . . vertebrate ace orthologs share an functional constraint but experience intra-species diversification in livestock with unknown selective pressure sequence comparison among ace orthologs across representative vertebrate species shows a pairwise identity range at - % ( figure a and supplemental fig. s and excel sheet), which is - % higher than the average value generated through a similarity analysis at - % on gene orthologs at a genome-wide scale [ ] . this indicates that ace exerts a similar and basic function cross-species, consistent with its systemic and regulatory role as a key enzyme in ras, an essential regulatory axis underlying the body circulatory and execratory systems in vertebrates [ , , ] . a comparison of evolutionary rates of major genes within ras including angiotensinogen (agt), ace, and several receptors of the processed angiotensin hormones showed that ace actually evolves slightly faster than ace [ , and unpublished data] . this implies that ace may bear pressure for ras adapting evolution per a species-dependent physiological and pathological requirement [ , , ] . this evolutionary adaptability of ace genes is demonstrated by the existence of numerical genetic polymorphisms [ ] and several transcript isoforms particularly in humans and major livestock species ( figure b and supplemental fig. s and excel sheet). we identified (and verified by rna-seq data analysis) four transcripts of ace isoforms in humans ( figure b ) that primarily differ in the c-terminal residues within the collectrin domain. particularly, - short ace isoforms were identified in dogs, pigs, cattle, and goats in addition to the longer ace consensus to the human's (designated as -s or -l, respectively after the animal common names in figure b and thereafter). these livestock ace -s isoforms have a - residual truncation at their n-terminal peptidase domains, which also span the region interacting with sars-cov spike protein. the selective mechanisms driving the evolution of these short ace isoforms in livestock are unknown, but may relate to previous pathogenic exposure or unprecedented physio-pathological pressure. to support this reasoning, short ace isoforms are detected in both domestic bos taurus and hybrid cattle, but not in the wild buffalo and bison; and ace isoforms from each species are generally paralogous and sister each other within a clade in the phylogenic tree ( figure b ). phylogenic analysis of vertebrate ace orthologs/paralogs reveals a general relationship aligning to the animal cladistics ( figure b) . in this context, homologs from the fish, frog and chicken conform to a primitive clade. all ungulate homologs form into parallel clades next to each other. the homologs from the glires, primates and carnivores cluster into a big clade (marked with yellow triangle node), which contains all the sars-cov susceptible species that have been verified via natural exposure or experimental infections ( figure b , marked with red/orange circles). we examined and merged several previous studies about the prediction of sars-cov susceptibility in vertebrates based on the simulated structural analysis of s-rbd-ace complex [ , , ] . as numerous vertebrate species were predicted to be high or low potential ( figure b , labeled as red h or green l) for sars-cov susceptibility, incongruence between the predicted sars-cov susceptibility and infected validation is apparent in pangolin, ferret, tiger, cat and horseshoe bat, indicating that some other factors besides ace -rbd affinity should be considered [ , , , ] . we, therefore, refined the prediction matrix to include the rbd-binding evasion of some ace orthologs identified in major livestock species and the interferon-stimulated ace expression in priming sars-cov infections [ , , , ] . several recent studies have elegantly demonstrated the structural interaction of the viral s protein or its rbd in complex with human ace receptor [ , ] . showing that the contacting residues at the rbd/ace interface (figure a ) involve at least residues in ace ( figure b , listed in the table cells and referred to the aligned residual positions in human ace ) and residues in the sars-cov- rbd ( figure b , blue circles with residue labels above the table) [ , , ] . the cross-species residual identity (%) of these interacting residues in ace are dispersed in a broader range ( - %) than the whole ace sequence identity rate at - % [ ] , indicating a faster evolution rate of this virus-interacting region. notably, the s-binding region spans a large part of the n-terminal peptidase domain, thus s-binding may competitively block a majority of active sites to inhibit the physiological action of ace ( figure c ). using a similar structural analysis procedure [ , ] , we modeled the ace structures of animal species of interest and simulated their interaction with sars-cov s-rbd based on a published rbd-human ace structure (protein data bank file m j) [ ] . figure demonstrates the s-rbd interaction with the simulated structures of ace long isoforms from the dog, pig and cattle, respectively. the major changes of the rbd-ace interacting interfaces are from the residual exchanges in ace from other species compared with human ace ( figure b - d, highlighted in red). in addition, the exchange of n t (in pigs) and n y (in cattle and sheep) would destroy the n-glycosylation site in human ace . ace from goat (supplement fig. s ) exhibits identical amino acid exchanges as in cattle in the rbd-ace interfacial contacts. in contrast, when compared with human ace , ace from cats (supplement fig. s ) conserves all relevant glycosylation sites in human ace [ , ] . we also calculated the interfacial contacts using parameters of protein-protein interaction including the predictable binding affinity energy (Δg), dissociation constant (kd) and number of different interfacial contacts within the s-rbd and ace contact. although the exact numbers may differ from the previous reports [ ] , they provide a very comparable matrix generated using the same algorithm ( figure e ) [ ] . data show that the ace of most domestic animals, including that from mouse and rat (the species known to be unsusceptible to human sars-cov ) have a binding affinity (Δg) at - . to - . kcal/mol. this is within the binding affinity range ( . - . kcal/mol) between the rbd and the ace from known susceptible species ( figure e , underlined in the left part of the table). this indicates that other factors, conceivably from genetic divergence and/or natural immunity, also contribute to sars-cov susceptibility in animal species. therefore, an effective prediction matrix should include the critical immunogenetic factors to further determine virus susceptibility in addition to the sequence/structural similarity of ace receptors (figure and fig. s ) [ , , ] . we detected several short ace isoforms in the domestic animals including dog, pig, goat and cattle that have an n-terminal truncation spanning - key residues in the contacting network to s-rbd but retain the enzyme active sites ( figure a ). most of the splicing isoforms from ace genes such as in zebrafish, cats and humans, share a common proximal promoter and encode ace proteins containing all key rbdinteracting residues [ , ] . however, these short ace -s isoforms in domestic animals truncate for (cattle/goat ace -s) or (dog/pig ace -s) residues at their n-termini compared with the long ace isoforms in the same species (figure and fig. s ). therefore, these short ace isoforms destroy - key residues in the contacting network to s-rbd but likely retain ace enzymatic function in ras. paired structural comparison between the human ace structure (extracted from m ) with each simulated ace -s structure from the pig, dog, and cattle/goat, reveals that all these ace -s orthologs from domestic animals, particularly the porcine one, show high structural similarity to the human ace except for the n-terminal truncations ( figure b- d ). this indicates that these short ace isoforms in domestic animals have little chance to be engaged by the viral s-binding, and predict an unexpected evolutionary advantage to allay potential covid- risk resulting from viral engagement and functional distortion on the classical long ace isoforms in these animal species [ , ] . sars-cov infection induces a weak ifn response but a production of high amount of inflammatory cytokines including interleukin (il)- and chemokine cxcl in most severe covid- patients [ , , , ] . studies of sars and mers showed that these pathogenic coronaviruses share similar viral antagonisms including the endoribonuclease (endou) encoded by nonstructural protein (nsp ), which directly blunts cell receptors responding to viral dsrna and in turn weaken the acute antiviral response [ ] . several recent studies revealed that sars-cov seems more cunning in not only evading or antagonizing but also in exploiting the ifn response for efficient cell attachment [ , , , ] . as a key enzyme in ras, the expression of ace gene has been primarily investigated for physiological response to circulatory regulations, and a response to pathological inflammation is also expected [ , , ] . however, the expression of human ace gene was highly responsive to both viral infection and host ifn response, i.e. human ace gene seems an unstudied ifn-stimulated gene (isg) [ , ] . surprisingly, the isg propensity of ace genes is species-dependent, for examples, the mouse ace gene is less ifn responsive, which may partly explain the mouse insusceptibility to sars-cov infection [ ] . to categorize the different ifn-inductive propensity of ace genes in vertebrates, particularly in major livestock species, we profiled the regulatory cis-elements and relevant transcription factors in the proximal promoter regions of each ace genes ( . kb before tss or atg). figure illustrates major regulatory cis-elements located in ace genes from major livestock animals and several reference animal species. data show that animal ace gene promoters are evolutionally different in containing ifn-or virus-stimulated response elements (isre, prdi, ifrs, and/or stat / factors) and cis-elements responsive to proinflammatory mediators. all these cis-elements recruit corresponding transcription factors (tf) to mediate differential ace responses to antiviral ifns and inflammation that is associated with covid- disease [ , , ] . we discover that ace genes obtain species-different isg propensity responsive to ifn and inflammatory stimuli. in most (if not all) of the sars-cov susceptible species, the ace genes obtained the ifn-responsive elements between the typical robust and tunable ifn-stimulated genes (isg) [ ] . in general, the robust isgs (isg as an example here) are stimulated in the acute phase of viral infection and play a more antiviral role; in contrast, the later responsive tunable isgs (irf as an example) contribute more to anti-proliferation of ifn activity [ ] . in addition, unlike the promoter of the short ace isoforms in cattle and goats, which share most common promoter regions with their paralogous long isoforms, the short ace isoforms of dogs (dog-s) and pigs (pig-s) have distinct proximal promoter regions (and different ifn responsivity) to the paralogous long ace isoforms (figures and ) . results indicate that the short ace isoforms in pigs and dogs diversify from their long paralogs at both the levels of genetic coding and epigenetic regulation to adapt to some evolutionary pressure, such as that from pathogenic interaction (figure ) [ , ] . . . matrix scores of interferon-inductive elements in ace gene promoters correspond to sars-cov susceptibility the position weight matrix (pwm) stands as a position-specific scoring model for the binding specificity of a transcription factor (tf) on the dna sequences [ ] . using pwm toolsets online (https://ccg.epfl. ch/cgi-bin/pwmtools), we evaluate mean pwm of key cis-elements in the proximal promoters of ace genes that containing binding sites for canonical ifn-dependent transcription factors, which include isre/stat, irf . irf / and irf , as well as c/ebp representing a core transcription factor for pro-inflammation. these ifn-dependent transcription factors, particularly irf / and isre/stat for ifn stimulation, are differentially enriched in the promoter regions of ace genes in a species-dependent way. higher enrichment of isre/stat / and/or irf / binding sites are detected in most sars-cov /covid susceptible species (indicated with solid orange or red circles, respectively). in contrast, the pwm for irf and c/ebp, which regulate inflammation, are less differential in ace promoters from different animal species, indicating that ace genes are more universally regulated by inflammation than that by the viral infection or ifn-induction in a species-dependent way ( figure ). as compared with the promoters of a typical human robust isg and tunable irf genes, this data indicate that ace genes (particularly the primate ones) are not typical robust or tunable isgs as represented by isg or irf , but respond differently to viral infection (through irf / ) or ifn auto-induction (via isre/stat) in a species-dependent manner ( figure ) [ ] . higher enrichment of isre/stat / and/or irf / corresponds to sars-cov susceptibility in experimentally validated mammalian species especially primates, but not to the phylogenically distant species such as zebrafish, which has very low potential for sars-cov susceptibility due to the high disparity of ace structures (figure and fig. s ). in addition, the proximal promoters of the pig and dog ace -s genes differ much in their ifn-responsive elements to most ace promoters in mammalians (figures and ). however, they are phylogenically sister to the ace promoters from the primitive vertebrates (frog, chicken and zebrafish) (figure , phylogenic tree). this indicates that the expression of these short ace isoforms is more conservative than the long ace paralogs, which represent a more recent evolution obtaining ace epigenetic regulation by ifn-signaling ( figure ) [ ]. studies show that affinity adaption of the viral s-rbd and ace receptor determines the cellular permissiveness to the virus [ , , ] . sars-cov not only adapts a high binding affinity to human ace for cell attachment, but also antagonizes host antiviral interferon (ifn) response and utilizes ifn-stimulated property of human ace gene to boost spreading [ , , , ] . in addition to structural analysis of simulated s-rbd-ace interaction, we propose that several immunogenetic factors, including the evolution of s-binding-void ace isoforms in some domestic animals, the species-specific ifn system, and epigenetic regulation of ifn-stimulated property of host ace genes, contribute to the viral susceptibility and the development of covid- -like symptoms in certain animal species [ , , , ] . a computational program in development that incorporates this multifactorial prediction matrix and in vitro validation of sars-cov susceptibility in major vertebrate species will be necessary to address public concerns relevant to sars-cov infections in animals (figure ). it will also lead to the development of better animal models for anti-covid investigations [ ] . in addition, several ifn-based therapies for treatment of covid have been proposed and are in the process of clinic trials [ , , , ] . considering the viral stealth of ifn-stimulated property of human ace , a timely and subtype-optimized ifn treatment should be delivered than a general injection of typical human ifn-α/β subtypes [ , , , ] . in this line, domestic livestock like pigs and cattle have a most evolved ifn system containing numerous unconventional ifn subtypes. some of these unconventional ifn subtypes, such as some porcine ifn-ω exert much higher antiviral activity than ifn-α even in human cells and most ifn-λ retaining antiviral activity with less pro-inflammatory activity, could be utilized for developing effective antiviral therapies [ , ] . in summary, a predication matrix, which integrates the structural analysis of s-rbd-ace interfacial interface and the species-specific immunogenetic diversity of ace genes, was used to predict the sars-cov susceptibility and fit current knowledge about the infectious potential already validated in different animal species (figure ). more extensive validation experiments are needed to further improve this prediction matrix. our current results demonstrate several previously unstudied immunogenetic properties of animal ace genes and imply some domestic animals, including dogs, pigs and cattle/goats, may obtain some immunogenetic diversity to confront sars-cov infection and face a less figure . scores of mean position weight matrix (pwm) of key cis-elements in the proximal promoters of ace genes that containing binding sites for canonical ifndependent transcription factors, which include isre/stat / , irf , irf / and irf , as well as c/ebp as a core transcription factors for pro-inflammatory response. these ifn-dependent transcription factors, particularly irf / and isre/stat critical for ifn stimulation, are differentially enriched in the promoter regions of ace genes in a species-dependent way. especially, increased enrichment of isre/stat / and irf / binding sites are detected in the sars-cov /covid susceptible species (indicated with solid orange or red circles, respectively). in contrast, the pwm for irf and c/ebp, which regulate inflammation, are less differentially enriched in ace promoters from different animal species. the promoters of a typical human robust interferon-stimulated gene (isg) and irf (a typical tunable isg) are used as references. higher enrichment of isre/stat / and irf / corresponds to sars-cov susceptibility in experimentally validated animal species and humans. abbreviations: c/ebp, ccaat/enhancer binding protein; irf, interferon-regulatory factor; isre, interferon-sensitive response element; stat, signal transducer and activator of transcription; pwm, position weight matrix. the pwm tools are used through https://ccg.epfl.ch/cgi-bin/pwmtools. covid risk than previously thought. however, immediate biosecurity practices should be applied in animal management to reduce animal exposure to the virus and prevent potential species-specific adaptation ( figure ) . for livestock breeding programs that targeting disease resistance to respiratory viruses, the genetic and epigenetic diversity of ace genes as well antiviral isgs are highly recommended [ , , , ] . in conclusion, sars-cov evolves to fit well with human (and nonhuman primates) ace receptor through the structural interfacial affinity, immunogenetic diversity and epigenetic expression regulation, which results in a highly infectious efficacy [ , , , , , , ]. most mammalian animals, especially those that belong to glires, primates and carnivores, have a higher potential for sars-cov susceptibility but in a species-different manner based on the existence of s-binding-void ace isoforms and the difference of the ifn-inductive propensity of the major ace genes. most ungulate animals appear have a low susceptibility potential with horses and sheep having a high potential (figure ) . current development of ifn-based anti-covid therapies should consider the isg property of human ace gene to optimize for timely application using a highly-antiviral subtype that potentially have less inflammatory (or even anti-inflammatory) activity [ evolution of the ifn complex and functional diversity in domestic animals (such as pigs and cattle) provides a natural model for optimizing ifn antiviral regulation and therapy development [ , ] . eric r. sang, yongming sang: conceived and designed the experiments; performed the experiments; analyzed and interpreted the data; wrote the paper. yun tian: performed the experiments; analyzed and interpreted the data; wrote the paper. yuanying gong: contributed reagents, materials, analysis tools or data; wrote the paper. laura c. miller: conceived and designed the experiments; contributed reagents, materials, analysis tools or data; wrote the paper. covid- dashboard by the center for systems science and engineering (csse) at johns hopkins university (jhu) a familial cluster of pneumonia associated with the novel coronavirus indicating person-to-person transmission: a study of a family cluster a new coronavirus associated with human respiratory disease in china the emergence of sars, mers and novel sars- coronaviruses in the st century a genomic perspective on the origin and emergence of sars-cov- origin and evolution of pathogenic coronaviruses animal origins of the severe acute respiratory syndrome coronavirus: insight from ace -s-protein interactions middle east respiratory syndrome coronavirus (mers-cov): animal to human interaction mers-cov: the intermediate host identified? how viral and intracellular bacterial pathogens reprogram the metabolism of host cells to allow their intracellular replication sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor a highly conserved cryptic epitope in the receptor binding domains of sars-cov- and sars-cov the proximal origin of sars-cov- sars-cov- reverse genetics reveals a variable infection gradient in the respiratory tract sars-cov- receptor ace is an interferon-stimulated gene in human airway epithelial cells and is detected in specific cell subsets across tissues susceptibility of ferrets, cats, dogs, and other domesticated animals to sars-coronavirus covid- : animals, veterinary and zoonotic links transmission of sars-cov- in domestic cats pathogenesis and transmission of sars-cov- in golden hamsters serological survey of sars-cov- for experimental, domestic, companion and wild animals excludes intermediate hosts of different species of animals animal models of mechanisms of sars-cov- infection and covid- pathology first detection and genome sequencing of sars-cov- in an infected cat in france simulation of the clinical and pathological manifestations of coronavirus disease (covid- ) in golden syrian hamster model: implications for disease pathogenesis and transmissibility infection and rapid transmission of sars-cov- in ferrets complete genome sequence of sars-cov- in a tiger from a u spike protein recognition of mammalian ace predicts the host range and an optimized ace for sars-cov- infection predicting the angiotensin converting enzyme (ace ) utilizing capability as the receptor of sars-cov- covid- : epidemiology, evolution, and crossdisciplinary perspectives the pivotal link between ace deficiency and sars-cov- infection physiological and pathological regulation of ace , the sars-cov- receptor renin-angiotensin system at the heart of covid- pandemic type i and type iii interferons -induction, signaling, evasion, and application to combat covid- increasing host cellular receptorangiotensin-converting enzyme (ace ) expression by coronavirus may facilitate -ncov (or sars-cov- ) infection sars-cov- entry factors are highly expressed in nasal epithelial cells together with innate immune genes antiviral regulation in porcine monocytic cells at different activation states a genomic survey of angiotensinconverting enzymes provides novel insights into their molecular evolution in vertebrates evolutionary constraints on structural similarity in orthologs and paralogs ace receptor polymorphism: susceptibility to sars-cov- , hypertension, multi-organ failure, and covid- disease outcome structural basis for the recognition of sars-cov- by full-length human ace structure of the sars-cov- spike receptor-binding domain bound to the ace receptor prodigy: a web server for predicting the binding affinity of protein-protein complexes imbalanced host response to sars-cov- drives development of covid- weak induction of interferon expression by sars-cov- supports clinical trials of interferon lambda to treat early covid- impaired type i interferon activity and exacerbated inflammatory responses in severe covid- patients, medrxiv ( ) dysregulation of type i interferon responses in covid- heightened innate immune responses in the respiratory tract of covid- patients heliyon xxx (xxxx) xxx coronavirus endoribonuclease activity in porcine epidemic diarrhea virus suppresses type i and type iii interferon responses covid- : lambda interferon against viral load and hyperinflammation multifaceted activities of type i interferon are revealed by a receptor antagonist epigenetic dysregulation of ace and interferon-regulated genes might suggest increased covid- susceptibility and severity in lupus patients pwmscan: a fast tool for scanning entire genomes with a position-specific weight matrix therapeutic options for the novel coronavirus ( -ncov) triple combination of interferon beta- b, lopinavir-ritonavir, and ribavirin in the treatment of patients admitted to hospital with covid- : an open-label, randomised, phase trial type interferons as a potential treatment against covid- interferon beta- b for covid- cross-species genome-wide analysis reveals molecular and functional diversity of the unconventional interferon-ω subtype porcine interferon complex and co-evolution with increasing viral pressure after domestication covid- and emerging viral infections: the case for interferon lambda heliyon xxx (xxxx) xxx the authors declare no conflict of interest. supplementary content related to this article has been published online at https://doi.org/ . /j.heliyon. .e . key: cord- -a r k a authors: zhang, shuyuan; qiao, shuyuan; yu, jinfang; zeng, jianwei; shan, sisi; lan, jun; tian, long; zhang, linqi; wang, xinquan title: bat and pangolin coronavirus spike glycoprotein structures provide insights into sars-cov- evolution date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: a r k a in recognizing the host cellular receptor and mediating fusion of virus and cell membranes, the spike (s) glycoprotein of coronaviruses is the most critical viral protein for cross-species transmission and infection. here we determined the cryo-em structures of the spikes from bat (ratg ) and pangolin (pcov_gx) coronaviruses, which are closely related to sars-cov- . all three receptor-binding domains (rbds) of these two spike trimers are in the “down” conformation, indicating they are more prone to adopt this receptor-binding inactive state. however, we found that the pcov_gx, but not the ratg , spike is comparable to the sars-cov- spike in binding the human ace receptor and supporting pseudovirus cell entry. through structure and sequence comparisons, we identified critical residues in the rbd that underlie the different activities of the ratg and pcov_gx/sars-cov- spikes and propose that n-linked glycans serve as conformational control elements of the rbd. these results collectively indicate that strong rbd-ace binding and efficient rbd conformational sampling are required for the evolution of sars-cov- to gain highly efficient infection. studies revealed that the sars-cov- s trimer, similar to that of sars-cov, needs to have at least one rbd in an "up" conformation to bind hace - . therefore, a spike trimer with all three rbds "down" is in a receptor-binding inactive state, and the conformational change of at least one rbd from "down" to "up" switches the spike trimer to a receptor-binding active state overall structures of ratg and pcov_gx spikes the overall structures of homotrimeric ratg and pcov_gx spikes resemble the previously reported pre-fusion structures of coronavirus spikes (fig. a ). both spikes have a mushroom-like shape (~ Å in height and~ Å in width), consisting of a cap mainly formed by β-strands and a stalk mainly formed by α-helices (fig. a) . like other coronaviruses, the ratg and pcov_gx spike monomers are composed of the s and s subunits with a protease cleavage site between them (fig. b, c) . the structural components of the spike include the n-terminal domain (ntd), rbd (also called the c-terminal domain, ctd), subdomain (sd ) and subdomain (sd ) in the s subunit; and the upstream helix (uh), fusion peptide (fp), connecting region (cr), heptad repeat (hr ), central helix (ch), β-hairpin (bh), subdomain (sd ) and heptad repeat (hr ) in the s subunit (fig. d, fig. s ). table s . a novel coronavirus associated with severe acute respiratory syndrome isolation of a novel coronavirus from a man with pneumonia in saudi arabia a new coronavirus associated with human respiratory disease in a novel coronavirus from patients with pneumonia in china a pneumonia outbreak associated with a new coronavirus of probable bat origin origin and evolution of pathogenic coronaviruses identifying sars-cov- -related coronaviruses in malayan pangolins isolation of sars-cov- -related coronavirus from malayan pangolins are pangolins the intermediate host of the novel coronavirus (sars-cov- )? associated with the covid- outbreak recombination, reservoirs, and the modular spike: mechanisms of coronavirus cross-species transmission sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structural basis for the recognition of sars-cov- by full-length human ace structural and functional basis of sars-cov- entry by using human ace structural basis of receptor recognition by sars-cov- cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding cryo-em structure of the sars coronavirus spike glycoprotein in complex with its host cell receptor ace cryo-em structure of the -ncov spike in the prefusion conformation sars-cov- and bat ratg spike glycoprotein structures inform on virus evolution and furin-cleavage effects a ph-dependent switch mediates conformational masking of sars-cov- spike. biorxiv receptor binding and priming of the spike protein of sars-cov- for membrane fusion sars-cov- and three related coronaviruses utilize multiple ace orthologs and are potently blocked by an improved ace -ig structural insights into coronavirus entry adaptation of sars-cov- in balb/c mice for testing vaccine efficacy a mouse-adapted model of sars-cov- to test covid- countermeasures functional and genetic analysis of viral receptor ace cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains glycans on the sars-cov- spike control the receptor immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen eman : an extensible image processing suite for electron microscopy new tools for automated high-resolution cryo-em structure determination in relion- motioncor : anisotropic correction of beam-induced motion for improved cryo-electron microscopy real-time ctf determination and correction quantifying the local resolution of cryo-em density maps swiss-model: homology modelling of protein structures and complexes ucsf chimera--a visualization system for exploratory research and analysis coot: model-building tools for molecular graphics phenix: a comprehensive python-based system for macromolecular structure solution molprobity: all-atom structure validation for macromolecular crystallography emringer: side chain-directed model and map validation for d cryo-electron microscopy vxx) rbd in wheat,and sars-cov- (pdb id: zge) rbd in marine; remaining regions shown in gray. (b) detailed structures of the rbd-glycans interface are shown zge/ vxx) rbds are colored the same as in a. glycans are shown as red sticks and asn-linked glycans are labeled. sequence alignment of the sars-cov- ratg and pcov_gx rbd-interacting glycosylation sites is shown in the bottom panel. some sequences between the three sites are omitted and indicated by black dots amino acid positions of asparagines are indicated above the sequences according to asparagines (n) are colored red and threonines (t) are colored blue binding affinities and cell entr y of the differ ent spikes. (a) binding curves of immobilized hace with the sars-cov- , pcov_gx or ratg spike. data are shown as different colored lines and the best fit of the data to a : binding model is shown in black. (b) the cell entry efficiencies of pseudotyped viruses as measured by luciferase activity. sars-cov- c) the representative micrographs and d classification results of negative-staining em. both spikes were incubated with -fold molar ratio of hace . the red box shows the complex of the pcov_gx spike with hace we thank the tsinghua university branch of china national center for protein fig. the r esidues and glycans inter acting with one rbd of the differ ent spikes. (a) the residues and glycans interacting with one rbd are shown as salmon spheres. the ratg rbd is colored in magenta, pcov_gx rbd in green, sars-cov- key: cord- -mwxkvwaz authors: li, wei; schäfer, alexandra; kulkarni, swarali s.; liu, xianglei; martinez, david r.; chen, chuan; sun, zehua; leist, sarah r.; drelich, aleksandra; zhang, liyong; ura, marcin l.; berezuk, alison; chittori, sagar; leopold, karoline; mannar, dhiraj; srivastava, shanti s.; zhu, xing; peterson, eric c.; tseng, chien-te; mellors, john w.; falzarano, darryl; subramaniam, sriram; baric, ralph s.; dimitrov, dimiter s. title: high potency of a bivalent human vh domain in sars-cov- animal models date: - - journal: cell doi: . /j.cell. . . sha: doc_id: cord_uid: mwxkvwaz novel covid- therapeutics are urgently needed. we generated a phage-displayed human antibody vh domain library from which we identified a high-affinity vh binder ab . bivalent vh, vh-fc ab bound with high avidity to membrane-associated s glycoprotein and to mutants found in patients. it potently neutralized mouse adapted sars-cov- in wild type mice at a dose as low as mg/kg and exhibited high prophylactic and therapeutic efficacy in a hamster model of sars-cov- infection, possibly enhanced by its relatively small size. electron microscopy combined with scanning mutagenesis identified ab interactions with all three s protomers and showed how ab neutralized the virus by directly interfering with ace binding. vh-fc ab did not aggregate and did not bind to human membrane-associated proteins. the potent neutralization activity of vh-fc ab combined with good developability properties and cross-reactivity to sars-cov- mutants provide a strong rationale for its evaluation as a covid- therapeutic. the global outbreak of a severe acute respiratory distress (sars) coronavirus (sars-cov- ) associated disease requires rapid identification of therapeutics and vaccines. while many vaccines are in clinical development, the time to market can be relatively long and immunogenicity can be limited for high-risk groups (amanat and krammer, ) . alternatively and complementarily, antibodies can be used as safe and effective prophylactics and therapeutics (pelegrin et al., ) . convalescent plasma from covid- patients inhibited sars-cov- infection and alleviated symptoms of newly infected patients (casadevall and pirofski, ; rojas et al., ) suggesting that potent neutralizing monoclonal antibodies (mabs) may be even more effective. sars-cov- genome shares more than % homology to the sars-cov . similar to sars-cov, sars-cov- uses the spike (s) envelope glycoprotein to enter into host cells. the viral entry is initiated by the receptor binding domain (rbd) of the s protein binding to its receptor, angiotensin-converting enzyme (ace ), leading to conformational change of the s subunit and formation of six helical-bundle resulting in membrane fusion between viral and host cells yan et al., ) . the sars-cov rbd contains immune-dominant epitopes that can elicit neutralizing antibodies conferring protection to sars-cov infection (he et al., ) . a recent bioinformatics study showed that sars-cov- rbd has several b cell epitopes (grifoni et al., ) . sars-cov- rbd based immunogens were able to elicit neutralizing sera in animals (quinlan et al., ) . thus, sars-cov- rbd is a good target for developing potent neutralizing mabs. we and others have identified such potent neutralizing human mabs targeting the rbd of sars-cov (zhu et al., ) and the middle east respiratory syndrome coronavirus (mers-cov) (ying et al., a) . recently, several groups have reported the isolation of potent neutralizing antibodies from convalescent human donors but all are in an immunoglobulin g (igg ) format with a molecular mass of about kda ju et al., ; rogers et al., ; shi et al., ; zost et al., ) . antibody domains and fragments such as fab (fragment antigen binding, molecular weight of kda), scfv (singe-chain variable fragment, kda) and v h (heavy chain variable domain, kda) are attractive antibody formats as candidate therapeutics (nelson, ) . for example, isotope labeled antibody fragments are more suitable for bio-imaging due to their better tissue penetration and faster clearance compared to full-size antibodies (freise and wu, ) . single antibody domains (sabd), e.g., camelid v h h ( kda) exhibit strong antigen binding and high stability (harmsen and de haard, ) . we and others have demonstrated that human igg heavy chain variable domain (v h ) can be engineered to achieve high stability and affinity to antigens (nilvebrant et al., ) , as exemplified by the v h , m . , targeting the human immunodeficiency virus type (hiv- ) envelope glycoprotein co-receptor binding site (chen et al., a) . the v h domains small size could improve therapeutic efficacy for infectious diseases, such as covid- because of greater penetration to sites of infection. the conformation of the sars-cov- s trimer is dynamic with only one rbd in the "up" conformation presenting neutralizing epitopes while epitopes in the other two rbds may be masked . small v h s may achieve binding to the cryptic rbd epitopes during the dynamic "breathing" of the s trimer . in addition, v h s may have j o u r n a l p r e -p r o o f an advantage for treatment of respiratory virus infections because v h s could efficiently penetrate tissue, especially when using direct delivery through inhalation (detalle et al., ) . to identify potent neutralizing v h s against sars-cov- , we panned our large ( clones) and diverse phage-displayed human v h antibody library against recombinant rbd. several v h binders were isolated and screened for their affinities, ace competition and stabilities. one of those v h s, ab , in an fc (human igg , crystallizable fragment) fusion format, showed potent neutralization activity and specificity against sars-cov- both in vitro and in two animal models. to our knowledge, this is the first report for high potency of a human antibody domain (v h ) in two animal models of infection. we generated a large phage-displayed human v h library where heavy chain complementarity-determining regions (hcdr , , s) were grafted into their cognate positions of a stable scaffold based on the germline v h - ( figure s a) . it was panned against recombinant rbd antigens with two different tags (avi-his and human igg fc tag) which were sequentially used to avoid phage enrichment to tags and related epitopes. the quality of the rbd used for panning was confirmed by ace binding (figure s b and c). after three rounds of panning, a panel of v h binders was obtained. among the highest affinity binders, we selected one, v h ab , which did not aggregate during a six-days incubation at °c as tested by dynamic light scattering (dls) (figure s d ). to increase the v h ab avidity and extend its in vivo half-life, it was converted to a bivalent antibody domain by fusion to the human igg fc (v h -fc ab ) (figure s e ). v h ab bound to sars-cov- rbd and s with half-maximal binding concentrations (ec s) of nm as measured by elisa (figure a and d) and an equilibrium dissociation constant (k d ) of nm as measured by the biolayer interferometry (blitz system) ( figure b) . the relatively fast dissociation rate constant (k d = . × - s - ) was significantly ( -fold) decreased by the conversion to a bivalent fc fusion format (k d = . × - s - ) ( figure e ) resulting in high avidity. v h -fc ab bound to sars-cov- rbd and s subunit of s protein with ec s of . nm and . nm, respectively, and a k d of . nm ( figure e ). it specifically bound to t cells expressing s, but not to control t cells ( figure c and figure s a ). the binding of v h -fc ab was higher than that of igg cr , an anti-sars-cov antibody cross-reactive with sars-cov- (tian et al., ) . the v h -fc ab 's halfmaximal facs measured binding concentration (fc ) of . nm was higher than that of recombinant human ace -fc (fc = . nm) ( figure f ). these data demonstrate that ab selected by an isolated rbd can bind to cell surface associated native s trimer. the binding of v h -fc ab to the s protein was significantly improved compared to that of the v h ab through avidity effect. competition with human ace for binding to rbd is a surrogate indicator for antibody neutralization activity. v h -fc ab outcompeted human ace -fc with a half-maximal inhibitory concentration (ic ) of . nm ( figure a ). note that the v h -fc ab was much more effective in outcompeting ace -fc than v h ab , consistent with its enhanced binding. ace can also block v h ab for binding to rbd ( figure s b ) and cell surface associated s ( figure s c) . v h -fc ab also significantly decreased the kinetics of ace binding as measured by blitz ( figure b ). v h -fc ab did not bind to the sars-cov rbd ( figure c ) and did not compete with cr for binding to rbd ( figure d ). the cr epitope is located in a conserved region on the rbd core domain distal from the ace binding interface, as seen in the crystal structure of the fab cr -rbd complex . these results indicate that the ab epitope may overlap with the ace binding site on rbd. currently, nine prevalent rbd mutants were found in covid- patients (priyanka et al., ) . six of these mutations (f l, n d. n d/d y, v f, r i, w r) are located in the rbd core domain and three, k r, g s and v a are in the receptor binding motif (rbm) (figure a) . v h -fc ab bound to all mutants similarly to wild type rbd as measured by elisa ( figure b ). to map the ab epitope, we also generated several mutations in non-conserved positions compared to sars-cov spanning the footprint of ace on rbm (n a, g l, l a, f a, a i, f a, q a, q a, n a, y a) ( figure c ). most of these mutants retained v h -fc ab binding except f a, f a and a i ( figure d and e) . the f a significantly decreased binding without affecting the overall rbd conformation (figure s c and s d) indicating that f directly interacts with ab . the f a and a i mutations decreased the binding by % and %, respectively, but they also affected the rbd conformation ( figure s c and s d) . these results suggest that a portion of the v h ab epitope could be in the rbm distal loop tip where the f is located at ( figure f ). to explore structural aspects of sars-cov- neutralization by v h ab , we performed negative stain electron microscopic analysis of the complex formed between the s protein ectodomain and v h ab or soluble ace (figure ) . the density maps showed that both v h ab and ace were in a quaternary conformation in which two of the protomers in the trimer are in the "down" conformation with the third one in the "up" conformation ( figures a and b) , similar to the quaternary conformation of the reported ace -bound s ectodomain (pdb id: vyb) (walls et al., ) . one molecule of the v h ab was observed bound to each rbd domain ( figure a ). in the ace -s complex, one molecule of ace was bound to the s protein trimer, straddling one "up" and one "down" rbd region ( figure b ). there appears to be a noticeable shift of the "up" rbd domain when it is bound to v h ab ( figure a ). this shift is not observed when ace is bound to the trimer ( figure b) . superposition of the two density maps reveals that the binding site of v h ab directly overlaps with the ace one, precluding simultaneous occupancy on the s protein ectodomain ( figure c ). we also found that when ace was added subsequent to the j o u r n a l p r e -p r o o f addition of v h ab , only the v h ab bound state was observed, further confirming the ace competition with v h ab . to better understand the spatial relationship between the site of v h ab binding and that of ace binding, we created a molecular model for ace bound s trimer by aligning the rbd region of the crystal structure of sars-cov- rbd bound ace (pdb id: m j) (lan et al., ) to the "up" rbd region in the cryo-em structure of the trimer (pdb id: yvb) (wrapp et al., ) . superposition of this chimeric structure with the density map of v h ab -bound s protein trimers reveals that the bound ace has extensive overlap with the space occupied by bound v h ab ( figure d ). the direct spatial overlap between bound v h ab and ace provides a structural mechanism for the observed effect of ab on blocking ace binding. the structural findings also showed that the rbm distal loop, which has f at its tip, is directly covered by the footprint of the bound v h ab , consistent with the epitope mapping results showing that f is a direct contacting residue for ab . we used four different assays to evaluate v h -fc ab mediated inhibition of sars-cov- infection in vitro: a βgalactosidase (β-gal) reporter gene-based quantitative cell-cell fusion assay (xiao et al., ) ; an hiv- backbonebased sars-cov- pseudovirus assay ; and two different replication-competent virus neutralization assays (a luciferase reporter gene assay and a microneutralization (mn)-based assay) (scobey et al., ; yount et al., ) . v h -fc ab inhibited cell-cell fusion much more potently than v h ab ( figure a ). the inhibitory activity of v h -fc ab was also higher than that of ace -fc. the control anti mers-cov antibody igg m did not show any inhibitory activity. v h -fc ab neutralized pseudotyped sars-cov- virus (ic = . µg/ml) more potently than ace -fc (ic = . µg/ml) and v h ab (ic = . µg/ml) ( figure b ). the pseudovirus neutralization ic for ace -fc in our assay is comparable to the one reported by changhai lei et al. ( . - . µg/ml) (lei et al., ) . interestingly, the maximum neutralization by v h ab was only % compared to the % by v h -fc ab and ace -fc, which was also observed for another antibody s (pinto et al., ) . the complete neutralization by v h -fc ab /ace -fc emphasizes the role of bivalency and related avidity in neutralization (klasse and sattentau, ) . furthermore, in the reporter gene assay v h -fc ab neutralized live sars-cov- with an ic of . µg/ml ( figure c ), which is much lower than that for ace -fc (ic of . µg/ml) and v h ab (ic = µg/ml). ace -fc seemed to be much less potent against the live virus compared to the pseudovirus, which is also observed by others (ic = . µg/ml) and may relate to the s expression levels and rbd/s conformation on the virus surface. we also confirmed the high v h -fc ab live virus neutralization potency by a microneutralization (mn) assay- % neutralization (nt ) at . µg/ml ( figure d ). the nt from the mn assay ( . µg/ml) was close to the ic ( . µg/ml) from the reporter gene assay suggesting consistency in the live virus neutralizing activity of v h -fc ab obtained with two independent assays at two different laboratories. these results suggest that v h -fc ab is a potent neutralizer of sars-cov- , which correlates with its strong competition with ace for binding to rbd. to evaluate the prophylactic efficacy of v h -fc ab in vivo, we used a recently developed mouse ace adapted sars-cov- infection model, in which wild type balb/c mice are challenged with sars-cov- carrying two j o u r n a l p r e -p r o o f mutations q t/p y at the ace binding interface in the rbd . it was shown that in this model, the aged balb/c mice exhibited more clinically relevant phenotypes than those seen in hace transgenic mice . groups of mice each were administered , , mg/kg v h -fc ab prior to high titer ( pfu) sars-cov- challenge followed by measurement of virus titer in lung tissue days post infection. v h -fc ab effectively inhibited sars-cov- in the mouse lung tissue in a dose dependent manner ( figure a ). there was complete neutralization of infectious virus at the highest dose of mg/kg, and statistically significant reduction by -fold at mg/kg. remarkably, even at the lowest dose of mg/kg it significantly decreased virus titer by fold (two tailed, unpaired t test, p = . ). to exclude possible effects of residual ab on viral titration, we performed another experiment in which mouse lungs were perfused with ml of pbs before harvesting for titration. the perfusion did not affect to any significant degree the infectious virus in the lungs ( figure b ). the v h -fc ab completely neutralized the virus in the lungs at mg/kg and significantly reduced infectious virus at mg/kg. v h -fc ab also reduced viral rna in the lungs ( figure c ). these results demonstrate the neutralization potency of v h -fc ab in vivo. they also suggest that the double mutations q t/p y on rbd did not influence v h -fc ab binding and contribute to the validation of the mouse adapted sars-cov- model for evaluation of neutralizing antibody efficacy. recently hamsters were demonstrated to recapitulate clinical features of sars-cov- infection (chan et al., ) (imai et al., ) . to evaluate the v h -fc ab efficacy in hamsters, it was intraperitoneally administered either hours before (prophylaxis) or hours after (therapy) intranasal tcid virus challenge. in the therapeutic group, the rationale for administration of the antibody six hours post viral infection is based on the replication cycle length of - hours after initial infection for sars-cov in veroe cells (keyaerts et al., ) . six hours after challenge with a high dose of tcid , approximately the same number of susceptible cells could become infected and likely produce much more infectious virus, which would need to be neutralized by the antibody to prevent subsequent cycles of infection. nasal washes and oral swab at , , days post infection (dpi) and different lung lobes at dpi were collected. v h -fc ab decreased viral rna by . log in the lung when administered prophylactically. the lung viral rna decrease in the therapeutic groups was slightly lower (by . log) ( figure d) . interestingly, the viral rna load in the therapeutic groups was to some extent tissue location dependent ( figure f ). the variation of the viral load in different lung lobes may relate to nonuniform antibody transport and viral spread inside the lung. remarkably, v h -fc ab alleviated hamster pneumonia and reduced the viral antigen in the lung (h&e staining, figure a and c and immunohistochemistry figure b and d). the control hamsters exhibited severe interstitial pneumonia characterized by extensive inflammatory cell infiltration, presence of type ii pneumocytes, alveolar septal thickening and alveolar hemorrhage. both prophylactic and therapeutic treatment of v h -fc ab reduced the lesions of alveolar epithelial cells, focal hemorrhage and inflammatory cells infiltration. v h -fc ab also reduced the shedding from mucosal membranes including in nasal washes and oral swabs ( figure s ). the decrease in viral rna in nasal washes and oral swabs were not as large as the decrease observed in the lung tissue, similar to a recent finding in hamsters (imai et al., ) . overall, the j o u r n a l p r e -p r o o f prophylactic treatment was more effective than the therapeutic treatment in decreasing viral load in nasal washes and oral swabs. notably, prophylactic administration of v h -fc ab effectively reduced the infectious virus in the oral swab at dpi, while the post-exposure treatment did not (figure s c and g) . interestingly, viral reduction (except the viral titer in the oral swab at dpi) was more effective at and dpi compared to that at dpi, likely due to the infection peak occurring before day as reported in hamsters (sia et al., ) . a striking finding is that v h -fc ab given therapeutically at as low dose as mg/kg can still decrease viral loads in the lung, nasal washes and oral swabs ( figure s ). we measured the v h -fc ab concentrations at both doses ( and mg/kg) in the sera at dpi and dpi in the post-exposure treatment groups ( figure s c ). the higher dose ( mg/kg) resulted in higher antibody concentration and better inhibitory activity than the lower dose ( mg/kg). the relatively high concentration of v h -fc ab five days after administration also indicates good pharmacokinetics. furthermore, we also compared the v h -fc ab concentration in both the sera and lung with that of igg ab , which has a similar affinity to sars-cov- and similar degree of competition with the receptor ace as v h -fc ab . we found that the concentration of v h -fc ab in hamster sera is significantly higher than that of igg ab at and dpi after postexposure administration of the same dose of mg/kg ( figure e ), possibly indicating more effective delivery of v h -fc ab from the peritoneal cavity to the blood than that of igg ab . we also found that the v h -fc ab concentration in all hamster lung lobes was higher than that of the igg ab ( figure f ), suggesting that v h -fc ab appears to penetrate the lung tissue more effectively than igg ab . these results indicate that the in vivo delivery of v h -fc ab may be more effective than that of full-size antibodies in an igg format. the v h -fc ab propensity for aggregation was measured at °c by dynamic light scattering (dls), which detects particle size distributions in the nanometer range (stetefeld et al., ) . it displayed a single peak at . nm which is the size of a monomeric v h -fc protein ( figure s a ). the absence of large-size peaks corresponding to large molecular weight species (aggregates) in solution, indicates that v h -fc ab is highly resistant to aggregation at high concentration ( mg/ml) and relatively long times of incubation ( days) at °c. the v h -fc ab propensity for aggregation was also evaluated by size exclusion chromatography (sec), which showed that > % of v h -fc ab was eluted in a peak at a position corresponding to a monomeric state with a molecular weight of kda ( figure s b ). antibody nonspecificity and polyreactivity can be an obstacle for developing an antibody into a clinically useful therapeutic. polyreactivity may not only cause off-target toxicities and interfere with normal cellular functions, but may also reduce antibody half-life (chuang et al., ) . to test for potential polyreactivity of v h -fc ab , a membrane proteome array (mpa) platform was used, in which , different human membrane protein clones were separately overexpressed in t cells in a matrix array achieving a high-throughput detection of binding by facs. v h -fc ab did not bind to any of those proteins ( figure s c ), demonstrating its lack of polyreactivity and nonspecificity. interestingly, we did not detect v h -fc ab binding to the human fcγria, which is probably due to the relatively low expression level of fcγria on hek- t cell surface without concomitant expression of the common γ chain (van vugt et al., ) . in addition, we found that v h -fc ab bound to the fcγrs much weaker than igg (figure s ), likely due to the different conformation in the lower hinge region for fc fusion proteins compared to that of igg s (ying et al., b) . for the fc fusion proteins (even with the same hinge sequence as igg ), binding to fcγrs may be different from that of igg , and can be affected by the fusion partners (lagassé et al., ) . the importance of antibody binding to fcγrs for therapeutic or prophylactic efficacy or toxicity in sars-cov- infection is unknown. neutralizing mabs are promising for prophylaxis and therapy of sars-cov- infections. recently, many potent neutralizing antibodies from covid patients were identified that neutralize pseudovirus with ic s ranging from to ng/ml, and replication-competent sars-cov- with ic s from to ng/ml ju et al., ; rogers et al., ; shi et al., ; zost et al., ) . by comparison, the v h -fcab reported here exhibited comparable or better neutralizing potency against sars-cov- pseudovirus and live virus (ic s of ng/ml and ng/ml respectively). of note, ic s can vary widely between different assays and laboratories because there is no generally accepted standardized assay. in addition, there are many factors that contribute to potency and efficacy in vivo. animal models are a more comprehensive and likely more reliable predictor of potential efficacy in humans than in vitro neutralization assays. to our knowledge v h -fc ab is the first human antibody domain whose activity was validated in two animal models. in the mouse ace adapted sars-cov- infection model, v h -fc ab significantly decreased infectious virus by -fold at days post infection even at a very low dose of mg/kg ( figure a ). it also exhibited both prophylactic and therapeutic efficacy in a hamster model. it not only reduced the viral load in the lung and alleviated pneumonia; but it also reduced shedding in the upper airway (nasal washes and oral swab), which could potentially reduce transmission of sars-cov- . impressively, v h -fc ab was active therapeutically even at mg/kg. the finding that v h -fc ab persisted for days post administration at significant levels indicates that the pharmacokinetics of v h -fc ab is comparable to that of a full size antibody; the half-lives of fc fusion proteins were reported to vary from those of igg s and can range from hours to days (unverdorben et al., ) . the molecular weight of v h -fc ab ( kda) is half of that of full-size igg which suggests an advantage in terms of smaller quantities needed to be produced compared to those for igg s to reach similar number of molecules and efficacy. in addition, it was shown that decreasing binder's size exponentially increases its diffusion through normal and tumor tissues (jain, ) . thus, decreasing the size two-fold can increase diffusion through tissues by four-fold. we found that after administration at the same dose, the concentration of v h -fc ab was higher than that of igg ab in both hamster sera and lung tissue. this result might suggest that the v h -fc ab diffusion from the peritoneal cavity to the blood and penetration of lung may be faster than that of igg ab . this may further explain its efficacy at low doses in animals. although the low dose showed efficacy in the small animal models, it should be noted that in humans higher doses could be required to achieve comparable degree of efficacy. another caveat is that in the j o u r n a l p r e -p r o o f hamster post-exposure experiment, the v h -fc ab was administered at a time (six hours) when the first round of virus replication was likely completed (keyaerts et al., ) , but before the infection peak at - days (sia et al., ) . because it inhibits infection of new cells, its administration at around the infection peak or after may not be as effective unless it also kills infected cells in vivo which is under investigation. recently antibody domains including human v h and camelid v h h were reported having varying neutralization potency (chi et al., ; sun et al., ; wrapp et al., ; wu et al., a) . compared to those domains, v h -fc ab is unique in terms of potency, aggregation resistance and specificity. v h -fc ab exhibited good developability properties including stability at high concentrations and long incubation at °c, as well as absence or very low aggregation. in addition, v h -fc ab did not bind to the human cell line t even at high concentration ( µm) which is about -fold higher than its k d indicating absence of non-specific binding to many membraneassociated human proteins. a similar result was obtained by the membrane protein array assay showing that v h -fc ab did not bind to any of , human membrane-associated proteins, indicating its lack of non-specificity and thus low potential for off-target toxicity when used in vivo. besides, unlike camel v h hs, the v h ab sequence is fully human and therefore likely less immunogenic than that of camelid v h hs. multiple structures are now available for the sars-cov- s protein trimer in complex with various neutralizing antibodies, offering insight into antigenic epitopes and inhibitory mechanisms critical for s protein neutralization. epitopes on the sars-cov- s protein rbd have emerged as effective targets, as evidenced by the action of several rbd binding antibodies including cr , b , c , cb , h , and s (barnes et al., ; lv et al., ; pinto et al., ; shi et al., ; wu et al., b) . while b , c , and cb directly compete with ace for binding sites on the rbd surface, h occupies a position distinct from these binding sites, precluding ace binding via steric inhibition . s targets the rbd of the s protein both in closed and open s protein conformations, exhibiting a different mechanism of neutralization (pinto et al., ) . a recent study of the structure of the s protein trimer in complex with the nanobody h -d (pdb id: z ) revealed full occupancy of the nanobody on all three rbds in a "one up and two down" conformation (huo et al., ) , similar to what we report here. our structural analysis demonstrates that the location of the v h ab bound to the trimeric s ectodomain directly overlaps the region that would be occupied by ace when bound to the s protein. the ace blocking is likely the major mechanism of the v h -fc ab neutralizing activity, which is significantly augmented by avidity effects due to its bivalency. the narrow neutralization concentration range in the live virus neutralization ( - ng/ml for %- % neutralization) ( figure d ) indicates a plausible cooperative neutralization mechanism, probably due to the synergistic binding of v h molecules in v h -fc ab to rbds. due to its small size, v h may facilitate targeting occluded epitopes on rbd that are otherwise inaccessible to full-length iggs, which is important because the sars-cov- s protein is conformationally heterogenous, exposing neutralizing epitopes to varying degrees . the structural analysis shows that v h ab is able to simultaneously target all three rbd epitopes in both "up" and "down" conformations, which may provide a structural basis for a unique cooperative neutralization mechanism for v h -fc ab . v h -fc ab with a long flexible linker between v h and fc may allow two j o u r n a l p r e -p r o o f v h molecules to bind simultaneously two protomers in the same s trimer or cross-link two different protomers from different s trimers. the ab epitope is distal to the cr epitope, explaining its lack of competition with cr . the ab contact residue f (l in sars-cov) is not conserved which likely explains its lack of cross-reactivity to sars-cov. from the gisaid and ncbi databases, we found nine mutations in rbd with relatively high frequencies in current circulating sars-cov- . six of them are in the core domain (f l, n d, n d/d y, v f, r i and w r) and three in the rbm (k r, g s, v a). the core domain mutations are far away from the ab epitope, thus these mutations do not affect v h -fc ab binding to rbd. those three rbm mutations also did not affect ab binding although they are close to the ab epitope, suggesting that these mutations may not affect ab neutralizing activity although neutralization of whole virus carrying these mutations is needed to definitely demonstrate this possibility. interestingly, v h -fc ab effectively inhibited the mouse ace adapted sars-cov- with a q t/p y mutation in rbd, indicating that this double mutation also does not affect v h -fc ab binding to rbd. these results suggest that v h -fc ab may be a broadly crossreactive sars-cov- neutralizing antibody. in conclusion, we identified a fully human antibody v h domain that shows strong competition with ace for binding to rbd and potent neutralization of sars-cov- in vitro and in two animal models. this potent neutralizing activity combined with its specificity and good developability properties warrants its further evaluation for prophylaxis and therapy of sars-cov- infection. our elucidation of its unique epitope and mechanism of neutralization could also help in the discovery of more potent inhibitors and vaccines. hamsters were bled at one and five dpi for measuring antibody concentrations in sera by sars-cov- s elisa. sera was diluted : and binding was detected by using the goat anti human igg-hrp. (f). viral rna levels in different lung lobes. rna quantity was presented as the tcid equivalence. experiments were performed in duplicate and the error bars denote ± sd, n = . detailed methods are provided in the online version of this paper and include the following: • key resources table further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, dimiter dimitrov (mit @pitt.edu). all requests for resources and reagents should be directed to and will be fulfilled by the lead contact author. this includes antibodies, viruses, plasmids and proteins. all reagents will be made available on request after completion of a material transfer agreement. antibody nucleotide sequence has been deposited to genbank with an accession number of mt . the antibody is only allowed for non-commercial use. all data supporting the findings of this study are available within the paper and are available from the corresponding author upon request. vero e (crl- , american type culture collection (atcc) and t (atcc) were cultured at °c in dulbecco's modified eagle medium (dmem) supplemented with % fetal bovine serum (fbs), mm hepes ph . , mm sodium pyruvate, and u/ml of penicillin-streptomycin. t stably expressing sars-cov- and human ace was cultured in dmem medium containing µg/ml zeocin. hek f and expi f were cultured in freestyle serum free medium (thermofisher, cat# ) and expi ™ expression medium j o u r n a l p r e -p r o o f (thermofisher, cat# a ), respectively. the sars-cov- spike pseudotyped hiv- backboned virus is packaged in t cells after transfecting pnl - .luc.re and pcdna . s plasmids. the sars-cov- (us_wa- / ) and sars-cov /canada/on/vido- / obtained from centers for disease control and prevention were propagated in vero e cells. the recombinant sars-cov- -seattlenluc virus and the mouse ace adapted sar-cov- virus (carrying a q t/p y mutation in rbd) recovered by the reverse genetics was produced in veroe cells. all work with infectious sars-cov- was performed in institutional biosafety committee approved bsl facilities using appropriate positive pressure air respirators and protective equipment. the recombinant proteins sars-cov- rbd-his, rbd mutants, rbd-fc, ace -hfc were subcloned into pcdna . expression plasmids, and expressed in expi f cells. proteins with his tag were purified by ni-nta affinity chromatography and protein with fc tag purified by protein a chromatography. protein purity was estimated as > % by sds-page and protein concentration was measured spectrophotometrically (nanovue, ge healthcare). v h ab antibody was identified by panning of the phage library. v h -fc ab were constructed by fusing v h to human igg fc with the native igg hinge. igg ab was obtained by our lab through panning of a fab phage library. mers-cov-specific igg m and sars-cov antibody igg cr sequences from other groups were subcloned into the pdr plasmid for expression. v h ab (in a phagemid pcomb x with a flag tag) was expressed in hb e. coli and purified by ni-nta affinity chromatography. all other igg were expressed in expi cells and purified with protein a chromatography. for the mouse model, balb/c mice purchased from envigo (balb/cannhsd, stock# , immunocompetent, - months of age, female) were used for all experiments. they are drug/test naïve and negative for pathogens. biofresh bedding with crinkle bedding added. hamsters have access to food and water ab libitum. food is lab diet p prolab rmh . cages are changed weekly or as needed and spot cleaned. for experiment, hamsters were intraperitoneally treated with v h -fc ab either hrs before (prophylaxis) or hrs (therapy) after intranasal challenge of × tcid of sars-cov- . nasal washes and oral swabs were collected at day , and post infection (dpi). hamsters were bled at and dpi. all hamsters were euthanized on dpi. at euthanasia, lungs were collected for rna isolation. for viral titer determination, veroe cells tcid assay was used. for testing viral rna, viral rna rt-qpcr was used. for testing antibody concentration at sera and lung, sars-cov- s elisa was used. for histopathology, % formalin fixed and paraffin embedded tissues were processed with either hematoxylin and eosin stain (h&e) or immunohistochemistry (ihc). lung lobes were scored based on pathology using microscopy. cr . the sars-cov- s and the anti-sars-cov antibody igg cr and genes were synthesized by idt (coralville, iowa). mers-cov-specific igg m antibody was expressed in human mammalian cell as described previously (ying et al., a) . briefly, igg m light chain and heavy chain fd were subcloned into the pdr vector containing dual promoters and a igg fc cassette. the recombinant plasmid was sequenced and transfected into expi cells for expression. the human angiotensin converting enzyme (ace ) gene was ordered from origene (rockville, md). the rbd domain (residues - ) and s domain (residues - ) and ace (residues - ) genes were cloned in frame to human igg fc in the mammalian cell expression plasmid pcdna . . the rbd protein with an avitag followed by a ×his tag at c-terminal was subcloned similarly. these proteins were expressed with expi expression system (thermo fisher scientific) and purified with protein a resin (genscript) and by nickel-nitrilotriacetic acid (ni-nta) resin (thermo fisher scientific). the fab cr antibody gene with a his tag was cloned into pcat plasmid (developed in house) for expression in hb bacteria and purified with ni-nta resin. protein purity was estimated as > % by sodium dodecyl sulfatepolyacrylamide gel electrophoresis (sds-page) and protein concentration was measured spectrophotometrically (nanovue, ge healthcare). unlike camel v h hs, which naturally evolved to be autonomously stable, human v h is usually unstable and easy to aggregate in the absence of v l (li et al., ; nguyen et al., ) . however, human v h can be selected or engineered with high stability and solubility. to facilitate identification of stable v h binders, we chose engineered germline v h - as our library scaffold (chen et al., b) . our human v h phage display library was made by grafting heavy chain cdr , , genes derived from healthy donors' peripheral blood monocytes (pbmcs) and j o u r n a l p r e -p r o o f splenocytes (takara, cat. no. ) into their cognate positions of a stable scaffold (based on the germline v h - ) in a manner similar to the method we previously described but without mutagenesis of cdr (chen et al., a) . briefly, cdrs were pcr-amplified by using primers with degenerated adaptors covering cdrs edge regions from diverse v h families in one end, and with sequences annealing to the v h - framework (fr) regions in the other end. the pcr products were then assembled by overlapping extension pcr by using primers with homologous ending. the whole v h was assembled by overlapping fr -cdr -fr -cdr and fr -cdr -fr fragments. after assembly, the v h fragment was sfi i digested followed by ligated into sfi i linearized pcomb x phagemid. the recombinant phagemid was then purified, desalted and concentrated for electroporation of bacteria tg , from which the v h phage particles were rescued and produced. the library size was determined by tittering transformants. the library quality (diversity) was checked by randomly sanger sequencing hundreds of v h clones and also evaluated by panning of diverse antigens. this library contains very large number of clones ( ). for panning, the v h library was alternatively panned against biotinylated rbd-his and rbd-fc proteins. rbd biotinylation occurred through biotin ligase (bira) mediated enzymatic conjugation of a single biotin on avitag (glndifeaqkiewhe) (fairhead and howarth, ) . the panning was for rounds with input antigens of µg rbd-his, µg rbd-fc and . µg rbd-his for the st , nd and rd round, respectively. the panning process begun with incubation of antigens with v h phage particles followed by washing with phosphate-buffered saline (pbs) containing . % tween- . bound phage pulled down by streptavidin-m -dynabeads were rescued by log-phase tg cells with the m ko helper phage. after the rd round panning, positive clones were selected by soluble expression monoclonal (sem) elisa followed by sequencing (chen et al., b) . v h binders were further screened for their binding affinity, stability and ace competition. for conversion to fc-fusion, the v h gene was subcloned into psectag b vector containing human igg fc fragment. v h -fc ab was expressed as described above. enzyme-linked immunosorbent assays (elisas). for detection of rbd biotinylation efficacy, horseradish peroxidase (hrp) conjugated streptavidin was used. for conformation of function of rbd-his after biotinylation, ng ace -fc was coated into the plates followed by addition of serially diluted biotinylated rbd-his. hrp conjugated streptavidin was used for detection. for other elisas, the sars-cov- rbd (residues - ) protein was coated on -well plates (costar) at ng/well in pbs overnight at o c. for screening sem elisa, clones randomly picked from the infected tg cells were incubated with immobilized antigen. bound phages were detected with hrp-conjugated mouse anti-flag tag ab (sigma-aldrich). for the v h -fc binding assay, hrpconjugated goat anti-human igg fc (sigma-aldrich) was used for detection. for the competition elisa with hace , nm of human ace -mouse fc was incubated with serially diluted v h , or v h -fc, and the mixtures were added to rbd coated wells. after washing, bound ace -mouse fc was detected by hrp-conjugated anti mouse igg (fc specific) (sigma-aldrich). for evaluation of ace blocking of v h ab binding to rbd, nm v h ab was incubated with coated rbd in the presence of various concentration of ace -his (sino biological), and the bound v h ab was detected by hrp conjugated anti flag antibody. for evaluation of conformational changes of the epitope mapping rbd mutants, we used a mouse polyclonal anti sars-cov- rbd antibody (sino biological, cat. no. -mp ) and the human igg cr antibody. for measuring the binding of v h -fc ab to rbd mutants, ng rbd mutant was coated on -wells plates and incubated with v h -fc ab with binding detected by using j o u r n a l p r e -p r o o f hrp conjugated anti human fc antibody. to evaluate the binding of v h -fc ab and igg ab to human fcγrs, recombinant human fcγria, iia, iiia were coated on -wells plates followed by addition of biotinylated v h -fc ab and igg ab . binding was detected by the streptavidin-hrp. all colors were developed by , ′, , ′tetramethylbenzidine (tmb, sigma) and stopped by m h so followed by recording absorbance at nm. experiments were performed in duplicate and the error bars denote ± sd. blitz. antibody affinities and avidities were analyzed by the biolayer interferometry blitz (fortebio, menlo park, ca). for measuring v h ab affinity, the rbd-fc was mounted on the protein a sensor (fortebio: - ). nm, nm and nm v h ab were used for association. for measuring avidity of v h -fc ab , biotinylated rbd-fc was immobilized on streptavidin biosensors (fortebio: - ) for min and equilibrated with dulbecco's phosphate-buffered saline (dpbs) (ph = . ) to establish baselines. nm, nm and nm v h -fc ab were chosen for association. the association was monitored for min and then the antibody was allowed to dissociate in dpbs for min. the k a and k d were derived from sensorgrams fittings and used for k d calculation. for the competitive blitz, nm v h -fc ab was loaded onto the rbd-fc coated sensor for s to reach saturation followed by dipping the sensor into a nm ace -fc or fab cr solution in the presence of nm v h -fc ab . the association was monitored for s. the signals from nm hace or cr binding to the rbd-fc coated sensor in the absence of v h -fc ab was independently recorded in parallel. competition was determined by the percentage of signal in the presence of v h -fc ab to signal in the absence of v h -fc ab (< . is considered to be competitive) (wu et al., a) . (agilent, cat. no. ) . mutants were expressed and purified according to the abovementioned rbd purification procedures. elisa was used to evaluate the binding of these mutants compared to the wild type rbd. a. expression and purification. the codon optimized sars-cov- p s protein ectodomain construct (genbank: yp_ . ) was c-terminally tagged with xhis and a twin strep tag and cloned into the mammalian expression vector pcdna . (synbio). hek f cells were grown in suspension culture using freestyle media (thermofisher) at °c in a humidified co incubator ( % co ). cells were transiently transfected at a density of x cells/ml using branched polyethylenimine (pei) (sigma) (portolano et al., ) . media was exchanged after h and supplemented with . mm valproic acid. supernatant was harvested by centrifugation after days, filtered and loaded onto a ml histrap hp column (cytiva). the column was washed with buffer ( mm tris ph . , mm nacl, mm imidazole) and the protein was eluted with buffer ( mm tris ph . , mm nacl, mm imidazole). purified protein was concentrated (amicon ultra kda cut off, millipore sigma) and loaded onto a j o u r n a l p r e -p r o o f superose column (cytiva) equilibrated with gf buffer ( mm tris ph . and mm nacl). peak fractions were pooled and concentrated to . mg/ml (amicon ultra kda cut off, millipore sigma). purified s protein ectodomain ( . mg/ml) was mixed with v h ab ( . mg/ml) or soluble ace ( . mg/ml) and incubated on ice for mins. for the competition experiment, the s protein ( . mg/ml) was first incubated on ice with v h ab ( . mg/ml) for mins then followed by addition of ace ( . mg/ml) for another mins. the mixtures ( . µl) were applied to mesh copper grids coated with continuous ultrathin carbon. grids were plasma cleaned using an h /o gas mixture for s in a solarus plasma cleaner (gatan inc.) prior to adding the sample. samples were allowed to adsorb for s before blotting away excess liquid, followed by a brief wash with milliq h o. grids were stained by three successive applications of % (w/v) uranyl formate ( s, s, s). grids containing s protein ectodomain with v h ab , and s protein ectodomain mixed with both v h ab and soluble ace were imaged using a kv glacios transmission electron microscope (thermofisher scientific) equipped with a falcon camera operated in linear mode. using epu automated acquisition software (thermofisher scientific), -frame movies were collected at , x magnification (corresponding to a physical pixel size of . -) over a defocus range of - . to - . µm with an accumulated total dose of e -/Å /movie. grids containing purified s protein ectodomain ( . mg/ml) with soluble ace ( . mg/ml) were imaged using a kv glacios transmission electron microscope equipped with a ceta m cmos camera (thermofisher scientific). micrographs were collected at , x magnification (physical pixel . -) over a defocus range of - . to - . µm with a total dose of e -/Å using epu automated acquisition software. c. image processing. motion correction and ctf estimation were performed in relion ( . ) (scheres, ) . particles were picked by cryolo ( . . ) (wagner et al., ) with pre-trained model for negative stain data. after extraction, particles were imported to cryosparc live (v . . ) (punjani et al., ) and subjected to d classification and d heterogeneous classification. final density maps were obtained by d homogeneous refinement. figures were prepared using ucsf chimera (pettersen et al., ) . after washing, v h ab binding was detected by pe conjugated anti flag tag antibody. to test antibody mediated inhibition of cell fusion, the β-galactosidase (β-gal) reporter gene based quantitative cell fusion assay was used (xiao et al., ) . in this assay, t-s cell expression of t rna polymerase was achieved by infection with vaccinia virus vtf . , while t-ace cell expression of t promoter controlled β-gal was obtained by infection with vaccinia virus vcb r. β-gal will be expressed only after fusion of the two types of cells, which can be monitored by chromogenic reactions using β-gal substrate. to assay cell-cell fusion, t cells stably expressing sars-cov- s ( t-s) cells were infected with t polymerase-expressing vaccinia virus (vtf - ), and t cells stably expressing ace ( t-ace ) were infected with vaccinia virus (vcb r lac-z) encoding t promotor controlled β-gal. two hours after infection, cells were incubated with fresh medium and transferred to °c for overnight incubation. the next day, t-s cells were pre-mixed with serially diluted antibodies or ace -fc at °c for h followed by incubation with t-ace cells at a : ratio for h at °c. then cells were then lysed, and the β-gal activity was measured using βgalactosidase assay kit (substrate cprg, g-biosciences, st. louis, mo) following the manufacturer's protocol. fusion inhibition percentage (sample reading, f) was normalized by maximal fusion (reading, f max ) of t-s and t-ace cells in the absence of antibodies using this formula: fusion inhibition % = [(f max -f)/(f max -f blank )] × %, in which f blank refers to the od reading of t-s and t incubation wells. fusion inhibition percentage was plotted against antibody concentrations. experiments were performed in duplicate and the error bars denote ± sd. pseudovirus neutralization assay. pseudovirus neutralization assay was performed based on previous protocols . briefly, hiv- backbone based pseudovirus was produced in t cells by co-transfection with plasmid encoding sars-cov- s protein and plasmid encoding luciferase expressing hiv- genome (pnl - .luc.re) using pei. pseudovirus-containing supernatants were collected h later and concentrated using lenti-x™ concentrator kit (takara, ca). pseudovirus neutralization assay was then performed by incubation of sars-cov- pseudovirus with serially diluted antibodies or ace -fc for h at °c, followed by addition of the mixture into pre-seeded t-ace cells. the mixture was then centrifuged at × g for hour at room temperature. the medium was replaced hrs later. after h, luciferase expression was determined by bright-glo kits (promega, madison, wi) using biotek synergy multi-mode reader (winooski, vt). cells only and virus only wells were included and used for normalization. the % pseudovirus neutralizing antibody titer (ic ) was calculated using graphpad prism . experiments were performed in duplicate and the error bars denote ± sd. (mn) assay was used as previously described (agrawal et al., a; agrawal et al., b; du et al., ; du et al., ) . briefly, serially three-fold and duplicate dilutions of individual monoclonal antibodies (mabs) were incubated with pfu of sars-cov or sars-cov- at room temperature for h before transferring into designated wells of confluent vero e cells grown in -well microtiter plates. vero e cells cultured with medium with or without virus were included as positive and negative controls, respectively. mers-cov rbd-specific j o u r n a l p r e -p r o o f neutralizing m mab (ying et al., a) were used as additional controls. after incubation at o c for days, individual wells were observed under the microcopy for the status of virus-induced formation of cytopathic effect. the efficacy of individual mabs was expressed as the lowest concentration capable of completely preventing virusinduced cytopathic effect in % of the wells. full-length viruses expressing luciferase were designed and recovered via reverse genetics as described previously (scobey et al., ; yount et al., ) . briefly, the sars-cov- rna from infected cell culture was reverse-transcribled and constructed into the seven contiguous genomic cdna subclones with interconnecting junctions, which were then bsai/bsmbi digested and ligated into a full-length sars-cov- genome cdna through the cohesive ends. a silent mutation of t a was introduced into a conserved region in nsp to differentiate our recombinant viruses from the circulating sars-cov- strains through sanger sequencing. the reporter viruse was synthesized by replacing a -bp region in orf with a gfp-fused nanoluciferase (nluc) gene. after assembly into full-length cdna, full-length rna was in vitro transcribed and was electroporated into vero e cells. virus stocks were propagated on vero e cells in minimal essential medium containing % fetal bovine serum (hyclone) and supplemented with penicillin/kanamycin (gibico). viruses were tittered in vero e usamrid cells to obtain a relative light units (rlu) signal of at least × the cell only control background. ab or ace -fc were serially diluted -fold up to eight dilution spots with at a starting dilution µg/ml, and were incubated with sars-cov-urbaninluc and sars-cov- -seattlenluc viruses at °c with % co for hour. then virus-antibody dilution complexes were added to the pre-seeding e usamrid cells ( , ) in duplicate. virus-only controls and cell-only controls were included in each neutralization assay plate. following infection, plates were incubated at °c with % co for hours. then cells were lysed and luciferase activity was measured via nano-glo luciferase assay system (promega) according to the manufacturer specifications. sars-cov and sars-cov- neutralization ic were defined as the sample concentration at which a % reduction in rlu was observed relative to the average of the virus control wells. experiments were performed in duplicate and ic was obtained by the non-linear fitting of neutralization curves in graphpad prism . mouse ace adapt sars-cov- variant was constructed by introduction of two amino acid changes (q t/p y) at the ace binding pocket in rbd. virus stocks were grown on vero e cells and viral titer was determined by plaque assay . groups of each of to -month old female balb/c mice (envigo, # ) were treated prophylactically ( hours before infection) by intraperitoneal injection with , , or mg/kg of v h -fc ab , respectively. mice were challenged intranasally with pfu of mouse-adapted sars-cov- . two days post infection, mice were sacrificed and lung viral titer was determined by the plaque assay. to exclude the residual lung antibody impact on viral titration, mice were euthanized and perfused with ml of pbs via cardiac puncture before lung harvest for viral titration. for virus titration, the caudal lobe of the right lung was homogenized in pbs. the resulting homogenate was serial-diluted and inoculated onto confluent monolayers of vero e cells, followed by agarose overlay. plaques were visualized via staining with neutral red on day post j o u r n a l p r e -p r o o f infection. to measure the viral rna in the lung, tissue homogenate lysed in trizol ls (thermofischer) was then processed with thermofischer trizol rna isolation protocol followed by rt-qpcr using the quantifast probe rt-pcr kit (qiagen) to amplify a portion of upe gene. the % tissue culture infectious doses (tcid ) equivalence were estimated by running serial dilutions of known tcid standards. infection. sars-cov /canada/on/vido- / was propagated on vero' cells using dmem with % fbs and µg/ml l-(tosylamido- -phenyl) ethyl chloromethyl ketone (tcpk) trypsin. infectious work with sars-cov- was approved by the biosafety protocol approval committee (bpac) at the university of saskatchewan and performed in the high containment laboratories at vido-intervac. male hamsters ( -week-old) were obtained from charles river (montreal, qc). for evaluations of prophylactic efficacy, all hamsters (n= ) were injected intraperitoneally with mg/kg of v h -fc ab hours prior to intranasal challenge of µl/nare containing a total of × tcid of sars-cov- . for the therapeutic group, hamsters were infected as above and treated intraperitoneally with mg/kg (n= ) or mg/kg (n= ) of v h -fc ab hours post-infection. untreated hamsters were kept as a control. nasal washes and oral swabs were collected at day , and post infection (dpi). hamsters were bled at and dpi. all hamsters were euthanized on dpi. at euthanasia, lung lobes were collected for virus titration and rna isolation. for viral titer determination, nasal washes were diluted in a -fold dilution series and absorbed on vero' cells in triplicates for hour at °c. inoculum was removed and replaced with fresh dmem containing % fbs, penn/strep and µg/ml tpck. cytopathic effect was scored on day and day post infection. the limit of detection is . tcid . for testing viral rna, viral rna isolated from nasal and oral swabs using the qiaamp viral rna mini kit (qiagen) and the quantifast probe rt-pcr kit (qiagen) to amplify a portion of upe gene. for rna levels in tissues, mg of tissue homogenate in buffer rlt were processed with the rneasy kit (qiagen) followed by rt-qpcr as above. tcid equivalence were estimated by running serial dilutions of known tcid standards. for testing ab concentrations post injection at hamster sera and lung tissue, sars-cov- spike- elisa was used. s protein was coated at µg/ml overnight at °c in pbs onto maxisorp plates (nunc). the following day plates were blocked with % skim milk and . %tween . serum collected on day and day post-challenge was diluted : and absorbed for hour at °c. plates were washed and goat anti human igg-hrp was added. plates were washed and subsequently developed with opd (o-phenylenediamine dihydrochloride) substrate. optical density was measured at nm after mins of incubation. for lung tissues, after blocking homogenates were diluted : and absorbed overnight at °c followed by detection with anti-human igg-hrp and substrate as stated above. the control hamster lung homogenate was used for background correction. for histopathology on day p.i., % formalin fixed and paraffin embedded tissues were processed with either hematoxylin and eosin stain (h&e) or immunohistochemistry (ihc) for detection of sars-cov antigen; in ihc after blocking tissue slides were treated with anti-nucleocapsid rabbit polyclonal antibodies followed with anti-rabbit hrp antibody. (tucker et al., ) . the entire library of plasmids is arrayed in duplicate in a matrix format and transfected into hek- t cells, followed by incubation for h to allow protein expression. before specificity testing, optimal antibody concentrations for screening were determined by using cells expressing positive (membrane-tethered protein a) and negative (mock-transfected) binding controls, followed by flow cytometric detection with an alexa fluor-conjugated secondary antibody (jackson immunoresearch laboratories). based on the assay setup results, v h -fc ab ( µg/ml) was added to the mpa. binding across the protein library was measured on an ique (ann arbor, mi) using the same fluorescently labeled secondary antibody. to ensure data validity, each array plate contained positive (fc-binding; sars-cov- s protein) and negative (empty vector) controls. identified targets were confirmed in a second flow cytometric experiment by using serial dilutions of the test antibody. the identity of each target was also confirmed by sequencing. for the mouse model, the statistical significance of difference between v h -fc ab treated and control mice lung virus titers was determined by the two-tailed, unpaired, student t test calculated using graphpad prism . . a p value < . was considered significant. ** p < . . for the mice lung viral titer after perfusion, viral rna and hamster lung viral rna, statistical significance was determined by the mann-whitney u test. a p value < . was considered significant. ns: p > . , *p < . , **p < . , ***p < . . for comparing v h -fc ab and igg ab concentration, significance analysis was determined by the two-way anova followed by tukey test in graphpad prism . . a p value < . was considered significant. ns: p > . , *p < . , **p < . , ***p < . , ****p < . . immunization with inactivated middle east respiratory syndrome coronavirus vaccine leads to lung immunopathology on challenge with live virus passive transfer of a germline-like neutralizing human monoclonal antibody protects transgenic mice against lethal middle east respiratory syndrome coronavirus infection sars-cov- vaccines: status report structures of human antibodies bound to sars-cov spike reveal common epitopes and recurrent features of antibodies potent neutralizing antibodies against sars-cov- identified by high-throughput single-cell sequencing of convalescent patients' b cells the convalescent sera option for containing covid- neutralizing antibody and soluble ace inhibition of a replication-competent vsv-sars-cov- and a clinical isolate of sars-cov- simulation of the clinical and pathological manifestations of coronavirus disease (covid- ) in golden syrian hamster model: implications for disease pathogenesis and transmissibility human domain antibodies to conserved sterically restricted regions on gp as exceptionally potent cross-reactive hiv- neutralizers construction of a large phage-displayed human antibody domain library with a scaffold based on a newly identified highly soluble, stable heavy chain variable domain humanized single domain antibodies neutralize sars-cov- by targeting spike receptor binding domain eliminating antibody polyreactivity through addition of n-linked glycosylation detection of novel coronavirus ( -ncov) by real-time rt-pcr generation and characterization of alx- , a potent novel therapeutic nanobody for the treatment of respiratory syncytial virus infection a mouse-adapted sars-cov- model for the evaluation of covid- medical countermeasures a truncated receptor-binding domain of mers-cov spike protein potently inhibits mers-cov infection and induces strong neutralizing antibody responses: implication for developing therapeutics and vaccines a conformation-dependent neutralizing monoclonal antibody specifically targeting receptor-binding domain in middle east respiratory syndrome coronavirus spike protein site-specific biotinylation of purified proteins using bira in vivo imaging with antibodies and engineered fragments a sequence homology and bioinformatic approach can predict candidate targets for immune responses to sars-cov- properties, production, and applications of camelid single-domain antibody fragments identification of a critical neutralization determinant of severe acute respiratory syndrome (sars)-associated coronavirus: importance for designing sars vaccines sars-cov- reverse genetics reveals a variable infection gradient in the respiratory tract neutralizing nanobodies bind sars-cov- spike rbd and block interaction with ace syrian hamsters as a small animal model for sars-cov- infection and countermeasure development physiological barriers to delivery of monoclonal antibodies and other macromolecules in tumors an emerging coronavirus causing pneumonia outbreak in wuhan, china: calling for developing therapeutic and prophylactic strategies occupancy and mechanism in antibody-mediated neutralization of animal viruses fc-fusion drugs have fcγr/c q binding and signaling properties that may affect their immunogenicity structure of the sars-cov- spike receptor-binding domain bound to the ace receptor neutralization of sars-cov- spike pseudotyped virus by recombinant ace -ig potent neutralization of sars-cov- in vitro and in an animal model by a human monoclonal antibody. biorxiv : the preprint server for biology antibody aggregation: insights from sequence and structure. antibodies (basel) bat origin of a new human coronavirus: there and back again neutralizing antibodies isolated by a site-directed screening have potent protection on sars-cov- infection structural basis for neutralization of sars-cov- and sars-cov by a potent therapeutic antibody. science antibody fragments: hope and hype camel heavy-chain antibodies: diverse germline v(h)h and specific mechanisms enlarge the antigen-binding repertoire engineered autonomous human variable domains antiviral monoclonal antibodies: can they be more than simple neutralizing agents? ucsf chimera--a visualization system for exploratory research and analysis cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody recombinant protein expression for structural biology in hek f suspension cells: a novel and accessible approach mutations in spike protein of sars-cov- modulate receptor binding cryosparc: algorithms for rapid unsupervised cryo-em structure determination the sars-cov- receptor-binding domain elicits a potent neutralizing response without antibodydependent enhancement isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model convalescent plasma in covid- : possible mechanisms of action relion: implementation of a bayesian approach to cryo-em structure determination reverse genetics with a full-length infectious cdna of the middle east respiratory syndrome coronavirus a human neutralizing antibody targets the receptor-binding site of sars-cov- pathogenesis and transmission of sars-cov- in golden hamsters dynamic light scattering: a practical guide and applications in biomedical sciences potent neutralization of sars-cov- by human antibody heavy-chain variable domains potent binding of novel coronavirus spike protein by a sars coronavirus-specific human monoclonal antibody isolation of state-dependent monoclonal antibodies against the -transmembrane domain glucose transporter using virus-like particles pharmacokinetic properties of igg and various fc fusion proteins in mice fcr gamma-chain is essential for both surface expression and function of human fc gamma ri (cd ) in vivo sphire-cryolo is a fast and accurate fully automated particle picker for cryo-em structural basis for potent neutralization of betacoronaviruses by single-domain camelid antibodies identification of human single-domain antibodies against sars-cov- a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace the sars-cov s glycoprotein: expression and functional characterization structural basis for the recognition of sars-cov- by full-length human ace exceptionally potent neutralization of middle east respiratory syndrome coronavirus by human monoclonal antibodies monomeric igg fc molecules displaying unique fc receptor interactions that are exploitable to treat inflammation-mediated diseases reverse genetics with a full-length infectious cdna of severe acute respiratory syndrome coronavirus a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov a safe and convenient pseudovirus-based inhibition assay to detect neutralizing antibodies and screen for viral entry inhibitors against the novel human coronavirus mers-cov potent cross-reactive neutralization of sars coronavirus isolates by human monoclonal antibodies potently neutralizing and protective human antibodies against sars-cov- key: cord- - g zcxaa authors: chi, xiaojing; liu, xiuying; wang, conghui; zhang, xinhui; li, xiang; hou, jianhua; ren, lili; jin, qi; wang, jianwei; yang, wei title: humanized single domain antibodies neutralize sars-cov- by targeting the spike receptor binding domain date: - - journal: nat commun doi: . /s - - - sha: doc_id: cord_uid: g zcxaa severe acute respiratory syndrome coronavirus (sars-cov- ) spreads worldwide and leads to an unprecedented medical burden and lives lost. neutralizing antibodies provide efficient blockade for viral infection and are a promising category of biological therapies. here, using sars-cov- spike receptor-binding domain (rbd) as a bait, we generate a panel of humanized single domain antibodies (sdabs) from a synthetic library. these sdabs reveal binding kinetics with the equilibrium dissociation constant (k(d)) of . – . nm. the monomeric sdabs show half maximal neutralization concentration (ec( )) of . – . µg/ml and . – . µg/ml against sars-cov- pseudotypes, and authentic sars-cov- , respectively. competitive ligand-binding experiments suggest that the sdabs either completely block or significantly inhibit the association between sars-cov- rbd and viral entry receptor ace . fusion of the human igg fc to sdabs improve their neutralization activity by up to ten times. these results support neutralizing sdabs as a potential alternative for antiviral therapies. c oronavirus disease is caused by infection of emerging severe acute respiratory syndromeassociated coronavirus (sars-cov- ) and had been declared by world health organization as the first coronavirus pandemic in human history . the severity of covid- symptoms can range from asymptomatic or mild to severe with an estimated mortality rate from less than % to up to % of patients depending on various factors . sars-cov- is spreading rapidly and sustainably around the world, urging prompt global actions to develop vaccines and antiviral therapeutics. sars-cov- polyprotein shares~ . % identity with sars-cov (genbank id: aas . ) and is classified into the genus betacoronavirus in the family coronaviridae . sars-cov- is an enveloped, positive-sense, single-stranded rna virus with a large genome of approximately , nucleotides in length. the virusencoded membrane (m), spike (s), and envelope (e) proteins constitute the majority of the protein that is incorporated into sars-cov- envelope lipid bilayer. the s protein can form homotrimers and protrudes from envelope to show the coronal appearance, invading susceptible cells by binding potential sars-cov- entry receptor angiotensin converting enzyme (ace ) . recently, researchers have figured out the molecular structure of sars-cov- s protein . it is composed of amino acids and structurally belongs to the type i membrane fusion protein with two areas s and s . the s region mainly includes the receptor binding domain (rbd), while the s region is necessary for membrane fusion. the rbd structure determines its binding efficiency with ace and provides an important target for neutralizing antibody recognition. single domain antibodies (sdabs), namely nanobodies, were initially identified from camelids or cartilaginous fish heavy-chain only antibodies devoid of light chains, where antigen-binding is mediated exclusively by a single variable domain (vhh) . therefore, sdabs are the smallest fragments that retain the full antigen-binding capacity of the antibody with advantageous properties as drugs, imaging probes and diagnostic reagents . the advantages of short development time, flexible formatting and robust production efficiency make sdab a powerful means to defeat infectious disease pandemics. for therapeutic purpose, relatively sophisticated humanization techniques have been adopted to modify the camelid-specific amino acid sequences in the framework to their human heavy chain variable domain equivalent, without altering sdab's biological and physical properties and reducing species heterogeneity . as sars-cov- is an emerging human virus, the whole population is susceptible due to the lack of protective antibodies. the existing neutralizing antibodies in convalescent plasma have been adopted as powerful therapeutic alternatives for covid- patients. in this study, using a synthetic humanized sdabs discovery platform, we obtain several high-affinity sars-cov- rbd targeting sdabs with desired neutralization activities. the results illustrate the potential of synthetic sdab library as a resource for antiviral molecules that can be rapidly accessed in a pandemic. these sdabs offer a potential hope for future anti-sars-cov- antibody cocktails. identification of sars-cov- rbd binding sdabs. sars-cov- makes use its envelope s glycoprotein to gain entry into host cells through binding ace . recent cryo-em research revealed that the s protein shows an asymmetrical homotrimer with a single rbd in the "up" confirmation and the other two "down" . antibodies may take advantage of this rbd structure to block virus entry. to enrich for sars-cov- rbd binding sdabs, we performed four rounds of biopanning using a lab owned, full synthetic, humanized phage display library with recombinant rbd protein. after phage elisa identification of clones, a number of sdabs exhibited an excellent affinity for sars-cov- rbd (supplementary table ). five distinctive sdad sequences ( e , f , f , d , and f ) were cloned into a prokaryotic expression vector and recombinant sdab proteins were purified by nickel-charged sepharose affinity chromatography (fig. a) . humanized sdabs obtained in this study are about amino acids with a single vhh domain in average molecular weight less than kda (fig. a) . the sdabs consist of three complementarity determining regions (cdrs), as well as four framework regions (frs). the amino acids in the frameworks have been maximally humanized, except for residues phe- and ala- (numbers refer to the international immunogenetics information system amino acid numbering (imgt.cines.fr)) in framework- to maintain proper antigen affinity and best stability . framework residues are illustrated in supplementary fig. . surface plasmon resonance (spr) technology is widely accepted as a golden standard for characterizing antibodyantigen interactions. to determine the kinetic rate and affinity constants, detailed analysis of spike rbd-binding to purified sdab proteins was carried out by spr. the sars-cov- or sarc-cov rbd protein was immobilized on the surface of biacore chip cm , respectively. then, various concentrations of purified sdabs were prepared and injected to pass over the surface. the sensorgram data were fitted to a : steady-state binding model. spr results demonstrated that the equilibrium dissociation constant (k d ) for the sars-cov- rbd protein against sdabs e , f , f , d , and f were . nm, . nm, . nm, . nm, and . nm, respectively ( fig. b-f, h) . however, the sdabs showed no binding with sars-cov rbd, except for the clone f demonstrating a relatively low affinity with k d = . nm (fig. g, h) . overall, as monovalent antibody fragment, the sdabs identified in this study reveals a satisfactory binding performance in a sars-cov- specific manner. neutralization of sars-cov- by rbd-specific sdabs. to further evaluate the neutralization activity of these sdabs, sars-cov- spike-pseudotyped particle (sars-cov- pp) infectivity assay was first established. pseudotyped particles are chimeric virions that consist of a surrogate viral core with a heterologous viral envelope protein at their surface, which can be operated in biosafety level (bsl- ) and frequently used tool for studying virus entry mechanism and neutralizing antibodies . we observed that all five sdabs showed inhibition potency of sars-cov- pp infection with ec (half maximal neutralization concentration) ranging from . to . µg/ml (fig. a) . we next tested the neutralization activity of the sdabs with sars-cov- live virus (fig. b) . the copy number of viral rna that was present in the cell culture supernatant was used as a proxy for viral replication. similarly, these sdabs showed comparable neutralization efficiency, with ec at approximately . - . µg/ml. totally, these monovalent sdabs demonstrated encouraging neutralization activity against both pseudotyped and authentic virus, although the neutralization potency is not completely matched (fig. c) . this phenomenon was normally reported in middle east respiratory syndrome coronavirus (mers-cov) neutralizing antibodies and may be likely explained by the difference in sdab recognized rbd spatial epitope or the steric hindrance formed by antigen-antibody complex , . interference of the ace -rbd interaction by the sdabs. within sars-cov- rbd, the receptor binding motif (rbm) directly contacts ace . recent report demonstrating that sars-article nature communications | https://doi.org/ . /s - - - cov- uses ace as its receptor with a much stronger affinity ( -fold to -fold higher) than sars-cov . to determine whether sdabs targeted different antigenic regions on the sars-cov- rbd surface, we performed a competition-binding assay using a real-time biosensor (fig. ) . we tested all five sdabs in a competition-binding assay in which human ace was attached to a cm biosensor. compared with a non-related isotype control sdab (fig. a) , addition of e and d completely prevent binding of sars-cov- rbd to ace (fig. b, e) . whereas, sdabs f , f , and f could partially compete the rbd/ receptor association (fig. c , d, f). these data suggested that these sdabs can be divided into rbm targeting or non-rbm targeting groups though it is not directly associated with either affinity or virus neutralization activity, which has laid a solid foundation for further development of bispecific neutralizing antibodies to overcome potential virus mutation in the future. inhibition of sars-cov- entry by fc-fused sdabs. sdabs can be readily fused to human igg fc-domain to overcome the limitations of the monovalent sdabs, such as the short blood residential time and lacking antibody-dependent cell-mediated cytotoxicity and complement dependent cytotoxicity . in addition, bivalent sdabs can be obtained via the disulfide bond formation in fc hinge area, which was reported to increase sdab's activity . to further explore the possibility of sdab-based antiviral therapeutics and enhance neutralization activity, we constructed human heavy chain antibodies by fusing the human igg fc region to the c-terminus of sdabs ( fig. a, b) . these fc fusion sdabs were produced in mammalian cells with supernatant yields around - µg per milliliter in shaking flask. fc fusion sdabs in culture supernatants were affinity purified with hitrap protein a hp antibody purification columns ( supplementary fig. ) and analyzed in both reducing and non-reducing conditions in western blot using an anti-human igg to detect fc. as shown in fig. c , the size of the constructed intact sdab-fc is around kda in the non-reducing condition, but a kda monomer was observed by prior treatment in reducing condition to break disulfide bonds. this suggests a correct expression and secretion of heavy chain antibodies in consistence with our design. neutralization assay results showed that genetic fusion of human fc could maintain or increase the neutralization activity of these sdabs for up to -fold in molar concentration of ec using the sars-cov- pp entry assay ( fig. d and supplementary fig. ). importantly, all fc-fused sdabs demonstrated potency with ec at sub-nanomolar level (fig. d) . finally, we showed that some of the sdabs are suitable for immunofluorescence staining (supplementary fig. ) and western blot to detect ectopically expressed sars-cov- s protein ( supplementary fig. ). given the disease severity and rapid global spread of covid- , there is an urgent need for development of vaccines, monoclonal antibodies, and small-molecule direct-acting antiviral medications. neutralizing antibodies directly target viral envelope protein, precisely block the virus-receptor association, and inhibit virus entry through a variety of molecular mechanisms. in this study, we isolated and characterized several humanized neutralizing sdabs that exhibit one-digit to two-digit nanomolar or even subnanomolar ec against sars-cov- using both pseudotyped and infectious viruses. sdabs have been investigated as important therapeutic alternatives against viral infection because of their high yield, low cost and intrinsic stability. for mers-cov, neutralizing sdabs were isolated from immunized dromedary camels or llamas and demonstrated ec value between . and . µg/ml with low k d values ( . - nm) , . comparable inhibition efficiency on sars-cov- pp and affinity kinetics were obtained for the sdabs identified in this study using a nonimmune library, which can speed up the discovery of neutralizing antibodies in an emergent outbreak. with further optimization and increase of library size and diversity, the synthetic sdab library technology will promote the discovery speed of powerful therapeutic antibodies , . fda approved the first sdab-based medicine for adults with acquired thrombotic thrombocytopenic purpura in [ ] [ ] [ ] [ ] . considering the cost and potential risks of full human antibody in some viral diseases, such as dengue virus infection, sdab fragments are a novel category of therapeutic molecules and can be readily reconstructed in a tandemly linked way to increase their blood residential time, biological activity, and eliminate underlying concerns about antibody-dependent enhancement (ade) of viral infection . in addition to being used as an injectable drug, the stable sdabs can be also developed into aerosolized inhalations and disinfection products for the prevention of covid- . besides, prior to the success of covid- vaccines, the construction of sdab-based adenovirus or adeno-associated virus gene therapy might provide long-term passive immune protection in vulnerable population, health care workers, or in severely affected areas. since the mature covid- animal models have not been developed, this study did not involve in vivo studies. as a next step, the crystal structure analysis of antigen-antibody complexes will be put on the agenda. in conclusion, the discovered neutralizing antibodies in this study could lead to new specific antiviral treatments and shed light on the design and optimization of covid- vaccines. library design and construction. a synthetic sdab phage display library was used for the screening of sars-cov- neutralizing antibodies. to minimize a possible antigenic effect from camelid sequences, sdab frameworks (frs) for library construction were determined according to a universal humanized scaffold architecture , and the sequences of the frs were illustrated in supplementary fig. . briefly, residues in frs , , and were mutated based on human heavy chain vh in maximum. in fr , humanization of residues at positions and was adopted to increase stability of sdabs, whereas residues and are maintained in camelid due to their critical impact on antigen affinity and/or stability (supplementary fig. ). for the design of variable regions, we analyzed a robust cdr repertoire from immune or naïve llama vhh clones. a synthetic diversity was introduced in the three cdrs by the positioned nucleotide assembly with cysteine and stop codon avoided. a constant length of amino acids was selected for cdr and cdr , and amino acids for cdr (supplementary fig. ). frameworks and cdrs were assembled using only cycles of overlapping polymerase chain reaction (pcr) to prevent drift during amplification. diversified sdab mixture was cloned in phagemid vector fadl- (antibody design labs, san diego, ca, usa) using sfii/bgli sites with the pelb peptide leader sequence fused with the sdabs at nterminus. the ligation product was purified and used to transform electrocompetent e. coli tg cells. a total electroporations was performed in the condition of v, mf, w. each electroporation was resuspended with ×yt and incubated with a shaking agitation for h at °c, and then combined and plated onto more than a thousand agar petri dishes ( mm) to ensure enough size of the library. library size was calculated by plating serial dilution aliquots and at least . × individual recombinant clones were obtained. quality control was carried out by sequencing more than clones. more than clones are full length and unique sdabs and less than clones show various errors, such as vector self-ligation, reading frame shift and fragment deletion. antibody selection by phage display. screening for sars-cov- rbd targeting antibodies was performed by panning in both immunotubes and native condition using a proprietary full-synthetic library of humanized sdabs with high-diversity, according to a standard protocol. briefly, for the nd and th panning rounds, the purified sars-cov- rbd protein fused with mouse fc was coated on nunc maxisorp immuno tubes (thermofisher) at around µg/ml in pbs overnight. for the st and rd panning rounds, rbd protein was first biotinylated with ez-link™ sulfo-nhs-lc-biotin (thermofisher) and then selected with streptavidin-coated magnetic dynabeads™ m- (thermofisher). the tubes or beads were blocked using % w/v skimmed milk powder in pbs (mpbs). after rinsing with pbs, about × flowed over the chip surface. after each cycle, the sensor surface was regenerated with mm glycine-hcl ph . . the data were fitted to a : interaction steadystate binding model using the biaevaluation . software. for competition-binding assays, the ace protein was diluted in mm sodium acetate buffer, ph . , and was immobilized on the chip at about response units. for the analyses, the his-tagged sars-cov- rbd protein was diluted in hbs-ep buffer or hbs-ep buffer with nm antibody ( e , f , f , d , or f ). the rbd in different buffer at gradient concentrations ( , . nm, nm, nm and, nm) was flowed over the chip surface. after each cycle, the sensor surface was regenerated with mm glycine-hcl ph . . the binding kinetics was analyzed with the software of biaevaluation using a : binding model. sars-cov- spike pseudotyped particle (sars-cov- pp). to produce sars-cov- pp, hek t cells were seeded day prior to transfection at . × cells in a -cm plate. the next day, cells were transfected using lipofectamine (thermofisher). the plasmid dna transfection mixture ( ml) was composed of µg of pnl- . -luc-e − r − and µg of pcdna-sars-cov- -s that was purchased from sino biologicals and reconstructed by deletion of amino acid cytoplasmic tail. a nonenveloped lentivirus particle (bald virus) was also generated as negative control. h after transfection, the media was replaced with fresh media supplemented with % fbs. supernatants containing sars-cov- pp were typically harvested at - h after transfection and then filtered through a syringe filter ( . µm) to remove any cell debris. sars-cov- pp was freshly used or allocated and frozen at − °c. to conduct the virus entry assay, t cells were transiently transfected with human ace expression plasmid and × cells and seeded in each well of a -well plate at day prior to transduction. the next day, µl of supernatant containing sars-cov- pp was added into each well in the absence or presence of serially diluted sdabs or human igg fc-fused sdabs. forty-eight hours after transduction, the cells were lysed in µl of passive lysis buffer and µl lysate was incubated with µl of luciferase assay substrate according to the manufacturer's instructions (promega, madison, wi, usa). ethics statement and virus isolation. sars-cov- was isolated from bronchoalveolar lavage fluid (balf) from a covid- patient in the jin yin-tan hospital of wuhan as reported previously . briefly, the patient was a -year-old man who reported a high fever and cough, with little sputum production, at the onset of illness. he had a continuous fever and developed severe shortness of breath days later. balf sample was collected from this hospitalized patient by nurses according to a standard procedure in which a bronchoscope is passed through the mouth into the lungs to obtain cells and other components from bronchial and alveolar spaces. a clinical protocol was conducted in accordance with the declaration of helsinki and was approved by the national health commission of china and ethics commission of the jin yin-tan hospital of wuhan (no. ky- - . ). written informed consent was waived by the ethics commission of the designated hospital for emerging infectious diseases. clearing the airway and collection of balf were as standard of care and for clinical etiological diagnosis. therefore, the requirement for written informed consent was waived given the context of emerging infectious diseases. for the isolation and identification of potential pathogens, the balf specimens were filtered and inoculated onto vero cells. all cultures were observed daily for a cytopathic effect (cpe). cpe were observed in % of vero cells after two passages. the viral particles in culture supernatants were characterized by negative staining electron microscope. the isolated sars-cov- was obtained from the patient by dr. lili ren and the virus full length sequence was deposited in gisaid database with accession id of epi_isl_ , which is completely as same as genbank accession number mn . gisaid is a globally recognized virus database and more than , viral genomic sequences of hcov- have been shared via gisaid since the start of the covid- outbreak. sars-cov- neutralization assay. the % tissue culture infectious dose (tcid ) assay was performed for sars-cov- in vero cells. briefly, cells were seeded h before infection in a -well plate at a density of × cells/well. viruses were serially diluted at : dilution. after h of incubation, the media were removed, and cells were fixed and stained with crystal violet. the tcid /ml titer was determined. for antibody neutralization assay, vero cells were seeded in -well plates at day prior to infection. serially diluted sdabs were mixed with sars-cov- at tcid per well and incubated at °c for h. the antibodyvirus mixture was incubated on vero cells at °c for h. unbound sars-cov- virions were removed by washing cells with fresh medium, then incubated for h at °c. the culture supernatants were collected for viral nucleic acid quantification. viral rna quantification was carried out by taqman real-time rt-pcr as reported with plotted standard curves using in vitro transcribed rna. briefly, the viral rna was isolated using trizol ls reagent (invitrogen, carlsbad, ca) according to the manufacturer's protocol. rna was extracted from µl culture supernatants and eluted in µl dnase/rnase-free water. the viral nucleocapsid gene-based quantification assay was developed using the taqman production of human igg fc fusion sdabs. the sequences of selected sdabs were cloned into a mammalian expression vector under the control of hef -htlv promotor and fused with n-terminal interleukin- signal peptide and c-terminal fc region, comprising the ch and ch domains of human igg heavy chain and the hinge region. maxiprepped plasmids were transiently transfected into -f cells (thermofisher) and the cells were further cultured in suspension for days before harvesting antibody-containing supernatant. fc-fused sdabs were prepared with prepacked hitrap® protein a hp column (ge healthcare). the produced fcfusion protein was analyzed by sds-page and the western blot using standard protocols for dimerization, yield and purity measurement. the primary antibody used for western blot was a horseradish peroxidase conjugated goat anti-human igg (sigma-aldrich, st. louis, mo, usa). immunofluorescence microscopy and western blot. cultured t cells on coverslips were transfected with either sars-cov- s expression plasmid or empty vector for h and then fixed using % paraformaldehyde for min at room temperature, permeabilized with . % triton x- (sigma-aldrich) in pbs for min. the cells were then incubated with each sdab overnight at °c. after three washes with pbs, the cells were incubated with alexa fluor -conjugated x-his tag monoclonal antibody (his.h ) (thermofisher, ma - -a , : ) for fig. inhibition of sars-cov- entry by fc-fused sdabs. a representation of the human igg fc-fused sdabs in this study. sdab-fc fusion construction generates a bivalent molecule with an approximate molecular weight of kda. b homology modeling of the bivalent f -fc molecule with swiss-model server (https://swissmodel.expasy.org) . the template structure for f modeling was based on a humanized camelid sdab in the pdb database ( eak). the structure is depicted as cartoons and colored with secondary structure. three cdrs, hinge region and fc were indicated. c five fc-fused sdabs were analyzed by western blot with gradient sds-page in reducing (with β-mercaptoethanol) or nonreducing (without β-mercaptoethanol) condition. d summary of ec value of fc-fused sdabs neutralization against sars-cov- pp. ec fold increases versus the corresponding monovalent sdabs were calculated. h at room temperature. the nuclei were stained with dapi ( : , ) diluted in pbs for min and mounted with an antifade reagent (thermofisher). images were acquired with a leica tcs sp confocal microscope system. for western blot, t cells in -well plate were transfected with sars-cov- s, sars-cov- s or empty vector individually. twenty-four h post transfection, cell lysates were prepared, and the samples were boiled with × sds loading buffer and loaded onto a % polyacrylamide gel. after electrophoresis, the separated proteins were transferred onto a nitrocellulose membrane (bio-rad, hercules, ca, usa). the resulting blots were probed with a sdab as primary antibody and an hrp-linked x-his tag antibody (thermofisher, his.h , ma - -hrp, : ) as the secondary antibody. antibody against β-actin is from sigma-aldrich (a , : ). the ecl reagent (amersham biosciences, piscataway, nj, usa) was used as the substrate for detection. statistics and reproducibility. data were analyzed using graphpad prism . (graphpad software, san diego, ca, usa). the values shown in the graphs are presented as means ± sd. one representative result from at least two independent experiments was shown. antibody neutralization experiments usually use two to four duplicated wells for each treatment. for sars-cov- pp entry assay and sars-cov- infection, the infectivity data were first inversed to neutralization activity. each neutralization data set was normalized by the background control (no virus) to define the real value for % neutralization. after transformation to neutralization, the lowest concentration point of antibody treatment was set to % neutralization. then, a -parameters neutralization nonlinear regression model was fitted to report ec values. all experiments were performed independently at least twice and similar results were obtained. one representative data of one experiment were shown. covid- : towards controlling of a pandemic clinical features of patients infected with novel coronavirus in wuhan a pneumonia outbreak associated with a new coronavirus of probable bat origin cryo-em structure of the -ncov spike in the prefusion conformation naturally occurring antibodies devoid of light chains nanobodies as therapeutics: big opportunities for small antibodies general strategy to humanize a camelid single-domain antibody and identification of a universal humanized nanobody scaffold production of pseudotyped particles to study highly pathogenic coronaviruses in a biosafety level setting prophylactic and postexposure efficacy of a potent human monoclonal antibody against mers coronavirus exceptionally potent neutralization of middle east respiratory syndrome coronavirus by human monoclonal antibodies fusion of higg -fc to in-anti-amyloid single domain antibody fragment vhh-pa h prolongs blood residential time in app/ps mice but does not increase brain uptake fusion of the mouse igg fc domain to the vhh fragment (arp ) enhances protection in a mouse model of rotavirus chimeric camel/human heavy-chain antibodies protect against mers-cov infection a novel nanobody targeting middle east respiratory syndrome coronavirus (mers-cov) receptor-binding domain has potent cross-neutralizing activity and protective efficacy against mers-cov nali-h : a universal synthetic library of humanized nanobodies providing highly functional antibodies and intrabodies construction of synthetic antibody phage-display libraries caplacizumab as an emerging treatment option for acquired thrombotic thrombocytopenic purpura caplacizumab treatment for acquired thrombotic thrombocytopenic purpura (hercules trial) clinical pharmacology of caplacizumab for the treatment of patients with acquired thrombotic thrombocytopenic purpura. expert rev caplacizumab treatment for acquired thrombotic thrombocytopenic purpura molecular mechanism for antibody-dependent enhancement of coronavirus entry identification of a novel coronavirus causing severe pneumonia in human: a descriptive study swiss-model: homology modelling of protein structures and complexes reporting summary. further information on research design is available in the nature research reporting summary linked to this article. the sequences of sdabs have been deposited in genbank with the accession numbers mt -mt . the isolated sars-cov- full length sequence was deposited in gisaid database with accession id of epi_isl_ , which is completely as same as genbank accession number mn . all other data are available from the corresponding author upon reasonable requests. source data are provided with this paper.received: march ; accepted: august ; this work was supported by cams initiative for innovative medicine grant -i m- - and -i m- - . w.y., x.c., l.r. q.j., and j.w. designed experiments and interpreted the data. w.y., x.c., x.l., c.w., x.l., j.h., and x.z. performed experiments and analyzed the data. w.y. conceived the study, supervised the work, and wrote the paper. all authors read and approved the final manuscript. a patent application has been filed on march on single domain antibodies targeting sars-cov- (china patent application no. . ). supplementary information is available for this paper at https://doi.org/ . /s - - - .correspondence and requests for materials should be addressed to q.j., j.w. or w.y.peer review information nature communications thanks the anonymous reviewers for their contribution to the peer review of this work. peer reviewer reports are available.reprints and permission information is available at http://www.nature.com/reprintspublisher's note springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.open access this article is licensed under a creative commons attribution . international license, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons license, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons license, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. to view a copy of this license, visit http://creativecommons.org/ licenses/by/ . /. key: cord- -cy vyb j authors: ripperger, tyler j.; uhrlaub, jennifer l.; watanabe, makiko; wong, rachel; castaneda, yvonne; pizzato, hannah a.; thompson, mallory r.; bradshaw, christine; weinkauf, craig c.; bime, christian; erickson, heidi l.; knox, kenneth; bixby, billie; parthasarathy, sairam; chaudhary, sachin; natt, bhupinder; cristan, elaine; el aini, tammer; rischard, franz; campion, janet; chopra, madhav; insel, michael; sam, afshin; knepler, james l.; capaldi, andrew p.; spier, catherine m.; dake, michael d.; edwards, taylor; kaplan, matthew e.; scott, serena jain; hypes, cameron; mosier, jarrod; harris, david t.; lafleur, bonnie j.; sprissler, ryan; nikolich-Žugich, janko; bhattacharya, deepta title: orthogonal sars-cov- serological assays enable surveillance of low prevalence communities and reveal durable humoral immunity. date: - - journal: immunity doi: . /j.immuni. . . sha: doc_id: cord_uid: cy vyb j we conducted a serological study to define correlates of immunity against sars-cov- . relative to mild covid- cases, individuals with severe disease exhibited elevated virus-neutralizing titers and antibodies against nucleocapsid (n) and the receptor binding domain (rbd) of spike protein. age and sex played lesser roles. all cases, including asymptomatic individuals, seroconverted by weeks post-pcr confirmation. spike rbd and s and neutralizing antibodies remained detectable through - months post-onset, whereas α-n titers diminished. testing of members of the local community revealed only sample with seroreactivity to both rbd and s that lacked neutralizing antibodies. this fidelity could not be achieved with either rbd or s alone. thus, inclusion of multiple independent assays improved the accuracy of antibody tests in low seroprevalence communities and revealed differences in antibody kinetics depending on the antigen. we conclude that neutralizing antibodies are stably produced for at least - months after sars-cov- infection. reduction neutralization test (prnt) titers, which we quantified as the final dilution at which % viral neutralization occurred (prnt ) ( figure a) . rbd to determine if rbd was capable of distinguishing between sars-cov- exposed and uninfected individuals and to set preliminary thresholds for positive calls, we initially tested : serum dilutions of samples from pcr+ sars-cov- infected individuals and samples collected prior to september, , well before the onset of the current pandemic ( figure s d) . using this test data set, we established a preliminary positive cutoff od value of . , equal to standard deviations above the mean values of the negative controls. we next used this preliminary threshold to test an expanded cohort of negative control samples collected prior to . ( figure b ). reactivity to rbd was clearly distinguishable for the majority of positive samples from negative controls ( figure b) . however, . % of the expanded negative control group displayed rbd reactivity that overlapped with pcr+ individuals (figure b, blue shade) , some of whom may have been early into disease and had not yet generated high levels of antibodies. to quantify the sensitivity of the assay relative to time of diagnosis, we measured antibody levels to rbd and plotted these values against time following sars-cov- pcr+ confirmation. whereas the sensitivity was modest within the first two weeks, after weeks, of samples showed high elisa signal ( figure c ). based on these data, samples were considered seropositive at od numbers above . , a value slightly above the highest od obtained from the subjects in the negative control group (figure b) . sera were considered negative at od values below . . finally, we created an indeterminate call at od values between . - . , as we observed some overlap between negative controls and pcr- confirmed samples in this range (figure b, blue shade) . we next applied this assay to community testing and obtained serum samples from to improve the positive predictive value, we considered the use of an orthogonal antigenically distinct test. previous studies have used full length s protein as a secondary screen following rbd elisas (amanat et al., ). while this improves the sensitivity of the assay and is perfectly reasonable in high seroprevalence communities such as new york city, rbd is part of s and is not antigenically distinct. thus, a false positive for rbd would presumably also be apparent in s elisas. we therefore first tested nucleocapsid (n) protein, as several other commercial serological tests quantify antibodies to this antigen (bryan et al., ; burbelo et al., ) . igg antibody titers to n protein in our collected sample cohort showed a strong correlation to prnt titers (figure a) . a weaker correlation was observed between n- reactive igm levels and prnt titers ( figure s a) . we next assayed reactivity to n antigen overlapped substantially between negative and positive controls ( figure b) . moreover, confirmed covid- samples showed very weak reactivity to n ( figure b) . because of the relatively poor performance of n protein as an antigen in our hands, we next tested the s domain of s protein as another candidate to determine seropositivity. rbd is located on the s domain, rendering s antigenically distinct (bosch et al., ; li, ; wrapp et al., ) . igg antibody titers to s correlated well with prnt titers ( figure c) , consistent with reports of s -specific neutralizing antibodies to sars-cov- and sars-cov- (duan et al., ; song et al., ). assessment of s serum reactivity in the pre- cohort revealed that approximately . % of these samples overlapped with signals in pcr-confirmed covid- samples ( figure d) . we thereafter employed a threshold of od > . , as our cutoff for s positivity, which was standard deviations above the average seroreactivity from the original -samples from the negative control cohort. specificity control testing using negative control sera showed that reactivities of negative samples against rbd and s were largely independent of one another, as samples with high signal for one antigen rarely showed similar background for the other (figure e ). based on these data, we chose to rely on combined rbd and s -reactivities as accurate indicators of prior sars-cov- exposure. with this improved combinatorial rbd and s assay to exclude false positives, we re- examined the original samples from the cohort of subjects that displayed rbd od values greater than . ( figure d-e) . of the non-neutralizing samples that displayed high (od > . ) rbd reactivity, lacked s reactivity ( figure f ). in contrast, the remaining rbd+ neutralizing samples all displayed substantial reactivity to s ( figure f) . five of the samples that fell below the rbd cutoff, yet still neutralized virus, displayed strong reactivity to s ( figure f ). based on these data, we established a scoring criterion of rbd od > . , s and all other samples as seronegative. applying these criteria to samples obtained prior to would lead to negative, indeterminate, and positive calls. using these same (atyeo et al., ). we therefore examined our data for these trends. first, in our pcr- confirmed cohort, we plotted igg titers relative to the time of disease onset, stratified by disease severity. severe disease (hospital admission) correlated with significantly higher antibody titers against rbd and n than those with mild disease, who were symptomatic but did not require hospital admission, whereas s titers were not statistically significantly different (figure a-c) . neutralizing titers were also higher in those with severe disease relative to mild cases ( figure d). through campus screening efforts, we also identified pcr+ individuals who either never developed symptoms or had only a brief and mild headache or anosmia. although previous reports suggested that such individuals may infrequently seroconvert or frequently serorevert ko et al., ) . given that older adults, as well as those of male sex, exhibit disproportional morbidity and mortality from covid- , we also sought to test whether humoral immunity in these subjects may be quantitatively reduced . contrary to this expectation, we did not observe any adverse impact of advanced age on humoral immunity (figure e-h) . before settling to a more stable nadir at later timepoints, as would be expected for all acute viral infections. we considered the possibility that we may have missed subjects that had seroreverted prior to their antibody test, thereby incorrectly raising our estimates of the durability of antibody production. therefore, to examine the duration of igg production in more depth, a subset of seropositive individuals with relatively low titers was tested longitudinally up to days post-onset. these data again revealed stable rbd and s igg levels at later stages of convalescence (figures a-b) . however, n-reactive igg levels were quite variable and most samples approached the lower limit of detection at later timepoints ( figure c) . a direct comparison in matched subjects of the changes in rbd, s , and n igg titers over time confirmed the variability in n responses and rapid decline in a subset of individuals ( figure d ). of time in all but one subject ( figure e) , which showed evidence of neutralizing antibodies that did not quite reach a prnt titer of ( figure s ) . these data suggest persistent neutralizing, rbd, and s -specific antibodies, but variable and often declining n-reactive titers during convalescence. together, these data are consistent with the maintenance of functionally important antibody production for at least several months after infection, and caution against the use of α-n antibodies to estimate immunity or seroprevalence. here, we demonstrated that using two antigenically distinct serological tests can greatly remedy specificity problems that are exacerbated in low sars-cov- seroprevalence communities. rbd and s seroreactivity behaved independently for sars-cov- -unexposed individuals, thereby suggesting that the theoretical false positive rate of the overall assay is the product of the two tests. using neutralization assays to confirm these results, we found our empirically determined false positive rate to be < . % ( / ), consistent with the independence of the rbd and s tests. the tight co-incidence between rbd/s positivity and the presence of neutralizing antibodies, even in low seroprevalence populations, is especially valuable for identifying individuals who likely have some degree of immunity. surprisingly, nucleocapsid (n), which is used by several commercial serological tests as an antigen, did not perform as well in our assays, with high false positive and negative rates. the reasons for the differences in antibody responses across antigens are difficult to explain, given the identical inflammatory environment in which these responses arose. one possibility is that the avidities of germline precursors differ for n-and s-protein specificities. for both memory and plasma cells, there appears to be a 'sweet spot' of antigen avidity that taken together, we have reported a highly specific serological assay for sars-cov- exposure that is usable in very low seroprevalence communities, and that returns positive results that are highly co-incident with virus neutralization. using this assay, we characterize the responses in different subject populations by age, sex and disease severity, we demonstrate that antibody production persists for at least months, and we suggest explanations for some reports that concluded otherwise. j o u r n a l p r e -p r o o f limitations of current study: one caveat to our study is that in our community testing cohort we may have missed individuals who were seropositive initially but then seroreverted by the time of the antibody test. second, the latest timepoint post-disease onset in our study is days. it remains possible that antibody titers will wane substantially at later times. additional serial sampling of pcr-confirmed mild cases will be required to test these possibilities. another the graphical abstract for this study was created on biorender.com. further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, deepta bhattacharya (deeptab@arizona.edu). this study did not generate new unique reagents. the data generated in this study and corresponding analyses have been described in main and table , as well as below in the text. subjects were recruited in three ways. first, targeted recruitment was used to recruit confirmed positive covid- pcr test subjects with severe covid- , defined as one that needed hospitalization into the banner-university medical center. second, targeted recruitment was used to recruit subjects with confirmed positive covid- pcr test who did not require hospitalization (mild/moderate covid- cases). aliquoting, serum was used for the elisa assay with or without freezing and thawing as described below. finally, sera from subjects recruited into the above two irb protocols prior to september, , served as negative controls for assay development. based on local and general prevalence, it would be expected that - % of these subjects have previously encountered seasonal coronaviruses (gorse et al., ) . freezing and thawing had no effect on levels of antibodies detected by elisa or prnt. lenti-x tm t cells (takaro bio usa) were grown at ºc, % co in high glucose dmem supplemented with % fetal bovine serum, non-essential amino acids, penicillin/streptomycin, glutamine, and sodium pyruvate. the serum/plasma dilution that contained or less plaques was designated as the nt titer. statistical analyses were performed in graphpad prism (v ) and microsoft excel (v . ). the threshold for indeterminate seropositivity to rbd was calculated as standard deviations above the average od value of the pre-pandemic negative control group. rbd seropositivity was established with an indeterminant range from an od value standard deviations above the mean od value of the negative control cohort (od =. ) to an od slightly above the highest od value observed in the negative control cohort (od = . ). readings above od = . were considered seropositive. the seropositive threshold to s was determined by calculating j o u r n a l p r e -p r o o f the od value standard deviations above the average od (od = . ) of the pre-pandemic negative control cohort. correlation r values between antibody titers and neutralizing titers were determined using a pearson correlation. p values to compare non-linear regression fits of antibody and neutralization titers over time grouped by disease severity, patient age, and patient sex were calculated in graphpad prism. null hypothesis was set for a single curve to fit all subject groups, which was rejected with less than % confidence. loess soothing splines were generated in graphpad prism. pseudo-r values were calculated using the squared correlation between the predicted outcomes and the actual outcomes from the fitted model (efron, ) . the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the performance characteristics of the abbott architect detection of nucleocapsid antibody to sars-cov- is more sensitive than antibody to spike protein in covid- patients the time course of the immune response to experimental coronavirus infection of man disappearance of antibodies to sars-associated coronavirus after recovery information for laboratories about coronavirus (covid- ) early release -antibody responses to sars-cov- at weeks a human sars-cov neutralizing antibody against epitope on s protein regression and anova with zero-one data: measures of residual variation retroviral vectors pseudotyped with severe acute respiratory syndrome coronavirus s protein prevalence of antibodies to four human coronaviruses is lower in nasal secretions than in serum complete mapping of mutations to the sars-cov- spike receptor-binding domain that escape antibody recognition immune response to sars-cov- in iceland persistence of igg antibodies in sars-cov infected healthcare workers serology detection of igm and igg antibodies in patients with coronavirus disease rapid decay of anti antibodies in persons with mild covid- evidence for sustained mucosal and systemic antibody responses to sars-cov- antigens in covid- patients dynamics and significance of the antibody response to sars-cov- infection zbtb restricts the duration of memory b cell recall responses murine cytomegalovirus infections, but not other repetitive challenges furin cleavage site is key to sars neutralizing antibody production in asymptomatic and mild comparison with pneumonic covid- patients findings from investigation and analysis of re- positive cases identification and characterization of the constituent human serum antibodies elicited by vaccination structure, function, and evolution of coronavirus spike proteins clinical features of covid- in elderly patients: a comparison with young and middle-aged patients clinical and immunological assessment of asymptomatic sars-cov- infections antibody responses to sars-cov- in patients with covid- lifetime of plasma cells in the bone marrow ene-covid): a nationwide, population-based seroepidemiological study. the lancet the receptor-binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients memory b cells, but not long-lived plasma cells, possess antigen specificities for viral escape mutants igm antibodies against severe acute respiratory syndrome clinical infectious diseases the behaviour of recent isolates of human respiratory coronavirus in vitro and in volunteers: evidence of heterogeneity among e-related strains cumulative incidence and diagnosis of sars-cov- infection in new york measuring sars-cov- neutralizing antibody activity using pseudotyped and chimeric viruses robust t cell immunity in convalescent individuals with asymptomatic or mild covid- longitudinal evaluation and decline of antibody responses in sars-cov- infection humoral immunity due to long- the extent of affinity maturation differs between the memory and antibody-forming cell compartments in the primary immune response cross-reactive serum and memory b cell responses to spike protein in sars-cov- and endemic coronavirus infection structural genomics of sars-cov- indicates evolutionary conserved functional regions of viral proteins seroconversion of a city: longitudinal monitoring of sars-cov- seroprevalence in new york city. medrxiv . . hospital-wide sars-cov- antibody screening in staff in a tertiary center iga dominates the early neutralizing antibody response to sars-cov- serocov-pop): a population-based study. the lancet intrinsic constraint on plasmablast growth and extrinsic limits of plasma cell survival a sars-cov- surrogate virus neutralization test based on antibody-mediated blockage of ace -spike protein-protein interaction covid- re-infection by a phylogenetically distinct sars-coronavirus- strain confirmed by whole genome sequencing repeated in vivo stimulation of t and b cell responses in old mice generates protective immunity against lethal west nile virus encephalitis sars-cov- infection induces robust, neutralizing antibody responses that are stable for at least three months divergent transcriptional programming of class-specific b cell memory by escape from neutralizing antibodies by sars-cov- spike protein variants the emergence of sars-cov- in europe cryo-em structure of the -ncov spike in the prefusion conformation viral rna level, serum antibody responses, and transmission risk in discharged covid- patients with recurrent positive sars-cov- rna test results: a population-based observational cohort study protective 'immunity' by pre-existent neutralizing antibody titers and preactivated t cells but not by so-called 'immunological memory cd and pd-l define functionally distinct memory b cell subsets that are independent of antibody isotype key: cord- - jwma y authors: xiu, siyu; dick, alexej; ju, han; mirzaie, sako; abdi, fatemeh; cocklin, simon; zhan, peng; liu, xinyong title: inhibitors of sars-cov- entry: current and future opportunities date: - - journal: j med chem doi: . /acs.jmedchem. c sha: doc_id: cord_uid: jwma y [image: see text] recently, a novel coronavirus initially designated -ncov but now termed sars-cov- has emerged and raised global concerns due to its virulence. sars-cov- is the etiological agent of “coronavirus disease ”, abbreviated to covid- , which despite only being identified at the very end of , has now been classified as a pandemic by the world health organization (who). at this time, no specific prophylactic or postexposure therapy for covid- are currently available. viral entry is the first step in the sars-cov- lifecycle and is mediated by the trimeric spike protein. being the first stage in infection, entry of sars-cov- into host cells is an extremely attractive therapeutic intervention point. within this review, we highlight therapeutic intervention strategies for anti-sars-cov, mers-cov, and other coronaviruses and speculate upon future directions for sars-cov- entry inhibitor designs. coronaviruses (covs) are enveloped positive-stranded rna viruses. they belong to the order of nidovirales and are classified into four genera: α, β, γ, and δ. coronaviruses are animal viruses with circulating reservoirs in mammals and birds. for most coronaviruses, the lifecycle can be dissected into four steps, including viral entry, replication, assembly, and release. until last year, six strains of coronaviruses have been identified that are pathogenic to humans. among them are cov-nl , cov-oc , cov-hku , and cov- e that could cause mild respiratory tract diseases. however, two of the β-covs, the severe acute respiratory syndrome coronavirus (sars-cov), and the middle east respiratory syndrome coronavirus (mers-cov) have caused severe epidemics in the past. , in april , sars-cov was responsible for infections, with a fatality rate of ∼ % by the end of september . mers-cov emerged from its zoonotic reservoir in and infected people with a fatality rate of ∼ % by the end of . both outbreaks having such high fatality rates, highlight the need for surveillance of coronavirus emergence. while efforts for the development of antivirals against sars-cov or mers-cov are still in process, a new coronavirus (sars-cov- ) has emerged from an epicenter located in wuhan, china, in december . sars-cov- is highly contagious and has quickly spread in and beyond china. as of may , , there have been more than diagnosed cases around the world, with confirmed deaths (figure ). the united states of america and brazil reporting the majority of the confirmed cases in the americas, with and cases, respectively. recently the genome of sars-cov- was determined, which revealed % identity with that of some sars-cov strains (gz , bj , tor , sz , pc - ) and interestingly % identity to the bat coronavirus batcov ratg . the receptor-binding spike (s) protein is highly divergent from other covs and displays nucleotide sequence identities of % or less to all other previously described sars-covs. however, again, the new sars-cov- s protein shares . % identity to the ratg s protein. the glycoprotein or s protein is responsible for receptor recognition and viral entry into host cells. the spike protein can be divided into two domains; s is responsible for angiotensin-converting enzyme ii(ace ) recognition, the recently identified host cell receptor, and s mediates membrane fusion (figure ). structural alignment of sars-cov- s protein with sars-cov s protein shows that both s proteins are similarly with a root-mean-square deviation (rmsd) of . Å over cα atoms, while the s domain, responsible for membrane fusion, display the most substantial similarities with an rmsd of . Å ( figure c ). engagement of the host cell receptor ace is important for viral entry; however, subsequent entry steps can vary and are cell-type specific. sars-cov can enter the host cell via both clathrin (endosomal) and nonclathrin pathways (nonendosomal); however, both pathways are dependent upon ace binding. , the clathrin-mediated pathway includes the s protein binding to ace and subsequent dynamin/clathrinmediated internalization of endosomal vesicles that maturate to late endosomes. within the late endosomes and lysosomes, acidification of the internalized endosomes and h + -dependent activation of the cellular cathepsin l proteinase takes place that cleaves and activates the s protein, therefore initiating viral fusion with the endosomal/lysosomal membrane (figure ). in the case of sars-cov, cell culture studies revealed that the entry process is delayed with a lag phase of around min, suggesting substantial maturation requirements. in accordance with findings that mouse hepatitis coronavirus (mhv) and feline coronavirus (fcv) infections of hela cells are also heavily dependent on endosomal maturation, the clathrindependent entry and endosomal maturation are key to entry across coronaviridae. for sars-cov- , a recent study also confirms that virus can use host cell receptor cd to gain entry into the host cells besides ace . in addition to the endosome-mediated entry pathway, host proteases also play critical roles in the nonendosomal entry of coronaviruses. host proteases such as the transmembrane protease serine (tmprss ) and tmprss d can cleave the s protein at the s /s cleavage site (figure ) to prime and activate the s protein for membrane fusion during the nonendosomal pathway. a recent study also confirms that tmprss expressing veroe cells are highly susceptible to sars-cov- infection, highlighting the importance of tmprss in the replication cycle. mers-cov can also be activated by furin (serine endoprotease) to initiate the nonclathrin mediated membrane fusion event. interestingly, in the new sars-cov- s figure . entry model of sars-cov- into the host cell. binding of the s domain within the spike (s) protein to the cellular ace receptor triggers conformational changes in the s domain that results in internalization and subsequent membrane fusion ((a) endosomal/clathrindependent pathway). the endosomal pathway is facilitated by a low ph and the ph-dependent cysteine protease cathepsin l. alternatively, sars-cov- can enter the cell via the nonendosomal/clathrin-independent pathway (b). during this route, ace recognition by the sars-cov- s protein (comparable to route a) is followed by additional activation/cleavage of the s protein into s and s domains by cell membrane-associated serine proteases such as tmprss and tmprss d. the figure was prepared with https://biorender.com/. . genome organization of sars-cov- . genome organization of the sars-cov- and location the central genes within the genome (numbers in brackets). the figure was prepared with https://biorender.com. protein, additional amino acid insertions at the s /s cleavage site results in an "rrar" furin recognition site absent in sars-cov s protein. this polybasic insertion sequence has possible implications for the sars-cov- replication cycle and its increased pathogenicity. indeed, polybasic furin sites have been observed in hemagglutinin (ha) proteins of highly virulent avian and human influenza viruses, and similar furinlike processing events are also observed for other rna viruses such as ebola virus and marburg virus, human immune deficiency virus (hiv), and flaviviruses. to activate the s protein for membrane fusion with the cellular membrane, structural rearrangements within the s domain are required. two heptad repeats, hr (dark blue in figure ) and hr can interact to form a six-helix bundle ( -hb), a common postfusion structure shared by all type i viral glycoproteins, to bring viral and cellular membranes in close proximity. additionally, the s domain contains a membrane interacting domain or fusion peptide that is exposed upon specific triggers such as receptor binding or low endosomal ph. to date, three membrane interacting regions with hostmembrane destabilizing effects have been identified in the sars-cov s protein: two conserved sequences across coronaviridae, with residues − and residues − , both c-terminal positioned at the second cleavage site in the s protein termed s ′ at arg and a less conserved third region with membrane disordering properties residues − . once in the host cell, the viral particle uncoats and is ready for transcription and translation. the first orf codes for approximately % of the genome and is separated into open reading frames (orf) a and b (figure ). orf a and orf b are translated into polyproteins pp a ( amino acids) and pp ab ( amino acids) that are processed by -c-like protease ( clpro) and papain-like protease (plpro). the processing of these polyproteins produces a variety of nonstructural proteins (nsps), including rna-dependent rna polymerase (rdrp) and helicase, to catalyze viral genome replication and protein synthesis. the remaining orfs in the sars-cov- genome code for accessory and structural proteins. following further assembly, the mature virions are transported to the cell surface in vesicles and released by exocytosis. any protein involved in the replication process could be a potential target for the development of antiviral agents. as mentioned previously, zhang et al. determined the fulllength genome sequence of sars-cov- and revealed that the virus was very similar ( . % nucleotide similarity) to a group of sars-like coronaviruses. simultaneously, shi et al. found that sars-cov- shares % sequence identity at a wholegenome level to a bat coronavirus, and importantly, they confirmed that sars-cov- utilizes the same cell entry receptor, ace , as sars-cov. recently, the cryo-em structure of full-length human ace bound to the rbd of the sars-cov- was solved, providing an important structural foundation for intervention strategies. conservation analysis also revealed that the rdrp and the clpro are highly conserved between sars-cov- and sars-cov. therefore, it is widely accepted that sars-cov- would behave similarly to sars-cov with regards to viral entry and replication. being the first step in the infection process, the entry of pathogenic viruses into susceptible cells is an extremely attractive intervention point. as with other well-known viruses, such as hiv- and ebola, viral entry of coronaviruses is a complex multiple-step process with numerous interactions and processing points that, in theory, could be targeted. in this review, we summarize case studies and highlight efforts in designing entry inhibitors against sars-cov, mers-cov, and other coronaviruses that can provide important information to combat the current sars-cov- outbreak. ■ host cell ace receptor recognition by the sars-cov- spike (s) as a promising antiviral target binding of the sars-cov- spike (s) protein to the cellular ace receptor represents the first encounter (in both the endosomal and nonendosomal pathway) in the viral replication cycle and provides prophylactic intervention opportunities. sars-cov- spike (s) recognizes with its rbd the cellular ace receptor with high affinity (k d = . nm) as judged by surface plasmon resonance (spr) interaction analysis, and intervention at the rbd-ace interface can potentially disrupt infection efficiency. recently the cryo-em and crystal structures of sars-cov- 's rbd in complex with ace were solved and provide important structural guidance for inhibitor design ( figure ). the interface can be divided into three contact sides, mainly polar in nature, and is similar to the sars-cov-ace complex. , in this structure, an extended loop of the rbd contacts an arch-like helix α of the proteolytic domain (pd) of ace via an n-(cluster ), central (cluster ), and cterminal (cluster ) portion ( figure purple box). additionally, helix α and loop − (connecting β and β ) of ace provide limited contacts. at the n terminus of α (cluster ), gln , thr , and asn of the rbd interact via hydrogen bonds with tyr , gln , lys , and arg from ace . the middle portion (cluster ) of the rbd loop contacts via tyr , the ace pd at residue his . at the c terminus of α (cluster ), gln of rbd contacts gln of ace , and phe of rbd interacts with met of ace through van der waals interactions ( figure ). the structures of the rbds from the sars-cov- -ace complex and the sars-cov-ace complex are quite similar, with an rmsd of . Å over cα atoms ( figure ). a comparison of both structures, however, also highlights some deviations at all three clusters summarized in table . these deviations need to be considered carefully during the inhibitor design process. in addition, helix α and the loop − connecting β and β are also contributing to the interface. sars-cov- s protein monomer was obtained from pdb vsb and rbd-ace complex from pdb vw . boxes , , and highlight polar clusters , , and , respectively. ■ targeting the rbd peptide analogues, monoclonal antibodies, and protein chimeras as rbd inhibitors. both sars-cov and sars-cov- use ace to gain entry into the host cells. as such, this critical interaction can be blocked to stop viral entry. this strategy was first demonstrated by hsiang et al. using a biotinylated enzyme-linked immunosorbent assay (elisa), hsiang et al. reported the disruption of the sars-cov s protein-ace interaction by small peptides. from a total of designed peptides, peptides sp- , sp- , and sp- ( figure and table ) significantly blocked the interaction of the sars-cov s protein with ace with ic values of . , . , and . nm, respectively. additional immunofluorescence assay (ifa) studies with s-protein-pseudotyped retroviruses, revealed a novel mechanism of infection inhibition of vero e cells by sp- . in light of the successful inhibition of sars-cov with this linked peptide, a similar strategy could potentially be effective against the new sars-cov- . the recently solved cryoem structure of sars-cov- in complex with the human ace receptor can provide a structural rationale for the peptide design. monoclonal antibodies (mab) have potential applications for diagnosis, prophylaxis, and treatment of established and evolving viral infections. − prabhakar et al. isolated specific antibodies from b cells in xenomouse immunized with sars-cov. further investigation revealed that several abs directly react with the rdb domain, and a combination of two abs ( d and c ) displayed near-complete neutralization efficiency as compared to a single ab application. two additional potent monoclonal antibodies, mab and mab , could be isolated from transgenic mice immunized with the soluble ectodomain of sars-cov s protein. this mab could bind sars-cov s protein directly with affinities of nm (mab ) and nm (mab ) as judged by spr analysis. mice that received mg/kg of mab or mab before sars-cov infection showed complete protection from reinfection of lung tissues. , cross-reactivity of mabs is highly desirable, and dimitrov et al. identified the human mab m that binds sars-cov with high affinity (k d = nm). mice that received μg of m were nearly completely protected from infection by urbani and gd virus strains. m did compete with the sars-cov receptor, ace , for binding to the rbd, suggesting that m inhibits sars-cov-ace binding as the predominant mechanism of action. however, sars-cov- showed some complexities for rbd directed antibodies. for instance, wrapp et al. tested crossreactivity of three antibodies, including s , m , and r, against sars-cov- rbd. despite the partly high degree of structural homology between the sars-cov- and sars-cov, no binding to the sars-cov- rbd was detected for any of the three antibodies at the concentration of μm. it can be concluded that sars-cov antibodies will not necessarily be cross-reactive for sars-cov- . in a different approach, hu et al. generated a novel chimeric recombinant protein recently by connecting the extracellular domain of human ace to the fc region of human immunoglobulin igg . these chimeric constructs displayed high-affinity for the sars-cov- and sars-cov rbd binding and potently neutralized sars-cov and sars-cov- in vitro, with ic values between . and . μm, respectively. these journal of medicinal chemistry pubs.acs.org/jmc perspective recombinant chimeras also showed cross-reactivity and could have, therefore, useful applications for diagnosis, prophylaxis, and treatment of sars-cov- . using the velocimmune platform, pascal et al. generated several human, noncompeting monoclonal antibodies that target mers-cov s protein and block viral entry into host cells. among them, two antibodies, regn and regn , can significantly inhibit mers-cov pseudoparticles, with ic values of and pm, respectively. in addition, regn and regn showed a good performance in a novel transgenic mouse model, which was developed by replacing the mouse dpp coding sequence with that encoding human dpp . results suggested that both regn and regn were able to potently reduce mers-cov specific rna levels in the lungs at a μg per mouse dose compared with the isotype control antibody. at the μg dose, regn was more effective at decreasing mers-cov rna levels compared with regn at the same dose. recently, in the common marmoset model of mers-cov infection, de wit et al. tested the prophylactic and therapeutic efficacy of regn and regn . data demonstrated that their protection might be more effective in a prophylactic treatment process rather than treatment of mers-cov. in the latest attempt, chen et al. identified sars-cov- rbd specific antibodies from samples of recovered covid- patients using an rbd-specific elisa binding study. among them, mab- b and mab- d effectively neutralized pseudovirus entry, with ic values of . and . μm, respectively. recently, in an elisa based (cross)reactivity assay, assessing antibody-containing supernatants of a collection of sars-s hybridoma's derived from immunized transgenic h l mice that encode chimeric immunoglobulins, wang et al. identified a chimeric mab d that targets rbd. d exhibited cross-neutralizing activity of sars-cov-s protein and sars-cov- -s protein pseudotyped vsv infection with ic values of . and . μm, respectively. brouwer et al. used cross-sectional blood samples from three pcr-confirmed sars-cov- -infected individuals to screen for binders to a soluble prefusion-stabilized s protein of sars-cov- using an elisa-based approach. all three blood samples did bind to the prefusion-stabilized s protein and prompted subsequent sorting of sars-cov- s proteinspecific b cells for mab isolation. nineteen nabs could be identified that target a diverse range of antigenic sites on the s protein and showed remarkable picomolar inhibiting activities with the two most potent ic values of . and . μg/ ml (cova - and cova - , respectively) against live sars-cov- virus. large antibody libraries are crucial in response to rapidly emerging pathogens. using eight large phage-displayed vh, scfv, and fab libraries and panning against the rbd of the sars-cov- , li et al. identified an exceptional potent (k d to rbd of pm as judged by biolayer interferometry) mab igg ab that competes with ace in vitro and protected transgenic mice expressing hace from high-titer intranasal sars-cov- challenge. in two different assays using replication-competent sars-cov- in a microneutralizationbased assay, % neutralization at < nm, and in a luciferase reporter gene assay, an ic of nm was reported. moreover, transgenic mice expressing human ace administrated with . mg of ig ab prior intranasal infection with sars-cov- did not show any detectable replicationcompetent virus, demonstrating the preventive effect of igg ab . small molecules targeting the rbd. besides peptides, mab, and protein chimeras, small molecules are still the preferred modality for a drug. this is due to improved pharmacokinetics, stability, and dosage logistics compared to proteins or peptides. , in addition, small molecules have advantages compared to peptides/proteins regarding dissemination logistics in remote areas and the high expenses of peptide/protein production. , to identify small molecule entry inhibitors against the sars-cov s protein, sarafianos et al. screened a chemical library composed of compounds according to lipinski's rule of five and identified an oxazole-carboxamide derivative, ssaa e ( , table ), that blocks the binding of the rbd of lundin et al. screened a library of diverse compounds and found a small molecule inhibitor, k ( ), which was able to inhibit hcov- e with an ic value of . μm and cc value of μm. studies for mechanism showed that k targeted a very early step in the hcov- e life cycle and may interact with viral particles, thus inactivating their binding. ■ targeting the cellular receptor peptide analogues as ace inhibitors. human angiotensin-converting enzyme (ace) is a highly glycosylated type i integral membrane protein and has been identified as a fundamental regulator of the renin−angiotensin system (ras) in humans and is an important target in regulation of blood pressure homeostasis. ace is a human homologue of ace. it contains a single zinc-binding catalytic domain, which is % similar to the human ace active region. ace can catalyze the cleavage of angiotensin i into angiotensin - , and angiotensin ii into the vasodilator angiotensin - and its organ-and cell-specific expression also suggests a role in the regulation of cardiovascular and renal function and fertility. ace is a functional receptor to the sars-cov during viral entry, and recent research demonstrated that sars-cov- also utilizes ace for infection. however, ace cannot be inhibited by ace inhibitors, so there is an urgent need to develop specific ace inhibitors that would prevent infection by both sars-cov and sars-cov- . one of the first efforts to target the ace receptors was documented by liu et al. using a novel epitope assembling assay, liu et al. identified linear b-cell immuno-cross-reactive epitopes of sars-cov s protein by synthesizing longer peptides. five of these peptides showed serologically highly cross-reactivity in all tested sars patients sera. among them, peptide s - could significantly block the binding of rbd to ace . s - , derived from the s fragment ( figure and table ) could target ace , and showed antiviral activity against sars-cov infection in vitro, with an ec value of . μm, providing an important basis to explore the antiviral potential of s - against sars-cov- . another peptide derived from the rbd, rbd- b, located in s of the sars-cov s protein, is crucial for binding to the host cells ace receptor (figure and table ). given the vital role of this motif, meyer et al. confirmed the binding to ace of a synthesized peptide mimicking this region ( ykyryl ) with a k d of around μm. moreover, rbd- b displays no toxicity, as judged by an mtt ( -( , )dimethylthiahiazo-(-z-y )- , -di-phenytetrazoliumbromide) cell proliferation assay, on veroe cells. in addition, rbd- b showed antiviral activity to hcov-nl at a peptide concentration of mm in caco cells, which also used ace as a functional receptor. constrained peptides are receiving more attention in the drug development field, combining the best attributes of antibodies and small molecules. linear peptides are often highly flexible and unstructured in solution, only forming structures upon target binding. this can sometimes reduce the affinity of such peptides for their target by an entropic penalty mechanism. however, stabilization methods such as cyclization or hydrocarbon stapling can increase the physicochemical characteristics and drug-like properties while negating the entropic penalty of binding and having a positive impact on affinity. using a constrained peptide library displayed on filamentous phages, ladner et al. identified several peptides inhibiting ace function with the most potent being dx (table ) . dx , an n-terminal acetylated and c terminal amidated peptide, was a potent ace peptide inhibitor with an ic value of nm and a k i value of . nm. dx did not inhibit ace activity and thus is specific to ace . in addition, dx was chemically stable and not hydrolyzable by ace . although it is not clear whether dx can inhibit coronavirus, as an effective ace inhibitor, anticoronavirus tests should be conducted in the future. small molecule as ace inhibitors. as discussed previously, peptide and constrained peptide inhibitors have inherent caveats concerning their use as drugs. therefore, screening for small molecule inhibitors, guided by information gleaned from the previous studies is the next logical step. a virtual screen targeting the ace catalytic site with around compounds combined with a molecular docking approach led to the identification of naae (n-( -aminoethyl)- aziridine-ethanamine) ( , table ). showed a dosedependent inhibition of ace catalytic activity with an ic value of μm and a k i of μm. despite its micromolar potency in inhibiting a sars-cov pseudotyped virus, cytotoxicity data is not available to date. chloroquine ( ) currently has applications for malaria and amoebiasis treatment. interestingly, nichol et al. showed that chloroquine could also block the interaction of rbd of sars-cov to ace under cell culture conditions with an ed value of . μm. recently, wang et al. found that blocked sars-cov- virus infection, with an ic value of . μm and a cc > μm in vero e cells. chloroquine possibly increases endosomal ph required for virus/cell fusion as well as impairs with the terminal glycosylation of the cellular ace receptor, thereby reducing the affinity of sars-cov/sars-cov- to ace . besides its antiviral activity, chloroquine may synergistically enhance its antiviral effect with immunemodulating activity in vivo. at present, chloroquine is carried out in clinical research in china for the treatment of sars-cov- (chictr ). hydroxychloroquine ( ) is an analogue of chloroquine, which shares the same mechanism of action as chloroquine but displays a more tolerable safety profile. recent studies suggest that and could cause ventricular arrhythmias, qt prolongation, , retinopathy, and other cardiac-related toxicity, which may pose a particular risk to critically ill patients. although both show antiviral activity, safety, and effectiveness, they require further clinical research. turner et al. identified that the sars-cov receptor, ace , undergoes proteolytic shedding, releasing an enzymatically active ectodomain during viral entry. further research identified that a disintegrin and metalloproteinase (adam ) is responsible for shedding regulation of ace . inhibiting adam activity with the adam-specific inhibitor gw x ( ) reduced shedding of ace at nm against sars-cov. another enzyme involved in ace shedding is tace (tnf-α converting enzyme, a member of the adam family). two tace inhibitors, tapi- ( ) and tapi- ( ), reduced ace shedding against sars-cov, with ic values of and nm, respectively. perhaps the most promising small molecule described to date is the very potent ace inhibitor mln- ( ). can inhibit the catalytic activity of ace with an ic of around pm. the crystal structure of the apo and bound ace complex revealed a significant subdomain movement of the nterminal and c-terminal subdomains of ace upon binding. this movement is important to position critical residues to stabilize the bound inhibitor. its high potency makes a very attractive candidate for sars-cov- interference; however, no antiviral coronavirus data is available at this time. milewska et al. synthesized several polymer-based compounds showing prominent anticoronaviral activity. among them, a cationically modified chitosan derivative, n-( hydroxypropyl)- -trimethylammonium chitosan chloride (htcc, ), and hydrophobically modified htcc (hm-htcc, ) were found that could inhibit hcov-nl replication. for both tested polymers, their ic values were relatively low in llc-mk cells, amounting to ∼ nm for and ∼ nm for . cc values were ∼ . and ∼ μm for and , respectively. recent research showed that and blocked the interaction of hcov-nl with its ace receptor and thus interfered with the process of viral entry. despite the availability of many compounds with inhibitory effects on ace , the corresponding admet data in a preclinical model is not available. regardless, direct inhibition of ace is probably not a viable therapeutic modality, however. this is due to its important normal physiological roles, in addition to its lung injury protective role in acute respiratory distress syndrome from a variety of causes, including sars-cov infection. , as such, directly inhibiting ace as an antiviral strategy appears to be physiologically unsound, and virally targetted blockers of its interaction with the sars-cov/sars-cov- s protein hold greater promise. membrane fusion is a crucial step in the mers/sars infection cycle in both described pathways (see section ). within the endosomal/clathrin-dependent route, internalized viral particles need to fuse with the endosomal membrane to escape the endosomal/lysosomal environment. this is achieved via a conformational change of the s protein (s domain) within the acidic milieu followed by membrane fusion activation by the host protease cathepsin l. membrane fusion is also essential during the nonendosomal/clathrin-independent route to fuse with cellular membranes facilitated by host protease cleavage of the s protein by cell membrane-associated proteases such as tmprss . in conclusion, the s domain of the sars-cov s protein and host proteases such as , hr and hr can interact with each other to form a -hb to bring viral and cellular membranes close (for exact location, see figure ). on the basis of this requirement, bosch et al. obtained peptides corresponding to region hr within the hr. hr - displayed in an infection inhibition assay with pseudotyped sars-cov s protein in vero cells an ec value of μm (figure and table ). moreover, hr - demonstrated concentration-dependent inhibition of hcov-nl infection with an ic value of . μm and a cc value of μm. on the basis of these initial results, further development of the hr - peptide is necessary to develop a more potent human coronaviruse (hcov) peptide inhibitor. similarly, ngai et al. obtained three hr derived peptides, including hr -a, gst-removed-hr , and hr peptide, with remarkable inhibitory activity against sars-cov ( figure and table ). virus entry inhibition studies suggested that hr -a, derived from the hr region, had an ec value of . μm. gst-removed-hr peptide and hr peptide, derived from the hr region, had ec values of . and . μm, respectively. hr p, spanning residues − in hr domains, could effectively inhibit mers-cov infection and s protein-mediated membrane fusion (figure and table ). this study indicates that hr p could specifically inhibit mers-cov in vero cells, with an ic value of ∼ . μm and a cc value of > μm. hr p also demonstrated high selectivity, as indicated by its high selectivity index (si > ). importantly, the introduction of arg, lys, or glu residues into the hr p peptide increased stability, solubility, and anti-mers-cov activity. to improve the stability, solubility, and antiviral activity of hr p, channappanavar et al. designed and synthesized an hr p analogue named hr p-m . hr p-m strongly blocked s protein-mediated cell−cell fusion in a dose-dependent manner at ic values of . μm in vitro. in vivo, hr p-m intranasal administration to ad / hdpp transgenic mice protected them from mers-cov infection and reduced the lung viral titers by more than fold. moreover, combination treatment with ifn-β was demonstrated to enhance the protective effect. the development of a drug with broad-spectrum hcov inhibitory activity is increasingly becoming an attractive approach. xia et al. found that the ek peptide showed pan-cov fusion inhibitory activity against multiple hcovs ( figure and table ). further investigation revealed that ek directly reacts with the hr region and can competitively inhibit viral -hb formation. the pseudovirus assay suggested that the antiviral activity of ek against hcov-oc , hcov-nl , and hcov- e infection with ic values of . , . , and . μm, respectively. in vitro cytotoxicity assay determined that ek is not cytotoxic at concentrations up to mm. mice that received mg/kg of ek were nearly completely protected from infection by hcov-oc and μg of ek against mers-cov infection. recently, this team found that ek could also potentially inhibit sars-cov- with an ic value of . μm in pseudovirus assay and an ic value of . μm in fusion inhibitory assay. to improve the inhibitory activity of ek against sars-cov- , they conjugate the cholesterol molecule to the ek peptide and found that a new peptide, ek c , exhibited highly potent inhibitory activity inhibit sars-cov- s-mediated membrane fusion and pseudovirus infection with ic values of . and . nm, the cc of ek c was μm, and the selectivity index was > . in the oc -infected mouse model, mice that received . mg/kg of ek c were nearly completely protected from infection by hcov-oc . these data suggested that ek c could be used for inhibition and treatment of infection by currently circulating sars-cov- . mers- hb, a polypeptide derived from the hr and hr region, was synthesized by gong et al., and affinity analysis demonstrated a low k d value of . nm, and an ic value of μm against mers-cov and cc > μm. hr derived peptides is a highly promising strategy for viral fusion inhibition. successful hr peptides have been used in the past to block entry of other virus families such as the hiv with the gp derived peptide fuzeon (t ), the only approved fusion inhibitor for hiv- treatment to date. therefore, hr derived peptides highlight a promising strategy for inhibitor development combating the new sars-cov- . xia et al. reported that two peptide-based membrane fusion inhibitors, e-hr p and e-hr p (figure and table ), targeting the hcov- e s protein hr and hr domains, could competitively inhibit the viral autologous -hb formation and inhibit hcov- e s protein-mediated viruscell membrane fusion with ic values of . and . μm, respectively. moreover, neither e-hr p nor e-hr p had significant cytotoxicity to huh- and a cells at concentrations up to μm. in addition, e-hr p potentially inhibited pseudotyped and live hcov- e infection with ic values of . and . μm, respectively. the s domain is the most conserved motif between the sars-cov and the new sars-cov- s protein. it represents an ideal immunogen for the generation of a novel or repurposing sars-cov s domain targeting mabs with cross-reactive potential. sasazuki et al., for example, could successfully isolate the human mab h from immunized kunming (km) mice. h displayed an anti-sars-cov neutralizing activity of around μg/ml. cell fusion assays indicate that h can inhibit viral fusion and entry rather than viral attachment to the surface of host cells or cleavage of the s protein. consequently, the s protein of sars-cov might be the direct target of h ; however, further studies are required to confirm this hypothesis. tan et al. identified mab a (ic value between and μg/ml), an anti-sars-cov s domain mab, that binds to a conserved loop region between the hr and hr domains of the s domain. tsunetsugu-yokota et al. found that antibody skot can inhibit sars-cov with an ec value of μg/ml in vero e cells sars-cov. mutational studies indicate that skot restrict conformational changes within the s domain, essential for viral entry. on the basis of this approach, they identified two small molecules, tgg ( , table ) and luteolin ( ) , that can bind avidly to the sars-cov s protein and inhibit viral entry of sars-cov into vero e cells with ic values of . and . μm, respectively. cytotoxicity assay showed that the cc of and were . and . mm, respectively. therefore, the selectivity index of and were . and . , respectively. further acute toxicity suggested that the % lethal doses of and were ∼ and . mg/kg, respectively. these indicated that these small molecules could be used at relatively high concentrations in mice. quercetin ( ), an analogue of , also showed antiviral activity against sars-cov, with an ic arbidol ( ), a broad-spectrum drug, has been licensed for decades in russia and china against influenza by binding to the ha protein to block the viruses−cell fusion. recently, wang et al. identified that efficiently inhibited sars-cov- virus infection in vitro with an ic value of . μm, a cc value of . μm, and an si of . . vankadari compared protein sequence analysis and found that a small region of the s domain (aa −aa ) of the sars-cov- spike glycoprotein resembles that of the influenza virus h n ha. so the mechanism of was to target the sars-cov- spike glycoprotein and blocked its trimerization, which may inhibit host cell adhesion and hijacking. in january , in wuhan, china, a clinical pilot trial conducted with patients with sars-cov- virus infection received mg three times a day for days; untreated sars-cov- patients served as a control group. in this trial, patients with showed a tendency to decrease viral load as determined by rt-pcr and reduced mortality ( % vs %), as compared to the control group. the hr regions of sars-cov and sars-cov- s protein share a high degree of conservation, and the described small molecules as fusion inhibitors can have potential applications in inhibiting sars-cov- fusion. indeed, targeting virus surface protein is a promising antiviral strategy, whether inhibiting rbd or s domain. during clathrin-dependent viral entry, the host cellular cathepsin l protease plays a key role in infection efficiency by activation of the s protein into a fusogenic state to escape the late endosomes, and cathepsin l (lysosomal endopeptidase) cleavage is believed to expose a hydrophobic fusion peptide essential to initiate membrane fusion. in light of its vital role in the sars cov infection cycle, cathepsin l is a desirable target to interfere with virus−cell entry. cathepsin l consists of a pro-and a mature-domain. in a low ph milieu, the pro-domain is autocatalytically cleaved to obtain the papain-like folded mature-domain consisting of an n-terminal helical domain and a c-terminal β-sheet domain (figure ). a well conserved cys-his-asn triad in the active site is crucial for substrate binding and catalysis. in light of its importance in the sars-cov- replication cycle, cathepsin l is a highly desirable target that will be described in the following section. teicoplanin is a glycopeptide antibiotic, with applications in the treatment of serious infections caused by gram-positive bacteria such as streptococcus and staphylococcus aureus. interestingly, teicoplanin was shown to block the entry of sars, mers, and ebola virus by specifically inhibiting the cathepsin l activity. more recently, zhang et al. showed that teicoplanin could also block the entry of the new sars-cov- pseudoviruses with an ic value of . μm. as a routinely used clinical antibiotic, teicoplanin could be potentially used immediately to combat the current sars-cov- outbreak. small molecules as cathepsin l inhibitors. human cathepsin l plays numerous critical roles in diverse cellular settings associated with human diseases. previous studies also highlighted the feasibility of targeting this cysteine endopeptidase with small molecules with implications for possible intervention strategies of sars-cov- infection. a high-throughput screen (hts) of a -compound library that resulted in the identification of mdl ( , table ) by bates et al., and in an antiviral activity assay, specifically inhibited cathepsin l-mediated substrate cleavage and blocked sars-cov viral entry, with an ic value of . nm and ec value in the range of nm. however, despite its potent inhibitory activity, no cytotoxicity data for is currently available. two small molecules, cid ( ) and cid ( ) , were reported by diamond et al. as viral entry inhibitors of the sars-cov. in a cathepsin l inhibition assay, could block cathepsin l with an ic value of . nm, while showed slightly weaker potency with an ic value of nm. interestingly, besides inhibiting sars-cov, compound (ec value of nm) showed some inhibition activity for ebola virus infection (ec value of nm) of human embryonic kidney t cells. importantly, did not show any sign of toxicity to human aortic endothelial cells at μm. this data offers a new promising point for the treatment of sars and ebola virus infections. recently, in a cell-based assay screen of ∼ compounds, ssaa e ( ) was identified that could specifically bind to the cathepsin l proteinase and interference sars-s protein during viral entry, with an ic value of . μm. in a pseudotype-based assay in t cells, the ec value of was around . μm, and no cytotoxicity was detected below μm. using sars-cov entry assays, zhou et al. screened cysteine protease inhibitors with confirmed activity to inhibit human cathepsins. among them, k ( ) demonstrated the most robust activity. results demonstrated that blocked sars-cov pseudovirus entry at an ic value of . nm while no toxicity was observed, cc value > μm. interestingly, for other coronaviruses, showed broadspectrum antiviral activity with ic values of . , . , and . nm against hcov- e, hcov-nl , and mers-cov, respectively. inhibitors of cell membrane-associated tmprss . either the endosomal cysteine proteases cathepsin l or the cell membrane-associated serine protease tmprss can facilitate sars-cov virus entry into host cells by cleavage of the viral s protein. this cleavage exposes fusion-competent motifs known as fusion peptides, and importantly, for sars-cov, the interference of both proteases is required for efficient inhibition of virus replication. matsuyama et al. identified camostat ( , table ), a commercially available serine protease inhibitor that can efficiently prevent sars-cov infections at μm by inhibiting tmprss activity. however, even at high concentrations ( μm) of , the inhibition of viral entry via sars s protein-mediated cell fusion never exceeded % (inhibition efficiency), indicating that despite the inhibition of tmprss , % of virus entry takes place via the endosomal cathepsin pathway. therefore, they examined the activity of pseudotyped viruses when treated with a combination of ( , )trans-epoxysuccinyl-l-leucylamindo- methylbutane ethyl ester (est, a cathepsin inhibitor) and . the results suggested that simultaneous treatment with est and remarkably blocked infection (> %). similarly, poḧlmann et al. reported that could prevent the viral entry of sars-cov- . importantly, full inhibition efficiency was attained when treated with both and e- d (a cathepsin inhibitor). both studies indicate that sars-cov and sars-cov- enter cells via a similar mechanism, showing the potential of as a promising candidate for further development as a sars-cov- treatment. inhibitors of the furin cleavage site in the coronavirus spike proteins. elevated levels of furin expression were able to facilitate mers-cov pseudovirion infection, and viral entry could be reduced by furin sirna silencing. decanoyl-rvkr-chloromethylketone ( , dec-rvkr-cmk), a furin inhibitor, was shown to block mers-cov s protein-mediated entry as well as virus infection, with journal of medicinal chemistry pubs.acs.org/jmc perspective an ic value of μm in hek- t cells. furthermore, when cathepsin inhibitor camostat was used in combination with , a significant inhibition in infectivity was characterized compared to camostat alone. recently, bestle et al., showed that the potent peptidomimetic inhibitor mi- ( ) could prevent proteolytic processing of the s protein from sars-cov- by endogenous furin in hek cells. however, no antiviral data is available for yet. the peculiar furin-like cleavage site (s /s -site in figure ) in sars-cov- that is absent in the sars-cov and other sars-like covs indicates that furin inhibitors could play a significant role in blocking the viral entry process. , ■ host factor inhibitors sars-cov- cell entry also relies on host cell factors. therefore, these host cell factors can play an essential role as targets for sars-cov- inhibition. chlorpromazine ( table ) is an antipsychotic drug developed for the treatment of schizophrenia. it has also been reported to inhibit the infection of hepatic c virus (hcv), mouse hepatitis virus (mhv- ), and alphavirus. ( ), an abelson kinase signaling pathway inhibitor that could inhibit abelson tyrosine−protein kinase (abl ) to block mers-cov virion fusion with endosomal membranes with an ic value of μm. showed no cytotoxic effects in vero cells at μm. , another abl inhibitor, dasatinib ( ) , was active against both mers-cov and sars-cov, with ic values of . and . μm, respectively. on the basis of an hts assay using cytopathic-effect ( phenotypic screening methods are usually used to identify firstin-class drugs without knowing the actual target and mechanism of action of the drug, while target-based screening identifies best-in-class drugs. − although the phenotypic screening approach often is limited in terms of capacity compared to in silico target-based screening, it can have advantages in identifying cell-active compounds providing information on drug solubility or cell uptake. − many drugs, especially natural products, have an unknown mechanism of action but were shown to inhibit coronavirus entry. hsiang et al. screened a library of chinese herbs using a biotinylated enzyme-linked immunosorbent assay to search for active compounds that could potentially inhibit sars-cov s protein binding to ace . further studies identified emodin ( , table ), the active component from polygonum multiflorum and rheum officinale, could block the interaction of sars-cov s protein to ace , with an ic value of μm in an s protein-pseudotyped retrovirus assay using vero e cells. however, the mechanism of action of still needs to be determined. sarafianos et al. found that ssaa e ( ), a benzamide derivative, could prevent sars-cov virus−cell membrane fusion in pseudotyped-based and antiviral-based assays, with an ic value of . μm, but a cc value of μm indicates additional unknown cellular targets. out of an hts, ve ( ) was identified using a phenotype-based screen from a structurally diverse small-molecule compound library. pseudotype virus entry assay suggested ve can specifically inhibit sars-cov virus entry into cells with an ec value of μm and inhibited sars-cov plaque formation with an ic of . μm. a similar hts approach was employed by zhang et al. for screening a compound library consisting of structurally diverse small molecules. eighty-four compounds were identified with significant anticoronavirus potential. further studies revealed that compounds inhibited virus entry, while others interfered with viral replication. natural products should, however, be considered with caution due to their unknown mechanism of action and possible toxic side effects. the recent sars-cov- outbreak, with its high fatality rate, has raised global concerns and was declared as a global pandemic by the who. the number of infections continues to rise, and numerous research groups around the globe have prioritized the identification and development of new covid- treatments. still, there are no effective treatments to date. viral entry is the first step in the viral life cycle and represents an attractive intervention point by blocking the coreceptor interaction or the virus−cell membrane fusion event. sars-cov- and other coronaviruses have similar infection mechanisms. this is especially true for sars-cov and cov-nl , which share the same human ace receptor crucial for viral entry. therefore, already developed inhibitors against known hcovs could potentially be used to combat sars-cov- . these efforts identified a large number of inhibitors, including peptides, antibodies, small-molecule compounds, and natural products with anticoronavirus activity. although many inhibitors demonstrated efficacy in inhibiting coronavirus virus infection, no specific prophylactic or postexposure therapy is currently available for hcovs. one of the main reasons causing this is that most of the potenial agents were not adequately evaluated for in vitro and in vivo studies. most drugs are in the preclinical stage and stopped in animal models due to poor bioavailability, safety, and pharmacokinetics so that few entered human trials. in light of the urgency of the current outbreak, repositioning of already approved drugs is increasingly becoming a promising approach, especially with toxicity and safety data in hand. the most effective measure to prevent viral diseases is vaccination. coronavirus vaccine development mainly focused on s protein, and some of them reported can inhibit sars, − and mers. although vaccination strategies were developed in the context of previous epidemics, no vaccine for sars-cov- infections is yet available. since the recent sars-cov- outbreak, research groups around the world are now stepping up to develop vaccines targeting sars-cov- , and vaccine research routes include nucleic acid vaccines, viral vector vaccines, inactivated vaccines, and recombinant protein vaccines. typical vaccine development is time, resource, and financially consuming, although this pandemic has created initiatives that hope to speed the development of a sars-cov- vaccine. even the most optimiztic views regarding an effective sars-cov- vaccine being created are at least one year away. even after creation, other hurdles for the sars-cov- include global implementation and distribution, and different strategies for containing this contagion should be explored simultaneously as the vaccine efforts. in addition to small-molecule inhibitors, monoclonal antibodies, and vaccine development, convalescent sera from sars-cov- survivors (convalescent-phase sera) is an additional option for covid- treatment. passive immunization was well established for viral infection prophylaxis. by metaanalysis of studies about the influenza, h n influenza epidemic demonstrated that early treatment of convalescent blood products decreased the risk ratio caused by pneumonia from % to %. nevertheless, the appropriate titer of the convalescent-phase sera antibody remains to be determined, which was required for therapeutic efficacy to inhibit sars-cov- . research carried out with mers-cov suggested that sera from patients recovering from infections did not contain sufficient antibody titers for therapeutic use. recent initiatives such as the governmental (usa) operation warp speed (ows) to support the development, manufacturing, and distribution of covid- vaccines, therapeutics, and diagnostics or the accelerating covid- therapeutic interventions and vaccines (activ) public− private partnership coordinated by the national institutes of health (nih) are crucial milestones in a coordinated effort to accelerate and prioritize the development of the most promising vaccines and treatments. initiatives like these that bridge government, academia, and industry should also be continued past the current covid- crisis so that we can respond to future novel outbreaks rapidly and adequately. severe acute respiratory syndrome (sars): a year in review middle east respiratory syndrome coronavirus (mers-cov) in silico design of antiviral peptides targeting the spike protein of sars-cov- covid- ) situation reports; world health organization covid- ) cryo-em structure of the -ncov spike in the prefusion conformation clathrin-dependent entry of severe acute respiratory syndrome coronavirus into target cells expressing ace with the cytoplasmic tail deleted sars coronavirus entry into host cells through a novel clathrin-and caveolae-independent endocytic pathway ebola virus and severe acute respiratory syndrome coronavirus display late cell entry kinetics: evidence that transport to npc + endolysosomes is a rate-defining step coronavirus cell entry occurs through the endo-/ lysosomal pathway in a proteolysis-dependent manner middle east respiratory syndrome coronavirus infection mediated by the transmembrane serine protease tmprss sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor host cell entry of middle east respiratory syndrome coronavirus after two-step, furin-mediated activation of the spike protein the spike glycoprotein of the new coronavirus -ncov contains a furin-like cleavage site absent in cov of the same clade structure of the hemagglutinin precursor cleavage site, a determinant of influenza pathogenicity and the origin of the labile conformation characterization of a highly conserved domain within the severe acute respiratory syndrome coronavirus spike protein s domain with characteristics of a viral fusion peptide genetic analysis of the sars-coronavirus spike glycoprotein functional domains involved in cell-surface expression and cell-to-cell fusion identification and characterization of the putative fusion peptide of the severe acute respiratory syndrome-associated coronavirus spike protein drug targets for corona virus: a systematic review mouse hepatitis virus type enters cells through a clathrin-mediated endocytic pathway independent of eps a decade after sars: strategies for controlling emerging coronaviruses severe acute respiratory syndrome coronavirus isolate wuhan-hu- a new coronavirus associated with human respiratory disease in china structural basis for the recognition of the sars-cov- by full-length human ace learning from the past: possible urgent prevention and treatment options for severe acute respiratory infections caused by -ncov pathology and pathogenesis of severe acute respiratory syndrome role of changes in sars-cov- spike protein in the interaction with the human ace receptor: an in silico analysis cryo-em structure of the sars coronavirus spike glycoprotein in complex with its host cell receptor ace structure of sars coronavirus spike receptor-binding domain complexed with receptor design and biological activities of novel inhibitory peptides for sars-cov spike protein and angiotensin-converting enzyme interaction identification of critical determinants on ace for sars-cov entry and development of a potent entry inhibitor faccin-galhardi, l. c. antibody therapy for the control of viral diseases: an update monoclonal antibodies for prophylaxis and therapy of infectious diseases. expert opin. emerging drugs monoclonal antibodies against viruses and bacteria: a survey of patents human monoclonal antibodies to sars-coronavirus inhibit infection by different mechanisms development and characterization of a severe acute respiratory syndrome-associated coronavirus-neutralizing human monoclonal antibody that provides effective immunoprophylaxis in mice generation and characterization of human monoclonal neutralizing antibodies with distinct binding and sequence features against sars coronavirus using xenomouse structure of severe acute respiratory syndrome coronavirus receptor-binding domain complexed with neutralizing antibody potent cross-reactive neutralization of sars coronavirus isolates by human monoclonal antibodies potent neutralization of novel coronavirus by recombinant ace -ig. biorxiv. pre-and postexposure efficacy of fully human antibodies against spike protein in a novel humanized mouse model of mers-cov infection prophylactic and yherapeutic efficacy of mab treatment against mers-cov in common marmosets human monoclonal antibodies block the binding of sars-cov- spike protein to angiotensin converting enzyme receptor a human monoclonal antibody blocking sars-cov- infection rapid selection of a human monoclonal antibody that potently neutralizes sars-cov- in two animal models therapeutic potential of small molecules and engineered proteins what are the drugs of the future? experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings novel inhibitors of severe acute respiratory syndrome coronavirus entry that act by three distinct mechanisms targeting membrane-bound viral rna synthesis reveals potent inhibition of diverse coronaviruses including the middle east respiratory syndrome virus angiotensin-converting enzyme : cardioprotective player in the renin-angiotensin system? hypertension a novel angiotensin-converting enzyme−related carboxypeptidase (ace ) converts angiotensin i to angiotensin − angiotensin-converting enzyme (ace ) as a sars-cov- receptor: molecular mechanisms and potential therapeutic target screening and identification of linear b-cell epitopes and entry-blocking peptide of severe acute respiratory syndrome (sars)-associated coronavirus using synthetic overlapping peptide library a hexapeptide of the receptor-binding domain of sars corona virus spike protein blocks viral entry into host cells via the human receptor ace using peptidomimetics and constrained peptides as valuable tools for inhibiting protein−protein interactions novel peptide inhibitors of angiotensin-converting enzyme structure-based discovery of a novel angiotensin-converting enzyme inhibitor chloroquine is a potent inhibitor of sars coronavirus infection and spread remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus ( -ncov) in vitro breakthrough: chloroquine phosphate has shown apparent efficacy in treatment of covid- associated pneumonia in clinical studies therapy and pharmacological properties of hydroxychloroquine and chloroquine in treatment of systemic lupus erythematosus, rheumatoid arthritis and related diseases in vitro antiviral activity and projection of optimized dosing design of hydroxychloroquine for the treatment of severe acute respiratory syndrome coronavirus (sars-cov- ) chronic hydroxychloroquine use associated with qt prolongation and refractory ventricular arrhythmia conduction disorder and qt prolongation secondary to long-term treatment with chloroquine chloroquine and hydroxychloroquine retinopathy-related risk factors in a turkish cohort tumor necrosis factor-alpha convertase (adam ) mediates regulated ectodomain shedding of the severe-acute respiratory syndrome-coronavirus (sars-cov) receptor, angiotensin-converting enzyme- (ace ) tace antagonists blocking ace shedding caused by the spike protein of sars-cov are candidate antiviral compounds ace x-ray structures reveal a large hingebending motion important for inhibitor binding and catalysis szczubialka, k. novel polymeric inhibitors of hcov-nl htcc: broad range inhibitor of coronavirus entry angiotensin-converting enzyme protects from severe acute lung failure the discovery of angiotensin-converting enzyme and its role in acute lung injury in mice cell-based antiviral screening against coronaviruses: developing virus-specific and broad-spectrum inhibitors inhibitors of cathepsin l prevent severe acute respiratory syndrome coronavirus entry severe acute respiratory syndrome coronavirus (sars-cov) infection inhibition using spike protein heptad repeat-derived peptides structures and polymorphic interactions of two heptad-repeat regions of the sars virus s protein inhibition of human coronavirus nl infection at early stages of the replication cycle fusion core structure of the severe acute respiratory syndrome coronavirus (sars-cov): in search of potent sars-cov entry inhibitors protective effect of intranasal regimens containing peptidic middle east respiratory syndrome coronavirus fusion inhibitor against mers-cov infection a pan-coronavirus fusion inhibitor targeting the hr domain of human coronavirus spike fusion mechanism of -ncov and fusion inhibitors targeting hr domain in spike protein inhibition of sars-cov- infection (previously -ncov) by a highly potent pan-coronavirus fusion inhibitor targeting its spike protein that harbors a high capacity to mediate membrane fusion identification of a novel inhibitor against middle east respiratory syndrome coronavirus peptide-based membrane fusion inhibitors targeting hcov- e spike protein hr and hr domains sars-cov- ) based on sars-cov immunological studies fully human monoclonal antibody directed to proteolytic cleavage site in severe acute respiratory syndrome (sars) coronavirus s protein neutralizes the virus in a rhesus macaque sars model substitution at aspartic acid in the sars coronavirus spike glycoprotein mediates escape from a s domain-targeting neutralizing monoclonal antibody a single amino acid substitution in the s and s spike protein domains determines the neutralization escape phenotype of sars-cov a small-molecule fusion inhibitor of influenza virus is orally active in mice small molecules that bind the inner core of gp and inhibit hiv envelope-mediated fusion small molecules blocking the entry of severe acute respiratory syndrome coronavirus into host cells a safe and convenient pseudovirus-based inhibition assay to detect neutralizing antibodies and screen for viral entry inhibitors against the novel human coronavirus mers-cov structural basis of influenza virus fusion inhibition by the antiviral drug arbidol the anti-influenza virus drug, arbidol is an efficient inhibitor of sars-cov- in vitro arbidol: a potential antiviral drug for the treatment of sars-cov- by blocking trimerization of the spike glycoprotein clinical features of cases with coronavirus disease in wuhan, china cathepsin l functionally cleaves the severe acute respiratory syndrome coronavirus class i fusion protein upstream of rather than adjacent to the fusion peptide cysteine proteases: modes of activation and future prospects as pharmacological targets caught in the act: the crystal structure of cleaved cathepsin l bound to the active site of cathepsin l serruys-schoutens, e. clinical evaluation of teicoplanin for therapy of severe infections caused by gram-positive bacteria glycopeptide antibiotics potently inhibit cathepsin l in the late endosome/lysosome and block the entry of ebola virus, middle east respiratory syndrome coronavirus (mers-cov), and severe acute respiratory syndrome coronavirus (sars-cov) teicoplanin potently blocks the cell entry of -ncov a review of small molecule inhibitors and functional probes of human cathepsin l a small-molecule oxocarbazate inhibitor of human cathepsin l blocks severe acute respiratory syndrome and ebola pseudotype virus infection into human embryonic kidney t cells protease inhibitors targeting coronavirus and filovirus entry simultaneous treatment of human bronchial epithelial cells with serine and cysteine protease inhibitors prevents severe acute respiratory syndrome coronavirus entry boẗtcher-friebertshaüser, e. tmprss and furin are both essential for proteolytic activation and spread of sars-cov- in human airway epithelial cells and provide promising drug targets a multibasic cleavage site in the spike protein of sars-cov- is essential for infection of human lung cells perspectives for repurposing drugs for the coronavirus disease hepatitis c virus entry depends on clathrin-mediated endocytosis inhibitors of alphavirus entry and replication identified with a stable chikungunya replicon cell line and virus-based assays testing of middle east respiratory syndrome coronavirus replication inhibitors for the ability to block viral entry coronavirus s protein-induced fusion is blocked prior to hemifusion by abl kinase inhibitors abelson kinase inhibitors are potent inhibitors of severe acute respiratory syndrome coronavirus and middle east respiratory syndrome coronavirus fusion repurposing of clinically developed drugs for treatment of middle east respiratory syndrome coronavirus infection saracatinib inhibits middle east respiratory syndrome-coronavirus replication in vitro phenotypic vs. target-based drug discovery for first-in-class medicines overview of recent strategic advances in medicinal chemistry new techniques and strategies in drug discovery natural products and their derivatives against coronavirus: a review of the non-clinical and preclinical data emodin blocks the sars coronavirus spike protein and angiotensinconverting enzyme interaction identification of novel small-molecule inhibitors of severe acute respiratory syndrome-associated coronavirus by chemical genetics a screen of the nih clinical collection small molecule library identifies potential anti-coronavirus drugs severe acute respiratory syndrome coronavirus spike protein expressed by attenuated vaccinia virus protectively immunizes mice recombinant modified vaccinia virus ankara expressing the spike glycoprotein of severe acute respiratory syndrome coronavirus induces protective neutralizing antibodies primarily targeting the receptor binding region sars coronavirus spike polypeptide dna vaccine priming with recombinant spike polypeptide from escherichia coli as booster induces high titer of neutralizing antibody against sars coronavirus prospects for a mers-cov spike vaccine passive immunization. prim care meta-analysis: convalescent blood products for spanish influenza pneumonia: a future h n treatment? feasibility, safety, clinical, and laboratory effects of convalescent plasma therapy for patients with middle east respiratory syndrome coronavirus infection: a study protocol key: cord- -oa jfots authors: taka, e.; yilmaz, s. z.; golcuk, m.; kilinc, c.; aktas, u.; yildiz, a.; gur, m. title: critical interactions between the sars-cov- spike glycoprotein and the human ace receptor date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: oa jfots severe acute respiratory syndrome coronavirus (sars-cov- ) enters human cells upon binding of its spike (s) glycoproteins to ace receptors and causes the coronavirus disease (covid- ). therapeutic approaches to prevent sars-cov- infection are mostly focused on blocking s-ace binding, but critical residues that stabilize this interaction are not well understood. by performing all-atom molecular dynamics (md) simulations, we identified an extended network of salt bridges, hydrophobic and electrostatic interactions, and hydrogen bonding between the receptor-binding domain (rbd) of the s protein and ace . mutagenesis of these residues on the rbd was not sufficient to destabilize binding but reduced the average work to unbind the s protein from ace . in particular, the hydrophobic end of rbd serves as the main anchor site and unbinds last from ace under force. we propose that blocking this site via neutralizing antibody or nanobody could prove an effective strategy to inhibit s-ace interactions. the covid- pandemic is caused by sars-cov- , which is a positive-sense single-stranded rna betacoronavirus. phylogenetic analyses demonstrated that the sars-cov- genome shares ~ % sequence identity with severe acute respiratory syndrome coronavirus (sars-cov), and ~ % with the middle-east respiratory syndrome coronavirus (mers-cov) ( ). despite these similarities, sars-cov- is much more infectious and fatal than sars-cov and mers-cov together ( ) . sars-cov- consists of a kb single-stranded rna genome that is encapsulated by a lipid bilayer and three distinct structural proteins that are embedded within the lipid membrane: envelope (e), membrane (m), and spike (s). host cell entry is primarily mediated by homotrimeric s glycoproteins located on the viral membrane ( fig. a) ( ) . each s protomer consists of s and s subunits that mediate binding to the host cell receptor and fusion of the viral envelope, respectively ( , ) . the receptor-binding domain (rbd) of s undergoes a large rigid body motion to bind to ace . in the closed state, all rbds of the s trimer are in the down position, and the binding surface is inaccessible to ace . the switching of one of the rbds into a semi-open intermediate state is sufficient to expose the ace binding surface and stabilize the rbd in its up position (fig. b) ( ) . the s protein binds to the human angiotensin-converting enzyme (ace ) receptor, a homodimeric integral membrane protein expressed in the epithelial cells of lungs, heart, kidneys, and intestines ( ) . each ace protomer consists of an n-terminal peptidase domain (pd), which interacts with the rbd of the s protein through an extended surface (fig. a , c) ( ) ( ) ( ) . upon ace binding, proteolytic cleavage of the s protein by the serine protease tmprss separates the s and s subunits ( ) . the s protein exposes fusion peptides that insert into the host membrane and promote fusion with the viral membrane ( ) . to prevent sars-cov- infection, there is a global effort to design neutralizing antibodies ( ) , nanobodies ( ) , peptide inhibitors ( ) , and small molecules ( ) that target the ace binding surface of the s protein. yet, only a limited number of studies were performed to investigate critical interactions that facilitate s-ace binding using md simulations. initial studies have constructed a homology model of sars-cov- rbd in complex with ace , based on the sars-cov crystal structure ( , ) and performed conventional md (cmd) simulations totaling ns ( , ) and ns ( , ) in length to estimate binding free energies ( , ) and interaction scores ( ) . more recent studies used the crystal structure of sars-cov- rbd in complex with ace to perform coarse-grained ( ) and all-atom ( - ) md simulations. the effect of the mutations that disrupt close contact residues between sars-cov- rbd and ace on binding free energy was investigated by post-processing of the md trajectories ( , , , ) or by using bioinformatic methods ( ) . the work required to unbind the s protein from ace would provide a more accurate estimate of the binding strength, but this has not been performed under low pulling velocities using the structure of sars-cov- rbd in complex with ace . in addition, systematic analysis of critical residues that stabilize s-ace binding and how mutagenesis of these interaction sites reduces the binding strength and alters the way the s protein detaches from ace under force have not yet been performed. in this study, we performed a comprehensive set of all-atom md simulations totaling . µs in length using the recently-solved structure of the rbd of the sars-cov- s protein in complex with the pd of ace ( ) . simulations were performed in the absence and presence of external force to investigate the binding characteristics and estimate the binding strength. these simulations showed additional interactions between rbd and pd domains to those observed in the crystal structure ( ) . an extensive set of alanine substitutions and charge reversal mutations of the rbd amino acids involved in ace binding were performed to quantify how mutagenesis of these residues weaken binding in the presence and absence of force in simulations. we showed that the hydrophobic end of rbd primarily stabilizes s-ace binding, and targeting this site could potentially serve as an effective strategy to prevent sars-cov- infection. to model the dynamic interactions of the s protein-ace binding interface, we used the costructure of rbd of the sars-cov- s protein in complex with the pd of human ace ( ) (fig. c) . the structure was solvated in a water box that contains physiologically-relevant salt ( mm nacl) concentration. two sets of cmd simulations, each of ns in length, were performed to determine the formation of a salt bridge ( ) and a hydrogen bond, as well as electrostatic and hydrophobic interactions between rbd and pd (table s ). a cutoff distance of Å between the basic nitrogens and acidic oxygens was used to score a salt bridge formation ( ) . for hydrogen bond formation, a maximum . Å distance between hydrogen bond donor and acceptor and a ° angle between the hydrogen atom, the donor heavy atom, and the acceptor heavy atom was used ( ) . interaction pairs that satisfy the distance, but not the angle criteria were analyzed as electrostatic interactions. for hydrophobic interactions, a cutoff distance of Å between the side chain carbon atoms was used ( ) ( ) ( ) . using these criteria, we identified eleven hydrophobic interactions ( fig. a) , eight hydrogen bonds (fig. b) , two salt bridges and six electrostatic interactions (fig. c ) between rbd and pd. observation frequencies were classified as high and moderate for interactions that occur in % and above and between - % of the total trajectory, respectively. f and y of rbd formed hydrophobic interactions with f , l , m , and y of pd, while l , f , y , and a of rbd formed hydrophobic interactions with t of pd at high frequencies (fig. d ). salt bridges between k -d (rbd-pd) and e -k , and hydrogen bonds between n -y , t -d , and q -e were observed at high frequencies, whereas hydrogen bonds y -d , q -k , t -y , y -e , and q -e were observed at moderate frequencies (fig. d ). residue pairs y -h , n -q , t -y , n -k , q -k , and y -q exhibited electrostatic interactions throughout the simulations (fig. d ). the interaction network we identified in our md simulations were mostly consistent with reported interactions in the rbd-pd crystal structure ( ) . however, our simulations identified four hydrogen bonds (q -k , t -d , y -e , and q -q ), one hydrophobic interaction (l -t ), and two electrostatic interactions (y -h and n -k ) that are not present in the crystal structure. in turn, we did not detect frequent hydrogen bonding between g -q , g -k , and y -r and an electrostatic interaction between g -k observed in the crystal structure ( ). this discrepancy may be due to radically different thermodynamic conditions between crystallization solutions and cmd simulations ( ) . we divided the rbd-pd interaction surface into three contact regions (cr - , fig a- the core region (cr ) comprised significantly fewer interactions than the ends of the rbd binding surface (cr and cr ). remarkably, out of interactions we detected in cr were hydrophobic, which were proposed to play a central role in anchoring of rbd to pd ( ) . unlike cr , cr formed only a single hydrophobic interaction with pd, whereas cr did not form any hydrophobic interactions. to estimate the binding strength of the s protein to ace , we performed steered md (smd) simulations to pull rbd away from pd at a constant velocity of Å − along the vector pointing away from the binding interface (fig. a) . steering forces were applied to the cα atoms of the rbd residues on the binding interface, whereas cα atoms of pd residues at the binding interface were kept fixed. because part of the work applied is lost to the irreversible processes as we pull rbd away from pd at a finite velocity, the second law of thermodynamics indicates that unbinding free energy difference between the initial and final states cannot be larger than the average work required for unbinding. therefore, our calculations report relative changes in the binding free energy of wild-type (wt) and mutant rbd under the same velocity and thermodynamic conditions. in smd simulations (each ns, totaling ns in length, table s ), the average work applied to unbind rbd from pd was . ± . kcal/mol (mean ± s.d.), demonstrating that the s protein binds stably to ace (fig. b) . to investigate the contribution of each of the interactions we identified to the overall binding strength, we introduced point mutations on the rbd. salt bridges were eliminated by charge reversals (k e and e k). we also replaced each amino acid with alanine (table s ) to disrupt the pairwise interactions ( ) , with minimal perturbations the protein backbone ( ) . two sets of cmd simulations (a total of . µs in length) were performed for each point mutant. we first quantified the root mean square fluctuation (rmsf) of the cα atom of the rbd residues located on the pd binding surface (fig. c) . the rigid body motions were eliminated by aligning the rbd interaction surface of pd for each conformer (see methods). out of mutations increased the residue fluctuations compared to wt (fig. s a), suggesting that disrupting the interactions between rbd and pd results in floppier binding. largest fluctuations were observed for mutations in cr (f a, and n a), mutations in cr (y a and y a) and mutation in cr (l a) (fig. c) . mutation of these residues also increased the fluctuations in their neighboring region. while mutations in cr increased fluctuations in cr significantly, mutations in cr had little to no effect on the fluctuations in cr ( fig. d and fig. s b ). we next performed smd simulations modeling the unbinding of rbd of each point mutant from pd ( simulations for each mutant, a total of . µs in length, table s ). f a, y a, y a, n a, and y a mutations substantially decreased the work requirement to unbind rbd-pd by . %, . %, . %, . % and . %, respectively ( fig. e-f, fig. s ). we note that most of these mutations also led to the largest increase in residue fluctuations on the binding surface (fig. c ). of these residues (f , n , and y ) are located in cr , whereas y is located in cr . these results highlight the primary role of hydrophobic interactions in cr to stabilize the s-ace binding. to further characterize critical interactions of the s-ace binding interface, we introduced double mutants to neighboring residues of rbd that form critical interactions with pd. we performed a total of . µs of cmd and . µs of smd simulations for double mutants (table s ). in particular, double mutants in cr resulted in out of the highest increase in rmsf ( fig. a and fig. s ). the f a/n a mutation at cr resulted in the largest increase in fluctuations in both cr and cr (fig. b and fig. s ). in smd simulations, out of double mutations also further decreased the average work to unbind rbd from pd ( fig. c-d, and fig. s ). similar to the rmsf analysis, double mutants in cr (f a/n a, e a/y a, e a/f a, and l a/f a) resulted in out of the largest decreases in average work (fig. d) . a charge reversal of k e in combination with either q a or y a also resulted in a large decrease in work values (fig. d) . we also used jarzynski equality ( , ) to construct the free energy profiles as a function of a reaction coordinate, referred to as the potential of mean force (pmf) ( ) . based on the estimated pmf ( fig. s ), double mutants in cr resulted in the largest decrease in the binding energy by - % compared to wt. collectively, these results show that two salt bridges (e -k and k -d ) and the network of hydrophobic interactions in cr involving f , y , and f residues are the most significant contributors of binding strength between the s protein and ace . to test whether cr anchors rbd to pd ( ), we investigated the order of events that result in detachment of rbd from pd in smd simulations. the unbinding process appears to perform a zipper-like detachment starting from cr and ending at cr in % of the simulations (fig. a) . in only % of the simulations, cr released last from pd (fig. a) . because unbinding simulations can reveal features characteristic for the reverse process of binding ( ) ( ) ( ) ( ) ( ) , these results suggest that cr binding is the first and critical event for the s protein binding to ace . mutagenesis of the critical residues in cr , in general, resulted in a substantial decrease in the percentages of unbinding events that terminate with the release of cr from pd. in alanine replacement of the hydrophobic residues (f a, f , and y ), cr was released last for %, %, and % of the smd simulations, respectively (fig. b) . the probability of cr to release last under force was further reduced in double mutants of e a/f a ( %) and l a/f a ( %) (fig. b ). unlike these mutants, f a and f a/n a mutants in cr increased the probability of cr to release last, but this could be attributed to a large increase in fluctuations in cr upon these mutations ( fig. s b ). these results indicate that single and double mutants of the critical residues in cr substantially reduce the binding free energy of this region to ace . it remains unclear whether higher infectivity of sars-cov- than sars-cov can be attributed to stronger interactions between s and ace in sars-cov- ( , ) . to test this possibility, we performed two sets of md simulations for the rbd of sars-cov s protein bound to the pd of ace (pdb id: ajf ( )), and compared these results to that of sars-cov- . similar to sars-cov- , rbd of sars-cov makes an extensive network of interactions with pd. we identified eleven hydrophobic interactions (fig. a) , six hydrogen bonds (fig. b) , and seven electrostatic interactions (fig. c) . out of these interactions, only are conserved in sars-cov and the following mutations have taken place: l /f (sars-cov/sars-cov- ), f /y , p /a , p /e , l /f v /k , n /q , y /q , and t /n . similar to sars-cov- , l and y of sars-cov rbd formed a total of seven hydrophobic interactions at a high frequency with the hydrophobic pocket of ace (fig. d) . unlike sars-cov- , sars-cov rbd did not form any salt bridges with ace . we next modeled the unbinding of rbd of sars-cov from pd by performing smd simulations (totaling ns in length, table s ). the average total unbinding work of sars-cov ( . ± . kcal/mol, mean ± s.d., fig. e ) was identical but more broadly distributed than that of sars-cov- ( . ± . kcal/mol, fig. b ). unlike sars-cov- , cr released last from pd in only % of the unbinding events of rbd of sars-cov, whereas the unbinding of cr was the last event in the remaining % (fig. f) . these results indicate that the s protein binds stably to ace in both sars-cov and sars-cov- and the higher infectivity of sars-cov- cannot be explained by an increase in binding strength. higher variability in unbinding work values and the absence of a clear order in unbinding events of rbd of sars-cov suggest that sars-cov has a more variable binding mechanism to ace than sars-cov- . we performed an extensive set of in silico analysis to identify critical residues that facilitate binding of the rbd of the sars-cov- s protein to the human ace receptor. mutagenesis of these residues and pulling the rbd away from pd at a low velocity enabled us to estimate the free energy of binding and the order of events that result in the unbinding of rbd from pd. our simulations showed that the pd interacting surface of rbd can be divided into three contact regions (cr - ). hydrophobic residues of cr strongly interact with the hydrophobic pocket of pd in both sars-cov and sars-cov- . cr of sars-cov- also forms a salt bridge with ace that is not present in sars-cov. based on our smd simulations, we did not observe a major difference in binding strength of the s protein to ace between sars-cov and sars-cov- , indicating that higher infectivity of sars-cov- is not due to tighter binding of s to the ace receptor. these results are consistent with a recent md simulation that applied the generalized born and surface area continuum solvation approach (mm-gbsa) ( ) , coarse-grained simulations ( ) , and biolayer interferometry ( ). our analysis suggests that cr is the main anchor site of the sars-cov- s protein to ace , and blocking the cr residues f , e , f , n , and y could significantly reduce the binding affinity. consistent with this prediction, llama based nanobody h -h that neutralizes sars-cov- ( ), by interacting with % and % of the critical residues we identified in cr and cr , respectively. similarly, the human antibody ha ( ), and vh-fc ab ( ) neutralizes sars-cov- by interacting with f , a , and f residues on cr , which were among the strongest interactions we detected between rbd and pd. experimental studies revealed that antibodies against sars-cov induce limited neutralizing activity against sars-cov- ( , ) . this may be attributed to the low sequence conservation of the cr region between sars-cov and sars-cov- . in particular, the s protein of sars-cov- contains critical phenylalanine (f ) and glutamate (e ) residues not present in sars-cov, that form hydrophobic interactions and a salt bridge with ace , respectively. it remains to be determined whether this difference plays a role in higher infectivity of sars-cov- than sars-cov. our simulations show that single and double mutants of cr are not sufficient to disrupt the binding of rbd to ace , but reduce the binding free energy of this region. because rbd makes multiple contacts with ace through an extended surface, small molecules or peptides that target a specific region in the rbd-ace interaction surface may not be sufficient to prevent binding of the s protein to ace . instead, blocking of a larger surface of the cr region with a neutralizing antibody or nanobody is more likely to introduce steric constraints to prevent the s protein-ace interactions. materials and methods md simulations system preparation. for cmd simulations, the crystal structure of sars-cov- s protein rbd bound with ace at . Å resolution (pdb id: m j) ( ) was used as a template. the chloride ion, zinc ion, glycans, and water molecules in the crystal structure were kept in their original positions. single and double point mutants were generated using the mutator plugin in vmd ( ) . each system was solvated in a water box (using the tip p water model) having Å cushion in the positive x-direction and Å cushions in other directions. this puts a Å water cushion between the rbd-pd complex and its periodic image in the xdirection, creating enough space for unbinding simulations. ions were added to neutralize the system and salt concentration was set to mm to construct a physiologically relevant environment. the size of each solvated system was ~ , atoms. all system preparations steps were performed in vmd ( ) . all md simulations were performed in namd . ( ) using the charmm ( ) force field with a time step of fs. md simulations were performed under n, p, t conditions. the temperature was kept at k using langevin dynamics with a damping coefficient of ps - . the pressure was maintained at atm using the langevin nosé-hoover method with an oscillation period of fs and a damping time scale of fs. periodic boundary conditions were applied. Å cutoff distance was used for van der waals interactions. long-range electrostatic interactions were calculated using the particle-mesh ewald method. for each system; first, , steps of minimization followed by ns of equilibration was performed by keeping the protein fixed. the complete system was minimized for additional , steps, followed by ns of equilibration by applying constraints on cα atoms. subsequently, these constraints were released and the system was equilibrated for an additional ns before initiating the production runs. the length of the equilibrium steps is expected to account for the structural differences due to the radically different thermodynamic conditions of crystallization solutions and md simulations ( ) . md simulations were performed in comet and stampede using ~ million core-hours in total. rmsf calculations. rmsf values were calculated as 〈∆ 〉 / = 〈( − 〈 〉 ) 〉 / , where, 〈 〉 is the mean atomic coordinate of the i th cα atom and is its instantaneous coordinate. smd simulations. smd ( ) simulations were used to explore the unbinding process of rbd from ace on time scales accessible to standard simulation lengths. smd simulations have been applied to explore a wide range of processes, including domain motion ( , ) , molecule unbinding ( ) , and protein unfolding ( ) . in smd simulations, a dummy atom is attached to the center of mass of 'steered' atoms via a virtual spring and pulled at constant velocity along the 'pulling direction', resulting in force f to be applied to the smd atoms along the pulling vector ( ), where is the guiding potential, is the spring constant, is the pulling velocity, is time, and are the coordinates of the center of mass of steered atoms at time t and , respectively, and is the direction of pulling ( ) . total work (w) performed for each simulation was evaluated by integrating f over displacement along the pulling direction as = ∫ ( ) . in smd simulations of sars-cov- , cα atoms of ace residues s -s , t -p , q -n , g -i , and p -r were kept fixed, whereas cα atoms of rbd residues k -i , g -f , y -a , and n -y were steered (fig. a) . steered atoms were selected as the region comprising the interacting residues. for sars-cov smd simulations the same ace residues were kept fixed. however, two slightly different steered atoms selections were applied: i) using the same residue positions as for sars-cov- , which are v -i , t -l , f -s , and n -y , and ii) selecting the region comprising the interacting residues, which aret -l , f -d , and n -y . the total number of fixed and steered atoms were identical in all simulations. the pulling direction was selected as the distance between the center of mass of steered and fixed atoms. the pulling direction also serves as the reaction coordinate ξ for free energy calculations. each smd simulation was performed for ns using a Å − pulling velocity. at a spring constant of − Å − , the center of mass of the steered atoms followed the dummy atom closely while the spring was still soft enough to allow small deviations. for each system, conformations were sampled with a ns frequency from their cmd simulations ( conformers from each set of the cmd simulations listed in table s md - a-b). these conformations served as separate starting conformations, , for each set of smd simulations (table s md - c-d). potential of mean force for unbinding of rbd. work values to unbind rbd from ace at low pulling velocities along the reaction coordinate were analyzed using jarzynski equality, which provides a relation between equilibrium free energy differences and the work performed through non-equilibrium processes ( ) ( ) ( ) : where Δf is the helmholtz free energy, kb is the boltzmann constant and t is the temperature. because work values sampled in our smd simulations differ more than kbt ( fig. s and s ) , the average work calculated in eq. will be dominated by small work values that are only rarely sampled. for a finite (n) number of smd simulations, the term − ln(∑ − / = ⁄ ) did not converge to 〈 − / 〉. thus, eq. provides an upper bound on Δf, which was used as an estimate of the pmf. fig. s . rmsf values of single and double mutants of rbd of sars-cov- . fig. s . distribution of work values obtained from smd simulations for each single point mutant system of rbd of sars-cov- . fig. s . distribution of work values obtained from smd simulations for each double point mutant system of rbd of sars-cov- . fig. s . pmf and Δf values of wt and six mutants of rbd of sars-cov- . table s . starting conformations and durations of the md simulations performed. movie s . cr releasing last when sars-cov- rbd was pulled away from ace pd. movie s . cr releasing last when sars-cov- rbd was pulled away from ace pd. identification of a novel coronavirus causing severe pneumonia in human: a descriptive study structure, function, and antigenicity of the sars-cov- spike glycoprotein mechanisms of coronavirus cell entry mediated by the viral spike protein stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis conformational transition of sars-cov- spike glycoprotein between its closed and open states structural basis for the recognition of the sars-cov- by full-length human ace structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structure of sars coronavirus spike receptorbinding domain complexed with receptor sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor key residues of the receptor binding motif in the spike protein of sars-cov- that interact with ace and neutralizing antibodies neutralizing nanobodies bind sars-cov- spike rbd and block interaction with ace computational design of ace -based peptide inhibitors of sars-cov- repurposing approved drugs as inhibitors of sars-cov- s-protein from molecular modeling and virtual screening cryo-em structure of the sars coronavirus spike glycoprotein in complex with its host cell receptor ace molecular mechanism of evolution and human infection with sars-cov- sars-cov- , an evolutionary perspective of interaction with human ace reveals undiscovered amino acids necessary for complex stability computational prediction of mutational effects on the sars-cov- binding by relative free energy calculations the sars-cov- exerts a distinctive strategy for interacting with the ace human receptor critical differences between the binding features of the spike proteins of sars-cov- and sars-cov effect of mutation on structure, function and dynamics of receptor binding domain of human sars-cov- with host cell receptor ace : a molecular dynamics simulations study dynamics of the ace -sars-cov- /sars-cov spike protein interface reveal unique mechanisms the mers-cov receptor dpp as a candidate binding target of the sars-cov- spike enhanced receptor binding of sars-cov- through networks of hydrogen-bonding and hydrophobic interactions zipping and unzipping of adenylate kinase: atomistic insights into the ensemble of open↔ closed transitions hbonanza: a computer algorithm for moleculardynamics-trajectory hydrogen-bond analysis direct and quantitative afm measurements of the concentration and temperature dependence of the hydrophobic force law at nanoscopic contacts a study of the preferred environment of amino acid residues in globular proteins molecular dynamics simulation of antimicrobial peptide arenicin- : b-hairpin stabilization by noncovalent interactions why protein conformers in molecular dynamics simulations differ from their crystal structures: a thermodynamic insight rapid mapping of protein functional epitopes by combinatorial alanine scanning comparing experimental and computational alanine scanning techniques for probing a prototypical protein-protein interaction equilibrium free-energy differences from nonequilibrium measurements: a master-equation approach nonequilibrium equality for free energy differences free energy calculation from steered molecular dynamics simulations using jarzynski's equality molecular dynamics simulations suggest that electrostatic funnel directs binding of tamiflu to influenza n neuraminidases molecular dynamics study of unbinding of the avidin-biotin complex steered molecular dynamics simulations reveal the likelier dissociation pathway of imatinib from its targeting kinases c-kit and abl unbinding of nicotine from the acetylcholine binding protein: steered molecular dynamics simulations computational insights into the mechanism of ligand unbinding and selectivity of estrogen receptors high potency of a bivalent human vh domain in sars-cov- animal models vmd: visual molecular dynamics scalable molecular dynamics with namd optimization of the additive charmm all-atom protein force field targeting improved sampling of the backbone ϕ, ψ and side-chain χ and χ dihedral angles steered molecular dynamics and mechanical functions of proteins steered molecular dynamics simulation of the rieske subunit motion in the cytochrome bc complex computational design of new peptide inhibitors for amyloid beta (aβ) aggregation in alzheimer's disease: application of a novel methodology unfolding of titin immunoglobulin domains by steered molecular dynamics simulation shielding and beyond: the roles of glycans in sars-cov- spike protein. biorxiv data and materials availability: data and the analysis software are available from the corresponding author upon request. the structure of the full-length s protein in complex with ace . the s protein is a homotrimer (green, purple, and grey) and embedded into the viral membrane. ace is a homodimer (blue and orange) and embedded into the host cell membrane. the full length structure of the s protein in complex with ace was modeled using the full length s protein model ( ) and the crystal structure of the s protein rbd in complex with ace (pdb id: m ). both proteins were manually inserted into the membrane by their transmembrane domains. (b) the structure of an s protomer in the down and up position of its rbd. s /s and s ' are the cleavage sites of the s protomer upon ace binding. (c) md simulations were performed for rbd of the s protein in complex with the pd of ace . catalytic residues of ace , glycans, and zn + and clions are shown in brown, red, yellow and purple, respectively. hydrophobic interactions (b) hydrogen bonds, and (c) salt bridges and electrostatic interactions, between rbd (green) and pd (blue) are shown on a conformation obtained from md simulations in the left panels. the interaction surface is divided into three distinct regions (cr - ). normalized distributions of the distances between the amino-acid pairs that form hydrophobic interactions (red), hydrogen bonds (purple), salt bridges (orange), and electrostatic interactions (green) are shown in the right panels. lines with colored numbers represent maximum cutoff distances for these interactions. key: cord- -l w xk b authors: rathore, jitendra singh; ghosh, chaitali title: severe acute respiratory syndrome coronavirus- (sars-cov- ), a newly emerged pathogen: an overview date: - - journal: pathog dis doi: . /femspd/ftaa sha: doc_id: cord_uid: l w xk b coronavirus disease (covid- ) is a viral pneumonia, responsible for the recent pandemic, and originated from wuhan, china, in december . the causative agent of the outbreak was identified as coronavirus and designated as severe acute respiratory syndrome coronavirus (sars- cov- ). few years back, the severe acute respiratory syndrome coronavirus (sars- cov) and the middle east respiratory syndrome coronavirus (mers-cov) were reported to be highly pathogenic and caused severe infections in humans. in the current situation sars-cov- has become the third highly pathogenic coronavirus that is responsible for the present outbreak in human population. at the time of this review, there were more than confirmed covid- patients which associated with over deaths in more then countries across the globe (as reported by world health organization). in this review we have discussed about sars-cov, mers-cov and sarc-cov- , their reservoirs, role of spike proteins and immunogenicity. we have also covered the diagnosis, therapeutics and vaccine status of sars-cov- . on december , , several cases of severe pneumonia were reported from wuhan, china. the causative agent of the outbreak was identified as betacoronavirus. genome sequencing revealed that it is closely related to the sars-cov (severe acute respiratory syndrome coronavirus) which had emerged in , and is designated as sars-cov- (gorbalenya et al. ; zhou et al. ) . in a very short duration, more than infectious cases including more than deaths were reported in china as on march , . at the time of this review ( may, ), the disease, termed as covid- (corona virus disease ), had already become pandemic and spread to more than countries and territories, including community transmissions in countries like the united states, germany, france, spain, japan, singapore, south korea, iran, italy and india. as on july , more than cases and deaths had been reported globally, with the rapid growth of numbers in many countries. for the up-to-date information about covid- , visit the world health organization (who) website (https://www.who.int/emer gencies/diseases/novel-coronavirus- ). the bats are likely to be the origin of sars-cov- , but the role of an intermediate host cannot be ruled out at this stage. initial studies showed that sars-cov- , can use angiotensinconverting enzyme (ace ) from bats, cats, civet cats, swine, ferrets, non-human primates (nhps) and humans as a receptor (letko, marzi and munster ; wan et al. a; zhou et al. ) . a pet dog in hong kong and a tiger in bronx zoo in the united state of america tested positive with sars-cov- infection, indicating that canine ace can also be recognized by sars-cov- . pangolins, which are endangered animals and are illegally imported into southern china (guangdong and guangxi provinces), have been considered as a potential intermediate host (lam et al. ; zhang et al. b) . the initial reports showed that in most of the covid- cases there was mild to moderate infection. however, approximately % of the cases were reported severe (chen et al. ; wang et al. a) . in this review, we will discuss about sars-cov, mers-cov and sarc-cov- . we have also discussed about various reservoirs, associated with them. in the end, we have covered the role of spike proteins and their immunogenicity along with the diagnosis, therapeutics and vaccine status of sars-cov- . zoonotic coronaviruses are becoming a global concern as there was emergence of earlier two coronaviruses, sars-cov and mers-cov (middle east respiratory syndrome coronavirus) which created a havoc and recently emergence of the third highly pathogenic sarc-cov- . it has been observed that members of the family coronaviridae are known to infect a wide range of vertebrates and humans. before the outbreak of sars (severe acute respiratory syndrome), only two coronaviruses including hcov- e and hcov-oc were known to infect humans. however, post-sars outbreak, the sars coronavirus (sars-cov), human coronavirus hcov-nl , human coronavirus hcov-hku and mers-cov have been isolated from humans. similar to sars-cov and mers-cov, the newly isolated sarc-cov- is highly pathogenic in humans and causes severe acute respiratory distress (shi, guo and rottier ) . the genomes of coronaviruses consist of a positive and single-stranded rna genome of about kb. the terminus encodes the enzyme viral replicase/transcriptase, which is involved in virus replication, whereas the terminus encodes viral structural proteins and virus group specific accessory proteins. functional studies of these viral proteins in detail are essential for antiviral drug screening and vaccine development. the earliest available genome sequencing data of sars-cov- made it possible to compare it with the genomes of sars-cov and other coronaviruses. results showed that sars-cov- belongs to the genus betacoronavirus and subgenus sarbecovirus, which also includes sars-cov. however, the mers-cov belongs to another subgenus, merbecovirus (lu et al. ; zhou et al. ) . the comparison study also showed that there is % nucleotide similarity between sars-cov- and sars-cov. the essential surface glycoprotein of sars-cov- known as spike (s) protein, essential for host cell receptor binding, showed only % similarity with sars-cov at the nucleotide level. the genomic organization of sars-cov- resembles those of other betacoronaviruses, including '-orf ab-s (surface glycoprotein)-orf a-e (envelope)-m (membrane glycoprotein)-n- ' as shown in fig. . comparative genome analysis of ratg , a virus from a rhinolophusaffinis (i.e. horseshoe) bat sampled from yunnan province in china in , with sars-cov- , showed that sars-cov- has % similarity at the nucleotide sequence level . although the sars-cov- and ratg have high similarity, yet they differ in some genomic features, such as sars-cov- contains a polybasic (furin) cleavage site insertion (residues prra) between the s and s subunits of the surface glycoprotein s protein (coutard et al. ) . polybasic insertion may increase the infectivity of the virus, as it is absent in other related betacoronaviruses. however, similar polybasic insertions are observed in different human coronaviruses, such as hcov-hku and highly virulent strains of avian influenza viruses. therefore, whether polybasic insertion between s and s subunits of s protein occurs due to the natural evolution in sars-cov- or by other means is going to be the topic of debate in the future. however, an independent insertion(s) of the amino acids paa at the s /s cleavage site was also observed in armyn virus (having % similarity in spike protein and % similarity in replicase nucleotide) isolated from rhinolophus bat in yunnan province in mid- , indicating that these insertion events may be a natural part of ongoing coronavirus evolution . the receptor-binding domain (rbd) of s protein is essential to interact with the ace receptor present on the surface of the target cells of the host. therefore, comparative sequence analysis was performed and results showed that there is % similarity in rbd between sars-cov- and ratg , but they share only one amino acid among the six key amino acid residues. further, due to the proteinaceous nature of spike, structural comparisons were also performed, suggesting that the rbd domain of the sars-cov- is well suited to interact with the human ace receptor. interestingly the same receptor was also utilized by sars-cov to cause infection (wrapp et al. ) . the sars-cov- is closely related to sars-cov and mers-cov having bat as reservoirs, but there are huge biological differences in the former as compared to the other two. the sars-cov- is markedly more infectious and has very different epidemiological dynamics. moreover, the mers-cov has never been able to fully adapt to human transmission (sabir et al. ) , whereas there is the remarkable local and global spread of sars-cov- . as in the case of sars and mers, the intermediate host including civets and camels, respectively, played an important role and may be considered as a true reservoir host (sabir et al. ) . therefore, due to the ecological separation between a bat (reservoir) and humans, 'intermediate' or 'amplifying' mammalian host is a must to acquire mutations in sars-cov- , and is essential for the efficient human transmission. to determine the intermediate host, it is essential to perform a wider sampling of animals that live close to human populations or available in wet markets for human consumption. surprisingly, there was discovery of viruses, closely related to sars-cov- from the malayan pangolins (manisjavanica) that are illegally imported into southern china (guangdong and guangxi provinces). it has been observed that rbd domain of guangdong pangolin viruses are particularly closely related to sars-cov- . there is a % amino acid sequence similarity and contain all six critical key mutations that are essential for binding to the ace receptor in these viruses. however, the rest of the genome is highly divergent from sars-cov- . hence, the evolution of coronaviruses in animal reservoirs as well as in intermediate hosts is required to explain the emergence of sars-cov- in humans. it might be possible that due to its asymptomatic infection, the virus could have acquired some of its essential mutations during a period the ''cryptic'' spread in humans before it was first detected in december . recombination is another possibility, which cannot be ruled out as sarbeviruses, and coronaviruses experience widespread recombination. the genome of sarbeviruses experience recombination at multiple locations, including spike protein. there are studies, which showed that recombination does occur among sars-cov- , ratg and the guangdong pangolin covs (lam et al. ) . the genome of rmyn too has been impacted by recombination . because of the small recombinant region, which may likely change as we increase the sample size of viruses related to sars-cov- , it would be difficult to determine the pattern and genomic ancestry of recombination. a total of cases with deaths in countries including china were reported due to the human respiratory disease during - , caused by the sars-cov. the studies showed that bats acted as a natural reservoir of sars-cov that caused the outbreaks (chan-yeung and xu ) . later, the sars-cov like similar antibody and genomic sequences were also discovered in rhinolophus bat, such as in r. ferrumequinum, r. pearsoni, r. sinicus, r. pusillus and r. macrotis (lau et al. ) . the comparative study of the genomes revealed that bat sars-like covs (sl-cov) have - % nucleotide sequence identities with sars-cov and also among themselves, and hence display great genetic diversity. further, the phylogenetic analysis pointed out that rhinolophus bat might be the direct progenitor of human sars-cov (hon et al. ) . various sars-cov groups were isolated in different epidemic periods and hosts. several methods have been adopted to investigate the selective pressure. results have shown that the most functional proteins of sars-cov adopted the stepwise adaptive evolutionary pathway. for example, the spike protein showed strong positive selection in the early as well as middle phases, and not in the late phase. however, the replicase enzyme experienced positive selection only in humans, and assembly proteins experienced the same in the middle and late phases. interestingly, no such positive selection was observed in any proteins of bat sars-like-cov. however, specific amino acid sites that may be the targets of positive selection in each group were identified (tang et al. ). later in , a study suggested the presence of two distinct genotypes of bt-slcov in r. sinicus (i.e. rp /rs and hku /rs ). the results also showed the evidence for the recombinant origin of rp and rs . the phylogenetic study showed that their major parent has a relatively closer relationship with hu-scovs. therefore, there may be a possibility for the presence of a bt-scov lineage in r. sinicus, that may have hu-scovs as their direct ancestor, as reported earlier (hon et al. ) . however, these speculations are based on studies done on limited strains only, therefore an extensive analysis for the prevalence of such genotype is required for its credibility (yuan et al. ) . in , globally, human cases were confirmed resulting in deaths by june , , by a disease having symptoms similar to sars, that emerged in saudi arabia (who ). later, it was found that the disease was caused by a virus designated as a novel human coronavirus, mers-cov, phylogenetic data showed that it belonged to lineage c of the betacoronavirusgenus and was highly similar to bat coronaviruses hku (tylonycterispachypus) and hku (pipistrelluspipistrellus; lau et al. ) . comparative genomic results showed that mers-cov has a % nucleotide identity in the entire genome with hku and hku . moreover, the rna dependent rna polymerase (rdrp) gene has % nucleotide identity. later, while studying the mode of entry into the cell it was confirmed that mers-cov uses dipeptidyl peptidase (dppiv), also known as cd . as dppiv is evolutionarily conserved among mammals, therefore mers-cov can infect a broad range of mammalian cells (humans, pigs, monkeys and bats) and may be efficient in cross-host transmission (raj et al. ) . similar to the case for sars-cov and mers-cov , the bat is still a probable species for origin of sars-cov- , as it shares % whole-genome identity with a bat cov, batcov ratg , from rhinolophusaffinis from yunnan province ). however, sars-cov and mers-cov before entering humans pass through intermediate hosts, such as civets or camels (cui, li and shi ) . this fact indicates that sars-cov- was probably transmitted to humans by other animals. by comparing the overall genome identity, it was concluded that pangolin-cov genome sequence is . % identical to ratg and . % identical to sars-cov- . however, there was . % identity between sars-cov- and ratg . other sar like covs (sl-covs) are also showed similarity with pangolin-cov, as its was . % similar to zxc and . % with zc . in a comparative genome analysis between pangolin-cov and sars-cov- (genbank: mn ), result showed . - % coverage range (average coverage . %). moreover, pangolin-cov genes shared high average nucleotide ( . %) and amino acid identity ( . %) with sars-cov- (genbank mn ). similar results were obtained when pangolin-cov genes were compared with ratg where . % nucleotide and . % amino acid identity was observed (zhang, wu and zhang ) . interestingly, some of the pangolin-cov genes showed higher amino acid sequence identity to sars-cov- genes than to ratg genes. for example orf b of pangolin-cov . %, the spike (s) protein . %, orf a . % and orf is . % identical to sars-cov- . similarly, orf b . %, the spike (s) protein . %, orf a . % and orf is . % identical to ratg . the high s protein amino acid identity governs the functional similarity between sars-cov- and pangolin-cov. a comprehensive phylogenetic analysis was performed based on the nucleotide sequences of whole-genome sequence, rna-dependent rna polymerase gene (rdrp), non-structural protein genes orf a and orf b, and main structural proteins encoded by the s and m genes. results showed that in all phylogenetic trees, pangolin-cov, ratg and sars-cov- were clustered into a well-supported group designated as ''sars-cov- group'' which represents a novel betacoronavirus group. however, within this group, ratg and sars-cov- were grouped together, and pangolin-cov was their closest common ancestor (zhang, wu and zhang ) . recently, an extensive study including localized genomic analysis and the pattern of evolutionary recombination was done. the results showed that the strong purifying selection among coronaviruses from distinct host species as well as cross-species infections is responsible for the origin of sars cov- ). therefore, we may summarize the origin and intermediate hosts of sars-cov, mers-cov and sars-cov- as shown in fig. . in the s domain of s protein, followed by fusion with cell membrane. sars-cov is responsible to cause severe acute respiratory syndrome. sars-cov utilizes angiotensin-converting enzyme (ace ) receptor present on the surface of host cells, as shown in fig. . sarc like-covs and sars-covs have identical genetic organizations with high sequence identities. the schematic representation of spike protein (s) from sars-cov- is shown in fig. . however, there is some important exception at the n' terminus of spike protein (s), essential for receptor binding in covs. there is a study to investigate the receptor usage by full-length s of sl-cov, sars-cov and a series of s chimeras. different ace receptors from human, civet, or horseshoe bat were expressed in cell lines by using human immunodeficiency virus-based pseudovirus system. several important observations were made in the study. first, the sl-cov s was unable to use any of the three receptors. second, the sars-cov s was unable to enter the cells expressing bat ace . third, the chimeric s enters the cells with different efficiencies for different constructs via human ace . fourth, a minimal insert region (amino acids - ) was sufficient to convert the sl-cov s from non-ace binding to human ace binding, indicating that the sl-cov s is largely compatible with sars-cov s protein, both in structure and function (ren et al. ) . detailed structural study of human sars-cov rbd complexed with human ace receptors was performed. results revealed that it is the truncations in the receptor-binding motif (rbm) region of sl-cov spike protein, which abolished its human ace -binding ability (li ) ). therefore, we may hypothesize that the sl-cov found in horseshoe bats is not the direct ancestor of human sars-cov. moreover, it has been observed that the human sars-cov, as well as its closely related civet sars-cov spike proteins, were not able to use a horseshoe bat (r. pearsoni) ace as a receptor for cell entry (ren et al. ) . these findings highlight a critical missing link (an intermediate host) in the bat-to-civet/human transmission chain of sars-cov (hou et al. ). an earlier study showed that ace from horseshoe bat could not function as a receptor for sars-cov. however, changing amino acids ( , and amino acids) from she to fyq was found adequate to convert the nonfunctional bat ace into a fully active receptor for sars-cov. further, an ace molecule from a fruit bat, which naturally has the fyq motif, supports sars-cov entry into the cells thus causing infection. this result indicates that there must be a wide host range for sars-cov-related viruses among different bat populations (yuan et al. ) . in the case of sars-cov- , the structural bioinformatics approaches accurately predicted that sars-cov- spikes bind human ace (wan et al. b ). when cell lines over-expressed the transmembrane protein 'angiotensin-converting enzyme ' (ace ) from humans, bats, pig or civet cats and were infected with sars-cov- , results showed that they became hypersensitized to infection, thus indicating that ace is a sars-cov- receptor . the binding studies also revealed that receptor-binding domains on the sars-cov- s proteins have a high affinity to human ace (wrapp et al. ) which makes it more virulent. however, apart from ace interaction, the n-terminal domain (ntd) of the sars-cov- s proteins may show binding to alternative host-cell receptors . sars-cov- s proteins have also acquired a furin protease cleavage site, by acquiring several basic residues (rrar/s). the sars-cov- furin substrate site facilitates the prime cleavage step, which further sensitizes s proteins for subsequent activation of cleavages occurring on susceptible target cells, and finally facilitates virus to enter the cells and cause infection (qing and gallagher ) . human sars-cov and sars-like coronavirus (sl-cov) in bats have a similar genomic organization; therefore their corresponding gene products are highly conserved. as far as s protein is concerned, it has only a - % sequence identity at the n-terminal region. it is the n-terminal region of coronavirus s protein that is responsible for receptor interaction. when the immunogenicity of the sl-cov s protein was analyzed and compared with that of sars-cov, results revealed that they shared only a limited number of immunogenic epitopes in their s proteins. moreover, major neutralization epitopes were also different ). in another study, a pseudovirus expressing full-length sl-cov s protein was used to raise mouse sera and monoclonal antibody. series of constructs expressing truncated s protein were prepared and analyzed with elisa, as well as western blot. results showed that amino acids - and amino acids - are two immunogenic determinants in mice. further, it was also shown that - amino acids are more immunogenic, as it was recognized by polyclonal as well as monoclonal antibodies. earlier studies also showed that amino acids - from sars-cov are immunodominant determinants (he et al. ). due to the high sequence similarity with sl-cov s protein in the same region, the amino acids - of s protein also demonstrated immune response in mouse (zhou et al. ). in a cross-reactivity test with antibodies against rbd domain sars-cov, some of the sl-cov strains (wivi) have shown positive results whereas some strains (shc ) failed too. this difference in reactivity is due to the low sequence identity in the rbd domain of shc and high sequence identity in the rbd domain of wivi with rbd domain of sars-cov (zeng et al. ). detection of novel coronavirus is done by different molecular biology techniques including real-time reverse transcription pcr (rrt-pcr), reverse transcription pcr (rt-pcr), reverse transcription loop-mediated isothermal amplification (rt-lamp), multiplex nucleic acid amplification, real-time rt-lamp and microarray-based assays . who also recommended a pan-coronavirus assay for characterization and confirmation.). viral culture and rt-pcr are among the most efficient and reliable methods for the diagnosis of sars-cov- infection. these methods are time consuming and generally takes hours to detect the nucleic acid and many days to isolate the virus from the samples. apart from that, specialized equipments and expertise are also required. to overcome these limitations, rapid diagnosis of sars-cov- infection can be done with rapid antigen detection (rad) tests. in rad tests, the immobilized sars-cov- antibody on the device can detect viral antigen in the sample. the results of rad tests are prompt and interpreted without specialized instrument. hence, rad tests could be beneficial reduce the workload in diagnostic laboratories and hospitals (mak et al. ) . however, as per the who, rad tests for sars − cov- antigen detection, further needs evaluation and is not recommended for clinical diagnosis (laboratory testing strategy recommendations for covid- : interim guidance ). the immune response to sars-cov in the early weeks of the infection can be detected using enzyme-linked immunosorbant assay (elisa), automated chemiluminescence immunoassay (clia), and lateral flow immunoassay (lfia), plaque reduction neutralization tests (prnt), or a combination of these methods (espejo et al. ) . the most commonly antigens used in these assays were the spike glycoprotein s including the receptor binding domain (jin et al. ), the nucleocapsid protein or both (pang et al. ) . application of inhibitor to halt virus interactions with the host may be one of the prophylactic methods. in this direction, an engineered pan-cov fusion inhibitor has been designed and designated as ek peptide. it has shown promising results in mice by inhibiting the infection in five human coronaviruses, including sars-cov, mers-cov and three bat-sl-covs (xia et al. ) . it has also been reported that intranasal application of engineered ek peptide before or post viral infection showed protection in human dpp -transgenic mice against mers-cov infection, indicating its potential prophylactic and therapeutic effect. another approach is designing of neutralizing antibody, which may block the interaction with the host cell. the s proteins of sars-cov and mers-cov are immunogenic. the rbd domains of sars-cov and mers-cov are known to have nonsequential epitopes that induce a more potent neutralizing antibody and give protection against sars-cov and mers-cov (du et al. ; zhou et al. ) . the modification on the structural basis for mers-cov s-rbd amino acid has improved the efficacy against mers-cov infection (zhou et al. ) . therefore, we may suggest that sars-cov- s-rbd or modified s-rbd of another related coronavirus could be used as target to develop a vaccine against sars-cov- . recently, neutralizing monoclonal antibodies and nanobodies against the rbd domain of s protein showed protection against sars-cov and mers-cov (du et al. ; zhou et al. ) . although the ntd and s unit of s protein from sars-cov and/or mers-cov was also studied to develop neutralizing antibodies, but the efficacy was found to be very low (du et al. ). therefore, rbd of s protein sars-cov- would be a key target for developing neutralizing antibodies as shown in fig. . cross protection by the antibodies developed against sars-cov, has been observed against bat-sl-cov-w v and bat-sl-cov-shc (zeng et al. ) . therefore, the development of cross-neutralizing antibodies can be another possible way for urgent prevention and treatment of sars-cov- infection. currently, plasma therapy in which polyclonal antibodies from recovered sars-cov- -infected patients have been used to treat sars-cov- infection is also being considered. researchers are working hard to develop monoclonal abs (mabs) and once such antibodies are produced, the next step will involve in vitro testing for neutralizing and/or crossneutralizing activity as well as in vivo evaluation for protective efficacy in available covid- animal models. preclinical and clinical trials testing the safety and and efficacy before they are approved for clinical applications are also necessary. recently, memory b cells specific to sars-cov- s or rbd (receptor binding domain) have been purified. among these, antibodies showed positive results in antigen binding assays with the top binders having ec below nm specific for rbd. further, among neutralizing antibodies, eight of them have shown an ic value within nm, whereas - best among all have ic of . nm. in epitope mapping, three main epitopes recognized by monoclonal antibodies have been identified in rbd domain. interestingly, - monoclonal antibody from same study, also showed cross-neutralizing property in the sars-cov pseudovirus assay (wan et al. a) . in another study, sars-cov- -neutralizing monoclonal antibodies were isolated from five infected patients. among them have shown positive result in in vitro neutralization assay and nine among them shown % virus-inhibition at the concentrations of - ng/ml. epitope mapping showed that receptor-binding domain (rbd) and the n-terminal domain (ntd), both are immunogenic in nature. further, structural studies of these monoloclonal antibodies have proven that one is targeting rbd, second one is targeting ntd and a third bridging rbd and ntd. therefore, several of these monoclonal antibodies are promising candidates for clinical development as potential therapeutic and/or prophylactic agents against sars-cov- (l et al. ). due to the high sequence identity of s protein between sars-cov- and its closely related sars-cov , sars-cov nabs have been tested for its cross-reactivity and/or cross-neutralizing activity against sars-cov- infection. interestingly, a sars-cov rbd-specific human neutralizing mab, cr , have shown the binding of sars-cov- rbd with high affinity and may recognize an epitope on the rbd that does not overlap with the ace -binding site (tian et al. ) . further, sars-cov- entry and infection may be blocked by crossreacting the sera isolated by convalescent sars patients or from animals specific for sars-covs (hoffmann et al. ) . moreover, it has been observed that polyclonal antibodies against the rbd domain of sars-cov have been cross-reacted with the rbd protein of sars-cov- . they cross-neutralized sars-cov- infection in hek t cells expressing the human ace receptor. such findings may open new avenues for the potential development of sars-cov rbd-based vaccines that might eventually prevent sars-cov- and sars-cov infection (tai et al. ) . it may be possible that sars-cov rbd-targeting neutralizing antibodies could be applied for treatment/prophylaxis of sars-cov- infection in the current absence of a specific vaccine against sars-cov- . remdesivir has been recently recognized as a promising antiviral drug in cultured cells, mice and nonhuman primate (nhp) models, against rna viruses including sars-cov and mers-cov (sheahan et al. ) . it is currently under clinical trials for the treatment of ebola virus infection (mulangu et al. ) . recently studies have shown that ec value of remdesivir against -ncov in vero e cells was . μm. these data suggest that its working concentration is likely to be achieved in nhps remdesivir have shown the efficient in vitro antiviral activity against sars-cov- . however, the controversial evidence of clinical improvement in severe covid- patients has been reported recently in france. the five covid- patients admitted in icu and treated with remdesivir. treatment showed significant reduction of sars-cov- viral load from upper respiratory in most of the cases, however but two patients died with active sars-cov- replication in their lower respiratory tract. remdesivir treatment was interrupted for its side effects among four patients due the complexity in such critically ill patients (dubert et al. ). the first covid- case in the united states was intravenously treated with remdesivir (iv) (holshue et al. ) . within h of remdesivir treatment, the patient showed recovery sign. as the viral loads was decreasing before remdesivir treatment, therefore it cannot be determined if further viral load reduction and clinical improvement were as the direct result of remdesivir treatment. in another study, the compassionate use of remdesivir (n = ) reported % improved oxygenation, % discharge and % death. this study was not most significantly as it lacks of a paired control group (grein et al. ) . recent study at the national institutes of health (nih), showed preliminary results of the adaptive covid- treatment trial (actt, n = ). in this randomized controlled trial (rct), remdesivir treatment showed % faster time to recovery as comparative to the placebo group (p < . ). the mortality rate was also showed reduction in remdesivir group, however it was not statistically significant ( % vs. . %, p = . ). so far, remdesivir has not shown any significant benefit in the reduction of mortality rate. currently, remdesivir is recommended by the nih for hospitalized severe covid- cases as defined by oxygenation needs (clinical management of covid- ). another drug, like chloroquine (c), has recently been reported as a potential broad-spectrum antiviral drug (savarino et al. ) . it inhibits the virus infection by increasing endosomal ph, which is essential for virus/cell fusion, as well as by interfering with the glycosylation of cellular receptors of sars-cov (vincent et al. ) . recent studies demonstrated that chloroquine is effective at both entry, as well as at postentry stages of the sars-cov- infection in vero e cells . apart from antiviral activity, chloroquine also has immune-modulating activity. therefore, it may synergistically enhance antiviral effect in vivo. chloroquine gets widely distributed in the whole body after oral administration, including lungs. the ec value of chloroquinein vero e cells against the sars-cov- was . μm, therefore it could be clinically achievable . the effect of hydroxychloroquine (hcq) and chloroquine (cq) in vitro was also tested by yao et al. in a systematic way, they had divided the experiment into two phases: treatment study and prophylaxis. in the treatment study, they determined the ec values for chloroquine. results showed that it was . μm and . μm at and h, respectively. however, in the case ofhydroxychloroquine,the ec values were . μm and . μm at and h, respectively. on the other hand in the prophylaxis study for chloroquine, the ec values were more than μm and . μm at and h, respectively. similarly, for hydroxychloroquine, the ec values were . μm and . μm at and h, respectively (yao et al. ) . hence, they found that hydroxychloroquine is more effective in vitro than chloroquine for both prophylaxis and treatment. a study in united states, where covid- patients hospitalized within h of diagnosis was treated with hydroxychloroquine alone (hcq) or with hydroxychloroquine and azithromycin (hcq + azm) or no hcq as treatments. among patients, there was no significant reduction in mortality rate or in the need of ventilation with hydroxychloroquine alone or with hydroxychloroquine and azithromycin (magagnoli et al. ) . a new york hospital stated the qtc prolongation associated with hcq + azm (n = ; chorin et al. ) . it was amplified from a baseline of ± ms to a maximal value of ± ms (p < . ) on day . ± . of the treatment. till date, researchers present conflicting data's related to the treatment with cq and hcq. therefore, significant randomized control tests (rcts) with improved study designs are required to examine the efficacy and the clinical benefits of hcq/cq treatment over its risks. currently, the nih recommendation are against cq/hcq and hcq + azm treatment for covid- , except for clinical trials. due to the potential toxicity, nih recommendations are also against the high-dose of cq ( mg) twice daily for days in all settings (coronavirus disease (covid- ) treatment guidelines). nitazoxanide has demonstrated potent in vitro activity against sars cov- , with an ec at h of . μm in vero e cells . this potent activity is consistent with ec values for nitazoxanide and its active metabolite, tizoxanide, against mers-cov in llc-mk cells where ec values of . μm and . μm respectively, have been demonstrated (rossignol ) . dexamethasone, a synthetic glucocorticoid, has antiinflammatory and immunosuppressive properties. there is a hyper inflammatory response involved in the clinical course of patients with pneumonia due to sars-cov- . the elevated level of c-reactive protein (crp) in sars-cov- patients has significantly decreased from . to . mg/l at time of discharge. % of the patients were discharged home with a mean length of stay of . days. none of the patients had escalation of care, leading to mechanical ventilation (selvaraj et al. ) recently, a randomized, controlled clinical trial in the united kingdom save the lives of people seriously ill with covid- when treated with dexamethasone. results showed the reduction of number of death by one-third (ledford ) . dexamethasone may be useful for the short-term in severe sars-cov- patients as it inhibit the protective function of t cells and block b cells from making antibodies (theoharides and conti ). the development of successful vaccine for humans can take several years. as no coronavirus vaccines are available in the market as of now, therefore, the development of a vaccine for the first time can be difficult and time-consuming. however, a mrna-based vaccine has been co-developed by moderna and the vaccine research center at the national institutes of health. in this vaccine, the target antigen's mrna, encapsulated in lipid nanoparticles are injected into vaccinee and antigen expresses in vivo. the phase i clinical trial has been recently started (clini-caltrials.gov: nct ). curevac is also using the same platform but they are still in the pre-clinical phase. apart from this, there are various other approaches including dna vaccine, recombinant protein-based vaccine, recombinant vector vaccine, inactivated vaccine and attenuated vaccine. different research companies, universities and institutes are targeting s protein of sars-cov- to develop recombinant proteinbased vaccines including novavaxexpress ion, ibio, sichuan clover biopharmaceutical, baylor college of medicine and the university of queensland. similarly, cansino biologics, geovax vaxart and the university of oxford are using viral-vector-based vaccine platform especially focused on the s protein. applied dna sciences and inovio are using dna vaccines platform again focused on the s protein (amanat fatima ). apart from the above-mentioned platforms, the whole microorganism based vaccine platform like inactivated and attenuated virus vaccine is also in consideration. codagenix with the serum institute of india is using live attenuated vaccine platforms. the recombinant vector-based platform (adenovirus vector) is adopted by johnson and johnson, and on the other hand, sanofi is also using the same platform (recombinant influenza vector) to develop a vaccine against sars-cov- . at this stage, it is difficult to predict the best platform for a vaccine against sars-cov- as all the above mentioned platforms have some advantages as well as disadvantages (amanat fatima ). coronaviruses have shown the capability to jump species boundaries and adapt to new hosts. therefore, we may face more such kind of outbreaks in the future. role of the intermediate host is also of major importance, as they provide direct pathway for virus transmission in humans. the enormous diversity of viruses in animals and their ongoing evolution makes it important to limit our exposure to animal pathogens as much as possible. based on the metagenomic data it is predicted that the pangolin-cov is most closely related to sars-cov- . pangolin-cov genome showed . % nucleotide identity with the sars-cov- genome. due to a very limited knowledge of this novel virus, it is difficult to explain the significant number of amino acid substitutions that occurred between the sars-cov- and sars or sl-covs. for example, in sars-s-cov, six mutations occurred in the regions other than that of the rbd domain, but interestingly no amino acid substitutions were present in the receptor-binding motifs that directly interact with human receptor ace protein. therefore, such differences that could affect sars-cov- transmission property as compared to sars-cov are of importance for future investigation. sars-cov- continues to infect people globally; therefore it is imperative to develop new, safe, accurate, fast and simple new technologies for detecting sars-cov- . apart from diagnosis, effective prophylactic and therapeutic agents are also required to control and prevent infection. various therapeutic agents including dexamethasone have shown promising results in the in vitro studies to control infection. however, there is an urgency to develop vaccine against coronaviruses. in this direction, studies on neutralizing antibodies from sars-cov and mers-cov against s protein and its many fragments including s -ntd, rbd and s may provide important guidelines for development of vaccine against sars-cov- . apart from neutralizing antibody against s protein, other approaches including dna vaccine, recombinant vector vaccine, inactivated vaccine and attenuated vaccine are also in pipeline to develop vaccines against sars-cov- . so far, the traditional public health measures including detection of active cases, isolation of such cases, tracing of all contacts and their quarantine, maintaining social distancing, as well as community quarantine were found to be successful. only after this pandemic ends, we will be in a position to assess the social, health and economic impact of such a massive outbreak. therefore, we must learn lessons for our future from such outbreaks, as new viruses will keep coming. none declared. sars-cov- vaccines: status report epidemiological and clinical characteristics of cases of novel coronavirus pneumonia in wuhan, china: a descriptive study the qt interval in patients with covid- treated with hydroxychloroquine and azithromycin clinical management of covid- covid- ) treatment guidelines the spike glycoprotein of the new coronavirus -ncov contains a furin-like cleavage site absent in cov of the same clade origin and evolution of pathogenic coronaviruses case reports study of the first five patients covid- treated with remdesivir in france the spike protein of sars-cov -a target for vaccine and therapeutic development review of current advances in serologic testing for covid- the species severe acute respiratory syndrome-related coronavirus: classifying -ncov and naming it sars-cov- compassionate use of remdesivir for patients with severe covid- identification of immunodominant sites on the spike protein of severe acute respiratory syndrome (sars) coronavirus: implication for developing sars diagnostics and vaccines sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor first case of novel coronavirus in the united states evidence of the recombinant origin of a bat severe acute respiratory syndrome (sars)-like coronavirus and its implications on the direct ancestor of sars coronavirus angiotensin-converting enzyme (ace ) proteins of different bat species confer variable susceptibility to sars-cov entry diagnostic value and dynamic variance of serum antibody in coronavirus disease laboratory testing strategy recommendations for covid- : interim guidance identification of -ncov related coronaviruses in malayan pangolins in southern china genetic characterization of betacoronavirus lineage c viruses in bats reveals marked sequence divergence in the spike protein of pipistrellus bat coronavirus hku in japanese pipistrelle: implications for the origin of the novel middle east respiratory syndrome coronavirus severe acute respiratory syndrome coronavirus-like virus in chinese horseshoe bats coronavirus breakthrough: dexamethasone is first drug shown to save lives functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses structural analysis of major species barriers between humans and palm civets for severe acute respiratory syndrome coronavirus infections potent neutralizing monoclonal antibodies directed to multiple epitopes on the sars-cov- spike bats are natural reservoirs of sars-like coronaviruses emergence of sars-cov- through recombination and strong purifying selection genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding outcomes of hydroxychloroquine usage in united states veterans hospitalized with covid- evaluation of rapid antigen test for detection of sars-cov- virus a randomized, controlled trial of ebola virus disease therapeutics potential rapid diagnostics, vaccine and therapeutics for novel coronavirus ( -ncov): a systematic review dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc difference in receptor usage between severe acute respiratory syndrome (sars) coronavirus and sars-like cronavirus of bat origin nitazoxanide, a new drug candidate for the treatment of middle east respiratory syndrome coronavirus co-circulation of three camel coronavirus species and recombination of mers-covs in saudi arabia new insights into the antiviral effects of chloroquine short-term corticosteroids in sars-cov patients: hospitalists' perspective broad-spectrum antiviral gs- inhibits both epidemic and zoonotic coronaviruses coronavirus: epidemiology, genome replication and the interactions with their hosts characterization of the receptorbinding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine differential stepwise evolution of sars coronavirus functional proteins in different host species dexamethasone for covid- ? not so fast potent binding of novel coronavirus spike protein by a sars coronavirus-specific human monoclonal antibody chloroquine is a potent inhibitor of sars coronavirus infection and spread remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus ( -ncov) in vitro human igg neutralizing monoclonal antibodies block sars-cov- infection receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus who| novel coronavirus summary and literature update -as of cryo-em structure of the -ncov spike in the prefusion conformation a pan-coronavirus fusion inhibitor targeting the hr domain of human coronavirus spike in vitro antiviral activity and projection of optimized dosing design of hydroxychloroquine for the treatment of severe acute respiratory syndrome coronavirus (sars-cov- ) intraspecies diversity of sars-like coronaviruses in rhinolophus sinicus and its implications for the origin of sars coronaviruses in humans cross-neutralization of sars coronavirus-specific antibodies against bat sars-like coronaviruses recent advances in the detection of respiratory virus infection in humans probable pangolin origin of sars-cov- associated with the covid- outbreak identification of immunogenic determinants of the spike protein of sars-like coronavirus immunogenicity difference between the sars coronavirus and the bat sars-like coronavirus spike (s) proteins a pneumonia outbreak associated with a new coronavirus of probable bat origin advances in mers-cov vaccines and therapeutics based on the receptor-binding domain key: cord- -h ukuu authors: olotu, fisayo a.; omolabi, kehinde f.; soliman, mahmoud e.s. title: leaving no stone unturned: allosteric targeting of sars-cov- spike protein at putative druggable sites disrupts human angiotensin-converting enzyme interactions at the receptor binding domain. date: - - journal: inform med unlocked doi: . /j.imu. . sha: doc_id: cord_uid: h ukuu the systematic entry of sars-cov- into host cells, as mediated by its spike (s) protein, is highly essential for pathogenicity in humans. hence, targeting the viral entry mechanisms remains a major strategy for covid- treatment. although recent efforts have focused on the direct inhibition of s-protein receptor-binding domain (rbd) interactions with human angiotensin-converting enzyme (hace ), allosteric targeting remains an unexplored possibility. therefore, in this study, for the first time, we employed an integrative meta-analytical approach to investigate the allosteric inhibitory mechanisms of sars-cov- s-protein and its association with hace . findings revealed two druggable sites (sites and ) located at the n-terminal domain (ntd) and s regions of the protein. two high-affinity binders; zinc (fosaprepitant – site ) and zinc (lomitapide – site ) were discovered via site-directed high-throughput screening against a library of ∼ fda approved drugs. interestingly, we observed that allosteric binding of both compounds perturbed the prefusion s-protein conformations, which in turn, resulted in unprecedented hace displacement from the rbd. estimated Δg(binds) for both compounds were highly favorable due to high-affinity interactions at the target sites. in addition, site residues; r , h , k and k , i , r , i , f , l , v and w were identified for their crucial involvement in the binding and stability of zinc . likewise, energy contributions of q , n , q , l , y , q , l , v , n , and a corroborated their importance to zinc binding at the predicted site . we believe these findings would pave way for the structure-based discovery of allosteric sars-cov- s-protein inhibitors for covid- treatment. the novel coronavirus disease also referred to as covid- is caused by the sars-cov- (severe acute respiratory syndrome coronavirus ), with incidences first reported in wuhan china in december . this disease has, however, persisted till mid- , spreading across countries with over , , cases reported coupled with increasingly high casualties numbering over , globally. sars-cov- belongs to a large group of coronaviruses which are known to cause respiratory infections and related complications. these rna viruses are spherical, pleomorphic, positive-sensed, single-stranded and polyadenylated. of all known viruses, coronaviruses (covs) have the largest rna genome , with diverse pathogenic effects in animals and humans. this virus class is divided into four genera namely: alpha-cov, beta-cov, gamma-cov and delta cov [ ] [ ] [ ] , with the beta-cov class prominent for their disease-causing effects in humans (hcovs). seven hcovs have been characterized to date [ ] [ ] [ ] ; among which four (hcov-hku , hcov-oc , hcov-nl and hcov- e) cause very mild respiratory symptoms. , on the other hand, mers-cov, sars-cov, and sars-cov- cause severe respiratory and gastrointestinal infections which, in most cases, can be fatal. although sars-cov-related infections were zoonotically transmitted into human populations, , human to human transmissions has further contributed towards viral super-spread via respiratory aerosols. the entry of sars-cov- coupled with its replication process in target human cells is achieved by the functionalities of a cohort of components, majorly non-structural and structural proteins, that make up the virus. generally, about non-structural proteins (nsps) mediate diverse pro-pathogenic functions such as replication, processing and proof-reading of genomic frames, host immune evasion among many others, as previously reported. [ ] [ ] [ ] more so, covs comprises of four major structural proteins that are integral to their pathogenesis. [ ] [ ] [ ] these are the nucleocapsid (n), envelope (e), membrane (m) and spike (s) proteins. the n protein makes up the nucleocapsid and other viral genome-related processes while the m protein is the most abundant of the four, playing major roles in maintaining viral structural integrity as well as coordinating other structural proteins. e protein, on the other hand, is crucial to the maturation of the virus [ ] [ ] [ ] [ ] [ ] while the trimeric s protein mediates viral entry into the host cell via the endosomal or non-endosomal route. two domains make up the s protein namely the n-terminal s domain and the c-terminal s membrane-anchored domain. the s region is extensively conserved in covs while constituent s region residues are highly diverge across the cov strains. these domains have been further characterized into subdomains due to specific functionalities with respect to host receptor recognition and binding (s ), coupled with membrane fusion and entry (s ) (figure ). similar to sars-cov architecture, some recent reports have sub-categorized the sars-cov- s ectodomain into the n-terminal domain (ntd), a conserved receptor-binding domain (rbd) which recognizes the human angiotensin-converting enzyme (hace ), and subdomains and (sd and sd ). during infection, proteolytic cleavage or priming of the s protein is crucial for viral fusion and entry into host cells, a process mediated by host cell proteases such as the transmembrane serine protease (tmprss ) and cathepsin l, [ ] [ ] [ ] at the s /s (boundary between s and s subunits) and s ' (immediately upstream s fusion peptide -fp) cleavage sites. [ ] [ ] [ ] the s protein primarily exists in a metastable prefusion complex prior to cleavage, after which notable conformational arrangements occur in order to fuse the viral membrane into j o u r n a l p r e -p r o o f the host cell. [ ] [ ] [ ] in addition, the rbd adopts disparate conformational motions to engage the host cell receptor. , conformations. [ ] [ ] [ ] the up conformation corresponds to the hace accessible state while the down state cannot engage the host cell receptor. the s domain, on the other hand, consists of the functionally important fusion peptide (fp), which is critical for viral fusion and formation of the post-fusion complex; heptad repeats and (hr and hr ); transmembrane domain (tm) and cytoplasmic tail (ct). the hrs of the s-protein trimer interact to form a fusion core of sixhelical bundle which helps bring the membranes of the virus and host cell in close proximity for fusion and entry. therefore, the roles of sars-cov- s-protein present it as an important therapeutic target, which would enable the prevention of viral entry and fusion in host cells. numerous studies have been reported over the past months with regards to the possibility of blocking direct interactions between sars-cov- s-protein and hace . most of these studies were aimed at targeting the s protein rbd domain with antibodies, peptide-based or small molecule compounds that binds with a much higher affinity to block s-protein-hace interactions. [ ] [ ] [ ] [ ] [ ] [ ] also, targeting host proteases such as tmprss was explored in a recent study, with consequential impediments on sars-cov- entry. identification of other functional (allosteric) sites on the prefusion s protein could present another dynamic and effective approach of preventing sars-cov- infectivity relative to its interaction with the host cell ace and proteases. this alternative target approach for sars-cov- s protein is important because its rbd (similar to other covs) has been associated with a high mutational propensity which may in turn alter the affinity of small molecule inhibitors or peptide designed to bind therein. allosteric targeting was explored in a recent study wherein the cov-conserved s hr region was identified as an important target site for the development of broad-spectrum inhibitors of human covs. the resulting peptide inhibitor (ek ) was evaluated in vivo and exhibited desirable safety and efficacy . more so, the protein contact network (pcn) paradigm was used to map functional allosteric loci on sars-cov s protein. relatively, this study was implemented to (i) identify potential druggable sites across the s and s domains of the sars-cov- s protein other than the rbd-hace interface (ii) perform high-throughput (virtual) screening of ~ fda approved drugs against the most druggable site(s) (iii) investigate the binding dynamics and interaction mechanisms of the compounds and their consequential effects on the s-protein rbd-ace complex. we believe this systematic study will be able to provide structural and molecular insights into possible allosteric sites on sars-cov- s protein suitable for selective targeting and structure- computational methodologies the three-dimensional structure of sars-cov- s-protein (prefusion) was retrieved from pdb with entry vsb. this, as previously reported, represents the s-protein rbd conformation in its up (open) state, which is most suitable for hace binding. also, to model binding interactions between the prefusion sars-cov- s-protein (s /s ) and the hace , a crystalized structure with pdb entry m j was separately retrieved. this complex depicts binding between the rbd domain (truncated) of sars-cov- s-protein and the protease domain (pd) of hace . co-crystallized molecules not relevant to this study were removed while missing residues (gaps) in the structures were filled using the modeller algorithm. this preparation was performed on the ucsf chimera graphic user interface (gui). subsequently, using the structural superposition method, we were able to model a complex between prefusion s-protein (s /s ) monomer (rbd -up conformation) and the hace protein ( figure ). j o u r n a l p r e -p r o o f possible druggable sites other than the sars-cov- rbd interface were predicted using approaches previously reported. [ ] [ ] [ ] [ ] herein, we employed multiple tools for site identification and validation, which include sitemap , fpocket , discovery studio client and prankweb. sitemap is an exhaustive tool which ranks protein pockets based on properties such as druggability, surface exposure, hydrophobicity and hydrophilicity among others [ ] [ ] [ ] . these details were then used to characterize the predicted pockets after which other predictive algorithms were used complementarily for cross-validation. two highly ranked sites were then selected for further analyses. furthering on the rationale of the study, we mapped out the two most druggable sites on the target protein and virtually screened against them a large chemical library of fda approved drugs (~ compounds) derived from the zinc repository (http://zinc.docking.org/substances/subsets/fda/). this screening was performed using highperformance computing-integrated autodock vina prior to which coordinates of the predicted sites were mapped using gridboxes. corresponding binding scores were retrieved from the resulting .pdbqt files and were used to filter down to the topmost compounds for each predicted sites and . subsequently, two compounds with the highest binding scores (most negative) were selected for the two predicted sites yielding complexes that were subjected to further simulation studies. as explained in . , the prefusion s-proteins (ligand-bound and unbound) were superimposed with the rbd-hace complex ( m j) after which the single j o u r n a l p r e -p r o o f truncated rbd was removed. by so doing, we obtained models of allosterically-bound and unbound pre-fusion s-protein-ace complex. this, as aimed in this study, would provide structural and dynamical insights into the mechanistic effects of allosteric targeting on sars-cov- host entry machinery. although computationally expensive ( residues), we proceeded with long-timescale md simulation runs for the systems on amber graphical processing unit (gpu) using its embedded modules. protein parameters were defined using the ff sb forcefield while ligand parameters were generated with the antechamber and parmchk modules. likewise, the leap program was used to define coordinate and topology files for the ligand-bound and unbound protein complexes. this program, also, was used to neutralize (addition of counter-ions; na + and cl -) and solvate the systems in a tip p water box of size Å. structural minimization was first carried out partially for steps with a restraint potential of kcal mol - . Å followed by another steps of full minimization with no restraints. a canonical (nvt) ensemble with a kcal mol - Å harmonic restraints was used to heat the systems gradually from - k for ps, after which the systems were equilibrated for ps at a constant k temperature without restraints in an npt ensemble. atmospheric pressure was maintained at bar with a berendsen barostat while each protein system was subjected to a production run of ns. studied systems include zinc -s-protein-hace (allosteric site ), zinc -sprotein-hace (allosteric site ), and unbound s-protein-hace . corresponding trajectories were saved at every ps time-frame until the end of the simulation followed by data plot analyses using microcal origin software. snapshots were also taken and analyzed to monitor structural events and ligand interaction dynamics across the trajectories on the ucsf chimera user j o u r n a l p r e -p r o o f interface (gui) and discovery studio client. the molecular mechanics/generalized born surface area (mm/gbsa) method was used to evaluate binding affinities of the predicted allosteric s-protein binders at their target sites. binding energy profiles for both compounds, inclusive of their energy components, were estimated using snapshots from the terminal ns of md trajectories where conformational stabilities were visible. this approach was important in order to minimize the effects of conformational disorder or entropy on ligand interactions. the equations below mathematically express binding energy calculations: as shown, internal (∆e int ), electrostatic (∆e ele ) and van der waals (∆e vdw ) energies sum up the gas-phase energy (∆g gas ) while the solvation free energy (∆g sol ) is defined by the polar solvation (∆g ele,sol ) and non-polar contribution to solvation (∆g np,sol ) terms. the mm/gbsa method was used to estimate the generalized born (gb) for ∆g ele,sol while the linear relationship between the surface tension proportionality constant (γ = . mol - Å - ), solvent accessible surface area (sasa, Å ), and β constant was used to solve ∆g np,sol . furthermore, estimated ∆g bind was decomposed into individual residue energies, most especially those that constitute the predicted allosteric pockets where the ligands were bound. this method was essential to identify specific residues that contribute crucially to the stability and inhibitory activities of potential allosteric inhibitors. j o u r n a l p r e -p r o o f based on the study rationale, we set out to identify possible sites for drugging the target protein table ). the architectures of these pockets are shown in figure . furthermore, defining the druggability of a site on target proteins depends on the size (volume) and hydrophobicity (with minimal hydrophilicity) while, on the other hand, high hydrophilicity, reduced hydrophobicity, small pocket size and shallowness characterize "difficult-to-drug" and undruggable pockets , [ ] [ ] [ ] . while large hydrophilicity could have repulsive effects on ligand mobility at the binding site, a small or shallow cavity would impede ligand access, fitness, optimal binding and stability. j o u r n a l p r e -p r o o f from table , sites → ranks above the . halgren dscore threshold making them suitable for therapeutic targeting. relatively, site appears to be highly surface-exposed with a score of . while a large pocket size and volume for site could favor the use of large-molecule compounds. taken together, high surface-exposure coupled with relatively large volumes, hydrophobicity and favorable donor/acceptor properties for sites and could account for their suitability as targetable allosteric regions on the s-protein other than the rbd (figure ). these presumptions are also reflected by the estimated dscore and sitescore values. in addition, since these predicted sites are highly functional, particularly the overlapping fp, hr and cr, targeting them could high-throughput screening and identification of potential allosteric binders to the predicted sites and high-throughput screening using a library of ~ fda approved drug compounds (http://zinc.docking.org/substances/subsets/fda/) were performed against the two predicted allosteric sites. results for the top compounds with the highest binding scores are presented in supplementary table s and supplementary table s for sites and respectively. from the screening results, overall highest scores were estimated for zinc (- j o u r n a l p r e -p r o o f kcal/mol) at site and zinc (- . kcal/mol) at site . as highlighted in our methods, md simulations were performed for the prefusion s-protein-hace complexes bound distinctly at two potential allosteric sites. this approach was essential to investigate the likely effects of allosteric targeting on the entry/fusion mechanisms of sars-cov- via host hace . however, this conformation appeared distorted the allosterically-bound s-proteins and could account for displacement motions of the interacting hace from the rbd interface. therefore, the allosteric-mediated disruption of sars-cov- s-protein rbd and its interaction with hace , as reported herein, is a major finding that could indicate the viability of allosteric targeting in sars-cov- therapy. furthermore, we measured structural stabilities across the ligand-protein complexes relative to the unbound system using the rmsd metrics. as shown in figure , structural instability was highest in the unbound s-protein while its associated hace was relatively stable compared to this could indicate the structural effects of allosteric targeting on the s-protein and its interaction with hace . estimated mean rmsds, as presented in table , corroborates conformational variations among the unbound and bound protein complexes. to minimize the effects of structural disorderliness (entropy) in our calculations, we selected, from the md trajectories, terminal time-frames ( - ns) from which the systems appeared to relatively stabilize. these were defined as the finally equilibrated (fe) time-frames and were used for subsequent structural analyses ( table ). from the resulting fe-rmsd plots, unbound s-protein was highly unstable while its associated hace exhibited low structural motion in line with the rmsd calculations, which could also imply that the binding of s-protein stabilized hace . in contrast, the allosterically-bound sproteins (sites and ) were notably stable while their corresponding hace showed high structural instability that could correlate with their systemic motions at the s-protein rbd as earlier mentioned. structural analyses of ligand orientations at the respective allosteric sites of sars-cov- sprotein were performed using averaged structures from the md trajectories ( figure ). findings reveal that the allosteric binding of zinc (fosaprepitant) was stabilized at the ntd. fosaprepitant contains a terminal triphosphate group that orients towards residues such as n , k , n , r and h . likewise, its trifluoromethyl group oriented towards d while constituent -o and -nh groups mediate interactions with q and n , among others. these altogether could facilitate high-affinity interactions accountable for its stability and allosteric inhibitory effects against the sars-cov- and associated hace . j o u r n a l p r e -p r o o f binding affinities of the compounds were determined using the mm/pbsa technique, which also allowed us to measure the energy contributions of interactive residues at the predicted allosteric sites. energy calculations, as presented in table were performed using relatively stable time-frames ( - ns) to minimize entropical effects that may interfere with ligand binding activities. in addition, we observed that electrostatic effects contributed most notably to the allosteric binding of zinc at the ntd region while van der waals contributions had the highest effect on the binding of zinc at the predicted site pocket. electrostatic contributions at site could be due to the high number of electropositive residues that constitute the pocket, as shown in figure , which may form high-affinity interactions with electronegative moieties of the compound. calculations further revealed that ∆e vdw and ∆e ele were more favorable in the gas phase for zinc while polar solvation energies were more favorable for zinc j o u r n a l p r e -p r o o f at the s region of the s-protein. this could imply that while the former was buried in the deep hydrophobic pocket of the ntd, the latter was surface exposed due to its trans-domain binding activity as earlier reported. to understand the mechanistic binding of the compounds at both predicted sites, we decomposed the binding free energies into individual contributions of the interacting residues. these were juxtaposed with structural analysis that showed the type and (π-alkyl) interactions. more so, π-π stacked interaction between y and a benzene ring (of the -tri-fluoromethyl- , '-biphenyl group) could be highly crucial for the stability of the compound. taken together, electrostatic energies favored the binding of zinc at site while vdw energies favored zinc binding at site , which consequentially, were able to perturb the s-protein rbd and allosterically disrupt hace interactions. the systemic entry of sars-cov- into the human host cell is a crucial process that underlies its virulence and pathogenicity in humans and other animals it infects. this mechanism is mediated by its interaction with the host ace (hace ) via attachment and fusion. potential intervention approaches in sars-cov- treatment include therapeutic strategies that could prevent sars-cov- s-protein binding to hace . in this study, we implemented an exhaustive approach to identify drug molecules that could potentially bind to sars-cov- s-protein at other sites other than the rbd. pertinent to the allosteric targeting approach implemented herein j o u r n a l p r e -p r o o f was the identification of highly druggable sites inherent in the s-protein (s /s ), which was carried out using multiple pocket prediction algorithms for identification and validation of possible allosteric sites. predicted pockets were then characterized based on their attributes after which two highly probable pockets were selected. these were then screened distinctly against a library of ~ fda approved drugs retrieved from the zinc database. amongst all, thermophoresis (mst) can be employed for further validation. these implementations will provide additional insights into the targetability and suitability of these pockets for novel covid- therapeutics. findings from this study paves way for novelty in the structure-based design of high-affinity allosteric inhibitors or disruptors of sars-cov- association with host hace thereby preventing viral entry. authors thank the college of health sciences, university of kwazulu-natal, south africa for providing infrastructural support and we also acknowledge the center for high performance computing (chpc), capetown, south africa, for providing computational resources. authors declare no conflict of interest. this research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. ) a novel coronavirus from patients with pneumonia in china ) coronavirus cases coronavirus replication and pathogenesis: implications for the recent outbreak of severe acute respiratory syndrome (sars), and the challenge for vaccine development february) a molecular arms race between host innate antiviral response and emerging human coronaviruses june) epidemiology, genetic recombination, and pathogenesis of coronaviruses genomic characterization of the novel human-pathogenic coronavirus isolated from a patient with atypical pneumonia after visiting wuhan origin and evolution of pathogenic coronaviruses studies with human coronaviruses ii. some properties of strains e and oc clinical and molecular epidemiological features of coronavirus hku -associated community-acquired pneumonia zoonotic origins of human coronaviruses october) interspecies transmission and emergence of novel viruses: lessons from bats and birds a pneumonia outbreak associated with a new coronavirus of probable bat origin epidemiology, causes, clinical manifestation and diagnosis, prevention and control of coronavirus disease (covid- ) during the early outbreak period: a scoping review severe acute respiratory syndrome coronavirus (sars-cov- ): an overview of viral structure and host response coronaviruses: an overview of their replication and pathogenesis april) emerging coronaviruses: genome structure, replication, and pathogenesis the molecular biology of coronaviruses efficient assembly and release of sars coronavirus-like particles by a heterologous expression system mers-cov virus-like particles produced in insect cells induce specific humoural and cellular imminity in rhesus macaques molecular interactions in the assembly of coronaviruses a structural analysis of m protein in coronavirus assembly and morphology a severe acute respiratory syndrome coronavirus that lacks the e gene is attenuated in vitro and in vivo the small envelope protein e is not essential for absence of e protein arrests transmissible gastroenteritis coronavirus maturation in the secretory pathway generation of a replication-competent, propagation-deficient virus vector based on the transmissible gastroenteritis coronavirus genome coronavirus envelope (e) protein remains at the site of assembly coronaviruses-drug discovery and therapeutic options mechanisms of coronavirus cell entry mediated by the viral spike protein sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor cathepsin l functionally cleaves the severe acute respiratory syndrome coronavirus class i fusion protein upstream of rather than adjacent to the fusion peptide role of the spike glycoprotein of human middle east respiratory syndrome coronavirus (mers-cov) in virus entry and syncytia formation inhibitors of cathepsin l prevent severe acute respiratory syndrome coronavirus entry efficient activation of the severe acute respiratory syndrome coronavirus spike protein by the transmembrane protease tmprss protease is linked to the severe acute respiratory syndrome coronavirus receptor and activates virus entry human coronaviruses: a review of virus-host interactions structure, function, and evolution of coronavirus spike proteins function, and antigenicity of the sars-cov- spike glycoprotein tectonic conformational changes of a coronavirus spike glycoprotein promote membrane fusion the spike glycoprotein of the new coronavirus -ncov contains a furin-like cleavage site absent in cov of the same clade cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains structural basis of receptor recognition by sars-cov- structural and functional basis of sars-cov- entry by using human ace . cell in silico study of the spike protein from sars-cov- interaction with ace : similarity with sars-cov, hot-spot analysis and effect of the receptor polymorphism li , , *, cheng zhao , , zhaohui li a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace computational design of ace -based peptide inhibitors of sars-cov- august) bat-to-human: spike features determining "host jump" of coronaviruses sars-cov, mers-cov, and beyond a pan-coronavirus fusion inhibitor targeting the hr domain of human coronavirus spike mapping active allosteric loci sars-cov spike proteins by means of protein contact networks structure of the sars-cov- spike receptor-binding domain bound to the comparative protein structure modeling using modeller ucsf chimera, modeller, and imp: an integrated modeling system exploring the lapse in druggability: sequence analysis, structural dynamics and binding site characterization of k-rasg c variant, a feasible oncotherapeutics target potential ebola drug targets -filling the gap: a critical step forward towards the design and discovery of potential drugs possible allosteric binding site on gyrase b, a key target for novel anti-tb drugs: homology modelling and binding site identification using molecular dynamics simulation and binding free energy calculations can we rely on computational predictions to correctly identify ligand binding sites on novel protein drug targets? assessment of binding site prediction methods and a protocol for validation of predicted binding sites identifying and characterizing binding sites and assessing druggability fpocket: an open source platform for ligand pocket detection prankweb: a web server for ligand binding site prediction and visualization new method for fast and accurate binding-site identification and analysis therapeutic target-site variability in α -antitrypsin characterized at high j o u r n a l p r e -p r o o f resolution silico assessment of potential druggable pockets on the surface of ?? -antitrypsin conformers autodock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading amber molecular dynamics with coupling to an external bath originpro . : scientific data analysis and graphing software-software review structural basis for the recognition of sars-cov- by full-length human ace mapping allosteric communications within individual proteins the following information is required for submission. please note that failure to respond to these questions/statements will mean your submission will be returned. if you have nothing to declare in any of these categories then this should be stated. all sources of funding should be declared as an acknowledgement at the end of the text. authors should declare the role of study sponsors, if any, in the collection, analysis and interpretation of data; in the writing of the manuscript; and in the decision to submit the manuscript for publication. if the study sponsors had no such involvement, the authors should so state. studies on patients or volunteers require ethics committee approval and fully informed written consent which should be documented in the paper.authors must obtain written and signed consent to publish the case report from the patient (or, where applicable, the patient's guardian or next of kin) prior to submission. we ask authors to confirm as part of the submission process that such consent has been obtained, and the manuscript must include a statement to this effect in a consent section at the end of the manuscript, as follows: "written informed consent was obtained from the patient for publication of this case report and accompanying images. a copy of the written consent is available for review by the editor-in-chief of this journal on request".patients have a right to privacy. patients' and volunteers' names, initials, or hospital numbers should not be used. images of patients or volunteers should not be used unless the information is essential for scientific purposes and explicit permission has been given as part of the consent. if such consent is made subject to any conditions, the editor in chief must be made aware of all such conditions. even where consent has been given, identifying details should be omitted if they are not essential. if identifying characteristics are altered to protect anonymity, such as in genetic pedigrees, authors should provide assurance that alterations do not distort scientific meaning and editors should so note. please specify the contribution of each author to the paper, e.g. study design, data collections, data analysis, writing, others, who have contributed in other ways should be listed as contributors.this research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.not applicable to this study.fao conceptualized, implemented, analyzed, interpreted and wrote the manuscript, kfo performed molecular dynamics simulation, while mes revised and approved the manuscript for submission. ☒ the authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.☐the authors declare the following financial interests/personal relationships which may be considered as potential competing interests:j o u r n a l p r e -p r o o f key: cord- -jdvb px authors: hanke, leo; vidakovics perez, laura; sheward, daniel j.; das, hrishikesh; schulte, tim; moliner-morro, ainhoa; corcoran, martin; achour, adnane; karlsson hedestam, gunilla b.; hällberg, b. martin; murrell, ben; mcinerney, gerald m. title: an alpaca nanobody neutralizes sars-cov- by blocking receptor interaction date: - - journal: nat commun doi: . /s - - - sha: doc_id: cord_uid: jdvb px sars-cov- enters host cells through an interaction between the spike glycoprotein and the angiotensin converting enzyme (ace ) receptor. directly preventing this interaction presents an attractive possibility for suppressing sars-cov- replication. here, we report the isolation and characterization of an alpaca-derived single domain antibody fragment, ty , that specifically targets the receptor binding domain (rbd) of the sars-cov- spike, directly preventing ace engagement. ty binds the rbd with high affinity, occluding ace . a cryo-electron microscopy structure of the bound complex at . Å resolution reveals that ty binds to an epitope on the rbd accessible in both the ‘up’ and ‘down’ conformations, sterically hindering rbd-ace binding. while fusion to an fc domain renders ty extremely potent, ty neutralizes sars-cov- spike pseudovirus as a . kda nanobody, which can be expressed in high quantities in bacteria, presenting opportunities for manufacturing at scale. ty is therefore an excellent candidate as an intervention against covid- . s ars-cov- was first identified as the etiologic agent of the novel pneumonia covid- , in late . in the comparatively short time since then, it has achieved pandemic status, causing more than . million cases, leading to more than , deaths. accordingly, the who declared the pandemic to be a public health emergency of international concern. a safe and effective vaccine is urgently needed, but requires time to develop. in the meantime, and indeed also in the post-vaccine era, highly specific and potent antiviral interventions are needed. many generic or repurposed candidates are in trials, but so far results have been unremarkable. since the virus is newly emerged, specifically designed drugs have not yet reached late phase trials. when available, specific antiviral drugs or antibody therapies will be used to protect individuals at risk and their widespread use will allow immunologically naïve populations to exit lockdowns more safely. the virus is closely related to sars-cov- , both being members of the lineage betacoronaviruses. cell entry of both viruses is achieved by first binding to the cell surface expressed receptor angiotensin-converting enzyme (ace ), followed by conformational changes in the viral spike glycoprotein trimer and subsequent membrane fusion. the affinity of sars-cov- receptor-binding domain (rbd) for ace is considerably higher than that for sars-cov- , , supporting efficient cell entry and likely contributing to pathogenesis. the rbd is a globular domain situated on the distal surface of the spike protein. two conformations have been observed in the stabilized trimer. specifically, one conformation where one rbd is ace accessible while the other two are not, and one conformation where all three rbds are down, i.e. receptor inaccessible , . as the receptor-engaging part of the spike, the rbd is an attractive target for coronavirus neutralization, and a number of conventional neutralizing monoclonal antibodies that target the rbd and block receptor binding have already been isolated from convalescent patients [ ] [ ] [ ] . camelid-derived single domain antibody fragments, also called vhhs or nanobodies, offer several advantages over conventional antibodies as candidates for specific therapies. despite being approximately one-tenth of the size of a conventional antibody, they retain specificity and affinity similar to conventional antibodies, while being far easier to clone, express, and manipulate. they are readily expressed in bacteria in large quantities and show high thermal stability and solubility, making them easily scalable and cost effective. their modularity means that they can be oligomerized to increase avidity or to increase serum half-life . critical to their use as antivirals in humans, they can easily be humanized with existing protocols . importantly, they have proven to be highly potent inhibitors of viral infections in vivo; particularly respiratory infections , . here, we describe the isolation, evaluation, and molecular determination of an alpaca-derived nanobody, ty , directed to the rbd of the sars-cov- spike glycoprotein. we demonstrate that the monomeric . kda ty molecule potently neutralizes sars-cov- spike pseudovirus. the nanobody binds with high affinity to the rbd in a manner that occludes ace interaction. we have also determined the mechanism of neutralization to be due to direct interference with rbd binding to ace . altogether, these results highlight the great potential of ty as a sars-cov- antiviral agent. isolation of a sars-cov- neutralizing nanobody. we immunized one alpaca with sars-cov- s -fc and rbd on a day immunization schedule. we generated a phage display library and performed two consecutive rounds of phage display, followed by an elisa-based binding screen (fig. a) . we isolated one nanobody, ty , that binds specifically to the rbd of the sars-cov- spike glycoprotein. in parallel we performed next generation sequencing (ngs) on the baseline and post-enrichment libraries, and quantified variant frequency before and after each enrichment step. ty exhibited the greatest fold-change in frequency among all nanobody variants, increasing over , -fold from baseline to after the second enrichment round (fig. b) . we report the amino acid sequence of ty in fig. c . to determine whether ty neutralized sars-cov- we employed an in vitro neutralization assay using lentiviral particles pseudotyped with the sars-cov- spike protein. ty neutralized sars-cov- pseudotyped viruses at an ic of . µg/ml ( nm) (fig. a) . no neutralization of a lentivirus pseudotyped with vsv-g by ty was evident, and a control nanobody produced and purified in the same way, but specific for influenza a virus nucleoprotein , showed no evidence of neutralization of sars-cov- pseudotyped viruses. when ty was expressed in mammalian cells as an fc-fusion protein the potent neutralization could be further increased to~ ng/ml (fig. a to confirm that ty is directed specifically against the sars-cov- spike protein, we characterized the specificity of ty by flow cytometry. we site-specifically conjugated a fluorophore to the c-terminus of ty by means of a sortase a reaction and copper-free click chemistry (ty -as p) and stained untransfected cells and cells transiently transfected with sars-cov- spike under permeabilizing conditions (fig. b) . while untransfected and unstained cells displayed similar signals, cells expressing the viral spike protein showed a strong shift in fluorescence intensity when stained with ty -as p. the apparent double peak likely reflected the varying efficiency of this transient transfection. to determine if the same probe can be exploited to recognize the viral spike protein in immunofluorescence, we infected vero e cells with infectious sars-cov- at moi for h, and stained the fixed and permeabilized cells with ty -as p and anti-dsrna antibody (fig. c) . while uninfected cells showed no signal, infected cells were strongly labeled with both dsrna antibody and ty -as p. thus, ty recognized the viral spike glycoprotein with high specificity in its native conformation in sars-cov- -infected cells. importantly, the low background in both experiments also suggested that ty is a highly specific and suitable tool for research, diagnostics, and therapy. to understand the mechanism of neutralization, we evaluated the effect of ty on rbd binding to ace . we site-specifically conjugated a fluorophore to the c-terminus of the rbd (rbd-as p) and used this probe to stain ace expressing hek t cells (fig. d) . preincubation of rbd-as p with unlabeled ty resulted in a strong reduction of ace staining, while preincubation with the control nanobody np-vhh , specific for influenza a virus nucleoprotein np, had no such effect. this result indicated that ty directly prevents binding of sars-cov- rbd to its host cell receptor ace . ty binds to rbd with high affinity. specific and high-affinity binding of ty to the rbd was also demonstrated in kinetic bio layer interferometry (bli) experiments. dipping of surfaceimmobilized nanobodies into monomeric rbd solutions at nm yielded binding responses with fast association kinetics and amplitudes reaching . nm only for ty but not for np-vhh (red and blue curves, respectively, in fig. a) . titration experiments performed under normal ( mm) and high salt ( mm) conditions revealed concentration-dependent kinetic response curves for binding of rbd to ty ( fig. b and supplementary fig. a , respectively). the derived semi-log concentration-response . infectivity relative to cells infected with pseudotyped virus in the absence of nanobody is shown. neutralization by ty was repeated in duplicate across six assays, neutralization by ty -fc was repeated in duplicate across two assays, and the error bars represent the standard deviation. b cells were transfected with a plasmid harboring the sars-cov- spike for h. cells were fixed, permeabilized, and stained with ty -as p (black and red) or left unstained (gray). cells were analyzed by flow cytometry. cell counts are presented as % of max (representative histogram). c vero e cells were infected with sars-cov- at a moi of for h. cells were fixed, permeabilized, and stained for dna (blue), dsrna (green), and with ty -as p (red). pictures were taken by fluorescence microscopy and representative examples are shown. scale bar, µm. d ace expressing hek t cells were trypsinized, fixed, and stained with rbd-as p alone (blue), or preincubated with np-vhh (green) or ty (red). cells were analyzed by flow cytometry. nature communications | https://doi.org/ . /s - - - article curves revealed sigmoidal line-shapes with fitted apparent k dvalues of ± . and ± . nm (mean value ± standard deviation) for binding at normal and high salt conditions, respectively. local fits to individual sensorgrams applying the standard : binding model appeared reasonable for the association phases at lower to intermediate rbd concentrations, as well as for all dissociation curves when fits were allowed to stay above zero (gray lines fig. b and s a st panel). however, the model deviated from the observed data at higher rbd concentrations. instead almost perfect fits were obtained when the same data were analyzed in terms of a bayesian two-dimensional distribution of k d and k off -rate constants to address heterogeneous ligand site populations on the sensor surface [ ] [ ] [ ] . for the two titrations at normal and high salt conditions, distinct peaks at k d and k off -rate values of - nm and - × − s − were obtained ( fig. b and supplementary fig. a th panel) . in both conditions, a second elevated plateau with k d and k off -rate values of about nm and - × − contributed significantly to the observed sensorgrams. since most high-affinity protein-protein interactions in the nmrange have dissociation rates in the × − s − range , we attribute the first defined peak as the relevant ty :rbd interaction. the second broad plateau is likely caused by rbd competition and rebinding effects on the sensor surface, as well as heterogeneous ligand populations , , . the orthogonal biophysical method isothermal titration calorimetry (itc) confirmed the high affinity binding of ty to rbd with a k d of nm (with estimated bounds of and nm) characterized by an exothermic enthalpy of about − ± . kcal/mole (fig. c , left panel). exothermic binding was already evident from the three initial relatively constant negative spikes that were caused by the injection of ty to rbd (fig. c , right panel). the amplitude of the following three to four spikes returned to baseline demonstrating saturation of the available rbd sites by ty binding. notably, return to baseline was accompanied by the appearance of preceding positive spikes (fig. c , left panel and supplementary fig. b) . these spikes were also detected when ty was injected into the buffer (hbs) and thus treated as ty dilution effects during data analysis. injection of np-vhh into rbd did not cause any binding or dilution heat changes above background noise. it should be noted that the itc measurements were performed at the lowest possible protein concentrations to derive k d -values in the low nm range, while still being able to detect interaction heat above background noise signals that were at about − . μcal/s (maximum spike amplitude) and ± . μcal/s, respectively. altogether, we concluded from these results that rbd bound to ty with high affinity of about - nm. ty binds to the rbd in 'up' and 'down' conformation. to understand the structural basis underlying the potent neutralization of sars-cov- we performed a cryo-em structure determination of the prefusion-stabilized spike ectodomain in complex with ty . the cryo-em reconstruction reaches an overall resolution of . Å ( . fsc; supplementary table ) with strong variation of estimated local resolution from high resolution in the core of the spike trimer to relatively low resolution in the top of the spike ( fig. a and supplementary fig. ). nevertheless, this reconstruction clearly shows that the spike retains only one main conformation with one rbd 'up' and two rbds 'down'. importantly, all three rbds are decorated in their upper parts with a ty nanobody. the nanobodies retain a similar binding orientation to the rbd whether the rbd is found in the 'up' or 'down' conformation ( fig. a, b) and each has a solvent-excluded surface area of~ Å , which is in line with the strong affinity observed in the biophysical-interaction studies. primary interactions with the rbd are through the cdrs. specifically, cdr interacts primarily with rbd t and v -e , and cdr interacts primarily with rbd y , f , and q . interestingly, cdr does not form any major interactions with the rbd, instead it stabilizes the conformation of cdr in the rbd bound mode and thereby acts indirectly to potentiate the ty -rbd binding. since ace can only be bound by an rbd in the 'up' conformation, the cryo-em reconstruction clearly shows that ace binding is sterically hindered from two sides (fig. c) . specifically, ace binding is blocked both by the ty nanobody bound to the rbd in the 'up' conformation and the neighboring rbd in the 'down' conformation. hence, ace binding is sufficiently hindered with any two of the available three binding rbd sites in the spike trimer. the current coronavirus pandemic has drastic consequences for the world's population, and vaccines, antibodies, or antivirals are urgently needed. neutralizing antibodies can block virus entry at an early step of infection and potentially protect individuals that are at high risk of developing severe disease. we report the identification and characterization of a sars-cov- rbdspecific single domain antibody fragment (nanobody) termed ty that potently neutralizes the virus. we identified ty by binding assay after two consecutive rounds of phage display, simultaneously monitoring sequence enrichment by ngs. although ty exhibited the greatest fold-enrichment in the ngs analysis, multiple additional nanobodies exhibited enrichment of varying extent across both rounds. as the correlation between phage display enrichment and neutralization is likely imperfect, further analyses of our libraries may yield other potent sars-cov- neutralizing nanobodies. in addition to neutralization activity, we also show that ty can be used as a detection reagent in flow cytometry and immunofluorescence demonstrating its suitability as a research tool and for diagnostics. glycans on spike glycosylation sites n , n , and n shield the rbd from antibodies, especially when the rbd is in down conformation , . indeed, in the rbd-down conformation, the glycan on n points towards the ty -binding epitope, likely not leaving sufficient space to accommodate a conventional antibody. in agreement with that, fab fragments from convalescent patients bound the rbd only in the up conformation and to an epitope that only minimally overlaps with the ty epitope . it should be noted that the nanobody ty can be readily produced in bacteria at very high yield (in excess of mg/l culture), making it an excellent candidate for a low-cost, scalable antiviral agent against sars-cov- , and we provide the amino acid sequence, encouraging direct exploitation as such. interestingly, while ty contains the hallmark (hydrophobic) amino acids of variable-heavy chains in framework , only one arginine (instead of tryptophan) in framework demonstrates that this antibody fragment derives from a heavy-chain only antibody . nevertheless, ty expresses extremely well, but exchanging the hydrophobic residues in framework may further improve this nanobody. while nanobodies capable of binding sars-cov- spike have recently been isolated, these were generated after sars-cov- spike immunization , or pcr maturation . also, in both cases a fusion to human fc domain is required for neutralization of sars-cov- , precluding expression in bacterial culture. naive libraries of human single-domain antibodies (sdabs) have also been screened to identify sars-cov- spikespecific nanobodies , , but they lack detailed structural information. other synthetic rbd-specific nanobodies have been published, but they lack information on their neutralization potential . ty represents the first single-domain antibody isolated from an animal specifically immunized with a sars-cov- protein. future work will aim to improve the potency and potential efficacy of ty through various strategies. for example, mutational scanning may yield potency improvements to ty . also, since ty already neutralizes as a monomeric protein, the generation of homodimeric or trimeric fusion constructs is expected to further increase its neutralization activity. indeed, fusion of ty to a human igg -fc dramatically improved the ic of this molecule, to~ ng/ml. additional strategies will explore linkerbased constructs that chain multiple copies of ty together, which may provide similar improvements in potency while retaining the possibility of being expressed in bacteria. ty may additionally be a useful component of a bi-specific or tri-specific antibody, which could combine epitope specificities to increase the mutational barrier to viral escape. based on our work, we hope that ty will be investigated as a candidate for antiviral therapy. cells and virus. vero e cells (atcc-crl- ) and hek t cells (atcc-crl- ) were maintained in dulbecco's modified eagle medium (gibco) supplemented with % fetal calf serum and % penicillin-streptomycin and cultured at °c in a humidified incubator with % co . a hek t cell line engineered to overexpress human ace (hek t-ace ) was generated by the lentiviral transduction of hek t cells. briefly, lentiviruses were produced by cotransfecting hek t cells with a plasmid encoding vsv-g (addgene cat# ), a lentiviral gag-pol packaging plasmid (addgene cat# ), and a human ace transfer plasmid. virions were harvested from the supernatant, filtered through . µm filters, and used to transduce hek t cells. all cell lines used for experiments were negative for mycoplasma as determined by pcr. infectious sars-cov- was propagated in vero e cells and titrated by plaque assay. proteins and probes. the plasmid for expression of the sars-cov- prefusionstabilized spike ectodomain with a c-terminal t fibritin trimerization motif was obtained from ref. . the plasmid was used to transiently transfect freestyle f cells using freestyle max reagent (thermo fisher scientific). the s ectodomain was purified from filtered supernatant on streptactin xt resin (iba lifesciences), followed by size-exclusion chromatography on a superdex in mm tris ph , mm nacl. the rbd domain (rvq-vnf) of sars-cov- was cloned upstream of an enterokinase cleavage site and a human igg fc. this plasmid was used to transiently transfect freestyle f cells using the freestyle max reagent. the rbd-fc fusion was purified from filtered supernatant on protein g sepharose (ge healthcare). the protein was cleaved using bovine enterokinase (genscript) leaving a flag-tag at the c-terminus of the rbd. enzyme and fc-portion were removed on his-pur ni-nta resin (thermo fisher scientific) and protein g sepharose (ge healthcare), respectively, and the rbd was purified by sizeexclusion chromatography on a superdex in mm tris ph , mm nacl. in addition, the rbd domain (rvq-vnf) was cloned upstream of a sortase a recognition site (lpetg) and a xhis tag and expressed in freestyle f cells as described above. rbd-his was purified from filtered supernatant on his-pur ni-nta resin, followed by size-exclusion chromatography on a superdex . the nanobodies were cloned for expression in the phen plasmid with a cterminal sortase recognition site (lpetg) and a xhis tag. this plasmid was used to transform bl cells for periplasmic expression. expression was induced with mm iptg at od = . ; cells were grown overnight at °c. nanobodies were retrieved from the periplasm by osmotic shock and purified by ni-nta affinity purification and size-exclusion chromatography. biotinylated and fluorescent probes were generated using sortase a as described in refs. , . in brief, nanobodies were site-specifically biotinylated on the cterminus using sortase a m. nanobody at a concentration of μm was incubated with sortase a m ( μm), gggk-biotin ( μm) in mm tris, ph . , mm nacl, mm cacl , for h at °c. unreacted nanobody and sortase was removed with ni-nta resin and excess gggk-biotin was removed using zeba spin desalting columns ( . ml, k mwco, thermo fisher scientific). to generate the fluorescently labeled probes, first a dibenzocyclooctyne-amine (dbco-amine, sigma aldrich) was attached via sortase a to the nanobody or the rbd (reaction conditions: μm rbd or nanobody, μm sortase a m, mm dbco-amine in mm tris ph . , mm nacl, mm cacl , h, °c). unreacted probe, sortase and excess dbco-amine were removed using ni-nta resin and pd- columns (ge healthcare), respectively. abberior star p-azide (abberior gmbh) was attached to the dbco-labeled proteins in a copper-free click chemistry reaction. unreacted fluorophore was removed on pd- column (rbd) or size-exclusion chromatography (nanobody). for mammalian expression, the sequence encoding the nanobody ty was cloned upstream of a human igg . this plasmid was used to transiently transfect freestyle f cells using the freestyle max reagent. the ty -fc fusion was purified from filtered supernatant on protein g sepharose followed by sizeexclusion chromatography. alpaca immunization. alpaca immunization and phage display was performed similarly as described in refs. , . in brief, the adult male alpaca tyson at pre-clinics, germany, was immunized four times in a -day immunization schedule. sars-cov- s -sheep-fc (native antigen company, sku: rec ) was used for the first two immunizations, and sars-cov- rbd produced in freestyle f cells was used for the last two immunizations. the animal study protocol was approved by the preclinics animal welfare officer commissioner and registered under the registration no. . - - - a at the lower saxony state office for consumer protection and food safety-laves and is compliant with the directive / /eu on animal welfare. library generation and nanobody isolation. four days after the final boost, rna was isolated from pbmcs (rna plus mini kit, qiagen). for cdna synthesis, superscript iii rt (thermo fisher scientific) was used with a combination of oligo (dt), random hexamers, or gene-specific primers (al.ch , atggagaggac gtccttgggt, and al.ch . ttcggggggaagayraagac) . all primer sequences are listed in supplementary table . nanobody sequences were pcr amplified and cloned into a phagemid vector for expression as piii fusion. tg cells (lucigen) were transformed with this library by electroporation. cells were inoculated with vcsm helper phage, and the resulting phage was enriched in two consecutive rounds of phage display on rbd immobilized on magnetic beads. after the second round of phage display, individual bacterial colonies were picked in a -well format, grown until od = . and nanobody expression was induced by addition of mm iptg. after h incubation at °c, bacterial supernatant was used as primary detection reagent in an elisa coated with rbd or s ectodomain. bound nanobodies were detected with anti-e tag (bethyl laboratories, : , ) secondary antibody. positive clones were sequenced and cloned into the phen expression vector for further characterization. amino acid sequence of ty . qvqlvetggglvqpggslrlscaasgftfss vymnwvrqapgkgpewvsrispnsgnigytdsvkgrftisrdnakn tlylqmnnlkpedtalyycaiglnlssssvrgqgtqvtvss ngs and analysis of nanobody libraries. plasmids from nanobody libraries before enrichment, and after each enrichment step, were amplified for cycles using q high-fidelity x master mix (neb) according to manufacturer's instructions, using primers: nb-ngs-fw: cactctttccctacacgacgctc ttccgatctctcgcggcccagccggccatgg and nb-ngs-rv: ggagttc agacgtgtgctcttccgatctaccggcgcaccactagtgca, annealing at °c. illumina indexing primers were added using an additional cycles, with kapa hifi. amplicons were size selected using agencourt ampure xp beads (bead ratio: : ), and were pooled at ratios of : : for pre:post- :post- libraries, to account for the reduction in diversity expected during enrichment, and sequenced on an illumina miseq using the miseq reagent kit v ( × ) ms- - . paired-end reads were merged using usearch , and then processed in the julia language, primarily using the nextgensequtils.jl package (analysis code is available here: https://github.com/murrellgroup/ty ). briefly, reads are trimmed of primer sequences, and deduplicated, maintaining read frequencies. variant frequencies are calculated as combined frequency of any reads matching a variant within % nucleotide divergence, using a kmer-based distance approximation for rapid database search. any reads with counts > from the second enrichment library are searched for their variant frequencies across all databases. when calculating enrichment, to avoid zeros due to sampling and to regularize against over-sensitivity to low-frequency baseline variants, all frequencies are increased by the reciprocal of the size of the pre-enrichment database. neutralization assay. pseudotyped viruses were generated by the co-transfection of hek t cells with plasmids encoding the sars-cov- spike protein harboring an amino acid truncation of the cytoplasmic tail , a plasmid encoding firefly luciferase, and a lentiviral packaging plasmid (addgene cat# ) using lipofectamine (invitrogen). media was changed - h after transfection, and pseudotyped viruses were harvested at and h post transfection, filtered through a . µm filter, and stored at − °c until use. pseudotyped virus neutralization assays were adapted from protocols previously validated to characterize the neutralization of hiv , but with the use of hek t-ace cells. briefly, pseudotyped viruses sufficient to generate~ , rlus were incubated with serial dilutions of nanobodies for min at °c. approximately , hek t-ace cells were then added to each well and the plates were incubated at °c for h. luminescence was then measured using bright-glo (promega) per the manufacturer's instructions on a gm- luminometer (promega) with an integration time of . s. flow cytometry. cells were trypsinized and fixed in % formaldehyde/pbs and stained with rbd-as p under non-permeabilizing conditions or with ty -as p under permeabilizing conditions. fluorescence was quantified using a bd facscelesta and the flowjo software package. immunofluorescence. vero e cells were seeded onto coverslips in a -well plate and incubated overnight at °c/ % co . cells were infected with sars-cov- at a moi of for h. cells were fixed with % (v/v) formaldehyde, permeabilized in . % triton x- and blocked in % horse serum. cells were incubated with anti-dsrna antibody ( : , j scicons, rnt-sci- ) for h at room temperature followed by h staining with the secondary antibody anti-mouse-alexa fluor ( : , thermo fisher scientific, a- ), hoechst ( : , invitrogen) and ty -as p ( . µg/ml). coverslips were mounted in mounting media and images were obtained using zeiss axiovert microscope and processed using adobe photoshop. biophysical bli and itc. bli was performed using single-use high-precision streptavidin biosensors (sax) on an eight-channel octet red instrument according to manufacturer's protocols (fortebio) . assays were performed in xpbs comprising . % tween- (pbst). biotinylated nanobodies ty and np-vhh were loaded at concentrations between and nm followed by quenching using biocytin to reach final sensor loads of between . and . nm. for the comparative binding test, the eight sensors were divided into two sets, each comprising double sample as well as single reference and single control sensors. sample and reference sensors were loaded with respective nanobodies. the sax control was only quenched. loading of the two sets was performed consecutively to reach similar immobilization levels, while subsequent association and dissociation phases were performed simultaneously. for association, the sample and control sensors were dipped into rbd, while the reference sensor was dipped into pbst. for titration experiments, all sensors were loaded simultaneously. during association one of the sensors was used as reference and only dipped into pbst. raw data were preprocessed, analyzed, and fitted by applying the : binding model as implemented in the manufacturer's software. bayesian analysis to obtain the two-dimensional distribution of k d and k off -rate values were performed using evilfit [ ] [ ] [ ] . the shown titration data were processed applying reference sensor subtraction and savitzky-golay filter operations. for itc, proteins were exchanged to xhbs-buffer ( mm hepes, mm nacl, ph . ) and isolated as single peak populations by superdex- hr / size-exclusion chromatography. itc measurements were performed using an itc calorimeter (ge healthcare). the cell temperature was set to °c and the syringe stirring speed to rpm. before each experiment, the rbd and nanobodies were loaded into the cell and syringe at concentrations of and μm, respectively. data and binding parameters were analyzed using the microcal peakitc software (malvern). the integrated heat versus molar ratio plots of the ty :rbd interactions were obtained by subtracting the ty dilution heat uptake from the binding data. the np-vhh :rbd data were only baseline-corrected, since dilution effects were not evident. raw and processed bli/itc data were imported into rstudio for visualization and further analysis [ ] [ ] [ ] . data along with analysis r scripts will be made publicly available via github and/or datadryad. cryo-em sample preparation and imaging. spike trimer ( . mg/ml) and ty ( . mg/ml) were mixed in a : molar ratio and incubated on ice for min. a -μl aliquot of the sample solution was applied to glow-discharged cryomatrix holey grids with amorphous alloy film (zhenjiang lehua technology) in a vitrobot mk iv (thermo fisher scientific) at °c and % humidity (blot s, blot force ). cryo-em data collection was performed with epu . (thermo fisher scientific) using a krios g i transmission-electron microscope (thermo fisher scientific) operated at kev in the karolinska institutet d-em facility. images were acquired in nanoprobe eftem mode with a slit width of ev using a gif energy filter (ametek) and a k detector (ametek) during . s with a dose rate of . e − /px/s resulting in a total dose of e − /å fractionated into movie frames. motion correction, ctf-estimation, fourier binning (to . Å/px), picking and extraction in pixel boxes were performed on the fly using warp . a total of , micrographs were selected based on an estimated resolution cutoff of Å and defocus below microns and , particles were picked by warp. extracted particles were imported into cryosparc v . . for d classification, d classification, and non-uniform d refinement. the particles were processed with c symmetry throughout. after d classification ( classes) , particles were retained and used to build three ab-initio d reconstructions. these were further processed for heterogeneous refinement that resulted in one reconstruction showing high-resolution structural features in the core of the spike. one round of homogenous refinement followed by non-uniform refinement resulted in a final reconstruction to an overall resolution of . Å ( . fsc) using , particles. localized reconstruction were performed using particles where all parts of the spike except the n-terminal domains, the rbds, and the nanobodies had been subtracted . the combined effects of these two approaches significantly increased the level of density detail in the upper part of the spike. model building and structure refinement. a structure of the -ncov spike protein trimer (pdb: vsb) was used as a starting model for model building. the model was extended and manually adjusted in coot . the nanobody structure was homology modeled using swiss-model taking pdb: jmr as a template. the missing regions of the rbd domains were built based on the rbd-ace crystal structure (pdb: lzg) . for model building and refinements, a composite map was made using phenix utilizing the particle center-of-mass focused reconstruction and the map from the localized reconstruction described above. structure refinement and manual model building were performed using coot and phenix in interspersed cycles with secondary structure and geometry restrained. all structure figures and all em density-map figures were generated with ucsf chimerax . reporting summary. further information on research design is available in the nature research reporting summary linked to this article. the sequence of ty is deposited in the ncbi genbank sequence data base and is available under the accession code mt . bli and itc data are available in https:// github.com/derpaule/ty _octet_itc and https://doi.org/ . /dryad.gb mkkwmz, respectively. next generation sequencing data is deposited at the sra, under bioproject id prjna . jupyter notebooks to reproduce the ngs data processing are available at: https://github.com/murrellgroup/ty . the cryo-em density map of sars-cov- spike glycoprotein with ty nanobodies bound was deposited in the electron microscopy data bank (emdb) with accession code emd- . the corresponding model was deposited in the protein data bank (pdb) with accession code zxn. source data are provided with this paper. received: june ; accepted: august ; cell entry mechanisms of sars-cov- cryo-em structure of the -ncov spike in the prefusion conformation structure, function, and antigenicity of the sars-cov- spike glycoprotein potent neutralizing antibodies against sars-cov- identified by high-throughput single-cell sequencing of convalescent patients' b cells human neutralizing antibodies elicited by sars-cov- infection a human neutralizing antibody targets the receptor-binding site of sars-cov- nanobodies and nanobody-based human heavy chain antibodies as antitumor therapeutics general strategy to humanize a camelid single-domain antibody and identification of a universal humanized nanobody scaffold single-domain antibodies targeting neuraminidase protect against an h n influenza virus challenge generation and characterization of alx- , a potent novel therapeutic nanobody for the treatment of respiratory syncytial virus infection the antiviral mechanism of an influenza a virus nucleoprotein-specific single-domain antibody fragment bayesian analysis of heterogeneity in the distribution of binding properties of immobilized surface sites the role of mass transport limitation and surface heterogeneity in the biophysical characterization of macromolecular binding processes by spr biosensing combined affinity and rate constant distributions of ligand populations from experimental surface binding kinetics and equilibria a guide to simple and informative binding assays determining kinetics and affinities of protein interactions using a parallel real-time label-free biosensor, the octet designing binding kinetic assay on the bio-layer interferometry (bli) biosensor to characterize antibody-antigen interactions characterization of protein-protein interactions by isothermal titration calorimetry sitespecific glycan analysis of the sars-cov- spike developing a fully glycosylated full-length sars-cov- spike protein model in a viral membrane structures of human antibodies bound to sars-cov- spike reveal common epitopes and recurrent features of antibodies nanobodies: natural single-domain antibodies structural basis for potent neutralization of betacoronaviruses by single-domain camelid antibodies neutralizing nanobodies bind sars-cov- spike rbd and block interaction with ace humanized single domain antibodies neutralize sars-cov- by targeting spike receptor binding domain identification of human single-domain antibodies against sars-cov- sybodies targeting the sars-cov- receptor-binding domain inhibition of sars-cov- infections in engineered human tissues using clinical-grade soluble human ace site-specific protein labeling via sortase-mediated transpeptidation production of unnaturally linked chimeric proteins using a combination of sortase-catalyzed transpeptidation and click chemistry how lamina-associated polypeptide (lap ) activates torsin lama pacos) as a convenient source of recombinant camelid heavy chain antibodies (vhhs) error filtering, pair assembly and error correction for next-generation sequencing reads long-read amplicon denoising isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model optimization and validation of the tzm-bl assay for standardized assessments of neutralizing antibodies against hiv- r: a language and environment for statistical computing (team rc, other elegant graphics for data analysis welcome to the tidyverse real-time cryo-electron microscopy data preprocessing with warp cryosparc: algorithms for rapid unsupervised cryo-em structure determination localized reconstruction of subunits from electron cryomicroscopy images of macromolecular complexes bacteriophage φ scaffolding protein gp before and after prohead assembly features and development of coot swiss-model: homology modelling of protein structures and complexes the structure of a furin-antibody complex explains noncompetitive inhibition by steric exclusion of substrate conformers structural and functional basis of sars-cov- entry by using human ace phenix: a comprehensive python-based system for macromolecular structure solution ucsf chimerax: meeting modern challenges in visualization and analysis equilibrium analysis of high affinity interactions using biacore the authors declare no competing interests. supplementary information is available for this paper at https://doi.org/ . /s - - - . peer review information nature communications thanks the anonymous reviewers for their contribution to the peer review of this work. peer reviewer reports are available.reprints and permission information is available at http://www.nature.com/reprintspublisher's note springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.open access this article is licensed under a creative commons attribution . international license, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons license, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons license, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. to view a copy of this license, visit http://creativecommons.org/ licenses/by/ . /. key: cord- - mxcssjj authors: ishay, yuval; kessler, asa; schwarts, asaf; ilan, yaron title: antibody response to sars‐co‐v‐ , diagnostic and therapeutic implications date: - - journal: hepatol commun doi: . /hep . sha: doc_id: cord_uid: mxcssjj the immune response against sars‐cov‐ is comprised of both cellular and humoral arms. while current diagnostic methods are mainly based on pcr, they suffer from insensitivity. therefore, antibody‐based serological tests are being developed to achieve higher sensitivity and specificity. current efforts in treating sars‐cov‐ infection include blocking of viral entry into the host cells, prohibiting viral replication and survival in the host cells, or reducing the exaggerated host immune response. administration of convalescent plasma containing anti‐viral antibodies was proposed to improve the outcome in severe cases. in this paper, we review some of the aspects associated with the development of antibodies against sars‐cov‐ and their potential use for improved diagnosis and therapy. sars-cov- is an infectious rna virus responsible for causing the covid- disease ( ) . while current diagnostic methods for covid- diagnosis are mainly based on pcr, they suffer from insensitivity. widespread reports of both false positive tests and false negative tests have been reported. therefore, serological tests are being developed to identify patients suffering from covid- , and to assist in identifying subjects who have been diseased and may now be immune to re-infection or to severe disease. the host immune response mounted towards the virus contributes to disease severity. the immune response towards sars-cov- is comprised of both the cellular and humoral arms. current evidence points to the severe manifestation of covid- disease as being driven by inappropriate hyperactivation of the immune system, associated cytokine storm, and end organ damage ( , ) . current efforts for the treatment of covid- include blocking of viral entry into the host cells, prohibiting viral replication and survival in the host cells, or reducing the exaggerated host immune response. however, these strategies have shown limited efficacy ( ) . administration of convalescent plasma was proposed to improve patient outcomes in severe cases. in this paper, we review some of the aspects associated with the development of antibodies against sars-cov- , their biology, potential uses, expected advantage, and disadvantages. sars-cov- is an enveloped, single-stranded rna virus. the viral genome encodes four structural proteins including the spike (s), envelope (e), membrane (m), and nucleocapsid (n), as well as other non-structural proteins. the s protein of the sars-cov- consists of two subunits, s and s . acting as a homotrimer, the heavily glycosylated s protein binds its cellular receptor, angiotensin converting enzyme (ace ), present on the pneumocytes and enterocytes, via the c-terminal domain of the s subunit, in the receptor binding domain (rbd) region ( , ) . extending from the viral membrane, the s protein extends outward from the virion. while the s subunit extends furthest beyond from the virus membrane, the inner s subunits consists of a mostly helical structure, leading towards the viral membrane. the interaction of the s -ace receptor leads to conformational changes in the helical s subunit. the next event in viral binding and entry includes cleavage of the s /s protein subunits by cellular proteases. this proteolytic activity may be performed via furin protease, a feature not unique to the sars-cov- among the coronaviruses, but absent in sars-cov ( ) . the cleaving protease, dictating the exact exposed viral amino acid sequence, also determines the pattern of viralcell fusion ( , ) . the release of newly constructed virions, and the later activities of these new virions are also dependent on specific protease activity ( ) . among the sites enumerated in this description, several appear as attractive targets for biologically active antibodies. of note, while new data is continuously and vigorously obtained, specifically regarding sars-cov- , much of the functional data regarding coronavirus activity and mechanisms come from the research on sars-cov and mers-cov. this appears particularly poignant where homologies in the structure and function between these viruses are sought. while sequence and biological similarities are common, major differences exist, influencing virus function and antibody biology. these range from matters such as cleavage by similar proteases, though sars-cov- shows unique furin sensitivity, to receptor binding, where it shares the affinity towards ace with sars-cov, through highly conserved rbd residues( ). the final event of protective and effective antibody production is the differentiation of b-cells into plasma cells, a change accompanied by robust antibody production. a fraction of these cells will differentiate into memory bcells, allowing for an early antibody response upon re-infection, and have been demonstrated after sars-cov infection ( ) . presumably, the "first contact" of the sars-cov- with the immune system occurs upon introduction of viable viral particles into the airways. the very first responding part of the immune system may be the epithelial cells themselves, both acting as antigen presenting cells (apcs) ( ) and internally expressing antiviral proteins, specifically type-i interferons ( ) . type-i interferon signaling is usually initiated via toll like receptors (tlrs). variance in the vulnerability to the virus, namely men being more vulnerable, has been attributed partially to a superior tlr signaling in women, possibly resulting in the enhanced antibody production ( ) . notably, tlr functions in b-cells as well,and may contribute to enhanced function and differentiation of plasma cells ( ) . following initial contact with epithelium, innate immune cells come in contact with the virus and with infected cells. the superficial intraepithelial dendritic cells (dcs) in the lungs adjacent to the airways are required for antibody production( ). after antigen encounter, they will move to the regional lymph nodes and help trigger robust antibody production by activation of cd "follicular helper" t-cells, supporting b-cell function ( ) . some this article is protected by copyright. all rights reserved dc functions, including type-i interferon secretion in response to viral stimulation, is also dependent on tlr signaling ( ) . while existing research is focused on the endogenic immune response to sars-cov- , and its possible beneficial manipulations, isolation of neutralizing antibodies (nabs) from the infected persons or laboratory manufacturing of these antibodies is another subject of intense interest. monoclonal antibodies (mabs) with some neutralizing activities were demonstrated to occur in the infected human sera ( ) . nabs may be defined in various ways; commonly as the antibody concentration required to prevent or decrease the infectivity ( ) . the most attractive antibodies are those targeting the s protein, whether in the rbd or other regions, including the s /s proteolytic cleavage site ( ) . it is plausible that the antibodies targeting these sites will block the essential viral functions including viral-antigen binding (expected from s -rbd antibodies) and/or interfere with s protein-mediated viral fusion or cell entry ( ) ( ) ( ) . multiple specific regions in sars-cov- show high homology to the sars-cov virus, suggesting potential b and t cell epitopes for sars-cov- ( ) . a set of b cell and t cell epitopes were derived from the s and n proteins which, excluding notable differences, are generally conserved between sars-cov and sars-cov- . the lack of mutations in these identified epitopes allows assessment of possible sars-cov- immune targets ( ) . this study showed no mutations occurred between sars-cov and sars-cov- in these sequences, confirming the possibility of antibody cross reactivity and humoral immunity. in spite of this high homology, cross-reactivity of sars-cov antibody is limited between two viral s proteins ( , ) . murine polyclonal sars-cov antibodies directed against the s protein inhibited sars-cov- entry into cells, indicating that the cross-nabs targeting conserved s epitopes can be produced ( ) . s -targeting mabs from immunized transgenic mice expressing human ig variable heavy and light chains can neutralize sars-cov- and sars-cov infections ( , ) . in a previously mentioned trial, sars-cov- rbd-specific mabs were generated, among which, only two clones showed significant blocking of viral entry, associated with a high competitive capacity against ace receptor binding ( ) . similar results were observed in the studies using sera from recovered sars and covid- patients, where limited crossneutralization occurred, suggesting that cross nabs are either incompletely reactive or insufficient for disease prevention ( ) . prior to and concurrently with the isolation of specific antibodies, sars-cov s -specific serum from convalescent sars patients or from animals was proposed to cross-neutralize the sars-cov- infection by reducing s proteinmediated sars-cov- entry ( ) . cross reactivity of the antibodies from sars-cov- patients against the s proteins, but not against the rbd of sars-cov and mers-cov, has been documented. the roles played by rbd in the invasion of sars-cov- into host cells make it a potential target for nabs. blocking of binding between the rbd and its respective receptor may restrict the conformational change of s, or hamper the s -mediated membrane fusion, thereby inhibiting the viral infection of host cells ( ) . the human nabs s . and m were isolated from sars-covinfected patients. they neutralize sars-cov infection by interacting with the rbd and by blocking the binding between viral rbd and ace receptor ( ) . the sars-cov rbd-specific human nabs, cr binds sars-cov- rbd with high affinity and recognizes an epitope on the rbd that does not overlap with the ace -binding site ( ) . the s . and s . mabs can neutralize the infectious clones of sars-cov, and protect the mice against four different homologous and heterologous sars-cov strains ( , ) . of note, such mabs produced in the chimeric mouse cells and originating from sars-cov patients were shown to neutralize sars-cov- virus particles by an ace -independent mechanism, which probably has to do with s protein fusion or proteolysis and preventing viral fusion ( ) . while these studies hold both promise and interest, isolation and analysis of neutralizing antibodies remains a difficult task. a majority of patients recovered from covid- showed high titers of sars-cov- s -specific igg antibodies when tested by enzyme-linked immunosorbent assay (elisa) ( ) . however, only out of these patients manifested effective blockade of sars-cov- rbd binding to hace when tested in vitro ( ) . the transient and dynamic conformational states of the s protein have been suggested to provide a narrow window for an exposure of the immunogenic epitopes of rbd to b lymphocytes ( ) . early and transient peak levels of anti-s antibody response were associated with a less favorable outcome for the patients, compared with a more delayed and sustained response ( ) . the phage display method, allowing rapid and wide display of proteins directly correlated to their associated genes, can detect nabs against sars-cov from both naïve and immune antibody libraries, capable of blocking the binding of s domain, thereby showing virus neutralization and prophylaxis capability either in vitro or in the animal models ( , , ) . another method, possibly allowing the production and utilization of existing nabs, may include the use of epstein-barr virus (ebv) transformation of human b cells to improve the isolation of nabs from the memory b cells harvested from the sars-cov infected patients ( ) . transgenic mice with human immunoglobulin genes are being developed to produce nabs against sars-cov by antigen immunization, which are effective for virus prophylaxis in animal models ( , ) . cloning of human mabs using samples from covid- -recovered patients whose sera showed hace receptor binding inhibition has been reported ( ) . following antibody cloning, three pairs of igg variable heavy chain and light chain inserted expression plasmids were expressed and named as mab- b , mab- d , and mab- b . all three mabs bind to the rbd protein. mab- b and mab- d blocked sars-cov- rbd-hace interaction and neutralized a sars-cov- s pseudotyped lentiviral particle ( ) . the mab- b and mab- d neutralized pseudovirus entry into host cells ectopically expressing hace ( ) . several nabs, such as b , f , and e , towards epitopes on sars-cov s manifested neutralization properties ( , ) . this article is protected by copyright. all rights reserved nucleocapsid-specific antibodies have been also demonstrated in the sera of infected patients. most studies assessing nucleocapsid antibodies have not differentiated these antibodies from other antibodies directed against sars-cov- studies that have seem to show similar kinetics to that of the general antibody response ( ) . no studies have shown the occurrence of definitive nabs directed at the n protein or the nature of the immune response triggered by such antibodies. serum igg, igm, and iga antibodies against sars-cov appeared in the patients after primary sars infection ( ) . data on the production of igg and igm is important for improved diagnosis of covid- ( ) . several studies have described the dynamics of antibody production in these patients. while it is too early to definitively summarize the characteristics of antibody dynamics, certain conclusions seem consistent across these studies. broadly, antibody titers increase and the prevalence of viral rna decrease as time progresses from the symptomatic disease onset ( , ) . elisa-based diagnostic kits often report a specificity of ~ % ( ), with some trials reporting higher percentage ( ) . while this is an impressive figure by itself, it may yield a relatively poor positive predictive value (ppv) when employed on a large scale to a disease with relatively low prevalence. elisa tests were argued to be efficient when trying to augment the sensitivity of testing of close contacts ( , ) , or deciding to allow a person to leave from quarantine. this specificity may be further reduced when testing a person recently exposed to the milder coronaviruses circulating within humans and livestock. however, to our knowledge, this question has not been directly assessed. igg and igm antibodies may appear simultaneously or sequentially, with cases of igm antibodies appearing last being described in some of the studies ( ) . conversion from seronegativity to seropositivity is likely to occur between - days after the onset of symptoms. data from some of these studies show that the patients with more severe illness were more likely to mount a high-titer and high-affinity antibody response, which was not necessarily associated with a reduction in the viral rna assayed from their blood ( ) . this is supported by the reports of recurring pcr positivity after igg seroconversion ( ) . if these studies become the prevalent findings, they may stand in a stark contrast to well established viral disease behaviors where high igg levels are thought to denote virtual immunity to the disease, allowing, at most, a mild manifestation upon re-exposure. it seems that in covid- , as our current understanding stands, antibody titers should be thought of as the disease markers and not as the definitive markers of immunity or disease resolution in the actively ill. in the antibody detection, different elisa kits, based on the recombinant sars-cov- nucleocapsid protein (rn) and recombinant spike protein (rs), show variable results. in a study of patients with confirmed covid- , % were diagnosed with rn-based igm, % with an igg, % with rs-based igm, and % with igg tests. the positive rates for rn-based and rs-based elisa detections were % for igm and % for igg. the sensitivity of the rs-based elisa for igm was higher than that of the rn-based test. here also, the antibody positivity increased as disease time progressed ( ) . another stratum of results expected from antibodies is the identification of immune and recovered persons who may be able to work in the critical locations during the times of pandemic. the ability to definitively identify specific nabs in the serum of recovered patients could also allow identifying the potential plasma donors for the development of passive immunization, and may assist in evaluating the effectiveness of various treatments in addition to assisting in determining the prognosis ( ) . most convalescent plasmas obtained from individuals who recover from covid- do not contain high levels of nabs. a recent analysis of covid- convalescent individuals evaluated plasmas collected an average of days after the onset of symptoms showing variable half-maximal pseudovirus neutralizing titers below : in % and below : , in %. only % showed titers above : , . expanded clones of rbd-specific memory b cells expressing closely related abs in different individuals were identified. the abs were directed against three distinct epitopes on rbd. rare but recurring rbdspecific antibodies with potent antiviral activity were identified in all subjects recovered ( ) . the relevance of the titers for the clinical effect are yet to be determined. a recent review analyzed the diagnostic accuracy of antibody tests for sars-cov- infection, for assessing past infections and for use in seroprevalence surveys ( ) . a total of publications reporting cohorts with , samples, of which were from cases of sars-cov- infection, were evaluated. substantial heterogeneity in sensitivities of iga, igm and igg abs, or combinations thereof, for results aggregated across different time periods postsymptom onset. pooled results for igg, igm, iga, total antibodies and igg/igm showed low sensitivity during the first week since onset of symptoms, rising in the second week and reaching their highest values in the third week. the sensitivity of antibody tests was proposed to be too low in the first week since symptom onset, to have a primary role for the diagnosis, but were suggested to have a role complementing other testing in individuals presenting later, when rt-pcr tests are negative. antibody tests are useful for detecting previous sars-cov- infection if used or more days after the onset of symptoms ( ) . several currently-available covid- antibody tests that are used in diagnostics and epidemiology, with a focus on their strengths and weaknesses are summarized in table . the lack of specific sars-cov- -targeted treatments and vaccines poses great challenges for the management of patients with severe illness. igg levels against sars-cov, drawn from affected patients, reach peak serum concentration during the convalescent phase and are reduced following recovery ( ) . while the capacities of antibodies to neutralize the virus were highly variable in the required concentration, some of them indeed showed such capability, and have been shown to provide protection against re-infection in a mouse model ( ) . use of convalescent plasma and development of nabs are attractive methods for the treatment of viral infections ( , ) . blocking mabs with high antigen specificity were proposed as potential candidates for neutralizing infections ( ) ( ) ( ) . convalescent plasma has intermittently emerged this article is protected by copyright. all rights reserved during the last few decades as a treatment for various infectious diseases ( ) ( ) ( ) , enjoying attention whenever diseases prove resistant to more conventional treatment methods. plasma-derived nabs can provide passive immune responses to viral infections and were effective in patients with severe illnesses caused by other viruses ( , ) . a meta-analysis showed that the mortality was reduced after receiving various doses of convalescent plasma in the patients with severe acute respiratory infections, with no adverse events or complications after treatment ( ) . antibodies from convalescent plasma were proposed to reduce the viremia by enhancing viral clearance, blocking infection of new cells, and contributing in the clearance of infected cells ( , , ) . during the sars epidemic, severely ill patients who deteriorated despite the treatment with methylprednisolone were given convalescent plasma at around th day of the disease onset. earlier plasma administration correlated with a better prognosis and higher rate of hospital discharge at day ( ) . convalescent plasma or immunoglobulins were effective in sars patients whose condition continued to deteriorate. some studies suggested a shorter hospital stay and lower mortality rate following convalescent plasma administration ( , , ( ) ( ) ( ) ( ) ( ) . a similar trend for the treatment timing was described in patients with lassa fever in nigeria treated with convalescent plasma ( ) . the empirical use of convalescent plasma for ebola virus disease showed some positive results ( ) ( ) ( ) . experimental and clinical data on the use of convalescent plasma products and humanized monoclonal antibodies for h n influenza infection have also shown positive outcomes, and this treatment was proposed as a mean for overcoming anti-viral drug resistance ( , , ) . in a study involving patients with severe pandemic influenza a (h n ) virus infection, administration of convalescent plasma reduced respiratory tract viral load, serum cytokine response, and mortality ( ) . a prospective cohort study during the pandemic showed reduction in the relative risk of mortality in the patients treated with convalescent plasma, demonstrating reduction of the viral loads without any adverse effects ( ) . a randomized trial of convalescent plasma failed to achieve its primary end point, a reduction of mortality; however, a subgroup multivariate analysis performed on of the patients enrolled in the trial demonstrated that h-ivig treatment was the only factor independently associated with reduced mortality ( ) . development of nabs against sars-cov- was proposed as a method for developing therapeutic agents for covid- ( , , , ) . several sars-cov- proteins (discussed above) prove attractive targets for nabs. the sars-cov- s protein is a target for developing nabs to block its binding and fusion ( ) . currently, no sars-cov- -specific nabs have been reported ( ) . however, polyclonal antibodies from recovered sars-cov- -infected patients are being used to treat the patients with severe infections. while many patients will develop an antibody response following their illness, specific characterization of these antibodies and their properties as nabs has yet to be determined ( , ) . early administration of convalescent plasma was advised in order to maximize its viral clearance effect ( ) . this article is protected by copyright. all rights reserved plasma collection is done via apheresis. in order to qualify for the donation, the donor must meet several conditions: diagnosis of prior covid- infection confirmed by pcr, donation needs to take place - days after resolution of the symptoms followed by two consecutives negative pcr results, donors need to be tested for absence of transmissible pathogens, donation should be done from male or nulliparous female donors, with no previous exposure to blood products in order to minimize the risk of transfusion associated acute lung injury (trali). plasma ( - ml) is donated according to the abo compatibility. pathogen inactivation measures need to be undertaken ( ) . it is advised to administer up to two units of plasma, possibly from two different donors ( ) . several studies that described the administration of convalescent plasma to critically ill covid- patients suggested post transfusion viral elimination and clinical improvement. a study of critically ill patients (n= ) reported clinical improvement in patients' status and laboratory indication of viral clearance for up to days post transfusion of two consecutive doses of convalescent plasma (total ml).( ) three of the patients were on mechanical ventilation and two on ecmo, and the treatment was provided between - days after hospitalization, following which improvement in fever, pao /fio ratio, and viral clearance were noted. three patients were discharged from the hospital, and two were in stable condition at the end of the follow-up period. although a clinical effect was obtained, the delay of up to three weeks in the administration, and the concurrent use of other therapies, make it difficult to assess the effect of plasma ( ) . administration of convalescent plasma in six critically ill patients was followed by discontinued viral shedding three days after infusion without reducing the mortality ( ) . a study in six covid- patients showed clinical, radiological, and laboratory improvement following administration of abo-compatible convalescent plasma indicating that this therapy is effective and specific ( ) . in a study of severe patients administration with ml of convalescent plasma, showed improved clinical, laboratory, and radiological status without severe adverse effects ( ) . in this study, the antibody titers of donor's plasma were assessed and found to be elevated in the majority of donors, along with a concurrent increase in nabs titers in the patients' sera following transfusion. treatment within two weeks of initial symptoms has improved the response ( ) . differences in the outcomes between the studies may reflect temporal variations of administration including the time lag between plasma donation and administration as well as the time from disease detection to treatment. safety evaluation of candidate antibodies must not be overlooked. although antibodies are generally protective, the antibody-dependent enhancement (ade) phenomenon of viral infections is documented for dengue virus and other viruses ( ) . in sars-cov infection, ade is mediated by the engagement of fc receptors (fcrs) expressed on various immune cells, including monocytes, macrophages and b cells ( ) . pre-existing sars-covspecific antibodies were proposed to promote viral entry into fcr-expressing cells. internalization of virus-antibody immune complexes may induce inflammation and tissue injury by activating myeloid cells via fcrs ( ) . this article is protected by copyright. all rights reserved figure presents several putative and proven nabs interactions in covid- , such as antibody targets and functions including those associated with disruption and non-disruption binding mechanisms, and those targeting the virus itself. in addition, non-neutralizing antibodies, cross reactive antibodies, and antibodies with low specificity or low titers, which are unable to act as nabs, are also generated. several large trials using convalescent plasma are being conducted ( ) . identifying and cloning mabs that target viral proteins to block the entry into host cells is being explored for preventing and treating covid- ( , ) . computational simulation of antibody-antigen complexes can improve the design of these therapies. key residues between rbd and nabs can be identified, and models are being used to assess the interaction between s protein and human ace or antibodies ( , , ( ) ( ) ( ) ( ) . several methods for improving the effectiveness of convalescent plasma or nabs are being considered. the outcomes of passive convalescent plasma therapy from recovered donors are unpredictable due to variability among the donors in both the levels and types of antibodies ( ) . appropriate selection of the donors is required for improving the quality of the collected plasma. assessment of the antibody titers needs to be performed prior to harvesting due to a marked variability in titers among the donors. titers correlate with the disease severity, timing of donation, use of steroids during acute illness ( , ) , and quality of antibodies (i.e. whether they are nabs or not) nonwithstanding. timing of plasmapheresis is a major factor as lower levels of antibodies are detected within the first two weeks following recovery ( ) . more data is required on the amount of virus neutralization by antibodies upon exposure to convalescent plasma. in vitro testing for neutralizing and/or crossneutralizing activity, and in vivo evaluation in available covid- animal models for protective efficacy, along with preclinical studies and clinical trials testing the safety and efficacy, are needed for optimizing this therapeutic option ( ) . the gender of the donor also plays a role in mounting a significant response. the degree of activation of the immune cells is higher in women than in men, which correlates with the triggering of tlr and production of proinflammatory cytokines. tlr is expressed in innate immune cells, which recognize single strand rna virus by promoting the production of viral antibodies and generation of il- and il- inflammatory cytokines. tlr is higher in women than in men and its expression may lead to better immune responses and increased resistance to viral infections ( ) . pairing hla-typing with covid- was proposed to improve the assessment of disease severity and assist in preferred donor selection ( ) . the use of hyper-immune globulin rather than whole plasma was proposed for improving efficacy and validity of the therapy. the main advantages are associated with an ability to provide the patients with controlled quantities of antibodies in lower volumes ( ) . similar techniques for concentrating this article is protected by copyright. all rights reserved antibodies are being used for the treatment and prevention of other diseases ( ) . this is similar to the concept of using hyper-immune globulin for various indications, including viral diseases in immunocompetent and immunocompromised hosts ( ) ( ) ( ) . a "cocktail antibody approach" for sars-cov- was proposed based on the studies suggesting that the combination of antibodies from diverse donors may exert a synergistic neutralization effect ( ) . a mixture of two antibodies showed a synergistic neutralization effect due to recognition of different epitopes on rbd ( ) . the use of immune adjuvants may also improve the response to the antibodies ( ) . sphingolipid-based adjuvants, when administered with antibodies, augmented the anti-viral response ( ) and improved the systemic anti-inflammatory effects of antibodies ( ) . the use of hyper-immune bovine colostrum comprised of antibodies and sphingolipids was effective in reducing systemic inflammation ( ) ( ) ( ) . mode of antibody administration may also have an impact on the effect of antibody-based therapy. oral administration of antibodies ameliorated viral-mediated chronic inflammation via promotion of regulatory t cells ( ) , and oral administration of viral antigens augmented an anti-viral immunity while reducing the inflammation ( , ) . the data on the possible harmful effects of antibody-mediated immune response in the development of pulmonary complications of sars-cov is controversial. several patients died of sars manifested strong nabs responses and pulmonary inflammation, suggesting that the nabs could be associated with the deterioration of the lung disease ( , ) . similar notions have been proposed as explaining the more severe phenotype of covid- prevalent in china. this may be related to the higher degree of exposure to milder coronaviruses and a "priming" of the immune system by pre-existing antibodies, leading to immune dysfunction and over-function ( ) . this notion is supported by the mild disease manifestations in the patients with agammaglobulinemia ( ) . previous exposure to coronaviruses may also explain a relatively high prevalence of spike protein-reactive cd cells in the healthy donors in a study ( ) . a major obstacle for implementing immune-based therapies for the viruses, including the administration of mabs, is associated with the development of viral resistance due to the immune evasion mechanisms, which the virus generates in response to the immune-pressure imposed on it by the immunomodulatory agents ( ) . prolonged exposure to the anti-viral drugs is associated with drug resistance, leading to persistent viremia or severe disease. in cases where anti-viral treatment is highly effective leading to viral elimination, resistance is less likely to occur. however, immunotherapy, including the administration of antibodies, are associated with selective pressure that may result in rapid viral and host adaptations leading to resistance to the therapy ( ) . both host and viral factors are associated with the development of resistance. viral-related tools include mechanisms of viral replication, genomic inference, and high rates of viral mutations ( , ) , ( ) . an immune adaptation process towards antibody-induced pressure on the virus or on anti-viral humoral and cellular responses may limit the efficacy and longevity of these therapies. combination of several potent nabs could improve the sensitivity to neutralization ( ) . methods for overcoming resistance by implementing host-tailored variability are being developed based on the data generated from the use of these methods for improving the effects of other immunomodulatory drugs ( ) ( ) ( ) ( ) . these include implementing artificial intelligence methods for overcoming host compensatory responses in sepsis and its sequela ( ) , and for improving the effects of adjuvants ( ) . algorithmcontrolled treatment regimens are now being used in several clinical trials for overcoming drug resistance (nct ; nct ). the lack of accurate diagnostic and effective therapeutic methods for the sars-cov- -infected patients led to the need of developing humoral-based approaches. while this approach holds promise, more data is needed for optimizing the antibody-based diagnosis, and for improving the implementation of convalescent plasma and other antibody-based therapies. the potential development of effective vaccines will benefit from the results achieved from these diagnostic and therapeutic attempts. immunotherapeutic methods are expected to require targeting the cellular arm of the immune system either in addition or as part of the design of antibody-based approaches, mainly for alleviating the immune-mediated target organ damage in covid- . this article is protected by copyright. all rights reserved sars-cov- : severe acute respiratory syndrome coronavirus ; s protein: spike protein; s and s : s protein subunits; rbd: receptor binding domain; abs: antibodies; nabs: neutralizing antibodies; and ace : angiotensin-converting enzyme . how to reduce the likelihood of coronavirus- (cov- or sars-cov infection and lung inflammation mediated by il- sars-cov- : a storm is raging immune responses and pathogenesis of sars-cov- during an outbreak in iran: comparison with sars and mers therapeutic opportunities to manage covid- /sars-cov- infection: present and future coronavirus membrane fusion mechanism offers a potential target for antiviral development function, and antigenicity of the sars-cov- spike glycoprotein cell entry mechanisms of sars-cov- sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor crystal structure of sars-cov- main protease provides a basis for design of improved α-ketoamide inhibitors a pneumonia outbreak associated with a new coronavirus of probable bat origin an efficient method to make human monoclonal antibodies from memory b cells: potent neutralization of sars coronavirus epithelial mhc class ii expression and its role in antigen presentation in the gastrointestinal and respiratory tracts type i interferons as regulators of lung inflammation coronavirus cov- /sars-cov- affects women less than men: clinical response to viral infection tlr -and tlr -responsive human b cells share phenotypic and genetic characteristics all rights reserved . lambrecht bn, hammad h. lung dendritic cells in respiratory viral infection and asthma: from protection to immunopathology dendritic cells and humoral immunity in humans sars coronavirus papain-like protease inhibits the type i interferon signaling pathway through interaction with the sting-traf -tbk complex neutralization of virus infectivity by antibodies: old problems in new perspectives neutralizing antibodies against sars-cov- and other human coronaviruses mers-cov spike protein: a key target for antivirals an emerging coronavirus causing pneumonia outbreak in wuhan, china: calling for developing therapeutic and prophylactic strategies a sequence homology and bioinformatic approach can predict candidate targets for immune responses to sars-cov- preliminary identification of potential vaccine targets for the covid- coronavirus (sars-cov- ) based on sars-cov immunological studies cryo-em structure of the -ncov spike in the prefusion conformation potent binding of novel coronavirus spike protein by a sars coronavirus-specific human monoclonal antibody characterization of spike glycoprotein of sars-cov- on virus entry and its immune cross-reactivity with sars-cov human monoclonal antibodies block the binding of sars-cov- spike protein to angiotensin converting enzyme receptor sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor potent cross-reactive neutralization of sars coronavirus isolates by human monoclonal antibodies structural basis for potent cross-neutralizing human monoclonal antibody protection against lethal human and zoonotic severe acute respiratory syndrome coronavirus challenge potent neutralization of severe acute respiratory syndrome (sars) coronavirus by a human mab to s protein that blocks receptor association immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen serological assays for sars-cov- infectious disease: benefits, limitations and perspectives perspectives on therapeutic neutralizing antibodies against the novel coronavirus sars-cov- generation and characterization of human monoclonal neutralizing antibodies with distinct binding and sequence features against sars coronavirus using xenomouse development and characterization of a severe acute respiratory syndrome-associated coronavirus-neutralizing human monoclonal antibody that provides effective immunoprophylaxis in mice rapid generation of fully human monoclonal antibodies specific to a vaccinating antigen a human sars-cov neutralizing antibody against epitope on s protein human monoclonal antibodies against highly conserved hr and hr domains of the sars-cov spike protein are more broadly neutralizing severe acute respiratory syndrome coronavirus −specific antibody responses in coronavirus disease longitudinal profile of immunoglobulin g (igg), igm, and iga antibodies against the severe acute respiratory syndrome (sars) coronavirus nucleocapsid protein in patients with pneumonia due to the sars coronavirus sars-cov- infection: response of human immune system and possible implications for the rapid test and treatment antibody detection and dynamic characteristics in patients with covid- profiling early humoral response to diagnose novel coronavirus disease (covid- ) diagnostic value and dynamic variance of serum antibody in coronavirus disease antibody responses to sars-cov- in patients with covid- antibody responses to sars-cov- in patients of novel coronavirus disease letter to the editor: three cases of re-detectable positive sars-cov- rna in recovered covid- patients with antibodies evaluation of nucleocapsid and spike protein-based elisas for detecting antibodies against sars-cov- the important role of serology for covid- control convergent antibody responses to sars-cov- in convalescent individuals antibody tests for identification of current and past infection with sars-cov- disappearance of antibodies to sars-associated coronavirus after recovery collecting and evaluating convalescent plasma for covid- treatment: why and how convalescent plasma as a potential therapy for covid- monoclonal antibodies for emerging infectious diseases -borrowing from history monoclonal antibody-based therapies for microbial diseases convalescent plasma treatment reduced mortality in patients with severe pandemic influenza a (h n ) virus infection read in the section on diseases of children, at the forty-fourth annual meeting of the american medical association meta-analysis: convalescent blood products for spanish influenza pneumonia: a future h n treatment? feasibility, safety, clinical, and laboratory effects of convalescent plasma therapy for patients with middle east respiratory syndrome coronavirus infection: a study protocol use of convalescent plasma therapy in sars patients in hong kong the use of tkm- and convalescent plasma in patients with ebola virus disease in the united states the effectiveness of convalescent plasma and hyperimmune immunoglobulin for the treatment of severe acute respiratory infections of viral etiology: a systematic review and exploratory meta-analysis hiv- therapy with monoclonal antibody bnc elicits host immune responses against hiv- enhanced clearance of hiv- -infected cells by broadly neutralizing antibodies against hiv- in vivo use of convalescent plasma therapy in sars patients in hong kong treatment of severe acute respiratory syndrome retrospective comparison of convalescent plasma with continuing high-dose methylprednisolone treatment in sars patients profile of specific antibodies to the sars-associated coronavirus viral shedding and antibody response in patients with middle east respiratory syndrome coronavirus infection chronological evolution of igm, iga, igg and neutralisation antibodies after infection with sarsassociated coronavirus the use of lassa fever convalescent plasma in nigeria use of convalescent whole blood or plasma collected from patients recovered from ebola virus disease for transfusion, as an empirical treatment during outbreaks: interim guidance for national health authorities and blood transfusion services: world health organization administration of brincidofovir and convalescent plasma in a patient with ebola virus disease the use of tkm- and convalescent plasma in patients with ebola virus disease in the united states passive immunoprophylaxis and therapy with humanized monoclonal antibody specific for influenza a h hemagglutinin in mice treatment with convalescent plasma for influenza a (h n ) infection convalescent plasma treatment reduced mortality in patients with severe pandemic influenza a (h n ) virus infection hyperimmune iv immunoglobulin treatment: a multicenter double-blind randomized controlled trial for patients with severe influenza a(h n ) infection a pneumonia outbreak associated with a new coronavirus of probable bat origin therapeutic strategies in an outbreak scenario to treat the novel coronavirus originating in wuhan, china antibody responses to sars-cov- in patients of novel coronavirus disease challenges of convalescent plasma therapy on covid points to consider in the preparation and transfusion of covid- convalescent plasma treatment of critically ill patients with covid- with convalescent plasma convalescent plasma to treat covid- : possibilities and challenges effect of convalescent plasma therapy on viral shedding and survival in covid- patients treatment with convalescent plasma for covid- patients in wuhan effectiveness of convalescent plasma therapy in severe covid- patients effectiveness of convalescent plasma therapy in severe covid- patients the potential danger of suboptimal antibody responses in covid- antibody-dependent sars coronavirus infection is mediated by antibodies against spike proteins convalescent serum lines up as first-choice treatment for coronavirus affinity maturation of t-cell receptor-like antibodies for wilms tumor peptide greatly enhances therapeutic potential engineered antibody ch domains binding to nucleolin: isolation, characterization and improvement of aggregation alteration of electrostatic surface potential enhances affinity and tumor killing properties of anti-ganglioside gd monoclonal antibody hu f affinity maturation of antibodies assisted by in silico modeling could intravenous immunoglobulin collected from recovered coronavirus patients protect against covid- and strengthen the immune system of new patients? anti-sars-cov- virus antibody levels in convalescent plasma of six donors who have recovered from covid- different longitudinal patterns of nucleic acid and serology testing results based on disease severity of covid- patients human leukocyte antigen susceptibility map for sars-cov- treatment of critically ill patients with covid- with convalescent plasma safety and efficacy results of simulated post-exposure prophylaxis with human immune globulin (hrig; kedrab) co-administered with active vaccine in healthy subjects: a comparative phase / trial hyperimmune globulin in pregnancy for the prevention of congenital cytomegalovirus disease ri- , an intravenous immunoglobulin containing high titer neutralizing antibody to rsv and other respiratory viruses for use in primary immunodeficiency disease and other immune compromised populations the immunology of posttransplant cmv infection: potential effect of cmv immunoglobulins on distinct components of the immune response to cmv human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants improving vaccine performance with adjuvants beta-glycoglycosphingolipid-induced augmentation of the anti-hbv immune response is associated with altered cd and nkt lymphocyte distribution: a novel adjuvant for hbv vaccination induction of regulatory t cells decreases adipose inflammation and alleviates insulin resistance in ob/ob mice oral administration of immunoglobulin g-enhanced colostrum alleviates insulin resistance and liver injury and is associated with alterations in natural killer t cells alleviation of insulin resistance and liver damage by oral administration of imm -e is mediated by increased tregs and associated with increased serum glp- and adiponectin: results of a phase i/ii clinical trial in nash imm- e improves metabolic endotoxemia and markers of liver injury in nonalcoholic steatohepatitis oral anti-cd immunotherapy for hcv-nonresponders is safe, promotes regulatory t cells and decreases viral load and liver enzyme levels: results of a phase- a placebo-controlled trial induction of oral immune regulation towards liver-extracted proteins for treatment of chronic hbv and hcv hepatitis: results of a phase i clinical trial treatment of chronic hepatitis b virus infection via oral immune regulation toward hepatitis b virus proteins anti-spike igg causes severe acute lung injury by skewing macrophage responses during acute sars-cov infection is covid- receiving ade from other coronaviruses? a possible role for b cells in covid- ?: lesson from patients with agammaglobulinemia sars-cov- -reactive t cells in patients and healthy donors targeting sars-cov- receptors as a means for reducing infectivity and improving antiviral and immune response: an algorithm-based method for overcoming resistance to antiviral agents antiviral drug resistance as an adaptive process antiviral peptides as promising therapeutic drugs targeting viral entry as a strategy for broadspectrum antivirals a personalized signature and chronotherapy-based platform for improving the efficacy of sepsis treatment introducing patterns of variability for overcoming compensatory adaptation of the immune system to immunomodulatory agents: a novel method for improving clinical response to anti-tnf therapies beta-glycosphingolipids as mediators of both inflammation and immune tolerance: a manifestation of randomness in biological systems generating randomness: making the most out of disordering a false order into a real one different longitudinal patterns of nucleic acid and serology testing results based on disease severity of covid- patients rapid diagnosis of sars-cov- infection by detecting igg and igm antibodies with an immunochromatographic device: a prospective single-center study assessment of immune response to sars-cov- with fully automated maglumi -ncov igg and igm chemiluminescence immunoassays serology characteristics of sars-cov- infection since the exposure and post symptoms onset antibody detection and dynamic characteristics in patients with covid- re: profile of specific antibodies to sars-cov- : the first report evaluation of enzymelinked immunoassay and colloidal goldimmunochromatographic assay kit for detection of novel coronavirus (sars-cov- ) causing an outbreak of pneumonia (covid- ) this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved accepted article key: cord- -cha ndv authors: horspool, a. m.; kieffer, t.; russ, b. p.; dejong, m. a.; wolf, m. a.; karakiozis, j. m.; hickey, b. j.; fagone, p.; tacker, d. h.; bevere, j. r.; martinez, i.; barbier, m.; perrotta, p. l.; damron, f. h. title: interplay of antibody and cytokine production reveals cxcl- as a potential novel biomarker of lethal sars-cov- infection date: - - journal: nan doi: . / . . . sha: doc_id: cord_uid: cha ndv the sars-cov- pandemic is continuing to impact the global population. this study was designed to assess the interplay of antibodies with the systemic cytokine response in sars-cov- patients. we demonstrate that significant anti-sars-cov- antibody production to receptor binding domain (rbd), nucleocapsid (n), and spike s subunit (s ) of sars-cov- develops over the first to days of infection. the majority of patients produced antibodies against all three antigens ( / sars-cov- positive patient specimens, %) suggesting a broad response to viral proteins. patient mortality, sex, blood type, and age were all associated with differences in antibody production to sars-cov- antigens which may help explain variation in immunity between these populations. to better understand the systemic immune response, we analyzed the production of cytokines by sars-cov- patients over the course of infection. cytokine analysis of sars-cov- positive patients exhibited increases in proinflammatory markers (il- , il- , il- ) and chemotactic markers (ip- , sdf- , mip- {beta}, mcp- , and eotaxin) relative to healthy individuals. patients who succumbed to infection produced decreased il- , il- , il- , il- , rantes, tnf-, gro-, and mip- relative to patients who survived infection. we also observed that the chemokine cxcl was particularly elevated in patients that succumbed to infection. cxcl is involved in b cell activation, germinal center development, and antibody maturation, and we observed that cxcl levels in blood trended with anti-sars-cov- antibody production. furthermore, patients that succumbed to infection produced high cxcl and also tended to have high ratio of nucleocapsid to rbd antibodies. this study provides insights into sars-cov- immunity implicating the magnitude and specificity of response in relation to patient outcomes. loading, plates were incubated for minutes at room temperature shaking at rpm. plates were then washed four times with pbs-t. secondary antibody buffer ( µl of % milk diluted in pbs-t containing : goat anti-human-igg-hrp; invitrogen part #: ) was added immediately following the washing procedure. the plates were incubated for minutes at room temperature shaking at rpm. plates were washed five times with pbs-t. sigmafast opd substrate (sigma part#: p ) was prepared in milliq ( . mΩxcm) water and µl was aliquoted into each well. ten minutes after loading the substrate, µl of stop solution ( n hcl) was added to end colorimetric development. the absorbance of the substrate in each well was measured on a synergy h (biotek) spectrophotomer at nm. antibody concentration was calculated based on area under the curve analyses of a vs. dilution factor plots for each sample. prepared for analysis by heating at o c for hour. samples were then centrifuged at , x g for mins to pellet aggregates. samples ( µl) were diluted : with universal assay buffer and incubated at room temperature on an orbital shaker at rpm for hour. select samples (based on sample quantity) were diluted : or : with the universal assay buffer, which was taken into account during analysis. a standard curve was . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint generated using antigen standards provided by the manufacturer. samples were resuspended in µl wash buffer prior to running on a magpix (luminex) instrument, and µl was analyzed per samples. bead cytokine production and antibody production were pooled into microsoft excel and imported to clustvis . data were transformed by the ln(x) transformation provided in the webtool and grouped with a % confidence interval. groups were based on patient . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted august , . in-patient anti-sars-cov- antibody production: antibody binding target and the timing of the antibody response are critical factors in mediating immunity. we evaluated anti- sars-cov- antibody production to antigens (rbd, n, and s ) in in-patients table ) by developing a novel rapid-elisa technique. our rapid-elisa technology evaluates igg antibody production to the sars-cov- rbd, n, and s proteins in approximately hour with greater than % accuracy (supplementary table ). our survey of sars-cov- positive patients demonstrated that antibody (igg) production to rbd, n, and s proteins developed over the first to days post- symptom onset (figure a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint supplementary data file). to better understand the kinetics of the antibody response, we plotted igg production of every patient over time to rbd, n, or s . patients produced igg against rbd rapidly after symptom onset with the peak igg response occurring days after symptom onset (figure g ). anti-s igg production escalated over a slightly larger period ( days, figure i ) and anti-n igg production was slower than either anti-rbd or anti-s antibody production ( days, figure h ). taken together, these data describe the breadth and timing of the igg response to sars-cov- antigens. populations, we analyzed patient groups based on sex, patient mortality, blood type, and age against anti-rbd, anti-n, or anti-s antibody production. as igg production is more consistently detectable after ten days post-symptom onset , , we assessed differences in igg production beyond ten days post symptom onset. limiting sample analysis to those greater than ten days post symptom onset did not significantly impact the mean antibody production of the patients (supplementary figure ) . patients who did not survive sars- cov- hospitalization produced significantly more antibodies to sars-cov- n than patients that survived infection ( figure a) . furthermore, patients that did not survive sars-cov- infection did not produce different quantities of anti-n antibodies than surviving patients during early infection (supplementary figure ) . to accurately assess differences in antibody production independently of disease outcome, we quantified anti- sars-cov- igg production in patients who survived infection grouped by biological sex, . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint blood type, and age. we determined that, in our cohort, females significantly produced more anti-s igg than males ( figure b ). we also observed that blood type was significantly associated with anti-sars-cov- igg production ( figure c ). blood type b+ patients produced significantly more igg to rbd and s than a+ or o+ patients ( figure c) and a+ patients produced the lowest quantities of anti-rbd and anti-s igg. o+ patients produced reduced anti-n igg relative to a+ or b+ patients. previous studies have identified that age impacts antibody production to sars-cov- , . our study demonstrates that antibody production against rbd or s antigens increased with age ( figure d ). in contrast, antibody production to n increased in patients over years old but did not continue to increase with age after years of age. this is particularly evident when examining pearson correlations between age and anti-sars-cov- igg production for each antigen (supplementary figure ) . overall, these data document a significant impact of patient demographics on anti-sars-cov- antibody production. changes in sars-cov- patient cytokine responses correlate with disease severity: antibody production represents the antigen-specific response to pathogens but is only one facet of immunity. we examined the broader immunological response to sars-cov- infection by quantifying the production of cytokines involved in a representative subset of sars-cov- or healthy patients. sars-cov- patients exhibited significant increased pro-inflammatory cytokine production (il- , il- , il- ) and increased chemotactic cytokine production (ip- , sdf- , mip- , mcp- and eotaxin) relative to non-infected individuals (figure ). of the sars-cov- -infected patients, mortality was associated with increased il- , il- , il- , ip- , and mcp- production. patients who succumbed . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . this response is critical for eradicating many pathogens. as many sars-cov- patients produced robust antibody responses to multiple antigens, we hypothesized that germinal center formation would be increased in these patients. to quantify this, we measured the serum concentration of cxcl , a critical mediator of germinal center formation and a biomarker of this immunological response , , , . we observed that cxcl production primarily correlated with peak antibody production to rbd and s antigens across sars- cov- infected patients (figure a-c) . additionally, we observed that there was a significant increase in average production of cxcl in positive patients relative to negative sars-cov- patients. in addition, we discovered that cxcl production was . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint significantly increased in patients that did not survive sars-cov- infection compared to those that did (figure d ). when we compared antibody and cxcl production based on patient survival over time, we observed that patients who did not survive sars-cov- infection exhibited a sustained increase in antibody and cxcl production relative to surviving patients (figure ef) to n increased over a longer period than antibodies against rbd, or the s domain. this could be due to a variety of factors including antigen immunodominance , , incongruent antigen processing and availability , , differences in antibody utility and turnover, or prior exposure to similar rbd/s antigens of other coronaviruses. theoretically, as n is not expressed on the viral surface, b cells producing antibodies against this antigen may not be selected for as rapidly as those that are specific to the rbd or s antigens and may not possess neutralizing function. as infection worsens, more cells lyse. this may . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint increase the local concentration of free nucleocapsid available for antigen processing and presentation, particularly in lymphoid tissue . in this respect, a more robust antibody response to nucleocapsid later in infection may be due to increased cellular damage. this may initiate a positive feedback loop where infected cells lyse and release nucleocapsid, which induces a less functional anti-nucleocapsid antibody response that fails to alleviate the cell lysis. more evidence is required to support these hypotheses, but these are interesting paradigms to consider in the context of anti-sars-cov- immunity. lethal sars-cov- infection is significantly correlated with higher antibody production , , and is described further in this study. in analyzing antibody production between patient demographics, it was important to eliminate increased antibody production due to lethal infection as a source of bias. as such, our analyses presented here describe igg production of sars-cov- survivors grouped by demographic. there are a multitude of studies reporting differences in igg production between demographics including: trends in anti-sars-cov- antibody production between sexes , , - , a correlation of genetically encoded blood type with sars-cov- immunity , and variability in antibody production in the aging population , . from these prior studies and others , it is known that biological sex can impact antibody production during infection. we observed this phenomenon when quantifying sex specific anti-s igg production. the anti-viral response is mediated in part by toll-like receptors which are differentially regulated between the sexes , . a higher frequency of anti-s igg in females would suggest an increased neutralizing response to the virus which has not been thoroughly evaluated to-date. our data exhibited a modest difference in antibody production between . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint sexes. as a result, we do not consider biological sex to be a major contributor to anti- it is documented that red blood cell phenotypes can influence microbial pathogenesis as antigens can function as receptors and/or co-receptors for pathogenic organisms . historically, an association was identified between abo type and pathogen cov spike protein binding to ace . although the underlying mechanism relating blood type to sars-cov- pathogenesis remains unclear, it appears there may be a relationship between abo blood type and coronavirus infection. recent data identified the q . locus (abo blood group locus) as potentially involved in susceptibility to covid- respiratory failure with evidence that type a phenotypes are at higher risk while type o phenotypes are partially protected . the data generated in these studies show an interesting pattern that may reinforce blood type related outcomes in severe disease due to a previously unreported association to the level and type of antibody response. as seen in figure c , the relative quantity of anti-rbd and anti-s antibodies was highest among anti-n antibodies. this is further accentuated by evaluating the ratio of anti-rbd or anti- s versus anti-n in our patient cohort which shows that higher n:rbd or n:s ratios are associated with poor prognosis (supplementary figure ) . it is plausible that type-a individuals may have a misdirected humoral response due to antigenic homology . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint between n-acetyl-galactosamine sugar moieties on the a antigen and spike protein resulting in molecular mimicry. this would result in type-o and -b individuals registering more spike protein epitopes as foreign and eliciting a more robust humoral response; in turn, this putative mechanism could reduce infectious dose and decrease the risk of mortality. further studies evaluating physiologic modifications of spike protein and its antigenic moieties would help support or disprove this theory. as the conclusions from these observations are currently theoretical, a more extensive review of comorbid conditionswith a multivariate analysis and estimations of associated odds ratiosmay reveal other associations outside of blood type. the aging process is associated with decreased t-cell functionality , resulting in hyperactive b-cell proliferation that does not confer immunity . we discovered that older patients typically produced more antibodies to rbd and s than younger patients. the lack of increase in antibody production to nucleocapsid in the elderly may be a function of antigen availability. to speculate, if elderly patients have higher viral loads due to decreased remediation of virus this would increase the relative abundance of surface exposed antigens (rbd and s ), but not necessarily hidden antigens (n). increased antibody production would therefore predominantly occur to rbd and s , and not n. other challenges are associated with studying this population including co-presentations of multiple diseases which complicates this analysis. regardless, our study has identified several patient demographics associated with differences in the anti-sars-cov- antibody response. the anti-viral immune response depends on a variety of signaling pathways mediated by cytokines and chemokines. many of the pro-inflammatory cytokines . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . of cardiac distress. separately, we discovered increased eotaxin production in sars- cov- patients. eotaxin was increased or similar to healthy patients during sars-cov- . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . antibody maturation signaling has not been investigated in the context of sars- cov- . we assessed the activity of the antibody maturation pathway by measuring cxcl concentrations in the serum of sars-cov- patients. increased cxcl in . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint to rbd (a), n (b), or s (c). correlation of antibody production to rbd vs. n (d) or s (e). correlation of antibody production to n vs. s (f). antibody production of anti-rbd (g), anti-n (h), or anti-s (i) antibodies by sars-cov- positive patients vs. days post sars- cov- disease onset. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint analysis of serum cytokines in patients with severe acute respiratory syndrome the immunobiology of sars candidate genes associated with susceptibility for sars- cxcl /ip- in infectious diseases pathogenesis and potential therapeutic implications highly efficacious lymphocyte chemoattractant, stromal cell-derived factor (sdf- ) sdf- α and cxcr as therapeutic targets in cardiovascular disease stromal cell-derived factor- α is cardioprotective after myocardial infarction stromal derived factor α: a chemokine that delivers a two-pronged defence of the myocardium sars-cov- and cardiovascular complications: from molecular mechanisms to pharmaceutical management outcomes of cardiovascular magnetic resonance imaging in patients recently recovered from coronavirus disease clinical features of patients infected with novel coronavirus in wuhan role of eotaxin- (ccl ) and cc chemokine receptor in bleomycin-induced lung injury and fibrosis influenza virus a stimulates expression of eotaxin by nasal epithelial cells the β-chemokine receptors ccr and ccr facilitate infection by primary hiv- isolates t cell exhaustion networking at the level of host immunity: immune cell interactions during persistent viral infections international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity maturation of anti-sars-cov- antibodies. the significant increase of cxcl in patients with lethal disease suggests this may be an emergency response to uncontrolled infection. it is possible that sustained infection stimulates increased antibody affinity maturation that is unable to keep pace with viral replication and the cytokine storm. in this sense, cxcl could be used as a marker of sars-cov- disease severity. there is a precedent for the utility of cxcl as a biomarker that is predictive of immune activation during hiv exposure , , . this adds credibility and feasibility for this utility, but further studies are required to validate this approach. we have provided a schematic of how the cxcl response interplays with our other observations of sars-cov- immunity in figure . to summarize, this study provides insight into the breadth of the immunological response against sars-cov- . we demonstrated increasing antibody production to multiple sars-cov- antigens over the first ten days of infection using a rapid-elisa assay. our results exhibit that patient mortality, sex, blood type, and age impact antibody production to sars-cov- , adding to what is known about sars-cov- pathogenesis. furthermore, lethal sars-cov- infection triggers a pro-inflammatory cytokine response, in combination with the secretion of several chemotactic agents. interestingly, patients with lethal sars-cov- disease exhibited divergent cytokine production compared to patients with non-lethal disease. finally, we discovered that a marker of germinal center activity (cxcl ) is upregulated in sars-cov- patients, and that this upregulation is amplified in lethal disease. ultimately, these studies help to elucidate the interplay . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review)the copyright holder for this preprint this version posted august , . -plex cytokine assays. pf produced rbd used in this study. amh analyzed and compiled assay data and figures. all authors took part in writing and editing the manuscript. we would like to thank bei resources for providing the following reagents (nr- ). we would finally like to express our gratitude to drs. laura gibson and clay marsh for enabling this research during the global pandemic. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review)the copyright holder for this preprint this version posted august , . is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . biol. , - ( ) . is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint production by sars-cov- production is compared to anti-rbd (a), anti-n (b), or anti-s (c) igg quantity over the course of patient disease. red arrows represent cxcl maxima, and green arrows represent local igg maxima. cxcl production was . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review)the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint compared between sars-cov- negative (-) and positive (+) patients, and sars-cov- positive survivors (s), or non-survivors (d) (d). examples of a surviving patient producing low cxcl and low anti-rbd igg response (e) or deceased patient producing high cxcl and high anti-rbd igg response (f). statistical significance was assessed with a brown forsyth and welch's one-way anova followed by tukey's multiple comparison test. **** = p< . , n.s. = not significant. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review)the copyright holder for this preprint this version posted august , . . https://doi.org/ . / . . . doi: medrxiv preprint key: cord- -yovp squ authors: duan, liangwei; zheng, qianqian; zhang, hongxia; niu, yuna; lou, yunwei; wang, hui title: the sars-cov- spike glycoprotein biosynthesis, structure, function, and antigenicity: implications for the design of spike-based vaccine immunogens date: - - journal: front immunol doi: . /fimmu. . sha: doc_id: cord_uid: yovp squ the ongoing pandemic of coronavirus disease (covid- ), caused by severe acute respiratory syndrome coronavirus (sars-cov- ), poses a grave threat to global public health and imposes a severe burden on the entire human society. like other coronaviruses, the sars-cov- genome encodes spike (s) glycoproteins, which protrude from the surface of mature virions. the s glycoprotein plays essential roles in virus attachment, fusion and entry into the host cell. surface location of the s glycoprotein renders it a direct target for host immune responses, making it the main target of neutralizing antibodies. in the light of its crucial roles in viral infection and adaptive immunity, the s protein is the focus of most vaccine strategies as well as therapeutic interventions. in this review, we highlight and describe the recent progress that has been made in the biosynthesis, structure, function, and antigenicity of the sars-cov- s glycoprotein, aiming to provide valuable insights into the design and development of the s protein-based vaccines as well as therapeutics. the coronavirus disease (covid- ) global pandemic represents an unprecedented public health, social and economic challenge ( , ) . the etiological agent of covid- is a new member of the coronaviridae family that is closely related to severe acute respiratory syndrome coronavirus (sars-cov) and was recently referred to as sars-cov- by the coronavirus study group of the international committee on taxonomy of viruses ( ) . the virus has spread rapidly and sustainably around the global resulting in over twenty-one million cases and more than , deaths as of august , ( ) . coronaviruses (covs) are enveloped positive-sense rna viruses ( ) . enveloped covs entering host cells and initiating infection is achieved through the fusion of viral and cellular membranes ( , ) . membrane fusion is mediated by the large type i transmembrane s glycoprotein on the viral envelope and the cognate receptor on the surface of host cells ( ) ( ) ( ) . the surfaceexposed location of the s glycoprotein not only allows it to carry out membrane fusion but also renders it a direct target for host immune responses, making it the major target of neutralizing antibodies ( ) . because of its central roles in viral infection and eliciting protective humoral and cell-mediated immune responses in hosts during infection ( ) , the s protein is the primary target for vaccine design as well as antiviral therapeutics ( ) . here, we provide a comprehensive overview of the wealth of research related to the sars-cov- s glycoprotein biosynthesis, structure, function, and antigenicity, aiming to provide useful insights into the design and development of the s protein-based vaccines as well as therapeutics to prevent or treat the ongoing global spread of sars-cov- /covid- . the sars-cov- s glycoprotein is synthesized as a -amino acid polyprotein precursor on the rough endoplasmic reticulum (rer) (figure ) ( ) . the unprocessed precursor harbors an endoplasmic reticulum (er) signal sequence located at the n terminus, which targets the s glycoprotein to the rer membrane and is removed by cellular signal peptidases in the lumen of the er ( , ) . a single stop-transfer, membrane-spanning sequence located at the c terminus of the s protein prevents it from being fully released into the lumen of the er and subsequent secretion from the infected cell ( , ) . co-translationally, n-linked, highmannose oligosaccharide side chains are added during synthesis ( , ) . shortly after synthesis, the s glycoprotein monomers trimerize, which might be thought to facilitate the transport from the er to the golgi complex. once in the golgi complex, most of the high-mannose oligosaccharide side chains are modified to more complex forms ( , ) , and o-linked oligosaccharide side chains are also added ( , ) . in the trans-golgi network, the sars-cov- s glycoprotein is proteolytically cleaved by cellular furin or furin-like proteases at the s /s cleavage site, comprising multiple arginine residues that are not found in the closely related sars-cov ( , ) . cleavage at the s /s site yields a surface subunit s , which attaches the virus to the host cell surface receptor, and a transmembrane subunit s , which mediates the fusion of viral and host cell membranes ( ) . the s and s subunits remain associated through noncovalent interactions in a metastable prefusion state ( ) . furin-like cleavage is essential for the sprotein mediated cell-cell fusion and viral infectivity, and is required for efficient sars-cov- infection of human lung cells ( ) and airway epithelial cells ( ) . following cleavage, an er retrieval signal (errs) consisting of a conserved kxhxx motif ( ) located at the extreme c terminus ensures that the mature sars-cov- s protein accumulates near the er-golgi intermediate compartment (ergic) ( , ) , where driven by interactions with another structural protein, the membrane (m) protein, the s protein participates in virus particle assembly and is incorporated into virus envelope ( figure ) ( , ) . besides, a fraction of mature sars-cov- s proteins travel through the secretory pathway to the plasma membrane, where they can mediate fusion of infected with uninfected cells to form multinucleated giant cells (syncytia) ( , ) . this may allow direct spreading of the virus between cells and potentially alter the virulence of sars-cov- ( ) . notably, a deletion of~ amino acid containing the errs from the cytoplasmic tail of the sars-cov- s protein has been shown to increase the infectivity of single-cycle vesicular stomatitis virus (vsv)-s pseudotypes ( ) and replicationcompetent recombinant vsvs bearing the s glycoprotein ( , ) , which likely could be translated to single-cycle human immunodeficiency virus (hiv)-s or other retrovirus-s pseudotypes straightforward ( ) . presumably, this deletion may enhance the cell surface expression of the sars-cov- s glycoprotein ( ) , thereby facilitating the s protein incorporation into pseudovirions and replication-competent virions. as mentioned above, the sars-cov- s glycoprotein plays pivotal roles in viral infection and pathogenesis. mature s glycoprotein on the viral surface is a heavily glycosylated trimer, each protomer of which is composed of amino acids (residues - ) ( figure a) . the surface subunit s is composed of amino acids (residues - ) and organized into four domains: an n-terminal domain (ntd), a c-terminal domain (ctd, also known as the receptor-binding domain, rbd), and two subdomains (sd and sd ) ( figure a ) ( ) . the transmembrane s subunit is composed of amino acids (residues - ) and contains an n-terminal hydrophobic fusion peptide (fp), two heptad repeats (hr and hr ), a transmembrane domain (tm), and a cytoplasmic tail (ct), arranged as fp-hr -hr -tm-ct ( figure a ) ( ) . as a typical class i viral fusion protein ( ) , the sars-cov- s glycoprotein shares common structural, topological and mechanistic features with other class i fusion proteins, including hiv envelope (env) glycoprotein and influenza virus haemagglutinin (ha) ( ) ( ) ( ) . like other class i viral fusion proteins, the sars-cov- s glycoprotein is also a conformational machine that mediates viral entry by rearranging from a metastable unliganded state, through a prehairpin intermediate state, to a stable postfusion state ( , ) . since the first genome sequence of sars-cov- became publicly available ( ) , a number of structures have been determined for the sars-cov- s glycoprotein trimer fragments in both the prefusion and postfusion states ( figures b-d) ( , , ) . the overall architecture of the prefusion sars-cov- s ectodomain stabilized by two consecutive proline mutations in two conformations determined by single particle cryo-electron microscopy (cryo-em) is a~ Å long trimer with a triangular cross-section, with the s subunit adopting a "v" shape contributing to the overall triangular appearance and the s subunit forming the stalk ( figures b, c) ( , ) . the structural difference between these two conformations only lies in the position of one of the three s rbds ( figures b, c) ( ) . when all three rbds are in the "down" position, the resulting s ectodomain trimer assumes a closed conformation, in which the receptor-binding surface of the s rbd is buried at the interface between protomers and cannot be accessible by its receptor ( figure b ) ( ) . the s ectodomain trimer with one single rbd in the "up" position assumes a partially open conformation and represents the functional state, as the receptorbinding surface of the "up" rbd can be fully exposed ( figure c ) ( , ) . the structural information provides a blueprint for structure-based design of vaccine immunogens and entry inhibitors of sars-cov- . in the closed sars-cov- s ectodomain trimer, interprotomer interactions occur through the s ctd packed against the other two s ctds and one ntd from an adjacent protomer because of domain swapping and through s , primarily between helical interactions formed by the upstream and central helices from each subunit around the trimer axis ( figure b ) ( ) . the s subunits rest above the s trimer, the life cycle of sars-cov- begins with membrane fusion occurring at the plasma membrane or within acidified endosomes after endocytosis, which is mediated by conformational changes in the s glycoprotein triggered by angiotensin-converting enzyme (ace ) binding. following viral entry, sars-cov- releases its genomic rna into the host cell cytoplasm. genome rna is first translated into viral replicase polyproteins (pp a and ab), which are further cleaved by viral proteases into a total of nonstructural proteins. a replication-transcription complex (rtc) is formed based on many of these nonstructural proteins. in the process of genome replication and transcription mediated by rtc, the negative-sense (− sense) genomic rna is synthesized and used as a template to produce positive-sense (+ sense) genomic rna and subgenomic rnas. the nucleocapsid (n) structural protein and viral rna are replicated, transcribed, and synthesized in the cytoplasm, whereas other viral structural proteins, including the s protein, membrane (m) protein and envelope (e) protein, are transcribed and then translated in the rough endoplasmic reticulum (rer) and transported to the golgi complex. in the rer and golgi complex, the sars-cov- glycoprotein is subjected to co-translational and post-translational processing, including signal peptide removal, trimerization, extensive glycosylation and subunit cleavage. the n protein is subsequently associated with the positive sense genomic rna to become a nucleoprotein complex (nucleocapsid), which together with s, m, and e proteins as well as other viral proteins, is further assembled and followed by budding into the lumen of the er-golgi intermediate compartment (ergic) to form mature virions. finally, the mature virions are released from the host cell, waiting for a new life cycle to start. this figure is adapted from the template in biorender (https://biorender.com/). stabilizing the later in the prefusion conformation ( figure b ) ( ) . when the s ectodomain trimer adopts a partially open conformation, the rbd in the "up" position will abolish the contacts with the s subunit of an adjacent protomer, destabilizing the partially open conformation ( figure c ) ( , ) . this will be beneficial to the dissociation of the s subunit and facilitate conformational rearrangements that the s trimer undergoes to mediate viral entry. prefusion structures of human coronavirus hku (hcov-hku ) and mouse hepatitis virus s protein ectodomains without two consecutive proline mutations reveal only fully closed conformation ( , ) , similar to that observed for a full-length, wild-type prefusion form of the sars-cov- s glycoprotein ( ) . notably, it is well established that trimeric prefusion hiv- env primarily resides in a closed configuration that is conformationally masked to evade antibody-mediated neutralization ( , ) and can spontaneously sample a transient, functional configuration ( ) . it can thus be speculated that native cov s glycoproteins on mature and infectious virions share a similar conformational masking feature ( ) , concealing the receptor-binding surface (for those utilizing ctds as rbds) ( figure c ), which is further discussed below. several lines of research have established that angiotensinconverting enzyme (ace ) is an entry receptor for sars-cov- ( ) ( ) ( ) . detailed interactions between the sars-cov- rbd and its receptor ace have been revealed by several structures of ace in complex with rbd ( ) ( ) ( ) ( ) . structurally, rbd consists of two subdomains: a core and an external subdomain ( , ) . an extended loop (residues - ), which lies on one edge of the core subdomain, presents a gently concave surface to cradle the n-terminal helix (a ) of ace . analysis of the interface between the sars-cov- rbd and ace reveals that a total of residues in rbd are in contact with amino acids in ace , forming a network of hydrophilic interactions that are suggested to predominate the virus-receptor engagement ( ) . outside this extended loop, residue lys located in helix a of the core subdomain, was shown to form ionic interactions with asp of ace . as the extended loop contains almost all the amino acids of the sars-cov- rbd that contact ace , it is referred to as the receptor-binding motif (rbm) ( ) . it has been proposed that inhibiting the interaction between rbd and ace might be useful in treating sars-cov- infection. recombinant soluble ace ( ) and ace -fc ( , ) have been shown to have potential applications in the prevention and treatment of sars-cov- infection in vitro. as the interaction between the rbd and ace is extensive, small molecules probably cannot be used as entry inhibitors to effectively block the virus entry by targeting the interaction interface. however, peptides would be able to engage most of the residues belonging to rbm ( ) . a pioneering study demonstrated that a -amino acid peptide (residues - ), derived from the n-terminal helix (a ) of ace , specifically associates with the sars-cov- rbd with low nanomolar affinity and disables receptor interactions ( ), representing a promising strategy for preventing the virus from invading human cells. in another study, a -amino acid peptide (residues - ), derived from the n-terminal back-to-back helices (a and ) and composed of most of the residues of ace that mediate interactions with the s protein, shows a similar but probably more potent inhibitory effect ( ) . the formation of a trimer-of-hairpins structure (also known as six-helix bundle) comprising hr and hr in the postfusion conformation is a unifying feature of class i viral fusion proteins ( ) . the crystal structure of a protein construct in which sars-cov- hr and hr were connected by a six-residue hydrophilic flexible linker was determined to be a canonical six-helix bundle structure with a rod-like shape ∼ Å in length and ∼ Å in diameter ( ) . three hr helices form a parallel central coiled-coil with three hr helices packing in an oblique, antiparallel manner against deep hydrophobic grooves on the surface of the central coiled-coil ( ) . notably, when a full-length s protein construct bearing the native furin-like cleavage site was transiently expressed by expi f cells, the purified s proteins contained the dissociated s trimer in the postfusion conformation ( ) . the cryo-em structure of this trimeric postfusion s shows that the central helix (ch) extended regular helices from the central coiled-coil, oriented toward target cells ( figure d ) ( ) , which forms the longest central triple helical coiled-coil (~ Å) among all known class i transmembrane subunit structures. the sars-cov- s trimer in the pre-hairpin intermediate state is very unstable and is just transiently present in vivo after triggering by ace engagement, stymieing structural characterization of the s protein in this state ( ) . however, although this fusion-intermediate phase is very short, it is enough for inhibitory peptides to associate with the prehairpin intermediate and block the six-helix bundle formation ( ) . furthermore, it has already been shown that the hr regions in various human covs are highly conserved ( ) , and therefore could serve as an attractive target for the design and development of potent and broad-spectrum inhibitors of pan-covs, including sars-cov- . a highly potent pan-coronavirus fusion inhibitor, ek c , has been reported to have good prophylactic and therapeutic potential against sars-cov- infection ( ). as mentioned earlier, the sars-cov- s proteins are heavily decorated by heterogeneous n-linked glycans projecting from the s trimer surface. the sars-cov- s sequence encodes up to n-linked glycan sequons per protomer, which likely plays an important role in protein folding ( ) and host immune evasion as a glycan shield ( ) . of the potential n-linked glycosylation sites on the s protein, were identified to be predominantly occupied by processed, complex-type glycans ( ) . the remaining eight sites were found to be dominated by oligomannose-type glycans, which are divergent from those founded on host glycoproteins ( ) . although glycosylation sites (n , n , n ) proximal to the receptor-binding sites on the sars-cov- s protein can be observed, ace bound to the glycosylated and deglycosylated s ectodomains with nearly identical affinity ( . nm vs . nm) determined by a biolayer interferometry binding assay ( ) . this observation suggests that the high binding affinity between the sars-cov- s protein and ace does not depend on the s protein glycosylation. when the site-specific n-linked glycans are mapped onto the prefusion structure of the sars-cov- s ectodomain ( ), the resulting model exhibited substantially higher levels of glycanfree surface than that revealed by structures of fully glycosylated, trimeric hiv- env ectodomains ( , ) . this suggests that the sars-cov- s protein is covered by a less dense and less effective glycan shield compared to viral glycoproteins from hiv- ( , ) and lassa virus ( ) , which may be beneficial for the induction of humoral immunity and could be good news for a sars-cov- vaccine ( ) . notably, it has been shown that multiple major viral surface antigens have neutralizing epitopes that are partly or even exclusively composed of carbohydrate moieties ( , ) , exemplified by the hiv- env spike, which could be recognized by a large number of carbohydrate-binding antibodies, including g , pg , pg , ch , pgt , pgt , pgt , and pgt ( , ) . in the case of sars-cov- , more recently a potent neutralizing antibody against both sars-cov and sars-cov- , s , has been shown to recognize a highly conserved glycan-containing rbd epitope ( ) . these observations suggest that carbohydrate moieties could be immunogenic and highlight the need for immunogens to display the glycans important for the recognition of neutralizing antibodies ( ) ; in support of this, specific n-linked glycans on hemagglutinin has been shown to be essential for the elicitation of broadly neutralizing antibodies against influenza ( ) . accordingly, there has been mounting interest in exploring the potential of immunogenic glycan moieties as vaccine candidates against multiple viruses, including sars-cov- ( , ) . membrane fusion and viral entry of sars-cov- is initiated by binding of rbd in the viral s glycoprotein transiently sampling the functional conformation to ace on the surface of target cells (figure ) ( ). after receptor engagement at the plasma membrane or ensuing virus endocytosis by the host cell ( ), a second cleavage (s ′ cleavage site) is generated, which is mediated by a cellular serine protease tmprss ( ) or endosomal cysteine proteases cathepsins b and l ( ) (figure ) . protease cleavage at s ′ site frees the fusion peptide from the new s n-terminal region, further destabilizes the sars-cov- s glycoprotein and may initiate s -mediated membrane fusion cascade. following the second cleavage, the fusion peptide at the n terminus of the s trimer is inserted into the host membrane ( ) , forming the pre-hairpin intermediate state ( ) . since the pre-hairpin intermediate state is extremely unstable, the s fusion protein is refolded quickly and irreversibly into the stable postfusion state ( , ) . these large conformational rearrangements pull the viral and host cell membrane into close proximity, leading ultimately to the membrane fusion ( , ) . since sars-cov- was identified as the causative agent of covid- , and its first genome sequence was released immediately and freely by a chinese research group ( ), sars-cov- vaccine candidates based on various vaccine platforms, such as inactivated or live attenuated vaccines, dna and mrna vaccines, viral vector-based vaccines, and recombinant protein-based vaccines, have been developed ( , ) . most of these vaccine strategies are based on the full-length s glycoprotein, the major viral surface antigen ( ) . when a vaccine strategy requires that the sars-cov- s protein be recombinantly expressed in the human body, the errs should be omitted to enhance the cell surface expression level of the resulting protein. theoretically, the native hiv- env trimer present on the surface of intact virions is thought to be a most ideal immunogen ( ) , as most of the neutralizing antibodies thus far described could recognize and bind to the prefusion form of trimeric hiv- env, although it is with great difficulty that such neutralizing antibodies against this glycan-covered, sequence-variable native form are induced ( ) . for sars-cov- , different lines of research have shown that convalescent sera from sars-cov and sars-cov- patients showed no or limited crossneutralization activity against these two viruses by pseudotyped and authentic viral infection assays, despite significant crossreactivity in binding to the s glycoproteins of both viruses ( , ( ) ( ) ( ) . similar results were also observed in infected or immunized animals ( , , ) . together with the finding that although the sars-cov- s protein shares a high degree of amino acid sequence identity with that of sars-cov (~ % overall), the rbm is less conserved (~ % identity) than any other functional region or domain ( ) , it can thus been surmised that the rbm has the most immunodominant neutralizing epitope(s) of the whole s protein, capable of readily eliciting strong neutralizing antibody responses. however, the native trimeric sars-cov- s protein could conceal each of its immunodominant rbms by adopting the closed conformation ( , ) . therefore, sars-cov- evades immune surveillance also through conformational masking, which is well-documented for hiv- ( , ) ; while at the same time, the s protein could transiently sample the functional state to engage ace , consistent with the notion that the fusion glycoprotein of highly pathogenic viruses have evolved to perform its functions while evading host neutralizing antibody responses. another concern for vaccine candidates based on the fulllength s glycoprotein of sars-cov- is raised by the observation that the s subunit could spontaneously dissociate from the s glycoprotein probably as a trimer that still assumes the rbd closed conformation, leaving only the postfusion s trimer ( ) . the resulting s and s subunits might expose immunodominant, nonneutralizing epitopes that are utilized by sars-cov- to serve as decoys to distract the host immune system, inducing a large proportion of ineffective antibody responses, as documented for hiv- ( ) and respiratory syncytial virus (rsv) ( ) . it should be noted that although vaccine candidates based on the full-length s protein of the closely related sars-cov could elicit neutralizing antibody responses against infection of sars-cov, they may also induce harmful immune responses, including liver damage of the vaccinated animals, infection of human immune cells by sars-cov, and antibody-dependent enhancement of sars-cov infection ( ) ( ) ( ) ( ) ( ) . therefore, although the s proteins of both sars-cov and sars-cov- are thought to be promising vaccine immunogens for generating protective immunity, optimizing antigen design is critical to ensure an optimal immune response through exposing more neutralizing epitopes and displaying fewer potentially weakly or non-neutralizing epitopes ( ) . vaccines containing or expressing the full-length s protein or its soluble ectodomain form should thus be engineered to sample a rbd(s) "up" conformation while the rest is still kept in the prefusion state ( , ) . apart from recombinant, soluble, stabilized ectodomains that are engineered to expose the immunodominant rbd by adapting the rbd(s) "up" conformation, rbd proteins of sars-cov and sars-cov- have also been widely used as recombinant protein-based vaccines ( , ( ) ( ) ( ) . the rbd of sars-cov is highly immunogenic ( , ) and is targeted by most of the neutralizing monoclonal antibodies that have been characterized ( ) . based on the observation that a -amino acid fragment (residues - ) was previously identified to be the minimal rbd region of sars-cov ( ), a corresponding -amino acid fragment (residues - ) can be readily selected as the minimal rbd region of sars-cov- and has already been characterized ( ) . this minimal form of rbds of both viruses could serve as a vaccine candidate ( ) . however, a conserved cysteine residue is located immediately upstream of the minimal rbd fragments of both viruses and always forms a disulfide bond in nearly all published structures containing this residue ( , ) ; this is also the case for middle east respiratory syndrome coronavirus (mers-cov) ( , ) and hcov-hku ( ), consistent with the observation that all rbds of these viruses share a conserved structural core. the disulfide bond contributes to stabilization of the rbd structure and likely modulates the protein immunogenicity. this notion is consistent with the observation that mice immunized with a longer form of the sars-cov rbd (residues - ) produced a higher titer of neutralizing antibodies compared with mice immunized with the minimal rbd region (residues - ) ( ) . therefore, when each of the minimal rbd fragments of sars-cov and sars-cov- is used as vaccine candidates, the critical cysteine residue should not be ignored and thus should be included ( ) . besides the rbd, which has been shown to a major target for human neutralizing antibody responses ( ), the ntd was recently identified to be a new vulnerable site of the sars-cov- s protein for antibody neutralizing and therefore could also serve as a recombinant protein-based vaccine ( ) ( ) ( ) . as expected, ntd-specific neutralizing antibodies could target the s protein in both closed and open conformations ( ) . in addition, the apparent accessibility of the fusion peptide and hr region in published structures of the sars-cov- s ectodomain trimer as well as their high sequence conservation among covs suggests that they would be good immunogen candidates for epitope-focused vaccine design aimed at raising broadly cov neutralizing antibodies ( ) . the epitope-focused vaccine design has proven to be successful in generating neutralizing antibodies against rsv fusion glycoprotein ( ). however, neutralizing antibodies targeted against these two regions still need to be isolated in infected individuals to support this notion. unlike wild-type full-length s protein of sars-cov- , the above monomeric fragments do not induce any infection-enhancing antibodies or harmful immune or inflammatory responses ( , ) , all of which could be potentially avoided through structure-based immunogen design to improve immunogenicity ( , ) . however, wide-type full-length or soluble ectodomain form of the sars-cov- s protein could trigger stronger cellular immune responses ( ) , which have been demonstrated to play an important role in controlling diseases caused by covs ( , ) , including sars-cov- ( ) , and are probably also an important determinant of effective vaccines against sars-cov- ( , ) . additionally, when more than one rbd of the s protein trimer is engineered to be locked in the "up" conformation ( , ) , the antigenicity and immunogenicity of the resulting rbds would be significantly enhanced compared to monomeric rbd form ( , ) . moreover, improved protection is likely to be achieved when vaccinated with full-length or soluble ectodomain form of the sars-cov- s protein in that both forms can elicit neutralizing antibodies directed against non-rbd sites, as observed for mers-cov ( ) . genetic variation has been used by many viruses that have rna genomes ( ) , including hiv and influenza, as a mechanism to avoid antibody-mediated immunity, and is partially responsible for the great difficulty in developing effective and durable vaccines against these viruses ( ) . as an rna virus, however, sars-cov- has a very low mutation rate overall ( ) likely because covs have a genetic proofreading mechanism ( ) . all reported variations occurred in the sars-cov- s glycoprotein have a prevalence of no more than % ( ) , with an exception of d g, which has become the most prevalent genotype in the global covid- pandemic ( ) . fortunately, although the d g mutation of the sars-cov- s protein has been shown to enhance viral infectivity ( ) ( ) ( ) , until now there is no evidence that infection with sars-cov- carrying the g mutant will be associated with disease severity ( , ) . furthermore, assays using both monoclonal and polyclonal antibodies generated from individuals naturally infected with d -or g -carrying viruses demonstrated that the d g mutation retains or even increases viral susceptibility to neutralization ( , , , ) . this suggests that the d g mutant maintains or favors an open, functional conformational state ( ) . although at an extremely low frequency, natural variations, including l r a v, v a, and f l that render the s glycoprotein resistant to certain neutralizing antibodies targeting the rbd, emerged under no selection pressure exerted by approved vaccines or neutralizing antibodies or entry inhibitors ( , ) . however, it has been shown that sars-cov- escape mutants could be easily selected and quickly amplified under the selection pressure of single antibody treatment ( ) . these observations suggest that a combination of at least two neutralizing antibodies that recognize and bind to distinct and non-overlapping epitopes on the sars-cov- s glycoprotein (e.g., rbd and ntd, as well as hr and glycan) is required to restrict the possible occurrence of viral escape mutants and potential subsequent loss of single antibody-mediated neutralization ( ) ( ) ( ) ( ) . when these observations are taken into consideration for vaccine design and development, an ideal sars-cov- immunogen should contain as many exposed neutralizing epitopes as possible, although the rbd also possesses extra epitope(s) besides the epitope in the rbm region ( , ( ) ( ) ( ) . sars-cov- is a highly contagious pathogen that continues to spread quickly around the globe, causing covid- to be one of the worst pandemics in recorded history. a safe and efficacious vaccine represents one of the best ways to reduce or eliminate the covid- pandemic ( ) . unfortunately, no vaccines for any of the known human covs have been licensed ( , ) , although several potential sars-cov and mers-cov vaccines have advanced into human clinical trials for years ( , ) , suggesting the development of effective vaccines against human covs has always been challenging. however, it has been shown that both sars-cov and sars-cov- could readily induce neutralizing antibodies following natural infection or immunization ( ) ( ) ( ) ( ) . moreover, a growing number of neutralizing monoclonal antibodies targeting the sars-cov- s glycoprotein with high potency have been isolated from plenty of convalescent donors ( ) as well as humanized mice ( , ) , some of which have been shown to afford protection against sars-cov- challenge in animal models. it thus seems that vaccine candidates designed to elicit such neutralizing antibodies are feasible. it is widely accepted that the s protein of sars-cov- is a most promising immunogen for producing protective immunity ( ) . however, it is likely that the s protein has evolved to perform its functions while evading host neutralizing antibody responses and thus should be engineered to ensure an optimal immune response ( , ) . the immunogen design strategies described in this review based on the wealth of the sars-cov- s glycoprotein research related to its biosynthesis, structure, function, antigenicity as well as immunogenicity will likely contribute to the ultimate success of safe and efficacious vaccines against sars-cov- /covid- . covid- : emergence, spread, possible treatments, and global burden highlight of immune pathogenic response and hematopathologic effect in sars-cov, mers-cov, and sars-cov- infection coronaviridae study group of the international committee on taxonomy of viruses. the species severe acute respiratory syndrome-related coronavirus: classifying -ncov and naming it sars-cov- an interactive web-based dashboard to track covid- in real time origin and evolution of pathogenic coronaviruses viral membrane fusion cell entry mechanisms of sars-cov- coronavirus membrane fusion mechanism offers a potential target for antiviral development characterization of spike glycoprotein of sars-cov- on virus entry and its immune cross-reactivity with sars-cov mechanisms of coronavirus cell entry mediated by the viral spike protein structure, function, and antigenicity of the sars-cov- spike glycoprotein sars-cov- vaccines: status report composition and divergence of coronavirus spike proteins and host ace receptors predict potential intermediate hosts of sars-cov- n-linked protein glycosylation in the endoplasmic reticulum protein folding in the endoplasmic reticulum important role for the transmembrane domain of severe acute respiratory syndrome coronavirus spike protein during entry snapshot: n-glycosylation processing pathways across kingdoms n-linked protein glycosylation in the er intracellular functions of n-linked glycans mechanisms and principles of n-linked protein glycosylation glycosylation quality control by the golgi structure the proximal origin of sars-cov- snapshot: o-glycosylation pathways across kingdoms a multibasic cleavage site in the spike protein of sars-cov- is essential for infection of human lung cells the spike glycoprotein of the new coronavirus -ncov contains a furin-like cleavage site absent in cov of the same clade tmprss and furin are both essential for proteolytic activation and spread of sars-cov- in human airway epithelial cells and provide promising drug targets intracellular targeting signals contribute to localization of coronavirus spike proteins near the virus assembly site the intracellular sites of early replication and budding of sarscoronavirus the cytoplasmic tail of the severe acute respiratory syndrome coronavirus spike protein contains a novel endoplasmic reticulum retrieval signal that binds copi and promotes interaction with membrane protein the contribution of the cytoplasmic retrieval signal of severe acute respiratory syndrome coronavirus to intracellular accumulation of s proteins and incorporation of s protein into viruslike particles properties of coronavirus and sars-cov- a replication-competent vesicular stomatitis virus for studies of sars-cov- spike-mediated cell entry and its inhibition measuring sars-cov- neutralizing antibody activity using pseudotyped and chimeric viruses cryo-em structure of the -ncov spike in the prefusion conformation the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex structure and immune recognition of trimeric pre-fusion hiv- env pre-fusion structure of a human coronavirus spike protein common features of enveloped viruses and implications for immunogen design for next-generation vaccines viral membrane fusion a new coronavirus associated with human respiratory disease in china distinct conformational states of sars-cov- spike protein cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer hiv- evades antibody-mediated neutralization through conformational masking of receptor-binding sites conformational masking and receptor-dependent unmasking of highly conserved env epitopes recognized by non-neutralizing antibodies that mediate potent adcc against hiv- conformational dynamics of single hiv- envelope trimers on the surface of native virions immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen a pneumonia outbreak associated with a new coronavirus of probable bat origin sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses structural basis of receptor recognition by sars-cov- structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structural and functional basis of sars-cov- entry by using human ace structural basis for the recognition of sars-cov- by full-length human ace inhibition of sars-cov- infections in engineered human tissues using clinical-grade soluble human ace neutralization of sars-cov- spike pseudotyped virus by recombinant ace -ig novel ace -igg fusions with increased activity against sars-cov- . biorxiv the first-in-class peptide binder to the sars-cov- spike protein ace fragment as a decoy for novel sars-cov- virus inhibition of sars-cov- (previously -ncov) infection by a highly potent pan-coronavirus fusion inhibitor targeting its spike protein that harbors a high capacity to mediate membrane fusion insights into vaccine development for acquired immune deficiency syndrome from crystal structures of human immunodeficiency virus- gp and equine infectious anemia virus gp insights into the design of sars-cov- spike-based immunogens a pancoronavirus fusion inhibitor targeting the hr domain of human coronavirus spike glycan shield and epitope masking of a coronavirus spike protein observed by cryo-electron microscopy site-specific glycan analysis of the sars-cov- spike mass spectrometry analysis of newly emerging coronavirus hcov- spike s protein and human ace reveals camouflaging glycans and unique post-translational modifications vulnerabilities in coronavirus glycan shields despite extensive glycosylation trimeric hiv- -env structures define glycan shields from clades a, b, and g structure of the lassa virus glycan shield provides a model for immunological resistance site-specific nglycosylation characterization of recombinant sars-cov- spike proteins using high-resolution mass spectrometry recent strategies targeting hiv glycans in vaccine design protein and glycan mimicry in hiv vaccine design structure and immune recognition of the hiv glycan shield cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody vaccination with glycan-modified hiv nfl envelope trimer-liposomes elicits broadly neutralizing antibodies to multiple sites of vulnerability n-linked glycans and k residue on hemagglutinin synergize to elicit broadly reactive h n influenza virus antibodies targeting host-derived glycans on enveloped viruses for antibody-based vaccine design coronaviruses' sugar shields as vaccine candidates cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains the covid- vaccine development landscape lack of antibodymediated cross-protection between sars-cov- and sars-cov infections lack of cross-neutralization by sars patient sera towards sars-cov- cross-reactive antibody response between sars-cov- and sars-cov infections phylogenetic analysis and structural modeling of sars-cov- spike protein reveals an evolutionary distinct and proteolytically sensitive activation loop in situ structural analysis of sars-cov- spike reveals flexibility mediated by three hinges structure-based design of a fusion glycoprotein vaccine for respiratory syncytial virus the spike protein of sars-cov-a target for vaccine and therapeutic development sars vaccine development from sars-cov to sars-cov- : safety and broad-spectrum are important for coronavirus vaccine development role of antibodydependent enhancement (ade) in the virulence of sars-cov- and its mitigation strategies for the development of vaccines and immunotherapies to counter covid- a perspective on potential antibody-dependent enhancement of sars-cov- developing covid- vaccines at pandemic speed sars-cov- protein subunit vaccination elicits potent neutralizing antibody responses sars-cov- mrna vaccine design enabled by prototype pathogen preparedness the sars-cov- vaccine pipeline: an overview roadmap to developing a recombinant coronavirus s protein receptor-binding domain vaccine for severe acute respiratory syndrome subunit vaccines against emerging pathogenic human coronaviruses the receptor binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients a universal design of betacoronavirus vaccines against covid- , mers, and sars neutralizing antibodies against sars-cov- and other human coronaviruses a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensin-converting enzyme characterization of the receptor-binding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine structure of severe acute respiratory syndrome coronavirus receptor-binding domain complexed with neutralizing antibody structural insights into coronavirus entry molecular basis of binding between novel human coronavirus mers-cov and its receptor cd unexpected receptor functional mimicry elucidates activation of coronavirus fusion yeast-expressed sars-cov recombinant receptor-binding domain (rbd -n ) formulated with alum induces protective immunity and reduces immune enhancement a vaccine targeting the rbd of the s protein of sars-cov- induces protective immunity convergent antibody responses to sars-cov- in convalescent individuals a potent neutralizing human antibody reveals the n-terminal domain of the spike protein of sars-cov- as a site of vulnerability potent neutralizing antibodies from covid- patients define multiple targets of vulnerability potent neutralizing antibodies directed to multiple epitopes on sars-cov- spike proof of principle for epitope-focused vaccine design the sars-cov- receptor-binding domain elicits a potent neutralizing response without antibody-dependent enhancement what are the most powerful immunogen design vaccine strategies? a structural biologist's perspective structural vaccinology for viral vaccine design targets of t cell responses to sars-cov- coronavirus in humans with covid- disease and unexposed individuals t-cell immunity of sars-cov: implications for vaccine development against mers-cov recent advances in the vaccine development against middle east respiratory syndrome-coronavirus divergent sars-cov- -specific t and b cell responses in severe but not mild covid- preliminary identification of potential vaccine targets for the covid- coronavirus (sars-cov- ) based on sars-cov immunological studies controlling the sars-cov- spike glycoprotein conformation structure-based design of prefusion-stabilized sars-cov- spikes high epitope density in a single protein molecule significantly enhances antigenicity as well as immunogenicity: a novel strategy for modern vaccine development and a preliminary investigation about b cell discrimination of monomeric proteins evaluation of candidate vaccine approaches for mers-cov quasispecies theory and the behavior of rna viruses coast-to-coast spread of sars-cov- during the early epidemic in the united states insights into rna synthesis, capping, and proofreading mechanisms of sars-coronavirus tracking changes in sars-cov- spike: evidence that d g increases infectivity of the covid- virus the d g mutation in the sars-cov- spike protein reduces s shedding and increases infectivity the d g mutation of sars-cov- spike protein enhances viral infectivity sars-cov- spike protein variant d g increases infectivity and retains sensitivity to antibodies that target the receptor binding domain making sense of mutation: what d g means for the covid- pandemic remains unclear the impact of mutations in sars-cov- spike on viral infectivity and antigenicity d g spike mutation increases sars cov- susceptibility to neutralization. medrxiv the sars-cov- spike variant d g favors an open conformational state antibody cocktail to sars-cov- spike protein prevents rapid mutational escape seen with individual antibodies studies in humanized mice and convalescent humans yield a sars-cov- antibody cocktail neutralizing antibodies against sars-cov- and other human coronaviruses perspectives on the development of neutralizing antibodies against sars-cov- a highly conserved cryptic epitope in the receptor binding domains of sars-cov- and sars-cov structural basis for neutralization of sars-cov- and sars-cov by a potent therapeutic antibody a human monoclonal antibody blocking sars-cov- infection updated approaches against sars-cov- vaccines for covid- : perspectives, prospects, and challenges based on candidate sars, mers, and animal coronavirus vaccines antibodies and vaccines against middle east respiratory syndrome coronavirus vaccines against coronaviruses: the state of the art. vaccines (basel) ( ) : immunogenic profile of sars-cov- spike in individuals recovered from covid- . medrxiv immunogenicity of a dna vaccine candidate for covid- dna vaccine protection against sars-cov- in rhesus macaques antibody signature induced by sars-cov- spike protein immunogens in rabbits progress and prospects on vaccine development against sars-cov- . vaccines (basel) ( ) : what are the most powerful immunogen design vaccine strategies? reverse vaccinology . shows great promise structure-based vaccine antigen design all authors listed have made a substantial, direct, and intellectual contribution to the work, and approved it for publication. we would like to thank prof. xinqi liu for critical reading of the manuscript; and drs. yanbin feng, mengyuan xu, jing ma and jianrong feng for helpful comments and discussions on the manuscript. the authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.copyright © duan, zheng, zhang, niu, lou and wang. this is an open-access article distributed under the terms of the creative commons attribution license (cc by). the use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. no use, distribution or reproduction is permitted which does not comply with these terms. key: cord- -jaom p s authors: he, yuxian; li, jingjing; jiang, shibo title: a single amino acid substitution (r a) in the receptor-binding domain of sars coronavirus spike protein disrupts the antigenic structure and binding activity date: - - journal: biochemical and biophysical research communications doi: . /j.bbrc. . . sha: doc_id: cord_uid: jaom p s abstract the spike (s) protein of severe acute respiratory syndrome coronavirus (sars-cov) has two major functions: interacting with the receptor to mediate virus entry and inducing protective immunity. coincidently, the receptor-binding domain (rbd, residues – ) of sar-cov s protein is a major antigenic site to induce neutralizing antibodies. here, we used rbd-fc, a fusion protein containing the rbd and human igg fc, as a model in the studies and found that a single amino acid substitution in the rbd (r a) could abolish the immunogenicity of rbd to induce neutralizing antibodies in immunized mice and rabbits. with a panel of anti-rbd mabs as probes, we observed that r a substitution was able to disrupt the majority of neutralizing epitopes in the rbd, suggesting that this residue is critical for the antigenic structure responsible for inducing protective immune responses. we also demonstrated that the rbd-fc bearing r a mutation could not bind to soluble and cell-associated angiotensin-converting enzyme (ace ), the functional receptor for sars-cov and failed to block s protein-mediated pseudovirus entry, indicating that this point mutation also disrupted the receptor-binding motif (rbm) in the rbd. taken together, these data provide direct evidence to show that a single amino acid residue at key position in the rbd can determine the major function of sars-cov s protein and imply for designing sars vaccines and therapeutics. the spike (s) protein of severe acute respiratory syndrome coronavirus (sars-cov), similar to those of other coronaviruses, is a large type i transmembrane glycoprotein, which is incorporated into the viral envelope and provides the virion with a corona-like appearance [ , ] . unlike those of many other cornaviruses, the s protein of sars-cov may not be cleaved in the virus-producing cells [ , ] ; however, two domains corresponding to the n-terminal s subunit and the c-terminal s subunit of processed coronaviruses can be defined by sequence alignment [ , ] . the s subunit of coronavirus s protein forms the surface knob-like structure, whereas the s subunit is membraneanchored and forms the stem-like structure beneath the knob [ , ] . the binding of coronavirus to its specific receptor on the target cell is an initial step of infection [ ] [ ] [ ] . angiotensinconverting enzyme (ace ) is a functional receptor for sars-cov [ ] [ ] [ ] . the s protein of sars-cov can bind to ace with high affinity and mediates viral entry. a residue fragment within s domain (residues - ) has been defined as a minimal receptor-binding domain (rbd) (fig. ) [ , , ] . crystal structure of an independently folded rbd bound to human ace reveals that residues - constitute the receptor-binding motif (rbm) [ ] . the s domain of sars-cov s protein contains a putative fusion peptide and two heptad repeat regions (hr and hr ), which can associate to form a six helix bundle comprised of three helices from hr that run antiparallel to three helices from hr [ ] [ ] [ ] . the second major function of coronavirus s protein is its capacity to elicit neutralizing antibodies and sterilizing immunity, and is thereby considered as a critical immunogen for vaccine development [ , ] . similarly, it has been shown that the s protein of sars-cov is a major protective antigen among four structural proteins [ ] . several live virus and dna vaccines expressing the s protein have been tested in preclinical studies [ ] [ ] [ ] . coincidently, the rbd of sars-cov s protein is a major target of neutralizing antibodies induced in patients infected with sars-cov and in animals immunized with inactivated viruses or s proteins [ ] [ ] [ ] . we previously demonstrated that rbd-fc, a fusion protein containing the rbd linked to fc portion of human igg , is a potent inducer of neutralizing antibodies and has potential to be developed as a subunit vaccine [ , ] . several conformation-dependent neutralizing epitopes (conf i-vi) were identified in the rbd [ ] . although the rbd is a major neutralizing domain of sars-cov, the s protein also contains neutralizing epitopes in other regions [ ] [ ] [ ] . interestingly, it was recently reported that the major function of full-length s protein, i.e., mediating viral entry and inducing neutralizing antibodies, could be abolished by single amino acid substitutions in the rbd (e.g., r a) [ ] . in this study, we used the rbd-fc as a model molecule to further investigate the impact of r a substitution on the immunogenicity and receptor-binding activity of independently folded rbd. our data are important for understanding the mechanism how a single amino acid residue determines the major function of sars-cov s protein. recombinant s proteins and mabs. the plasmid encoding the receptorbinding domain (residues - ) of sars-cov s protein tor (accession no. ay ), fused with the fc portion of human igg (rbd-fc), was previously described [ , ] . rbd-fc bearing r a substitution (designated as rbd-r a) was generated by mutagenesis using the quickchange xl kit (stratagene) and verified by dna sequencing. each of the recombinant fusion proteins was expressed in t cells transfected with the plasmid using fugene reagents (boehringer mannheim, indianapolis, in) according to the manufacturer's protocol and purified by protein a-sepharose fast flow (amersham biosciences, piscataway, nj). the full-length s protein (fl-s) of sars-cov urbani (accession no. ay ) was expressed in expressf +Ò insect cells with recombinant baculovirus d by the protein sciences corporation (bridgeport, ct). a panel of mabs specific for the rbd of sars-cov s protein was prepared in our laboratory, which include mabs isolated from mice immunized with rbd-fc [ ] , mabs isolated from mice immunized with fl-s, and mabs isolated from mice immunized with inactivated sars-cov. immunization of mice and rabbits. rbd-fc and its mutant rbd-r a were, respectively, used to immunize mice and rabbits. four female balb/c mice ( weeks old) per group were subcutaneously immunized with lg of purified proteins re-suspended in pbs plus mlp + tdm adjuvant (sigma, saint louis, mi) and boosted with lg of the same antigen plus the mlp + tdm adjuvant at -week intervals. four nzw rabbits ( weeks old) per group were immunized intradermally with lg purified proteins re-suspended in phosphate-buffered solution (pbs, ph . ) in the presence of freund's complete adjuvant (fca), and boosted three times with freshly prepared emulsion of lg immunogen and freund's incomplete adjuvant (fia) at -week intervals. pre-immune sera were collected before starting the immunization and antisera were collected days after each boost. sera were kept at °c before use. enzyme-linked immunosorbent assay. the reactivity of mouse and rabbit antisera or anti-rbd mabs with s proteins (rbd-fc, rbd-r a or fl-s) was determined by enzyme-linked immunosorbent assay (elisa). briefly, lg/ml recombinant protein was used to coat -well microtiter plates (corning costar, acton, ma) in . m carbonate buffer (ph . ) at °c overnight. after blocking with % non-fat milk, serially diluted antisera or mabs were added and incubated at °c for h, followed by four washes with pbs containing . % tween . bound antibodies were detected with hrp-conjugated goat anti-mouse igg or goat anti-rabbit igg (zymed) at °c for h, followed by washes. the reaction was visualized by addition of the substrate , , , -tetramethylbenzidine (tmb) and absorbance at nm was measured by an elisa plate reader (tecan us, research triangle park, nc). generation of sars pseudovirus and neutralization assay. sars-cov pseudovirus system was developed in our laboratory as previously described [ , ] . in brief, hek t cells were co-transfected with a plasmid encoding the s protein corresponding to sars-cov tor isolate and a plasmid encoding env-defective, luciferase-expressing hiv- genome (pnl - .luc.re) by using fugene reagents (boehringer mannheim). supernatants containing sars pseudovirus were harvested h post-transfection and used for single-cycle infection of human ace transfected t ( t/ace ) cells. briefly, t/ace cells were plated at cells/well in -well tissue-culture plates and grown overnight. the supernatants containing pseudovirus were pre-incubated with serially diluted mouse or rabbit antisera at °c for h before addition to cells. the culture was re-fed with fresh medium h later and incubated for an additional h. cells were washed with pbs and lysed using lysis reagent included in a luciferase kit (promega, madison, wi). aliquots of cell lysates were transferred to -well costar flat-bottomed luminometer plates (corning costar, corning, ny), followed by addition of luciferase substrate (promega). relative light units (rlu) were determined immediately in the ultra luminometer (tecan us). receptor-binding assays. binding of rbd-fc or rbd-r a protein to soluble ace was measured by elisa. briefly, recombinant soluble ace (r&d systems, inc., minneapolis, mn) at lg/ml was coated onto -well elisa plates (corning costar) in . m carbonate buffer (ph . ) at °c overnight. after blocking with % non-fat milk, serially diluted rbd-fc or r a was added to the wells and incubated at °c for h. after washing, the hrp-conjugated goat anti-human igg (zymed) was added and incubated an additional h. after washing, the substrate tmb was used for detection. binding of rbd-fc or rbd-r a to ace expressing cells was measured by flow cytometry. briefly, amino acid residue (arginine) is essential for the immunogenicity of rbd to induce neutralizing antibodies it was recently reported that the full-length s protein of sars-cov bearing r a substitution failed to induce neutralizing antibodies against s protein-pseudotyped viruses [ ] . we are interested in investigating whether this point mutation can also affect the immunogenicity of rbd-fc, a potent inducer of neutralizing antibodies [ , ] . the rbd-r a mutant was generated by mutagenesis as described. both wild-type rbd-fc and rbd-r a were expressed in t cells and purified to homogenicity by protein a chromatography (fig. c) . for comparison, both the wild-type and mutant proteins were, respectively, used to immunize mice and rabbits. as shown in fig. , both mice and rabbits developed robust antibody responses against the corresponding immunogens after the third boosting immunizations. relatively, rbd-fc induced higher titers of antibodies in both mice (mean end-point titer was / , ) and rabbits (mean end-point titer was / , , ), whereas rbd-r a induced antibodies with mean end-point titers at / , in mice and at / , in rabbits. we used a recombinant full-length s protein (fl-s) as a coating antigen to measure the titers of antibodies specific for the rbd in the antisera collected after the third boost (fig. ) . surprisingly, while the rbd-fc could induce high titers of rbd-specific antibodies in mice and rabbits (mean endpoint titers were / , and / , , respectively), the r a mutant only induced rbd-specific antibody titers at / in mice and / in rabbits, respectively. we then tested whether mouse and rabbit antisera had neutralizing activities against s protein-mediated viral entry by using sars pseudovirus. consistent with our previous findings [ , ] , both mouse and rabbit antibodies induced by rbd-fc could potently neutralize sars pseudovirus with mean % neutralizing titers at / , and / , , respectively (fig. ) . however, the antibodies induced by rbd-r a mutant in both mice and rabbits could not significantly inhibit sars pseudovirus at / or higher dilutions. these results indicate that r a substitution severely impairs the immunogenicity of rbd-fc to elicit rbd-specific neutralizing antibodies in immunized animals. we previously identified six groups of conformationdependent neutralizing epitopes (conf i-vi) with a panel of anti-rbd mabs isolated from the mice immunized with rbd-fc [ ] . we have recently isolated a set of novel anti-rbd mabs from the mice immunized with inactivated sars-cov or recombinant full-length s protein, and their epitopes have been initially grouped by binding competition (designated as group a-c and group a-c, respectively) (fig. ) . consistently, these novel anti-rbd mabs possess potent neutralizing activity, but they may target different epitopes as shown by their unique specificity to block receptor binding (data not shown). to probe antigenic structure in the rbd bearing r a mutation, the representative anti-rbd mabs from each epitopic groups were used in elisa to measure their reactivity with both wild-type and mutant proteins. as shown in fig. , all anti-rbd mabs strongly reacted with rbd-fc, but only three conformation-dependent mabs ( f , g , and s ) were able to recognize the rbd-r . we have known that the epitope for s (group d) differs from those of the conf v mabs ( f and g ) as shown by their binding competition and their capacity to block receptor binding (data not shown). it was of interest to note that the reactivity of one mab ( d ) targeting the mouse antisera ( /dilution) linear epitope (residues - ) was significantly reduced by r a substitution, while the mab h targeting the residues - reacted with both proteins equally. these results suggest that r a substitution can disrupt the majority of conformation-dependent neutralizing epitopes in the rbd, and that the conf v and group d epitopes are relatively stable and conserved. to further characterize the antigenicity of rbd-r a mutant, we compared it with wild-type rbd-fc for the reactivity with the mouse antisera induced against the fl-s by elisa. as expected, the rbd-fc reacted strongly with the mouse anti-s sera (mean end-point titer at / , ), comparable with the ff-s protein ( / , ) (fig. ) . however, rbd-r a mutant did not significantly react with mouse anti-fl-s sera, confirming that r a mutation disrupts the antigenic structure in the rbd. the s protein of sars-cov is responsible for binding with the receptor and mediates viral entry into target cell [ ] . however, pseudovirus expressing the full-length s protein with r a substitution completely loses infectivity [ ] . we hypothesized that the substitution of r might disrupt the receptor-binding motif in the rbd and thereby the pseudovirus was unable to bind to the receptor. first, we used an elisa-based assay to compare the binding activities of both rbd-fc and rbd-r a to soluble ace . as shown in fig. , rbd-fc could bind to the soluble ace in a dose-dependent manner, whereas rbd-r a had no binding activity at a concentration up to lg/ml, at which the wild-type rbd-fc reached a reactive plateau. we then measured whether rbd-fc and rbd-r a bind to cell-associated ace by flow cytometry. consistent with our previous report, rbd-fc bound to ace -expressing t cells efficiently; however, the rbd-r a could not bind to ace expressed on the cells (fig. ) . rbd-fc itself had inhibitory activity against pseudovirus entry, suggesting its potential application as an antiviral therapeutic [ ] . in parallel, we tested the inhibitory activity of both rbd-fc and rbd-r a on sars pseudovirus. as expected, rbd-fc was able to inhibit sars pseudovirus infection with an ic of . lg/ml rabbit antisera ( /dilution) ( fig. ) , whereas the rbd-r a had no inhibitory effect at a concentration as high as lg/ml, suggesting that the rbd mutant could not compete with the virion to bind to the receptor. taken together, these data suggest that r a substitution may damage the receptor-binding motif in the rbd. sars-cov emerged in the winter of - and killed approximately people, % of those infected [ , , [ ] [ ] [ ] . although there are no recent sars outbreaks, the need to develop effective vaccines remains of high importance to prevent future epidemic caused by the sars-cov, which may re-emerge from animal reservoirs [ ] [ ] [ ] [ ] [ ] . to this end, structural and functional characterization of sars-cov is one of the highest priorities. yi et al. [ ] recently reported that single amino acid substitutions in the sars-cov s protein determine the viral entry and immunogenicity to induce neutralizing antibodies, but the mechanisms that caused these phenotypes remain to be elucidated. in this study, we used the rbd-fc as a model to study how a single residue mutation in the rbd can abolish the major function of full-length s protein, since this molecule can efficiently bind to the receptor ace and contains multiple conformation-dependent epitopes (conf i-vi) capable of inducing highly potent neutralizing antibodies [ ] . we converted arginine to alanine (r a), which was shown to disrupt the immunogenicity of full length to induce neutralizing antibodies and s protein-mediated viral entry [ ] , and evaluated its effect on the antigenic structure and binding function of the rbd. first, we found that r a substitution could completely abolish the ability of rbd to induce neutralizing antibodies in the immunized mice and rabbits. we then probed the antigenic epitopes in the rbd bearing r a by a panel of anti-rbd mabs recognizing different epitopes in the rbd and found that this mutation could disrupt the major neutralizing epitopes. these results provide direct evidence to explain why rbd-r a mutant could not induce neutralizing antibodies. although conf v and group d neutralizing epitopes in the rbd-r a retained partial reactivity with the corresponding mabs, they failed to elicit functional antibodies in either mice or rabbits. these data indicate that the residue r is essential for maintaining the antigenic structure in the rbd, which confers the immunogenicity to induce neutralizing antibodies. it was understandable that a single residue change in the rbd (e.g., r a) could abolish its ability to induce functional antibodies through disrupting its major conformation-dependent neutralizing epitopes, but the mechanism how r a substitution was able to determine the immunogenicity of full-length s protein is poorly understood [ ] . although the rbd of sars-cov is a major target of neutralizing antibodies, the s protein also contains some neutralizing epitopes in the other domains. for example, it was reported that the linear epitopes in the hr region of the s subunit, which is far away from the rbd, could induce antibodies with moderate neutralizing activity [ ] . we have recently found that the n-terminal region (residues - ) of s protein also contains neutralizing epitopes (data not shown). therefore, further structural characterization on the s protein may provide important information for understanding why a single point mutation in rbd affects the immunogenicity of the entire s protein. we subsequently documented that r a mutation was able to completely abolish rbd-mediated binding activity to the receptor ace . the rbd-r a molecule could not bind to either soluble or cell-associated ace as shown by elisa and flow cytometry-based assays, respectively. moreover, the rbd with r a substitution also lost its inhibitory ability against viral entry. therefore, our data indicate that the residue r is not only essential for the antigenic structure in the rbd, but also critical for the receptor-binding motif (rbm). crystal structure of the rbd in complex human ace reveals that only a few of the many contacting residues in the large interface between the s protein and receptor determine the efficiency of virus binding and infection [ ] . the ace is bound by an extended loop in the s protein that projects from a compact core formed by residues - , the rbm. in particular, a methyl group from a threonine residue at position of the s protein at the interface extends into a hydrophobic pocket in ace that contains a lysine residue at position . although the r is not one of residues on the loop that contact residues on human ace [ ] , its substitution might change the configuration of rbm and thereby disrupts interaction of the residues between rbm and ace . subsequently, the conformational change of the rbd might result in a dramatic alteration in its antigenic structure. therefore, retaining the critical residues and proper antigenic conformations in rbd is important for developing sars vaccines. the sites in rbd containing critical residues, e.g., r , can be used as targets for rational design of therapeutics. characterization of a novel coronavirus associated with severe acute respiratory syndrome retroviruses pseudotyped with the severe acute respiratory syndrome coronavirus spike protein efficiently infect cells expressing angiotensin-converting enzyme the sars-cov s glycoprotein: expression and functional characterization coronavirus spike proteins in viral entry and pathogenesis molecular modelling of s and s subunits of sars coronavirus spike glycoprotein cellular entry of the sars coronavirus sars-associated coronavirus coronavirus spike proteins in viral entry and pathogenesis angiotensin-converting enzyme is a functional receptor for the sars coronavirus a model of the ace structure and function as a sars-cov receptor expression cloning of functional receptor used by sars coronavirus amino acids to of the severe acute respiratory syndrome coronavirus spike protein are required for interaction with receptor a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensin-converting enzyme structure of sars coronavirus spike receptor-binding domain complexed with receptor structural characterization of the fusion-active complex of severe acute respiratory syndrome (sars) coronavirus interaction between heptad repeat and regions in spike protein of sars-associated coronavirus: implications for virus fusogenic mechanism and identification of fusion inhibitors structural characterization of the sars-coronavirus spike s fusion protein core severe acute respiratory syndrome vaccine development: experiences of vaccination against avian infectious bronchitis coronavirus coronavirus immunogens contributions of the structural proteins of severe acute respiratory syndrome coronavirus to protective immunity severe acute respiratory syndrome coronavirus spike protein expressed by attenuated vaccinia virus protectively immunizes mice mucosal immunisation of african green monkeys (cercopithecus aethiops) with an attenuated parainfluenza virus expressing the sars coronavirus spike protein for the prevention of sars a dna vaccine induces sars coronavirus neutralization and protective immunity in mice recombinant modified vaccinia virus ankara expressing the spike glycoprotein of severe acute respiratory syndrome coronavirus induces protective neutralizing antibodies primarily targeting the receptor binding region inactivated sars-cov vaccine elicits high titers of spike protein-specific antibodies that block receptor binding and virus entry identification of a critical neutralization determinant of severe acute respiratory syndrome (sars)-associated coronavirus: importance for designing sars vaccines receptor-binding domain of sars-cov spike protein induces highly potent neutralizing antibodies: implication for developing subunit vaccine receptor-binding domain of sars coronavirus spike protein contains multiple conformational-dependant epitopes that induce highly potent neutralizing antibodies monoclonal antibodies targeting the hr domain and the region immediately upstream of the hr of the s protein neutralize in vitro infection of severe acute respiratory syndrome coronavirus b-cell responses in patients who have recovered from severe acute respiratory syndrome target a dominant site in the s domain of the surface spike glycoprotein an exposed domain in the severe acute respiratory syndrome coronavirus spike protein induces neutralizing antibodies single amino acid substitutions in the severe acute respiratory syndrome coronavirus spike glycoprotein determine viral entry and immunogenicity of a major neutralizing domain identification of a novel coronavirus in patients with severe acute respiratory syndrome a novel coronavirus associated with severe acute respiratory syndrome coronavirus as a possible cause of severe acute respiratory syndrome virus detectives seek source of sars in china's wild animals isolation and characterization of viruses related to the sars coronavirus from animals in southern china cross-host evolution of severe acute respiratory syndrome coronavirus in palm civet and human bats are natural reservoirs of sars-like coronaviruses severe acute respiratory syndrome coronavirus-like virus in chinese horseshoe bats key: cord- - wfy f authors: gobeil, sophie m-c.; janowska, katarzyna; mcdowell, shana; mansouri, katayoun; parks, robert; manne, kartik; stalls, victoria; kopp, megan; henderson, rory; edwards, robert j; haynes, barton f.; acharya, priyamvada title: d g mutation alters sars-cov- spike conformational dynamics and protease cleavage susceptibility at the s /s junction date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: wfy f the sars-cov- spike (s) protein is the target of vaccine design efforts to end the covid- pandemic. despite a low mutation rate, isolates with the d g substitution in the s protein appeared early during the pandemic, and are now the dominant form worldwide. here, we analyze the d g mutation in the context of a soluble s ectodomain construct. cryo-em structures, antigenicity and proteolysis experiments suggest altered conformational dynamics resulting in enhanced furin cleavage efficiency of the g variant. furthermore, furin cleavage altered the conformational dynamics of the receptor binding domains (rbd) in the g s ectodomain, demonstrating an allosteric effect on the rbd dynamics triggered by changes in the sd region, that harbors residue and the furin cleavage site. our results elucidate sars-cov- spike conformational dynamics and allostery, and have implications for vaccine design. highlights sars-cov- s ectodomains with or without the k p, v p mutations have similar structures, antigenicity and stability. the d g mutation alters s protein conformational dynamics. d g enhances protease cleavage susceptibility at the s protein furin cleavage site. cryo-em structures reveal allosteric effect of changes at the s /s junction on rbd dynamics. the severe acute respiratory coronavirus (sars-cov- ) belongs to the b-coronavirus family of enveloped, positive-sense single stranded rna viruses, and has one of the largest genomes among rna viruses (de wit et al., ) . of the seven known coronaviruses that infect humans, four (hcov- e, hcov-oc , hcov-nl , cov-hku ) circulate annually causing generally mild respiratory symptoms in otherwise healthy individuals, while the sars-cov- and middle east respiratory syndrome coronavirus (mers-cov) , that are closely related to sars-cov- , have resulted in the - sars and mers epidemics (zumla et al., , respectively. the ongoing pandemic of coronavirus disease of , is a global public health emergency with more than million cases and million deaths recorded worldwide (dong et al., ) (https://coronavirus.jhu.edu). the surface of the sars-cov- is decorated with the spike (s) glycoprotein turonova et al., ) that is the target of most current vaccine development efforts sempowski et al., ) . in its prefusion conformation the sars-cov- s protein is a large homo-trimeric glycoprotein forming a crown (from the latin corõna) at the surface of the virus capsid. each s protomer is subdivided into two domains, s and s , which are delimited by a furin cleavage site at residue - (figure ) . the s domain comprises the n-terminal domain (ntd), an ntd-to-rbd linker (n r), the receptor binding domain (rbd), and subdomains and (sd and sd ). the s domain contains a second protease cleavage site (s ') followed by the fusion peptide (fp), heptad repeat (hr ), the central helix (ch), the connector domain (cd), heptad repeat (hr ), the transmembrane domain (tm) and a cytoplasmic tail (ct) (figure ) . the s domain is responsible for recognition and binding to the host-cell angiotensin-converting enzyme (ace ) receptor. the s domain is responsible for viral-host-cell membrane fusion and undergoes large conformational changes (hoffmann et al., a) , but only upon furin cleavage and further essential processing by cleavage at the s ' site by tmprss and related proteases (bestle et al., ; hoffmann et al., b; matsuyama et al., ) . previous reports have demonstrated the central role of the dynamics of the rbd domains between a "closed" (or all rbd-down receptor inaccessible conformation) and "open" (or rbd-up) conformations for recognition and binding to the host cell ace receptor (gui et al., ; shang et al., ; yuan et al., ) . since the early stages of the covid- pandemic, virus evolution has been followed by large-scale sequencing of the virus genomes isolated from patients, and several mutations that arose and propagated within different populations have been identified even though the virus has genetic proofreading mechanisms (elbe and buckland-merrett, ; korber et al., ) . the d g mutation in particular has attracted attention since it has quickly become the dominant strain of sars-cov- circulating worldwide (korber et al., ) . the d g mutation of the s protein has been associated in numerous reports with increased fitness and/or infectivity of the virus (korber et al., ; li et al., ; weissman et al., ) . cryo electron microscopy (cryo-em) structures of the s glycoprotein ectodomain have revealed that d is a surface residue in the vicinity of the furin cleavage site. mutation of this residue to a glycine is expected to disrupt a critical interprotomer hydrogen bond involving t of the s domain (korber et al., ) and resulting in a shift in the observed equilibrium between the open and closed state of the s protein ectodomain (johnson et al., ; weissman et al., ; yurkovetskiy et al., ) ( figure ). most structures of the sars-cov- s ectodomain currently available include two mutations, one to disrupt the furin cleavage site (rrar to gsas = s-gsas), and a double proline mutation (pp) of residues - , designed to prevent conformational change to the post-fusion state (wrapp et al., ) . originally designed for the mers s protein (pallesen et al., ) , insertion of two consecutive pro mutations at the junction of the hr and ch regions stabilized the pre-fusion conformation of the mers, sars and hcov-hku spikes, increased protein expression, and immunogenicity for the mers s protein (pallesen et al., ) . based on these prior data, introduction of two consecutive proline residues at the beginning of the central helix was postulated as a general strategy for retaining b-coronavirus s proteins in the prefusion conformation. thus, the pp mutations were carried over to the sars-cov- ectodomain (wrapp et al., ) that is currently widely used in the field for vaccine and structural studies, and is also the component of a vaccine candidate . although shown to stabilize the pre-fusion conformation of other coronaviruses, the effect of the pp insertion has not been systematically studied for the sars-cov- s ectodomain. with the goal of investigating the biophysical and structural consequences of the d g mutation, and to prevent the engineered pp mutations from confounding our observations, we produced two sars-cov- s ectodomain constructs with the native k and v residues, incorporating either a d or a g at position (figure ) . the rrar sequence in the furin cleavage site was replaced by a gsas sequence thus rendering the s constructs furin-cleavage deficient. to probe the effect of the d g substitution on furin cleavage of the s protein, we either reinstated the native furin sequence or replaced it with an exogeneous hrv c proteolysis signal. we determined the cryo-em structures of the uncleaved d and g s ectodomains, as well as the structure of the fully cleaved g s ectodomain of the currently globally dominant sars-cov- . our results demonstrate the effect of the d g substitution on the conformational dynamics and furin cleavage susceptibility of the s ectodomain, and reveal insights into the allostery between the rbd and distal regions of the s protein. while the sars-cov- s ectodomain construct that includes mutations of residues k and v , between the hr and ch subdomains (s domain), to prolines (pp) (named s-gsas/pp in this study) (figure ) is widely used in the field, the origin of this pp construct was based upon the stabilization of the pre-fusion conformation of other coronavirus spikes (pallesen et al., ; walls et al., ; wrapp et al., ) . here, we generated an analogous s ectodomain construct that had the native k and v residues (named s-gsas) (figure ). in our f expression system (see methods for details), both the s-gsas/pp and s-gsas constructs expressed at similar levels, yielding about mg final protein per l of culture. both proteins also showed similar migration profiles on sds-page and by size exclusion chromatography (sec) on a superose column (figure a, b) . similar to the s-gsas/pp construct henderson et al., ; wrapp et al., ) , s-gsas showed - % intact prefusion spike trimers by negative stain electron microscopy (nsem) (figure c ). this finding is in contrast to previous observations for mers and the sars-cov- ectodomains, which showed a mixture of the prefusion and postfusion conformations unless the pp mutation was included (pallesen et al., ) . binding of s-gsas and s-gsas/pp measured by elisa to ace and cr , both requiring an rbd-up conformation, ab and ab , two antibodies isolated from a covid- convalescent donor with epitopes mapping to the ace binding site and s domain respectively, and g , binding to a quaternary s glycan epitope, were all nearly identical demonstrating that both constructs showed similar antigenic behavior ( figure d ). using differential scanning fluorimetry to measure the spike thermostability, we found the s-gsas and s-gsas/pp ectodomains showed similar melting temperatures ( figure e ). we next solved cryo-em structures of the s-gsas ectodomain (figures f-h , s -s , table s ), to compare with the s-gsas/pp structures (walls et al., ; wrapp et al., ) and to visualize the impact that the engineered pp mutations had on the structure of the sars-cov- spike ectodomain. two populations of the s-gsas s ectodomain were identified in the cryo-em dataset -a -rbd-up (or open) and a -rbd-down (or closed) conformation ( figure f and table s ). both structures were similar to the corresponding structures of s-gsas/pp (walls et al., ) , with overall rmsds of . Å and . Å for the -up and -down structures, respectively. in the region around the pp mutations, we found the s-gsas structures to be similar to the corresponding s-gsas/pp structures ( figure h ). in the s-gsas -rbd-up structure, we observed that the k side chain was appropriately positioned to make an interprotomer salt bridge with the d residue of the rbd of the adjacent protomer, an interaction that would be abrogated in the pp construct. the corresponding residues in the mers s protein, v and l are non-polar, and the adjacent protomers too far to interact with these residues (figure s ). in the sars-cov- s protein cryo-em structure (pdb xlr, x b) the residues d -d (equivalent to sars-cov- d -d ) lie further from k suggesting that this putative salt bridge interaction may be more transient in sars-cov- . overall, our data show that for the sars-cov- s ectodomain, the s-gsas construct showed similar structural, antigenic and stability behavior as the s-gsass/pp construct that included the k p and v p mutations at the junction of the ch and hr regions. while these and analogous mutations had proved beneficial for the expression and stability of other covs (pallesen et al., ) , for the sars-cov- s protein other compensating interactions may help confer stability to the pre-fusion form in the absence of the pp mutations. for the rest of this study we have used the s-gsas construct as the platform for introducing mutations and other modifications of interest. to understand the molecular details of the spike d g mutation that arose and quickly dominated circulating sars-cov- isolates globally, we sought to assess the impact of the d g mutation on the structure and antigenicity of the sars-cov- s ectodomain. the d g mutated s-gsas construct (s-gsas/d g), yielded an average of ~ mg of purified protein per l of culture (n = ). the sds-page, sec and dsf profiles of the s-gsas/d g ( figure a ) were similar to that of the s-gsas s ectodomain (figure a, b) . nsem of the s-gsas/d g s ectodomain revealed typical and well-dispersed pre-fusion s particles ( figure b ). to visualize structural details at higher resolution, we determined the cryo-em structures of s-gsas/d g construct ( figure c -e, table s , figures s -s ) . two major populations of the s ectodomain were identified in the cryo-em dataset -one population with one rbd in the "up" or ace receptor accessible conformation and the other with all three rbds in the figure c ). this is consistent with our previous observations made with nsem data that showed an increase in the rbd-up population for the s-gsas/d g s ectodomain (weissman et al., ) . our results show that the d g mutation in the sd domain, even though distal from the rbd region, has an allosteric effect on rbd dynamics leading to alteration of up/down rbd dispositions. to understand the nature of this allostery, we examined changes in the s protein that accompany the up and down rbd transition (figure ) by comparing the rbd-up chain in the -rbd-up structure to the down chains in the -up and the -down structures ( figure c ). in each s protein protomer, the polypeptide chain folds into domains as it traverses the length of the s subunit and enters the s subunit i.e. the ntd (residues - ) followed by the rbd (residues - ), the sd (residues - ) and sd (residues - ) domains ( figure a ). the ntd and rbd are connected via a -residue linker spanning residues - (named n r) that stacks against the sd and sd domains ( figure a-d) , as it makes its way from the ntd to the rbd, essentially connecting all the individual domains in the s subunit, and forming "super" subdomains sd ' and sd ', respectively . upon overlaying the protomers with the rbd in the up position with the protomers with their rbds in the down position by using the s subunit residues - for superpositions, we found that the down-to-up rbd motion is accompanied by a rigid body movement of the sd ' domain resulting in a shift of up to ~ . Å of the sd domain ( figure d ), relative to its position in the rbd-down protomers and a shift of up to a Å in the n r linker as it hinges to enter into the rbd. this results in a ~ ° tilt of residues - of the n r linker region that forms part of the sd ' super subdomain, while residues - of the linker that associate with the sd subdomain remained virtually unmoved, showing only a slight tilt in the b beta strand that accompanied large movements in the rbd and adjoining sd ' domain ( figure d ). indeed, the sd ' super subdomain that harbors the d g mutation appears to form a conformationally invariant anchor with the mobile rbd and ntd domains at either end ( figure d) . additionally, the s subunit remains invariant between the different protomers showing that the large movements that occur in the s subunit are effectively arrested by the sd ' super subdomain conformationally invariant anchor. these observations are mirrored in the difference distance matrices (ddm) comparing the rbd-up and down chains ( figure e and figure s ). ddm analyses (richards and kundrot, ) provide superposition-free comparisons between a pair of structures by calculating the differences between the distances of each pair of ca atoms in a structure and the corresponding pair of ca atoms in the second structure. the ddm analysis not only shows the large movement in the rbd region and the movement in the ntd, it also captures the movement in the n r linker and the sd domain observed in the structures. overall, these analyses show that the d g mutation is acquired within a key region encompassing the sd domain and an additional b-strand contributed by residues - of the n r linker that forms a region of relative structural stillness separating the mobile ntd and rbd, as well isolating the motions in s from the s subunit. this distal mutation altering rbd conformational dynamics shows that small changes in this region can translate into large allosteric effects, and suggests a role for the sd domain in modulating rbd dynamics. in addition to the d g mutation, the sd subdomain also harbors a furin cleavage site (residues - ) that separates the s and s subunits (figure ) . cleavage of the s protein by furin at this site is essential for virus transmission (shang et al., ) . the proximity of the d g mutation to the furin cleavage site and the increased flexibility observed in the cryo-em dataset of the s-gsas/d g ectodomain ( figure c -e), prompted us to examine the effect of the d g substitution on furin cleavage. since our expression system (i.e. freestyle cells) endogenously expresses furin, in order to obtain uncleaved spike that we could then test for protease cleavage in vitro, we engineered a hrv c site ( amino acids long) to replace the furin cleavage site ( amino acids long) at the s /s junction, resulting in the s-hrv c and s-hrv c/d g s ectodomain constructs ( figure a ). both proteins expressed in f cells but at lower yields compared to the s-gsas constructs ( µg/l and µg/l for the s-hrv c and s-hrv c/d g proteins, respectively). sec and sds-page profiles were similar to the s-gsas and s-gsas/d g proteins confirming well-folded and homogeneous spike preparations ( figure a , b). nsem micrographs showed characteristic kite-shaped particles for the pre-fusion s protein, and d-classification of particles from nsem revealed well folded spikes, further confirming that s-hrv c spikes retained the overall fold and structure of the s-gsas spikes (figure c, d) . to test the susceptibility of the hrv c site engineered at the junction of the s and s subunits to protease cleavage, we incubated the purified s-hrv c and s-hrv c/d g spikes with the hrv c enzyme and followed the digestion by analyzing aliquots taken at different time-points by sds-page ( figure e -g). we found that the digestion of the s-hrv c/d g spike ( figure f -g) proceeded at a faster rate than that of the s-hrv c spike ( figure e-g) with the s-hrv c/d g spike almost % digested within the first minutes of incubation, whereas, the s-hrv c constructs only achieved % of cleavage after hours, and a substantial portion remained uncleaved even upon addition of more enzyme followed by additional hours of incubation. these results suggested that the d g mutation increased the susceptibility of protease cleavage at the s /s junction. to study the effect of the d g substitution on protease cleavage at the s /s junction with the native furin site, we next generated spike ectodomains constructs where the furin site was restored to the native sequence, resulting in two constructs named s-rrar and s-rrar/d g ( figure a) . the proteins were expressed and purified using our usual methodology for the furin cleavage-deficient constructs (see methods). the sec profiles ( figure a ) showed a higher proportion of the first higher molecular weight peak. a second peak eluting at a similar molecular weight as the s-gsas spike (at ~ . ml elution volume) was used for further characterization. the sec profile of the s-rrar spike preparation showed small populations of lower molecular weight peaks that were not observed for the s-rrar/d g protein ( figure a ). on sds-page (figure b) , the peak corresponding to the s ectodomain showed the s-rrar construct as having one major band at the molecular weight corresponding to the spike monomer and some fainter bands corresponding to the s and s subunits while the s-rrar/d g protein showed a band corresponding to the spike monomer and the two bands corresponding to the molecular weights of the s and s subunits. the smaller molecular weight bands corresponding to the s and s subunits were in higher proportions in the s-rrar/d g spike preparation compared to the s-rrar preparation. in summary the sec and sds-page profiles showed that, although both the s-rrar and s-rrar/d g constructs were cleaved by endogeneous furin (figure b ) during protein expression the s and s subunits remained together in solution ( figure a) . consistent with the enhanced cleavage observed for the s-hrv c/d g spike relative to the s-hrv c spike, in the furin-site restored spikes we observed a higher proportion of cleaved spike in s-rrar/d g relative to s-rrar, suggesting that the d g mutation makes the spike more susceptible to furin cleavage. nsem of the purified s-rrar ( figure c ) and s-rrar/d g ( figure d) confirmed that both of these furin site-restored spikes formed well-folded spike ectodomains. we next digested the sec isolated fractions of the s-rrar and s-rrar/d g ectodomains ( figure a-d) in vitro by adding furin ( figure e ). as observed for the s-hrv c constructs, the d version of the spike was less susceptible to cleavage than the g mutant for the same incubation time with the enzyme. sec purification of the fully digested s-rrar/d g ectodomain revealed a peak corresponding to the ectodomain ( figure f) . on sds-page, this peak migrated as two distinct bands corresponding to the s and s domains thus confirming isolation of only the cleaved portion of the protein ( figure g ). nsem showed fully folded ectodomains for the furin digested and sec purified s-rrar/d g protein ( figure h ). in summary, these results show that acquisition of the d g mutation the s protein sd domain resulted in increased furin cleavage of the s ectodomain. to visualize the structure of the furin-cleaved s ectodomain at atomic level resolution, we obtained a cryo-em dataset, and resolved two populations of the furin-cleaved s ectodomain -a -rbd-up and a -rbd-down population ( figure a, figure s and s and table s ). we observed an increased proportion of the -rbd-down s compared to the uncleaved d g s ectodomain, thus reporting a change in the rbd conformational dynamics upon furin cleavage. consistent with this result, we observed reduced binding to ligands such as ace- and cr that require the rbd to be in the up conformation for binding ( figure b ). as expected, decrease in binding was also observed with antibody , isolated from a convalescent covid- donor, with an epitope overlapping with the ace binding site. antibody g that binds a quaternary glycan epitope in the s subunit showed a small decrease in binding with the furin-cleaved s ectodomain, whereas another covid- -derived s antibody showed increase in binding with the furin-cleaved s ectodomain. we compared the different protomers in the two structures by overlaying three protomers in the asymmetric -rbd-up structure and one protomer from the symmetric -rbd-down structure using residues - (comprising the ch and hr regions) for superposition ( figure c ). similar to observations made with the s-gsas/d g s ectodomain structure, the rbd up/down motion in the furin-cleaved g s ectodomain was associated with a movement in the sd domain and in the region of the rbd-to-ntd linker that joined the sd b sheet ( figure c, s b) . as observed for s-gsas/d g, the sd domain showed little conformational change and formed a stable motif anchoring the mobile ntd and rbd domains. these observations reinforce the divergent roles of the sd and sd domain in rbd motion. we next examined the region of the sd domain proximal to the ntd, and asked whether we could detect any structural changes in this region and if yes, could these be related to ntd motion. in the symmetric -rbd-down s ectodomain, all ntds are identical, each stacking against the down rbd of the adjacent promoter. in the asymmetric -rbd-up structure, each ntd were distinct. to distinguish between these, we named the ntds as follows: the ntd that was part of the protomer with the rbd in the up conformation was named ntd . ntd stacked against a down rbd that contacted the up-rbd at one end and the second down-rbd at the other. the ntd stacked against a down-rbd that contacted a down-rbd at one end, and the ntd had the least amount of rbd contact by virtue of contacting the up-rbd ( figure a ). observing the ntd-proximal region on the sd domain (marked by a dotted square on figure c ) that also contacted the rbd-to-ntd linker, we noted shifts in the t - loop between the different protomers. while the shifts were modest (with a maximal displacement of . Å), interestingly, identical trends were observed in the -rbd-up structures of the s-gsas, s-gsas/d g and furin-cleaved s-gsas/d g s ectodomains, suggesting that this region of the sd domain responds to ntd motion and adopts a different conformation depending on the ntd environment ( figure d ). thus, these data provide further evidence for allostery in the s protein, with changes in the sd domain impacting the rbd conformational dynamics. while the sd domain remains almost structurally invariant, we observe small but reproducible changes in sd loops in response to rbd/ntd movement suggesting that small changes in the sd region may translate to large motions in the rbd/ntd region. stabilized ectodomain constructs have proven to be useful tools to understand conformational dynamics of cov s proteins. in particular, these have enabled high-resolution structural determination and atomic level understanding of the s ectodomain. they also are key components in vaccine development pipelines. the structural similarities in the s proteins of diverse covs have often enabled quick translation of structural rules and ideas from one cov s ectodomain to another. indeed, after the onset of the recent and ongoing covid- pandemic, the sars-cov- s ectodomain could be rapidly stabilized and structurally characterized by exploiting its similarities with other covs and following strategies that had proved successful previously pallesen et al., ; wrapp et al., ) . some of these stabilization strategies, such as introduction of proline residues in the fusion subunit to prevent transition from pre-to post-fusion, have been successful in stabilizing the pre-fusion conformation of diverse class i fusion proteins including rsv f (krarup et al., ) , hiv- env (sanders et al., ) , ebola and marburg gp (rutten et al., ) , influenza ha (qiao et al., ) and lassa gpc (hastie et al., ) . while the underlying hypothesis for the stabilization of the s ectodomain was that introduction of proline residues at the junction of the ch and hr helices would arrest conformational transition to the post-fusion form, we found that even without the pp mutations, the sars-cov- s ectodomain retained its pre-fusion form. not only so, even following furin cleavage the s ectodomain retained its pre-fusion conformation. these differences between the observed behavior of the sars-cov- s relative to other covs suggests that even though they retain similar overall topology and structural folds, there are differences between these covs that profoundly affect their structural and biological properties. studying and accounting for these will be essential not only to understand sars-cov- but also to appreciate the nature and origin of these differences between covs for anticipating, preparing for and rapidly combating future cov pandemics. viral surface proteins that are involved in receptor binding mediated cellular entry typically consist of flexible and moving parts that exhibit large conformational changes. while this conformational flexibility is necessary for function, structural checkpoints are required to prevent premature activation and destabilization or unfolding of the protein structure. conformationally-silent structural islands provide the necessary stabilizing anchors for adjacent regions undergoing large motions. in this study we have identified the sd domain in the sars-cov- s protein as such a conformational anchor that is spatially interspersed between the highly mobile ntd and rbd regions, while itself remaining relatively invariant in its conformation. this conformational invariability of the sd subdomain is reminiscent of the beta sandwich structure in the hiv- envelope glycoprotein that connects and anchors a mobile layered architecture of the gp inner domain (pancera et al., ) . the conformationally invariant sd also serves to contain the movements of the rbd and ntd to the s subunit, such that the s subunit was unchanged between the various rbd "up" and "down" protomers ( figure and figure s ). this suggests a role for the sd domain in preventing premature triggering due to the stochastic up/down rbd motions in the sars-cov- s protein, as well as the importance of downstream events such as ace receptor engagement and tmprss protease cleavage (bestle et al., ; hoffmann et al., b; matsuyama et al., ) in orchestrating the full extent of pre-to post-fusion transformation. in this study, we also assigned a key role to the n r linker that connects the ntd to the rbd within a protomer. rather than just being a connector, this -residue linker is also a modulator of conformational changes that are critical for receptor engagement. the linker contributes a beta strand to each of the sd and sd subdomains thus connecting all the structural domains in the s subunit. in addition to the much discussed d g mutation, the sd subdomain also houses the multibasic furin cleavage site that demarcates the s and s subunits. furin cleavage is an essential processing step for the s protein and is necessary for viral infection and transmission (hoffmann et al., a; shang et al., ) . we provide evidence in this study that the d g mutation enhances susceptibility of the sars-cov- s ectodomain to furin cleavage, thus raising the possibility that this is a contributor to increased fitness and transmissibility of d g isolates. in this paper, we study the effect of the d g mutation on rbd dynamics and susceptibility to furin cleavage. we find that the d g mutation results in increased furin cleavage susceptibility, which could be responsible for the increased transmissibility of the sars-cov- with the d g mutation. it is important to consider though that these results are further information and requests for resources and reagents should be directed to priyamvada acharya (priyamvada.acharya@duke.edu) . data and code availability cryo-em reconstructions and atomic models generated during this study are available at wwpdb and embd (https://www.rcsb.org; http://emsearch.rutgers.edu) under the accession codes pdb ids kdg, kdh, kdk, kdl, kdi, kdj, ke , ke , ke , ke , ke , kea, keb, kec and emdb ids emdb- , emd- , emd- , emd- , emd- , emd- , emd- , emd- , emd- , emd- , emd- , emd- , emd- , emd- . gibco freestyle -f cells (embryonal, human kidney) were incubated at °c and % co in a humidified atmosphere. cells were incubated in freestyle expression medium (gibco) with agitation at rpm. plasmids were transiently transfected into cells using turbo (speedbiosystems) and incubated at °c, % co , rpm for days. on the day following transfection, hyclone cdm hek media (cytiva, ma) was added to the cells. antibodies were produced in expi cells (embryonal, human kidney). cells were incubated in expi expression medium at °c, rpm and % co in a humidified atmosphere. plasmids were transiently transfected into cells using the expifectamine transfection kit and protocol (gibco). all genes in this study were synthesized and sequenced by geneimmune biotechnology (rockville, md). the sars-cov- spike protein ectodomain constructs used comprised the protein residues − (genbank: mn ) with or without the d g mutation, with or without the furin cleavage site rrar (residue - ) mutated to gsas or levlfqgp (hrv c protease site), a c-terminal t fibritin trimerization motif, a c-terminal hrv c protease cleavage site (except for the constructs where the furin site was mutated to an hrv c site), a twinstreptag and an xhistag. all spike ectodomain constructs were cloned into the mammalian expression vector pαh (wrapp et al., ) . for the ace- construct, the c-terminus was fused a human fc region. protein purification spike ectodomains were harvested from filtered and concentrated supernatant using streptactin resin (iba) and further purified by sec using a superose / gl increase column preequilibrated in mm tris, ph . , mm nacl, . % sodium azide. all protein purification steps were performed at room temperature in a single day. the purified proteins were flash frozen and stored at - °c in single-use aliquots. each aliquot were thawed by incubation (~ min) at °c before use. antibodies were produced in expi f cells and purified by protein a affinity. ace- with human fc tag was purified by protein a affinity chromatography. negative-stain electron microscopy samples were diluted to µg/ml in mm hepes ph . , mm nacl, % glycerol, . mm glutaraldehyde and incubated for minutes before quenching the glutaraldehyde by the addition of m tris (to a final concentration of mm) and minutes incubation. a -µl drop of sample was then applied to a glow-discharged carbon-coated grid for - seconds, blotted, stained with % uranyl formate, blotted and air-dried. images were obtained using a philips em electron microscope at kv, , × magnification, and a . Å pixel size. the relion (scheres, ) program was used for particle picking, d and d class averaging. differential scanning fluorimetry dsf assay was performed using tycho nt. (nanotemper technologies). spike ectodomains were diluted to approximatively . mg/ml. intrinsic fluorescence was measured at nm and nm while the sample was heated from to °c at a rate of °c/min. the ratio of fluorescence ( / nm) and inflection temperatures (ti) were calculated by the tycho nt. apparatus. elisa assays spike samples were pre-incubated at different temperatures then tested for antibody-or ace- binding in elisa assays as previously described . assays were run in two formats. in the first format antibodies or ace protein were coated on -well plates at µg/ml overnight at °c, washed, blocked and followed by two-fold serially diluted spike protein starting at µg/ml. binding was detected with polyclonal anti-sars-cov- spike rabbit serum (developed in our lab), followed by goat anti-rabbit-hrp and tmb substrate. absorbance was read at nm. in the second format, serially diluted spike protein was bound in individual wells of -well plates, which were previously coated with streptavidin at µg/ml and blocked. proteins were incubated at room temperature for hour, washed, then human mabs were added at µg/ml. antibodies were incubated at room temperature for hour, washed and binding detected with goat anti-human-hrp and tmb substrate. cryo-em purified sars-cov- spike preparations were diluted to a concentration of ~ . mg/ml in mm tris ph . , mm nacl and . % nan . a . -µl drop of protein was deposited on a quantifoil- . / . grid that had been glow discharged for seconds in a pelco easiglow™ glow discharge cleaning system. after a seconds incubation in > % humidity, excess protein was blotted away for . seconds before being plunge frozen into liquid ethane using a leica em gp plunge freezer (leica microsystems). frozen grids were imaged in a titan krios (thermo fisher) equipped with a k detector (gatan). no statistical analysis were performed in this study. table reagent or resource source identifier antibodies ace n/a cr n/a g n/a ab n/a ab n/a goat anti-rabbit-hrp abcam ab goat anti-human-hrp jackson immunoresearch laboratories first derivative (ratio) figure c with the s subunit colored by domain and the s subunit colored grey. rbd is colored red, ntd green, sd dark blue, sd orange and the linker between the ntd and rbd colored cyan. b. overlay of the individual protomers in the -rbd-up structure and a protomer in the c symmetric -rbd-down structure shown in figure c . the structures were superimposed using s subunit residues - (spanning the hr and ch regions). the domain colors of the up-rbd chain are as described in panel a. the down-rbds are colored salmon, the sd domains from the down rbd chains are colored light blue. the linker between the ntd and rbd in the down rbd chains are colored deep teal. c. zoomed-in view showing the association of the linker connecting the ntd and rbd with the sd and sd domains. d. zoomed-in views of individual domains marked in panel b. the n r linker spanning residues - connects the ntd and the rbd. residues - of the n r linker contribute a b-strand to the sd subdomain together forming the sd ' "super" subdomain. residues - of the n r linker contribute a b-strand to the sd subdomain together forming the sd ' "super" subdomain. e. difference distance matrices (ddm) showing structural changes between different protomers for the structures shown in figure c . the blue to white to red coloring scheme is illustrated at the bottom. ab (rbd-directed neutralizing antibody) and ab (s -directed non-neutralizing antibody) to s-gsas/d g (in blue) and the furin-cleaved s-rrar/d g ectodomain (in green) measured by elisa. the assay format was the same as in figure d . c. overlay of the individual protomers in the -rbd-up structure and a protomer in the c symmetric -down-rbd structure shown in panel a. rbd-up chain with the s subunit colored by domain and the s subunit colored grey. rbd is colored red, ntd colored green, sd dark blue, sd orange and the linker between the ntd and rbd colored cyan. the down rbds are colored salmon, the sd domains from the down rbd chains are colored light blue. the linker between the ntd and rbd in the down rbd chains are colored deep teal. insets show zoomed-in views of individual domains similar to the depiction in figure d . d. (left) the protomers of the -rbd-up structure of the furin-cleaved s-rrar/d g ectodomain superimposed using residues - and colored by the color of their ntd as depicted in panel a. zoomed-in views show region of the sd domain proximal to the ntd. a glycan cluster on the sars-cov- spike ectodomain is recognized by fab-dimerized glycan-reactive antibodies. biorxiv real-space refinement inphenixfor cryo-em and crystallography tmprss and furin are both essential for proteolytic activation of sars-cov- in human airway cells sars-cov- mrna vaccine design enabled by prototype pathogen preparedness sars and mers: recent insights into emerging coronaviruses an interactive web-based dashboard to track covid- in real time cold sensitivity of the sars-cov- spike ectodomain data, disease and diplomacy: gisaid's innovative contribution to global health features and development ofcoot ucsf chimerax: meeting modern challenges in visualization and analysis the bio d packages for structural bioinformatics cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding structural basis for antibody-mediated neutralization of lassa virus controlling the sars-cov- spike glycoprotein conformation a multibasic cleavage site in the spike protein of sars-cov- is essential for infection of human lung cells sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor structures and distributions of sars-cov- spike proteins on intact virions tracking changes in sars-cov- spike: evidence that d g increases infectivity of the covid- virus a highly stable prefusion rsv f vaccine derived from structural analysis of the fusion mechanism the impact of mutations in sars-cov- spike on viral infectivity and antigenicity macromolecular structure determination using x-rays, neutrons and electrons: recent developments in phenix enhanced isolation of sars-cov- by tmprss -expressing cells immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen structure of hiv- gp with gp -interactive region reveals layered envelope architecture and basis of conformational mobility ucsf chimera?a visualization system for exploratory research and analysis cryosparc: algorithms for rapid unsupervised cryo-em structure determination specific single or double proline substitutions in the "spring-loaded" coiled-coil region of the influenza hemagglutinin impair or abolish membrane fusion activity identification of structural motifs from protein coordinate data: secondary structure and first-level supersecondary structure structure-based design of prefusion-stabilized filovirus glycoprotein trimers stabilization of the soluble, cleaved, trimeric form of the envelope glycoprotein complex of human immunodeficiency virus type a bayesian view on cryo-em structure determination processing of structurally heterogeneous cryo-em data in relion nih image to imagej: years of image analysis pandemic preparedness: developing vaccines and therapeutic antibodies for covid- cell entry mechanisms of sars-cov- situ structural analysis of sars-cov- spike reveals flexibility mediated by three hinges. science function, and antigenicity of the sars-cov- spike glycoprotein d g spike mutation increases sars cov- susceptibility to neutralization. medrxiv cryo-em structure of the -ncov spike in the prefusion conformation cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains structural and functional analysis of the d g sars-cov- spike protein variant coronaviruses -drug discovery and therapeutic options rbd "up" ( -rbd-up s) vs rbd "down" protomer # ( -rbd-up s) rbd "up" ( -rbd-up s) vs rbd "down" protomer ( -rbd-down s) rbd "up" ( -rbd-up s) vs rbd "down" protomer # ( -rbd-up s) rbd "down" protomer # ( -rbd-up s) vs rbd "down" protomer # ( -rbd-up s) rbd "down" protomer # ( -rbd-up s) vs rbd "down" protomer ( -rbd-down s) cryo-em data were collected at the national center for cryo-em access and training (nccat) and the simons electron microscopy center located at the new york structural biology center, supported by the nih common fund transformative high resolution cryo-electron microscopy program (u gm ) and by grants from the simons foundation key: cord- -jpkxjn e authors: brielle, esther s.; schneidman-duhovny, dina; linial, michal title: the sars-cov- exerts a distinctive strategy for interacting with the ace human receptor date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: jpkxjn e the covid- disease has plagued over countries and has resulted in over , deaths within weeks. we compare the interaction between the human ace receptor and the sars-cov- spike protein with that of other pathogenic coronaviruses using molecular dynamics simulations. sars-cov, sars-cov- , and hcov-nl recognize ace as the natural receptor but present a distinct binding interface to ace and a different network of residue-residue contacts. sars-cov and sars-cov- have comparable binding affinities achieved by balancing energetics and dynamics. the sars-cov- –ace complex contains a higher number of contacts, a larger interface area, and decreased interface residue fluctuations relative to sars-cov. these findings expose an exceptional evolutionary exploration exerted by coronaviruses toward host recognition. we postulate that the versatility of cell receptor binding strategies has immediate implications on therapeutic strategies. one sentence summary molecular dynamics simulations reveal a temporal dimension of coronaviruses interactions with the host receptor. to gain access to host cells, coronaviruses rely on spike proteins, which are membrane-anchored trimers containing a receptor-binding s segment and a membrane-fusion s segment ( ) . the s segment contains a receptor binding domain (rbd) that recognizes and binds to a host cell receptor. the angiotensin-converting enzyme (ace ) was identified as the critical receptor for mediating sars- entry into host cells ( , ) . binding of the spike protein to the receptor is a critical phase where the levels of the ace expressed on the cell membrane correlates with viral infectivity, and govern clinical outcomes ( ) . consistent with the clinical pulmonary manifestation, ace is widely expressed in almost all tissues, with the highest expression levels in the epithelium of the lung ( ) . similar to the sars- virus, the covid- virus enters the host cell by rbd binding to the host cell ace receptor ( , , ) . host receptor recognition for cell entry is, however, not specified by the cov genus classification. mers-cov is a member of the bcov genus but does not recognize the ace receptor. in contrast, hcov-nl is a member of the acov genus and does recognize the ace receptor ( ) . herein, we analyze the binding of several cov rbds to ace with molecular dynamics (md) simulations and compare the stability, relative interaction strength, and dynamics of the interaction between the viral spike protein and the human ace receptor. the covid- rbd (residues - ) shares a . % sequence identity and high structural similarity with the sars- rbd ( table ). in contrast, the rbd of hcov-nl is only . % identical to that of covid- and there are no significant structural similarities between them (fig. s ) . remarkably, the rbd of mers-cov, which is structurally similar to that of covid- ( . % sequence identity, % structure similarity) recognizes a different host receptor (dpp ) for its cell entry and does not bind ace ( ) . we ran ns molecular dynamic (md) simulations of ace in complex with the rbds of the covid- , sars- , and hcov-nl viruses to quantify the energetics and the dynamics of the different rbd-ace interactions. the simulation trajectory snapshots at ps intervals ( , frames) were analyzed by a statistical potential to assess the probability of the rbd-ace interaction (soap score, ( )), with lower values corresponding to higher probabilities and thus higher affinities. the interaction scores for covid- rbd-ace were comparable to those of sars- , median of - . and - . , respectively (fig. a) . hcov-nl has rbd-ace interaction scores are higher than both of the sars-covs (median of - . ). mers, which is structurally similar to covid- ( table ) does not bind ace . mers virus which binds dipeptidyl peptidase- (dpp , also known as cd ( )), has rbd-ace interaction scores that indicate extremely weak affinity (median of - . ), as expected from a non-cognate receptor interaction. covid- has the largest buried surface area at the interface ( Å ), followed by the interface area for sars- ( Å ) and hcov-nl ( Å ). the number of ace contacting residues maintains the same order, with , , and for covid- , sars- , and hcov-nl , respectively (fig. c) . the three rbds exploit specific binding sites on ace based on the analysis of the md trajectories ( fig. , c and d; movie s ). there is a significant overlap of ace interacting residues between covid- and sars- (at least %), while hcov-nl shares only % and % of contacts with sars- and covid- , respectively. these findings suggest that the coronaviruses exert different interaction strategies with their cognate receptors to achieve the affinity that is required for effective cell entry. an ace residue is considered as part of the interface if one of its atoms is within Å from any rbd atom in at least % of the , md simulation frames. (d) overlay of snapshots for each of the three rbds. the ace is in surface representation (gray). the frames were aligned using the n-terminal fragment of ace that contains the two helices participating in the rbds binding. while the sequence identity between the rbds of covid- and sars- is % (table ) , we observe a significantly higher residue substitution rate at the interaction interface with the ace receptor. out of rbd interface residues, only residues ( %) in covid- are conserved with respect to sars- ( fig. a , table s , fig. s ). similarly, only residues ( %) in sars- are conserved with respect to covid- . to investigate these interface residues, we construct and overlay the contact maps for the rbd-ace interfaces for covid- and sars- (fig. b) . we define a residue-residue contact frequency (cf) as the fraction of md trajectory frames in which the contact appears. remarkably, only out of the total residue-residue interface contacts have comparable (< % difference) contact frequencies between the covid- -ace and sars- -ace interfaces ( fig. b , colored gray). furthermore, we find two interaction patches unique to covid- ( fig. b , patches and ) and another patch unique to sars- ( fig. b , patch ). covid- has a significant and unique contact site between residues - of the rbd and residues - of ace (fig. , b and c). covid- also creates a new interaction patch with the middle of the n-terminal ace helix (fig. , b and c), while sars- has a unique interaction patch with the end of the same helix (fig. , b and c) . the rest of the changes in the interface contact frequencies are due to the different interface loop conformations (covid- residue numbers - , sars- residue numbers - ) (fig. , a and b , table s ). covid- has a significantly higher number of well-defined contact pairs compared to sars- : vs. contacts (with and unique pairs, excluding the ones with similar cfs) were found for rbd-ace of the covid- and sars- , respectively (fig. b) . results from fig. expose the accelerated evolution among the key anchoring residues of the rbd-ace interface. this comparison raises the following question: how does sars- rbd reach an ace binding affinity that is comparable to that of covid- but with fewer contact pairs and a smaller interface area? the distribution of soap scores throughout the simulation trajectory has a larger fluctuation range for sars- , relative to covid- ( fig. , a and b; fig. s a ) suggesting that sars- -ace interaction is fluctuating between several structural states. moreover, analysis of contact frequencies along the entire trajectory reveals that none of the sars- contacts are maintained over % of the frames while covid- still maintains about half of its contacts at % of the trajectory (fig. s b) . to investigate the dynamics of covid- binding compared to sars- , we calculate the root-mean-square fluctuation (rmsf) of each residue with respect to the lowest energy snapshot from their respective ns md simulation trajectory. the interface region in the rbd contains two loops (loop : residues - , loop : residues - ; using covid- numbering, fig. d ) that bind to the ace n-terminal helix on both of its ends. these two loops are highly flexible in the sars- rbd (fig. , a and d) . while loop is also fluctuating in the covid- rbd, albeit much less, loop remains relatively rigid in the covid- rbd. in addition, we find that in the covid- -rbd, a region centered around k leads to further stability relative to the corresponding region in sars- . we attribute this difference to the unique interaction of covid- at position k with the middle of the n-terminal ace helix, thus serving as an anchor site to the receptor ( fig. c and fig. a) . the contribution of k to ace binding is observed in a recent cryoem structure of the covid- spike protein bound to ace ( ) . overall, covid- is more rigid compared to sars- (fig. , a and d) . we investigate the dynamics of a designed sars (sars-des) variant ( ) table s ). the l f mutation is of special interest for the covid- rbd as well because it has this same substitution. our md simulation analysis reveals that the sars-des has a substantially lower interaction scores with ace (median of - . , fig. s ) , as expected for an optimized human ace -binding rbd design. we observed that these two mutations not only enhance the binding affinity to ace , but also lead to a substantial stabilization of the interaction interface. the fluctuation signatures along the rbd of sars-des are surprisingly similar to those recorded for covid- (fig. , b and c) . thus, the switch from a flexible binding mode (for sars- ) to a stable one (covid- and sarsdes, fig. b ) highlights the remarkable capacity of the rbd to adopt alternative receptor binding strategies driven by a minimal number of amino acid substitutions. this analysis reveals the critical role of l f (sars-des residue f ) for stabilizing the covid- -ace interface and a reduction in the number of states of the covid- spike protein bound to an ace receptor. experimental affinity measurements (e.g. surface plasmon resonance, spr) confirm the high affinity of sars- rbd-ace binding, with an equilibrium dissociation constant (kd) of ~ mm ( ) ( ) ( ) ( ) , similar to the binding affinity of ace and the covid- rbd ( , ) . our md based calculation is consistent with sars- displaying a similar but slightly higher affinity relative to covid- (fig. a, fig. s and table s ). binding affinity is achieved through a combination of interface contact optimization and protein stability (fig. e) . while the rbd-ace complex can be resolved at high-resolution by cryo-em ( , ) , md simulations provide orthogonal information about the interaction dynamics on a nanosecond timescale. in the case of covs, md simulations reveal an exceptional versatility of viral receptor binding strategies (fig. e) . covid- adopted a different strategy for achieving comparable affinity to sars- : the interface of covid- is significantly larger than that of sars- ( Å vs. Å) with a remarkable number of interacting residues (ace : vs. , fig. c) . in contrast, sars- is more flexible in its interaction with ace , interacting through fewer contacts that serve as "hot spots". therefore, we predict that sars- rbd neutralizing antibodies will not be effective for covid- . the failure of several of these antibodies to neutralize the binding of covid- rbd to its receptor is consistent with our findings ( , ) . the fluctuation from high-to low-affinity conformations in sars- leads to an increased efficacy for inhibiting peptides ( ) and high-affinity antibodies ( ) compared to covid- . this implies a therapeutic challenge is attributed to the enhanced rigidity of the covid- rbd relative to that of the sars- . the geometric and physicochemical properties of rbd-ace interfaces resemble those of antibody-antigen interactions. in both cases the interface benefits from long loop plasticity, bulky aromatic side chains as anchoring sites, and the stabilization of the complex by distributed electrostatic interactions ( ) . both covid- and sars- interfaces contain long flexible loops and nine aromatic residues (tyr, trp, phe) in the interface with ace ( fig. a) . moreover, in the sars designed variant (sars-des ( )), the addition of an aromatic residue (l f substitution) significantly improved the interaction scores and interface stability (fig. , b and d) . our findings shed light on the accelerated evolution of spike protein binding to the ace receptor similar to the rapid evolution along the antibody-antigen affinity maturation process. structural modeling the structural model of the covid- spike protein receptor binding domain (rbd) in complex with ace was generated by comparative modeling using modeller . ( ) with the covid- sequence (refseq: yp_ . ). we relied on the crystal structure of the spike protein receptor-binding domain from a sars coronavirus designed human strain complexed with the human receptor ace (pdb sci, resolution . Å) as a template for comparative modeling. the sars- spike protein rbd and hcov-nl in complex with ace were taken from pdb ajf (resolution . Å) and kbh (resolution . Å), respectively. missing residues were added in modeller. mers rbd structure was taken from the complex with the neutralizing antibody cdc -c (pdb c z, resolution . Å) and structurally aligned onto sars- rbd in complex with ace receptor. the designed variant is from pdb sci. the md simulations were performed with gromacs software ( ) using the charmm m force field ( ) . each of the complexes was solvated in transferable intermolecular potential with points (tip p) water molecules and ions were added to equalize the total system charge. the steepest descent algorithm was used for initial energy minimization until the system converged at fmax < , kj/(mol · nm). then water and ions were allowed to equilibrate around the protein in a two-step equilibration process. the first part of equilibration was at a constant number of particles, volume, and temperature (nvt). the second part of equilibration was at a constant number of particles, pressure, and temperature (npt). for both md equilibration parts, positional restraints of k = , kj/(mol · nm ) were applied to heavy atoms of the protein, and the system was allowed to equilibrate at a reference temperature of k, or reference pressure of bar for ps at a time step of fs. following equilibration, the production simulation duration was nanoseconds with fs time intervals. altogether , frames were saved for the analysis at intervals of ps. we superimposed several md snapshots on the recently submitted to the pdb x-ray structure ( vw , resolution . Å) of covid- -ace complex. the average rmsd over the interface ca atoms is ~ Å. interaction scores between the virus spike rbd and ace were calculated for each frame of the trajectory using the soap statistical potential ( ) . in the interface contact analysis, a residue-residue contact was defined based on the inter-atomic distance, with a cutoff of Å. table s . rbd-ace interface evaluated by several methods for analysis of protein-protein interactions movie s . overlay of random snapshots from the md trajectories of covid- -ace , sars- -ace , and hcov-nl -ace complexes. for clarity only one copy of ace is shown (gray), covid- , sars- , and hcov-nl are colored blue, red, and green, respectively. characterization of a novel coronavirus associated with severe acute respiratory syndrome return of the coronavirus: -ncov coronavirus pathogenesis and the emerging pathogen severe acute respiratory syndrome coronavirus epidemiology and clinical characteristics of human coronaviruses oc , e, nl , and hku : a study of hospitalized children with acute respiratory tract infection in guangzhou, china a new coronavirus associated with human respiratory disease in china structural basis for human coronavirus attachment to sialic acid receptors a crucial role of angiotensin converting enzyme (ace ) in sars coronavirusinduced lung injury susceptibility to sars coronavirus s protein-driven infection correlates with expression of angiotensin converting enzyme and infection can be blocked by soluble receptor exogenous ace expression allows refractory cell lines to support severe acute respiratory syndrome coronavirus replication tissue distribution of ace protein, the functional receptor for sars coronavirus. a first step in understanding sars pathogenesis receptor recognition by novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars human coronavirus nl employs the severe acute respiratory syndrome coronavirus receptor for cellular entry the s proteins of human coronavirus nl and severe acute respiratory syndrome coronavirus bind overlapping regions of ace structure of mers-cov spike receptor-binding domain complexed with human receptor dpp tm-align: a protein structure alignment algorithm based on the tm-score optimized atomic statistical potentials: assessment of protein interfaces and loops structural basis for the recognition of the -ncov by human ace . biorxiv computational characterization and design of sars coronavirus receptor recognition and antibody neutralization receptor and viral determinants of sars-coronavirus adaptation to human ace potent binding of novel coronavirus spike protein by a sars coronavirusspecific human monoclonal antibody crystal structure of nl respiratory coronavirus receptor-binding domain complexed with its human receptor structure, function and antigenicity of the sars-cov- spike glycoprotein. biorxiv cryo-em structure of the -ncov spike in the prefusion conformation a hexapeptide of the receptorbinding domain of sars corona virus spike protein blocks viral entry into host cells via the human receptor ace the spike protein of sars-cov--a target for vaccine and therapeutic development the indistinguishability of epitopes from protein surface is explained by the distinct binding preferences of each of the six antigen-binding loops comparative protein structure modeling using modeller gromacs . : a high-throughput and highly parallel open source molecular simulation toolkit charmm m: an improved force field for folded and intrinsically disordered proteins acknowledgments. the authors gratefully acknowledge barak raveh for useful suggestions. tables s -s movie s references ( ) ( ) ( ) residue key: cord- - tn al authors: ni, ling; ye, fang; cheng, meng-li; feng, yu; deng, yong-qiang; zhao, hui; wei, peng; ge, jiwan; gou, mengting; li, xiaoli; sun, lin; cao, tianshu; wang, pengzhi; zhou, chao; zhang, rongrong; liang, peng; guo, han; wang, xinquan; qin, cheng-feng; chen, fang; dong, chen title: detection of sars-cov- -specific humoral and cellular immunity in covid- convalescent individuals date: - - journal: immunity doi: . /j.immuni. . . sha: doc_id: cord_uid: tn al summary the world health organization has declared sars-cov- virus outbreak a world-wide pandemic. however, there is very limited understanding on the immune responses, especially adaptive immune responses to sars-cov- infection. here, we collected blood from covid- patients who have recently become virus-free and therefore were discharged, and detected sars-cov- -specific humoral and cellular immunity in newly discharged patients. follow-up analysis on another cohort of patients weeks post discharge also revealed high titers of igg antibodies. in all patients tested, displayed serum neutralizing activities in a pseudotype entry assay. notably, there was a strong correlation between neutralization antibody titers and the numbers of virus-specific t cells. our work provides a basis for further analysis of protective immunity to sars-cov- , and understanding the pathogenesis of covid- , especially in the severe cases. it has also implications in developing an effective vaccine to sars-cov- infection. *these authors contributed equally to this work. #to whom correspondence should be addressed: chen dong, chendong@tsinghua.edu.cn; or fang chen, anzhenchenfang@ .com; cheng-feng qin, qincf@bmi.ac.cn. introduction at the end of , patients with coronavirus disease were identified in wuhan, china (wang et al., ) , infected by a novel coronavirus, now named as severe acute respiratory syndrome coronavirus (sars-cov- ). the world health organization (who) first declared this outbreak a public health emergency of international concern (phelan et al., ) and subsequently a world-wide pandemic (di pierro et al., ) . the genome sequence of sars-cov- bears % (zhou et al., ) and . % identity to that of a bat coronavirus and sars-cov, respectively (zhu et al., ) . like sars-cov and mers-cov, sars-cov- belongs to the beta genus coronavirus in the corornaviridae family (lu et al., ) . clinically, several papers showed that most covid- patients developed lymphopenia as well as pneumonia with higher plasma levels of pro- inflammatory cytokines in severe cases (chan et al., ; huang et al., ; wu et al., ), suggesting that the host immune system is involved in the pathogenesis (mahallawi et al., ; nicholls et al., ) . patients infected by sars-cov or mers-cov were previously reported to have antibody responses (ko et al., ; shi et al., ; wang et al., (thevarajan et al., ) . one covid- patient in finland was shown to possess a low level of neutralizing antibody titer (haveri et al., ) . however, virus-specific t lymphocytes and their relationships with neutralizing antibody titers in covid- patients remains uncharacterized. in this study, we collected blood from covid- patients who have recently become virus-free and therefore were discharged, and analyzed their sars-cov- -specific antibody and t cell responses. whereas the remaining were weeks post discharge (follow-up patients, patients # - ). only three travelled to wuhan city within the past months. in line with the previous reports (wang et al., ) , patients (# , ) showed lymphopenia (normal range is . - . x e cells per l). sera from three healthy donors (wang et al., ) were obtained before the sars-cov- outbreak (healthy donor # - ). additional healthy donors (# - ) who had been in close contacts with the patients were recruited in this study. human ab serum collected from healthy male ab donors in the us (gemcell, ca) was used as a negative control. in order to detect anti-viral immune responses, we first constructed recombinant pet - n- xhis by linking copies of his tag to the c-terminus of np in the pet -n vector (biomed, cat. number: bm ). escherichia coli transformed with pet -n- xhis was lysed and tested by coomassie blue staining to confirm np expression at . kda. np was further purified by ni-nta affinity chromatography and gel filtration. the purity of np was approximately % ( figure s a ). the presence of np was subsequently confirmed by anti-flag antibody ( figure s b ). the receptor-binding domain (rbd) of s protein (s-rbd) and main protease (lan et al., ) were produced by a baculovirus insect expression system and purified to a purity of % ( figure s a ). using sera from patients and healthy donors, igg and igm against sars-cov- np, main protease and s-rbd antigens were analyzed. there was no significant antibody response to main protease in sera from several patients (data not shown), suggesting that it may not serve as an antigen for humoral immunity. we thus focused on np and s-rbd. the individual serum samples were then performed by serial dilutions to get optimal dilutions ( figure a ). dilution of : was used for igm and : for igg. np-and s-rbd-specific igm and igg antibodies were both detected in the sera of newly discharged patients, compared with healthy donor groups. anti-sars-cov- igg antibodies were also more obviously observed than igm in the follow-up patients (# - ), when compared with healthy donors ( figure b ). in addition, values from the serum dilution curves were calculated to determine the area under the curve (auc) values. compared to control sera, covid- patient sera showed significantly higher auc for np-and s-rbd-specific igg antibodies ( figure c ). taken together, these findings indicate that covid- patients mounted igg and igm responses to sars-cov- proteins, especially np and s-rbd, and also suggest that infected patients could maintain their igg amounts, at least for two weeks after discharge. in addition, igg isotypes was further tested in patients and controls. as shown in figure d , anti-np and s-rbd igg was mainly igg isotype, and the newly discharged and follow-up patients showed similarly amounts of anti-np igg . of interest, one patient (pt# ) showed higher amounts of anti-np igg , whereas anti-s-rbd igg was detected in two patients (pt# - ). however, we did not detect igg to either np or s-rbd proteins (data not shown). since the rbd of the s protein has been shown to bind to human angiotensin converting enzyme (ace ) (zhou et al., ) , the existence of antibodies against it may suggest neutralization of sars-cov- infection. to assess this, we performed pseudovirus particle-based neutralization assay, since there was a significantly positive correlation in the neutralizing antibody titers between pseudovirus and sars-cov- ( figure a ). as shown in figure b and c, patients # , , , and , all within the newly discharged group, had high neutralizing antibody titers. these results demonstrate that most recently discharged patients had strong humoral immunity to sars-cov- . among the follow-up patients, all had neutralizing antibody titers with the exception of patient # being negative. as expected, there was a significant correlation between neutralizing antibody titers and auc of anti-s- to explore cellular immune responses to sars-cov- , we isolated peripheral blood monocytic cells (pbmcs) from the whole blood and phenotypically analyzed them by flow cytometry ( figure a ). we found that compared to newly discharged patients, there was a trend towards an increased frequency of nk cells in the follow-up patients ( figure b ). however, there was no significant difference in terms of the percentages of t cells among those two groups and the healthy donors. to assess virus-specific cellular immunity, we then treated pbmcs with recombinant np, main protease and s-rbd, followed by ifn-γ elispot analysis. the results were considered positive if there were at least -fold increase in the numbers of ifn-γ-secreting t cells in the subject than in the healthy donors. as shown in figure c , compared with healthy donors, the numbers of ifn-γ-secreting np-specific t cells in patients # , , , and were much higher than other patients, suggesting that they had developed sars-cov- -specific t cell responses. of note, patients # , , , and developed both strong humoral and cellular immune responses. main protease-specific t cells were detected in patient # , and , while patients # , , , , , and showed s-rbd-specific t cells. although the numbers of ifn-γ-secreting s-rbd specific t cells were much lower than those of np-specific t cells, they could be detected in more patients than those for other viral proteins. in the follow-up patients, only patient # who showed lymphopenia before treatment still had a high number of ifn-γ-secreting t cells in response to np, main protease and s-rbd ( figure c ), which suggests that anti-viral t cells may not be maintained at high numbers in the pbmcs in the recovered patients. more interestingly, when combining all patients in our analysis, there was a significant correlation between the neutralizing antibody titers and the numbers of np- in this study, we characterized sars-cov- -specific humoral and cellular immunity in recovered patients. both were detected in newly discharged patients. in addition, the neutralizing antibody titers significantly correlated with the numbers of np-specific t cells. these findings suggest both b and t cells participate in immune-mediated protection to viral infection. our work has thus provided a basis for further analysis of protective immunity to sars-cov- , and understanding the pathogenesis of covid- , especially in the severe cases. it has also implications in designing an effective vaccine to protect and treat sars- in our study, production of s-rbd-specific antibodies were readily detected in recovered patients. moreover, we observed virus-neutralization activities in these recovered patients. not surprisingly, a significant correlation between neutralizing antibody titers and auc of anti-s-rbd igg, but not anti-np igg, was observed. anti-s-rbd igg might be useful in nonetheless, in our study and the one mentioned above, most patients developed measurable neutralization antibodies after infection, suggesting that the viral infection does not curtail adaptive immunity. however, unlike the above-mentioned study, we did not find any correlation between neutralizing antibody titers and patient's age, which could be due to our small sample size. our results thus need further confirmation in a large cohort of covid- patients. in addition, our analysis could not differentiate cd + and cd + t cell responses, due to the limitation in the amounts of pbmcs obtained and availability of instrumentation. the plasmid (pet -n- xhis) generated in this study will be made available on request from the lead contact without restriction. the study did not generate any unique dataset or code. committee at tsinghua university. informed consent was obtained from all subjects for being included in the study. all patient data were anonymized before study inclusion. see table for full patient information, including age, sex, and health status. cell lines huh- cells originally taken from a liver tumor in a japanese male were cultured in dmem supplemented with % fbs. cells were grown at °c in a % co setting. the od value at nm was calculated. neutralizing antibody assay pseudovirus expressing the sars-cov- s protein was produced as described previously (deng et al., ) . pnl luci and gp-pcaggs were co-transfected into t cells. highlights: . sars-cov- -specific antibodies are detected in covid- convalescent subjects. . most covid- convalescent individuals have detectable neutralizing antibodies. . cellular immune responses to sars-cov- are found in covid- convalescent human convalescence sera notes: pt, patient; f, female; m, male; p, positive; n, negative; bt, before treatment; na key: cord- -ufyzqgqk authors: aguilar-pineda, jorge alberto; albaghdadi, mazen; jiang, wanlin; lopez, karin j. vera; del-carpio, gonzalo davila; valdez, badhin gómez; lindsay, mark e.; malhotra, rajeev; lino cardenas, christian l. title: structural and functional analysis of female sex hormones against sars-cov cell entry date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: ufyzqgqk emerging evidence suggests that males are more susceptible to severe infection by the sars-cov- virus than females. a variety of mechanisms may underlie the observed gender-related disparities including differences in sex hormones. however, the precise mechanisms by which female sex hormones may provide protection against sars-cov- infectivity remains unknown. here we report new insights into the molecular basis of the interactions between the sars-cov- spike (s) protein and the human ace receptor. we further observed that glycosylation of the ace receptor enhances sars-cov- infectivity. importantly estrogens can disrupt glycan-glycan interactions and glycan-protein interactions between the human ace and the sars-cov thereby blocking its entry into cells. in a mouse model, estrogens reduced ace glycosylation and thereby alveolar uptake of the sars-cov- spike protein. these results shed light on a putative mechanism whereby female sex hormones may provide protection from developing severe infection and could inform the development of future therapies against covid- . the novel coronavirus disease (covid- ) global pandemic caused by infection with the severe acute respiratory syndrome coronavirus (sars-cov ) virus has infected nearly million people worldwide resulting in nearly , deaths as july , . emerging data suggests that males are more susceptible to covid- infection and are at higher risk of critical illness and death than females [ ] [ ] [ ] . there has been consistent evidence of an increased case fatality rate (cfr) among males in nearly every country with available sex-disaggregated data including peru, france, greece, italy, mexico, pakistan, philippines and spain amounting to a . times higher cfr than females . understanding the mechanisms underlying enhanced covid- susceptibility and disease severity in males is key to developing new therapies and guiding vaccine development. changes in sex hormone concentration over an individual's lifetime and associated risk of comorbid conditions, such as cardiovascular diseases, may also contribute to variability in disease susceptibility and severity . it has been postulated that the male-biased sex divergence in covid- deaths could be, in part, explained by the strict relationship between sex hormones and the expression of the entry receptor for sars-cov , the angiotensin converting enzyme (ace ) receptor , . molecular studies have demonstrated that the male hormone testosterone regulates the expression of ace and the transmembrane serine protease (tmprss ) which is an androgen-responsive serine protease that cleaves the sars-cov- spike (s) protein and facilitates viral entry via ace binding [ ] [ ] [ ] . androgen-driven upregulation of ace levels may therefore be associated with increased vulnerability to severe infections in male patients with covid- . paradoxically, ace plays an important role in lung protection during injury which is attenuated by the binding of sars-cov- . the presence of a male-biased dependence in covid- susceptibility may imply the presence of a protective factor against sars-cov- infectivity in women. in addition to the ability of sex hormones to modulate expression of proteins related to entry into host cells, both estrogens and androgens are also able to directly modulate immune cell function via receptor-mediated effects , . additionally, sex chromosomes may mediate more favorable outcomes among women compared to men affected with covid- . x-linked genes associated with immune function tend to be expressed more often in females who generally have two x chromosomes compared to males . additional clues to the possible protective effects of estrogens have been suggested by differences in dietary patterns among countries with different cfrs . interestingly, countries with the lowest cfrs including japan and korea are the largest consumers of isoflavones-based foods, also known as phytoestrogens, that may also mediate favorable effects on ace expression and therefore covid- risk [ ] [ ] [ ] . the observation that females and those individuals consuming higher levels of isoflavones may be protected from covid- infection and adverse consequences indicates a potential protective role of estrogens against sars-cov- . here, we examine the role of two estrogen molecules ( β-diol and s-equol) to modulate the ace -dependent membrane fusion protein and reduce cell entry of the sars-cov spike protein into lung cells. to the best of our knowledge, we report new findings regarding the importance of molecular interactions between hace and the viral spike (s) protein. furthermore, we provide insights into the molecular basis for our observations that estrogens impair sars-cov entry and highlight the potential for estrogens as an agent in patients with covid- . glycosylation site-mapping of human ace and sars-cov- spike interactions. recent studies , have shown the ability of the sars-cov virus to utilize a highly glycosylated spike (s) protein to elude the host's immune system and bind to its target membrane receptor, ace , thus enabling entry into human cells. based on the structural complementarity and steric impediments between the s protein and human ace (hace ) protein membranes, we mapped the glycosylation sites of both models [ ] [ ] [ ] [ ] and performed molecular dynamics simulations (mds) by ns to stabilize the glycosylated sars-cov spike (s) and hace complex (suppl. table ., suppl. figure and figure a ). these analyses revealed that glycosylation of the ace protein increases the affinity of the virus s protein to interact with the receptor via glycan-glycan interactions, glycan-protein interactions, hydrogen, and hydrophobic bonds (suppl. table . and figure b) . notably, glycan-glycan interactions occur between the ace glycan at n and n and glycans found on the spike's receptor binding domain (s-rbd) at n and n (figure c , left panel). despite the close interaction between ace and s-rbd glycans, their affinity to anchor with highly negatively charged molecules such as the ace protein remains unalterable (figure c right panel) suggesting that glycan and electrostatic-dependent surface tethering may represent a plausible mechanism for ace-s-rbd binding and cell infection. the glycan-protein interactions occur between the ace glycan at n and the residues of the s-rbd at n , s , n , l , v , g , v , q , t and q (figure d ). while ace residues at d , y , w and g form hydrogen bonds with residues of the s-rbd at n , d , s , n and e (figure f ). multiple distinct clusters of hydrophobic residues at the ace surface were also found to interact with the s-rbd protein (suppl. figure ). importantly one key hydrophobic region on the ace surface at t interacts with five residues of the s-rbd (p , g , f , g and y ), (figure g ). given the insights afforded by our in silico mds experiments, we sought to explore the impact of ace glycosylation on s-rbd cell entry using cultured human umbilical vein endothelial cells (huvecs). a variety of saccharide substrates were utilized for their ability to modulate glycosylation profiles in cells. the glycosylation pattern of the endogenous ace was increased in nearly all treated cells ( figure h ). notably co-incubation of huvecs with ug of recombinant s-rbd (rs-rbd) protein revealed that glucose ( mm) pre-treatment was associated with the greatest degree of rs-rbd entry into the cells by ~ fold compared with hypoglycemic media (hbss or optimen) cells ( figure g ). this model indicates that glycosylated residues surrounding the cavity at the top of the ace molecule could increase binding by the s-rbd. given the possibility that occupancy at glycosylated residues or s-rbd binding sites by estrogens could modify the affinity of the sars-cov virus and alter entry into the cell thereby reducing infectivity, we sought to further examine these interactions using a range of complementary experimental approaches (see table s ). estrogens bind to hace and stimulate its stabilization and internalization. in an effort to explore the potential protective effects of female sex hormones against sars-cov- infection, we examined the impact of estradiol ( β-diol) and a dietary-derived phytoestrogen (s-equol) on hace structure and protein expression by a combination of in silico modeling, in vitro, and in vivo analysis. specifically, in light of the importance of glycan-glycan interactions that mediate virus-ace interactions, we sought to analyze the effect of estrogens on key molecular viral and receptor binding sites. in agreement with yan et al. , we identified three important regions on the ace surface that are utilized for sars-cov- binding. the environment of these regions is composed of a high density of glycans, including a helix α from residues i to t , a helix α from residues v to m , and one loop from residues k to g (suppl. figure and figure a ). we then homogenously solvated the glycosylated hace structure with . mm of β-diol or . mm of s-equol followed by ns of mds. remarkably we found that the βdiol molecules interact with residues at f , y , q , t , q , m and the s-equol molecules interacts with residues at q , k , t , f , k , e , l , n , d , k , a , f , e , q and l (figure b , supp. table # ). both estrogen molecules energetically stabilized the α and α helices by physical interactions and thereby minimized the fluctuation of the ace chains a and b (figure c , supp #). importantly , our calculation of free-energy landscape (fel), demonstrated that the surface of chain b of ace (s-rbd's preferred interaction region) loses its interaction energy with the s-rbd protein from . kj/mol to . kj/mol ( %) for the β-diol system and to . kj/mol ( %) for the s-equol system ( figure d ). in addition, binding of either estrogen molecules to the surrounding hydrophobic pocket of ace at the residue t promotes a decrease in energy by ~ % which may have a negative impact on the attachment of the s-rbd protein to the receptor (suppl. figure ) . we also observed estrogen-glycan interactions particularly at the glycan-protein interactions between the ace (n ) and the s-rbd (n ) ( figure. a) . indeed, glycans are highly polar structures due to their high content of hydroxyl groups which make them suitable for attachment to the ace protein (mostly negatively charged) or the sars-cov- s-protein (polarly charged). the density functional theory (dft) calculation shows an important decrease of the glycan's molecular electrostatic potential (mep) due to the interactions with either estrogen molecules. therefore, estrogen-glycan interactions could decrease the adhesive effect of glycans that enhance s-rbd and ace receptor interactions (suppl. figure and figure. b). these structural analyses suggest that estrogens could act as putative ace ligands due to their ability to bind to highly energetic pockets at the top of the ace surface protein which may increase its conformational equilibria and potentially boost its internalization to the cytoplasm. to support our in-silico analyses, we treated huvecs with β-diol ( nm) or s-equol ( nm) overnight under normal physiologic conditions. immunofluorescent staining demonstrated that estrogen-treated cells have less ace membrane cellular localization ( figure c ). immunoblot analysis revealed that endogenous and dietary estrogens promote ace internalization and degradation through the endocytosis process as assessed by lc b and lamp protein activation in treated cells ( figure. d). to test the hypothesis that lower levels of estrogens are associated with increased levels of ace protein in the respiratory tract, we administrated intratracheally either β-diol ( . μm) or s-equol ( μm) to male mice. histologic analysis of lung sections demonstrated that both forms of estrogens decrease ace membrane expression levels in lung alveoli and also reduced the glycosylation of the ace receptor ( figure e & f) estrogens interfere with sars-cov- receptor binding and block entry into the cell. to determine if the decline of conformational gibbs free energy and gain in stabilization of ace due to estrogen binding could affect the ability of the s protein to interact with the ace receptor and thereby its entry into cells, we performed a refinement step of ace -free or ace -estrogen models with ns of mds followed by molecular docking with the sars-cov- s protein. from structures obtained, with top scores were chosen for further analysis (suppl. table ). the ace - β-diol model promoted the shift of s-rbds from the binding surface toward the lateral side of the ace protein decreasing the number of contact residues. notably s-rbds completely lose contact with key ace -glycosylated residues at n , n , n and n . we also observed that the contact between the s-rbd and the helix α and α of ace moved toward the n-terminal of the helix and thus affected the ability to bind the receptor. in the same manner, the ace -s-equol model demonstrated that s-equol blocks the contact between the s-rbds and the receptor's surface, notably promoting novel interactions at the c-terminal of the helix α causing nonspecific contacts with the receptor at residues q -i and p -n . interestingly, we found that the β-diol interacts with residues on the surface of the receptor and notably forms a cluster on glycans at n (chain a) and n (chain b). on the other hand, the s-equol molecules tend to interact more widely accounting for a total of interactions, including on residues on the chain a and residues on the chain b. (for better visualization, only the top scored s-rdbs structures are shown in figure a ). the nonspecific binding by the s-rbds could be explained by the susceptibility of ace to interact with polar molecules and especially to electrophilic attacks. the fact that the β-diol or s-equol contain few polar groups but are deficient in negative charge renders them more susceptible to attack the surface of hace thereby blocking s-rbd from binding correctly. in addition, we computed the binding score of these models using the atomic energy contact function and in agreement with our previous docking results observed that both estrogen molecules significantly reduced the atomic energy contact between virus and receptor. remarkably, the β-diol reduced the atomic contact by % and the s-equol by % indicating that the entry of the virus may be affected by the presence of either estrogen molecules (figure b ). to validate our in-silico prediction, we pre-treated huvecs with either estrogen molecules followed by incubation with μg of rs-rbd protein overnight. importantly either low or high concentration of β-diol (low= nm) & high= nm) or s-equol (low= nm & high= nm) blocked more than % of the rs-rbd protein entry into the cell as assessed by immunofluorescence and colocalization with lamp , a lysosome marker ( figure c ). in addition, immunoblot demonstrated a decrease of rs-rbd levels in the cytoplasm of huvecs for both estrogen-based treatments (figure d ). together these results suggest a potential molecular mechanism by which estrogens may provide protection against severe infection in covid among women and individuals with phytoestrogen intake. next, we sought to test the ability of estrogens to block key interactions between ace and the sars-cov- s protein and thereby infection of the respiratory tract. male wild-type mice were treated with β-diol ( . μm) or s-equol ( μm) via intratracheal instillation for hrs before tissue collection. elisa-based binding assay showed a significant decrease of ace affinity to sars-cov- s protein in lungs from mice treated with either estrogen molecules compared with the control group (figure a ). we then evaluated in vivo whether intratracheal estrogen treatment would reduce internalization of the s protein in male mice. we observed that intratracheal instillation of both estrogen molecules hrs before intratracheal instillation of rs-rbd ( μg, overnight treatment) increased signal for the rs-rbd on the surface of lung cells which likely results from reduced binding to ace in estrogen-treated mice compared to the untreated group. in contrast control (dmso-treated) lungs showed normal ace membrane localization and cytoplasmic r-s-rbd signal indicating the unperturbed intake of the rs-rbd protein. the observed increase in extracellular rs-rbd in alveolar cells from lungs of estrogen-treated mice suggests estrogen-mediated reduction in internalization of the s-protein. indeed, we observed that pretreatment with estrogen resulted in rs-rbd protein accumulation on the surface of the alveolar cells (figure b ) rather than being internalized into the cytoplasm which would thereby support viral replication and disease progression. our data show that estrogens may interfere with sars-cov- infection in the respiratory tract through direct interaction with the ace receptor in vivo. increased susceptibility and risk of adverse clinical outcomes among males affected by covid- has been reported in multiple epidemiological studies [ ] [ ] [ ] [ ] . androgens can effectively upregulate viral target proteins that may increase viral entry and pathogenicity in patients following exposure to the sars-cov virus and sex-related hormones can modulate immune respose. a detailed understanding of the molecular and cellular mechanisms modulated by estrogen that contribute to viral pathogenicity is therefore critical to the development of new therapies to combat the covid- pandemic. beside the epidemiologic evidence suggesting that females are protected from severe infection, a recent study has demonstrated that the female reproductive tract, expresses very low levels of the ace receptor and almost undetectable tmprss , suggesting that the virus is unlikely to infect the female reproductive tract, where female sex hormones are produced , . in the current study, we utilized in silico, in vitro, and in vivo studies to characterize important glycosylation-mediated interactions between the sars-cov virus spike (s) protein and the human ace receptor that can be modulated by endogenous or dietary estrogens in a manner that may be protective against the sars-cov entry into human cells. previous studies have highlighted the critical role of glycosylation in viral pathobiology, host immune system evasion, and infectivity in a range of human viral illnesses . in many of these viruses, the viral envelope and secreted proteins are extensively glycosylated which is necessary for structural integrity and functionality of these proteins. viral proteins may be glycosylated by the host cell as viruses are able to hijack cellular glycosylation processes. however little data exists on the impact of glycosylation of host proteins necessary for viral entry, such as ace , on viral infectivity. using a novel molecular simulation approach, we demonstrated that ace glycosylation augments binding of the viral s protein by supporting multiple types of interactions including glycan-glycan and glycan-protein interactions, thereby facilitating the stability and affinity of viral binding to the target host receptor. we extend these in silico observations by also demonstrating that entry of the s-rbd can be augmented in vitro by exposure of cultured huvecs to a hyperglycemic environment that increases ace glycosylation. these observations provide insights into the enhanced susceptibility of diabetic patients to severe infections and death in covid- [ ] [ ] [ ] . based on these findings that ace glycosylation enhances interaction with the viral s protein in silico, we explored whether the predominant endogenous form of estrogen, β-diol, may provide a protective effect as assessed using in silico modeling of viral s protein-ace interaction and in vitro and in vivo models of viral infectivity. in addition, we used an identical approach to understand the potential protective mechanisms of dietary phytoestrogens on sars-cov infectivity observed in populations with low cfrs where consumption of these foods is high. we found that both endogenous and dietary estrogens compete with the s-rbd protein to bind specific sites on hace that are used by the virus to bind the receptor. indeed, estrogens were found to bind at almost all sites including ace glycans causing a reduction of energy on the surface of the receptor rendering the receptor less susceptible to interact with other molecules via reduced cell surface expression including the viral s protein interactions. our findings that estrogens interfere with s protein and ace interactions in silico that is associated with reduced s protein uptake in an in vitro model of sars-cov- infectivity in cultured human endothelial cells are consistent with prior studies demonstrating that estrogens have antiviral properties against hiv, ebola and hepatitis viruses . additionally, recent evidence indicates that decreased levels of estrogens in post-menopausal women are an independent risk factor for disease severity in female covid- patients . the findings of the current study thus represent novel findings in our understanding of the molecular mechanisms underlying reduced susceptibility to sars-cov- among females or individuals with depressed estrogen levels and in countries where dietary estrogens are high. we then examined the ability of estrogen molecules to interfere with s protein uptake into pulmonary epithelial cells using an in vivo model of sars-cov infectivity. in agreement with our cellular experiments, lungs from mice treated with dietary or endogenous estrogens demonstrated a dramatic reduction in the uptake of s-rbd. in addition, we observed a remarkable reduction of ace binding possibly due to the low protein levels of ace in those lungs possibly in response to estrogen-mediated degradation. in conclusion we provide a molecular basis that helps elucidate the potential protective effect of estrogens in women infected by the sars-cov- virus which could inform the development of future therapeutic measures to protect against sars-cov- infection including the design of suitable blocking antibodies, estrogen-related treatments, and vaccine development. for immunefluorescence, huvec cells were cultured into -well lab-tektm ii chamber slides (nunctm) and were then treated with either β-diol at nm or s-equol at nm. cells were rinsed twice with ice-cold pbs, fixed with % paraformaldehyde in pbs (pfa, boston bioproducts) for min at rt, and were permeabilized with . % triton-x (sigma-aldrich) for min. the slides were blocked with % donkey-serum, and . m glycine in pbs-tween ( . %) for h at rt. subsequently, the antibodies anti-ace ( : ), s-rbd-his-tag ( : ), anti-lamp ( : ) and anti-lc b ( : ) were added and slides were incubated overnight at °c. the slides were then washed times for min each with pbs-t and were incubated with secondary antibodies at : dilution for hr at room temperature. following immunostaining, slides were mounted with diamond mounting medium containing dapi (thermo fisher). slides were then visualized with the leica tcs sp confocal microscopy station and the pictures were digitized with the leica application suite x software. huvec cells were rinsed twice with ice-cold pbs and proteins were extracted with m-per for whole cell lysis, respectively (thermo fisher). these lysis buffers contained halt protease, phosphatase inhibitors and edta (thermo fisher). the protein concentration was determined by the colorimetric bicinchoninic acid assay (bca assay, thermo fisher). equal amounts of total protein from cell lysates were separated by sds-page ( μg or μg for ace , lamp , lc b and rs-rbd-his-tag, respectively). proteins from the gel were then electro-transferred onto . μm nitrocellulose and . μm pvdf membranes. the membranes were then blocked for h at room temperature, with either % non-fat powdered milk dissolved in tbs-t or % bovine serum albumin in tbs-t, for the nitrocellulose and pvdf membranes, respectively. following blocking, membranes were incubated overnight at °c with the primary antibodies anti-ace ( : ), anti-lamp ( : ), anti-lc b ( : ) and anti-his-tag ( : ). the odyssey infrared western system was used to detect target proteins. band intensity was quantified using imagej software. all experiments involving mice were approved by the partners subcommittee on research animal care. personnel from the laboratory carried out all experimental protocols under strict guidelines to insure careful and consistent handling of the mice. mouse model of sars-cov- s protein entry. weeks old male c bl/ were purchased from the jackson laboratories, usa. to induce the recombinant s-rbd protein. briefly, mice were anesthetized with sevoflurane inhalation (abbott) and placed in dorsal recumbency. transtracheal insertion of a -g animal feeding needle was used to instillate estrogen molecules, rs-rbd or vehicle (dmso), in a volume of µl. mice were sacrificed hrs after instillation of rs-rbs and lungs were removed for further analysis. histology. lungs were then fixed in formalin ( %) for hours before transfer to % ethanol for photography prior to paraffinization and sectioning ( μm) paraffin embedding. slides were produced for tissue staining) for quantitative analysis. saccharides treatment: hypoglycemic media was composed by hbss buffer or optiment media. normal media contained complete endothelial cell growth media. for hyperglycemic media, optiment was supplemented with d-glucose at mm, d-galactose at mm, d-ribose at μm, d-mannose at μm, or d-fructose at μm. huvecs at %- % confluence were supplemented with hypoglycemic, normal or hyperglycemic media hours before incubation with μg of recombinant s-rbd-his-tag overnight. estrogen treatment: huvecs at %- % confluence were supplemented with opti-mem hours before treatment with complete growing media containing β-diol at a concentration of nm or s-equol at a concentration of nm for hrs. fresh media containing rs-rbd ( μg) was supplied the next day. prior to cellular collection, cells were washed with sterile pbs, protein extraction were performed as described above. rs-rbd-ace binding assay μg of total protein extracts from mouse lungs were cleaned up with iga/igg agarose beads for hr at c on a rotator followed by resuspension in assay diluent at x. then μl of each lysate containing , , , , , or μg of total protein were placed into corresponding well of a covid s-protein microplate (cat#: cov-sace , ray biotech,inc.) for overnight incubation at c on a rotator. then supernatant was removed, and wells were washed x followed by incubation with x hrp-conjugated secondary antibody solution for hr at room temperature. then μl of tmb one-step substrate reagent was added to each well for min at room temperature. before read μl of stop solution was added and microplate was read at nm. results are given as mean ± sd student's t test ( -tailed) was applied to determine the statistical significance of difference between control and treated groups (*p < . , **p < . and ***p < . ). for all experiments, at least experimental replicates were performed. violin plot graphs show mean ± sd. data were analyzed, and graphs were prepared with prism . (graphpad software). p values of less than . were considered statistically significant. the crystalline structures used in this work were pdb id: vxx for spike protein (sp, trimeric form) of virus sars-cov- and pdbid: m (of rbd/ace -b at complex) for ace protein (dimeric form), both obtained in the rcsb protein data bank. the missing residues for sp located on n and c terminal domains (m- to p and f to h , respectively) were not considered in the molecular simulations. therefore, each one sp chain was made up of residues (a to s ). in our sp model, another disulfide bond was recognized between missing residues c and c and was considered in md simulations. for ace , residues on n-terminal domain were excluded (m- to t ) and on c-terminal domain only extramembrane residues were considered (to i to g ). ace structure contain two zinc ions in peptidase domain which were considered in this work. the remaining missing residues of both proteins were added using swissmodel server (s-t ). the ace and sp structures are considered glycoproteins and the glycan-linked residues have already been reported [ ] [ ] [ ] [ ] . the oplsaa based doglycans software was used for building all models (s-t ). for sp, there are n-glycosylation residues on its surface, but n , n , n and n sites were excluded due to residues considered in our sp model. oglycosylation sites was not included too. in ace , all n-glycosylation sites were considered. the glycosylation process was carried out using the glycan glcnac man model, a glycoside sequence composed of n-acetyl glucosamines and mannoses (s-f b). this glycan type is the most common core sugar sequence on the n-glycans , estrogen solvated systems. the systems were constructed with the average structure of glycosylated ace , obtained in last ns of mds trajectories. β-diol and s-equol structures were quantum optimized and their force fields were constructed using ligpargen server [ ] [ ] [ ] . the previous simulation box of ace was augmented . nm in all directions and with the protein centered in box, was solvated two times using gmx solvate module. the first solvation was made homogeneous way with βdiol and s-equol molecules ( . and . mm solutions, respectively). in second solvation, explicit water molecules were added to fill the simulation box. in the solvation process, we made sure that the estrogen molecules were not close to the protein at the start of the md simulations. all quantum simulations were performed using density functional theory (dft) at b lyp/ tzvp level , . the self-consistent reaction field (scrf) theory was used for describing the solvent effects on the molecules in water solutions. the calculations were performed in the electronic structure program gaussian and results were visualized in gaussview v. . the molecular structures of β-diol and s-equol were optimized and it was ensured that they were at a global minimum through frequency analysis. these optimized structures were used in the molecular dynamic simulations. in meps analysis, single point calculations were carried out and total electron densities was mapped on molecular electrostatic potential surface. to address the structural interactions, we performed molecular dynamics simulations using gromacs (v. . ) with the opls/aa force field parameters . the protein complexes were solvated with tip p explicit water model . in addition to na + counterions used to neutralize the total charge in the simulation box, we used a mm nacl concentration to mimic physiological conditions. all molecular systems were built in a triclinic simulation box considering periodic boundary conditions (pbc) in all directions (x, y and z). minimum distance of the surface atoms of proteins to the edge of periodic box was . nm for ace receptor and sars-cov- spike protein, and . nm for ace -estrogen solvated systems. the equations of motions were integrated with the leap-frog integrator using a time step of fs. temperature in the simulations was maintained at . k using modified berendsen thermostat (v-rescale algorithm) with = . ps coupling constant with protein and water-ions coupled separately. pressure was maintained at bar using the parrinello-rahman barostat with a compressibility of . x − bar - and a coupling constant of = . ps. all simulations were carried out with a short-range non-bonded cut-off of . nm and the particle mesh ewald (pme) method was used for computing long-range electrostatic interactions with a tolerance of x for contribution in real space. the verlet neighbor searching cut-off scheme was applied with a neighbor-list update frequency of steps ( fs). bonds involving hydrogen atoms are constrained using the linear constraint solver (lincs) algorithm . simulations were first energy minimized using the steepest descent algorithm for a maximum of , steps. the equilibration was conducted by two steps. the equilibration was conducted by two steps. the first step, a ns of dynamics in the nvt (isothermal-isochoric) ensemble and second step, was continued for another ns in the npt (isothermal-isobaric) ensemble. production runs were performed in the npt ensemble for ns for ace and sars-cov- spike protein and ns for ace -estrogen solvated systems. structure and data analysis. the structural interactions were obtained carried out a rigid-rigid body docking analysis using patchdock server in order to obtain the contact residues between s-rbd and ace systems. the patchdock algorithm discard all unacceptable complex and results are assorted by geometry shape complementarity score. in addition, patchdock do calculate the effective atomic contact energies according to zhang et al. the molecular docking was done take to ace protein as receptor molecule and spike protein as ligand molecule. clustering rmsd value and complex type they were selected according to the recommended parameters for protein-protein interactions ( . Ǻ and default mode). for ace -estrogen systems, the docking was performed in the presence of the estrogen molecules bound to the ace structure, β-diol molecules and s-equol molecules, respectively. from total results obtained in molecular docking, those structures that had steric impediments (intermembranal clashes) were discarded. the steric impediments were calculated based in sars-cov- virus size , , whose diameter varies about to nm and its spike protein is about to nm length ( and nm on average, respectively). molecular interactions were analyzed with ligplot software and the pdb files required was constructed with fortran based own computer programs and the statistical data results. statistical results, rmsd, rmsf, rg, sasa, hydrogen bonds, free energies, matches, structures, movies, b-factor maps, were obtained using gromacs modules and their different tool options. the analysis of structure properties was performed using md trajectories on the last ns of each simulations and visualization of the md simulations was created using visual molecular dynamics (vmd) software and the graphs were plotted by the xmgrace software . each molecular conformation during an md simulation has an associated energy and this can be observed using fel maps. these maps are usually represented by two variables related to atomic position and one energetic variable, typically the gibbs free energy. this free energy can be estimated from probability distributions of the system with respect to the chosen variables that are then converted to a free-energy value by bolzmann inverting multi-dimensional histograms. when represented in three dimensions, the fel maps show the energy range of all possible conformations were obtained during a simulation. in this work, we considered two substructures of ace protein for fel map analysis, the alpha - region (i to y ) and loops regions l - and l - (d to r ). the fel maps were plotted using gmx sham module while the rmsd and radius of gyration were considered as atomic position variables respect to its average structure and figures were constructed using wolfram mathematica . . . estrogens bind to ace glycans to promote its internalization. (a) glycanestrogen interactions stabilize ace structure through high-energy contacts involving ace glycan-residues at e , n , k and v (red color). (b) mep maps show the electrostatic impact of estrogen molecules on the surface of ace glycans. energy scale ranging from - . μa (red) to . μa (blue). (c) immunofluorescence staining of human ace (magenta) and the lysosome marker lamp (green), shows loss of ace membrane levels in huvecs treated with β-diol or s-equol compared with control cells (dmso). (d) immunoblot of lysates isolated from huvecs showing decreased levels of total ace protein with estrogen treatment. reduced ace protein levels were associated with increased endocytosis activity as evidenced by immunoblot for lc b and lamp . (e) histologic analysis of mouse lungs after hrs of intratracheal installation with β-diol or s-equol shows loss of ace expression (red) on the membrane of alveolar cells. estrogen-treated lungs showed greater ace -lamp colocalization (white arrows) indicating internalization of the receptor. (f) immunoblot showing decreased levels of total and glycosylated ace proteins in estrogen-treated lungs from male mice compared to control lungd. quantification of protein levels of three replicate experiments is shown. student's t-test, tails. bar graphs are presented as mean with error bars (±sd). surface interacting with top scored s-rbds (top -blue, -red, -orange, -purple and top yellow). s-rbds were scored based on shape complementarity principles. (b) heatmap of atomic contact energy between ace and s-rbds, shows spontaneous energy structures from most favorable (green) to less favorable s-rbd structures (red). energy scale ranging from kcal/mol to - kcal/mol. (c) immunofluorescence analysis of s-rbd entry into huvecs pretreated with β-diol or s-equol followed by treatment with μg/ml of recombinant s-rbd (red) demonstrate that estrogen-treated cells had reduced entry of s-rbd into cells in conjunction with a reduction in ace internalization as showed by colocalization with lamp (green). (d) immunoblot of isolated proteins from cultured huvecs shows a % reduction of s-rbd entry into cells in estrogen-treated cells. quantification of protein levels of three replicate experiments is shown. student's t-test, tails. bar graphs are presented as mean with error bars (±sd). (a) elisa-based binding assay using lung protein lysates shows reduced sars-cov- s protein affinity for the ace receptor after treatment with either β-diol or s-equol. (b) immunofluorescence analysis of wild-type mouse lung treated with β-diol ( . μm) or s-equol ( μm) compared with control lung (dmso) demonstrates rs-rbd protein accumulation on the surface of the alveolar cells rather than being internalized intracellularly where viral replication may occur. treatment with either estrogen also reduced ace prtoein expression (quantification in lower right panel). lamp (orange), ace (green) and rs-rbd (red). quantification levels of three replicate experiments is shown. student's t-test, tails. bar graphs are presented as mean with error bars (±sd). a pneumonia outbreak associated with a new coronavirus of probable bat origin tripartite combination of candidate pandemic mitigation agents: vitamin d, quercetin, and estradiol manifest properties of medicinal agents for targeted mitigation of the covid- pandemic defined by genomics-guided tracing of sars-cov- targets in human cells circulating plasma concentrations of angiotensin-converting enzyme in men and women with heart failure and effects of renin-angiotensin-aldosterone inhibitors predictors of mortality in hospitalized covid- patients: a systematic review and meta-analysis impact of sex and gender on covid- outcomes in europe are sex discordant outcomes in covid- related to sex hormones? sars-cov- and male infertility: possible multifaceted pathology covid- and androgen-targeted therapy for prostate cancer patients sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor structural and functional basis of sars-cov- entry by using human ace ace , much more than just a receptor for sars-cov- . front the x chromosome in immune functions: when a chromosome makes the difference the x-files in immunity: sex-based differences predispose immune responses considering how biological sex impacts immune responses and covid- outcomes correction: sex hormones promote opposite effects on ace and ace activity, hypertrophy and cardiac contractility in spontaneously hypertensive rats cross-country comparison of case fatality rates of covid- /sars-cov- . osong public health res whole versus the piecemeal approach to evaluating soy equol: history, chemistry, and formation beyond the cholesterol-lowering effect of soy protein: a review of the effects of dietary soy and its constituents on risk factors for cardiovascular disease developing a fully glycosylated full-length sars-cov- spike protein model in a viral membrane site-specific glycan analysis of the sars-cov- spike structural basis for the recognition of sars-cov- by full-length human ace structure, function, and antigenicity of the sars-cov- spike glycoprotein emerging covid- coronavirus: glycan shield and structure prediction of spike glycoprotein and its interaction with human cd . emerging microbes & infections lc -associated endocytosis facilitates β-amyloid clearance and mitigates neurodegeneration in murine alzheimer's disease gangliosides are essential endosomal receptors for quasi-enveloped and naked hepatitis a virus coronavirus disease- and fertility: viral host entry protein expression in male and female reproductive tissues female reproductive tract has low concentration of sars-cov receptors impaired estrogen signaling underlies regulatory t cell loss-offunction in the chronically inflamed intestine molecular mechanisms of sex bias differences in covid- mortality differential regulation and targeting of estrogen receptor α turnover in invasive lobular breast carcinoma sars-cov- has a sweet tooth glycosylation in health and disease potential influence of menstrual status and sex hormones on female sars-cov- infection: a cross-sectional study from multicentre in wuhan, china swiss-model: homology modelling of protein structures and complexes doglycans-tools for preparing carbohydrate structures for atomistic simulations of glycoproteins, glycolipids, and carbohydrate polymers for gromacs probing the glycosidic linkage: secondary structures in the gas phase potential energy functions for atomic-level simulations of water and organic and biomolecular systems . * cm a-lbcc: localized bond-charge corrected cm a charges for condensed-phase simulations ligpargen web server: an automatic opls-aa parameter generator for organic ligands density-functional thermochemistry. iv. a new dynamical correlation functional and implications for exact-exchange mixing fully optimized contracted gaussian basis sets for atoms li to kr gromacs: high performance molecular simulations through multilevel parallelism from laptops to supercomputers development and testing of the opls all-atom force field on conformational energetics and properties of organic liquids temperature and size dependence for monte carlo simulations of tip p water quiet high-resolution computer models of a plasma molecular dynamics with coupling to an external bath canonical sampling through velocity rescaling die berechnung optischer und elektrostatischergitterpotentiale a parallel linear constraint solver for molecular simulation patchdock and symmdock: servers for rigid and symmetric docking determination of atomic desolvation energies from the structures of crystallized proteins a novel coronavirus from patients with pneumonia in china science forum: sars-cov- (covid- ) by the numbers ligplot: a program to generate schematic diagrams of protein-ligand interactions vmd: visual molecular dynamics version . . mathematica, version key: cord- -a q vp m authors: chowdhury, surid mohammad; talukder, shafi ahmad; khan, akib mahmud; afrin, nadia; ali, md ackas; islam, rajib; parves, rimon; al mamun, abdulla; sufian, md. abu; hossain, md nayeem; hossain, mohammed akhter; halim, mohammad a. title: antiviral peptides as promising therapeutics against sars-cov- date: - - journal: j phys chem b doi: . /acs.jpcb. c sha: doc_id: cord_uid: a q vp m [image: see text] over peptides, which were known to inhibit sars-cov- , were computationally screened against the receptor-binding domain (rbd) of the spike protein of sars-cov- . based on the binding affinity and interaction, peptides were selected, which showed higher affinity compared to the α-helix of the human ace receptor. molecular dynamics simulation demonstrated that two peptides, s p and s p , were the most promising candidates, which could potentially block the entry of sars-cov- . tyr and tyr residues present in the “finger-like” projections of the rbd were found to be critical for peptide interaction. hydrogen bonding and hydrophobic interactions played important roles in prompting peptide–protein binding and interaction. structure–activity relationship indicated that peptides containing aromatic (tyr and phe), nonpolar (pro, gly, leu, and ala), and polar (asn, gln, and cys) residues were the most significant contributors. these findings can facilitate the rational design of selective peptide inhibitors targeting the spike protein of sars-cov- . a new type of coronavirus was first detected in december at wuhan city, the capital of the hubei province of china. this virus is designated as severe acute respiratory syndrome-related coronavirus- (sars-cov- ). the pneumonia-like disease caused by the virus is globally known as covid- . apart from china, covid- has spread to countries and killed over , people in total as of today ( june, ) . with the case count and death toll rising each day, there is an urgent need for antiviral drugs or vaccines against sars-cov- . the sars-cov- is a positive-sense single-stranded rna virus. it is a member of the same family belonging to sars-cov and middle east respiratory syndrome (mers-cov). the sars-cov- virion has a diameter of − nm. like other coronaviruses, sars-cov- has four structural and many nonstructural proteins. the structural proteins are called spike (s), envelope (e), membrane (m), and nucleocapsid (n) proteins. s, e, and m proteins perform together to form the viral envelope. the spike protein has a crown-like (corona) appearance. the spike (s) protein allows the virus to be attached into the host surface by interacting with human angiotensin-converting enzyme- (hace ) receptors present in the upper and lower respiratory system. , hace receptors are expressed in many organs including the lung, small intestine, testis, and kidney. ace , which acts as an exopeptidase, catalyzes the conversion of angiotensin i to angiotensin i−ix and angiotensin ii to angiotensin i−vii. − cryo-electron microscopy analysis has indicated that unlike other coronaviruses, the s protein of sars-cov- has − times greater affinity to the hace receptors, resulting in greater transmissibility than others. , upon performing sequence alignment and homology modeling, it is evident that the s protein of sars-cov and sars-cov- share % sequence identity. , the s protein comprises s and s domains. the s domain is responsible for binding to ace receptors via its receptorbinding domain (rbd), whereas the s domain performs the fusion, enabling viral genome entry. electron microscope imaging revealed that the s glycoprotein forms a clove-shaped spike with three s heads and a s trimeric stalk. the rbd has greater variability. six amino acids (l , f , q , s , n , and y ) in the rbd are extensively responsible for the efficient binding. the s protein of sars-cov- is glycosylated containing predicted n-glycosylation in the sequence, having one site less than the sars-cov- at n . the rbd of sars-cov- shares % sequence identity with that of sars-cov at the protein level. the interaction between the s protein and the ace receptor is the critical route of entry for the virus. therefore, the s protein is a potential target for drug or vaccine development. small molecules or peptides can be designed as therapeutics that will disrupt the interaction between the s protein and the ace receptor; however, small molecules are not ideal for targeting large protein−protein interactions (ppis). peptides, on the other hand, can disrupt the ppis effectively as they possess a larger surface compared to small molecules and thus specifically bind to the interface-binding region. in this context, a team from the massachusetts institute of technology (mit) developed -mer peptide against the spike protein. a research group from the university of illinois at chicago designed four ace -based peptide inhibitors of sars-cov- . while this early stage of peptide inhibitor development shows great promise, only few ace- -based peptides were screened and proposed. in this work, we, therefore, computationally screened antiviral peptides that were known to work against sars-cov- , targeting the rbd of the s protein of sars-cov- . peptides that showed higher s protein-binding affinity compared to the α-helix (ah) of the ace peptidase were further analyzed with molecular dynamics (md) simulation and the structure− activity relationship (sar) in order to achieve a high-affinity binder for the s protein. ■ methods molecular docking. a total of peptides were selected from the antiviral peptide database avpdb, which were experimentally verified to be effective against the sars-cov- . all the peptides were modeled by the cabs-fold. the crystal structure m j of the rbd was retrieved from the rcsb protein data bank (pdb). peptides were docked to the rbd using patchdock, and initial peptide−rbd complexes obtained from patchdock were then refined by firedock. peptides were further docked using cluspro and haddock . , with an aim to reach a consensus score. peptides that exhibited better binding scores in all three docking modes were subsequently analyzed by md simulation. md simulations. a ns md simulation was conducted for apo-rbd, ah-rbd, s p -rbd, s p -rbd, s p -rbd, s p -rbd, s p -rbd, and s p -rbd complexes to evaluate the peptide−protein conformational dynamics and interaction. md simulation was performed three times for each case. yasara dynamics software was used, and amber force field was considered for all calculations. , water molecules ( . g/cm density) were added, and the system was neutralized by adding nacl salt at . % concentration at k temperature. the particle-mesh ewald method was used for long-range electrostatic interaction calculation. a berendsen thermostat was used to control the simulation temperature. periodic boundary condition was employed for performing the simulation, and the cell size was Å larger than the protein−peptide complex in all cases. a simulated annealing method was used for the initial energy minimization process of each simulation system, using the steepest gradient approach ( cycles). a . fs time step was used for the overall simulations. finally, ns md simulation was performed for each system, and the snapshots were saved at every ps. bond distance, bond angle, dihedral angles, solvent-accessible surface area (sasa), coulombic and van der waals (vdw) interactions, root-mean-square-deviation (rmsd), root-mean-square-fluctuation (rmsf), and values for backbone, alpha carbon, and heavy atoms were analyzed from md simulation. md snapshots were collected to evaluate the interactions in peptide−protein complexes over ns. a total of md snapshots were selected for the binding free energy calculations by prodigy server, which measures the free energy based on intermolecular contacts and properties derived from the noninterface surface. different multivariate energy factors were analyzed by employing the principle component analysis (pca) method to understand the structural and energetic changes of proteins in the presence of the peptide during md simulation. the structural and energy information including bond distances, bond angles, dihedral angles, planarity, vdw energies, and electrostatic energies were considered. pca analysis can disclose the hidden structural and energy profile among different groups. , the last ns of the md trajectory data for both apo-rbd and peptide−protein complexes were considered for pca analysis. data were preprocessed using centering and scaling prior to this analysis. the multivariate factors were arranged in the x matrix and reduced into a product of two new matrices by using the following equation. here, t k is the matrix of scores, which signifies the relation of samples with each other, p k is the matrix of loadings, carrying information about the relation of variables to each other, k is the number of factors into the model, and e is the unmodeled variance. for performing all the calculations, r , -based inhouse-developed codes were used. peptide sar analysis. peptide sar was performed considering the best peptides. relevant peptide properties including acidic (a), basic (b), aromatic (ar), polar (p), nonpolar (np) amino acids, net charge at ph , molecular weight, and approximate volume (table s ) are calculated using the protparam tool. initially, stepwise multiple linear regression (mlr) was performed considering these properties as variables to predict the calculated binding affinity of the test peptides with the rbd of the sars cov- spike protein. subsequently, pca was performed taking the five most important peptide properties to cluster the test peptides in a biplot to explore the structural variance. peptide-binding affinity and interaction. the amino acid sequence, length, and in vitro inhibition efficiency (these data are collected from the peptide database avpdb ) against sars-cov- of peptides are summarized in table s . all peptides were docked to the rbd of the sars cov- spike protein using patchdock. the binding pockets of the rbd were specified during the molecular docking. the best peptide−protein complexes obtained from patchdock were submitted to firedock for subsequent refinement. only the complexes that showed the higher binding affinity and the expected binding interaction were chosen as the best candidates ( table ). the ah of the ace peptidase domain (pd) is considered as a control peptide. the binding affinity of all peptides is tabulated in out of peptides, peptides showed satisfactory binding interaction when docked using cluspro . (table s ) . moreover, strong binding affinity was also observed for s p and s p , which agreed with firedock results. although s p , s p , s p , and s p exhibited better affinity, they shifted away from the binding pocket. in haddock results, nine peptides displayed more favorable docking scores, namely, s p , s p , s p , s p , s p , s p , s p , s p , and s p . however, none of these peptides exceeded the control ah in terms of binding affinity (table s ) . various residues including glu , tyr , and tyr present in the ace binding site of the rbd were involved in noncovalent interaction with the antiviral peptides ( figure a) . notably, glu and tyr exhibited multiple interactions with several antiviral peptides, indicating that these residues might be crucial for attachment with peptides ( figure c) . other important residues that also interacted with the antiviral peptides are gln , leu , tyr , and tyr . hydrogen bonding played a crucial role in peptide−rbd interaction, contributing % of all interactions (figure b) . besides hydrogen bonding, hydrophobic interactions contributed to %, while electrostatic interactions were involved in only % of the total interactions. md simulation. md simulations of apo-rbd and complexes of ah, s p , s p , s p , s p , s p , and s p were performed. s p and s p showed significant changes in rmsd (figure a) . when md snapshots were analyzed, it became clear that such changes in rmsd were not due to the change in protein conformation, rather it could be attributed to the movement of the peptide. both peptides, s p and s p , were found to be deviated from their binding interface, although s p was more deviated than s p . the ah-rbd complex remained stable over the simulation period, as indicated by its rmsd profile and respective snapshots (figure a ). although s p , s p , s p , and s p complexes exhibited a slightly greater rmsd, their respective md snapshots (figure b ,c) revealed that these peptides occupied the binding interface and remained as stable complexes throughout the simulation period. the s p -rbd complex exhibited lower radius of gyration (rg) values compared to the ah-rbd complex, indicating that this peptide induced the compactness in the rbd upon binding (figure b ). overall, a trend of reduction in the sasa was detected for all complexes (figure c) . nonetheless, the most prominent downgrading in the sasa was observed in the case of s p -rbd complexes, thus confirming the induction of (figure d) . a high rmsf value illustrated the s p displacement from its binding site. however, high fluctuations were observed in regions spanning residue numbers − and − in the rbd for all complexes. this is not unexpected because these regions correspond to loops which lack any definite geometry. binding free energies of ah-rbd, s p -rbd, s p -rbd, s p -rbd, and s p -rbd complexes were calculated, for which ah showed an average binding energy of − . ± . kcal/ mol which was the highest compared to other peptides ( figure a ). the average binding affinities of s p and s p were found to be better than those of s p and s p . overall, the journal of physical chemistry b pubs.acs.org/jpcb article md simulation suggests that s p and s p could be our potential candidates. a pca model including eight training sets (apo-rbd and seven peptide−rbd complexes) is generated to understand structural and energy changes in the peptide−protein complexes relative to apo-rbd during md simulation. here, the first two pcs explain . % of variance, where pc explains % and pc explains . % of variance. in the score plot of pc and pc (figure c ), apo-rbd shows a major rightward shift relative to all the peptide−protein complexes along pc . this clustering pattern is usual because majority of the variables, that is, coulomb energy, angle, bond distance, and vdw energies (figure d ) have largely influenced the variance along pc . the rbd-s p complex is at the farthest interaction of s p , s p , and ah with the rbd obtained from md simulation. in the s p -rbd complex, tyr and tyr in the rbd exhibited remarkable interactions over the ns simulation period (figure a ), whereas asn , leu , tyr , gln , and phe also interacted frequently. most notable residues found in the s p peptide were pro , tyr , cys , and tyr (figure b) . interaction between s p and the rbd was elevated by both hydrogen bonding and hydrophobic interactions (figure c ). tyr in the rbd formed hydrophobic interaction and hydrogen bonds with gly and tyr in s p (figure d ). however, tyr showed hydrophobic interaction only with tyr and pro in the s p peptide. hydrophobic interaction ( %) and hydrogen bonding ( %) contributed to most of the interactions between s p and the rbd. in the s p -rbd complex, ala and arg present in the rbd were involved in multiple interactions during the simulation period ( figure s a ). other residues such as tyr , tyr , tyr , and phe were also detected. in the s p peptide, tyr , cys , his , cys , and phe residues were involved in such interactions ( figure s b ). figure s c illustrates that the interactions between s p and the rbd were mostly governed by hydrogen bonds covering around % of the total interactions. arg in the rbd showed major interaction with glu in s p through electrostatic and hydrogen bonding ( figure s d ). in the ah-rbd complex, arg and lys residues in the rbd showed significant interactions during the ns md simulation. the rbd residue arg interacted with the glu in ah through electrostatic and hydrogen bonding, whereas lys interacted with asp and his by electrostatic, hydrophobic, and hydrogen bonding ( figure s ). md simulation results suggested that the arg and lys in the rbd were essential for strong binding of ace with the rbd of the spike protein. besides these residues, asn , gln , tyr , tyr , gln , gly , and leu were involved in interaction with ah residues including glu , his , gln , lys , gln , tyr , and asp ( figure s ). overall, tyr and tyr residues in the rbd commonly participated in all stable peptide−protein complexes ( figure b ). these residues are present in the "finger-like" projections of the rbd (involved in ace receptor binding), which suggests that these projections and specifically these residues are crucial in peptide−rbd binding. structure−activity relationship. mlr analysis is executed with the most relevant peptide properties to sort out the significant predictors of the binding affinity of the test peptides (table s ) . aromatic, nonpolar, and polar residues are found to be the most significant predictors, which explains the observed dominance of hydrogen and hydrophobic interactions ( % in total) of the peptide−rbd complexes. in other words, tyrosine and polar residues stabilize the peptide−protein complexes by hydrogen bonding interactions, whereas nonpolar and other aromatic residues stabilize the complexes by hydrophobic interactions. this regression model also holds true in the peptide−rbd dynamic interactions in ns md simulation as high-frequency residues of the best-performing peptides (s p and s p ) in contact with the rbd are predominantly aromatic, nonpolar, or polar (table s ). in addition, the clustering behavior of the top peptides based on the most significant five peptide properties are analyzed to get an insight into their structural variance. in the generated biplot, the clustering pattern of s p , s p , s p , s p , and ah replicated the energy score plot of their complexes with the rbd, except that s p is close to ah ( figure ). besides, nonpolar, polar, and aromatic residues play a significant role in the clustering pattern of the test peptides as nonpolar and polar residues are heavily loaded onto pc and the aromatic residue onto pc , which altogether explains . % structural variance. designing and developing high-affinity antiviral peptides represent a promising therapeutic strategy for covid- treatment. encompassing the extended protein contact interface, high-affinity antiviral peptides can strongly inhibit the rbd of the spike protein, thus blocking the sars-cov- from entering cells and subsequent replication. although ace based peptide inhibitors are suggested, our results demonstrated that over peptides can show better binding affinity than the ah of ace . details from md simulation indicate that s p and s p could be the most promising antiviral peptide for sars-cov- . some critical residues of both the rbd and peptides are also observed by analyzing the residuespecific contact maps of these peptides. sar reveals that by combing aromatic, polar, and nonpolar residues, one can further optimize these peptides to improve their binding affinity for the s protein. we anticipate that these peptides can serve as the next-generation antiviral therapeutics for the treatment of the covid- disease. in addition, these antiviral peptides can be conjugated to gold nanoparticles that are expected to act as potent nanoinhibitors enhancing the antiviral activity. our study provides valuable information for the rational design and development of peptide inhibitors against sars-cov- that can show high in vitro and in vivo efficacy. the supporting information is available free of charge at https://pubs.acs.org/doi/ . /acs.jpcb. c . sequence, length, inhibition efficiency, binding affinity, stepwise mlr analysis, interaction residues, and distribution of noncovalent interactions (pdf) coronaviridae study group of the international committee on taxonomy of viruses. the species severe acute respiratory syndrome-related coronavirus: classifying -ncov and naming it sars-cov- computational design of ace -based peptide inhibitors of sars-cov- analysis of therapeutic targets for sars-cov- and discovery of potential drugs by computational methods the spike protein of sars-cov -a target for vaccine and therapeutic development cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor structure, function, and evolution of coronavirus spike proteins angiotensin-converting enzyme is a functional receptor for the sars coronavirus expression and functional characterization rare driver mutations in head and neck squamous cell carcinomas converge on notch signaling genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding learning from the past: possible urgent prevention and treatment options for severe acute respiratory infections caused by -ncov the proximal origin of sars-cov- structure analysis of the receptor binding of -ncov future directions for peptide therapeutics development. drug discovery today the first-in-class peptide binder to the sars-cov- spike protein avpdb: a database of experimentally validated antiviral peptides targeting medically important viruses cabs-fold: server for the de novo and consensus-based prediction of protein structure structural basis of receptor recognition by sars-cov- patchdock and symmdock: servers for rigid and symmetric docking fast interaction refinement in molecular docking the cluspro web server for proteinprotein docking the haddock . web server: user-friendly integrative modeling of biomolecular complexes making optimal use of empirical energy functions: force-field parameterization in crystal space ff sb: improving the accuracy of protein side chain and backbone parameters from ff sb particle mesh ewald: an n· log(n) method for ewald sums in large systems prodigy: a web server for predicting the binding affinity of protein−protein complexes a molecular modeling approach to identify effective antiviral phytochemicals against the main protease of sars-cov- prediction of deleterious non-synonymous snps of human stk gene by combining algorithms protein identification and analysis tools on the expasy server. the proteomics protocols handbook the authors declare no competing financial interest. we are grateful to our donors who supported to build a computational platform (http://grc-bd.org/donate/). the authors like to acknowledge the world academy of science (twas) to purchase the high-performance computer for performing md simulation. the journal of physical chemistry b pubs.acs.org/jpcb article key: cord- -u amf oh authors: parsons, lisa m.; bouwman, kim m.; azurmendi, hugo; de vries, robert p.; cipollo, john f.; verheije, monique h. title: glycosylation of the viral attachment protein of avian coronavirus is essential for host cell and receptor binding date: - - journal: journal of biological chemistry doi: . /jbc.ra . sha: doc_id: cord_uid: u amf oh avian coronaviruses, including infectious bronchitis virus (ibv), are important respiratory pathogens of poultry. the heavily glycosylated ibv spike protein is responsible for binding to host tissues. glycosylation sites in the spike protein are highly conserved across viral genotypes, suggesting an important role for this modification in the virus life cycle. here, we analyzed the n-glycosylation of the receptor-binding domain (rbd) of ibv strain m spike protein and assessed the role of this modification in host receptor binding. ten single asn–to–ala substitutions at the predicted n-glycosylation sites of the m –rbd were evaluated along with two control val–to–ala substitutions. cd analysis revealed that the secondary structure of all variants was retained compared with the unmodified m –rbd construct. six of the glycosylation variants lost binding to chicken trachea tissue and an elisa-presented α , -linked sialic acid oligosaccharide ligand. lc/ms(e) glycomics analysis revealed that glycosylation sites have specific proportions of n-glycan subtypes. overall, the glycosylation patterns of most variant rbds were highly similar to those of the unmodified m –rbd construct. in silico docking experiments with the recently published cryo-em structure of the m ibv spike protein and our glycosylation results revealed a potential ligand receptor site that is ringed by four glycosylation sites that dramatically impact ligand binding. combined with the results of previous array studies, the glycosylation and mutational analyses presented here suggest a unique glycosylation-dependent binding modality for the m spike protein. avian coronaviruses of poultry cause significant disease with subsequent economic losses in several commercially farmed bird species. avian infectious bronchitis virus (ibv) is a gam-macoronavirus that predominantly affects domestic fowl, primarily chickens (gallus gallus). the virus initially infects upper airway epithelium tissues, and depending on the ibv strain, disease outcomes range from mild respiratory disease to kidney failure and death ( ) . the viral envelope of ibv contains the highly-glycosylated spike (s) protein that is post-translationally cleaved into two domains, s and s . this s glycoprotein is the major adhesion molecule of the virus. it is a class i viral fusion protein, in which the variable s domain is involved in host cell receptor binding, and the more conserved s domain mediates the fusion of the virion with the cellular membrane ( , ) . the role of spike in host cell attachment and the induction of protective immunity has been reviewed ( ) . the spike protein monomer is a transmembrane glycoprotein with a molecular mass of kda before glycosylation ( ) . a cleavable n-terminal signal peptide ( ) directs the s protein toward the endoplasmic reticulum (er), where it is extensively modified with n-linked glycosylation ( , ) . after glycosylation in the er, the monomers oligomerize to form trimers ( - ) . the n-terminal amino acids of s were shown to encompass the receptor-binding domain (rbd) of ibv strain m ( ) , which interacts with sialyl-␣ , -substituted glycans present on the host's cell surface ( , ) . ten n-linked glycosylation sites are predicted to exist on the m -rbd ( ), of which most are highly conserved (fig. s ). it is interesting that of the sites are - % conserved. sites asn- and asn- were less conserved at and %. however, each had a nearby alternative site that was also highly conserved. alternative site asn- was conserved % of the time, and one or both asn- and asn- was present in % of the sequences. site asn- was conserved at %. in % of the sequences, either asn- or asn- was present but never together. therefore, all sites, including the alternatives, likely serve important functions. the n-glycosylation of viral glycoproteins is known to modulate the ability of viruses to infect host cells and to be recognized by the host's immune system ( ) . recently, zheng et al. ( ) studied extracted spike proteins and mutant viruses with asn-to-asp (asparagine to aspartate) and asn-to-gln (aspara-gine to glutamine) mutations at predicted glycosylation sites in the s protein of the beaudette ibv strain ( ) . their results indicate that glycosylation at some sites on the beaudette s -rbd was important for viral fusion and infectivity, which may include host recognition. however, the beaudette strain is a cell culture-adapted strain, is nonvirulent in chickens ( ) , and does not bind chicken tissues known to be important for infectivity ( ) , making it difficult to extrapolate these results to clinically relevant ibvs. to characterize and assess the role that glycosylation plays when interacting with host tissues through the rbd of pathogenic ibv strain m , we used a combination of molecular and analytical techniques, including histochemistry, elisa, circular dichroism (cd), ms, and docking analyses as listed in table . systematic deletion of each glycosylation site and histochemical analysis of each variant revealed which of the glycosylation sites affect the binding of ibv s protein to host epithelial tissue. site occupancy analysis by lc/ms e indicated that at least of predicted n-glycosylation sites in the m -rbd domain are glycosylated. analysis of site occupancy and signature n-glycan patterns at each site in combination with single glycosylation site deletions provided insight toward the biological relevance of each of those sites in binding to host tissue receptors. overall, our data confirm that n-glycosylation plays a critical and likely unique role in binding of the ibv spike domain to its host tissue receptors. to analyze the role of glycosylation of m -rbd in receptor binding, missense mutants (asn-to-ala) were generated on a site-by-site basis at each of the predicted n-glycosylation sites. recombinantly produced glycovariant rbd proteins migrated with the same electrophoretic mobility as unmodified m -rbd (fig. ) . the rbd proteins were evaluated by cd spectroscopy to assess similarity to the wt secondary structure. wt m -rbd, all glycosylation-site variants, and two nonglycosylation variants, v a and v a, were analyzed for secondary structure differences at °c. thermal melts were performed on each construct from to °c followed by full scans collected at °c and again at °c after the melt. overlays of all the cd spectra can be found in fig. s . visually, all spectra at all temperatures follow the same curve. the n a spectra were generated at higher protein concentrations but aligned well to cd spectra of all other variants when normal-ized to the percent of maximum signal. likewise, all the proteins had analogous broad melting curves suggesting the proteins were similarly stable. protein folding was reversible for all proteins, with comparable recovery rates (see cd- °c-aftermelt-normalized in fig. s ). dichroweb ( ) was used to calculate the percent of ␣-helix, ␤-strand, turn, and unordered portions of the protein in the initial °c spectra to estimate secondary structure differences between the proteins (fig. ) . the percent of ␣-helix varied with the extremes being unmodified rbd and n a. n a exhibited . Ϯ . % ␣-helix character as compared with wt, which has . Ϯ . %. interestingly, n a gave a very strong signal in the histochemical assay ( fig. a ) and had the most notably different released glycans' signature compared with the other constructs. we conclude that all proteins maintained a very similar structure and therefore suggest that single n-glycosylation sites are by themselves not indispensable for protein folding or stability. because we established that all variant m -rbd proteins are folded, we investigated their abilities to bind tissue receptors. recombinant proteins were incubated with chicken trachea tissue sections and examined by histochemical analysis. n a, n a, n a, n a, v a, and v a bound ciliated epithelial cells of the chicken trachea with similar staining intensity as the unmodified rbd with the most intense staining associated with the n a construct (fig. a ). in contrast, binding of constructs n a, n a, n a, n a, n a, and n a to trachea tissue was not detectable. removal of sialic acids by treatment of the trachea tissues with arthrobacter ureafaciens neuraminidase (auna) abrogated binding of all constructs as shown in fig. s . these results demonstrate that glycosylation on the rbd affects binding to sialyl ligands on chicken trachea tissue. the interaction of the variants with neu ac(␣ - )gal (␤ - )glcnac, a previously established ligand for m ( ) , was assayed by elisa. n a, n a, n a, and n a variants were able to bind the ligand in a concentration-dependent manner (fig. b ) like unmodified rbd. binding affinities of n a, n a, n a, n a, n a, and n a were significantly reduced compared with unmodified rbd and comparable with that of a negative control protein, the s of turkey coronavirus, with specificity for nonsialylated dilacnac glycans ( ) . fig. c shows the elisa absorbance at the nmol of ligand concentration for each construct. no significant difference was observed for variants n a, n a, n a, and n a compared with unmodified rbd (shown in dark gray bars in fig. c ). all other variants (shown in light gray bars in fig. c ) demonstrated significantly lower affinity for the receptor, consistent with histochemistry and ligand titration plot results. six of the single glycosylation site variants lost the ability to bind ligand. to investigate whether global changes in glycosylation may have affected binding, we analyzed release glycans from each protein. matrix-assisted laser desorption/ionizationtime of flight (maldi-tof) mass spectrometry (ms) analysis of enzymatically released and permethylated glycans allows for semi-quantitative analysis of glycan compositions. the method is particularly useful for samples containing sialylated glycans because they are stabilized by permethylation. the percent abundances of glycans identified in each sample are shown in fig. . the majority of the asn-to-ala variants, as well as the v a and v a control variants, had similar maldi-tof-ms permethylation profiles (fig. ) . over glycan compositions were identified ranging from high-mannose glycans to large complex ones. nearly half of the glycans contained at least one and up to three sialic acid molecules in all samples. the most intense glycoforms clustered in five groups with increasing amounts of complexity as reflected by the number of n-acetyl glucosamines (hexnacs). these include high-mannose, complex, and hybrid forms as follows: i, hex - hexnac (high mannose); ii, neuac - hex - dhex - hexnac (complex and hybrid); iii, neuac - hex dhex hexnac (complex); iv, neuac - hex dhex hexnac (complex); and v, neuac hex dhex hexnac (complex). high-mannose glycans were less abundant in unmodified m than in variant rbds. the n a, n a, and n a variants contained diminished amounts of the group v high-mass complex glycans. the n a variant was the most atypical with less defined clustering in the common clustering regions of the spectrum and higher abundances in spectral regions where compositions had less hex and more hexnac overall. for instance, cluster iv was shifted from glycans with hexoses (neuac - hex dhex hexnac ) to glycoforms with - hexoses (neuac - hex - dhex hexnac ). more abundance was observed in regions containing hexnac residues (neuac - hex - dhex hexnac ). to better understand the difference between n a and the other constructs, we calculated the monosaccharide percent mass and average mass for each construct. the average mass percent for glycans across all released glycan pools was hex ( . %), hexnac ( . %), dhex ( . %), and neuac ( . %). the n a construct had the lowest amount of hex ( . %) and the highest amounts of hexnac ( . %) and neuac ( . %). the former two were s.d. or greater from the mean (see table s ). this indicates that the n a construct likely had shorter, more branched, and more highly-charged glycans on average than the other constructs. two other variants had values more than s.d. from the mean. n a (normal binding) was most abundant in hex ( . %) and least abundant in hexnac ( . %) and dhex ( . %), probably due to its higher high-mannose content. n a (normal binding) had the lowest amount of neuac ( . %). this is perhaps a reflection of the missing sugars in this variant because site asn- in other variants was populated with many sialylated glycoforms based on site-specific analysis (table s ). to assess the differences in glycosylation on a site-to-site basis, glycopeptide lc/ms analysis was carried out on unmodified m and two single glycosylation site variants, n a and n a, that represented a nonbinder and a binder of trachea tissue, respectively. m -rbd had predicted glycosylation sites, whereas the variant rbds had nine each. n a was also of specific interest due to the unique glycosylation pattern observed in its free glycan profile. as cleavage with trypsin alone resulted in glycopeptides with more than one glycosylation site, we also analyzed glycopeptides after an additional treatment with chymotrypsin, which resulted in one glycosite per peptide, the identification of more glycopeptides, and decreased ambiguity concerning glycosylation site assignment. although a protein may contain the sequence (nx(s/t)), where n-glycosylation is known to occur, it may not actually be glycosylated, or it may be glycosylated only part of the time. potential glycosylation sites, their predicted glycosylation state, and their measured site occupancy are shown in table . of the glycosites, all but asn- were predicted to be glycosylated (occupied) based on netnglyc analysis (http://www.cbs. dtu.dk/services/netnglyc- . /). percent occupancy was analyzed by lc/ms; however, a poor signal was obtained for the asn- site in m and n a, and therefore, occupancies were not calculated. all other sites were estimated to be occu- figure . tissue-binding assay and elisas. histochemical assays of recombinant unmodified m -rbd and single asn-to-ala and val-to-ala glycosylation variants to trachea tissue (a) and elisa-presented neu ac␣ - gal␤ - glcnac (b and c). b, concentration dependence of binding. c, absorbance for each protein at the -nmol concentration. two-way anova showed significantly less binding by variant n a, n a, n a, n a, n a, and n a rbd proteins compared with unmodified rbd (compare light gray bars (variant) to unmodified (black bar)). no significant (n.s.) difference was observed for variants with dark gray bars. data points are averaged from three separate assays. ****, p Ͻ . . pied at % or greater in m and n a. the n a variant exhibited site occupancy at all expected sites, including asn- , although signal intensity at that site was low. two sites had much lower occupancy in n a as compared with the other samples. site asn- dropped to % occupancy and site asn- to % occupancy compared with nearly complete occupancy in the n a and m proteins. overall site occupancy was high for all sites. the difficulty in detecting some of the peptides, particularly asn- , may be due to hydrophobicity. ionization is partially driven by hydrophobicity, and asn- only had % hydrophobic character after the two digestions, which may, in part, explain its low detectability. by comparison, glycopeptides containing asn- , asn- , and asn- were short and between and % hydrophobicity, whereas glycopeptides containing other sites had predicted hydrophobicity ranging from to % and tended to produce higher intensity spectra. glycoform relative abundances at each site are listed in table s . fig. shows the location of each glycosylation site on the rbd of m . overall compositions at each site were similar in charge and size across the three constructs. a representative glycan is shown at each site based on peak intensity. the n a construct had glycoforms like those identified by maldi-tof ms with more hexnac and fewer hex compared with m and n a. fewer overall glycan compositions were detected on glycopeptides by lc/ms compared with the free glycans observed by maldi-tof ms ( versus compositions). this can be expected because the technology of instrumentation used and the physiochemical characteristics of permethylated glycans and glycopeptides differ significantly. the forms detected overlapped between the two analyses. during our investigation, the first structure of the m spike protein was solved using electron microscopy (em) ( ) . mapping the glycosylation sites onto the structure did not lead to a clear understanding of how the mutations affect binding. although em structural resolution is limited, and the precise coordinates for the attached glycans are not known, an attempt was made to dock a series of potentially sialylated ligands to a glycan-stripped structure of the rbd and a structure that was populated with glycans based on our data. the glycan chosen for each site on the rbd was based on the predominant glycans identified at each site by lc/ms (see fig. ). seventeen oligosaccharide ligands were chosen based on a previous glycan array study of m ( ) and elisa data (this work). both strong and weak binders were selected (fig. ) . each ligand was docked times against both the sugarstripped and in silico glycosylated m -rbd coordinates. there was no statistically significant difference between the docked binding energies of ligands that did and did not bind on the array. all oligosaccharide ligands, except for , , , , , and , docked seven or more times to one or more of the four sites on the m sugar-stripped structure with no clear pattern differentiating between them (fig. ). in the sugar-stripped structure, all binding occurred at sites a and b. site a is under the galectin fold near site asn- , and site b encompasses asn- and asn- . all three glycosylation sites are required for binding to trachea tissue. the docking pattern changed dramatically when glycans were modeled onto the structure. the most dramatic change was seen at site d where eight ligands bound seven or more times, whereas interactions at all other sites decreased. there were no binders at site a, only two at site c ( and ) and three at site b ( , , and ) . all of the ligand oligosaccharides that docked at site d were sialylated, consistent with ligands identified by array and elisa. no control ligand ( and uncharged; and kdn-charged) bound at site d. the interaction at site d involved both sugar-protein and sugar-sugar contacts, and in some docking runs, the interaction was completely sugar-sugar. site d is in the center of a circle of glycosylation sites that showed altered binding profiles when mutated; n a, n a, and n a lost the ability to bind, whereas n a gave a very strong signal in the histochemical assay. of note, no ligands docked in the site at the top of the galectin fold where many structural homologs of m are thought to bind sugars, such as the bovine coronavirus rbd ( ) . for comparison, we docked neu ac(␣ - )gal(␤ - )glcnac (␤-ome) against the crystal structure of the bovine rbd. twenty five of times the glycan docked in the proposed binding site at the top of the galectin fold in the negatively-charged area of the bovine rbd control near asn- (fig. b ). previously, we established that the ibv m s protein binds sialic acid-substituted glycoconjugate ligands in chicken trachea and lung tissue ( ) . intriguingly, the m rbd is highlyglycosylated with potential glycosylation sites, and glycosylation appears to be necessary for binding to host tissues because treating the protein with a neuraminidase diminishes binding ( ) . this study extends our investigation toward determining the role of glycosylation in the function of the rbd, which encompasses the n-terminal region of the native protein. each of the potential glycosylation sites was individually ablated, and each construct was examined for its ability to bind tissue and an elisa-presented ligand. in addition, the global glycosylation profile of every construct was surveyed, and glycosylation of three representative constructs was examined on a site-specific basis. six of the glycosylation sites in the rbd domain of ibv m were essential for binding to chicken trachea tissue and an elisa-presented sialylated oligosaccharide ligand. cd analysis demonstrated that both secondary structure and stability were similar across all the rbd constructs indicating the proper fold was likely retained for all. globally, percent abundances of sialylated glycans differed across mutants, but the differences were not associated with loss of binding. for example, and % of the glycans in binding mutants n a and n a, respectively, and and % of the glycans in the nonbinders n a and n a, respectively, were sialylated (summed from fig. ) . by comparison, % of the glycans in the unmodified rbd construct were sialylated. on a site-specific basis, some glycosylation sites had more sialylation than others (table s ). on average, each of glycosites asn- , asn- , asn- , and asn- were sialylated at least % of the time. sites asn- and asn- were in the less-ordered region of the protein figure . site-specific glycosylation of m , n a, and n a. the s -n-terminal receptor binding domain residues - from pdb entry cv is represented as gray ribbons. the asparagines of glycosylation sites that could still bind trachea tissue after mutation to alanine are in cyan, and those that could not are in dark red. glcnac residues from the structure are dark blue balls and sticks. the most predominant glycan for each site across all three constructs is shown to the right. glycoforms shown on the right are based on our data, and inferred structural detail is based on accepted knowledge of the cell type used in protein production. monosaccharides are represented as follows: mannose (green circles); galactose (yellow circles); glcnac (blue squares); fucose (red triangles); and sialic acid (purple diamonds). numbering of the sites is based on the mature sequence. the figure was made with ccp mg ( ) and gimp. ( ) and referenced in the figure as array score . white columns were against structure without sugars, and gray columns were lc/ms-identified where the sugars were modeled. bottom, rbd-binding domain of m from pdb structure cv . glycosylation sites are shown as cyan balls. sites where two or more oligosaccharides docked seven or more times are indicated as colored space-filled amino acids. colors and labels match the table above. b is a turned °toward the user. structure representations were made in ccp -mg ( ) . sugar symbols were rendered with drawglycan-snfg (www.virtualglycome.org/drawglycan/) ( ) . away from the galectin fold where binding is associated in the docking study. site asn- is at the bottom of the galectin fold and is required for ligand binding. site asn- is at the top of the galectin fold and is also required for binding. although we cannot conclude that sialylation is required at asn- and asn- , it is clear that glycosylation at these sites serves a role in ligand binding. the publication of the cryo-em structure of m ( ), the first structure of a spike protein from a gammacoronavirus, made it possible to visualize the distribution of the glycosylation sites in the tertiary structure of the protein. the study verified the site occupancy we observed on m -rbd because of of the glycosylation sites in the em structure were occupied. site asn- , not occupied in the em structure, is on a ␤-strand in the em structure, and it forms close contacts with the s c-terminal domain in the native protein. the c-terminal domain was not part of our construct. therefore, asn- in the recombinant constructs was likely in an environment much different from that found in the full-length protein. many human galectins, and also the bovine ␤-coronavirus spike protein ( ) , bind sugars at what is the top of the ␤-sandwich near site asn- in the rbd constructs (see fig. ). the bovine rbd site asn- closely aligns with site asn- of m (see fig. ). in the bovine protein, this demarks the region of proposed ligand binding. loss of asn- in the m rbd abrogates binding to trachea tissue. although ablation of asn- diminishes ligand binding, our docking study gave no evidence that this is the sialyl ligand-binding site in m . evaluation of the charge distribution in the proposed binding sites indicates that the bovine site is negatively charged, whereas the negative charge in the same region in m is sparse (fig. ) . this difference in charge near asn- may explain the lack of ligand docking in this region (gray ␤-strands in fig. b ) during docking simulations. the precise ligand-binding region of proteins with a galectin fold varies. rotavirus protein vp , for example, binds sialic acid in a groove between the ␤-sheets of the sandwich ( ) . the clustering of five of six required n-glycosylation sites suggests the location of the ligand-binding site may be on the right of the galectin fold as shown in fig. . our docking experiments studying possible oligosaccharide ligands to m were not conclusive in terms of binding energies but did identify four potential saccharide-binding regions (fig. ) . docking also demonstrated that glycosylation affects binding in silico because one potential site (site a; see fig. ) lost favor, whereas another one, site d, dramatically gained favor when the protein was glycosylated. site d is in the center of three glycosylated asparagines required for binding (asn- , asn- , and asn- ), and one whose loss results in a very strong histochemical signal and has a protein-wide effect on glycosylation with increased sialylation (asn- ) . in addition, the site d region is negatively charged (see fig. a ) like the proposed sialyl ligandbinding site on the bovine protein (fig. b) ( ) . all the ligands that interacted with site d were sialylated and included the glycan that bound in our elisa studies. interestingly, carbohydrate-carbohydrate contacts were detected in the rbd-ligand interactions at site d. this is an intriguing result because carbohydrate-carbohydrate interactions, although not common, have been reported between nonfucosylated antibodies and their receptor, in cell-cell adhesion interactions, between tumor antigens, and between bacterial receptors and mucin ( ) ( ) ( ) ( ) ( ) . a literature search did not uncover any reported carbohydrate-carbohydrate interactions between virus and host. although our docking study must be evaluated in the context of the higher root mean square deviations typical of em structures, and the inexactness of modeled oligosaccharides, results suggest that a combination of carbohydratecarbohydrate and carbohydrate-protein interactions should be considered in the binding mechanism. in conclusion, we have shown that glycosylation of six sites on the m ibv rbd are necessary for the interaction of m with both trachea tissue and neu ac(␣ - )gal(␤ - ) glcnac ligand in elisa. based on occupancy data, at least nine sites were glycosylated in the recombinant m -rbd. deletion of individual glycosylation sites had little effect on secondary structure, but it did have some effect on overall glycosylation profiles of some variants, especially n a. some differences can be expected because one site, with specific glycans, is lost from each variant, thus mildly altering overall profiles. in silico docking suggests that glycosylation may guide ligand binding. especially intriguing is site d, where glycosylation is required for in silico docking at that site. the interaction of m ibv with sialyl ligand may prove to be a unique interaction involving both carbohydrates and protein. further investigation is warranted. the tissues used for this study were obtained from the tissue archive of the veterinary pathologic diagnostic center (department of pathobiology, faculty of veterinary medicine, utrecht university, the netherlands). this archive is composed of paraffin blocks with tissues maintained for diagnostic purposes; no permission from the committee on the ethics of animal experiment is required. the pcd vector containing ibv m -rbd in-frame with a c-terminal gcn trimerization motif and strep-tag has been a and pink boxes on b. y , e , w , and h in b are involved in binding to sialic acid. the large asterisk in a indicates possible binding site based on structural comparison between the two proteins. images were made with ccp -mg ( ) . bovine coordinates are from pdb code h . described previously ( ) . site-directed mutagenesis using the q technology (new england biolabs) was performed to mutate the asparagine-encoding residues of the n-linked glycosylation sequence motif nx(s/t) into alanine or valine using the primers in table . sequences of the resulting rbds were confirmed by sanger sequencing (macrogen, the netherlands). hek t (atcc crl- ) cells were transfected with pcd plasmids using polyethyleneimine at a : ratio. the recombinant proteins were purified using strep-tactin-sepharose beads, as described previously ( ) , and their production was confirmed by western blotting using strep-tactin hrp antibody (iba, germany). recombinant m and its variants were prepared for cd spectroscopy by buffer exchange and concentration with four centrifugation cycles through -kda mwco amicon ultra . -ml centrifugal filters (ufc ) into mm sodium phosphate, ph . . final concentrations were measured with a thermo fisher scientific nanodrop spectrophotometer. cd spectra were collected on a jasco j- spectropolarimeter with a peltier thermostated fluorescence temperature controller module. samples were diluted to . mg/ml and four scans accumulated from to nm with a scanning speed of nm/min, digital integrated time -s, bandwidth nm, and standard sensitivity at °c. a thermal melt was done from to °c with a ramp rate of °c/min. measurements were taken every °at , , , , , , , and nm. a full cd scan was collected at °c. the temperature was then lowered to °c. after allowing the protein to refold for min at °c, a third cd scan was taken at °c to measure recov-ery. a savitzky-golay filter was used to smooth cd data at different temperatures for visual comparison (fig. s ) . secondary structure calculations for the cd data collected at °c before the thermal melt were processed by dichroweb ( ) using the cdsstr ( ), selcon ( ) , and contill ( ) algorithms with protein reference set . results from the three algorithms were averaged and plotted in fig. . histochemistry was performed as described previously ( ) . briefly, chicken trachea tissues from a -week-old broiler chicken were sectioned at m before incubation with rbd proteins at g/ml. desialylated tissues were prepared by pre-treatment with milliunits of neuraminidase (sialidase) from a. ureafaciens (auna, sigma, germany) in mm potassium acetate, . mg/ml triton x- , ph . , at °c overnight before protein application. chicken trachea tissues were from a -week-old broiler chicken (g. gallus) obtained from the tissue archive of the veterinary pathologic diagnostic center (department of pathobiology, faculty of veterinary medicine, utrecht university, the netherlands). sialic acids (neu ac␣ - gal␤ - glcnac-paa, -sialc-paa, glyconz, russia) were coated ( g/well) in a -well maxisorp plate (nunc, sigma) at °c overnight, followed by blocking with % bsa (sigma) in pbs- , % tween. rbd proteins ( g/ml) were preincubated with strep-tactin-hrpo ( : ) for min on ice, before applying them to the plates for h at room temperature. , Ј, , Ј-tetramethylbenzidine substrate was used as a peroxidase substrate to visualize binding, after which the reaction was terminated using n h so . absorbances (a nm ) were measured in a fluostar omega (bmg labtech) microplate reader, and mars data analysis software was used for analysis. protein samples of each recombinant protein were measured at each concentration in triplicate. statistical analysis was performed by comparing each protein to the unmodified rbd using two-way anova with dunnett's multiple comparisons test where ␣ was set to . . the workflow is shown in fig. s . aliquots between and g of m , n a, and n a and g of the remaining proteins were digested with trypsin as per an and cipollo ( ) . approximately - -g aliquots of protease-digested proteins were processed for deglycosylated glycopeptide and permethylated glycan analyses. samples were resuspended in mm ammonium bicarbonate, ph . . glycans were released by digestion with units/l pngase f (glycerol-free from new england biolabs) for h at °c. the samples were adjusted to ph . with - l of mm hcl. to maximize glycan release, samples were further digested with . milliunits/l pngase a overnight at °c. free glycans and deglycosylated peptides were separated using c spe cartridges (thermo fisher scientific). intact glycopeptide analyses were performed using - g of hilic-enriched glycopeptides as per an and cipollo ( ) . following data collection on the trypsinized glycopeptides, the remainder of the m , n a, and n a sam- lc/ms e data were collected on trypsinized peptides deglycosylated with pngase f as described under n-glycan release. asparagines that are deglycosylated by pngase f are converted to aspartate with a mass gain of . da due to the replacement of -nh with -oh. the percent occupancy for each site is calculated by comparing the intensity of peptides with asn to those with asp. however, spontaneous deamidation of unmodified asn to asp can also occur. o-water, which results in mass shift of . da, was used to ensure calculated percent occupancy was not skewed due to spontaneous deamidation. this experiment allows for examination of both spontaneous and enzymatically catalyzed deamidation, and therefore, accurate estimations of percent occupancy of glycosites can be determined. percent occupancy was calculated by comparing the intensities of the deglycosylated (dg) and nonglycosylated (ng) peptides using the equation: dg/(dg ϩ ng)⅐ . pngase-released n-glycans were applied to c spe and eluted with . % formic acid leaving the deglycosylated peptides bound to the c column. the glycan eluate fractions were combined, and butanol was added to a final concentration of %. the samples were then loaded onto -mg porous graphite columns prepared first by sequential washes of ml of % acetonitrile (acn), ml of % acn in water, ml of % acn in water, and ml of water. all solutions contained . % trifluoroacetic acid (tfa). the loaded columns were washed three times with ml of . % tfa in water, then eluted with % acn, . % tfa, water, followed by % acn, . % tfa, and water. the eluents were pooled and dried in glass vials by rotary evaporation. permethylation was done following the method of cincanu and costello ( ) and cincanu and kerek ( ) . maldi-tof analysis of permethylated n-glycans was performed on a bruker autoflex tm speed mass spectrometer in positive polarity reflectron mode. , -dihydroxybenzoic acid was used as a matrix, and malto-oligosaccharides were used as an external calibrant. data were processed using flexanalysis tm . each sample was spotted three times, and scans were collected in positive reflectron mode. peaks were picked and assigned, and intensities were averaged across each set of spots using in-house software. assignments were based on glycans known to be present in hek t cells. each peptide or glycopeptide sample was analyzed three times. a c column (beh nanocolumn m inner diameter ϫ mm, . -m particle, waters corp.) was used for nanolc/ms e analyses. a nanoacquity uplc system (waters corp.) was used for automatic sample loading and flow control. load buffer was % acn, % water. peptides were eluted via a -min gradient from to % acn with a flow of . l/min. all chromatography solutions included . % formic acid. the eluent flowed to an uncoated -m inner diameter picotip emitter (new objective inc., woburn, ma). the mass spectrometer was a synapt g hdms system (waters corp.). applied source voltage was v. data were collected in positive polarity mode using data-independent ms e acquisition, which consists of a starting -v scan followed by a scan ramping from to v in . s. to calibrate internally, every s fmol/l glu-fibrinopeptide b with pmol/l leucine enkephalin in % acetonitrile, . % formic acid, . % water was injected through the lockmass channel at a flow rate of nl/min. initial calibration of the mass spectrometer was performed in ms mode using glu-fibrinopeptide b and tuned for a minimum resolution of , full-width at half-maximum. nanolc/ms e data were processed using biopharmalynx . (waters corp.) and glymps (in-house software) ( , ) to identify specific glycans on each peptide. the search settings included trypsin digest with up to one missed cleavage, fixed cysteine carbamidomethylation, variable methionine oxidation, and variable n-glycan modifications based on a building block glycan library. assignment inclusion criteria were as follows: ) the presence of a core fragment (peptide, peptide ϩ hexnac, peptide ϩ hexnac , peptide ϩ dhex hexnac , and peptide ϩ hex hexnac ); ) the presence of three or more peptide fragments; ) the presence of three or more assigned glycopeptide fragments; ) assignment is made in at least of injections; and ) the existence of the glycan in glyconnect (https://glyconnect.expasy.org). residues - of the m spike em structure were extracted from the published structure (pdb code cv ) ( ) . this corresponds to the m -rbd used in this paper. glycamweb's glycoprotein-builder program ( ) was used to add the major oligosaccharide found at each glycosylation site onto the protein in silico. all glycosites in the m em structure were occupied except asn- ; however, asn- was occupied in our data and was populated accordingly. all glycosites were glycosylated in the new pdb file based on best evidence from our ms data. the coordinates of m -rbd without glycans, m -rbd with modeled glycans, and bovine rbd (pdb code h ) were used in docking experiments. a virtual library of oligosaccharides representing a variety of binding epitopes was created based on the cfg array version . (see fig. for a list). raw models of the oligosaccharide ligands were created with the amber tool tleap (www.ambermd.org) utilizing the glycam force field ( ), then energy minimized using yasara ( ) . dock screening of the library was performed with the yasara implementation of autodock vina ( ) with default parameters. a molecular dynamics simulation with explicit water (tp ) but with fixed coordinates for the backbone atoms was run on the glycosylated m rbd model to allow the amino acid side chains to accommodate the added glycans and to find low energy conformations. two models were extracted from the glycosylated md rbd run at and ns, which were used for dock screening with the virtual library. each oligosaccharide ligand was docked against the structures times. docking results shown in fig. are for the -ns model. results were similar in the -ns models. the long view: years of infectious bronchitis research the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex coronaviridae the avian coronavirus spike protein cloning and sequencing of the gene encoding the spike protein of the coronavirus ibv coronavirus ibv: structural characterization of the spike protein coronavirus ibv glycopolypeptides: size of their polypeptide moieties and nature of their oligosaccharides assembly of coronavirus spike protein into trimers and its role in epitope expression quaternary structure of coronavirus spikes in complex with carcinoembryonic antigen-related cell adhesion molecule cellular receptors mapping of the receptor-binding domain and amino acids critical for attachment in the spike protein of avian coronavirus infectious bronchitis virus binding of avian coronavirus spike proteins to host factors reflects virus tropism and pathogenicity sialic acid is a receptor determinant for infection of cells by avian infectious bronchitis virus glycan-protein interactions in viral pathogenesis identification of n-linked glycosylation sites in the spike protein and their functional impact on the replication and infectivity of coronavirus infectious bronchitis virus in cell culture the pathogenesis of virulent and avirulent avian infectious bronchitis virus protein secondary structure analyses from circular dichroism spectroscopy: methods and reference databases novel receptor specificity of avian gammacoronaviruses that cause enteritis cryo-em structure of infectious bronchitis coronavirus spike protein reveals structural and functional evolution of coronavirus spike proteins crystal structure of bovine coronavirus spike protein lectin domain the rhesus rotavirus vp sialic acid binding domain has a galectin fold with a novel carbohydrate binding site unique carbohydrate-carbohydrate interactions are required for high affinity binding between fc␥riii and antibodies lacking core fucose model system for cell adhesion mediated by weak carbohydrate-carbohydrate interactions carbohydrate-carbohydrate interaction as a major force initiating cell-cell recognition tn and stn are members of a family of carbohydrate tumor antigens that possess carbohydrate-carbohydrate interactions are lewis b and h type on helicobacter pylori involved in binding of bacteria to muc mucin? adv variable selection method improves the prediction of protein secondary structure from circular dichroism spectra a self-consistent method for the analysis of protein secondary structure from circular dichroism an unbiased approach for analysis of protein glycosylation and application to influenza vaccine hemagglutinin elimination of oxidative degradation during the per-o-methylation of carbohydrates asimpleandrapidmethodforthepermethylation of carbohydrates glycosylation analysis of engineered h n influenza a virus hemagglutinins with sequentially added historically relevant glycosylation sites glycosylation characterization of an influenza h n hemagglutinin series with engineered glycosylation patterns: implications for avian coronavirus glycosylation - structure-function relationships glycam : a generalizable biomolecular force field. carbohydrates autodock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization and multithreading presenting your structures: the ccp mg molecular-graphics software drawglycan-snfg: a robust tool to render glycans and glycopeptides with fragmentation information key: cord- -q yqnlyl authors: armijos-jaramillo, vinicio; yeager, justin; muslin, claire; perez-castillo, yunierkis title: sars-cov- , an evolutionary perspective of interaction with human ace reveals undiscovered amino acids necessary for complex stability date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: q yqnlyl the emergence of sars-cov- has resulted in more than , infections and nearly , deaths globally so far. this novel virus is thought to have originated from an animal reservoir, and acquired the ability to infect human cells using the sars-cov cell receptor hace . in the wake of a global pandemic it is essential to improve our understanding of the evolutionary dynamics surrounding the origin and spread of a novel infectious disease. one way theory predicts selection pressures should shape viral evolution is to enhance binding with host cells. we first assessed evolutionary dynamics in select betacoronavirus spike protein genes to predict where these genomic regions are under directional or purifying selection between divergent viral lineages at various scales of relatedness. with this analysis, we determine a region inside the receptor-binding domain with putative sites under positive selection interspersed among highly conserved sites, which are implicated in structural stability of the viral spike protein and its union with human receptor hace . next, to gain further insights into factors associated with coronaviruses recognition of the human host receptor, we performed modeling studies of five different coronaviruses and their potential binding to hace . modeling results indicate that interfering with the salt bridges at hot spot could be an effective strategy for inhibiting binding, and hence for the prevention of coronavirus infections. we also propose that a glycine residue at the receptor binding domain of the spike glycoprotein can have a critical role in permitting bat variants of the coronaviruses to infect human cells. the recent emergence of the novel sars coronavirus (sars-cov- ) marked the third introduction of a highly pathogenic coronavirus into the human population in the twenty-first century, following the severe acute respiratory syndrome coronavirus (sars-cov) and the middle east respiratory syndrome coronavirus (mers-cov). the first, sars-cov emerged in november in the guangdong province of china and spread globally during - , infecting more than people and causing deaths (drosten et al., ; who, ) . mers-cov was the second emergence and was first detected in saudi arabia in and resulted in nearly human infections and deaths in countries (fehr et al., ; zaki et al., ) . in december , sars-cov- , a previously unknown coronavirus capable of infecting humans was discovered in the chinese city of wuhan, in the hubei province zhu et al., ) . sars-cov- is associated with an ongoing pandemic of atypical pneumonia, now termed coronavirus disease (covid- ) that has affected over , people with fatalities as of march , (who, . both sars-cov and mers-cov are thought to have originated in colonies of bats, eventually transmitted to humans, putatively facilitated by intermediate hosts such as palm civets and dromedary camels, respectively (cui et al., ) . the genome of sars-cov- shares about % nucleotide identity with that of sars-cov and is % identical to the bat coronavirus batcov ratg genome, reinforcing the probable bat origin of the virus . however, better assessing the evolutionary dynamics of sars-cov- is an active research priority worldwide. sars-cov, mers-cov and sars-cov- belong to the genus betacoronavirus within the subfamily coronavirinae of the family coronaviridae. members of this family are enveloped viruses containing a single positive-strand rna genome of - kb in length, the largest known rna virus genome. the coronavirus spherical virion consists of four structural proteins: the spike glycoprotein (s-protein), the envelope protein, membrane protein and nucleocapsid. the transmembrane trimeric s-protein plays a critical role in virus entry into host cells (gallagher & buchmeier, ; tortorici & veesler, ) . it comprises two functional subunits: s subunit, where the receptor-binding domain (rbd) is found, is responsible for binding host cell surface receptors and s subunit mediates subsequent fusion between the viral and cellular membranes (kirchdoerfer et al., ; yuan et al., ) . both sars-cov and sars-cov- interact directly with angiotensin-converting enzyme (ace ) to enter host target cells (hoffmann et al., ; li et al., ; walls et al., ; yan et al., ) . in the case of sars-cov, ace binding was found to be a critical determinant for the virus host range and key amino acid residues in the rbd were identified to be essential for ace -mediated sars-cov infection and adaptation to humans (li et al., ; li et al., ) . understanding the dynamics that permits a virus to shift hosts is of considerable interest, and further be an essential preliminary step towards facilitating the development of vaccines and the discovery of specific drug therapies. we employ a multidisciplinary approach to look for evidence of diversifying selection on the s-protein gene, and model the interactions between human ace (hace ) and the rbd of selected coronavirus strains, which ultimately afforded us novel insights detailing virus and host cell interactions. given the rapid pace of discovery we aim to add clarity to evolutionary dynamics of diseases strains by more precisely understand the dynamics at the s-protein and its interaction with hace . the most similar genomes to sars-cov- mn were retrieved using blastp (altschul et al., ) vs the nr database of genbank (table ) . genomes were then aligned using mauve (darling et al., ) and the s-protein gene was trimmed. the extracted genomic sections were aligned using a translation align option of geneious (kearse et al., ) with a mafft plugin (katoh & standley, ) . the phylogenetic reconstruction of s-protein genes was performed with phyml (guindon et al., ) , using a gtr+i+g model, using non-parametric bootstrap replicates. both, the alignment and the tree were used as input for paml codeml (yang, ) . the presence of sites under positive selection was tested by the comparison of m (it allows a proportion of positive, neutral and negative selection sites in the alignment) vs m (it allows a proportion of neutral and negative selection sites in the alignment) and m (ω follows a beta distribution plus a proportion of sites with ω> ) vs m (ω follows a beta distribution) models using the ete toolkit . (huerta-cepas et al., ) . the presence of tree nodes under positive selection was obtained with the free branch model and then tested by the comparison of branch free (different ω for each selected branches) vs m (negative selection for all sites and branchesnull model) and branch free vs branch neutral (ω= for selected branches) models. the presence of sites with positive selection under specific branches of the tree was tested with bsa (proportion of sites with positive selection in a specific branch of the tree) vs bsa (proportion of sites with neutral and purifying selection in a specific branch of the tree) models. likelihood ratio test (lrt) was performed (p≤ . ) to compare the hypothesis contrasted by each model. we used the set of programs available in hyphy (kosakovsky pond et al., ) , fast unconstrained bayesian approximation (fubar) to detect overall sites under positive selection, and fixed effects likelihood (fel) to detect specific sites under positive selection in specific branches. we used mixed effects model of evolution (meme) to detect episodic positive/diversifying selection and adaptive branch site rel (absrel) to detect branches in the tree under positive selection. the web server datamonkey (weaver et al., ) was used to perform the hyphy analyses. finally, treesaap . (woolley et al., ) was used to detect sites under adaptation (in terms of physicochemical properties). the same alignment and tree described above were used for this analysis. all these experiments were performed again using the s-protein genes of a shorter list of accessions and more distantly related (broad dataset) to sars-cov- (ay , ay , dq , fj , ky , mg , mg , mn , nc_ ) to test the reproducibility of the predicted branches and sites under positive selection. the crystal structure of the sars-cov s-protein rbd ( genebank id nc_ ) in complex with hace was retrieved from the protein data bank (code ajf) (berman et al., ) . homology models were constructed using this structure as template for the rbds of sars-cov- (sars , genebank id mn ), the bat sars-like coronavirus isolate rm (rm , genebank id dq ) and the bat sars-like coronavirus isolate rs ( rs , genebank id ky ). one additional homology model for the g d mutant of the sars-cov- rbd (sars -mut) was constructed. homology models were built with modeller v. (webb & sali, ) using its ucsf chimera interface (pettersen et al., ) . five models were constructed for each target sequence and the one with the lowest dope score was selected for the final model. all non-amino acidic residues were removed from the sars-cov rbd-hace complex to obtain a clean complex. the homology models of the sars , rm , rs rbds and sars -mut were superimposed into the sars-cov rbd to obtain their initial complexes with hace . these complexes were then subject to molecular dynamics (md) simulations and estimation of their free energies of binding using amber (case et al., ) . for the later, ace was considered as the receptor and the rbds as ligands. the protocol described below was employed for all complexes and otherwise noted default software parameters were employed. systems preparation was performed with the tleap program of the amber suite. each complex was enclosed in a truncated octahedron box extending Å from any atom. next, the boxes were solvated with tip p water molecules and na+ ions were added to neutralize the excess charge. systems were minimized in two steps, the first of which consisted in steps of the steepest descent algorithm followed by cycles of conjugate gradient with protein atoms restrained using a force constant of kcal/mol.Å . the pme method with a cutoff of Å was used to treat long range electrostatic interactions. during the second minimization step the pme cutoff was set to Å and it proceeded for steps of the steepest descent algorithm followed by cycles of conjugate gradient with no restrains. the same pme cutoff of Å was used in all simulation steps from here on. both minimization stages were performed at constant volume. the minimized systems were heated from to k at constant volume constraining all protein atoms with a force constant of kcal/mol.Å . the shake algorithm was used to constrain all bonds involving hydrogens and their interactions were omitted from this step on. heating took place for steps, with a time step of fs and a langevin thermostat with a collision frequency of . ps - was employed. all subsequent md steps utilized the same thermostat settings. afterward, the systems were equilibrated for ps at a constant temperature of k and a constant pressure of bar. pressure was controlled with isotropic position scaling with a relaxation time of ps. the equilibrated systems were used as input for ns length production md simulations. the free energies of binding were computed under the mm-pbsa approach implemented in ambertools (case et al., ) . a total of md snapshots were evenly selected, one every ps, from the last ns of the production run for mm-pbsa calculations. the ionic strength was set to mm and the solute dielectric factor was set to for all systems. in order to detect branches and sites under positive/negative selection, two datasets were explored. the first ('closer' dataset) harbors the most similar genomes to wuhan-hu- coronavirus (sars-cov- ) (mn ). for this dataset, several genomes were excluded from the analysis because they showed minimal variation to other sequences. we used a preliminary phylogeny to select a representative isolate of each clade (table ) in order to exclude highly similar sequences. the second dataset ('broad' dataset) includes some accessions of the first dataset plus isolates less related to sars-cov- , like sars-like coronavirus isolates from different countries (see methods). we compare the results of two dataset because the phylogenetic distance between orthologues in a given dataset has been demonstrated to alter the ability to detect selection in paml and meme (mcbee et al., ) . in both datasets, we observed evidence of purifying selection in the majority of nodes of the tree. specifically, in the 'closer' dataset we identified nodes with evidence of negative selection, and under positive selection when free ratios model of codeml model was applied. to confirm the four nodes under positive selection we use ltr test for contrasting hypothesis using branch free, branch neutral and m models of codeml. using these approximations, any node predicted by free ratios model with ω> was significantly different to the purifying (ω< ) or neutral (ω= ) models. an equivalent analysis was performed using absrel of hyphy, observing episodic diversifying selection in at least of nodes of the phylogenetic tree reconstructed with the 'closer' dataset ( figure ). interestingly, one of the divisions detected with diversifying selection was the branch that contains sars-cov- , pangolin coronavirus isolate mp and bat coronavirus ratg (called sars-cov- group) but not the specific branch that contains sars-cov- . under positive selection in sars-cov- using the closer dataset without pangolin coronavirus isolate mp . it is interesting despite the influence of the dataset in the results, because site f is directly involved in hace -rbd interaction , explaining at least in part strong selection at this site. moreover, the branch-site model bsa (positive selection) vs bsa (relaxation) of codeml were compared to find evidence of sites under positive selection in branch of sars-cov- using the 'closer' dataset, but bsa does not show significant differences with bsa (p> . ) indicating selection cannot be confidently implicated, but it was when other datasets were used (including f ). in summary, we do find evidence of sites under positive/episodic selection in branches of close related strains of wuhan-hu- isolate coronavirus. however, there is not strong evidence of specific sites under positive selection in sars-cov- using the tools mentioned in this work. this result does not disregard the presence of positive selection sites in sars-cov- , nonetheless, it shows the limitation of the methods to identify with precision specific sites under positive selection in a precise taxon of a phylogenetic tree. we further warn researchers need to be conservative with interpretations of studies utilizing these methodologies, given the equivocal results can be generated by datasets varying in genetic similarity. to complement our analyses looking for evidence of selection among lineages, we specifically analyzed for patterns of selection across sites in the s-protein genes, we used the sites models available in codeml and hyphy. model m of codeml detected . % of sites under positive selection (ω> ) and models m and m detected % of sites under purifying selection (ω< ). model m explains the significant data better (p= e- ) than m model, that takes in account only sites with neutral and purifying selection. to resolve these ambiguities in positive selection sites we calculate putative selection sites with codeml (using bayes empirical bayes from m and m models) and fubar with different datasets reflecting the addition of novel sequences to online repositories (broad, closer, closer without mn and mt and closer without mt ) and we obtain different results. it is becoming increasingly clear that predictions of positive selected sites are highly influenced simply by the diversity of the individual sequences included in the datasets. in any case, the majority of predicted sites converge in the region between to , a section of the rbd. additionally, we used treesaap to detect important biochemical amino acid properties changes over regions and/or sites along betacoronavirus s-protein. using a sliding window size of (increasing by ) we detect that the region between to (using sars-cov- s-protein as a reference) have drastic amino acid changes for alpha-helical tendencies. in addition, the section between to residues registers radical changes in amino acids implicated in the equilibrium constant (ionization of cooh). in the structural analysis we performed, the section between to forms a loop that is not present in certain s-proteins of coronavirus isolated in bats. this loop extends the interaction area between rbd of s-protein and human ace , in fact, the lack of these loop decreases the negative energy of interaction (increasing the binding) among these two molecules (see table ). these results obtained from independent analysis strongly highlight the importance of to section. additionally, important hace -binding residues in the rbd from sars-cov- obtained from the crystallography and structure determination performed by shang et al. ( ) are also present in the section we highlight here. we propose that this region is the most probable to contain the sites under positive selection due to predictions by our codeml and fubar models. in that sense, we refer to this section as region under positive selection (rps). it is important to additionally clarify that even inside the rps we found at least aa highly conserved between coronaviruses, several of them are predicted as sites under purifying selection. this shows that it is necessary to maintain sites without change around polymorphic sites, probably to conserve the protein structure and at the same time to have the ability to colonize more than one host. interestingly, the rps of the pangolin coronavirus isolate mp differs only in one amino acid with the homologous region of sars-cov- , whereas in contrast the bat coronavirus ratg (the overall most similar isolate to sars-cov- sequenced at the moment) shows differences in the same region. several explanations could derive from this observation. the hypothesis of recombination inside the pangolin between a native coronavirus strain and a bat coronavirus (like ratg ) is congruent with our observation. this scenario was proposed and discussed as the origin of sars-cov- by (lam et al., ; wong et al., ; xiao et al., ) , however, other explanations are possible. if the sars-cov- , ratg and pangolin coronavirus mp isolate are closely related as shown in the tree of the figure , we are observing the ancestral sequence of rps in human and pangolin coronaviruses, and a mutated version in bat virus. elucidating the origin of sars-cov- is beyond the scope of this work, nevertheless sequencing of new coronavirus isolates in the near future could resolve this question. with a list of broader observations related to the role of selection across viral genomes we aimed to specifically understand how these regions could affect virus/host interactions. to understand more in deep the importance of rps in the evolution of sars-cov- , we quantified the relative importance of this region in the interaction between rbd and hace . in that sense, md simulations were run for five complexes (listed in methods). in all cases the systems were stable with root mean square deviations (rmsd) of their backbones between . Å and . Å relative to the initial complexes structures during the last ns of the production run. we first investigated the network of contacts between the ligands (coronaviruses rdb) with the receptor (hace ). overall, all complexes present a large number of contacts between the ligands and the receptor in at least % of the md snapshots selected for mm-pbsa calculations. common interactions with t , f , k , h , y , k , g , d and r of the receptor are observed in all systems. the full networks of interactions between the coronaviruses and the hace receptor are provided as supporting information. next we estimated the free energies of binding of the coronaviruses' rbds to hace and the results of these evaluations are summarized in table . these calculations show that the sars , sars and rs viruses are predicted to favorably bind to the human hace receptor, while the rm and sars -mut variants present unfavorable free energies of binding. the fact that the bat's coronavirus rs , in addition to sars and sars , presents favorable interaction with hace is in accordance with the previous observation that it is able to infect human cells expressing this protein (hu et al., ) . to get more insights into the contribution of the receptor and the rbds to the binding process, we performed energy decomposition experiments. the contribution of each residue in the studied coronaviruses that interact with the hace receptor are shown in table . rows are presented in such a way that each of them contains the residues occupying the same position in the viruses rbds structures as in the sar rbd structure. from here on, residues numeration will take that of sars as reference. in general, most rbds residues show negative values of contribution to the free energies of binding to the human receptor. all studied rbds, except that of the rm coronavirus, have amino acids with large favorable contributions to the free energies of binding that directly interact with hace : k of sars and sars -mut, r in sars and r in rs . on the other hand, the g d mutation (d present in bat coronavirus strains) have a negative contribution to the binding of the rdb to hace . this site was predicted to be under purifying selection by fubar analyses, and is located within the rps. strikingly, the g d mutation (sars numeration) has a large negative influence in the free energy of binding in the two complexes that contain it. it is also worth noting that the three aspartic acid substitutions present in all systems negatively contribute to the systems stability. taking into account that the only difference between sars and sars -mut is the g d mutation, we postulate that this rbd position is critical for the human receptor recognition by coronaviruses. to the best of our knowledge, no coronavirus having aspartic acid at this position is able to infect human cells. this result supports the prediction from fubar analyses indicating that the site g d is under purifying selection. combined, our results strongly suggest that the mutation of the d residue present in the coronaviruses from bats is critical for their rbds to recognize the human hace receptor. additionally, it shows the importance of sites under purifying selection in rps for the rbd evolution. to better interpret the influence of the key interactions between the coronaviruses rbds and their hace receptor, their interactions were analyzed. to select the representative structure of each system the md snapshots employed for mm-pbsa calculations were clustered. then, the representative structure of a system was selected as the centroid of the most populated cluster. the predicted rbd-hace complexes for sars , sars and sars -mut are depicted in figure . many studies have focused on coronaviruses mutations that favor adaptations for human hosts infections. for example, it has been shown that specific substitutions at positions , , , and ( , , , and in sars) of the rbd of sars favors the interaction between the rbd of sars and hace (cui et al., ) . likewise, homology modeling studies found favorable interactions between the residues occupying these positions in the sars rbd and the human receptor . the cornerstone of these favorable interactions is the complementarity of the rbds with hot spots and . these are salt bridges between k and e and between d and k of ace which are buried in a hydrophobic environment (see figure. ). in the cases of sars and sars, q (n in sars) and n (t in sars) add support to the hot spots according to these previous studies. these observations should also hold for the rs strain, however the n a change in the later compared to sars (a in rs ) add little support to hot spot . in this case, to continue permitting human infection, the large favorable contribution of r in rs to the free energy of binding could compensate the weak support provided by a to hot spot . interestingly, k is the residue forming the largest network of contacts with the analyzed rbds among those belonging to both hot spots. our simulations also show that in sars and sars the rbd amino acids with the largest contribution to the free energy of binding, k and r (see table ) respectively, do not interact with any hot spot residue. instead, they interact with d of hace in the sars complex and with e of the human receptor in the sars complex. this could indicate that interactions additional to those previously identified with the hace hotspots could be critical for the stabilization of the rdb-human receptor complexes. finally, we analyzed the possible reasons for the predicted negative impact that the g d mutation has on the predicted free energies of binding of the rbd to hace . as depicted in figure , g directly interacts with k in hot spot and its mutation interferes with the d -k salt bridge. specifically, d of the rdb point to d of hace yields a high electric repulsion between these amino acids. consequently, this portion of the rbd is pushed to a position further from hace than that observed in the wild type receptor, resulting in the reduction of its network of contacts with k . as a result, the binding of the rbd to hace is considerably inhibited and unlikely to occur. a priority in ongoing research is to better understand coronavirus evolution, with specific interests in understanding the role of selection pressures in viral evolution, and clarifying how viral strains can infect novel hosts. our experiments suggest that there are sites under positive selection in the s-protein gene of sars-cov- and other betacoronaviruses, particularly in a region that we called rps (region under positive selection) inside of the rbd. however, we have identified that by in large, sites in this region (and overall, in the s-protein gene) are under purifying selection. particularly, for the site d g, the presence of aspartic acid seems indispensable for the interaction with the hace . additionally, we performed md simulations and free energies of binding predictions for five different complexes of coronaviruses that do and do not infect human cells. our results suggest that as long as no disrupting interference occur with both salt bridges at hot spots and coronaviruses are able to bind with hace . modeling results suggest that interference with the hot spot could be and effective strategy for inhibiting the recognition of the rbd of the sars-cov- spike protein by its human host receptor ace and hence prevent infections. although additional simulations and experiments are required, all evidence suggests that the mutation of d in the bat variants of the coronaviruses permit infection of human cells. giving the large contribution of sars k to the free energy of binding of the rbd to hace we propose that blocking its interaction with the receptor d could be a promising strategy for future drug discovery efforts. gapped blast and psi-blast: a new generation of protein database search programs the -new coronavirus epidemic: evidence for virus evolution the protein data bank origin and evolution of pathogenic coronaviruses mauve: multiple alignment of conserved genomic sequence with rearrangements identification of a novel coronavirus in patients with severe acute respiratory syndrome middle east respiratory syndrome: emergence of a pathogenic human coronavirus coronavirus spike proteins in viral entry and pathogenesis new algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of phyml . . systematic biology sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor discovery of a rich gene pool of bat sars-related coronaviruses provides new insights into the origin of sars coronavirus clinical features of patients infected with novel coronavirus in ete: a python environment for tree exploration more effective purifying selection on rna viruses than in dna viruses mafft multiple sequence alignment software version : improvements in performance and usability geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data pre-fusion structure of a human coronavirus spike protein hyphy . -a customizable platform for evolutionary hypothesis testing using phylogenies identification of -ncov related coronaviruses in malayan pangolins in southern china animal origins of the severe acute respiratory syndrome coronavirus: insight from ace -s-protein interactions angiotensin-converting enzyme is a functional receptor for the sars coronavirus receptor and viral determinants of sars-coronavirus adaptation to human ace the effect of species representation on the detection of positive selection in primate gene data sets ucsf chimera-a visualization system for exploratory research and analysis structural basis for receptor recognition by the novel coronavirus from wuhan on the origin and continuing evolution of sars-cov- structural insights into coronavirus entry structure, function, and antigenicity of the sars-cov- spike glycoprotein receptor recognition by novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars datamonkey . : a modern web application for characterizing selective and other evolutionary processes comparative protein structure modeling using modeller summary of probable sars cases with onset of illness from evidence of recombination in coronaviruses implicating pangolin origins of ncov- treesaap: selection on amino acid properties using phylogenetic trees isolation and characterization of -ncov-like coronavirus from malayan pangolins structural basis for the recognition of the sars-cov- by full-length human ace paml : phylogenetic analysis by maximum likelihood cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains a pneumonia outbreak associated with a new coronavirus of probable bat origin a novel coronavirus from patients with pneumonia in china the authors declare that they have no conflicts of interest. key: cord- -x t h gu authors: madariaga, m. l. l.; guthmiller, j.; schrantz, s.; jansen, m.; christenson, c.; kumar, m.; prochaska, m.; wool, g.; durkin, a.; oh, w. h.; trockman, l.; vigneswaran, j.; keskey, r.; shaw, d. g.; dugan, h.; zheng, n.; cobb, m.; utset, h.; wang, j.; stovicek, o.; bethel, c.; matushek, s.; giurcanu, m.; beavis, k.; disabato, d.; meltzer, d.; ferguson, m.; kress, j. p.; shanmugarajah, k.; matthews, j.; fung, j.; wilson, p.; alverdy, j. c.; donington, j. title: clinical predictors of donor antibody titer and correlation with recipient antibody response in a covid- convalescent plasma clinical trial date: - - journal: nan doi: . / . . . sha: doc_id: cord_uid: x t h gu background: convalescent plasma therapy for covid- relies on the transfer of anti-viral antibody from donors to recipients via plasma transfusion. the relationship between clinical characteristics and antibody response to covid- is not well defined. we investigated predictors of convalescent antibody production and quantified recipient antibody response in a convalescent plasma therapy clinical trial. methods: multivariable analysis of clinical and serological parameters in confirmed covid- convalescent plasma donors days or more following symptom resolution was performed. mixed effects regression models with piecewise linear trends were used to characterize serial antibody responses in convalescent plasma recipients with severe covid- . results: mean symptom duration of plasma donors was . and . % ( / ) had been hospitalized. antibody titers ranged from to : , (anti-receptor binding domain (rbd)) and to : , (anti-spike). multivariable analysis demonstrated that higher anti-rbd and anti-spike titer were associated with increased age, hospitalization for covid- , fever, and absence of myalgia (all p< . ). fatigue was significantly associated with anti-rbd (p= . ) but not anti-spike antibody titer (p= . ). in pairwise comparison among abo blood types, ab donors had higher anti-rbd titer than o negative donors (p= . ) and higher anti-spike titer than o negative (p= . ) or o positive (p= . ) donors. eight of the ten recipients were discharged, one remains on ecmo and one died on ecmo. no toxicity was associated with plasma transfusion. after excluding two ecmo patients and adjusting for donor antibody titer, recipient anti-rbd antibody titer increased on average % per day during the first three days post-transfusion (p= . ) and anti-spike antibody titer by . % (p= . ). conclusion: advanced age, fever, absence of myalgia, fatigue, blood type and hospitalization were associated with higher convalescent antibody titer to covid- . despite variability in donor titer, % of convalescent plasma recipients showed significant increase in antibody levels post-transfusion. a more complete understanding of the dose-response effect of plasma transfusion among covid- patients is needed to determine the clinical efficacy of this therapy. convalescent plasma therapy has historically been used as a treatment during epidemics ( ) . in this therapy, neutralizing anti-viral antibodies, as well as non-neutralizing antibodies and other immunomodulators, are transferred via plasma transfusion from those who have recovered from disease to those currently infected ( ) ( ) ( ) . for patients with severe covid- , convalescent plasma therapy has safely led to improvement in clinical and radiographic parameters ( ) ( ) ( ) ( ) ( ) ( ) . once adequate numbers of people convalesced and supply chain logistics were established, providing plasma therapy to a large number of patients has proven feasible ( ) . efficacy of convalescent plasma therapy relies on a robust antibody response in convalescent plasma donors. measurements of antibody response among patients with covid- demonstrate that the majority develop igm and igg within weeks of symptom onset, with specificity towards receptor binding domain (rbd) and spike protein viral epitopes correlating with virus neutralization ( ) ( ) ( ) . strikingly, a small proportion of recovered covid- patients show no detectable antibodies to these epitopes ( , ) . the relationship between host characteristics, disease course and variability in antibody response to covid- is poorly understood. the aim of this study was to establish a translational convalescent plasma program to investigate the relationship between clinical and serological parameters in convalescent plasma donors and define the antibody response of convalescent plasma recipients. this was a prospective open label clinical study to assess the feasibility, safety and immunological impact of delivering anti-sars-cov- convalescent plasma to hospitalized patients aged years or older with severe or life-threatening covid- disease within days from the onset of their illness. this study was conducted at university of chicago medicine (ucm) from april , to may , . the final date of follow-up was may , . we used existing hospital infrastructure and personnel to build the convalescent plasma program at a time when state-wide shelter-in-place orders were active, elective procedures were not being performed, and non-covid- -related research activities were halted. the donor enrollment team consisted of two surgeons, two surgical residents, and three physician assistants. a dedicated study coordinator was present at the ucm blood donation center to facilitate whole blood donation and collect research samples. recipients were selected during daily videoconference with infectious disease. one surgeon visited the hospital covid- unit daily to obtain consent and research samples. plasma donors were age or older, able to donate blood per standard ucm blood donation center guidelines, had a documented covid- polymerase chain reaction (pcr) positive test, and complete resolution of symptoms at least days prior to donation. recruitment occurred via social media, news outlets, word-of-mouth and announcements in university and community bulletins. the ucm infectious disease team provided an institutional list of patients with a positive pcr test for covid- , and their physicians were emailed to request permission to contact the patient for donor participation. interested plasma donors were directed to fill out a short screening survey online. potential donors meeting study criteria were screened for eligibility, reported symptoms and comorbidities, consented, and were scheduled for donation at the ucm blood donation center in a single telephone encounter. after meeting the ucm blood donation center eligibility, whole blood was collected and processed according to standard ucm blood donation center procedures. standard whole blood donation was used for plasma collection because it fit into preexisting ucm blood bank infrastructure and workflow therefore facilitating rapid deployment of a collection process, and allowed red blood cell and unused plasma units to be used in the regular blood bank inventory. during blood donation, a single research sample was collected at the same time as blood samples for standard immunohematology testing and infectious disease screening. leukocyte filters used in separation of constituent blood parts were also collected for research. eligibility for convalescent plasma recipients included: age or older, laboratoryconfirmed covid- , within days from the start of illness and severe or life-threatening covid- as defined by the united states food and drug administration (fda) ( ). severe covid- was defined as dyspnea, respiratory frequency ≥ /min, blood oxygen saturation ≤ %, partial pressure of arterial oxygen to fraction of inspired oxygen ratio < , and/or lung infiltrates > % within to hours. life-threatening covid- was defined as respiratory failure, septic shock, and/or multiple organ dysfunction or failure. patients who were pregnant, received pooled immunoglobulin in the past days or had a history of transfusion reaction were excluded from this study. recipients had routine pre-transfusion testing, in keeping with institution policies. on the day of enrollment, an emergency investigational new drug (eind) application was filed and approved for each recipient by the fda ( ). subsequently, one abo-compatible unit of convalescent plasma (~ ml) was transfused over hours. repeat administration of convalescent plasma occurred in one recipient (r ). blood samples and nasopharyngeal swabs were obtained at day , , , , post transfusion. the primary outcome was feasibility as defined by the collection of convalescent plasma and its administration into hospitalized patients. secondary outcomes included type and duration of respiratory support, cardiac arrest, transfer to intensive care unit (icu), length of stay, mortality, complications of plasma administration, process outcomes, and antibody titer of plasma donors and recipients. levels of anti-rbd and anti-spike antibodies were measured by enzyme-linked immunosorbent assay (elisa) in blood samples at time of donation and plasma recipients, as previously described ( ) . nasopharyngeal specimens were obtained by flocked swabs in plasma recipients and analyzed by rt-pcr to detect sars-cov- rna. study data were collected and managed using redcap electronic data capture tools hosted at ucm ( , ) . donor patient characteristics were compared using the chi-squared test for categorical variables and the two-sample t test for continuous variables. univariate regression analysis for antibody titer (anti-rbd and anti-spike) was conducted against age, sex, body mass index (bmi), previous pregnancy, previous blood donation, blood type, symptoms (fever, cough, sore throat, dyspnea, abdominal pain, aguesia, anosmia, fatigue, myalgia, headache), co-morbidities (respiratory, cardiovascular, renal, diabetes, autoimmune disease, cancer, liver disease), smoking history, travel in the past months to the united states, asia or europe, symptom duration, interval from symptoms resolution to plasma donation, and hospitalization. pairwise comparison using t tests without adjusting for multiple comparisons was used to compare antibody titers among different abo blood groups. we conducted multivariable analyses to identify prediction models for anti-rbd and anti-spike antibody titers among convalescent plasma donors. best subset variable selection method was chosen to identify the subset of predictors that maximizes the adjusted r-squared among all possible models. to compare daily change in recipient antibody response, we fit mixed effects regression models with piecewise linear trend with a change point at days after intervention for log-transformed antibody titers. we considered recipients on extra-corporeal membrane oxygenation (ecmo) (r and r ) separately from recipients not on ecmo (r , , , , , , , ), because ecmo recipients had different baseline characteristics. data analysis was performed using software r, version . . . mixed effects regression models were fit using the lmer function of the lme package ( ) . data analysis was conducted within rstudio environment, and r markdown files with fully reproducible data analysis can be obtained from the authors upon request. this study was approved by the institutional review board (irb - ). all participants (plasma donors and plasma recipients) gave written informed consent prior to inclusion in the study. analysis was performed by mlm and mg. this clinical trial was registered at clinicaltrials.gov with identifier nct . all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted june , . . https://doi.org/ . / . . . doi: medrxiv preprint potential plasma donors were recruited to our study over days (table ). the average age was . years (range to ), the majority were female ( . %), and % had never donated blood before. potential donors with confirmed positive covid- pcr (n= , %) were more likely to be male, have ageusia and anosmia, and lack cough, sore throat and dyspnea compared to the symptomatic patients who had clinical signs of covid- but were never tested (table ) . among plasma donors (n= ) who donated as of publication, average symptom duration was . ± . days, ( . %) had respiratory comorbidities such as asthma, chronic obstructive pulmonary disease or obstructive sleep apnea, and ( . %) had been previously hospitalized for covid- ( table ). the average interval between symptom start and plasma donation was . ± . days. donor antibody titers measured on day of plasma donation ranged from to : , (anti-rbd) and from to : , . (anti-spike) ( table ). in univariable regression analysis, higher average anti-rbd and anti-spike antibody titers were associated with plasma donors who were older, male, had higher bmi, had fever and had been hospitalized (p< . , supplemental table ). in a pairwise comparison among abo groups without adjusting for multiple comparisons, ab donors had higher anti-rbd titer than o negative donors (p= . ) and higher anti-spike titer than o negative (p= . ) or o positive (p= . ) donors. to determine predictors of anti-rbd and anti-spike antibody titer, we performed best subset multivariable analysis including age, sex, blood type, history of previous blood donation, fever, cough, fatigue, myalgia, symptom duration, hospitalization and travel in the united states within the past months. significant predictors of anti-rbd antibody titer were age (p= . ), fever (p< . ), previous hospitalization (p< . ), lack of myalgia (p= . ), and fatigue (p= . ) (r-squared= . , adjusted r-squared= . , table ). significant predictors of anti-spike antibody titer were age (p= . ), fever (p= . ), previous hospitalization (p= . ), and absence of myalgia (p< . ) (r-squared= . , adjusted r-squared= . , table ). o positive blood type was associated with lower anti-rbd (p= . ) but did not meet significance threshold for antispike (p= . ). ten hospitalized patients with severe or life-threatening covid- received plasma on day ( figure , table ). plasma recipients were on average . years old (range to ) and % female. the average time from start of symptoms to plasma transfusion was days (range to ) and the average time from hospital admission to plasma transfusion was days (range to ). at the time of plasma transfusion, two patients were on ecmo, one patient was mechanically ventilated, two patients were on high-flow nasal cannula (hfnc), four patients were on nasal cannula and one patient was on room air. five patients had received other therapies for covid- before transfusion, including remdesivir, tocilizumab, anakinra and hydroxychloroquine. two plasma recipients were on chronic immunosuppression after transplantation. figure shows selected clinical and laboratory parameters of convalescent plasma recipients. only one recipient (r ) had fever prior to transfusion and this resolved by day post-transfusion. r and r remained on ecmo throughout the study period. in the remaining recipients, oxygen requirements improved to room air or nasal cannula. the sequential organ failure assessment (sofa) score ( ) was calculated for recipients on mechanical ventilation or ecmo and showed a general trend towards improvement; notably both ecmo patients were weaned off vasopressor and intra-aortic balloon pump support by days post-transfusion. levels of inflammatory marker c-reactive protein (crp) were variable. crp decreased in six recipients (r , r , r , r , r , r ). sars-cov np swab pcr remained positive in patients and turned negative in patients; patient (r ) had been positive for sars-cov days prior to plasma transfusion but was negative for sars-cov on day of transfusion ( figure ). at last follow-up, patient on ecmo remained in the hospital (r ), patient on ecmo was transitioned to comfort care and died on day after plasma transfusion (r ), patients were discharged to rehabilitation facilities and patients were discharged to their place of residence ( figure ). on day of transfusion, anti-rbd antibody titers were undetectable in recipients (r , r , r ) and anti-spike antibody titers were undetectable in recipients (r , r , r ) ( table and figure ). both patients on ecmo had very high antibody titer at day which decreased in the days after transfusion (figure ). the remaining plasma recipients showed increase in antibody titer within the first three days after transfusion (r , , , , , , ) with the exception of r who did not show any antibody titer until day (anti-spike) and day (anti-rbd) after transfusion ( figure ). we performed a mixed effects model for log-transformed reciprocal antibody titer adjusting for donor antibody titer level looking at the first days post-transfusion among the non-ecmo patients. after plasma transfusion, recipient anti-rbd antibody titer increased on average by % per day (p= . ) and recipient anti-spike antibody titer increased on average by . % per day (p= . ) (figure ). among the two ecmo recipients, recipient antibody response was not significantly changed until three days after plasma transfusion (decreasing by . % per day for anti-rbd titer and . % per day for anti-spike titer, p< . ) (figure ) . we monitored the clinical status of the recipients before, during and immediately after transfusion. no recipients experienced toxicity associated with plasma transfusion. there was no clinical deterioration or worsening of disease status immediately related to plasma transfusion. convalescent plasma transfusion was safe in high-risk individuals in our study: immunosuppressed patients after stem cell and lung transplants and a patient with end-stage renal disease on dialysis. we developed a translational convalescent plasma treatment program within the existing hospital infrastructure during the covid- pandemic that provided a new therapeutic option for patients while assessing the antibody profile of both convalescent and hospitalized patient populations. our multivariable analysis demonstrated that clinical characteristics can predict serological response of antibodies associated with virus neutralization ( ) . higher anti-rbd and anti-spike antibody were more likely found in convalescents who were older, hospitalized, had fever, and lacked myalgia. fatigue also significantly predicted higher anti-rbd but not antispike antibody titer. variability in convalescent populations and immune response to viral infection may explain why recovery is not always marked by seroconversion ( , ) . indeed, in our study four plasma donors (as well as four plasma recipients) had undetectable antibody titers. disparate plasma donor populations and geography may explain why symptom duration and elapsed time from symptom onset was associated with antibody response in new york city ( ) but not among our patients in chicago. disparate plasma donor populations and geography may also explain antibody variability. these data highlights that the impact of variability in antibody type and titer on virus-neutralizing activity and long-term immunity is unknown. interestingly, we found that antibody titers significantly differed across abo blood type groups, ( ) . further studies on the relationship between abo polymorphism and antibody titer may uncover genetic determinants of the host response to covid- . recipients received plasma with a range of antibody titer from : to : , (anti-rbd) and : to : , (anti-spike). despite this, % of recipients demonstrated a significant increase in anti-spike and anti-rbd antibody titer in the days post transfusion that was independent of donor antibody titer and were discharged after clinical improvement. interestingly, recipient antibody titer continued to increase up to days in four recipients (r , , , ); in contrast, the two most severely ill patients on ecmo who had the highest antibody titers (up to : , anti-spike antibody in r ) showed a decrease in antibody titer after receiving plasma on day - of illness. importantly we demonstrate the safety of transfusing convalescent plasma in immunosuppressed patients after lung transplantation and stem cell transplantation. none of the plasma recipients in this study deteriorated after convalescent plasma transfusion, consistent with the safety profile of other trials ( ) ( ) ( ) ( ) ( ) ) . repeat plasma dose in recipient r was also welltolerated. pre-clinical models of sars-cov and clinical experience of other viral illness had raised concern about the potential for non-neutralizing antibody to cause antibody dependent enhancement of disease, which was not seen here despite variable titers of donor antibodies ( ) ( ) ( ) . the variability in post-transfusion recipient antibody titer and clinical response seen here and in other studies ( , , , ) indicates that the therapeutic activity of convalescent plasma depends on the timing of treatment and composition of convalescent plasma. indeed, plasma contains more than , proteins, including albumin, immunoglobulins, complement, and coagulation factors as well as organic compounds such as cytokines ( ) . convalescent plasma drawn shortly after natural infection ( , ( ) ( ) ( ) ( ) may be enriched for populations of protective antibodies not present in plasma derived from long-recovered or rarely-hospitalized donors studied here. furthermore, immunomodulatory and non-virus neutralizing antibody effects such as stimulation of the host humoral immune response and facilitating viral uptake into cells via fc-receptors to increase viral antigen presentation to other effector cells may contribute to disease recovery. taken together, while randomized controlled efficacy trials for convalescent plasma therapy in covid- are currently underway, establishing effective anti-covid- plasma-based therapy will require both an understanding of the precise dose and type of virusneutralizing antibody and in-depth characterization of plasma donor-recipient pairs. the availability of a pre-existing hospital-based blood collection facility within our medical center significantly eased the procurement of convalescent plasma and will allow us to assess immunological characteristics of donor-recipient pairs in future studies. such hospitalbased blood collection facilities have been declining in number across the united states for several decades ( ) . cultivating region-specific convalescent plasma inventory may potentially facilitate the identification and isolation of antibodies with specific activity against local virus strains and be a useful model for future outbreaks. in addition, convalescent plasma derived from whole blood collection is a rapidly scalable technique that requires basic phlebotomy and blood separation rather than a dedicated apheresis personnel and equipment. furthermore, a significant proportion ( . %) of our plasma donors had never donated blood before, indicating that a convalescent plasma donation program can serve as important community outreach during a time when patients avoid hospitals that are perceived as unsafe ( ) . in summary, development of a convalescent plasma program is feasible, rapidly deployable and economical when existing resources of equipment, space and personnel are used. establishing the clinical predictors of high antibody titer and understanding the serological posttransfusion response may guide patient selection and shed light on antibody response to covid- . further work characterizing convalescent plasma donor and recipient pairs is needed to elucidate mechanisms of convalescent plasma therapy and demonstrate optimal viral epitope therapeutic targets. all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted june , . . , interval of symptoms to plasma donation, blood type were not significantly associated with anti-rbd or anti-spike antibody titer. all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted june , . all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted june , . all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted june , . . https://doi.org/ . / . . . doi: medrxiv preprint treatment of influenza pneumonia by the use of convalescent human serum: preliminary report the convalescent sera option for containing covid- convalescent plasma as a potential therapy for covid- convalescent plasma in covid- : possible mechanisms of action effectiveness of convalescent plasma therapy in severe covid- patients treatment of critically ill patients with covid- with convalescent plasma treatment with convalescent plasma for critically ill patients with sars-cov- infection use of convalescent plasma therapy in two covid- patients with acute respiratory distress syndrome in korea patients with convalescent plasma in convalescent plasma treatment of severe covid- : a matched control study early safety indicators of covid- convalescent plasma in , patients humoral immune response and prolonged pcr positivity in a cohort of sars-cov patients in the new york city region temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by sars-cov- : an observational cohort study antibody responses to sars-cov- in patients of novel coronavirus disease seroconversion in humans: a detailed protocol for a serological assay, antigen production, and test setup research electronic data capture (redcap)--a metadata-driven methodology and workflow process for providing translational research informatics support the redcap consortium: building an international community of software platform partners fitting linear mixed-effects models using lme the sofa (sepsisrelated organ failure assessment) score to describe organ dysfunction/failure. on behalf of the working group on sepsis-related problems of the european society of intensive care medicine abo blood group and susceptibility to severe acute respiratory syndrome relationship between the abo blood group and the covid- susceptibility inhibition of the interaction between the sars-cov spike protein and its cellular receptor by anti-histo-blood group antibodies anti-spike igg causes severe acute lung injury by skewing macrophage responses during acute sars-cov infection current studies of convalescent plasma therapy for covid- may underestimate risk of antibody-dependent enhancement treatment with convalescent plasma for influenza a (h n ) infection use of convalescent plasma therapy in sars patients in hong kong continued decline in blood collection and transfusion in the united states- delayed access or provision of care in italy resulting from fear of covid- we thank all the plasma donors for their willingness to help in a time of need and the blood bank staff for their excellent care. we thank samantha guerrero, alyssa anneken, bruce boehrnsen and rohit allada for helping us establish the infrastructure for this study. we thank the university of chicago and department of surgery, university of chicago for providing support for this study. this study was funded by the department of surgery, university of chicago and the national institute of allergy and infectious diseases (niaid) collaborative influenza aka, above the knee amputation; chf, congestive heart failure; dm, diabetes mellitus; dvt, deep venous thrombosis; esrd, endstage renal disease; htn, hypertension; nafld, non-alcoholic fatty liver disease; pe, pulmonary embolism; pvd, peripheral vascular disease.all rights reserved. no reuse allowed without permission.(which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity.the copyright holder for this prep this version posted june , . . https://doi.org/ . / . . . doi: medrxiv preprint key: cord- -rnq hfsj authors: liu, bingfeng; shi, yaling; zhang, wanying; li, rong; he, zhangping; yang, xiaofan; pan, yuejun; deng, xilong; tan, mingkai; zhao, lingzhai; zou, fan; zhang, yiwen; pan, ting; zhang, junsong; zhang, xu; xiao, fei; li, fang; deng, kai; zhang, hui title: recovered covid- patients with recurrent viral rna exhibit lower levels of anti-rbd antibodies date: - - journal: cell mol immunol doi: . /s - - - sha: doc_id: cord_uid: rnq hfsj nan plasma samples were collected from these patients with covid- at the time of convalescence and assessed for antibodies against the following sars-cov- proteins: the spike glycoprotein (s); the receptor-binding domain (rbd); conserved heptad repeats (hr -hr ) in the s domain; and the nucleocapsid (n), membrane (m), and envelope (e) proteins. the concentrations of igg secreted in response to these sars-cov- proteins varied in different patients, with detection rates of . % ( / ), . % ( / ), . % ( / ), . % ( / ), . % ( / ), and . % ( / ) for the s, rbd, hr -hr , n, m, and e proteins, respectively ( fig. a; fig. s ). the detection rates of igm to the s, rbd, hr -hr , and n proteins were . % ( / ), . % ( / ), . % ( / ), and % ( / ), respectively ( fig. b; fig. s ). notably, significantly higher levels of sars-cov- -specific igg and igm developed to the s and n proteins (fig. a, b) . to evaluate the effect of specific antibodies on rp status, we compared the levels of anti-sars-cov- igg to the s, rbd, hr -hr , n, and m proteins in these patients during their convalescent period ( fig. c; fig. s ). the results showed that rp patients induced significantly lower levels of anti-rbd igg than prn patients (p = . ) (fig. c) . as all of these rp patients were in a moderate condition before recovery, the prn patients were further classified as moderate ( patients) or severe ( patients) according to their symptoms before recovery. the levels of anti-rbd igg in rp patients were still significantly lower than those of either prn-severe or prn-moderate patients (p = . and p = . , respectively; fig. d ). in addition, the patients with severe symptoms within the prn group were more likely to induce higher levels of anti-rbd igg (p = . ; fig. d ), which is consistent with previous reports. in contrast, there were no significant differences either in igg to other viral proteins or in igm between prn and rp patients (fig. c, e; figs. s , s ) , suggesting that the humoral response to rbd rather than to other regions of the s protein or the full-length s protein might have played an important role in preventing viral rebound during recovery. furthermore, we observed that the titers of igg to rbd among these recovered patients positively correlated with the spikebinding antibodies targeting the s, hr -hr , and n proteins (r = . , p < . ; r = . , p < . ; and r = . , p = . , respectively) but not with the m or e proteins (fig. s a) . moreover, the y-axis represents optical density units at od nm, and the x-axis represents reciprocal plasma dilutions. c normalized od nm values of the anti-sars-cov- igg to the rbd, s, hr -hr , n, m, and e proteins are compared between prn and rp patients. the p value was calculated using a two-tailed mann-whitney u test or unpaired student's t test. d normalized od nm values of the anti-rbd igg were compared between prn-severe, prn-moderate, and rp patients. the p value was calculated using a two-tailed mann-whitney u test or unpaired student's t test. e normalized od nm values of the anti-rbd igm were compared between prn and rp patients. the p value was calculated using a two-tailed mann-whitney u test the level of igm to the rbd protein among these recovered patients also correlated with the s, hr -hr , and n proteins (r = . , p < . ; r = . , p < . ; and r = . , p < . , respectively) (fig. s b) . in addition, a positive correlation was also observed between age and igg level to the rbd, s, hr -hr , and n proteins (r = . , p = . ; r = . , p = . ; r = . , p = . ; and r = . , p = . , respectively; fig. s ), indicating the important role of age in the generation of specific binding antibodies. because of the lack of clinical characteristics and the unknown significance of rp patients, it is critical to provide comprehensive serological profiling to guide the management of recovered covid- patients after discharge. an important feature of the rp patients was their younger age than that of the prn patients, and the ages of these recovered patients positively correlated with titers of igg to the rbd protein. , , these observations are consistent with the conclusion that the level of igg to the rbd protein in rp patients is significantly lower than that in the prn group. based on our findings, the anti-rbd igg level could serve as an indicator of rp status. to minimize the risk of possible viral rebound and retransmission during the current pandemic, close monitoring of anti-rbd igg levels at viral shedding and a longterm follow-up of patients with lower levels of rbd antibodies is needed. moreover, the relationship between anti-sars-cov- igg titers and rp status suggests that the interplay between the virus and the host immune response in coronavirus infections should be further investigated for the development of more accurate diagnostic technologies and effective vaccines against viral infection. positive rt-pcr test results in patients recovered from covid- clinical characteristics of the recovered covid- patients with redetectable positive rna test recurrence of positive sars-cov- in patients recovered from covid- clinical course and risk factors for recurrence of positive sars-cov- rna: a retrospective cohort study from wuhan prolonged sars-cov- rna detection in anal/rectal swabs and stool specimens in covid- patients after negative conversion in nasopharyngeal rt-pcr test recurrence of sars-cov- pcr positivity in covid- patients: a single center experience and potential implications assessment of patients who tested positive for covid- after recovery recurrence of covid- after recovery: a case report from virological assessment of hospitalized patients with covid- detection of sars-cov- -specific humoral and cellular immunity in covid- convalescent individuals coronavirus disease test results after clinical recovery and hospital discharge among patients in china viral kinetics and antibody responses in patients with covid- neutralizing antibody responses to sars-cov- in a covid- recovered patient cohort and their implications the online version of this article (https://doi.org/ . /s - - - ) contains supplementary material.competing interests: the authors declare no competing interests. key: cord- -mtq yh authors: rodrigues, joão pglm; barrera-vilarmau, susana; teixeira, joão mc; seckel, elizabeth; kastritis, panagiotis; levitt, michael title: insights on cross-species transmission of sars-cov- from structural modeling date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: mtq yh severe acute respiratory syndrome coronavirus (sars-cov- ) is responsible for the ongoing global pandemic that has infected more than million people in more than countries worldwide. like other coronaviruses, sars-cov- is thought to have been transmitted to humans from wild animals. given the scale and widespread geographical distribution of the current pandemic, the question emerges whether human-to-animal transmission is possible and if so, which animal species are most at risk. here, we investigated the structural properties of several ace orthologs bound to the sars-cov- spike protein. we found that species known not to be susceptible to sars-cov- infection have non-conservative mutations in several ace amino acid residues that disrupt key polar and charged contacts with the viral spike protein. our models also predict affinity-enhancing mutations that could be used to design ace variants for therapeutic purposes. finally, our study provides a blueprint for modeling viral-host protein interactions and highlights several important considerations when designing these computational studies and analyzing their results. introduction sars-cov- , a novel betacoronavirus first identified in china in late , is responsible for the ongoing global pandemic that has infected more than million people worldwide and killed nearly . [ ] . based on comparative genomics, sars-cov- is thought to have been transmitted to humans from an animal host, most likely bats or pangolins [ ] . given the widespread human-to-human transmission across the globe, the question emerges whether humans can infect other animal species with sars-cov- , namely domestic and farm animals. identifying potential intermediate hosts that can act as reservoirs for the virus has both important global health, animal welfare, and ecological implications. during the course of this pandemic, there have been several news reports of domestic, farm, and zoo animals testing positive for sars-cov- infection. belgium [ ] and new york [ ] reported positive symptomatic cases in cats, the netherlands reported infection of minks in farms [ ] , and the bronx zoo in new york reported infections in lions and tigers [ ] . in all these cases, the vehicle of transmission appears to be an infected human owner or handler. more importantly, in the case of the mink farms in the netherlands, there is evidence of human-to-animal-to-human transmission. in addition to these reported cases, several groups put forward both pre-prints and peer-reviewed studies on animal susceptibility to sars-cov- under controlled laboratory conditions [ ] [ ] [ ] , two of which are of particular interest. the first study showed that cats, civets, and ferrets are susceptible to infection; pigs, chickens, and ducks are not, while the results for dogs were inconclusive [ ] . a second study, using human cells expressing recombinant sars-cov- receptor proteins showed that camels, cattle, cats, horses, sheep, and rabbit can be infected with the virus, but not chicken, ducks, guinea pigs, pigs, mice, and rats [ ] . together, these studies provide a dataset of confirmed susceptible and non-susceptible species that we can use to find molecular discriminants between the two groups. for simplicity, from here on we will refer to susceptible and non-susceptible species as sars-cov- pos and sars-cov- neg , respectively. like sars-cov- before, sars-cov- infection starts with the binding of the viral spike protein to the extracellular protease domain of angiotensin-converting enzyme (ace ) [ ] , a single-pass transmembrane protein expressed on the surface of a variety of tissues, including along the respiratory tract and the intestine. several biophysical and structural studies identified helices α and α , as well as a short loop between strands β and β in ace as the interface for the viral spike protein [ ] [ ] [ ] [ ] . these studies also identified key differences between the sequences of the receptor binding domains (rbd) of sars-cov- and sars-cov- , which explain the stronger interaction of the latter with human ace . as such, we can reasonably assume that sequence variation across ace orthologs might explain why some animal species are susceptible to infection while others are not. in addition, combining structural and binding data with the natural diversity of ace across species is likely to shine a light on the key aspects that drive ace interaction to viral rbds and ultimately help guide the development of therapeutic molecules against sars-cov- . unsurprisingly, several groups already published, or made available as preprints, multiple sequence and structure-based analyses of how sequence variation affects ace binding to sars-cov- rbd [ ] [ ] [ ] [ ] . two recent preprints, specifically, focus on the effects of ace variation on rbd binding. the first used an ace sequence library to select for mutants that bind rbd with high affinity, identifying several mutants that enhance or decrease affinity to the viral protein and providing a blueprint for engineering proteins and peptides with therapeutic purposes [ ] . while useful, we note that the authors carried out a single round of selection as opposed to the multiple rounds commonly carried out in similar studies. the second study used computational modeling to predict ΔΔg of mutations in animal species and assess their risk for infection [ ] . in addition, the authors also identified a number of locations on ace that contribute to binding the viral rbd, in particular residues , , , as well as a cluster of n-terminal hydrophobic amino acid residues. in this study, we aimed to leverage structural, binding, and sequence data to investigate how different ace orthologs bind to sars-cov- rbd. we selected animal species likely to encounter humans in a variety of residential, industrial, and commercial settings. for each of these species, we generated d models of ace bound to rbd and refined these models using short molecular dynamic simulations. after refinement, we found that models of sars-cov- pos species generally have a lower (better) score than those of sars-cov- neg species. further, we carried out a per-residue energy analysis that identified key locations in ace that are consistently mutated across sars-cov- neg species. collectively, our results provide a structural framework that explains why certain animal species are not susceptible to sars-cov- infection, and also suggests potential mutations that can enhance binding to the viral rbd. sequence conservation of ace orthologs we analyzed the sequence conservation of ace across our dataset, with respect to the entire sequence ( residues) and to the interface residues computed from a structure of ace bound to rbd (pdb id: m ) ( residues) ( table ). all orthologs are reasonably conserved, with global similarity values to the human ace sequence (hace ) ranging from % (goldfish) to . % (chimpanzee) (figure , left panel). all species coarsely cluster in three classes consistent with evolutionary distance to humans: primates have the highest similarity values, followed by other mammals, birds and reptiles, and finally fish. zooming in on the interface residues, we find substantially more variation (figure , right panel) . similarity values for these residues range from % (crocodile) to % (all primates) but, despite an overall correlation (pearson r of . ), do not always match global similarities. hedgehogs and sheep, for example, share . % and . % global similarity with hace , respectively, but % and . % for the interface region. in absolute numbers, these similarities mean that sheep share out of residues with hace at the interface with rbd, while hedgehogs share . the horseshoe bat, one of the proposed animal reservoirs for sars-cov- , shares . % interface similarity with hace , a comparable value to the . % of the sars-cov- neg mouse sequence. altogether, these results prompt two observations. first, neither global nor interface sequence similarity is predictive of sars-cov- susceptibility. second, that the interface of the viral rbd is substantially plastic and able to bind to sufficiently different ace orthologs. refinement of the hace :rbd complex in order to validate the refinement protocol used in our analysis, we created and refined models of human ace (hace ) bound to sars-cov- rbd. we used the cryo-em structure of full-length human ace bound to the rbd, in the presence of the amino acid transporter b at (pdb id: m ). compared to a high-resolution crystal structure of the same complex (pdb id: m j), the cryo-em structure lacks several key contacts between our two proteins of interest, which we attribute to poor density for side-chain atoms at the interface region. our refinement protocol restores the majority of these contacts (table s ) , yielding an average haddock score of - . (arbitrary units) for the best models of the best cluster. see materials and methods for further details on the protocol. these negative haddock scores suggest a favorable interface and agree with scores calculated for a reference set of transient protein-protein interactions (n= , haddock score=- . ± . ) [ ] . the interfaces in our models are dominated by hydrogen bond interactions involving the ace α helix and a small loop between strands β and β . there is one single salt-bridge involving hace d and rbd k consistently present in all our hace models. these observations all agree with the published crystal structure. further, the buried surface area of the refined models is also in agreement with published crystal structures (~ Å ). as such, considering the low quality of the interface region in our template structure, we are confident that our modeling and refinement protocol is robust enough to model all ace orthologs. refinement of orthologous ace :rbd complexes we modeled and refined complexes for all ace orthologs in our dataset (table ) using the same protocol as above. the representative models for each species ( best models of the best cluster) are available for visualization and download at https://joaorodrigues.github.io/ace -animal-models/. the haddock scores of all ace complexes (including hace ) range from - . (dog) to - . (mouse), a significant range that indicates substantial differences between these interfaces (table and figure ). the average haddock score is - . , very close to that of the human complex (- . ). overall, models of sars-cov- pos species have consistently lower (better) scores than those of sars-cov- neg species. although it is well-known that docking scores do not quantitatively correlate with experimental binding affinities [ ] , these scores suggest that sars-cov- neg species lack one or more key ace residues that contribute significantly to the interaction with rbd. to understand what forces drive the interactions between ace and sars-cov- rbd, we quantified the contribution of each component of the haddock scoring function to the overall score ( figure ). the haddock score is a linear combination of van der waals, electrostatics, and desolvation energy terms. in our models, electrostatics are the most discriminatory component (pearson r of . ), followed by desolvation ( . ), and finally van der waals ( . ). these correlations suggest that differences between the models of the different species originate primarily in polar and charged residues, in agreement with observations from experimental structures. in addition, the buried surface area of the models also correlates quite strongly with the haddock score (pearson r of . ), which is unsurprising since larger interfaces tend to make more contacts. most models bury between and Å , in agreement with the crystal and cryo-em structures, while the topscoring species (dog and goldfish) bury nearly Å and the lowest-scoring (mouse) bury only Å . finally, there is a weak correlation between the average haddock score of the representative models and the sequence similarity of the ace interface residues (pearson r of . ) ( figure s ). per-residue energetics of the ace :rbd interface to gain further insight on how ace sequence variation across the different orthologs affects binding to sars-cov- rbd, we calculated haddock scores for each interface residue in the refined models ( figure ). this high-resolution analysis reveals several sites that discriminate between sars-cov- pos and sars-cov- neg species. the first and most relevant of these sites is amino acid , which in hace (d ) interacts with rbd k to form the only intermolecular salt-bridge of the interface. in all sars-cov- pos species, this site is occupied by a negatively charged amino acid residue. in contrast, out of sars-cov- neg species have a hydrophobic or polar residue at this position. the goldfish ace sequence is an interesting outlier, with the second-best haddock score despite having a lysine at position that breaks the intermolecular salt-bridge. the loss of such an important site is compensated by the introduction of an alternative salt-bridge between e and rbd r . finally, the sequences of the top-scoring models also suggest that between aspartate and glutamate, the latter results in a stronger interaction, likely due to a stabilizing effect of the longer side-chain. the second site is amino acid , a lysine in hace , and in nearly all of the sars-cov- pos species, that interacts both with ace e and rbd q . the only exceptions are the civet and dromedary sequences, mutated to threonine and glutamate, respectively. in the case of the civet, our models show that t can still hydrogen bond with both e and rbd q . dromedaries, on the other hand, share e with chickens, guinea pigs, and ducks, all sars-cov- neg species. however, and quite beautifully, dromedaries compensate the possible electrostatic repulsion between e and e with a lysine at position (q in hace ) leading to the formation of an additional intramolecular salt-bridge that stabilizes the fold of ace and frees e to hydrogen bond with q in % of our models. all three sars-cov- neg species have an additional charge-reversal mutation at position , although with different outcomes in our models. in both chicken and duck ace , e is locked in an intramolecular salt-bridge with r in all of our models, losing the intermolecular hydrogen bond with rbd q . lastly, guinea pigs compensate k e with e k and remain able to hydrogen bond with rbd. the third discriminatory site between sars-cov- pos and sars-cov- neg species is amino acid , a histidine in hace and a polar residue in all sars-cov- pos species. in our hace models, h is doubly-protonated and forms an intramolecular salt-bridge/hydrogen bond with e and an intermolecular hydrogen bond with the hydroxyl group of rbd y . in addition, in most of our models, the aromatic ring of h is close enough (< . Å) to the aliphatic side-chain of rbd l to form productive hydrophobic interactions. our energetic analysis shows that substituting h by polar (serine, threonine) or hydrophobic (leucine, valine) residues destabilizes the interface, while substitution by a tyrosine substantially contributes to a stronger interaction. sars-cov- neg species except mouse and rat have hydrophobic residues at position , losing the ability to hydrogen bond with rbd y . in addition, the side-chain of rbd l is often out of range of hydrophobic interactions. in contrast, the h y substitution in the dog, ferret, and civet sequences loses the intramolecular hydrogen bond with e but compensates by hydrophobic interactions with nearby rbd residues and hydrogen bonds with rbd r (ferret), s (civet) or y (dog). in addition, the loss of aromatic residues at position leads to a steep decrease in desolvation energy of the models( figure s ). . cc-by-nc-nd . international license was not certified by peer review) is the author/funder. it is made available under a the copyright holder for this preprint (which this version posted june , . . besides these three major discriminatory sites, we identified three other sites that are systematically mutated in sars-cov- neg species. the first of these sites is k (in hace ), which is involved in an intramolecular saltbridge with d and an additional backbone hydrogen bond with rbd g . in rat and mouse ace , both sars-cov- neg species, this residue is mutated to a histidine, which weakens the interaction with d , possibly leading to increased conformational dynamics of the β -β loop and consequently lower binding affinity. then, position , a glutamine in hace and in most other species, hydrogen bonds with rbd y in the majority of our models. in canary, chicken, pigeon, hedgehog, duck, and crocodile ace sequences, this amino acid is mutated to a glutamate. this substitution introduces the possibility of an additional intramolecular salt-bridge with k , in ace helix α , which we observe in some of our models, preventing the formation of the intermolecular hydrogen bond. finally, y in hace is mutated to phenylalanine in canary, chicken, rat, duck, and mouse ace , mostly sars-cov- neg species. although our models do not offer a clear reason as to why this mutation could be damaging to rbd binding, the loss of the terminal hydroxyl group could have two negative consequences. first, there is the clear loss of two possible hydrogen bonds, with ace q and rbd n . then, the gain in hydrophobicity could lead the aromatic moiety to bury between both α and α helices, causing rbd f to lose a valuable interaction partner. all our models and the scoring statistics are available for visualization and download at https://joaorodrigues.github.io/ace -animal-models/. can structural modeling predict cross-species transmission of sars-cov- ? our computational modeling of vertebrate ace orthologs bound to sars-cov- rbd discriminates between previously reported sars-cov- pos and sars-cov- neg species. models of sars-cov- neg species -chicken, duck, guinea pig, mouse, and rat -generally have higher (worse) haddock scores than average (figure ), suggesting that these species' non-susceptibility to infection could stem from deficient rbd binding to ace . despite this clear trend, there are two notable outliers. our modeling ranks guinea pig ace (sars-cov- neg ) as a better receptor for sars-cov- rbd than for example, human, cat, horse, or rabbit ace (all sars-cov- pos species), despite experiments showing that there is negligible binding between the two proteins [ ] . then, the goldfish ace sequence ranks second among all models, despite reports that fishes are unlikely to be susceptible to infection due to their physiology and environment [ ] . these two results highlight the need for critical thinking when evaluating predictions from computational models. as noted earlier in the introduction, sars-cov- infection is a complex multi-step process [ ] . thus, while we can assume that impaired ace binding decreases odds of infection, we cannot state that ace binding is predictive of infection. for instance, experiments with recombinant ace show that the pig ortholog binds sars-cov- rbd and leads to entry of the virus in host cells [ ] , but tests in live animals returned negative results [ ] . in addition, our modeling protocol makes assumptions about the bound state of the two proteins, starting from the cryo-em template structure. however, cryo-em structures of the full-length sars-cov- spike protein [ ] highlight multiple unbound conformations for rbd, and coarse-grained simulations of the hace :rbd complex show that there is substantial flexibility in some of the interfacial rbd loops [ ]. altogether, these limitations show that computational models alone cannot predict whether certain animal species are at risk of infection. what our models do predict, however, is that there are distinctive molecular features characteristic of sars-cov- neg species. as the adage goes, 'all models are wrong, but some are useful.' sars-cov- neg species lack important polar and charged ace residues on further inspection, we find that sars-cov- neg models rank worse due to a substantial decrease in electrostatic energy (figure ), indicating loss of polar interface contacts, namely hydrogen bonds and saltbridges ( figure ) . indeed, models of mouse, duck, rat, and chicken lack the ability to form an intermolecular salt-bridge with rbd due to the loss of hace d . these predictions are supported by experimental work, where mutants lacking a negative charge at this position are largely unable to bind rbd [ ] . non-conservative mutations at other sites on ace also contribute negatively to the interface scores. residues k and h (hace ) engage multiple neighboring residues in both intra-and intermolecular hydrogen bonds, contributing both to ace fold stability and rbd binding, respectively. our models suggest that the introduction of a negatively charged residue at position is disruptive to binding, in agreement with experiments [ ] . in sars-cov- pos species, like dromedary camels, this mutation is more likely to be tolerated due to additional compensatory mutations that stabilize the ace fold and still allow for contacts with rbd. in all sars-cov- neg species except guinea pig however, there are no additional mutations to compensate for this substitution. as for position , our predictions contrast with experimental measurements [ ] , which show that mutation to a hydrophobic residue improves binding between ace and rbd. in our models, the preference seems to be for aromatic residues (histidine, tyrosine) capable of both hydrogen bonds and hydrophobic interactions. we note, however, that our coverage of sequence space is limited to naturally occurring variants. unlike in the work referenced before [ ] , where the selection driver is rbd binding, natural selection of ace might impose additional constraints on sequence variability. finally, our models suggest that reduced flexibility of ace might be a positive contributor to rbd binding affinity. disrupting an intramolecular salt-bridge between d and k by substituting k with a shorter polar amino acid residue is a consistent feature in mice and rats, both sars-cov- neg species. these results support other computational modeling work [ ] that suggest that rbd mutants g d bind worse to ace because of the disruption of this intramolecular salt-bridge. natural variants of ace encode potential affinity-enhancing mutations for sars-cov- rbd in addition to identifying mutations that impair binding of sars-cov- rbd, our models suggest several hace variants that could be used to enhance affinity between the two proteins. the clearest affinity enhancer seems to be d e, a variant observed in of the best scoring species ( figure ) and shown in experiments to increase binding to rbd [ , ] . the longer side-chain of a glutamate residue can help strengthen and stabilize the intermolecular salt-bridge with rbd k . the impact of such conservative mutations in stabilizing protein interactions has been reported previously for other systems [ ] . the second predicted enhancer is h y, which as we discussed above, contrasts with experimental measurements. in addition to maintaining hydrogen bonds and hydrophobic contacts, our models show that this mutation results in a substantial increase in desolvation energy ( figure s ). in summary, our protocol combines structural, sequence, and binding data to create a structure-based framework to understand sars-cov- susceptibility across different animal species. our models help rationalize the impact of naturally-occurring ace mutations on sars-cov- rbd binding and explain why certain species are not susceptible to infection with the virus. in addition, we propose possible affinityenhancing mutants that can help guide engineering efforts for the development of ace -based antiviral therapeutics. despite the aforementioned limitations, our protocol and models can easily be replicated using freely-available tools and web servers and serve as a blueprint for future modeling studies on ace interactions with coronaviruses' rbds. finally, to prevent human-to-animal transmission, we recommend following the world organization for animal health guidelines: people infected with covid- should limit contact with their pets, as well as with other animals (including humans). sequence alignment of ace orthologs sequences of ace orthologs from species were retrieved from ncbi using the human gene as a reference (gene id: , updated on -apr- ) and the query term "ortholog_gene_ [group]". other species, such as rhinolophus sinicus, were manually included using custom queries. the sequences were aligned with mafft version [ , ] , using the alignment method fft-ns-i (standard). some sequences had undefined amino acids ('x'), which we converted to glycine to allow modeling without any bias for amino acid identity. all species and the respective protein identifiers are listed in table . . cc-by-nc-nd . international license was not certified by peer review) is the author/funder. it is made available under a the copyright holder for this preprint (which this version posted june , . . https://doi.org/ . / . . . doi: biorxiv preprint all calculations were based on the alignments from mafft, restricted to the region used for modeling (residues - ). to calculate sequence similarity, we considered the following groups based on physico-chemical properties: charged-positive (arg, lys, his), charged-negative (asp, glu), aromatic (phe, tyr, trp), polar (ser, thr, asn, gln), and apolar (ala, val, ile, met). cys, gly, and pro residues were considered individual classes. the modeling of ace orthologs was carried out using modeller . [ ] and custom python scripts (available upon request). we used the cryo-em structure of the sars-cov- rbd bound to human ace (pdb id: m ) [ ] as a template for all our subsequent models, including all glycans and the coordinates of rbd. to save computational resources, we modeled only the extracellular domain of ace , specifically residues - , which are known to be sufficient to bind to rbd. to avoid unwanted deviation from the initial cryo-em structure, we restricted the optimization and refinement of the models to the coordinates of atoms of mutated or inserted residues. we used the fastest library schedule for model optimization and the very_fast schedule for model refinement. for each species, we generated backbone or loop models and selected the one with the lowest normalized dope score as a representative. these final models were then processed to remove any sugar molecules in species where the respective asparagine residue had been mutated. the initial complex models were prepared for refinement using the pdb-tools suite [ ] . each chain was separated into a different pdb file (pdb_selchain) and standardized with ter and end statements (pdb_tidy). we used haddock . [ ] to carry out the refinement of the models. the protein molecules were parameterized using the standard force field in haddock, while the sugars were parameterized using updated parameters for carbohydrates [ ] . we used a modified version of the topology generation scripts to allow automatic detection of n-linked glycans and expand the range of the interface refinement ( Å distance cutoff). each initial homology model was refined through independent short molecular dynamics simulations in explicit solvent (solvshell=true). these refined models were then clustered using the fcc algorithm [ ] with default parameters and scored using the haddock score, a linear combination of van der waals, electrostatics, and desolvation. a lower haddock score is better. the top models of the top scoring cluster, ranked by its average haddock score, were selected as representatives of the complex. analysis of interface contacts of refined ace :rbd complexes we used the interfacea analysis library (version . ) (http://doi.org/ . /zenodo. ) to identify intermolecular contacts between hace and rbd, specifically hydrogen bonds, salt bridges, and aromatic ring stacking. hydrogen bonds were defined between any donor atom (nitrogen, oxygen, or sulfur bound to a hydrogen atom) within . Å of an acceptor atom (nitrogen, oxygen, or sulfur), if the donor-hydrogen-acceptor angle was between and degrees. salt bridges were defined between two residues with a pair of cationic/anionic groups within Å of each other. finally, two aromatic residues were defined as stacking if the centers of mass of the aromatic groups were within . Å (pi-stacking) or Å (t-stacking) and the angle between the planes of the rings was between and degrees (pi-stacking) or between and degrees (t-stacking). additionally, for pi-stacking interactions, the projected centers of both rings must fall inside the other ring. for each modelled species, we took the best models of the best cluster, judged by their haddock score, and aggregated all their contacts together. contacts present in at least models were considered representative. per-residue decomposition of haddock scores we used a custom cns [ ] script to calculate the haddock score of each residue at the interface between ace and rbd. briefly, the protocol was the following. for each model, since haddock uses a united-atom force field, we first added missing hydrogen atoms and minimized their coordinates, keeping all other atoms fixed. we marked a residue of ace as part of the interface if any of its atoms were within Å of any atom of rbd, and vice-versa. we then calculated the electrostatics, van der waals, and desolvation energies for each of these residues, considering only atoms belonging to the other protein chain. note that this protocol does not account for intramolecular effects of mutations. finally, we calculated the haddock score per residue, using the default scoring function weights, and averaged per-residue values for the best models of the best cluster of each species. . cc-by-nc-nd . international license was not certified by peer review) is the author/funder. it is made available under a the copyright holder for this preprint (which this version posted june , . . . cc-by-nc-nd . international license was not certified by peer review) is the author/funder. it is made available under a the copyright holder for this preprint (which this version posted june , . . figure . sequence similarity of ace orthologs to human ace . global sequence similarity values range from - %, while similarity values for residues interacting with sars-cov- rbd (derived from m ) range from - %. species are ordered by decreasing global sequence similarity to human ace . colors indicate known susceptibility to infection: sars-cov- pos species in green, sars-cov- neg species in red, others in gray. . cc-by-nc-nd . international license was not certified by peer review) is the author/funder. it is made available under a the copyright holder for this preprint (which this version posted june , . . https://doi.org/ . / . . . doi: biorxiv preprint figure . haddock scores of modeled ace orthologs bound to sars-cov- rbd. the haddock score predicts the strength of the interaction between proteins. models of sars-cov- pos species (green) generally have better (more negative) scores than sars-cov- neg species (red), suggesting that impaired binding between the two proteins might explain differences in viral susceptibility. the scores shown here are the average of the best models for each ace ortholog. . cc-by-nc-nd . international license was not certified by peer review) is the author/funder. it is made available under a the copyright holder for this preprint (which this version posted june , . . figure . correlation of haddock score with individual energy terms and structural features. differences in electrostatics energy contribute the most towards discriminating sars-cov- pos species (green) from sars-cov- neg species (red), supporting observations of hydrogen bonding networks and charged interactions in experimental structures. the buried surface area of the models is also correlated with their haddock score. the units for van der waals and electrostatics energies, desolvation, and buried surface area are kcal.mol - , arbitrary units, and Å , respectively. the human complex is shown in black for reference. . cc-by-nc-nd . international license was not certified by peer review) is the author/funder. it is made available under a the copyright holder for this preprint (which this version posted june , . . https://doi.org/ . / . . . doi: biorxiv preprint figure . haddock score of individual ace interface residues. amino acid residues at positions , , , and are predicted to be the largest contributors to the stability of the interface. sars-cov- neg (red labels) species consistently show changes in these positions which could explain their non-susceptibility to the virus. the top-scoring sars-cov- pos (green labels) also suggest that hace d e and h y could potentially act as affinity enhancers. for each species, each block represents an interface residue of ace . the identity of the amino acid is shown in one-letter codes. the colors represent the average haddock score of each particular residue over the best models: lower scores (blue) indicate more favorable interactions. positive scores (dark red) indicate steric clashes or electrostatic repulsion. blank squares indicate that in that ortholog, that position is not part of the interface of the complex. residues marked with *, and ** are observed to form hydrogen bonds or salt-bridges in the hace :rbd crystal structure, respectively. see materials and methods for additional details on definitions. . cc-by-nc-nd . international license was not certified by peer review) is the author/funder. it is made available under a the copyright holder for this preprint (which this version posted june , . . https://doi.org/ . / . . . doi: biorxiv preprint figure s . correlation between interface sequence similarity to hace and haddock score. . cc-by-nc-nd . international license was not certified by peer review) is the author/funder. it is made available under a the copyright holder for this preprint (which this version posted june , . . https://doi.org/ . / . . . doi: biorxiv preprint figure s . desolvation energy of individual ace interface residues. aromatic residues at position contribute the most to the gain in desolvation energy across all species of the complex, indicating that h y could be a potential affinity enhancing mutation in hace . for each species, each block represents an interface residue of ace . the identity of the amino acid is shown in one-letter codes. the colors represent the average desolvation energy of each particular residue over the best models: lower scores (blue) indicate more favorable interactions. blank squares indicate that in that ortholog, that position is not part of the interface of the complex. residues marked with *, and ** are observed to form hydrogen bonds or salt-bridges in the hace :rbd crystal structure, respectively. see materials and methods for additional details on definitions. . cc-by-nc-nd . international license was not certified by peer review) is the author/funder. it is made available under a the copyright holder for this preprint (which this version posted june , . . https://doi.org/ . / . . . doi: biorxiv preprint an interactive web-based dashboard to track covid- in real time the proximal origin of sars-cov- a cat appears to have caught the coronavirus, but it's complicated mink infected two humans with coronavirus: dutch government. reuters. seven more big cats test positive for coronavirus at bronx zoo. in: animals [internet susceptibility of ferrets, cats, dogs, and other domesticated animals to sars-coronavirus potential host range of multiple sars-like coronaviruses and an improved ace -fc variant that is potent against both sars-cov- and sars-cov- . microbiology simulation of the clinical and pathological manifestations of coronavirus disease (covid- ) in golden syrian hamster model: implications for disease pathogenesis and transmissibility structural and functional basis of sars-cov- entry by using human ace structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structural basis for the recognition of sars-cov- by fulllength human ace structural basis of receptor recognition by sars-cov- the sequence of human ace is suboptimal for binding the s spike protein of sars coronavirus . biochemistry sars-cov- spike protein predicted to form stable complexes with host receptor protein orthologues from mammals receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus sars-cov- , an evolutionary perspective of interaction with human ace reveals undiscovered amino acids necessary for complex stability proteins feel more than they see: fine-tuning of binding affinity by properties of the non-interacting surface are scoring functions in protein−protein docking ready to predict interactomes? clues from a novel binding affinity benchmark viewpoint: sars-cov- (the cause of covid- in humans) is not known to infect aquatic food animals nor contaminate their products the trinity of covid- : immunity, inflammation and intervention structure, function, and antigenicity of the sars-cov- spike glycoprotein mafft: a novel method for rapid multiple sequence alignment based on fast fourier transform aleaves facilitates on-demand exploration of metazoan gene family trees on mafft sequence alignment server with enhanced interactivity comparative protein modelling by satisfaction of spatial restraints pdb-tools: a swiss army knife for molecular structures the haddock . web server: user-friendly integrative modeling of biomolecular complexes compatible topologies and parameters for nmr structure determination of carbohydrates by simulated annealing clustering biomolecular complexes by residue contacts similarity version . of the crystallography and nmr system sheep - acknowledgements jpglmr acknowledges support from the molecular sciences software institute (aci- ). jpglmr and ml acknowledge funding from the national institutes of health usa (r gm ). plk acknowledges funding from the federal ministry for education and research (bmbf, zik program) ( z hn ) and the european regional development funds for saxony-anhalt (efre: zs/ / / ). the authors thank t. dots, k. lindorff-larsen, j. puglisi, and r. fernandes for feedback and encouragement during the course of the project. key: cord- -lbmbp ca authors: hansen, c. b.; jarlhelt, i.; perez-alos, l.; hummelshoj landsy, l.; loftager, m.; rosbjerg, a.; helgstrand, c.; bjelke, j. r.; egebjerg, t.; jardine, j. g.; svaerke jorgensen, c.; iversen, k.; bayarri-olmos, r.; garred, p.; skjoedt, m.-o. title: sars-cov- antibody responses determine disease severity in covid- infected individuals date: - - journal: nan doi: . / . . . sha: doc_id: cord_uid: lbmbp ca globally, the covid- pandemic has had extreme consequences for the healthcare system and calls for diagnostic tools to monitor and understand the transmission, pathogenesis and epidemiology, as well as to evaluate future vaccination strategies. here we have developed novel flexible elisa-based assays for specific detection of sars-cov- antibodies against the receptor-binding domain (rbd): an antigen sandwich-elisa relevant for large population screening and three isotype-specific assays for in-depth diagnostics. their performance was evaluated in a cohort of convalescent participants with previous covid- infection, ranging from asymptomatic to critical cases. we mapped the antibody responses to different areas on protein n and s and showed that the igm, a and g antibody responses against rbd are significantly correlated to the disease severity. these assays-and the data generated from them-are highly relevant for diagnostics and prognostics and contribute to the understanding of long-term covid- immunity. coronaviruses (covs) are zoonotic pathogens primarily targeting the human respiratory system ( ) . while most human cov infections are mild, three coronaviruses have appeared in the past two decades that cause deadly pneumonia. severe acute respiratory syndrome coronavirus (sars-cov- ), first observed in china in , spread rapidly to countries infecting more than , people and causing deaths before being contained in ( ) . barely a decade later in , middle east respiratory syndrome coronavirus (mers-cov) was identified on the arabian peninsula and has caused more than , cases and deaths ( ) ( ) ( ) . at the end of , a novel sars-cov strain (sars-cov- ) emerged in wuhan (china) and has been spreading at an unprecedented speed around the world ever since. the disease, named coronavirus disease (covid- ) , accounts for more than million confirmed cases and , related deaths at the time of manuscript preparation ( ) . the clinical features of the covid- infection are diverse, ranging from asymptomatic carriers of the infection to acute respiratory distress syndrome and multiple organ dysfunction ( ) . sars-cov- can infect people of all ages; however, elderly and people suffering from co-morbidities such as diabetes, cardiovascular disease, chronic respiratory disease or cancer are more inclined to suffer from a severe disease progression ( ) . sars-cov- is an enveloped rna virus with a diameter of - nm to accommodate one of the largest genomes of all known rna viruses ( . - . kb) ( ) . one-third of its genome encodes four structural proteins: spike (s), membrane (m), envelope (e) and nucleocapsid (n) proteins. the protein s that extends as homotrimers on the outer viral membrane binds to the host angiotensin-converting enzyme (ace ) receptor and allows the virus to enter and infect the host target cell ( ) . host proteases process the protein s into the s and s subunits: s is responsible for receptor recognition and is comprised of an n-terminal and a c-terminal domain, the later containing the receptorbinding domain (rbd) ( ) ; while the s mediates the fusion of the viral envelope with the membrane of the host cell. the protein s of sars-cov- shares % homology at the amino acid level with the protein s of sars-cov- and while both interact with the ace receptor, sars-cov- does so with a - times higher affinity ( ) , which may explain the higher transmission rate of covid- . precise diagnosis of covid- with rt-pcr detection of sars-cov- nucleic acids remains crucial to identify symptomatic carriers to secure correct treatment and as a tool in quarantine strategies to limit the infection rate in asymptomatic carriers. serological detection of specific sars-cov- antibodies is a useful tool to identify . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint convalescent individuals that have developed immunity and thereby might be protected against reinfection, although this issue is still not resolved ( ) . moreover, serological testing is necessary to understand the transmission, pathogenesis and epidemiology of sars-cov- , providing critical data to inform public health authorities for controlling the spread of covid- and eventually re-opening societies. another question that has arisen is whether an overwhelming antibody response may even aggravate covid- infection in patients ( ) . due to urgency and demand, many serological tests have been developed rapidly and made commercially available with only limited validation on clinical samples ( ) . we have developed a flexible elisa-based platform for rapid and specific detection of sars-cov- antibodies. the platform includes an indirect rbd sandwich elisa (s-elisa) for pan immunoglobulin (ig) detection suitable for large scale antibody surveillance and direct elisas for in-depth analyses of the igm, iga and igg isotype antibody responses towards rbd and protein n. moreover, we set out to characterize the antibody response levels in relation to symptom characteristics and disease severity, to elucidate the immunological response in covid- convalescent individuals. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint the assays were optimized with a specific focus on diminishing nonspecific binding and the final setup was chosen based on the optimal intensity (od) of the absorbance signal and signal-to-noise (s/n) ratio between the positive sample and the negative quality control. the developed elisa-based assays proved to be suitable for automatization and were used in high-throughput setups with both -well and -well formats, correlating significantly a total of convalescence plasma samples from previously infected individuals with sars-cov- (positive samples) and plasma samples from healthy individuals collected before the pandemic outbreak (negative controls) were subjected to antibody measurement in the rbd s-elisa ( figure a ) and in the direct elisa setups ( figure b -d). s/n ratio between the od of a positive sample and the od of the negative quality control and receiver operating characteristic (roc) were assessed to calculate the best fit cut off to estimate the performance of the assay. the rbd s-elisa performed with a . % sensitivity and . % specificity ( figure a ). the sensitivities and specificities of the direct elisas were . %, . % for igm ( figure b) , . %, . % for iga ( figure c ) and . %, . % for igg ( figure d ), respectively. the intra-and inter-assay variation were found to be acceptable (< %) for all four assays ( table ). the limit of detection was determined by interpolating the cut off od value and converting it into antibody concentrations. the resulting values indicate an estimated -fold higher sensitivity of the direct elisa setup ( . ng/ml) compared to the s-elisa ( . ng/ml) ( table ) . detection of igm, iga and igg antibodies against sars-cov- protein n was evaluated by analyzing positive samples and negative controls and roc curve analyses were assessed to estimate the assay performance . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint to provide a better insight into antibody seroconversion during sars-cov- infection and reactivity against different locations on protein s and protein n, we conducted igm, iga and igg detection in positive samples against protein fragments and short peptides located on the protein s and protein n structures, full-length rbd, protein s and protein n (figure a ). the heatmap shown in figure b indicates a different reactivity of igm, iga and igg towards the proteins/peptides analyzed. figure c represents a simplified overview of the % of individuals with antibodies recognizing each protein/fragment. a clear tendency towards a higher prevalence of igg responses against the s subunit part of protein s and the middle part of protein n was observed. parts of the s subunit and most of the n terminal part of rbd showed little immunogenicity. the cut off was calculated as the average of the negative controls plus three times the standard deviation. levels of antibodies against sars-cov- were measured from plasma samples of recovered individuals with covid- . the rbd s-elisa measures total anti-rbd igs present in the samples using a single dilution ( figure a figure b ). in comparison, responses of igm and iga isotypes were detected in and of the individuals, respectively ( figure c and figure d ). a dynamic range of antibody titers expressed as arbitrary units (a.u.)/ml was obtained when od values were interpolated by regression analysis using a four-parameter logistic curve fitting. the convalescent individuals were classified according to disease severity ( figure a -c) and symptom onset calculated as the time from the first selfperceived symptom related to covid- to the moment of blood sampling ( figure d -f). we observed a highly significant difference in the antibody titers between the disease severity groups ( figure a -c) (p < . ), with a clear association between increasing antibody levels and more severe disease symptomatology for all three antibody isotypes. igg shows the most significant increase between the severity groups ( figure c ). in contrast, asymptomatic . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint individuals appeared not to follow the same severity tendency. this could be explained by the low number of individuals in this group, thus not being representative. when we assessed the difference in antibody titers between groups based on the time of symptom onset ( figure d -f), we observed a significant increase in igg levels continuously over the time of sampling ( figure f ), while igm ( figure d ) and iga levels ( figure e ) did not change significantly. table depicts the correlation between self-perceived covid- symptomatology and the igm, iga and igg titers, as well as the disease severity, sex and age. symptoms such as fever, shortness of breath and lack of appetite were significantly and positively correlated with igm and igg levels. in contrast, iga levels were significantly negatively correlated with the loss of sense of smell/taste and headache. both age, sex (male) and disease severity were significantly positively correlated to the level of all three antibody isotypes. when adjusting the analysis between antibody levels and disease severity for age and sex, there was no longer a significant correlation between iga and severity. whereas it remained highly significantly correlated for igm and igg. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint we have developed an elisa-based platform for detection sars-cov- antibodies comprising an indirect rbd s-elisa for pan ig detection and direct elisas for in-depth analyses of the igm, iga and igg isotype responses towards rbd and protein n. rbd was chosen as the primary antigen for the screening and estimation of antibody titers for several reasons. it is regarded to be a sufficient representative part of protein s to induce an immunogenicity response ( ) and we did not detect additional sensitivity improvements by employing the full protein s as a ligand-target in the direct elisa (to be described in detail elsewhere). in a recent phase trial, antibody responses against a vaccine candidate (s- p antigen) and the rbd were assessed, finding similar ig responses in pattern and magnitude between both antigens ( ) . one of the advantages of using rbd instead of full-length protein s is the more efficient production and higher stability of recombinant rbd due to its reduced size and simple tertiary structure. the relative unique primary sequence of the sars-cov- rbd ( ) also reduces the risk of cross-reactive antibody signals derived from prior b cell responses against other types of coronaviruses. it is; however, important to annotate that this setup is highly flexible, making it possible to substitute the antigen upon a change in demand or in case of viral mutations. the use of protein n as a target antigen for serological screening has several theoretical pitfalls: the location of protein n makes it less accessible for b cell receptor interaction on the naïve b cell and probably requires a viral membrane degradation. furthermore, protein n shows higher sequence conservation ( . %) and thereby increase the risk of false-positive detection ( ) . nevertheless, large commercial providers have chosen to use protein n as the serological target in different types of sars-cov- antibody assays ( ) . in our study, the detection of igm antibodies towards protein n or its fragments was weaker than igg, suggesting a fast seroconversion of igm into igg. furthermore, around - % of the convalescent individuals did not mature any detectable antibody response, which is in good agreement with a previous study ( ) . the direct elisa setup allows the use of different sars-cov- proteins in their full-length, shortened variants or fragmented immunogens, offering a useful tool to study the different reactivity patterns of igm, iga and igg towards specific exposed areas on the viral antigens. we measured several different antigen areas on protein n and s to establish a heatmap of the antibody landscape. based on the results, we could demonstrate a tendency towards immune dominating areas in the s unit and the central part of protein n. however, it is important to highlight that the heatmap does not represent the full sequences of the cov antigens and that the dissection of antigens into shorter fragments could have a significant impact on the antibody reactivity. it could; however, in a more extensive setup, provide valuable information towards a targeted . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . we show that igg levels increases during the first weeks after symptom onset, while the igm and iga levels remain stable for the same period of time. this was surprising as we would have expected a more pronounced decrease in igm levels over time. this observation could indicate an importance of the two isotypes, reinforced by the fact that severity is correlated with high levels of antibodies of all three isotypes. it has previously been shown that both igg independently and total antibody levels correlate with disease severity in patients during hospitalization ( ) , but to our knowledge, the prolonged clear correlation between iga, igm and igg titers and disease severity have not been reported before. the data illustrates that individuals with mild symptoms during infection with sars-cov- , in general, will mount a lower antibody response compared to individuals with moderate and severe symptoms several months after recovering from a covid- infection. this observation gives an essential insight into the immunological response regarding clinical disease presentation, which further highlights the demand for more quantitative assays. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint in this study, all three antibody isotypes correlate positively with age and sex (male), which can be explained by the fact that severe covid- infection is more pronounced in the elderly male population ( ) . we show that the antibody titers, especially igg levels, are correlated with specific symptom characteristics, including fever, pharyngalgia, shortness of breath and nausea, which again shows the link between clinical manifestations and the immunological response. the iga level, on the other hand, was negatively correlated with loss of sense of taste/smell and surprisingly showed no other correlation to the symptomatology of the upper respiratory tract in this study. the role of iga, which is considered the predominant antibody involved in mucosal immunity, remains to be fully understood. iga is suggested to mediate anti-viral defense functions at different anatomic levels in relation to mucosal epithelium ( ) . however, the mechanisms behind this remain unknown and often gain limited attention during infectious studies. in this respect, it could be interesting to examine the iga levels in mucosal tissue during sars-cov- infection and determine whether the mucosa-associated iga plays a significant role during sars-cov- infection. our findings provide support to the notion that antibodies towards sars-cov- represent a double-edged sword. antibodies are important in viral neutralization, but also in fc receptor-mediated phagocytosis, antibody-dependent cellular cytotoxicity (adcc) and complement-dependent cellular cytotoxicity (cdcc) and subsequent elimination of pathogens. however, it is known that particularly adcc and cdcc can drive harmful and systemic pro-inflammatory responses that can have severe pathophysiological consequences. thus, based on our findings and others, it may be suggested that an unwanted immune response towards sars-cov- may be one of the mechanisms causing hyperactivation of macrophages and monocytes, leading to the deadly cytokine storm, which seems to be a hallmark of covid- ( ) . whether a previously infected individual can expect stable long-term protection against reinfection with sars-cov- remains unknown. a recent study found a correlation between the production of neutralizing antibodies against rbd and elevated igg titers in convalescence covid- individuals ( ), reinforcing the use of rbd as the candidate to analyze for neutralizing antibodies in these individuals. the durability of neutralizing antibodies (primarily igg) against sars-cov- has yet to be defined, but persistence for up to days from symptom onset has been described ( , ) . in comparison, when following infection with sars-cov- , concentrations of igg remained high for approximately to months before subsequently declining slowly during the next to years ( ) . it is uncertain whether an individual with low antibodies titers, mainly igg, has a higher risk of reinfection compared to an individual with high levels of . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint antibodies post disease. it has recently been suggested that in the absence of antibody levels, an individual could be protected against reinfections by the presence of memory t cells ( ) . furthermore, the major proportion of convalescent individuals, included in this study, indeed show a dominating igg response, suggesting that both affinity maturation, isotype class switching and b cell memory response has occurred. these b cell populations could, together with the memory cd + and cd + cells, secure a fast and efficient response to a secondary exposure of sars-cov- . this study is limited by relying on participants' retrospectively self-reported symptomatology and symptom debut, which allows for an unknown amount of misclassification. moreover, with this design, we could not monitor the antibody response concerning survival. however, because of the detailed analysis of the antibody responses, and the clear associations despite the retrospective design, we assume that the associations would be even stronger in a carefully conducted prospective designed study. in conclusion, we have established robust, flexible and specific elisa-based platforms for detection sars-cov- antibodies and presented novel insight into the link between antibody responses and covid- disease severity. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint the following buffers were used: pbs ( . mm na hpo , . the coding sequence for protein s rbd (qic . , aa r -s ) was synthesized by genscript and cloned into a pcdna . expression vector with an n-terminal human vh - signal peptide and a c-terminal xhis tag followed by an avitag (-gsg-hhhhhhhhhh-gsg-glndifeaqkiewhe). the sequence of protein n (qld . ) was optimized as described elsewhere ( ) and synthesized by geneart (thermo fisher scientific, waltham, ma, usa) into a pcdna . vector with the human serum albumin signal peptide and a c-terminal xhis tag (-gs-hhhhhhhh). both constructs were expressed using a mammalian transient expression system. on day five after transfection, the supernatants were harvested by centrifugation and sterile filtered. the recombinant proteins were purified using a -step automated purification method setup on an Äkta express chromatography system with an immobilized metal affinity excel histrap column and a size exclusion superdex column (chromatography system and columns cytiva, marlborough, ma, usa). the purified proteins were stored in a buffer composed of mm hepes, mm nacl, ph . . a portion of the rbd was specifically biotinylated in the avitag sequence using a bira kit (avidity llc, co, usa). a total of recovered individuals previously tested rt-pcr positive for sars-cov- were included in the study. the department of cardiology at herlev university hospital in denmark recruited the participants. the rt-pcr positive participants are comprised of males and females aged from - and course of disease ranged from asymptomatic to critically ill. all participants were invited to complete an electronic self-report questionnaire providing additional information about symptom onset, characteristics and disease severity divided into the following groups: asymptomatic, mild, moderate, severe or critical. the mild disease was defined as having few symptoms and generally feeling well, moderate disease as being bedridden at home, severe disease as the need for hospitalization and critical disease as need of admission to the intensive care unit (icu) for mechanical ventilation. characteristics of the study participants are detailed in table . serum and edta plasma samples were stored in aliquots and kept frozen at − °c . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint usa). plates were washed four times with pbs-t between the steps mentioned above. all -well plates were handled by the biomek fx automated workstation (beckman coulter, brea, ca, usa). development and validation: specificity of the signal, limit of detection, variation, parallelism and clinical performance the rbd s-elisa and the direct elisas was subjected to optimization with regards to dilution range, the use of blocking buffers and variations in incubation times. the final conditions were chosen based on s/n ratios. specificity and sensitivity were calculated based on roc curve representation and selection of the most appropriate cut off by prioritizing the specificity. assay sensitivity regarding the limit of detection was determined by the concentration given by interpolating the od value of the cut off. the calibrator was prepared by spiking µg/ml of recombinant human monoclonal igg antibody against sars-cov- spike (a , genscript, piscataway, nj, usa) into normal human serum and diluting in serum into a -fold dilution. samples were treated as patient samples and further diluted : in the s-elisa and : in the direct elisa in pbs-t followed by incubated as stated above. intra-assay variation was evaluated by calculating the coefficient of variation (cv) of an individual cvs for all the duplicates in a total of samples. inter-assay variation was evaluated by calculating the cv of two samples run in duplicates in separate plates on three different days. the parallelism between serum and plasma samples was evaluated by comparing pairs of serum and plasma samples using spearman rank correlation tests. to evaluate whether the antibody levels correlated with the disease severity and/or the days after symptom onset, sample absorbances were logistically transformed and a four-parameter logistic curve fitting was applied to calibrate the antibody levels into a.u./ml. the appropriate dilution for each sample was chosen based on the best fit in the linear range of the calibrator. the interpolated value in a.u./ml was corrected by the dilution factor ( , or ). a sample od value below . was automatically given the value of a.u./ml. a total of different sars-cov- protein fragments on the protein s and s and protein n coupled via an nterminal cysteine and maleimide conjugation to recombinant-human serum albumin (rhsa) (albix, novozymes, bagsvaerd, denmark), short proteins from protein s and full-length protein s, protein n and rbd were analyzed for immunogenicity capacity on the direct elisa. proteins details are illustrated in figure a . nunc™ maxisorp flat-bottom plates nonsterile -well plates were coated with µg/ml of the proteins in pbs on at °c. a total of rt-. cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint pcr positive samples and six negative controls were diluted : in dilution buffer and incubated as mention above. detection and development procedures were followed, as described in subsection . . statistical analyses were performed using graphpad prism version statistical differences between disease severity and symptoms onset groups were analyzed using one-way anova (kruskal-wallis test) with dunn's multiple comparison test. spearman rank correlation tests were used to determine the correlation between different experimental parameters. data are represented as the average of sample duplicate and the median. significance levels: * = p < . , ** = p < . , *** = p < . **** = p < . and p < . was considered statistically significant. all procedures involving the handling of human samples are in accordance with the principles described in the declaration of helsinki and ethically approved by the regional ethical committee of the capital region of denmark (h- ) . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint kruskal-wallis test was performed. p value < . was considered significant. * = p < . , ** = p < . , *** = p < . **** = p < . . . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint tables table . intra-and inter-assay variation and limit of detection for the s-elisa setup and the direct elisa setup. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . spearman rank correlation analysis was performed. p value < . was considered significant. * = p < . , ** = p < . , *** = p < . **** = p < . . a n = participants. b n = participants. c partial correlation adjusted for age and sex. . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint . cc-by-nc-nd . international license it is made available under a is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint origin and evolution of pathogenic coronaviruses sars -beginning to understand a new virus identification of a novel coronavirus in patients with severe acute respiratory syndrome middle east respiratory syndrome coronavirus (mers-cov): announcement of the coronavirus study group sars and mers: recent insights into emerging coronaviruses coronavirus disease (covid- ) pandemic clinical features of patients infected with novel coronavirus in wuhan, china estimates of the severity of coronavirus disease : a model-based analysis covid- infection: origin, transmission, and characteristics of human coronaviruses the covid- pandemic: biological evolution, treatment options and consequences structural and functional basis of sars-cov- entry by using human ace cryo-em structure of the -ncov spike in the prefusion conformation. science ( -. ) antibody responses to sars-cov- in patients of novel coronavirus disease dissecting antibody-mediated protection against sars-cov- the receptor binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients an mrna vaccine against sars-cov- -preliminary report the coronavirus nucleocapsid is a multifunctional protein biochemical characterization of sars-cov- nucleocapsid protein diagnostic accuracy of serological tests for covid- : systematic review and meta-analysis seroprevalence of antibodies against sars-cov- among health care workers in a large spanish reference hospital gender differences in patients with covid- : focus on severity and mortality multiple functions of immunoglobulin a in mucosal defense against viruses: an in vitro measles virus model dysregulation of immune response in patients with covid- in wuhan, china detection of sars-cov- -specific humoral and cellular immunity in covid- convalescent individuals immune phenotyping based on neutrophil-to-lymphocyte ratio and igg predicts disease severity and outcome for patients with covid- duration of antibody responses after severe acute respiratory syndrome robust t cell immunity in convalescent individuals with asymptomatic or mild covid- chimeric proteins containing map- and functional domains of c b-binding protein reveal strong complement inhibitory capacities the authors thank jytte bryde clausen and bettina eide holm for excellent technical assistance and josé juan almagro armenteros (university of copenhagen, copenhagen, denmark) for his statistical advice. this work was financially supported by the carlsberg foundation (cf - ) and the novo nordisk foundation ( a ). the authors have declared that no conflict of interest exists. the assays were developed in a non-commercial collaboration between rigshospitalet and novo nordisk a/s. key: cord- -y a z e authors: bhattacharya, rajarshi; gupta, aayatti mallick; mitra, suranjita; mandal, sukhendu; biswas, swadesh r. title: a natural food preservative peptide nisin can interact with the sars-cov- spike protein receptor human ace date: - - journal: virology doi: . /j.virol. . . sha: doc_id: cord_uid: y a z e nisin, a food-grade antimicrobial peptide produced by lactic acid bacteria has been examined for its probable interaction with the human ace (hace ) receptor, the site where spike protein of sars-cov- binds. among the eight nisin variants examined, nisin h, nisin z, nisin u and nisin a showed a significant binding affinity towards hace , higher than that of the rbd (receptor binding domain) of the sars-cov- spike protein. the molecular interaction of nisin with hace was investigated by homology modeling and docking studies. further, binding efficiency of the most potent nisin h was evaluated through the interaction of hace :nisin h complex with rbd (receptor-binding domain) of sars-cov- and that of hace :rbd complex with nisin h. here, nisin h acted as a potential competitor of rbd to access the hace receptor. the study unravels for the first time that a globally used food preservative, nisin has the potential to bind to hace . the ongoing global outbreak of covid- , a severe life-threatening infectious respiratory disease caused by a recently discovered severe acute respiratory syndrome coronavirus (sars-cov- ) has drastically affected human life with over eighteen millions of cases of infection globally (https://coronavirus.jhu.edu/map.html). until now, no specific antiviral medication is available for covid- , but extensive efforts are underway worldwide. although vaccines are thought to be the most powerful weapon to fight against virus invasion, it may take quite a long time to go from the lab to successful applications in humans. considering the acute crisis of covid- pandemic, there is an urgent need for developing effective antiviral therapeutics for the prevention and treatment of covid- . it is well accepted that the spike protein on the outer surface of sars-cov- is a crucial recognition factor for its attachment and entry to the host cells (shang et al., ) . the viral infection in humans is initiated by binding of rbd (receptor binding domain) of spike protein to human angiotensin-converting enzyme (hace ) receptor (wang et al., ) . therefore, a therapeutic agent that blocks hace might prevent the interaction of spike protein of sars-cov- and thereby could reduce the establishment of infection. although small non-proteinaceous molecules are commonly preferred as therapeutics, they are not effective in blocking protein-protein interactions (ppis) particularly, where a deep binding pocket may be missing at the interface (arkin et al., ) . on the contrary, peptides are more suitable for disrupting ppis by specifically interacting with the interfaces. more importantly, small peptides have reduced immunogenicity (sorolla et al., ) . hence, peptides are potentially the ideal candidates for application as novel therapeutics. the recently described peptides are all small, synthetic and costly, and have not produced promising results against sars-cov- (du et al., ) . the peptides recently designed computationally (han and král, ) against the sars-cov- has to be synthesized prior to practical application, hence such peptides are not natural and food-grade. the present study attempts to investigate the ability of food-grade nisin a and its natural variants to block the interaction between hace and the spike protein of sars-cov- , a key step of covid- disease initiation. nisin, a pentacyclic antibacterial peptide with residues, is produced by certain strains of food-grade lactococcus lactis, widely used for cheese manufacturing (fox and wood, ; lubelski et al., ; juncioni et al., ) . nisin belongs to a group of cationic peptide antimicrobials collectively called type a (i) lantibiotics (smith and hillman, ) . it was first identified in fermented milk cultures and is now globally used as a natural and safe food preservative in a variety of food products around the world, such as processed cheese, dairy desserts, milk, fermented beverages, meat and canned foods (hurst, ; fons et al., ; mitra et al., ) . it has been approved by the european union (e ), world health organization (who) as well as by the us food and drug administration (fda). currently, nisin is licensed in over countries (shin et al., ) . because of the high safety profile over the past years of usage and its strong antimicrobial action against a wide range of food spoilage and pathogenic bacteria, nisin has been extensively studied. it also has multiple applications in biomedicine including bacterial infections, cancer, oral diseases and other veterinary and research field (shin et al., ) . since the discovery of nisin a, eight natural variants of nisin have been discovered which include nisin a, z, f, q, h, u, u and p (garcia-gutierrez et al., ) . nisin z producing organisms are very common in nature (mitra et al., ; vos et al., ) . the structures of eight variants of nisin were analyzed in the present study. all nisin peptides were aligned to show their identity and modeled on swiss-model web server. hace and rbd domain of -cov- were also modeled on the same platform to increase the acceptability of the structures. all the peptides and rbd were docked with hace using haddock server. the binding affinity of the peptides was examined by docking analysis based on z-score, binding affinity and buried surface area. structurally, nisin is a unique molecule containing unusual amino acids including dehydroalanine and dehydrobutyrine, formed by dehydration of serine and threonine residues, respectively. these two residues are stereo-and regio-specifically coupled to the thiol group of the cysteines to form lanthionine and β-methyl lanthionine introduced enzymatically at post-translational level (cotter et al., ) . nisin is thus a thioether-bridged pentacyclic peptide. the crystal structure of nisin has not been developed. the peptide molecule adopts different conformations depending on the environment. the structure of nisin cannot be described in terms of regular secondary-structure elements, due to the presence of the ring systems in which % of the residues are incorporated. however, the nmr structure is available in pdb database, which was used in this study as template to generate the model structures of the nisin variants. the nmr structure of nisin has determined two structured domains: an n-terminal domain (residues - ) containing three lanthionine rings, a, b and c; and a c-terminal domain (residues - ) containing two intertwined lanthionine rings numbered d and e (hilbers, ) . these domains are flanked by regions showing structural flexibility. the four-residue rings b, d and e of nisin all show a β-turn structure, which is closed by the thioether linkage. the backbones of the rings b and d form type i β-turns. the c-terminal domain consists of three consecutive β-turns. the nmr data will help us to locate residues in nisin interacting with hace . the present study attempts to evaluate the potential of nisin variants to interact with hace by predicting nisin binding site using nisin-hace docking computation with the nmr structure of nisin in the pdb database. this is the first report on the potential of widely used food-grade antibacterial peptide nisin to bind with hace and predicting the possibility of nisin as therapeutic against covid- . the work is significant in finding a solution to prevent the infection by novel coronavirus sars-cov- . amino acids sequences of eight nisin variants: nisin z ( (park et al., ) . esprit software (robert and gouet, ) was used to represent the msa using blosum algorithm. homology models of all nisin variants were done using the swiss- model web server (waterhouse et al., ) using nisin z (smtl id: wco. ) as template. the steriochemical property of each of the models was evaluated by ramachandran plot using volume, area, dihedral angle reporter (vadar) server (willard et al., ) (fig. s ) . similarly, the rbd (receptorbinding domain) of spike protein of sars-cov- and hace receptor was modeled using smtl id: lzg. and smtl id: m . , respectively. all the models of nisin variants were superimposed together to determine their structural differences using read scoring matrix in pymol software. (the pyolecular graph). molecular docking was performed to test the binding affinity of all nisin variants towards hace . in order to understand the comparative binding strength, multi-body docking were done between hace :nisin h complex with the rbd of sars-cov- and hace :rbd complex with nisin h. the solvated docking software, haddock (melquiond et al., ) was used without defying any restraints for such study. most reliable model was selected by lowest haddock score value. the score is calculated as where evdw is the intermolecular van der waals energy, eelec the intermolecular electrostatic energy, edesol represents an empirical desolvation energy. active site residues of hace (k , e , d , m ) responsible of rbd spike binding were selected for docking. the residues surrounding the active loci were considered as passive. the interacting residues were visualized using discovery studio. (systÈmes, ) prodigy@bonvin lab web server (xue et al., ) was used to calculate Δg to predict the affinity of nisin h for hace at • c with other parameters remained under default condition. grand average of hydropathy score of hace was calculated with exapassyprotparam webserver. (gasteiger et al.walker, ) in multiple sequence alignment (fig. ) of amino acid residues of eight nisin variants (nisin a, z, q, h, p, u, u and f), nisin z shared . % amino acid sequence similarity with nisin h, whereas nisin p, u, u , q and f shared only . %, . %, . %, . % and . %, respectively with nisin h (table s ) . nisin a was found to be closely related to nisin z ( . % identity) with only a single amino acid difference (his asn). in contrast, nisin h differs from nisin a by five different amino acids at positions , , , and with . % identity. nisin p is shorter than nisin h ( residues) by three residues from the c-terminus. nisin h differs from nisin f by residues, f i, m l, t g, y m, h n, i v and k h. nisin q is different from nisin h due to the presence of isoleucine, leucine, valine, glycine, leucine, asparagine, valine and histidine at positions , , , , , , and , respectively. nisin u and u differed from nisin h by ten amino acids. the residual surface accessibility is present at the bottom of the alignment (fig. ) . the model structures of all nisin variants, hace , rbd of spike protein built on using swiss-model web server were validated for steriochemical properties using ramachandran plot (fig. s ) . we considered the number of amino acids in the disallowed regions except for glycine and proline because of their chirality and imino group, respectively. homology model of nisin p and u had no disallowed amino acids. nisin h and u had only one residue in disallowed region, whereas two residues were found in the disallowed region for nisina, f, q and z. the rmsd (c-alpha) from all the superimposed variants of nisin was found . . these signify that all the nisin models were structurally similar to one another. the binding efficiency of nisins with hace was further evaluated from docking studies. best haddock model of nisin variants in complex with hace was analyzed for three parameters viz. z-score, buried surface area, and binding affinity. the z-score indicates how many standard deviations from the average of the cluster is located in terms of score (the more negative the better). z-score of hace -sars-cov- rbd, hace -nisin a, hace -nisin z, hace -nisin h, hace -nisin q, hace -nisin u, hace nisin u , hace -nisin f, and hace -nisin p was predicted as − . ,- . ,- . ,- . ,- . ,- . ,- . ,- . , and − . . hence, both nisin h and nisin z were lowest than rest of the nisin variants as well as rbd of spike protein. burried surface area of nisin z and nisin h with hace were found higher, . Å and . Å , respectively in contrast to Å for the rbd. this suggests that nisin h and nisin z had better binding efficiency for hace . the binding affinity of docked structures of all eight variants of nisin in complex with hace was calculated as Δg derived from analysis with prodigy for each complex in comparison with the rbd of spike protein of sars-cov- . Δg of hace -sars-cov- , hace -nisin a, hace -nisin z, hace -nisin h, hace -nisin q, hace -nisin u, hace -nisin u , hace nisin f, and hace -nisin p was − kcal/mol, − . kcal/mol, − . kcal/mol, − . kcal/mol, − . kcal/mol, − . kcal/mol, − . kcal/mol, − . kcal/mol, and − . kcal/mol, respectively. thus Δg of hace -nisin z and hace -nisin h are much higher conferring strong binding affinity than that of hace -rbd.gravy score of nisin a, z, h, q, u, u , f, p and rbd-sars-cov- was calculated as . , . , . , . , . , . , . , . , − . , respectively (table ) . from the gravy score of all nisin variants, nisin h turned out to be more hydrophilic than nisin a and nisin z and will thus more potent to interact with the hydrophobic groove of hace than others variants of nisin. from the docking analysis it is evident that nisin z and nisin h interacts to hace more efficiently. the interacting residues and atoms are given in (table s ). the hydrogen bonds (k :c , k :t , k : k , e :k , e :c , e :n , d :k , d :c , m :c , k : n ) and hydrophobic bonds (m :i , m :c , k :c , y :c , k : c ) are the major binding force for hace -nisinz interaction. interacting residues of nisin z was predicted as i , c , k , t , c , n , k , and c . all interacting residues of nisin z were hydrophilic in nature. the residues in nisin h interacting with the hace include hydrogen bond of t :k , c :k , k :k , t :k , p :k , k : e , k :d , n :e , c :d , h :d , c :k , t :k (fig. ) and hydrophobic bond of c :k and y :k , c :m , a : k , c :k , c :k . among all these interacting residues, t , p , c , k , t , c , k , c were highly conserved among all the nisin variants. like rbd, surface accessible hydrophilic residues, t , p , c , k , t , k , and c were found to be involved in binding to hydrophobic groove of hace . it was found that nisin z and nisin h recognized five common residues (k , e , d , m , k ) in hace that were also recognized by rbd of spike. the binding efficiency of preformed hace :nisin h complex was performed by competitive tertiary docking with rbd of sars-cov- (fig. ) . as, nisin h had already occupied the active site residues of hace with strong hydrogen bond and hydrophobic interactions (table s ) , rbd of sars-cov- could not get access the active residues of hace with reasonable efficiency by overcoming the binding strength of hace :nisin h interaction with Δg of − . kcal/mol (binding affinity of rbd:hace complex is − kcal/mol). on the contrary when nisin h was allowed to interact with the hace :rbd complex, it was found that nisin h could be able to interact with active residues (k and m ) of hace from the hace :rbd complex (table s ) . nisin h, being more potent candidate could able to interfere in the interaction between rbd-hace . there is high possibility that nisin would be able to competitively displace bound sars-cov- because of its higher binding affinity towards the ace receptor compared to that of the virus. furthermore nisin being a non-synthetic molecule and smaller in size, will ensure high bioavailability. based on such study, we hypothesize that nisin h, z, a and u could be an eligible competitor of rbd of sars-cov- for having the same binding patch in hace . recently, several peptides computationally designed to target the spike protein of sars-cov- have been reported (han and král, ; baig et al., ) as a strategy to prevent their interaction with ace receptor for tackling covid- infection. from an application perspective, it would be advantageous of using nisin as an effective treatment option over the reported designed peptides for several reasons, including its natural occurrence, food-grade status, extreme stability and ease of manufacturing through microbial fermentation, cost effectiveness, delivery at high concentration, etc. however, further experimental validation is required to confirm nisin binding to hace . among all analyzed nisin variants, nisin z, nisin a, nisin u and nisin h were most effective in interacting with human endothelial cell surface-receptor hace , the site where rbd of spike of sars-cov- binds to initiate infection. compared to the rbd of viral spike protein, nisin binds with the hace receptor with higher affinity. nisin being a low molecular weight peptide and readily bioavailable in the system, its binding to hace is expected to over-rule the interaction possibility of the rbd of spike of sars-cov- and could essentially exclude the virus entry to the host cell. since nisin is a heat stable natural food grade peptide, can be produced cost effectively, even in large quantity through microbial fermentation, the present work will create greater interest among researchers to develop a new nisin-based treatment strategy for covid- , either through oral or nasal applications. however, further experimental validation is necessary to determine its doses and mechanistic application to check the competition of nisin and spike protein of sars-cov- for accessing the human. the authors declare that they have no conflict of interest. small-molecule inhibitors of protein-protein interactions: progressing toward the reality identification of a potential peptide inhibitor of sars-cov- targeting its entry into the host cells bacteriocins -a viable alternative to antibiotics? molecular modeling and chemical modification for finding peptide inhibitor against severe acute respiratory syndrome coronavirus main proteinase microbial ecology in health and disease mechanisms of colonisation and colonisation resistance of the digestive tract part : bacteria/bacteria interactions mechanisms of colonisation and colonisation resistance of the digestive tract first evidence of production of the lantibiotic nisin protein identification and analysis tools on the expasy server computational design of ace -based peptide inhibitors of sars-cov- surface location and orientation of the lantibiotic nisin bound to membrane-mimicking micelles of dodecylphosphocholine and of sodium dodecylsulphate introduction, i. a. hurst nisin biotechnological production and application: a review biosynthesis, immunity, regulation, mode of action and engineering of the model lantibiotic nisin the haddock . web server: user-friendly integrative modeling of biomolecular complexes potential application of the nisin z preparation of lactococcus lactis w in preservation of milk the embl-ebi search and sequence analysis tools apis deciphering key features in protein structures with the new endscript server structural basis of receptor recognition by sars-cov- biomedical applications of nisin therapeutic potential of type a ( i ) lantibiotics , a group of cationic peptide antibiotics precision medicine by designer interference peptides: applications in oncology and molecular therapeutics dassault syst mes biovia properties of nisin z and distribution of its gene , nisz , in lactococcus lactis structural and functional basis of sars-cov- entry by using human ace swiss-model: homology modelling of protein structures and complexes vadar: a web server for quantitative evaluation of protein structure quality structural bioinformatics prodigy: a web server for predicting the binding affinity of protein-protein complexes we most sincerely acknowledge the research grant (bt/pr / get/ / / ) received from dbt, department of biotechnology, govt. of india, new delhi. supplementary data to this article can be found online at https://doi. org/ . /j.virol. . . . rb curated, analyzed and interpreted the data. amg helped in docking studies. s mitra, s mandal and srb supervised the work. all the authors write, review and edited the manuscript. key: cord- -wivk bm authors: schoof, michael; faust, bryan; saunders, reuben a.; sangwan, smriti; rezelj, veronica; hoppe, nick; boone, morgane; billesbølle, christian b.; puchades, cristina; azumaya, caleigh m.; kratochvil, huong t.; zimanyi, marcell; deshpande, ishan; liang, jiahao; dickinson, sasha; nguyen, henry c.; chio, cynthia m.; merz, gregory e.; thompson, michael c.; diwanji, devan; schaefer, kaitlin; anand, aditya a.; dobzinski, niv; zha, beth shoshana; simoneau, camille r.; leon, kristoffer; white, kris m.; chio, un seng; gupta, meghna; jin, mingliang; li, fei; liu, yanxin; zhang, kaihua; bulkley, david; sun, ming; smith, amber m.; rizo, alexandrea n.; moss, frank; brilot, axel f.; pourmal, sergei; trenker, raphael; pospiech, thomas; gupta, sayan; barsi-rhyne, benjamin; belyy, vladislav; barile-hill, andrew w.; nock, silke; liu, yuwei; krogan, nevan j.; ralston, corie y.; swaney, danielle l.; garcía-sastre, adolfo; ott, melanie; vignuzzi, marco; walter, peter; manglik, aashish title: an ultra-potent synthetic nanobody neutralizes sars-cov- by locking spike into an inactive conformation date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: wivk bm without an effective prophylactic solution, infections from sars-cov- continue to rise worldwide with devastating health and economic costs. sars-cov- gains entry into host cells via an interaction between its spike protein and the host cell receptor angiotensin converting enzyme (ace ). disruption of this interaction confers potent neutralization of viral entry, providing an avenue for vaccine design and for therapeutic antibodies. here, we develop single-domain antibodies (nanobodies) that potently disrupt the interaction between the sars-cov- spike and ace . by screening a yeast surface-displayed library of synthetic nanobody sequences, we identified a panel of nanobodies that bind to multiple epitopes on spike and block ace interaction via two distinct mechanisms. cryogenic electron microscopy (cryo-em) revealed that one exceptionally stable nanobody, nb , binds spike in a fully inactive conformation with its receptor binding domains (rbds) locked into their inaccessible down-state, incapable of binding ace . affinity maturation and structure-guided design of multivalency yielded a trivalent nanobody, mnb -tri, with femtomolar affinity for sars-cov- spike and picomolar neutralization of sars-cov- infection. mnb -tri retains stability and function after aerosolization, lyophilization, and heat treatment. these properties may enable aerosol-mediated delivery of this potent neutralizer directly to the airway epithelia, promising to yield a widely deployable, patient-friendly prophylactic and/or early infection therapeutic agent to stem the worst pandemic in a century. monoclonal antibodies disclosed to date. our lead neutralizing molecule, mnb -tri, blocks sars-cov- entry in human cells at picomolar efficacy and withstands aerosolization, lyophilization, and elevated temperatures. mnb -tri provides a promising approach to deliver a potent sars-cov- neutralizing molecule directly to the airways for prophylaxis or therapy. synthetic nanobodies that disrupt spike-ace interaction to isolate nanobodies that neutralize sars-cov- , we screened a yeast surface-displayed library of > x synthetic nanobody sequences. our strategy was to screen for binders to the full spike protein ectodomain, in order to capture not only those nanobodies that would compete by binding to the ace -binding site on the rbd directly but also those that might bind elsewhere on spike and block ace interaction through indirect mechanisms. we used a mutant form of sars-cov- spike (spike*,) as the antigen ( ). spike* lacks one of the two activating proteolytic cleavage sites between the s and s domains and introduces two mutations to stabilize the pre-fusion conformation. spike* expressed in mammalian cells binds ace with a kd = nm ( supplementary fig. ) , consistent with previous reports ( ). next, we labeled spike* with biotin or with fluorescent dyes and selected nanobody-displaying yeast over multiple rounds, first by magnetic bead binding and then by fluorescence-activated cell sorting (fig. a) . three rounds of selection yielded unique nanobodies that bound spike* and showed decreased spike* binding in the presence of ace . closer inspection of their binding properties revealed that these nanobodies fall into two distinct classes. one group (class i) binds the rbd and competes with ace (fig. b) . a prototypical example of this class is nanobody nb , which binds to spike* and to rbd alone with a kd of nm and nm, respectively ( fig. c ; table ). another group (class ii), exemplified by nanobody nb , binds to spike* (kd = nm), but displays no binding to rbd alone (fig. c, table ). in the presence of excess ace , binding of nb and other class i nanobodies is blocked entirely, whereas binding of nb and other class ii nanobodies is decreased only moderately (fig. b) . these results suggest that class i nanobodies target the rbd to block ace binding, whereas class ii nanobodies target other epitopes and decrease ace interaction with spike allosterically or through steric interference. indeed, surface plasmon resonance (spr) experiments demonstrate that class i and class ii nanobodies can bind spike* simultaneously (fig. d) . analysis of the kinetic rate constants for class i nanobodies revealed a consistently greater association rate constant (ka) for nanobody binding to the isolated rbd than to full-length spike* (table ) , which suggests that rbd accessibility influences the kd. we next tested the efficacy of our nanobodies, both class i and class ii, to inhibit binding of fluorescently labeled spike* to ace -expressing hek cells (table , fig. e ). class i nanobodies emerged with highly variable activity in this assay with nb and nb as two of the most potent clones with ic values of and nm, respectively (table ) to define the binding sites of nb and nb , we determined their cryogenic electron microscopy (cryo-em) structures bound to spike* ( fig. a state rbds only contacts a single rbd (fig. d) . nb interacts with the spike s domain external to the rbd our attempts to determine the binding site of nb by cryo-em proved unsuccessful. we therefore turned to radiolytic hydroxyl radical footprinting to determine potential binding sites for nb . spike*, either apo or bound to nb , was exposed to - milliseconds of synchrotron x-ray radiation to label solvent-exposed amino acids with hydroxyl radicals. radical-labeled amino acids were subsequently identified and quantified by mass spectrometry of trypsin/lys-c or glu- c protease digested spike*( ). two neighboring surface residues on the s domain of spike (m and h ) emerged as highly protected sites in the presence of nb we assessed multivalent nb binding to spike* by spr. both bivalent nb with a amino acid linker (nb -bi) and trivalent nb with two amino acid linkers (nb -tri) dissociate from spike* in a biphasic manner. the dissociation phase can be fitted to two components: a fast phase with kinetic rate constants kd of . x - s - for nb -bi and . x - s - for nb -tri, which are of the same magnitude as that observed for monovalent nb (kd = . x - s - ) and a slow phase that is dependent on avidity (kd = . x - for nb -bi and kd < . x - s - for nb -tri, respectively) ( fig. a) . the relatively similar kd for the fast phase suggests that a fraction of the observed binding for the multivalent constructs is nanobody binding to a single spike* rbd. by contrast, the slow dissociation phase of nb -bi and nb -tri indicates engagement of two or three rbds. we observed no dissociation for the slow phase of nb -tri over minutes, indicating an upper boundary for kd of x - s - and subpicomolar affinity. this measurement remains an upper- bound estimate rather than an accurate measurement because the technique is limited by the intrinsic dissociation rate of spike* from the chip imposed by the chemistry used to immobilize spike*. we reasoned that the biphasic dissociation behavior could be explained by a slow interconversion between up-and down-state rbds, with conversion to the more stable down- state required for full trivalent binding. according to this view, a single domain of nb -tri engaged with an up-state rbd would dissociate rapidly. the system would then re-equilibrate as the rbd flips into the down-state, eventually allowing nb -tri to trap all rbds in closed spike*. to test this notion directly, we varied the time allowed for nb -tri binding to spike*. indeed, we observed an exponential decrease in the percent fast-phase with a t / of s ( table ). nb -tri shows a -fold enhancement of inhibitory activity, with an ic of . nm, whereas trimerization of nb and nb resulted in more modest gains of - and -fold ( nm and nm), respectively (fig. c) . we next confirmed these neutralization activities with a viral plaque assay using live sars- nb -tri proved exceptionally potent, neutralizing sars-cov- with an average ic of pm (fig. d ). nb -tri neutralized sars-cov- with an average ic of nm (fig. d) . we further optimized the potency of nb by selecting high-affinity variants. to this end, we prepared a new library, starting with the nb coding sequence, in which we varied each amino acid position of all three cdrs by saturation mutagenesis (fig. a) . after two rounds of magnetic bead-based selection, we isolated a population of high-affinity clones. sequencing revealed two highly penetrant mutations: i y in cdr and p y in cdr . we incorporated these two mutations into nb to generate matured nb (mnb ), which binds with -fold increased affinity to spike* as measured by spr (fig. b) . as a monomer, mnb inhibits both pseudovirus and live sars-cov- infection with low nanomolar potency, a ~ -fold improvement compared to nb ( fig. i -j, table ). a . Å cryo-em structure of mnb bound to spike* shows that, like the parent nanobody nb , mnb binds to closed spike (fig. c, supplementary fig. ) . the higher resolution map allowed us to build a model with high confidence and determine the effects of the i y and p y substitutions. mnb induces a slight rearrangement of the down-state rbds as compared to both previously determined structures of apo-spike* and spike* bound to nb , inducing a ° rotation of the rbd away from the central three-fold symmetry axis (fig. h) ( , ) . this deviation likely arises from a different interaction between cdr and spike*, which nudges the rbds into a new resting position. while the i y substitution optimizes local contacts between cdr in its original binding site on the rbd, the p y substitution leads to a marked rearrangement of cdr in mnb (fig. f-g) . this conformational change yields a different set of contacts between mnb cdr and the adjacent rbd (fig. d) . remarkably, an x-ray crystal structure of mnb alone revealed dramatic conformational differences in cdr and cdr between free and spike*-bound mnb , suggestive of significant conformational heterogeneity for the unbound nanobodies and induced-fit rearrangements upon binding to spike* (fig. e) . the binding orientation of mnb is similar to that of nb , supporting the notion that our multivalent design would likewise enhance binding affinity. unlike nb -tri, trivalent mnb (mnb -tri) bound to spike with no observable fast-phase dissociation and no measurable dissociation over ten minutes, yielding an upper bound for the dissociation rate constant kd of . x - s - (t / > days) and a kd of < pm (fig. b) . as above, more precise measurements of the dissociation rate are precluded by the surface chemistry used to immobilize spike*. mnb -tri displays further gains in potency in both pseudovirus and live sars-cov- infection assays with ic values of pm ( . ng/ml) and pm ( . ng/ml), respectively (fig. h-i, table ). given the sub-picomolar affinity observed by spr, it is likely that these viral neutralization potencies reflect the lower limit of the assays. mnb -tri is therefore an exceptionally potent sars-cov- neutralizing antibody, among the most potent molecules disclosed to date. nb , nb -tri, mnb , and mnb -tri are robust proteins one of the most attractive properties that distinguishes nanobodies from traditional monoclonal antibodies is their extreme stability ( ). we therefore tested nb , nb -tri, mnb , and mnb -tri for stability regarding temperature, lyophilization, and aerosolization. temperature denaturation experiments using circular dichroism measurements to assess protein unfolding revealed melting temperatures of . , . , . , and . °c for nb , nb -tri, mnb and mnb -tri, respectively ( fig a) . aerosolization and prolonged heating of nb , mnb , and mnb -tri for hour at °c induced no loss of activity (fig b) . moreover, mnb and mnb -tri were stable to lyophilization and to aerosolization using a mesh nebulizer, showing no aggregation by size exclusion chromatography and preserved high affinity binding to spike* (fig. c-d) . there is a pressing need for prophylactics and therapeutics against sars-cov- infection. most recent strategies to prevent sars-cov- entry into the host cell aim at blocking the ace -rbd interaction. high-affinity monoclonal antibodies, many identified from convalescent patients, are leading the way as potential therapeutics ( - ). while highly effective in vitro, these agents are expensive to produce by mammalian cell expression and need to be intravenously administered by healthcare professionals ( ). moreover, large doses are likely to be required for prophylactic viral neutralization, as only a small fraction of systemically circulating antibodies cross the epithelial cell layers that line the airways ( ). by contrast, single domain antibodies (nanobodies) provide significant advantages in terms of production and deliverability. they can be inexpensively produced at scale in bacteria (e. coli) or yeast (p. pastoris). furthermore, their inherent stability enables aerosolized delivery directly to the nasal and lung epithelia by self-administered inhalation ( ). monomeric mnb is among the most potent single domain antibodies neutralizing sars-cov- discovered to date. multimerization of single domain antibodies has been shown to improve target affinity by avidity ( , ) . in the case of nb and mnb , however, our design strategy enabled a multimeric construct that simultaneously engages all three rbds, yielding profound gains in potency. furthermore, because rbds must be in the up-state to engage with ace , conformational control of rbd accessibility can serve as an added neutralization mechanism. indeed, our nb -tri and mnb -tri molecules were designed with this functionality in mind. sars-cov- seroconversion in humans: a detailed protocol for a serological assay, antigen production, and test setup trimeric sars-cov- spike interacts with dimeric ace with limited intra- spike avidity. biorxiv yeast surface display platform for rapid discovery of conformationally selective nanobodies automated electron microscope tomography using robust prediction of specimen movements motioncor : anisotropic correction of beam-induced motion for improved cryo-electron microscopy cryosparc: algorithms for rapid unsupervised cryo-em structure determination new tools for automated high-resolution cryo-em structure determination in relion- grigorieff, cistem, user-friendly software for single-particle image processing structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structure of a nanobody-stabilized active state of the β( ) adrenoceptor rosettaes: a sampling strategy enabling automated interpretation of difficult cryo-em maps coot: model-building tools for molecular graphics isolde: a physically realistic environment for model building into low- resolution electron-density maps phenix: a comprehensive python-based system for macromolecular structure solution allosteric nanobodies reveal the dynamic range and diverse mechanisms of g-protein-coupled receptor activation the beamline x c of the center for synchrotron biosciences: a national resource for biomolecular structure and dynamics experiments using synchrotron footprinting fast quantitative analysis of timstof pasef data with msfragger and ionquant msstats: an r package for statistical analysis of quantitative mass spectrometry-based proteomic experiments automatic processing of rotation diffraction data from crystals of initially unknown symmetry and cell constants phaser crystallographic software buster version . . . . cambridge, united kingdom: global phasing ltd figure . cryo-em structures of nb and nb bound to spike. a, cryo-em maps of spike*- nb complex in either closed (left) or open (right) spike* conformation. b, cryo-em maps of spike*-nb complex in either closed (left) or open (right) spike* conformation. the top views show receptor binding domain (rbd) up-or down-states. c, nb straddles the interface of two down-state rbds, with cdr reaching over to an adjacent rbd. d, nb binds a single rbd in the down-state (displayed) or similarly in the up-state nb in either rbd up-or down-state. e, comparison of rbd epitopes engaged by ace (purple), nb (red), or nb (green) multivalency improves nanobody affinity and inhibitory efficacy. a, spr of nb and multivalent variants. red traces show raw data and black lines show global kinetic fit for nb and independent fits for association and dissociation phases for nb -bi and nb -tri dissociation phase spr traces for nb -tri after variable association time ranging from curves were normalized to maximal signal at the beginning of the dissociation phase. percent fast phase is plotted as a function of association time (right) with a single exponential fit. n = independent biological replicates. c, inhibition of pseudotyped lentivirus infection of ace expressing hek t cells. n = biological replicates for all but nb -tri (n = ) d, inhibition of live sars-cov- virus. representative biological replicate with n = (right panel) or (left panel) technical replicates per concentration. n = biological replicates for all but nb and nb - tri (n = ) dissociation was observed for mnb -tri over minutes. c, cryo-em structure of spike*-mnb comparison of receptor binding domain (rbd) engagement by nb and mnb demonstrating changes in nb and mnb position and the adjacent rbd. e, comparison of mnb complementarity determining regions in either the cryo-em structure of the spike*-mnb complex or an x-ray crystal structure of mnb alone. f, cdr of nb and mnb binding to the rbd. as compared to i in nb nb and mnb binding to the rbd demonstrating a large conformational rearrangement of the entire loop in mnb . h, comparison of closed spike* bound to mnb and rotational axis for rbd movement is highlighted. i, inhibition of pseudotyped lentivirus infection of ace expressing hek t cells by mnb and mnb -tri. n = biological replicates j, mnb and mnb -tri inhibit sars-cov- infection of veroe cells in a plaque assay representative biological replicate with n = technical replicates per concentration. n = biological replicates for all samples average values from n = biological replicates for nb , nb , and nb -tri are presented c average values from n = biological replicates for nb , nb -bi, and nb -tri. n = biological replicates for all others nb -no binding nc -no competition np -not performed we thank the entire walter and manglik labs for facilitating the development and rapid execution of this large-scale collaborative effort. we thank sebastian bernales and tony de fougerolles for advice and helpful discussion, and jonathan weissman for input into the project and reagent and machine use. we thank jim wells for providing the ace ecd-fc construct, jason mclellan for providing spike, rbd, and ace constructs, and florian krammer for providing an rbd construct. we thank jesse bloom for providing the ace expressing hek t i . x . x - . x - . x . x - . x - . x - ( . x - ) np np nb ii . x . x - . x - nb nc . x - ( . x - ) . x - ( . x - )nb i . x . x - . x - . x . x - . x - . x - ( . x - ) . x - ( . x - ) . x - ( . x - )nb i . x . x - . x - . x . x - . x - . x - ( . x - ) np np nb i . x . x - . x - . x . x - . x - . x - ( . x - ) . x - ( . x - ) np nb i . x . x - . x - biphasic biphasic biphasic . x - ( . x - ) . x - ( . x - ) np nb i . x . x - . x - . x . x - . x - . x - ( . x - ) . x - ( . x - ) np nb i . x . x - . x - np . x - ( . x - ) np np nb ii . x . x - . x - nb nc . x - ( . x - ) np nb ii . x . x - . x - nb . x - ( . x - ) np np nb i . x . x - . x - . x . x - . x - . x - ( . x - ) . x - ( . x - ) np nb i . x . x - . x - . x . x - . x - . x - ( . x - ) np np ace n/a . x . x - . x - np np np . x - ( . x - ) . x - ( . x - ) np mnb i . x . x - . x - . x . x - . x - . x - ( . x - ) . x - ( . x - ) key: cord- -spiuqngp authors: huang, yuan; yang, chan; xu, xin-feng; xu, wei; liu, shu-wen title: structural and functional properties of sars-cov- spike protein: potential antivirus drug development for covid- date: - - journal: acta pharmacol sin doi: . /s - - - sha: doc_id: cord_uid: spiuqngp coronavirus disease is a newly emerging infectious disease currently spreading across the world. it is caused by a novel coronavirus, severe acute respiratory syndrome coronavirus (sars-cov- ). the spike (s) protein of sars-cov- , which plays a key role in the receptor recognition and cell membrane fusion process, is composed of two subunits, s and s . the s subunit contains a receptor-binding domain that recognizes and binds to the host receptor angiotensin-converting enzyme , while the s subunit mediates viral cell membrane fusion by forming a six-helical bundle via the two-heptad repeat domain. in this review, we highlight recent research advance in the structure, function and development of antivirus drugs targeting the s protein. the epidemic of novel coronavirus disease (covid- ) was caused by a new coronavirus occurred in december , and now has spread worldwide and turned into a global pandemic [ ] . the covid- was quickly discovered to be caused by a coronavirus later named severe acute respiratory syndrome coronavirus (sars-cov- ) [ ] , which belongs to the β coronavirus family. it is the seventh known coronavirus to infect humans; four of these coronaviruses ( e, nl , oc , and hku ) only cause slight symptoms of the common cold. conversely, the other three, sars-cov, mers-cov, and sars-cov- , are able to cause severe symptoms and even death, with fatality rates of %, %, and %, respectively. although a large number of studies and clinical trials are being launched on covid- around the world [ , ] , no evidence from randomized clinical trials has shown that any potential therapy improves outcomes in patients [ ] . as the epidemic spreads, it is critical to find a specific therapeutic for covid- , and vaccines targeting various sars-cov- proteins are under development. sars-cov- is a single-stranded rna-enveloped virus [ ] . an rna-based metagenomic next-generation sequencing approach has been applied to characterize its entire genome, which is , bp in length (genbank no. mn ), encoding amino acids [ ] . gene fragments express structural and nonstructural proteins. the s, e, m, and n genes encode structural proteins, whereas nonstructural proteins, such as -chymotrypsinlike protease, papain-like protease, and rna-dependent rna polymerase, are encoded by the orf region [ ] . a large number of glycosylated s proteins cover the surface of sars-cov- and bind to the host cell receptor angiotensinconverting enzyme (ace ), mediating viral cell entry [ ] . when the s protein binds to the receptor, tm protease serine (tmprss ), a type tm serine protease located on the host cell membrane, promotes virus entry into the cell by activating the s protein. once the virus enters the cell, the viral rna is released, polyproteins are translated from the rna genome, and replication and transcription of the viral rna genome occur via protein cleavage and assembly of the replicase-transcriptase complex. viral rna is replicated, and structural proteins are synthesized, assembled, and packaged in the host cell, after which viral particles are released (fig. d) [ ] . these proteins are critical to the viral life cycle and provide potential targets for drug therapies. for example, ace -based peptide, clpro inhibitor ( clpro- ), and a novel vinylsulfone protease inhibitor have been experimentally demonstrated to be effective against sars-cov- [ ] . the sars-cov- s protein is highly conserved among all human coronaviruses (hcovs) and is involved in receptor recognition, viral attachment, and entry into host cells. due to its indispensable functions, it represents one of the most important targets for covid- vaccine and therapeutic research. in this review, we summarize advances in research of the sars-cov- s protein and its therapeutic targeting. with a size of - kda, the s protein consists of an extracellular n-terminus, a transmembrane (tm) domain anchored in the viral membrane, and a short intracellular c-terminal segment [ ] . s normally exists in a metastable, prefusion conformation; once the virus interacts with the host cell, extensive structural rearrangement of the s protein occurs, allowing the virus to fuse with the host cell membrane. the spikes are coated with polysaccharide molecules to camouflage them, evading surveillance of the host immune system during entry [ ] . the total length of sars-cov- s is aa and consists of a signal peptide (amino acids - ) located at the n-terminus, the s subunit ( - residues), and the s subunit ( - residues); the last two regions are responsible for receptor binding and membrane fusion, respectively. in the s subunit, there is an n-terminal domain ( - residues) and a receptor-binding domain (rbd, - residues); the fusion peptide (fp) ( - residues), heptapeptide repeat sequence (hr ) ( - residues), hr ( - residues), tm domain ( - residues), and cytoplasm domain ( - residues) comprise the s subunit ( fig. a) [ ] . s protein trimers visually form a characteristic bulbous, crown-like halo surrounding the viral particle (fig. a) . based on the structure of coronavirus s protein monomers, the s and s subunits form the bulbous head and stalk region [ ] . the structure of the sars-cov- trimeric s protein has been determined by cryo-electron microscopy at the atomic level, revealing different conformations of the s rbd domain in opened and closed states and its corresponding functions (fig. b , c) [ , ] . in the native state, the cov s protein exists as an inactive precursor. during viral infection, target cell proteases activate the s protein by cleaving it into s and s subunits [ ] , which is necessary for activating the membrane fusion domain after viral entry into target cells [ ] . similar to other coronaviruses, the s protein of sars-cov- is cleaved into s and s subunits by cellular proteases, and the serine protease tmprss is used as a protein primer. although the cleavage site of sars-cov is known, that of sars-cov- s has not yet been reported [ , ] . structure of the s subunit the binding of virus particles to cell receptors on the surface of the host cell is the initiation of virus infection; therefore, receptor recognition is an important determinant of viral entry and a drug design target. rbd situated in the s subunit binds to the cell receptor ace in the region of aminopeptidase n. the s region contains the ntd and ctd, and atomic details at the binding interface demonstrate key residue substitutions in sars-cov- -ctd. in addition, the sars-cov- s ctd binding interface has more residues that directly interact with the receptor ace than does sars-rbd ( versus ), and a larger surface area is buried with sars-cov- s ctd in complex with ace than with sars s rbd. mutations of key residues play an important role in enhancing the interaction with ace . f in sars-cov- , instead of i in sars rbd, forms strong aromatic-aromatic interactions with ace y , and e in sars-cov- -ctd, instead of p in sars rbd, forms ionic interactions with k , which leads to higher affinity for receptor binding than rbd of sars-cov ( fig. d) [ , , , ] . the rbd region is a critical target for neutralizing antibodies (nabs), and sars-cov- and sars-cov rbd are~ %- % similar in sequence. nine ace -contacting residues in cov rbd are fully conserved, and four are partially conserved. analysis of the rbm (receptor-binding motif, a portion of rbd making direct contacts with ace ) of sars-cov and sars-cov- revealed that most residues essential for ace binding in the sars-cov s protein are conserved in the sars-cov- s protein. however, some studies showed that murine monoclonal antibodies (mabs) and polyclonal antibodies against sars-rbd are unable to interact with the sars-cov- s protein, revealing differences in antigenicity between sars-cov and sars-cov- [ ] . similarly, a sars-cov rbd-specific antibody failed to block infection mediated by the s protein of sl-cov-shc [ ] , which suggests that the s rbd may not be an ideal drug target due to the highly mutable characteristic of broad-spectrum anti-cov drugs. structure of the s subunit the s subunit, composed successively of a fp, hr , hr , tm domain, and cytoplasmic domain fusion (ct), is responsible for viral fusion and entry. fp is a short segment of - conserved amino acids of the viral family, composed mainly of hydrophobic residues, such as glycine (g) or alanine (a), which anchor to the target membrane when the s protein adopts the prehairpin conformation. previous research has shown that fp plays an essential role in mediating b-c the s protein rbd closed and opened status. d the s protein binds to ace with opened rbd in the s subunit. e the six-helix structure formed by hr and hr of the s subunit. membrane fusion by disrupting and connecting lipid bilayers of the host cell membrane [ ] . hr and hr are composed of a repetitive heptapeptide: hpphcpc, where h is a hydrophobic or traditionally bulky residue, p is a polar or hydrophilic residue, and c is another charged residue [ ] . hr and hr form the six-helical bundle ( -hb) (fig. e) , which is essential for the viral fusion and entry function of the s subunit [ ] . hr is located at the c-terminus of a hydrophobic fp, and hr is located at the n-terminus of the tm domain [ ] . the downstream tm domain anchors the s protein to the viral membrane, and the s subunit ends in a ct tail [ ] . rbd binds to ace , and s changes conformation by inserting fp into the target cell membrane, exposing the prehairpin coiledcoil of the hr domain and triggering interaction between the hr domain and hr trimer to form -hb, thus bringing the viral envelope and cell membrane into proximity for viral fusion and entry [ ] . hr forms a homotrimeric assembly in which three highly conserved hydrophobic grooves on the surface that bind to hr are exposed. the hr domain forms both a rigid helix and a flexible loop to interact with the hr domain. in the postfusion hairpin conformation of covs, there are many strong interactions between the hr and hr domains inside the helical region, which is designated the "fusion core region" (hr core and hr core regions, respectively). targeting the heptad repeat (hr) has attracted the greatest interest in therapeutic drug discovery. the s protein is an important target protein for the development of specific drugs, while the s rbd domain is part of a highly mutable region and is not an ideal target site for broad-spectrum antiviral inhibitor development [ ] . in contrast, the hr region of the s subunit plays an essential role in hcov infections and is conserved among hcovs, as is the mode of interaction between hr and hr [ ] . a synthetic peptide derived from the stem region of the zikv envelope protein was demonstrated in to potently inhibit infection by zikv and other flaviviruses in vitro [ ] , implying antiviral efficiency of peptides derived from conserved regions of viral proteins. peptides derived from the hr region of class i viral fusion proteins of enveloped viruses competitively bind to viral hr and effectively inhibit viral infection [ ] . therefore, hr is a promising target for the development of fusion inhibitors against sars-cov- infection. the s protein on the surface of the virus is a key factor involved in infection. it is a trimeric class i tm glycoprotein responsible for viral entry, and it is present in all kinds of hcovs, as well as in other viruses such as hiv (hiv glycoprotein , env), influenza virus (influenza hemagglutinin, ha), paramyxovirus (paramyxovirus f), and ebola (ebola virus glycoprotein) [ ] . similar to other coronaviruses, the s protein of sars-cov- mediates receptor recognition, cell attachment, and fusion during viral infection [ , , , [ ] [ ] [ ] . the trimer of the s protein located on the surface of the viral envelope is the basic unit by which the s protein binds to the receptor [ , ] . the s domain contains the rbd, which is mainly responsible for binding of the virus to the receptor, while the s domain mainly contains the hr domain, including hr and hr , which is closely related to virus fusion [ ] . receptor binding as mentioned above, the sars-cov- s protein binds to the host cell by recognizing the receptor ace [ ] . ace is a homolog of ace, which converts angiotensin i to angiotensin - [ ] . ace is distributed mainly in the lung, intestine, heart, and kidney, and alveolar epithelial type ii cells are the major expressing cells [ ] . ace is also a known receptor for sars-cov. the s subunit of the sars-cov s protein binds with ace to promote the formation of endosomes, which triggers viral fusion activity under low ph (fig. a, b) [ ] . interaction between the s protein and ace can be used to identify intermediate hosts of sars-cov- , as ace from different species, such as amphibians, birds, and mammals, has a conserved primary structure [ ] . luan et al. compared the binding affinities between ace and sars-cov- s from mammals, birds, snakes, and turtles and found that the ace of bovidae and cricetidae interacted well with sars-cov- s rbd but that ace from snakes and turtles could not. the s protein binds to ace through the rbd region of the s subunit, mediating viral attachment to host cells in the form of a trimer [ ] . sars-cov- s binds to human ace with a dissociation constant (k d ) of . nm, though that of sars-cov s is . nm [ ] , indicating that sars-cov- s is more sensitive to ace than is sars-cov s. through the identification of sars-cov- proteins, researchers found~ % difference in s between sars-cov- and sars-cov, whereas that of rbd is~ % [ ] . viral fusion viral fusion refers to fusion of the viral membrane and host cell membrane, resulting in the release of the viral genome into the host cell. cleavage of the sars-cov- s and s subunits is the basis of fusion. the s protein is cleaved into two parts, the s subunit and s subunit, by host proteases, and the subunits exist in a noncovalent form until viral fusion occurs [ ] . researchers have found that the specific furin cleavage site is located in the cleavage site of sars-cov- but not in other sarslike covs [ , ] . mutation of the cleavage site in sars-cov- or sars-like covs has revealed that the s protein of sars-cov- exists in an uncleaved state but that the others are mainly in a cleaved state. sars-cov- s has multiple furin cleavage sites, which increases the probability of being cleaved by furin-like proteases and thereby enhances its infectivity [ , ] . the furin-like cleavage domain is also present in highly pathogenic influenza virus and is related to its pathogenicity, as observed in the avian influenza outbreak in hong kong in [ , ] . in addition, host cell proteases such as tmprss are essential for s protein priming, and they have been shown to be activated in the entry of sars-cov and influenza a virus [ , , ] . another host cell protease that has been proven to cleave viral s protein is trypsin [ ] . in summary, the s protein of sars-cov- is similar to that of sars-cov, and host cell proteases are essential for promoting s protein cleavage of both sars-cov- and sars-cov. the presence of a specific furin cleavage site on sars-cov- s might be one reason that sars-cov- is more contagious than sars-cov. the formation of -hb is essential for viral fusion. the fp in the n-terminus of sars-cov- and the two hr domains on s is essential for viral fusion [ ] . after cleavage of the s protein, the fp of sars-cov- is exposed and triggers viral fusion. under the action of some special ligands, the fusion protein undergoes a conformational change and then inserts into the host cell membrane (fig. c) [ ] . for example, the ligand for influenza a virus is h + , while the ligand for hiv is a coreceptor such as ccr or cxcr [ ] . the distance between the viral membrane and host cell membrane is shortened, and the hr domain of the s protein is in close proximity to the host cell membrane, whereas the hr domain is closer to the viral membrane side. then, hr folds back to hr , the two hr domains form a six-helix structure in an antiparallel format of the fusion core, the viral membrane is pulled toward the host cell membrane and tightly binds to it, and the two membranes fuse [ ] . the fundamental role of the s protein in viral infection indicates that it is a potential target for vaccine development, drug development targeting sars-cov- s y huang et al. antibody-blocking therapy, and small molecule inhibitors. considering the similarity with sars-cov and mers-cov, potential nabs and inhibitors targeting sars-cov- s are summarized below (fig. ) . antibodies based on the sars-cov- s protein the s protein is the main antigen component in all structural proteins of sars-cov- . unlike other functional proteins of sas-cov- , it is responsible for inducing the host immune response, and nabs targeting the s protein can induce protective immunity against viral infection. similar to sars-cov and mers-cov, research on nabs of sars-cov- mainly includes mabs, antigenbinding fragments, single-chain variable region fragments, and single-domain antibodies (nbs), which target s rbd, s -ntd, or s regions to prevent s -mediated fusion [ , ] . on the other hand, multiple sars-cov- vaccine types are under development, including rna/dna-based formulations, recombinant viral epitopes, adenovirus-based vectors, and purified inactivated virus [ ] . the sequence and striking structural similarity between the sars-cov- and sars-cov s proteins emphasize the close relationship between these two viruses, which provides the possibility to treat covid- with antibodies targeting the sars-cov s protein [ ] . compared with sars-cov- rbd, sars-cov- interacts with hace via the c-terminal domain (sars-cov- -ctd), showing higher affinity for receptor binding. rbd can induce highly potent nab responses and has the potential to be developed as an effective and safe subunit vaccine against sars-cov- . sars-cov s polyclonal antibodies obtained from immunized mice completely inhibited the invasion of sars-cov s-mlv (murine leukemia virus), whereas the invasion rate of sars-cov- s-mlv was reduced to~ % [ ] . the polyclonal anti-sars s antibody t inhibits the entry of sars-cov s but not that of sars-cov- s pseudovirus particles [ ] . consistently, recent studies have reported similar results, showing that three sars rbd-directed mabs, s , m , and r, were unable to bind to sars-cov- rbd [ , , ] . on the other hand, several mabs have shown promising results in neutralizing sars-cov- . cr , a sars-cov-specific human mab, binds potently with sars-cov- (k d of . nm, measured by bli in octetred ), suggesting that cr has the potential to be developed as candidate therapeutic, alone or in combination with other nabs, for the prevention and treatment of sars-cov- infection [ ] . a mab targeting s prepared from immunized transgenic mice expressing human ig variable heavy and light chains has recently been shown to neutralize both sars-cov- and sars-cov infections via an unknown mechanism that is independent of the blockade of rbd-hace interaction [ ] . recently, many human blocking mabs ( mab- b , mab- d , d , n , n , s , p c- f , p b- f , b , h ) have been successfully cloned from single memory b cells from recovered covid- patients [ ] [ ] [ ] [ ] [ ] [ ] . these mabs specifically bind to sars-cov- s to effectively neutralize infection. in addition, sera from sars patients during rehabilitation or animals specifically immunized with sars-cov s may cross-neutralize sars-cov- and reduce s protein-mediated sars-cov- entry (fig. ) [ ] . the stability of the sars-cov- s protein is lower than that of sars-cov s [ ] . the mapping of multiple s sequences of the subgenus sarbecovirus underscores that the s fusion region is more conserved than the s subunit and that the s subunit is more exposed at the viral surface [ ] . the sars-cov s subunit plays a key role in mediating virus-cell fusion and its integration into host cells, where hr and hr interact to form -hb, thus enabling the virus to bind to and fuse with the cell membrane [ ] . sequence alignment shows that sars-cov- hr has the same sequence as sars-cov hr . therefore, sars-cov- hr p ( - residues) was designed to inhibit sars-cov- fusion and entry into a target cell. surprisingly, hr p showed inhibitory activity against sars-cov- s-mediated fusion and sars-cov- pseudovirus, with ic values of . and . μm, respectively [ ] . notably, ek is a pancoronavirus fusion inhibitor targeting the hr domain of hcov s [ ] . the x-ray crystal structure of the -hb core of the sars-cov- s subunit hr and hr domains has been solved, indicating that several mutant residues in the hr region may be related to enhanced interaction in the hr region [ ] . subsequently, ek c , a lipopeptide derived from ek , was generated and verified to inhibit sars-cov- s-mediated cell-cell fusion. as expected, the entry of sars-cov- s pseudovirus was also inhibited by ek c , with an ic of . nm,~ -fold more potent than the original ek peptide. another sequence-based lipopeptide fusion inhibitor, ipb , potently inhibits sars-cov- s protein-mediated cell-cell fusion and pseudovirus infection [ ] . in addition to peptide fusion inhibitors, nelfinavir mesylate (viracept), a currently prescribed anti-hiv protease inhibitor, suppresses both sars-cov- s and sars-cov s-mediated cell-cell fusion. viracept is the first reported small molecule fusion inhibitor in addition to peptide fusion inhibitors. moreover, nelfinavir may inhibit the function of tmprss involved in activation of the s protein [ ] . this discovery makes possible clinical applications of anti-sars-cov- therapeutics, especially in the early stage of infection. protease inhibitors targeting sars-cov- s cleavage sites sars-cov- entry requires cleavage of the s protein at the s /s and s sites. proteolysis by tmprss and cathepsin b and l plays an important role in priming sars-cov- s for entry. camostat mesilate is a potent serine protease inhibitor of tmprss . utilizing research on the sars-cov and sars-cov- cell entry mechanism, it has been demonstrated that sars-cov- cellular entry can be blocked by camostat mesilate [ , ] . there are currently five clinical trials registered to evaluate the efficacy of camostat mesilate (clinicaltrials.gov identifier: nct , nct , nct , nct , nct ). in addition, cathepsins in lysosomes are crucial for sars-cov entry via endocytosis. e- d, an inhibitor of cathepsin l, blocks infection with sars-cov and sars-cov- psv [ ] [ ] [ ] . future trials with covid- patients may help to confirm the efficacy of e- d therapy. phosphatidylinositol -phosphate -kinase (pikfyve) is the main enzyme synthesizing pi ( , ) p in early endosomes [ ] . apilimod, a potent inhibitor of pikfyve , can significantly reduce the entry of sars-cov s pseudovirus into /hace cells via early endosomes in a dose-dependent manner [ ] . treating / hace cells with another pikfyve inhibitor, ym [ ] , also had a similar effect. moreover, a major downstream effector of pi ( , )p , two-pore channel subtype (tpc ) [ ] , is important for sars-cov- entry, and tetrandrine (an inhibitor of tpc ) inhibits the activity of sars-cov- s pseudovirus. furin (proprotein convertase (pc) subtilisin kexin , pcsk ), as a member of the pc family, catalyzes the hydrolysis of peptide and protein substrates at paired basic residues [ ] . strikingly, sars-cov- s harbors a furin cleavage site ( - residues) at the s / s boundary, which may increase the efficiency of sars-cov- transmission [ ] . the furin-like cleavage site in the s protein of sars-cov- may have implications for the viral life cycle and pathogenicity. therefore, furin inhibitors can be used as a drug therapy for sars-cov- [ ] . patent literature since describes the use of furin or its inhibitors in the treatment of diseases, and some furin inhibitors that have been reported, including α- -pdx (α -antitrypsin portland) [ ] , hexa-d-arginine(d r) [ ] , serpin proteinase inhibitor (pi ) [ ] , and a peptidomimetic furin inhibitor [ ] . the sars-cov- s protein binds to the host cell receptor and induces virus-cell membrane fusion, which plays a vital role in the process of virus invasion. moreover, the high affinity between the s protein and ace increases the infectivity of sars-cov- . mammals including pangolins, pets (dogs and cats), and members of cricetidae may be important for determining key residues for association with s from sars-cov and sars-cov- [ ] . further drug development targeting sars-cov- s y huang et al. understanding of the structure and function of sars-cov- s will allow for additional information regarding invasion and pathogenesis of the virus, which will support the discovery of antiviral therapeutics and precision vaccine design. structural information will also assist in evaluating mutations of the sars-cov- s protein and will help in determining whether these residues have surface exposure and map to known antibody epitopes of s proteins from other coronaviruses. in addition, structural knowledge ensures that the proteins produced by constructs are homogeneous and participate in the prefusion conformation, which should maintain the most neutralizationsensitive epitopes when used as a candidate vaccine or b-cell probe for isolating neutralizing human mabs. furthermore, atomic-level details will enable the design and screening of small molecules that inhibit fusion. since sars-cov- and sars-cov rbd domains share % amino acid sequence identity, future work will be necessary to evaluate whether any of these abs neutralize newly emerged coronavirus. overall, interaction between the s protein of sars-cov- and ace should be further studied to contribute elucidation of the mechanism of sars-cov- infection. similarly, focusing on high expression of the s protein or its receptor binding region is also of great significance for the development of vaccines. the s subunit of sars-cov- shows % sequence homology with the sars-cov s domain and is structurally conserved. therefore, the development of antibodies targeting this functional motif may cross-bind and neutralize these two viruses and related covs. antiviral peptides prevent sars-cov- membrane fusion and can potentially be used for the prevention and treatment of infection. it is worth mentioning that ek c , which targets the highly conserved hr domain of the s subunit, is expected to have therapeutic potential against sars-cov- . more importantly, ek c can be used as a nasal drop, which increases its medicinal properties, it possesses a high genetic barrier to resistance, and does not easily induce drug-resistant mutations. on the other hand, peptide fusion inhibitors may not be widely used clinically and have low bioavailability. therefore, the development of oral small molecule fusion inhibitors is a major direction. in the course of virus epidemics, the ability to adapt to external pressure is an important factor affecting the spread of the virus. regarding the envelope s protein, recombination or mutation in the gene of its rbd can occur to promote transmission between different hosts and lead to a higher fatality rate [ ] . mutation of the aspartate (d) at position to glycine (g ) results in a more pathogenic strain of sars-cov- [ ] , which makes it more difficult to develop antibodies or vaccines that target nonconservative regions. to effectively prevent disease, combinations of different mabs that identify different epitopes on the sars-cov- s surface can be assessed to neutralize a wide range of isolates, including escape mutants [ ] . currently, no specific therapeutic or prophylactic has been used clinically to treat or prevent sars-cov- infection. nonspecific antiviral drugs, such as ifn-α (recombinant human ifn-α b, ifn-α a), remdesivir, chloroquine, favipiravir, and lopinavir-ritonavir (aluvia), have been clinically used to treat covid- in china [ ] . nevertheless, niaid-vrc scientists are developing a candidate vaccine expressing sars-cov- s protein in mrna vaccine platform technology. clinical trials of the vaccine are expected in the coming months. continued strengthening of the monitoring of the sars-cov- s protein is of great significance for subsequent new drug development and protection against covid- . a novel coronavirus from patients with pneumonia in china analysis of therapeutic targets for sars-cov- and discovery of potential drugs by computational methods research and development on therapeutic agents and vaccines for covid- and related human coronavirus diseases pharmacologic treatments for coronavirus disease (covid- ): a review genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding rna based mngs approach identifies a novel human coronavirus from two individual pneumonia cases in wuhan outbreak genomic characterization of the novel human-pathogenic coronavirus isolated from a patient with atypical pneumonia after visiting wuhan functional assessment of cell entry and receptor usage for sars-cov- and other lineage b betacoronaviruses coronaviruses: an overview of their replication and pathogenesis learning from the past: possible urgent prevention and treatment options for severe acute respiratory infections caused by -ncov the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex site-specific glycan analysis of the sars-cov- spike fusion mechanism of -ncov and fusion inhibitors targeting hr domain in spike protein coronavirus membrane fusion mechanism offers a potential target for antiviral development cryo-em structure of the -ncov spike in the prefusion conformation structure, function, and antigenicity of the sars-cov- spike glycoprotein tmprss activates the human coronavirus e for cathepsin-independent host cell entry and is expressed in viral target cells in the respiratory epithelium sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor cleavage of spike protein of sars coronavirus by protease factor xa is associated with viral infectivity structural and functional basis of sars-cov- entry by using human ace structure of the sars-cov- spike receptor-binding domain bound to the ace receptor a pan-coronavirus fusion inhibitor targeting the hr domain of human coronavirus spike physiological and molecular triggers for sars-cov membrane fusion and entry into host cells heptad repeat sequences are located adjacent to hydrophobic regions in several types of virus fusion glycoproteins preliminary bioinformatics studies on the design of a synthetic vaccine and a preventative peptidomimetic antagonist against the sars-cov- ( -ncov, covid- ) coronavirus peptide-based membrane fusion inhibitors targeting hcov- e spike protein hr and hr domains bat-to-human: spike features determining 'host jump' of coronaviruses sars-cov, mers-cov, and beyond interaction between heptad repeat and regions in spike protein of sars-associated coronavirus: implications for virus fusogenic mechanism and identification of fusion inhibitors a peptide-based viral inactivator inhibits zika virus infection in pregnant mice and fetuses structural basis for membrane fusion by enveloped viruses cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding coronavirus spike protein and tropism changes structural basis for the recognition of sars-cov- by full-length human ace origin and evolution of pathogenic coronaviruses a novel angiotensin-converting enzyme-related carboxypeptidase (ace ) converts angiotensin i to angiotensin - angiotensin-converting enzyme (ace ) as a sars-cov- receptor: molecular mechanisms and potential therapeutic target cell entry mechanisms of sars-cov- structure analysis of the receptor binding of -ncov receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus structural basis for human coronavirus attachment to sialic acid receptors the spike glycoprotein of the new coronavirus -ncov contains a furin-like cleavage site absent in cov of the same clade sars-cov- , sars-cov, and mers-cov: a comparative overview a review on the cleavage priming of the spike protein on coronavirus by angiotensin-converting enzyme- and furin host cell proteases: critical determinants of coronavirus tropism and pathogenesis human influenza a h n virus related to a highly pathogenic avian influenza virus role of host cellular proteases in the pathogenesis of influenza and influenza-induced multiple organ failure tmprss and adam cleave ace differentially and only proteolysis by tmprss augments entry driven by the severe acute respiratory syndrome coronavirus spike protein tmprss is the major activating protease of influenza a virus in primary human airway cells and influenza b virus in human type ii pneumocytes characterization of spike glycoprotein of sars-cov- on virus entry and its immune cross-reactivity with sars-cov biochemical analysis of coronavirus spike glycoprotein conformational intermediates during membrane fusion viral membrane fusion mechanisms of viral membrane fusion and its inhibition covid- , an emerging coronavirus infection: advances and prospects in designing and developing vaccines, immunotherapeutics, and therapeutics novel antibody epitopes dominate the antigenicity of spike glycoprotein in sars-cov- compared to sars-cov rapid development of an inactivated vaccine candidate for sars-cov- purified coronavirus spike protein nanoparticles induce coronavirus neutralizing antibodies in mice potent binding of novel coronavirus spike protein by a sars coronavirus-specific human monoclonal antibody a human monoclonal antibody blocking sars-cov- infection human monoclonal antibodies block the binding of sars-cov- spike protein to angiotensin converting enzyme receptor identification of fully human single-domain antibodies against sars-cov- structural and functional analysis of a potent sarbecovirus neutralizing antibody potent human neutralizing antibodies elicited by sars-cov- infection a potent neutralizing human antibody reveals the n-terminal domain of the spike protein of sars-cov- as a site of vulnerability inhibition of sars-cov- (previously -ncov) infection by a highly potent pan-coronavirus fusion inhibitor targeting its spike protein that harbors a high capacity to mediate membrane fusion design of potent membrane fusion inhibitors against sars-cov- , an emerging coronavirus with high fusogenic activity the anti-hiv drug nelfinavir mesylate (viracept) is a potent inhibitor of cell fusion caused by the sars-cov- spike (s) glycoprotein warranting further evaluation as an antiviral against covid- infections camostat mesilate therapy for covid- alisporivir inhibits mers-and sars-coronavirus replication in cell culture, but not sars-coronavirus infection in a mouse model but not hcov-nl , utilizes cathepsins to infect cells-viral entry. nidoviruses: toward control of sars and other nidovirus glycopeptide antibiotics potently inhibit cathepsin l in the late endosome/lysosome and block the entry of ebola virus, middle east respiratory syndrome coronavirus (mers-cov), and severe acute respiratory syndrome coronavirus (sars-cov) the phosphatidylinositol- -phosphate -kinase inhibitor apilimod blocks filoviral entry and infection inhibition of pikfyve using ym suppresses the growth of liver cancer via the induction of autophagy two-pore channels control ebola virus host cell entry and are drug targets for disease treatment proprotein convertases in health and disease drug development targeting sars-cov- clinical features of patients infected with novel coronavirus in wuhan furin inhibition reduces vascular remodeling and atherosclerotic lesion progression in mice furin inhibitor d r suppresses epithelial-mesenchymal transition in sw and patu cells via the hippo-yap signaling pathway the serpin proteinase inhibitor : an endogenous furin inhibitor released from human platelets peptidomimetic furin inhibitor mi- in combination with oseltamivir and ribavirin efficiently blocks propagation of highly pathogenic avian influenza viruses and delays high level oseltamivir resistance in mdck cells alteration of brain network topology in hiv-associated neurocognitive disorder: a novel functional connectivity perspective the establishment of reference sequence for sars-cov- and variation analysis sars-cov- viral spike g mutation exhibits higher case fatality rate perspectives on therapeutic neutralizing antibodies against the novel coronavirus sars-cov- remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus ( -ncov) in vitro this project was supported by grants from guangzhou science and technology program (# to wx), the fund of natural science foundation of guangdong province (# a to wx), and grants from major scientific and technological projects of guangdong province (# b to swl). competing interests: the authors declare no competing interests. key: cord- -roj ksvc authors: lan, jiaming; deng, yao; chen, hong; lu, guangwen; wang, wen; guo, xiaojuan; lu, zhuozhuang; gao, george f.; tan, wenjie title: tailoring subunit vaccine immunity with adjuvant combinations and delivery routes using the middle east respiratory coronavirus (mers-cov) receptor-binding domain as an antigen date: - - journal: plos one doi: . /journal.pone. sha: doc_id: cord_uid: roj ksvc the development of an effective vaccine is critical for prevention of a middle east respiratory syndrome coronavirus (mers-cov) pandemic. some studies have indicated the receptor-binding domain (rbd) protein of mers-cov spike (s) is a good candidate antigen for a mers-cov subunit vaccine. however, highly purified proteins are typically not inherently immunogenic. we hypothesised that humoral and cell-mediated immunity would be improved with a modification of the vaccination regimen. therefore, the immunogenicity of a novel mers-cov rbd-based subunit vaccine was tested in mice using different adjuvant formulations and delivery routes. different vaccination regimens were compared in balb/c mice immunized times intramuscularly (i.m.) with a vaccine containing µg of recombinant mers-cov rbd in combination with either aluminium hydroxide (alum) alone, alum and polyriboinosinic acid (poly i:c) or alum and cysteine-phosphate-guanine (cpg) oligodeoxynucleotides (odn). the immune responses of mice vaccinated with rbd, incomplete freund’s adjuvant (ifa) and cpg odn by a subcutaneous (s.c.) route were also investigated. we evaluated the induction of rbd-specific humoral immunity (total igg and neutralizing antibodies) and cellular immunity (elispot assay for ifn-γ spot-forming cells and splenocyte cytokine production). our findings indicated that the combination of alum and cpg odn optimized the development of rbd-specific humoral and cellular immunity following subunit vaccination. interestingly, robust rbd-specific antibody and t-cell responses were induced in mice immunized with the rrbd protein in combination with ifa and cpg odn, but low level of neutralizing antibodies were elicited. our data suggest that murine immunity following subunit vaccination can be tailored using adjuvant combinations and delivery routes. the vaccination regimen used in this study is promising and could improve the protection offered by the mers-cov subunit vaccine by eliciting effective humoral and cellular immune responses. in a novel human coronavirus, middle east respiratory syndrome coronavirus (mers-cov), caused outbreaks of a sars-like illness in the middle east, and is now considered a threat to global public health [ , ] . as of july , , the world health organization (who) reported confirmed cases of mers-cov infection, including deaths (a case fatality rate of . %) [ ] . now, studies show that camels are a likely primary source of the mers-cov that is infecting humans [ , , ] . but the routes of transmission between camels and people which is the key point to stop transmission of the virus, is far from clearly understood. the continued threat of mers-cov necessitates the development of an effective vaccine. some studies have indicated that recombinant receptor-binding domain (rrbd) protein of mers-cov spike (s) is a good candidate antigen for a mers-cov subunit vaccine [ , , , ] . however, highly purified proteins are typically not inherently immunogenic, as they usually lack the means to directly stimulate the innate immune system [ ] . besides, they are often prone to degradation. hence, they call for efficient delivery systems and potent immunostimulants, jointly denoted as adjuvant(s) to evoke the desired antigen-specific immune response phenotype enabling successful vaccination [ ] . aluminium is one of the most common adjuvant in non-living vaccines, has a record of successful use in human vaccination where it promotes antibody-mediated protective immunity [ ] . another classic adjuvant is that based on a water-in-oil-emulsion formulation, such as incomplete freund's adjuvant (ifa). recently, researches have focused on adjuvants that signal through pattern recognition receptors (prrs), such as toll-like receptors (tlrs) [ ] . cysteine-phosphate-guanine (cpg) oligodeoxynucleotides (odns), which activate b cells and plasmacytoid dendritic cells via tlr and induce both innate and adaptive immunity, are currently being developed as a vaccine adjuvant [ ] . another frequently used adjuvant is polyriboinosinic acid (poly(i:c)), a synthetic dsrna that mimics the effects of naturally occurring dsrna, a tlr agonist [ , ] . beside of enhancing the immune response, adjuvant(s) can tailor-make the polarization immune response. for example, ppolarized th -type immunity can be achieved by the addition of freund's adjuvant or cpg dna to an antigen. on the other hand, th antibody responses can be induced by the alum, as indicated by increased igg relative to igg a [ , ] . however, in situations where both th and th responses are required for protection, the choice of one regimen over another might be counter effective. this has led to additional research for alternative adjuvants or adjuvant combinations that promote balanced mixed th /th responses [ ] . in recent years, the combination of antigens with more than one adjuvant, called the adjuvant system approach has produced vaccines with the ability to generate effective immune responses adapted to both the pathogen and the target population [ ] . by using multiple adjuvants in combination, antigen presenting cell (apc) activation is influenced at more than one level, guiding the subsequent adaptive pathways and ultimately inducing a more robust immune response [ ] . the induction of a robust humoral, including potent neutralizing antibodies, and cellular immune response is likely essential for immediate and sustained protective immunity in a mers-cov vaccine design. in this study, different adjuvants combination regimens including alum, ifa, cpg and poly(i:c) were compared in an effort to promote balance between th and th immune response to bystander rrbd antigen spanning residues - of mers-cov s in a murine model to develop an effective vaccine against mers-cov infection. animal studies were carried out in strict compliance with the guide for the care and use of laboratory animals of the people's republic of china. the study protocol was approved by the committee on the ethics of animal experiments of the chinese centre for diseases control and prevention. all procedures were performed under ethylether anesthesia and all efforts were made to minimize suffering. mers-cov rrbd protein, containing a -amino-acid fragment spanning residues - ( figure a ) of genbank number jx , was prepared using a bac-to-bac baculovirus expression system as described in detail previously [ ] . the required rrbd was measured by sds-page ( figure b) and western blot ( figure c ) with a mice polyclonal antibody against spike of mers-cov ( figure c ). before vaccination, the rrbd protein was quantified by bradford method. the rrbd protein was combined with different adjuvants immediately prior to immunisation. aluminium hydroxide was kindly provided by the north china pharmaceutical group corporation genetech biotechnology development company. the odn motif containing unmethylated cpg ( -tccat-gacgttcctgacgtt- ) was synthesised by takara bio inc. poly(i:c) and ifa were purchased from sigma (st. louis, mo). a single dose ( mg) of rrbd protein ( ml) was combined with either mg of alum alone (rbd/a), alum plus mg of cpg (rbd/a+c), alum plus mg of poly(i:c) (rbd/ a+p) or mg of cpg and ml of ifa (rbd/i+c). six-to-eight-week-old female balb/c mice (animal care centre, chinese academy of medical science, beijing, china) were randomly distributed into eight groups. eight mice of each group were vaccinated three times with rrbd proteins at -week intervals by either an intramuscular (i.m.) or a subcutaneous (s.c.) route (table ). sera were collected weeks after each vaccination and heat-inactivated at uc for min before detection of rbdspecific and neutralizing antibodies. mice were scarified weeks after the last immunisation, and their lungs and spleens were harvested for detection. a schematic of the vaccination and analysis timeline is shown in figure . elisa was used to detect the mers-cov rbd-specific antibody response in immunised mice. briefly, -well elisa plates were pre-coated with rrbd protein ( ng/well) overnight at uc and blocked with % non-fat milk for h at uc. serially diluted sera of eight mice in each group were added to the plates and incubated at uc for h, followed by four washes with phosphate-buffered saline (pbs) containing . % tween (pbst). bound antibodies were incubated with hrp-conjugated anti-mouse igg, igg , igg a or igg b ( : , , sigma) for h at uc. the reaction was visualised by using , , , tetramethylbenzidine (tmb) peroxidase substrate solution (invitrogen) and stopped by addition of m h so . absorbance at nm was measured using an elisa plate reader (wellscan mk ). the cut-off value was set . -fold above that of the negative control. antibody avidity was determined using the elisa method described by vermont et al [ ] . briefly, sera were diluted to a titre of : , and an ascending concentration of the chaotropic agent nascn ( - m) was added to the plate. plates were incubated for min at room temperature (rt) before washing and development to determine total igg. as a control for antibody specificity, elisa was used to measure the total anti-mers-cov igg titres of pre-and post-vaccination sera samples. the conventional neutralization assay using live mers-cov is cumbersome and has to be performed in biosafety level- facilities. therefore, we adapted a mers-cov pseudovirus system which is sensitive and quantitative, and can be conducted in biosafety level- facilities as reported by zhao et al [ ] . in brief, t cells were co-transfected with a plasmid encoding codon-optimized mers-cov s protein and a plasmid encoding env-defective, luciferaseexpressing, hiv- genome (pnl - r-e-luc) using fugene hd reagents (roche, basel, switzerland). supernatants containing mers-cov pseudovirus were harvested h post-transfection and used for single-cycle infection. huh . cells were plated at cells/well in -well tissue-culture plates and grown overnight. the supernatants containing pseudovirus were pre-incubated with -fold serially diluted mouse sera at uc for h before addition to cells. the culture was refed with fresh medium h later and incubated for an additional h. cells were washed with pbs and lysed using lysis reagent included in a luciferase kit (promega). aliquots of cell lysates were transferred to -well costar flatbottom luminometer plates (corning costar), followed by addition of luciferase substrate (promega). relative light units were determined immediately in the gaomax luminometer (promega). all experiments were carried out in triplicate. pseudovirus inhibition (pi) rate was calculated as: (relative luciferase units of mock sera -relative luciferase units of immune serum for a given dilution)/relative luciferase units of mock sera. to evaluate the antigen-specific t-cell response induced by the vaccination regimes, an ifn-c elispot assay was performed as described previously [ ] . briefly, -well plates were coated with ml per well of mg/ml anti-mouse ifn-c antibody (bd pharmingen) overnight at uc and then blocked for h at rt. freshly harvested splenocytes ( per well) or lung lymphocytes of eight mice in each group were isolated as described previously [ ] . then, mg/ml of a synthesised -mer peptide library, which overlapped the mers-cov s rbd by amino acids, was added to the wells in triplicate. next, a biotinylated detection antibody (bd pharmingen) and streptavidin-horseradish peroxidase were added. blots were developed by the addition of an aec ( -amino- -ethylcarbazole) substrate solution, which produced a coloured spot after -min rt incubation in the dark. finally, ifn-c spot-forming cells (sfcs) were counted. phorbol -myristate -acetate (pma) and ionomycin were added to the positive-control group, whereas the negative-control group received no stimuli. the number of peptide-specific ifn-c secreting t cells was calculated by subtracting the negativecontrol value from the sfc count. a cba analysis was conducted to investigate the levels of th and th -type cytokine secretion [ ] in mice after three times' immunization. in brief, splenocytes ( per well) of eight mice in each group were distributed in -well plates and stimulated with mg/ml of pooled rbd peptide. plates were incubated for h at uc and supernatants were harvested. the concentrations of cytokines, including il- , il- , il- , il- , tnf-a, il- a and ifn-c, were measured using a mouse th /th /th cytokine kit (bd biosciences) and a facs calibur flow cytometer (becton dickinson). data were analysed using the fcap array software (becton dickinson). statistical analysees were conducted using the one-way anova function in the spss . software package. a p-value less than . were considered to indicate statistical significance. the vaccination regime affects the rbd-specific igg response in mice to assess the humoral immune response to different immunisation regimens, mice were immunised with rrbd protein combined with different adjuvants three times at -week intervals. serum samples were collected weeks after each vaccination and total anti-mers-cov rbd igg antibody titres were determined by elisa. the results indicated that rrbd protein combined with any adjuvant, including alum, ifa, cpg or poly(i:c), could induce a rbd-specific igg antibody response in the majority of mice after the second immunisation. in a few vaccinated mice, rbd-specific igg antibodies could be detected even after the first immunisation. the seroconversion rates of the different groups following the first and the second immunisations are shown in table . as shown in figure a , there was no discernible increase in igg titres after the third immunisation compared to the second immunisation. among the vaccination regimes, rbd/a+c and rbd/i+c elicited the highest total igg titres (p, . , figure a) . besides, the difference of igg titer in these two groups was not significant (p$ . , figure a ). similarly shown in figure a , the difference of igg titer in rbd/a and rbd/a+p groups had no significance. the rbd-specific antibodies were lower than : in the adjuvant control groups at each of the three vaccinations. the responses to the various vaccination regimes were investigated using nascn antibody-displacement elisa to measure antibody avidity ( figure b ). mice received the rbd/ a+c or rbd/i+c regimes had higher antibody avidity weeks after the final vaccination than those received the rbd/a or rbd/a+p regimes. it was also noteworthy that high antibody avidity correlated with a high igg titre in mice. to further characterise the immune response to the different vaccination regimes, igg isotype analyses were performed weeks after the final vaccination using secondary antibodies against igg , igg a and igg b. as shown in figures c, d , e and f, mice immunised with rbd/a+c or rbd/i+c produced higher igg and igg a titres than mice immunised with rbd/a or rbd/a+p. also, the igg to igg a ratio revealed a th skewed response in mice that received the rbd/a+c or rbd/i+c regimes. in contrast, the rbd/a and rbd/a+p regimes produced a higher igg /igg ratio, indicating a th response. the titres of rbd-specific igg b antibodies, however, were not significantly different among the vaccination groups (p$ . ). nneutralizing antibodies in the sera of mice immunised with different vaccination regimes were evaluated with a pseudovirusbased neutralization assay. a low level of neutralizing antibodies were detected weeks after the first or second vaccination in aall of the sera tested, although the total igg antibody levels had almost peaked after the second vaccination. the highest level of neutralizing antibodies was induced after the last vaccination ( figure ). the pseudovirus inhibition (pi) rates are shown in figure . as shown, the rbd/a+c regime had the highest neutralizing antibody activity (p, . ). surprisingly, there was low level of detectable neutralizing antibody in the sera of mice immunised s.c. with the rbd/i+c regime, although sera from this to characterise the cellular immune responses elicited by the vaccination regimes, single ifn-c-producing cells were quantified by elispot. both systemic and local cellular immune responses were assessed using lymphocytes from the spleen and lungs of immunised mice. the peptide library used to stimulate the lymphocytes was described in the materials and methods section. results are expressed as the number of sfcs per input cells. adjuvants without rrbd did not elicit a clear cellular response in the spleen weeks after the third immunisation ( figure a ). neither rbd/a nor rbd/a+p induced a significant cellular immune response. in contrast, rbd/a+c and rbd/i+c regimen enhanced a detectable systemic cellular immune response. furthermore, the rbd/i+c regimen induced the greatest cellular immune response with the greatest number of ifn-c-producing cells in the spleen (p, . ). a significant cellular immune response in the lung was induced only by the rbd/i+c regime, although a few ifn-c producing cells were detected in all immunised mice ( figure b ). we therefore concluded that while both the rbd/a+ c and rbd/i+c regimes could induce a systematic cellular immune response in mice, only the rbd/i+c regime could elicit a significant local cellular immune response in the lung. a higher frequency of rbd-specific, tnf-a-and il- producing t cells were induced with alum and cpg via an i.m. route the cytokine profiles of spleen cells from immunised mice were analysed after stimulation with rbd-specific peptides. during cba, splenocytes from mice immunised with rbd/a+c or rbd/i+c produced ifn-c ( figure a ). in contrast, il- was produced by splenocytes following immunisation with rrbd combination of any adjuvants ( figure b ). but the differences in ifn-c and il- production among the groups were not significant (p$ . ). compared with other groups, splenocytes from mice immunised with rbd/a+c induced significantly higher levels of tnf-a ( figure c ) and il- ( figure d ) (p, . ). all these indicated that the adjuvants of alum and cpg combination could induce a th and th mixed immune responses in the rrbd antigen model of mers-cov, though the responses revealed a th polarization in the isotype elisa and elispot detection. similarly, the high levels of ifn-c, il- and il- also indicated a th and th mixed immune responses could be induced by the rbd/i+c regimes. (figure a, b, e) . different from all of these, as shown in figure f , the rbd/a+p regimes induce the highest level of il- (p, . ), which indicated a th response inclination consistent with the results of igg isotype. however, il- a was not detected in any of the vaccination groups (data not shown). coronaviruses can adapt rapidly to new hosts, and an adaptation of mers-cov that allowed the virus to efficiently replicate in humans would be a major public health concern, since such an adaptation could trigger a pandemic [ ] . the development of an effective vaccine is critical to prevent a potential mers-cov pandemic. previous studies have shown that vaccination with the sars-cov rbd induces highly potent neutralizing antibodies and significantly inhibits sars-cov infection [ , ] . therefore it was proposed that vaccination with the rbd of mers-cov, which belongs to the same betacoronavirus genus as sars-cov [ , ] , might also inhibit mers-cov infection and induce a neutralizing antibody response against mers-cov. du et al [ ] identified a recombinant protein containing a -amino acid fragment (residues - ) in the truncated rbd of mers-cov spike protein fused with human igg fc fragment (s - -fc) was able to induce in the vaccinated mice strong mers-cov sspecific antibodies, which blocks the binding of rbd to dipeptidyl peptidase (dpp ), the human mers-cov receptor [ ] and effectively neutralizes mers-cov infection. besides, they [ ] showed that residues to in the s protein of mers-cov induced significant neutralizing antibody responses, suggesting that this region had a potential to be developed as a mers-cov vaccine. mou et al [ ] showed the polyclonal antibodies in rabbits against the rbd in the s protein to a -amino-acid fragment (residues to ) efficiently neutralized virus infectivity. however, none of the studies evaluated the immunogenicity of rrbd protein systematically in an animal model. recently, ma et al [ ] ssuggested the possibility of developing a recombinant rbd protein containing residues - into an effective and safe mucosal mers vaccine through the intranasal route in the presence of the only poly(i:c) adjuvant in a mouse model. while the need for vaccines with the ability to generate an effective immune response has led to the combination of antigens with more than one adjuvant, the 'adjuvant system' approaches. the adjuvant system approach aids in the development of vaccines that generate effective immune responses [ , ] . in this study, the roles of three adjuvants-alum, ifa, cpg and poly (i:c)-in rrbd subunit vaccination were investigated aimed at inducing an effective immune response through use of tailored adjuvant combinations and delivery routes. consistent with above studies, all vaccination regimes containing rrbd induced an rbd-specific cellular and humoral immune response. however, a more robust immune response was elicited when mice were immunised with the rbd/a+c and rbd/i+c regimes. an unexpected result was the absence of neutralizing antibodies in the sera of rbd/i+c immunised mice, despite anti-rbd specific igg titres being similar for the rbd/i+c and rbd/ a+c regimes. to further understand the riddle, we detected the aantibody avidity of different vaccination regimes by aavidity elisa. however, the results showed the high antibody avidity correlated with a high igg titre in mice of rbd/i+c and rbd/ a+c groups. so, we speculated maybe the adjuvants of destroyed the conformation of rrbd and covered the antigen binding sites. another probable cause of the low titer of neutralizing antibodies in the sera of rbd/i+c immunised mice was the delivery route of subcutaneous. as known, the subcutaneous may be associated with degradation at injection site, which leads to decreased bioavailability [ ] . whatever, further studies are in process. compared with other studies, the regimes in this study induced lower titres of neutralization antibodies. for example, the pi of alum plus cpg, the group showing the highest titer of neutralizing antibody in all immunization groups was : . while the rrbd protein in the above studies acquired a : in mice neutralization antibody titre. the differences may be caused by the detection methods of neutralization antibody. as showed in the materials and methods parts, the neutralization antibodies in this study were detected by a pseudovirus system which can be conducted in biosafety level- facilities. while the differences of induced neutralization antibodies among different groups can be shown clearly. the subclass of immunoglobulin induced after immunization is an indirect measure of the relative contribution of th -type cytokines vs. th -type cytokines [ ] . to characterise the immune response of the different vaccination regimes, igg isotype including igg , igg a and igg b analyses were performed. as expected, the rbd/a regimes produced a th response with high igg /igg ratio. in contrast, mice received the rbd/a+c or rbd/i+c regimes revealed a th skewed response. consistently, the rbd/a+c or rbd/i+c regimes induced a systematic cellular immune response in mice by elispot analysis. the high level of ifn-c and il- in the cba was also a proof of the cellular immune response in mice. besides, the mice in the rbd/a+c group had a high level of il- and il- , which were an index of th skewed response. taken together, the rbd/a+c induced a th and th mixed immune responses, though the responses had a th inclination. it was our original intention of mixed th /th responses for better protection. similarly, the rbd/i+c regimes induced a mixed th and th responses. however, it was a pity that the rbd/i+c regimes could not induce an effective neutralization antibody, which was the most important factor of a prophylactic vaccine. above all, in this study, mers-cov s rrbd combined with the adjuvants alum and cpg produced the most robust immune response. it indicates that the combination of alum and cpg was the optimal strategy for i.m. rrbd antigen delivery in a murine model. this result will facilitate future mers-cov vaccine design. the results of the present study also support the importance of the adjuvant system approach, although adjuvant combinations do not always produce the desired response, as seen with rbd/i+c. consistent with the results of the present study, cpg plus alum was found to induce protective humoral, as well as cellular immunity, in mice immunised with a recombinant haemagglutinin vaccine that protected against influenza virus challenge [ ] . the ideal immunity of the cpg and alum combination may be the result of mutual complementation of these two adjuvants. it is well known that alum can promote antibody-mediated protective immunity. however, alum is a poor inducer of cellular immune responses [ ] . recently, adjuvants including oil-in-water emulsions have shown improved efficacy for avian influenza protection suggesting that even for diseases where humoral immunity can confer protection, cellular immune responses may be necessary in vaccine design [ ] . the key features of cpg-odn used as a vaccine adjuvant, include the ability to elicit th cell, but only under certain conditions, cd + cytotoxic t cell responses and an additional ability to divert the pre-existing th response in neonates and elderly mice toward a th phenotype [ ] . thus, we expect that the combination of alum and cpg will prove applicable in a range of infectious diseases that have defeated current immunisation strategies. except for a choice of adjuvants in combination with optimal protective antigen, practical items such as the antigen: adjuvant ratio, dose, vaccination regimen and often route of administration will strongly impact on both the effectiveness and safety of the vaccine formulation. in most cases, an experimental vaccine will be initially tested in an animal model [ ] . to evaluate the immunogenicity of rrbd protein thoroughly, it is necessary to test the protective effects of rrbd subunit immunisation in an animal model of mers-cov infection. to date, rhesus macaques have been reported to generate pneumonia-like symptoms within h of mers-cov infection [ ] , and we are testing the effects of rrbd immunisation in rhesus macaques. considerable efforts are being made to establish a small animal model of mers-cov infection. though the lung cells of the syrian hamster express the receptor for mers-cov, they are not susceptible to mers-cov infection [ , ] . recently, a mouse model of mers-cov infection was reportedly generated by transduction of mice with adenoviral vectors expressing dpp [ ] . in the future, we expect the protective effect of the rbd/a+c vaccination should be investigated in this murine model of mers-cov infection. isolation of a novel coronavirus from a man with pneumonia in saudi arabia middle east respiratory syndrome coronavirus (mers-cov): challenges in identifying its source and controlling its spread middle east respiratory syndrome coronavirus (mers-cov) -update evidence for camel-to-human transmission of mers coronavirus concerns about misinterpretation of recent scientific data implicating dromedary camels in epidemiology of middle east respiratory syndrome (mers) middle east respiratory syndrome coronavirus (mers-cov) rna and neutralising antibodies in milk collected according to local customs from dromedary camels intranasal vaccination with recombinant receptor-binding domain of mers-cov spike protein induces much stronger local mucosal immune responses than subcutaneous immunization: implication for designing novel mucosal mers vaccines a truncated receptor-binding domain of mers-cov spike protein potently inhibits mers-cov infection and induces strong neutralizing antibody responses: implication for developing therapeutics and vaccines identification of a receptorbinding domain in the s protein of the novel human coronavirus middle east respiratory syndrome coronavirus as an essential target for vaccine development the receptor binding domain of the new middle east respiratory syndrome coronavirus maps to a -residue region in the spike protein that efficiently elicits neutralizing antibodies vaccine adjuvants: mode of action ) ph-triggered microparticles for peptide vaccination immunomodulatory properties of the vaccine adjuvant alum tlr-based immune adjuvants cpg motif-based adjuvant as a replacement for freund's complete adjuvant in a recombinant lhrh vaccine synthetic double-stranded rnas are adjuvants for the induction of t helper and humoral immune responses to human papillomavirus in rhesus macaques poly(i:c)/alum mixed adjuvant priming enhances hbv subunit vaccine-induced immunity in mice when combined with recombinant adenoviral-based hbv vaccine boosting the adjuvanticity of an o. volvulus-derived rov-asp- protein in mice using sequential vaccinations and in non-human primates adjuvant-dependent modulation of th and th responses to immunization with beta-amyloid unmet needs in modern vaccinology: adjuvants to improve the immune response molecular basis of binding between novel human coronavirus mers-cov and its receptor cd antibody avidity and immunoglobulin g isotype distribution following immunization with a monovalent meningococcal b outer membrane vesicle vaccine a safe and convenient pseudovirus-based inhibition assay to detect neutralizing antibodies and screen for viral entry inhibitors against the novel human coronavirus mers-cov rapid generation of a mouse model for middle east respiratory syndrome receptor-binding domain as a target for developing sars vaccines roadmap to developing a recombinant coronavirus s protein receptor-binding domain vaccine for severe acute respiratory syndrome is the discovery of the novel human betacoronavirus c emc/ (hcov-emc) the beginning of another sars-like pandemic? genetic characterization of betacoronavirus lineage c viruses in bats reveals marked sequence divergence in the spike protein of pipistrellus bat coronavirus hku in japanese pipistrelle: implications for the origin of the novel middle east respiratory syndrome coronavirus dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc randomized, double-blind, phase a trial of falciparum malaria vaccines rts,s/as b and rts,s/as a in malaria-naive adults: safety, efficacy, and immunologic associates of protection subcutaneous drug delivery: a route to increased safety, patient satisfaction, and reduced costs insect cell-expressed hemagglutinin with cpg oligodeoxynucleotides plus alum as an adjuvant is a potential pandemic influenza vaccine candidate trends in vaccine adjuvants adjuvant activity of cpg-odn formulated as a liquid crystal pneumonia from human coronavirus in a macaque model the middle east respiratory syndrome coronavirus (mers-cov) does not replicate in syrian hamsters key: cord- -auook y authors: zhao, guangyu; he, lei; sun, shihui; qiu, hongjie; tai, wanbo; chen, jiawei; li, jiangfan; chen, yuehong; guo, yan; wang, yufei; shang, jian; ji, kaiyuan; fan, ruiwen; du, enqi; jiang, shibo; li, fang; du, lanying; zhou, yusen title: a novel nanobody targeting middle east respiratory syndrome coronavirus (mers-cov) receptor-binding domain has potent cross-neutralizing activity and protective efficacy against mers-cov date: - - journal: j virol doi: . /jvi. - sha: doc_id: cord_uid: auook y the newly emerged middle east respiratory syndrome coronavirus (mers-cov) continues to infect humans and camels, calling for efficient, cost-effective, and broad-spectrum strategies to control its spread. nanobodies (nbs) are single-domain antibodies derived from camelids and sharks and are potentially cost-effective antivirals with small size and great expression yield. in this study, we developed a novel neutralizing nb (nbms ) and its human-fc-fused version (nbms -fc), both of which target the mers-cov spike protein receptor-binding domain (rbd). we further tested their receptor-binding affinity, recognizing epitopes, cross-neutralizing activity, half-life, and efficacy against mers-cov infection. both nbs can be expressed in yeasts with high yield, bind to mers-cov rbd with high affinity, and block the binding of mers-cov rbd to the mers-cov receptor. the binding site of the nbs on the rbd was mapped to be around residue asp , which is part of a conserved conformational epitope at the receptor-binding interface. nbms and nbms -fc maintained strong cross-neutralizing activity against divergent mers-cov strains isolated from humans and camels. particularly, nbms -fc had significantly extended half-life in vivo; a single-dose treatment of nbms -fc exhibited high prophylactic and therapeutic efficacy by completely protecting humanized mice from lethal mers-cov challenge. overall, this study proves the feasibility of producing cost-effective, potent, and broad-spectrum nbs against mers-cov and has produced nbs with great potentials as anti-mers-cov therapeutics. importance therapeutic development is critical for preventing and treating continual mers-cov infections in humans and camels. because of their small size, nanobodies (nbs) have advantages as antiviral therapeutics (e.g., high expression yield and robustness for storage and transportation) and also potential limitations (e.g., low antigen-binding affinity and fast renal clearance). here, we have developed novel nbs that specifically target the receptor-binding domain (rbd) of mers-cov spike protein. they bind to a conserved site on mers-cov rbd with high affinity, blocking rbd's binding to mers-cov receptor. through engineering a c-terminal human fc tag, the in vivo half-life of the nbs is significantly extended. moreover, the nbs can potently cross-neutralize the infections of diverse mers-cov strains isolated from humans and camels. the fc-tagged nb also completely protects humanized mice from lethal mers-cov challenge. taken together, our study has discovered novel nbs that hold promise as potent, cost-effective, and broad-spectrum anti-mers-cov therapeutic agents. ities, neutralization mechanisms, cross-neutralizing activity against divergent mers-cov strains, half-life, and protective efficacy against lethal mers-cov infection in an established hdpp -tg mouse model ( ) . this study reveals that efficacious, robust, and broad-spectrum nbs can be produced to target mers-cov s protein rbd and that they hold great promise as potential anti-mers-cov therapeutics. identification and characterization of mers-cov-rbd-specific nbs. to construct the nb (i.e., vhh) library, we immunized llama with recombinant mers-cov rbd (residues to , emc strain) containing a c-terminal human igg fc tag (i.e., rbd-fc) and isolated peripheral blood mononuclear cells (pbmcs) from the immunized llama. after four rounds of bio-panning and screening using mers-cov rbd-fc, we isolated a positive clone with the highest binding affinity for the rbd. the gene encoding this rbd-specific nb was subcloned into yeast expression vector to construct nbms (which contains a c-terminal his tag) and nbms -fc (which contains a c-terminal human igg fc tag) nbs (fig. ) . both nbms and nbms -fc were nbms -fc nbs. blood was collected from mers-cov rbd-fc protein-immunized alpaca after the last immunization to isolate pbmcs. rna was then extracted to synthesize cdna via rt-pcr. this was followed by pcr amplification of the n-terminal igg heavy-chain fragment (ϳ bp), including the vhh gene, while the latter was used as the template to amplify the vhh gene fragment (ϳ to bp). the vhh dna sequence was further ligated into phagemid vector pcantab e and transformed into e. coli tg competent cells to construct vhh library. vhh phage display was carried out to isolate rbd-specific clones. after four rounds of bio-panning, the rbd-specific vhh coding sequence was confirmed from the selected positive clones. the identified vhh coding gene containing a c-terminal his or human igg fc was inserted into pichia pastoris yeast expression vector ppicz␣a to construct nbms and nbms -fc, respectively, for further soluble expression and purification. expressed in yeast cells, secreted into the cell culture supernatants, and purified to homogeneity ( fig. a, left) . the estimated molecular weights were about kda for nbms and kda for nbms -fc, since the latter formed a dimer. these mers-cov rbd-specific nbs from llama, but not severe acute respiratory syndrome coronavirus (sars-cov) rbd-specific mabs from mice, were recognized by anti-llama antibodies ( fig. a, right) . thus, the yeast-expressed nbs maintained their native conformation and antigenicity. to characterize their functions, we examined how the nbs interact with mers-cov rbds. first, we evaluated the binding between the nbs and mers-cov rbd using elisa. the result showed that both nbs bound strongly to recombinant mers-cov rbd containing a c-terminal folden tag (rbd-fd) and mers-cov s containing a c-terminal his tag (s -his) in a dose-dependent manner (fig. b) . second, we determined the binding affinity of the two nbs for mers-cov rbd using surface plasmon resonance (spr). the result showed that the k d between nbms and rbd-fc was . nm, the nbs were subjected to sds-page (left) or western blotting (right), followed by detection using anti-llama antibody. the molecular weight marker (in kda) is indicated on the left. (b) detection of binding between nbms or nbms -fc and mers-cov s (mers-s ) or rbd (mers-rbd) protein by elisa. the plates were coated with mers-cov s -his or rbd-fd protein ( g/ml), followed by sequential incubation with respective nbs and goat anti-llama and hrp-conjugated anti-goat igg antibodies. the data are presented as mean a values Ϯ the standard deviation (sds) (n ϭ ). significant differences (*; **, and ***) are shown in the binding of nbs to mers-s or mers-rbd at various concentrations. (c) the binding kinetics between nbms or nbms -fc and mers-cov rbd or s protein were measured by spr. mers-cov rbd-fc protein was used for binding to nbms (containing a c-terminal his ), and s -his protein was used for binding to nbms -fc (containing a c-terminal human fc). (d) detection of nbms and nbms -fc neutralizing activity against mers-cov infection (emc strain) by a microneutralization assay. the nb-mers-cov mixtures were incubated with vero e cells and observed for the presence or absence of cpe. neutralizing activity of nbs was recorded as the concentration of nbs in complete inhibition of mers-cov-induced cpe in at least % of the wells (nd ). the data are expressed as mean nd Ϯ the sd (n ϭ ). the experiments were repeated twice, and similar results were obtained. the "(Ϫ) control" in panels a, b, and d refers to sars-cov g mouse mab. whereas the k d between nbms -fc and s -his was . nm (fig. c) . third, we carried out mers-cov neutralization assay. the result showed that the nbs efficiently neutralized the infection of live mers-cov (emc strain) in vero cells. the measured % neutralization doses (nd ) were . g/ml for nbms and . g/ml for nbms -fc (fig. d ). taken together, the nbs strongly bound to mers-cov rbd and neutralized mers-cov infection. molecular mechanism underlying the neutralizing activities of nbs. to investigate the mechanism underlying the neutralizing activities of nbs, we evaluated the competition between the nbs and hdpp for the binding to mers-cov rbd. first, we carried out a flow cytometry assay where recombinant mers-cov rbd interacted with cell-surface-expressed dpp in the presence or absence of recombinant nbs. the result showed that both nbs significantly blocked the binding of rbd to cell-surface dpp in a dose-dependent manner ( fig. a and b) . as a negative control, sars-cov-rbdspecific g mab did not block the binding between mers-cov rbd and cell surface dpp ( fig. a and b) . second, we carried out an enzyme-linked immunosorbent assay (elisa) where recombinant mers-cov rbd and recombinant hdpp interacted in the presence or absence of recombinant nbs. the result showed that both nbs, but not were coated with mers-cov rbd-fc protein ( g/ml), followed by sequential incubation with serial dilutions of nbs or hdpp protein ( g/ml), goat anti-hdpp , and hrp-conjugated anti-goat igg antibodies. the percent inhibition was calculated as the rbd-hdpp binding in the presence or absence of nbs according to the following formula: ( Ϫ rbd-hdpp -nb/rbd-hdpp ) ϫ . a significant difference (***) occurred between nbms and nbms -fc in inhibition of rbd-hdpp binding. the "(Ϫ) control" in panels b and c refers to sars-cov g mab. the data are presented as the mean percent inhibition Ϯ the sd (n ϭ ). the experiments were repeated twice, and similar results were obtained. g mab, blocked the binding between mers-cov rbd and dpp in a dosedependent manner. moreover, compared to nbms , nbms -fc blocked the rbd-dpp binding more efficiently (fig. c) . these data reveal that the nbs can compete with hdpp for the binding to mers-cov rbd, suggesting that the nb-binding site and the dpp -binding site overlap on the mers-cov rbd. to map the binding site of the nbs on mers-cov rbd, we performed alanine scanning on the surface of mers-cov rbd and detected the binding of nbs to the alanine-containing rbd mutants. the results showed that nbms demonstrated tight binding to mers-cov rbd containing the single mutations l a, d a, r a, e a, e a, w a, v a, and e a and slightly reduced binding to rbd containing triple mutations l f-d g-v a, suggesting that these rbd residues do not play significant roles in nb binding. instead, single mutation d a and double mutations e a-d a on mers-cov rbd both ablated the binding of nbms to the rbd (fig. a) , suggesting that rbd residue asp plays an important role in nb binding. we further investigated the role of asp in nb binding using the mers-cov pseudovirus entry assay. neither nbms nor nbms -fc could neutralize the cell entry of mers-cov pseudovirus bearing the d a mutation, again confirming that residue asp is critical for nb binding (fig. b ). to examine of the role of the d a mutation in dpp binding, we carried out an elisa to detect the binding between dpp and the plates were coated with rbd-fd protein ( g/ml) and treated with or without dtt, followed by sequential incubation with serial dilutions of nbms or nbms -fc and goat anti-llama and hrp-conjugated anti-goat igg antibodies. the data are presented as mean a values Ϯ the sd (n ϭ ). the "(Ϫ) control" in panels b and d refers to sars-cov g mab. the above-described experiments were repeated twice, and similar results were obtained. mers-cov rbd bearing the d a mutation. the result showed that the d a mutation significantly reduced the binding of the rbd to dpp (fig. c ). overall, these results demonstrate that nbs recognize the asp -containing epitope on mers-cov rbd and that this epitope also plays an important role in dpp binding. therefore, the nbs and dpp compete for the same region on mers-cov rbd, and mutations in this region can reduce the binding of both the nbs and dpp . to investigate whether nb-recognized epitopes on mers-cov rbd are conformational or linear, we detected the binding of nbs to mers-cov rbd with its conformational structure disrupted. to this end, we treated mers-cov rbd with reducing agent dithiothreitol (dtt) to break the disulfide bonds in the protein, and performed an elisa on the binding between nbs and dtt-treated rbd. the result showed that neither nbms nor nbms -fc bound to the dtt-treated rbd (fig. d) . as a control, both nbs bound to untreated rbd with high affinity. thus, the nbs recognize the conformational epitope on the rbd. to understand the structural mechanism underlying the neutralizing activities of the nbs, we examined the competitive interactions among the nbs, dpp , and mers-cov rbd using structural modeling (fig. ). in the absence of the nbs, mers-cov rbd binds tightly to the dpp receptor, with d of rbd serving as a key residue at the binding interface (fig. a) . here, rbd residue d forms a critical salt bridge with dpp , and it interacts with the surrounding key rbd residues via van der waals contacts and hydrogen bonds (fig. b) , enabling rbd and dpp to maintain strong binding inter- actions. the nbs bind tightly to the rbd in the same d -containing region, abolishing the binding between rbd and dpp (fig. c) . cross-neutralizing activity of nbs against divergent mers-cov strains. to investigate the cross-neutralizing activity of nbs against divergent mers-cov isolates, we performed mers-cov pseudovirus entry assay in the presence of the nbs where the pseudoviruses encode the s gene of various mers-cov isolates from different countries (saudi arabia, qatar, and south korea), hosts (human and camels), and time periods ( to ). these mers-cov strains all contain mutations in their rbds. the results showed that both nbs potently neutralized the cell entry of all of the mers-cov pseudoviruses, with the nd values ranging from . to . g/ml (for nbms ) and from . to . g/ml (for nbms -fc) ( table ) . therefore, although the nbs were developed using the rbd from one mers-cov strain (emc ), they have broad-spectrum cross-neutralizing activity against existing mers-cov strains, as well as potentially future emerging mers-cov strains. in vivo half-life of nbs. to evaluate the in vivo half-life of the nbs, we injected the nbs into mice, collected the sera from the mice after different time intervals, and measured the binding between the sera and recombinant mers-cov s using elisa. the results showed that the sera collected from nbms -injected mice gradually lost their binding affinity for mers-cov s , and completely lost their binding for mers-cov s days postinjection (fig. a ). in comparison, nbms -fc demonstrated stable binding for recombinant mers-cov s at days postinjection (fig. b) . as a control experiment, sera collected from pbs-injected mice showed no binding for recombinant mers-cov s (fig. c) . thus, compared to monomeric nb, fc-fused nb has a significantly extended in vivo half-life likely due to its dimeric structure, which increases the molecular weight of nb from to kda and hence may slow down its renal clearance. prophylactic and therapeutic efficacy of nb in transgenic mice. because mers-cov does not infect wild-type mice, we previously developed hdpp -tg mice ( ) as the susceptible animal model for mers-cov research. to evaluate the prophylactic efficacy of nbms -fc, mice were injected with a single dose of nbms -fc days before they were infected with a lethal dose of mers-cov and were subsequently monitored for their weight and survival. trastuzumab, an antibody used for treating breast cancer, was used as a control. the result showed that after mers-cov infection, mice treated with nbms -fc had a % survival rate (fig. a, above) and steady weight (fig. a, below) . in comparison, mice treated with trastuzumab all died on day postinfection, and their weight also sharply decreased starting from day postinfection (fig. a) . to evaluate the therapeutic efficacy of nbms -fc, mice were first infected with mers-cov and then treated with single-dose nbms -fc either or days postinfection. the result showed that mice treated with nbms -fc on day postinfection had a % survival rate and steady weight (fig. b ). in addition, mice treated with nbms -fc on day postinfection also had a % survival rate (fig. c , above); although their weights first decreased on day postinfection, it rebounded on day postinfection (fig. c, below) . in comparison, mice receiving trastuzumab all died on day after infection, and their weights continuously decreased ( fig. b and c) . overall, nbms -fc has potent prophylactic and therapeutic efficacy in protecting susceptible animal models against lethal mers-cov challenge. . virus-challenged mice were monitored for days to evaluate survival rate (above) and body weight changes (below). the body weight data are presented as means Ϯ the sd of mice in each group (n ϭ ). significant differences (** and ***) are indicated between the nbms -fc and control groups. mers-cov continues to infect humans with a high fatality rate. because camels likely serve as the transmission hosts for mers-cov and also because humans have contact with camels, the constant and continuing transmissions of mers-cov from camels to humans make it difficult to eradicate mers-cov from the human population. thus, efficacious, cost-effective, and broad-spectrum anti-mers-cov therapeutic agents are needed to prevent and treat mers-cov infections in both humans and camels. nbs have been gaining acceptance as antiviral agents because of their small size, good tissue permeability, and cost-effective production, storage, and transportation. however, their small size may also lead to relative low antigen-binding affinity and quick clearance from the host body. in this study, we have developed a novel mers-covtargeting nb, nbms , and its fc-fused version, nbms -fc, both of which demonstrate great promise as anti-mers-cov therapeutic agents. nbms and nbms -fc present superior characteristics common to other nbs. they target the mers-cov rbd, which plays an essential role in cell entry of mers-cov by binding to its receptor hdpp . both nbs can be expressed in yeast cells with high purity and yields and are soluble in solutions. all of these properties suggest costeffective production, easy storage, and convenient transportation of these nbs in potential commercial applications. the mers-cov rbd-targeting nbs developed also demonstrate good qualities comparable to previously reported mers-cov rbd-specific conventional iggs. first, the nbs bind to mers-cov rbd with high affinities. the k d values for nbms and nbms -fc to bind mers-cov rbd were . ϫ Ϫ m and . ϫ Ϫ m, respectively. the k d values for rbd-targeting conventional iggs to bind mers-cov rbd range from . ϫ Ϫ m to . ϫ Ϫ m ( , , ) . moreover, the nd values for nbms and nbms -fc to neutralize mers-cov (emc strain) infection in cultured cells were . and . g/ml, respectively. the nd values for rbd-specific conventional iggs to neutralize various mers-cov strains ranged from micrograms/ml to nanograms/ml ( , , , , ) . thus, the nbs developed in this study and conventional iggs reported previously have comparable mers-cov rbd-binding affinities and mers-covneutralizing activities. structural comparisons of conventional iggs and nbs have shown that the antigen-binding site of iggs consists of paired heavy-chain and lightchain variable (vh-vl) domains, whereas nbs lack the light chain and hence cannot form the paired vh-vl domains ( , ) . instead, nbs have an extended cdr region (Ͼ amino acid residues), longer than that of the vhs of conventional iggs (average length amino acid residues) ( ) ( ) ( ) . moreover, the nbs developed here contain a -amino-acid cdr ; the extended cdr enables the nbs to bind to the antigens with higher affinity ( ) . furthermore, although the single-domain nb (i.e., nbms ) is small and can be cleared from the serum relatively quickly, the fc-fused nb (i.e., nbms -fc) with relatively increased size demonstrates extended in vivo half-life. therefore, the potential short half-life of nbs can be overcome by adding the appropriate tag to the nbs to increase their half-life. overall, the present study has shown the feasibility of overcoming the potential limitations of nbs. the mers-cov rbd-targeting nbs potently neutralize mers-cov entry into host cells. the k d values between the nbs and mers-cov rbd are significantly lower than that between mers-cov rbd and hdpp receptor. as a result, the nbs can outcompete hdpp for the binding of mers-cov rbd, thereby blocking the binding of mers-cov to dpp , as well as mers-cov entry into host cells. it is worth noting that the rbd on the mers-cov s trimer frequently undergoes conformational changes, switching between a lying down, receptor-inaccessible conformation and a standing-up, receptoraccessible conformation. hence, in the context of the virus particles where the rbd is part of the s protein, the nbs would need to bind the rbd when the rbd is in the standing-up conformation ( ) . importantly, the nbs demonstrate strong crossneutralizing activities against various mers-cov strains isolated from different hosts (humans and camels) and from different time points during mers-cov circulation in humans (from years to ). nbms had a relatively high nd against the agv / strain containing a v a mutation, which is consistent with the slightly reduced binding affinity between nbms and mers-cov rbd containing the v a mutation (fig. a) . the broad neutralizing spectrum of the nbs results from the binding site of the nbs on mers-cov rbd, which is located in the asp containing region that plays a critical role in dpp binding. interestingly, several mers-cov rbd-specific conventional iggs also bind to the same epitope ( , ) , suggesting that this region is a hot spot for immune recognition. although mutations in this region can eliminate the binding of the nbs to mers-cov rbd and hence lead to viral immune evasion, they also reduce the binding of mers-cov rbd to receptor dpp and hence decrease the efficiency of viral entry. thus, viral immune evasion from the inhibition of the nbs through mutations can be costly to mers-cov itself. indeed, residue asp in s protein rbd is highly conserved in almost all of the natural mers-cov strains published to date (fig. ) . therefore, the mers-cov-specific nbs can potentially be developed into broad-spectrum anti-mers-cov therapeutic agents. despite the above analysis, this study did not examine all possible mutations in the nb-binding region (since the atomic structures of mers-cov rbd complexed with the nbs are still unknown), and thus it is possible that future escape mutations may occur to residues that this study did not cover. in that case, a combination of the current nbs and other antibodies targeting other s regions or various rbd epitopes may be helpful in battling the emergence of immune escape mers-cov strains. in sum, the mers-cov-specific nbs developed in the present study possess superior qualities common to all nbs such as their small size and cost-effective production. they also overcome potential limitations of other nbs by maintaining a high binding affinity for their target mers-cov rbd and an optimized half-life. moreover, they recognize a functionally important region on mers-cov rbd, rendering viral immune evasion costly and at the same time making themselves good candidates as broad-spectrum anti-mers-cov therapeutics. we have confirmed the effectiveness of the nbs by showing that the fc-fused nb completely protected animal models from lethal mers-cov challenge. thus, the nbs can potentially be used in both humans and camels to prevent and treat mers-cov infections in either of these hosts and also block the camel-tohuman transmission of mers-cov. overall, our study proves the feasibility of developing highly effective nbs as anti-mers-cov therapeutic agents and points out strategies to preserve the advantages of nbs, as well as to overcome the potential limitations of nbs. construction of vhh library and screening for mers-cov-rbd-specific nbs. construction of the nb (i.e., vhh) library and screening of mers-cov-rbd-specific nbs were performed as previously described ( ) . briefly, male and female alpacas (llama pacos, year) were subcutaneously immunized with recombinant rbd-fc ( g/alpaca) ( ) plus freund complete adjuvant, and boosted three times with the same immunogen plus freund incomplete adjuvant (invivogen). blood was collected days after the last immunization, and then pbmcs were isolated using ficoll-paque gradient centrifugation (ge healthcare). total rna was extracted with trizol reagent (invitrogen). cdna was synthesized by reverse transcription-pcr (rt-pcr) using a transscript cdna synthesis supermix (transgen biotech, china), followed by pcr amplification of the n-terminal igg heavy-chain fragment (ϳ bp), using the forward primer vhh-l-f ( =-ggtggtcctggctgc- =) and the reverse primer ch -r ( =-ggtacgtgctgttgaact gttcc- =). the vhh gene (ϳ to bp) was further amplified using the above dna fragment as the template and the forward primer vhh-fr -d-f ( =-tttctattactaggcccagccggccgagtctggaggrr gcttggtgca- =) and the reverse primer vhh-fr -d-r ( =-aaaccgttggccataatggcctgaggagacgr tgacstsggtc- =) (the sfii restriction site is underlined). the sfii-digested vhh dna fragment was then inserted into phagemid vector pcantab e (bio-view shine biotechnology, china) to construct the vhh phage display library ( ) . phage particles were analyzed by elisa using recombinant mers-cov rbd-fc and fc of human igg proteins as the positive and negative target proteins, respectively, to screen for rbd-specific nbs. after four rounds of bio-panning, one of five positive clones, cab , with the highest binding to mers-cov rbd, was selected for further analyses (fig. ) . expression of mers-cov-rbd-specific nbs in yeast cells. nbms and nbms -fc nbs containing a c-terminal his and fc of human igg , respectively, were constructed based on the aforementioned cab vhh. the dna sequences encoding nbab and nbab -fc were synthesized (genscript) and inserted into the pichia pastoris secretory expression vector, ppicz␣a (invitrogen) (fig. ) . the recombinant nbms and nbms-fc were expressed in pichia pastoris gs cells and purified using a ni-nta column (for nbms ; ge healthcare) and a protein a sepharose fast flow column (for nbms -fc; ge healthcare), respectively. sds-page and western blotting. the purified anti-mers-cov-rbd nbs were analyzed using sds-page and western blotting ( , ) . briefly, nbs ( g) were loaded onto % tris-glycine sds-page gels and stained using coomassie brilliant blue or transferred to nitrocellulose membranes. after being blocked overnight at °c with % nonfat milk/phosphate-buffered saline-tween ( % pbst), the membranes were incubated sequentially with goat anti-llama igg ( : , ; abcam) and horseradish peroxidase (hrp)-conjugated anti-goat igg ( : , ; r&d systems) antibodies for h at room temperature and then with ecl western blot substrate reagents. finally, the membranes were visualized using amersham hyperfilm (ge healthcare). a sars-cov-rbd-specific mab, g ( ) , was used as a control. elisa. elisa was performed to detect the binding between nbs and mers-cov s or rbd proteins ( , ) . briefly, elisa plates were coated overnight at °c, respectively, with recombinant mers-cov s -his ( ), rbd-fc ( ), rbd-fd ( ), or one of the mutant rbds containing a c-terminal human fc tag ( ) . after being blocked with % pbst for h at °c, the plates were further incubated sequentially with serially diluted nbs (containing a c-terminal his or fc tag), either goat anti-llama ( : , ) or mouse anti-his ( : , ) antibody (sigma) and either hrp-conjugated anti-goat igg ( : , ) or hrp-conjugated anti-mouse igg ( : , ) antibody (ge healthcare) for h at °c. elisa substrate ( , =, , =tetramethylbenzidine [tmb]; invitrogen) was added to the plates, and the reactions were stopped with n h so . the absorbance at nm (a ) was measured using a tecan infinite pro microplate reader (tecan). to detect the binding between nbs and denatured mers-cov rbd protein, elisa plates were coated with rbd-fd protein ( g/ml) overnight at °c and then sequentially incubated with dtt ( mm) and iodoacetamide ( mm) (sigma) for h at °c ( ) . after three washes using pbst, elisa was performed as described above. inhibition of the binding between mers-cov rbd and hdpp proteins by nbs was performed using elisa as described above, except that recombinant hdpp protein ( g/ml; r&d systems), and serially diluted nbs were added simultaneously to the rbd-fc-coated plates. the binding between rbd and dpp was detected using goat anti-hdpp antibody ( : , ; r&d systems) and hrp-conjugated anti-goat igg ( : , ). the percent inhibition was calculated based on the a values of rbd-hdpp binding in the presence or absence of nbs. sars-cov g mab was used as a negative control to nbs. surface plasmon resonance. the binding between nbs and mers-cov s or rbd protein was detected using a biacores instrument (ge healthcare) as previously described ( ) . briefly, recombinant fc-fused mers-cov rbd-fc protein or nbms -fc nb ( g/ml) was captured using a sensor chip protein a (ge healthcare), and recombinant his -tagged mers-cov s -his protein or nbms nb at various concentrations was flown over the chip surface in a running buffer containing mm hepes (ph . ), mm nacl, mm edta, and . % surfactant p . the sensorgram was analyzed using biacore s software, and the data were fitted to a : binding model. flow cytometry. this assay was performed to detect the inhibition of the binding between mers-cov rbd and cell surface hdpp by nbs ( ) . briefly, huh- cells expressing hdpp were incubated with mers-cov rbd-fc protein ( g/ml) for min at room temperature in the absence or presence of nbs at various concentrations. cells were incubated with fluorescein isothiocyanate-labeled antihuman igg antibody ( : , sigma) for min and then analyzed by flow cytometry. the percent inhibition was calculated based on the fluorescence intensity of rbd-huh- cell binding in the presence or absence of nbs. mers pseudovirus neutralization assay. neutralization of mers pseudovirus entry by nbs was performed as previously described ( , ) . briefly, t cells were cotransfected with a plasmid encoding env-defective, luciferase-expressing hiv- genome (pnl - .luc.re) and a plasmid encoding mers-cov s protein. the mers pseudoviruses were harvested from supernatants at h posttransfection and then incubated with nbs at °c for h before being added to huh- cells. after h, the cells were lysed in cell lysis buffer (promega), incubated with luciferase substrate (promega), and assayed for relative luciferase activity using tecan infinite pro luminator (tecan). the nd of the nbs was calculated as previously described ( ) . mers-cov microneutralization assay. neutralization of mers-cov infection by nbs was performed as previously described ( , ) . briefly, mers-cov (emc strain) at an amount equal to median tissue culture infective doses (tcid ) was incubated with nbs at different concentrations for h at °c. the nb-virus mixture was then incubated with vero e cells for h at °c in the presence of % co . the cytopathic effect (cpe) was observed daily. the neutralizing activity of nbs was reported as the nd . the reed-muench method was used to calculate the nd value for each nb ( ) . measurement of half-life of nbs. male and female c bl/ mice ( to weeks old) were intravenously injected with nbs ( g in l per mouse) into the tail vein. sera were collected at different time points ( min, h, h, day, days, and days postinjection). the concentrations of nbs in the sera were detected by elisa, as described above. briefly, mers-cov s -his protein ( g/ml) was used to coat elisa plates, and then sera, goat anti-llama antibodies ( : , ), and hrp-conjugated anti-goat igg antibodies ( : , ) were sequentially added for elisa reactions. evaluation of protective efficacy of nbms -fc nb. the prophylactic and therapeutic efficacy of nbms -fc was evaluated in hdpp -tg mice as previously described ( ) . briefly, male and female mice ( to weeks old) were intraperitoneally anesthetized with sodium pentobarbital ( mg/kg of body weight) before being intranasally inoculated with lethal dose of mers-cov (emc strain, . tcid ) in l of dulbecco modified eagle medium. either days preinfection or or days postinfection, the mice were intraperitoneally injected with nbms -fc ( mg/kg). trastuzumab mab was used as a control to the nb. the infected mice were observed daily for days, and their body weights and survivals were recorded. statistical analysis. statistical analysis was performed using graphpad prism version . . to compare the binding of nbs to mers-cov s or rbd protein, as well as the rbds with or without d a mutation to hdpp receptor, a two-tailed student t test was used. one-way analysis of variance was used to compare the inhibition of nbs to rbd-hdpp binding. statistical significance between survival curves was analyzed using kaplan-meier survival analysis with a log-rank test. p values lower than . were considered statistically significant. in the figures, "*," "**," and "***" indicate p Ͻ . , p Ͻ . , and p Ͻ . , respectively. data availability. all data needed to evaluate the conclusions presented here are included. additional data related to this study may be requested from the authors. camelid and shark single domain antibodies: structural features and therapeutic potential nanobody-based products as research and diagnostic tools global analysis of vhhs framework regions with a structural alphabet application of camelid heavy-chain variable domains (vhhs) in prevention and treatment of bacterial and viral infections nanobodies® as inhaled biotherapeutics for lung diseases generation and characterization of alx- , a potent novel therapeutic nanobody for the treatment of respiratory syncytial virus infection nanobodies as therapeutics: big opportunities for small antibodies nanobodies: natural single-domain antibodies caplacizumab for acquired thrombotic thrombocytopenic purpura phase i study of ga-her -nanobody for pet/ct assessment of her expression in breast carcinoma the titan trial-assessing the efficacy and safety of an anti-von willebrand factor nanobody in patients with acquired thrombotic thrombocytopenic purpura isolation of a novel coronavirus from a man with pneumonia in saudi arabia middle east respiratory syndrome coronavirus (mers-cov): animal to human interaction molecular evolution of mers coronavirus: dromedaries as a recent intermediate host or longtime animal reservoir? mers-cov spike protein: a key target for antivirals mers-cov spike protein: targets for vaccines and therapeutics receptor recognition mechanisms of coronaviruses: a decade of structural studies structure-based discovery of middle east respiratory syndrome coronavirus fusion inhibitor molecular basis of binding between novel human coronavirus mers-cov and its receptor cd structure of mers-cov spike receptor-binding domain complexed with human receptor dpp dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc structure, function, and evolution of coronavirus spike proteins recombinant receptor-binding domains of multiple middle east respiratory syndrome coronaviruses (mers-covs) induce crossneutralizing antibodies against divergent human and camel mers-covs and antibody escape mutants intranasal vaccination with recombinant receptor-binding domain of mers-cov spike protein induces much stronger local mucosal immune responses than subcutaneous immunization: implication for designing novel mucosal mers vaccines identification of an ideal adjuvant for receptor-binding domain-based subunit vaccines against middle east respiratory syndrome coronavirus vaccines for the prevention against the threat of mers-cov introduction of neutralizing immunogenicity index to the rational design of mers coronavirus subunit vaccines a conformation-dependent neutralizing monoclonal antibody specifically targeting receptor-binding domain in middle east respiratory syndrome coronavirus spike protein single-dose treatment with a humanized neutralizing antibody affords full protection of a human transgenic mouse model from lethal middle east respiratory syndrome (mers)-coronavirus infection prophylactic and postexposure efficacy of a potent human monoclonal antibody against mers coronavirus pre-and postexposure efficacy of fully human antibodies against spike protein in a novel humanized mouse model of mers-cov infection a humanized neutralizing antibody against mers-cov targeting the receptor-binding domain of the spike protein b -n, a monoclonal antibody against mers-cov, reduces lung pathology in rhesus monkeys following intratracheal inoculation of mers-cov jordan-n / efficacy of antibody-based therapies against middle east respiratory syndrome coronavirus (mers-cov) in common marmosets potent neutralization of mers-cov by human neutralizing monoclonal antibodies to the viral spike glycoprotein exceptionally potent neutralization of middle east respiratory syndrome coronavirus by human monoclonal antibodies nanobodies and nanobodybased human heavy chain antibodies as antitumor therapeutics multi-organ damage in human dipeptidyl peptidase transgenic mice infected with middle east respiratory syndrome-coronavirus junctional and allele-specific residues are critical for mers-cov neutralization by an exceptionally potent germline-like antibody identification of human neutralizing antibodies against mers-cov and their role in virus adaptive evolution single-domain antibodies as versatile affinity reagents for analytical and diagnostic applications single domain antibodies: promising experimental and therapeutic tools in infection and immunity single domain camel antibodies: current status engineering a camelid antibody fragment that binds to the active site of human lysozyme and inhibits its conversion into amyloid fibrils cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains structural basis for the neutralization of mers-cov by a human monoclonal antibody mers- isolation of antigen specific llama vhh antibody fragments and their high level secretion by saccharomyces cerevisiae searching for an ideal vaccine candidate among different mers coronavirus receptor-binding fragments: the importance of immunofocusing in subunit vaccine design single domain antibodies derived from dromedary lymph node and peripheral blood lymphocytes sensing conformational variants of prostate-specific antigen receptor-binding domain of severe acute respiratory syndrome coronavirus spike protein contains multiple conformation-dependent epitopes that induce highly potent neutralizing antibodies a recombinant receptor-binding domain of mers-cov in trimeric form protects human dipeptidyl peptidase (hdpp ) transgenic mice from mers-cov infection a safe and convenient pseudovirus-based inhibition assay to detect neutralizing antibodies and screen for viral entry inhibitors against the novel human coronavirus mers-cov theoretical basis, experimental design, and computerized simulation of synergism and antagonism in drug combination studies receptor-binding domain of mers-cov with optimal immunogen dosage and immunization interval protects human transgenic mice from mers-cov infection rapid human metapneumovirus microneutralization assay based on green fluorescent protein expression this study was supported by the national key plan for scientific research and summarized and analyzed the data. j.s. and f.l. performed the structural analysis. g.z., f.l., l.d., and y.z. wrote the manuscript. s.j., f.l., l.d., and y.z. revised the manuscript. key: cord- -oj re k authors: zhou, haixia; chen, yingzhu; zhang, shuyuan; niu, peihua; qin, kun; jia, wenxu; huang, baoying; zhang, senyan; lan, jun; zhang, linqi; tan, wenjie; wang, xinquan title: structural definition of a neutralization epitope on the n-terminal domain of mers-cov spike glycoprotein date: - - journal: nat commun doi: . /s - - - sha: doc_id: cord_uid: oj re k most neutralizing antibodies against middle east respiratory syndrome coronavirus (mers-cov) target the receptor-binding domain (rbd) of the spike glycoprotein and block its binding to the cellular receptor dipeptidyl peptidase (dpp ). the epitopes and mechanisms of mabs targeting non-rbd regions have not been well characterized yet. here we report the monoclonal antibody d that binds to the n-terminal domain (ntd) of the spike glycoprotein and inhibits the cell entry of mers-cov with high potency. structure determination and mutagenesis experiments reveal the epitope and critical residues on the ntd for d binding and neutralization. further experiments indicate that the neutralization by d is not solely dependent on the inhibition of dpp binding, but also acts after viral cell attachment, inhibiting the pre-fusion to post-fusion conformational change of the spike. these properties give d a wide neutralization breadth and help explain its synergistic effects with several rbd-targeting antibodies. m iddle east respiratory syndrome coronavirus (mers-cov), a novel lethal human virus in the family of coronaviridae, was first identified in saudi arabia in june . infection by this pathogen causes an acute respiratory disease designated as mers, with symptoms that are very similar to those of sars . globally, mers-cov infections have been confirmed in countries causing deaths (http://www.who. int/emergencies/mers-cov/en/). interspecies transmission from dromedary camels to humans is considered to be one major route of transmission in the middle east region , . however, many infected patients without camel exposure and a recent mers outbreak in korea demonstrated that large-scale human-tohuman transmissions can occur through close contacts . due to its potential for mutating toward efficient human-to-human transmission and causing a pandemic, mers-cov was listed as a category c priority pathogen by the us national institute of allergy and infectious diseases. monoclonal antibodies (mabs) with potent neutralizing activity have become promising candidates for both prophylactic and therapeutic interventions against viral infections . on coronaviruses, the component primarily targeted by mabs is the homotrimeric spike (s) glycoprotein of the virion. as a typical class i fusion glycoprotein, the s trimer of highly pathogenic coronaviruses such as mers-cov and sars-cov, which mediates receptor recognition and membrane fusion during viral entry [ ] [ ] [ ] [ ] [ ] [ ] , undergoes protease cleavage into the s and s subunits, positional change of the receptor-binding domain (rbd) in the s subunit for receptor binding, dissociation of the s -receptor complex, and finally formation of a six-helix bundle by the s subunits. a series of rbd-targeting antibodies against mers-cov, which block the binding of the s trimer to the cellular receptor dpp , have been reported and characterized [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . these antibodies exhibited high potency in inhibiting the infectivity of pseudotyped and live mers-cov in cells and animal models. the neutralizing epitopes and mechanisms of antibodies including c , d , m , mers- , jc - , cdc-c , mers- , and mers-gd were further elucidated at the atomic level by structural and functional studies [ ] [ ] [ ] [ ] [ ] [ ] [ ] . sequence comparisons of different mers-cov strains have shown that most naturally occurring mutations of the s glycoprotein are located on the rbd of the s subunit and the s subunit. considering the rapid evolution and high genome variation of rna viruses, more mutations on the rbd may enable the new strains to escape neutralization by currently known rbd-targeting antibodies. therefore, new mabs targeting other functional regions of the mers-cov s glycoprotein and/or neutralizing by different mechanisms are important for developing effective prophylactic and therapeutic interventions against mers-cov infection. although several mabs targeting non-rbd regions have recently been reported, their neutralizing epitopes and mechanisms remain unclear , , . in this study, we isolated and characterized the mouse mab d by combining structural, biochemical, and functional studies. the d antibody recognizes the ntd of mers-cov s glycoprotein and neutralizes the infectivity of pseudotyped and live virus with a potency comparable to those of the most active rbd-targeting antibodies. we also found that the epitope and mechanism of d , which are different from those of rbdtargeting antibodies, enable it to have a better neutralizing breadth and to work synergistically with other antibodies against different mers-cov strains. all these results indicate that d is a very promising candidate for the future combined use of different antibodies in our battle against mers-cov. characterization of neutralizing mab d targeting the ntd. to generate mers-cov neutralizing mabs with epitopes outside the rbd, mice were immunized with recombinant mers-cov s protein (residues - ). subsequently, the spleenocytes were harvested and fused with sp / myeloma cells, and the hybridoma cell lines were screened for positive clones by elisa with the s protein . the positive clones were further tested for their reactivity to different s fragments, including the s subunit ntd (residues - ), rbd (residues - ), and the s subunit (residues - ). one ntd-specific mab, named as d , was finally isolated with an ec of approximately . μg ml − in elisa (fig. a) . it exhibited no crossreactivity with the rbd at a concentration of μg ml − (fig. b) . we further assessed the potential of d , in the form of crude extracts from mouse ascites, for inhibiting mers-cov entry into susceptible huh cells and vero e cells with either pseudotyped or infectious viruses. as expected, d was able to neutralize the infectivity of pseudotyped and live mers-cov (fig. c, d) . the neutralizing activity of d was dose-dependent, with an ic of approximately . μg ml − against pseudotyped virus and practically the same ic of approximately . μg ml − against live virus (emc strain) (fig. c, d) . images illustrating the reduced pfu formation, corresponding to the rate of neutralization of live mers-cov, are shown in fig. e . antibody isotyping showed that d belongs to the igg subtype. sequencing further determined that the heavy chain germline v and j segments are ighv - * and ighj * , while those of light chain are igkv - * , igkj * , and igkj * , respectively (supplementary table ) . we also generated a chimeric version of d ( d -h) by combining the v segments of d with the human igg backbone, which was efficiently expressed and purified in freestyle -f cells ( supplementary fig. a ). the bio-layer interferometry (bli) experiment showed that the affinity constant of the binding between d -h and ntd was approximately nm (table and supplementary fig. b) . the ic of the purified d -h against cell entry by pseudotyped mers-cov was approximately . μg ml − (supplementary fig. c ). we also investigated the protective efficacy of d -h against infection of pseudotyped mers-cov using r -hdpp mice model with a human dpp inserted into the rosa locus by crispr/cas , which could also been productively infected by high-titer mers-cov pseudovirus, with effects comparable to the authentic infection . bioluminescence of the fluc reporter showed that the pseudovirus infection in the mice was clearly prevented by d -h and rbd-specific mab mers- when both antibodies were administered by the intraperitoneal injection with a dose of μg per mouse ( supplementary fig. d ). the recombinant chimeric d -h, which retained the activities as the mouse d and protected r -hdpp mice against challenge of pseudotyped mers-cov, was utilized in subsequent binding and neutralization experiments. overall structure of the d scfv bound to the ntd. to structurally characterize the d and its binding to the spike protein, we determined the crystal structure of the antibody scfv ( d -scfv) in complex with the ntd at a resolution of . Å with a final r work of . and r free of . . statistics of diffraction data collection, processing, and structure refinement are listed in table . there were three complexes of d -scfv bound to ntd per asymmetric unit. the refined model contains residues tyr to ser of mers-cov ntd, glu to ser of the v h and asp to lys of the v l . n-linked glycans attached to asn , asn , asn , asn , asn , asn , asn , and asn of the ntd are also included in the model. it has been previously shown that the mers-cov ntd folds into a galectin-like structure, which can be separated into top, core and bottom subdomains (fig. a) . upon binding, the d -scfv contacts the top subdomain of the ntd and the asn -linked glycans with its heavy and light chains ( fig. a and supplementary fig. ). all three cdrs of the heavy chain and the cdr and cdr of the light chain participate in the binding (fig. a) . the buried surface between the d -scfv and the ntd encompasses approximately Å for the heavy chain and Å for the light chain. structural features of the interface between d and ntd. the binding interface between d -scfv and ntd consists of residues and asn -linked glycans from the ntd, as well as residues from all cdrs except for lcdr (fig. b, c) . the interacting residues from the ntd are tyr , asp , pro , asp , val , ser , glu , ser , asn , lue , arg , and asn . together with the asn -linked nag , nag and man , they form the conformational epitope recognized by d (fig. b) . the residues recognizing d are ser , tyr , asn from the hcdr , tyr , asn , and ser from the hcdr , arg , tyr , asn , tyr , and tyr from the hcdr , tyr , and tyr from the lcdr , and arg and asp from the lcdr (fig. c) . specifically, d hcdr residues ser , tyr , and asn interact with pro and asp from the ntd, and a formed hydrogen bond is from d asn to ntd asp ( fig. d and supplementary fig. crystal structure of d -scfv bound to ntd and the binding interface. a an overall structure of the ntd/ d -scfv complex in which the ntd, n -linked glycans on the ntd, d v l , and d v h are colored in blue, gray, magenta, and cyan, respectively. b epitope on the ntd recognized by d . the ntd is represented as blue surface, on which the protein region bound by d is displayed in orange and the n -linked glycans are displayed as gray sticks. c d residues that are involved in the binding. the v l and v h are colored in magenta and cyan, respectively, and the residues interacting with d are displayed in orange. d interactions between the d v h residues and the corresponding residues of ntd. e interactions between the d v l residues and the corresponding residues of ntd. f zoom-in view of interactions between n -linked glycans and d interacting with tyr , asp , pro , asp , and arg of the ntd (fig. d ). tyr and asn of d form two hydrogenbonding interactions with asp of the ntd (supplementary table ). for the light chain, the lcdr and lcdr residues tyr , tyr , arg , and asp interact with glu , ser , arg , and asn of the ntd, and a salt bridge is formed between arg of lcdr and glu of the ntd (fig. e and supplementary table ) . a prominent feature at the interface is the extensive recognition of asn -linked glycans by all three heavy chain cdrs ( fig. f and supplementary table ). specific hydrogen-bonding interactions occur between tyr and arg of d and the nag and man glycans, respectively ( fig. f and supplementary table ). confirmation of the neutralizing epitope. to confirm the epitope and its critical residues, we performed a mutagenesis study by introducing single mutations to all ntd recognized residues including trp , asp , pro , asp , val , ser , glu , ser , asn , asn , lue , arg , and asn . we first examined the effects of these ntd mutations on the binding by d -h. the d -h bound the wild-type ntd with an affinity of approximately nm (table and supplementary fig. ). by contrast, the d a and r a mutations dramatically reduced the binding, to a level that was undetectable by bli experiment (table and supplementary fig. ). the e a and n q mutations reduced by the binding affinity by -fold to . μm and -fold to . μm, respectively (table and supplementary fig. ). all the other nine mutations had variant unequal effects on the binding by reducing the affinity in the range of -to -fold (table and supplementary fig. ). the effects of these mutations on the neutralizing activity of d -h were in consistent with the changes of binding affinity. pseudotyped mers-cov bearing d a, e a, or r a mutation in the spike glycoprotein escaped the neutralization by d -h (table and supplementary fig. ). the ic values of d against pseudotyped mers-cov bearing d a, v a, or n q mutation were increased approximately by -, -, and -fold (table and supplementary fig. ). the binding and neutralization assays collectively revealed that asp , val , glu , arg , and asn -linked glycans are critical for recognition and neutralization of mers-cov by d . sequencing of multiple clinical isolates had revealed that the mers-cov s glycoprotein is evolving at an average rate of . × − substitutions per site per year . alignments of the deposited sequences in the ncbi identified naturally changing residues from the prototype emc sequence including v f, v i, v a, d y, l f, t i, a y, l f, d g, v l, v a, e k, d e, v i, q r, q h, r h, r q, a s, t i, g s, and v a, which are located in the ntd (residues - ) and rbd (residues - ) of the s subunit, and the s subunit (residues - ). several residue changes on the rbd, such as those occurring on d , d , and e , indeed enabled the mers-cov to escape the neutralization of antibodies targeting the rbd , . considering that most of the mutations are outside the ntd, we speculated that d -h would have a better tolerance for these naturally occurring mutations. we generated pseudotyped mers-cov bearing the emc strain s glycoproteins and its mutants harboring all the listed residue changes. the neutralization assays showed that d -h showed effective neutralizing activity against almost all pseudotyped mers-cov variants. only the two mutations v f and v a on the ntd increased the ic value of d -h by more than -fold and significantly reduced its neutralization activity (fig. a, b) , which confirmed the results of the structural and biochemical studies of the binding interface. all other naturally occurring mutations, most of them on the rbd and the s subunit did not affect the neutralization capability of d -h (fig. a, b) , indicating that d would have a wide neutralization breadth against different variants of mers-cov. combination of d with other rbd-targeting antibodies. the current available mers-cov antibody epitopes with solved structures are all on the rbd, which can be grouped into three categories: ( ) epitope of mers- ; ( ) epitopes of mers- , d , c , and jc - ; and ( ) epitopes of m , mca , cdc-c , and the newly reported mers-gd ( supplementary fig. ) . in our study of the rbd-specific mab mers- , we also found synergism with the ntd-targeting mab f . thus, the elucidation of the epitope targeted by d , which added a category outside the rbd ( supplementary fig. ), prompted us to study the combined effect of d together with the three representative antibodies mers- , mers- , and mers-gd in the neutralization of pseudotyped mers-cov by titrating the neutralizing potency of an equimolar mixture of the two antibodies and comparing the dose response with that observed in neutralization assays performed with the individual antibody alone. as shown in the fig. , the combination index (ci) values of mers-gd combined with d at fa values of effective dose %, %, %, and % (ed , ed , ed , and ed , respectively) were . , . , . , and . , respectively. as a ci value of indicates an additive effect, < indicates synergism, and > indicates antagonism, the combination of d and mers-gd worked in a clearly synergistic manner. meanwhile, the combination index (ci) values of combined mers- with d at fa values of effective dose %, %, %, and % (ed , ed , ed , and ed ) were . , . , . , and . , respectively. thus, the combination of mers- and d also demonstrated synergism, in particular at relatively lower concentrations. however, the percent neutralization obtained using combined mers- and d showed no obvious difference of half maximal inhibitory concentration (ic ) compared with that of d alone. the combination index (ci) values of combined mers- and d at fa values of effective dose %, %, %, and % (ed , ed , ed , and ed ) were . , . , . , and . , respectively. it indicated that the combination of d with mers- exhibited neither synergy nor antagonism. mechanism of d neutralization. a major reported mers-cov neutralization mechanism relies on inhibiting the binding of the s trimer with the cellular receptor dpp . the epitopes of these reported antibodies all reside in the rbd responsible for receptor binding. the fact that the d epitope is outside the rbd indicated that it may have a different neutralizing mechanism. we first examined if d is still able to inhibit the receptor binding by the s trimer. the facs analysis of cellsurface staining showed that the scfv and fab fragments of d -h did not inhibit the staining of huh cells by the s trimer, while the d -h slightly reduced the staining (fig. a , supplementary table and supplementary fig. ). by contrast, the rbdtargeting mab mers- was much more potent than d -h in inhibiting the binding of the s trimer to huh cells. moreover, the fab and scfv fragments of mers- retained nearly the same potency in the inhibition ( fig. a and supplementary table ). surface plasmon resonance (spr) analysis confirmed these conclusions by showing that d -h, and not its fab or scfv fragments, could interfere with the binding of the s trimer to chipcoupled dpp in a dose-dependent manner ( supplementary fig. ) , while the igg, fab, and scfv of mers- all inhibited the binding ( supplementary fig. ) . to investigate why the igg, fab, and scfv of d inhibit receptor binding differently, we constructed models of their binding to the s trimer. the mers-cov s trimer structure was determined by cryo-em with the rbd in standing or lying positions, and only the standing rbd could bind to the dpp receptor. after superimposing the ntd/ d -scfv crystal structure onto the s trimer, we observed no steric clashes between three ntd-bound scfv fragments and one or two rbdbound dpp receptors ( fig. b and supplementary fig. ). the s trimer with three rbd-bound receptors was not considered because the cryo-em study of the mers-cov s trimer only revealed conformations with one or two standing rbds. when the scfv was replaced with the fab, there were also no steric clashes between the fab and dpp receptor (fig. c ). it is more complicated to model the binding of d -h to the s trimer, considering that the igg form has two binding sites and the intrinsic flexibility. we found that binding of the d -h igg to the ntd in certain orientations could inhibit the binding of dpp due to steric clashes, while there were still no steric clashes with the d -h bound in some other orientations (fig. d, e) . these results provided a structural explanation for the inability of d -h scfv and fab to inhibit the binding of the s trimer to the dpp receptor. they may also explain why the d -h igg form is not as potent as the mers- igg, fab, and scfv which all directly bind to the rbd. in parallel with biochemical studies, we also examined the neutralizing activities of d -h igg, fab, and scfv. the d -h fab and scfv did not interfere with the binding of the s trimer to the dpp receptor. however, they were still able to inhibit the cell entry of pseudotyped mers-cov with ic value of . μg ml − and . μg ml − , respectively (fig. a) . although the d -h fab and scfv are less active than the igg in infection inhibition, they were still comparable to the fab or scfv fragments of several reported rbd-targeting antibodies such as mers- fab (ic : . μg ml − ) and mers- scfv (ic : . μg ml − ) ( supplementary fig. a ). these results collectively indicated that neutralization by d -h involves other mechanism besides interfering with the initial receptor binding. we tested and compared the neutralizing activity of d in pre-attachment and post-attachment settings. after the cell attachment, d was still able to inhibit infection by pseudotyped mers-cov with an ic of . μg ml − (fig. b) . in comparison, mers- , which is more potent than d in inhibiting receptor binding, exhibited very weak neutralization after receptor binding (supplementary fig. b) . the above results, especially the retaining activity of d after viral attachment indicated that d would also interfere with the prefusion to postfusion conformational transition of the s glycoprotein required for membrane fusion. this transition , . we showed that the mers-cov s glycoprotein in the prefusion state is sensitive to the digestion of proteinase k (fig. c) . previous studies have demonstrated that cleavage at the s /s site by trypsin and the binding with cellular receptor greatly enhanced the prefusion to postfusion transition of the spike glycoprotein . consistently, the amount of a kda and proteinase-k-resistant band of the s glycoprotein representing the postfusion six-helix bundle was at the maximum level in the presence of trypsin and dpp (fig. c) . and the addition of d -h fab obviously reduced the intensity of the band (fig. c) . meanwhile, we analyzed the full-length mers-cov s trimer embedded in the membrane of pseudotyped virus and the trigger we used to induce the conformational transition was the incubation with huh cells that endogenously expressing dpp receptor. after incubating the pseudotyped virus with huh cells for h at °c, a proteinase-k resistant band on the sds-page gel appeared and the addition of d -h, d -h fab, or d scfv all clearly decreased the intensity of this band ( supplementary fig. ) . thus, these biochemical results strongly suggest that d could also exert its neutralizing activity in the postattachment stage after receptor-binding by inhibiting the conformational transition of the s glycoprotein required for membrane fusion (fig. d) . since dpp , which is a critical step for viral cell attachment. in this study, we first isolated the neutralizing mouse antibody d targeting the ntd of the s glycoprotein. neutralization assays showed that d is highly potent and its activity is comparable to that of the most potent rbd-targeting antibodies. structural determination of d scfv bound to the ntd and mutagenesis studies revealed the epitope and key residues on the ntd for binding and neutralization at atomic level. comparisons of d scfv, fab, and igg forms in dpp -binding competition and neutralization assays indicated that its activity is not solely dependent on the inhibition of dpp binding. further experiments indicated that the neutralizing activity of d after cell attachment is through the inhibition of prefusion to postfusion conformational transition of the s glycoprotein trimer, which mediates the fusion of viral and cell membranes. we also showed that d has a wide neutralization breadth against mers-cov variants bearing naturally occurring mutations and exhibited synergistic effects with several rbd-targeting antibodies. these results collectively revealed an antibody epitope and neutralization mechanism on the s glycoprotein, which would contribute to the global efforts to control mers-cov infection and transmission by providing alternatives for mers-cov immunotherapy. similar the ntds of the s protein of other betacoronaviruses such as mhv, bcov and hku , that of mers-cov also folds into a galectin-like structure. although the galectin domain is a typical carbohydrate-recognition domain, the betacoronavirus ntds can include structural variations that enable more diverse functions in viral infection. the examples, include the ntd of bcov that retains the glycan-binding activity recognizing -n-acetyl- -oacetylneuraminic acid (neu , ac ) and the ntd of mhv that evolved specific protein-protein interactions with its cellular receptor ceacam , and both interactions are important for the viral cell attachment , . however, there is still no report on the glycan or protein-binding activities of the mers-cov ntd. in fact, crystallographic structure determination showed that the glycanbinding site on the mers-cov ntd is occupied by a short helix (residues - ) and the asn -linked glycan, indicating that it is not able to bind glycans in the same way as the ntd of bcov . notably, the asn -linked glycan is involved in the recognition by d , whereby nag and man undergoes specific hydrogen-bonding interactions with tyr and arg of d , respectively. the ntd n q mutation also dramatically reduced the binding and neutralization by d , but did not dramatically affect the cell infection of pseudotyped mers-cov ( supplementary fig. ) . therefore, the asn -linked glycan serves as an important anchor point for the binding of d to the mers-cov ntd. as the largest class i viral fusion protein, the coronavirus s glycoprotein is expected to undergo a prefusion to postfusion conformational transition to mediate the interaction between viral and cellular membrane proteins, although structural studies just began to shed light on this recently. the s glycoprotein of betacoronaviruses mhv and hku , whose structures have been determined by the cryo-em method, all adopt a similar prefusion homotrimeric architecture , . interestingly, in the prefusion architecture of the s trimer of highly pathogenic mers-cov and sars-cov, two major conformational states were observed. a d -h igg was tested for neutralizing activity against pseudotyped mers-cov before or after receptor binding. vrc mab was used as unrelated control. c the effect of d -h fab on the conformational change of the mers-cov s trimer was probed by western blotting using an anti-mers-cov s polyclonal antibody. refolding to the postfusion conformation was detected by the appearance of a proteinase-k resistant band. trypsin was used at μg ml − and proteinase k at μg ml − . digestion experiments and western blots were performed in triplicates, and a representative result is shown for each of them. d a cartoon representation designed by us showing the neutralization mechanism by which d blocks mers-cov entry. on the one hand, some virus particles can not bind to dpp due to steric hindrance caused by d binding. on the other hand, d still recognizes the particles when the up receptor-binding domain (rbd) binds to dpp , and may inhibit the prefusion to postfusion transition of the s subunit and the initiation of membrane fusion. source data are provided as a source data file major difference between them is the change of the rbd in the s subunit from a down to an up position, which was proposed to be a prerequisite for the binding of the s trimer to their respective cellular receptor dpp and ace . this proposal was recently confirmed by our cryo-em study of the sars-cov s trimer in complex with ace , and we also showed that ace -binding could induce the dissociation of the s subunit, which results in the falling apart of the prefusion s trimer and the transition to the prefusion state of the s subunit . a major neutralization mechanism of antibodies against mers-cov is to directly or indirectly compete with the cellular receptor dpp for binding to the rbd. in theory, antibodies that interfere with the coronavirus membrane fusion process other than receptor binding would also have a neutralizing activity, and the d mab targeting the ntd we studied is one such example. here, we showed that d neutralization is not solely dependent on dpp -binding competition, and its inhibition of the s trimer conformational transition after cell attachment also plays a significant role in the neutralization. we suggested that the binding of d may stabilize the prefusion architecture of the s trimer, even after the binding of dpp receptor. the stabilization of viral fusion protein at one conformational state for neutralization has also been observed and studied in other viruses such as hiv. a recent study revealed that the hiv env trimer is intrinsically dynamic with three major and distinct prefusion conformations . among them, the closed, ground-state conformation is dominant and could be remodeled to another two conformations by cd receptor binding, which is essential for the subsequent prefusion to postfusion transition . the binding of neutralizing antibodies, whether inhibiting the binding of the cd receptor (such as vrc ) or not (such as g and pgt ) all resulted in the stabilization of the ground-state conformation of the env, which finally disfavors its prefusion to postfusion state transition required for viral entry , . to the best of our knowledge, our study offers the first structural definition of the neutralizing epitope of an antibody targeting the s ntd of mers-cov. as we summarized in supplementary table , a total of six anti-ntd mabs have been reported , , , . all of them neutralize the infection of pseudotyped mers-cov emc strain with high potency except for mab . f . the mab f and our d showed the same neutralizing activity against live mers-cov in plaque reduction neutralization testing. notably, the mouse mab g can greatly relieve the symptom of dpp -transgenic mice infected following mers-cov infection and our d -h can inhibit the infection of pseudotyped mers-cov in r -hdpp mice. however, the specific neutralizing epitopes and mechanisms of f , g , jc - , and fib-h are largely unknown. in addition, the combination of different antibodies is supposed to be an effective strategy to combat mers-cov infection as it continues to spread among multiple animal species and to probe and adapt to the human population [ ] [ ] [ ] . an effective combination would require the candidate antibodies to bind to disparate epitopes or with distinct mechanisms and hence display additive or synergistic effects, as the mabs mers- and f we mentioned before . although the exact mechanism that leads to the synergy or additive is uncertain, our d -h with mers-gd or mers- antibodies demonstrated a synergy in inhibiting the infectivity of pseudotyped mers-cov, while d -h and mers- antibodies together had an additive effect. consequently, d is currently the most comprehensively studied ntd-targeting mab with a different epitope and working mechanism, which makes it an excellent candidate, in combination with other rbd-targeting neutralizing antibodies or alone, in our battle against mers-cov infection. three weeks after the initial immunization, these mice were boosted twice at -week intervals. cells collected from the spleens of sacrificed animals were fused with cultured sp / cells at a : ratio in the presence of peg (sigma). hat selection medium was used for the fused hybridoma cultures. after -weeks of incubation, the positive hybridomas were selected via s-coated elisa, and the positive clones were subjected to limited dilutions and downstream validation. for large-scale mab production, ascites fluid from mice inoculated with the hybridomas was collected and purified by the caprylic acid-ammonium sulfate precipitation method. protein expression and purification. the coding sequence of the mers-cov spike glycoprotein ectodomain (emc strain, spike residues - ) was ligated into the pfastbac-dual vector (invitrogen) with a c-terminal t fibritin trimerization domain and a hexa-his-strep tap tag to facilitate further purification processes. briefly, the protein was prepared using the bac-to-bac baculovirus expression system, purified by sequentially applying strep-tactin and superose column (ge healthcare) with hbs buffer ( mm hepes, ph . , mm nacl). fractions containing mers-cov s glycoprotein were pooled and concentrated for subsequent biochemical analyses. the sequence encoding mers-cov s ntd (residues - ) with a c-terminal hexa-his tag was inserted into the eukaryotic expression vector pvax. freestyle -f cells were transfected with the plasmid using polyethylenimine (pei) (sigma). after h, the supernatant was collected and the ntd was purified using nta sepharose (ge heathcare) and superdex high performance column (ge healthcare) with hbs buffer ( mm hepes, ph . , mm nacl). the sequence encoding the d v l and v h were separately cloned into the backbone of antibody expression vectors containing the constant regions of human igg . the chimeric antibody d -h was expressed in freestyle -f cells by transient transfection and purified by affinity chromatography using protein a sepharose and gel-filtration chromatography. the purified d -h was exchanged into phosphate-buffered saline (pbs), and was digested with papain protease (sigma) over night at °c. the digested antibody was then passed back over protein a sepharose to remove the fc fragment, and the unbound fab in the flow through was additionally purified using a superdex high performance column (ge healthcare). the gene encoding the d v l followed by v h with a connecting triple gggs linker and a c-terminal hexa-his tag was synthesized and cloned into the eukaryotic expression vector pvrc . freestyle -f cells were transfected the plasmid in the presence of pei (sigma). the cell-culture supernatant was collected h after the transfection, and the d scfv was collected and captured on nta sepharose (ge healthcare). the bound d scfv was eluted with hbs buffer containing mm imidazole and was then further purified by gel-filtration chromatography using a superdex high performance column (ge healthcare). complex preparation and crystallization. the mers-cov ntd and the scfv fragment of d were mixed at a molar ratio of : . , incubated for h at °c and further purified by gel-filtration chromatography. the purified complex concentrated to approximately mg ml − in hbs buffer ( mm hepes, ph . , data collection and structure determination. to collect the diffraction data, all crystals were flash-cooled in liquid nitrogen after being incubated in reservoir solution containing % (v/v) glycerol. the diffraction images were collected on the bl u beamline at the shanghai synchrotron research facility (ssrf) with the wavelength of . Å. all images were processed with hkl . the structure was solved by molecular replacement using phaser from the ccp suite . the search models were the mers-cov ntd structure (pdb id: vyh) and the structures of the variable domain of the heavy and light chains available in the pdb with the highest sequence identities. subsequent model building and refinement were performed using coot and phenix, respectively , . there are % of most favored, . % of allowed and . % of disallowed ramachandran plot in the final refinement model. all structural figures were generated using pymol . neutralizing assay of pseudotyped mers-cov. t cells cultured in mm dish were co-transfected with μg of pcdna . -mers-spike or its mutants and μg of pnl - .luc.re. the supernatants containing sufficient pseudotyped mers-cov were harvested - h post-transfection. subsequently, the % tissue culture infectious dose (tcid ) was determined by infection of huh cells. for the neutralization assay, tcid per well of pseudoytped virus were incubated with or serial : dilutions of purified antibodies, fabs or scfvs for h at °c, after which huh cells (about . × per well) were added. after incubation for h at °c, the neutralizing activities of antibodies were determined by the luciferase activity and presented as ic , calculated using the dose-response inhibition function in graphpad prism (graphpad software inc.) cell entry of pseudotyped virus. the concentration of the harvested pseudotyped virions was normalized by p elisa kit (beijing quantobio biotechnology co., ltd., china) before infecting the target huh cells. the infected huh cells were lysed at h after infection and viral entry efficiency was quantified by comparing the luciferase activity between pseudotyped viruses bearing the mutant-and wildtype mers-cov spike glycoproteins. postattachment neutralization assay. for the postattachment pseudotyped virus neutralization assay, huh cells, upon reaching a density of . × per well in a -well plate, were incubated with tcid per well of pseudotyped virus at °c for h. after removing the supernatant, μl of pbs was added twice to each well to wash the un-bond pseudotyped viruses. a total of serial : dilutions of purified antibodies in dmem ( % fbs) were then added to the huh cells with attached pseudotyped viruses, as well as dmem ( % fbs) alone as control. neutralization activities were determined based on the luciferase activity after incubation for h at °c and also presented as ic , calculated using the dose-response inhibition function in graphpad prism (graphpad software inc.) cooperativity of mabs for neutralization. synergistic, additive, and antagonistic interaction between d and mers-gd , d , and mers- , as well as d and mers- for virus neutralization were evaluated by the median effect analysis method using compusyn software as previously reported , . the measured neutralization values were input to the program as fractional effects (fa) ranging between . and . for each of the two antibodies and for both in combination. ci values were calculated in relation to fa values. a logarithmic ci value of indicates an additive effect, < indicates synergism, and > indicates antagonism. live mers-cov neutralization assay. the neutralizing activity of the mabs against live mers-cov was also determined in dpp -expressing vero e cells. upon reaching a density of × per well in a -well plate, cell monolayers were infected with - plaque-forming units (pfu) of live virus in the presence or absence of the mab. after three days of incubation at °c, the inhibitory capacity of the mabs was assessed by determining the numbers of plaques compared with the potent mers-cov anti-rbd and anti-n mabs. murine model of mers-cov pseudovirus infection. the mers-cov susceptible animal model hdpp -knockin mouse, which was established by inserting human dipeptidyl peptidase (hdpp ) into the rosa locus using crispr/cas , resulting in global expression of the transgene in a genetically stable mouse line , was used in this experiment. mice (n = ) were challenged by intraperitoneal injection (i.p.) with doses of . × . tcid of pseudotyped mers-cov. d -h and mers- were administered i.p. to r -hdpp mice at a dose of μg per mouse prior to challenge with pseudovirus. mice (n = for the pbs group and n = for the c group) were also administered pbs or control mab c (mab of anti-na of h n , at a dose of μg per mouse) and challenged using the same i.p. dose of pseudovirus. the ivis-lumina ii imaging system (xenogen, baltimore, md, usa) was used to detect bioluminescence. prior to measuring luminescence, the mice were anesthetized using an i.p. injection of sodium pentobarbital ( mg kg − ). the exposure time was s, and fluorescence intensity in regions of interest was analyzed using living image software (caliper life sciences, baltimore, md, usa). different wavelengths were used for detecting pseudovirus and tdtomato fluorescence. the substrate, d-luciferin ( mg kg − , xenogen-caliper corp., alameda, ca, usa), was injected i.p. and imaging was conducted min later. the relative intensities of emitted light were represented as colors ranging from red (intense) to blue (weak) and quantitatively presented as photon flux in photons s − cm − sr − . binding studies using bli. binding kinetics of mers-cov ntd and its mutants with d were studied using a fortébio octet htx instrument. assays with agitation set to rpm in hbs buffer ( mm hepes, ph . , mm nacl) supplemented with . % (v/v) tween were performed at °c in solid black tilted-bottom -well plates (greiner bio-one). d ( μg ml − ) was used to load anti-human igg fc capture probes for s to capture levels of . - nm. biosensor tips were then equilibrated for s in hbs buffer supplemented with . % (v/v) tween prior to binding assessment with different concentrations of wild-type or mutant mers-cov ntd for s, followed by dissociation for s. data analysis and curve fitting were performed using octet software, version . . binding competition assays by spr. real-time binding and analysis by spr were conducted on a biacore t instrument with cm chips (ge healthcare) at room temperature. for all the analyses, hbs buffer consisting of mm hepes, ph . , mm nacl and . % (v/v) tween was used, and all proteins were exchanged to the same buffer. the blank channel of the chip was used as the negative control. dpp ( μg ml − ) was immobilized on the chip at about response units. soluble mers-cov spike trimer (s) at the same gradient in the present or absence of the concentration gradient of iggs, fabs, or scfvs was flowed over the chip surface. after each cycle, the sensor surface was regenerated with . mm naoh. data were analyzed using the biacore t evaluation software by fitting to a : langmuir binding model. facs analysis of cell-surface staining. the binding between recombinant soluble mers-cov spike trimer (s) and human dpp expressed on the surface of huh cells was measured using fluorescence-activated cell sorting (facs). all cellsurface staining experiments were performed at room temperature. soluble mers-cov spike trimer (s) with strep-tag ( μg) was incubated with monoclonal antibodies (mabs) in advance at molar ratios of : , : , : , and : for h. huh cells were trypsinized and then incubated with s or s and mabs mixtures for h. after washing the un-bound s with pbs times, the huh cells were then stained with streptavidin apc (bd ebioscience) for another min. cells were subsequently washed with pbs times and analyzed by flow cytometry on a facs aria iii machine (bd ebiosciences). western blots. totally, μl pseudotyped mers-cov was thawed and mixed with μg of antibodies (igg, fab or scfv) for h. the virus alone or the mixture was incubated with μl of huh cell suspension for another h at °c. an equal volume of buffer and proteinase-k (final concentration of μg ml − ; thermo_fisher) was then added and incubated h at °c. for the soluble s, μg of the s trimer was incubated with μg of the dpp ectodomain or μg of d fab for h on ice. trypsin (final concentration of μg ml − ; thermo_-fisher) was then added to these samples and incubated min at °c. subsequently, the samples were supplemented with μg ml − proteinase-k and incubated min at °c. × sds-page loading buffer was then added to all samples prior to boiling at °c. samples were run on a - % gradient tris-mops-gel (genscript) and transferred to polyvinylidene fluoride membranes. an anti-s mers-cov s polyclonal antibody ( : dilution; thermo_fisher; cat#pa - ) and an hrp-conjugated goat anti-rabbit secondary antibody ( : dilution; huaxingbio; cat#hx ) were used for western blotting. ai was used to develop images. reporting summary. further information on research design is available in the nature research reporting summary linked to this article. the source data underlying figs. a-d, , , and a-c and supplementary figs. a-c, , , , - are provided as a source data file. crystal structures presented in this work has been deposited in the protein data bank (pdb) and are available with accession code j . isolation of a novel coronavirus from a man with pneumonia in saudi arabia the emerging novel middle east respiratory syndrome coronavirus: the "knowns" and "unknowns middle east respiratory syndrome coronavirus in dromedary camels: an outbreak investigation human infection with mers coronavirus after exposure to infected camels, saudi arabia environmental contamination and viral shedding in mers patients during mers-cov outbreak in south korea history of passive antibody administration for prevention and treatment of infectious diseases structure of mers-cov spike receptor-binding domain complexed with human receptor dpp host cell entry of middle east respiratory syndrome coronavirus after two-step, furin-mediated activation of the spike protein cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains cryo-em structure of the sars coronavirus spike glycoprotein in complex with its host cell receptor ace a conformation-dependent neutralizing monoclonal antibody specifically targeting receptor-binding domain in middle east respiratory syndrome coronavirus spike protein potent neutralization of mers-cov by human neutralizing monoclonal antibodies to the viral spike glycoprotein identification of human neutralizing antibodies against mers-cov and their role in virus adaptive evolution exceptionally potent neutralization of middle east respiratory syndrome coronavirus by human monoclonal antibodies prophylactic and postexposure efficacy of a potent human monoclonal antibody against mers coronavirus pre-and postexposure efficacy of fully human antibodies against spike protein in a novel humanized mouse model of mers-cov infection a humanized neutralizing antibody against mers-cov targeting the receptor-binding domain of the spike protein evaluation of candidate vaccine approaches for mers-cov importance of neutralizing monoclonal antibodies targeting multiple antigenic sites on mers-cov spike to avoid neutralization escape ultrapotent human neutralizing antibody repertoires against middle east respiratory syndrome coronavirus from a recovered patient junctional and allele-specific residues are critical for mers-cov neutralization by an exceptionally potent germline-like antibody structural basis for the neutralization of mers-cov by a human monoclonal antibody mers- structural definition of a unique neutralization epitope on the receptor-binding domain of mers-cov spike glycoprotein a novel neutralizing monoclonal antibody targeting the nterminal domain of the mers-cov spike protein receptor-binding domain of severe acute respiratory syndrome coronavirus spike protein contains multiple conformation-dependent epitopes that induce highly potent neutralizing antibodies a human dpp -knockin mouse's susceptibility to infection by authentic and pseudotyped mers-cov spread, circulation, and evolution of the middle east respiratory syndrome coronavirus two-step conformational changes in a coronavirus envelope glycoprotein mediated by receptor binding and proteolysis unexpected receptor functional mimicry elucidates activation of coronavirus fusion protective effect of intranasal regimens containing peptidic middle east respiratory syndrome coronavirus fusion inhibitor against mers-cov infection passive transfer of a germline-like neutralizing human monoclonal antibody protects transgenic mice against lethal middle east respiratory syndrome coronavirus infection prophylaxis with a middle east respiratory syndrome coronavirus (mers-cov)-specific human monoclonal antibody protects rabbits from mers-cov infection b -n, a monoclonal antibody against mers-cov, reduces lung pathology in rhesus monkeys following intratracheal inoculation of mers-cov jordan-n / crystal structure of bovine coronavirus spike protein lectin domain crystal structure of mouse coronavirus receptor-binding domain complexed with its murine receptor pre-fusion structure of a human coronavirus spike protein cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer conformational dynamics of single hiv- envelope trimers on the surface of native virions broadly neutralizing antibodies and the search for an hiv- vaccine: the end of the beginning towards a solution to mers: protective human monoclonal antibodies targeting different domains and functions of the merscoronavirus spike glycoprotein hiv therapy by a combination of broadly neutralizing antibodies in humanized mice human monoclonal antibodies against highly conserved hr and hr domains of the sars-cov spike protein are more broadly neutralizing improving neutralization potency and breadth by combining broadly reactive hiv- antibodies targeting major neutralization epitopes automatic crystal centring procedure at the ssrf macromolecular crystallography beamline processing of x-ray diffraction data collected in oscillation mode phaser crystallographic software coot: model-building tools for molecular graphics phenix: building new software for automated crystallographic structure determination pymod . : improvements in protein sequence-structure analysis and homology modeling within pymol quantitative analysis of dose-effect relationships: the combined effects of multiple drugs or enzyme inhibitors drug combination studies and their synergy quantification using the chou-talalay method we would like thank dr. changfa fan (division of animal model research, institute for laboratory animal resources, national institutes for food and drug control) for help in providing the r -hdpp mouse model and experimental method. we thank dr. jianhua he and the staff scientists at the ssrf bl u beam line, as well as dr. shilong fan at the x-ray crystallography platform of the tsinghua university technology center for assistance in diffraction data collection. this work was supported by the national key plan for scientific research and development of china (grants yfd and yfd ), the national natural science foundation of china (grants and u ), and the national major project for control and prevention of infectious disease in china ( zx - ). h.z., w.t., l.z. and x.w. designed the experiments. y.c., k.q. and w.t. isolated the antibody d and sequenced the corresponding v l and v h genes. b.h. carried out the neutralizing assay with live mers-cov. h.z. and s.z. expressed, purified, and crystallized the protein, and h.z. carried out the bli and spr analysis. h.z. conducted all the neutralizing assays based on pseudotyped mers-cov with the help of w.j. h.z. conducted dpp -competition assays and the western blots analysis. p.n. performed the protection assay in mice. h.z. and j.l. collected the diffraction data. h.z. and x.w. processed the diffraction data, determined, and analyzed the structure. h.z. and x.w. wrote the paper with contributions from l.z. and w.t. supplementary information accompanies this paper at https://doi.org/ . /s - - - .competing interests: the authors declare no competing interests.reprints and permission information is available online at http://npg.nature.com/ reprintsandpermissions/ peer review information: nature communications thanks the anonymous reviewers for their contribution to the peer review of this work. peer reviewer reports are available.publisher's note: springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.open access this article is licensed under a creative commons attribution . international license, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons license, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons license, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. to view a copy of this license, visit http://creativecommons.org/ licenses/by/ . /. key: cord- -pqn ojj authors: yao, hebang; cai, hongmin; li, tingting; zhou, bingjie; qin, wenming; lavillette, dimitri; li, dianfan title: a high-affinity rbd-targeting nanobody improves fusion partner’s potency against sars-cov- date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: pqn ojj a key step to the sars-cov- infection is the attachment of its spike receptor-binding domain (s rbd) to the host receptor ace . considerable research have been devoted to the development of neutralizing antibodies, including llama-derived single-chain nanobodies, to target the receptor-binding motif (rbm) and to block ace -rbd binding. simple and effective strategies to increase potency are desirable for such studies when antibodies are only modestly effective. here, we identify and characterize a high-affinity synthetic nanobody (sybody, sr ) as a fusion partner to improve the potency of rbm-antibodies. crystallographic studies reveal that sr binds to rbd at a conserved and ‘greasy’ site distal to rbm. although sr distorts rbd at the interface, it does not perturb the rbm conformation, hence displaying no neutralizing activities itself. however, fusing sr to two modestly neutralizing sybodies dramatically increases their affinity for rbd and neutralization activity against sars-cov- pseudovirus. our work presents a tool protein and an efficient strategy to improve nanobody potency. sars-cov- , the pathogenic virus for covid- , has caused a global pandemic since its first report in early december in wuhan china ( ), posing a gravely crisis for health and economic and social order. sars-cov- is heavily decorated by its surface spike (s) ( , ) , a single-pass membrane protein that is key for the host-virus interactions. during the infection, s is cleaved by host proteases ( , ) , yielding the nterminal s and the c-terminal s subunit. s binds to angiotensin-converting enzyme (ace ) ( - ) on the host cell membrane via its receptor-binding domain (rbd), causing conformational changes that trigger a secondary cleavage needed for the s mediated membrane fusion at the plasma membrane or in the endosome. because of this essential role, rbd has been a hot spot for the development of therapeutic monoclonal antibodies (mabs) and vaccine ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) . llama-derived heavy chain-only antibodies (nanobodies) are attractive biotherapeutics ( ) . these small (~ kda) proteins are robust, straightforward to produce, and amenable to engineering such as mutation and fusion. owing to their ultra-stability, nanobodies have been reported to survive nebulization, a feature that has been explored for the development of inhaled nanobodies to treat respiratory viral diseases ( , ) which categorizes covid- . owing to their high sequence similarities with human type vh domains (vh ), nanobodies are known to cause little immunogenicity ( ) . for the same reason, they can be humanized with relative ease to reduce immunogenicity when needed. therefore, nanobodies as biotherapeutics are being increasingly recognized. examples of nanobody drugs include caplacizumab ( ) for the treatment of acquired thrombotic thrombocytopenic purpura, and ozoralizumab and vobarilizumab that are in the clinical trials for rheumatoid arthritis ( , ) . recently, several groups have independently reported neutralizing nanobodies ( , ( ) ( ) ( ) ( ) ( ) ( ) or single-chain vh antibodies ( ) against sars-cov- with variable potencies. we have recently reported several synthetic nanobodies (sybodies) which bind rbd with various affinity and neutralizing activity ( ) . affinity and neutralizing activity are very important characteristics for therapeutic antibodies, and they can be improved by a number of ways such as random mutagenesis ( , ) and structure- based design. previously, in the case of one modestly-neutralizing sybody mr , we have determined its structure and designed a single mutant that improves its potency by over folds ( ) . the rational design approach, while very effective, inevitably requires high-resolution structural information which are non-trivial to obtain. generally applicable tools will be welcome. here, we report a strategy to increase sybody potency by biparatopic fusion with sr , a sybody that binds rbd tightly with a kd of . nm. as revealed by crystal structure, sr engages the rbd at a conserved site that is distal to the rbm. as such, it does not neutralize sars-cov- but forms non-competing pairs with several other rbm-binders and increases their neutralization potency when conjugated. sr may be used as a general affinity-enhancer for both detection and therapeutic applications. a high-affinity rbd binder without neutralizing activity previously, we generated sybodies from three highly diverse synthetic libraries by ribosome and phage display with in vitro selections against the sars-cov- rbd. most of the sybody binders showed neutralizing activity. interestingly, about sybodies bind rbd but showed no neutralizing activities ( ) even at m concentration. one such sybodies, named sr , was characterized in this study. in analytic fluorescence-detection size exclusion chromatography (fsec), sr caused earlier retention of rbd (fig. a) which was included at a low concentration ( . m), suggesting nanomolar affinity for sr -rbd binding. this was confirmed by bio-layer interferometry analysis (fig. b) which showed a kd of . nm and an off-rate of × - s - . consistent with its inability to neutralize sars-cov- pseudovirus, sr did not affect rbd-ace binding (fig. c) . to characterize the sr -rbd interactions in detail, we purified the complex (fig. d ), and obtained crystals (fig. d ) that diffracted to . Å resolution ( table ) . the structure was solved by molecular replacement using the published rbd and sybody structures (pdb ids m j and m ) ( , ) as search models. the structure was refined to rwork/rfree of . / . ( table ) . the asymmetric unit contained one molecule each for the rbd and sr , indicating an expected : stoichiometry. sr binds to the rbd sideways at a buried surface area of , . Å ( fig. a) , which is significantly larger than that for the previously reported sybodies sr ( . Å ) and mr ( . Å ) ( ) . the binding surface is near a heavily decorated glycosylation site, asn ( fig. a- c) , which, although at an apparent strategic position to possibly divide the accessible surfaces for immune surveillance, does not show clashes with sr . all three cdrs participated in the interaction by providing five (cdr ), three (cdr ), and nine h-bonds (cdr ) (fig. e- g) . peculiarly, the cdr , which contains a cluster of hydrophobic side chains that include met , val , phe , trp , and tyr , inserted into a greasy pocket (fig. b ) in the rbd that was lined with twelve hydrophobic/aromatic residues (fig. f) . unlike salt bridges, hydrophobic interactions are more tolerant to environment such as change of ph and ionic strength. in addition, they are less specific and thus less likely to be affected by mutations. this binding mode thus makes sr an attractive candidate for detection purposes. most rbd-targeting neutralizing antibodies, including neutralizing nanobodies characterized so far ( , - , , , - , - , , , ) , engage the rbd at the receptor-binding motif (rbm) (fig. a) , thus competing off ace and preventing viral entry. aligning the ace structure to the sr -rbd structure showed that the sr - binding epitope is distant from the rbm (fig. a) . comparing the epitopes of existing monoclonal antibodies showed that the sr epitope partly overlaps with cr ( ) and the recently identified ey a ( ) (fig. b, c ). it has been established that the binding of the bulky cr and ey a at the interface between rbd and the n- . taken together, the structural data rationalize the high-affinity binding between sr and rbd, and its inability to neutralize sars-cov- . because nanobodies are relatively easy to produce, the availability of nanobodies that recognize a wide spectrum of epitopes can be a useful toolkit to probe binding mode of uncharacterized antibodies using competitive binding assays. they may also be used to select binders with new epitopes by including them as pre-formed sybody-rbd complexes during in vitro selection (and thus excluding binders at the same site). other rbd-targeting nanobodies ( , , , ) and mabs ( - , , , , , - ) . red, the collective epitope of rbm-binders; blue, the sr epitope; magenta, the collective epitope of cr and ey a; orange, the overlap between the structure alignment of sr -rbd with ace -rbd revealed that the two rbd structures were overall very similar with a c rmsd of . Å (fig. a) . nevertheless, significant structural rearrangements at the binding interface were observed (fig. a, b) . specifically, the small -helix  - (numbers mark start-end) moves towards the direction of rbm by a dramatic ~ . Å and transforms to a short sheet ( - ) which in turn forms a parallel -sheet pair with  - in the cdr region. in addition, nudged by the cdr , the short helix  - swings towards the rbd core by ~ . Å. remarkable, the dramatic rearrangements did not cause noticeable conformational change of rbm (fig. a) nor did it affect ace binding (fig. c) . given that rbd is a relatively small entity, and that the two surfaces are relatively close (~ Å), this was somewhat unexpected. a probable explanation is that rbd is very rigid and hence stable. indeed, as shown in fig. c , rbd showed ultra-stability, with an apparent melting temperature of greater than º c ( -min heating). intriguingly, the rearrangement happens at a region that is rich in disulfide bonds. specifically,  - is tethered between the disulfide pairs cys -cys and cys -cys , and  - bridges cys -cys and cys- -cys (fig. d) . thus, the three disulfide bonds segregate the two local motifs from the rest of rbd, preventing these conformational changes from propagating through the domain. the neutral feature of sr so far suggests it could bind to rbd in addition to rbm binders such as mr and sr ( ) . indeed, bli assays showed no competition between sr and mr (fig. a) , indicating a 'sandwich complex' where the rbd is bound with both sybodies. this non-competing feature was also observed in the case of mr (fig. b) which has also been shown to have neutralizing activities ( ) . as a further proof for the simultaneous binding, we determined the structure of the sandwich complex sr -rbd-mr (fig. c, table ) to . Å resolution. the sandwich complex was similar to the individual mr -and sr -rbd complexes, with an overall c rmsd of . and . Å, respectively. aligning the sandwich complex with the mr -rbd structure revealed no noticeable changes at the mr -binding surface (fig. c) , reinforcing the idea that sr -binding does not allosterically change the rbm surface nor affect rbm binders. to the two-component complex structure (rbd (green) and mr , pdb id c w) ( ) . although sr does not neutralize sars-cov- pseudovirus itself, its highaffinity may help increase the affinity of other neutralizing nanobodies through avidity effect by fusion. indeed, the biparatopic fusion sr -mr displayed remarkable increase in binding affinity compared to sr or mr alone. its kd of . nm (fig. a ) was lower than mr (kd = . nm) ( ) by folds and lower than sr (kd = . nm) by folds. consistently, sr -mr neutralized sars-cov- pseudovirus times more effectively (in molarity) than mr alone (fig. b) . that sr can enhance potency of its fusion partner was also demonstrated in the case for mr . at its free form, mr bound to rbd with a kd of . nm (fig. c) , and showed modest neutralizing activity with an ic of . g ml - ( . nm). fusing it to sr increased its affinity by over folds, displaying a kd of . nm (fig. d) . in line with this, sr -mr showed a -fold higher neutralization activity compared to mr , with an ic of . nm ( . g ml - ) (fig. e) . interestingly, when fused to mr , a neutralizing antibody that had higher affinity (kd = . nm) than sr , the neutralizing activity decreased by folds (fig. f) . possible reasons include steric incompatibility caused by improper link length, and allosteric effects. such hypothesis warrants future structural investigation. binding affinity and neutralizing activity are important characteristics of therapeutic antibodies. for modestly neutralizing nanobodies, the potency can be increased in a number of ways, including random mutagenesis ( ) , structure-based design ( ) , and fusion ( , , ) . compared with the other two approaches, the fusion technique is more rapid, less involving and does not rely on prior structural information. depending on whether the two fusion partners are the same, divalent nanobodies can be categorized into two types: monoparatopic and biparatopic. biparatopic fusions recognize two distinct epitopes on the same target. therefore, they are more likely to be resistant to escape mutants because simultaneous mutations at two epitopes should occur at a much lower rate than at a single epitope. because of the minute size, sr could be used as an 'add-on' to monoclonal antibodies, scfv fragments, and other nanobodies to enhance their affinity and potency, especially for those with modest neutralizing activities. in addition, due to its small size and high stability, sr may be chemically modified as a vector to deliver smallmolecule inhibitors specifically targeting sars-cov- . in summary, we have structurally characterized sr , a high-affinity nanobody against sars-cov- rbd. although lacking neutralizing activity alone, sr is an attractive biparatopic partner for rbm-binders owing to its distinct epitope from rbm. our work presents a generally useful strategy and offers a simple and fast approach to enhance potency of modestly active antibodies against sars-cov . the authors claim no conflict of interest. sars-cov- rbd was expressed essentially as described ( ) . briefly, a dna fragment encoding, from n-to c-terminus, residues - of sars-cov s, a gly-thr linker, the c protease site (levlfqgp), a gly-ser linker, the avi tag (glndifeaqkiewhe), a ser-gly linker, and a deca-his tag were cloned into the pfastbac-based vector. baculovirus was generated in sf cells following the invitrogen bac-to-bac transfection protocol. high five insect cells were infected with p virus. for crystallization, sr or sr -mr was mixed with rbd at a : . molar ratio. the mixture was then loaded onto a superdex column for gel filtration. fractions containing the complex were pooled and concentrated to mg ml - . to screen rbd binders by size exclusion chromatography (sec) using unpurified sybodies, rbd was fluorescently labelled as follows. first the avi-tagged rbd was for - s, before moving into sybody-free buffer for dissociation. bli signal was monitored during the whole process. data were fitted with a : stoichiometry using the build-in software analysis . for kinetic parameters. for competitive assay of the rbd between sr and ace , the rbd-coated sensor was saturated in nm of sr , before soaked in nm sr with or without nm of ace . as a control, bli assays were also carried out by soaking the rbd-coated sensor in ace without sr . for competitive rbd-binding assays for different sybodies, the assays were carried out the same manner as described above. desired crystals were cryo-protected, harvested using a mitegen loop under a microscope, and flash-cooled in liquid nitrogen before diffraction. x-ray diffraction data were collected at beamline bl u ( ) at shanghai synchrotron radiation facility with a x μm beam on a pilatus m detector, with oscillation of . ° and a wavelength of . Å. data were integrated using the software xds ( ) , and scaled and merged using aimless ( ) . the sr -rbd structure was solved by molecular replacement using phaser ( ) with pdb ids m j and m ( ) as the search model. the sr -mr -rbd structure was solved using the sr -rbd and mr structure ( ) as search models. the models were manually adjusted as guided by the fo-fc maps in coot ( ) , and refined using phenix ( ) . structures were visualized using pymol ( ). the structure factors and coordinates were deposited in the protein data bank (pdb) under accession codes d z (sr +rbd) and d (sr -mr +rbd). a novel coronavirus outbreak of global health concern cryo-em structure of the -ncov spike in the prefusion conformation structure, function, and antigenicity of the sars-cov- spike glycoprotein cell entry mechanisms of sars-cov- sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor structure of the sars-cov- spike receptor-binding domain bound to the ace receptor structural basis of receptor recognition by sars-cov- structural and functional basis of sars-cov- entry by using human ace structural basis for the recognition of sars-cov- by full-length human ace conformational dynamics of sars-cov- trimeric spike glycoprotein in complex with receptor ace revealed by cryo-em. biorxiv structural basis for the neutralization of sars-cov- by an antibody from a convalescent patient a highly conserved cryptic epitope in the receptor binding domains of sars- a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace a human neutralizing antibody targets the receptor-binding site of sars-cov- isolation of potent sars-cov- neutralizing antibodies and protection from disease in a small animal model convergent antibody responses to sars-cov- in convalescent individuals the receptor-binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody potent neutralizing antibodies against multiple epitopes on sars-cov- spike. microbe neutralizing nanobodies bind sars-cov- spike rbd and block interaction with ace studies in humanized mice and convalescent humans yield a sars-cov- antibody cocktail potent neutralizing antibodies against sars-cov- identified by high-throughput single-cell sequencing of convalescent patients' b cells a potent neutralizing human antibody reveals the n-terminal domain of the spike protein of sars-cov- as a site of vulnerability. biorxiv potent neutralizing antibodies from covid- patients define multiple targets of vulnerability structures of human antibodies bound to sars-cov- spike reveal structural basis for potent neutralization of sars-cov- and role of antibody affinity maturation. biorxiv : the preprint server for biology the therapeutic potential of nanobodies delivery of alx- by inhalation greatly reduces respiratory syncytial virus disease in newborn lambs nanobodies® as inhaled biotherapeutics for lung diseases caplacizumab treatment for acquired thrombotic thrombocytopenic purpura emerging therapies in rheumatoid arthritis: focus on monoclonal antibodies. f res , f faculty rev- structural basis for potent neutralization of betacoronaviruses by single- potent synthetic nanobodies against sars-cov- and molecular basis for neutralization. biorxiv an ultra-high affinity synthetic nanobody blocks sars-cov- infection by locking spike into an inactive conformation. biorxiv an alpaca nanobody neutralizes sars-cov- by blocking receptor interaction synthetic nanobodies targeting the sars-cov- receptor-binding domain. biorxiv selection, biophysical and structural analysis of synthetic nanobodies that effectively neutralize sars-cov- . biorxiv identification of human single-domain antibodies against sars-cov- synthetic single domain antibodies for the conformational trapping of membrane proteins generation of synthetic nanobodies against delicate proteins the protein complex crystallography beamline (bl u ) at the shanghai synchrotron radiation facility how good are my data and what is the resolution? phaser crystallographic software features and development of coot phenix: a comprehensive python-based system for macromolecular structure solution key: cord- - ijlj so authors: li, wenhui; zhang, chengsheng; sui, jianhua; kuhn, jens h; moore, michael j; luo, shiwen; wong, swee-kee; huang, i-chueh; xu, keming; vasilieva, natalya; murakami, akikazu; he, yaqing; marasco, wayne a; guan, yi; choe, hyeryun; farzan, michael title: receptor and viral determinants of sars-coronavirus adaptation to human ace date: - - journal: the embo journal doi: . /sj.emboj. sha: doc_id: cord_uid: ijlj so human angiotensin-converting enzyme (ace ) is a functional receptor for sars coronavirus (sars-cov). here we identify the sars-cov spike (s)-protein-binding site on ace . we also compare s proteins of sars-cov isolated during the – sars outbreak and during the much less severe – outbreak, and from palm civets, a possible source of sars-cov found in humans. all three s proteins bound to and utilized palm-civet ace efficiently, but the latter two s proteins utilized human ace markedly less efficiently than did the s protein obtained during the earlier human outbreak. the lower affinity of these s proteins could be complemented by altering specific residues within the s-protein-binding site of human ace to those of civet ace , or by altering s-protein residues and to residues conserved during the – outbreak. collectively, these data describe molecular interactions important to the adaptation of sars-cov to human cells, and provide insight into the severity of the – sars epidemic. sars coronavirus (sars-cov) is the etiological agent of severe acute respiratory syndrome (sars), an acute pulmonary syndrome that, when it emerged in the winter of - , resulted in the death of approximately individuals, close to % of those infected fouchier et al, ; ksiazek et al, ; kuiken et al, ) . despite concerns that sars-cov would re-emerge, last winter ( ) ( ) , only a handful of individuals were found infected by the virus. these individuals appeared to have much less severe symptoms, and no secondary transmission was observed peiris et al, ; song et al, ) . severe cases of sars were also reported in , but these resulted from laboratory infections (normile, ) . the coronavirus spike (s) protein mediates infection of receptor-bearing cells (gallagher and buchmeier, ; holmes, ; . angiotensin-converting enzyme (ace ) is a functional receptor for sars-cov, and binds the sars-cov s protein with high affinity . several lines of evidence suggest that ace is a physiologically relevant receptor during infection. tissue expression of the receptor corresponds to the localization of virus in infected individuals and animals (harmer et al, ; chan et al, ; ding et al, ; hamming et al, ) . also, the efficiency of infection in humans, mice, and rats correlates with the ability of the ace of each species to support viral replication subbarao et al, ; wentworth et al, ) . antibodies that block ace association (sui et al, ) protect mice against infection (sui et al, ) . finally, although many cell lines do not express ace , most cell lines shown to support sars-cov infection or replication detectably express this receptor nie et al, ) . although dc-signr (l-sign, cd l) and dc-sign (cd ) have recently been shown to enhance infection of ace -expressing cells (jeffers et al, ; marzi et al, ; yang et al, ) , these proteins do not appear to mediate efficient infection in the absence of ace (jeffers et al, ; marzi et al, ) . unlike many type i fusion proteins, including those of other coronaviruses, the s protein of sars-cov is not cleaved in the virus-producing cell (xiao et al, ; moore et al, ) . however, two domains corresponding to the s and s proteins of processed coronaviruses can be defined (gallagher and buchmeier, ) . the s domain mediates receptor association, whereas the s domain is membraneassociated and likely undergoes structural rearrangements that mediate membrane fusion. a discrete receptor-binding domain (rbd) of the s protein has been defined at residues - of the s domain (xiao et al, ; babcock et al, ; wong et al, ) . this rbd binds ace with higher affinity than does the full s domain . early cases of sars in were reported to have occurred in animal traders and restaurant workers handling wild mammals, and sars-cov has been isolated from two such mammals, palm civets (paguma larvata) and raccoon dogs (nyctereutes procyonoides) (guan et al, ) . palm civets have also been implicated in the minor - sars outbreak (zhong, ) . the sequences of the s-protein genes of viruses isolated from palm civets, and one from raccoon dogs, have been determined (guan et al, ; song et al, ) . interestingly, the rbds of these animal s proteins are highly conserved except residue , which varies between asparagine and basic amino acids, but differ at several positions from the rbds of viruses isolated during the - outbreak. although different from palm-civet-derived rbds, the latter rbds were themselves highly conserved in the more than s-protein genes obtained during the severe human outbreak zhang et al, ) . full sequence has also been published for two s-protein genes obtained during the mild - outbreak song et al, ) . these genes are nearly identical and contain elements common to s protein isolated during the earlier human outbreak and to that isolated from palm civets. here we describe the s-protein-binding domain of human ace by characterizing chimeras of human and rat ace . by introducing four residues of human ace into rat ace , we convert this receptor, which supports little or no s-proteinmediated infection, into a receptor that binds the s domain and supports infection, with efficiency close to that of human ace . by characterizing additional ace variants, we localize the s-protein-binding domain primarily to a-helix of ace and to a loop leading to b-sheet . we then show that a representative s protein from the mild - outbreak, and one from palm civets, mediates more efficient infection of cells expressing palm-civet ace compared to cells expressing human receptor. in contrast, s protein from the severe - outbreak efficiently binds and utilizes both receptors. two regions of the s-protein-binding site on ace , and two residues in the rbd of these s proteins, largely determine these differences. our data describe s-protein adaptations, and their receptor counterparts, that permit efficient infection of human cells. these adaptations, absolutely conserved during - outbreak, may in part account for the unusual severity of sars. we have previously shown that s-protein-mediated infection of cells expressing human ace is substantially more efficient than that of cells expressing the same levels of rat ace . we first investigated whether this difference localized to the ace enzymatic or collectrin domain. the latter domain is defined by its close homology with a small, kidney-expressed protein of the same name (zhang et al, ) . figure a shows that a fusion protein comprising the sars-cov s domain and the fc domain of human igg (s -ig) efficiently precipitated human ace , as well as an ace chimera with the human catalytic domain and the rat collectrin domain. in contrast, s -ig could not precipitate rat ace or an ace chimera with the rat catalytic domain and the human collectrin domain. the ability of these chimeric receptors to bind s -ig was also reflected in their ability to support s-protein-mediated infection ( figure c ). these data indicate that differences in the ability of rat and human ace to support infection are localized to the catalytic domain of the receptor. guided by the structure of the ace catalytic domain (towler et al, ) , we made a series of human ace variants in which one or a few solvent-exposed residues were altered to their rat ace counterparts. two variants consistently bound s -ig less efficiently than did wild-type human ace . introduction of rat residues - (nfs), which include a glycosylation site at asparagine not present in human ace , partially inhibited s -ig association ( figures b and a ) and s-protein-mediated entry ( figure c ). alteration of lysine to a histidine residue present on the rat receptor interfered more dramatically with s -ig association, and also partially inhibited s-proteinmediated entry. introduction of human residues - (myp) and lysine into rat ace resulted in substantial figure introduction of rat ace residues into the human ace catalytic domain interferes with s-protein association. (a) hek t cells were transfected with plasmids encoding human or rat ace , or chimeras of these receptors in which residues - , corresponding to the ace catalytic domain, were exchanged. transfected cells were metabolically labeled with [ s]cysteine and [ s]methionine, and lysed. lysates were immunoprecipitated with protein a-sepharose together with either the s domain of the sars-cov (tor ) s protein fused to the fc domain of human igg (s -ig), or with an antibody ( d ) recognizing a tag present at the carboxyterminus of each of the ace variants, and analyzed by sds-page. (b) hek t cells were transfected with plasmids encoding human ace , rat ace , or human ace variants in which residues corresponding to those of rat ace were introduced at the indicated position. transfected cells were analyzed as in (a) and precipitated ace was quantified by phosphorimaging. values indicate the ratio of protein precipitated by s -ig to that precipitated by d . error bars indicate the range of two or more experiments. increases in s -ig binding, as assayed both by immunoprecipitation and by flow cytometry (figure a and b). combination of both sets of residues resulted in a rat ace variant that bound s -ig and supported s-protein-mediated infection comparably to human ace (figure a -c). these data suggest that residues - and, more so, lysine participate in s-protein association with human ace . we subsequently altered a number of additional residues of human ace in the vicinity of residues - and to alanine or, in some cases, aspartic acid. alteration of two residues on the first helix of ace , at lysine and tyrosine , substantially interfered with the s -ig association, as did residues adjacent to lysine , at aspartic acid and at arginine , both within ace b-sheet ( figure a) . figure b -d shows three views of the crystal structure of human ace , in which residues that convert rat ace to an efficient sars-cov receptor are shown in red, and additional residues whose alteration interferes with s -ig association are shown in yellow. green indicates residues whose alteration did not affect s -ig binding. the c-terminal collectrin domain is not well ordered in the structure, but is used here to position the molecule with respect to the cell membrane. as shown in the figures, the s-protein-binding site on ace is localized above the deep cleft that harbors the catalytic site and on the upper left of the structure when viewed facing that cleft. the crystal structure of ace has been solved in two distinct conformations, an open one ( figure b -d) and a conformation in which the cleft is closed around the ace inhibitor mln- (dales et al, ; towler et al, ) . these two conformations likely reflect free and substrate-bound states of the enzyme, respectively. we investigated whether the large conformational change associated with inhibitor binding interfered with s -ig binding or s-protein-mediated infection. as previously reported, mln- inhibits ace activity in the low nanomolar range (dales et al, ) , and no ace enzymatic activity was detected in the presence of nm inhibitor ( figure a ). however, nm mln- did not interfere with immunoprecipitation of ace by s -ig, nor did this inhibitor interfere with s-protein-mediated infection ( figure b and c). consistent with these observations, the distances among a and b carbons of residues , , , , and in the s-protein-binding site of ace varied by less than . Å between the inhibitor-bound and -unbound structures (towler et al, ) . in contrast, distances between these residues and residues across the cleft typically varied by greater than Å in the two structures. these data suggest that the s-protein-binding region of ace is not perturbed by inhibitors or substrates that induce large conformational changes in the receptor. consistent with these studies, we have also observed, using the assay shown in figure a , that s -ig does not interfere with the enzymatic activity of ace (data not shown). the s protein used in the studies above was obtained from a patient infected during the severe - sars-cov outbreak. another outbreak, during the winter of - , caused much less severe symptoms in the few individuals figure introduction of human ace residues into rat ace converts rat ace to an efficient sars-cov receptor. (a) hek t cells transfected with plasmids encoding the indicated human and rat ace variants or with vector alone were analyzed as in figure (light gray), or by flow cytometry (dark gray). flow cytometry values indicate the ratio of mean fluorescence intensity (m.f.i.) observed using s -ig to that using the e antibody, which recognizes a tag present at the amino-terminus of each ace variant. error bars indicate the range of two or more experiments. (b) representative example of an immunoprecipitation experiment used in (a). (c) murine leukemia viruses (mlv) expressing green fluorescent protein (gfp), lacking its endogenous envelope glycoprotein (mlv-gfp), and pseudotyped with the s protein of sars-cov (tor isolate) were incubated with hek t cells transfected with plasmids encoding the indicated human or rat ace variants. amount of ace -expressing plasmid was adjusted to maintain comparable receptor expression levels, as indicated, by flow cytometry using the antibody e that recognizes an amino-terminal myc tag on these receptors. gfp expression in cells was quantified by flow cytometry to measure infection of cells by pseudotyped viruses. error bars indicate range of two experiments. infected and resulted in no documented human-to-human transmissions liang et al, ; zhong, ; song et al, ) . we compared an s protein of virus isolated during this latter outbreak (gd t ; accession number ay ; denoted gd herein) with that of virus obtained during the - outbreak (tor ; ay ) and with that isolated from palm civets (sz ; ay ) (guan et al, ; marra et al, ; rota et al, ; he et al, ; song et al, ) . figure a shows that the s domains of all three s proteins efficiently bound palm-civet ace (accession number ay ), whereas only the s domain of tor efficiently bound human ace . of note, the s domain of virus isolated during the - outbreak bound palmcivet ace much more efficiently than it bound human ace . we then investigated the rbd of each of these s-protein variants. the ability of these s domains to bind palm-civet and human ace was reflected in the ability of their respective rbds to bind these receptors ( figure b ). we also assayed the ability of the entire s protein to mediate infection of cells expressing human or palm-civet ace . the efficiency of entry was consistent with the ability of the s domain of each variant to bind each ace ( figure c ). these data are consistent with the hypothesis that the palm civet is a source of sars-cov, and suggest that the apparent lack of severity of disease during the - outbreak may be due in part to incomplete adaptation of gd virus to human ace . the rbds of tor and sz differ by four residues ( figure a ). we investigated which of these residues contribute to the ability of tor rbd to bind efficiently human ace . each residue in the tor rbd was altered to its sz counterpart. alteration of two residues, at positions and , interfered with the association of the tor rbd with human ace ( figure b ). surface plasmon resonance studies further demonstrated a greater than -fold decrease of affinity for human ace when either residue or , but not when residue or , is altered to its palm-civet counterpart (table i and supplementary figure ) . notably, alteration of threonine to serine also affected the ability of the tor rbd to associate with palm-civet ace ( figure b ). introduction of tor residues at or substantially increased the ability of the full sz s protein to infect cells expressing human ace . introduction of sz residues at these positions into the tor s protein resulted in a -to -fold decrease in infection of these cells figure except that the indicated solvent-accessible residues common to human and rat ace were modified in human ace to either alanine or aspartic acid. (b) representation of the crystal structure of human ace , with the collectrin domain oriented downward and viewed from the side of the cleft bearing the enzymatic active site. residues of rat ace whose alteration to the corresponding human residues converted rat ace to an efficient sars-cov receptor are shown in red. human ace residues whose alteration substantially decreased s -ig association are shown in orange. residues whose alteration did not affect s -ig association are shown in green. low-resolution electron density associated with the collectrin domain is represented by a small b-sheet and a-helix at the base of the figure. (c) a view identical to that in (b) except that the molecule has been rotated about the vertical axis. (d) a view identical to that in (c) except that the molecule has been rotated about the horizontal axis. ( figure c ). these data suggest that adaptation of s protein to human ace is facilitated by alteration of residue to asparagine and of to threonine. we also investigated determinants on civet ace that participate in its ability to facilitate efficient infection by gd and sz . two regions within the s-protein-binding site differed significantly between human and palm-civet ace (see figure d and e). a region of a-helix (residues - ) varied at six residues between human and civet ace . likewise, a loop initiating ace a-helix differed by four residues ( - ). a glycosylation site at asparagine of human ace is part of this latter region and is not present in palm-civet ace . figure a shows that the s domains of gd and sz bound substantially more efficiently when either of these regions was introduced into human ace , and that introduction of both regions resulted in receptor binding comparable to that with wild-type palm-civet ace . these observations were also reproduced in infection assays ( figure b ). these data demonstrate that residues within the s-protein-binding domain of ace largely determine the efficiency with which gd and sz s proteins bind and utilize human and palm-civet ace . figure c compares the ability of eight rbd variants to bind to human and civet ace , as well as to the chimeric molecules assayed in figure a and to a point-mutation variant of palm-civet ace , in which aspartic acid was altered to a glycine present in human ace . rbd variants were generated from that of tor (left panels) or sz (right panels), and altered at positions , , or both, as indicated. the panels of figure c permit several conclusions. first, the rough equivalence between the left and right panels indicates that, as implied by figure b and c, s-protein residues and account for most of the differences between tor and sz rbd. second, no consistent differences were observed between palm-civet ace and its variant with glycine at residue , indicating little or no contribution of this residue to s-protein association. third, consistent with infection data in figure c , the presence of threonine at residue enhanced the affinity of most rbds for civet and human ace , and for variants of these receptors. (compare, for example, k /s rbd variants with k /t variants for their ability to precipitate each ace variant.) fourth, and in contrast, substitution of lysine for asparagine in most contexts increased the ability of each rbd variant to associate with human, but not with palm-civet, ace . (compare the binding of n /t rbd with that of k /t variants for binding to human ace (lane ) and palm-civet ace (lane ); likewise for n /s and k /s variants.) this enhancement was also observed for ace chimeras containing human a-helix residues (lanes and ), but not those of palm civet (lanes , , and ), whereas residues - did not determine sensitivity to rbd residue . fifth, consistent with infection data in figure b and supplementary figure , all rbds bound ace variants bearing residues - of palm-civet ace substantially more efficiently than they bound equivalent variants with human ace residues at these positions (compare lanes and and lanes and in each panel). thus, the data of figure c indicate that a lysine at s-protein residue interferes with rbd association with human, but not palmcivet, ace . supplementary figure shows data consistent with a steric interaction between lysine of human ace and lysine of the sz s protein. our data also show that alteration of s-protein serine to threonine increases rbd affinity for both human and civet ace . finally, they suggest that no s protein studied has fully adapted to human ace residues - , consistent with a recent zoonotic transmission of the virus. figure d -f summarizes our findings. figure d shows amino-acid sequences of regions critical to s-protein association for palm-civet, rat, and human ace . figure e shows ace oriented with the c-terminal membrane-associated collectrin domain facing away from the viewer. red indicates residues whose alteration transformed rat ace to an efficient sars-cov receptor. orange indicates additional residues common to rat and human ace whose alteration also interferes with s-protein association. yellow indicates residues along the a-helix ridge that are unique to palm-civet ace , and which permit efficient association with rbd isolates from palm civet and likely interact with lysine of the palm-civet rbd. k of human ace , which interferes with palm-civet rbd lysine , is labeled with white text in figure e . four residues at the beginning of a-helix that permit more efficient binding and infection by all s proteins assayed are shown in cyan, and the glycosylation site in this region, present in human but not palm-civet ace , is shown in green. ace is a functional receptor for sars-cov, and is likely to play a critical role in viral replication in an infected host hamming et al, ; nie et al, ) . here we describe the s-protein-binding domain of ace . in particular, residues along the first a-helix, and lysine and proximal residues at the n-terminus of b-sheet , participate in sprotein binding and in infection. by altering histidine in rat ace and modifying a glyosylation site that may alter the shape of a-helix , we converted rat ace to an efficient receptor for sars-cov. this s-protein-binding region of ace remains intact in the presence of an inhibitor that dramatically alters the overall conformation of ace (dales et al, ; towler et al, ) , consistent with the inability of this inhibitor to block infection, and with the inability of the s protein to modulate ace activity. although there can be multiple constraints on interspecies transmission of viruses (webby et al, ) , s-protein alterations are sufficient to extend or alter the host range of a number of coronaviruses (kuo et al, ; casais et al, ; haijema et al, ; schickli et al, ) . we have shown that entry is the primary barrier to sars-cov infection of murine surface plasmon resonance experiments in which the indicated rbd-ig tor variants shown in figure b bound to immobilized anti-human antibody were assayed for association with soluble human ace . the experiment is representative of two performed with similar results. table listing amino-acid differences among the rbds of the s proteins of the indicated isolates. residues critical to the differential association of these rbds with palm-civet and human ace are shown in gray. (b) experiment similar to that shown in figure b except that individual residues within the tor rbd have been altered to the corresponding residues in the sz rbd. (c) hiv- -luciferase pseudotyped with s protein of the tor , gd, or sz viruses, or with the indicated sz or tor variant, was incubated with hek t cells transfected with plasmid encoding human ace or with palm-civet ace . infection, measured as luciferase activity of cell lysates, was assayed days postinfection. the figure shows the mean and range of two experiments. species-specific ace determinants of differential s-protein association. (a) hek t cells transfected with plasmid encoding human or palm-civet ace , with human ace bearing the indicated palm-civet residues, or with vector alone were analyzed by flow cytometry using s -ig variants of the tor , gd, and sz isolates. error bars indicate the range of two or more experiments. (b) hek t cells transfected with plasmid encoding the ace variants used in (a) were incubated with hiv- -luciferase virus pseudotyped with the s proteins of tor , gd, or sz viruses. infection was assayed as in figure c . (c) hek t cells transfected with plasmid encoding human ace , human ace variants bearing the indicated palm-civet residues, palm-civet ace , or the palm-civet ace variant d g were metabolically labeled and lysed. cell lysates were immunoprecipitated with an anti-tag antibody recognizing an amino-terminal tag on these ace variants (a-myc), or with rbd-ig of tor or sz , or with their variants with the indicated alterations of residues and . the experiment is representative of at least two with similar results. (d) amino-acid content of critical regions of ace from human, palm civet, and rat. orange indicates human-ace residues whose alteration interferes with tor s-protein association. red indicates rat-ace residues whose alteration to their human counterparts converts rat ace to an efficient sars-cov receptor. yellow indicates residues of palm-civet ace that accommodate s-protein lysine of sars-cov isolated from palm civets. cyan indicates additional residues of palm-civet ace that, when introduced into human ace , result in more efficient association with all s proteins assayed. this effect may be due to the loss of glycosylation at asparagine of human ace , shown in green. cells . these observations suggest that s-protein changes may be critical to or sufficient for the adaptation of sars-cov to human cells. accordingly, we compared the s proteins derived from the - outbreak (tor ), from the less severe - outbreak (gd), and from apparently healthy palm civets (sz ) (guan et al, ; he et al, ) . strikingly, the receptor-binding regions of each of these s proteins bound palm-civet ace efficiently, but only that from the - outbreak bound human ace with comparable efficiency. these data are consistent with the absence of human-to-human transmission during the - outbreak, and with recent transmission of sars-cov from palm civets to humans (guan et al, ; zhong, ; song et al, ) . differences among these s proteins permitted identification of key changes necessary for adaptation to the human receptor. in particular, changes at s-protein residues and appear to be critical for high-affinity association with human ace . the alteration at to a small, uncharged residue is a consistent property of all described sars-cov obtained from humans, whereas most civet-derived viruses retain a basic residue at this position (guan et al, ; marra et al, ; rota et al, ; he et al, ; zhang et al, ; song et al, ) . our data indicate that residue interacts with residues along a ridge formed by ace a-helix , and in particular with lysine , which is present in human but not palm-civet ace . alteration of s-protein residue to the asparagine found in virus isolated from humans appears to accommodate this human ace lysine. differences at s-protein residue are also of interest. a threonine at position is absolutely conserved in all of the more than s proteins isolated during the severe - outbreak. in contrast, the s proteins of viruses isolated during the - outbreak, and all animal sars-cov isolated, had a serine at this position (guan et al, ; he et al, ; zhang et al, ; song et al, ) . a threonine at position increased affinity of most rbds assayed for both human and palm-civet ace , and all chimeras thereof, and substantially enhanced the efficiency with which palmcivet-derived s protein infected cells expressing human ace . these observations indicate that the additional methyl group of threonine participates in the efficiency of infection of human and non-human cells. s-protein alterations at residues and are important for high-affinity association with human ace , and for efficient infection of cells expressing this receptor. knowledge of these residues may be useful in assessing the risk posed by any new sars-cov outbreak. our data also show that, even with these and other changes outside the rbd, sars-cov is imperfectly adapted to its human receptor. in particular, introduction of residues - of civet ace into the human receptor increased binding of, and infection mediated by, all s proteins assayed. this effect may be due to removal of a glycosylation site at position to which no sars-cov has fully adapted. this observation raises the possibility that soluble human ace lacking this glycosylation would more effectively inhibit sars-cov replication than wild-type human ace . we have previously shown that replication of sars-cov in a murine cell line is limited by the low affinity of the s protein for murine ace (li et al, ) . moreover, the affinity of s protein for the receptors of rats, mice, and humans correlates with the ability of virus to replicate in these animals. the lower affinity of palm-civet-derived s protein for the palmcivet receptor is consistent with this pattern in that no overt disease was manifest in animals from which this virus was isolated (guan et al, ) , but disease was observed in palm civets challenged with isolates obtained during the - outbreak . together, these observations suggest that the affinity of s protein for ace is an important determinant in the overall rate of viral replication and in the severity of disease. if so, adaptations within the s protein that are critical for high-affinity association with human ace may have contributed to the unusual severity of sars. plasmid encoding a codon-optimized form of the sars-cov s protein of the tor isolate (accession number ay ) has been previously described moore et al, ) . plasmids encoding the corresponding s proteins of the gd t isolate, isolated during the mild - outbreak (accession number ay ; denoted gd herein), and the sz isolate, isolated from palm civets (accession number ay ), were generated de novo by recursive pcr. plasmids encoding the s domain (residues - ) and the rbd (residues - ) of the tor s protein, fused to the fc domain of human igg (s -ig and rbd-ig, respectively), have been previously described wong et al, ) . corresponding s -ig and rbd-ig variants of the gd and sz isolates and variant ace molecules were generated by mutagenesis using the quikchange method (invitrogen). human, rat, and palm-civet ace molecules were amplified from cdna of corresponding tissue by pcr, and cloned into a vector encoding previously described amino-and carboxy-terminal tags . association of s -ig or rbd-ig with ace variants was determined by flow cytometry and by immunoprecipitation. flow cytometry using ace -expressing cells has been previously described wong et al, ) . briefly, hek t cells were transfected with a plasmid encoding ace variants, or with vector alone. at days post-transfection, cells were detached in pbs/ mm edta and washed with pbs/ . % bsa. s -ig or rbd-ig, or variants thereof, or the anti-tag antibody e , were added to cells, and the mixture was incubated on ice for h. cells were washed three times with pbs/ . % bsa, and then incubated for min on ice with antihuman igg fitc conjugate (sigma). cells were again washed with pbs/ . % bsa, and analyzed. immunoprecipitations were performed as previously described wong et al, ) . briefly, hek t cells were transfected with plasmid encoding ace variants and radiolabeled with [ s]cysteine and [ s]methionine. after days, transfected cells were harvested and lysed in pbs buffer containing % chapso. cell lysates were incubated with protein a-sepharose beads together with mg s -ig or rbd-ig variants, or with the antibodies d , recognizing a carboxy-terminal c tag on ace , or e , recognizing an amino-terminal myc tag. protein a-sepharose beads were washed three times in pbs/ . % chapso, and analyzed by sds-page. immunoprecipitated ace variants were quantified by phosphorimaging. tor rbd-ig variants were also assayed by surface plasmon resonance using a biacore . a nm portion of purified rbd-ig of tor variants was bound to an anti-human antibody (sigma i- ) immobilized on a cm sensor chip. soluble human ace in hbs-ep buffer (biacore) was introduced at a flow rate of ml/min at concentrations of , , , , . , and nm. kinetic parameters were determined with bia-evaluation software (biacore). infection with s-protein-pseudotyped retrovirus mlv expressing gfp and pseudotyped with sars-cov s-protein variants has been previously described . briefly, mlv virions were generated by cotransfecting plasmid encoding mlv gag and pol genes, the pqcxix vector (bd sciences) expressing gfp, and plasmid encoding s-protein variants. at h post-transfection, cell supernatants were normalized for reverse transcriptase activity and incubated with hek t cells transfected with ace variants. at h postincubation, gfp fluorescence of infected cells was measured by flow cytometry. in some cases, cells were preincubated for h with the ace inhibitor mln- or with nh cl before infection, and equivalent concentrations were maintained during infection. infection was also assayed with a lentivirus expressing a luciferase reporter gene and pseudotyped with s-protein variants, as previously described (sui et al, ) . briefly, t cells were cotransfected with plasmid encoding s-protein variants, a plasmid (pcmvdr . ) encoding hiv- gag-pol, and a plasmid (phiv-luc) encoding the firefly luciferase reporter gene under control of the hiv- long terminal repeat. at days post-transfection, viral supernatants were harvested and ìl of s-protein-pseudotyped virus was used for infection of ace -expressing t cells in a -well plate. infection efficiency was quantitated by measuring the luciferase activity in the target cells with an eg&g berthold microplate luminometer lb v. the enzymatic activity of ace was assayed using a fluorogenic substrate, -methoxycoumarin-yvadapk( , -dinitrophenyl)-oh (r&d systems). cleavage of this peptide by ace removes the , -dinitrophenyl moiety that quenches the fluorescence of the -methoxycoumarin moiety. a mg portion of a soluble form of ace was incubated in mm tris buffer with varying concentrations of the ace inhibitor mln- (dales et al, ) . fluorescence was monitored at min intervals using an excitation wavelength of nm and emission wavelength of nm. supplementary data are available at the embo journal online. amino acids to of the severe acute respiratory syndrome coronavirus spike protein are required for interaction with receptor recombinant avian infectious bronchitis virus expressing a heterologous spike gene demonstrates that the spike protein is a determinant of cell tropism persistent infection of sars coronavirus in colonic cells in vitro substrate-based design of the first class of angiotensin-converting enzyme-related carboxypeptidase (ace ) inhibitors organ distribution of severe acute respiratory syndrome (sars) associated coronavirus (sars-cov) in sars patients: implications for pathogenesis and virus transmission pathways identification of a novel coronavirus in patients with severe acute respiratory syndrome aetiology: koch's postulates fulfilled for sars virus coronavirus spike proteins in viral entry and pathogenesis isolation and characterization of viruses related to the sars coronavirus from animals in southern china switching species tropism: an effective way to manipulate the feline coronavirus genome tissue distribution of ace protein, the functional receptor for sars coronavirus. a first step in understanding sars pathogenesis quantitative mrna expression profiling of ace , a novel homologue of angiotensin converting enzyme molecular evolution of the sars coronavirus during the course of the sars epidemic in china susceptibility to sars coronavirus s protein-driven infection correlates with expression of angiotensin converting enzyme and infection can be blocked by soluble receptor cellular entry of the sars coronavirus sars-associated coronavirus cd l (l-sign) is a receptor for severe acute respiratory syndrome coronavirus a novel coronavirus associated with severe acute respiratory syndrome newly discovered coronavirus as the primary cause of severe acute respiratory syndrome retargeting of coronavirus by substitution of the spike glycoprotein ectodomain: crossing the host cell species barrier efficient replication of severe acute respiratory syndrome coronavirus in mouse cells is limited by murine angiotensin-converting enzyme angiotensin-converting enzyme is a functional receptor for the sars coronavirus laboratory diagonosis of four recent sporadic cases of community-acquired sars the genome sequence of the sars-associated coronavirus dc-sign and dc-signr interact with the glycoprotein of marburg virus and the s protein of severe acute respiratory syndrome coronavirus retroviruses pseudotyped with the severe acute respiratory syndrome coronavirus spike protein efficiently infect cells expressing angiotensin-converting enzyme highly infectious sars-cov pseudotyped virus reveals the cell tropism and its correlation with receptor expression infectious diseases. mounting lab accidents raise sars fears severe acute respiratory syndrome characterization of a novel coronavirus associated with severe acute respiratory syndrome the nterminal region of the murine coronavirus spike glycoprotein is associated with the extended host range of viruses from persistently infected murine cells cross-host evolution of severe acute respiratory syndrome coronavirus in palm civet and human prior infection and passive transfer of neutralizing antibody prevent replication of severe acute respiratory syndrome coronavirus in the respiratory tract of mice potent neutralization of severe acute respiratory syndrome (sars) coronavirus by a human mab to s protein that blocks receptor association evaluation of human mab r in immunoprophylaxis of sars by an animal study, epitope mapping and analysis of spike variants ace x-ray structures reveal a large hinge-bending motion important for inhibitor binding and catalysis molecular constraints to interspecies transmission of viral pathogens mice susceptible to sars coronavirus a -amino acid fragment of the sars coronavirus s protein efficiently binds angiotensin-converting enzyme civets are equally susceptible to experimental infection by two different severe acute respiratory syndrome coronavirus isolates the sars-cov s glycoprotein: expression and functional characterization ph-dependent entry of severe acute respiratory syndrome coronavirus is mediated by the spike glycoprotein and enhanced by dendritic cell transfer through dc-sign collectrin, a collecting duct-specific transmembrane glycoprotein, is a novel homolog of ace and is developmentally regulated in embryonic kidneys reconstruction of the most recent common ancestor sequences of sars-cov s gene and detection of adaptive evolution in the spike protein management and prevention of sars in china key: cord- -wgt kg f authors: diego-martin, borja; gonzález, beatriz; vazquez-vilar, marta; selma, sara; mateos-fernández, rubén; gianoglio, silvia; fernández-del-carmen, asun; orzáez, diego title: pilot production of sars-cov- related proteins in plants: a proof of concept for rapid repurposing of indoors farms into biomanufacturing facilities date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: wgt kg f the current covid- crisis is revealing the strengths and the weaknesses of the world’s capacity to respond to a global health crisis. a critical weakness has resulted from the excessive centralization of the current biomanufacturing capacities, a matter of great concern, if not a source of nationalistic tensions. on the positive side, scientific data and information have been shared at an unprecedented speed fuelled by the preprint phenomena, and this has considerably strengthened our ability to develop new technology-based solutions. in this work we explore how, in a context of rapid exchange of scientific information, plant biofactories can serve as a rapid and easily adaptable solution for local manufacturing of bioreagents, more specifically recombinant antibodies. for this purpose, we tested our ability to produce, in the framework of an academic lab and in a matter of weeks, milligram amounts of six different recombinant monoclonal antibodies against sars-cov- in nicotiana benthamiana. for the design of the antibodies we took advantage, among other data sources, of the dna sequence information made rapidly available by other groups in preprint publications. mabs were all engineered as single-chain fragments fused to a human gamma fc and transiently expressed using a viral vector. in parallel, we also produced the recombinant sars-cov- n protein and its receptor binding domain (rbd) in planta and used them to test the binding specificity of the recombinant mabs. finally, for two of the antibodies we assayed a simple scale-up production protocol based on the extraction of apoplastic fluid. our results indicate that gram amounts of anti-sars-cov- antibodies could be easily produced in little more than weeks in repurposed greenhouses with little infrastructure requirements using n. benthamiana as production platform. similar procedures could be easily deployed to produce diagnostic reagents and, eventually, could be adapted for rapid therapeutic responses. the current pandemic is evidencing several weaknesses in our ability to respond to a global crisis, one of which is the insufficient and heavily centralized distribution of the world manufacturing capacity of bioproducts such as antibodies, vaccines and other biological reagents, specially proteins. since it is economically impracticable to ensure readiness by maintaining inactive infrastructures during large periods of normality, the development of dualuse systems has been proposed, which would serve regular production needs in normal times but could be rapidly repurposed to strategic manufacturing requirements in times of crisis. ideally, such adaptable infrastructures should be widespread to serve local demand in case of emergency. recombinant protein production in plants is a technologically mature bioengineering discipline, with most current plant-based bioproduction platforms making use of non-food crops, mainly the nicotiana species tabacum and n. benthamiana as biomanufacturing chassis (moon et al., ; capell et al., ) . n. benthamiana is most frequently used in association with agrobacterium-mediated transient expression, also known as agroinfiltration, a technology that dramatically reduces the time required for product development. briefly, agroinfiltration consists in the massive delivery of an agrobacterium suspension culture to the intercellular space of plant leaves, either by pressure, using a syringe (small scale), or applying vacuum to plants whose aerial parts have been submerged in a diluted agrobacterium culture (large scale). agrobacterium transfers its t-dna to the cell nucleus, therefore massively reprogramming the plant cell machinery towards the synthesis of the t-dna-encoded protein(s)-of-interest. transient expression of the transgene is often assisted by self-replicating deconstructed virus vectors that amplify the transgene dose, thus boosting protein production by several orders of magnitude (gleba et al., ) . other systems, such as the peaq system, rely on viral genetic elements for boosting expression without recurring to viral replication (sainsbury et al., ). transient expression in n. benthamiana has become the standard in plant-based recombinant protein production due to a unique combination of advantages, with speed and high yield as the most obvious ones. maximum production levels in the g/kg fresh weight (fw) range for certain highly stable proteins such as antibodies have been reported (marillonnet et al., ) . regarding speed, the in-planta incubation times required to obtain maximum yield of recombinant protein are no more than two weeks. an important, often insufficiently highlighted feature of n. benthamiana transient expression is its relatively small infrastructure requirements, partially overlapping with those employed in more conventional, medium/high-tech indoors agriculture, such as hydroponics, vertical farming, etc. (buyel, ) . in this context, when confronted with the covid- crisis, we decided to exercise a partial reorientation of the activities in our academic lab, which is equipped with a multipurpose glass greenhouse facility, towards the production of sars-cov- antigens and antibodies against the virus. here we describe the recombinant production, purification and analysis of six anti-sars-cov- monoclonal antibodies at laboratory scale, plus a pilot upscaling of two of those six antibodies. next to production scale, a critical parameter to assess was the response time. the process described here started in mid-april with the selection of literature-available antibody variable sequences and finalized nine weeks later with approximately . g of anti-sars-cov- antibody (ab) produced in modular badges of n. benthamiana plants and formulated as one litre of ab-enriched plant apoplastic fluid. based on this experience, we estimate that the same process can be reduced up to - weeks with small pre-adaptations, a remarkably short reaction time for a de novo antibody production system. absolutely key for this fast reaction is the immediate availability of scientific data including antibody sequences in pre-print repositories. this is in our opinion one of the most positive lessons that can be extracted from the covid- crisis. we discuss here the possible applications of the fast plant-produced antigens and antibodies in diagnostics and therapy and propose the repurpose of high-tech agricultural facilities as an alternative for decentralized biomanufacturing in times of crisis. both, nicotiana benthamiana wild type plants and , -xylosyltransferase/alpha , fucosyltransferase (Δxt/ft) rnai knock down lines (strasser et al., ) were grown in the greenhouse. growing conditions were °c (light)/ °c (darkness) with a -h-light/ -hdark photoperiod. all sequences were cloned and assembled using the goldenbraid (gb) assembly system https://gbcloning.upv.es) . antibody sequences were obtained from literature (see table ). all antibodies were cloned as single chain antibodies fused to the human igg fc domain. those antibodies derived from synthetic or camelid single domain vhh libraries (sybody , sybody and nanobody ) were designed as direct fusions. cr , cr and cr human monoclonal antibodies were redesigned as single chain variable fragment (scfv) by connecting the variable light (vl) and heavy (vh) chains with a ggggsggggsggggssgggs peptide linker. antibody sequences were codon optimised for n benthamiana with the idt optimization tool at http://eu.idtdna.com/codonopt . the sars-cov- antigen sequences used (n protein, yp_ . ; and s-protein rbd domain, yp_ . , aa - ) derive from the wuhan strain nc_ . four different versions of rbd were designed corresponding to (i) the native sequence with a cterminal xhis-tag or (ii) an n-terminal xhis-tag and a c-terminal kdel sequence for er retention, and (iii and iv) their corresponding n. benthamiana codon optimised counterparts, using the same tool as above. dna sequences were domesticated as level phytobricks for gb cloning and ordered for synthesis as double-stranded dna fragments (gblocks, integrated dna technologies). gblocks were first cloned into the domestication vector pupd (vazquez-vilar et al., ) in a bsmbi golden gate restriction/ligation reaction ( °c - min, x ( °c - min / °c - min), °c - min, °c - min). the ligation product was transformed into e. coli top electrocompetent cells and positive clones were verified by restriction digestion analysis and sequencing. pupd level phytobricks were then cloned into the expression vectors pgreen sp-higg (antibody sequences), pcambiav (rbd sequences) or pcambiav (n sequences). pgreen sp-higg is a pgreen vector based adaptation of the the magnicon® ' provector pich (icon genetics) that is designed for bsai cloning of gb (b -b ) standard parts as in-frame fusions with the tobacco ( - )-beta-glucanase signal peptide and the human igg fc domain. similarly, pcambiav and pcambiav are pcambia based adaptations of the magnicon® ' provector pich that are designed for bsai cloning of gb standard parts as in-frame fusions with tobacco ( - )-beta-glucanase signal peptide (pcambiav , for expression of secreted proteins) or without any subcellular localization signal (pcambiav , for expression of cytoplasmic proteins). assembly reactions were performed as above, and the ligation reactions were transformed into e. coli top electrocompetent cells. positive clones were verified by restriction digestion analysis. all level parts generated in this work are listed in supplementary table . for transient expression in n. benthamiana, the plasmids were transformed into agrobacterium tumefaciens strain gv c c by electroporation. the same strain but carrying the psoup helper plasmid was employed to allow the replication of the pgreen vectors which encode the antibodies. overnight grown exponential cultures were collected by centrifugation and the bacterial pellets were resuspended in agroinfiltration solution ( mm mes, mm mgcl , µm acetosyringone, ph . ) and incubated for h at rt in a horizontal rolling mixer. for small scale agroinfiltration, culture optical density at nm was adjusted to . with agroinfiltration solution and the bacterial suspensions harbouring the ' antibody or antigen modules, the integrase (pich ), and the ' module (pich ) were mixed in equal volumes. control samples were agroinfiltrated with pich _dsred and integrase module. agroinfiltration of to -week-old n. benthamiana plants was carried out through the abaxial leaf surface using a ml needle-free syringe (becton dickinson s.a.). for pilot scale production, the bacterial suspensions were prepared as above except that a lower od was used ( . for sybody agroinfiltration and . for nanobody agroinfiltration). additionally, for sybody agroinfiltration, a bacterial suspension of pich _dsred, a magnicon® ´module encoding the fluorescent protein dsred, was added to the final agrobacterium infiltration solution in a ratio : : . : . (pich :pich :pgreensp-sybody -higg :pich _dsred). delivery of agrobacterium to the plant cells was carried out by vacuum infiltration in a vacuum degassing chamber (model dp , applied vacuum engineering) provided with a l infiltration tank. the aerial part of whole plants (seven plants at a time) was immersed into the agrobacterium infiltration solution; vacuum was applied for min at a vacuum pressure of . bar and then slowly released. days post-vacuum agroinfiltration leaves were excised and then infiltrated with mm phosphate buffer ( . mm nah po , . mm na hpo . h o, ph ), without (sybody ) or with (nanobody ) . mm pmsf (sigma-aldrich, # ), following the same procedure as the vacuum agroinfiltration. after eliminating the buffer excess with tissue paper, the leaves were introduced into mesh zipped bags and then centrifuged using a portable cloth dryer orbegozo sc . thus, the apoplastic fluid was obtained from the drain tube. the apoplastic fluid was centrifuged ( min, x g, at °c) to remove any cell debris and agrobacterium, the supernatant was collected and then fractions of ml were concentrated times using kda amicon ultra- k centrifugal filters (millipore) after centrifugation ( min, x g, at °c). protein crude extracts were obtained by homogenizing ground frozen leaf tissue with cold pbs buffer ( mm nah po , mm na hpo . h o, mm nacl, ph . ) in a : (w/v) and were centrifuged at rpm for min at °c. for antibody purification, g of ground agroinfiltrated tissue were extracted in ml of cold mm phosphate buffer. samples were centrifuged at x g for min and the supernatant was transferred to a clean tube and further clarified by filtration through a . µm membrane filter. the recombinant antibodies were purified by affinity chromatography with protein a agarose resin (abt technology) following a gravity-flow procedure according to the manufacturer's instructions. mm citrate buffer ph was used for elution and m tris-hcl ph was used for neutralization of the eluted sample ( . µl for each µl elution fraction). purified antibodies were quantified using the bio-rad protein assay following the manufacturer's instructions and using bsa for standard curve preparation. the n. benthamiana leaves infiltrated with the different sars-cov- proteins were collected (rbd) or - (n protein) days post infiltration (dpi). leaves were frozen in liquid nitrogen and stored at - °c until used. protein extraction was performed using - g of ground frozen tissue in volumes of coldextraction buffer. three different buffers were tested as a first approach, in order to optimize the purification yields. buffer a: pbs buffer with mm imidazole, ph . buffer b: buffer a supplemented with % triton x- , and buffer c: buffer b supplemented with % glycerol, % sucrose and . % -β-mercaptoethanol. samples were vigorously vortexed and centrifuged at x g for min at °c. the supernatant was carefully transferred to a clean tube and filtered through a . µm syringe filter. protein purification was carried out by ni-nta affinity chromatography as described in (fernandez-del-carmen et al., ) . purified proteins were quantified using the bio-rad protein assay following the manufacturer's instructions and using bsa for standard curve preparation. proteins were separated by sds-page electrophoresis on nupage % bis-tris gels (invitrogen) using mes-sds running buffer ( mm mes, mm tris-base, . mm sds, mm edta, ph . ) under reducing conditions. gels were visualized by coomassie blue staining. for western blot analysis, proteins were transferred to pvdf membranes (amersham hybond™-p, ge healthcare) by semi-wet blotting (xcell ii™ blot module, invitrogen, life technologies) following the manufacturer's instructions. blots were blocked with % ecl prime blocking agent (ge healthcare) in pbs-t (pbs buffer supplemented with . % (v/v) tween- ). for anti-sars-cov- antibody detection, the blots were incubated with : hrp-conjugated rabbit anti-human igg (sigma-aldrich, #a ). for sars-cov- antigen detection the blots were incubated with : anti-his mouse monoclonal primary antibody (qiagen, # ) and then incubated with : peroxidase labelled anti-mouse igg secondary antibody (ge healthcare). blots were developed with ecl prime western blotting detection reagent (ge healthcare) and visualised using a fujifilm las- imager. the overnight coating of costar well eia/ria plates (corning) was carried out at °c with µl of µg/ml rbd (raybiotech, # - ) or bsa (used as control) in coating buffer ( mm na co , mm nahco , ph . ). after washes with µl of pbs, the plate was blocked with µl of a % (w/v) ecl advance blocking reagent (ge healthcare) solution in pbs-t (pbs supplemented with . % (v/v) tween- ) for h at rt. the plate was washed times with pbs, and then, starting at µg of the purified antibody per well ( µl), : serial dilutions in blocking solution were incubated for h min at rt. after washing steps with pbs-t (pbs buffer supplemented with . % tween- ), : hrp-labelled rabbit anti-human igg (sigma-aldrich, #a ) in blocking solution was added. after h, the plate was washed with pbs and the substrate o-phenylenediamine dihydrochloride sigmafast™ opd tablet (sigma-aldrich, #p ) was added (following manufacturer's instructions). reactions were stopped with µl m hcl per well and absorbance was measured at nm. the endpoint titer was determined as the last concentration of each purified antibody showing an absorbance value higher than the value defined as cutoff (mean blank + sd). blank is defined as the values from each elisa test against bsa (zrein et al., ; armbruster and pry, ) . the sandwich elisas were performed as described in the antigen elisa section with a few changes. the plates were coated with µl of µg/ml murine anti-his mab (qiagen, # ), and after blocking, the plates were incubated with µl of the crude extracts of the (rbd/n) antigen expressing leaves serially diluted ( : ) in bsa . %. wt crude extracts were used as negative control. the crude extracts were prepared by adding a volume of pbs buffer corresponding to times the mass of the ground tissue in liquid nitrogen. then the mix was centrifuged ( rpm, °c, min) and the supernatant was subjected to sonication before use. the antigens were sandwiched with µg of the corresponding purified antibody (or µl of the apoplastic fluid in % blocking reagent) per well ( h min incubation, rt). the same procedure as in the antigen elisa was followed for the incubation with the conjugated secondary antibody, colorimetric reaction and measurement. six different antibody sequences were selected for recombinant production in n. benthamiana, following a plant deconstructed viral strategy based on magnifection technology, as described earlier (marillonnet et al., ) (see table ). four of those were directed against the receptor binding domain (rbd) of the sars-cov- spike (s) protein, whereas the remaining two were directed against the n protein. all six antibodies were engineered as single polypeptide chains fused to the human cɣ -cɣ constant immunoglobulin domains. three of them, those derived from single chain camelid or synthetic vhh antibody libraries, were produced as direct fusions. the other three, derived from full-size human monoclonal antibodies, were redesigned as scfvs, using a linker peptide that connects vh and vl regions (see fig a) . the nucleotide sequences of the different variable regions were all obtained from the literature, then chemically synthesized with appropriate extensions and cloned into a destination magnifection-adapted vector using a type iis restriction enzyme strategy. the cloning cassette was flanked by a β-endoglucanase signal peptide for apoplastic localization in n-terminal and the human cɣ -cɣ domains of the human igg in the c-terminal side. the resulting vectors were transferred to agrobacterium cultures and agroinfiltrated in n. benthamiana leaves in combination with a ´ magnicon® module, containing the rna polymerase and movement protein, and with an integrase module (fig b) . for antibody production we used wild-type and rnai Δxt/ft glycoengineered n. benthamiana plant lines, the latter lacking plantspecific xylose and fucose glycosylation (strasser et al., ) . infiltrated leaves were examined daily, and only minimal damage was observed in the agroinfiltrated tissues during the incubation period. after seven days, leaf samples were collected, ground, and crude extracts were analyzed in sds-page. as can be observed in fig a-b (upper panel) , all samples produced coomassie-detectable bands of the expected antibody size. scfv-igg - kda antibodies migrated slightly above the - kda rubisco large subunit, partially masking its detection. vhh-igg antibodies migrated at the expected - kda size. the identity of the coomassie bands was confirmed by western blot using an anti-human igg antibody for detection (fig a-b , lower panel). as shown in fig a-b , under reducing conditions lower molecular weight (mw) bands were also detected, probably as a result of partial proteolytic degradation. small-scale affinity purification was carried out for all six antibodies produced in Δxt/ft plants using protein a affinity chromatography (fig c-d) . the resulting purified antibodies were used to estimate the yield of the final product, which ranged between . µg/g fw (cr antibody) to . µg/g fw (nanobody antibody) (see table ). the in-planta production of sars-cov- rbd and n protein antigens was also assayed in parallel using a similar strategy as described for antibody production. for this purpose, two versions of the expression vector were designed for rbd, one with the native viral sequence and the other with a n. benthamiana codon-optimized sequence. for the n protein, only the n. benthamiana codon-optimized sequence was employed. for rbd, native and codon optimized versions were targeted to the apoplast with the tobacco glucan endo- , -beta-glucosidase signal peptide and versions containing a kdel peptide for er retention were also generated. all nucleotide sequences were chemically synthesized with a small nucleotide extension coding for a histidine tag for detection ( fig a) . as described for antibody production, magnicon®-derived ´ vector modules encoding rbd and n proteins were agroinfiltrated in combination with an integrase module and a ´-module lacking any additional subcellular localization signal. shorter incubation times were decided in antigen production as compared to antibodies because antigen constructs produced different degrees of necrotic lesions in the leaves, ranging from mild symptoms in n protein to severe necrosis after four days in native rbd. for those constructs producing more severe lesions, incubation time was reduced to five days, and for the rest the incubation period was extended to seven days. rbd was extracted and purified using small-scale affinity-chromatography with niquel columns and the resulting coomassie and a western blot analysis are shown in fig b. rbd can be detected as a major estimated kda band, with the presence of higher mw bands that suggest multimerization. er retention did not improve expression levels of rbd for the native version, nor for the n. benthamiana optimized one (data not shown). addition of % triton x- to the standard extraction buffer (see materials and methods) did not improve the yield, which was estimated as - µg/g fw (table ) . n protein was extracted from agroinfiltrated leaves and affinity purified following the same procedure described for rbd. a major kda band was detected both on the crude extract and upon purification (fig b) . small-scale affinity-chromatography with niquel columns gave an estimated yield of µg/g fw for n protein ( table ) . binding activities of affinity purified anti-rbd antibodies were analysed by antigen elisa as shown in fig a. as expected, all assayed antibodies were active in binding their respective antigen. endpoint dilution titers were calculated for anti-rbd antibodies using a commercial antigen. sybodies and and nanobody showed high dilution titers, ( . µm, . µm and . µm, respectively), but the performance of cr was significantly lower ( . µm). in a parallel experiment, we tested the ability of plant-made antibodies to selectively detect our own plant-made antigens, including here also the n protein, using a sandwich elisa approach. for this analysis, elisa plates were coated with a murine anti-his mab, incubated with serial dilutions of crude plant extracts from antigen-producing plants and sandwiched with purified plant-made antibodies. as shown in fig b- c, all antigen-producing plant extracts gave sandwich-elisa signals significantly above the background when assayed using their cognate antibodies, thus evidencing the capacity of both, antibodies and antigens, to function as potent diagnostic tools. background signals in this experiment are likely due to cross-reaction of the anti-human igg secondary antibody with the murine anti-his mab, and could be easily reduced for more potent diagnostic applications by employing recombinant antibody formats other than igg. in the design of a pilot upscaling experiment, we favoured modularity and tried to maximize the affordability and adaptability of the process by reducing the requirements for highly specialized lab equipment. we carried out a final agroinfiltration for recombinant antibody production using a total of plants, equivalent to approximately . kilograms of fresh plant material. the plants were divided in two batches of plants each, and used to produce sybody and nanobody respectively, as these antibodies showed the most promising binding activities and yields. to facilitate the upscaling of the agroinfiltration process, plant seedlings were transplanted in growth modules, each module comprising seven pots kept together in a double layer of disposable plastic-board hexagons as shown in fig a. each production batch consisted in eight hexagonal modules. when plants were six weeks old, they were agroinfiltrated by submerging each hexagon upside down into a cm diameter cylindrical tank filled with l of an agrobacterium suspension, set inside a cylindrical vacuum degas chamber (fig a) . in this way, seven plants at a time were vacuumagroinfiltrated by slowly releasing vacuum while leaves remained submerged in the solution. next, plants were rinsed, brought back to the growth chamber and incubated for days before harvest. two different concentrations of the agrobacterium suspension were used in this experiment. one of them (sybody ) consisted in an od . final mix containing plasmids pich , pich , pgreensybody -igg , and pich _dsred at : : . : . ratio, where pich _dsred is a magnicon® ´module encoding dsred. the fluorescent marker was added to the infiltration mix to monitor the extension of the viral infection foci. as described elsewhere (julve et al., ; julve parreño et al., ) superinfection exclusion among virial clones yields mosaic-like expression patterns of individual clones, therefore the tiles produced by red fluorescent proteins were used as an indication of the extension and distribution of the unlabelled foci producing the recombinant antibody. in parallel, nanobody upscaled production was undertaken by agroinfiltration of an od . agrobacterium culture mix containing pich , pich and pgreennanobody -igg at : : ratio. after days, dsred tiles in sybody experiment, clearly visible with the naked eye, finalized their expansion in most agroinfiltrated leaves, an indicator that the expression tiles had covered the whole leaf surface (fig b) . at this stage, leaves were harvested and submitted to an apoplastic fluid recovery assay, where > . kg batches of detached leaves were vacuum infiltrated in mm phosphate buffer using the same vacuum device as described above. once rinsed to remove the excess of buffer, leaves were packed in mesh zipped bags, spinned down in a spin portable cloth dryer, and the intercellular apoplastic fluid was recovered from the drain tube. with this simple procedure, between and millilitres of apoplastic fluid (sybody and nanobody , respectively) was recovered from . kg of detached leaves. a fraction of the apoplastic fluid of both antibodies was concentrated times in kda centricons, and the rest was kept refrigerated for further analysis. fig c-d show the coomassie-staining and western blot analysis of crude extracts as well as apoplastic fluid preparations, and their corresponding purifications. crude extracts in this pilot experiment showed a vhh-igg band similar in intensity to that obtained in small scale experiments (data not shown). interestingly, apoplastic fluid consisted in a very simplified mix of proteins, with the recombinant antibody being among the most predominant ones. as shown, the different optical density of the agrobacterium culture, together with the presence of a competing dsred clone clearly influenced the accumulation levels, with the yields of nanobody clearly outperforming those of its sybody counterpart. unfortunately, the antibodies seemed partially degraded as indicated by the presence of two bands smaller than the expected vhh-cɣ -cɣ size, which could be compatible with degradation fragments. degradation was only partially solved with the addition of the protease inhibitor pmsf into the recovered phosphate buffer, as shown with nanobody production (fig d) . despite degradation, in a densitometric analysis we estimate that the recovered apoplastic fluid contains . g per liter of intact mab full-size. finally, we performed sandwich elisa tests of sybody and nanobody ( fig e and fig f, respectively) using the total and concentrated apoplastic fluid as detection reagent against serial dilutions of crude plant extracts from rbd-producing plants, showing that this simple antibody preparation can be directly employed in detection procedures without the need of additional purification steps. several n. benthamiana-dedicated bioproduction facilities are functioning worldwide, as those from leaf expression systems in uk (dobon, ) , icon genetics (giritch et al., ) and fraunhofer in germany (wirz et al., ) or kentucky bioprocessing in us (olinger et al., ) , among others. notably, medicago recently announced the building a new sqm facility with capacity for around - million of planned doses of flu vaccine per year. such facilities usually involve separated modules for upstream processing, namely a wet-lab module for preparation of the bacterial inoculum, a regular plant growth chamber, and agroinfiltration room, and a post-infiltration growth chamber. in addition, downstream processing facilities are often situated next or to the production ones to minimize the handling time of fresh tissues. whereas installed capacity of plant-dedicated biofactories is in continue growth, they are clearly insufficient to respond to global or even regional demands in times of crisis. we reasoned that, at least for upstream processes, the infrastructures required for medium scale n. benthamiana-based production are not radically different to those employed in high-tech agriculture practices as hydroponics, aeroponics or vertical farming, and thus high-tech agriculture facilities could be easily repurposed as biomanufacturing facilities in a matter of days or weeks (mcdonald and holtz, ) . as an exercise to practically test the repurposing requirements, we describe here the partial adaptation of our research laboratory and greenhouse facilities to the production of sars-cov- -related antigens and antibodies using n. benthamiana agroinfiltration as manufacturing platform. in figure we show a chronogram of the activities undertaken by our team towards the production of sars-cov- antigens and antibodies, from the initial selection of the nucleotide sequences of the genes-of-interest to the production of one litre of plant apoplastic fluid of recombinant sybody and nanobody . in our hands, the whole process took a total of nine weeks with non-exclusive personnel dedication and partially restricted access to our facilities. the process can be divided in three periods: the first step (design), taking approximately ten days, was dedicated to construct design and gene synthesis. it was pivotal in this step to have open access to viral and antibody sequences deposited in pre-print repositories. particularly remarkable was the openness of academic labs that immediately released primary sequence information of partially characterized anti-sars-cov- monoclonal antibodies, an exercise that should serve as an example in the future. due to our limited testing capacity, the number of parallel designs per product was maintained relatively low, and several design decisions (e.g. codon optimization, purification tags) were taken based in a best-guess approach. ideally, proper crisis preparedness should involve a centralized automated equipment such as a biofoundry (hillson et al., ) , with which the design space could be extended dramatically without causing delay. the second phase (build) was dedicated to cloning and construct building and lasted less than three weeks. our lab counts with adapted plasmids and cloning procedures from previous projects (sarrion-perdigones et al., ; vazquez-vilar et al., ) , therefore no significant time lag occurred in this step. importantly, this period also involved seeding a new plant batch at the scale required for pilot production in week seven ( plants distributed in hexagonal modules in this case). in a third phase (test), starting on week , all constructs were infiltrated at a small scale (three replicate leaves each), shortly incubated ( or days) and then tested functionally in parallel analyses. this small-scale assay took two additional weeks, summing a total of approximately days for the complete process. the synthetic biology-inspired design-build-test (dbt) process described above is conceived as an iterative one, so that new dbt cycles can be run fuelled by the conclusion of previous cycles to generate new optimized versions of the product. based on this experience, we estimate that the whole dbt cycle could be shortened to days or less by optimising the pipeline (e.g. introducing centralized, automatized design and build phases), and by improving preparation and anticipation in the facilities (fig ) . for instance, note that moving from step to step without delay requires a small batch of plants be always maintained in the facility, as it was in our case to supply our research requirements. this only involves transplanting - seedlings every three weeks, and then disposing of them every other three weeks once they start flowering. if a continuous plant supply is not maintained, a minimum of three extra weeks needs to be considered to have plants ready for the first test iteration. whereas the first version of products shown here lack iterative optimization, it would serve eventually to respond to the most urgent demands. in our case, as the results of the first dbt process arose, the best performing version (v . ) of two of the products were taken to production phase. in the exercise shown here, the upscaling was relatively small ( plants, approximately . kg fw). post-agroinfiltration incubation time was extended to days to maximize yields. in the meantime, optimization of the purification/extraction methods were undertaken at small scale, so that the new knowledge acquired could be applied in the batch purification of the pilot experiment. in a crisis-scenario, and given the modularity of the proposed scheme, several medium-size production modules can be replicated in a farming facility, and reproduced in several farms, allowing easy scalability. successive iterations with small scale agroinfiltration could be an effective way to maximize yields and reduce development times by comparing different small-scale strategies. it should be mentioned that the basic apoplast-based downstream processing proposed here could only be used, with the necessary adaptations, in a limited number of crisis-related applications, mainly related with detection and diagnosis. other uses, certainly therapeutic ones, would involve additional regulatory considerations including gmp downstream facilities, which are beyond the scope of this exercise. as a result of this experience, several improvements can be envisioned. we employed the magnicon® vector system with few adaptations for all the attempted proteins. although magnicon® produces maximum yields for many products, some proteins, particularly viral antigens may express better with other (e.g. non-viral) systems. in our experiments, antigens showed rather low expression levels despite optimization attempts using codon optimization and different localization signals. in adapting to an emergency, it would be advisable to perform initial expression tests using different production platforms also involving nonreplicative methods (sainsbury and lomonossoff, ) or dna viruses (yamamoto et al., ; zhang and mason, ) , and to incorporate them to the initial optimization test. as mentioned, this could be done in a centralized manner, later distributing expression clones to several repurposed production facilities. in contrast to antigens, recombinant single-chain antibodies showed in general higher and more uniform expression levels, as could be expected from their more similar structure. we chose to adapt full human iggs to a scfv-igg format to facilitate cloning and expression procedures, since it has been earlier described in plants that single chain formats reproduce the binding activities of the original full-size antibodies from which they derive. this format also facilitates comparisons with vhh antibodies, also produced as igg fusions. the plant-made sars-cov- products described here have several potential applications in the diagnosis area. both rbd and n proteins can be used as reagents for serological assays (amanat et al., ; liu et al., ) , although further yield optimizations should be required. for those assays where antigen glycosylation is an important factor, glycoengineered plants (strasser et al., ) can provide a competitive alternative to mammalian cells cultures (o'flaherty et al., ) . regarding antibodies, they can serve as internal references for the quantification of serological responses. with small modifications, the same antibodies can be adapted for sandwich elisa and employed in the detection and quantification of viral particles, a better proxy for infectiveness than rna. we also show here that apoplastic fluid is an inexpensive antibody preparation suitable for certain applications that require low-cost preparations, e.g. the concentration of the virus from environmental samples. as shown here, the protein complexity in the apoplast is greatly reduced, therefore the apoplast could be regarded as a plant-equivalent of hybridoma supernatant or ascited fluid, although at much lower cost. unfortunately, apoplastic preparations are prone to partial antibody degradation, probably due to endogenous proteases, however this can be minimized using extraction buffers with appropriate protease inhibitors, as it was shown for nanobody . the current pandemic crisis has evidenced the power of new antibody selection procedures, either based on single-cell selection from human peripheral blood mononuclear cells, in the case of full-size antibodies, or based on ultra-high throughput selection of synthetic libraries (sybodies) in the case of camelid-derived nanobodies (zimmermann et al., ; walter et al., ) . large collections of anti-sars-cov- , potentially neutralizing antibody sequences were made available to the scientific community in a question of weeks rather than months. it does not go unnoticed that the combination of rapid antibody selection procedures with fast, modular and scalable plant expression also has implications in the therapeutic arena as an ideal system for passive immunization. intravenous polyclonal immunoglobulins (ivig) from recovered patients have been shown a very effective covid- treatment in several studies (montelongo-jauregui et al., , and references herein) however the limited availability of patient sera hampers its application in practice. interestingly, we showed in a recent work that large recombinant polyclonal antibody cocktails (pluribodies), mimicking a mammalian immune response can be produced in n. benthamiana with high batch-to-batch reproducibility (julve parreño et al., ) . passive immunization with recombinant antibody cocktails resembles a natural response more than a monoclonal therapy, requires shorter developmental times and is probably more robust against the development of resistances. in conclusion, based on the results of the exercise described here, we propose the repurposing of indoors farms into plant-based biomanufacturing facilities as a viable option to respond to local and global shortages of bioproducts such as diagnostics and therapeutic reagents in times of crisis. the authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. all authors designed and performed the experiments, and analyzed the data. d.o. wrote this manuscript. all authors revised and edited the written manuscript. this work was supported by h eu projects newcotiana and pharmafactory. s.s. is recipient of a fpi fellowship bio - -r from the spanish zimmermann, i., egloff, p., hutter, c. a., arnold, f. m., stohler, p., bocquet, n., et al. ( the upper timeline represents the actual timespan of the experiments. note that the time points represent approximately the days required to produce and initially characterize the designated products; notwithstanding, some of the results shown in previous figures correspond to extended analysis obtained at a later stage, during the preparation of this manuscript. the lower timeline describes the estimated minimal timespan that would result by introducing some of the optimizations described in the text. a serological assay to detect sars-cov- seroconversion in humans limit of blank, limit of detection and limit of quantitation plant molecular farming -integration and exploitation of side streams to potential applications of plant biotechnology against sars-cov- transient gene expression seeds plant-based bioproduction systems: leaf expression systems' hypertrans technology promises low costs and high yields recombinant jacalin-like plant lectins are produced at high levels in nicotiana benthamiana and retain agglutination activity and sugar specificity rapid high-yield expression of full-size igg antibodies in plants coinfected with noncompeting viral vectors engineering viral expression vectors for plants: the 'full virus' and the 'deconstructed virus' strategies building a global alliance of biofoundries a coat-independent superinfection exclusion rapidly imposed in nicotiana benthamiana cells by tobacco mosaic virus is not prevented by depletion of the movement protein a synthetic biology approach for consistent production of plant-made recombinant polyclonal antibodies against snake venom toxins evaluation of nucleocapsid and spike protein-based enzyme-linked immunosorbent assays for detecting antibodies against sars-cov- systemic agrobacterium tumefaciens-mediated transfection of viral replicons for efficient transient expression in plants from farm to finger prick-a perspective on how plants can help in the fight against covid- convalescent serum therapy for covid- : a th century remedy for a st century disease development of systems for the production of plant-derived biopharmaceuticals. plants mammalian cell culture for production of recombinant proteins: a review of the critical steps in their biomanufacturing delayed treatment of ebola virus infection with plant-derived monoclonal antibodies provides protection in rhesus macaques extremely high-level and rapid transient protein production in plants without the use of viral replication peaq: versatile expression vectors for easy and quick transient expression of heterologous proteins in plants goldenbraid: an iterative cloning system for standardized assembly of reusable genetic modules goldenbraid . : a comprehensive dna assembly framework for plant synthetic biology generation of glyco-engineered nicotiana benthamiana for the production of monoclonal antibodies with a homogeneous human-like n-glycan structure: xylt and fuct down-regulation in n. benthamiana potent binding of novel coronavirus spike protein by a sars coronavirus-specific human monoclonal antibody molecular and biological characterization of human monoclonal antibodies binding to the spike and nucleocapsid proteins of severe acute respiratory syndrome coronavirus gb . : a platform for plant bio-design that connects functional dna elements with associated biological data sybodies targeting the sars-cov- receptor-binding domain automated production of plant-based vaccines and pharmaceuticals structural basis for potent neutralization of betacoronaviruses by single-domain camelid antibodies improvement of the transient expression system for production of recombinant proteins in plants bean yellow dwarf virus replicons for high-level transgene expression in transgenic plants and cell cultures the authors want to thank to the staff of the ibmcp and the polytechnic university of valencia who help us to access safely to our laboratory and greenhouses during the covid- lockdown in spain, and specially eugenio grau for his readiness to help us with sanger sequencing during that period. we are grateful to prof. steinkellner and strasser for providing glycoengineered plant lines and to prof. gleba for sharing magnifection plasmids. ministry of science and competitiveness and r.m. is recipient of a gva fellowship (acif/ / ). key: cord- -abanr authors: brigger, d.; horn, m.p.; pennington, l.f.; powell, a.e.; siegrist, d.; weber, b.; engler, o.; piezzi, v.; damonti, l.; iseli, p.; hauser, c.; froehlich, t.k.; villiger, p.m.; bachmann, m.f.; leib, s.l.; bittel, p.; fiedler, m.; largiadèr, c.; marschall, j.; stalder, h.; kim, p.s.; jardetzky, t.s.; eggel, a.; nagler, m. title: accuracy of serological testing for sars‐cov‐ antibodies: first results of a large mixed‐method evaluation study date: - - journal: allergy doi: . /all. sha: doc_id: cord_uid: abanr background: serological immunoassays that can identify protective immunity against sars‐cov‐ are needed to adapt quarantine measures, assess vaccination responses, and evaluate donor plasma. to date, however, the utility of such immunoassays remains unclear. in a mixed‐design evaluation study, we compared the diagnostic accuracy of serological immunoassays that are based on various sars‐cov‐ proteins and assessed the neutralizing activity of antibodies in patient sera. methods: consecutive patients admitted with confirmed sars‐cov‐ infection were prospectively followed alongside medical staff and biobank samples from winter / . an in‐house enzyme‐linked immunosorbent assay utilizing recombinant receptor‐binding domain (rbd) of the sars‐cov‐ spike protein was developed and compared to three commercially available enzyme‐linked immunosorbent assays (elisas) targeting the nucleoprotein (n), the s domain of the spike protein (s ) and a lateral flow immunoassay (lfi) based on full‐length spike protein. neutralization assays with live sars‐cov‐ were performed. results: one‐thousand four‐hundred and seventy‐seven individuals were included comprising sars‐cov‐ positives (defined as a positive real‐time pcr result; prevalence . %). igg seroconversion occurred between day and day . while the elisas showed sensitivities of . % for rbd, . % for s , and . % for n protein, the specificity was above % for all tests. out of sars‐cov‐ positive individuals, . % showed full neutralization of live sars‐cov‐ at serum dilutions ≥ : , while none of the sars‐cov‐ negative sera revealed neutralizing activity. conclusions: elisas targeting rbd and s protein of sars‐cov‐ are promising immunoassays which shall be further evaluated in studies verifying diagnostic accuracy and protective immunity against sars‐cov‐ . governments worldwide are facing a unique challenge: to save thousands of lives threatened by coronavirus disease (covid- ) , while minimising economic and social damage caused by lockdown and other strict measures. serological immunoassays will play a central role in addressing these challenges for the following reasons . first, serological tests might improve the rate of diagnosis as real-time rt-pcr is associated with a high number of false-negative results due to pre-analytical and other issues . second, antibody assays may support intensive surveillance measures such as universal testing, active case-finding, contact tracing, and linking clusters and thereby may facilitate an exit strategy from lockdown [ ] [ ] [ ] [ ] , . in patients with severe disease extensive activation of cytokine-secreting cells from the innate and adaptive immune system has been reported to result in a cytokine storm contributing to acute respiratory distress syndrome and multiorgan failure [ ] [ ] [ ] [ ] [ ] . antibody responses against different sars-cov- antigens have been described in serological samples of infected patients. few patients with anti-viral antibodies have been identified in the first days following symptom onset but the positive rate rapidly increases thereafter , . to date, antibody testing has focused primarily on two highly abundant structural antigens of sars-cov- , specifically the nucleoprotein (n) protein this article is protected by copyright. all rights reserved and the spike (s) protein . while the n phosphoprotein ensures the linkage of the viral rna to the membrane , the s glycoprotein binds to ace and thereby initiates viral entry into the host cell , [ ] [ ] [ ] . neutralizing antibodies (nab) are typically generated against the s protein and often target the receptor binding domain (rbd) , . as demonstrated in a vaccination approach using inactivated virus, the rbd represents an immunodominant viral antigen since at least half of the detectable anti-s igg antibodies were directed against the rbd . in contrast, the amount of anti-n antibodies was -fold lower. lateral flow immunoassays (lfi) , as well as enzyme linked immunosorbent assays (elisa) , have been developed but not yet adequately evaluated. while lfis are remarkably fast and only require minutes to perform, significant concern regarding their sensitivity and specificity has been raised . elisas are considered more robust but require highly specialized laboratories with the capacity to run automated high-throughput measurements. at the time of compiling this paper, the diagnostic performance of different immunoassays as well as their predictive value for protective immunity remains unclear. before a broad implementation of immunoassays can be justified, the following points need to be carefully assessed in adequately powered and designed diagnostic studies: ( ) diagnostic accuracy (or sensitivity/ specificity respectively) in the acute and subacute phase of the disease, ( ) antibody kinetics over time in patients with confirmed covid- , ( ) extent of cross-reactivity with other pathogens and patients with autoimmune disorders, ( ) reliability between different assay settings and material characteristics, as well as ( ) correlate of protective immunity . with the present study, we aimed to comprehensively establish the utility and diagnostic accuracy of serological immunoassays for sars-cov- infection and to explore protective immunity as predicted by such immunoassays in a mixed-method observational study of hospital inpatients as well as medical personnel. this article is protected by copyright. all rights reserved international guidelines on study design were strictly followed and cross-sectional, prospective observational, as well as case-control designs were used. participants were recruited via three different routes: (i) inpatients with a sars-cov- test result (real-time pcr; rt-pcr), (ii) medical personnel of the inselspital, and (iii) residual material from patients stored at the liquid biobank bern (www.biobankbern.ch). inclusion criteria of inpatients are (i) hospitalisation in inselspital, (ii) tested positive for sars-cov- using rt-pcr (nasopharyngeal swab), (iii) aged or older and (iv) signed general consent (exemption was granted for a few patients). for this manuscript, only inpatients who had tested positive for sars-cov- with more than days of residual material available were considered. the temporal pattern of antibody response and seroconversion rate was assessed in a subgroup of inpatients; the first consecutive patients were selected. inclusion criteria of medical personnel were (i) medical staff at inselspital since february , (ii) aged or older, and (iii) signed informed consent. the personnel were recruited via mailing lists. a limited number of fully anonymized, residual biobank samples were also used for the purpose of this study with the inclusion criterion of having been collected from inpatients between december and february . a total of randomly selected sera from individuals who were tested positive in either of the three elisa immunoassays as well as negative controls were assessed in a live sars-cov- neutralization assay (all collected in april ). the university hospital bern (inselspital) is one of the largest tertiary hospitals in switzerland covering a catchment area of more than million inhabitants. with several associated smaller hospitals, it provides the full spectrum of general as well as highly specialised medical services. more than , employees work at the insel gruppe ag. the study was supported by the local covid- task force. the study protocol was approved by the appropriate ethics committee and the authorities of the university hospital and conducted in accordance with the declaration of helsinki. the manuscript was prepared according to the standards for reporting diagnostic accuracy studies (stard) guideline . this article is protected by copyright. all rights reserved blood was taken following an established in-house protocol to ensure adequate preanalytical conditions and samples were collected using plastic syringes (serum or lithium heparin respectively, s-monovette®, sarstedt, nümbrecht, germany). only residual material was used in the case of inpatients. two tubes (serum and lithium heparin respectively) were drawn in the case of medical personnel. samples were immediately transported to the central laboratory, processed using a glp laboratory track and centrifuged within minutes with an established protocol . with regard to inpatients, pseudonymized demographical, clinical as well as laboratory data were extracted and transferred by the insel data science center (idsc) from electronic patient documentation. limited data were collected for the purpose of this substudy: age, gender, time interval since rt-pcr (nasopharyngeal swab). a positive sars-cov- rt-pcr result was used as additional inclusion criterion. with regard to medical personnel, a redcap database survey was constructed collecting demographical data, covid- symptoms (presence, extent and date), comorbidities and risk factors, professional exposure, and date of rt-pcr. the s protein and rbd are regarded as ideal candidates for the development of diagnostic tests and vaccines targeting sars-cov- . the pcaggs plasmid containing the human codon-optimized sequence of the sars cov- s protein receptor binding domain (rbd, amino acids r -f ) with native s signal sequence (amino acids m -s ) and a c-terminal hexahistidine tag was kindly provided by prof. florian krammer. plasmid dna was prepared using the gene elute hp plasmid maxiprep kit (sigma-aldrich). prior to transfection expi f cells (thermo-fisher) were grown to a density of . x cells/ml in culture medium (a mixture of % expi and % freestyle- media from thermo-fisher). for each liter of transfection, . mg of plasmid dna was diluted in ml of culture medium, mixed with . ml fectopro transfection reagent (polyplus), and incubated for minutes at room temperature prior to addition to cells. immediately following transfection cells were supplemented with x d-glucose ( g/l) this article is protected by copyright. all rights reserved and x valproic acid ( mm) boost solutions. three days post transfection the cell culture supernatants were harvested by centrifugation at , x g for min. supernatants were passed through a . µm filter and : diluted with pbs containing mm imidazole. for purification of his-tagged rbd protein ml ni-nta resin (hispur ninta thermofisher) was washed three times with washing buffer (pbs with mm imidazole) and incubated on a stir plate at °c for hour. subsequently, the mixture was poured into a glass column with a frit and washed times with column volumes of washing buffer. the protein was then eluted three times with ml pbs containing mm imidazole. elutions were pooled and dialyzed overnight against pbs using . kda cutoff snakeskin dialysis tubing. the final protein concentration was determined by nanodrop measurement at a . the quality of recombinant rbd protein was analyzed by sds-page and analytical size-exclusion chromatography. all elisa assays were performed on a dsx automated elisa system device (dynex technologies). the in-house assay was prepared as follows: -well plates were coated overnight at °c with µl of µg/ml rbd protein in pbs. the following day, each well was blocked with µl of pbs/ . % casein at °c until use and at least overnight. subsequently plates were washed twice with pbs and µl sera were added at a : dilution in pbs/ . % casein for hour at rt. after five washes with µl pbs/ . % tween µl of hrp-labeled secondary polyclonal anti-human igm (sigma, a ) and anti-human igg (sigma, a ) antibodies were added in a : ' dilution for minutes at rt. again, the plates were washed times with pbs/ . % tween and µl of tmb substrate solution (sigma, t ) was added for minutes at rt. the development was stopped by adding µl of . m h so and results were measured at od - nm. all samples with an od > . were assigned as positive. several commercial tests were conducted according to the manufacturers' instructions. an elisa produced by euroimmun ag, lübeck, germany targeting the s protein as the this article is protected by copyright. all rights reserved immobilized antigen for the detection of igg antibodies was employed. briefly, samples were diluted : in sample buffer and l of diluted samples, pre-diluted positive and negative controls, as well as pre-diluted calibrator were added for hour at °c. after three wash steps with µl wash buffer, µl of hrp-labeled secondary anti-human igg antibodies were added for minutes at °c. the plates were washed again three times with wash buffer and µl of tmb solution was added for minutes at rt. the development was stopped by adding µl of . m h so and results were measured at od - nm. antibody values were expressed as a ratio (od sample /od calibrator ). all samples with a ratio > . were assigned as positive. comorbidities and risk factors, which will be used as covariables in subsequent phases of this study, will be extracted from electronic patient records and asked in the redcap this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved at university hospital bern we first established a carefully designed mixed-method diagnostic accuracy study (fig. ) . this article is protected by copyright. all rights reserved breathlessness, coughing, or loss of smell recombinantly expressed rbd has been used to establish an in-house elisa for the detection of igm and igg anti-sars-cov- antibodies in human serum samples (supplementary fig. a,b) . optimal serum dilutions were determined by titration of sera derived from six sars-cov- + and six sars-cov- -individuals. the serum dilution of : allowed efficient discrimination between positive and negative outcome (supplementary fig. ) . after automatization on a dynex dsx device, the intra-assay (within-run) and inter-assay (day-to-day) precisions of the in-house rbd elisa was assessed (supplementary fig. a- e,f) . overall, the in-house rbd elisa assay showed high intra-and inter-assay reproducibility and demonstrated a high degree of agreement between plasma and serum samples. among a subgroup of sars-cov- + inpatients, seroconversion for igm and igg antibodies was observed between day and day after the rt-pcr result and between day and day after the start of symptoms (fig. ) . interestingly, igm and igg antibody responses against rbd and s were substantially more pronounced as compared to n. assessment of the longitudinal dynamics of patient sera revealed a marked and consistent increase of igg antibodies for rbd and s (fig. a) . igm antibodies were measured in the rbd and n elisa and detectable at least for two weeks after seroconversion (fig. b) . interestingly, the individual temporal igg and igm patterns showed a high degree of inter-individual variability with one group of patients this article is protected by copyright. all rights reserved of these samples, all were negative for anti-rbd igm and igg, as well as anti-s igg. however, two biobank samples tested positive for anti-n igg (elisa; . %), and one tested positive for anti-n igm (elisa; %). all samples were negative for anti-s igg and igm ( %) as tested by lfi. the pooled study population consisted of individuals, of whom tested as rt-pcr positive (prevalence . %). sera from all individuals were tested in the three different elisa setups for igg and igm anti-sars-cov- antibodies (fig. a) . a subgroup of samples (n= ) was additionally assessed on lfi (fig. b) . both assay formats showed high specificity above % for igg and igm measurements (supplementary table ). however, the sensitivity between assays and formats varied considerably. the highest sensitivities were reached for igg measurements with the s ( . %) and rbd ( . %) elisa, followed by igg measurements on n ( . %) elisa. sensitivities for igm measurements were all considerably lower for both elisa and lfi formats, which could be due to the more transient detectability of igm upon infection. to detect potential sources of variability, we additionally studied the antibody response in salient subgroups of rt-pcr positive individuals (fig. c) this article is protected by copyright. all rights reserved in the tested inpatient population, we observed three "false-negative" (negative in s elisa despite positive rt-pcr) outcomes. among three false-negative inpatients (p , p , and p ), two were measured at an early time-point (patient and ), and one patient (p ) might have experienced seroconversion at a very late time-point because of a significant increase of antibody titers at day (supplementary fig. and supplementary fig. ) . in the assessed hospital staff, seven were classified as "falsenegative". all of these reported mild diseases and had symptoms clearly associated with covid- (fever, breathlessness, cough, and loss of taste or smell). twenty-two individuals in the hospital staff group tested "false-positive" (positive s elisa results despite negative rt-pcr). fourteen of them experienced one or more symptoms clearly associated with covid- . the remaining eight individuals were clearly positive in at least three assays. all other individuals were either classified as "true-positive" (positive in s elisa, and positive in rt-pcr), or as "true-negative" (negative in s elisa, and negative in rt-pcr). in terms of performance, the calculated area under the receiver operating characteristic (fig. b) . a total of randomly selected sera from individuals who were tested positive in either of the three elisa immunoassays as well as negative controls were assessed in a live sars-cov- neutralization assay using ace -expressing vero-e cells ( inpatient samples, and samples of medical personnel). full neutralization of viral infection has been determined based on % inhibition of the cytopathic effect in a serial dilution of the sera (supplementary fig. ) . the means of highest serum dilutions at which full neutralization was observed correlated remarkably well with the measured antibody responses in the elisa immunoassays (fig. a-c) . importantly, . % of the sera from elisa positive individuals showed full inhibition at serum dilutions ≥ : . the two sera that did not show neutralization (p and p ) were drawn at an early time point this article is protected by copyright. all rights reserved where the patients did not yet show antiviral antibodies. both patients, however, fully neutralized the virus after seroconversion at a later time point (fig. d) . further, all sera from elisa negative individuals showed no neutralizing activity. of note, one or two elisa assays were negative in samples with full neutralization. we report first results of a large, mixed-design evaluation study which was implemented to compare the diagnostic accuracy of serological immunoassays for sars-cov- antibodies. while the time to seroconversion varied substantially between infected individuals, the mounted igg responses were robust and stable over time in all assays relying on rbd, s as well as n. with regards to the elisa assays, the overall diagnostic accuracy was adequate with a high specificity. some "false-positive" results are likely due to a rather narrow diagnostic window and limited sensitivity of the rt-pcr as well as asymptomatic disease course . "false-negative" results may be caused by a long seroconversion period observed in some patients and mild disease course in other individuals. the accuracy measures of lfi and n were inferior compared to elisa targeting s and rbd. strikingly, there is a high degree of correlation between antibody responses to these viral surface proteins and the neutralizing activity against live sars- a few other studies have previously assessed the diagnostic accuracy of serological immunoassays. recently, long and colleagues studied the antibody response in patients with covid- using a magnetic chemiluminescent immunoassay. in accordance with their results, we observed high inter-individual variation in the time to seroconversion. in contrast to their study, we confirmed these findings with an appropriate diagnostic accuracy protocol using different serological immunoassays. in another case-control study, infantino et al. analyzed covid- inpatient samples and selected patients collected before using a magnetic chemiluminescent immunoassay . in agreement with their results, we found limited sensitivity but high specificity of the serological sars-cov- immunoassays. in further study conducted at the geneva university hospital, samples of covid- patients were included as well as controls collected before , and analyzed with the same s elisa that we this article is protected by copyright. all rights reserved used in our study. similar to our results they report a high specificity for igg, particularly with an adjusted cut-off value . in line with other studies the accuracy and performance lfis was rather weak , . the study presented here adds important value to previous reports as it (i) was designed as a comprehensive diagnostic accuracy study combining different research methods, (ii) directly compares major assay approaches, (iii) was fully approved by all appropriate authorities, (iv) was independently conducted at a university hospital, (v and indicate that such serological tests might even be used to predict protective immunity in near future , . to draw further conclusions, however, sars-cov- positive patients have to be followed over an extended time period in future studies. this article is protected by copyright. all rights reserved in line with previous studies , we observed that the antibody response is more pronounced in patients with severe disease than patients without (figure , panel c; inpatients, hospitalized patients, older patients). however, the response was similar in patients with mechanical ventilation and hospitalized patients. this is most likely due to limitations in sensitivity, which does not contradict our general observations. in summary, we report the first results of a large, mixed-design evaluation study that has been conducted in an independent academic setting at the university hospital bern to assess the diagnostic accuracy of various immunoassays to determine antibody responses against sars-cov- . while antibody responses of individual covid- patients against rbd and s protein were similar, a weaker reactivity against n protein became apparent. the time to seroconversion varied substantially between covid- patients but the igg response was robust and stable in all three elisa setups. their overall diagnostic accuracy was adequate with a high specificity but limited sensitivity. the antibody responses measured in these elisas correlated remarkably well with sars-cov- neutralizing activity of the sera. on the other hand, accuracy measures of s protein based lfis were poor. together, our results emphasize that appropriate serological immunoassays represent a valuable tool to identify a good portion of patients with previous sars-cov- infection, will help to facilitate exit strategies from lockdown and might even be used to predict immunity to sars-cov- in near future. this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved accepted article serology assays to manage covid- the laboratory diagnosis of covid- infection: current issues and challenges the important role of serology for covid- control connecting clusters of covid- : an epidemiological and serological investigation investigation of three clusters of covid- in singapore: implications for surveillance and response measures universal weekly testing as the uk covid- lockdown exit strategy immunology of covid- : mechanisms, clinical outcome, diagnostics and perspectives -a report of the european academy of allergy and clinical immunology (eaaci) distribution of ace , cd , cd , and other sars-cov- associated molecules in tissues and immune cells in health and in asthma, copd, obesity, hypertension, and covid- risk factors sars-cov- , covid- , skin and immunology -what do we know so far? allergy clinical characteristics of patients infected with sars-cov- in wuhan is global bcg vaccination-induced trained immunity relevant to the progression of sars-cov- pandemic? immune response to sars-cov- and mechanisms of immunopathological changes in covid- a compendium answering questions on covid- and sars-cov- accepted article this article is protected by copyright. all rights reserved clinical characteristics of pediatric covid- patients with different severities and allergic status blood myeloperoxidase-dna, a biomarker of early response to sars-cov- infection? allergy a preliminary study on serological assay for severe acute respiratory syndrome coronavirus (sars-cov- ) in admitted hospital patients. medrxiv antibody responses to sars-cov- in covid- patients: the perspective application of serological tests in clinical practice evaluation of nucleocapsid and spike protein-based elisas for detecting antibodies against sars-cov- the coronavirus nucleocapsid is a multifunctional protein the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor sars-cov- receptor ace protein expression in serum is significantly associated with age neutralizing antibodies against sars-cov- and other human coronaviruses characterization of the receptor-binding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine. cellular & molecular immunology rapid development of an inactivated vaccine for sars-cov- designs, formats and applications of lateral flow assay: a literature review accepted article this article is protected by copyright. all rights reserved cross-sectional pilot study exploring the feasibility of a rapid sars-cov- immunization test in health and nonhealthcare workers a serological assay to detect sars-cov- seroconversion in humans. medrxiv sars-cov- seroconversion in humans: a detailed protocol for a serological assay, antigen production, and test setup evaluation of antibody testing for sars-cov- using elisa and lateral flow immunoassays studies for evaluating diagnostic and prognostic accuracy stard : updated reporting guidelines for all diagnostic accuracy studies rapid centrifugation in the routine hemostasis laboratory sars-cov- immunogenicity at the crossroads detection of novel coronavirus ( -ncov) by real-time rt-pcr distinct characteristics of covid- patients with initial rrt-pcr-positive and rrt-pcr-negative results for sars-cov- antibody responses to sars-cov- in patients with covid- diagnostic accuracy of an automated chemiluminescent immunoassay for anti-sars-cov- igm and igg antibodies: an italian experience validation of a commercially available sars-cov- serological immunoassay accepted article this article is protected by copyright. all rights reserved rapid point-of-care testing for sars-cov- in a community screening setting shows low sensitivity evaluation of sars-cov- serology assays reveals a range of test performance a comparison of four serological assays for detecting anti-sars-cov- antibodies in human serum samples from different populations sars-cov- seroconversion in health care workers lack of reinfection in rhesus macaques infected with sars-cov- . biorxiv serological and molecular findings during sars-cov- infection: the first case study in finland severe acute respiratory syndrome coronavirus -specific antibody responses in coronavirus disease patients antibody responses to sars-cov- at weeks postinfection in asymptomatic patients the dna plasmid encoding the sars-cov- receptor binding domain of the spike all authors declare that there is no conflict of interests. this article is protected by copyright. all rights reserved this article is protected by copyright. all rights reserved key: cord- -ypls zau authors: wan, jinkai; xing, shenghui; ding, longfei; wang, yongheng; gu, chenjian; wu, yanling; rong, bowen; li, cheng; wang, siqing; chen, kun; he, chenxi; zhu, dandan; yuan, songhua; qiu, chengli; zhao, chen; nie, lei; gao, zhangzhao; jiao, jingyu; zhang, xiaoyan; wang, xiangxi; ying, tianlei; wang, haibin; xie, youhua; lu, yanan; xu, jianqing; lan, fei title: human igg neutralizing monoclonal antibodies block sars-cov- infection date: - - journal: cell rep doi: . /j.celrep. . sha: doc_id: cord_uid: ypls zau summary covid- has become a worldwide threat to humans, and neutralizing antibodies have therapeutic potential. we have purified more than one thousand memory b cells specific to sars-cov- s or rbd (receptor binding domain), and obtain paired heavy and light chain fragments. among these, antibodies test positive for antigen binding, and the majority of the top binders with ec below nm are rbd binders. furthermore, we identify neutralizing antibodies, of which show an ic within nm, and the best one, - , with ic of . nm. through epitope mapping, we find main epitopes in rbd recognized by these antibodies, and epitope b antibody - could substantially enhance the neutralizing abilities of most of the other antibodies. we also find that - could cross-neutralize the sars-cov pseudovirus. altogether, our study provides potent human neutralizing antibodies for covid- as therapeutic candidates. covid- has become a worldwide threat to humans, and neutralizing antibodies have therapeutic potential. we have purified more than one thousand memory b cells specific to sars-cov- s or rbd (receptor binding domain), and obtain paired heavy and light chain fragments. among these, antibodies test positive for antigen binding, and the majority of the top binders with ec below nm are rbd binders. furthermore, we identify neutralizing antibodies, of which show an ic within nm, and the best one, - , with ic of . nm. through epitope mapping, we find main epitopes in rbd recognized by these antibodies, and epitope b antibody - could substantially enhance the neutralizing abilities of most of the other antibodies. we also find that - could cross-neutralize the we screened sera samples from patients recently recovered from covid- , and found all individuals showed certain levels of serological responses, with # and # being the weakest, to sars-cov- spike rbd and s proteins ( figure a ). we also found that sera, except for , showed neutralization abilities against sars-cov- pseudoviral infection of hek t cells stably expressing human ace ( figure b ). such observations, i.e. the sera from different individuals displayed a wide range of antibody responses, were consistent with a recent report (wu et al., a). of note, no. blood sample was obtained at the second day after hospitalization (table s ) , the sera already showed weak s antigen response and pseudoviral neutralizing activities. the rbd domain in the s region of sars-cov- spike protein is the critical region mediating viral entry through host receptor ace . using recombinant rbd and s antigens, we then isolated rbd and s bound memory b cells for antibody identification using the pbmcs (peripheral blood mononuclear cells) from the individuals by fluorescence activated cell sorting ( figure s a ). sequences encoding immunoglobulin heavy (igh) and light (igl) chains were amplified from single b cell complementary dna samples after reverse transcription and then cloned through homologous recombination into mammalian expressing vectors (robbiani et al., ) . overall, naturally paired igh and igl clones were obtained, and the numbers of clones derived from each individual were listed in table s . in order to screen for sars-cov- spike antigen specific monoclonal antibodies, we used two primary assays based on elisa (enzyme linked immunosorbent assay) and fca (flow cytometry assay), respectively. among the candidate antibodies expressed in hek e cells, were positive for rbd or s binding ( figure a ). all the positive clones were then sequenced. notably, almost all ( . %) of the sequences obtained were unique ones ( figure b) , similarly as what were previously reported in ebola zhang et al., ) and yellow fever (calvert et al., ) studies. based on the ranking of elisa values and fca positivities, we focused on antibodies for further characterization. we first measured the precise values of ec by elisa, and identified strong binders for s-ecd (extracellular domain) and rbd with ec below nm, and the most potent one showing ec at . nm ( . ng/ml) ( figure c and figure s a ). of note, among these antibodies, antibodies ( - , - , - and - ) were fca negative but bound recombinant rbd relatively well in elisa ( figure c ). on the other hand, we also identified another antibodies ( - , - - , - and - ) showed strong fca positivity but with barely detectable elisa signals ( figure c ). these findings indicated certain conformational differences might exist between the recombinant and we also noticed that the majority of the strong binders were rbd binders, except for - , - and - ( figure c ). and we further confirmed that - and to identify neutralizing antibodies, we first employed pseudoviral infection assays using hek t-ace cells. from all the antibodies tested, we found a total of pseudoviral neutralizing antibodies ( figure c , and figure a , column ). among these, could neutralize authentic virus entry into vero-e cells, and of them showed potent ic within nm ( figure a , column ). we next characterized the best one, - , which was able to effectively block authentic viral entry at ic of . nm ( figure b , left). we also tested - expressing in cho cells, and found it could achieve ~ mg/l without any optimization suggesting for a great potential in therapeutic development. furthermore, the rbd binding affinity of - was also tested by bli (bio-layer interferometry assay), and showed comparable k d of . nm ( figure b, right) . since cdr are the most critical region for antibody diversity, we then aligned the cdr sequences of the heavy (cdr h ) and light (cdr l ) chains of the authentic viral neutralizing antibodies and found unique ones ( figure a to understand the neutralizing mechanism of the antibodies, we performed epitope mapping experiments for the rbd binders (note, - , - and - were non-rbd binders, figure c and figure a ). in order to do so, we first utilized rbd-ace blocking elisa, and found that of them could effectively compete ace binding to rbd with ic below nm ( figure a , column ), indicating their binding epitopes overlapping with ace binding surface ( figure c , middle left). we then carried out bli competition and mutagenesis assays for further analyses. due to the identical cdr sequence and high similarities among - , - and - mentioned above, we only chose - as the representative in these assays. based on bli competition results, the rbd binders could be classified into groups, replaced in the non-ace binding surface ( figure c , left) to locate these epitopes. all epitope a antibodies were largely unaffected by these mutations ( figure s c, left) , indicating that epitope a is limited within ace binding surface ( figure c , middle left) considering that they competed ace binding in blocking elisa mentioned above. epitope b antibody, - , was sensitive to f a, a t and c a mutations ( figure s c , middle) and also competed ace in blocking elisa ( figure a , column ), therefore we speculated that epitope b should include these residues and partially overlap with ace binding surface ( figure c , middle right). finally, residues critical for epitope c antibody binding were shown in figure c , right panel. since they did not compete ace in blocking elisa ( figure a , column ), we proposed epitope c at the indicated area of rbd ( figure c, right) . notable, despite that the epitope c and non-rbd binders could not block ace binding for rbd ( figure a the spike proteins of sars-cov- share % and % of amino acid identities with sars-cov and mers-cov, respectively. therefore, we wondered whether our antibodies could cross-react with the s proteins of these two other coronaviruses. in order to do so, we overexpressed the s proteins of sars-cov- , sars-cov and mers-cov in hek t, and tested the cross-reactivities by flow cytometry analyses. from this exercise, we found antibodies, - , - and - , cross-recognizing sars-cov s, but not mers-cov s ( figure a and b). - and - shared similar s protein affinities between sars-cov- and sars-cov, but - had much lower affinity towards sars-cov s compared to sars-cov- s ( figure c ). elisa signals towards both rbd and s, but could robustly bind freshly expressed s protein in a membrane ( figure c, column , ) . these included two neutralizing antibodies - (mentioned above) and - , indicating that the recombinant rbd or s protein may differ from the membrane bound s in terms of d conformation. we would have missed these antibodies if we only had used elisa for antibody triage. therefore, future antibody study should consider multiple approaches for the initial identification and quality control. the fluorescently labeled s bait was previously prepared by incubating µg of his tag-s protein with anti his tag antibody-pe (phycoerythrin) for at least hr at °c in the dark, rbd bait performed as before. pbmcs were stained using aad, cocktail neutralization assay was performed with antibodies by : (n:n), and calculated ic by total antibodies concentrate ion. all experiments related to authentic virus were done in bsl- . monoclonal antibodies were incubated with pfu sars-cov- sh at °c for hour before added into vero-e cell culture ( -well plate, x cells per well), and the cells were continued for hr before microscopic analyses for cpe (cytopathic effect). the descriptive statistics mean ± sem or mean ± sd were determined for continuous variables as noted. ec and ic values in this study were determined after log transformation of antibody concentration using a -parameters nonlinear fit analysis positive cells (%) assessing the pandemic potential of mers-cov. the neutralizing epitopes of the sars-cov s-protein cluster independent of repertoire, antigen structure or mab technology potent neutralizing antibodies from covid- patients define multiple targets of vulnerability. biorxiv the coronavirus pandemic in five powerful charts a humanized monoclonal antibody neutralizes yellow fever virus strain d- in vitro but does not protect a mouse model from disease role of the ebola membrane in the protection conferred by the three-mab cocktail mil potent neutralizing antibodies against sars-cov- identified by high-throughput single-cell sequencing of convalescent patients' b cells genomic characterization of the novel human-pathogenic coronavirus isolated from a patient with atypical pneumonia after visiting wuhan. emerging microbes & infections the zika outbreak of the st century convalescent plasma as a potential therapy for covid- epidemiological and clinical characteristics of cases of novel coronavirus pneumonia in wuhan, china: a descriptive study human monoclonal antibodies block the binding of sars-cov- spike protein to angiotensin converting enzyme receptor a potent neutralizing human antibody reveals the n-terminal domain of the spike protein of sars-cov- as a site of vulnerability the species severe acute respiratory syndrome-related coronavirus: classifying -ncov and naming it sars-cov- potent neutralizing monoclonal antibodies against ebola virus isolated from vaccinated donors a randomized controlled trial of zmapp for ebola virus infection refined protocol for generating monoclonal antibodies from single human and murine b cells sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor clinical features of patients infected with novel coronavirus in wuhan, china. the lancet characterization of spike glycoprotein of sars-cov- on virus entry and its immune cross-reactivity with sars-cov clinical progression and viral load in a community outbreak of coronavirus-associated sars pneumonia: a prospective study cross-neutralization of sars-cov- by a human monoclonal sars-cov antibody safe pseudovirus-based assay for neutralization antibodies against influenza a(h n ) virus recurrent potent human neutralizing antibodies to zika virus in brazil and mexico rapid generation of fully human monoclonal antibodies specific to a vaccinating antigen chimeric camel/human heavy-chain antibodies protect against mers-cov infection characterization of the receptor-binding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine potent binding of novel coronavirus spike protein by a sars coronavirus-specific human monoclonal antibody function, and antigenicity of the sars-cov- spike glycoprotein a human monoclonal antibody blocking sars-cov- infection cryo-em structure of the -ncov spike in the prefusion conformation neutralizing antibody responses to sars-cov- in a covid- recovered patient cohort and their implications. medrxiv a new coronavirus associated with human respiratory disease in china identification of human single-domain antibodies against sars-cov- a noncompeting pair of human neutralizing antibodies block covid- virus binding to its receptor ace cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains potent neutralizing monoclonal antibodies against ebola virus infection a pneumonia outbreak associated with a new coronavirus of probable bat origin key: cord- -welf eb authors: zhou, daming; duyvesteyn, helen me; chen, cheng-pin; huang, chung-guei; chen, ting-hua; shih, shin-ru; lin, yi-chun; cheng, chien-yu; cheng, shu-hsing; huang, yhu-chering; lin, tzou-yien; ma, che; huo, jiandong; carrique, loic; malinauskas, tomas; ruza, reinis r; shah, pranav nm; tan, tiong kit; rijal, pramila; donat, robert f.; godwin, kerry; buttigieg, karen; tree, julia; radecke, julika; paterson, neil g; supasa, piyasa; mongkolsapaya, juthathip; screaton, gavin r; carroll, miles w.; jaramillo, javier g.; knight, michael; james, william; owens, raymond j; naismith, james h.; townsend, alain; fry, elizabeth e; zhao, yuguang; ren, jingshan; stuart, david i; huang, kuan-ying a. title: structural basis for the neutralization of sars-cov- by an antibody from a convalescent patient date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: welf eb the covid- pandemic has had unprecedented health and economic impact, but currently there are no approved therapies. we have isolated an antibody, ey a, from a late-stage covid- patient and show it neutralises sars-cov- and cross-reacts with sars-cov- . ey a fab binds tightly (kd of nm) the receptor binding domain (rbd) of the viral spike glycoprotein and a . Å crystal structure of an rbd/ey a fab complex identifies the highly conserved epitope, away from the ace receptor binding site. residues of this epitope are key to stabilising the pre-fusion spike. cryo-em analyses of the pre-fusion spike incubated with ey a fab reveal a complex of the intact trimer with three fabs bound and two further multimeric forms comprising destabilized spike attached to fab. ey a binds what is probably a major neutralising epitope, making it a candidate therapeutic for covid- . conversion to the post-fusion form where the s subunit engages the host membrane whilst dispensing with s , . neutralising human monoclonal antibodies that recognise the ace receptor binding site for sars-cov- and sars-cov- are generally not cross-reactive between the two viruses and are susceptible to escape mutation - (indeed a natural mutation y n has already been identified at this site (gisaid : accession id: epi_isl_ wienecke-baldacchino et al.)). in contrast cr (derived from a sars-cov- patient) cross-reacts strongly with sars-cov- (methods, fig. ) and has been shown to recognise a cryptic, conserved epitope on the rbd distinct from the binding epitope of ace , [ ] [ ] [ ] . that this is not uncommon for sars-cov- antibodies is suggested by similar observations for d . to isolate sars-cov- spike-reactive monoclonal antibodies, we cloned antibody genes from blood-derived plasmablasts of a covid- patient in the convalescent phase. one of these, ey a was shown by elisa to bind s of sars-cov- and cross react with sars-cov- (fig. ) . binding of ey a to sars-cov- -infected cells was detected by immunofluorescence ( fig. ). surface plasmon resonance (spr) measurements for ey a fab showed high affinity binding to immobilised sars-cov- rbd (kd = nm, although the value for immobilised ey a igg was somewhat higher) as derived from the kinetic data (methods, extended data fig. , extended data table ). spr studies showed that there was some interdependence of ey a and cr binding, which varied depending on which component was immobilised on to the sensor chip; ace blocking assays confirmed a somewhat asymmetric blocking effect (extended data fig. ). with rbd stably expressed on mdck-siat cells (mdck-rbd), ey a did not block binding of ace to the rbd, whereas with ace stably expressed on mdck-siat cells (mdck-ace ) ey a blocked the interaction of rbd with ace . in this assay, ey a exhibits around times stronger ace blocking than cr (ey a, ic = nm; cr , ic = nm) and has equivalent ace inhibition compared to ace -fc (ic = nm) and vhh -fc (ic = nm) . these observations are suggestive of an indirect effect by ey a once bound to the rbd, consistent with an allosteric or weak direct interaction. this was supported by an spr competition assay with immobilised cr , which binds distant from the ace binding site (extended data fig. ) . this showed complete competition with ey a for rbd binding suggesting they recognise the same or overlapping epitopes, and indicated that ey a binds the sars-cov- rbd more tightly. two independent neutralisation tests, both using live wild type sars-cov- showed strong neutralisation. a neutralisation test for ey a based on quantitative pcr detection of virus in the supernatant bathing infected vero e cells after days of culture, showed a ~ -fold reduction in virus signal (methods, extended data fig. ) indicating that it is highly neutralising. this was further corroborated by a plaque reduction neutralisation test (prnt) at phe porton down (methods and extended data table ) using sars-cov- virus and ey a which gave an nd of ~ . µg/ml ( nm) (calculated according to grist ) . a separate prnt implementation at oxford gave a slightly higher nd of ~ µg/ml, consistent with a shorter incubation time of antibody with virus at lower temperature (extended data fig. ). to elucidate the epitope of ey a, we determined the crystal structures of the deglycosylated sars-cov- rbd in complex with ey a fab alone and in a ternary complex incorporating a nanobody (nb) which has been shown to compete with ace (for data on a closely related nb see huo , submitted), as a crystallisation chaperone. the crystals of the binary complex diffracted to . Å resolution (methods, extended data table ) and those of the ternary complex to . Å. the interaction between ey a and the rbd was identical in both complexes (extended data fig. ). the higher resolution ternary complex, which showed that there was no interaction between ey a and the nb, permitted a full interpretation of the detailed interactions (figs. and ) and has been refined to give an rwork/r-free of . / . and good stereochemistry (methods, extended data table ). residues - of the rbd, - and - of the heavy chain and - of the light chain of ey a and - of the nb are well defined fig. a ,b. the nb recognises an epitope adjacent to and slightly overlapping the ace receptor binding site and binds the rbd orthogonally to ey a (fig. b,c) . ey a binds essentially the same epitope as cr , but with a different pose corresponding to a rotation of ° about an axis perpendicular to the rbd α helix (central to both epitopes) (fig. d,e) . the fab complex interface buries and Å of surface area for the cdrs of the heavy and light chains respectively. the ey a interaction is mediated by the cdr loops h , h , h , l and l contacting predominantly α but also α and the β -α , α -α and α -β loops of the rbd (fig. and extended data fig. ). a total of residues from the heavy chain and from the light chain participate in the interface together with residues from the rbd. for the heavy chain these form potentially hydrogen bonds and a single salt bridge between d (of h ) and k of the rbd and the light chain interface residues contribute an additional hydrogen bonds. hydrophobic interactions further increase the binding affinity (fig. ) . of the residues involved in the interaction are conserved between the cr and ey a epitopes ( fig. and extended data fig. ). conformational changes are introduced into the rbd by binding to ey a at the α (residues - ) and α (residues - ) helices (extended data fig. ), similar to those seen for the cr complex . comparison of the epitope residues for ey a, cr and vhh- shows that there is a very substantial overlap (extended data fig. ), although the bulk of the molecules extend in different directions, such that vhh- directly blocks ace- binding . in the first pre-fusion spike structures (pdb ids: vsb , vxx, vyb ), where residues and in the linker between two helices in s were mutated to a pro-pro sequence to prevent the conversion to the post-fusion helical conformation, the rbds were found in either one 'up' two 'down' or all three 'down' configuration, and in both cases the epitope is inaccessible. in the 'down' position it is packed against another rbd of the trimer and the nterminal domain (ntd) of the neighbouring protomer. a recent publication for the wild type spike identifies a more closed form where the s portion of the spike is tightened up. the structure is not yet deposited however, and so we have looked at the role of the epitope in the down rather than fully closed form, which will be broadly similar. here the ey a epitope packs down tightly against the s 'knuckle' bearing the pro-pro mutations, forming a buried protein-protein interface and making the epitope completely inaccessible. we assume that in the closed form this interaction will be even tighter and is probably responsible for maintaining the spike in the pre-fusion state. even when the rbd is in the 'up' configuration, the epitope remains largely inaccessible and substantial further movement of the rbd would be required to permit interaction unless more than one rbd was in the up conformation . to investigate how the fab insinuates itself into the spike, we performed cryo-em analysis. spike ectodomain was mixed with a -fold molar excess of ey a fab and incubated at room temperature ( °c) with an aliquot taken at hours, applied to cryo-em grids and frozen (methods). unbiased d class averages revealed three major particle classes with over onethird comprising a trimeric spike/ey a complex (some of which are self-associated) (methods, extended data table and °. in addition, the orientations of the vh domains relative to their associated rbds differ slightly from that of the crystal structure (by °, ° and °, respectively). the quality of the density suggests that these likely samples selected from a continuous distribution (extended data fig. ). the majority of the remaining particles form either a roughly -fold symmetric structure or a triangular association (methods, extended data table and figs. - ). reconstructions of these particles were anisotropic due to a preferential orientation of the particles on the grid which was somewhat mitigated by collecting data with ° tilt to yield reconstructions at . Å and . Å, respectively, in the plane of the grid but significantly worse resolution perpendicular to the grid (extended data fig. ). the reconstructions were sufficiently clear to allow the unambiguous fitting of ey a-rbd complexes (extended data fig. ), however the density for what we assume are the n-terminal domains is poor in both reconstructions and we did not attempt to fit a model. these structures likely represent a residual well-structured fragment from the unfolding of the pre-fusion state of the spike (sds page analysis shows that the spike polypeptide remains largely uncleaved, extended data fig. ). the 'dimeric' and 'trimeric' structures are formed by different lateral associations and these also differ from that seen for similarly structurally degraded spike-cr complexes (extended data fig. ). convalescent serum has shown promise in patients severely ill with covid- , , thus immunotherapeutics have potential for treating covid- even at a relatively late stage in the disease. to this end, it is desirable to find a combination of antibodies that neutralise the virus by different mechanisms to mitigate against immune evasion and antibody dependent enhancement. one neutralisation mechanism is blocking receptor attachment. we propose that attachment at the ey a epitope is a further major neutralisation mechanism. in support of this, the epitope recognised by ey a has been reported for several antibodies , , , and nanobodies , raised against sars-cov- , sars-cov- and mers. for sars-cov- , cr has also been shown to neutralise synergistically with ace blocking antibodies . despite the spatial separation of the ey a and ace epitopes we find some cross-talk between the two binding events. the ey a epitope is extremely unusual, since it is completely inaccessible in the pre-fusion spike trimer. this raises the question of what the mechanism of neutralisation might be. in the pre-fusion state the ey a/cr epitope rests down upon the upper end of the helixturn-helix between heptad repeat (hr ) and the central helix (ch) of s , essentially putting a lid on the spring-loaded extension of the helix which occurs on conversion to the postfusion state in the vicinity of the mutations designed to prevent conversion between the preand post-fusion conformation (fig. ). the residues of the epitope are crucial to these protein-protein interactions, and therefore highly conserved, explaining why it has, to date, proved impossible to generate mutations that escape binding of the antibody , . ey a binding to the isolated rbd is tight (at ~ nm it is roughly an order of magnitude tighter than cr ) and, remarkably, the binding pose on top of the spike allows three fabs to bind simultaneously around the central axis (whereas cr fab cannot be accommodated). in line with this, a major portion of spike molecules incubated for h with ey a are still in the intact pre-fusion state, with only about / being converted. simple modelling suggests that a similar packing could occur for intact antibodies (extended data fig. ). in general, we would expect binders at this epitope to neutralise by displacing the 'lid' on the hr /ch turn, reducing the stability of the pre-fusion state and therefore reducing the barrier to conversion to the more stable post-fusion trimer. this conversion is hindered in the construct we have used by the presence of the proline mutations at the turn between the helices. premature conversion would prevent later attachment to the cell and block infectivity. the kinetics of this process will determine the effectiveness of the antibody in neutralisation and ultimately protection. since the rbd is a relatively small domain there might also be an interplay between separate epitopes, thus we saw allosteric effects between ey a and ace binding and similarly vhh- , which binds an overlapping epitope to ey a, strongly inhibits ace- binding by virtue of its different angle of attack . the reason for the cross-talk between this study was designed to isolate sars-cov- antigen-specific human mabs from peripheral plasmablasts in humans with natural sars-cov- infection, to characterize the antigenic specificity and phenotypic activity of sars-cov- spike-reactive mab, and to determine the structure of antibody in complex with viral antigen. fresh peripheral blood mononuclear cells (pbmcs) were separated from whole blood by density gradient centrifugation and cryopreserved pbmcs were thawed. pbmcs were stained with a mix of fluorescent-labelled antibodies to cellular surface markers, including anti-cd (bd biosciences, usa), anti-cd (bd biosciences, usa), anti-cd (bd biosciences, usa), anti-cd (bd biosciences, usa), anti-cd (bd biosciences, usa), anti-igg (bd biosciences, usa) and anti-igm (bd biosciences, usa). plasmablasts were selected by gating on cd -cd -cd +cd hicd hiigg+igm-events and were isolated in chamber as single cell as previously described . sorted single cells were used to produce human igg monoclonal antibodies as previously described . expression vectors that carry variable domains of heavy and light chains were transfected into the t cell line for expression of recombinant full-length human igg monoclonal antibodies in serum-free transfection medium. to determine the individual gene segments employed by vdj and vj rearrangements and the number of nucleotide mutations and amino acid replacements, the variable domain sequences were aligned with germline gene segments using the international immunogenetics (imgt) alignment tool (http://www.imgt.org/imgt_vquest/input). ey a igg used for neutralisation and making fab: antibody was expressed using nanobody: this was derived from a naïve library followed by affinity maturation as described deglycosylation of rbd: µl of endoglycosidase f (~ mg/ml) was added to rbd (~ mg/ml, ml) and incubated at room temperature for two hours. rbd was then loaded to a superdex hiload / gel filtration column (ge healthcare) for further purification using buffer mm hepes ph . , mm nacl. purified rbd was concentrated using a kda ultrafiltration tube (amicon) to mg/ml. the neutralization activity of monoclonal antibody-containing supernatant was measured using a sars-cov- (strain cdc- ) infection of vero e cells . briefly, vero e cells were preseeded in a well plate at a concentration of x cells per well. on the following day, monoclonal antibody-containing supernatant were mixed with an equal volume of tcid virus preparation and incubated at °c for hour. the mixture was added into seeded vero e cells and incubated at °c for days. the cell control, virus control, and virus back-titration were setup for each experiment. at day , the culture supernatant was harvested from each well and the viral rna was extracted by the automatic labturbo system (taigen, taiwan) following the manufacturer's instructions. for the most part, except that the specimen was pretreated with proteinase k prior to rna extraction. real-time reverse transcription polymerase chain reaction was performed in a -µl reaction containing µl of rna sars-cov- (australia/vic / ) plaque reduction neutralization tests were performed using passage of sars-cov- victoria/ / . virus suspension at appropriate concentrations in dulbecco's modification of eagle's medium containing % fbs (d ; µl) was mixed antibody ( µl) diluted in d at a final concentration of µg/ml, µg/ml, . ug/ml or . µg/ml, in triplicate, in wells of a well tissue culture plate, and incubated at room temperature for minutes. thereafter, . ml of a single cell suspension of vero e cells in d at x /ml was added, and incubated for h at o c before being overlain with . ml of d supplemented with carboxymethyl cellulose ( . %). cultures were incubated for a further days at o c before plaques were revealed by staining the cell monolayers with amido black in acetic acid/methanol. purified and deglycosylated rbd and ey a fab were combined in an approximate molar ratio of : at a concentration of . mg/ml. nb was also combined with ey a- his fab and rbd in a : : molar ratio with a final concentration of . mg/ml. these two complexes were separately incubated at room temperature for one hour. initial screening of crystals was performed in crystalquick -well x plates (greiner bio-one) with a cartesian robot using the nanolitre sitting-drop vapour diffusion method as previously described , . crystals were soaked in a solution containing % glycerol and % reservoir solution for a few seconds and then mounted in loops and frozen in liquid nitrogen prior to data collection. diffraction data were collected at k at beamline i of diamond light source, uk. diffraction images of . ° rotation were recorded on an eiger xe m detector with an exposure time of . s per frame, beam size × µm and % beam transmission. data were indexed, integrated and scaled with the automated data processing program xia -dials , . the data set for the binary complex of ° was collected from a single frozen crystal to . Å resolution with -fold redundancy. the crystal belongs to space group p with unit cell dimensions a = b = . Å and c = . Å. the structure was determined by molecular replacement with phaser using search models of antibody cr fab and the rbd of the rbd/cr fab complex (pdb id yla; ). there are three rbd/ey a complexes in the crystal asymmetric unit, resulting in a crystal solvent content of ~ %. for the ternary complex, a data set of ° rotation with data extending to . Å was collected on beamline i of diamond with exposure time . s per . ° frame, beam size × µm and % beam transmission). the crystal also belongs to space group r but with unit cell dimensions (a = b = . Å and c = . Å). there is one rbd/ey a/nb complex in the asymmetric unit and a solvent content of ~ %. one cycle of refmac was used to refine atomic coordinates after manual correction in coot to the protein sequence from the search model. for both the binary and ternary complexes the final refinement used phenix resulting in rwork = . and rfree = . for all data to . Å resolution for the binary complex and to rwork = . and rfree = . for all data to . Å resolution for the ternary complex. there is well ordered density for a single glycan at the glycosylation site n in the rbd. data collection and structure refinement statistics are given in extended data table . structural comparisons used shp , residues forming the rbd/fab interface were identified with pisa , figures were prepared with pymol (the pymol molecular graphics system, version . r pre, schrödinger, llc). spike protein, following sec purification, was buffer exchanged into mm tris ph . , mm nacl, . % nan buffer using a desalting column (zeba, thermo fisher). a final concentration of . mg/ml was incubated with ey a fab (in the same buffer) in a : molar ratio (fab to trimeric spike) at room temperature for hrs. control grids of spike alone after incubation at room temperature for hrs were also prepared. each grid was prepared using µl sample applied to a freshly glow-discharged on high for grids were screened on a titan krios microscope using serialem operating at kv (thermo fisher). movies were collected on a k detector on a titan krios operating at kv in super resolution mode, with a calibrated super resolution pixel size of . a/pix at both ° and ° tilt. to compensate for the poorer contrast with tilted data, it was necessary to use a higher dose rate for the latter dataset. alignment and motion correction was performed using relion . 's implementation of motion correction , with a -by- patch-based alignment. all frames were binned by two, resulting in a final calibrated pixel size of . Å/pixel. contrast-transfer-function (ctf) of full-dose and non-weighted micrographs was estimated within a cryosparc wrapper for gctf-v . . images were then manually inspected and those with poor ctf-fits were discarded. particles were then picked by unbiased blob picking in cryosparc v. . . and subjected to rounds of d classification. for the spike-ey a dataset (structure a), , , spike-like particles were used to make a template to pick particles from the untilted dataset, which were then filtered by d classification to , particles and then further refined by d classification with an ab initio model set. for the ° dataset, , particles were used as a template, and filtered by d classification to a set of , particles and then, as before, further refined by unbiased d classification. the two particle sets were then refined together, with a final set of , particles. for b and c (triangular ring and 'dimeric' form), particles from both the zero and ° datasets were combined in a similar manner to the spike-ey a dataset using the 'exposure group utilities' module in cryosparc. both particle sets (b, particles and c, , particles) were then reclassified and the best class refined with non-uniform refinement. for b, c symmetry was imposed at this final refinement stage, resulting in an appreciable improvement in resolution, as indicated by inspection and gold-standard fsc = . ( . versus . Å, see extended data table ). the em density of spike/ey a was fitted with the structure of a closed form of spike (pdb id vxx) apart from the rbds and ey a fab which were fitted with rbd/ey a of the ternary crystal structure using coot . due to the lower resolution, rbd and ey a are only fitted to the 'dimeric' and 'trimeric' em density. the spike/ey a structure was refined with phenix real space refinement, first as a rigid body and then by positional and bfactor refinements. only rigid body refinement was applied to the 'dimeric' and the 'trimeric' complexes. the statistics of em data collection and structure refinement are shown in extended data table these authors contributed equally: d.z., hmed, c.-p.c. ey a binds the s subunit of sars-cov- and cross react with s of sars-cov- . b, antibody cr similarly binds the s subunit of sars-cov- and cross react with sars-cov- s , but with lower affinity. c, convalescent serum from a covid- patient was also included as a control and in this case binding to mers and oc spike proteins also investigated. d, binding of ey a on the sars-cov- infected cells in immunofluorescence assay. anti-influenza h mab bs a was included as a control. sars-cov- spike sars-cov- s sars-cov- s mers spike oc spike performed cryo-em sample preparation, screening and processing and j.raedecke performed cryo-em data collection, and j.ren refined the cryo-em structures helped prepare materials, perform experiments and analysed data. all authors read and approved the manuscript real estimates of mortality following covid- infection cryo-em structure of the -ncov spike in the prefusion conformation. science ( -. ) cryo-em structure of the sars coronavirus spike glycoprotein in complex with its host cell receptor ace structure, function, and antigenicity of the sars-cov- spike glycoprotein dynamical asymmetry exposes -ncov prefusion spike human monoclonal antibodies block the binding of sars-cov- spike protein to angiotensin converting enzyme receptor human monoclonal antibody combination against sars coronavirus: synergy and coverage of escape mutants potent neutralization of severe acute respiratory syndrome (sars) coronavirus by a human mab to s protein that blocks receptor association potent cross-reactive neutralization of sars coronavirus isolates by human monoclonal antibodies human neutralizing antibodies elicited by sars-cov- infection global initiative on sharing all influenza data -from vision to reality neutralization of sars-cov- by destruction of the prefusion spike a highly conserved cryptic epitope in the receptor-binding domains of sars-cov- and sars-cov. science ( -. ) potent binding of novel coronavirus spike protein by a sars coronavirus-specific human monoclonal antibody a human monoclonal antibody blocking sars-cov- infection diagnostic methods in clinical virology. x article structural basis for potent neutralization of betacoronaviruses by single-domain camelid antibodies distinct conformational states of sars-cov- spike protein deployment of convalescent plasma for the prevention and treatment of covid- treatment of critically ill patients with covid- with convalescent plasma structural basis for neutralization of sars-cov- and sars-cov by a potent therapeutic antibody early release-severe acute respiratory syndrome coronavirus −specific antibody responses in coronavirus disease identification of human single-domain antibodies against sars-cov- immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen the production of glycoproteins by transient expression in mammalian cells epitope-associated and specificity-focused features of ev -neutralizing antibody repertoires from plasmablasts of infected children sequence variation among sars-cov- isolates in taiwan detection of novel coronavirus ( -ncov) by real-time rt-pcr isolation and rapid sharing of the novel coronavirus (sar-cov- ) from the first patient diagnosed with covid- in australia beitrag zur kollektiven behandlung pharmakologischer reihenversuche a procedure for setting up high-throughput nanolitre crystallization experiments. i. protocol design and validation a procedure for setting up high-throughput nanolitre crystallization experiments. crystallization workflow for initial screening, automated storage, imaging and optimization xia : an expert system for macromolecular crystallography data reduction dials: implementation and evaluation of a new integration package phaser crystallographic software refmac for the refinement of macromolecular crystal structures coot: model-building tools for molecular graphics macromolecular structure determination using x-rays, neutrons and electrons: recent developments in phenix crystal structure of cat muscle pyruvate kinase at a resolution of . Å inference of macromolecular assemblies from crystalline state new tools for automated high-resolution cryo-em structure determination in relion- real-time ctf determination and correction algorithms for rapid unsupervised cryo-em structure determination multiple ligand-protein interaction diagrams for drug discovery we acknowledge the bd facsaria™ cell sorter service provided by the core instrument the authors declare no competing interests. correspondence to david i. stuart or kuan-ying a. huang. the coordinates and structure factors of the sars-cov- rbd/ey a crystallographic complexes are available from the pdb with accession codes xxx and vvv respectively. em maps and structure models are deposited in emdb and pdb with accession codes xxx and yyy for the pre-fusion spike, and xxxxx and yyyy for the dimeric complex respectively. the data that support the findings of this study are available from the corresponding authors on request. key: cord- -q bngari authors: yepes-pérez, andres f.; herrera-calderon, oscar; quintero-saumeth, jorge title: uncaria tomentosa (cat’s claw): a promising herbal medicine against sars-cov- /ace- junction and sars-cov- spike protein based on molecular modeling date: - - journal: journal of biomolecular structure & dynamics doi: . / . . sha: doc_id: cord_uid: q bngari covid- is a novel severe acute respiratory syndrome coronavirus. currently, there is no effective treatment and vaccines seem to be the solution in the future. virtual screening of potential drugs against the s protein of severe acute respiratory syndrome corona virus (sars-cov- ) has provided small molecular compounds with a high binding affinity. unfortunately, most of these drugs do not attach with the binding interface of the receptor-binding domain (rbd)–angiotensin-converting enzyme- (ace- ) complex in host cells. molecular modeling was carried out to evaluate the potential antiviral properties of the components of the medicinal herb uncaria tomentosa (cat’s claw) focusing on the binding interface of the rbd–ace- and the viral spike protein. the in silico approach starts with protein–ligand docking of cat’s claw key components followed by molecular dynamics simulations and re-docked calculations. finally, we carried out drug-likeness calculations for the most qualified cat’s claw components. the structural bioinformatics approaches led to the identification of several bioactive compounds of u. tomentosa with potential therapeutic effect by dual strong interaction with interface of the rbd–ace- and the ace- binding site on sars-cov- rbd viral spike. in addition, in silico drug-likeness indices for these components were calculated and showed good predicted therapeutic profiles of these phytochemicals found in u. tomentosa (cat’s claw). our findings suggest the potential effectiveness of cat’s claw as complementary and/or alternative medicine for covid- treatment. communicated by ramaswamy h. sarma the severe acute respiratory syndrome corona virus (sars-cov- ) is a part of coronavirus family (cov) and was initially identified in wuhan, china. covid- (coronavirus disease ) is highly contagious in humans, which has rapidly spread and caused an unprecedented pandemic, with a large number of deaths and economic crisis in the world (prajapat et al., ) . according to the latest report of the world health organization (who), over . million cases and deaths of covid- were confirmed as of september , (world health organization, . in developing countries of latin america and the carribean, the public health has been the most affected because people do not have the opportunities to access a modern health system and medicines . phytotherapy based on natural products might be a proper alternative for treating viral diseases (akram et al., ) . according to who estimates, about % of the population in developing countries uses traditional medicine in primary health care, mainly medicinal plants (world health organization, ) . the selection of natural products for the study of their biological properties has been addressed through three fundamental methodologies: the selection of random natural sources, the selection based on chemotaxonomy (screening of similar compounds in organisms belonging to the same family or genus) and selection based on ethnomedicine (heinrich, ) . ethnomedicine has been considered the most effective therapy and consists of the study of natural products that have a long history of use in some communities for the treatment of certain diseases and are part of the phytotherapeutic arsenal of popular knowledge (wu & tan, ) . on the other hand, uncaria tomentosa (willd. ex schult.) dc. named cat's claw ('uña de gato' in spanish) is a woody vine indigenous to the peruvian amazon and other tropical areas of south and central america that belongs to rubiaceae family (sandoval et al., ) . currently, the raw material of u. tomentosa is dispensed in public hospitals of the social health insurance (essalud-peru) as complementary medicine service (cms) (gonzales et al., ) . traditionally, extracts prepared by roots and barks decoction are used against several diseases, such as allergies, arthritis, inflammations, rheumatism infections and cancer (araujo et al., ) . bioactive constituents of u. tomentosa extracts include proanthocyanidins [proanthocyanidin b (the main component), proanthocyanidin b , proanthocyanidin c , an epicatechin trimer, epiafzelechin- b! -epicatechin and an epicatechin tetramer] (batiha et al., ; navarro-hoyos et al., ) , oxindole alkaloids (isopteropodine, pteropodine, rhynchophylline, mytraphylline, speciophylline, uncarine f and uncarine e), indole alkaloidal glucosides (cadambine, dihydrocadambine and -isodihydrocadambine) (batiha et al., ; kura s et al., ; laus et al., ; lima-junior et al., ; lock et al., ; navarro et al., snow et al., ) , quinovic acid glycosides (pavei et al., ) , tannins (ostrakhovich et al., ) , polyphenols, catechins, beta sitosterol (aquino et al., ; navarro et al., ) and proteins (lenzi et al., ) , which individually or synergistically contribute to their therapeutic properties. in regards to the antiviral properties of u. tomentosa, the alkaloid fraction has been demonstrated to be the most effective on human monocytes infected with dengue virus- (denv) in vitro (reis et al., ) . another study revealed that only the alkaloidal fraction has inhibitory activity on dengue virus, and the negative effect was observed with the nonalkaloidal fraction (lima-junior et al., ) . in another study, the antiherpetic activity of u. tomentosa seems to be associated with polyphenols or with their synergistic effect with pentacyclic oxindole alkaloids or quinovic acid glycosides (caon et al., ) . u. tomentosa hydroethanolic extracts have demonstrated a significant in vitro inhibitory effect on the replication of herpes simplex virus type , and the inhibition of viral attachment in the host cells was characterized as the main mechanism of its antiviral activity (terlizzi et al., ) . sars-cov- contains four structural proteins, namely the spike (s), membrane (m), envelope (e) and nucleocapsid (n) proteins. the s protein is responsible for the host attachment and fusion of the viral and host-cell membranes (wu et al., ) . otherwise, the angiotensin-converting enzyme receptor (ace- r) is the host cellular receptor with a higher affinity to sars-cov- (jamwal et al., ) . this process is triggered when the s subunit of s protein binds to a host-cell receptor (han & kr al, ) . to engage a host-cell receptor, the receptor-binding domain (rbd) of s undergoes transient hinge-like conformational motions (receptoraccessible or receptor-inaccessible states). u. tomentosa's constituents could block the virus from binding to human cell receptors and disrupt the virus cycle helping to prevent the protein maturation of sars-cov- and limit its infection spread . several molecular targets have been identified as the main druggable key of sars-cov- for new antiviral discovery. moreover, its x-ray structure has been recently released, hence allowing possible computational analysis. in fact, several computational studies have already been undertaken on this system including a long ls molecular dynamics (md) study and virtual screening of several databases (huang et al., ) . with neither drugs nor vaccines approved against covid- yet, finding strategies to diminish the impact of the pandemic is fundamental. medicinal herbs and, more particularly, those with demonstrated antiviral activities as u. tomentosa could slow down the spreading of the disease. particularly in developing countries, in which the accessibility to these plants is easier and more economically viable, adding these medicinal herbs to the general medical kit may be beneficial. here, our study stands on an in silico strategy reminiscent to those applied at the early stage of current state-of-the-art drug discovery pipelines and includes ( ) protein-ligand docking of all bioactive compounds of u. tomentosa against focusing both on the binding interface of the rbd-ace- and inside sars-cov- rbd spike protein, ( ) simulations of ligand pathway of the best predicted compounds from step to evaluate convenient entrance mechanism of the compounds to the binding site, ( ) md simulation to assess the stability of the best protein-ligand complexes from , ( ) calculation of pharmacokinetics parameters for the most qualified compounds resulting from the previous parts of the docking protocol. this study demonstrates the antiviral potential of u. tomentosa-based products to be applied as a rapid phytotherapeutic option for covid- . calculated binding affinity of the main constituents of the u. tomentosa (table ) was explored for its ability to disrupt the sars-cov- /ace- complex and inhibit sars-cov- spike protein of novel coronavirus findings, a facile therapeutic option for anti-coronavirus therapy. to this purpose, the crystal structures of sars-cov- /ace- complex and sars-cov- spike protein were downloaded from the protein data bank (pdb entry code m and vyb, respectively) (yan et al., ) and all bounded ligands, ions and solvent molecules were manually removed using the ds visualizer . program. for docking studies, the structures of the selected proteins were parameterized using autodock tools (trott & olson, ) . to facilitate the formation of hydrogen bonds, polar hydrogens were added. ligands used in this study are major components of the u. tomentosa extracts and a sulfated heparin octasaccharide (taken from pdb ue ), a potent sars-cov- inhibitor in vitro reported in the literature (kwon et al., ) . the d structures of the cat's claw constituents were obtained as mol. files from the zinc database (chemaxon, ) . the resultant compounds were submitted to marvinsketch . (morris et al., ) to correct the protonation states of the ligands at physiological ph . . in addition, the geometry optimization of all ligands was carried out using the hf/ - g à level of theory. then, the structures were parameterized using autodocktools to add full hydrogens to the ligands, to assign rotatable bonds and saving the resulting structure in the required format for use with autodock. all possible flexible torsions of the ligand molecules were defined using autotors in pdb autodocktools (morris et al., ; walls et al., ) to promote the calculated binding with the target structure. our docking protocol was performed using autodock vina and default procedures to dock a flexible ligand to a rigid protein. docking simulation of ligands was carried out on the interface between the sars-cov- and ace- (pdb code: m ) (yan et al., ) , where both proteins residues are in proximity. next, we used the cryo-em structure of sars-cov- spike protein (pdb code: vyb) in their open state (lipinski et al., ) to explore the potential inhibition of components of the cat's claw, selecting ace- -binding pocket to this study. once a potential binding site was identified, compounds which are the major components of the cat's claw extracts were docked to this enzymes-site to determine the most probable and the most energetically favorable binding conformations. to accomplish rigorous docking simulations involving a grid box to the identified catalytic site, autodock vina . . (trott & olson, ) was used. the exhaustiveness was for each protein-ligand pair (number of internal independent runs). the active site was surrounded by a docking box of   Šwith a grid spacing of Å. affinity scores (in kcal mol - ) given by autodock vina for all compounds were obtained and ranked based on the free energy binding theory (more negative value means greater binding affinity). the resulting structures and the binding docking poses were graphically inspected to check the interactions using the ds visualizer . (http:// dsbiovia.com/ products/) or the pymol molecular graphics system . programs. molecular interaction stability of protein-ligand complexes obtained by docking simulations were verified through md simulations by using the gromacs program (abraham et al., ) considering the sars-cov- /ace- interface, as well as the sars-cov- spike protein active site and the best docking pose for proanthocyanidin c , qag- , proanthocyanidin b and -dihydrocadambine, respectively. force field parameters for protein and ligands were derived independently. for the selected protein, the amber force field was selected and assigned using the pdb gmx tool of the gromacs program packages, meanwhile ligand force field parameters were prepared with the generalized amber force field (gaff) using the molecular geometries previously optimized with the hf/ - g à level of theory in gas phase, (foresman et al., ; glendening et al., ; roothaan, ) with the gamess-us program (schmidt et al., ) . in addition, each ligand was verified as a minimum through a harmonic vibrational normal mode analysis. atomic charges were obtained with the merz-kollman scheme (singh & kollman, ) by fitting a restricted electrostatic potential (resp) model by the gamess-us program (bayly et al., ) , and the output file was used into the resp sub-program of the ambertools program package (cornell et al., ) . assignment of gaff force field parameters was carried out by the antechamber program (wang et al., ) and the required input files for molecular dynamics simulations were prepared using the acpype python interface. protein and protein-ligand complexes were solvated in a rectangular box of tip p waters. the obtained system was neutralized adding seven-sodium counter ions to neutralize the net negative charge of the protein, and then physiological conditions ( k, ph . , . % nacl solution) were established (hammad et al., ) . to remove spurious contact, molecular geometries were optimized with the steepest descent algorithm with , steps, protein backbones atoms were constrained with a force constant of kj mol À . then, the md simulations were allowed to run for ps in the npt ensemble. in addition, ns in the npt ensemble were calculated for the production stage. all simulations were carried out under periodic boundary conditions. a cubic box with the size of   nm was used. a Å cutoff distance was used to calculate nonbonded interactions. electrostatic interactions were treated with the ewald particle mesh (pme) method (nishizawa & nishizawa, ) ; while van der waals interactions were introduced by using the cut-off scheme. finally, table . best binding energy (kcal mol - ) based on autodock scoring of the main constituents of the u. tomentosa into the rbd/ace- interface and sars-cov- spike protein binding domain (rbd) (pdb id: vyb). best binding energy rbd/ace- interface (kcal mol - ) best binding energy sars-cov- rbd (kcal mol - ) spiroxindole alkaloids uncarine f - . - . speciophylline - . - . mitraphylline - . - . pteropodine - . - . isopteropodine - . - . isomitraphylline - . - . rynchophylline - . - . isorynchophyllin - . - . indole glycosides alkaloids -isodihydrocadambine - . - . cadambine - . - . -dihydrocadambine - . - . polyhydroxylated triterpenes uncaric acid - . - . floridic acid - . - . pht- - . - . quinovic acid glycosides qag- - . - . qag- - . - . qag- - . - . qag- - . - . ) b a hepos: heparin octasaccharide taken from pdb ue was used as positive control. b estimated by kwon et al. ( ) . the v-rescale thermostat at k with a coupling constant of . ps was used and the pressure was kept constant at atm using the parinello-rahman barostat (parrinello & rahman, ) with a coupling constant of . ps and a compressibility factor of .  À bar À . all covalent bonds were constrained using the lincs algorithm and the contact list was updated every fs. drug-likeness prediction along with further adme properties presents a wide of opportunities for a rapid new antiviral drug discovery. the drug-like and adme properties for the most active components of the u. tomentosa extract (constituents having the highest binding affinity) were screened using openaccess cheminformatics platforms such as molinspiration (for molecular weight -mw, rotatable bonds and polar surface area -psa descriptors), alogps . (for log p o/w descriptor) and the pre-admet . to predict four pharmaceutical relevant properties such as intestinal permeability (app. caco- ), albumin-binding proteins (k hsa ), madin-darby canine kidney (mdck line) cells permeation and intestinal absorption (%hia). these parameters establish movement, permeability, absorption and action of potential drugs (ertl et al., ) . the interpretation of both mdck and caco- permeability using preadmet is as follows: ( ) permeability lower than : low permeability; ( ) permeability between and : medium permeability; ( ) permeability higher : high permeability. this study was performed to identify whether certain components of u. tomentosa extracts have potential therapeutic effects against covid- . to this purpose, a database of compounds that have shown prevalence on the herbal therapeutic activity has been generated ( figure ) (aquino et al., (aquino et al., , batiha et al., ; keplinger et al., ; kitajima et al., ; lima-junior et al., ; lock et al., ; montoro et al., ; navarro et al., pavei et al., ; peñaloza et al., ; snow et al., ; vera-reyes et al., ) . our initial hypothesis is that cat's claw should contain molecules with highest therapeutic profiles against sars-cov- , by disrupting sars-cov- /ace- association or by inhibiting sars-cov- spike protein. during covid- host infection, sars-cov- enters human epithelial cells through a first molecular recognition of rbd to the ace- protein. when coronaviruses bind directly to the peptidase domain (pd) of ace- , it results in the loss of their primary physiological role, which includes vasoconstriction and blood pressure regulation. in consequence, binding of sars-cov- rbd to the human ace- receptor is associated strongly with cardiovascular diseases, such as hypertension, heart attack and chronic nephropathies. blocking the binding of sars-cov- to the human ace- receptor may result in the most promising approach to prevent virus entry into human cells. recently, the cryo-em cocrystal structures of the rbd of sars-cov- with human ace- have been solved (yan et al., ) , which open the possibility to design better and more specific inhibitors for suppression of viral infection. thus, to study the effectiveness of cat's claw against sars-cov- /ace- complex, docking approaches were carried out in the ace- -rbd binding interface as the druggable site, to establish the interaction between the selected site and the main constituents of cat's claw. we also performed molecular docking studies to find a potential association of constituents of cat's claw to the sars-cov- spike protein. this approach also could conduce to block the sars-cov- spike protein interaction with human receptor ace- . hence, in this article, the structure of spike glycoprotein (pdb id: vyb) is to be considered as an additional druggable target. in addition, we have also performed the docking of a sulfated heparin octasaccharide (hepos) as positive reference, which have been recently reported as an effective in vitro inhibitor of sars-cov- by its interaction against the spike protein rbd (kwon et al., ) . overall, docking approach revealed that the most components founded in cat's claw could block sars-cov- /ace- association because they displayed significant binding affinity at interface of ace- -rbd complex in the range between - . to - . kcal mol - (table ) . on the other hand, when structures were docked against the sars-cov- spike protein, good dock scores were obtained (specially, proanthocyanidins series) ranging from - . to - . kcal mol - . thus, we performed a rigorous exploration of the docking solutions obtained from these compounds when docking occurred against sars-cov- -related enzymes. hence, based on the analysis of these different results and visual inspection, a clear behavior appears along the molecular docking that could be summarized as follows in the following sections. the recognition of sars-cov- by human ace- can be divided into several contacts. from the ace- structure: a helix domain comprises critical aminoacids, such as gln , thr , asp , lys , his , glu , asp , tyr , gln , glu , phe and gln ; a helix domain located around met and four residues (lys , gly , asp and arg ) between b and b sheets are needed to recognize sars-cov- . from sars-cov- structure, ten residues are essential for ace- binding, such as lys , gln , tyr , tyr , tyr , asn , gln , thr , gln and phe . therefore, these key binding domains were considered in this article to explore the ability of u. tomentosa to disrupt sars-cov- rbd interaction to human ace- (yan et al., ) . in general, all compounds as part of u. tomentosa show docked structures that fit well into the rbd/ace- interface, particularly along a , a , b and b domains of human ace- ( figure ) and with good predicted docking scores that range from - . to - . kcal mol - (table ) . calculations revealed that most of ligands were located between b and b sheets, and a reduced group very close to a and a helix of ace- . since during viral infection a , a , b and b domains are responsible for the recognition of sars-cov- by human ace- , our findings open the possibility to use u. tomentosa against sars-cov- /ace- association. notably, most predicted complexes have the interaction fingerprint with those of critical residues as part of ace- -rbd binding interface. protein-ligand interaction analysis of the constituents to the rbd/ace- complex revealed that the most of constituents strongly bonded to ace- through his , asp , glu , phe , gln , lys , thr , glu residues, meanwhile with sars-cov- rbd showed critical contacts with tyr , tyr , lys , tyr residues. due to these compounds as part of cat's claw and interaction with key aminoacids inside ace- -rbd binding interface, we are encouraged to believe that u. tomentosa could affect the interaction of sars-cov- spike protein with ace- . note that this approach may be useful as a rapid therapeutic option to prevent or treat covid- . a simple view showed that all ligands were able to interact with those critical aminoacids involved in the molecular recognition of rbd to the ace- protein, where at least one compound from each chemical series showed great ability to bind to the sars-cov- -rbd complex along the interface. hence, we focused on five compounds (those with highest negative energy value obtained after docking on each series) such as proanthocyanidin c (- . kcal mol À ), qag- (- . kcal mol À ), -isodihydrocadambine (- . kcal mol À ), uncarine f (- . kcal mol - ) and uncaric acid (- . kcal mol À ), which had higher or comparable affinity than positive reference hepos (- . kcal mol À ) to the sars-cov- -rbd complex (table ) . at this point it is worth mentioning that the docking calculations involving positive reference give a value in very good agreement with experimental one (- . kcal mol À ), hence providing with a certain amount of confidence regarding the autodock scoring function of this project. a rigorous view of d plots ligand interactions into ace- -rbd interface generated from ds visualizer program revealed which interactions are involved by the most docking active ligands and how their structures affect them. thus, the most active docking ligand proanthocyanidin c (which had affinity of À . kcal mol À ) was able to interact with those critical aminoacids for binding sars-cov- spike to the human ace- receptor. we found that proanthocyanidin c establishes three strong hydrogen bond interactions between the hydroxyl groups in flavone moiety with tyr (sars-cov- ) and asp (ace- receptor) residues, one p-alkyl interaction with lys residue and several hydrophobic interactions between the molecule and the glu , gln , lys , phe , thr and his residues. in addition to these critical aminoacids, proanthocyanidin c revealed notable binding interactions with human ace- receptor very close to interface through four hydrogen contact, such as ala , thr , lys and ala that could may also affect the interaction of rbd with ace- (figure (a) ). furthermore, a closer look at the best possible binding pose of qag- (which display a high docking score of À . kcal mol À ), reveals strong interactions at the contact interface between the two proteins. thus, it was found that qag- forms two hydrophobic interactions with critical aminoacids lys and asp located between b and b sheets in the ace- protein. we also observed that qag- displays one interaction at interface with val (from spike protein) through strong hydrogen bond. in addition, various hydrophobic contacts were observed between molecule and ser , phe , val , cys , ile , gly , asp , phe , ala , arg and gly at the sars-cov- rbd-ace- interface (figure (b) ). note that the deprotonated carboxylate moiety does not form any important interaction, therefore does not play any specific role in the qag- binding at rbd-ace- interface. this fact can be explained primarily because positively charged residues (lysine, arginine or histidine) do not surround this moiety at the interface. on the other side, -isodihydrocadambine (- . kcal mol - ) has key contacts with residues in the rbd/ace- binding interface (figure (a) ). thus, molecule showed several interactions with critical residues of human ace- , which are essential for sars-cov- spike binding, among them, one hydrogen bond contact and one p-alkyl interaction with glu residue. importantly, the protonated nitrogen atom in the b-carboline moiety forms one salt bridge interaction with the carboxyl in asp (from ace- protein) and three hydrophobic interactions with phe , thr and gln . furthermore, -isodihydrocadambine also had a crucial h-bond contact with tyr amino acid of sars-cov- spike protein, which is well-reported as an initial contact point during ace- recognition. -isodihydrocadambine also showed one p-cation interaction between fused aromatic ring in the b-carboline moiety and the key residue lys and eight van der waals contacts with spike protein residues, including glu , asp , gln , phe leu , asp , tyr and tyr . further interactions were also observed that might promote the disruption of the interactions between sars-cov- -rbd and ace- , among them two h-bonds formed per arginine residues (from ace- protein) and (from spike protein) interacting with the sugar moiety. interestingly, uncarine f binds at rbd/ace- interface with binding affinity of À . kcal mol À through several interactions with those base residues for the recognition of sars-cov- to the human ace- (figure (b) ). besides two strong hydrogen contacts with his of human ace- and tyr from the sars-cov- spike protein, uncarine f showed several hydrophobic interactions with key aminoacids at the interface, such as lys , asp , phe , glu , tyr and gln . finally, analysis of the d-interaction map for uncaric acid (which had docking score of À . kcal mol À ) revealed crucial contacts with essential aminoacids residues for ace- -rbd binding. thus, uncaric acid binds at asp residue of ace- through a hydrogen bonding with one of the hydroxyl groups. furthermore, hydrophobic interactions between molecule and five of those key residues at interface were displayed as follows: his , tyr , phe , gln and tyr . notably, uncaric acid had five p-alkyl interactions with other interface residues that probably could promote the ace- -rbd binding cleavage, such as leu (d), val (d), lys (d), pro (d) and lys (f). in accordance to abovementioned, our docking studies shows that several components of u. tomentosa may have the ability to disrupt the association of sars-cov- spike protein with the human ace- receptor. among these components, proanthocyanidin c (- . kcal mol À ), qag- (- . kcal mol À ), -isodihydrocadambine (- . kcal mol À ), uncarine f (- . kcal mol À ) and uncaric acid (- . kcal mol À ) had good predicted binding affinity for binding interface and they can naturally access it without noticeable energetic cost. these findings suggest that u. tomentosa may be a viable treatment option during initial stage of the covid- infection. recently, cryo-em structure of the sars-cov- spike glycoprotein was resolved in their closed and open states (pdb id: vxx and vyb, respectively) (walls et al., ) . sars-cov- spike protomer consists of five functional domains, namely as ntd, rbd, fp, hr and cd (figure ). because rbd domain is responsible for the binding to the host cell receptor (ace- ), we focused our docking investigations inside the several residues in the sars-cov- rbd have been identified as essential in the association to the human ace- during coronavirus (covid- ) infection, including tyr , tyr , gly , thr , tyr , lys , gln , asn and gln (lan et al., ) . therefore, to demonstrate the ability of constituents of u. tomentosa to block binding of the sars-cov- spike protein to human ace- receptor, we performed molecular docking studies around aforementioned critical amino acids, meaning this docking runs were carried out inside ace- binding surface of rbd. from a general perspective, promising docking scores were obtained when the major constituents from u. tomentosa bind to the rbd of sars-cov- (ranging from - . to - . kcal mol À ). thus, to compare the best binding pose of the most docking-active components of u. tomentosa and hepos (positive reference) inside sars-cov- rbd, figure illustrates the most stable binding poses based on autodock scoring listed in table for all constituents on rbd binding domain. the superposition of the positive reference (hepos) and the best conformation obtained theoretically for selected docked compounds showed that the major constituents in the ethanolic extract of u. tomentosa can accommodated themselves into stable conformations occupying this binding site during docking process. notably, all main constituents of u. tomentosa had at least two interactions with those key aminoacids for sars-cov- rbd binding to human ace- , through h-bonds, p-contacts or hydrophobic contacts, making this herb a promising treatment that may be used in the early stages of the covid- infection. among them, four exhibited high potential to bind rbd: -dihydrocadambine (in brown), proanthocyanidin b (in purple blue), proanthocyanidin b (in light blue) and proanthocyanidin c (in hot pink) ( figure ) , which had the highest docking score of - . , - . , - . and - . kcal mol À , respectively, that would be comparable to that reported for the potent inhibitor hepos of - . kcal mol À . as such, in searching those critical contacts that could blocks rbd-ace- interaction, an exhaustive analysis has been undertaken to the docking results for those components as mentioned in figure . as shown in figure , our findings revealed that the most docking active molecules complexed with the sars-cov- rbd had an interaction fingerprint involving seven critical residues implied in the sars-cov- spike attachment to the human receptor ace- : tyr , gly , lys , asn , tyr , gln and asn . thus, -dihydrocadambine displays two strong h-bond through binding between sugar and acetyl moiety and key tyr and gly residues, respectively. notably, tyr also established one p-p stacking interaction with the fused aromatic ring in the b-carboline moiety. in addition, critical residues lys and arg from rbd was involved in the binding event by forming three p-cation contacts with -dihydrocadambine, while hydrophobic interactions formed with asn , tyr , gln , tyr residues clearly favored its affinity for sars-cov- spike protein. however, a visual inspection to the d-protein-ligand interaction plot of -dihydrocadambine shows that the protonated nitrogen formed two unfavorable positive-positive interactions (represented in red dotted lines) that could have great significance in the stability of protein-ligand complex. proanthocyanidin b was well-fitted into the functional domain rbd and its hydroxyl groups formed four hydrogen bonding and four hydrophobic contacts with critical residues for binding to ace- , including gly , asn , gln and tyr , tyr , lys , gln , respectively. one further hbond with ser and one hydroxyl group were predicted by docking when proanthocyanidin b binds to sars-cov- rbd. finally, proanthocyanidin c was able to bind sars-cov- rbd through three strong h-bonds with those critical residues (gly , lys , asn ) crucial to association with human ace- . also, further van der waals interactions were predicted to form between proanthocyanidin c and crucial gln , tyr , gln , tyr , gln residues that may be stabilizing the binding event to the rbd domain. with higher confidence on the viability of our docking predictions of the best compound with highest binding affinity, such as proanthocyanidin c (- . kcal mol À ) and qag- (- . kcal mol À ) into the rbd/ace- interface and -dihydrocadambine (- . kcal mol À ) and proanthocyanidin b (- . kcal mol À ) within sars-cov- rbd binding domain, we further evaluated the stability of the docked complexes throughout md simulations for . ns. to accomplish this aim, we first calculated the root mean square deviation (rmsd) for ligands for ns of md simulation at real natural conditions into the selected binding pocket. the md simulation results (figure (a-d) ) showed that the rmsd of the systems reached equilibrium after around % ns of simulation time. in general, after equilibration, small fluctuations in the rmsd were observed, suggesting substantial stability for all complexes during the simulations, which fall within the ideal range around Å (smaller rmsd values indicate higher stability of the simulation) (gohlke et al., ; kramer et al., ) . a rigorous exploration of the rmsd values for ligand-sars-cov- rbd complexes shows that structures of -dihydrocadambine and proanthocyanidin b display good signals of stability during ns of md simulation with rmsds values of . ± . and . ± . Å, respectively (figure (a,b) ). importantly, similar behavior had the docked complex composed of proanthocyanidin c and qag- into the rbd/ace- interface, which showed remarkable stability throughout the simulation time period at rmsd values of . ± . and . ± . , respectively (figure (c,d) ). a closer look at rmsd plot for qag- into the rbd/ace- interface revealed that ligand gradually stabilized after ns, which is an indication of its higher conformational flexibility within the interface between rbd and ace- proteins. taken together, these findings suggest binding stability of ligands toward the active domains of the sars-cov- rbd and rbd/ ace- viral targets. the radius of gyration (r g ) represents the compactness of the protein structure and conformational stability of the whole systems (i.e. protein-ligand complexes). if the radius of gyration remained relatively constant, the complex was considered to be stably folded; otherwise, it was considered to be unfolded. in this scenario, radius of gyration values was calculated in order to observe the conformational alterations and dynamic stability of each viral protein within the ns simulation time. figure (a-d) illustrates r g values for the protein and ligand in the complexes, respectively. as shown in figure (a-d), calculated r g values for ligands into the sars-cov- rbd protein for -dihydrocadambine ( . ± . Å), figure . backbone rmsd values of (a) -dihydrocadambine within sars-cov- rbd active site (red) and protein without ligand (blue). b: proanthocyanidin b within sars-cov- rbd binding site (red) and protein without ligand (blue). c: proanthocyanidin c at rbd/ace- interface and rbd/ace- complex without ligand (blue). d: qag- into rbd/ace- interface (red) and rbd/ace- complex without ligand (blue). proanthocyanidin b ( . ± . Å) and inside rbd/ace- binding interface for proanthocyanidin c ( . ± . Å) and qag- ( . ± . Å) remained relatively constant during the simulation, therefore each protein-ligand complex could be considered to be stably folded. in addition, although rmsd values would show that the ligands undergo a significant shift within the active domain, estimated r g values suggest that overall shape of the protein was stable upon binding of the ligand during the -ns md simulation. finally, these observations are also supported by d-binding interactions plots and d representation of the selected ligands into the binding pocket, respectively (see supporting information figures s -s ). in addition, to show the conformational changes of the ligands into the viral proteins active site along the first ns window md simulation (figure ), these plots revealed that after md simulations the key protein-ligand interactions initially shown by the docking results were maintained and the -dihydrocadambine and proanthocyanidin b within sars-cov- rbd, as well as proanthocyanidin c at rbd/ace- remained stable in the binding pocket compared to the initial docking pose. thus, crucial binding ligand interactions with lys , arg , tyr , gln , asn , tyr , ser , tyr , tyr phe and gly were maintained after -ns md simulation into the binding pocket of sars-cov- rbd. similarly, in comparison with the docking results, those key ligand interactions with amino acid residues identified as essential for maintaining rbd-ace- stability are also present after md simulations. besides, d representations of the selected ligands into each binding pocket were used with the aim to compare the best conformation poses from md simulation and docking, hence we plotted the superposition of the docked complex d-structures before and after -ns md simulation into the active cavity (see supporting information figures s -s ). in general, there is no significant difference between the structures extracted after -ns md simulation and the docking pose of ligands, only slight translational and rotational motions were observed. the obtained md simulation results suggest ( ) the conformation of the binding pocket and ligands were stable during the md simulations, ( ) ligands do not leave the binding pocket while running md simulation and ( ) active pocket in both selected viral targets favored ligands binding, suggesting not only the rationality and validity of our docking studies, but also proposes that many of these constituents of u. tomentosa could act as a dual inhibitors of the sars-cov- spike protein and rbd/ace- complex, which are mostly responsible for the attachment and internalization of the novel coronavirus in the human host. in the final stage, selected ligands were redocked into the sars-cov- rbd ( -dihydrocadambine and proanthocyanidin b ) or inside the rbd/ace- interface (proanthocyanidin c and qag- ) starting from the mean geometries of the last ns md simulations trajectories, aiming to obtain the correct binding energies and poses. the vina re-docked results are summarized in table and the best ligand bound receptor figure . radius of gyration (r g ) graphs: (a) for -dihydrocadambine into the binding cavity (red) and sars-cov- spike protein without ligand (blue). b: for proanthocyanidin b onto active site (red) and sars-cov- spike protein without ligand (blue). c: for proanthocyanidin c within binding cleft and rbd/ace- complex without ligand (blue and violet). d: for qag- into the binding site (red) and rbd/ace- complex without ligand (blue and violet). conformations are shown in the supporting information figures s -s . notably, after re-docked process -dihydrocadambine displays higher binding energy (- . kcal.mol À ) than the initial predicted docking score (- . kcal.mol À ). d and d diagrams of protein-ligand interactions from d coordinates showed that -dihydrocadambine had a significant conformational change in their binding modes in comparison with observed binding poses initially predicted by docking. as can be seen from table , interactions profile into the sars-cov- rbd binding site was substantially altered, showing contacts with those critical residues, including asn (h-bond), gly (three h-bond), tyr (h-bond), tyr (p-cation with cationic nitrogen). on the other hand, despite proanthocyanidin b showed a re-docking score significantly lower than the initial docking. however, readers can observe that proanthocyanidin b is well-accommodated inside the rbd binding pocket highlighting key contacts with tyr , ser , lys , gln , arg , tyr , tyr , glu and phe that are essential for sars-cov binding to human ace- receptors. in addition, predicted binding poses were compared in supporting information figure s , thus the best binding pose obtained from re-docking studies is considerably different to that retrieved from the initial docking, this finding could be strongly associated with a lower binding energy after re-docking. interestingly, when proanthocyanidin c was re-docked using the mean geometry of the last ns md simulation trajectory, a higher binding score (- . kcal mol À ) than the initial docking was obtained (- . kcal mol À ). as shown in supporting information figure s , proanthocyanidin c fitwell into the rbd/ace- interface interacting with those critical residues at the junction of sars-cov- to human ace- , such us glu , his , asp , gln , tyr , phe , lys , glu , lys . a d-comparison between the best initial docking pose and the best re-docking pose into the active site clearly revealed that this binding pose significantly favors proanthocyanidin c binding. finally, re-docking calculations into rbd/ace- interface for qag- showed a very similar binding affinity (- . kcal mol À ) to the initial docking prediction (- . kcal mol À ). a close view to d and d ligand-protein diagrams plot revealed that qag- fit-well inside the binding pocket, and is located closed to b and b sheets in the ace- protein. re-docked process confirms that qag- is capable of binding to aminoacids residues that are critical for the recognition of the sars-cov- by full-length human ace- . after the re-dock analysis, we identified that qag- formed two hydrogen bond interactions with key gly and tyr residues, respectively. similarly, ligand was able to bind through van der waals contacts with three critical residues asp , gln and glu , which could have an effect on stabilizing the binding event. as can be seen in supporting information figure s , d-plots comparison between starting pose docking and docking pose after re-docking for qag- revealed similar binding modes at rbd/ace- interface, only a slight shift was observed. taken together, these computational results clearly evidenced that constituents of u. tomentosa are capable of binding to sars-cov- spike protein through strong interactions with those key aminoacids crucial for the viral attachment to human ace- in stable complexes. our investigations could be a new strategy for inhibiting the recognition of the sars-cov- rbd by ace- , and therefore might interfere with the entry of coronavirus to its host cells. computational modeling demonstrated that components of this herb may cause ace- and spike protein cleavage, because they would interact with key residues within rbd/ace- interface forming very stable complexes. hence, we firmly believe that ethanolic extract of cat's claw may be a novel herbal-based therapeutic option to treat covid- infection because its components may cause disruption of sars-cov- rbd/ace- complex or could block attachment of sars-cov- to the entry receptor ace- . calculated drug-likeness profiles play a critical role in assessing the quality of novel antiviral candidates. early predictions of pharmacokinetic behavior of the promising antiviral compounds based on its structure could help find safer and effective leads for preclinical testing. here, we calculated and analyzed various drug-likeness indices for the most qualified components of u. tomentosa predicted by docking studies (table ) . to this purpose, ten pharmacokinetics parameters were calculated as drug-likeness filter for speciophylline, uncarine f, uncaric acid, cadambine, -isodihydrocadambine, -dihydrocadambine, proanthocyanidin b , proanthocyanidin c , proanthocyanidin b , epiafzelechin- b- , qag- , qag- , qag- , qag- and qag- , respectively. results obtained revealed the druggability of the selected components from ethanolic extract of cat's claw, demonstrating their potential as likely orally active antiviral. despite major components had more than two violations to the rule of and druggability, predicted values for speciophylline, uncarine f, uncaric acid and cadambine displayed favorable physiochemical profiles compared to positive reference hepos, which clearly displayed important violations in rof- . the discussion focused on these four active compounds showed that according to lipinski rule of five (rof- ) (no more than one violation is acceptable) (lipinski et al., ) , compounds could be used as orally dosed drugs in humans. the predicted human intestinal absorption (% hia) for speciophylline, uncarine f and uncaric acid revealed greater hia value in the range between % and %, while cadambine having relatively low values of % may be acceptable within % of marketed drugs. note that, greater hia values denote that these compounds could be absorbed throughout the intestinal segments upon oral administration. on the other hand, we calculated the most important physicochemical property to correlated passive molecular transport through membranes and drug-membrane interactions, such as polar surface area (psa) (ertl et al., ) . predicted psa showed favorable values for compounds ranging from to Å , indicating they would penetrate more efficiently through the infected host cells. in addition, the partition coefficient between n-octanol and water (log p o/w ) was calculated in order to explore the ability of constituents to pass through lipid bilayers (veber et al., ) . notably, speciophylline, uncarine f and uncaric acid presented values within ideal range for approved-drugs (ranging from . to . ). binding to serum albumin (expressed as log k hsa ) is the most important parameter for distribution and transport of antiviral drugs in the systemic circulation (colmenarejo, ; zhivkova, ) . early prediction of this parameter reduces the amount of wasted time and resources for drug development candidates in the antiviral therapy and management. we found that ligands fit well within the recommended values range (ranging from À . to . ), showing log k hsa numbers between - . and . . finally, we also predicted the passive transmembrane permeation using caco- cell monolayers or mdck cells as models. currently, both models are used as a simplified in vitro model for intestinal absorption in drug development (broccatelli et al., ; pham-the et al., ; press & di grandi, ) . our results showed that in comparison with reference drugs, speciophylline, uncarine f and uncaric acid have - nm/s. from such observations, these compounds displayed great values of human intestinal absorption (% hia) above %, very similar to the reference drugs values (above %). given the aforementioned results, we believe at least four components of cat's claw here reported may provide favorable characteristics as the drug like, hence u. tomentosa may constitute itself a promissory option to fight against covid- infection. by integrating in silico approaches, this article evidenced that several components of u. tomentosa could act disrupting the association of sars-cov- spike protein with the human ace- receptor or by blocking the rbd-ace- interaction during covid- virulence. further, we identified that various constituents of cat's claw bears optimal pharmacokinetics properties to be used orally as a potential antiviral response. therefore, we believe that ethanolic extract of u. tomentosa should be taken into consideration as a rapid response to the covid- during the early stages of infection. covid- outbreak that emerged from wuhan, china has acquired pandemic status and severe acute respiratory < . ---- a molecular weight of the hybrid ( - ). b polar surface area (psa) ( . - Å ). c n-on number of hydrogen bond acceptors < . d n-ohnh number of hydrogens bonds donors . e octanol water partition coefficient (log p o/w ) (- . to . ). f binding-serum albumin (k hsa ) (- . to . ). g human intestinal permeation (< poor, > great). h madin-darby canine kidney (mdck) cells permeation. i human intestinal absorption (% hia) (> % is high, < % is poor). j heparin octasaccharide used as positive reference in this work. syndrome requires the attention of academics to discover the possible safe and effective drug to ameliorate its effects worldwide. in the present study, constituents of u. tomentosa were docked on the binding interface of the rbd-ace- and inside sars-cov- rbd spike protein of novel corona virus. it was observed that the components of u. tomentosa such as proanthocyanidin c , qag- , -isodihydrocadambine, uncarine f and uncaric acid had a good predicted binding affinity for interface of the rbd-ace- as compared to the sulfated heparin octasaccharide (hepos). likewise, -dihydrocadambine, proanthocyanidin b , proanthocyanidin b and proanthocyanidin c had the highest docking score on sars-cov- spike glycoprotein in their open state, whereas md simulations at ns demonstrated both the feasibility of the binding free energy predicted by docking protocols and the stability of the docked protein-ligand complexes. virtual prediction adme revealed that speciophylline, uncarine f and uncaric acid presented values of druggability according to lipinski rule, demonstrating their potential bioavailability as likely orally active antiviral. based on our findings and its ancestral use in the traditional medicine from south american countries, u. tomentosa can be performed as an herbal supplement with the safety and efficacy parameters at both preclinical and clinical stages to evaluate its effectiveness in the treatment of novel coronavirus disease . furthermore, all components found in u. tomentosa could work in synergism by different mechanisms to combat the spread of sars-cov- . no potential conflict of interest was reported by the author(s). andres f. yepes-p erez http://orcid.org/ - - - oscar herrera-calderon http://orcid.org/ - - - jorge quintero-saumeth http://orcid.org/ - - - gromacs: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. softwarex, - , - antiviral potential of medicinal plants against hiv, hsv, influenza, hepatitis, and coxsackievirus: a systematic review new polyhydroxylated triterpenes from uncaria tomentosa triterpenes and quinovic acid glycosides from uncaria tomentosa uncaria tomentosa improves insulin sensitivity and inflammation in experimental nafld uncaria tomentosa a wellbehaved electrostatic potential based method using charge restraints for deriving atomic charges: the resp model a second generation force field for the simulation of proteins, nucleic acids, and organic molecules predicting passive permeability of drug-like molecules from chemical structure: where are we? antimutagenic and antiherpetic activities of different preparations from uncaria tomentosa (cat's claw) chemaxon -software solutions and services for chemistry and biology. marvinsketch, version in silico prediction of drug-binding strengths to human serum albumin fast calculation of molecular polar surface area as a sum of fragment-based contributions and its application to the prediction of drug transport properties toward a systematic molecular orbital theory for excited states natural bond orbital methods. wires computational molecular science knowledge-based scoring function to predict protein-ligand interactions the world summit of harmonization on traditional, alternative and complementary medicine pharmacophore development, drug-likeness analysis, molecular docking, and molecular dynamics simulations for identification of new ck inhibitors computational design of ace -based peptide inhibitors of sars-cov- ethnomedicine and drug discovery de novo design of protein peptides to block association of the sars-cov- spike protein with human ace an updated insight into the molecular pathogenesis, secondary complications and potential therapeutics of covid- pandemic uncaria tomentosa (willd.) dc. -ethnomedicinal use and new pharmacological, toxicological and botanical results two new -hydroxyursolic acid-type triterpenes from peruvian "una de gato" (uncaria tomentosa) evaluation of the flexx incremental construction algorithm for protein-ligand docking effect of alkaloid-free and alkaloid-rich preparations from uncaria tomentosa bark on mitotic activity and chromosome morphology evaluated by allium test sulfated polysaccharides effectively inhibit sars-cov- in vitro structure of the sars-cov- spike receptor-binding domain bound to the ace receptor alkaloids of peruvian uncaria tomentosa effects of aqueous fractions of uncaria tomentosa (willd.) d.c. on macrophage modulatory activities searching therapeutic strategy of new coronavirus pneumonia from angiotensin-converting enzyme : the target of covid- and sars-cov uncaria tomentosa alkaloidal fraction reduces aracellular permeability, il- and ns production on human microvascular endothelial cells infected with dengue virus experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings bioactive compounds from plants used in peruvian traditional medicine identification and quantification of components in extracts of uncaria tomentosa by hplc-es/ms autodock and autodocktools : automated docking with selective receptor flexibility automated docking using a lamarckian genetic algorithm and an empirical binding free energy function fractioning of proanthocyanidins of uncaria tomentosa. composition and structure-bioactivity relationship proanthocyanidin characterization and bioactivity of extracts from different parts of uncaria tomentosa l. (cat's claw molecular dynamics simulation analyses of viral fusion peptides in membranes prone to phase transition: effects on membrane curvature, phase behavior and lipid-water interface destabilization antioxidant activity of the extract from uncaria tomentosa polymorphic transitions in single crystals: a new molecular dynamics method hplc-pda method for quinovic acid glycosides assay in cat's claw (uncaria tomentosa) associated with uplc/q-tof-ms analysis chemical composition variability in the uncaria tomentosa (cat's claw) wild population in silico assessment of adme properties: advances in caco- cell monolayer permeability modeling drug targets for corona virus: a systematic review permeability for intestinal absorption: caco- assay and related issues immunomodulating and antiviral activities of uncaria tomentosa on human monocytes infected with dengue virus- new developments in molecular orbital theory cat's claw inhibits tnfa production and scavenges free radicals: role in cytoprotection general atomic and molecular electronic structure system an approach to computing electrostatic charges for molecules the amazon rain forest plant uncaria tomentosa (cat's claw) and its specific proanthocyanidin constituents are potent inhibitors and reducers of both brain plaques and tangles inhibition of herpes simplex type and type infections by oximacrov r , a cranberry extract with a high content of a-type proanthocyanidins (pacs-a) autodock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading molecular properties that influence the oral bioavailability of drug candidates monoterpenoid indole alkaloids and phenols are required antioxidants in glutathione depleted uncaria tomentosa root cultures structure, function, and antigenicity of the sars-cov- spike glycoprotein a novel coronavirus outbreak of global health concern automatic atom type and bond type perception in molecular mechanical calculations annex who guidelines on good herbal processing practices for herbal medicines coronavirus disease (covid- ): weekly epidemiological update. world health organization severe acute respiratory syndrome coronavirus : from gene structure to pathogenic mechanisms and potential therapy interaction between gut microbiota and ethnomedicine constituents structural basis for the recognition of sars-cov- by full-length human ace studies on drug-human serum albumin binding: the current state of the matter key: cord- -vcxalkeo authors: graham, n. r.; whitaker, a. n.; strother, c. a.; miles, a. k.; grier, d.; mcelvany, b. d.; bruce, e. a.; poynter, m. e.; pierce, k. k.; kirkpatrick, b. d.; stapleton, r. d.; an, g.; botten, j. w.; crothers, j. w.; diehl, s. a. title: kinetics and isotype assessment of antibodies targeting the spike protein receptor binding domain of sars-cov- in covid- patients as a function of age and biological sex. date: - - journal: medrxiv : the preprint server for health sciences doi: . / . . . sha: doc_id: cord_uid: vcxalkeo sars-cov- is the newly emerged virus responsible for the global covid- pandemic. there is an incomplete understanding of the host humoral immune response to sars-cov- during acute infection. host factors such as age and sex as well the kinetics and functionality of antibody responses are important factors to consider as vaccine development proceeds. the receptor-binding domain of the cov spike (rbd-s) protein is important in host cell recognition and infection and antibodies targeting this domain are often neutralizing. in a cross-sectional study of anti-rbd-s antibodies in covid- patients we found equivalent levels in male and female patients and no age-related deficiencies even out to years of age. the anti-rbd-s response was evident as little as days after onset of symptoms and for at least weeks after symptom onset. anti-rbd-s igg, igm, and iga responses were simultaneously induced within days after onset, but isotype-specific kinetics differed such that anti-rbd-s igg was most sustained over a -week period. the kinetics and magnitude of neutralizing antibody formation strongly correlated with that seen for anti-rbd-s antibodies. our results suggest age- and sex- related disparities in covid- fatalities are not explained by anti-rbd-s responses. the multi-isotype anti-rbd-s response induced by live virus infection could serve as a potential marker by which to monitor vaccine-induced responses. and infection and antibodies targeting this domain are often neutralizing. in a cross-sectional study of anti-rbd-s antibodies in covid- patients we found equivalent levels in male and female patients and no age-related deficiencies even out to years of age. the anti-rbd-s response was evident as little as days after onset of symptoms and for at least weeks after symptom onset. anti-rbd-s igg, igm, and iga responses were simultaneously induced within days after onset, but isotype-specific kinetics differed such that anti-rbd-s igg was most sustained over a -week period. the kinetics and magnitude of neutralizing antibody formation strongly correlated with that seen for anti-rbd-s antibodies. our results suggest age-and sex- related disparities in covid- fatalities are not explained by anti-rbd-s responses. the multi- isotype anti-rbd-s response induced by live virus infection could serve as a potential marker by which to monitor vaccine-induced responses. human pathogenic coronaviruses (cov) such as severe acute respiratory syndrome (sars)- cov- , middle east respiratory syndrome (mers)-cov, and sars-cov- (all b-covs) have resulted from zoonoses and utilize cellular receptors to bind and access host cells for productive infection ( - ). cov spike (s) proteins are large (> kda) glycosylated trimeric structures that protrude from viral particles and enable binding of cov to cellular receptors. sars-cov- interacts with angiotensin converting enzyme- (ace ) via a flexible receptor-binding domain (rbd) located on the distal tip of the s protein ( - ). after binding, several proteases act upon s, priming it to adopt large conformational shifts that facilitate entry into host cells( ). first the s domain (which contains rbd) is cleaved from the c-terminal s domain. for sars-cov- this process may involve furin in the host cell membrane due to a novel furin-recognition site in the s /s region ( - ). the s domain is further processed by other serine and cysteine- proteases such as trypsin, cathepsin, and tmprss to facilitate viral entry into the host cell ( , ). neutralizing antibodies to sars cov- have been isolated and were found to target rbd-s ( ). one of these mabs cr was also found to bind sars-cov- rbd-s( ). at the polyclonal level, the quantity of anti-rbd s igg antibodies against sars-cov- correlate well with neutralizing activity( - ). cross-neutralization amongst sars viruses by rbd-s- targeting antibodies can occur ( - ). however, sequence homology for rbd-s is low for non- sars b-covs (such as mers) and for a-covs such as nl , oc , e, and hku ( , ). for these reasons serology for sars-cov- rbd-s is being used to help identify recovered covid- patients as plasma donors for passive immunotherapy ( ). there are several risk factors for covid- mortality but whether two of these -age and biological sex -are associated with the sars-cov- rbd-s immune response has to our knowledge not been addressed in the peer-reviewed literature. furthermore, most serology studies have been done in the setting of severe covid- disease and, save for one study ( ), without the benefit of detailed kinetics. herein we tracked the kinetics and magnitude of neutralizing and anti-sars-cov- s and rbd-s antibodies in a cross-sectional cohort of pcr- confirmed covid- patients. all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. we chose a two-step elisa-based rbd-s-focused approach to serology in our study population. reagents and pre-print protocols were available in mid-march , which indicated that rbd-s screening and full-s confirmation could identify specific and functional antibodies and be quickly operationalized. using the established protocol ( ) we confirmed the expected protein size of mammalian-expressed rbd-s ( figure a ) and trimerized spike ( figure b ) produced from dna plasmids (gift from florian krammer, mt sinai school of medicine). rbd-s antibodies were specific and correlated with neutralization ( ), findings that have been validated using similar rbd-s-focused assays( , ). we confirmed rbd-s and s protein conformation by binding of cr human igg (figure c, d) . cr was isolated as a sars-s domain-binding single chain antibody fragment by phage display and is neutralizing as an igg ( ). cr binds adjacent to rbd-s in trimeric s of sars-cov- in a glycosylation-sensitive manner( ). mammalian expression of appropriate size proteins and recognition by cr together confirm that our protein preparations exhibited the expected characteristics. we first piloted our antigen preps for the rbd-s igg screening assay using serum samples from a pcr-confirmed severe covid- patient (defined as admission to the intensive care unit, icu) who was admitted to the hospital days following symptom onset and based on an early report suggesting that sars-cov- could trigger antibody responses in this timeframe ( ). we compared igg reactivity in this sample to decreasing amounts of our rbd-s antigen preparations against a fixed, recommended amount of commercially produced rbd-s protein derived from the protocol we used ( ). we found that a wide range of locally produced rbd-s antigen yielded igg reactivity equivalent to ng of commercial antigen in an acute serum sample from this covid- -positive patient ( figure e) . no signal was observed in a pre- serum sample or in the absence of serum ( figure e ). using the standard ng amount hereafter, we found that rbd-s-binding igm and igg were present at - days after all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint symptom onset. we did not detect any rbd-s-binding in healthy pre- sera (figure f) , in agreement with extensive testing of this assay in pre-covid- serum performed elsewhere ( ). due to different secondary antibodies for igm and igg detection we cannot conclude whether absolute levels of rbd-s igg were higher than rbd-s igm. total igg and igm were readily detected in both covid- and in healthy non-covid- serum ( figure g) . for a cross-sectional covid- serological survey we collected serum samples from patients that tested covid- positive by nasopharyngeal swab rt-qpcr testing. all patients had been admitted to the hospital and / ( %) were admitted to the icu. twenty-five patients were subsequently discharged and died. one to five serum samples were collected from each patient with the first sample being taken within approximately days after diagnosis, in which diagnosis occurred around days after symptom onset ( table ) . there was a %: % male: female distribution and patients were on average ± years of age (range - years) ( table ) . a male bias in covid- mortality was reported early during the pandemic ( - ) and has been confirmed worldwide in a recent meta-analysis ( ). one of the hypotheses to explain this is differences in adaptive immunity between males and females. although the mean serum rbd-s igg reactivity level appeared higher in male samples (o.d. = . , n = ) versus female samples (o.d. = . , n = ) this difference was not significant and the same maximum reactivity values were found in males and females (figure a) . although not absolute, it appears that irrespective of comorbidities, there is a higher risk of covid- mortality and morbidity in older individuals ( years of age and over) ( - ). we therefore assessed rbd-s igg antibodies by age. there was a broad range of rbd-s igg responses that did not differ as a function of age as assessed by correlation analysis (r < . , figure b ). notably, one of the highest rbd-s igg responses was from a -year old patient. a serum sample from a -year old covid- patient was negative for rbd-s igg, but this sample was taken just three days after symptom onset, which may be too early for induction of all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint robust igg responses. taken together, we did not find evidence of biological sex-or age-related deficiencies in rbd-s igg responses in covid- patients. rdb-s-reactive serum igg was detected in of ( %) samples that were taken within days of symptom onset ( figure c ). after day of symptoms, % of samples were positive for rbd igg ( figure c ). there were small variations in positive threshold for rbd by assay date ( figure s ). we therefore confirmed each sample (whether rbd-positive or not) with an endpoint titration and area under the curve calculation for reactivity against the full spike ectodomain trimer ( ). samples that were rbd-s-negative were also low for spike total reactivity (auc) and titer (figures d, e) . furthermore, we found a very strong correlation between rbd and spike igg ( figure in the patient-specific rbd igg data ( figure s a ) we found several patterns: initial seroconversion (e.g. patients , and ), rapid increases (e.g. patients , , , , , occurring between days - ), and plateaued responses (e.g. patients and , occurring mainly after day ). these responses were concordant with temporal patient- specific s igg titers ( figure s b ). anti-s titers in patients with a negative rbd-s test were generally low and in rbd-positive samples, followed the same trends as rbd-reactivity, providing further confirmation of robust serological responses to sars-cov- during acute covid- . at the patient level, neutralizing activity was observed after as few as five days after symptom onset and throughout the study period and was predominantly found in those samples with positive rbd-s igg ( figure s ) . all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint to assess antibody isotype dynamics during acute sars-cov- we followed rbd-s and full spike-specific igm and iga levels in the same samples for which rbd-s and spike igg was determined. at the patient level we found robust co-occurrence of igm, igg, and iga antibodies reactive to rbd-s in most samples, particularly in post-day samples (figure s ) . pooling all the data revealed that all pre-day rbd-s responses for all isotypes were low. around day , igm targeting rbd-s as well as the switched isotypes igg and iga simultaneously rose. while rbd-reactive igm and iga responses tapered after weeks post- onset (though remained higher than baseline), those for igg continued to rise to a plateau that was sustained up to weeks after symptoms onset (the most protracted timepoint measured, figure a ). similar patterns were obtained for full spike-reactive antibodies ( figure b ). these results suggesting that during acute infection covid- patients undergo a seroconversion across isotypes to sars-cov- rather than an expansion of pre-existing anti-cov antibodies. lastly, we assessed anti-rbd-igg responses by clinical severity. all the patients in this study were hospitalized and % of were admitted to the intensive care unit. when we stratified by icu admission and compared rbs-s igg levels, we found a trend towards higher levels in those requiring icu-level care (p = . ) ( figure a) . additionally, we observed a significant association between rbd-s igg and duration of icu admission ( figure b ). lastly of ( %) patients succumbed to covid- . while a significant difference in the median rbd-s igg was not observed between survivors and decedents, a smaller range trending towards higher rbd-s reactivity was observed in those patients that died ( figure c ). although we did not have continuous monitoring of viral load in these patients during hospitalizations it is possible that rbd-s igg levels reflect ongoing viral replication during more severe disease and in conjunction with other factors may allow for recovery. taken together, our results provide the first comprehensive survey of sars-cov- spike rbd antibodies that accounts for two key risk factors for covid- . neither rbd-s nor s antibodies were significantly different as a function of biological sex. anti-rbd-s and spike igg all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint responses were induced across decades of age with robust responses found in several samples from patients ≥ years old. these results also extend kinetic analyses and confirm the paucity of anti-sars-cov- anti-spike responses in very early blood samples taken prior to day after symptoms onset ( , ). we also assessed protective anti-spike rbd responses as a function of level of hospital care and disease severity and found that duration of icu-level care was associated with higher responses, possibly due to an extended period of sars-cov- replication during severe disease. a limitation of our study is that we only followed symptomatic patients admitted to hospital; it is unclear whether antibody responses differ in asymptomatic or mildly symptomatic patients. we also did not directly assess whether the rbd-specific antibodies we studied were neutralizing at the clonal level, though we did observe a strong association with polyclonal rbd-s igg responses and sars-cov- neutralizing activity. this is in agreement with other reports which confirm that rbd-s igg levels correlate with neutralizing activity and that the rbd of sars-cov- is a potent target for neutralizing antibodies ( - , , , ). it will be important to determine whether anti-rbd iga or even igm antibodies contribute to blocking activity. all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . was also used as a positive control during assay set up and this reagent was produced in hek t cells under hhsn c and obtained through bei resources, niaid, nih: all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint wuhan-hu- , recombinant from nr- . cr is a sars-cov s-specific antibody originally isolated by single chain variable region phage display and then cloned as an igg /kappa monoclonal human igg /k ( ). we received cr heavy chain (hc) and light chain (lc) cloned into pfusess-chig-hg and pfuse ss- clig-hk, respectively (invivogen) from florian krammer spotted on filter paper. we (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . graphics and statistical testing. all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint all statistics and graphics were performed using r version . . using standard packages or graphpad prism . . . non-parametric loess (local regression) was used for smoothing. all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . detection of serum igg from a covid- patient (left), but not from pre- serum (center) or all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint and % confidence intervals are shown for each isotype. all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . spearman's rho coefficient (r ), % confidence interval, and p-value are shown. (c) rbd-s igg in patients that were deceased or discharged were analyzed by student's t-test and p-value is shown. boxplots show the median, % confidence level, and all individual samples. all rights reserved. no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted july , . . https://doi.org/ . / . . . doi: medrxiv preprint origin and evolution of pathogenic coronaviruses genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding a pneumonia outbreak associated with a new coronavirus of probable bat origin sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus site-specific glycan analysis of the sars-cov- spike cryo-em structure of the -ncov spike in the prefusion conformation coronavirus spike protein and tropism changes cleavage site in the spike protein of sars-cov- is essential for infection of human lung cells phylogenetic analysis and structural modeling of sars-cov- spike protein reveals an evolutionary distinct and proteolytically sensitive activation loop proteolytic cleavage of the sars-cov- spike protein and the role of the novel s /s site. iscience characterization of spike glycoprotein of sars-cov- on virus entry and its immune cross-reactivity with sars-cov a highly conserved cryptic epitope in the receptor binding domains of sars-cov- and sars-cov a serological assay to detect sars-cov- seroconversion in humans the receptor binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in sars-cov- patients rapid generation of neutralizing antibody responses in covid- patients human neutralizing antibodies elicited by sars-cov- infection antigenicity of the sars-cov- spike glycoprotein broad neutralization of sars-related viruses by human monoclonal antibodies convalescent plasma treatment of severe covid- : a matched control study temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by sars-cov- : an observational cohort study characteristics of and important lessons from the coronavirus disease (covid- ) outbreak in china: summary of a report of cases from the chinese center for disease control and prevention epidemiological and clinical characteristics of cases of novel coronavirus pneumonia in wuhan, china: a descriptive study clinical characteristics of coronavirus disease in china considering how biological sex impacts immune responses and covid- outcomes estimates of the severity of coronavirus disease : a model-based analysis age and multimorbidity predict death among covid- patients: results of the sars-ras study of the italian society of hypertension disease and healthcare burden of covid- in the united states serological assays for emerging coronaviruses: challenges and pitfalls : , ) and plotted against days of symptoms. (e) spike igg endpoint titer or (f) auc is plotted against rbd-igg reactivity. (g) sars-cov- microneutralization titers are plotted against rbd-s igg reactivity. cutoff values no reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medrxiv a license to display the preprint in perpetuity. the copyright holder for this preprint this version posted key: cord- -qffg r authors: wong, alan h. m.; tomlinson, aidan c. a.; zhou, dongxia; satkunarajah, malathy; chen, kevin; sharon, chetna; desforges, marc; talbot, pierre j.; rini, james m. title: receptor-binding loops in alphacoronavirus adaptation and evolution date: - - journal: nat commun doi: . /s - - -x sha: doc_id: cord_uid: qffg r rna viruses are characterized by a high mutation rate, a buffer against environmental change. nevertheless, the means by which random mutation improves viral fitness is not well characterized. here we report the x-ray crystal structure of the receptor-binding domain (rbd) of the human coronavirus, hcov- e, in complex with the ectodomain of its receptor, aminopeptidase n (apn). three extended loops are solely responsible for receptor binding and the evolution of hcov- e and its close relatives is accompanied by changing loop–receptor interactions. phylogenetic analysis shows that the natural hcov- e receptor-binding loop variation observed defines six rbd classes whose viruses have successively replaced each other in the human population over the past years. these rbd classes differ in their affinity for apn and their ability to bind an hcov- e neutralizing antibody. together, our results provide a model for alphacoronavirus adaptation and evolution based on the use of extended loops for receptor binding. c oronaviruses are enveloped, positive-stranded rna viruses that cause a number of respiratory, gastrointestinal, and neurological diseases in birds and mammals , . the coronaviruses all possess a common ancestor and four different genera (alpha, beta, gamma, and delta) that collectively use at least four different glycoproteins and acetylated sialic acids as host receptors or attachment factors have evolved [ ] [ ] [ ] . four coronaviruses, hcov- e, hcov-nl , hcov-oc , and hcov-hku circulate in the human population and collectively they are responsible for a significant percentage of the common cold as well as more severe respiratory disease in vulnerable populations , . hcov- e and hcov-nl are both alphacoronaviruses and although closely related, they have evolved to use two different receptors, aminopeptidase n (apn) and angiotensin converting enzyme (ace ), respectively , . the more distantly related betacoronaviruses, hcov-oc and hcov-hku , are less well characterized and although hcov-oc uses -o-acetylsialic acid as its receptor , the receptor for hcov-hku has not yet been determined [ ] [ ] [ ] . recent zoonotic transmission of betacoronaviruses from bats is responsible for sars and mers, and in these cases infection is associated with much more serious disease and high rates of mortality [ ] [ ] [ ] . like hcov-nl , sars-cov uses ace as its receptor and the observation that mers-cov uses dipeptidyl peptidase highlights the fact that coronaviruses with new receptor specificities continue to arise. the coronavirus spike protein (s-protein) is a trimeric singlepass membrane protein that mediates receptor binding and fusion of the viral and host cell membranes . it is a type- viral fusion protein possessing two regions, the s region that contains the receptor-binding domain (rbd) and the s region that contains the fusion peptide and heptad repeats involved in membrane fusion [ ] [ ] [ ] [ ] [ ] [ ] . the coronavirus s-protein is also a major target of neutralizing antibodies and one outcome of hostinduced neutralizing antibodies is the selection of viral variants capable of evading them, a process known to drive variation [ ] [ ] [ ] . as shown by both in vivo and in vitro studies, changes in host, host cell type, cross-species transmission, receptor expression levels, serial passage, and tissue culture conditions can also drive viral variation [ ] [ ] [ ] [ ] [ ] . rna viruses are characterized by a high mutation rate, a property serving as a buffer against environmental change . a host-elicited immune response, the introduction of antiviral drugs, and the transmission to a new species provide important examples of environmental change . nevertheless, the means by which random mutations lead to viral variants with increased fitness and enhanced survival in the new environment are not well characterized. given their wide host range, diverse receptor usage and ongoing zoonotic transmission to humans, the coronaviruses provide an important system for studying rna virus adaptation and evolution. the alphacoronavirus, hcov- e, is particularly valuable as it circulates in the human population and a sequence database of natural variants isolated over the past fifty years is available. moreover, changes in sequence and serology have suggested that hcov- e is changing over time in the human population [ ] [ ] [ ] . reported here is the x-ray structure of the hcov- e rbd in complex with human apn (hapn). the structure shows that receptor binding is mediated solely by three extended loops, a feature shared by hcov-nl and the closely related porcine respiratory coronavirus, prcov. it also shows that the hcov- e rbd binds at a site on hapn that differs from the site where the prcov rbd binds on porcine apn (papn), evidence of an ability of the rbd to acquire novel receptor interactions. remarkably, we find that the natural hcov- e sequence variation observed over the past fifty years is highly skewed to the receptor-binding loops. moreover, we find that the loop variation defines six rbd classes (classes i-vi) whose viruses have successively replaced each other in the human population. these rbd classes differ in their affinity for hapn and their ability to be bound by a neutralizing antibody elicited by the hcov- e reference strain (class i). taken together, our results provide a model for alphacoronavirus adaptation and evolution stemming from the use of extended loops for receptor binding. characterization of the hcov- e rbd interaction with hapn. to define the limits of the hcov- e rbd, we expressed a series of soluble s-protein fragments and measured their affinity to a soluble fragment (residues - ) of hapn, the hcov- e receptor. the smallest s-protein fragment made (residues - ) bound hapn with an affinity (k d of . ± . µm) similar to that of the entire s region (residues - ) ( table , supplementary fig. a , b) and this fragment was used in the structure determination. to confirm the importance of the table analysis of the hapn ectodomain (residues - , wt and mutants) interaction with fragments of the hcov- e sprotein (wt and mutants) using surface plasmon resonance hcov- e rbd-hapn interaction for viral infection, we showed that both the rbd and the hapn ectodomain inhibited viral infection in a cell-based assay (fig. a, b, c) . crystals of the hcov- e rbd-hapn complex were obtained by co-crystallization of the complex after size exclusion chromatography. the crystallographic data collection and refinement statistics are shown in table . the asymmetric unit contains one hapn dimer (and associated rbds) and one hapn monomer (and associated rbd) that is related to its dimeric mate by a crystallographic two-fold rotation axis. both dimers (noncrystallographic and crystallographic) are found in the closed conformation and are essentially identical to that which we previously reported for hapn in its apo form (rmsd over all cα atoms of . Å). each apn monomer is bound to one rbd as shown in fig. a . the hcov- e rbd-hapn interaction buries Å of surface area on the rbd and Å on hapn. the hcov- e rbd is an elongated six-stranded β-structural domain with three extended loops (loop : residues - , loop : residues - , loop : residues - ) at one end that exclusively mediate the interaction with hapn (fig. b ). loop is the longest and it contributes~ % of the rbd surface buried on complex formation (figs. c and g). within loop , residues cys and cys form a disulfide bond that makes a stacking interaction with the side chains of hapn residues tyr and glu (fig. c) . the c s/c s rbd double mutant showed no binding to hapn at concentrations up to μm (table , supplementary fig. d , and supplementary table ), evidence of the importance of the stacking interaction and a likely role for the disulfide bond in defining the conformation of loop . notably, loop contains three tandemly repeated glycine residues (residues - ) whose nh groups donate hydrogen bonds to the side chain of asp and the carbonyl oxygen of phe of hapn (fig. c) ; mutation of hapn residue asp to alanine leads to a~ -fold reduction in affinity ( (fig. c) ; the importance of trp of loop is evidenced by the fact that mutating it also ablates binding (table , supplementary fig. f , and supplementary table ). hcov- e and prcov bind at different sites on apn. as with hcov- e, the porcine respiratory alphacoronavirus, prcov, also uses apn as its receptor . as our complex shows, hcov- e binds at a site on hapn (h-site) that differs from the site on papn (p-site) used by prcov (fig. a, b) . glu in hapn, a residue in the hapn-rbd interface, is an n-glycosylated asparagine (asn ) in papn and attempts to dock the hcov- e rbd at the h-site on papn leads to a steric clash with the n-glycan ( supplementary fig. a ). consistent with this observation, the hcov- e rbd cannot bind to a mutant form of hapn (e n/k e/q t) that possesses an n-glycan at position , as we have shown ( . across species, the sequence identity at the h-and p-sites is only~ % ( fig. c and supplementary fig. c ) and the receptor-binding loops of these viruses must be accommodating the remaining apn structural differences on receptors from species that they do not infect. together these results provide evidence that the extended receptor-binding loops of these alphacoronaviruses possess conformational plasticity. the observation that hcov- e and prcov bind to different sites on apn has important consequences. among species, apn is found in open/intermediate and closed conformations and conversion between them is thought to be important for the catalysis of its substrates , . the hcov- e rbd binds to hapn in its closed conformation and structural comparison shows that the h-site does not differ between the open and closed conformations. this is to be contrasted with the p-site of papn that differs in the open and closed conformations. indeed, the prcov rbd has recently been shown to bind to papn in the open conformation as a result of p-site interactions made possible in the open form . these differences in binding and receptor conformation are reflected in the fact that enzyme inhibitors that promote the closed conformation of apn block tgev infection , but not hcov- e infection , and the fact that the prcov s-protein , but not hcov- e , inhibits apn catalytic activity. the receptor-binding loops of hcov- e vary extensively. sequence data from viruses isolated over the past years provides a wealth of data on the natural variation shown by hcov- e ( supplementary fig. ). with reference to the hcov- e rbd-hapn complex reported here, we now show that % of the amino acids in the receptor-binding loops and supporting residues vary among the sequences analyzed ( sequences in total), while only % of the rbd surface residues outside of the receptor-binding loops show variation (fig. a, b) . moreover, for the eight variants where full genome sequences were reported, the receptor-binding loops represent the location at which the greatest variation in the entire genome is observed (fig. c) . analysis of the hcov- e rbd-hapn interface further shows that of the rbd surface residues that are fully or partially buried on complex formation, of them vary in at least one of the sequences analyzed and a pairwise comparison of the sequences suggests that many of these positions can vary simultaneously ( supplementary fig. ). finally, we show that the six invariant interface residues on the rbd (gly , gly , cys , cys , asn , and arg ) constitute only % of the viral surface area buried, the very region expected to be the most highly conserved from a receptor-binding standpoint. the fig. naturally occurring hcov- e sequence variation. a color-coded amino-acid sequence conservation index (chimera) mapped onto a ribbon representation of the hcov- e rbd. blue represents a high percentage sequence identity and red represents a low percentage sequence identity among the viral isolates analyzed. b surface representation in the same orientation as in (a, left), and rotated °(right). the asn-glcnac moiety of the nglycans are shown in stick representation. color coding as in a. c amino-acid sequence variation shown by the eight viral isolates whose entire genome sequences have been reported. the entire protein coding region of the viral genome was treated as a continuous amino acid string ( residues in total). amino acid differences among the eight sequences were analyzed in residue bins and for each bin the sum was plotted. green-colored bins correspond to residues in the s-protein and purple-colored bins correspond to residues in the rbd. the horizontal dotted line denotes the average number of aminoacid differences per bin across the protein-coding region of the whole viral genome. d alignment of the sequences selected for each of the six classes. the "|" symbol demarcates every residues in the alignment. e representative images showing hcov- e infection of l- cells in the presence of: pbs, monoclonal antibody . .e at two different concentrations, and monoclonal antibody . h at two different concentrations (anti-hcov-oc antibody). the nucleus is stained blue and green staining indicates viral infection. magnification (× ) and scale bar = µm. f statistical quantification of the monoclonal antibody inhibition experiment. error bars correspond to standard deviations obtained from three independent experiments remaining % (i.e., Å ) of the viral surface area buried is made up of residues that differ in their variability and the role they play in complex formation (supplementary table ). loop variation leads to phylogenetic classes. phylogenetic analysis of the hcov- e rbd sequences found in the database showed that they segregate into six classes ( supplementary fig. ). class i contains the atcc- reference strain (originally isolated in and deposited in ) and related lab strains, while classes ii-vi, represent clinical isolates that have successively replaced each other in the human population over time since the s. to characterize these classes, a representative sequence from each was selected; for class i, the rbd of the reference strain, also used in our structural analysis, was selected. to simplify characterization, the rbds of the other five classes were synthesized with the class i sequence in all but the loop regions (fig. d) . as observed for class i, the other rbds do not bind to the hapn mutant that introduces an n-glycan at glu (supplementary fig. d) , an observation suggesting that they all bind at the same site on hapn. the rbds bound hapn with añ -fold range in affinity (k d from~ to~ nm). these differences in affinity are largely a result of differences in k off with little difference in k on (table and supplementary fig. ) . table shows the identity of the loop residues that have shown variation. of those buried in the rbd-hapn interface, residues , , and are particularly noteworthy as they undergo considerable variation in amino-acid character. residue , for example, accounts for % of the total buried surface area on complex formation and changes from gly to val to pro in the transition from classes i to vi. variation of this sort provides insight into how changes in receptor-binding affinity might be mediated during the process of viral adaptation. each of the six rbd classes were also characterized using a neutralizing mouse monoclonal antibody ( . e ) that we generated against the hcov- e reference strain (class i). as shown in fig. e, f, . e inhibits hcov- e infection of the l cell-line. this antibody binds to the class i rbd with a k d of nm (k on = . × m − s − , k off = . s − ) and as shown by a competition binding experiment, it blocks the rbd-hapn interaction ( supplementary fig. a, b) . in contrast, . e shows no binding to the other five rbd classes at a concentration of μm (supplementary fig. c ), strong evidence that the receptorbinding loops of the class i rbd are important for antibody binding and that loop variation can abrogate antibody binding. consistent with this observation, non-conserved amino-acid changes both within and outside of the rbd-hapn interface are observed across all classes (supplementary table ). correlating structure and function with natural sequence data is a powerful means of studying viral adaptation and evolution. to this end, we have delimited the hcov- e rbd and determined its x-ray structure in complex with the ectodomain of its receptor, hapn. we found that three extended loops on the rbd are solely responsible for receptor binding, and that these loops are highly variable among viruses isolated over the past years. a phylogenetic analysis also showed that the rbds of these viruses define six rbd classes whose viruses have successively replaced each other in the human population. the six rbds differ in their receptor-binding affinity and their ability to be bound by a neutralizing antibody ( . e ) and taken together, our findings suggest that the hcov- e sequence variation observed arose through adaptation and selection. antibodies that block receptor binding are a common route to viral neutralization and exposed loops are known to be particularly immunogenic . loop-binding neutralizing antibodies are elicited by the alphacoronavirus tgev , and the receptorbinding loops of hcov- e mediate the binding of the neutralizing antibody, . e . as shown by the sequences of the viral isolates analyzed, the rbds differ almost exclusively in their receptor-binding loops. . e blocks the hapn-rbd interaction and it can only bind to the rbd (class i) found in the virus that elicited it. this observation shows that loop variability can abrogate neutralizing antibody binding. indeed, the successive replacement or ladder-like phylogeny observed, when the sequence of the hcov- e rbd is analyzed, is characteristic of immune escape as shown by the influenza virus , . taken together, our results suggest that immune evasion contributes to if not explains the extensive receptor-binding loop variation shown by hcov- e over the past years. hcov- e infection in humans does not provide protection against different isolates , and viruses that contain a new rbd class that cannot be bound by the existing repertoire of loop-binding neutralizing antibodies provide an explanation for this observation. neutralizing antibodies that block receptor binding can also be thwarted by an increase in the affinity/avidity between the virus and its host receptor. increased receptor-binding affinity/avidity allows the virus to more effectively compete with receptor blocking neutralizing antibodies, a mechanism thought to be important for evading a polyclonal antibody response . in addition, an optimal receptor binding affinity is thought to exist in a given environment. as such, adaptation in a new species, changes in tissue tropism, and differences in receptor expression levels can all lead to changes in receptor binding affinity , , . recent cryoem analysis has shown that the receptor-binding sites of hcov-nl , sars-cov, mers-cov, and by inference hcov- e, are inaccessible in some conformations of the prefusion s-protein trimer [ ] [ ] [ ] [ ] [ ] . although the ramifications of this structural arrangement are not yet clear, restricting access to the binding site has been proposed to provide a means of limiting bcell receptor interactions against the receptor-binding site . how this might work in mechanistic terms is also not clear given the need to bind receptor. however, in a simple model, the inaccessible s-protein conformation(s) would be in equilibrium with a less stable (higher energy) but accessible s-protein conformation (s). the energy difference between these conformations is a barrier to binding that decreases equally the intrinsic free energy of binding of both the viral receptor and the b-cell receptor and relative binding energies may be the key. both soluble hapn and values after ± correspond to the residual standard deviation reported by scrubber . two experiments were performed nature communications | doi: . /s - - -x article antibody . e can inhibit hcov- e infection in a cell-based assay, an indication that their binding energies (k d of and nm, respectively) are sufficient to efficiently overcome the barrier to binding. however, b-cell receptors bind their antigens relatively weakly prior to affinity maturation and they would be much less able to do so. the dynamics of the interconversion between accessible and inaccessible conformations may also be a factor in the recognition of inaccessible antibody epitopes , , and further work will be required to establish if and how restricting access to the receptor binding site enhances coronavirus fitness. the cryoem structures also show that the receptor-binding loops make intra-and inter-subunit contacts in the inaccessible prefusion trimer. this suggests the intriguing possibility that the magnitude of the energy barrier, or the dynamics of the interconversion between accessible and inaccessible conformations, might be modulated by loop variation during viral adaption. immune evasion and cross-species transmission involve viral adaptation and we posit that the use of extended loops for receptor binding represents a strategy employed by hcov- e and the alphacoronaviruses to mediate the process. such loops can tolerate insertions, deletions, and amino acid substitutions relatively free of the energetic penalties associated with the mutation of other protein structural elements. indeed, our analysis of the six rbd classes shows that the receptor-binding loops possess a remarkable ability to both accommodate and accumulate mutational change while maintaining receptor binding. among the six classes, % of the loop residues show change and only % of the receptor interface buried on receptor binding has been conserved. as we have shown, variation in the receptorbinding loops can abrogate neutralizing antibody binding and it will also increase the likelihood of acquiring new receptor interactions by chance. in this way, the selection of viral variants capable of immune evasion and/or cross-species transmission will be facilitated , , [ ] [ ] [ ] . cross-species transmission involves the acquisition of either a conserved (i.e., a similar interaction with a homologous receptor) or a non-conserved receptor interaction (i.e., an interaction with a non-homologous receptor, or an interaction at a new site on a homologous receptor) in the new host. hcov- e binds to a site on hapn that differs from the site where prcov binds to papn (fig. a, b) , and hcov-nl is known to bind the nonhomologous receptor, ace . clearly, conserved receptor interactions have not accompanied the evolution of these alphacoronaviruses ( fig. d-g) . in mechanistic terms, receptor-binding loop variability and plasticity would facilitate the acquisition of both conserved and non-conserved receptor interactions. however, compared to conserved receptor interactions, the successful acquisition of non-conserved interactions would be expected to be relatively infrequent and more likely to require viral replication and mutation in the new host to optimize receptor-binding affinity. many coronaviruses have originated in bats , and it is tempting to speculate that viral transmission between bats has facilitated the emergence of non-conserved receptor interactions. bats account for~ % of all mammalian species and they possess a unique ecology/biology that facilitates viral spread between them , . moreover, the barriers to viral replication in a new host are lower among closely related species , . it follows that the viral replication required to optimize non-conserved receptor interactions in the new host would be facilitated by transmission between closely related bat species. by a similar reasoning, the use of conserved receptor interactions requiring little optimization would facilitate large species jumps. several bat coronaviruses showing a high degree of sequence similarity with hcov- e have recently been identified , and an analysis of how they interact with bat apn will inform this discussion. predicting the emergence of new viral threats is an important aspect of public health planning and our work suggests that rna viruses that use loops to bind their receptors should be viewed as a particular risk. rna viruses are best described as populations , and extended loops-inherently capable of accommodating and accumulating mutational change-will enable populations with loop diversity. such populations will provide routes to escaping receptor loop-binding neutralizing antibodies, optimizing receptor-binding affinity, and acquiring new receptor interactions, interrelated processes that drive viral evolution and the emergence of new viral threats. protein expression and purification. the soluble ectodomain of hapn (residues - ) was expressed and purified from stably transfected hek s gnt -cells (atcc crl- ) as described previously . the various soluble forms of the hcov- e s-protein were expressed and purified from stably transfected hek s gnt -cells for x-ray crystallography, and from hek t (atcc crl- ) and/or hek f (invitrogen - ) cells for cell-based and biochemical characterization, as described previously . point mutations were generated using the infusion hd site-directed mutagenesis protocol (clontech). in all cases, the target proteins were secreted as n-terminal protein-a fusion proteins with a tobacco etch virus (tev) protease cleavage site following the protein-a tag. harvested media was concentrated -fold and purified by igg affinity chromatography (igg sepharose, ge). the bound proteins were liberated by on-column tev protease cleavage and further purified by anion exchange chromatography (hitrap q hp, ge). protein crystallization. the rbd of the s-protein of hcov- e (residues - ) and the soluble ectodomain of hapn (residues - ) were mixed in a ratio of . : (rbd:hapn) and the complex was purified by superdex (ge) gel filtration chromatography in mm hepes, mm nacl, ph . . the complex was concentrated in gel filtration buffer to mg/ml for crystallization trials. crystals were obtained by the hanging drop method using a : mixture of stock protein and well solution containing % peg , mm gssg, mm gsh, % glycerol, µg/ml endo-β-n-acetylglucosaminidase a and mm mes, ph . at k. crystals were typically harvested after days and flash-frozen with well solution supplemented with . % glycerol as cryoprotectant. data collection and structure determination. diffraction data were collected at the canadian light source, saskatoon, saskatchewan (beamline cmcf- id- ) at a wavelength of . Å. data were merged, processed, and scaled using hkl ; % of the data set was used for the calculation of r free . phases were obtained by molecular replacement using the human apn structure as a search model (pdb id: fyq) using phaser in phenix . manual building of the hcov- e rbd was performed using coot . alternate rounds of manual rebuilding and automated refinement using phenix were performed. secondary structural restraints and torsion-angle non-crystallographic symmetry restraints between the three monomers in the asymmetric unit were employed. ramachandran analysis showed that % of the residues are in the most favored region, with % in the additionally allowed region. data collection and refinement statistics are found in table . a stereo image of a portion of the electron density map in the hcov- e-hapn interface is showed in supplementary fig. . figures were generated using the program chimera . buried surface calculations were performed using the pisa server. surface plasmon resonance binding assays. surface plasmon resonance (biacore) assays were performed on cm- dextran chips (ge) covalently coupled to the ligand via amine coupling. the running and injection buffers were matched and consisted of mm nacl, . % tween- , . mg/ml bsa, and mm hepes at ph . . response unit (ru) values were measured as a function of analyte concentration at k. kinetic analysis was performed using the global fitting feature of scrubber (biologic software) assuming a : binding model. for experiments using hapn as a ligand, between and ru were coupled to the cm- dextran chips. for experiments using . e , ru was immobilized. viral inhibition assay. hcov- e was originally obtained from the american type culture collection (atcc vr- ) and was produced in the human l cell line (atcc ccl ) which was grown in minimum essential medium alpha (mem-α) supplemented with % (v/v) fbs (paa). the l ( × ) cells were seeded on coverslips and grown overnight in mem-α supplemented with % (v/v) fbs. for inhibition assays in the presence of soluble hapn, wild-type hcov- e ( . tcid ) was pre-incubated with the fragment (residues - ) diluted in pbs for one hour at °c before being added to cells for h at °c. for inhibition assays in the presence of the soluble sprotein fragments, the different fragments, diluted in pbs, were added to cells and kept at °c on ice for h. medium was then removed and cells were inoculated with wild-type hcov- e ( tcid ) for h at °c. for both inhibition assays, after the -h incubation period, medium was replaced and cells were incubated at °c with fresh mem-α supplemented with % (v/v) fbs for h before being analyzed by an immunofluorescence assay (ifa). cells on the coverslips were directly fixed with % paraformaldehyde (pfa %) in pbs for min at room temperature and then transferred to pbs. cells were permeabilized in cold methanol (− °c) for min and then washed with pbs for viral antigen detection. the s-protein-specific monoclonal antibody, - h. , raised against hcov- e (igg , produced in our laboratory by standard hybridoma technology), was used in conjunction with an alexafluor- -labeled mouse-specific goat antibody (life technologies a- ), for viral antigen detection . after three washes with pbs, cells were incubated for min with dapi (sigma-aldrich) at µg/ml to stain the nuclear dna. to determine the percentage of l- cells positive for the viral s-protein, fields containing a total of - cells were counted, at a magnification of × using a nikon eclipse e microscope, for each condition tested in three independent experiments. green fluorescent cells were counted as s-protein positive and expressed as a percentage of the total number of cells. statistical significance was estimated by the analysis of variance (anova) test and tukey's test post hoc. monoclonal antibodies (igg , produced in our laboratory by standard hybridoma technology) raised against hcov- e ( . e ) or hcov-oc ( . h , negative control) that were found to be s-protein specific were tested in an infectivity neutralization assay. wild-type hcov- e ( . tcid ) was preincubated with the antibodies ( / of hybridoma supernatant) for h at °c before being added to l- cells for h at °c. cells were washed with pbs and incubated at °c with fresh mem-α supplemented with % fbs (v/v) for h before being analyzed by an immunofluorescence assay (ifa). statistical significance was estimated by an anova test, followed by post hoc dunnett (twosided) analysis. comparative sequence analysis of hcov- e viral isolates. the protein sequence of the hcov- e p e isolate rbd (residues - ) was used to perform a search of the non-redundant protein sequence database using blastp. and the residue-specific sequence conservation index was mapped onto the surface of the rbd using the "render by conservation" tool in chimera . percentage identity is mapped using a color scale with blue indicating % identity and red indicating % identity. the protein-coding regions of the eight sequences for which the entire genome were reported (genbank identifier numbers: nc_ . , jx . , jx . , kf . , kf . , kf . , af . , and ku . ) were aligned using muscle. the entire protein-coding region of the viral genome was treated as a continuous amino-acid string ( residues in total). protein residues that were not identical among the eight sequences were counted as a difference and plotted in residue bins. the sequence aak . was chosen as the representative of class i and the loop sequences of abb . , abb . , abb . , abb . , and afr . were combined with the non-loop sequences of aak . to generate the rbds of classes (ii-vi), respectively. data availability. coordinates and structure factors for the hcov- e rbd in complex with human apn were deposited in the protein data bank with pdb id: atk. the authors declare that all other data supporting the findings of this study are available within the article and its supplementary information files, or are available from the authors upon request. received: may accepted: october a decade after sars: strategies for controlling emerging coronaviruses epidemiology, genetic recombination, and pathogenesis of coronaviruses discovery of seven novel mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavirus molecular evolution of human coronavirus genomes coronavirus host range expansion and middle east respiratory syndrome coronavirus emergence: biochemical mechanisms and evolutionary perspectives neuroinvasive and neurotropic human respiratory coronaviruses: potential neurovirulent agents in humans epidemiology and clinical presentations of the four human coronaviruses e, hku , nl , and oc detected over years using a novel multiplex real-time pcr method human aminopeptidase n is a receptor for human coronavirus e human coronavirus nl employs the severe acute respiratory syndrome coronavirus receptor for cellular entry human and bovine coronaviruses recognize sialic acidcontaining receptors similar to those of influenza c viruses identification of the receptor-binding domain of the spike glycoprotein of human betacoronavirus hku human coronavirus hku spike protein uses o-acetylated sialic acid as an attachment receptor determinant and employs hemagglutininesterase protein as a receptor-destroying enzyme crystal structure of the receptor binding domain of the spike glycoprotein of human betacoronavirus hku severe acute respiratory syndrome coronavirus-like virus in chinese horseshoe bats isolation and characterization of a bat sars-like coronavirus that uses the ace receptor further evidence for bats as the evolutionary source of middle east respiratory syndrome coronavirus angiotensin-converting enzyme is a functional receptor for the sars coronavirus dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc structure, function, and evolution of coronavirus spike proteins the coronavirus spike protein is a class i virus fusion protein: structural and functional characterization of the fusion core complex cryo-electron microscopy structures of the sars-cov spike glycoprotein reveal a prerequisite conformational state for receptor binding pre-fusion structure of a human coronavirus spike protein glycan shield and epitope masking of a coronavirus spike protein observed by cryo-electron microscopy cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains contributions of the structural proteins of severe acute respiratory syndrome coronavirus to protective immunity identification of human neutralizing antibodies against mers-cov and their role in virus adaptive evolution effects of human anti-spike protein receptor binding domain antibodies on severe acute respiratory syndrome coronavirus neutralization escape and fitness the evolution and emergence of rna viruses host-specific parvovirus evolution in nature is recapitulated by in vitro adaptation to different carnivore species recombination, reservoirs, and the modular spike: mechanisms of coronavirus cross-species transmission human coronavirus e encodes a single orf protein between the spike and the envelope genes clinical isolates of human coronavirus e bypass the endosome for cell entry the role of mutational robustness in rna virus evolution hiv pathogenesis: dynamics and genetics of viral populations and infected cells analysis of human coronavirus e spike and nucleoprotein genes demonstrates genetic drift between chronologically distinct strains the behaviour of recent isolates of human respiratory coronavirus in vitro and in volunteers: evidence of heterogeneity among e-related strains differences in neutralizing antigenicity between laboratory and clinical isolates of hcov- e isolated in japan in - depend on the s region sequence of the spike protein the x-ray crystal structure of human aminopeptidase n reveals a novel dimer and the basis for peptide processing structural bases of coronavirus attachment to host aminopeptidase n and its inhibition by neutralizing antibodies mutational analysis of aminopeptidase n, a receptor for several group coronaviruses, identifies key determinants of viral host range allosteric inhibition of aminopeptidase n functions related to tumor growth and virus infection human coronavirus e: receptor binding domain and neutralization by soluble receptor at degrees c broadly neutralizing antiviral antibodies unifying the epidemiological and evolutionary dynamics of pathogens viral phylodynamics hemagglutinin receptor binding avidity drives influenza a virus antigenic drift evolution of the hemagglutinin protein of the new pandemic h n influenza virus: maintaining optimal receptor binding by compensatory substitutions preconfiguration of the antigen-binding site during affinity maturation of a broadly neutralizing influenza virus antibody a single mutation in the envelope protein modulates flavivirus antigenicity, stability, and pathogenesis conformational dynamics of single hiv- envelope trimers on the surface of native virions hiv- fitness cost associated with escape from the vrc class of cd binding site neutralizing antibodies spread of mutant middle east respiratory syndrome coronavirus with reduced affinity to human cd during the south korean outbreak escape from human monoclonal antibody neutralization affects in vitro and in vivo fitness of severe acute respiratory syndrome coronavirus crystal structure of nl respiratory coronavirus receptorbinding domain complexed with its human receptor bats as "special" reservoirs for emerging zoonotic pathogens bats as viral reservoirs host phylogeny constrains cross-species emergence and establishment of rabies virus in bats jumping species-a mechanism for coronavirus persistence and survival evidence for an ancestral association of human coronavirus e with bats surveillance of bat coronaviruses in kenya identifies relatives of human coronaviruses nl and e and their recombination history what can we predict about viral evolution and emergence? simple piggybac transposon-based mammalian cell expression system for inducible protein production synthesis of neoglycoenzymes with homogeneous n-linked oligosaccharides using immobilized endo-beta-n-acetylglucosaminidase a processing of x-ray diffraction data collected in oscillation mode phenix: a comprehensive python-based system for macromolecular structure solution coot: model-building tools for molecular graphics ucsf chimera--a visualization system for exploratory research and analysis persistent infection of human oligodendrocytic and neuroglial cell lines by human coronavirus e muscle: multiple sequence alignment with high accuracy and high throughput the work was supported by cihr operating grants to j.m.r. and p.j.t. and a canada research chair to p.j.t. the canadian light source is acknowledged for synchrotron data collection. supplementary information accompanies this paper at doi: . /s - - -x.competing interests: the authors declare no competing financial interests.reprints and permission information is available online at http://npg.nature.com/ reprintsandpermissions/ publisher's note: springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.open access this article is licensed under a creative commons attribution . international license, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons license, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons license, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. to view a copy of this license, visit http://creativecommons.org/ licenses/by/ . /. key: cord- -eeyqh nc authors: zhou, yusen; yang, yang; huang, jingwei; jiang, shibo; du, lanying title: advances in mers-cov vaccines and therapeutics based on the receptor-binding domain date: - - journal: viruses doi: . /v sha: doc_id: cord_uid: eeyqh nc middle east respiratory syndrome (mers) coronavirus (mers-cov) is an infectious virus that was first reported in . the mers-cov genome encodes four major structural proteins, among which the spike (s) protein has a key role in viral infection and pathogenesis. the receptor-binding domain (rbd) of the s protein contains a critical neutralizing domain and is an important target for development of mers vaccines and therapeutics. in this review, we describe the relevant features of the mers-cov s-protein rbd, summarize recent advances in the development of mers-cov rbd-based vaccines and therapeutic antibodies, and illustrate potential challenges and strategies to further improve their efficacy. middle east respiratory syndrome (mers) coronavirus (cov) is an infectious virus that was first reported in june [ ] . mers-cov may infect people of any age, but older age, underlying comorbidity (such as diabetes mellitus, renal disease, respiratory disease, heart disease, and hypertension), and delayed confirmation or late diagnosis are all factors that affect mers disease outcomes and mortality [ ] [ ] [ ] [ ] [ ] [ ] . sex could be a factor in mers epidemiology, as more males seem to be affected than females [ ] [ ] [ ] . mers-cov infection of women during pregnancy has adverse outcomes, with fetal mortality of~ %; however, only a limited number of pediatric mers-cov infections occur [ ] [ ] [ ] [ ] . at the end of december , , laboratory-confirmed mers infections were reported globally (in countries), leading to deaths, and a mortality of . %. among these infections, , ( . %) were reported in saudi arabia, with mortality in individuals ( . %) (http://www.emro.who.int/health-topics/mers-cov/mers-outbreaks.html). the largest mers outbreak outside saudi arabia occurred in south korea in , with cases and deaths [ , , ] . the most recent mers cases were reported in in south korea, the united kingdom, and malaysia, in addition to saudi arabia, the united arab emirates, and oman (http://www.who.int/emergencies/mers-cov/en/). mers-cov is thought to have originated in bats [ ] [ ] [ ] [ ] . mers-like viruses have been isolated from bats that use (at lower efficiency) the same receptor for cell entry as the mers-cov isolated from humans [ ] [ ] [ ] . dromedary camels are potential intermediates for long-term evolution of mers-cov and seasonal zoonotic transfer of virus to humans [ ] [ ] [ ] [ ] . antibodies specific to host cellular proteases for its activity in viral entry, but although evidence initially indicated that cellular furin activates s protein, subsequent results have demonstrated no evidence for the involvement of furin during viral entry [ , ] . the dpp receptor varies among different host species, and mers-cov is thought to use multiple pathways to enable rapid adaptation to speciesspecific variations [ ] [ ] [ ] . in addition to dpp , mers-cov can bind to sialic acid via the s subunit of s protein, or utilize the membrane-associated kda glucose-regulated protein (grp ) to attach to target cells, suggesting that these proteins may also have roles in virion attachment [ , ] . the structures of mers-cov rbd alone and complexed with dpp have been determined ( figure ) [ , , ] . the rbd has a fold-rich tertiary structure, which consists of a core and a receptor-binding motif (rbm), with stabilization provided by four disulfide bonds and two glycans [ ] . a number of rbd residues are located at the dpp -binding interface, and they have a critical role in rbd-dpp binding [ , , ] . structural analysis of mers-cov trimeric s protein has identified specific features of the rbd and its complex with dpp . notably, in the prefusion conformation of the s trimer, individual rbds are either buried (lying state) or exposed (standing state), and this flexibility presumably facilitates recognition by dpp [ ] . other structural studies have revealed four s-trimer conformational states, in which each rbd is either tightly packed at the membrane-distal apex or rotated into a receptor-accessible conformation, suggesting fusion initiation through sequential rbd events [ ] . in configurations with one, two, or three rbds rotated out, rbd determinants are exposed at the apex of the rbd-dpp complex, and they are accessible for interaction with dpp ( figure ) [ ] . mers-cov s protein has an important role in viral pathogenesis, determining host tropism and entry into host cells [ , , ] . the s protein contains an s subunit at the n terminus and an s subunit at the c terminus. the s subunit is composed of the n-terminal domain (ntd) and rbd [ , , ] . the rbd has a key role in the mediation of binding of mers-cov to cells expressing dipeptidyl peptidase (dpp ) receptor, enabling the virus to enter into target cells by fusing with cell membranes through the formation of a fusion core ( figure c ) [ ] [ ] [ ] [ ] . the s protein requires host cellular proteases for its activity in viral entry, but although evidence initially indicated that cellular furin activates s protein, subsequent results have demonstrated no evidence for the involvement of furin during viral entry [ , ] . the dpp receptor varies among different host species, and mers-cov is thought to use multiple pathways to enable rapid adaptation to species-specific variations [ ] [ ] [ ] . in addition to dpp , mers-cov can bind to sialic acid via the s subunit of s protein, or utilize the membrane-associated kda glucose-regulated protein (grp ) to attach to target cells, suggesting that these proteins may also have roles in virion attachment [ , ] . the structures of mers-cov rbd alone and complexed with dpp have been determined ( figure ) [ , , ] . the rbd has a fold-rich tertiary structure, which consists of a core and a receptor-binding motif (rbm), with stabilization provided by four disulfide bonds and two glycans [ ] . a number of rbd residues are located at the dpp -binding interface, and they have a critical role in rbd-dpp binding [ , , ] . structural analysis of mers-cov trimeric s protein has identified specific features of the rbd and its complex with dpp . notably, in the prefusion conformation of the s trimer, individual rbds are either buried (lying state) or exposed (standing state), and this flexibility presumably facilitates recognition by dpp [ ] . other structural studies have revealed four s-trimer conformational states, in which each rbd is either tightly packed at the membrane-distal apex or rotated into a receptor-accessible conformation, suggesting fusion initiation through sequential rbd the mers-cov rbd core is colored in blue, the rbm is colored in red, and dpp is colored in green. the rbm residues directly involved in dpp binding are shown as sticks. dpp , dipeptidyl peptidase ; rbd, receptor-binding domain; rbm, receptor-binding motif; s, spike protein. the function and structure of the s-protein rbd demonstrate that it is an important target for development of vaccines and therapeutic agents against mers-cov. a number of mers vaccines have been developed based on viral rbd, including nanoparticles, virus-like particles (vlps), and recombinant proteins, and their protective efficacy has been evaluated in animal models, including mice with adenovirus (ad )-directed expression of human dpp (ad /hdpp ), hdpp -transgenic (hdpp -tg) mice, and non-human primates (nhps) [ ] [ ] [ ] [ ] [ ] [ ] [ ] . features of these rbd-based vaccines, in terms of functionality, antigenicity, immunogenicity, and protective ability, are shown in table . the function and structure of the s-protein rbd demonstrate that it is an important target for development of vaccines and therapeutic agents against mers-cov. a number of mers vaccines have been developed based on viral rbd, including nanoparticles, virus-like particles (vlps), and recombinant proteins, and their protective efficacy has been evaluated in animal models, including mice with adenovirus (ad )-directed expression of human dpp (ad /hdpp ), hdpp -transgenic (hdpp -tg) mice, and non-human primates (nhps) [ ] [ ] [ ] [ ] [ ] [ ] [ ] . features of these rbd-based vaccines, in terms of functionality, antigenicity, immunogenicity, and protective ability, are shown in table . a soluble nanoparticle vaccine formed in escherichia coli by the rna-mediated folding of a rbd-ferritin (fr) hybrid elicits robust rbd-specific antibody and cellular immune responses in mice, producing antisera that effectively block the binding of rbd to hdpp in vitro [ ] . the adjuvants alum and the squalene-based mf significantly augment the antibody titers and t-cell responses induced by rbd-fr nanoparticle vaccines engineered with or without a ssg linker [ ] . similarly, a chimeric, spherical vlp (svlp) vaccine expressing mers-cov rbd induces specific antibody and cellular immune responses in mice, preventing pseudotyped mers-cov entry into susceptible cells [ ] . the protective efficacy of these two types of mers vaccine does not yet seem to have been investigated in a viral-challenge animal model. recombinant vaccines involving rbd subunits have been extensively studied for protection against mers-cov infection in mers-cov-susceptible animal models [ , [ ] [ ] [ ] , ] . a recombinant rbd (rrbd) fragment (residues - ) expressed in insect cells elicits an antibody response and the production of neutralizing antibodies in mice and nhps [ , ] . it gives incomplete protection in mers-cov-challenged nhps, with the alleviation of pneumonia and clinical manifestations, as well as the reduction of viral load in lung, trachea, and oropharyngeal swabs [ ] . a mers-cov s-protein rbd fragment containing residues - has been identified as a critical neutralizing domain [ ] . a treatment regimen involving two doses of a fusion of this fragment and the fc region of human igg (s - -fc) four weeks apart is able to induce strong, long-term antibody responses (including production of neutralizing antibodies) in mice [ ] . these responses are significantly greater than those with a single dose or two doses at intervals of one, two, or three weeks [ ] . rrbds with single or multiple mutations corresponding to s-protein sequences of mers-cov strains isolated from humans or camels from to have also been studied [ ] . all these rrbds bind rbd-specific neutralizing monoclonal antibodies (mabs) and dpp , and are highly immunogenic, eliciting the production of s -specific antibodies in mice, which cross-neutralizes multiple mers pseudoviruses and live mers-cov [ ] . a trimeric rbd-fd protein formed by fusing a mers-cov rbd fragment (residues - ) to the foldon trimerization motif, binds strongly to dpp , and elicits robust and long-term responses with the production of mers-cov s -specific antibodies and neutralizing antibodies in mice, and protects hdpp -tg mice against mers-cov infection [ ] . the protection provided by existing subunit vaccines based on wild-type mers-cov rbd is not complete, with survival rates in hdpp -tg mice after a mers-cov challenge of~ % for s - -fc and % for rbd-fd [ , ] . however, a variant rbd (t n) vaccine produced by masking a non-neutralizing epitope at residue with a glycan probe has both functionality in binding dpp , and antigenicity in binding four potent mers-cov rbd-specific neutralizing mabs (hhs- , m , m , and m ) [ ] . the t n vaccine has significantly greater efficacy than the wild-type rbd vaccine, and it fully protects against a lethal mers-cov challenge in immunized hdpp -tg mice [ ] , demonstrating the possibility of developing rbd-based mers-cov vaccines with high efficacy. [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . these antibodies generally have greater neutralizing activity against mers-cov infection than non-rbd s -based or s -based antibodies [ , , , ] . the prophylactic and therapeutic efficacies of rbd-targeting antibodies have been tested in ad /hdpp mice, hdpp -tg mice, and nhps [ , , [ ] [ ] [ ] . in an earlier review, we described the antiviral mechanisms, in vivo protection, and crystal structures of previously reported mers-cov rbd-specific mabs, including mouse mabs mersmab , e , c , f , and d , and human mabs lca , mers- , mers- , regn , regn , e , f , a , b , b , b -n, c , m d , m , m , m , hms- , and c h [ ] . in this review, we focus on newly reported antibodies targeting mers-cov s-protein rbd, or on newly identified features of existing mabs that were not described previously (table ) [ , [ ] [ ] [ ] [ ] . rbd-targeting human mabs have been extensively reported. most of these mabs can neutralize pseudotyped or live mers-cov in vitro, and some have shown protection against mers-cov infection in animal models in vivo [ , [ ] [ ] [ ] [ ] . the structures of several of these mabs with their antigen-binding fragments (fabs) or single-chain variable fragments (scfvs) complexed with rbd are known ( figure ) [ , [ ] [ ] [ ] [ ] . binding of these mabs to rbd involves two major recognition modes, with binding to rbd residues contacted by or overlapping with dpp (as is the case for gd- , mca , and cdc -c ), or with binding to the rbd residues outside of the dpp -binding interface (as seen with mers- ) ( table ) . infection in animal models in vivo [ , [ ] [ ] [ ] [ ] . the structures of several of these mabs with their antigen-binding fragments (fabs) or single-chain variable fragments (scfvs) complexed with rbd are known ( figure ) [ , [ ] [ ] [ ] [ ] . binding of these mabs to rbd involves two major recognition modes, with binding to rbd residues contacted by or overlapping with dpp (as is the case for gd- , mca , and cdc -c ), or with binding to the rbd residues outside of the dpp -binding interface (as seen with mers- ) ( table ) . the human mabs mers-gd and mers-gd each recognize distinct regions of the rbd [ ] . these mabs have a synergistic effect in the neutralization of pseudotyped mers-cov in vitro, with a much lower half-maximal inhibitory concentration (ic ) for their use in combination than separately [ ] . an analysis of crystal structures has indicated that mers-gd binds rbd at the dpp -binding site, and that the neutralization and recognition epitopes almost completely overlap this site, as seen previously for mers-cov rbd-targeting neutralizing mabs, such as m [ , ] . the mers-gd mab protects hdpp -tg mice from mers-cov challenge, both preventively and therapeutically, with significantly lower lung virus titers and rna copy numbers at day postchallenge, and higher survival rates ( % for pre-challenge vaccination and % for post-challenge vaccination) relative to control mice treated with an irrelevant mab [ ] . the human mab mca was isolated from a mers survivor via the construction of a phagedisplay antibody library from peripheral b cells [ ] . crystal structure analysis indicates that mca binds mers-cov s-protein rbd at residues involved in receptor binding, thus interfering with rbd binding to hdpp ( figure a ) [ ] . this mab prophylactically and therapeutically inhibits mers-cov replication in common marmosets, resulting in significantly improved outcomes and reduced the human mabs mers-gd and mers-gd each recognize distinct regions of the rbd [ ] . these mabs have a synergistic effect in the neutralization of pseudotyped mers-cov in vitro, with a much lower half-maximal inhibitory concentration (ic ) for their use in combination than separately [ ] . an analysis of crystal structures has indicated that mers-gd binds rbd at the dpp -binding site, and that the neutralization and recognition epitopes almost completely overlap this site, as seen previously for mers-cov rbd-targeting neutralizing mabs, such as m [ , ] . the mers-gd mab protects hdpp -tg mice from mers-cov challenge, both preventively and therapeutically, with significantly lower lung virus titers and rna copy numbers at day post-challenge, and higher survival rates ( % for pre-challenge vaccination and % for post-challenge vaccination) relative to control mice treated with an irrelevant mab [ ] . the human mab mca was isolated from a mers survivor via the construction of a phage-display antibody library from peripheral b cells [ ] . crystal structure analysis indicates that mca binds mers-cov s-protein rbd at residues involved in receptor binding, thus interfering with rbd binding to hdpp ( figure a ) [ ] . this mab prophylactically and therapeutically inhibits mers-cov replication in common marmosets, resulting in significantly improved outcomes and reduced lung disease, compared with unvaccinated controls, and undetectable virus titers days post-challenge [ ] . a probe-based single-b-cell cloning strategy has been used for the isolation of cdc -c and cdc -c mabs from a patient convalescing from mers, as well as for the isolation of jc - and jc - mabs from nhps immunized with mers-cov full-length s dna and protein [ ] . all these antibodies have neutralizing activities against both pseudotyped and live mers-cov. among them, cdc -c is the most potent against pseudotyped mers-cov strains, with neutralization ic values ranging from . µg/ml to . µg/ml [ ] . crystal-structure analysis of the cdc -c and jc - fab-rbd complexes indicates that both mabs bind rbd in the "out" (exposed) position, with the cdc -c rbd binding overlapping with the dpp -contacting residues ( figure b ,c) [ ] . in addition, cdc -c prophylactically protects hdpp -tg mice from mers-cov infection, resulting in no detectable viral replication in the lungs three days post-challenge, and no fatalities over days of observation [ ] . the human mab mers- also neutralizes pseudotyped mers-cov and, notably, displays synergistic neutralization in combination with the mers-cov s-protein rbd-targeting mers- and m mabs [ , ] , as well as the s-protein ntd-targeting f mab, in each case with dramatic reduction of the ic compared with individual mabs [ ] . structural analysis of a mers- -fab-rbd complex revealed that mers- binds the rbd from outside the dpp -binding interface, rather than competing with dpp ( figure d ). unlike mers- , which binds rbd regardless of its conformational state within the s trimer, mers- binds rbd in the "standing" position where its epitopes are readily exposed and accessible [ ] . thus, mers- displays unique epitope specificity, and an unusual mechanism of action involving indirect interference with dpp binding through conformational changes, which may explain the observation of synergistic neutralization in combination with other mabs [ ] . single-domain antibody fragments (vhhs), or nanobodies, are the antigen-recognition regions of camelid heavy-chain-only antibodies (hcabs), which do not contain light chains. vhhs are easily expressed with high yield, and they have intrinsic stability, strong binding affinity, and specificity to target antigens, and they have therefore been developed as important therapeutic tools against viral infection, including that of mers-cov [ , , [ ] [ ] [ ] [ ] [ ] . four vhhs (vhh- , vhh- , vhh- , and vhh- ) have been identified from bone marrow cells of dromedary camels immunized with modified vaccinia virus (mva) expressing mers-cov s protein, and challenged with mers-cov [ ] . these vhhs bind mers-cov s protein with low k d values ( . - nm), recognize an epitope at residue d of rbd, and neutralize mers-cov (prnt , . - . µg/ml) [ ] . these four monomeric vhhs have each been fused with a c-terminal human igg tag to generate four hcabs (hcab- , hcab- , hcab- , and hcab- ), with a higher binding affinity and a longer half-life than the free vhhs [ ] . studies of protective efficacy show that hdpp -tg mice (k ) injected with monomeric vhh- ( or µg per mouse) lose weight, and die within seven days post-infection, possibly because of the short half-life of the vhh. however, when the mice are injected with hcab- ( µg per mouse), which has an extended half-life (~ . days), protection against mers-cov is complete, with no viral titers or pathological changes in the lungs of virally challenged mice [ ] . by immunizing llamas with a recombinant rbd fragment (residues - ) fused to a c-terminal human igg fc tag (s - -fc), we constructed a vhh library, and we used it to generate a monomeric vhh, nbms , and a human fc-fused vhh, nbms -fc [ ] . both vhhs can be expressed in a yeast expression system to high purity, and bind rbd with high affinity, recognizing a conformational epitope (residue ) at the rbd-dpp interface, and blocking the binding of rbd to dpp . these vhhs, particularly nbms -fc, potently cross-neutralize pseudotyped mers-cov strains isolated from different countries, hosts, and time periods [ ] . importantly, the fc-fused nbms -fc significantly improves the serum half-life of nbms , and a single-dose treatment of hdpp -tg mice with this agent completely protects them against lethal mers-cov challenge [ ] . these single-domain vhhs demonstrate the feasibility of developing cost-effective, potent, and broad-spectrum therapeutic antibodies against mers-cov infection. compared with vaccines based on mers-cov full-length s protein, which have the potential to attenuate neutralizing activity or enhance immune pathology, vaccines developed from mers-cov s-protein rbd are safer, and they do not cause immunological toxicity or eosinophilic immune enhancement [ , , , ] . moreover, rbd-based therapeutic antibodies are generally more potent than non-rbd s -based or s -based antibodies [ , , ] . hence, rbd-based vaccines and therapeutic antibodies have the potential for further development as effective tools to prevent and treat mers-cov infection. despite their acknowledged advantages, there are some issues associated with rbd-based interventions that need to be addressed. for example, rbd is under a high level of pressure of positive selection, and mutations occur in the rbd-dpp binding interface that might reduce the efficacy of these treatments [ , [ ] [ ] [ ] . one possible way to avoid this effect, and to delay the emergence of escape mutants is to combine rbd-targeting therapeutics with those targeting other regions of the s protein, or to combine antibodies recognizing distinct epitopes within the rbd [ , ] . such combinatorial strategies could also dramatically reduce antibody neutralization doses, providing feasible means to combat the continual threat of mers-cov. some recent advances have been made in the structure-guided design of anti-mers-cov interventions. structurally designed inhibitors of the cl protease have demonstrated potency against mers-cov [ ] . also, a structurally designed s-protein trimer in the optimal prefusion conformation is shown to elicit production of high titers of anti-mers-cov neutralizing antibodies [ ] . indeed, based on the previous studies on the structural design of mers-cov rbd, non-neutralizing epitopes in the rbd can be masked, to refocus the immunogenicity of the rbd on the neutralizing epitopes, and thus to enhance its ability to confer immune protection [ ] . results from these structure-based studies will help to inform the design of innovative rbd-based anti-mers-cov vaccines and therapeutics with improved efficacy. isolation of a novel coronavirus from a man with pneumonia in saudi arabia fatality risks for nosocomial outbreaks of middle east respiratory syndrome coronavirus in the middle east and south korea risks of death and severe disease in patients with middle east respiratory syndrome coronavirus impact of comorbidity on fatality rate of patients with middle east respiratory syndrome clinical determinants of the severity of middle east respiratory syndrome (mers): a systematic review and meta-analysis prevalence of comorbidities in the middle east respiratory syndrome coronavirus (mers-cov): a systematic review and meta-analysis epidemiological, demographic, and clinical characteristics of cases of middle east respiratory syndrome coronavirus disease from saudi arabia: a descriptive study sex matters -a preliminary analysis of middle east respiratory syndrome in the republic of korea middle east respiratory syndrome coronavirus (mers-cov) outbreak in south korea, : epidemiology, characteristics and public health implications hospital outbreak of middle east respiratory syndrome coronavirus middle east respiratory syndrome coronavirus (mers-cov) infection during pregnancy: report of two cases & review of the literature mers-cov infection in a pregnant woman in korea middle east respiratory syndrome coronavirus in pediatrics: a report of seven cases from saudi arabia middle east respiratory syndrome coronavirus infection during pregnancy: a report of cases from saudi arabia an outbreak of middle east respiratory syndrome coronavirus infection in south korea probable transmission chains of middle east respiratory syndrome coronavirus and the multiple generations of secondary infection in south korea further evidence for bats as the evolutionary source of middle east respiratory syndrome coronavirus bat origins of mers-cov supported by bat coronavirus hku usage of human receptor cd receptor usage and cell entry of bat coronavirus hku provide insight into bat-to-human transmission of mers coronavirus replication and shedding of mers-cov in jamaican fruit bats (artibeus jamaicensis) discovery of novel bat coronaviruses in south china that use the same receptor as middle east respiratory syndrome coronavirus rapid detection of mers coronavirus-like viruses in bats: pote ntial for tracking mers coronavirus transmission and animal origin receptor usage of a novel bat lineage c betacoronavirus reveals evolution of middle east respiratory syndrome-related coronavirus spike proteins for human dipeptidyl peptidase binding mers-cov spillover at the camel-human interface prevalence of middle east respiratory syndrome coronavirus (mers-cov) in dromedary camels in abu dhabi emirate middle east respiratory syndrome coronavirus in dromedary camels: an outbreak investigation high prevalence of mers-cov infection in camel workers in saudi arabia the prevalence of middle east respiratory syndrome coronavirus (mers-cov) antibodies in dromedary camels in israel serologic evidence for mers-cov infection in dromedary camels sero-prevalence of middle east respiratory syndrome coronavirus (mers-cov) specific antibodies in dromedary camels in tabuk, saudi arabia dromedary camels in northern mali have high seropositivity to mers-cov cross-sectional surveillance of middle east respiratory syndrome coronavirus (mers-cov) in dromedary camels and other mammals in egypt serological evidence of mers-cov antibodies in dromedary camels (camelus dromedaries) in laikipia county reported direct and indirect contact with dromedary camels among laboratory-confirmed mers-cov cases middle east respiratory syndrome coronavirus: risk factors and determinants of primary, household, and nosocomial transmission unusual presentation of middle east respiratory syndrome coronavirus leading to a large outbreak in riyadh during outbreaks of middle east respiratory syndrome in two hospitals initiated by a single patient in daejeon mers-cov outbreak following a single patient exposure in an emergency room in south korea: an epidemiological outbreak study outbreak of middle east respiratory syndrome at tertiary care hospital transmission of middle east respiratory syndrome coronavirus infections in healthcare settings hospital-associated middle east respiratory syndrome coronavirus infections hospital-associated middle east respiratory syndrome coronavirus infections family cluster of middle east respiratory syndrome coronavirus infections healthcare-associated infections: the hallmark of middle east respiratory syndrome coronavirus with review of the literature clinical features and viral diagnosis of two cases of infection with middle east respiratory syndrome coronavirus: a report of nosocomial transmission clinical course and outcomes of critically ill patients with middle east respiratory syndrome coronavirus infection human intestinal tract serves as an alternative infection route for middle east respiratory syndrome coronavirus persistence of antibodies against middle east respiratory syndrome coronavirus presence of middle east respiratory syndrome coronavirus antibodies in saudi arabia: a nationwide, cross-sectional, serological study feasibility of using convalescent plasma immunotherapy for mers-cov infection, saudi arabia feasibility, safety, clinical, and laboratory effects of convalescent plasma therapy for patients with middle east respiratory syndrome coronavirus infection: a study protocol challenges of convalescent plasma infusion therapy in middle east respiratory coronavirus infection: a single centre experience safety and tolerability of a novel, polyclonal human anti-mers coronavirus antibody produced from transchromosomic cattle: a phase randomised, double-blind, single-dose-escalation study prospects for a mers-cov spike vaccine current advancements and potential strategies in the development of mers-cov vaccines is the discovery of the novel human betacoronavirus c emc/ (hcov-emc) the beginning of another sars-like pandemic? cov spike protein: a key target for antivirals genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans engineering a replication-competent, propagation-defective middle east respiratory syndrome coronavirus as a vaccine candidate reverse genetics with a full-length infectious cdna of the middle east respiratory syndrome coronavirus the endonucleolytic rna cleavage function of nsp of middle east respiratory syndrome coronavirus promotes the production of infectious virus particles in specific human cell lines mers coronavirus nsp participates in an efficient propagation through a specific interaction with viral rna middle east respiratory syndrome coronavirus nonstructural protein is necessary for interferon resistance and viral pathogenesis structural and biochemical characterization of endoribonuclease nsp encoded by middle east respiratory syndrome coronavirus structural insights into the middle east respiratory syndrome coronavirus a protein and its dsrna binding mechanism middle east respiratory coronavirus accessory protein a inhibits pkr-mediated antiviral stress responses inhibition of stress granule formation by middle east respiratory syndrome coronavirus a accessory protein facilitates viral translation, leading to efficient virus replication sola, i. mers-cov b protein interferes with the nf-kappab-dependent innate immune response during infection proteolytic processing of middle east respiratory syndrome coronavirus spikes expands virus tropism host cell entry of middle east respiratory syndrome coronavirus after two-step, furin-mediated activation of the spike protein structure, function, and evolution of coronavirus spike proteins mers-cov spike protein: targets for vaccines and therapeutics dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc structure of the fusion core and inhibition of fusion by a heptad repeat peptide derived from the s protein of middle east respiratory syndrome coronavirus structure-based discovery of middle east respiratory syndrome coronavirus fusion inhibitor crystal structure of the receptor-binding domain from newly emerged middle east respiratory syndrome coronavirus middle east respiratory syndrome coronavirus spike protein is not activated directly by cellular furin during viral entry into target cells receptor variation and susceptibility to middle east respiratory syndrome coronavirus infection host species restriction of middle east respiratory syndrome coronavirus through its receptor, dipeptidyl peptidase adaptive evolution of mers-cov to species variation in dpp middle east respiratory syndrome coronavirus and bat coronavirus hku both can utilize grp for attachment onto host cells identification of sialic acid-binding function for the middle east respiratory syndrome coronavirus spike glycoprotein structure of mers-cov spike receptor-binding domain complexed with human receptor dpp molecular basis of binding between novel human coronavirus mers-cov and its receptor cd cryo-em structures of mers-cov and sars-cov spike glycoproteins reveal the dynamic receptor binding domains immunogenicity and structures of a rationally designed prefusion mers-cov spike antigen tailoring subunit vaccine immunity with adjuvant combinations and delivery routes using the middle east respiratory coronavirus (mers-cov) receptor-binding domain as an antigen chaperna-mediated assembly of ferritin-based middle east respiratory syndrome-coronavirus nanoparticles novel chimeric virus-like particles vaccine displaying mers-cov receptor-binding domain induce specific humoral and cellular immune response in mice recombinant receptor binding domain protein induces partial protective immunity in rhesus macaques against middle east respiratory syndrome coronavirus challenge identification of an ideal adjuvant for receptor-binding domain-based subunit vaccines against middle east respiratory syndrome coronavirus introduction of neutralizing immunogenicity index to the rational design of mers coronavirus subunit vaccines a recombinant receptor-binding domain of mers-cov in trimeric form protects human dipeptidyl peptidase (hdpp ) transgenic mice from mers-cov infection searching for an ideal vaccine candidate among different mers coronavirus receptor-binding fragments-the importance of immunofocusing in subunit vaccine design receptor-binding domain-based subunit vaccines against mers-cov optimization of antigen dose for a receptor-binding domain-based subunit vaccine against mers coronavirus receptor-binding domain of mers-cov with optimal immunogen dosage and immunization interval protects human transgenic mice from mers-cov infection engineering a stable cho cell line for the expression of a mers-coronavirus vaccine antigen recombinant receptor-binding domains of multiple middle east respiratory syndrome coronaviruses (mers-covs) induce cross-neutralizing antibodies against divergent human and camel mers-covs and antibody escape mutants intranasal vaccination with recombinant receptor-binding domain of mers-cov spike protein induces much stronger local mucosal immune responses than subcutaneous immunization: implication for designing novel mucosal mers vaccines importance of neutralizing monoclonal antibodies targeting multiple antigenic sites on mers-cov spike to avoid neutralization escape a humanized neutralizing antibody against mers-cov targeting the receptor-binding domain of the spike protein prophylactic and postexposure efficacy of a potent human monoclonal antibody against mers coronavirus pre-and postexposure efficacy of fully human antibodies against spike protein in a novel humanized mouse model of mers-cov infection junctional and allele-specific residues are critical for mers-cov neutralization by an exceptionally potent germline-like antibody single-dose treatment with a humanized neutralizing antibody affords full protection of a human transgenic mouse model from lethal middle east respiratory syndrome (mers)-coronavirus infection a conformation-dependent neutralizing monoclonal antibody specifically targeting receptor-binding domain in middle east respiratory syndrome coronavirus spike protein exceptionally potent neutralization of middle east respiratory syndrome coronavirus by human monoclonal antibodies middle east respiratory syndrome: current status and future prospects for vaccine development evaluation of candidate vaccine approaches for mers-cov a novel human mab (mers-gd ) provides prophylactic and postexposure efficacy in mers-cov susceptible mice ultrapotent human neutralizing antibody repertoires against middle east respiratory syndrome coronavirus from a recovered patient human neutralizing monoclonal antibody inhibition of middle east respiratory syndrome coronavirus replication in the common marmoset structural definition of a unique neutralization epitope on the receptor-binding domain of mers-cov spike glycoprotein chimeric camel/human heavy-chain antibodies protect against mers-cov infection a novel nanobody targeting middle east respiratory syndrome coronavirus (mers-cov) receptor-binding domain has potent cross-neutralizing activity and protective efficacy against mers-cov structural basis for the neutralization of mers-cov by a human monoclonal antibody mers- application of camelid heavy-chain variable domains (vhhs) in prevention and treatment of bacterial and viral infections nanobodies(r) as inhaled biotherapeutics for lung diseases generation and characterization of alx- , a potent novel therapeutic nanobody for the treatment of respiratory syncytial virus infection nanobodies as therapeutics: big opportunities for small antibodies nanobodies: natural single-domain antibodies vaccines for the prevention against the threat of mers-cov evolutionary dynamics of mers-cov: potential recombination, positive selection and transmission spread of mutant middle east respiratory syndrome coronavirus with reduced affinity to human cd during the south korean outbreak mutations in the spike protein of middle east respiratory syndrome coronavirus transmitted in korea increase resistance to antibody-mediated neutralization combining a fusion inhibitory peptide targeting the mers-cov s protein hr domain and a neutralizing antibody specific for the s protein receptor-binding domain (rbd) showed potent synergism against pseudotyped mers-cov with or without mutations in rbd structure-guided design of potent and permeable inhibitors of mers coronavirus cl protease that utilize a piperidine moiety as a novel design element this article is an open access article distributed under the terms and conditions of the creative commons attribution (cc by) license acknowledgments: this study was supported by the nsfc grant , and the nih grants r ai , r ai , and r ai . the funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication. the authors declare no competing interests. key: cord- - ku jc s authors: kraus, aurora; casadei, elisa; huertas, mar; ye, chunyan; bradfute, steven; boudinot, pierre; levraud, jean-pierre; salinas, irene title: a zebrafish model for covid- recapitulates olfactory and cardiovascular pathophysiologies caused by sars-cov- date: - - journal: biorxiv doi: . / . . . sha: doc_id: cord_uid: ku jc s the covid- pandemic has prompted the search for animal models that recapitulate the pathophysiology observed in humans infected with sars-cov- and allow rapid and high throughput testing of drugs and vaccines. exposure of larvae to sars-cov- spike (s) receptor binding domain (rbd) recombinant protein was sufficient to elevate larval heart rate and treatment with captopril, an ace inhibitor, reverted this effect. intranasal administration of sars-cov- s rbd in adult zebrafish recombinant protein caused severe olfactory and mild renal histopathology. zebrafish intranasally treated with sars-cov- s rbd became hyposmic within minutes and completely anosmic by day to a broad-spectrum of odorants including bile acids and food. single cell rna-seq of the adult zebrafish olfactory organ indicated widespread loss of expression of olfactory receptors as well as inflammatory responses in sustentacular, endothelial, and myeloid cell clusters. exposure of wildtype zebrafish larvae to sars-cov- in water did not support active viral replication but caused a sustained inhibition of ace expression, triggered type cytokine responses and inhibited type cytokine responses. combined, our results establish adult and larval zebrafish as useful models to investigate pathophysiological effects of sars-cov- and perform pre-clinical drug testing and validation in an inexpensive, high throughput vertebrate model. species that have been reported as naturally susceptible to sars-cov- include rhesus and cynomolgus macaques (munster et al., ; rockx et al., ) , ferret (kim et al., ) , cat (shi et al., ) , and syrian hamster (chan et al., ) . mice, by contrast, are not spontaneously permissive to the virus, but mice expressing the human ace receptor provide a useful animal model (bao et al., ; jia et al., ; lutz et al., ) . all these mammalian models have unique advantages and disadvantages for the study of immune responses to sars- cov- and other host-pathogen interactions but do not allow rapid, whole organismal, high throughput, and low-cost preclinical testing of drugs and immunotherapies. as model vertebrates, zebrafish are permissive to human viral pathogens including influenza a (gabor et al., ) , herpes simplex virus type (burgos et al., ) , chikungunya virus (palha et al., ) and human noroviruses gi and gii (van dycke et al., ). zebrafish offer many advantages over other animal models due to their high reproductive ability, rapid development, low maintenance costs, and small transparent bodies. importantly, zebrafish olfactory, immune, and cardiovascular physiology share a significant degree of conservation with humans (postlethwait et al., ; saraiva et al.) . genetically, more than % of disease related genes have a zebrafish orthologue (howe et al., ) . the zebrafish innate immune system is already developed in the transparent larval stages and members of all major groups of mammalian cytokines have been identified in the zebrafish genome (gomes and mostowy, ; zou and secombes, ) . academic laboratories and the pharmaceutical industry use zebrafish larvae in preclinical studies for assessing efficacy and toxicity of candidate drugs for several diseases (taylor et al., ) . zebrafish is a proposed model for covid- and has recently been used in one vaccination study (galindo-villegas, ; ventura-fernandes et al., ). this study aims to elucidate the physiopathology of wildtype zebrafish in response to sars- cov- . sars-cov- infection causes a wide litany of symptoms, ranging from asymptomatic to mild or severe disease (menni et al., ) . apart from respiratory symptoms, multi-organ pathologies are often reported with heterogeneous symptoms such as olfactory and taste loss, cardiac dysfunction, renal pathologies, neurological damage, muscle and joint pain, gastrointestinal symptoms, clotting disorders and others (tabata et mudd et al., ), and elevated type cytokine levels (lucas et al., ) . importantly, this cytokine pattern is in sharp contrast to that found in patients experiencing mild or moderate symptoms, who are able to control exacerbated type and type cytokine responses (lucas et al., ). sars-cov- enters the human host cells when sars-cov- spike (s) protein receptor binding domain (rbd) binds to angiotensin-converting enzyme (ace ) on a permissive host cell, then a serine protease, such as tmprss , cleaves the spike protein s /s site to facilitate fusion of the virion with the host cell membrane (hoffmann et al., a (hoffmann et al., , b . ace is expressed in many different cell types across many organs in the human body including lung, olfactory sustentacular cells, enterocytes, and endothelial cells (albini et ace , the use of ace inhibitors is being considered as a therapeutic intervention in covid- patients (lopes et al., ) . importantly, drugs currently used to treat the covid- can be pro-arrhythmic and therefore there is a need to incorporate cardiovascular damage into the list of targets of therapeutic interventions in covid- and for models that replicate human cardia physiology (kochi et al., ) . a hallmark of sars-cov- infection is acute loss of smell (cooper et al., ) . viral-induced anosmia is not unique to sars-cov- infections since viruses such as rhinoviruses, influenza, parainfluenza and coronaviruses are known to be the main cause of olfactory deficits in humans (suzuki et al., ; imam et al., ) . in mice and humans, ace expression is detected in sustentacular cells, olfactory stem cells known as horizontal and globose basal cells in the olfactory epithelium, and vascular cells (pericytes) in the olfactory bulb (brann et al., the present study reports for the first time that zebrafish larvae exposed to sars-cov- appear to mount innate immune responses that resemble cytokine responses of mild covid- patients. recombinant sars-cov- s rbd is sufficient to cause olfactory, renal and cardiovascular pathologies in larvae and adult zebrafish. we also identify potential mechanisms of sars-cov- induced anosmia by scrna-seq. our findings support the use of zebrafish as a novel vertebrate model to elucidate sars- cov- pathophysiology and to screen drugs and other therapies targeting covid- . results phylogenetic analyses of ace molecules in vertebrates comparative analysis of ace molecules in vertebrates indicated that ace molecules are well conserved in vertebrates with a %- % similarity and . %- % identity between zebrafish ace and human ace , respectively (table s ). examination of ace amino acid motifs in the region involved in binding sars-cov- s protein revealed zebrafish ace has %/ % sequence similarity with the corresponding human ace region compared to %/ % in macaques ace or %/ % in ferret ace ( systemic injection of recombinant sars-cov- protein into adult zebrafish has been shown to induce some toxicity (ventura-fernandes et al., ). in order to determine whether recombinant sars-cov- s rbd protein causes inflammatory responses in zebrafish larvae, we exposed dpf larvae to sars-cov- s rbd recombinant protein for hours (h) and measured cytokine responses by qpcr. as shown in figure a , h immersion with sars-cov- s rbd protein induced a significant downregulation in ifnphi expression and significant increase in expression of ccl a. , a pro-inflammatory chemokine. no changes in ace , tnfα, il b and il a/f expression were observed ( figure a ). these results indicate that rapid immune responses occur in zebrafish larvae exposed to sars-cov- s rbd. we next evaluated the effects of sars-cov- rbd s on zebrafish larva heart function to validate zebrafish larvae as a model for covid- cardiac manifestations. we immersed -and -days post fertilization (dpf) zebrafish larvae with sars-cov- s rbd, or with vehicle, and measured heart rate after h. as shown in figure b , dpf and dpf zebrafish treated with sars-cov- s rbd had significantly higher heart rates compared to vehicle treated controls. as a positive control for the recombinant protein, we used animals treated with the same dose of recombinant infectious hematopoietic necrosis virus (ihnv) glycoprotein (r-ihnvg), a rhabdovirus known to cause severe endothelial damage in zebrafish (ludwig et al., ) . r-ihnvg caused a severe decrease in larval zebrafish heart rate compared to control treated animals ( figure b -c). to determine if increased heart rate induced by sars-cov- s rbd was dependent on ace binding, we co- incubated dpf larvae with captopril and reverted sars-cov- s rbd induced heart dysfunction. captopril had no effect on r-ihnvg induced bradycardia ( figure b ). ventricular trace analyses showed marked differences in rhythm patterns in each treatment group ( figure d). importantly, the captopril and sars-cov- s rbd treated animals, despite having similar heart rates to those of the vehicle treated controls, displayed a unique ventricular trace pattern, warranting future studies regarding the potential cardioprotective role of captopril in combined, these results indicate that zebrafish exposed to sars-cov- s rbd protein experience tachycardia and suggest that zebrafish larvae constitute a valuable pre-clinical model to test the effects of drugs for covid- on cardiac activity in vivo. zebrafish anosmia is one of the earliest manifestations of sars-cov- infection in humans (cooper et al., ) . we have previously shown that ihnv glycoprotein protein is sufficient to induce rapid nasal immune responses as well as neuronal activation in teleost fish (sepahi et ( figure d ). loss of the epithelial mosaic structure characteristic of the teleost olfactory epithelium was observed on days , and post-treatment ( figure e ). by day , loss of entire apical lamellar areas due to severe necrosis was observed in the olfactory lamellae of all treated animals compared to controls ( figure f ). significant loss of olfactory cilia was recorded in all animals treated with sars-cov- s rbd at all time points ( figure h ). these results indicate the sars-cov- s rbd is sufficient to cause inflammation, edema, hemorrhages, ciliary loss, and necrosis in the olfactory organ of zebrafish. hence, olfactory damage can be caused by indirect mechanisms and in the absence of active sars-cov- replication in this tissue. toxicity effects of intranasal sars-cov- s rbd delivery were also evaluated in distant tissues such as the kidney, a target organ of sars-cov- . acute kidney injury (aki) incidence varies from . % to % in covid- patients (su et al., ) . renal damage, especially aki, is also common in patients with ras dysfunction such as diabetic patients who suffer from hypertension (ribeiro-oliveira et al., ; advani, ). a recent study in zebrafish injected with the n-terminal part of sars-cov- s protein reported inflammation and damage in several tissues of adult zebrafish including kidney and d post-injection (ventura-fernandes et al., ). in the present study, histological examination of the head-kidney of zebrafish who received sars-cov- s rbd intranasally revealed renal tubule pathology characteristic of aki h post-treatment ( figure s ). pathology was not as severe at later time points, but vacuolation of the renal tubule epithelium was still visible days post-treatment ( figure s ). we did not observe signs of glomerulopathology in treated animals compared to controls. together, these results indicate that intranasal delivery of sars-cov- s rbd is sufficient to cause nephropathy in adult zebrafish but that pathology is not as severe as when the protein is delivered by injection. intranasal delivery of sars-cov- s rbd causes anosmia in adult zebrafish adult zebrafish exposed to sars-cov- s rbd had a significant reduction of olfactory responses to food extracts of ~ % of preexposure olfaction within minutes as measured by electro-olfactogram (eog), indicating an instant effect of the protein on olfactory function (fig. a). reduction of olfaction was sustained for least one hour of recording, but the olfactory organ remained still semi functional. zebrafish treated with pbs never lost olfaction at any time point. to further quantify the degree of olfactory reduction due to sars-cov- s rbd, we took advantage of the two easily accessible and isolated olfactory chambers present in zebrafish. we exposed one naris to the sars-cov- s rbd protein and the other naris, from the same animal, to pbs and waited h or d before measuring olfaction by eog. at h we observed a - % reduction in food and bile olfactory responses between the treated and untreated naris and a complete loss of olfactory function to both odorants d post-treatment (fig. b) . the reduction of olfactory sensitivity for food extract was smaller than that found for bile, probably due to the lower number of osns involved in bile acid detection compared to amino acids found in food (hansen et al., ) . our results indicate that sars-cov- s rbd-induced-anosmia is not specific for a subset of osns, since both food extracts and bile olfactory signals were suppressed in sars-cov- s rbd treated zebrafish. this fact, together with the d time to develop complete anosmia and disrupt olfactory epithelial structure, support the hypothesis that sars- cov- s rbd damage may occur first on sustentacular cells, with subsequent impacts on osn viability and function. single-cell analysis of the zebrafish olfactory organ to understand the impact of sars-cov- s rbd on zebrafish oo, we performed single cell clusters, endothelial cell (ec) clusters and leucocyte (lymphoid and myeloid) clusters ( figure a-b). of the ncs, neuron and neuron corresponded to mature osns. neuron expressed markers of ciliated osns (ompa and ompb) in addition to several olfactory receptor (or) genes (buiakova et al., ) . neuron , on the other hand, expressed markers of microvillus osns (trpc b, s z, and gnao) and many vomeronasal receptors (vr) such as v rh as well as or gene ( figure s ) (kraemer et al., neuron and neuron expressed cell cycle and early neuronal progenitor markers as well as tmprss b and tmprss a, however the majority of cluster identifying genes in these two clusters are undescribed ( figure s ). scs are supporting cells that exist in the neuroepithelium around osns and in humans, they express ace (bryche et al., ) . scs in the olfactory epithelium can directly arise from horizontal basal cells (hbcs) (yu and wu, ). we found clusters of scs in our datasets. subpopulation closely related to the sustentacular cluster ( figure s ). we identified three clusters of ecs that all express tmprss and tmprss . while we did not detect ace expression in any cell clusters, we detected ace mrna in adult zebrafish olfactory organ, and at low levels in the olfactory bulb ( figure s ). ace expression levels have previously shown to be low in neuronal tissues and therefore may be hard to detect by scrna-seq (song et al., ). endothelial and clusters expressed the endothelial markers sox and tmp a (yao et al., ). all three clusters broadly expressed genes associated with tight junctions (tjp , jupa, ppl, cldne, and cgnl ) as well as many keratin genes ( figure s ). interestingly, endothelial cluster also expressed the calcium channel trpv and a slew of non-annotated genes. there are copious amounts of immune cells in the teleost olfactory organ ( intranasal delivery of sars-cov- s rbd induces inflammatory responses and widespread loss of olfactory receptor expression in adult zebrafish olfactory organ the cellular landscape of the zebrafish olfactory epithelium was affected by sars-cov- s rbd treatment and time ( figure a -d). this was especially evident in the proportions of neuronal cell types d post-treatment when the proportion of mature, omp + ciliated osn was much lower compared to controls and the h treated group. in contrast, neuronal progenitors expressing cell cycle markers (aubk, ecrg , and mki ) and neuronal differentiation and plasticity markers (neurod , neurod , gap , sox , and sox ) were expanded d post-treatment ( figure b -c). further, we detected a noticeable decrease in the proportion of cells belonging to the lymphocyte cluster, a cluster that expressed markers of treg cells (foxp b) h post intranasal delivery of recombinant sars-cov- s rbd but this change was not noticeable at day ( figure c ). at d, we observed a third lymphocyte cluster, not detected at h, highly expressing the tcr subunit zap as well as plac onzin related protein (ponzr ), a molecule that has immunoregulatory roles during th type immune responses in mammals ( figure s in olfactory neuronal clusters were enriched in processes such as neuron differentiation, sensory system development, and sensory organ morphogenesis, whereas downregulated genes belonged to sensory perception of smell, detection of chemical stimulus and gpcr signaling pathway ( figure f ). functional enrichment analyses in metascape showed that the top non-redundant enriched clusters in both upregulated and downregulated genes in zebrafish osns h post- treatment was sensory perception of smell ( figure g ). the same was true days post-treatment, but processes such as regeneration, neuron development, neuron fate commitment and the p signaling pathway were also enriched within the upregulated genes ( figure h ). combined, these results suggest that presence of sars-cov- s rbd in the olfactory organ instigates harmful effects on osns within hours and that the magnitude of the osn damage increases by days post-treatment. further, these analyses indicate that neuronal regeneration and differentiation processes were initiated by day in order to begin repair of olfactory damage. our study allowed us to dissect how each cell type in the zebrafish olfactory organ responds to sars-cov- s rbd. our results indicated unique responses by sc clusters and ec clusters to treatment (figures and ) . at h, we detected increased expression of apoeb, of transcription factors foxq a and id b, the transcriptional regulator nfil - , two tumor necrosis factor receptor superfamily members (tnfrsf b and tnfrsf a) as well as tcima (transcriptional and immune response regulator), whose mammalian ortholog pcim regulates immune responses as well as endothelial cell activation and expression of inflammatory genes ( figure a ) (kim et al., ). further, at h we observed downregulation of the pro-inflammatory cytokine il af/ as well as glutathione peroxidase gpx b, the transcription factor notch b, basal cell adhesion molecule bcam, guanine nucleotide-binding protein subunit gamma gng , and the calcium binding s z in sc and ec from sars-cov- s rbd treated olfactory organs relative to vehicle treated. at d post-treatment, we observed significant increased expression of the gene that encodes brain natriuretic peptide (nppc), a vasodilating hormone, the pro-inflammatory chemokine ccl a. , the m macrophage marker arg , the transcription factors foxq a and sox a, tubulin beta tubb , and the epithelial mitogen epgn, among others ( figure b ). downregulated genes at d post-treatment included hsp . , apoeb, the osteoblast specific factor b postb, the desmosomal component periplakin (ppl), the vasoconstricting endothelin (edn ), the heparin binding molecule latexin (ltx) involved in pain and inflammation, and cd b, a part of the mhc-ii complex ( figure b ). combined, these data indicated immune regulatory responses in sc and ec clusters early after sars-cov- s rbd treatment, followed by transcriptional changes with potential vasoactive effects by day . go and enrichment set analyses indicated that sc and ec clusters initially undergo transcriptional changes enriched in metabolic responses, response to stress, and cell differentiation ( figure c ). later on, at day , sc and ec responses were enriched in genes involved not only in the stress response but also in immune responses and responses to wounding ( figure d ). similar results were identified using metascape, which showed that the inflammatory response to wounding was moderately enriched in the downregulated genes at h, whereas by day , response to wounding became the top enriched set among the upregulated genes ( figure e -f). exposure of wildtype zebrafish larvae with sars-cov- does not support viral replication zebrafish larvae have been used as models to investigate several human viruses. infecting zebrafish larvae in a bsl- laboratory by immersion in contaminated water is comparable to infecting a cell line. we first checked the stability of sars-cov- in zebrafish water overtime, in the absence of any animals. we found that sars-cov- viral loads in the water remained stable throughout the experiment ( figure a -b). we exposed wildtype ab zebrafish larvae to live sars-cov and examined viral mrna abundance over time to determine if zebrafish larvae can support viral replication. we detected no increases in the viral n copy numbers over time and a steady decline in e gene copy numbers in both water from wells containing larvae and virus as well as in the larval tissue ( figure c -f). these results indicate that wild-type zebrafish larvae cannot support efficient sars-cov- replication as suggested by the in silico comparative sequence analyses of the zebrafish ace molecule. exposure of zebrafish larvae to sars-cov- decreases ace expression and triggers pro- inflammatory cytokine responses in order to determine whether exposure of zebrafish larvae with live sars-cov- causes changes in ai, we measured ace mrna levels in control and sars-cov- exposed larvae over time. ace expression was significantly downregulated as early as h post-infection. ace expression inhibition was sustained over the time course of the experiment with the greatest decrease occurring days post-infection (dpi) ( figure a ). we next evaluated changes in expression of cytokine and chemokine genes to establish whether zebrafish mount inflammatory responses that resemble the patterns of mild or severe sars-cov- infection. il β expression was significantly upregulated at h, dpi ( - fold) and dpi ( fold) and significantly downregulated at dpi ( figure b ). we detected a significant increase in tnfa expression in sars-cov- exposed larvae dpi ( figure c ). ifnphi and ifnphi are the two main type i ifn genes involved in larval zebrafish antiviral responses (levraud et al., ) . we detected a significant up-regulation of ifnphi at and dpi, whereas expression was inhibited at dpi. interestingly, ifnphi expression followed a very different pattern compared to ifnphi , which was significantly downregulated dpi but significantly upregulated at dpi ( figure d -e). mxa expression was significantly downregulated at all time points ( figure f ). il af/ expression was significantly elevated , and dpi ( figure g ). expression levels of il , a member of the il family, were downregulated hpi and dpi ( figure h ) whereas the type ii cytokine il /il b was downregulated at hpi, dpi and dpi ( figure i ). further, a significant increase in the expression of the chemokine ccl a. was detected in infected larvae at and dpi compared to controls ( figure j) . a moderate increase in ccl a. expression was observed at dpi followed by a strong down-regulation ( fold) at dpi ( figure k ). taken together, these data indicate that exposure to sars-cov- induces a significant antiviral and pro-inflammatory immune response in wildtype zebrafish larvae. this response involved type i ifn, tnfa, il b, il and ccl , reminiscent of covid- patients with mild disease. the current covid- pandemic has propelled the investigation of sars-cov- and the development of animal models that help identify therapeutic interventions and vaccines for covid- . thus far, all animal models reported are mammals, and therefore breeding, genetic manipulation, and animal housing in bsl- laboratories make these models costly and not readily available in large numbers. zebrafish can overcome many of the limitations of mammalian models thanks to their transparent bodies, short life-span, low maintenance costs and production of large numbers of embryos. we therefore performed the simplest infection procedure, where sars-cov- was added to the water of zebrafish larvae. in this manner, bsl- trained personnel with no experience in zebrafish microinjection can readily expose larvae to sars-cov- without the need of animal protocols in a similar fashion to in vitro cell culture infections. exposure of wildtype zebrafish larvae to sars-cov- in the water did not however result in any detectable viral replication. downregulated in response to sars-cov- exposure. combined, these data suggest the sars- cov- induces some type i ifn responses in zebrafish larvae while inhibits others. future studies are clearly needed to ascertain the role of teleost type i ifn in the anti sars-cov- immune response. s protein is a structural protein of sars-cov- and therefore the target of several vaccine trials. therefore, we exposed zebrafish larvae to sars-cov- s rbd protein and investigated transcriptional and physiological responses. rapid changes in gene expression were detected in treated larvae, including up-regulation of the chemokine ccl a. and the down-regulation of ifnphi . the ccl /ccl axis appears to be critical in teleost antiviral innate responses, as previous studies have shown very rapid responses in larvae exposed to the rhabdovirus svcv (sepahi et al., ) . this change was also detected in the larvae that were exposed to the live sars-cov- virus in the present study. we further detected a significant down-regulation of type i ifn ifnphi gene in larvae exposed to sars-cov- s rbd protein. examination of ace transcriptional changes in zebrafish larvae exposed to sars-cov- revealed a consistent down-regulation in expression throughout the course of infection. interestingly, we did not observe any changes in ace expression after h immersion with sars- cov- s rbd protein. recently, enterocytes were found to be the main cell type expressing ace in dpf-old zebrafish larvae (postlethwait et al., ); and therefore it is possible that the down-regulation in ace expression observed in our experiments was the result of enterocyte responses to sars-cov- . however, an olfactory epithelial cell cluster was not identified in this dataset, probably because these cells constitute too small a fraction of the cells of an entire larva. importantly, we exposed larvae to the virus at dpf, when the olfactory pit is already sampling the surrounding water, while the gut fully opens only at dpf. thus, changes in ace expression levels in the olfactory pit of the zebrafish larvae cannot be ruled out at this point. previous work has shown that ace knockdown in mice protects from sars-cov infection (kuba et al., ) . thus, down-regulation of zebrafish ace expression may have protected larvae from sars-cov- infection in our experiments. our data agree with studies in mouse lungs, where suppression of ace gene expression was consistently observed following sars- cov- infection (chen et al., ). interestingly, changes in ace levels can occur in response viruses that do not require ace for host entry (chen et al., ). thus, although further studies are warranted, our data suggest that ace is involved in antiviral sars-cov- responses in zebrafish. we took advantage of the zebrafish fish model which allows for easy live imaging of heart beats in transparent larvae. we detected in vivo cardiac/heart responses in larval zebrafish exposed to sars-cov- s rbd protein characterized by tachycardia. cardiac arrhythmia is a common symptom among covid- patients and current research efforts aim to understand how sars- cov- infection impacts cardiovascular function (libby, ) . our findings underscore that sars-cov- s rbd is able to cause tachycardia in the zebrafish larval model and that this model can be used for rapid evaluation of drug treatments for covid- . as a proof of concept, we used captopril, an ace inhibitor currently being evaluated in human clinical trials (nct ). captopril treatment ameliorated tachycardia in zebrafish larvae exposed to sars-cov- s rbd recombinant protein. our studies therefore suggest the beneficial use of captopril in covid- patients undergoing cardiac arrhythmia, but clearly further studies are required to fully translate these findings to the clinic and to determine the duration and timing of captopril treatment in covid- patients. a recent report in zebrafish adults indicated that injection of recombinant sars-cov- s n terminal protein caused histopathology of several tissues including the liver, kidney, brain and ovary (ventura-fernandes et al., ). additionally, some animals succumbed to injection with the recombinant protein. we did not detect any mortalities neither in larvae nor in adults in any of our experiments, perhaps suggesting that mortalities were due to the injection procedure rather than the protein treatment itself. of note, the dose used in the present study was considerably lower than the dose delivered in the ventura-fernadez study, perhaps explaining the differences in toxicity between both studies. we observed histological damage following a single intranasal delivery of sars-cov- protein, specifically at the site of delivery, the olfactory organ, whereas more transient and moderate damage was detected in the renal tubules. renal damage may have occurred by direct uptake of sars-cov- s rbd by the kidney once the protein reached the bloodstream following intranasal administration or, alternatively, by activation of ras or inflammatory cascades at the olfactory organ. thus, toxicity of sars-cov- s protein in adult zebrafish may be less severe when delivered intranasally than by injection, and future studies should evaluate whether current vaccine candidates also exert similar effects and whether different administration routes cause the same side-effects or not. this is particularly important as the intranasal route appears promising for some vaccine candidates study did not determine when zebrafish recover olfactory function following sars-cov- s rbd intranasal treatment, but based on our histopathological observations, the olfactory organ was still severely damaged days after treatment, suggesting that recovery of olfactory function may take several weeks in our model. our findings therefore indicate that similar to humans, zebrafish suffer from olfactory pathology and loss of smell in response to sars-cov- s rbd protein. thus, olfactory pathophysiology appears to occur even in the absence of viral replication raising the possibility that nasal vaccines for covid- may also cause transient anosmia in humans. in conclusion, the present study reports that both adult and larval wild-type zebrafish can be useful models to advance our understanding of covid zebrafish larvae in responses to sars-cov- s rbd. animals were exposed to sars-cov- s rbd protein (r-spike, ng/ml) for h at . °c or vehicle. changes in gene expression were measured by rt-qpcr using rps as the house-keeping gene. each data point represents a pool of larvae/well. data are expressed fold-change compared to vehicle controls using the pffafl method. (b) average heart beat per minute of dpf (n= ) zebrafish larvae after h of incubation with vehicle, ng/ml r-spike, or ng/ml r-ihnvg. (c) average zebrafish heart beats per minute in dpf zebrafish larvae (n= ) after h of incubation with vehicle, ng/ml r-spike, or ng/ml r-ihnvg with and without treatment with mm of captopril. heart beats were recorded for min at under a nikon ti microscope and (c-d) mean viral loads quantified as log of sars-cov- n gene and sars-cov- e gene copy numbers in control supernatants from well with larvae not exposed to virus, and supernatants from wells with larvae that were exposed to pfu of sars-cov- for h, d, d and d. each sample represents the supernatant of one well containing larvae. (e-f) mean viral loads quantified as log of sars-cov- n gene and sars-cov- e gene copy numbers in control larvae and larvae exposed to pfu of sars-cov- for h, d, d and d. each sample point represents one well containing larvae. larval infections began at dpf after mechanical dechorionation at dpf. * p-value< . ; ** p- value< . *** p-value< . . results are representative of two independent experiments. for all experiments, wild type ab zebrafish were obtained from zirc (oregon, usa). for the intranasal delivery of sars-cov- s rbd protein into adult zebrafish, female and male adult zebrafish were obtained from dr. wong's laboratory at the university of nebraska due to lockdown of zirc during the pandemic. all fish were maintained in a filtered aquarium system at ℃ with a h light and h dark cycle at the university of new mexico aquatics animal facility. all experiments with adults utilized a mix of male and female animals, and the larvae sex is indeterminable. animals were fed ad libitum gemma complete nutrition (skretting). ab larvae were obtained by batch-crossing ab adults allowing for natural fertilization. the morning of fertilization, larvae were collected at n= per petri dish and kept in e medium containing . % methylene blue. in the afternoon, larvae were placed in fresh e medium without methylene blue and non-surviving embryos were removed. larvae were maintained at . ℃ in e medium until dpf when they are slowly changed to system water. sars-cov- the sars-cov- isolate, a cdc isolate from a us patient (usa-wa / ), was obtained from bei resources. the strain was grown at a low moi to minimize generation of noninfectious particles and low passaged virus was used in all experiments described. the virus was propagated on vero e cells and viral loads quantitated by rt-qpcr and by plaque forming assays as we have described previously (bradfute et al, ). intranasal delivery of sars-cov- s rbd recombinant protein to adult zebrafish adult zebrafish were anesthetized for min in . mg/ml tricaine-s (syndel) solution and then moved to an absorbent boat where their gills were still covered with anesthetic solution for administration of solutions to nares. using a microloader tip (eppendorf, ), μl of ng/μl sars-cov- s rbd (kindly provided by dr. f. krammer) was directly pipetted into each naris, while μl of sterile pbs was applied in control fish. after inoculation, animals were recovered in a separate tank supplemented with o before returning to their rearing tank until the end of the experiment. euthanasia was performed on ice to ensure rapid death without perturbing the combined supernatants were centrifuged for min at g in supplemented neurobasal medium and cells were counted with a hemocytometer. viability was estimated by trypan blue staining. cells were then strained twice through flowmi μm strainers and loaded onto the chromium controller with a viability of > %. cell libraries were generated according to x genomics protocols at the university of new mexico cancer center genomics core facility and sequenced on an illumina novaseq at the university of colorado genomics and microarray core facility. sequencing depth and statistics of the scrna-seq run are shown in figure s . sras for this project can be found on ncbi under bioproject #prjna . fastqs were run through the cell ranger v . pipeline with default settings using the grcz zebrafish genome. output matrices were loaded to r (v . . ) as a seurat object (package seurat v . . ). first, cells with less than or greater than features, and greater than % mitochondrial features were removed. after counts were normalized using the "lognormalize" method and a scale factor of , variable genes were selected using the 'vst" method. data was scaled, and pca dimensional reduction was run. jackstraw analysis determined the vehicle control to have significant principal components (pcs) and the treated samples to have significant pcs which were used for clustering analysis. sars-cov- s rbd treated samples were integrated with the vehicle treated sample and clustered together using significant pcs and a resolution of . . cluster markers were identified with "findallmarkers" in seurat and exported for cluster identification. differential expression analysis was done with seurat "findmarkers" in default settings for each cluster and exported for gene ontology analysis. gene ontology (go) analysis was done with web-based guis metascape and shinygo v . which draw multiple currently maintained databases (ensembl, entrz, kegg among others). biological process webs were created using biological process output from shinygo v . in prism graphpad. biological processes bar graphs were produced by metascape. electro-olfactogram recordings adult ab zebrafish were anesthetized and received µl of recombinant sars-cov- s rbd protein ( ng/µl) in pbs or pbs alone. after h or day, zebrafish were anaesthetized ( . g ms /l), placed in a v-shape stand and supplied with aerated water containing ms anesthetic ( . g/l). the nasal flap was removed with sterile fine forceps to expose the olfactory rosettes to a continuous tank water source. olfactory responses to zebrafish food extract or goldfish bile were measured by electrical recordings as detailed in sepahi et al., . the food extract was prepared as a filtered solution of l tap water and . g of dry food pellets. water food extracts were separated in ml aliquots and kept frozen until the recording day. a . ml mix of bile fluid from adult goldfish was aliquoted in µl and kept frozen until the day of the recording. before each recording, bile aliquots were diluted : in water from the eog system. there were no significant differences in olfactory responses between males and females, hence responses of both sexes were averaged together. the percentage reduction in olfactory activity was calculated by dividing the amplitude of the olfactory signal at time x by amplitude of the olfactory signal at time * . percentage of olfactory signal reduction between control and treated naris was calculated as follows (amplitude response to odorant in control naris (mv) -amplitude response to odorant in treated naris (mv))/amplitude response to odorant in control naris (mv)) * . (sigma) at ℃, stabilized for min at rt and then imaged to record heart-beat activity. zebrafish larvae heart-beat recordings and analysis as the agarose solidified, animals were adjusted to the microscope stage (approx. min) then hearts were recorded using brightfield avi for min at . frames/s at rt. avi images were then opened in imagej with the time series analyzer v plugin. circular rois were drawn in either the atrium or ventricle and average intensity was extracted. the maximum average intensity peaks were identified and counted per s as bpm. data were analyzed by one-way anova with tukey's post hoc. infection of zebrafish fish larvae with sars-cov- ten animals were placed in each well in -well plates containing ml of tank water and transferred to bsl facility the day before infection. gene expression analyses by rt-qpcr whole larvae rna was extracted using trizol. for tissue homogenization bead beater tubes are preloaded with . g . mm dia zirconia beads, . g . mm zirconia beads and µl trizol. samples were loaded into the tubes and bead beat at rpm for s. tubes were then centrifuged at , rpm for min. the homogenate/lysates were transferred to clean . ml microfuge tubes and spun at , rpm for min to pellet debris. supernatants were then processed to extract the total rna using a standard chloroform/phenol extraction protocol. rna was quantified by nanodrop and samples were normalized and µg of rna was used to synthesize cdna using the superscript iii first strand system (thermofisher, ). qpcr was performed using ssoadvanced supermix (biorad, ) and primers listed in table s (supplemental methods). gene expression changes were quantified using the pfaffl method (pfaffl, glycerophospholipid biosynthesis apoptosis microtubule polymerization or depolymerization gpcr signaling, coupled to cyclic nucleotide nd messenger negative regulation of cell differentiation iron uptake and transport circadian regulation of gene expression sensory perception of smell developmental process anatomical structure development response to wounding response to stress immune system response cell chemotaxis multicellular organism development acute kidney injury: a bona fide complication of diabetes the sars-cov- receptor, ace- , is expressed on many different cell types: implications for ace-inhibitor-and angiotensin ii receptor blocker-based cardiovascular therapies myocardial injury and covid- : possible mechanisms savalan the pathogenicity of sars-cov- in hace transgenic mice the establishment of neuronal properties is controlled by sox and sox imbalanced host response to sars-cov- drives development of covid- macrophage-mediated neuroprotection and neurogenesis in the olfactory epithelium cov- entry genes in the olfactory system suggests mechanisms underlying covid- - associated anosmia massive transient damage of the olfactory epithelium associated with infection of sustentacular cells by sars-cov- in golden syrian hamsters olfactory marker protein (omp) gene deletion causes altered physiological activity of olfactory sensory neurons zebrafish as a new model for herpes simplex virus type infection integrating single-cell transcriptomic data across different conditions, technologies, and species simulation of the clinical and pathological manifestations of coronavirus disease (covid- ) in golden syrian hamster model: implications for disease pathogenesis and transmissibility individual variation of the sars-cov- receptor ace gene expression and regulation acute inflammation regulates neuroregeneration through the nf-κb pathway in olfactory epithelium chronic inflammation directs an olfactory stem cell functional switch from neuroregeneration to immune defense covid- and the chemical senses: supporting players take center stage keiland broad host range of sars-cov- predicted by comparative and structural analysis of ace in vertebrates smell and taste disorders during covid- outbreak: cross-sectional study on patients an inflammatory cytokine signature predicts covid- severity and survival promoter transgenes direct macrophage-lineage expression in zebrafish expressions and significances of the angiotensin-converting enzyme gene, the receptor of sars-cov- for covid- influenza a virus infection in zebrafish recapitulates mammalian infection and sensitivity to anti-influenza drug treatment covid- : real-time dissemination of scientific information to fight a public health emergency of international concern a mechanistic model and therapeutic interventions for covid- involving a ras-mediated bradykinin storm shinygo: a graphical gene-set enrichment tool for animals and plants epigenetic contribution of high- mobility group a proteins to stem cell properties differential downregulation of ace by the spike proteins of severe acute respiratory syndrome coronavirus and human coronavirus nl the case for modeling human infection in zebrafish covid- and the cardiovascular system: implications for risk assessment, diagnosis, and treatment options impaired type i interferon activity and exacerbated inflammatory responses in severe covid- patients crystal structure of zebrafish interferons i and ii reveals conservation of type i interferon structure in vertebrates correlation between olfactory receptor cell type and function in the channel catfish a multibasic cleavage site in the spike protein of sars-cov- is essential for infection of human lung cells sars-cov- cell entry depends on ace and tmprss and is blocked by a clinically proven protease inhibitor the zebrafish reference genome sequence and its relationship to the human genome angiotensin-converting enzyme protects from severe acute lung failure is sars-cov- (covid- ) postviral olfactory dysfunction (pvod) different from other pvod? ace mouse models: a toolbox for cardiovascular and pulmonary research tc (c orf ) is a novel endothelial inflammatory regulator enhancing nf-κb activity infection and rapid transmission of sars-cov- in ferrets cardiac and arrhythmic complications in patients with covid- structural and functional diversification in the teleost s family of calcium-binding proteins intranasal vaccination with a lentiviral vector strongly protects against sars-cov- in mouse and golden hamster preclinical models a crucial role of angiotensin converting enzyme (ace ) in sars coronavirus-induced lung injury arrhythmias and sudden cardiac death in the covid- pandemic sars-cov- productively infects human gut enterocytes genes in zebrafish and humans define an ancient arsenal of antiviral immunity angiopoietin-like increases pulmonary tissue leakiness and damage during influenza pneumonia the heart in covid- : primary target or secondary bystander? jacc basic to composition and divergence of coronavirus spike proteins and host ace receptors predict potential intermediate hosts of sars-cov- continuing versus suspending angiotensin-converting enzyme inhibitors and angiotensin receptor blockers : impact on adverse outcomes in hospitalized patients with severe acute respiratory syndrome coronavirus ( sars-cov- ) -the brace corona trial longitudinal analyses reveal immunological misfiring in severe covid- whole-body analysis of a viral infection: vascular endothelium is a primary target of infectious hematopoietic necrosis virus in zebrafish larvae covid- preclinical models: human angiotensin-converting enzyme transgenic mice characterization of the immune barrier in human olfactory mucosa real-time tracking of self- reported symptoms to predict potential covid- targeted immunosuppression distinguishes covid- from influenza in moderate and severe disease respiratory disease in rhesus macaques inoculated with sars-cov- real-time whole-body visualization of chikungunya virus infection and host interferon response in zebrafish cc chemokine receptor expression defines a subset of peripheral blood lymphocytes with mucosal t cell phenotype and th or t- regulatory cytokine profile type i and type iii interferons -induction evasion, and application to combat covid- angiotensin ii induced proteolytic cleavage of myocardial ace is mediated by tace/adam- : a positive feedback mechanism in the activating a reserve neural stem cell population in vitro a new mathematical model for relative quantification in real-time rt- pcr an intestinal cell type in zebrafish is the nexus for the sars-cov- receptor and the renin angiotensin-aldosterone system that contributes to covid- comorbidities the renin-angiotensin system and diabetes: an update. vasc. health risk manag comparative pathogenesis of covid- , mers, and sars in a nonhuman primate model innate immune signaling in the olfactory epithelium reduces odorant receptor levels: modeling transient smell loss in covid- patients interplay between sars-cov- and the type i interferon response molecular and neuronal homology between the olfactory systems of zebrafish and mouse. sci. rep tissue microenvironments in the nasal epithelium of rainbow trout ( oncorhynchus mykiss two distinct cd α + cell populations and establish regional immunity olfactory sensory neurons mediate ultrarapid antiviral immune responses in a trka-dependent manner coronavirus placenta-specific limits ifnγ production by cd t cells in vitro and promotes establishment of influenza-specific cd t cells in vivo neuroinvasion of sars-cov- in human and mouse brain alterations in smell or taste in mildly symptomatic outpatients with sars- cov- infection renal histopathological analysis of postmortem findings of patients with covid- in china identification of viruses in patients with postviral olfactory dysfunction clinical characteristics of covid- in people with sars-cov- infection on the diamond princess cruise ship: a retrospective analysis small molecule screening in zebrafish: an in vivo approach to identifying new chemical tools and drug leads proinflammatory cytokines in the olfactory mucosa result in covid- induced anosmia a robust human norovirus replication model in zebrafish larvae zebrafish studies on the vaccine candidate to covid- , the spike protein: production of antibody and adverse reaction a rampage through the body neuronal wiskott-aldrich syndrome protein regulates tgf-β -mediated lung vascular permeability receptor recognition by the novel coronavirus from wuhan: an analysis based on decade-long structural studies of sars coronavirus clinical characteristics of hospitalized patients with coronavirus-infected pneumonia in wuhan, china remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus ( -ncov) in vitro the zebrafish activating immune receptor nitr signals via dap sars and mers: recent insights into emerging coronaviruses sox transcription factors in endothelial differentiation and endothelial-mesenchymal transitions spatiotemporal photolabeling of neutrophil trafficking during inflammation in live zebrafish regeneration and rewiring of rodent olfactory sensory neurons severe acute respiratory syndrome coronavirus infects and damages the mature and immature olfactory sensory neurons of hamsters angiomotin-like protein controls endothelial polarity and junction stability during sprouting angiogenesis metascape provides a biologist-oriented resource for the analysis of systems-level datasets clinical characteristics of covid- patients: a meta-analysis sars-cov- receptor ace is an interferon-stimulated gene in human airway epithelial cells and is detected in specific cell subsets across tissues the function of fish cytokines single-cell rna-seq data analysis on the receptor ace expression reveals the potential risk of different human organs vulnerable to -ncov infection key: cord- - sk wc authors: wu, jianping; mok, chee-keng; chow, vincent tak kwong; yuan, y. adam; tan, yee-joo title: biochemical and structural characterization of the interface mediating interaction between the influenza a virus non-structural protein- and a monoclonal antibody date: - - journal: sci rep doi: . /srep sha: doc_id: cord_uid: sk wc we have previously shown that a non-structural protein (ns )-binding monoclonal antibody, termed as h , can significantly reduce influenza a virus (iav) replication when expressed intracellularly. in this study, we further showed that h binds stronger to the ns of h n than a/puerto rico/ / (h n ) because of an amino acid difference at residue . a crystal structure of h fragment antigen-binding (fab) has also been solved and docked onto the ns structure to reveal the contacts between specific residues at the interface of antibody-antigen complex. in one of the models, the predicted molecular contacts between residues in ns and h -fab correlate well with biochemical results. taken together, residues n and t in h n ns act cooperatively to maintain a strong interaction with mab h by forming hydrogen bonds with residues found in the heavy chain of the antibody. interestingly, the pandemic h n - and the majority of seasonal h n circulating in humans since has n in ns , suggesting that mab h could bind to most of the currently circulating seasonal influenza a virus strains. consistent with the involvement of residue t , which is well-conserved, in rna binding, mab h was also found to inhibit the interaction between ns and double-stranded rna. residues - in h n -ns are sufficient for its interaction with mab h . as described previously , the deletion of residues - of ns abolished its interaction with mab h . these residues lie in the helix α (residues - ) of h n -ns (rbd) and are well conserved between h n , h n and h n viruses (fig. a) . this is consistent with the ability of mab h to bind to both avian h n and seasonal iavs . in order to determine if the helix α of h n -ns (rbd) is sufficient for the interaction with mab h , enzyme-linked immunosorbent assay (elisa) was performed by using a synthetic peptide h n -ns - mer corresponding to helix α . as shown in fig. b , mab h bound to h n -ns - mer in a dose dependent manner, indicating that the helix α of ns (rbd) is sufficient for its interaction with mab h . in contrast, there was no binding between mab h and an irrelevant control peptide of similar molecular weight. within helix α , there is only one amino acid difference between h n and h n -pr , namely n in h n -ns and s in h n -pr -ns (fig. a) . to determine if residue in ns is involved in its interaction with mab h , recombinant h n -ns (rbd) and h n -pr -ns (rbd) proteins were bacterially expressed and purified for elisa. the elisa readings were similar for h n -ns (rbd) and h n -pr -ns (rbd) when high concentrations of proteins were coated on the plate (fig. c) . however, the readings were significantly higher for h n -ns (rbd) at lower protein concentrations, indicating that that the single amino acid difference between helix α of h n -ns (rbd) and h n -pr -ns (rbd) affects their interactions with mab h (fig. c ). residue in ns is critical for its interaction with mab h . to further define the contribution of residue in h n -ns to the interaction with mab h , two mutant proteins in which n was mutated to a and s respectively, were generated. comparative elisa showed that mab h bound to ns (rbd)-wild-type (wt) stronger than ns (rbd)-n s when different amounts of protein were coated on the plate and analyzed with μ g/ml of mab h ( fig. a) . similarly, when a fixed amount of protein ( μ g/ml) was coated on the plate and analyzed with different concentrations of mab h , the binding to ns (rbd)-n s was significantly lower than ns (rbd)-wt (fig. b ). furthermore, when n was substituted with a at residue , the interaction between mab h and ns (rbd) was totally abolished in all the conditions tested ( fig. a,b) . this result suggests that the difference at residue is the main reason for the stronger binding of mab h to h n -ns (rbd) when compared to h n -pr -ns (rbd) (fig. ). since both n and s are polar amino acids while a has no side chain, it is probable that the formation of hydrogen bonds is important for the interaction between mab h and ns . mab h binds differently to h n -pr virus when compared with mutant viruses carrying substitution at residue in ns . since mab h binds to bacterially-expressed ns protein of h n and h n -pr with different affinities, it is important to investigate whether this holds true for ns expressed in infected cells. thus, recombinant pr (rgpr ) viruses expressing the ns protein containing a single amino acid substitution at residue (rgpr -ns -s a and rgpr -ns -s n) were generated using a reverse genetics system. subsequently, a cells were infected with each virus at a low multiplicity of infection (moi) of . respectively and plaque assay was used to determine the amount of virus secreted at different time points post-infection. as shown in fig. a , although the viral titer was slightly lower in the case of rgpr -ns -s n infection, the overall growth rates of wt and mutant viruses were similar from to hours post-infection (h.p.i.). this is consistent with a previous report showing that substitution of residue in ns does not affect viral replication in vitro . next, t cells were infected with moi of viruses and cell lysates were collected at and h.p.i. to determine the rate of viral protein synthesis. consistent with the virus growth kinetics, the expressions of structural proteins np and m were similar for all viruses at both time-points (fig. b ). the level of ns , as determined by using a rabbit anti-ns polyclonal antibody, was also comparable for all viruses. however, mab h bound to rgpr -ns -s n virus significantly stronger than rgpr -ns -wt virus containing s residue in mismatches are shown in white letters. ns (rbd) is composed of α -helices as shown above the sequence. the region corresponding to helix α (residues - ) is boxed. (b) peptide elisa was performed to determine the region of ns sufficient for binding to mab h . wells were coated with serially diluted h n -ns - mer or a negative control peptide and probed with μ g/ml mab h . * indicates statistically significant difference of p < . when compared with control peptide. (c) comparative elisa was performed to determine the ability of ns (rbd) of h n -pr and h n to bind to mab h . wells were coated with serially diluted proteins and probed with μ g/ml mab h . data shown represents result from three independent experiments and error bars represent standard deviation (sd) of the experiment carried out in duplicates. *indicates statistically significant difference of p < . when compared with h n -pr -ns (rbd). scientific reports | : | doi: . /srep ns , supporting the above results that n in ns is preferred over s for the interaction with mab h . as expected, mab h did not bind to rgpr -ns -s a virus. to further define the interaction interface between mab h and ns , attempts were made to co-crystallize the complex but failed. however, the h -fab alone was found to crystallize in space group p ( ) and the structure was solved by molecular replacement using the structure of bl - (pdbid: q q) as the starting model and refined to the crystallographic r-factor of . % (deposited in the pdb under accession code b m). the refinement statistics are shown in table . previously, we demonstrated that the intracellular expression of h -scfv in mammalian cells reduced the replication of pr virus. since the three dimensional structure of h -fab has been solved, a commercially available lipodin-ab reagent (abbiotec), which is a protein transfection reagent dedicated to the transport of antibodies into living cells, was used to deliver h -fab into a cells. the cells were then subjected to infection so as to determine if h -fab has an impact on viral replication. when either h -fab or a -fab (which is derived from a negative-control antibody binding to the spike glycoprotein of sars coronavirus) was transfected into a cells using lipodin-ab reagent, intracellular accumulation of fab was observed even up to h post transfection (fig. a) . the intracellular accumulation of fab was also observed at as early as h post transfection (data not shown). the transfection efficiency was about % and most of the fab molecules were evenly distributed in the cytoplasm but there were some punctate staining which could be due to aggregation of fab inside a cells. in contrast, no intracellular accumulation of fab was observed when a -fab or h -fab was added to the cells without lipodin-ab reagent. as shown in fig. b , the delivery of fab did not have any significant effect on cell viability at either or h post transfection. upon successful delivery of fab, its biological function was assessed in influenza a virus infected cells. as mab h binds strongly to rgpr -ns -s n virus (fig. ) , a cells were infected with rgpr -ns -s n virus after transfection of h -fab. cell lysates were collected at , , h.p.i. to determine the rate of viral protein synthesis. as shown in fig. c ,d, the level of viral m protein at h.p.i. was significantly reduced in h -fab transfected cells when compared to a -fab transfected cells. at h.p.i., the average reduction in normalized m expression in h -fab transfected cells was % when compared to a -fab transfected cells. at h.p.i., a slight reduction in m expression (~ %) was also observed in h -fab transfected cells but it is not statistically significant. this result suggests that the successful delivery of h -fab into living cells could reduce viral replication by affecting certain function(s) of ns in the infected cells. next, computational modelling was used to study the complex between this high resolution h -fab structure and the published structure of h n -ns (rbd) of h n /a/crow/kyoto/t / strain (pdb id: z a). this was conducted by using haddock on water-refined models, including an analysis of energy contributions from van der waals interaction, electrostatic interaction, restraints violation and buried surface area . as comparative elisa in this and previous studies showed that residues n and t in ns (rbd) are important for the interaction with mab h , they were defined as active residues involved in the binding interaction to generate a series of models of the ns (rbd) and h -fab complex. all the models of ns (rbd) and h -fab complex were found to cluster into groups, in which there were at least two conformations of the ensemble showing backbone root-mean-square deviations at the interface of less than . Å. as additional elisa results showed that residues, namely s , r , and g , in ns are not involved in interaction with mab h ( figure s ), this information was used to distinguish between the models in these clusters. of all energetically best models generated, predicted models were grouped into cluster . the average buried surface area was . ± . Å , and rmsd from the overall lowest-energy structure was . ± . Å. among them, the best predicted model from this cluster showed good agreement with our comparative elisa data, since only residues n and t were predicted to be involved in the interaction, while the side chains of s , r and g were either distal from the interface (s and r ) or inaccessible (g ) to the binding partner of h -fab (fig. ) . by analyzing the polarity of the amino acids and distance between them in this model, it is predicted that residues n and n in the variable domain of heavy chain (vh)-complementarity determining region (cdr ) of h -fab could form hydrogen bonds with the side chain of n in ns (rbd). in addition, the side chain of t in ns (rbd) could form hydrogen bonds with residue r in vh-cdr . in contrast, vh-cdr , vh-cdr and all the cdrs in the variable domain of light chain (vl) are unlikely to be involved in the interaction as they are distal to helix α of ns (rbd). on the other hand, predicted models were grouped into cluster . for these refined structures analyzed, the average buried surface area was . ± . Å , and rmsd from the overall lowest-energy structure was . ± . Å. based on the best predicted model from this cluster ( figure s ), it was predicted that t of ns (rbd) could form the hydrogen bond with n of vh-cdr while n of ns (rbd) could form the hydrogen bond with n of vl-cdr . these predictions are in agreement with the results shown in fig. and in our previous publication . however, this model also predicted that r of ns (rbd) could be involved in the interaction with h -fab because it was in close proximity to two residues in vl-cdr . in this model, the distance between r of ns (rbd) and y of vl-cdr was . Å while the distance between r of ns (rbd) and s of vl-cdr was . Å. thus, this model does not agree with the results from comparative elisa which showed that substitution of r of ns (rbd) with k did not affect its interaction with mab h ( figure s ). lastly, another predicted models were grouped into cluster . the average buried surface area was . ± . Å , and rmsd from the overall lowest-energy structure was . ± . Å. based on the best predicted model from this cluster ( figure s ), the contacts between n and t with residues in h -fab were the same as described above for the model from cluster . however, this model also predicted that r of ns (rbd) could be involved in the interaction with h -fab because it was in close proximity to three residues in vl-cdr . in this model, y , s and y of vl-cdr were found to be at . Å, . Å and . Å from r of ns (rbd) respectively. thus, this model does not agree with the results from comparative elisa which showed that substitution of r of ns (rbd) with k did not affect its interaction with mab h ( figure s ). overall, the predicted model from cluster is consistent with our comparative elisa data and suggests that residues n and t are important for the binding between ns (rbd) and h -fab because their side-chains could make hydrogen bonds with residues in the vh-cdr of the fab. in addition, r of ns (rbd) was distal from the antibody-antigen interface, which is consistent with the results from comparative elisa ( figure s ) showing that substitution of r of ns (rbd) with k did not affect its interaction with mab h . in contrast, the predicted models from cluster and cluster do not agree with the results from comparative elisa. mab h disrupts ns and dsrna interaction. ns (rbd) forms a symmetric six-helical homodimer, which binds to dsrna. the key residues in ns involved in the interaction with dsrna are t , d , d , r , r , r , l , s and t , most of which are positively charged residues and mainly clustered in the middle of helices α /α ′ of the rbd , . to investigate whether mab h hampers dsrna-ns interaction in vitro, an alphascreen assay was carried out. in this experiment, glutathione s-transferease (gst)-tagged h n -ns (rbd) protein, which was produced in e. coli, was incubated with synthetic -nucleotide sirna ( nt-sirna) followed by addition of streptavidin coated donor beads and anti-gst-conjugated acceptor beads which recognize biotinylated rna and gst-tagged protein respectively. if the interaction between ns and nt-sirna brings both beads to close proximity, transfer of excitation energy from donor beads into acceptor beads will yield a luminescent signal (fig. a ) . when ns (rbd) was pre-incubated with different concentrations of mab h followed by addition of nt-sirna, acceptor and donor beads, the luminescent signal decreased at high concentration of mab h (fig. b ). the luminescent signal was reduced by ~ % and ~ % at mab h concentrations of and μ m respectively (fig. b) , suggesting that the binding of mab h to ns (rbd) can block the interaction of ns with dsrna. on the other hand, the negative control mab a did not reduce the luminescent signal at all the concentrations tested. furthermore, when mab h , ns and nt-sirna were mixed simultaneously, μ m of mab h could reduce the signal by about % (fig. c) , which suggest that mab h also directly competes with dsrna to bind to ns . the ns protein of influenza virus is a multi-functional protein that is involved in key aspects of the virus replication cycle . the ns (rbd) consists of the first amino acids and contains residues critical for dsrna binding . the dimerization of the rbd is a prerequisite for its rna binding activity . since the ns protein is an intracellularly expressed viral protein, which is subjected to less host selective immune response and thus has lower mutation rate, antibodies targeting this protein could be helpful for the development of therapeutic treatment of influenza a infection. indeed, our previous study showed that mab h binds to the highly conserved t residue in ns and reduces viral replication of h n -pr in mammalian cell lines . in this study, comparative elisa showed that helix α of ns (rbd) is sufficient for its interaction with mab h (fig. ) . while mab h binds to ns (rbd) of both h n and h n -pr , the binding affinity to the homologous h n viral protein is higher. interestingly, a single amino acid difference at residue in helix α of ns (rbd) of h n and h n -pr is found to be critical for the interaction with mab h (figs and ). by using either purified ns protein or ns expressed in infected cells, our results showed that the interaction of mab h with ns is stronger when residue is n than when it is s and is abolished when the residue is an a. to understand the pattern of polymorphism at residue in ns , sequences were retrieved from niaid influenza research database (http://www.fludb.org) and analyzed ( table ). sequence analysis of avian h n isolates revealed that % of them have n and % have s suggesting that mab h has the ability to bind to the majority of avian h n isolates. interestingly, a recent computational study compared the viral proteins of highly and lowly pathogenic h viruses and identified s to n substitution as a potential marker of pathogenicity of avian influenza virus subtype h . as for human isolates, the majority ( %) of seasonal h n isolated before have s in ns like h n -pr . however, almost all the pandemic h n (pdmh n ) isolates have n in ns . as for seasonal h n , % of viruses isolated before have n and this percentage increased to % for those isolated from to . thus, mab h is expected to bind to the majority of circulating seasonal influenza viruses. by solving a crystal structure of h -fab and docking it onto the ns (rbd) of h n with the haddock program, molecular contacts made by residues t and n at the interface of antibody-antigen complex were predicted (fig. ) . based on one of the energetically best models generated, the side chain of residue n in ns (rbd) could form hydrogen bonds with the side chains of residues n and n of vh-cdr of h -fab. meanwhile, residue t in ns (rbd) seems to interact with residue r in vh-cdr . these predictions are consistent with the results of binding assays performed using substitution mutants of ns as described above and in our previous study . while the predicted d model of the antibody-antigen complex gives us some clues on how residues and in ns (rbd) interact with h -fab, it may be able to accurately reveal the contributions of all molecular contacts between ns (rbd) and h -fab. hence, further studies could focus on the use of other epitope mapping methodologies , besides crystallography, to obtain a more precise map of the molecular contacts at the antigen and antibody interface. furthermore, an alphascreen assay showed that mab h could inhibit the interaction between ns and dsrna (fig. ) . while both residues and are located in helix α of the ns (rbd), it has been shown that t , but not s , is one of the key residues involved in dsrna binding . thus, the interaction between t and h -fab predicted in the docked model is consistent with the ability of mab h to inhibit the interaction between ns and dsrna. although a previous study has demonstrated that mutation at residue did not affect the affinity of ns (rbd) for dsrna , our study suggests that the side chain of n makes crucial contacts with mab h to stabilize the antibody-antigen complex. as such, when the side chain is not present in the case of gst-tagged ns protein was incubated with biotinylated dsrna and the interaction between ns and dsrna could bring streptavidin-coated donor bead and anti-gst-conjugated acceptor bead close to each other. when excited at nm, singlet oxygen molecules ( o ) are produced from donor beads, which react with acceptor beads to produce light emission measured at - nm. (b) inhibition activity of mab h was determined by pre-incubating nm gst-tagged ns protein with serially diluted mab h or a negative control mab a . following that, luminescent signal was measured after the addition of nm biotinylated dsrna, acceptor and donor beads. (c) inhibition activity of mab h was determined by incubating nm gst-tagged ns protein, nm biotinylated-dsrna and serially diluted mab h or a simultaneously. following that, luminescent signal was measured after the addition of acceptor and donor beads. all readings obtained were normalized against that of samples in the absence of antibody. error bars represent sd of the experiment carried out in triplicates. *indicates statistically significant difference of p < . when compared to mab a . the n a substitution mutant, interaction with mab h is completely abolished (figs and ). this indicates that the interaction between the side chains of residue t in ns and residue r in h -fab is not sufficient to maintain the antibody-antigen interface. on the other hand, the t v and t a substitution mutants still retained some binding to mab h ( figure s ), presumably because of the hydrogen bonds between residue n and the heavy chain of the antibody. based the alphascreen assay, a high concentration of mab h (~ μ m) was required to reduce the interaction between ns (rbd) and dsrna by > % (fig. ) . on the other hand, the concentration of mab h required for binding ns (rbd) in elisa was in the nanomolar range (fig. ) . this discrepancy could be due to the differences between the two assays but it also suggests that the binding of mab h to ns could have other effects on ns besides disrupting its interaction with dsrna. indeed, our previous gel filtration and dynamic light scattering results suggest that the complex between h -fab and ns (rbd) is multimeric in nature and each oligomer could consist of molecules of ns (rbd) and molecules of h -fab . in contrast, ns (rbd) eluted out the gel filtration column as a dimer, as would be expected, in the absence of h -fab . in addition, the delivery of h -fab into a cells also caused a reduction in the replication of rgpr -ns -s n recombinant virus (fig. ) . while it is difficult to precisely define the biologically relevant quaternary structures of ns , several studies have shown that the ns has conformational plasticity and dynamic changes in the quaternary structure of ns are likely to be important for the different functions of ns in infected cells . hence, it is possible that the binding of h -fab to the ns expressed in infected a cells could affect the conformational plasticity of ns and/or its ability to interact with certain cellular factor, thus resulting in a reduction in viral replication. in future studies, advanced fluorescence microscopic techniques could be used to determine the effect of h -fab on the dynamic of ns inside infected cells. in summary, we have used biochemical and structural methods to characterize the interaction between ns and mab h . our results showed that helix α in ns (rbd) is sufficient for interacting with mab h and residues n and t in this helix are likely to make hydrogen bonds with the cdr of the antibody heavy chain. helix α is highly conserved and this is consistent with the ability of mab h to bind to different subtypes of iav. after solving a high resolution crystal structure of h -fab, a haddock-derived model of the antibody-antigen complex has been obtained and the molecular contacts predicted from this model are in agreement with results obtained from comparative elisa performed using ns mutants. this model may be used in antibody engineering experiments to increase the affinity of interaction between ns and mab h so as to increase mab h 's potency in viral inhibition. in addition, it may be useful in structure-based rational drug design to identify small molecule inhibitors of ns . cells. a , t and mdck cells were purchased from american type culture collection (manassas, va, usa). a cells were cultured in minimum essential medium (mem) (gibco). t and mdck cells were cultured in dulbecco's modified eagle's medium (invitrogen). both media were supplemented with % fetal bovine serum (hyclone), penicillin ( , units/ml)-streptomycin ( mg/ml) solution (sigma aldrich). all cell lines were maintained at °c with % co . ascites and rabbit polyclonal antibodies production. ascites were produced by injecting hybridoma cells into peritoneal cavities of pristine-primed balb/c mice. the protocol was approved by institutional animal care and use committee (iacuc) of the biological resource center, a*star, singapore (protocol number: ). in order to generate rabbit anti-h n -ns polyclonal antibody, gst-fusion ns protein was purified using the method as previously described . new zealand white rabbits were then immunized with this protein and bled as previously described . the protocol was approved by iacuc of the biological resource center, a*star, singapore (protocol number: ). all the animal procedures were performed in strict compliance with the recommendations of the naclar guideline in singapore. all efforts were made to minimize the suffering and euthanasia was performed using carbon dioxide. protein expression and purification. ns (rbd) (residues - ) of a/chicken/hatay/ (h n ) was pcr amplified from a full-length ns gene (accession no.: caj . ) and the ns (rbd) (residues - ) of a/puerto rico/ / (h n ) was pcr amplified from a full-length ns gene (accession no.: abd . ). the expression constructs were generated by inserting pcr product into pet sumo expression vector (invitrogen). ns mutants expression construct was generated by overlap pcr as described previously . all proteins were expressed and purified as previously described . the purified proteins were dialyzed against dialysis buffer ( mm tris-hcl, ph . , mm nacl) and concentrated to mg/ml in a centrprep- (amicon) for subsequent assays. crystallization of h -fab. mab h was purified from the ascites by using affinity chromatography and h -fab was obtained by papain cleavage as described previously . the purified h -fab was dialyzed against dialysis buffer and concentrated to mg/ml for subsequent crystallization. crystallizations were performed with the hanging drop vapor diffusion method at °c by mixing μ l of h -fab with μ l of reservoir solution and the mixture was equilibrated against μ l of reservoir solution. crystals of h -fab were grown against crystallization buffer containing % peg , . m nacl and . m bis-tris, ph . . these crystals grew to a maximum size of . mm × . mm × . mm over the course of days. single crystals were obtained by dissecting from multiple crystals. crystals were flash frozen ( k) in the above reservoir solution supplemented with % glycerol. a total of frames of a native data set with oscillation at . Å wavelength were collected for h -fab. all data sets were processed by hkl . most of the crystals diffracted rather weak and the scaled data sets were anisotropic with strong ice rings and high mosaicities. nevertheless, one of the data sets displaying scientific reports | : | doi: . /srep weak ice ring was able to scale to . Å at the mosaicity of . °. this data was used for structure determination and refinement with the statistic table listed as table . of h -fab were predicted using the online igblast tool (http://www.ncbi.nlm.nih.gov/igblast/). as the three-dimensional structure of ns (rbd) of a/chicken/hatay/ (h n ) has not been solved, the structure of ns (rbd) of a/crow/kyoto/t / (h n ) (pdb id: z a) was used instead. as shown in fig. a , there are amino acid differences between the ns (rbd) of these two strains but the sequence of helix α in the ns (rbd) is % identical. mutagenesis data from comparative elisa was used to generate a series of models for the complex between h -fab and ns (rbd) by using version . of haddock webserver , . along with the available individual structure, haddock utilizes the experimentally derived data to predict the complex structure. to achieve this, ns (rbd) was docked to h -fab using the easy interface of haddock webserver, where both residues n and t of ns (rbd) and amino acids in the cdrs of h -fab were defined as active residues involved in the interaction. in this docking experiment, only the variable domains of h -fab were used. enzyme-linked immunosorbent assay (elisa). elisa was performed as described previously . briefly, a synthetic peptide corresponding to residues - in h n -ns (apfldrlrrdqkslrgrgntlgld, chemically synthesized by gl biochem) or purified wt and mutant ns (rbd) proteins were serially diluted into . m carbonate-bicarbonate buffer, ph . . a -mer peptide corresponding to a fragment of the hepatitis c virus core protein (rpswgpidprrrsknlgkvidtltcgfap, chemically synthesized by genway biotech) was used as a negative control. proteins or peptides ( μ l) were then coated onto -well elisa plates (nunc) overnight at °c. the wells were blocked in % milk in pbs with . % tween (pbst) for h at °c followed by addition of μ l of mab h as primary antibody to each well and incubated at °c for - h. the wells were then washed in pbst followed by the addition of goat anti-mouse horse-radish peroxidase (hrp)-conjugated antibody (pierce) as secondary antibody and incubated at °c for h. tetramethylbenzidine substrate (pierce) was added and reaction was stopped using . m sulfuric acid. absorbance at nm was recorded using an absorbance reader (tecan infinite m ). recombinant viruses were generated with phw reverse genetic system as described previously . residue in ns was changed from s to a or n by pcr mutagenesis and the resulting dna were inserted into phw vector. this plasmid was co-transfected with another seven phw plasmids containing the other pr genomic dnas into t cells. days post transfection, culture supernatant was collected and used to infect mdck cells. when cytopathic effect (cpe) was visibly detected, culture supernatant was collected and used to infect naïve mdck cells. individual plaque was then amplified and viral titer was determined by plaque assay. virus infection and western blot analysis. % confluent t cells were infected with moi of wt or mutant viruses at °c for h. the medium was discarded and cells were rinsed with pbs. cell lysates were harvested at and h.p.i. in ripa buffer ( mm tris-hcl ph . , mm nacl, . % np , . % sodium deoxycholate, . % sds). then, μ g of total lysate were resolved using electrophoresis on an sds-polyacrylamide gel and transferred to a nitrocellulose membrane (bio-rad). antibodies against gapdh (santa cruz), np (millipore), and m (as described previously) were used. mab h and rabbit anti-h n -ns polyclonal antibody (as described above) were also used. after washing, the membrane was incubated with a hrp-conjugated secondary antibody (pierce) at room temperature for h. the membranes were then washed and detected with enhanced chemiluminescence substrate (pierce) using chemidoc ™ mp imaging system (bio-rad). multiple-cycle growth kinetics of recombinant virus. plaque assay was applied to determine the growth kinetics of rgpr -ns -wt, rgpr -ns -s a and rgpr -ns -s n recombinant viruses. % confluent monolayers of a cells were rinsed with pbs and subsequently adsorbed with . moi of wt and mutant viruses respectively for h at °c. the medium was discarded and the cells were rinsed using pbs and cultured in mem without serum at °c. supernatant containing virus was collected at , , , and h.p.i. respectively and subjected to plaque assay to determine viral titer. plaque assay in mdck cells. % confluent mdck cells were adsorbed with serially diluted supernatants containing viruses for h at °c. the medium was discarded and the cells were rinsed using pbs. the cells were overlaid with ml of dmem supplemented by . % agar and μ g/ml tpck-trypsin (thermo scientific). after incubation at °c for days, the cells were fixed using % formalin for h and stained using . % crystal violet solution. delivery of a -fab and h -fab into a cells. a cells were cultured in -well plate. the a -fab and h -fab were then transfected into % confluent cells by using lipodin-ab reagent (abbiotec) according to manufacturer's protocol with slight modification. briefly, μ l of fab solution ( μ g) was mixed thoroughly with μ l lipodin-ab solution and incubated for min at room temperature. μ l of serum-free mem medium was added to fab/lipodin-ab solution and then immediately added to the cells. the cells were washed with pbs once before the fab/lipodin-ab solution was added. the cells were incubated at °c in % co for h, washed once with pbs and subjected to viral infection and western blot analysis as described above. cell viability assay. cell viability was determined using wst- reagent (roche) according to manufacturer's protocol. briefly, μ l of wst- reagent was diluted in the μ l of culture media and added to a cells cultured in a transparent -well microplate and incubated for h, followed by measuring absorbance at nm. immunofluorescence assay. a cells grown on coverslip were transfected with fab as described above. approximately h after transfection, the medium was aspirated, washed with pbs once and replaced with serum-free mem medium and incubated at °c in % co for another h. the cells were then washed with pbs once and fixed with % paraformaldehyde for min and permeabilized with . % tritonx- for min, followed by blocking with % bsa in pbs for min. the cells were incubated with alexa fluor -conjugated goat anti-mouse igg antibody (invitrogen) for h. after washing, cells were stained with dapi before mounting. images were captured using an olympus fluoview fv laser-scanning confocal microscope. alphascreen-based dsrna binding inhibition assay. a nt-sirna previously reported to form complex with ns (rbd) was purchased from thermo scientific dharmacon (dharmacon, lafayette, co). the sirna was biotinylated at the ′ end of the sense strand and the sirna sequences were as follows: ′ -biotin-agacaccauuaugcugucuuu- ′ (sense) and ′ -agacagcauaauggugucuuu- ′ (antisense). the lyophilized sirnas were reconstituted in rnase-free water to a final concentration of nm. to express gst-tagged ns (rbd) of a/chicken/hatay/ (h n ) was pcr amplified from a full-length ns gene (accession no.: caj . ) and cloned into pgex- p- vector (ge healthcare). the expression and purification were performed as described previously . the purified gst-fusion protein was dialysed against pbs and the final concentration was determined using the coomassie plus protein assay reagent (thermo scientific). in vitro rna binding inhibition assay was carried out in -well proxiplate by using the alphascreen anti-gst kit (perkinelmer). in the first experiment, μ l of nm gst-tagged proteins were mixed with same volume of serially diluted mab h and incubated at room temperature for h. then, μ l of nm biotinylated nt-sirna was added into the binding mixture and incubated at room temperature for h before the addition of μ l of detection mixture containing . μ l of anti-gst (glutathione s-transferase) acceptor beads and . μ l of streptavidin-coated donor beads (perkinelmer). after another incubation at room temperature for h, luminescent signal was measured using an enspire multimode plate reader (perkinelmer) . in the second experiment, μ l serially diluted mab h was mixed with μ l of nm gst-tagged proteins and μ l of nm biotinylated nt-sirna simultaneously. after incubation at room temperature for h, detection mixture was added and measurement was made similarly. statistical analysis. two-tailed student's t test was applied to evaluate the statistical significance of differences measured from the data sets obtained in independent experiments. p < . was considered statistically significant. influenza-who cares orthomyxoviridae: the viruses and their replication new world bats harbor diverse influenza a viruses mixed infections of pandemic h n and seasonal h n viruses in outbreak advances in the development of influenza virus vaccines toward a universal influenza virus vaccine: prospects and challenges evasion of influenza a viruses from innate and adaptive immune responses viral m ion channel protein: a promising target for anti-influenza drug discovery influenza neuraminidase inhibitors: antiviral action and mechanisms of resistance avian flu: isolation of drug-resistant h n virus influenza a virus strains that circulate in humans differ in the ability of their ns proteins to block the activation of irf and interferon-beta transcription virulence of h n avian influenza virus enhanced by a -nucleotide deletion in the viral nonstructural gene rna binding by the novel helical domain of the influenza virus ns protein requires its dimer structure and a small number of specific basic amino acids binding of influenza a virus ns protein to dsrna in vitro the influenza virus ns protein is a poly(a)-binding protein that inhibits nuclear export of mrnas containing poly(a) the influenza virus ns protein binds to a specific region in human u snrna and inhibits u -u and u -u snrna interactions during splicing the primary function of rna binding by the influenza a virus ns protein in infected cells: inhibiting the ′ - ′ oligo (a) synthetase/rnase l pathway influenza a virus ns targets the ubiquitin ligase trim to evade recognition by the host viral rna sensor rig-i structural basis for a novel interaction between the ns protein derived from the influenza virus and rig-i binding of the influenza a virus ns protein to pkr mediates the inhibition of its activation by either pact or doublestranded rna loss of function of the influenza a virus ns protein promotes apoptosis but this is not due to a failure to activate phosphatidylinositol -kinase (pi k) influenza a virus ns protein activates the pi k/akt pathway to mediate antiapoptotic signaling responses identification of influenza virus inhibitors targeting ns a utilizing fluorescence polarization-based high-throughput assay synthesis and evaluation of quinoxaline derivatives as potential influenza ns a protein inhibitors novel influenza virus ns antagonists block replication and restore innate immune function antiviral activity of baicalin against influenza virus h n -pdm is due to modulation of ns -mediated cellular innate immune responses small interfering rna targeting the nonstructural gene transcript inhibits influenza a virus replication in experimental mice a new panel of ns antibodies for easy detection and titration of influenza a virus a monoclonal antibody binds to threonine in the non-structural protein of influenza a virus and interferes with its ability to modulate viral replication roles of the phosphorylation of specific serines and threonines in the ns protein of human influenza a viruses monoclonal antibodies targeting the hr domain and the region immediately upstream of the hr of the s protein neutralize in vitro infection of severe acute respiratory syndrome coronavirus haddock: a protein-protein docking approach based on biochemical or biophysical information conserved surface features form the double-stranded rna binding site of non-structural protein (ns ) from influenza a and b viruses structural basis for dsrna recognition by ns protein of influenza a virus assay optimization and screening of rna-protein interactions by alphascreen the multifunctional ns protein of influenza a viruses a complete map of potential pathogenicity markers of avian influenza virus subtype h predicted from expressed proteins current approaches to fine mapping of antigen-antibody interactions conformational plasticity of the influenza a virus ns protein comparing the antibody responses against recombinant hemagglutinin proteins of avian influenza a (h n ) virus expressed in insect cells and bacteria simple and efficient site-directed mutagenesis using two single-primer reactions in parallel to generate mutants for protein structure-function studies haddock versus haddock: new features and performance of haddock . on the capri targets a dna transfection system for generation of influenza a virus from eight plasmids the pymol molecular graphics system, version we are grateful to r.g. webster for the pr based reverse genetics system. we thank members of the monoclonal antibody unit at institute of molecular and cell biology for technical assistance. this work was supported by the singapore ministry of health's national medical research council under its nmrc-cbrg scheme [grant no. nmrc/cbrg/ / ]. key: cord- -v xs ej authors: vadlamani, bhaskar s.; uppal, timsy; verma, subhash c.; misra, mano title: functionalized tio( ) nanotube-based electrochemical biosensor for rapid detection of sars-cov- date: - - journal: sensors (basel) doi: . /s sha: doc_id: cord_uid: v xs ej the coronavirus disease (covid- ) is a newly emerging viral disease caused by the severe acute respiratory syndrome coronavirus (sars-cov- ). rapid increase in the number of covid- cases worldwide led the who to declare a pandemic within a few months after the first case of infection. due to the lack of a prophylactic measure to control the virus infection and spread, early diagnosis and quarantining of infected as well as the asymptomatic individuals are necessary for the containment of this pandemic. however, the current methods for sars-cov- diagnosis are expensive and time consuming, although some promising and inexpensive technologies are becoming available for emergency use. in this work, we report the synthesis of a cheap, yet highly sensitive, cobalt-functionalized tio( ) nanotubes (co-tnts)-based electrochemical sensor for rapid detection of sars-cov- through sensing the spike (receptor binding domain (rbd)) present on the surface of the virus. a simple, low-cost, and one-step electrochemical anodization route was used for synthesizing tnts, followed by an incipient wetting method for cobalt functionalization of the tnts platform, which was connected to a potentiostat for data collection. this sensor specifically detected the s-rbd protein of sars-cov- even at very low concentration (range of to nm (nano molar)). additionally, our sensor showed a linear response in the detection of viral protein over the concentration range. thus, our co-tnt sensor is highly effective in detecting sars-cov- s-rbd protein in approximately s, which can be explored for developing a point of care diagnostics for rapid detection of sars-cov- in nasal secretions and saliva samples. the current outbreak of novel coronavirus (ncov- or sars-cov- ), was first detected in wuhan, china in december , but quickly spread to other parts of china as well as to the entire world, causing a pandemic [ ] . according to the who, as of august, , around , , people are infected, and , people have died due to sars-cov- infection [ ] . sars-cov- infection causes a variety of symptoms including fever, cough and respiratory distress, which are collectively called the coronavirus disease or covid- [ ] . the spreading of sars-cov- primarily occurs from person-to-person transmission through close contact or via small droplets produced during coughing, sneezing, and talking [ , ] . the incubation period for sars-cov- is around - days, with no noticeable symptoms; however, the viral transmission from an infected person to a non-infected person is still possible during this asymptomatic period [ ] . under the current scenario, with no vaccines in the market, global lockdown regulations are in place in order to minimize the viral spread. consequently, this pandemic has caused a severe socio-economic impact on the world economy and raised fears of a global recession [ ] . currently, the real-time reverse-transcriptase polymerase chain reaction (rt-pcr) technique is the most common and reliable laboratory testing method for qualitative/quantitative sars-cov- detection [ , ] followed by serum virus neutralization assay (svna) for the determination of antibody neutralization [ ] and enzyme-linked immunoassays (elisa) for the detection of antibodies against sars-cov- [ ] . however, the major limitations of these laboratory-based diagnostic tests are the invasive nature of the tests that often require trained personal for nasopharyngeal sample collection, along with the requirement of highly sophisticated machines, cross-reactivity with other viruses, and longer duration of testing. in order to contain the viral spread, surveillance of even asymptomatic individuals is needed, which is feasible only after the development of a simple, portable and rapid point-of-use sensor for the detection of sars-cov- . sars-cov- has a positive-sense, single-stranded rna (~ k bp) genome with orfs that encode for structural, replication and non-structural proteins [ ] . similar to its genetic cousin, human sars-cov, sars-cov- consists of four structural proteins viz. spike (s), envelope (e), membrane (m), and nucleocapsid (n). coronaviruses are named for the crown like spike glycoprotein, s (composed of two subunits: the s subunit and the s subunit) on the surface/envelope [ ] . the s subunit of the s protein consists of a receptor binding domain (rbd) that has a high binding affinity towards the host angiotensin-converting enzyme ii (ace ) receptor present on the human cells; the s subunit mediates virus-host cell fusion and entry [ ] . importantly, the s protein is highly immunogenic and induces immune response to produce neutralizing antibodies as well as t-cell responses in sars-cov- infected individuals [ ] . functionally, binding of s-rbd to the hace receptor is crucial for the entry of sars-cov- into human cells. interestingly, sars-cov- s-rbd shares only % sequence identity with sars-cov s-rbd, which has been evaluated for vaccines and therapeutic drug development [ ] . hence, the s-rbd of sars-cov- is an excellent target for diagnostic and therapeutic interventions. electrochemical biosensors are advantageous for sensing biomolecules because of their ability to detect biomarkers with accuracy, specificity and high sensitivity [ ] . electrochemical biosensors have been successfully used in medical diagnostics for the detection of viruses such as the middle east respiratory syndrome coronavirus (mers-cov) [ ] , the human enterovirus (ev ) [ ] , the human influenza a virus h n [ ] , and the avian influenza virus (aiv) h n [ ] . lahyquah et al. [ ] used an array of carbon electrodes modified with gold nanoparticles for the detection of mers-cov. very recently, a biosensor using gold nanoparticle decorated fto glass immobilized with ncovid- monoclonal antibody was reported for the detection of sars-cov- [ ] . the functionality of the electrochemical biosensor can be further improved by nanostructuring the electrode as it increases the electrochemical reaction rate due to an increased electrode surface area to volume ratio, thereby increasing the electrode surface area to analyte fluid volume. in the work by chin et al. on the encephalitis virus, it was found that nanostructuring of carbon electrodes with carbon nanoparticles increased the current response by % due to an enhanced electron charge transfer kinetics [ ] . similarly, we have reported that co functionalized tio nanotubes (ni-tnts) with higher surface-to-volume ratio can detect the biomarkers associated with tuberculosis [ , ] . the proposed sensing mechanism involves the formation of a complex between co and the biomarker at a specific bias voltage, due to the reduction of co ions and oxidation of the biomarker. similarly, we hypothesized that s-rbd or sars-cov- can be detected through complexing of functionalized nanoparticles with the s-rbd protein. a schematic of viral detection directly from a patient sample is shown in figure . in the current work, we have determined the potential of co-functionalized tio nanotubes (co-tnts) for the electrochemical detection of s-rbd protein of sars-cov- . tnts were synthesized by a simple, cost-effective, one-step electrochemical anodization route, and co functionalization was carried out using the incipient wetting method. our data shows that cobalt functionalized tnts can selectively detect the s-rbd protein of sars-cov- using the amperometry electrochemical technique in~ s. in the current work, we have determined the potential of co-functionalized tio nanotubes (co-tnts) for the electrochemical detection of s-rbd protein of sars-cov- . tnts were synthesized by a simple, cost-effective, one-step electrochemical anodization route, and co functionalization was carried out using the incipient wetting method. our data shows that cobalt functionalized tnts can selectively detect the s-rbd protein of sars-cov- using the amperometry electrochemical technique in ~ s. tnts were synthesized by electrochemical anodization of the ti sheet. a ti sheet of size . × . cm, with a tab mm in width, was cut out of a g grade ti sheet (thickness . mm). one side of the coupon was polished with grit polishing paper for min to remove any surface metal oxide layer. the coupon was ultrasonicated in a : solution of ethanol and acetone for min. the unpolished side was masked with kapton tape to avoid any exposure to electrolyte during anodization. the electrochemical anodization was performed in a standard two-electrode configuration, using ti foil as a working electrode and platinum foil as a counter electrode with a cm gap between them. the anodization was carried out using an electrolyte of composition . ml (ch oh) , ml di h o, and . g nh f, in a teflon beaker. the electrolyte was maintained at a subzero temperature and was continuously stirred using a magnetic stirrer at a speed of rpm. the anodization was carried out by maintaining a constant voltage of v across both the electrodes for min. after anodization, the sample was rinsed in di h o and baked in an oven at °c for hrs. the kapton tape was removed from the sample after baking, and the sample was annealed in a tube furnace at °c for h in a continuous flow of oxygen. the annealed tnts obtained from the furnace were functionalized with cobalt using an incipient wetting method, i.e., a wet ion exchange process. the same side of the sample that was masked earlier was again masked with kapton tape. the sample was ultrasonicated in a solution containing . g of cocl . h o in ml ethanol for min. the sample was baked in an oven at °c for h to obtain cobalt functionalized tnts. tnts were synthesized by electrochemical anodization of the ti sheet. a ti sheet of size . × . cm, with a tab mm in width, was cut out of a g grade ti sheet (thickness . mm). one side of the coupon was polished with grit polishing paper for min to remove any surface metal oxide layer. the coupon was ultrasonicated in a : solution of ethanol and acetone for min. the unpolished side was masked with kapton tape to avoid any exposure to electrolyte during anodization. the electrochemical anodization was performed in a standard two-electrode configuration, using ti foil as a working electrode and platinum foil as a counter electrode with a cm gap between them. the anodization was carried out using an electrolyte of composition . ml (ch oh) , ml di h o, and . g nh f, in a teflon beaker. the electrolyte was maintained at a subzero temperature and was continuously stirred using a magnetic stirrer at a speed of rpm. the anodization was carried out by maintaining a constant voltage of v across both the electrodes for min. after anodization, the sample was rinsed in di h o and baked in an oven at • c for h. the kapton tape was removed from the sample after baking, and the sample was annealed in a tube furnace at • c for h in a continuous flow of oxygen. the annealed tnts obtained from the furnace were functionalized with cobalt using an incipient wetting method, i.e., a wet ion exchange process. the same side of the sample that was masked earlier was again masked with kapton tape. the sample was ultrasonicated in a solution containing . g of cocl . h o in ml ethanol for min. the sample was baked in an oven at • c for h to obtain cobalt functionalized tnts. the morphology of the tnts and co-tnts were examined using dual beam scanning electron microscopy (sem, thermofisher scientific). the cobalt content in the co-tnt sample was analyzed using the eds detector attached to sem. the sem micrographs were analyzed using imagej software. the pcaggs vector containing sars-cov- wuhan-hu- spike glycoprotein receptor binding domain (rbd) with a c-terminal hexa-histidine tag was obtained from bei resources (niaid, nih, nr- ). his -tagged s-rbd containing pcaggs plasmid was expressed in hek t (human embryonic kidney) cells obtained from the american type culture collection (atcc) and maintained in dulbecco's modified eagle medium (dmem), supplemented with % fetal bovine serum (fbs, atlanta biologicals), mm l-glutamine, u/ml penicillin, and µg/ml streptomycin. cells were grown at • c in a humidified chamber supplemented with % co . for his -tagged s-rbd protein generation, hek t cells were transfected with recombinant plasmid using neon transfection system (thermo scientific) according to the manufacturer's instructions. supernatants from transfected cells were harvested on day post-transfection and the cell debris was removed by centrifugation ( rpm, min at • c). supernatants were then incubated with ml of ni-nta agarose (qiagen) for every ml of supernatant, for h at • c with rotation. for s-rbd purification, gravity flow columns were used to load the ni-nta agarose bound his -tagged spike-rbd protein, followed by washing with wash buffer ( mm sodium phosphate, mm nacl, m urea, mm imidazole, ph . ) and eluting with elution buffer ( mm sodium phosphate, mm nacl, m urea, mm imidazole, ph . ). the eluted protein was concentrated using protein concentrators (thermo scientific, , and ), quantified using bradford assay and nanodrop (thermo scientific) and further analyzed by sds-page. the electrochemical sensing of s-rbd protein was carried out using a custom-built co-tnt packaged printed circuit board setup. the sensor response was measured with the help of gamry reference + potentiostat attached to the printed circuit board. the custom-made printed circuit board consists of a copper clamp that holds the co-tnt grown over the ti sheet. the upward-facing co-tnt side acts as a working electrode, and the bottom-facing ti side acts as a counter electrode, to which electrical connections were made via copper lines running on the top and bottom of the custom-built chip, respectively. the detailed schematic of the printed circuit board was reported in our earlier work [ ] . the schematic of the whole sensing set up along with the detection methodology is shown in figure . amperometry is an electrochemical technique where a constant voltage is applied across the electrodes and response current is monitored as a function of time [ ] . the technique uses response current to determine the concentration of the analyte in the electrolyte solution between the electrodes. the s-rbd protein in the elution buffer ( mm sodium phosphate, mm nacl, m urea, mm imidazole, ph . ) was transferred onto the surface of co-tnt using a micropipette. the sensor response with various s-rbd protein in concentrations was determined using the amperometry technique, at a bias voltage of − . v. the bias voltage was determined by conducting the cyclic voltammetry experiments in the voltage window − to + v. all the experiments were carried out at room temperature. the scanning electron microscopy (sem) micrographs of the tnts, prepared by electrochemical anodization, are shown in figure a . the inset shows the side view of the tnts (figure a) . the outer diameter and wall thickness of tnts were~ and~ nm, respectively. the average length of tnts was found to be~ . µm. in our earlier work, tnts synthesized under similar conditions were found to show the crystalline anatase phase predominantly [ ] . the surface morphology of the co-tnts examined under sem is shown in figure b . the sem micrograph reveals the presence of precipitates on top of the tnt surface. eds analysis confirmed the uniform distribution of co on top of tnts (figure c) , and the co content was found to be~ wt% (figure d) . we have previously shown using sensors , , of detailed xps studies that co exists in the co + state ( p / peak at . ev) and also co(oh) is the predominant phase present on the surface of co-tnts [ ] . therefore, the morphology of tnts can be visualized as having a very large surface area, uniformly decorated with co + ions. sensors , , x of to show the crystalline anatase phase predominantly [ ] . the surface morphology of the co-tnts examined under sem is shown in figure b . the sem micrograph reveals the presence of precipitates on top of the tnt surface. eds analysis confirmed the uniform distribution of co on top of tnts (figure c) , and the co content was found to be ~ wt% (figure d) . we have previously shown using detailed xps studies that co exists in the co + state ( p / peak at . ev) and also co(oh) is the predominant phase present on the surface of co-tnts [ ] . therefore, the morphology of tnts can be visualized as having a very large surface area, uniformly decorated with co + ions. the receptor binding domain of the spike glycoprotein (s-rbd), present as a crown on the surface of the virus is an easily accessible target for the detection of sars-cov- . the rbd domain comprises of amino acids - , which is a ~ kda protein with potential n-glycosylation sites. as shown in figure a , b, the sds-page gel of his -tagged s-rbd protein, either stained with simplyblue safestain ( figure a ) or immunologically detected with mouse anti-his monoclonal antibody ( figure b ) showed the presence of specific protein in our viral protein preparation. immunoblot detected sars-cov- s-rbd protein at approximately kda, as expected, but also at kda, representing the dimeric forms of s-rbd protein ( figure b ). detection of a slightly higher molecular weight (~ kda) s-rbd protein as compared to the calculated size was possibly because of post-translational modifications, including glycosylation on the protein. importantly, the s-rbd purified protein from the human embryonic kidney cells were of high purity; it was used for quantitation and detection on co-tnt sensors. the ability of co-tnt to sense the s-rbd protein of sars-cov- was determined by performing an amperometry experiment at a bias voltage of − . v. the amperometry curves obtained at various concentrations of protein are shown in figure . the sensor was exposed to protein s after the the receptor binding domain of the spike glycoprotein (s-rbd), present as a crown on the surface of the virus is an easily accessible target for the detection of sars-cov- . the rbd domain comprises of amino acids - , which is a~ kda protein with potential n-glycosylation sites. as shown in figure a , b, the sds-page gel of his -tagged s-rbd protein, either stained with simplyblue safestain ( figure a ) or immunologically detected with mouse anti-his monoclonal antibody ( figure b ) showed the presence of specific protein in our viral protein preparation. immunoblot detected sars-cov- s-rbd protein at approximately kda, as expected, but also at kda, representing the dimeric forms of s-rbd protein ( figure b ). detection of a slightly higher molecular weight (~ kda) s-rbd protein as compared to the calculated size was possibly because of post-translational modifications, including glycosylation on the protein. importantly, the s-rbd purified protein from the human embryonic kidney cells were of high purity; it was used for quantitation and detection on co-tnt sensors. the ability of co-tnt to sense the s-rbd protein of sars-cov- was determined by performing an amperometry experiment at a bias voltage of − . v. the amperometry curves obtained at various concentrations of protein are shown in figure . the sensor was exposed to protein s after the beginning of the experiment (marked by an arrow). the sensor response current increases sharply and rapidly as the sensor was exposed to the protein. at a protein concentration of nm (nano molar), the peak sensor current output was found to be~ . µa (micro ampere). the peak current decreases to~ . µa at a protein concentration of nm and further decreases to~ . µa at a protein sensors , , of concentration of nm. the sensor detection time was~ s over the concentration range of to nm. it is hypothesized that the rapid increase in sensor response current could be attributed to the electrochemically triggered unfolding of protein that exposes its interior [ ] [ ] [ ] and subsequent complex formation between co and the protein [ , ] . each s-rbd protein monomeric unit contains tyrosine, tryptophan and cysteine amino acid residues [ ] , and all of them were reported to underdo electrochemical oxidation under application of potential [ , ] . the electrochemical oxidation process involves deprotonation, where the -oh functional group in the protein is converted to -o − . we envisage that the complex formation occurs between the co + ion in co-tnt and the -o − radical in the protein. a very similar mechanism was reported earlier, where methyl nicotinate biomarker was exposed to co-tnt [ ] . attributed to the electrochemically triggered unfolding of protein that exposes its interior [ ] [ ] [ ] and subsequent complex formation between co and the protein [ , ] . each s-rbd protein monomeric unit contains tyrosine, tryptophan and cysteine amino acid residues [ ] , and all of them were reported to underdo electrochemical oxidation under application of potential [ , ] . the electrochemical oxidation process involves deprotonation, where the -oh functional group in the protein is converted to -o -. we envisage that the complex formation occurs between the co + ion in co-tnt and the -oradical in the protein. a very similar mechanism was reported earlier, where methyl nicotinate biomarker was exposed to co-tnt [ ] . the average sensor response time, which is defined as the time taken to reach the peak current, was found to be ~ sec. it is very short compared to our earlier studies on the sensor for colorectal cancer, where a sensor response time of ~ sec was documented [ ] . the shorter sensor response time indicates higher kinetics of the reaction between co-tnt and the protein molecules. at all the protein concentrations, it was observed that the sensor current did not recover to the initial baseline current within the experimental timeframe. therefore, the sensor recovery time, defined as the sensor's time to recover to the initial baseline current value, could not be reported. the sensor's strange behavior could be due to the change in the surface chemistry of co-tnt after interaction with the protein. the sensor response (sr) was calculated at various protein concentrations based on the following equation: i , − i , the average sensor response time, which is defined as the time taken to reach the peak current, was found to be~ s. it is very short compared to our earlier studies on the sensor for colorectal cancer, where a sensor response time of~ s was documented [ ] . the shorter sensor response time indicates higher kinetics of the reaction between co-tnt and the protein molecules. at all the protein concentrations, it was observed that the sensor current did not recover to the initial baseline current within the experimental timeframe. therefore, the sensor recovery time, defined as the sensor's time to recover to the initial baseline current value, could not be reported. the sensor's strange behavior could be due to the change in the surface chemistry of co-tnt after interaction with the protein. the sensor response (sr) was calculated at various protein concentrations based on the following equation: sensor resposne (sr) = i max, protein − i max, base line i max,base line where i max,protein is the maximum current obtained when the sensor is exposed to sars-cov- s-rbd protein and i max,base line is the maximum current obtained when the sensor is not exposed to the protein. the value of i max,base line , which is the current obtained when the sensor is not exposed to the protein, was found to be~ pa ( figure ). the sensor responses measured at different protein concentrations are shown in figure . the sensor response was found to increase with an increase in the concentration of protein. moreover, the sensor response exhibited excellent linearity over the concentration range to nm with a correlation coefficient of r = . . the regressed linear calibration curve for sensor response was obtained as follows: where sr is the sensor response, and c is the concentration of protein in nm. according to statistical analysis [ ] , the detection limit fof measurements using the sensor was determined to be . nm. sensors , , x of the limit of detection can be further improved by the use of (i) co-tnt synthesized by an insitu anodization technique and (ii) co-tnts of even higher length. previously, we found that co-tnt synthesized by in-situ anodization with higher sensor sensitivity compared to co-tnt synthesized by the incipient wetting route, towards the detection of tuberculosis biomarkers [ ] . a higher sensor sensitivity corresponds to a better limit of detection and sensitivity of quantitation. the increased sensitivity was attributed to the presence of co (oh) precipitate sites in direct contact with the parent tio , due to which direct conduction is possible. the sensor sensitivity can also be improved by using longer co-tnts as higher surface area results in a higher reaction rate; thereby, higher sensor response current can be obtained even at lower protein concentrations. the limit of detection can be further improved by the use of (i) co-tnt synthesized by an in-situ anodization technique and (ii) co-tnts of even higher length. previously, we found that co-tnt synthesized by in-situ anodization with higher sensor sensitivity compared to co-tnt synthesized by the incipient wetting route, towards the detection of tuberculosis biomarkers [ ] . a higher sensor sensitivity corresponds to a better limit of detection and sensitivity of quantitation. the increased sensors , , of sensitivity was attributed to the presence of co (oh) precipitate sites in direct contact with the parent tio , due to which direct conduction is possible. the sensor sensitivity can also be improved by using longer co-tnts as higher surface area results in a higher reaction rate; thereby, higher sensor response current can be obtained even at lower protein concentrations. in this study, we developed a co-metal functionalized tnt as a sensing material for electrochemical detection of sars-cov- infection through the detection of the receptor binding domain (rbd) of spike glycoprotein. we confirmed the biosensor's potential for clinical application by analyzing the rbd of the spike glycoprotein on our sensor. amperometry electrochemical studies indicated that the sensor could detect the protein in the concentration range to nm. the relationship between sensor response and protein concentration was found to be linear with the limit of detection as low as~ . nm levels. importantly, our sensor detected sars cov- s-rbd protein in a very short time (~ s), confirming its implication in developing a rapid diagnostic assay. our report thereby demonstrates the development of a simple, inexpensive, rapid and non-invasive diagnostic platform that has the potential of detecting sars-cov- on clinical specimens, including nasal, nasopharyngeal swabs or saliva. moreover, the developed approach has the potential for diagnosis of other respiratory viral diseases by identifying appropriate metallic elements to functionalize tnts. author contributions: formal analysis, methodology, writing-original draft preparation, writing-review and editing, b.s.v.; methodology, writing-original draft preparation, writing-review and editing, t.u.; conceptualization, methodology, project administration, funding, writing-review and editing, m.m.; conceptualization, methodology, project administration, funding, writing-review and editing, s.c.v. all authors have read and agreed to the published version of the manuscript. funding: this work was supported by the departmental and institutional funds. who. novel coronavirus ( -ncov) situation report- . available online who. coronavirus disease (covid- ): situation report- the species severe acute respiratory syndrome-related coronavirus: classifying -ncov and naming it sars-cov- cluster of sars among medical students exposed to single patient identification of severe acute respiratory syndrome in canada the sars-cov- outbreak: what we know the socio-economic implications of the coronavirus pandemic (covid- ): a review molecular diagnosis of a novel coronavirus ( -ncov) causing an outbreak of pneumonia positive rt-pcr test results in patients recovered from covid- a sars-cov- surrogate virus neutralization test based on antibody-mediated blockage of ace -spike protein-protein interaction diagnostic performance of seven rapid igg/igm antibody tests and the euroimmun iga/igg elisa in covid- patients coronavirus infections and immune responses structural and functional properties of sars-cov- spike protein: potential antivirus drug development for covid- characterization of the receptor-binding domain (rbd) of novel coronavirus: implication for development of rbd protein as a viral attachment inhibitor and vaccine satchi-fainaro, r. immune-mediated approaches against covid- identification of sars-cov rbd-targeting monoclonal antibodies with cross-reactive or neutralizing activity against sars-cov- electrochemical biosensors for pathogen detection an electrochemical immunosensor for the corona virus associated with the middle east respiratory syndrome using an array of gold nanoparticle-modified carbon electrodes a colorimetric and electrochemical immunosensor for point-of-care detection of enterovirus electrochemical detection of in fl uenza virus h n based on both immunomagnetic extraction and gold catalysis using an immobilization-free screen printed carbon microelectrode an impedance immunosensor based on low-cost microelectrodes and speci fi c monoclonal antibodies for rapid detection of avian in fl uenza virus h n in chicken swabs ecovsens-ultrasensitive novel in-house built printed circuit board based electrochemical device for rapid detection of ncovid- antigen, a spike protein domain of sars-cov- carbon nanoparticle modified screen printed carbon electrode as a disposable electrochemical immunosensor strip for the detection of japanese encephalitis virus titania nanotube array sensor for electrochemical detection of four predominate tuberculosis volatile biomarkers anodic functionalization of titania nanotube arrays for the electrochemical detection of tuberculosis biomarker vapors detection of food decay products using functionalized one-dimensional titania nanotubular arrays electrochemical methods: fundamentals and applications electrochemical detection of methyl nicotinate biomarker using functionalized anodized titania nanotube arrays electrochemical detection of methyl nicotinate biomarker using functionalized anodized titania nanotube arrays analysis of redox activity of proteins on the carbon screen printed electrodes chemical-induced unfolding of cofactor-free protein monitored by electrochemistry dominant forces in protein folding protein electrochemistry: application in medicine. a review detection of four distinct volatile indicators of colorectal cancer using functionalized titania nanotubular arrays determination of the lower limit of detection the s-rbd expression vector was obtained through bei resources, niaid, nih: vector pcaggs containing the sars-cov- , wuhan-hu- spike glycoprotein gene rbd with c-terminal hexa-histidine tag, nr- . the authors declare no conflict of interest. key: cord- - voi y authors: han, hui-ju; liu, jian-wei; yu, hao; yu, xue-jie title: neutralizing monoclonal antibodies as promising therapeutics against middle east respiratory syndrome coronavirus infection date: - - journal: viruses doi: . /v sha: doc_id: cord_uid: voi y since emerging in , middle east respiratory syndrome coronavirus (mers-cov) has been a global public health threat with a high fatality rate and worldwide distribution. there are no approved vaccines or therapies for mers until now. passive immunotherapy with neutralizing monoclonal antibodies (mabs) is an effective prophylactic and therapeutic reagent against emerging viruses. in this article, we review current advances in neutralizing mabs against mers-cov. the receptor-binding domain (rbd) in the spike protein of mers-cov is a major target, and mouse, camel, or human-derived neutralizing mabs targeting rbd have been developed. a major problem with neutralizing mab therapy is mutant escape under selective pressure, which can be solved by combination of neutralizing mabs targeting different epitopes. neutralizing mabs are currently under preclinical evaluation, and they are promising candidate therapeutic agents against mers-cov infection. middle east respiratory syndrome (mers) emerged in in saudi arabia with the death of a man with pneumonia; the causative agent was subsequently identified as mers-cov, which belonged to lineage c betacoronaviruses [ ] . with dromedary camels (camelus dromedarius, also known as arabian camel) as direct sources and bats as potential reservoirs [ ] , mers-cov has been frequently introduced into human populations. once mers-cov is introduced into a person, person-to-person transmission might occur, and is responsible for approximately % of mers cases globally [ ] . mers-cov has been a consistent threat to humans. as of october , mers-cov has caused laboratory-confirmed human cases, including deaths in countries, with the fatality rate as high as % (http://www.who.int/emergencies/mers-cov/en/). although mers cases are primarily reported in the middle east, facilitated by international travelling, mers-cov can also be a worldwide threat, which is well illustrated by the mers outbreak in south korea in [ ] . given the potential risk of causing worldwide public health emergencies and the absence of licensed vaccines and antiviral therapeutics, the world health organization has listed mers-cov in the "list of blueprint priority diseases" (http://www.who.int/blueprint/priority-diseases/en/). vaccines are the most important approach against viral infections, but usually take a long time to develop. they are also unable to provide either immediate prophylactic protection or treat ongoing viral infections. neutralizing monoclonal antibodies (mabs) have recently emerged as a powerful tool to provide prophylactic and therapeutic protection against emerging viruses [ ] . potent neutralizing mabs can be achieved by various technologies, such as hybridoma technology, humanized mouse, phage or yeast display, and single b cell isolation [ ] . mers-cov is a single, positive-stranded rna virus of about kb, which encodes four major viral structural proteins-including spike (s), envelope (e), membrane (m) and nucleocapsid (n)-as well as several accessory proteins [ ] . the s protein ( aa) plays an important role in virus infection and consists of a receptor-binding subunit s (aa - ) and a membrane-fusion subunit s (aa - ). s mediates viral attachment to host cells and s mediates virus-cell membrane fusion [ ] . the s subunit contains a receptor-binding domain (rbd) (aa - ) [ ] that can bind to cell receptor dipeptidyl peptidase (dpp , also known as cd ), and mediates viral attachment target cells [ ] . the rbd consists of a core subdomain and a receptor-binding motif (rbm) (aa - ). the schematic representation of mers-cov s protein is shown in figure a . neutralizing mabs binding to the s protein of mers-cov can prevent viral attachment to the cell receptor and inhibit viral entry [ ] . the s protein of mers-cov is a key target for antivirals, and rbd is the most popular focus. in this study, we review the current knowledge on neutralizing mabs targeting the rbd of mers-cov. stable hybridoma cell lines were generated by fusing myeloma cells with splenocytes of mice that were immunized with mers-rbd protein. two neutralizing mabs, c and e , had high affinity for the rbd of mers-cov and blocked both pseudovirus and live mers-cov entry into cells with high efficacy [ ] . humanized c showed similar neutralizing activity in cell entry tests. in vivo tests indicated that c could significantly reduce the virus titers in the lungs of ad -hcd -transduced mice which were infected with mers-cov, highlighting its potential application in humans not only for preventing but also treating mers-cov infection. crystallization of the c fab/mers-rbd complex showed that the c recognized conformational epitopes (y -n , k , l -k , p , v -s , w -e , and d -q ), which were partially overlapped the receptor-binding footprint in the rbd of mers-cov. the c complex interfered with mers-cov binding to dpp by both steric hindrance and interface-residue competition. e competed with c to bind to mers-rbd, indicating that they recognized proximate or overlapping epitopes [ ] . neutralizing mab mersmab was obtained by fusing myeloma cells with splenocytes of a mouse that was immunized with recombinant mers-cov s [ ] . mersmab effectively blocked the entry of pseudovirus and live mers-cov into cells. structural analysis showed that mersmab bound to the rbd of mers-cov through recognizing conformational epitopes, and all of the residues critical for mersmab binding were located on the left ridge of rbm. mersmab neutralized mers-cov by competitively blocking the binding of mers-cov rbd to dpp . based on escape mutant analysis of the key residues on the rbd, it was found that residue l , d , r , e , and w were critical for mersmab binding to the rbd, while mutation of e , d , or e did not affect the interaction of mersmab and the rbd at all [ ] . an ultra-large nonimmune human antibody-phage display library was constructed with b cells of unimmunized donors. with a unique spanning strategy, seven human neutralizing mabs with varying neutralization efficacy to mers-cov were identified [ ] . binding detection demonstrated that the epitopes of these mabs lay within aa - of the s protein, which overlapped a large part of the rbd of mers-cov. binding competition assays showed that these mabs recognized at least three distinct epitope groups, which was further confirmed by escape studies. with no cross-epitope resistance, these mabs neutralized mers-cov by competitively blocking the binding of the rbd of mers-cov to dpp . escape mutant assays showed that five residues were critical for neutralization of these mabs, namely l , t , y , r , and p . of the seven mabs, b exhibited the best neutralization activity against both pseudovirus and live mers-cov infectivity in cells. moreover, under the selective pressure of these mabs, the igg form of b was superior, since it did not induce neutralization escape [ ] . in vivo tests demonstrated that b reduced lung pathology in rhesus monkeys infected with mers-cov [ ] . with its high neutralizing activity and suppression of mutant escape, b in the igg form is a promising therapeutic mab against mers-cov. three human mabs-m , m , and m -were identified from a large naïve human phage display antibody library, which was constructed with peripheral blood mononuclear cells from healthy volunteers [ ] . the binding sties of the three mabs were within the rbd of mers-cov (aa - ), therefore they neutralized mers-cov by competing with dpp binding to the rbd. the three mabs also competed with each other to bind to the rbd of mers-cov, and mutant analysis showed that the three mabs possessed overlapping but distinct epitopes. of the three mabs, m neutralized both pseudovirus and live mers-cov infectivity in cells with exceptional potency (m inhibited % mers-cov pseudovirus infection at a concentration of . g/ml, and neutralized live mers-cov with ic of g/ml and ic of . g/ml). residues in the rbd crucial for m binding were l , d , e , d , w , and v [ ] . in vivo study demonstrated that prophylaxis with m reduced virus titers in the lung of rabbits infected with mers-cov [ ] , and m also provided transgenic mice expressing human dpp with full prophylactic and therapeutic protection from mers-cov [ ] . however, another study with a non-human primate, the common marmoset showed that m could only alleviate the severity of the disease, and did not provide complete protection against mers-cov [ ] . igg+ memory b cells were isolated from a mers patient, and were subsequently immortalized with epstein-barr virus. a neutralizing mabs, lca , was identified, and was the first fully human neutralizing mab with naïve heavy and light chain pairs [ ] . lca efficiently neutralize mers-cov infectivity in cells. in vivo study showed that lca provided balb/c mice transduced with adenoviral vectors expressing human dpp (hdpp ) with both prophylactic and postexposure protection against mers-cov. furthermore, the neutralizing efficacy of lca was evaluated in ifn-α/β receptor-knockout mice that were more stringent models of mers-cov infection. after transducing with hdpp , these mice showed more profound clinical symptoms when challenged with mers-cov [ ] ; administration of lca reduced mers-cov titer in the lungs of these mice more effectively (lung viral titer reduced by three logs in one day for ifn-α/β receptor-knockout mice vs. three days for balb/c mice) [ ] . with naïve heavy and light chain pairs, lca was more potent than b and comparable to m . cross-competition experiment demonstrated that lca competed with b to bind to the rbd. lca interacted with rbd residues around k , and the lca footprint on the rbd was partially overlapped with that of dpp . four residues in the rbd affected the binding of lca -namely t , k , e , and e -which were conserved in all mers-cov isolates. moreover, compared with dpp , the binding affinity of lca to rbd was significantly higher (~ -fold). therefore, one major neutralization mechanism of lca was to competitively inhibit the interaction of the rbd with dpp . interestingly, virus escape studies demonstrated that under the selective pressure of lca , a mutant variant (v a) in the n-terminal domain (ntd) of mers-cov s subunit was also generated [ ] . a gmp-approved cell line (lca . . ) that expresses lca in high concentrations has been established, highlighting its application as promising therapeutics against mers-cov infection [ ] . hybridoma b cells producing neutralizing mabs against the s protein of mers-cov were generated by immunizing humanized transgenic mice (velocimmune mice) with dna encoding the mers-cov s protein. two fully human neutralizing mabs, regn and regn , were obtained [ ] . the two mabs bound with high affinity to distinct epitopes on the rbd of mers-cov, which were conserved during the natural evolution of mers-cov. mutation as a result of selective pressure by one mab should not affect the binding of the other mab. regn neutralized a broad range of mers-cov isolates, the prototype emc/ strain and all clinical mutants including a p, s g, s f, a v, l f, d g, and v a. with the exception of v a variant, regn achieved similar neutralizing activity. in vivo study demonstrated that regn and regn reduced mers-cov replication in humanized dpp mice in both prophylactic and therapeutic settings [ ] . when evaluated in the common marmoset, both mabs seemed to be more effective for prophylaxis rather than for treatment of mers-cov infection [ ] . an anti-mers-cov phage display antibody library was constructed with the peripheral b cells of a mers survivor, and a human neutralizing mab against mers-cov, mca , was identified [ ] . mca showed potent neutralizing activity against mers-cov in cell entry tests. in vivo, mca completely inhibited the replication of mers-cov in common marmosets when administrated prophylactically or therapeutically. structure analysis of the mca fab-rbd complex showed that mca formed direct contacts with the receptor-binding site (rbs) subdomain on the rbd. epitopes on the rbs critical for mca binding were d , w , e , d , y , r , and q . superimposed structure analysis of mca -rbd and hdpp -rbd complexes showed that the binding interface of mca was largely overlapped with that of hdpp . therefore, the neutralizing mechanism of mca was achieved by competing with dpp for binding to the rbd [ ] . two potent human neutralizing mabs, mers- and mers- , were derived from a nonimmune human yeast display antibody library, which was constructed with spleen and lymph node polyadenylated rna from normal humans [ ] . [ ] . further structural analysis showed that mers- bound to unique epitopes and caused conformational changes in the rbd interface critical for accommodating dpp , therefore indirectly disrupting the interaction between the two. moreover, mers- also demonstrated synergistic effects with m and f (a ntd-specific mab). the special neutralizing mechanism made mers- a valuable addition for the combined use of mabs against mers-cov infection [ ] . thirteen ultrapotent neutralizing mabs, which all targeted the rbd of mers-cov were generated following a protocol for the rapid production of antigen-specific human mabs [ ] . briefly, antibody-secreting b cells were isolated from the whole blood of a mers patient, and the antibody genes were amplified and cloned into vectors to transfect human cell lines for mab production. of the mabs, mers-gd and mers-gd exhibited the strongest neutralizing activity against both pseudovirus and live mers-cov in cell infection tests. mers-gd directly competed with dpp to bind to the rbd to dpp , and the crystal structure of mers-gd showed that its epitopes were almost completely overlapped with dpp -binding sites. mers-gd and mers-gd recognized distinct epitopes on the rbd, and had a low level of competing activity. the combined use of the two mabs demonstrated synergistic effects in neutralization against pseudotyped mers-cov. mutant analysis demonstrated that residues l , d , v , e , and a on rbd were important for the neutralizing activity of mers-gd , and residue r was critical for mers-gd [ ] . moreover, in vivo study found that mers-gd could provide both prophylactic and therapeutic protection for hdpp -trangenic mice against mers-cov infection [ ] . dromedary camels exposed to mers-cov showed mild clinical signs but developed exceptionally potent neutralizing antibodies. camelid species naturally produced heavy chain-only antibodies (hcabs) [ ] , which are dimeric and devoid of light chains, and their antigen recognition region is solely formed by the variable heavy chains (vhhs) (also called nanobodies, nbs). vhhs or nbs have long complementarity-determining region (cdr ) loops and are capable of binding to unique epitopes not accessible to conventional antibodies [ ] . notably, camelid vhhs are relatively stable and can be produced with high yields in prokaryotic systems [ ] . because of their small size; good tissue permeability; and cost-effective production, storage, and transportation [ ] [ ] [ ] , vhhs or nbs have been gaining acceptance as antiviral agents. a vhh complementary dna library was constructed with the bone marrow of dromedary camels infected with mers-cov. four vhhs (vhh- , vhh- , vhh- , and vhh- ) with high neutralizing activity were identified by direct cloning and screening of the phage display antibody library [ ] . the four vhhs competed for a single epitope that partially overlapped with the rbd-dpp interface. mutant analysis showed that the four vhhs did not bind to the d n variant, which was a critical residue on the rbd for dpp binding [ , ] . therefore, these vhhs most likely neutralized mers-cov by blocking its binding to dpp . of the vhhs, vhh- showed the best neutralizing activity and epitope recognition. vhh- efficiently blocked the entry of mers-cov into cells, and it also prophylactically protected k transgenic mouse expressing hdpp from mers-cov infection. to extend the half-life of vhh- in serum, it was linked to a human fc domain lacking the ch exon to construct the chimeric camel/human hcab- , which showed similar neutralizing activity as vhh- . the chimeric camel/human hcab- was highly stable in mice and provided k mice with fully prophylactic protection against mers-cov infection [ ] . alpacas were immunized with recombinant mers-cov rbd-containing a c-terminal human igg fc tag, and vhhs were amplified from their peripheral blood mononuclear cells to construct a vhh phage display library. a neutralizing nb, nbms , which bound with high affinity to the rbd of mers-cov and blocked the binding of rbd to dpp , was identified [ ] . to extend its in vivo half-life, the human-fc-fused version, nbms -fc, was constructed. nbms competed with dpp to bind to rbd, indicating that the binding site of nbms on rbd overlapped with that of dpp . the binding site of the nbms on the rbd was mapped to be around residue d , which is part of a highly conserved conformational epitope at the receptor-binding interface in almost all the natural mers-cov published to date. nbms did not neutralize psuedotyped mers-cov bearing a mutation in d , confirming that residue d was critical for nbms binding. nbms efficiently neutralized the cell entry of live mers-cov. moreover, nbms showed potent prophylactic and therapeutic efficacy in protecting hdpp -transgenic mice against mers-cov infection [ ] . for their exceptionally high neutralization activity in vitro and in vivo, these newly identified neutralizing mabs are promising candidate therapeutics against the infection of mers-cov. however, the use of a single neutralizing antibody bears the risk of selecting escape mutants, a fact that has been observed for lca and other described antibodies [ , , ] . notably, the majority of these escape mutations had little impact on viral fitness and the interaction of dpp with the rbd [ ] . moreover, mutants of mers-cov during natural infection have also been reported [ ] . escape from neutralization is a major concern with therapeutic neutralizing mabs, however, this potential problem can be solved by combining mabs that target distinct epitopes and show different neutralizing mechanisms [ ] . this strategy can take advantage of the synergistic effects while decreasing the possibility of viral escape. currently, most of the mers-cov neutralizing mabs compete with dpp binding to the rbd, and residues on the rbd critical for mab neutralization are identified by mutant analysis. almost all of the residues identified critical for mab neutralization are located in rbm, and overlap with those critical for dpp binding ( figure b) . with the availability of crystal structure of mab fab-rbd complex, the neutralization mechanism of these mabs will be better illustrated. based on the crystal structure of rbd-dpp , it was found that several conserved residues in the rbd are critical for the interaction of the rbd with dpp (y , l , d , e , e , d , d , y , r , w , and v ) [ , ] . development of therapeutic neutralizing mabs targeting those critically conserved residues might be important for combating mers-cov. moreover, a study found a mouse-derived neutralizing mab, f , which bound to a possible linear epitope in the ntd of the mers-cov s subunit, exhibited efficient neutralizing activity against pseudovirus and live mers-cov in cell entry tests. this study highlighted the important role of ntd during the infection process of mers-cov. ntd might have significant implications for the development of prophylactic and therapeutic mabs against mers-cov infection [ ] . although the in vitro neutralizing potency of f was approximately -fold lower than that of the rbd-targeting neutralizing mabs [ ] , it may provide an alternative for the immunotherapy against mers-cov, once the virus mutates and is no longer susceptible to rbd-specific mabs. so far, there is a lack of appropriate animal models to mimic the pathology of merd-cov in humans. commonly-used laboratory animals-such as wild-type mouse, ferret, hamster, and guinea pig-are not susceptible to mers-cov infection due to differences in critical amino acids in the s-binding domain of their dpp [ ] [ ] [ ] . new zealand rabbits, hdpp -transduced/transgenic mice, camelids and non-human primates (rhesus macaque and common marmoset) are susceptible to mers-cov infection, however, rabbits showed asymptomatic infection [ ] ; dromedary camels displayed different clinical manifestations to that of humans [ ] ; rhesus macaque only showed transient lower respiratory infection [ ] , while common marmoset developed progressive pneumonia [ ] ; hdpp -trangenic mouse expressed hdpp extensively, and resulted in multiple organ damage [ ] ; hdpp -transduced mouse only exhibited mild transient clinical diseases [ ] . with robust animal models, the protective effects of these neutralizing mabs will be better evaluated. furthermore, ongoing efforts on developing therapeutic neutralizing mabs against mers-cov should also consider the different target populations (dromedary camels and humans) and their protective efficacy. isolation of a novel coronavirus from a man with pneumonia in saudi arabia evidence for zoonotic origins of middle east respiratory syndrome coronavirus middle east respiratory syndrome coronavirus: risk factors and determinants of primary, household, and nosocomial transmission probable transmission chains of middle east respiratory syndrome coronavirus and the multiple generations of secondary infection in south korea human monoclonal antibodies as candidate therapeutics against emerging viruses. front genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans mers-cov spike protein: a key target for antivirals structure of mers-cov spike receptor-binding domain complexed with human receptor dpp dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc a humanized neutralizing antibody against mers-cov targeting the receptor-binding domain of the spike protein a conformation-dependent neutralizing monoclonal antibody specifically targeting receptor-binding domain in middle east respiratory syndrome coronavirus spike protein identification of human neutralizing antibodies against mers-cov and their role in virus adaptive evolution b -n, a monoclonal antibody against mers-cov, reduces lung pathology in rhesus monkeys following intratracheal inoculation of mers-cov jordan-n / . virology exceptionally potent neutralization of middle east respiratory syndrome coronavirus by human monoclonal antibodies prophylaxis with a middle east respiratory syndrome coronavirus (mers-cov)-specific human monoclonal antibody protects rabbits from mers-cov infection passive transfer of a germline-like neutralizing human monoclonal antibody protects transgenic mice against lethal middle east respiratory syndrome coronavirus infection efficacy of antibody-based therapies against middle east respiratory syndrome coronavirus (mers-cov) in common marmosets prophylactic and postexposure efficacy of a potent human monoclonal antibody against mers coronavirus rapid generation of a mouse model for middle east respiratory syndrome rapid generation of a human monoclonal antibody to combat middle east respiratory syndrome pre-and postexposure efficacy of fully human antibodies against spike protein in a novel humanized mouse model of mers-cov infection prophylactic and therapeutic efficacy of mab treatment against mers-cov in common marmosets human neutralizing monoclonal antibody inhibition of middle east respiratory syndrome coronavirus replication in the common marmoset potent neutralization of mers-cov by human neutralizing monoclonal antibodies to the viral spike glycoprotein structural definition of a unique neutralization epitope on the receptor-binding domain of mers-cov spike glycoprotein ultrapotent human neutralizing antibody repertoires against middle east respiratory syndrome coronavirus from a recovered patient a novel human mab (mers-gd ) provides prophylactic and postexposure efficacy in mers-cov susceptible mice naturally-occurring antibodies devoid of light-chains molecular basis for the preferential cleft recognition by dromedary heavy-chain antibodies nanobodies: natural single-domain antibodies application of camelid heavy-chain variable domains (vhhs) in prevention and treatment of bacterial and viral infections nanobodies(r) as inhaled biotherapeutics for lung diseases generation and characterization of alx- , a potent novel therapeutic nanobody for the treatment of respiratory syncytial virus infection chimeric camel/human heavy-chain antibodies protect against mers-cov infection molecular basis of binding between novel human coronavirus mers-cov and its receptor cd a novel nanobody targeting middle east respiratory syndrome coronavirus (mers-cov) receptor-binding domain has potent cross-neutralizing activity and protective efficacy against mers-cov importance of neutralizing monoclonal antibodies targeting multiple antigenic sites on mers-cov spike to avoid neutralization escape severe respiratory illness caused by a novel coronavirus identification of residues on human receptor dpp critical for mers-cov binding and entry a novel neutralizing monoclonal antibody targeting the n-terminal domain of the mers-cov spike protein wild-type and innate immune-deficient mice are not susceptible to the middle east respiratory syndrome coronavirus the middle east respiratory syndrome coronavirus (mers-cov) does not replicate in syrian hamsters adenosine deaminase acts as a natural antagonist for dipeptidyl peptidase -mediated entry of the middle east respiratory syndrome coronavirus asymptomatic middle east respiratory syndrome coronavirus infection in rabbits experimental infection of dromedaries with middle east respiratory syndrome-coronavirus is accompanied by massive ciliary loss and depletion of the cell surface receptor dipeptidyl peptidase an animal model of mers produced by infection of rhesus macaques with mers coronavirus infection with mers-cov causes lethal pneumonia in the common marmoset multi-organ damage in human dipeptidyl peptidase transgenic mice infected with middle east respiratory syndrome-coronavirus funding: this study was supported by a grant from national natural science funds of china (nos. ). the authors declare that they have no conflict of interest. key: cord- - qzo v authors: wang, yunfei; wang, lichun; cao, han; liu, cunbao title: sars‐cov‐ s is superior to the rbd as a covid‐ subunit vaccine antigen date: - - journal: j med virol doi: . /jmv. sha: doc_id: cord_uid: qzo v since its emergence in december , severe acute respiratory syndrome coronavirus (sars‐cov‐ ) has developed into a global pandemic within a matter of months. while subunit vaccines are one of the prominent options for combating coronavirus disease (covid‐ ), the immunogenicity of spike protein‐based antigens remains unknown. when immunized in mice, the s domain induced much higher igg and iga antibody levels than the rbd and more efficiently neutralized sars‐cov‐ when adjuvanted with alum. it is inferred that a large proportion of these neutralization epitopes are located in the s domain but outside the rbd and that some of these are spatial epitopes. this finding indicates that expression systems with posttranslational modification abilities are important to maintain the natural configurations of recombinant spike protein antigens and are critical for effective covid‐ vaccines. further, adjuvants prone to a th response should be considered for s ‐based subunit covid‐ vaccines to reduce the potential risk of antibody‐dependent enhancement (ade) of infection. this article is protected by copyright. all rights reserved. biology, chinese academy of medical sciences & peking union medical college (imb, cams). animals were randomly divided into groups with mice in each group (n= ). antigens were diluted to μg/mouse/dose in μl of pbs and mixed with the same volume of alum adjuvant (thermofisher scientific) prior to immunization. thus, μl of immunogens were administered intramuscularly into the thigh muscle three times at week intervals. weeks after the final immunization, mice were anesthetized with ketamine, and blood was collected via cardiac puncture. after clotting at °c overnight, serum was collected by centrifugation at rpm for min and pooled by group. all experiments were performed in compliance with the guiding principles for the care and use of laboratory animals of the animal ethics committee of the imb, cams (permit number: scxk (dian) k - ). ninety-six-well plates were coated with μg/ml hek k cell-expressed recombinant sars-cov- s or rbd proteins overnight at °c. plates were washed one time with wash buffer (pbs containing . % (v/v) polysorbate ) and then blocked with % (w/v) skim milk dissolved in wash buffer for hour at °c. plates were then washed times and incubated with serially diluted mouse sera for hour at °c. next, plates were washed times and incubated with goat anti-mouse igg/iga/igg /igg a hrp-conjugated secondary antibodies this article is protected by copyright. all rights reserved. article (thermofisher scientific) for hour at °c. following additional washes, , ', , '-tetramethylbenzidine (tmb, bd bioscience) substrate was added. the plate was incubated at room temperature in the dark for min, and reactions were stopped by the addition of m sulfuric acid. absorbance ( nm) was detected using a microplate reader (bio-tek instruments, inc). antibody titers were defined by end-point dilution with a cut-off signal of od = . . sera samples that did not produce an od> . at : were determined as . the igg -to-igg a titer ratio was calculated to evaluate th -th balance , . for neutralization, mouse sera were diluted with dmem in a two-fold series. then, μl of sars-cov- diluted with dmem to . lg ccid was added to μl of diluted serum, incubated at °c for hour, and then added to μl of this article is protected by copyright. all rights reserved. article co for days, the neutralization titer was reported as the serum dilution at which sars-cov- infection was inhibited by %. all sars-cov- manipulations were carried out in a biosafety level (bsl- ) laboratory at imb, cams. data are shown as the mean and standard deviation. graphpad prism . (san diego, ca, usa) was used for statistical analyses. here, we fused s and rbd to the carboxyl terminus of the norovirus shell domain, which has been reported to present recombinant expressed proteins on the surface of virus-like particles to enhance the immunity of recombinant proteins , . while both rbd and s were expressed well with the norovirus shell domain (s-rbd and s-s , respectively), as certified by the corresponding band (~ kda for s-rbd and ~ kda for s-s ) by sds-page ( figure a ) and western blot ( figure b ), both were expressed as inclusion bodies. following sonication, washing and dialysis, while s-rbd showed quite high purity (lane s-rbd in figure a ), s-s showed only approximately % purity by sds-page (lane s-s in figure a ). transmission electron microscopy showed that after dialysis, only a small portion of the s-rbd and s-s fusion proteins formed similar but not identical virus-like particles with diameters of approximately - nm (showed by arrows in figure c & d) , while the majority of these recombinant proteins formed irregular aggregates ( figure c & d) . this article is protected by copyright. all rights reserved. article sars-cov- s induces higher igg and iga titers than rbd weeks after the third intramuscular immunization (figure a) , both s -specific (s -coated plate in figure ) and rbd-specific (rbd-coated plate in figure ) antibodies were analyzed. hek k cell-expressed recombinant s (s immunized) and e.coli-expressed norovirus shell domain-s fusion protein (s-s immunized) induced similar s -specific igg titers ( ) and similar rbd-specific titers ( ) ( figure b ). hek k cell-expressed recombinant rbd (rbd immunized) induced low s -specific igg titers ( ) and rbd-specific igg titers ( ), implying low immunogenicity of the rbd alone. unlike s (similar igg titers between s and s-s may be attributed to low purity and thus low s content in s-s ), fusion of the rbd with the norovirus shell domain (s-rbd immunized) elevated both rbd-specific igg titers (from to ) and s -specific igg titers (from to ). as sars-cov- is a respiratory virus, mucosal immunity is important to fight infection and we therefore detected iga titers ( figure c ). while both hek k cell-expressed recombinant s (s immunized) and e.coli-expressed norovirus shell domain-s fusion protein (s-s immunized) induced equivalent levels of s -specific iga titers and igg titers ( ), both hek k cell-expressed recombinant rbd (rbd immunized) and e.coli-expressed norovirus shell domain-rbd fusion proteins (s-rbd immunized) induced half the level of s -specific iga titers as igg titers ( vs for rbd and vs for s-rbd, respectively). while rbd-specific iga titers were the lowest of all the igg and iga titers tested, there was a tendency for norovirus shell domain to elevate rbd immunogenicity on rbd-specific iga titers (rbd coated, s-rbd this article is protected by copyright. all rights reserved. article immunized vs rbd immunized in figure c ), as was observed for the s -specific iga titers (s -coated, s-rbd immunized vs rbd immunized in figure c ). sars-cov- s induced more balanced th -th responses than the rbd similar to s -specific total igg titers ( figure b , s -coated), both hek k cell-expressed recombinant s (s immunized) and e.coli-expressed norovirus shell domain-s fusion protein (s-s immunized) induced the highest s -specific igg ( figure a , s coated) and igg a ( figure b , s coated) titers and comparably low rbd-specific igg ( figure a , rbd coated) and igg a ( figure b , rbd coated) titers. hek k cell-expressed recombinant rbd (rbd immunized) and e.coli-expressed norovirus shell domain-rbd fusion proteins (s-rbd immunized) induced low levels of igg and igg a titers specific to both s (s coated in figure a&b ) and the rbd (rbd coated in figure a&b ). notably, the igg titers in each group in figure a are considerably higher than the igg a titers in figure b . to enable direct comparisons between groups, we compared the igg /igg a ratios in each group induced by their own antigens ( figure c ). while both hek k cell-expressed recombinant s (s immunized) and e.coli-expressed norovirus shell domain-s fusion proteins (s-s immunized) induced an igg /igg a ratio of , hek k-expressed recombinant rbd (rbd immunized) induced an igg /igg a ratio as high as . this ratio could be lowered by ligation to the e.coli-expressed norovirus shell domain (the s-rbd immunized igg /igg a ratio was ) but was still higher than in that in the s and s-s immunized groups. higher igg -to-igg a ratios, including the those for the s and s -rbd groups, imply a th -biased immune response for these antigens. igg a subtype antibodies, which are present in neutralizing sera . this result is consistent with the observed eosinophilic infiltration following vaccination and virus exposure, a typical characteristic of th immune responses with elevated igg /igg a proportions , . unfortunately, both sars-cov- s and rbd showed a th -like immune response with high proportions of igg when immunized with alum as adjuvant (figure ) , implying a similar immunopathological risk to those reported for other coronaviruses. though no ade has been reported in animal models re-exposed to sars-cov- or exposed following vaccine immunization, special attention should be paid to th -biased adjuvants, as reported in the development of sars-cov- and mers-cov vaccines , . in conclusion, the sars-cov- s domain is more immunogenic than the rbd domain, inducing higher igg and iga antibodies and also efficient virus neutralization antibodies. we infer that a large proportion of these neutralization epitopes exist within the s domain but outside of the rbd and that some of these are spatial epitopes. while s induced a more balanced th /th response than the rbd when adjuvanted with alum, increased levels of igg antibodies still indicate a potential risk of ade, and adjuvants prone to a th response should be considered for s subunit-based covid- vaccines. no potential conflicts of interest. genomic characterisation and epidemiology of novel coronavirus: implications for virus origins and receptor binding a new coronavirus associated with human respiratory disease in covid- in remdesivir in adults with severe covid- : a randomised, double-blind, placebo-controlled, multicentre trial clinical benefit of remdesivir in rhesus macaques infected with sars-cov- vaccine development and therapeutic design for -ncov/sars-cov- : challenges and chances the early landscape of covid- vaccine development in the uk and rest of the world molecular basis of coronavirus virulence and vaccine development sars vaccine development prospects for a mers-cov spike vaccine sars immunity and vaccination antigenic and immunogenic characterization of recombinant baculovirus-expressed severe acute respiratory syndrome coronavirus spike protein: implication for vaccine design effects of toll-like receptor stimulation on eosinophilic infiltration in lungs of balb/c mice immunized with uv-inactivated severe acute respiratory syndrome-related coronavirus vaccine immunization with sars coronavirus vaccines leads to pulmonary immunopathology on challenge with the sars virus immunization with inactivated middle east respiratory syndrome coronavirus vaccine leads to lung immunopathology on challenge with live virus gold nanoparticle-adjuvanted s protein induces a strong antigen-specific igg response against severe acute respiratory syndrome-related coronavirus infection, but fails to induce protective antibodies and limit eosinophilic infiltration in lungs the spike protein of sars-cov--a target for vaccine and therapeutic development mers-cov spike protein: a key target for antivirals severe acute respiratory syndrome (sars) coronavirus: application of monoclonal antibodies and development of an effective vaccine receptor-binding domain of sars-cov spike protein induces long-term protective immunity in an animal model recombinant receptor binding domain protein induces partial protective immunity in rhesus macaques against middle east respiratory syndrome coronavirus challenge receptor-binding domain of mers-cov with optimal immunogen dosage and immunization interval protects human transgenic mice from mers-cov infection key: cord- -uea kwge authors: shehata, mahmoud m.; mostafa, ahmed; teubner, lisa; mahmoud, sara h.; kandeil, ahmed; elshesheny, rabeh; boubak, thamer a.; frantz, renate; pietra, luigi la; pleschka, stephan; osman, ahmed; kayali, ghazi; chakraborty, trinad; ali, mohamed a.; mraheil, mobarak abu title: bacterial outer membrane vesicles (omvs)-based dual vaccine for influenza a h n virus and mers-cov date: - - journal: vaccines (basel) doi: . /vaccines sha: doc_id: cord_uid: uea kwge vaccination is the most functional medical intervention to prophylactically control severe diseases caused by human-to-human or animal-to-human transmissible viral pathogens. annually, seasonal influenza epidemics attack human populations leading to – thousand deaths/year worldwide. recently, a novel middle east respiratory syndrome coronavirus emerged. together, those two viruses present a significant public health burden in areas where they circulate. herein, we generated a bacterial outer membrane vesicles (omvs)-based vaccine presenting the antigenic stable chimeric fusion protein of the h -type haemagglutinin (ha) of the pandemic influenza a virus (h n ) strain from (h n pdm ) and the receptor binding domain (rbd) of the middle east respiratory syndrome coronavirus (mers-cov) (omvs-h /rbd). our results showed that the chimeric antigen could induce specific neutralizing antibodies against both strains leading to protection of immunized mice against h n pdm and efficient neutralization of mers-cov. this study demonstrate that omvs-based vaccines presenting viral antigens provide a safe and reliable approach to protect against two different viral infections. acute respiratory infections are among the leading causes of disease and mortality in developing and developed countries [ , ] . the severity of these acute infections is usually potentiated following the dissemination of the infection throughout the lower respiratory tract, leading to millions of human deaths worldwide each year [ ] . annually, seasonal influenza epidemics attack - % of the human population leading to - thousand deaths/year worldwide [ ] . beside these epidemics, the world is confronted every - years with antigenically distinct pandemic influenza virus strains of wide geographical distribution and considerable human-to-human transmissibility resulting in high mortality rates [ ] . in recent years, the world has been challenged with newly emerging influenza a virus (iav) infections, which have the potential to cause sporadic fatalities in the human population within limited epidemics. for instance, highly pathogenic avian influenza viruses (hpaiv) of the h n -subtype and low pathogenic avian influenza viruses (lpaiv) of the h n -, h n -and h n -subtypes [ ] [ ] [ ] [ ] have caused sporadic human infections. in addition, other iavs caused global pandemic outbreaks, such as the swine-origin h n influenza virus (h n pdm ) [ , ] . in , a novel middle east respiratory syndrome coronavirus (mers-cov) emerged. by february , a total of laboratory-confirmed human cases, including associated deaths, were reported globally in countries (case-fatality rate: . %) [ ] . the majority of these cases were reported from the arabian peninsula, specifically saudi arabia (cases = ; deaths = ; case-fatality rate = . %) [ ] . to combat iav and mers-cov infections, vaccination represents an affordable and a facile way to protect against devastating epidemics and occasional pandemics. however, despite significant efforts to develop a safe and effective vaccine [ ] , there are no approved vaccines for mers-cov till now. recent reports have also demonstrated that replication of recombinant iav vaccine strains in either embryonated eggs or in cell-culture systems allows viral adaptation, which may affect the antigenicity of the vaccine [ ] [ ] [ ] . therefore, genetically and phenotypically stable vaccines represent a promising alternative to control iav and mers-cov infections [ ] . outer membrane vesicles (omvs) are natural, spherical nanoparticles ( - nm) derived from gram-negative bacteria. omvs are released from both pathogenic and non-pathogenic bacteria and are highly immunogenic due to their components, including lipopolysaccharides (lps), bacterial outer membrane (om) proteins, lipids, immunogenic toxins, dna/rna and other periplasmatic and cytoplasmatic proteins [ , ] . omvs from pathogenic bacteria have been commercially used to induce specific antibodies against different bacterial strains, including neisseria meningitidis serogroup b [ ] . the composition of the omvs can be adapted and used as a vaccine platform via incorporation of heterologous antigens into the vesicles [ ] . this engineering approach is advantageous because (i) it retains the antigens in their native conformation, (ii) it enables the omvs to target specific immune responses, and (iii) it provides multiple and commensurate protein antigens in a single production process [ ] . however, bacteria-based vaccines are not well explored to deliver viral antigens. therefore, we engineered a stable omvs-based dual vaccine against h n pdm and mers-cov by producing omvs with a chimeric hemagglutinin (ha) comprising of both ha and ha from the h n pdm and the receptor binding domain (rbd) of mers-cov. mers-cov strain was isolated and grown in vero-e cells. the two viruses were used for preparation of the omvs-based dual vaccine and inactivated vaccines were used as positive control. influenza to construct the pmp-h /rbd plasmid, three pcr fragments (f , f , and f ) encompassing (i) '-ncr and signal peptide of ha from cal-h n pdm , (ii) rbd of mers-cov and a -amino acid/peptide linker (gsagsag), (iii) the coding sequence and '-ncr of ha were amplified with sequence-specific primers (table ) and phusion high-fidelity pcr master mix with hf buffer (invitrogen, carlsbad, ca) and then simultaneously ligated into linearized pmpccdb vector [ ] . briefly, for the pcr amplification of each fragment, µl of × phusion master mix, . µl of forward and reverse primers ( µm/µl), and ng of the according template dna were mixed and the reaction was then brought to a total volume of µl using rnase-/dnase-free ddh o. the plasmid pmp-ha-gi [ ] encoding the ha of cal-h n pdm was used as template dna for f and f , while the plasmid pcdna . -spike-mers-cov encoding the spike protein from the isolate mers-cov/camel/egypt/hku-nrce- / was used as a template for f . the pcr settings were: • c for min then steps of cycles ( • c for s, • c for s, and • c for min), with a final extension step at • c for min. the three amplified pcr fragments were then loaded onto a % agarose-gel for electrophoresis. separation and purification of the three specific fragments was done by using qiaquick gel purification kit according to manufacturer's (qiagen, germany) instructions. after purification the three fragments were digested by corresponding restriction enzymes, shown in table . ligation of the three fragments and the linearized vector was performed using t dna ligase (promega, madison, wi, usa) by adding µl of each purified fragment ( ng/µl) to µl × buffer, µl t dna ligase, and µl linearized vector ( ng/µl). the mixture was then incubated overnight at • c. transformation of escherichia coli dh -α competent cells was performed by mixing µl of the ligation reaction with µl bacterial suspension (invitrogen, ca, usa) and subsequent incubation on ice for min. the bacterial cells were then subjected to heat shock at • c for s in a water bath and were then chilled on ice for min before adding µl of soc media (invitrogen, ca, usa). the reaction tubes were then rotated at rpm in a shaking incubator at • c for h. after an incubation time of h, µl of the transformed bacterial suspension was spread on ampicillin containing luria-bertani (lb) agar plate and incubated for h at • c. single colonies were then selected and incubated in ml liquid lb for h for subsequent plasmid isolation and the correct e. coli dh ß competent bacteria (invitrogen, ca, usa) were transformed with pmp-h /rbd according to manufacturer's instructions as described above with ng of pmp-h /rbd plasmid. individual colonies from ampicillin containing lb agar plate were picked and incubated in ml liquid lb for h. these cultures were then used to inoculate the large-volume cultures in the next step. omvs are typically purified from supernatants of transformed e. coli dh ß cells. to this point, liters ( × ml) of dh ß cultures (inoculated with ml of the starter culture) were grown in lb broth at • c in an orbital shaking incubator at rpm until reaching the exponential phase (od nm . ). the grown bacteria were pelleted at × g for min and the supernatant was sterile-filtered (millipore express plus membrane filter, pes, . µm) to remove residual bacteria. afterwards, the bacteria free supernatant was concentrated by ultrafiltration using krosflo research ii tff and a kda hollow fiber membrane (spectrum labs, germany) to a final volume of ml. the resulting filtrate ( ml) was subjected to further ultracentrifugation at , × g for h and • c in a sw ti rotor (beckman, ga, usa) to separate the omvs fraction. subsequently, the omvs containing pellet was resuspended in µl pbs (dulbecco's phosphate buffered saline, biochrom gmbh), sterile filtered (millex-gv syringe filter unit, pvdf, . µm) and stored at − • c until use. the amount of isolated omvs was quantified by protein concentration measurement using bradford protein assay. enriched omvs ( µg) from cultured dh ß, either transformed with empty vector (control) or with pmp-h /rbd, were mixed with µl × sds sample buffer ( % glycerol, mm tris/hcl (ph . ), % sds, . % bromophenol blue, and % β-mercaptoethanol) and incubated for min at • c. the omvs samples were then separated on precast gradient nupage ® novex ® - % bis-tris protein gels (invitrogen, usa) and subsequently transferred onto immobilon-fl polyvinylidene fluoride (pvdf) membranes (merck millipore). following protein transfer, the pvdf membrane was blocked using blocking buffer ( × tbs ( mm tris-hcl, ph . , mm nacl) containing % non-fat dry milk) for h at room temperature (rt). the membrane was washed once for min using washing buffer ( × tbs-tween ( mm tris-hcl, ph . , mm nacl, . % tween )). afterwards, detection of the viral ha protein was achieved using rabbit monoclonal antibodies recognizing influenza a virus hemagglutinin (abcam), diluted : in blocking buffer. h later, the membrane was washed three times for min with washing buffer. next, the membranes were incubated in the dark for h with the corresponding goat anti-rabbit irdye (li-cor, nebraska, usa), diluted : , in blocking buffer containing a : dilution of % sds. after three washing steps ( min each), twice with washing buffer and once with × tbs, the proteins were visualized using an odyssey infrared imaging system and application software package (li-cor, nebraska, usa). the propagated virus was inactivated using . % formaldehyde in • c for h. to ensure that there are no active viral particles following inactivation process of the inactivated control vaccines, mdck and vero e cells were inoculated with µl of the inactivated strains. about h post-inoculation, the cell-culture supernatant was then tested using either ha assay (for h n ) or plaque assay (for mers-cov). a volume of ml of the inactivated viral harvest was then carefully layered with ml of % sucrose in an ultra-centrifugation tube and centrifuged in a sorvall mtx ultracentrifuge (thermo scientific, ca, usa) at , rpm for h at • c. the pellets were further resuspended in µl × pbs. the required amounts of viral antigen (µg) of each virus were mixed with imject alum adjuvant (invitrogen, ca, usa) in a ratio of : (v/v). the final antigen/adjuvant combination was continuously mixed for min under cooling conditions to effectively adsorb the antigen into the surface of the adjuvant and generate optimal vaccine formulation. female balb/c mice ( - week-old) were reared and supplied from the animal house at the national research centre (nrc), egypt. mice were divided into groups ( mice/group). two groups of mice were intramuscularly injected with µg of omvs-h /rbd and omvs-empty. three other groups were used as controls including negative control group that was injected with sterile pbs and two positive control groups that were injected either with inactivated h n pdm or inactivated mers-cov. all animals received booster immunizations after weeks. serum samples were collected at , , , , and weeks after prime immunization. all mice sera were separated and stored at − • c until used. sera collected from immunized/control mice were treated with receptor-destroying enzyme (rde) from vibrio cholerae (denka seiken, tokyo, japan) and kept overnight at • c. the rde was then inactivated by incubation at • c for h. diluted sera were incubated with four ha units of h n pdm and a . % suspension of chicken red blood cells, incubated for h at rt. hai titer is the reciprocal value of the dilution at which no agglutination was observed. titers < : were considered as negative. plaque-reduction neutralization test (prnt) assay was performed to determine the efficacy of stimulated antibodies in sera from vaccinated/control balb/c mice to neutralize mers-cov. briefly, sera were inactivated by heating at • c in a water bath for min. sera were diluted two-fold serial dilution from : to : dilution in µl of dmem/ % fbs. an equal amount of plaque forming unit in µl dmem/ % fbs was added over sera dilutions. the serum/virus dilutions were then incubated at • c for h in a humidified incubator with % co . afterwards, µl of each dilution were inoculated into individual wells of -well tissue culture plates with confluent vero-e cell monolayers and incubated at • c for h. the plates were periodically undulated every min to avoid cell drying. after h of virus adsorption, inoculum was removed gently from the infected monolayer cells, washed with × pbs and covered with an overlay containing × mem media, % agar, % penicillin/streptomycin (pen/strep). the plates were left to solidify and incubated at • c with % co upside down until the formation of viral plaques were visible ( days). the cell monolayers were then fixed with . % formaldehyde solution for h at rt, stained with % crystal violet solution (in % methanol) for min at rt, and washed with water to visualize the plaques. the percent (%) of inhibition is calculated as following: % of plaque reduction = (virus control plaques count-sample plaques count)/(virus control plaques count) × the prnt is defined as the reciprocal of the antibody dilution required to reduce the number of mers-cov plaques in vero-e cells by % relative to the control wells. eight weeks after prime immunization, mice were anesthetized by intra-peritoneal injection with ketamine solution with doses adjusted to their individual body weight ( µg/g). an infectious dose of . tcid of influenza virus a/california/ / (h n ) wild type, was administered intra-nasally to all vaccinated and control groups. to ensure full separation between the groups and the absence of natural infection, an additional control group of five mice was added and not subjected to infection. body weight was monitored daily and mice showing a weight loss of more than % of their initial vaccines , , of body weight were euthanized and recorded as dead. mice were kept under specific pathogen free (spf) conditions at the national research centre facility unit, egypt. all animal trials were conducted in accordance with the recommendations and guidelines of the egyptian animal welfare legislation. the ethics committee of the national research centre, egypt, approved the animal trial in mice (approval code: . all experiments with infectious virus were performed according to egyptian regulations for the propagation of influenza viruses. all experiments involving low pathogenic and highly pathogenic avian influenza a viruses were performed in biosafety level and (bsl , ) containment cabinets, respectively, approved for such use by the local authorities. as schematically represented in figure a , the pmp-h /rbd construct comprises of the '-non-coding region (ncr) and the signal peptide (sp) of the ha gene from a/giessen/ / (h n pdm ), followed by the receptor-binding domain (rbd) of the mers-cov spike gene (rbd: - nt = - amino acids (aa)), a -aa flexible linker peptide (lp: gsagsag, [ ] ) and the coding sequence plus the '-ncr of the ha gene. the pmp-h /rbd is designed to link the rbd and ha fragments into a single polypeptide chain. the final fusion protein contains amino acid (aa) residues - of the rbd, aa residues - of the lp and aa residues - of the ha . vaccines , , x for peer review of their initial body weight were euthanized and recorded as dead. mice were kept under specific pathogen free (spf) conditions at the national research centre facility unit, egypt. all animal trials were conducted in accordance with the recommendations and guidelines of the egyptian animal welfare legislation. the ethics committee of the national research centre, egypt, approved the animal trial in mice (approval code: . all experiments with infectious virus were performed according to egyptian regulations for the propagation of influenza viruses. all experiments involving low pathogenic and highly pathogenic avian influenza a viruses were performed in biosafety level and (bsl , ) containment cabinets, respectively, approved for such use by the local authorities. as schematically represented in figure a , the pmp-h /rbd construct comprises of the `-non-coding region (ncr) and the signal peptide (sp) of the ha gene from a/giessen/ / (h n pdm ), followed by the receptor-binding domain (rbd) of the mers-cov spike gene (rbd: - nt = - amino acids (aa)), a -aa flexible linker peptide (lp: gsagsag, [ ] ) and the coding sequence plus the '-ncr of the ha gene. the pmp-h /rbd is designed to link the rbd and ha fragments into a single polypeptide chain. the final fusion protein contains amino acid (aa) residues - of the rbd, aa residues - of the lp and aa residues - of the ha . to produce outer-membrane vesicles (omvs), comprising an expressed mers-cov rbd and h n pdm ha (omvs-h /rbd) hybrid protein, the e. coli strain dh ß was transformed with pmp-h /rbd plasmid. the presence of the rbd from mers-cov and the h -ha from h n pdm in the omvs particles was examined by immunoblot analysis using µg of purified omvs-h /rbd. the blotting pattern confirmed the presence of rbd-linked ha viral protein; corresponding to ha -rbd ( kda + kda equal kda). in contrast, omvs isolated from untransformed dh ß (omvs-empty) did not show any cross-reacting proteins (figure b) . to produce outer-membrane vesicles (omvs), comprising an expressed mers-cov rbd and h n pdm ha (omvs-h /rbd) hybrid protein, the e. coli strain dh ß was transformed with pmp-h /rbd plasmid. the presence of the rbd from mers-cov and the h -ha from h n pdm in the omvs particles was examined by immunoblot analysis using µg of purified omvs-h /rbd. the blotting pattern confirmed the presence of rbd-linked ha viral protein; corresponding to ha -rbd ( kda + kda equal kda). in contrast, omvs isolated from untransformed dh ß (omvs-empty) did not show any cross-reacting proteins (figure b) . to assess the immunogenicity of the bivalent omvs-h /rbd preparations, female balb/c mice were immunized with µg/mouse of omvs-h /rbd and omvs-empty in comparison with inactivated h n pdm and inactivated mers-cov as a positive controls and pbs as a negative control. mice received a booster dose three weeks after prime immunization (figure a ). sera were collected every two weeks from week two to eight after prime immunization. ncr: non-coding region, sp: signal peptide, rbd: receptor binding domain (mers-cov), l: nucleotide sequence of peptide linker (ggtagcgccggtagcgccgga), ha : hemagglutinin , and ha : hemagglutinin . (b) immunoblotting pattern of omvs, extracted either from various control/non-transformed omvs (c , c , and c ) (empty omvs) or pmp-h /rbd-transformed (omvs-h /rbd) dh ß (s -s ), against antiserum of swine ha antibody. to assess the immunogenicity of the bivalent omvs-h /rbd preparations, female balb/c mice were immunized with µg/mouse of omvs-h /rbd and omvs-empty in comparison with inactivated h n pdm and inactivated mers-cov as a positive controls and pbs as a negative control. mice received a booster dose three weeks after prime immunization (figure a ). sera were collected every two weeks from week two to eight after prime immunization. at two weeks post-vaccination, mice vaccinated with inactivated h n pdm virus or omvs-h /rbd revealed a - × log increase in hai antibody titer as compared to the control and omvs-empty groups (figure b ). four weeks after vaccination we observed a drop in the hai titers in these mice due to the booster vaccination at week three. interestingly, the two groups vaccinated with omvs-h /rbd or inactivated h n pdm showed a significant increase in geometric mean hai antibody titers to . ( . log ) and ( . log ) at week six, and at week eight, the geometric mean hai antibody titers decreased to . ( . log ) and ( . log ), respectively. these results revealed that the vaccinated mice had developed a strong immunogenic response against the h n pdm virus. in the negative pbs control group and in the omvs-empty group hai titers remained low, reflecting that all animals were indeed housed under influenza-free conditions (no natural infection). at two weeks post-vaccination, mice vaccinated with inactivated h n pdm virus or omvs-h /rbd revealed a - × log increase in hai antibody titer as compared to the control and omvs-empty groups (figure b ). four weeks after vaccination we observed a drop in the hai titers in these mice due to the booster vaccination at week three. interestingly, the two groups vaccinated with omvs-h /rbd or inactivated h n pdm showed a significant increase in geometric mean hai antibody titers to . ( . log ) and ( . log ) at week six, and at week eight, the geometric mean hai antibody titers decreased to . ( . log ) and ( . log ), respectively. these results revealed that the vaccinated mice had developed a strong immunogenic response against the h n pdm virus. in the negative pbs control group and in the omvs-empty group hai titers remained low, reflecting that all animals were indeed housed under influenza-free conditions (no natural infection). in addition, the omvs-h /rbd vaccinated mice showed a significant increase of neutralizing antibodies against the mers-cov strain hku-nrce- at week and reached the highest neutralizing titer ( . log ) at week eight compared to the control group (p < . ) (figures c and a) . control ( × pbs), inactivated h n pdm and omvs-empty groups showed no neutralizing antibodies against mers-cov during the eight weeks of infection (figure b-d) . vaccines , , x for peer review of in addition, the omvs-h /rbd vaccinated mice showed a significant increase of neutralizing antibodies against the mers-cov strain hku-nrce- at week and reached the highest neutralizing titer ( . log ) at week eight compared to the control group (p < . ) (figures c and a) . control ( × pbs), inactivated h n pdm and omvs-empty groups showed no neutralizing antibodies against mers-cov during the eight weeks of infection (figure b-d) . on the other hand, a plaque reduction neutralization test (prnt ) using sera from mice vaccinated with inactivated mers-cov showed complete neutralization (prnt titer, approximately > : ) after eight weeks of first immunization of active mers-cov (figure e) . to investigate the protection level of vaccinated mice against h n pdm infection, vaccinatedand control groups of balb/c mice ( weeks post-vaccination) (figure a) were infected with wild type cal-h n pdm virus. to ensure full separation between the groups and the absence of natural infection, an additional control group of five mice was added and not subjected to infection. both groups, vaccinated either with inactivated h n pdm or omvs-h /rbd, showed no weight losses till days p.i. (post infection) in comparison to the infected pbs control group. interestingly, these results showed that the vaccinated groups with omvs-h /rbd and inactivated h n pdm virus protected all mice from body weight loss (bwl) (figure a ) and mortality up to days post challenge infection (figure b ). vaccines , , x for peer review of interestingly, these results showed that the vaccinated groups with omvs-h /rbd and inactivated h n pdm virus protected all mice from body weight loss (bwl) (figure a ) and mortality up to days post challenge infection (figure b ). in contrast, the control group of pbs-treated mice infected with cal-h n pdm % exhibited a bwl of more than % from day four to six p.i. resulting in euthanasia (figure a ). the mortality rate in this control group was % four days p.i. and increased gradually to % at day p.i., and % at days p.i. (figure b) . mortality in the omvs-empty group reached % at days p.i. and resulted in euthanization of mice (bwl ≥ %) (figure b ). the continuous evolution of h n pdm in swine and human populations, and the recent emergence of mers-cov infections with high mortality rate in humans has raised awareness of both viruses as serious emergent global health topics [ , ] . since vaccination is the most important strategy to combat emerging human viral infections, an effective vaccine remains a necessity, particularly for the mers-cov. the mers-cov spike (s) protein plays an essential role during virus entry through the binding of its antigenic rbd region to the dpp host cell receptor [ ] . the rbd is recognized as a major antigenic glycoprotein fragment for inducing a potent humoral and cellular neutralizing antibody (nab) immune responses [ ] [ ] [ ] [ ] [ ] [ ] . traditionally, influenza vaccines are produced by generating a natural or recombinant reassortant iav expressing the immunogenic ha antigen [ , ] . the iav comprises of two subunits ha and ha , hosting the antigenic sites to which specific and neutralizing antibodies are elicited to combat iavs strains during vaccination or natural infection [ , ] . however, it was reported that the specificity of the vaccine produced in cell-culture and embryonated eggs is occasionally impaired by amino acid (aa) changes, due to seed strain adaptation, with a drastic impact on vaccine effectiveness [ ] [ ] [ ] . the vaccine platform presented in this study depends on plasmid-based bacterial expression of recombinant viral antigen(s). these plasmids, encoding viral antigen(s), can be easily and quickly modified to insert non-synonymous changes in the encoding region of the antigen(s). additionally, the bacterial expression has lower mutation rates than eukaryotes [ ] . this platform can be also a base for incorporating combinations of different viral antigens to address additional vaccines needed to combat seasonal h n , h n and influenza b viruses. omvs had been introduced as a part of novel vaccine formulations carrying antigenic proteins eliciting protective responses in animal models from diverse microorganisms such as n. meningitis b, vibrio cholera, salmonella typhimurium, pseudomonas aeruginosa, gallibacterium anatis, acinetobacter baumannii, chlamydia trachomatis, shigella spp., and mycobacterium tuberculosis [ ] [ ] [ ] [ ] [ ] [ ] [ ] . lps in the outer surface of omvs acts as a self-adjuvant that induces humoral and cellular immunity. therefore, omvs vaccines may be used without extra adjuvant to increase the immunogenicity and produce antiviral innate immune responses against various influenza virus infections via activation of macrophages [ ] [ ] [ ] . despite that the exact role of lps in the context of omvs vaccines requires in contrast, the control group of pbs-treated mice infected with cal-h n pdm % exhibited a bwl of more than % from day four to six p.i. resulting in euthanasia (figure a ). the mortality rate in this control group was % four days p.i. and increased gradually to % at day p.i., and % at days p.i. (figure b) . mortality in the omvs-empty group reached % at days p.i. and resulted in euthanization of mice (bwl ≥ %) (figure b ). the continuous evolution of h n pdm in swine and human populations, and the recent emergence of mers-cov infections with high mortality rate in humans has raised awareness of both viruses as serious emergent global health topics [ , ] . since vaccination is the most important strategy to combat emerging human viral infections, an effective vaccine remains a necessity, particularly for the mers-cov. the mers-cov spike (s) protein plays an essential role during virus entry through the binding of its antigenic rbd region to the dpp host cell receptor [ ] . the rbd is recognized as a major antigenic glycoprotein fragment for inducing a potent humoral and cellular neutralizing antibody (nab) immune responses [ ] [ ] [ ] [ ] [ ] [ ] . traditionally, influenza vaccines are produced by generating a natural or recombinant reassortant iav expressing the immunogenic ha antigen [ , ] . the iav comprises of two subunits ha and ha , hosting the antigenic sites to which specific and neutralizing antibodies are elicited to combat iavs strains during vaccination or natural infection [ , ] . however, it was reported that the specificity of the vaccine produced in cell-culture and embryonated eggs is occasionally impaired by amino acid (aa) changes, due to seed strain adaptation, with a drastic impact on vaccine effectiveness [ ] [ ] [ ] . the vaccine platform presented in this study depends on plasmid-based bacterial expression of recombinant viral antigen(s). these plasmids, encoding viral antigen(s), can be easily and quickly modified to insert non-synonymous changes in the encoding region of the antigen(s). additionally, the bacterial expression has lower mutation rates than eukaryotes [ ] . this platform can be also a base for incorporating combinations of different viral antigens to address additional vaccines needed to combat seasonal h n , h n and influenza b viruses. omvs had been introduced as a part of novel vaccine formulations carrying antigenic proteins eliciting protective responses in animal models from diverse microorganisms such as n. meningitis b, vibrio cholera, salmonella typhimurium, pseudomonas aeruginosa, gallibacterium anatis, acinetobacter baumannii, chlamydia trachomatis, shigella spp., and mycobacterium tuberculosis [ ] [ ] [ ] [ ] [ ] [ ] [ ] . lps in the outer surface of omvs acts as a self-adjuvant that induces humoral and cellular immunity. therefore, omvs vaccines may be used without extra adjuvant to increase the immunogenicity and produce antiviral innate immune responses against various influenza virus infections via activation of macrophages [ ] [ ] [ ] . despite that the exact role of lps in the context of omvs vaccines requires further investigations, high amounts of lps could be a drawback due to its known endotoxicity and ability to induce excessive secretions of pro-inflammatory cytokines [ ] . therefore, several ongoing investigations aim to produce genetically detoxified and less reactogenic lps to improve omv safety [ , , ] . additionally, modified bacterial strains such as clearcoli™ bl (de ), which do not trigger lps-related immune response, can be applied for omv production [ ] . based on these observations we engineered the expression of antigenically-stable and immunogenic (omvs)-based bivalent vaccine that elicits protective antibodies (abs) following immunization to control infections with h n pdm and mers-cov. a recombinant construct comprising the ha of h n pdm fused to the rbd of the mers-cov s protein is expressed in an e. coli bacterial strain. the expressed bivalent antigens were incorporated within the released omvs (omvs-h /rbd). this novel chimeric omvs-h /rbd produced high levels of a neutralizing abs titer against influenza h n virus at weeks post immunization. stimulated neutralizing abs (humoral immunity) together with lps-induced cellular immunity could fully protect immunized mice after challenge with h n pdm without significant loss in body weight. surprisingly, the induced non-specific cellular immunity induced by omvs-empty could partially protect the mice. this emphasize the synergistic effect of humoral and cellular immunities secreted upon vaccination with the chimeric omvs-h /rbd formulation. serum transfer experiments would be able to further elucidate the role of humoral immunity independent of cellular immunity [ ] . additionally, omvs-h /rbd-vaccinated mice demonstrated a significant increase in the neutralizing abs titer against mers-cov ( : ) at week in comparison to the control group as in figure c . these findings ensured that omv vaccination platform can provide a protection by efficient neutralization of invading h n pdm and mers-cov. the data presented in this study are consistent with recent reports describing the potential of omvs as biologically active, stable and highly immunogenic vaccines to protect against iavs. a newly developed recombinant omvs bearing the conserved m e protein (omvs-m e) from iavs efficiently protected mice from an h n -and h n -type iav challenge [ , ] . our study represents an extension of these studies and suggests the generation of omvs that incorporate combinations of different viral antigens to generate safe and efficient vaccines in animal husbandry and for humans. in summary, the results show that the generated (omvs-h /rbd)-based vaccine presenting the antigenic stable chimeric fusion protein of h -type ha of the pandemic influenza a virus (h n ) strain and rbd of mers-cov induces specific neutralizing antibodies against h n pdm and mers-cov leading to protection of immunized mice against both viruses. these results demonstrate that omvs-based vaccines presenting viral antigens have the potential to be a vaccine platform that provides simultaneous protection against two different viral infections. bacterial and viral pathogen spectra of acute respiratory infections in under- children in hospital settings in dhaka city estimates of the global, regional, and national morbidity, mortality, and aetiologies of lower respiratory tract infections in countries: a systematic analysis for the global burden of disease study a history of influenza h n avian flu infects humans for the first time characterization of an avian influenza virus h n egyptian isolate clinical and epidemiological characteristics of a fatal case of avian influenza a h n virus infection: a descriptive study origin and molecular characterization of the human-infecting h n influenza virus in taiwan outbreak of pandemic influenza a (h n ) at a new york city school emergence and pandemic potential of swine-origin h n influenza virus understanding the latest human coronavirus threat. viruses influenza h n vaccines: recent challenges contemporary h n influenza viruses have a glycosylation site that alters binding of antibodies elicited by egg-adapted vaccine strains a structural explanation for the low effectiveness of the seasonal influenza h n vaccine bioengineering bacterial outer membrane vesicles as vaccine platform bacterial outer membrane vesicles: new insights and applications meningococcal serogroup b vaccines: will they live up to expectations? antibody-mediated immunity induced by engineered escherichia coli omvs carrying heterologous antigens in their lumen improved dual promotor-driven reverse genetics system for influenza viruses design of an ha -based escherichia coli expressed influenza immunogen that protects mice from pathogenic challenge high prevalence of mers-cov infection in camel workers in saudi arabia cross-sectional surveillance of middle east respiratory syndrome coronavirus (mers-cov) in dromedary camels and other mammals in egypt dipeptidyl peptidase is a functional receptor for the emerging human coronavirus-emc structural definition of a unique neutralization epitope on the receptor-binding domain of mers-cov spike glycoprotein novel chimeric virus-like particles vaccine displaying mers-cov receptor-binding domain induce specific humoral and cellular immune response in mice receptor-binding domain of mers-cov with optimal immunogen dosage and immunization interval protects human transgenic mice from mers-cov infection a truncated receptor-binding domain of mers-cov spike protein potently inhibits mers-cov infection and induces strong neutralizing antibody responses: implication for developing therapeutics and vaccines a novel nanobody targeting middle east respiratory syndrome coronavirus (mers-cov) receptor-binding domain has potent cross-neutralizing activity and protective efficacy against mers-cov middle east respiratory syndrome coronavirus: a comprehensive review the pb segment of an influenza a virus h n pdm isolate enhances the replication efficiency of specific influenza vaccine strains in cell culture and embryonated eggs protective immunity based on the conserved hemagglutinin stalk domain and its prospects for universal influenza vaccine development evolution of the mutation rate comparison of intranasal outer membrane vesicles with cholera toxin and injected mf c. as adjuvants for malaria transmission blocking antigens anapn and pfs / immunogenicity of vibrio cholerae outer membrane vesicles secreted at various environmental conditions overexpression of mica induces production of ompc-enriched outer membrane vesicles that protect against salmonella challenge immunogenicity and protective efficacy of vibrio cholerae outer membrane vesicles in rabbit model evaluation of intranasal and subcutaneous route of immunization in neonatal mice using dodab-bf as adjuvant with outer membrane vesicles of neisseria meningitis b vaccination with outer membrane vesicles and the fimbrial protein flfa offers improved protection against lesions following challenge with gallibacterium anatis immunization with pseudomonas aeruginosa outer membrane vesicles stimulates protective immunity in mice outer membrane vesicles as platform vaccine technology engineered outer membrane vesicle is potent to elicit hpv e -specific cellular immunity in a mouse model of tc- graft tumor bacterial outer membrane vesicles provide broad-spectrum protection against influenza virus infection via recruitment and activation of macrophages lipopolysaccharide endotoxins next-generation outer membrane vesicle vaccines against neisseria meningitidis based on nontoxic lps mutants detoxifying escherichia coli for endotoxin-free production of recombinant proteins. microbial endotoxin-free protein production-clearcoli™ technology (application note) outer-membrane-vesicle-associated o antigen, a crucial component for protecting against bordetella parapertussis infection recombinant m e outer membrane vesicle vaccines protect against lethal influenza a challenge in balb/c mice safe recombinant outer membrane vesicles that display m e elicit heterologous influenza protection