key: cord-0873491-3i06ogc8 authors: Ni, Xiaomin; Schröder, Martin; Olieric, Vincent; Sharpe, May E.; Olmos, Victor; Proschak, Ewgenij; Merk, Daniel; Knapp, Stefan; Chaikuad, Apirat title: Structural insights into plasticity and discovery of remdesivir metabolite GS-441524 binding in SARS-CoV-2 macrodomain date: 2021-03-04 journal: bioRxiv DOI: 10.1101/2021.03.04.433966 sha: 0461ab323541987d6fbbcef4c08f2ba69e8e1b05 doc_id: 873491 cord_uid: 3i06ogc8 The nsP3 macrodomain is a conserved protein interaction module that plays essential regulatory roles in host immune response by recognizing and removing posttranslational ADP-ribosylation sites during SARS-CoV-2 infection. Thus, targeting this protein domain may offer a therapeutic strategy to combat the current and future virus pandemics. To assist inhibitor development efforts, we report here a comprehensive set of macrodomain crystal structures complexed with diverse naturally-occurring nucleotides, small molecules as well as nucleotide analogues including GS-441524 and its phosphorylated analogue, active metabolites of remdesivir. The presented data strengthen our understanding of the SARS-CoV-2 macrodomain structural plasticity and it provides chemical starting points for future inhibitor development. to previous report 20 , we observed diverse conformations of Phe132 and Ile131 located in the phosphate binding site ( Figure 1B ). The flexibility of the phenylalanine and isoleucine was rather interesting as it potentially determines the availability of the pocket to diverse ligands. This role was supported by conformational changes observed in the complexes with MES and HEPES molecules that were used as buffers in crystallization reagents and occupied the ribose-1-phosphate binding site ( Figure 1C and 1D ). Both buffer molecules bound within the phosphate binding side similarly to previous observation 20 , and caused conformational changes of loop β6-α5 residues, in essence an out-swing of Phe132 and a rotation of Ile131 side chain. In comparison, slight difference was observed for the binding of MES that induced also Ala129-Gly130 main chain flip, enabling two hydrogen bonds which were absent in the HEPES complex ( Figure 1C and 1D) . Overall, the structural changes seen in the apo state and both complexes with these weakly interacting ligands suggested by their high B-factors revealed therefore potential intrinsic plasticity of the macrodomain especially within its ribose-phosphate binding pocket, which may form a determinant factor for the binding of ligands. Figure 1 . Plasticity of the ADP-ribose binding pocket of SARS-CoV-2 macrodomain. A) Superimposition of eight molecules from the asymmetric units of two distinct apo crystal structures (pdb id 6ywk and 6ywm). B) Closed up of the ADP-ribose binding pocket of all eight molecules reveals flexibility of the residues lining the pocket, notably Phe132 and Ile131. C, D) Binding of HEPES (pdb id 6ywk) and MES (pdb id 6ywm) used as crystallization reagents within the ribose-phosphate biding site requires conformational changes of Phe132 and Ile131 necessary for an accommodation of the sulfate moieties of the ligands. To further understand the conformational changes necessary for the binding of ADP-ribose, we performed soaking of the ligand into the apo crystals, and compared the ligand-bound form (2.50-Å resolution) with the apo state. Interestingly, a number of significant side chain alterations were observed primarily at two regions: loop β6-α5 as well as loop β3-α2 that constructed the ribose-phosphate binding site (Figure 2A ). Substantial alterations were noted for the former structural motif, which included an ~56° outswing of Phe132 vacating the phosphate binding pocket in addition to a rotation of Ile131 side chain to pack on top of the terminal ribose and a main chain carbonyl flip of A129 to enable hydrogen bond interactions between Gly130 amine and the phosphate moiety of the ADP-ribose. At the opposing site of the binding pocket, small alterations of the glycine-rich loop β3-α2 were also evident, involving the rearrangement of Gly46-47 main chains bringing the loop closer to the ligand. Nevertheless, two configurations of this glycine-rich loop were observed for the ADP-ribose complex; one had Gly47 carbonyl atom flipped 'in' (two of five monomers) and the other conformation with an 'out' oriented carbonyl group (in the other three monomers; Figure 2B and 2C). These structural changes provided further evidence for high flexibility of this glycine-rich loop, which contains several catalytic residues important for enzymatic function 16 . Nonetheless, the two conformations of the loop did not alter the contacts between the terminal ribose and the protein, which were also highly conserved compared to similar complexes that have been reported previously 16, 20 . Plasticity of the ADP-ribose binding pocket of the macrodomain prompted us to question whether it might be able to accommodate other nucleosides and nucleotides. Thus, we attempted to co-crystallize the protein with selected naturally-occurring ligands, including AMP, GDP-glucose, ribose-1-phosphate, β-NAD and β-NADP, and all of which except ribose-1phosphate led to successful determination of complexed structures at 1.55-2.05-Å resolution. Examination of the electron density suggested however modification of the bound ligands (Figure 3 ). In the case of AMP, only the adenosine moiety, the product of autohydrolysis lacking the phosphate group, was presented in the structure. Interestingly, the adenosine ligand exerted two binding modes; one resembled that of the adenosine moiety in the ADP-ribose complex, whereas a slight tilt of the adenine ring in the second binding mode positioned the ribose group outside in the solvent exposed region ( Figure 3A and 3B). Although the hydrogen bond between the adenine amine group and Asp22 was still maintained, different water-mediated contacts between the ribose moiety and the protein were evident. For GDP-glucose, the electron density only allowed modelling of GMP, a product of hydrolysis. Nonetheless, the binding of GMP in the ribose-1-phosphate binding pocket of the macrodomain was intriguing ( Figure 3C ). The guanine ring surprisingly occupied the cavity reserved for the terminal ribose moiety of ADP-ribose, not the AMP-binding site likely due to a poor fit between the guanine 6-hydroxyl and 2-amine groups and Asp22 and the backbone amide of Val155 and Phe156 of loop β7-α6. The bound GMP was stabilized mainly through interactions of the guanine ring, which was sandwiched between Ile131 and the glycine-rich loop β3-α2 forming a network of hydrogen bonds to Lys44, Val49, Ala50 and Ala38. No contribution to binding was observed from the ribose and phosphate groups that were positioned outside the binding pocket in the solvent region. For β-NAD and β-NADP, the observed electron density suggested that the entities of the bound ligands were likely nicotinamide-cleaved products, resulting in the complex of ADPribose and ADP-ribose-2'-phosphate (ADPRP), respectively. Although hydrolase activity catalyzing NAD hydrolysis 25 has been documented for some macrodomains, we assumed that these cleavage products were likely a consequence of autohydrolysis of the compounds since no significant increase in ADP-ribose or ADPRP traces was observed in a HPLC assay upon an incubation of β-NAD or β-NADP with SARS-CoV-2 macrodomain (data not shown). Structural comparison demonstrated that both the NAD-cleaved product ADP-ribose and the NADPcleaved product ADPRP assumed the binding mode highly identical to ADP-ribose (Figure 2 and 3D). Nevertheless, a difference was noted for ADPRP that the 2'-phosphate group, which was located in the solvent region, could engage additional magnesium-mediated contacts with an inward side chain of Asp157 ( Figure 3D ). The flexibility of the ADP-ribose binding pocket with its ability to accept various ligands prompted us to speculate potential binding of anti-viral nucleoside analogues. We therefore selected a set of diverse clinically used antivirals, including abacavir, entecavir, acyclovir, gemcitabine, remdesivir and remdesivir metabolite GS-441524, and tested binding of these drugs using co-crystallization. Among the set, examination of electron density maps revealed that the remdesivir metabolite GS-441524 was the only ligand that showed binding in the crystal structures (2.15-Å resolution; Figure 4A and 4B). The interaction of this compound was rather unexpected as this metabolite and its active tri-phosphorylated form have been designed to target RNA-dependent RNA polymerase (RdRp) of several viruses, including Ebola virus, MERS-CoV, SARS-CoV and potentially SARS-CoV-2 [26] [27] [28] [29] . In addition, recent study has further demonstrated that the direct use of GS-441524 can potently inhibit the replication of SAR-CoV-2 in mouse model 30 . Based on our crystal structure, structural comparison demonstrated that the binding mode of GS-441524 in the macrodomain highly resembled that of the adenosine moiety of ADP-ribose. The amine group of the pyrrolotriazine ring, which was sandwiched between Val49 and Phe159, formed a hydrogen bond to Asp22 while the hydroxyl groups of the ribose moiety engaged two contacts with the backbone carbonyl atoms of Leu126 and Val155 ( Figure 4C ). The 1'-nitrile group was accommodated in the space adjacent to loop β7-α6. However, with a distance of ~3.3 Å direct contacts to the backbone amine atoms of Phe156 and Asp157 might be weak, yet charge compatibility with this binding pocket was provided by the negative electrostatic potential of the nitrile group. In comparison with the adenosine substrate, GS-441524 showed improved shape complementarity with the binding pocket ( Figure 3A and 4C) . To assess this, we performed isothermal calorimetry (ITC) to determine the affinities of this ligand, adenosine, AMP and ADP-ribose. Unfortunately, adenosine and AMP did not yield interpretable ITC binding isotherms. Nonetheless, we observed that the KDs of GS-441524 and ADP-ribose were remarkably comparable (10.8 and 7.3 μM, respectively). This was a surprising result considering that the two phosphate moieties and the terminal ribose, are missing in GS-441524. Thus, we synthesized the 5' monophosphate derivative of this compound (GS-441524 monophosphate), and characterized the binding. Interestingly, an installation of 5'-phosphate led to a slight improved affinity with similar binding strength than ADP-ribose (KD of 8.6 μM), suggesting a contribution of the phosphate group for binding. Nevertheless, of particular note was different thermodynamics of the three ligands. The presence of phosphate group in GS-441524 monophosphate and ADPribose as well as additional terminal ribose-1'-phosphate moiety in the latter resulted in large negative binding enthalpy changes, which were counteracted by highly unfavorable entropy changes (TΔS) likely attributable to the constrained ligand and potentially pocket in the bound state. To conclude, targeting the macrodomain offers an attractive target for the development of antiviral agents against SARS-CoV-2 and other viruses. To assist the inhibitor discovery efforts, we report a collective set of crystal structures extending our structural knowledge of this protein. Structural comparison demonstrated that the ADP-ribose binding site, essentially the pocket for ribose-1-phosphate, of the macrodomain possessed high structural plasticity. This was demonstrated by its adaptability to diverse naturally-occurring nucleosides and nucleotides including ADP-ribose-phosphate (ADPRP). This flexibility offers opportunities for rational targeting this protein module by small molecule inhibitors. Indeed, recent study has reported a number of potential small molecule binders discovered through crystallographic fragment screening 31 . In line with this, our unprecedented discovery of the binding of GS-441524, an active metabolite of remdesivir, supported this hypothesis that the ADP-ribose pocket of the macrodomain indeed represents a druggable site. The low micromolar affinity of GS-441524 and its small molecular weight offers a versatile starting point for ligand design. Protein production: The DNA encoding SARS-CoV-2 macrodomain was commercially synthesized, and subcloned into pET-28a(+) vector (Genscript, supplementary table s1 ). Expression of the recombinant protein harbouring an N-terminal His6 tags was carried out in E. coli Rosetta which was cultured in TB media. The culture was grown at 37 °C until reaching an OD600 of 1.6 prior to cooling to 18 °C. At an OD600 of 2.6-2.8, the protein expression was induced by adding 0.5 μM IPTG and the expression was continued overnight. Cells were harvested by centrifugation, and resuspended in buffer containing 50 mM Tris, pH 8.0, 500 mM NaCl, 10 mM imidazole, pH 7, 5% glycerol and 1 mM TCEP. Lysis was performed by sonication, and supernatant was clarified by centrifugation. The recombinant protein was initially purified using Ni 2+ -affinity chromatography, and subsequent cleavage of the histidine tag was performed by TEV protease treatment. The cleaved protein was passed through Ni 2+ -NTA column, and was further purified by size exclusion chromatography using Superdex S75. The pure protein was stored at -80 °C in buffer 25 mM Tris pH 8.0, 150mM NaCl and 5% glycerol. The macrodomain was buffer exchanged into 25 mM Tris pH 8.0, 150mM NaCl and concentrated to ~8.5-10 mg/ml. Crystallization was performed using sitting drop vapor diffusion method at 20 °C and conditions listed in supplementary table s2. Macroseeding was employed in all crystallization experiment using initial crystals of the macrodomain, which were stored in 30% broad-molecular-weight PEG smears 32 , 0.1 M MgCl2, 0.1 M tris pH 7.0. For co-crystallization, the protein was mixed with 10-fold molar excess of the ligands. In the case of ADP-ribose, soaking was performed overnight using the ligand at 10 mM concentration. Viable crystals were cryo-protected with mother liquor supplemented with 20% ethylene glycol prior to flash cooling in liquid nitrogen. Diffraction data were collected at Swiss Light Source, and were processed and scaled with XDS 33 and aimless 34 , respectively. Initial structure solution was obtained with molecular replacement method using Phaser 35 and the coordinate of the macrodomain (pdb id: 6wen). Manual model rebuilding alternated with refinement was performed in COOT 36 and REFMAC5 37 . The geometry of the final models was verified using MOLPROBITY 38 . Omitted electron density maps for the bound ligands are shown in supplementary figure s1. The summary of data collection and refinement statistics are in supplementary table s2. Chemical synthetic procedure is described in supplementary information. Isothermal calorimetry: Isothermal titration calorimetry (ITC) experiment was performed using NanoITC instrument (TA Instrument) at 25 °C in the buffer containing 20 mM Tris pH 8.0, 100 mM NaCl, 0.5 mM TCEP, 5% glycerol. For ADP-ribose, the ligand at 500 µM in syringe was titrated into the reaction cell containing the protein at 106 µM, whereas for GS-441524 and GS-441524 monophosphate the protein at 250 µM was titrated into the reaction cell containing the compound at 22 μM. Data analyses were performed with NanoAnalyze software (TA Instrument) using an independent binding model from which the affinities and thermodynamic parameters were calculated. A new coronavirus associated with human respiratory disease in China The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2 Genomic characterization of the 2019 novel human-pathogenic coronavirus isolated from a patient with atypical pneumonia after visiting Wuhan Molecular evolution of the SARS coronavirus during the course of the SARS epidemic in China Nsp3 of coronaviruses: Structures and functions of a large multi-domain protein The macro domain is an ADP-ribose binding module Macrodomains: Structure, Function, Evolution, and Catalytic Activities The Viral Macrodomain Counters Host Antiviral ADP-Ribosylation The impact of PARPs and ADP-ribosylation on inflammation and host-pathogen interactions ADP-Ribosylation, a Multifaceted Posttranslational Modification Involved in the Control of Cell Physiology in Health and Disease Viral Macro Domains Reverse Protein ADP-Ribosylation The conserved macrodomains of the non-structural proteins of Chikungunya virus and other pathogenic positive strand RNA viruses function as mono-ADP-ribosylhydrolases Reversible ADP-ribosylation of RNA Viral macrodomains: a structural and evolutionary assessment of the pharmacological potential Structural, Biophysical, and Biochemical Elucidation of the SARS-CoV-2 Nonstructural Protein 3 Macro Domain ADP-ribose-1"-monophosphatase: a conserved coronavirus enzyme that is dispensable for viral replication in tissue culture Viral Macrodomains: Unique Mediators of Viral Replication and Pathogenesis The ADP-ribose-1''-monophosphatase domains of severe acute respiratory syndrome coronavirus and human coronavirus 229E mediate resistance to antiviral interferon responses Crystal structures of SARS-CoV-2 ADP-ribose phosphatase: from the apo form to ligand complexes The SARS-CoV-2 Conserved Macrodomain Is a Mono-ADP-Ribosylhydrolase Molecular Basis for ADP-Ribose Binding to the Mac1 Domain of SARS-CoV-2 nsp3 Structural basis of severe acute respiratory syndrome coronavirus ADP-ribose-1''-phosphate dephosphorylation by a conserved domain of nsP3 Evolutionary and structural analysis of SARS-CoV-2 specific evasion of host immunity The ARH and Macrodomain Families of alpha-ADP-ribose-acceptor Hydrolases Catalyze alpha-NAD(+) Hydrolysis Remdesivir: A Review of Its Discovery and Development Leading to Emergency Use Authorization for Treatment of COVID-19 Comparative therapeutic efficacy of remdesivir and combination lopinavir, ritonavir, and interferon beta against MERS-CoV Remdesivir Metabolite GS-441524 Effectively Inhibits SARS-CoV-2 Infection in Mouse Models Defined PEG smears as an alternative approach to enhance the search for crystallization conditions and crystal-quality improvement in reduced screens How good are my data and what is the resolution? Phaser crystallographic software Overview of refinement procedures within REFMAC5: utilizing data from different sources MolProbity: More and better reference data for improved all-atom structure validation