key: cord-0876719-hymksifb authors: Rothan, Hussin A.; Teoh, Teow Chong title: Cell-Based High-Throughput Screening Protocol for Discovering Antiviral Inhibitors Against SARS-COV-2 Main Protease (3CLpro) date: 2021-01-19 journal: Mol Biotechnol DOI: 10.1007/s12033-021-00299-7 sha: 44ff1fe8df3d430017feec9fba4ee0ded61087ff doc_id: 876719 cord_uid: hymksifb The global public health has been compromised since the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) emerged in late December 2019. There are no specific antiviral drugs available to combat SARS-CoV-2 infection. Besides the rapid dissemination of SARS-CoV-2, several variants have been identified with a potential epidemiologic and pathogenic variation. This fact has forced antiviral drug development strategies to stay innovative, including new drug discovery protocols, combining drugs, and establishing new drug classes. Thus, developing novel screening methods and direct-targeting viral enzymes could be an attractive strategy to combat SARS-CoV-2 infection. In this study, we designed, optimized, and validated a cell-based assay protocol for high-throughput screening (HTS) antiviral drug inhibitors against main viral protease (3CLpro). We applied the split-GFP complementation to develop GFP-split-3CLpro HTS system. The system consists of GFP-based reporters that become fluorescent upon cleavage by SARS-CoV-2 protease 3CLpro. We generated a stable GFP-split-3CLpro HTS system valid to screen large drug libraries for inhibitors to SARS-CoV-2 main protease in the bio-safety level 2 laboratory, providing real-time antiviral activity of the tested compounds. Using this assay, we identified a new class of viral protease inhibitors derived from quinazoline compounds that worth further in vitro and in vivo validation. COVID-19 is a pandemic disease caused by SARS-CoV-2, a highly contagious coronavirus causing significant healthcare and economic burden. SARS-CoV-2 is causing a spectrum of disease from asymptomatic to severe complications, including pneumonia, acute respiratory distress syndrome (ARDS), acute lung injury (ALI), cytokine storm syndrome (CSS), and death [1] [2] [3] [4] . There are no specific antiviral drugs or vaccines with proven clinical efficacy for treating or preventing infection with SARS-CoV-2, except a few non-specific repurposing drugs [5] [6] [7] . Furthermore, several variants of SARS-CoV-2 have been identified with potential epidemiologic and pathogenic variation [8] [9] [10] [11] [12] [13] . As such, the development of novel antiviral screening methods and direct-targeting of viral enzymes could be an attractive strategy to combat SARS-CoV-2 infection. SARS-CoV-2 polyproteins are processed by two viral proteases, papain-like protease (PLpro) and 3C-like protease (3CLpro), which are excellent targets for the development of therapeutic antivirals [14, 15] . Because of its highly conserved sequence, 3CLpro and PLpro have been considered as potential targets for antiviral drugs against SARS, MERS, and COVID-19 [16, 17] . Further, 3CLpro is responsible for virus-induced apoptotic signal [18] , and PLpro for stripping ubiquitin and ISG15 from host-cell proteins to aid coronaviruses in their evasion of the host innate immune responses [14] . Therefore, targeting 3CLpro and PLpro may have advantages in inhibiting viral replication and dysregulation of signaling cascades in infected cells. Viral 3CLpro [also called main protease (M pro )] cleaves viral polyproteins at 11 sites compared to 3 sites of PLpro [19] . As such, we concentrated our efforts on identifying antiviral candidates against viral 3CLpro. This protease has an identical sequence among coronaviruses and has no human homolog [20, 21] . In this study, we developed a protocol for high-throughput screening (HTS) to identify inhibitors against SARS-CoV-2 proteases based on the split-GFP complementation method. Our previous published data showed a practical implementation of split-GFP complementation assay to measure protein translocation from ERto-cytosol [22] . This cell-based-screening protocol is very significant in enhancing the safety, throughput, and reproducibility of antiviral screening. It can be used in biosafety level two laboratory, providing a real-time activity of tested compounds of large drug libraries, and also provide insight on compounds cytotoxicity. We designed a cell-based assay using GFP-split complementation to screen drug libraries and identify inhibitors against SARS-CoV-2 main protease 3CLpro. The GFP-split complementation assay was previously designed to measure caspase activity in the apoptotic cells in vitro and in vivo [23, 24] . We previously used the GFP-split complementation to establish cell lines stably expressing a dislocation-induced reconstituted GFP reporter to monitor and quantify protein translocation from the endoplasmic reticulum to the cytosol [22] . In this study, we utilized this technology to develop and optimize a protocol for high-throughput screening (HTS) to identify inhibitors against SARS-CoV-2 protease by screen small molecules library. We found that this assay is a simple and practical strategy to screen large drug libraries for protease inhibitors. The assay principle depends on splitting GFP into two units (GFP β1-9 and β10-11), resulting in losing its fluorescent capacity. β10-11 has a high affinity to bind to the β1-9 and rapidly develops green fluorescence [25] . Thus, split-GFP protease assay depends on preventing GFP units' assembly and making the triggering GFP assembly under protease activity. GFP gains the green fluorescence when β10 and β11 in anti-parallel position bind to β1-9 (Fig. 1a) . Using E5/K5 heterodimer to flip β10 and β11 in parallel form prevents self-assembly of the split GFP (Fig. 1b) . Upon protease cleavage, β11 flips back, forming an anti-parallel structure with β10, which enables self-assembly with β1-9 and leads to gain of green fluorescence (Fig. 1c) . Insertion of the 3CLpro cleavage site between E5/K5 heterodimer and β11 allows the 3CLpro to release β11 and to resume the antiparallel structure with β10. We named this reporter assay a GFP-split-3CLpro assay. We designed and generated an expression cassette consists of two plasmids. The first plasmid harbors GFP construct, and the second plasmid harbors 3CLpro to co-express in HEK293 cells (Fig. 2) . We used standard PCR to amplify the two GFP fragments, β-strand 1-9 and β-strand 10-11. The ten amino acid linker and cleavage sites of 3CLpro and E5/K5 amino acid sequences were inserted into the GFP construct using overlap-extension PCR as we described previously [26] (Fig. 2a) . GFP expression was detected at 24 h, and high GFP expression was observed at 48 and 72 h post-transfection. No considerable GFP expression was observed before the transfection of HEK239T cells with 3CLpro plasmid. The cells transfected only with the GFP construct but not 3CLpro did not show fluorescence activity five days after transfection. The GFP fluorescence dramatically increased at 48 and 72 h, as in Fig. 2b , leading to an increase in the signal-to-noise ratio. We further investigated the expression levels of GFP construct and 3CLpro by immunoblotting. We found higher expression of GFP construct than 3CLpro (Data not shown) that needs extensive plasmid ratio optimization. Increased 3CLpro expression resulted in rapid cleavage into the GFP Fig. 1 GFP-split complementation method. This assay was developed as previously described [23, 24] . Split GFP into β1-9 and β10-11 resulted in losing its fluorescent capacity. β10-11 has a high affinity to bind to the β1-9 and rapidly develop green fluorescence [25] . a GFP gains the green fluorescence when β10 and β11 in anti-parallel position bind to β1-9. b E5/K5 heterodimer was used to flip β10 and β11 in parallel form prevents self-assembly of the split GFP. c Upon protease cleavage, β11 flips back, forming an anti-parallel structure with β10, which enables self-assembly with β1-9 and gains green fluorescence. The 3CLpro cleaves between E5/K5 heterodimer and β11 allowing β11 to form the anti-parallel structure with β10 (Color figure online) construct and producing high GFP fluorescence. Instead of optimizing plasmid ratios and establishing a stable fluorescent signal, we sought to link the expression levels of GFP to the 3CLpro by generating the expression cassette 2 (Fig. 2c) . We joined the β-strand 10-11 construct to the C-terminus of 3CLpro and inserted another 3CLpro cleavage site to facilitate the release of the β-strand 10-11. GFP assembly was initiated only when the β-strand 10-11 released from 3CLpro by enzyme auto-cleavage (Fig. 3) . A more stable increase in the GFP fluorescence and optimal signal-to-noise ratio were observed for expression cassette two as compared to cassette one. We evaluated the enzyme activity over time by measuring the GFP fluorescence intensity. In Fig. 3 , the X axis plots the time and the Y axis plots the GFP intensity. We used the sigmoidal curve Fig. 2 The expression cassettes of GFP-split-3CLpro assay. The HEK293T cells were co-transfected with the recombinant pcDNA3.1 plasmids, and the GFP fluorescence intensities were measured at 24, 48, and 72 h. a The expression cassette one consists of two plasmids, the first plasmid harbors GFP construct, and the second plasmid harbors SARS-CoV-2 main protease (3CLpro). b GFP fluorescence noticeably increased at 48 and 72 h, leading to an increase in the signal-to-noise ratio. c The expression cassette two consists of two plasmids, the first plasmid expresses the GFP β-strand 1-9, and the second plasmid harbors a construct of β-strand 10-11 that was joined to the C-terminus of 3CLpro by another 3CLpro cleavage site to facilitate auto-cleavage of the β-strand 10-11. dA stable increase in the GFP fluorescent activity initiated when the β-strand 10-11 released from 3CLpro by enzyme auto-cleavage activity and binds to β-strand 1-9 Developing an HTS assay for screening thousands of compounds requires extensive optimization and evaluation to verify that the assay is performing acceptably. We evaluated our GFP-split-3CLpro protocol by screening an in-house library of 50 small molecule compounds (Fig. 4a) . Most of the compounds possess potential antiviral protease activity based on our previous studies of testing these compounds against NS3-4A protease of the Hepatitis C virus [27] , NS2B-NS3 protease of dengue virus [28] [29] [30] [31] [32] [33] [34] [35] [36] , and NS2B-NS3 protease Zika virus [37, 38] . In this study, boceprevir was used as a reference compound that showed a significant reduction in GFP fluorescent activity, suggesting potent inhibition against SARS-CoV-2 main protease (Fig. 4b) . This finding agrees with a previous study that showed potent inhibition of boceprevir against SARS-CoV-2 in vitro [39] . Surprisingly, the quinazoline derivatives caused a slight to a significant reduction in GFP fluorescence intensity (Fig. 4c-4g) . The compounds QZ1, QZ2, QZ3, and QZ5, showed 20-40% inhibition of viral protease activity, as presented in Fig. 4c , d, e, g. Among quinazoline derivatives, the compound QZ4 caused a considerable reduction in the GFP fluorescence intensity (Fig. 4f) . We sought to calculate the EC 50 of this compound by applying increasing concentrations of the inhibitor for 48 h. The results revealed that the QZ4 compound, [3-(5-methoxy-2-hydroxy benzylidene amino)-2(5-methoxy-2-hydroxyphenyl)-2,3-dihydro quinazoline-4(1H)one] reduced the GFP fluorescence intensity at an EC 50 of 6.5 µM, suggesting potential inhibition against 3CLpro activity with 50% cytotoxic concentration (CC 50 ) more than 100 µM at 48 h (Fig. 5a, b) . Furthermore, we measured the EC 50 value of the reference compound boceprevir, which is approximately 5.2 µM at 48 h (Fig. 5d) . The EC 50 value of boceprevir is close to the EC 50 of this compound measured by FRET-based enzymatic assay (4.1 µM) [39] , taking into account compound bioavailability in the cell-based assay. We further performed a molecular docking study to illustrate the binding of the QZ4 compound to SARS-CoV-2 main protease 3CLpro (Fig. 6) . The X-ray crystal structure of 3CLpro (PDB ID: 6w63.pdb) was downloaded from Protein Databank, and we obtained the 3D structure of Fig. 4 Validation of GFP-split-3CLpro screening protocol using a small molecule in-house library. a GFP-split-3CLpro protocol was evaluated by screening an in-house library of 50 small molecule compounds at 10 µM concentration. b Boceprevir showed an 80% reduction in GFP fluorescent activity at 40 µM, suggesting potent inhibi-tion against SARS-CoV-2 main protease. Quinazoline derivatives: c QZ1; d QZ2; e QZ3; g QZ5 showed 20-50% inhibition to viral protease while f QZ4 showed approximately 80% inhibition at 40 µM (n = 5 replicates) the QZ4 compound from PubChem. The results showed that the compound binds to the 3CLpro catalytic site with docking energy − 10.8 kcal/mol. The QZ4 binds to the 3CLpro by three hydrogen bonds with Gln 189, Gln192, and Arg 188.. We described previously the synthesis of the bioactive quinazoline derivatives [40] . Quinazoline compounds showed potent kinase inhibition via phenyl group that allows favorable interaction with the target into the kinase domain41. Furthermore, some kinase inhibitors have antiviral potential due to their activity against host cell kinases required for virus replication, and therefore, it was considered for drug repurposing against SARS-CoV-2 replication [42] . In this study, the quinazoline scaffold structure contains two active groups (R1 and R2) that contribute to its biological activity. Substitution of the two methoxyphenyl groups at R1 sites resulted in a considerable reduction in inhibition potencies of QZ4 compound to virus replication (Table 1) . This study suggests a new class of viral protease inhibitors derived from quinazoline compounds that worth further investigation. We do not expect the host cell proteases cleave the 3CLpro cleavage site because 3CLpro cleaves after a glutamine residue, and no human host-cell proteases are known with this substrate specificity [43, 44] , reducing the non-specific cleavage of our GFP-designed construct. Our published data showed a practical implementation of split-GFP complementation assay to measure protein translocation from ER-to-cytosol [22] . Besides, viral 3CLpro and PLpro are conserved among coronaviruses, and the protease inhibitors could have a broad spectrum against different coronaviruses [45] . Our published data also showed that protease assays are viable in identifying protease inhibitors virus replication such as HCV [27] , DENV [28] [29] [30] [31] [32] [33] [34] [35] [36] , and ZIKV [37, 38] . In this study, we developed a novel cell lines model for highthroughput screening (HTS) to identify inhibitors against SARS-CoV-2 protease based on split-GFP complementation. This method significantly enhances the safety, throughput, and reproducibility of antiviral screening. Besides, it helps shorten the time of screening drugs and provides the real-time activity of tested compounds and insight into compound toxicity. Most recently published studies of designing antiviral therapeutics against 3CLpro and PLpro depend on the in silico design and virtual screening without actual validation. Our cell-based protease assay will help validate the in silico designed inhibitors without using the infectious virus. Using this assay, we identified a new class of viral protease inhibitors derived from quinazoline compounds that worth further in vitro and in vivo validation. Standard PCR was used to amplify the two GFP fragments, β-strand 1-9 and β-strand 10-11. The ten amino acids linker and cleavage sites of 3CLpro and E5/K5 amino Fig. 5 The inhibition potential of quinazoline derivatives QZ4 to SARS-CoV-2 main protease 3CLpro. a QZ4 compound, [3-(5-methoxy-2-hydroxy benzylidene amino)-2(5-methoxy-2-hydroxyphenyl)-2,3-dihydro quinazoline-4(1H)-one]. b The increasing concentrations of QZ4 showed considerable reduction in the GFP fluorescence intensity over the time c The reduction in GFP intensities indicates that the EC 50 of QZ4 is approximately 6.5 µM suggesting potential inhibition against 3CLpro activity with 50% cytotoxic concentration (CC 50 ) more than 100 µM at 48 h. d EC 50 of boceprevir is approximately 5.2 µM at 48 h (n = 3 replicates) acid sequences were inserted into the GFP construct using overlap-extension PCR, as we described previously [26] . The purified DNA of GFP constructs and 3CLpro (Gen-Bank code: MN908947.3, ORF1ab polyprotein residues 3264-3569,) were sequenced for any mutations, and the intact DNA fragments were cloned in pGEM®-T cloning vector (Cat.# A1360, Promega), amplified in the E. coli, retrieved and sub-cloned in pcDNA3.1 mammalian expression vector (Cat.# V79020, ThermoFisher). The HEK293T cells (ATCC) were co-transfected with the recombinant pcDNA3.1 plasmids using Lipofectamine 2000 (Cat.# 11668-019, Invitrogen). The cell-based assay was optimized for a 96-well plate. Assay conditions were optimized including, seeding cell density (2 × 10 4 cell/well), FBS concentration (1.5%) in Dulbecco's modified Eagle medium (DMEM, Gibco) containing 1× GlutaMAX (Gibco), 1 mM sodium pyruvate (Gibco), and 1× antibiotic-antimycotic solution (Gibco). Cells tolerance to the solvent used for the compounds (DMSO 0.1%), in presence and absence of potential 3CLpro inhibitors. The HEK293T cells co-expressing split-GFP construct and SARS-CoV-2 3CLpro were cultured in 96-well black and clear-bottom plates (Cat.# 07-200-625, fisher scientific) at 2 × 10 4 cells/well overnight. GFP fluorescence intensities were measured using Tecan F200 Pro multimode microplate reader at excitation 488 nm and emission 525 nm at different time points, and GFP expression in HEK293T cells was acquired by the fluorescence microscopy. A small in-house library was screened by treating cells with the compounds prepared in DMSO at 10 µM concentration for 48 h. We used boceprevir (Cas # 394730600, Sigma) as a positive control that showed activity against SARS-COV-2 3CLpro [39] . The experiments were run on three occasions to identify the assay's hit rate, the reproducibility of the assay, and the false negative and false positive hit rates in the assay. At the endpoint of the split-GFP-3CLpro assay, we measure the compound cytotoxicity by applying 20/well of the MTT solution (CellTiter 96® AQueous, Promega) and incubating the cells for 4 h at 37 °C and 5% CO 2 . The absorbance was measured at 490 nm using a 96-well plate reader. The X-ray crystal structure of 3CLpro (PDB ID: 6w63. pdb) was downloaded from Protein Databank and minimized using the steepest descent algorithm in the GROMACS version 2020.1 with OPLSAA force field to the convergence of force less than 1000.0 kJ/mol/nm. The 3D structure of QZ4 ligand was downloaded from PubChem and optimized using Gaussian09 software applying the density functional theory (DFT) calculations with B3LYP 6-31G(d) basis set. Molecular docking between the optimized QZ4 and the 3CLpro receptor was performed using AutoDock Vina Version 2.0. The molecular rendering was performed using PyMOL 1.3 (TM) 2010 Schrodinger, LLC, and the 2-D diagram, which was computed using Discovery Studio 4.5 Client to analyze the molecular interactions of docked conformations. The MTT assay was used to measure compounds cytotoxicity by applying 20/well of the MTT solution (CellTiter 96® AQueous, Promega) and incubating the cells for 4 h at 37 °C and 5% CO 2 . The absorbance was measured at 490 nm using a 96-well plate reader. The epidemiology and pathogenesis of coronavirus disease (COVID-19) outbreak Molecular aspects of COVID-19 differential pathogenesis The pathogenesis of coronavirus disease 2019 (COVID-19): Evaluation and prevention Simulation of the clinical and pathological manifestations of coronavirus disease 2019 (COVID-19) in golden Syrian hamster model: Implications for disease pathogenesis and transmissibility The FDA-approved gold drug auranofin inhibits novel coronavirus (SARS-COV-2) replication and attenuates inflammation in human cells Repurposing existing drugs for the treatment of COVID-19. Annals of the COVID-19, immune system response, hyperinflammation and repurposing antirheumatic drugs Tracking changes in SARS-CoV-2 spike: Evidence that D614G increases infectivity of the covid-19 virus Isolation and phylogenetic analysis of SARS-CoV-2 variants collected in Russia during COVID-19 outbreak Escape from neutralizing antibodies by SARS-CoV-2 spike protein variants Genomic analysis of early SARS-CoV-2 variants introduced in Mexico Genetic variants and source of introduction of SARS-CoV-2 in South America Attenuated SARS-CoV-2 variants with deletions at the S1/S2 junction The SARS-coronavirus papain-like protease: Structure, function and inhibition by designed antiviral compounds Molecular docking and dynamics study of natural compound for potential inhibition of main protease of SARS-CoV-2 Inhibition of SARS-CoV 3CL protease by flavonoids Identification, synthesis and evaluation of SARS-CoV and MERS-CoV 3C-like protease inhibitors Severe acute respiratory syndrome coronavirus 3C-like proteaseinduced apoptosis Evaluating the 3C-like protease activity of SARScoronavirus: Recommendations for standardized assays for drug discovery Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain Conservation of substrate specificities among coronavirus main proteases A small molecule inhibitor of ER-to-cytosol protein dislocation exhibits anti-dengue and anti-Zika virus activity Designing a green fluorogenic protease reporter by flipping a beta strand of GFP for imaging apoptosis in animals Rational design of a GFP-based fluorogeniccaspase reporter for imaging apoptosis in vivo A new protein-protein interaction sensor based on tripartite split-GFP association A comparative study on the expression, purification and functional characterization of human adiponectin in Pichia pastoris and Escherichia coli Identification of peptide leads to inhibit hepatitis C virus: Inhibitory effect of plectasin peptide against hepatitis C serine protease Synthetic peptide optimization improves the inhibition of dengue NS2B-NS3 protease and dengue replication in vitro Screening of antiviral activities in medicinal plants extracts against dengue virus using dengue NS2B-NS3 protease assay Inhibitory effect of doxycycline against dengue virus replication in vitro Identification of natural antimicrobial agents to treat dengue infection: In vitro analysis of latarcin peptide activity against dengue virus Fusion of protegrin-1 and plectasin to MAP30 shows significant inhibition activity against dengue virus replication Antiviral cationic peptides as a strategy for innovation in global health therapeutics for dengue virus: High yield production of the biologically active recombinant plectasin peptide Study the antiviral activity of some derivatives of tetracycline and non-steroid anti inflammatory drugs towards dengue virus Inhibition of dengue NS2B-NS3 protease and viral replication in Vero cells by recombinant retrocyclin-1 Protegrin-1 inhibits dengue NS2B-NS3 serine protease and viral replication in MK2 cells Carnosine exhibits significant antiviral activity against Dengue and Zika virus Novel peptides inhibit zika NS2B-NS3 serine protease and virus replication in human hepatic cell line Boceprevir, GC-376, and calpain inhibitors II, XII inhibit SARS-CoV-2 viral replication by targeting the viral main protease NMDA receptor antagonism with novel indolyl, 2-(1,1-Dimethyl-1,3-dihydro-benzo[e]indol-2-ylidene)-malonaldehyde, reduces seizures duration in a rat model of epilepsy Quinazoline-based multi-tyrosine kinase inhibitors: Synthesis, modeling, antitumor and antiangiogenic properties Repurposing of kinase inhibitors for treatment of COVID-19 Alpha-ketoamides as broad-spectrum inhibitors of coronavirus and enterovirus replication: Structure-based design, synthesis, and activity assessment From SARS to MERS: Crystallographic studies on coronaviral proteases enable antiviral drug design Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations Acknowledgments This study was supported by the Institut Pengurusan dan Pemantauan Penyelidikan, University of Malaya, TRGS Grant (TR001B-2014B).