key: cord-0715177-xmq97qvh authors: Wu, W.; Wang, S.; Zhang, H.; Guo, W.; Lu, H.; Xu, H.; Zhan, R.; Fidan, O.; Sun, L. title: Biosynthesis of Novel Naphthoquinone Derivatives in the Commonly-used Chassis Cells Saccharomyces cerevisiae and Escherichia coli date: 2021-12-23 journal: Appl Biochem Microbiol DOI: 10.1134/s0003683821100124 sha: a1ffa2b7cf049e3565b065322e74c69b1db6ca67 doc_id: 715177 cord_uid: xmq97qvh Naphthoquinones harboring 1,4-naphthoquinone pharmacophore are considered as privileged structures in medicinal chemistry. In pharmaceutical industry and fundamental research, polyketide naphthoquinones were widely produced by heterologous expression of polyketide synthases in microbial chassis cells, such as Saccharomyces cerevisiae and Escherichia coli. Nevertheless, these cell factories still remain, to a great degree, black boxes that often exceed engineers’ expectations. In this work, the biotransformation of juglone or 1,4-naphthoquinone was conducted to generate novel derivatives and it was revealed that these two naphthoquinones can indeed be modified by the chassis cells. Seventeen derivatives, including 6 novel compounds, were isolated and their structural characterizations indicated the attachment of certain metabolites of chassis cells to naphthoquinones. Some of these biosynthesized derivatives were reported as potent antimicrobial agents with reduced cytotoxic activities. Additionally, molecular docking as simple and quick in silico approach was performed to screen the biosynthesized compounds for their potential antiviral activity. It was found that compound 11 and 17 showed the most promising binding affinities against Nsp9 of SARS-CoV-2, demonstrating their potential antiviral activities. Overall, this work provides a new approach to generate novel molecules in the commonly used chassis cells, which would expand the chemical diversity for the drug development pipeline. It also reveals a novel insight into the potential of the catalytic power of the most widely used chassis cells. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1134/S0003683821100124. against disease-causing microorganisms and noted for its antitumor activity [4] . Lapachol, isolated from the bark and wood of Tabebuia sp. and Tecoma sp., was reported as a potential therapeutic agent with various bioactivities such as anti-inflammatory, analgesic, antimalarial, antischistosomal, antiviral, antitrypanosomal, leishmanicide, antifungal, anticancer and antiulcerative activities [5] . Additionally, in a recent study, lapachol and its derivatives were screened against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) using an in silico approach and some of lapachol derivatives exhibited strong binding affinity to non-structural protein 9 (Nsp9) of SARS-CoV-2, which is one of the potential drug target proteins [6] . Another natural naphthoquinone is shikonin, which is isolated from Lithospermum erythrorhizon. It was previously announced to be an effective necroptotic inducer for cancer cells [7] . Shikonin was also reported to possess pharmaceutical value in inflammatory and infectious diseases [8] . As a secondary metabolite generated by the walnut tree (Juglandaceae), juglone has several biological activities such as anticancer, antioxidant, antiviral, antibacterial and antifungal effects [9] . Plumbagin, a plant-derived naphthoquinone, is a yellow compound extracted from the root of Plumbago zeylanica L. and has a wide range of biological activities including anticancer, anti-fungal, anti-bacterial, anti-malarial, anti-inflammatory, and antimutagenic properties [10] . Vitamin K is a class of isoprenoid molecules existing in multiple dietary forms, which are essential to normal coagulation as an enzyme cofactor for γ-carboxylation of peptide-bound Glu residues. All vitamin K forms are 2-methyl-1,4-naphthoquinone derivatives, term as menaquinones and have important roles in human biology, such as bone, cardiovascular, and metabolic health [11] . As menaquinone is the essential electron-delivery vector of the respiratory chain in various microbial cells, bacteria are the main producers of menaquinone in industry [12] . 1,4-naphthoquinone is the critical intermediate in the biosynthesis of menaquinones in bacteria [13] . Vitamin K and its derivatives are primarily synthesized by Grampositive bacteria and found in much lower amounts in the food supply in meat, dairy or fermented food products [11] . Engineering microbial systems for the efficient synthesis of natural products has been noted as an important application of synthetic biology. The heterologous reconstitution of the biosynthetic pathways for natural products in microbial chassis cells has attracted increasing attention [14] . Since microbial chassis cells demonstrated high catalytic power of enzymes and a large panoply of biosynthesized products, they have been shown to be profitable in many applications. Some model microorganisms, such as Escherichia coli and Saccharomyces cerevisiae, have been considered as ideal cell factories, thanks to the high growth rate, availability of synthetic biology tools and relatively well-studied metabolic networks [14, 15] . Nevertheless, these cell factories still remain, to a great degree, black boxes that often exceed engineers' expectations. In industrial bioprocesses, different from scientific research, no surprises are desired, and total control over a fully comprehensible and specially designed chassis cell is the ultimate goal. Due to aforementioned bioactivities of naphthoquinones, they are being investigated with respect to their use in therapeutic applications [1] [2] [3] . This huge potential can be enhanced and expanded by rational modifications to the core structure of naphthoquinones. Besides, compared to the complicated ones, simple naphthoquinones are easy to be modified and optimized by the enzymes of microbial chassis cells [1] . In this work, biotransformation of 1,4-naphthoquinone and juglone in the commonly-used microbial chassis cells, S. cerevisiae and E. coli, led to the biosynthesis of a series of "unexpected" naphthoquinone derivatives ( Fig. 1 and Table 1 ). This study not only provided a novel insight into the potential of the catalytic power of these chassis cells, but also expanded the chemical space with the biosynthesis of novel naphthoquinone derivatives. The molecular docking experiments were also conducted to analyze the potential antiviral activities of the biosynthesized compounds and revealed that compound 11 and 17 are the most promising antiviral candidates with strong binding affinities to the Nsp9 of SARS-CoV-2. General methods. The metabolites were analyzed and purified on Shimadzu Essentia LC-16 HPLC instrument (Japan) with SPD-16 UV-Vis dual wavelength detector. All UV-spectra were shown in Fig. S1 . High-resolution electrospray ionization mass spectroscopy (HRESIMS) measurements were obtained from 1260-6538 Q-TOF system with ZORBAX Eclipse Plus C18 (2.1 × 100 mm; 1.8 μm) column (Agilent Technologies, USA). Linear gradient from 10 to 90% (vol/vol) acetonitrile-water (containing 0.1% formic acid) was used for 30 min at a flow rate of 0.3 mL/min. NMR spectra were recorded at room temperature on Bruker AVANCE III HD 400 Digital NMR Spectrometer (USA) at 400 MHz for 1 H and Fig. S2-Fig. S55 . Strains and culture conditions. As chassis cells, S. cerevisiae INVSc1 and E. coli BL21(DE3) were purchased from Invitrogen (Thermo Fisher Scientific, USA). S. cerevisiae INVSc1 was grown in 150 mL of YPD (1% yeast extract, 2% peptone, 2% dextrose) medium at 30°C with shaking at 225 rpm for 24 h. E. coli BL21(DE3) was grown in Luria-Bertani (LB) medium containing (g/L): tryptone-10.0, NaCl-10.0, yeast extract-5.0, pH 7.0, at 37°C for 24 h before harvested. Biosynthesis and characterization of naphthoquinones. 1,4-naphthoquinone (97%) and juglone (98%) were purchased from Bide Pharmatech Ltd. (China) and Shanghai Macklin Biochemical Co. Ltd. (China), respectively. L-Glu (99%), L-Cys (99%), L-Ser (99%) and L-Gly (99%) were obtained from Shanghai Yuanye Bio-Technology Co. Ltd., while L-Lys monohydrochloride (98%) was purchased from Sigma-Aldrich (USA). The chassis cells were harvested by centrifugation at 3500 g for 10 min, washed thrice with 100 mM phosphate buffer (PB, pH 7.4) and transferred to 50 mL of PB buffer, followed by the supplementation of 20 mg of 1,4-naphthoquinone or 3 mg of juglone. Additionally, 20 mg of corresponding amino acids were added when necessary. To investigate the bioconversion of 6, 16 and 17, chassis cells were resuspended in 1 mL PB buffer, and supplemented with 0.2 mg of substrate. Biotransformation experiment was carried out for 7 days under the same incubation conditions as above. Control groups set as microorganism control (microorganisms in substrate-free PB buffer) and substrate control (substrate in microorganism-free PB) were incubated under the same conditions. After fermentation, the broths were extracted thrice with an equal volume of ethyl acetate for 60 min. The extracts were concentrated to dryness under reduced pressure using rotary evaporator (Eyela, Singapore) and the residues were dissolved in methanol. These samples were subsequently analyzed on a Shimadzu Essentia LC-16 instrument (Japan) equipped with column ZORBAX Eclipse Plus C18 (5 μm, 4.6 × 250 mm) (Agilent Technologies, USA). Linear gradient from 10 to 90% (vol/vol) acetonitrilewater (containing 0.1% formic acid) was used for 30 min at a flow rate of 1 mL/min, except for the samples in Fig. 2a , which were analyzed using linear gradient of 20-40% (vol/vol) for 30 min and 40-98% gradient for 10 min at a flow rate of 0.9 mL/min. Purification and structural characterization of 1-17. To isolate the biosynthesized compounds (1-15) the fermentation broths were scaled up to 5 L. After fermentation, the broths were extracted as described above. The metabolites were separated from the extract by a typical purification procedure given below. After fermentation, the broth was extracted as described above and separated by silica gel column chromatography. The fractions containing the target compounds were further separated by reverse-phase HPLC. The details of purification and structural characterization for the known compounds (1-6 and 8) are provided in the supplementary material. Compound 9 was found to be in fraction 9 after silica gel column chromatography. It was further purified on HPLC with 17% acetonitrile-water. The peak at 45 min was collected to yield 4.7 mg of 9 in pure form. HRESIMS m/z was 304.0819 [M + H] + (calculated for C 15 H 14 NO 6 , 304.0821). Compound 10 was found to be in fraction 10 after silica gel column chromatography. It was further purified on HPLC with 23% acetonitrile-water. The peak at 16 min was collected to yield 5.0 mg of 10 in pure form. HRESIMS [M+H] + ion signal for 10 was at m/z 333.0542, which was consistent with the calculated exact mass for C 15 H 13 N 2 O 5 S, 333.0545. 1 H, 13 C and 2D NMR data are respectively listed in Table 2 and Fig. S22-26 . For the purification of 7 and 11, 1,4-naphthoquinone was incubated with E. coli BL21(DE3). Purification and structural characterization of 7 are provided in the supplementary materials with details. Compound 11 was found to be in fraction 7 after silica gel column chromatography. It was washed with water and methanol after silica gel column chromatography to yield 2.8 mg in pure form. 1 H, 13 C and 2D NMR data are listed in Table 2 To isolate compound 12, juglone was incubated with S. cerevisiae INVSc1. After separation on the silica gel column, 12 was found to be in fraction 10. It was subjected to further purification on HPLC, eluted with 31% acetonitrile-water. The peak at 20 min was collected, yielding 3.3 mg of 12 in pure form. 1 H, 13 C and 2D NMR data are given in Table 2 To isolate 16 and 17, 1,4-naphthoquinone and amino acid were incubated with microorganism. After fermentation, they were extracted as described above. Then the crude extract was fractionated on silica gel column, eluted with a gradient of methanolchloroform (0 : 100, 1 : 99, 2 : 98, 3 : 97, 4 : 96, 5 : 95, vol/vol, each 250 mL, containing 0.1% glacial acetic acid) to get 6 fractions. To isolate 16, 1,4-naphthoquinone and L-Ser were incubated with S. cerevisiae INVSc1. The broth was scaled up to 300 mL. After separation on the silica gel column, 16 was found to be in fraction 6, which was further separated by HPLC with 20% acetonitrilewater (containing 0.1% formic acid) at a flow rate of 1 mL/min. The peak at 11.13 min was collected to yield 1.6 mg of 16 in pure form. 1 H, 13 C and 2D NMR data are found in Table 2 and Fig. S46-50 ; HRESIMS [M + H] + ion signal was found at m/z 262.0711, which matched with the calculated exact mass for C 13 H 12 NO 5 , 262.0715. To isolate 17, 1,4-naphthoquinone and L-Lys monohydrochloride were incubated with E. coli BL21(DE3). The broth was scaled up to 2 L. After separation on the silica gel column, 17 was found to be in fraction 6, which was washed with water and methanol after dryness under vacuum. Twenty mg of 17 in pure form were obtained. 1 H, 13 C and 2D NMR data are found in Table 2 [16] . The ligands are first drawn in ChemDraw Professional 16.0, followed by MM2 minimization using Chem3D and saved as SDF files. Subsequently, the SDF files were converted PDB files for each ligand using Open Babel GUI software. Both protein and the ligands were prepared for the molecular docking experiments using AutoDock Tools (ADT) 1.5.6 [17] . In the protein preparation process, water molecules were deleted, polar hydrogen atoms and Kollmans charges were added. For ligands, Gasteiger charges were added, aromatic carbon and rotatable bonds were detected, torsion number was automatically set, and non-polar hydrogens were merged. Then, protein and ligand files were saved as PDBQT files. Druggable pockets of the protein was predicted using online Cav-ityPlus platform (http://www.pkumdl.cn:8000/cavityplus/computation.php). Based on the predicted most druggable pockets, the grid boxes were determined using ADT software. The grid box for pocket 1 was centered at x = 39.82, y = - 19 Our previous research has revealed that supplement of different nutrients into the fermentation broth of engineered E. coli resulted into the biosynthesis of unexpected by-products [19] , suggesting the enzymes of host microorganisms may modify the structures of final products. To investigate the potential catalytic power of chassis cells, 2 naphthoquinones, 1,4-naphthoquinone and juglone, were added, respectively, into the PB buffer containing the most widely used chassis cells, S. cerevisiae or E. coli. After 7 days of incubation, the broths were extracted and subjected to HPLC analysis. In the broth containing S. cerevisiae INVSc1 and 1,4-naphthoquinone as substrate, the extract revealed the presence of serial new product peaks 1-10 compared to empty control (Fig. 2a) . 1,4-naphthoquinone incubated with E. coli BL21(DE3) was bioconverted to compounds 1, 7 and 11 (Fig. 2b) . Although juglone, noted as 5-hydroxy-1,4-naphthoquinone, shared a very similar structure with 1,4-naphthoquinone, 12 was the only major product of the biotransformation of S. cerevisiae INVSc1 (Fig. 2c) . When juglone incubated with E. coli BL21(DE3), compounds 13-15 were detected (Fig. 2d) . These results suggested that a series of naphthoquinone derivatives were generated when the naphthoquinones incubated with chassis cells. Structural characterization of compounds 1-15. (Fig. 3 and S24) . Consistently, the S18 APPLIED BIOCHEMISTRY AND MICROBIOLOGY Vol. 57 Suppl. 1 2021 WU et al. 13 C NMR spectrum of 9 exhibited signals attributable to a naphthoquinone, 2 carboxyl groups (δ 174.54 and 172.51), a methine group (δ 54.73) and 2 methylene groups (δ 30.63 and 26.26) (Fig. S23 ). Although the carbon signal at δ 172.51 was weak, we identified it through the heteronuclear multiple-bond correlation (HMBC) spectrum (Fig. S26) . These findings suggested that Glu group was introduced to the quinone ring of 1,4-naphthoquinone at C-2. This was confirmed by the HMBC spectrum. As shown in Fig. 3 , the HMBC correlations of NH-1' at δ 7.32 to C-1, C-3, C-2', C-3' and C-4', H-2' at δ 4.05-4.20 to C-4', H-4' at δ 2.09 to C-2', C-3', C-5' and C-6', and H-5' at δ 2.35 to C-2', C-4' and C-6' confirmed the connection of the Glu group with 1,4-naphthoquinone at C-2. Thus, compound 9 was identified as (1,4-dioxo-1,4dihydronaphthalen-2-yl)glutamic acid. The assigned carbon and proton signals are listed in Table 2 . Compound 10 was isolated as a purple powder with determined the molecular formula as C 15 H 12 N 2 O 5 S according to its HRESIMS data [M + H] + , m/z 333.0542. This was 57 mass units larger than compound 6, which possessed identical UV-vis spectra with it (Fig. S1 ). Similar to that of 6, the 1 H NMR and 1 H -1 H COSY spectra of 10 (Fig. S13 , S15, S27 and S29) also showed the presence of a naphthoquinone core (δ 7.98, 7.90, 7.78, and 7.73) and Cys moiety bridging C-2 and C-3 of the quinone ring (δ 7.93, 4.54, 2.79, and 3.23), excepting 3 additional proton signals, including NH triplet at δ 8.08 and 2 vicinal CH doublet of doublets at δ 3.55 and δ 3.78, respectively. Besides, 2 additional carbon signals were found in the 13 C NMR spectrum, δ 42.04 and 171.41 ( Fig. S14 and S28 ). According to the heteronuclear single quantum coherence (HSQC) spectrum, both of the two CH protons (δ 3.55 and 3.78) were directly attached to the carbon at δ 42.04 (Fig. S30 ). All of these data suggested that 10 was a derivative of 6 and may contain an additional Gly residue connected to the Cys via an amide bond. It was confirmed by the HMBC correlations shown in Fig. 3 . The HMBC correlations of NH-1' at δ 7.93 to C-1, C-3, C-2' and C-4', H-2' at δ 4.54 to C-2, C-3' and C-4', and H-4' at δ 2.79 and 3.23 to C-3, C-2' and C-3' confirmed the connection of the Cys residue group with 1,4-naphthoquinone at C-2 and C-3. The linkage of Gly residue to Cys group and the peptide bond were determined based on the HMBC correlations of NH-5' at δ 8.08 to C-3' and C-6', and H-6' at δ 3.55 and 3.78 to C-3' and C-7'. Therefore, 10 was identified as (5,10-dioxo-3,4,5,10-tetrahydro-2H-naphtho[2,3-b] [1, 4] thiazine-3-carbonyl)glycine. All the carbon and proton signals were assigned and are listed in Table 2 . Compound 11 was isolated as an orange powder, exhibiting the molecular formula C 25 It was clearly a symmetric structure as judged by its 1 H and 13 C NMR spectra ( Fig. S32 and S33) , which showed 11 protons and 13 carbons. In the 1 H NMR .73, and 7.62), a NH multiplet (δ 5.86-5.93) and a CH singlet (δ 5.73) observed in the low field showed the presence of a naphthoquinone core and the substitution of H-2 by an amino group. Consistently, the 13 C NMR spectrum revealed that there were 2 ketone groups (δ 182.99 and 181.90), suggesting that the naphthoquinone moiety is still present. Three additional aliphatic proton signals were found in the high field (δ 3.23, 1.77, and 1.48-1.59), which belonged to H-2', H-3' and H-4'. The 1 H-1 H COSY spectrum confirmed the connections among them, and between NH-1' and H-2' (Fig. 3) . The position of the aliphatic chain was determined by the HMBC correlation of NH-1' to C-1, C-3 and C-2', and H-2' to C-2 (Fig. 3) . Thus, the structure of compound 11 was elucidated as 2,2'-(pentane-1,5-diylbis(azanediyl))bis(naphthalene-1,4-dione). The assigned carbon and proton signals are listed in Table 2 . Compound 12 was obtained as a purple powder, and its molecular formula was determined to be C 13 H 9 NO 5 S by HRESIMS, which gave a m/z 292.0276 for [M + H] + . The 1 H NMR spectrum showed 3 aromatic protons (δ 7.58, 7.51, and 7.25) and a chelating phenolic hydroxyl group (δ 12.61), while the 13 C NMR spectrum exhibited 2 ketone groups (δ 184.20 and 177.39) and an aromatic quaternary carbon signal belonged to C-5 at δ 160.61 ( Fig. S37 and S38 ). They established the 5-hydroxy-1,4-naphthoquinone core of 12 by a comparison of the NMR data with the substrate juglone (Fig. S4) , [20] and it was further supported by the HMBC correlations shown in Fig. 3 . The other additional proton signals were similar to those of 6 and 10, including a broad singlet at δ 8.08 belonged to NH-1', a CH multiplet at δ 4.44-4.61 belonged to H-2', and 2 vicinal CH doublet of doublets both belonged to H-4' (δ 3.18 and 3.05). The 1 H-1 H COSY correlations among them were shown in Fig. 3 . Besides, similar to the 1 H NMR spectrum of 6 and 10, no signal for H-2 or H-3 was found. Those indicated that there was Cys moiety bridging C-2 and C-3 of the quinone ring. It was obvious that 12 was an additional product of Cys to juglone. The position of the Cys moiety was determined by the HMBC correlation of NH-1' to C-1, and H-4' to C-3 (Fig. 3) . Thus, 12 was identified as 9hydroxy-5,10-dioxo-3,4,5,10-tetrahydro-2H-naphtho[2,3-b] [1, 4] thiazine-3-carboxylic acid. All the carbon and proton signals were assigned and listed in Table 2 . Biosynthesis of naphthoquinone derivatives supplemented with amino acids. Analyzing the structures of 1-15, most of the products seem to be the direct additions of amino acids to naphthoquinones. To confirm this, we added L-Glu into the fermentation broth of S. cerevisiae INVSc1 with supplemented 1,4-naphthoquinone to observe whether the yield of 9 increased. As a result, the content of 9 significantly increased, but we also detected it in substrate control (Fig. 4a) . These suggested that these compounds were the additional products of amino acids with naphthoquinones. However, this fact alone was not enough to explain the formation of compound 6, 7, 10, 11, 12, 14 and 15. Among these compounds, 6, 10 and 12 had similar structures and 6 was the simplest one. The directed biosynthesis of 6 was carried out to explore the form of the dihydro-thiazine ring in their structure. Therefore, L-Cys was fed into the fermentation broth of S. cerevisiae INVSc1 with supplemented 1,4-naphthoquinone. After 7 days, the cultures were extracted and subjected to HPLC analysis. As shown in Fig. 4b , the production of 6 significantly increased as expected, compared to the fermentation broth without L-Cys (trace i and ii). However, 6 was also detected in substrate control (trace iii). This result explained the formation of dihydro-thiazine ring in 6, 10 and 12. Compounds 7, 14 and 15 may be the decarboxylation products after Ser molecule attached to 1,4-naphthoquinone or juglone. Considering 7 is the simplest compound among them and produced both by S. cerevisiae INVSc1 and E. coli BL21(DE3), we next tested whether supplement of L-Ser in the fermentation broth with 1,4naphthoquinone can increase the production of 7. Meanwhile, 11 is most likely biosynthesized by decarboxylation after Lys attached to two 1,4-naphthoquinones, so we also added L-Lys monohydrochloride into the fermentation broth of E. coli BL21(DE3) with supplemented 1,4-naphthoquinone to confirm this. The experiments were carried out for 7 days and the results were shown in Figs. 4c-4e. Unexpectedly, the productions of 7 and 11 not significantly increased as that of 6. Instead, 2 additional products, 16 and 17, were produced in significant amount. Besides, we also detected them in substrate control. Thus, 16 and 17 might be Ser and Lys attached to 1,4-naphthoquinone on the C-2 position, respectively. To confirm this, we purified them for structural characterization. Compound 16 was isolated as an orange powder, exhibiting the molecular formula C 13 H 11 NO 5 as established by HRESIMS [M + H] + , m/z 262.0711, which is one CHO 2 unit longer than 7. Consistently, the 13 C NMR spectrum of 16 revealed that there was an additional carboxyl carbon signal at δ 171.44 (Fig. S47) . These indicated that 16 may be Ser attached to 1,4-naphthoquinone on the C-2 positions. The 1 H NMR spectrum of 16 showed several proton signals in the low field, including 4 aromatic proton signals (δ 8.03, 7.95, 7.85, and 7.76), NH doublet (δ 6.98) and CH singlet (δ 5.74) (Fig. S46) , which indicated the presence of a naphthoquinone moiety with NH at C-2. CH multiplet at δ 4. 18-4.26 and CH2 doublet at δ 3.83 that belong to H-2' and H-4′ were observed in the high field. The 1 H-1 H COSY spectrum confirmed the connectivities among them and NH-1' (Fig. 3) . Only one proton signal for H-2' was found, suggesting the substitution of a proton by a carboxyl group at this posi-S20 APPLIED BIOCHEMISTRY AND MICROBIOLOGY Vol. 57 Suppl. 1 2021 tion, which was determined by the HMBC correlation of NH-1' to C-1, C-3 and C-3' (Fig. 3) . Thus, the structure of 16 was elucidated as 2-(1,4-dioxo-1,4dihydronaphthalen-2-ylamino)-3-hydroxypropanoic acid. All the carbon and proton signals were assigned and are listed in Table 2 . Compound 17 was obtained as a red powder, determined the molecular formula as C 26 (Fig. S51) , indicating the presence of a carboxyl group and 2 amino naphthoquinone moieties. In the high field of 1 H NMR spectrum, a methine multiplet (δ 4.11-4.19) and 4 methylene groups (δ 3.19, 1.96, 1.63 and 1.42) that belong to H-2', H-7', H-4', H-6' and H-5' were observed. The 1 H -1 H COSY correlations among them and 2 NH groups were shown in Fig. 3 , suggesting that a long side chain is connected to 2 naphthoquinones via 2 NH groups. Consistently, the 13 C NMR and HSQC spectra of 17 exhibited signals attributable to 2 naphthoquinone moieties, a carboxyl group (δ 173.04), a methine carbon (δ 55.13) and 4 methylene groups (δ 42.06, 30.40, 27.25, and 23.31) (Fig. S52 and S54 ). The carboxyl group was located at C-3′ based on the HMBC correlations of NH-1', H-2' and H-4' to C-3'. The linkages of the long side chain to 2 naphthoquinones were determined based on the HMBC correlations of NH-1' at δ 7.25 to C-1, C-3, C-2', C-3' and C-4', H-2' at δ 4.11 -4.19 to C-2, H-7' at δ 3.19 to C-12, and NH-8' at δ 7.56 to C-11, C-13, C-6' and C-7' (Fig. 3) . Therefore, 17 was identified as 2,6-bis(1,4-dioxo-1,4-dihydronaphthalen-2-ylamino)hexanoic acid. The assigned carbon and proton signals are listed in Table 2 . In order to confirm that 10 was biosynthesized from 6 and Gly, 7 was the decarboxylation product of 16, and 11 was the decarboxylation product of 17, we added the products as substrates into the fermentation broths of microorganisms. Since 10 was a derivative of 6 that contains an additional Gly residue connected to the Cys via an amide bond, 6 and Gly were fed into the fermentation broth of S. cerevisiae INVSc1 to produce 10. However, the peak for 10 was undetectable (Fig. 5a, iii) . Compound 7 can be detected in both fermentation broths of S. cerevisiae INVSc1 and E. coli BL21(DE3), thus, we added 16 into these fermentation broths. The result was shown in Figs. 5b, 5c. As a result, 7 was produced, but the production was low. This result indicated that 7 was indeed converted from 16, but the low titer suggested that 7 might be mainly produced through other pathways. In addition, we also observed 1 in the blank control of 16, suggesting that 1 may be the degradation product of these amino acid naphthoquinones. Compound 11 similarly seemed to be biosynthesized by the decarboxylation of 17, and next we tested the biotransformation of 17 by E. coli BL21(DE3). As shown in Fig. 5d, no peak for 11 was detected, indicating that 11 was not the decarboxylation product of 17. Proposed biosynthetic pathway for the biosynthesis of naphthoquinone derivatives. In this work, we studied the modification of 1,4-naphthoquinone and juglone with the metabolites generated by S. cerevisiae or E. coli chassis cells. Seventeen products, including 6 novel compounds, were identified. As demonstrated in Fig. 7 , we proposed the synthetic pathways for products 1-17. Some compounds, including 2-5, 8 and 9, were derived from naphthoquinones and contained an amino acid residue connected to C-2 of the quinone ring. 2 was formed from Val, 3 from β-Ala, 4 from γ-aminobutyric acid, 5 from Ala, 8 from Gly and 9 from Glu. It was confirmed that compound 9 was generated by the incubation of Glu with 1,4-naphthoquinone. Compounds 6, 10 and 12 share a similar structure, which have Cys moiety bridging C-2 and C-3 of the quinone ring through the amino and sulfhydryl groups to form a dihydro-thiazine ring. The incubation of 6 with Gly did not give 10, yet it was proposed that 10 was derived from the Cys-Gly dipeptide attached to 1,4-naphthoquinone on C-2 position because Cys-Gly dipeptide is generated from the glutathione S-transferase degradation pathway endogenous in S. cerevisiae [21] . Ethanolamine can be generated from the decarboxylation of Ser [22] . Compound 7 is derived from the attachment of ethanolamine to the C-2 position of 1,4-naphthoquinone, whereas 14 and 15 were generated by the attachment of ethanolamine to the C-2 or C-3 position of juglone. The incubation of Ser with 1,4-naphthoquinone gave 16, which was further decarboxylated into 7 by both S. cerevisiae INVSc1 and E. coli BL21(DE3) chassis cells. 16 was nonenzymatically converted to 1, suggesting that 13 was also the degradation product of juglone derivatives, such as 14 or 15. Compound 17 was generated by 2 units of 1,4-naphthoquinone attached to the amino groups of Lys. Incubation of 17 with E. coli BL21(DE3) did not give 11, indicating that 11 was not the decarboxylation product of 17. Free Lys in E. coli may have been directly converted to cadaverine by lysine decarboxylase before synthesizing 11 [23] . Thus, two units of 1,4-naphthoquinone were connected by cadaverine on the amino groups. Biological activities of naphthoquinones and the investigation of potential antiviral activities of 1-17 using molecular docking. 1,4-naphthoquinone pharmacophore is known to impart pronounced biological effects on its derivatives, leading to antitumor, antimalarial, antiviral, molluscidal, antileishmanial, antiproliferative, antibacterial and antifungal activities [2] . Among 17 compounds identified, 9-12, 16 and 17 are novel compounds reported firstly in this study, which have potential to be bioactive agents. Most of the known compounds obtained in this work have been proven to be biologically active compounds. For instance, 1 has various antimicrobial, anticancer and antimalarial activities. As an antimicrobial agent, it showed strong activity against Gram-negative bacteria (Salmonella typhimurium, Acinetobacter baumanii, Citrobacter freundii and Proteus mirabilis), Gram-positive bacteria (Staphylococcus aureus), and the yeast (Candida albicans) [24] . The anticancer activity of naphthoquinone derivatives was tested on pancreatic cancer MIA PaCa-2 cells and some derivatives were reported to be promising anticancer agents [25] . 2 was also demonstrated antimicrobial activity, but it was less active than 1,4-naphthoquinone [26] . Anticancer activities of 3-5 and 8 were previously reported. Importantly, 5 and 8 showed high cytotoxic activity and selectivity against 8 human cancer cell lines [5] . In addition, 8 was also reported to have antimycobacterial activities, but the specificity was low [27] . 7 possessed cytotoxic activity against lung cancer cell line H460. [28] 14 exhibited significant cytotoxicity towards cultured MDA-MB231, HepG2 and SNU638 cells [29] . The antiviral activities of naphthoquinones were also reported. For example, 2 novel naphthoquinones, isolated from a medicinal plant Rhinacanthus nasutus (L.) Kurz (Acanthaceae), exhibited inhibitory activity against cytomegalovirus, with EC50 values of 0.02 and 0.22 μg/mL [30] . Another new naphthoquinone extracted from the same medicinal plant was tested against coxsackievirus B3, human rhinovirus 1B and influenza PR8 virus and exhibited strong antiviral activity with the IC50 value ranging from 0.03 to 23.7 μM [31] . Taldon et al. also investigated the antiviral activities of novel 1,4-naphthoquinone derivatives, 2 of which showed strong antiviral activity against influenza-A and herpes simplex viruses [32] . More interestingly, lapachol and its derivatives, which are naphthoquinone derivatives, were recently tested against SARS-CoV-2 using in silico approaches [6] . In this study, GOLD program was used for molecular docking against Nsp9 of SARS-CoV-2 and lapachol derivative IX showed the highest docking score. Therefore, it was expected for the compounds biosynthesized in this study to possess similar bioactivities. To justify this, in silico approach was simply conducted to screen the potential antiviral activities of compounds 1-17 against Nsp9 of SARS-CoV-2. These compounds were subjected to molecular docking. The druggability and pocket prediction were performed using an online protein binding cavity analyzer (http:// www.pkumdl.cn:8000/cavityplus/computation.php). The most druggable 2 pockets were targeted for molecular docking. The pocket 1 covered the Nsp9specific α-helical GxxxG dimerization interface [16] . The pocket 2 covered the amino acid residues that are actively involved in the RNA binding site of Nsp9. Both pockets are important targets for the inhibition of Nsp9. GxxxG motif is considered important for viral replication and its disruption of Nsp9 of porcine delta coronavirus blocked nucleotide binding capacity [33] . RNA binding site of Nsp9 was reported as a promising drug target to discover potential antiviral compounds. With this approach, potential compounds were identified against dengue virus, Chikungunya virus and SARS-CoV [6] . After specifying the grid boxes based on predicted druggable pockets, protein and ligand preparations were completed and 6W4B were docked against 1-17 using Autodock Vina. Binding energies (kcal/mol) of ligands docked against both pockets are listed in Table 3 . Compound 11 had the highest binding affinity for the pocket 1 of Nsp9, while 17 showed the highest binding affinity for pocket 2. The interaction of these two compounds with their corresponding pockets of Nsp9 was illustrated in Fig. 6 . 11 formed 2 hydrogen bonds to Asn34-B and Asn99-B (-A: on chain A and -B: on chain B) as well as several hydrophobic (Leu5-B, Val8-B, Phe41-B, Phe76-A, Met102-B, Leu104-A, Leu107-A and Ala108-A) and van der Waals interactions (Val8-B, Thr36-B, Cys74-A, Leu89-A, Leu98-B and Leu113-A) (Fig. 6a) . On the other hand, 17 had 3 hydrogen bonds to Arg40-A, Val42-A and Ser60-A and formed a salt bridge to Arg40-A. Several hydrophobic (Arg40-A, Val42-A, Phe57-A, Lys59-A, Ile66-A, Thr68-A and Ile92-A) and van der Waals interactions (Met13-A, Gly39-A Phe48-A and Pro58-A) were also observed for 17. Additionally, Gly39-A formed a carbon-hydrogen bond with 17 at the quinone subunit of symmetrical 1,4-naphtoquinone (Fig. 6b) . To compare our results with the literature, the molecular docking experiments for the lapachol IX derivative was performed and resulted in binding affinities of -6.5 and -7.7 to pocket 1 and 2, respectively. The best docking scores of our compounds for each pocket outperformed this compound. Thus, it is expected that 11 and 17 hold promise to be the antiviral compounds for the treatment of SARS-CoV-2. However, this preliminary finding needs to be confirmed by molecular dynamics simulations, in vitro and in vivo experiments to draw a solid and more reliable conclusion. Our work demonstrated that the metabolites of S. cerevisiae and E. coli chassis cells can modify the polyketides such as naphthoquinones. Hu et al. iso- O lated 2 shunt products in the synthesis of the perylenequinone elsinochrome A by Aspergillus nidulans, they were the 1,2-naphthoquinones adduct of Cys and Cys-Gly dipeptide [34] . Lebeau et al. detected 9 nitrogen-containing azaphilone red pigments from a wild type of Talaromyces sp. 30570 marine-derived fungal strain, 6 among which were occurred only when complex organic nitrogen sources were present in the culture medium. These structures contained an amino acid residues [35] . Sun et al. also demonstrated the influences of exogenous glucose and pyruvate on the polyketide product flaviolin in an engineered E. coli BL21(DE3) [19] . Therefore, as heterologous expression host, some certain intracellular metabolites of S. cerevisiae and E. coli chassis cells may modify the target products to create chemical diversity. This not only provides a new approach for the biosynthesis of novel molecules in the commonly used chassis cells, but also expands the chemical repertoire for the discovery of novel and promising agents with various biological activities such as anticancer and anti-SARS-CoV-2. Molecular Pharming: Applications, Challenges, and Emerging Areas