key: cord-0880748-p6dm39p1 authors: Lee, Soo Hyun; Lee, Yeon Kyung; Lee, Sang-Hun; Kwak, Jisung; Song, Hyun Seok; Seo, Minah title: Detection and discrimination of SARS-CoV-2 spike protein-derived peptides using THz metamaterials date: 2022-01-14 journal: Biosens Bioelectron DOI: 10.1016/j.bios.2022.113981 sha: 8ee5c8180760a50ccb8913570a1d466441ebea7a doc_id: 880748 cord_uid: p6dm39p1 The development of effective assay techniques for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has recently received research attention due to its rapid worldwide spread and considerable risk to human health. The receptor-binding domain (RBD) of the spike (S) protein in SARS-CoV-2, a key component for viral entry that has a unique sequence compared to other structural proteins, has been considered an important diagnostic target. In this respect, low-frequency vibrational modes have the advantage of providing information about compositional and structural dependencies at the peptide level. In this study, the sensitive and selective detection of peptides derived from the RBD in SARS-CoV-2 and SARS-CoV was investigated using metamaterial-based sensing chips with a terahertz time-domain spectroscopy (THz-TDS) system. Based on their RBD sequences, two pairs of peptides with 20 residues each were prepared. The sensitivity, specificity, and reproducibility of the proposed system were examined via quantitative analysis using THz metamaterials at three resonance frequencies, and it was found that the species could be discriminated based on their sequences. The THz signals were analyzed with regard to the major amino acid components of the peptides, and the molecular distributions were also investigated based on the hydropathy and net charge of the peptides. The novel coronavirus disease 2019 , caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and first reported in Wuhan, China, in December 2019, remains a significant threat to public health globally Wang et al., 2020; Sun et al., 2020) . Within three months of its emergence, the World Health Organization declared the COVID-19 to be a pandemic. The rapid global spread of COVID-19 is due to several factors. In particular, SARS-CoV-2 predominantly spreads through human-to-human transmission via the inhalation of contaminated saliva, nasal, or aerosols Sun et al., 2020) and presents cold-like symptoms in the early stages of infection, with a long incubation period of around two weeks. Although many strategies have been developed to prevent the transmission of SARS-CoV-2, prompt isolation of infected patients is still regarded as the most effective method (Bhadoria et al., 2020) . However, the exclusion of asymptomatic viral carriers from diagnosis targets has contributed to its continued spread Sun et al., 2020; Zhao et al., 2020) . Currently, SARS-CoV-2 detection primarily relies on nucleic acid amplification and serologybased antigen-binding assays Beig Parikhani et al., 2021; Ji et al., 2020) . However, the reverse transcriptase-polymerase chain reaction requires a relatively long time (i.e., a few hours) for complementary deoxyribonucleic acid (DNA) amplification including pre-/post-treatment, and the antigen-antibody interaction for SARS-CoV-2 has a low accuracy due to the high structural similarity of antigens within the same genus (Morales-Narváez et al., 2020; Niemz et al., 2011; Tai et al., 2020) . Therefore, the development of rapid diagnostic techniques based on sophisticated molecular structures is urgently required. In particular, several strategies for specific viral detection have been recently reported, for examples, the rolling circle replication (Yan et al., 2021; Yue et al., 2021) , DNA methylation (Ganesh et al., 2020) , electrochemical CRISPR (Bruch et al., 2019; Li et al., 2021) , and chemiluminescence resonance energy transfer . For the case of SARS-CoV and SARS-CoV-2, it J o u r n a l P r e -p r o o f is well-known that both take advantage of the angiotensin-converting enzyme 2 (ACE2) receptor for host entry using spike (S) proteins in their membranes Lan et al., 2020) . Differences in their S protein sequences, especially in their receptorbinding domain (RBD), determine their structure, viral transmission, and neutralizing efficiency (Lan et al., 2020; Hoffmann et al., 2020; Krammer et al., 2020; Kim et al., 2021) . For this reason, the S protein represents an ideal target for viral discrimination and subsequent vaccine and therapeutic development. Metamaterial-based terahertz time-domain spectroscopy (THz-TDS) has recently attracted increasing attention, particularly in chemical and biomedical sensing applications (Fan et al., 2014; Choi et al., 2011; Yang et al., 2016; Gong et al., 2020) . Due to its low photon energy irradiation (1 THz, which is equivalent to 4 meV), THz-TDS offers insight into the structural and dynamical characteristics of organic compounds and does not lead to ionization, radiative thermal excitation, or damage (Fan et al., 2014; Choi et al., 2011; Yang et al., 2016) . It scans unique spectral features in the ultrabroad spectrum corresponding to vibrational/rotational modes (referred to as "fingerprints"), overall molecular motion, and phonon vibrations, especially in low frequencies (≤ 100 cm -1 ). These low-frequency vibrations contain information about hydrogen bonds, van der Waals forces, torsion, and hydrophobic interactions, which allow a deeper understanding of biological targets (e.g., proteins, nucleosides, peptides, amino acids (AAs), and DNA) (Yang et al., 2016; Gong et al., 2020; Born et al., 2009; Janek et al., 2014) . Furthermore, metamaterial-based sensing chips with elaborately designed structures at a subwavelength scale (λ/10−λ/10,000) have been developed to enhance the sensitivity and obtain the higher signal-to-noise (S/N) ratio in response to the intrinsically weak molecular absorption within the THz regime (Choi et al., 2011; Lee et al., 2020; Srivastava et al., 2019) . These chips produce a specific resonance frequency (fres) that is tunable by adjusting the geometric parameters of the antenna-type J o u r n a l P r e -p r o o f nanoslot structures. These advantages allow THz-TDS to effectively and rapidly probe and identify biomolecules with high structural similarity even at extremely low concentrations. In this work, we developed nanoscale metamaterial-based THz-TDS for the detection and discrimination of peptides. The THz signals responsible for the low vibrational frequencies can be employed to reveal the properties of individual components. Peptides derived from the S protein in SARS-CoV and SARS-CoV-2 were selected as viral model analytes. The selectivity and specificity of the proposed approach were experimentally and quantitatively examined using THz signals at fres= 1.16, 1.64, and 2.07 THz that were strongly enhanced by the metamaterials. The effects of hydropathy and the net charge of the peptides on the THz features were also explored. In our THz-TDS system, data could be acquired within a few minutes with a sufficiently stable S/N ratio of 70 dB, thus it represents a new type of rapid and accurate sensing platform. Given the sequence similarity of SARS-CoV and SARS-CoV-2, two pairs of peptides from the RBD of their S proteins were individually synthesized (Peptron, Daejeon, South Korea). for human SARS-CoV-2 and AY278741 for human SARS-CoV). The structures of the SARS-CoV-based peptides (i.e., SCRBD-1 and SCRBD-2) and those of the SARS-CoV-2based peptides (i.e., S protein, SC2RBD-1, and SC2RBD-2) were described using 6NB6 and 6XEY from the Protein Date Bank (PDB), respectively. J o u r n a l P r e -p r o o f Nanoslot arrays with a high aspect ratio to enhance the THz signal were fabricated based on a conventional semiconductor process. A six in un-doped silicon (Si) wafer with a high resistivity of > 10 4 Ω•cm was cleaned using a piranha solution (H2SO4:H2O2 = 3:1) and a diluted hydrofluoric acid solution (HF:H2O = 1:100). In order to improve the compatibility with biomolecules, a 300 nm-thick Si dioxide (SiO2) layer was thermally deposited onto the wafer using a furnace. The nanoslot patterns were defined using photolithography, which was followed by metallization (Ti/Au; 10/150 nm) and a lift-off process. A THz-TDS system in transmittance mode was designed based on the Ti:sapphire femtosecond laser with a center λ of 800 nm, a pulse width of 100 fs, and a repetition rate of 80 MHz. To take the measurements, a chamber environment containing a full THz beam path was sealed with purged dried air (relative humidity of less than 1%) to prevent the loss of the THz signal due to water vapor absorption. The THz electric (E)-field was derived from the 30-times-averaged time-domain waveforms, which were then converted into frequencydomain data using the fast Fourier transform. The transmittance (T(ω)) was defined as, where Esample(ω) and Esub(ω) are the E-field transmitted through the THz metamaterials with and without the peptides, respectively. To prepare the samples, each peptide was dissolved in de-ionized (DI) water at different concentrations (0.1, 0.5, and 1.0 mg/mL). DI water was chosen as the solvent to avoid a spectral shift occurring due to the competition between ions and the proteins. A 2 μl aliquot of the solution was dropped onto the surface of the THz metamaterials and then dried for 15 min in a vacuum. J o u r n a l P r e -p r o o f The E-field profiles of the nanoslot metamaterials were theoretically investigated based on the finite element method (FEM) using an electromagnetic (EM) wave module in the commercial software package COMSOL Multiphysics. Details of the simulation parameters are provided in Table S1 . The complex permittivity of the Au layer was extracted from the standard Drude model. Given the regular arrangement of the nanoslots, a periodic boundary condition (BC) was applied to the x-and y-directions. An active port BC at the top of the air domain was used to generate the incident THz wave while a passive port was employed at the bottom of the metamaterial model. A perfectly matched layer with a scattering BC that simulated the infinite free space was employed to pass light out of the domain without reflection. The strategy for sensing and discriminating the peptides from SARS-CoV and SARS-CoV-2 based on the THz metamaterial is illustrated in Fig. 1a . Both belong to the betacoronaviruses (β-CoV) within the coronaviridae, which is a family of enveloped, positive-sense, and singlestranded RNA viruses (i.e., Baltimore class IV). They have significant structural similarity, with an 82% viral genome identity (Naqvi et al., 2020; Lu et al., 2020) . Their RNA genome primarily encodes four proteins: the S protein, envelope protein, membrane protein, and nucleocapsid protein (Beig Parikhani et al., 2021; Ji et al., 2020; Tai et al., 2020; Krammer et al., 2020; Kim et al., 2021; Naqvi et al., 2020) . The S protein, which is highly glycosylated trimeric membrane protein, consists of S1 and S2 subinits, mediating viral binding and membrane fusion, respectively. (Beig Parikhani et al., 2021; Tai et al., 2020; Lan et al., 2020; Kim et al., 2021) . When the virus enters the host, the RBD in the S1 subunit plays a key role in the interaction with the ACE2 receptor. For their RBD, the highly similar sequences (approximately 73%) and secondary structures are observed ( Fig. 1b and 1c ) (Beig J o u r n a l P r e -p r o o f Parikhani et al., 2021; Ji et al., 2020; Wan et al., 2020) . To determine their unique characteristics derived from their AA composition, two pairs of 20 AAs were selected from the RBD of SARS-CoV and SARS-CoV-2. In the comparison, the peptides were paired as SCRBD-1/SC2RBD-1 (yellow highlights) and the SCRBD-2/SC2RBD-2 (blue highlights). The fundamental mechanisms of the nanoslot structure are presented in Fig. 2a . For THz metamaterials, the geometric parameters of the nanoslots are crucial to the overall performance of the resulting sensing device. The length (l) of the slot is related to fres based on the following equation (Lee et al., 2020) : where c0 is the speed of light and n is the refractive index of the substrate. In this study, l was 32, 40, and 60 μm, corresponding to a fres of 2.07, 1.64, and 1.16 THz, respectively. The width (w) of the slot is associated with the trade-off relationship between sensitivity (i.e., E-field enhancement) and the molecule-capturing volume. As w becomes narrower, THz field enhancement increases due to the funneling effect, while the volume containing the target molecules (i.e., the optical hot-spot and the active area) is reduced. To optimize this trade-off, w was set at 500 nm, as demonstrated by our previous studies (Lee et al., 2020; Seo et al., 2009; Lee et al., 2021) . The distance between each slot was set at 40 and 10 μm along the transverse and longitudinal directions, respectively, to obtain highly resolute spectra smaller than the first Rayleigh minimum. To prevent the direct transmission of incident THz waves through the metal film, the thickness of the Au layer was set at 150 nm, which was much greater than its skin depth (δ~ 73, 61, and 55 nm at 1.16, 1.64, and 2.07 THz, respectively). The highly resistive substrate maximized the S/N ratio due to an unaltered n (nSiO2≈ 1.96 and Their THz spectra for a concentration of 1.0 mg/mL averaged from multiple chips are presented in Fig. 2b and 2c. The differences in ∆f and ∆T/T0 for SCRBD-1 and SC2RBD-1 were very small at fres= 1.16 THz but they increased noticeably at a higher fres (Table 1) . These results indicate that peptides with similar secondary backbone structures can be classified using their unique THz fingerprints. At cryogenic temperatures, biosamples generally produce absorption features (α) that reflect the characteristics of the individual molecular components (Neu et al., 2019; Laman et al., 2008; Pan et al., 2017) . With increasing environmental temperature, the transition from the thermally occupied states broadens and slightly shifts the molecular spectral features, leading to information of a lower resolution. In addition, large samples including proteins and peptides with relatively long segments demonstrate no significant optical features (Lee et al., 2017) . In contrast, relatively short segments were employed in the present study, so the influence of major AA components on the THz spectra was able to be considered. For SCRBD-1 and SC2RBD-1, the leucine (Leu) and asparagine (Asn) were analyzed due to their prominent α features in the transmitted THz signals from a pellet mixed with high-density J o u r n a l P r e -p r o o f polyethylene, which is almost transparent in the THz regime (Supplementary Information). Prominent peaks were observed at 0.64, 0.82, 1.43, 1.64, and 2.11 THz for Leu and 1.63 and 2.20 THz for Asn, which is consistent with other reports (Nishizawa et al., 2006; Wang et al., 2009 ). Based on the major AA components, at 1.64 and 2.07 THz, the intrinsic α and corresponding ∆T/T0 of SC2RBD-1 may be higher than those of SCRBD-1. Thus, THz-TDS is able to discriminate between extremely small biomarkers (λ/10,000−λ/100,000) with high structural similarity, thus demonstrating its potential for use in practical bioassays. For effective and reliable early diagnosis, high sensitivity, a low limit of detection (LOD), and high reproducibility are required. To investigate those key parameters, quantitative analysis (0.1, 0.5, and 1.0 mg/mL) of SCRBD-1 and SC2RBD-1 was conducted (Fig. 3a-f ). The average concentration-dependent ∆f and ∆T/T0 for each fres are summarized in Fig. 3g . For all concentrations, similar trends were observed. It was thus possible to distinguish SCRBD-1 and SC2RBD-1, which share similar secondary structures ( Fig. 1b; part of the α-helix and βstrand), even at low concentrations. A highly linear relationship was observed with the concentration with correlation coefficients (R 2 ) of > 98.4% and > 98.6% for SCRBD-1 and SC2RBD-1, respectively, thus indicating the excellent quantitative sensitivity of THz-TDS. Accordingly, the LOD was estimated to be 0.1 mg/mL (i.e., 41.7 μM). The reproducibility of the THz signals was estimated using the standard deviation (SD) of multiple runs. SC2RBD-1 had a relatively large SD compared to that of SCRBD-1, which was related to their molecular distribution. One factor determining the SD is the isoelectric point (pI), which determines the net charge of peptides in relation to the pH of a solvent. Based on data from Thermofisher Scientific (https://www.thermofisher.com/kr/en/home.html), the pI was theoretically expected to be 9.8 and 10.2 for SCRBD-1 and SC2RBD-1, respectively, which is much higher than the pH of the solvent used in the present study (DI water; ~6.8), indicating that the peptides are positively charged (pI > pH). Thus, an attractive force between the peptides and the Au layer was induced. Consistent with this phenomenon, during the drying process, the surface of the peptide films was uniformly polymerized without notable deformations (Supplementary Information). Another factor related to the SD is hydropathy. Hydrophilicity leads to more beneficial molecular solubility, dispersion, and distribution. Among biomolecules, the most typical hydrophilic interactions are hydrogen bonds and salt bridges (Lan et al., 2020) . Polar and amphipathic AAs readily form hydrogen bonds with other polar groups and charged AAs often participate in salt interactions (Lan et al., 2020 , Mermut et al., 2006 Fernández-Carneado et al., 2004) . It has been reported that there are 13 hydrogen bonds and 2 salt bridges at the SC2RBD/ACE2 interface and 13 hydrogen bonds and 3 salt bridges at the SCRBD/ACE2 interface (Lan et al., 2020; Lu et al., 2020; Wan et al., 2020) . In particular, the highly hydrophilic Y and N are observed for SCRBD-1 (N427, N435, Y436, N437, Y438, Y440, and Y442) and SC2RBD-1 (N440, Y449, N450, Y451, N452, and Y453). To compare their wettability properties, the grand average of hydropathy (GRAVY) index, evaluated using the averaged Kyte-Doolittle scale, was calculated. A positive GRAVY value indicates hydrophobicity, while a negative value indicates hydrophilicity. The GRAVY index of SCRBD-1 and SC2RBD-1 was calculated to be -1.41 and -0.94, respectively. Due to the surface tension across the slots, more hydrophilic components penetrate more efficiently into narrow regions, resulting in a lower SD. To investigate the effect of AA composition on the THz signal, 20 additional AAs with similar secondary structures were analyzed. The 20 AAs with the sequences of 447−466 (SCRBD-2) and 460−479 (SC2RBD-2) were selected, respectively, representing the κ-helix (also known as polyproline II, PPII). The dominant AAs within the κ-helix are proline, which J o u r n a l P r e -p r o o f is characterized by the absence of H-donor atoms in its cyclic side chain, and glycine (Fernández-Carneado et al., 2004) . These residues are major AA components in both peptides. For SCRBD-2 and SC2RBD-2, the fres and concentration-dependent THz signals were characterized ( Fig. 4a-f ) and their averages are summarized in Fig. 4g . At all fres, high linearity was observed in the relationship with the concentration, with an R 2 of >0.98 and >0.94 for SCRBD-2 and SC2RBD-2, respectively. For all fres and at 1 mg/mL, prominent increases THz spectra were observed (∆f ≥ 0.05 THz and ∆T/T0 ≥ 15%) for SCRBD-2, while a dramatic decrease was observed for SC2RBD-2 (∆f < 0.03 THz and ∆T/T0 < 13%), despite its similar κ-helix structure. Thus, SCRBD-2 and SC2RBD-2 were successfully discriminated using the THz spectra, though the mechanisms somewhat differ from the case for SCRBD-1 and SC2RBD-1. This large discrepancy can be explained based on the probability of the molecules being captured by the active regions, as illustrated by the following equation Park et al., 2014) : where Ns is the number of AAs in the nanoslot gap and εAA, εair, and εeff are the permittivity of the AA, the air, and the gap without AAs, respectively. In Eq. (2), Ns is the critical variable determining ∆f; thus, the results indicate that significantly fewer SC2RBD-2 molecules were trapped by the slots compared to SCRBD-2. This can be ascribed to the electrostatic repulsive force between SC2RBD-2 and the Au layer. The theoretically expected pI value was about 9.9 and 4.4 for SCRBD-2 and SC2RBD-2, respectively. In this case, both the Au layer filled with numerous free electrons and SC2RBD-2 dissolved in DI water (pI < pH) were negatively charged, and consequently, they repelled one another. This interaction hindered the efficient molecular adsorption and the sensing performance of the metamaterial. For this reason, it is speculated that an inhomogeneous SC2RBD-2 distribution was created when the droplet was J o u r n a l P r e -p r o o f 13 placed on the metamaterial. Once it had completely dried, the formation of an irregular film with crack lines was observed (Supplementary Information). This non-uniform distribution of SC2RBD-2 was also consistent with its relatively large SD when compared to that of SCRBD-2. We successfully compared and analyzed the peptides in the RBD of SARS-CoV-2 and SARS-CoV, which share highly similar secondary structures. Using THz metamaterials (fres= 1.16, 1.64, and 2.07 THz), the spectral fingerprints at low frequencies of the peptides were measured. These fingerprints originated from a variety of structural properties, including the peptides themselves, individual AAs, and hydrogen bonds. The variation and SD of the THz signals (i.e., ∆f and ∆T/T0) were also analyzed based on the hydropathy and net charge of the peptides. For SCRBD-1 and SC2RBD-1, which share similar secondary structures (part of the α-helix and β-strand), the hydrophilic and positively charged peptides led to more effective molecular capture and higher film uniformity. The spectral differences at fres= 1.64 and 2.07 THz were attributed to the number of major AAs (i.e., Leu and Asn), which have strong intrinsic absorption peaks at the measured frequencies. On the other hand, for SCRBD-2 and SC2RBD-2, which share the κ-helix, significant differences in the signals were observed. The repulsive forces between the negatively charged SC2RBD-2 and the metamaterial disrupted effective molecular detection. Based on quantitative analysis, the LOD for all of the peptides was estimated to be 0.1 mg/mL (41.7 μM). Taken together, these results demonstrate the validity of the proposed approach to the detection and classification of biomolecules, even for those within the same genus, at the peptide level. Moreover, the THz system with fast acquisition and high S/N ratio contributed to the rapid trace of molecules within a few minute. Therefore, metamaterial-based THz-TDS analytic tools show great promise for point-of-care and label-free diagnosis in practical applications. A Novel Coronavirus from Patients with Pneumonia in China A novel coronavirus outbreak of global health concern COVID-19: Epidemiology, Evolution, and Cross-Disciplinary Perspectives COVID-19: asymptomatic carrier transmission is an underestimated problem Institutional versus home isolation to curb the COVID-19 outbreak The Inclusive Review on SARS-CoV-2 Biology, Epidemiology, Diagnosis, and Potential Management Options Detection of COVID-19: A review of the current literature and future perspectives The impact of biosensing in a pandemic outbreak: COVID-19 Point-of-care nucleic acid testing for infectious diseases Characterization of the receptor-binding domain (RBD) of 2019 novel coronavirus: implication for development of RBD protein as a viral attachment inhibitor and vaccine Versatile electrochemical biosensor based on bi-enzyme cascade biocatalysis spatially regulated by DNA architecture Rolling Circle Replication for Biosensing, Bioimaging, and Biomedicine Quantum scale organic semiconductors for SERS detection of DNA methylation and gene expression CRISPR/Cas13a-Powered Electrochemical Microfluidic Biosensor for Nucleic Acid Amplification-Free miRNA Diagnostics Electric field-enhanced electrochemical CRISPR biosensor for DNA detection Chemiluminescence resonance energy transfer: From mechanisms to analytical applications Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2 Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor SARS-CoV-2 Cell Entry Depends on ACE2 and TMPRSS2 and Is Blocked by a Clinically Proven Protease Inhibitor SARS-CoV-2 vaccines in development Neutralizing Human Antibodies against Severe Acute Respiratory Syndrome Coronavirus 2 Isolated from a Human Synthetic Fab Phage Display Library The growth of biomedical terahertz research A terahertz metamaterial with unnaturally high refractive index Biomedical Applications of Terahertz Spectroscopy and Imaging Biomedical applications of terahertz technology Solvation Dynamics of Model Peptides Probed by Terahertz Spectroscopy. Observation of the Onset of Collective Network Motions Terahertz time-domain spectroscopy response of amines and amino acids intercalated smectites in far-infrared region Label-free brain tissue imaging using large-area terahertz metamaterials Terahertz sensing of 7 nm dielectric film with bound states in the continuum metasurfaces Insights into SARS-CoV-2 genome, structure, evolution, pathogenesis and therapies: Structural genomics approach. BBA-Mol Genomic epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus Terahertz field enhancement by a metallic nano slit operating beyond the skin-depth limit Direct comparison with terahertz metamaterials and surface-enhanced Raman scattering in a molecular-specific sensing performance Detection of microorganisms using terahertz metamaterials Terahertz Spectroscopy of Tetrameric Peptides High-Resolution Waveguide THz Spectroscopy of Biological Molecules Terahertz spectra of L-phenylalanine and its monohydrate Frequency-Tunable Terahertz-Wave Generation from GaP using Cr: Forsterite Lasers Correlations between Terahertz Spectra and Molecular Structures of 20 Standard α-Amino Acids Situ Adsorption Studies of a 14-Amino Acid Leucine-Lysine Peptide onto Hydrophobic Polystyrene and Hydrophilic Silica Surfaces Using Quartz Crystal Microbalance, Atomic Force Microscopy, and Sum Frequency Generation Vibrational Spectroscopy Potential Peptide Carriers: Amphipathic Proline-Rich Peptides Derived from the N-Terminal Domain of γ-Zein This work was supported in part by KIST