key: cord-0737254-tgh2aclk authors: Kim, Jihyun; Novakovic, Mihajlo; Jayanthi, Sundaresan; Lupulescu, Adonis; Kupče, Ēriks; Grün, J. Tassilo; Mertinkus, Klara; Oxenfarth, Andreas; Schwalbe, Harald; Frydman, Lucio title: The Extended Hadamard Transform: Sensitivity‐Enhanced NMR Experiments Among Labile and Non‐Labile (1)Hs of SARS‐CoV‐2‐derived RNAs date: 2022-01-18 journal: Chemphyschem DOI: 10.1002/cphc.202100704 sha: 9a330be9db327dd4dedb79b3471ce4946f7de1d9 doc_id: 737254 cord_uid: tgh2aclk Hadamard encoded saturation transfer can significantly improve the efficiency of NOE‐based NMR correlations from labile protons in proteins, glycans and RNAs, increasing the sensitivity of cross‐peaks by an order of magnitude and shortening experimental times by ≥100‐fold. These schemes, however, fail when tackling correlations within a pool of labile protons – for instance imino‐imino correlations in RNAs or amide‐amide correlations in proteins. Here we analyze the origin of the artifacts appearing in these experiments and propose a way to obtain artifact‐free correlations both within the labile pool as well as between labile and non‐labile (1)Hs, while still enjoying the gains arising from Hadamard encoding and solvent repolarizations. The principles required for implementing what we define as the extended Hadamard scheme are derived, and its clean, artifact‐free, sensitivity‐enhancing performance is demonstrated on RNA fragments derived from the SARS‐CoV‐2 genome. Sensitivity gains per unit time approaching an order of magnitude are then achieved in both imino‐imino and imino‐amino/aromatic protons 2D correlations; similar artifact‐free sensitivity gains can be observed when carrying out extended Hadamard encodings of 3D NOESY/HSQC‐type experiments. The resulting spectra reveal significantly more correlations than their conventionally acquired counterparts, which can support the spectral assignment and secondary structure determination of structured RNA elements. Multidimensional NMR correlations play an essential role in structural and dynamic elucidations of organic and biological molecules. [1, 2] These correlations occur when the polarization is transferred among spins via stochastic processes like chemical exchange or the nuclear Overhauser effect (NOEs), [3] [4] [5] [6] or through coherent J-couplings as in Total Correlation Spectro-scopY (TOCSY) [7, 8] or heteronuclear (HSQC, HMQC) correlations. [9, 10] However, detecting these transfers -particularly homonuclear NOESY cross-peaks -can often be challenged by the relatively low efficiencies of the polarization transfers. This limitation becomes even more acute when detecting crosspeaks involving labile protons. In these cases, inter-site correlations involving the labile protons can be lost by chemical exchanges with the solvent, resulting in weak or no cross-peaks to/from such sites. To alleviate this problem measurements on hydroxyl, amino, imino or even amide protons often need to be performed at low temperatures where solvent exchanges occur more slowly, or signal averaged over long periods of time. Neither of these options is optimal for studying complex biomolecules under physiological conditions. We have recently demonstrated a number of avenues whereby solvent exchanges can be used to enhance correlations involving labile protons. [11] [12] [13] [14] For instance, instead of a single t 1 encoding followed by a mixing time, the peaks of interest can be encoded and cross-correlated by looping multiple encoding/mixing times. [11] During each mixing time, labile protons are then repolarized through exchange processes with an unperturbed water 1 H magnetization, acting as a source for a more efficient polarization transfer. As a result, a sensitivity enhancement is imparted on cross-peaks between labile and non-labile sites, facilitating 2D NOESY and TOCSY correlations. Alternatively, one can target the labile peaks of interest one-byone, with a frequency-selective irradiation. If carried out with continuous irradiations this may be considered as a special case of the selective looped manipulations, as labile repolarizations by solvent exchanges will occur continuously rather than discretely during the inter-loop delays, leading for instance to selective magnetization transfer (SMT) experiments. [14] Although lacking Fourier multiplexing these experiments can, in analogy to Saturation Transfer Experiments (STD), [15] deliver NOE crosspeaks showing an enhanced sensitivity compared to conven-tional methods. Further enhancements of such correlations can be achieved by incorporating into SMT a Hadamard scheme, [16] [17] [18] whereby monochromatic frequency-selective pulses are replaced by polychromatic saturation or polychromatic inversion pulse blocks, perturbing multiple labile peaks in what will become the F 1 frequency domain. As this perturbation is encoded with a Hadamard matrix, pure correlations from each peak are revealed by a suitable Hadamard reconstruction of the data. By replacing the t 1 time-domain Fourier encoding with a Hadamard frequency-domain encoding, the efficiency of the 2D NMR measurement on a sparse system is remarkably increased: in favorable cases, the ensuing the Hadamard magnetization transfer (HMT) scheme can lead to NOE crosspeaks whose signal-to-noise (SNR) per unit time is enhanced by over two orders of magnitude, compared to their conventionally-collected counterparts. Despite the advantages of HMT, the method has a major drawback: it only works reliably when used to correlate nuclei comprising spectrally distinct spin pools. This was illustrated in ref. [14] , which showed that although HMT could provide reliable cross-peaks between different spin pools, it will in general fail when connectivities are sought within the same pool of spins -e. g., when seeking imino-imino correlations within nucleic acids. This problem originates from Hadamard assumption of a linear independence between the weightings imparted to the various peaks during its frequency encoding. In other words, Hadamard encoding assumes that when addressing multiple sites, the effects imparted by the RF on the signal of one site, will not interact with the effects it imparts on another site. This can be more clearly appreciated if considering a basic four-site, four-scans scenario, involving a Hadamardbased saturation of the sites. In such scenario, the Hadamardencoded experiment will be defined by a 4 × 4 matrix c lm f g, where 1 � m � n p indexes the various peaks, and 1 � l � n S indexes the various scans. We write the elements of such matrix as where À 1 means saturation of a given peak m in scan l; + 1 means no saturation (note that the + 1 and À 1 in the Hadamard matrix do not indicate states of magnetization). If no site cross-relaxes with any of the other sites being irradiated, it is straightforward to evaluate the outcome of each measurement: no peaks will be detected in experiment I, peaks A and B will arise in measurement II, and so on. That is To decode the contribution from individual sites a Hadamard reconstruction is then applied, involving summations according to the columns employed in the encoding of the sites. That is: the contribution from each site will arise from combining the signals {S l } in experiments 1 � l � 4, weighted by the � 1 coefficients in the columns of the c lm f g matrix. For instance and so on. Notice the factor of 2 in front of each site, corresponding to the p #scans enhancement that Hadamard multiplexing brings on top of a site-by-site irradiation of the kind happening in SMT. Consider now the situation that arises if setting up the same Hadamard scheme, but among mutually cross-relaxing sites. For example, assume a situation where sites A, B and C cross-relax with one another, while site D does not cross-relax with any other sites. The execution of the Hadamard-encoded experiment described above, will then lead to where the red arrows denote the cross-relaxing sites, and the ΔX Y terms represent a cross-relaxation into site X arising because of the saturation of site Y. The same Hadamard reconstruction procedure mentioned earlier, will now lead to: and so on. Notice that this is not what one would want, as the actual NOE spectrum should only exhibit contributions from the directly cross-relaxing sites; i. e., etc. The terms highlighted in red in Eq. (2) -also shown in Figure 1 -are therefore artifacts arising from cross-talks between the saturated sites. Notice that such artifacts only arise from Hadamard encoding when more than one proton within a cross-relaxing proton network is perturbed (i. e. saturated or inverted) per scan; no artifacts arise in SMT, as in this experiment sites are addressed one by one. [14] As mentioned, however, SMT will not be as efficient as an ideal HMT, which enjoys the advantages of both solvent repolarization effects and of a Hadamard multiplexing. In addition to the continuous saturation approach just analyzed, the Hadamard encoding scheme was also demonstrated on the basis of selective, polychromatic inversions. [12] As in the continuous saturation irradiation case, this inversionbased strategy will involve selective pulses applied at the frequencies of the peaks to be encoded, changing from scan to scan on the basis of a Hadamard encoding. In order to enjoy from the solvent-exchange enhancements arising in the saturation experiments this pulsed inversion approach is repeated a number of times, with a mixing time introduced between inversions so as to fully replenish the original labile 1 H polarization via exchanges with the water 1 Hs (Figure 2a ). It is enlightening to consider whether such inversion-based experiments will also suffer from the above-mentioned artifacts. To do so, we assume again a four-site system, with sites A, B and C cross-relaxing with each other, subject to the aforementioned c lm f g 1�m�n p ;1�l�n S Hadamard encoding matrix. In this case, however, c lm ¼ À 1 will mean an inversion and + 1 means no inversion. The spectral traces that will arise after the Hadamard reconstruction, will then be A Hadamard reconstruction procedure will now lead to: and so on. Notice that in such pulsed inversion experiments (i) sensitivity is doubled vis-à-vis the saturation version as a result of taking differences between positive and negative peaks (instead of between positive and zeroed ones), and (ii) one can now obtain artifact-free cross-peaks from all resonances. The spectrum obtained from a reconstruction based on an HMT encoding. Depending on the crossrelaxation between A, B and C, the latter can provide cross-peak contributions marked in red, that do not involve the pairs of sites being correlated. These contributions can be positive, zero, or negative, and while generally smaller than the real cross-peaks between the two targeted sites, their magnitudes cannot be ascertained or disentangled from the latter. While these calculations suggest that multiple selective inversions enable a full separation of NOEs via Hadamard encoding without cross-talk artifacts, this only holds in the limit of complete spin re-equilibration between scans. Should this not be the case, partial saturations driven by multiple looped inversions may also accumulate, and spread out to unselected (i. e., passive) protons. This will then lead to a situation as described above for Hadamard encoded continuous multiplesite saturations, and hence to cross-talk artifacts. This point is clearly visible in Figure 3 , which shows a series of SMT and HMT experiments carried out on the imino resonances of 5_SL5b + c -an RNA fragment taken from the SARS-CoV-2 viral genome. While artifact-free cross-relaxation traces arise from both SMT or from pulsed HMT encoding experiments carried without looping ( Figure 3 , grey and blue spectra), artifacts in the form of negative cross-peaks are evident in both the continuous saturation and looped inversion versions of the HMT acquisition ( Figure 3 , black and red traces). The reason for this is common and lies in the fact that, as highlighted by the red terms in Eq. (2), Hadamard-encoded saturation is not compatible with a clean Hadamard-decoded separation. Thus, when multiple inversions are applied on sites that do not exchange with the solvent fast enough, polarizations that were inverted may not be fully recovered by the start of the next loop. That's why the narrower resonances in this RNA ( Figure 3 , grey region), corresponding to sites that are in slower exchange with the water, lead to the strongest spurious cross-peaks. By contrast, broader RNA imino resonances corresponding to sites that are exchanging more rapidly with the solvent, lead to artifact-free reconstructions after the full Hadamard procedure (Figure 3b); these sites managed to replenish their magnetizations inbetween every inversion pulse. It follows that, as heretofore presented, neither saturation nor inversion procedures offer a foolproof way of exploiting Hadamard multiplexing in MT experiments within a group of cross-relaxing, exchanging sites. The present study proposes a solution to this problem, that exploits the enhancements arising from exchanges with the solvents, benefits from the multiplexing advantages of Hadamard encoding both for the intralabile (e. g., imino-imino) and labile/non-labile (e. g., iminoamino/aromatic) NOE peak correlations, and does not suffer from the spurious artifacts just described regardless of whether solvent exchanges are fast or slow. The arguments above explain why two distinct MT approaches were previously introduced to enhance sensitivity for crossrelaxation experiments on labile protons in biomolecules: a less . Note that as in HETMAT the imino peaks are selected based on the frequencies of both the 15 N and the 1 H, pseudo-3D 15 NÀ 1 H-resolved NOESY correlations will arise. In all cases, cross-relaxation occurs during these MT processes, while water 1 Hs repolarize the targeted iminos resulting in sensitivity enhanced cross-peaks. Further enhancements are achieved through the Hadamard processing. "dec" refers to GARP4 [19] or adiabatic [20] decoupling used during the encoding and the acquisition for labeled samples; water suppression was achieved using excitation sculpting [21] or WATERGATE-3919. [22] [23] [24] The water suppression block can be replaced with a selective spin-echo if detecting solely correlations among the imino protons. sensitive but generalizable and artifact-free SMT scheme, was proposed for intra-set NOE correlations (as between RNA's imino protons); while a more sensitive but artifact-prone HMT scheme with multiply frequency-selective excitation (saturation or inversion) and Hadamard encoding, was proposed for interset NOE correlations (as between RNA's imino and amino protons). As derived above, however, artifacts may occur whenever multiple protons within a cross-relaxing proton network are perturbed simultaneously, leading to inseparable non-linear responses when using conventional Hadamard encoding to tackle, for instance, imino-imino correlations. To fully disentangle such complex responses, we here introduce an "extended Hadamard" encoding scheme that allows recording artifact-free cross-relaxation experiments on both like and unlike proton groups, with the combined sensitivity enhancement from multiplexing (Hadamard editing) and continuous repolarization via water exchange. Even if other biomolecular scenarios are conceivable, we shall consider for concreteness the imino!imino and imino! amino correlations encountered in the RNA-based examples here presented. We thus consider a system with n I imino protons of initial polarization I, that can have cross peaks with amino protons of initial polarization A. If the i th imino proton is saturated the polarization A of an amino proton 'a' that crossrelaxes with it changes as A!AÀ ΔA i , and if A cross-relaxes with two or more imino protons that are simultaneously saturated we assume that its change in polarization is linear; i. e., if one saturates i and j iminos cross-relaxing with A, then A! AÀ ΔA i À ΔA j . To disentangle the correlations of the n I iminos a series of n S (n S � n I ) scans will be recorded, where each involves the saturation of some, none, or all imino protons. If this saturation proceeds according to Hadamard encoding strategy as described in the Introduction, the experiment will be defined by an All experiments focused on the 5_SL5b + c RNA fragment of SARS-CoV-2, and were carried out at 1 GHz. SMT and regular HMT-saturation spectra (gray and black) were acquired using 800 ms saturation pulses (w 1 =2p = 8 Hz; see Experimental Section). HMT experiments without looping (blue) inverted the selected imino peaks with a sinc pulse (50 Hz bandwidth) followed by 175 ms mixing; in HMT with looping, polychromatic inversion pulses and a 125 ms mixing time were repeated 7 times. Note that artifacts do not appear in the looped pulsed HMT for rapidly exchanging (broad) peaks, but appear when targeting slowly exchanging (narrow) peaks. Notice as well that: (i) despite having equal number of scans the various experiments have remarkably different sensitivities: (ii) in agreement with the original Kupče & Freeman's original Hadamard-encoded NOESY proposition, [17] no artifacts are observed in inversion-based Hadamard without looping. l th scan the m th proton is saturated, and c lm ¼ þ1 otherwise. The signal of the amino proton 'a' observed in scan l will be The DA i contributions arising from the iminos are then disentangled according to the Hadamard strategy in the Introduction, whereby the desired cross peak with amino site 'a' associated to a particular imino proton i, C a i , is obtained as The fact that the numbers of + 1 and À 1 in each column of a standard Hadamard matrix are equal, implies that P n s l¼1 c li ¼ 0 for any column i; i. e., that no relaxed A signal (first term on Eq. (6b)) will contribute to the final spectrum. The orthogonality between any two columns of a Hadamard matrix also implies that P n s l¼1 c li c lm ¼ 0 for m ¼ 6 i, meaning that out of all the terms in the double summation of Eq. (6b) the sole surviving products are those involving m = i. As there are n S such terms it follows that which is as expected from the arguments in the Introduction: only the contribution of the i th imino proton that we assumed cross-relaxing with A shows up, amplified by half the number of scans as a result from the Hadamard multiplexing. Consider now a similar scenario but aimed at establishing the imino-imino cross peaks. To do so we assume a linearity in the site's cross-talks: i. e., we assume that if the q th imino proton is saturated, the polarization of a p th cross-relaxing imino peak I p will change as I p ! I p À ΔI pq ; and that if two (or more) iminos q and r are saturated and cross-relax with proton p, then the change in the latter will be I p ! I p À ΔI pq À ΔI pr . The signal corresponding to the p th imino proton obtained in scan l can then be written as If applying on these signals the same C p q ¼ P l c lq S p l Hadamard decoding strategy as above, the cross-peak at imino site p associated to imino site q becomes As before, the P n s l¼1 c lq ¼ 0 and P n s l¼1 c lq c lp ¼ 0; p6 ¼q conditions imply that all contributions of the initial polarization I p will cancel out, as desired. The inter-imino cross-peaks, however, now involve a new product between three encoding coefficients X n S l¼1 c lq ðc lp þ 1Þðc lm À 1Þ ¼ 0 ; q6 ¼p6 ¼m: Linear and quadratic combinations of these coefficients will cancel out upon summation for every combination of different spin pairs, as desired. To remove all but the pairwise crossrelaxation contributions, one now also demand the additional request a condition that is new, and did not arise when considering imino!amino cross-peaks. Eq. (11) poses a demand that is not fulfilled by Hadamard matrices in their standard form. We therefore sought out a matrix for which this third condition is satisfied for a given n I number of imino sites, while still retaining Hadamard original matrix orthogonality condition. Suitable solutions fulfilling all these three demands are provided by what we denote as extended Hadamard matrices E, which we built from smaller Hadamard encodings by an extension of the original Hadamard matrices, involving a negation of all experiments involved. Thus, for example for n I � 4 sites, an 8 � 4 extended Hadamard matrix E which satisfies all expectations involved by both imino!amino and imino!imino HMT experiments, arises from then performing an extended Hadamard transform will require a 16 �8 matrix where if n I < 8, the encoding strategy calls for using the first n I rows of E 8 . When utilizing any of these extended forms, one can deduce the change in polarization of an amino proton a due to cross-relaxation by an imino proton i, by implementing the sums in Eqs. (5-6) along all of E columns: this is just as in the original Hadamard processing. Similarly, as the conditions P l c lq c lp c lm ¼ 0; q6 ¼p6 ¼m are now also satisfied, the reconstructed cross-peak intensity expected at imino site p due to cross-relaxation with an imino proton q, will be this is only half as efficient as the imino!amino encoding (Eq. (16)), but will now arise without undesirable contaminations. It is clear to envision how a similar strategy, may allow extensions of this approach to the encoding of n I � 9 sites. Notice that by contrast to conventional Hadamard encoding, this extended Hadamard mode calls for recording at least twice the number of experiments (n S ) as sites are present in the correlations. If -as is almost invariably the case -the NOE correlations being sought are a sensitivity-limiting factor and more than the minimum n S = n I number of scans needs to be collected anyhow, then, in practice, this will not be a penalty. This, provided that the additional number of scans demanded by the extended HMT (eHMT) procedure, still enjoys Hadamard multiplexing advantages. For the amino imino correlations this will clearly be the case, as the cross-peak strength after n S scans in Eqs. (7) (HMT) and (16) (eHMT), are the same. As HMT did not work for imino imino correlations, the point of comparison for the latter is the SMT case -which is an on/off magnetization transfer difference experiment carried out on each of the 1 � i � n I imino protons. Since the number of scans in the standard SMT experiment is 2n I (akin to STD, SMT is a difference experiments, and hence requires at least two scans per addressed site), while extended HMT (which, unlike SMT, does not require taking differences against a reference scan) requires n S > n I scans, it is easiest to arrive at a signal-to-noise per scan comparison, by assuming that both experiments are carried out for the same total number of scans 2n I n S -meaning that in the SMT case, n S scans were averaged for each site. Denoting as N 1 the noise in one scan, the SNR of a cross peak at imino site p due to imino site q in the SMT experiment will then be It follows from here that eHMT imino imino enhancement in SNR per unit time (scan), e I , will be It is thus seen that there will be an SNR advantage, e I > 1, for n I > 4. SMT, HMT, eHMT and conventional 2D NOESY experiments were implemented on the 5_SL5b + c RNA fragment derived from the SARS-CoV-2, seeking NOE correlations between the imino protons and other nearby protons at 1 GHz and 283 K. This fragment derives from the 5'-terminal untranslated region (5'-UTR) of the SARS-CoV-2 RNA genome, known to be highly structured and conserved among Betacoronaviruses. [25] The regulatory function of 5'-UTR has been linked to the maintenance of viral replication, balanced transcription of subgenomic mRNAs and translation of viral proteins, and it is being evaluated as potential targets of low molecular weight drugs. [26] As 1 H assignments for this RNA fragment have been reported, [27] experiments were carried out mainly to examine the presence of artifacts and relative sensitivities. As already demonstrated in Ref [14] the most basic of these experiments, SMT, will significantly enhance cross-relaxation peaks when compared to its conventional counterpart; under the assayed conditions (grey vs black traces in Figure 4 ), ca. 4-and 6-fold SNR enhancements per unit time (SNRT) were observed for iminoimino and imino-amino proton correlations, respectively. [28] Regular HMT shows even larger SNRT enhancements for iminoamino cross-peaks, yet clear artifacts are also visible, particularly for the imino-imino cross-peaks. eHMT can solve this problem, providing clean, artifact-free spectra with genuine correlations (red spectra in Figure 4 ). As this RNA fragment contains a total of 12 imino peaks, two sets of E 8 (Eq. [15] ) or three sets of E 4 (Eq. [12] ) eHMT matrices could be used for these acquisitions, encoding 6 or 4 peaks, respectively. For the latter case, the required number of scans (n s ) becomes 3 × 8 = 24; i. e., it requires more scans than a regular HMT acquisition for which n s = 16. In accordance to the derivations above eHMT shows then a lower sensitivity per unit time (or per scan) than regular HMT ( Figure 2 , is replaced with a selective spin-echo optimized to excite/refocus signals solely the imino spectral region. Although correlations on the amino protons' spectral region are then lost, a shorter interscan delay can be used, further increasing sensitivity during the same experimental time. Figure S1 (Supporting Information) compares eHMT imino-imino NOE correlation spectra measured with selective spin-echo (SE) and WATERGATE (WG) pulse Table 1. sequences. Note that the SE-based experiment shows � 2 × higher SNR, though requiring only half of the experimental time than WG-based experiments. Averaged SNR and SNRT enhancements for these experiments are also quantified in Table 1 . These SE eHMT experiments could be particularly useful for revealing sequential connectivity in DNAs and RNAs from imino-imino NOE correlations, which is a first step in determining secondary structures. The eHMT scheme can also be used as starting point for encoding more complex experiments. To illustrate this, the extended Hadamard scheme was adapted to HETMAT NMR, [13] a pseudo-3D acquisition that also relies on magnetization transfers. Unlike SMT/HMT experiments, imino protons of interest are here selected for saturation in a 2D 1 H/ 15 N plane, using a frequency-specific 2D cross-polarization (CP) saturation module (Figure 2b) -leading essentially to a 3D version of HSQC-NOESY. When extending this site-specific saturation scheme to a Hadamard version where multiple peaks are perturbed simultaneously, an additional source of artifact (besides that mentioned above) can arise, stemming from unintended crosstalks between the sites. During the course of these tests we found that the site selectivity afforded by a low-power CP executed with w 1 =2p = 50 Hz fields -which was sufficient in the site-by-site HETMAT experiments we described at 1 GHz [13] became insufficient to provide clean spectra after Hadamard reconstruction ( Figure S2) . Therefore, all extended Hadamard HETMAT (eHETMAT) experiments were performed with CP w 1 =2p = 20 Hz; although this narrower CP would be associated with a sensitivity loss, it might be compensated by the SNR enhancements achieved from the Hadamard scheme. In the original HETMAT, every second scan was a reference scan where CP was applied at a far off-resonance 15 N chemical shift. Only the targeted peak was inverted by subtracting onand off-resonance scans, while the effects arising from the peaks overlapping along either 1 H or 15 N chemical shift were canceled out. By contrast in the Hadamard version of the experiment subtraction occurs between spectra encoded by the Hadamard matrix -without the benefits of an independent, reference scan. If the various peaks being encoded have different 1 H chemical shifts, genuine cross-peaks are easily identified by comparing the resulting spectra, as illustrated in Figure S3 . However, if two peaks to be correlated happen to be overlapping in the 1 H dimension, it is not simple to distinguish among their cross-peaks in the absence of a reference scan. As HETMAT-based tests of the extended Hadamard ideas we therefore decided to exclude overlapping 1 H peaks out of the current discussion, and grouped peaks in the different Hadamard experiments with as little overlap as possible. Supporting Figures S4 and S5 show how these peaks were grouped for the 5_SL5b + c and the larger 5_SL8 RNA fragments of SARS-CoV-2, by marking them with the same color in their 2D HSQC spectra. A full spectrum can then be reconstructed simply by adding the Hadamard sub-spectra. The performance of eHETMAT NOESY experiments is illustrated in Figures 5 and 6 , when targeting 5_SL5b + c and 5_SL8 RNA fragments, respectively. For 5_SL5b + c, a total of 9 peaks were encoded with either three sets of extended Hadamard matrices E 4 (Eq. [12] ), or two sets of E 8 matrices (Eq. [15] ). Using reported assignments, [27] all the diagonal and cross-peaks for 5_SL5b + c could be identified and no artifacts were noticed ( Figure 5 ). These data emphasizes the sensitivity enhancement that extended Hadamard HETMAT NOESY provides over its original proposition; Table 1 lists the average SNR and SNRT performances for both experiments, showing that the addition of Hadamard encoding can enhance SNR by 2-3.5 × , in measurements that have ca. half the acquisition times. This is clearly visible in the 2D contour plot, and in the comparisons among 1D slices obtain with and without the extended Hadamard encoding shown in Figure 5a and in Supporting Figure S6 . These reveal that several imino-imino correlations that are detected in eHETMAT, are invisible in both conventional NOESY/HSQC and in non-Hadamard HETMAT experiments (green labels in Figure 5a ). For the larger 5_SL8 RNA fragment, a total of 12 peaks were selected and encoded into two sets of extended Hadamard HETMAT E 8 matrices. As shown in Figure 6 , all diagonal and cross-peaks were identified by the eHETMAT NOESY experiment without noticeable artifacts, while showcasing superior sensitivity and shorter experimental times than non-Hadamard counterparts. Several additional correlations for peaks that are still lacking an unambiguous assignment (labeled as GA, GB, GC, GD, UA, UB, and UC in Figure 6 ), could also be detected. However, the spectral denseness arising in this relatively large RNA fragment, compromises the resolution of even this pseudo-3D HSQC/NOESY-type acquisition. Further evaluation of structure and dynamics for 5_SL8 will require the even higher dimensional experiments, which we are developing while still using the benefits from the presented eHETMAT approach. An extended Hadamard scheme was introduced to obtain sensitivity-enhanced NOE correlations between labile sites and between labile and non-labile sites, and demonstrated by targeting the imino protons in SARS-CoV-2 RNA fragments. The need to develop these schemes arose from artifacts created by the original HMT, when trying to obtain clean correlations between perturbated labile sites. The extended Hadamard method presented here provides a solution to this problem, allowing one to enjoy both Hadamard multiplexing advantage as well as the CEST-like cross-peak enhancements arising from solvent exchanges. This was achieved by extending the regular Hadamard matrix to fulfill a new condition, P l c lq c lp c lm ¼ 0; q6 ¼p6 ¼m that is not required for detecting other types of Hadamard-encoded correlations. Though requiring at least twice as many experiments as the regular Hadamard Table 1 . Comparisons between these spectra and spectra from 2D HMQC NOESY and regular Hadamard HETMAT NOESY, are shown in Supporting Information ( Figure S6 ). (b) Secondary structure of 5_SL5b + c; dashed lines denote the new correlations observed in extended Hadamard HETMAT between base pairs. encoding, this method also provided substantial SNR/unit_time gains relative to the conventional counterparts. The extended Hadamard scheme was compatible with 1 H-1 H and 1 H-15 N-1 H HETMAT experiments, leading to pseudo-2D NOESY and pseudo-3D HSQC-NOESY experiments. The resulting sensitivity gains revealed new imino-imino correlations that were not visible in conventional or in SMT-based NOESY experiments. Not only imino-imino correlations in RNAs but also correlations between amide protons on proteins could be examined using this method, which would be useful to elucidate their structures. While correlations between hydroxyl protons such as in sugar moieties would not be detectable due to fast exchanging with solvent protons, cross-peaks arising from these protons to other types of protons (CH or NH) would be detectable, expecting the similar SNR enhancements as illus-trated here. The extended Hadamard concepts introduced here could also be included in higher dimensional experiments, providing additional resolution as pseudo-3D or pseudo-4D experiments involving OH or NH protons. This option is currently being explored. Despite the advantages stated above, the extended Hadamard method has drawbacks and limitations. Unlike SMT or HETMAT experiments where saturation/inversion fields, mixing time, and the number of loops to be chosen can be tailored peak-by-peak, Hadamard multiplexing as we have implemented here uses single, common parameters for addressing all selected sites. This may result in a loss of efficiency for transferring polarization through NOE, as equal parameters are applied for both narrow and broad peaks. This problem could simply be solved by separating the eHMT encoding into blocks, Figure 6 . Idem as in Figure 5 , but for the larger 5_SL8 fragment shown in (b). Assignments reported for diagonal peaks are annotated in both panels in black and blue, with the latter excluded from the extended Hadamard HETMAT experiments due to overlap along the 1 H dimension. Unassigned peaks are labeled GA, GB, GC, GD, UA, UB, and UC. The right-hand panels show in green cross-peaks observed by eHETMAT but not in the non-Hadamard experiment. For the HETMAT, RF CP fields ω 1 /2π = 75 Hz with 17 loops and a t NOE = 50 ms were used for addressing the broader peaks, while 50 Hz with 10 loops with t NOE = 80 ms mixing was used for the rest of the peaks. In the Hadamard experiments, RF CP fields ω 1 /2π = 20 Hz with 10 loops and a t NOE = 80 ms were used. 1D slices at selected 15 N chemical shift are shown on top. Note that intensities in 2D contour plot and 1D slices of HETMAT NOESY (black) are multiplied by a factor of 2. The averaged enhancements of SNR and SNRT are summarized in the Table 1 . Comparisons between these spectra and spectra from 2D HMQC NOESY and regular Hadamard HETMAT are shown in Supporting Information Figure S7 . addressing peaks based on their common rates of solvent exchange (i. e., on their linewidths). Two such sets of different parameters should be sufficient to detect cross-peaks with full sensitivity. Otherwise, fine-tuning of the irradiation scheme to have multiple frequencies with distinct strengths, could also be incorporated into the saturation experiment. For the inversion cases -whether dealing with HMT-or HETMAT-based inversions -the application of this scheme might be less useful as it would have to involve tailoring the number of loops for each site, rather than the strength of the irradiation. Still, some of these features are currently being explored. As other frequency-selective experiments also eHMT, like SMT and HETMAT, benefits from performance at the highest possible magnetic fields -where a maximum site resolution is available to separate peaks and to prevent cross-talks, and were larger fields can lead to better inversions/saturations. For instance, a 20 Hz Hartmann-Hahn match shows sufficient selectivity to avoid cross-talks in HETMAT experiments at 1 GHz, but not at 500 MHz ( Figure S8 ). Moreover, dealing with large number of peaks decreases the relative advantage of all these frequency-encoded experiments vs their time-domain counterparts. Solutions to these remaining complications are being sought. Sample Preparation 15 N-labeled 5_SL5b + c and 5_SL8 RNA samples were produced by T7 polymerase-based in-vitro transcription, as described in Ref [27] . RNA purity was verified by denaturing PAA gel electrophoresis and homogenous folding was monitored by native PAA gel electrophoresis. The final concentration of the 5_SL5b + c and 5_SL8 samples in the NMR tubes were 0.7 and 0.8 mM, respectively. NMR experiments were run on a 1 GHz, 23.5 T Bruker Avance Neo spectrometer equipped with a TCI cryoprobe. All pseudo-2D SMT and pseudo-3D HETMAT NOESY experiments were performed following previously described procedures. [13, 14] In SMT, the duration of the saturation pulse was set to 800 ms, with 8 Hz and 30 Hz w 1 =2p fields to saturate narrow and broader imino resonances, respectively. For the HMT experiments, all peaks were saturated with pulses of the same duration but 8 Hz nutation fields. The nutation frequencies and CP duration of the HETMAT experiments were chosen as 50 Hz/14.5 ms or 75 Hz/10.4 ms, depending on the broadness of peaks; 20 Hz/46 ms were used in the eHETMAT experiments. The mixing time and number of loops were optimized depending on the sample, to achieve a maximum NOE: NOESY mixing times ranging from 150-200 ms were utilized in the conventional experiments for both homonuclear 2D NOESY and 2D HMQC-NOESY experiments. Spectra were acquired with 512 scans for SMT and HETMAT; 64 or 128 scans for the Hadamard-based experiments. Each spectrum was apodized with a QSINE window function and Fourier transformed using Topspin software (Bruker Biospin). Importing the peak list into the Hadamard experiment was performed using au-programs "had_pl" and "had_plx" for HMT and Hadamard HETMAT experiments, respectively. All pulses were generating using WaveMaker via the "wvm -a" that is based on the regular Hadamard matrix. For using extended Hadamard matrice, "wvm_x" was used, that generates pulses based on the userdefined Hadamard matrix, saved in a "wvm.had" file by the experiment. Spectra were processed using custom-written auprogram "proc_had" for HMT and "proc_hadx" for Hadamard HETMAT directly in TopSpin 4.0.9. SMT and HETMAT spectra were processed in similar ways, after zero-filling to 512 F 1 points. Supporting Table S1 summarizes the main parameters used in all types of experiments shown in this study. Additional experimental details, including experimental set-up and pulse sequences implementation, can be downloaded from https://www.weizmann.ac.il/ chembiophys/Frydman_group/software. NMR of Proteins and Nucleic Acids Protein NMR Spectroscopy: Principles and Practice We rely in this study on SNR/unit_time rather than on the more customary SNR/sqrt(unit_time) criterion used in Fourier-transform NMR, due to the hybrid Fourier/non-Fourier nature of the various experiments here compared. In any case, Table 1 provides all the information needed to compute either metrics Accepted manuscript online: December 30, 2021 Version of record online This work was supported by the EU Horizon 2020 program ( The authors declare no conflict of interest.Keywords: NMR spectroscopy · Hadamard encoding · NOESY · nucleic acids · SARS-CoV-2 genome