The present invention relates to processing 1H-NMR spectral data.
1H (or proton)-Nuclear Magnetic Resonance (NMR) spectroscopy is a well-established technique used throughout clinical, population-scale, pharmaceutical and agricultural product research for qualitative and quantitative analyses of small molecules (SMs) in complex samples. It is also increasingly used to measure the abundance of larger structures such as lipoprotein species in blood plasma and serum and indirectly estimate NMR-invisible components of biofluids. All these types of measurements are captured in the single most common experiment in metabolomics and clinical research applications, the 1H-NMR one-dimensional general profile experiment with solvent signal suppression (e.g., one dimensional nuclear Overhauser effect spectroscopy, 1DNOESY pulse sequence). However, broad baseline signals arising from macromolecular content (e.g., proteins and lipids abundant in serum/plasma, urine from subjects with pathological conditions, cerebrospinal fluid, saliva, tissues, cell lysates, pancreatic fluid, and food matrices such as milk, oil etc.) can overlap with and even hide the SM signals in a spectrum, inhibiting the efficiency of their deconvolution for annotation and quantification. More importantly, variability in macromolecule concentrations among samples results in baseline fluctuations which hinder the robust determination of SM contribution to diagnostic phenotypic signatures and/or fingerprinting by multivariate analysis (MVA).
These issues may be addressed by physically removing the macromolecules, for example, by ultra-centrifugation with filtering, but the time and cost required for sample processing, potential for introducing procedural variability, and negative impact to the integrity of the sample itself all undermine the key strengths of NMR as a high-throughput, intrinsically precise and non-destructive technique. Instead, the more practical and routinely applied approach is to suppress resonances from macromolecular-derived signals on instrument. This is accomplished by performing an ancillary “spin-echo” experiment such as the Carr-Purcell-Meiboom-Gill (CPMG) pulse sequence which filters macromolecular signals via transverse relaxation times (T2), generating a 1D spectrum of slow relaxing signals, mainly belonging to SMs. The approach is sufficiently reproducible although imperfect in its suppression of broad resonances given the time limit for large cohort studies (e.g., metabolomics) and unsuitable for direct absolute quantification, as the signal integral is modulated by the high variability of T2 values for each proton spin system from each SM. It is also time consuming, contributing substantially to the acquisition time required by standard profiling workflows. The approach is therefore costly, especially at the scale required for the routine analysis of samples from epidemiology cohorts, food industry quality control, and other large-scale applications.
According to a first aspect of the present invention there is provided a computer-implemented method of processing 1H-NMR spectral data. The method comprises receiving 1H-NMR spectral data for a sample, performing a Fourier transform of the 1H-NMR spectral data to obtain Fourier-transformed spectral data, first differentiating the imaginary part of the Fourier-transformed spectral data or processed Fourier-transformed spectral data obtained from the Fourier-transformed spectral data to obtain a first derivative, and storing the first derivative in storage.
This process can be used to obtain an NMR spectrum or set of spectra in which signals from small molecules in the sample are enhanced by suppressing baseline confounding broad 1H-NMR signals arising from macromolecules (such as, for example, proteins, lipids and/or polysaccharides).
The method may comprise processing 1H-NMR spectral data for a set of samples, each 1H-NMR spectral data provided for a respective sample.
The 1H-NMR spectral data (or “1H-NMR spectrum”) may take the form of a free induction decay or other time-dependent signal (or “time-domain signal”). The 1H-NMR spectral data may be expressed in terms of relative intensity as a function of time.
The Fourier-transformed spectral data is a frequency-dependent signal (or “frequency-domain signal”) and may be expressed in terms of amplitude as a function of chemical shift (usually in units of ppm). The first derivative is a frequency-dependent signal.
The imaginary part of the Fourier-transformed spectral data or processed Fourier-transformed spectral data may comprise a set of datapoints and first differentiating the imaginary part of the Fourier-transformed spectral data or processed Fourier-transformed spectral data may comprise, at each of said datapoints, determining a value of a first derivative of the imaginary part of the Fourier-transformed spectral data or processed Fourier-transformed spectral data.
First differentiating the imaginary part of the Fourier-transformed spectral data or the processed Fourier-transformed spectral data may also be referred to as “taking the first derivative of the imaginary part of the Fourier-transformed spectral data or the processed Fourier-transformed spectral data”.
Performing the Fourier transform may comprise performing a one-dimensional Fourier transform of the 1H-NMR spectral data. Performing the Fourier transform may comprise performing a numerical Fourier transform of the 1H-NMR spectral data.
The 1H-NMR spectral data may comprise or consist at least 65,000 (or 216) datapoints.
The Fourier-transformed spectral data may have a spectra width of up to 20 ppm.
The method may further comprise apodizing (or “windowing”) the 1H-NMR spectral data prior to performing the Fourier transform using an apodization function. This can help improve signal-to-noise.
The apodization function may be, for example, an exponential multiplication, Gaussian multiplication or other suitable function for enhancing signal-to-noise ratio. The exponential multiplication may take the form exp [-kt], where k is a line broadening factor and t is time. The Gaussian multiplication may take the form exp [-(kt)2], where k is a line broadening factor and t is time. The line broadening factor k may take a value between 0.1 and 1 Hz.
The method may further comprise processing the Fourier-transformed spectral data to generate processed Fourier-transformed spectral data.
Processing the Fourier-transformed spectral data may comprise performing phase correction on the Fourier-transformed spectral data. Performing phase correction may comprise multiplying the real part of the Fourier-transformed spectral data by sine function and the imaginary part of the Fourier-transformed spectral data by cosine function.
The method may further comprise denoising the first derivative. This can help improve signal-to-noise. Thus, the first derivative stored in storage is a denoised first derivative.
Denoising the first derivative may comprise applying a low-pass filter. The low-pass filter may have coefficients equal to the reciprocal of window span, i.e., the span of the first derivative.
Denoising the first derivative may comprise applying local regression. Applying the local regression may comprise applying local regression using weighted linear least squares and a first-degree polynomial model and/or applying local regression using weighted linear least squares and a second-degree polynomial model.
Denoising the first derivative may comprise applying a Savitzky-Golay filter. Applying a Savitzky-Golay filter generally involves a moving average with filter coefficients determined by an unweighted linear least-squares regression and a polynomial model of specified degree.
The method may further comprise displaying the first derivative (which may be de-noised). The method may comprise displaying a 1H-NMR spectral plot for comparing with first derivative.
According to a second aspect of the present invention there is provided a computer program product comprising a non-transitory computer readable storing or carrying a computer program which comprises instructions which, when executed by at least one processor, causes the at least one processor to perform the method of the first aspect.
According to a third aspect of the present invention there is provided apparatus comprising at least one processor and memory, wherein the at least one processor is configured to receive 1H-NMR spectral data for a sample, to perform a Fourier transform of the 1H-NMR spectral data to obtain Fourier-transformed spectral data, to first differentiate the imaginary part of the Fourier-transformed spectral data or of processed Fourier-transformed spectral data obtained from the Fourier-transformed spectral data to obtain a first derivative and to store the first derivative in memory or storage.
According to a fourth aspect of the present invention there is provided a system comprising storage (for example, a hard-disk drive or solid-state drive) storing one or more sets of 1H-
NMR spectral data, and apparatus for processing 1H-NMR spectral data, the apparatus configured to retrieve the one or more sets of 1H-NMR spectral data from storage and to perform the method of the first aspect on each respective set of 1H-NMR spectral data.
According to a fifth aspect of the present invention there is provided a system comprising a 1H-NMR measurement system for generating 1H-NMR spectral data for a sample and apparatus for processing 1H-NMR the spectral data, the apparatus configured to perform the method of the first aspect using 1H-NMR spectral data from the 1H-NMR measurement system.
Certain embodiments of the present invention will now be described, by way of example, with reference to the accompanying drawings, in which:
A computer-implemented method of (or “approach to”) processing 1H-NMR spectra, herein referred to as “SMolESY” (Small Molecule Enhancement SpectroscopY), is described. According to the SMolESY process, a first derivative of the imaginary part Fourier-transformed 1H-NMR is obtained. The first derivative is herein referred to as SMolESY data or simply SMolESY. The SMolESY process can reliably increase resolution and deplete macromolecular signals directly from the 1H 1D-NMR spectrum with no intensity modulation. The approach relies on mathematical differentiation. By calculating the first partial derivative of the imaginary data of the NMR spectrum, SMolESY yields a profile of small molecules (SMs) free from large molecule signal baseline interference and sample-to-sample fluctuation. As the approach does not rely on T2 or j-coupling constant modulation, the inherent quantitative quality of the conventional 1H-NMR spectrum is preserved. Furthermore, the resolution of SMs derived signals is enhanced by as much as three-fold, enabling the annotation of otherwise overlapping signals and further facilitating their quantification. However, it is also commonly understood that derivatives are prone to instability when applied to signals of very low intensity, and therefore the practical effects of a reduced signal-to-noise ratio (s/n) required evaluation. Herein, we demonstrate that despite the lower s/n, for the case of SMs of biologically relevant complex mixtures, the signal’s limit of detection (LOD) is not functionally affected. To our knowledge, the application of our approach (even in its simplest form of differentiation without combined with any traditional or modern signal denoising filters) to biofluids or complex matrices of large cohort studies, with the view to suppressing signals of macromolecules across entire spectra in a systematic way, has never been reported or tested. Based upon our findings, the SMolESY approach may be used to functionally replace and additionally improve upon several weaknesses of traditionally used spin-echo experiments, particularly in the NMR-based metabolomics field. Before describing a system for processing 1H-NMR data, experiments will be described which may be useful for understanding SMolESY approach.
The approach was applied to 1H-NMR 1D-NOESY spectra from various datasets of varying sample matrix complexity in order to systematically evaluate SMolESY performance.
The first set of 1D-NOESY and CPMG spectra were generated from a series of pure human serum albumin solutions at concentrations designed to span and exceed those found in normal human blood (
Validation of SMolESY signal integrity and intra-metabolite reproducibility: application to free and very low macromolecular content matrices. The second dataset was generated from synthetic mixtures of metabolites in varying concentrations (hereinafter described in the section Artificial mixtures preparation in Materials and Methods) designed to enable assessment of SMolESY fidelity across a comprehensive set of 1H-NMR peak shapes and multiplicities (e.g. triplet, quartet etc.). The relationship between 1D-NOESY and SMolESY peak integrals (
For the third dataset, SMolESY spectra were produced from a collection of 3020 1D-NOESY profiles of human plasma samples from two different cohorts (2026 plasma-heparin and 994 plasma-EDTA samples) (hereinafter described in Plasma - Urine spectra employed for the present study in the Experimental section) so as to increase sample content variability and compared to their corresponding CPMG spectra. Pearson correlation between the SMolESY and CPMG spectra showed that 73% of transformed peaks were highly correlated with r > 0.90 (
such as urea, and very low abundance unknown metabolites (1.5 < signal-to-noise ratio < 2.2,
(
There are two other characteristics for helping successful implementation of SMolESY to -omics and analytical studies are spectral binning and absolute quantification. These were addressed using NMR experiments where 17 common biological metabolites in various known concentrations were spiked-in to a real plasma matrix to provide a SM profile against a constant macromolecular background (described hereinafter in Artificial mixtures preparation-Spiking experiments in the Experimental section).
Comparison between spectral bins of SMolESY and CPMG spectral bins indicated a strong linear correlation for all spiked metabolites (R2 > 0.98) (
To facilitate the implementation of SMolESY in both targeted (direct metabolite signal integration), untargeted (profiling/fingerprinting) and quantitative NMR (qNMR) applications, a “SMolESY_platform” can be used for producing and processing SMolESY data from raw NMR spectra. SMolESY_platform is hereinafter described with reference to
The compromising effect that common macromolecules (proteins, lipids and polysaccharides) exhibit on individual quantitative SM measurements and on the broader SM profile has yet to be adequately addressed. Consequently, modern standard protocols for biofluid, cell extracts, food and other rich in macromolecules complex mixtures analysis rely on a sequence of experiments, each of which is individually flawed in application to the most common of biofluids (e.g. blood products). Whereas 1D-NOESY cannot be used for MVA, CPMG cannot be used for accurate and reliable quantification. Here we demonstrate that the computational transformation of the standard 1D 1H-NMR experiment yields both high fidelity spectral SM profiles and data from which quantitative chemical measurements can be extracted.
Systematic evaluation of SMolESY clearly demonstrates its ability to cleanly suppress macromolecular signals in synthetic test cases (albumin titration), common agricultural products (milk and oil), and human plasma. In all cases, the suppression of macromolecular signals resulted in the enhancement of SM-derived information from the SMolESY’s ability to reproduce the SM-derived information captured by the 1D 1H-NMR with high fidelity, ensuring the transformation is not detrimental to SM signals. SMolESY implementation both on a large cohort of more than 3000 individuals’ plasma samples and > 100 urine samples showed an outstanding reproducibility with virtually no loss of metabolic information. Although the approach does risk decreasing the s/n of very broad signals such as those from highly exchangeable and/or interacting protons of small molecules (e.g. urea), generally such signals are of low fidelity in 1H-NMR analyses unless specific experiments or sample preparation procedures are employed. This risk can be further mitigated by applying smoothing algorithms such as traditional or advanced approaches for signal denoising on the SMolESY data acquired by our toolbox.
Additionally, our method helped to the recovery of some signals in crowded regions of the spectrum and areas where macromolecule signals appear. This, combined with the expected enhancement to spectral resolution when calculating signal derivatives, facilitates the chemical assignment of SMs by increasing the analytical specificity. Furthermore, the linear mathematical transformation preserves the quantitative aspects of the data, given the appropriate calibration and reference signal from reference compounds or electronically produced by the PULCON experiments. SMolESY can therefore be used directly for absolute quantification without the need for complex and computationally expensive deconvolution algorithms typically applied to 1D experiments and unlike spin-echo pulse sequence experiments altogether. Importantly, these improvements can also be realized post hoc by retrospective application of SMolESY to existing 1D 1H-NMR raw spectra. This could be of major importance for NMR analysis of sample types with low physiological macromolecular content (e.g. urine) for which spin-echo experiments are not routinely acquired, yet which occasionally or in pathological Moreover, SMolESY is also readily applicable to historical datasets increasing its value and making them comparable with new processed datasets. Its application only requires high resolution 1H-NMR data (> 65 k data points) input which is the established norm within modern high-quality metabolomics and analytical studies.
SMolESY is well suited for the enhancement of small molecule profiles in NMR spectra derived from complex sample types exhibiting broad and confounding macromolecular signals. In its simplest form as a partial derivative, 1D 1H-NMR spectra are transformed yielding effective suppression of macromolecular signals and enhanced clarity and resolution of small molecule signals. The quantitative capacity of the original data is preserved and, despite variable reductions in the s/n measured across the spectrum, the total chemical information recovered from SMolESY is greater than that from CPMG (demonstrated in human plasma and serum as major biological matrices of interest). Thus, the validation set presented herein and applications to various sample types establish SMolESY as a functional in silico replacement for the routine CPMG experiment (or other spin echo variants).
The approach may further enable higher throughput sample preparation procedures by precluding the removal of macromolecules from sample types where such preparation is routine practice (e.g. for the NMR study of various food matrices). SMolESY is therefore of major significance in biomedical research, food industry, environmental sciences and indeed any other applications where 1H-NMR is applied to chemically complex samples with abundant macromolecules. The approach is particularly pertinent for large cohort studies where up to 30% acquisition time could be saved compared to the conventional NMR-metabolomics pipeline. Since 1H-NMR is emerging as the dominant technique for large scale application to biofluid analysis (e.g. supporting molecular epidemiology and biobanking efforts) and increasingly used for routine quality control assessment of agricultural products, we believe the time and cost savings provided by SMolESY will support the future application of NMR in these contexts.
Herein, the numerical differentiation (first derivative) of spectral data was calculated by the “gradient” function integrated in MATLAB programming suite (MathWorks, version R2019b). In general, the first derivative of a signal is the rate of change of y (i.e. intensity data) with x (ppm data), dy/dx, which in practice is the slope of the tangent to the signal at each point across the ppm axis. It has been shown that the maximum intensity of a signal in the derivative spectrum is inversely proportional to its linewidth to the half-height (Δv½), and therefore very broad signals are significantly suppressed while the Δv½ of sharp signals are further narrowed (see below). Importantly, and in contrast to differentiation of the real data, the 1st derivative of the imaginary data yields positive peak intensities (> o baseline, see below) owing to the gradient of all signals described by the imaginary data. Derivative spectroscopy could enhance the resolution of a signal, whereas a broad signal could be completely attenuated, which could be easily described in the following equations.
Assuming a Fourier transformed Lorentzian signal f(x) across a specific frequency region equals to:
where Iδ is the maximum intensity of the signal at a specific chemical shift (δ) and Δv½ is the linewidth at the half-height of the signal, the 1st derivative of f(x) is:
From Eq.2, it can be seen that signals with large Δv½ (e.g. broad signals of macromolecules) are highly suppressed to zero (Iδ ~ o), whereas sharp signals (i.e. small Δv½ values) are extra sharpened, thus enhancing spectral resolution.
The 1st numerical derivative of the real data from an NMR spectrum (after Fourier transform and phase correction), produces an antisymmetric signal (positive on one side and negative on the other) (
All reagents employed for the artificial mixtures of metabolites, spiking experiments and buffers composition were purchased from Sigma Aldrich.
All scripts for the correlation analyses were coded in the MATLAB programming suite (MathWorks, version R2019b). The linear regression analyses, statistical comparisons between slopes and intercepts (i.e. F-tests), as well as their plotting, were performed by Prism 8 (GraphPad Software, Inc, 2019). Multivariate statistics was performed using the MATLAB based PLS Toolbox (Eigenvector Research, Inc., Manson, WA, USA 98831, version 8.7.1 (2019) software available at http://www.eigenvector.com).
The albumin concentrations in the NMR samples were o, 7.5, 15.0, 37.5, 75.0, 150.0 and 225.0 mM. The selected metabolites and their concentration for the initial artificial mixture were: cytidine (1 mM), benzoic acid (1 mM), citric acid (0.25 mM), caprylic acid (1 mM), L-isoleucine (0.375 mM), creatinine (0.5 mM), L-glutamic acid (0.5 mM), L-glutamine (0.625 mM), hippuric acid (0.625 mM), L-phenylalanine (0.8 mM), and L-tryptophan (0.375 mM). Artificial mixtures of small MW metabolites contained 50% plasma buffer (see below the plasma buffer composition) and 50% of the aqueous mixture of metabolites in different concentrations. After the initial mixture, 8 sequential (and equal) dilutions were performed, resulting in 9 different samples. Among all the metabolites we depict (
The spiked 17 metabolites in a real plasma sample along with their 11 different concentrations are summarized in Table I below.
The total number of plasma (> 3200) and urine (~ 100) NMR samples were prepared following the established standard operating procedures for metabolomics analyses. Namely, the plasma NMR samples consisted of 50% plasma buffer [75 mM Na2HPO4; 6.2 mM NaN3; 4.6 mM sodium trimethylsilyl [2,2,3,3-d4]propionate (TMSP) in H2O with 20% (v/v) 2H2O; pH 7.4] and 50% of blood plasma and urine NMR samples consisted of 10 % urine buffer [1.5 M KH2PO4 dissolved in 99.9% 2H2O, pH 7.4, 2 mM NaN3 and 5.8 mM 3-(trimethylsilyl) propionic acid-d4 (TSP)] and 90 % of urine. The cow milk sample was prepared following the same protocol used for blood products and additional centrifugation cycle was required in order to remove extra fat content. The olive oil sample was prepared by diluting the sample 10 % in deuterated chloroform.
Solution 1H NMR spectra of all samples were acquired using a Bruker IVDr 600 MHz spectrometer (Bruker BioSpin) operating at 14.1 T and equipped with a 5 mm PATXI H/C/N with 2H-decoupling probe including a z-axis gradient coil, an automatic tuning-matching (ATM) and an automatic refrigerated sample changer (Sample-Jet). Temperature was regulated to 300 ± 0.1 K and 310 ± 0.1 K for urine and plasma samples, respectively. For each blood NMR sample, 3 NMR experiments were acquired in automation: a general profile 1H NMR water presaturation experiment using a one-dimensional pulse sequence where the mixing time of the 1D-NOESY experiment is used to introduce a second presaturation time, a spin echo edited experiment using the Carr-Purcell-Meiboom-Gill (CPMG) pulse sequence which filters out signals from fast T2 relaxing protons from molecules with slow rotational correlation times such as proteins and other macromolecules, and a 2D J-resolved experiment. Each experiment had a total acquisition time of approximately 4 minutes [32 scans were acquired for the 1DNOESY (98,304 data points, spectral width of 18,029 Hz) and the 1D-CPMG (73,728 data points, spectral width of 12,019 Hz) experiments while 2 scans and 40 planes were acquired for the 2D J-resolved experiment].
For each urine sample 2 NMR experiments were acquired as previously published in A. C. Dona et al., Anal. Chem., 2014, 86, 9887-9894 which is incorporated herein by reference. Free induction decays of all 1D-spectra were multiplied by an exponential function equivalent to 0.3 Hz line-broadening before applying Fourier transform. All Fourier transformed spectra were automatically corrected for phase and baseline distortions and referenced to the TSP singlet at 0 ppm.
In the present study, we randomly selected ~3000 plasma 1H-NMR spectra (both 1D-NOESY and CPMG) from the National Phenome Centre repository, previously acquired for various clinical and epidemiology phenotyping studies. Of these, approximately 1000 plasma spectra corresponded to plasma-EDTA samples, and the rest (~2000) to heparin plasma samples. Both heparin and EDTA collected plasma samples were selected for the SMolESY validation in order to highlight the broad applicability of this approach.
Urine 1D-NOESY 1H-NMR spectra were taken from a publicly available study (available at Metabolights, accession number: MTBLS694).
Signal-to-noise ratios (s/n) of selected 1H NMR signals from the CPMG and SMolESY NMR profiles, namely, from L-Alanine, Glucose, L-phenylalanine and formic acid were calculated as the ratio of peak intensity at maximum height to the standard deviation of the noise for each of the 3020 CPMG and SMolESY plasma spectra. Noise was calculated as indicated in A. Rodriguez-Martinez et al., Anal. Chem., 2017, 89, 11405-11412 which is incorporated herein by reference. The selected signals resonate at different spectral areas with variable amount of noise and exhibit different multiplicities. The whole number of “peak-picked” signals was calculated by using the “findpeaks” Matlab function, implementing as threshold the calculated level of noise for the CPMG and SMolESY spectra, respectively.
Principal component analysis (PCA) was performed for the 1DNOESY and the SMolESY 1H-NMR urine datasets, after excluding H2O spectral region (4.7-4.84 ppm) for the spectral width 0.5-10 ppm. Both 1D-NOESY and SMolESY 1H-NMR spectra were calibrated to TSP signal (singlet) at 0 ppm. No binning was applied to the data, all data points for each spectrum were used as variables. For the PCA analysis, NMR data was mean centered and no normalization was applied so as to have intact signals contribution to the PCA analysis.
In order to facilitate the implementation of SMolESY data into NMR metabolomics analyses and NMR-based analytical deconvolution of complex matrices content, we created a graphical user interface toolbox (SMolESY_platform) for the SMolESY data generation, exportation as well as pre-treatment for any metabolomics pipeline. Our software enables the loading of the ordinary biofluids 1D-NMR spectra and their transformation to SMolESY along with the visualization of both 1H-NMR and SMolESY spectra for any comparison. Furthermore, it offers the opportunity for the calibration of SMolESY data to a reference peak (for example, to the anomeric doublet resonance of glucose at ~5.25 ppm for plasma/serum/cerebrospinal fluid/pancreatic juice). In addition, the SMolESY_platform provides: i) a semi-automated alignment and integration of SMolESY signals for absolute quantification (i.e. targeted approach) and ii) a variable shaped binning algorithm for untargeted metabolomics studies (i.e. diseases fingerprinting etc.). Both signals and bin-tables (i.e. bucket tables) integration values can be exported for further statistical. The SMolESY_platform is described in more detail with reference to
To calculate the absolute concentration of each metabolite from the SMolESY spectra, we simply integrated the signals taking the transformed ERETIC signal (i.e. in our case it was the Quantref electronic signal generated by Bruker Biospin) as a reference.
Referring to
The system 1 includes an 1H-NMR measurement system 2 and an 1H-NMR data processing system 3.
The 1H-NMR measurement system 2 can be used to perform 1H-NMR spectroscopy on a sample 3 held in a sample holder 4.
The 1H-NMR measurement system 2 includes a magnet field generator 5 which includes magnetic poles 6 and windings 7 which can be driven by a magnetic field controller 8. The sample 3 subjected to a magnetic field which can be swept. The 1H-NMR measurement system 2 also includes a first coil 9 which can be driven by an rf transmitter 10 so as to subject the sample 3 to rf excitation and a second coil 11 wound around the sample holder 4 to detect an rf signal emitted by the sample 3. The second coil 11 is connected to a receiver 12 which may include an amplifier (not shown) and an ADC (not shown) for decimating the received signal. A controller 13, typically in the form of a computer system, is used to control the magnet field controller 5 and rf transmitter 10, and to collect data from rf receiver 12. An example of a suitable 1H-NMR measurement system 2 is the Bruker IVDr 600 MHz spectrometer (Bruker BioSpin).
The 1H-NMR data processing system 3 can be used to process 1H-NMR spectral data 14 generated by 1H-NMR measurement system 2. In this example, the 1H-NMR measurement system 2 is connected to the 1H-NMR data processing system 3 by a network 15 (for example, a LAN or the Internet). The systems 2, 3, however, need not be connected and data 14 may be transferred in other ways. For example, data may be transferred via removeable storage (not shown).
The 1H-NMR data processing system 3 is a computer system, which may take the form of a workstation, desk-top computer, lap-top computer or other sufficiently powerful computing device.
Referring also to
The computer system 3 includes one or more central processing units (CPUs) 31 having respective memory caches (not shown), system memory 32, a graphics module 33 in the form of a graphics card, which includes a graphics processing unit (GPU) 34 and graphics memory 35 (which may be referred to as “video RAM”), connected to one or more displays 36, and an input/output (I/O) interface 37 operatively connected by a bus system 38.
The I/O interface 37 is operatively connected to bus and/or network interface(s) 39 (such as Ethernet interface or WLAN interface) for receiving the 1H-NMR spectral data 14. The I/O interface 37 can also be connected to and control the 1H-NMR measurement system 2.
The I/O interface 37 is also operatively connected to user input devices 40 (such, as a keyboard, mouse and/or touch screen) and the storage 41, for example, in the form of one or more hard disk drives and/or solid-state drives. Some peripheral devices, such as removable storage, and other computer components are not shown. The computer system 3 may have a different configuration from that shown in
Storage 41 also holds computer software 42 for processing 1H-NMR spectral data 14 using the method herein described and for storing Fourier transform data 43 and SMolESY spectra 44 (herein also referred to as the “first derivative”).
Referring also to
Optionally, the processor(s) 31 may perform apodization (or “windowing”) by applying an apodization (step S2). For example, the apodization function may be an exponential multiplication, Gaussian multiplication or other suitable function for enhancing signal-to-noise ratio. Apodization may be selected by a user.
The processor(s) 31 performs a Fourier transform of the (apodized) 1H-NMR spectral data 14 to obtain Fourier-transformed spectral data 43 (step S3). The Fourier-transformed spectral data 43 includes real and imaginary parts.
Optionally, the processor(s) 31 may perform phase correction (step S4). This may, for example, involve multiplying the real part of the Fourier-transformed spectral data 43 by a sine function and the imaginary part of the Fourier-transformed spectral data by a cosine function. Phase correction may be selected by the user.
The processor(s) 31 extracts the imaginary part of the Fourier-transformed spectral data 43 (step S6) and performs first differentiation of the data 43 to obtain a first derivative 44 (or “SMolESY spectrum”) (step S6).
Optionally, the processor(s) 31 may perform de-noising of the first derivative 44.
Referring to also to
Other forms of denoising can be used such as applying local regression or applying a Savitzky-Golay filter. Applying the local regression may comprise applying local regression using weighted linear least squares and a first-degree polynomial model and/or applying local regression using weighted linear least squares and a second-degree polynomial model. Applying a Savitzky-Golay filter generally involves a moving average with filter coefficients determined by an unweighted linear least-squares regression and a polynomial model of specified degree.
De-noising may be selected by the user.
The SMolESY data 44 (i.e., the (de-noised) first derivative 44) is then stored in storage 41 (step S8). The SMolESY data 44 (can optionally be displayed (step S9). Additionally, or alternatively, the SMolESY data 44 can be transmitted to a remote location via the network interface 39.
Referring to
The user interface 50 includes first, second, third, fourth and fifth input regions 51, 52, 53, 53, 54, 55 for receiving user input and first, second and third output (or “display”) regions 56, 57, 58 for display spectra or portions of spectra.
The first input region 51 (“Directories - Input file - Transformation” region) includes buttons 61, 62 for selecting folders for importing and exporting data, a button 63 for instructing the performing a transformation of free induction decay data set into SMolESY spectrum, a check box 64 for selecting glucose and a button 65 for instructing the system 3 to export SMolESY data.
The second input region 52 (“Plotting Tools”) includes buttons 71, 72 for instructing the system 3 to plot NMR spectra and SMolESY, a check box 73 for selecting linking plots and data cursor controls 74.
The third input region 53 (“Calibration - Integration” region) includes buttons 81, 82 for instructing the system 3 to check selected region and to calibrate signals, a check box 83 for selecting data for bucketing, a field 84 for providing a bucket size, a check box 85 for selecting quantification, a button 86 for instructing the for instructing the system 3 to integrate, buttons 87, 88 for instructing the system to export the integral and to accumulate quantitive integral.
The fourth input region 54 (“peak Peaking tools for calibration - integration (manual version” region) includes a button 91 to instruct the system to peak pick, buttons 92, 93 for instructing the system (and showing whether) peak picking is on or off, input fields 94, 95, a button 96 for instructing the system to reset peak picking data.
The fifth input region 55 (“OPTION (multi-Peaks calibration - Integration) (manual version” region) button 101 for selecting folders for importing an input file and buttons 102, 103 for instructing the system 3 to select previous and next peaks, a button 104 for instructing the system to plot and a button 105 for instructing the system to reset the plot.
The first display region 56 can be used to display the NMR spectra.
The second display region 57 can be used to display the SMolESY spectra.
The third display region 58 can be used to display calibrated peaks from the SMolESY spectra.
It will be appreciated that various modifications may be made to the embodiments hereinbefore described. Such modifications may involve equivalent and other features which are already known in the design, manufacture and use of NMR systems and component parts thereof and which may be used instead of or in addition to features already described herein. Features of one embodiment may be replaced or supplemented by features of another embodiment.
Although claims have been formulated in this application to particular combinations of features, it should be understood that the scope of the disclosure of the present invention also includes any novel features or any novel combination of features disclosed herein either explicitly or implicitly or any generalization thereof, whether or not it relates to the same invention as presently claimed in any claim and whether or not it mitigates any or all of the same technical problems as does the present invention. The applicants hereby give notice that new claims may be formulated to such features and/or combinations of such features during the prosecution of the present application or of any further application derived therefrom.
Number | Date | Country | Kind |
---|---|---|---|
2006494.5 | May 2020 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2020/053215 | 12/15/2020 | WO |