1. Field of the Invention
This invention relates to the analysis of glycated proteins, and more particularly to a method for qualitative and/or quantitative analysis of one or more glycated proteins in a sample.
2. Description of the Related Art
Glycated proteins are formed by non-enzymatic reactions between reducing carbohydrates (e.g. glucose, fructose, ribose) or derivatives (e.g. ascorbic acid etc.) with terminal amino groups or ε amino groups in lysine and arginine residues. This process must be distinguished from that enzymatically catalysed by glycosyl transferase to synthesise glycoproteins involved in many biological processes. Enzymatic glycosylation is based on the attachment of oligosaccharides to specific protein side chains such as asparagine (N-linked), serine and threonine (O-linked), and the C-termini of cell surface proteins (1). Glycosylation is involved in many biological processes in contrast to glycation which is a completely undesired modification from a clinical point of view.
Due to the crucial role of glucose as an energy source in humans, it is the main circulating sugar and, thus, the most relevant molecule in terms of protein glycation. The mechanisms involved in glycation are illustrated in
The kinetics of the initial glycation process are governed by the formation of the Amadori compound which is a slow step under human physiological conditions (37° C., ˜5 mM blood glucose concentration in healthy subjects) (5). However, this process is enhanced under prolonged hyperglycaemia exposition being one of its pathophysio logical mechanisms of action. In contrast to physiological glucose concentration, chronic supraphysio logical glucose concentration (>10 mM) negatively affects a large number of organs and tissues, such as pancreas, eyes, liver, muscles, adipose tissues, brain, heart, kidneys and nerves. Glucose toxicity is the main cause of diabetic complications, which are often observed only several years after the development of the illness (6, 7). However, chronic hyperglycaemia can also increase the development rate of early diabetic states by affecting the secretion capacity of pancreatic cells, which in turn increase blood glucose concentration. This vicious circle finally leads to the total incapacity of β-cells to secrete insulin (8, 9). That is why glycation has often been related to chronic complications of diabetes mellitus, renal failure and degenerative changes occurring in the course of aging (10-12).
Glycation of proteins is one of the potential mechanisms that are expected to be involved in glucotoxicity due to clinical evidences. Calvo et al. have evaluated the non-enzymatic glycation of high-density lipoprotein (HDL) in type 1 and 2 diabetic patients as compared to that on control healthy subjects. The authors isolated glycated apolipoprotein A-I (ApoA-I) from diabetic patients and compared its lipid binding properties to those of ApoA-I from healthy subjects. They found that ApoA-I glycation promotes a decrease in the stability of the lipid-apolipoprotein interaction and also in its self-association. Therefore, the structural cohesion of HDL molecules is seriously affected by glycation of ApoA-I (13-15). In vivo studies in mice proved that glycated insulin exhibits a reduced ability to stimulate glucose oxidation by the isolated mouse diaphragm muscle. This observation was in concordance with previous studies suggesting that glycation of insulin decreases its potency to stimulate lipogenesis in isolated rat adipocytes. Consistent with such effects, glycated insulin displayed a significantly reduced ability to lower plasma glucose concentrations in mice. These and other studies clearly indicated that glycation results in a significant impairment of insulin action to regulate plasma glucose homeostasis (16).
The glycaemic control of clinical patients is currently assessed indirectly with the conventional test for the analysis of glycated haemoglobin (HbA1c). HbA1c is a long-term indicator of the patient glycaemic state because of the erythrocyte lifespan (˜120 days). HbA1c concentration is a memory effect of blood glucose concentrations over the previous 8-12 weeks (17-20). Other measurements indicative of short-term glucose perturbation are needed in order to understand its potential biological effect, taking into account that any protein could be potentially glycated. Due to the continuous exposure to glucose, the concentration of HbA1c and glycated human serum albumin in plasma from healthy patients has been estimated around 5-7% and 15%, respectively (21, 22). Therefore, the development of methods for identification and quantification of glycated proteins as well as for prediction of new potential targets under different conditions is crucial to elucidate their biological effect.
Glycohaemoglobin (HbA1c) is a long term control indicator in patients with diabetes mellitus. The amount of HbA1c reflects the mean glucose concentration over the previous two to three months (lifetime of red blood cells). Known methods for analysis of HbA1c are based on cation exchange chromatography, affinity chromatography and immune turbidimetry. These methods can be interfered with chemical modification of the haemoglobin, e.g. carbamylation or acetylation. Moreover, many conditions alter HbA1c levels. Any process that shortens erythrocyte lifespan (e.g. kidney disease, liver disease, hemolytic anemia, hemoglobinopathies, and recovery from blood loss) decreases HbA1c as glycation increases with age of the red cell. Also lower HbA1c levels are found in diabetic and nondiabetic pregnant women. Any process that slows erythropoesis such as aplastic anemia will increase HbA1c by causing an older erythrocyte cohort. Current methods for measuring HbA1c levels are therefore not useful for prognosis purposes.
U.S. Pat. No. 7,070,948 discloses a method for assaying glycated protein, comprising treating a glycated protein-containing sample with protease to liberate a glycated peptide from a glycated protein; allowing an oxidase to react with the liberated glycated peptide; and determining the produced hydrogen peroxide.
U.S. Pat. No. 7,183,118 discloses methods for quantitative proteome analysis of glycoproteins, involving immobilizing glycopolypeptides to a solid support; cleaving the immobilized glycopolypeptides, thereby releasing non-glycosylated peptides and retaining immobilized glycopeptides; releasing the glycopeptides from the solid support; and analysing the released glycopeptides. The method can include the step of identifying one or more glycopeptides using mass spectrometry. In one embodiment non-enzymatic glycation in diabetic mice is investigated by labelling serum from normal and diabetic obese mice with light and heavy ICAT (isotope coded affinity tag) reagent.
Recently, Zhang et al. proposed several approaches for the characterization of glycated proteins (23-25). These approaches are based on bottom-up workflows characterized by the implementation of selective and sensitive steps for this application such as an enrichment step for isolation of glycated proteins and/or peptides with boronate affinity chromatography (BAC) and data-dependent mass spectrometry methods. For example, the first of these references describes proteomic profiling of non-enzymatically glycated proteins in human plasma and erythrocyte membranes. Phenylboronate affinity chromatography was used to enrich glycated proteins and glycated tryptic peptides. The enriched peptides were subsequently analysed by liquid chromatography coupled with electron transfer dissociation-tandem mass spectrometry for identification of proteins which have been glycated. Nevertheless, these approaches have been focused on qualitative analysis by identification of glycated proteins and sugar attachment sites. Therefore, it is clear that there is a demand for quantitative methods for analysis of glycated proteins in order to evaluate the glycaemic control of clinical samples or compare patient glycaemic states.
A problem with the above proposals is that they do not enable the qualitative and quantitative analysis of complex mixtures of glycated proteins such as those which occur in the human or animal body. In the present invention we have provided methods which enable such analysis, permitting the identification of glycated proteins, identification of glycation sites within the protein structures, and quantitative assay of the degree of glycation at such sites.
The present invention provides the following:
1. A method for analysis of one or more glycated proteins in a sample, the glycated proteins containing moieties of a natural reducing carbohydrate bound at one or more glycation sites in the proteins, the method comprising:
2. A method for analysis of one or more glycated proteins in a sample, the glycated proteins containing moieties of a natural reducing carbohydrate bound at one or more glycation sites in the proteins, the method comprising:
(a) treating the sample with a stable isotopic form of said carbohydrate which is different in mass from the natural carbohydrate, whereby the isotopic form becomes incorporated by glycation in one or more proteins in the sample;
(b) digesting the proteins in the thus-treated sample to form peptides, at least some of which are glycated by the natural reducing carbohydrate and some by the isotopic form of the carbohydrate at identical glycation sites;
(c) separating the glycated peptides from the non-glycated peptides; and
(d) identifying and/or quantifying the glycated peptides by the difference in mass between the natural carbohydrate and the isotopic form of the carbohydrate at identical glycation sites.
3. A method according to 1 or 2, in which the natural reducing carbohydrate is selected from glucose, fructose, ribose, mannose, ascorbic acid, glyoxal and methylglyoxal.
4. A method according to 3, in which the natural reducing carbohydrate is glucose.
5. A method according to any of 1 to 4, in which the isotopic form of the carbohydrate is the 13C isotope, the 2H isotope or the 18O isotope, preferably the 13C isotope.
6. A method according to 5, in which the natural reducing carbohydrate is 12C6-glucose and the isotopic form is 13C6-glucose.
7. A method according to any of 2 to 6, in which the proteins are digested to form peptides by treatment with an endoproteinase, such as Glu-C, trypsin, Asp-N or Arg-C, or with CNBr.
8. A method according to 7, in which the digestion step is carried out with endoproteinase Glu-C.
9. A method according to any of 2 to 8, in which the glycated peptides are separated from the non-glycated peptides by boronate affinity chromatography, cationic exchange chromatography, isoelectric focusing or reverse phase hplc, preferably by boronate affinity chromatography.
10. A method according to any of 1 to 9, in which the glycated proteins or peptides are identified and/or quantified by mass spectrometry, wherein doublet signals are obtained, corresponding to each glycation site, with a mass shift corresponding to the difference in mass between the natural carbohydrate and the isotopic carbohydrate.
11. A method according to 10, in which the glycated proteins or peptides are quantified by measuring the signal intensity corresponding to glycation with the natural carbohydrate and comparing it to the signal intensity corresponding to glycation with a predetermined quantity of the isotopic form of the carbohydrate at the same glycation site.
12. A method according to 10 or 11, in which tandem mass spectrometry is carried out to identify and/or quantify the glycated peptides, and hence identify proteins from which the glycated peptides have been derived and the glycation sites for a specific protein in the sample, and optionally to quantify the degree of glycation at such sites.
13. A method according to any of 10 to 12, in which the glycated peptides are fractionated by reversed-phase liquid chromatography prior to analysis by mass spectrometry.
Non-enzymatic glycation of proteins is a post-translational modification produced by a reaction between reducing sugars and amino groups located in lysine and arginine residues or in N-terminal position. This modification plays a relevant role in medicine and food industry. In the clinical field, this undesired role is directly linked to blood glucose concentration and, therefore, to pathological conditions derived from hyperglycaemia (>11 mM glucose) such as diabetes mellitus or renal failure. An approach for qualitative and quantitative analysis of glycated proteins is here described to achieve the three information levels for their complete characterization. These are identification of glycated proteins, elucidation of sugar attachment sites and quantitative analysis to compare between glycaemic states. In embodiments of the invention, qualitative analysis can be carried out by tandem mass spectrometry after endoproteinase Glu-C digestion and boronate affinity chromatography for isolation of glycated peptides. For this purpose, two MS operational modes can be used: HCD-MS2 and CID-MS3 by neutral loss scan monitoring of two selective neutral losses (162.05 and 84.04 Da for the glucose cleavage and an intermediate rearrangement of the glucose moiety). On the other hand, quantitative analysis can be based on labelling of proteins with 13C6-glucose incubation in order to evaluate the native glycated proteins labelled with 12C6-glucose. As glycation is chemo-selective, it is exclusively occurring in potential targets for in vivo modifications. This approach, named Glycation Isotopic Labelling (GIL), implemented on a bottom-up workflow enabled to differentiate glycated peptides labelled with both isotopic forms resulting from enzymatic digestion by mass spectrometry (6 Da mass shift/glycation site). The strategy was then applied to a reference plasma sample that allowed detection of 50 glycated proteins and 161 sugar attachment positions with identification of preferential glycation sites for each protein. A predictive approach was also tested to detect potential glycation sites under high glucose concentration.
An innovative method for quantitative analysis of glycated proteins is here presented. In one embodiment, this method is based on differential labelling of proteins with isotopic [13C]-sugars, named Glycation Isotopic Labelling (GIL). The labelling step is performed by natural incubation under physiological conditions mimicking the in vivo glycation process. By this procedure, only potential glycation targets are labelled due to the chemo-selectivity of this process. After labelling, this approach can be implemented to any proteomics workflow based on MS detection as the two isotopic forms of a glycated protein can be discriminated. In this research, the approach has been implemented in a bottom-up workflow for analysis of non-enzymatic glycation of the human plasma proteome. The data analysis can be fully automated and has been performed combining Phenyx MS2 identification (26ref) and SuperHirn MS1 quantification tools (27ref).
The present invention provides a qualitative and quantitative method to assess glycemic control at short and long-term exposure to high glucose concentrations. In contrast to HbA1c analysis methods our alternative is focused on the analysis of the full proteome of the target sample (blood, plasma/serum or other biological fluids). We thus provide an analytical platform to achieve different information levels which are not only quantitative but also qualitative. Our method involves the chemical incorporation of stable isotopes (not affecting amino acids).
As an embodiment of the invention, a semiquantitative approach to analysis of glycated proteins is described. The glycation in this embodiment is with glucose as the natural reducing carbohydrate. Two aliquots of a target sample are incubated for equal or different times with equal or different quantities of, respectively, “light” glucose (in which all six carbon atoms are 12C, also referred to as 12Glu6) and “heavy” glucose (in which all six carbon atoms are 13C, also referred to as 13Glu6). The incubated samples are pooled and digested with a suitable enzyme such as endoproteinase Glu-C. The resulting peptides are separated into non-glycated peptides and glycated peptides by boronate affinity chromatography. Reversed phase fractionation of the peptides is carried out, followed by tandem mass spectrometry and data analysis. The MS data is plotted as abundance (%) against mass/charge ratio (m/z). Glycated peptides are identified by doublet signals separated by a 6 Dalton mass shift per glycation site (or fraction of 6 if the peptide is more than singly charged). This procedure achieves the following:
An absolute quantitative approach can then be adopted as follows. Once target glycated peptides have been identified, there are two strategies to be followed for the assessment of the glycemic control by absolute quantification:
1) spiking a control sample (red blood cells) incubated with 13Glu6 to a test sample at different ratios;
2) spiking target glycated peptides labeled with 13Glu6 (obtained by absolute quantification synthesis, AQUA) to a test sample at different ratios.
As the mass of the different peptides is defined, this analysis can be carried out by LC-MS/MS by multiple reaction monitoring (MRM) which enables the highly selective and sensitive determination of target glycated proteins. Glycated peptides with 13Glu6 act as internal standards. This methodology is useful for clinical prognosis of irregularities in glycemic control caused by exposure to high glucose concentrations. An absolute quantification approach is for the first time proposed for analysis of glycated proteins.
The following Examples illustrate the invention.
This example describes the strategy for purification of glycated peptides, relative quantitation by labelling with stable isotopes and identification of glycated peptides.
A scheme of this quantitative approach is illustrated in
Four conventional proteins (myoglobin, β-lactoglobulin, insulin and lysozyme) were dissolved (100 nanomols of each one) in a buffer containing 100 mM NaH2PO4, pH 7.5 (incubation buffer). The standard was split into two aliquots which were incubated separately with 90 mM “light” and “heavy” glucose for 24 hours at 37° C. in order to simulate physiological conditions. Then, both sets were pooled to subject the resulting mixture to the rest of the analytical workflow. This is initiated by desalting and isolation of proteins using centrifugal filtration units (Microcon with cut-off 3 kDa as nominal molecular weight limit). Proteins were reconstituted in 0.5 M triethylammonium hydrogen carbonate buffer (TEAB, digestion buffer), pH 8.5 and split into five sub-samples (400 μL each one) for enzymatic digestion. The protocol for digestion of one sub-sample started with the addition of 20 μL of 50 mM tris-(2-carboxyethyl) phosphine hydrochloride (TCEP) for 60 min at 60° C. in order to reduce disulfide bonds. Then, iodoacetamide (IAA) at a concentration of 400 mM was added (10 μL) to alkylate thiol groups. The mixture was reacted for 30 min in the dark at room temperature. Freshly prepared endoproteinase Glu-C (1 μg/μL) was added (100 μL to obtain a 1:10 w/w ratio), and the digestion was performed overnight at 37° C. After protein digestion, the resulting solution was dried by evaporation in a speed-vac concentrator and reconstituted in 100 μL of a solution 50 mM MgCl2/200 mM NH4CH3COO at pH 8.1 (adjusted with diluted NaOH). At this moment, this solution contains a mixture of non-glycated and glycated peptides (labelled with “light” and “heavy” glucose).
Following with the proposed workflow, the next step is the separation of glycated peptides from the non-glycated ones. This can be selectively carried out by boronate affinity chromatography (see
With this chromatographic method, selective separation of glycated and non-glycated peptides can be achieved. Both peptide fractions are collected and dried in a speed-vac concentrator. The resulting residue is reconstituted in 1 mL 0.1% trifluoroacetic acid (TFA)/5% CH3CN (v/v) in water for desalting of peptides by using solid-phase extraction cartridges (SPE, Waters Oasis HLB 10 mg cartridges). The SPE protocol consisted of the following steps: (1) Wash of cartridges with 1 ml 0.1% TFA/95% CH3CN in water (twice); (2) Equilibration of cartridges with 1 ml 0.1% TFA/5% CH3CN in water (twice); (3) Addition of sample solution; (4) Wash of cartridges with 1 ml 0.1% TFA/5% CH3CN in water (twice); and (5) Elution of peptides from cartridges with 1 ml 0.1% TFA/50% CH3CN in water. With this protocol, desalting of peptides is ensured as well as removal of polar compounds. The eluted solution is dried and peptides reconstituted with 0.1% TFA (aq) for subsequent analysis by RPLC-MS/MS with an electrospray interface (ESI) as ionization source. The separation was run for 60 min using a gradient from 0.1% TFA/3% CH3CN in water (mobile phase A) to 0.085% TFA/95% CH3CN in water (mobile phase B). The gradient was run as follows: 0-10 min 100% A, then to 90% A and 10% B at 12 min, 50% A and B at 55 min, and 98% B at 60 min at 400 nL/min flow rate.
The labelling with both isotopic glucoses enables the detection of glycated peptides by mass spectrometry as they provide a doublet signal with a mass shift of +6 Da (for singly charged peptides, while it would be +3 Da and +2 Da for doubly and triply charged peptides according to the mass/charge ratio, m/z) per glycation site.
There is no sense in the development of quantitative approaches if this is not supported by qualitative tools focused on the identification of glycated peptides. Tandem mass spectrometry (after RPLC) is an effective tool in this task together with the use of MS/MS fingerprinting identification software (such as Phenyx from Genebio or Mascot from Matrix Science). These software are powerful search engines that use MS and MS/MS data for identification of peptides and proteins from primary sequence databases. In the case of glycation with glucose, the identification is carried out by search of peptides with the addition of 162.0528 mass units (in the mass of the peptide, but also by analysis of its fragmentation pattern in MS/MS). The figure of 162.0528 is derived from the mass of the glucose molecule less the mass of the water molecule which is lost in binding. The identification was carried out by application of two MS/MS methodologies: MS/MS in high collision dissociation energy (MS2) and MS/MS/MS by neutral loss scan (MS3).
An additional level of information is achieved by identification of the glycation sites. The elucidation of the position where glycation takes place is of particular interest to interpret its biological effect on the protein function or turnover. It is possible to identify the glycation site with both MS/MS methodologies. Thus, in MS2 mode the spectra obtained by fragmentation of glycated peptides contain this information that is processed with the fingerprinting software. Concerning the MS3 mode, the data treated for sequencing of the peptides are those obtained in the second fragmentation step. As the glucose moiety is removed in the first MS/MS step, it would not be possible to identify the glycation site. However, if a second neutral loss is simultaneously monitored (loss of 84 mass units by intermediate fragmentation of the glucose moiety), it is possible to know where the glucose was attached. Table 1 shows the glycated peptides together with the attachment sites that were identified with both MS/MS methodologies by analysis of the four proteins standard.
F
VNQHLCGSHLVE
F
VNQHLCGSHLVE
K
: Glycated lysine;
F
: N-terminal glycation;
M
: Oxidized methionin;
C
: Carbamidomethylated cysteine.
Alternatively, this approach can be applied with other reducing carbohydrates (fructose, ribose, mannose, . . . ) or derivatives (ascorbic acid, glyoxal, methylglyoxal, . . . ) by using their “light” and “heavy” forms. In addition the whole analysis could be performed without digestion of the full protein.
This example describes the strategy for purification of glycated peptides, relative quantitation by labelling with stable isotopes and identification of glycated peptides.
This example describes the strategy for discovering and measuring the level of new glycated proteins by spiking a reference protein material (red blood cell lysate, plasma and others) labeled with 13Glu6 to the corresponding patient sample.
The different steps of the analytical workflow to be followed are: (1) Incubation of a reference protein material such as plasma or red blood cell lysate with “heavy” isotopic glucose (defining “heavy” glucose where all carbons correspond to the isotope 13C); (2) spiking the sample of interest from patients (red blood cell lysate, plasma, others) with the corresponding heavy labelled reference protein material; (3) In-solution enzymatic digestion for peptides generation; (4) Separation of glycated and non-glycated peptides by boronate affinity chromatography; (5) As described in Example 1, analysis of the glycated fraction by reversed-phase liquid chromatography (RPLC), tandem mass spectrometry, and, finally, analysis of the data sets by a suite of software tools.
The level of glycohemoglobin is increased in the red blood cells of persons with poorly controlled diabetes mellitus. Since the glucose stays attached to hemoglobin for the life of the red blood cell (normally about 120 days), the level of glycohemoglobin reflects the average blood glucose level over the past 3 months.
This example describes the strategy for measuring the level of Glycohaemoglobin (HbA1c) levels by spiking the N-terminal peptide of Haemoglobin B chain (obtained by absolute quantification synthesis, AQUA) labeled with 13Glu6.
The different steps of the analytical workflow to be followed are: (1) Incubation of the chemically-synthesised N-terminal peptide of Haemoglobin B chain with “heavy” isotopic glucose (defining “heavy” glucose where all carbons correspond to the isotope 13C); (2) spiking the sample of interest (red blood cell lysate, plasma, others) with the heavy labelled N-terminal peptide of Haemoglobin B chain; (3) In-solution enzymatic digestion for peptides generation; (4) Separation of glycated and non-glycated peptides by boronate affinity chromatography; (5) Analysis of the glycated fraction by reversed-phase liquid chromatography (RPLC); (6) Tandem mass spectrometry analysis including multiple reaction monitoring (MRM) for selective quantitation; and, finally, (7) Analysis of the data sets by a suite of software tools.
The level of glycated proteins is increased in the red blood cells and plasma of persons with poorly controlled diabetes mellitus. Since the glucose stays attached to proteins for their life, the level of glycated proteins reflects the average blood glucose level over the past days, weeks and months according to the half-life of each of the proteins.
This example describes the strategy for measuring the level of newly discovered glycated proteins from Example 2 by spiking their glycated peptide (obtained by absolute quantification synthesis, AQUA) labeled with 13Glu6.
The different steps of the analytical workflow to be followed are: (1) Incubation of the chemically-synthesised peptides of the newly discovered glycated proteins with “heavy” isotopic glucose (defining “heavy” glucose where all carbons correspond to the isotope 13C); (2) spiking the sample of interest (red blood cell lysate, plasma, others) with the heavy labelled peptides of the newly discovered glycated proteins; (3) In-solution enzymatic digestion for peptides generation; (4) Separation of glycated and non-glycated peptides by boronate affinity chromatography; (5) Analysis of the glycated fraction by reversed-phase liquid chromatography (RPLC); (6) Tandem mass spectrometry analysis including multiple reaction monitoring (MRM) for selective quantitation of each spiked peptide; and, finally, (7) Analysis of the data sets by a suite of software tools.
Chemicals—Disodium hydrogen phosphate, sodium hydroxide, ammonium acetate, acetic acid, [12C6]-glucose (≧99.5%) and [13C6]-glucose (99 atom % 13C) were purchased from Sigma. Myoglobin from horse heart (≧90%), β-lactoglobulin from bovine milk (˜90%) and insulin from bovine pancreas (powder cell culture tested) were provided by Sigma. Lysozyme from hen egg white (10 500 units mg-1) was from Fluka. These four proteins were used to prepare a multistandard mix in 0.1 M phosphate buffer pH 7.5. Human reference plasma containing 3.8% trisodium citrate as anticoagulant was purchased from Sigma. Plasma was tested and found negative for antibody to HIV-1/HIV-2, antibody to HCV and HbSAg. According to manufacturer, whole blood was collected with anticoagulants (9:1), pooled and centrifuged. The resulting plasma was filtered (0.45 μm) and lyophilized. Triethylammonium hydrogen carbonate buffer (TEAB, 1 M pH 8.5), iodoacetamide (IAA, ≧99%), tris-(2-carboxyethyl) phosphine hydrochloride (TCEP, 0.5 M) and sodium phosphate were from Sigma-Aldrich. Endoproteinase Glu-C from Staphylococcus aureus V8 was from Fluka. Water for chromatography LiChrosolv and acetonitrile Chromasolv for HPLC (≧99.9%) were, respectively, from Merck and Sigma. Superpure ULC-MS formic acid (≧99.9%) was purchased from Biosolve Chemicals (Valkenswaard, the Netherlands) as ionizing agent for LC-MS analysis.
Glucose labelling of a proteins multistandard—Two aliquots of the multistandard of four model proteins (0.125 mg of each protein) in 0.5 ml phosphate buffer were independently incubated with 30 mM [12C6]-glucose and [13C6]-glucose for 24 h at 37° C. Glucose and other salts were removed with Microcon ultrafiltration devices that have an Ultracel® YM-3 regenerated cellulose membrane with 3 kDa molecular weight cut-off (Millipore), followed by a buffer exchange to 0.5 M pH 8.5 TEAB in the same unit according to the manufacturer's instructions. Protein concentration was subsequently measured using the Bradford assay with bovine serum albumin as calibration protein.
Glucose labelling of the reference human plasma—Human plasma was reconstituted in 5 ml water according to the recommended manufacturer protocol. Two aliquots of the reconstituted plasma (50 μl each) in 0.5 ml phosphate buffer were independently incubated with 30 mM [12C6]-glucose and [13C6]-glucose for 24 h at 37° C. Then, each aliquot was separately analysed or were pooled in 1:1 ratio, depending on the analytical purpose, for subsequent analysis with a bottom-up approach. In any case, glucose and other salts were similarly removed by Microcon devices in order to isolate the proteins that were reconstituted in 0.5 M pH 8.5 TEAB. Protein concentration was subsequently measured using the Bradford assay with bovine serum albumin as calibration protein.
Endoproteinase Glu-C enzymatic digestion of proteins—Reconstituted proteins in the case of the multistandard (400 μl) and 1 mg plasma proteins according to Bradford assay (diluted to 400 μl TEAB) were enzymatically digested using endoproteinase Glu-C. For this purpose, cysteine groups were reduced with 50 mM TCEP in water (20 μl) by incubation of the reaction mixtures for 60 min at 60° C. Then, cysteine residues were alkylated with 400 mM IAA (10 μl) for 30 min in the dark at room temperature. Freshly prepared endoproteinase Glu-C (1.0 μg/μl) was added (67 μl to obtain a ratio 1:15 w/w), and the digestion was performed overnight at 37° C. Then, digestion mixtures were evaporated under speed-vacuum and reconstituted in 50 μl mobile phase A (0.2 M NH4Ac/50 mM MgCl2 pH 8.1) for isolation of glycated peptides.
Enrichment of glycated peptides by boronate affinity chromatography—Reconstituted peptides were fractioned by boronate affinity chromatography for isolation of the low-concentrated glycated peptides. For this purpose, the target sample (50 μl) was injected in a Waters HPLC equipped with a TSK-Gel boronate affinity column Tosoh Bioscience (7.5 cm×7.5 mm inner diameter; 10 μm particle size) at room temperature. An isocratic chromatographic method was used for affinity separation that consists of: 1) 0-10 min 100% mobile phase A for retention of glycated peptides by interaction between boronate ligands and 1,2-cis diol groups of glucose moieties, with elution of non-glycated peptides; 2) 10-20 min 100% mobile phase B (0.1 M HAc) for elution of glycated peptides; and 3) 20-30 min 100% mobile phase A for the equilibration of the column to the initial conditions. Both the non-glycated and the glycated fractions were collected for subsequent evaporation and reconstitution in 5% ACN/0.1% formic acid. Then, peptides were desalted and preconcentrated prior to LC-MS/MS analysis. This was carried out with C18 microspin columns (Harvard Apparatus, Holliston, Mass., USA) according to the protocol recommended by the manufacturer, which ends with elution of peptides with 400 μl 50% ACN/0.1% formic acid. This solution was evaporated to dryness for reconstitution with 50 μl 5% ACN/0.1% formic acid.
LC-MS/MS analysis of peptides—Peptides were analysed with a nanoflow HPLC using a Waters NanoAcquity HPLC system (Milford, Mass.) coupled to a hybrid linear ion trap-orbitrap mass spectrometer (Thermo Fisher, San Jose, Calif.) with electrospray ionization in positive mode. The HPLC system included a helium degasser (Michrom SA, Auburn, Calif.). Peptides were trapped on a homemade 100 μm inner diameter 18 mm long precolumn packed with 200 Å (5 μm particle size) Magic C18 particles (C18AO: Michrom) for 12 min. Subsequent peptides separation was on a homemade gravity-pulled 75 μm inner diameter 150 mm long analytical column packed with 100 Å (5 μm particle size) Magic C18 particles (C18AQ: Michrom) and directly interfaced to the mass spectrometer.
For each LC-MS/MS analysis, an estimated amount of 0.5 μg of peptides (0.1 μg/μl) was loaded on the precolumn at 3 ml/min in water/ACN (95/5 v/v) with 0.1% (v/v). After retention, peptides were eluted using an ACN gradient flowing at 220 nl/min with: mobile phase A, water, 0.1% formic acid; mobile phase B, ACN, 0.1% formic acid. The gradient program was as follows: 0 min, A (95%), B (5%); 55 min, A (65%), B (35%); 60 min, A (15%), B (85%); 65 min, A (85%), B (15%); 75-90 min, A (95%), B (5%). The electrospray ionization voltage was applied via a liquid junction using a gold wire inserted into a microtee union (Upchurch Scientific, Oak Harbor, Wash.) located in between the precolumn and analytical column. Ion source conditions were optimized using the tuning and calibration solution recommended by the instrument provider.
Two complementary data-dependent tandem mass spectrometry methods were used for analysis of glycated proteins: MS2 with high-energy collisional dissociation (HCD) as activation mode and MS3 by neutral loss scan with CID as activation mode. In data-dependent HCD-MS2 analysis, fragmentation of the three most abundant precursor ions was carried out on the octopole collision cell attached to the C-trap (normalized collision energy 50 eV) while detection was performed with orbitrap accuracy. The precursor ion isolation window was set to 2 m/z units. MS survey scans were acquired at resolution R=60 000 in profile mode while MS2 spectra were acquired at resolution R=7500. Precursor ions of charge state +2 and higher were included for data-dependent selection. In cases where charge state could not be identified, the most abundant ion was selected for HCD. Data-dependent acquisition was then performed over the entire chromatographic cycle. Data-dependent CID-MS3 neutral loss scan was entirely carried out in the linear trap with three steps: 1) first fragmentation of medium collision energy (35 eV) to promote the cleavage of the glucose moiety (−162.05 Da, that correspond to −81.02 and −54.01 Da for doubly and triply charged peptides, respectively) or an intermediate fragmentation of the glucose molecule (−84.04 Da, that correspond to −42.02 and −28.01 Da for doubly and triply charged peptides, respectively); 2) isolation of the ions in which one of the neutral losses is detected; and (3) fragmentation of the isolated peptide with a medium collision energy (35 eV). Similarly, the precursor ion isolation window was set to 2 m/z units and MS survey scans were acquired at resolution R=60 000 in profile mode. In this case, MS2 and MS3 acquisition was carried out with ion trap resolution. Precursor ions of charge state +2 and higher were included for data-dependent selection. In cases where charge state could not be identified, the most abundant ion was selected for CID. Data-dependent acquisition was then performed over the entire chromatographic cycle.
Data analysis—After data-dependent acquisition, a post-acquisition workflow was initiated specifically for each MS operation mode. For HCD-MS2 experiments, the workflow was based on the detection of precursor ions in an accurate way. This workflow consisted of three major steps. First, peak detection was performed over the entire chromatographic elution profile for each precursor ion scan. This step was performed using the feature-detection software Hardklör (28). During this step, a list of potential monoisotopic precursors for each precursor ion scan was created. Second, tandem mass spectral data were converted into peak lists (.dta files) using the instrument vendor's software (extract_msn.exe; Thermo Fisher). During this step, a .dta file was created for every tandem mass spectrum. This simple text file contains the precursor ion MH+ value and charge state (as assumed by the instrument) in the first line, and then a list of fragment ion m/z values and abundance in the remaining lines. If the charge state was not clearly assigned, extract_msn.exe creates one .dta file for a potential +2 charge state ion and one .dta file for a potential +3 charge state ion. In the last step, the measured precursor ion mass and charge given by the instrument (read from the .dta were compared to all possible precursor ions within a given elution time window and precursor ion transmission window. For our system, a peak elution window of ±6 s of the considered tandem mass spectrum and a precursor ion transmission window of ±1.1 m/z units were used. Potential precursor ion peaks detected in more than one MS spectrum were averaged (geometrical mean) if they were observed within a ±5 ppm tolerance. Then, all possible collected precursor ions MH+ and charge state values were ranked according to their summed correlation values over the considered time window. In those situations, up to three peaks (the three peaks with highest summed correlation values) were used as potential candidate precursor ions. In the situation where no peak was detected in the considered survey scan windows, the m/z value contained in the original .dta file was kept, with charge states +2 and +3. This last step was performed using a Pen script which is available at the Goodlett laboratory website, http://goodlett.proteomics.washington.edu. For MS3 neutral loss experiments, the same workflow was used except the first step since detection was not carried out with orbitrap accuracy. Therefore, peak lists were created with extract_msn.exe from tandem mass spectral data in the second fragmentation step after neutral loss step.
The resulting dta files for both MS operation modes were searched against UniProt-Swiss-Prot/TrEMBL database (Swiss-Prot Release 56.6 of Dec. 16, 2008, 287 050 entries and TrEMBL release 39.6 of Dec. 16, 2008, 4 988 379 entries) using Phenyx 2.6 (GeneBio, Geneva, Switzerland) operating on a local server. No taxonomy was used for the model protein mixture and Homo sapiens was specified for plasma database searching experiments. Common amino acid modifications for both MS operation modes were carbamidomethylation of cysteines and oxidized methionine, which were set as fixed and variable modifications, respectively. For HCD-MS2 experiments, glycation of lysine and arginine residues or on N-terminal positions (162.052 and 168.072 Da for glycated peptides with [12C6]- or [13C6]-glucose) was selected as variable modification. For MS3 neutral loss experiments, a variable modification as a consequence of glucose fragmentation after neutral loss of 84.04 Da (78.01 Da for K, R and on N-terminal positions) was additionally specified. Endoproteinase Glu-C was selected as enzyme, with three potential missed cleavages as maximum. The peptide and fragment ion tolerance depended on the MS operation mode. For HCD-MS2, peptide and fragment ion tolerance was tuned at 6 ppm. In contrast, these values were 1.1 and 0.8 Da for precursor and fragment ions in MS3 neutral loss. In both modes two sequential search rounds were used. In the first round, two missed cleavages were allowed in normal mode. This round was selected in “turbo” search mode. In the second round, three missed cleavages were allowed in half-cleaved mode. The minimum peptide length allowed was six for both rounds. The acceptance criteria were slightly lowered in the second round search. These were for HCD-MS2 experiments: AC score 9.7, peptide Z-score 9.7, peptide p value 1 10−7 for round 1; AC score 9.5, peptide Z-score 9.5, peptide p value 1 10−6 for round 2, corresponding to an estimated false positive ratio of less than 1%. For MS3 in neutral loss experiments, these parameters were changed to AC score 7.0, peptide Z-score 7.0, peptide p value 1 10−6 for round 1; AC score 6.5, peptide Z-score 6.5, peptide p value 1 10−5 for round 2, corresponding to an estimated false positive ratio of less than 1%. False positive ratios were estimated using a reverse decoy database. This estimation was performed using separate searches in the reverse database to keep the database size constant. This involved a slight underestimation of the estimated false positive ratio (29). In case of several matching entries, Swiss-Prot entries were preferred to TrEMBL entries. All data were acquired in triplicate (three analytical injections of the same sample) and analysed in an independent manner.
Peptide quantification—Quantitation of glycated proteins was possible as after enzymatic digestion, the resulting glycated peptides (with addition of 162 mass units) provided doublet signals in precursor MS scan (labelling with light and heavy glucose). The mass shift of the doublet signals depended on the peptide charge and the number of glycation sites. Peptide quantification was carried out by calculation of the ratio between peak areas from extracted ion chromatograms corresponding to both isotopic forms of each glycated peptide. Due to the same physicochemical properties of the two isotopic glycated peptides, these were chromatographically co-eluted providing a doublet signal with a mass shift that depends on the peptide charge and the number of glycation sites. The peptide ratios [12C6]-glucose peptide/[13C6]-glucose peptide were obtained from the average values of intra-run triplicates. As shown in
Qualitative Analysis by Tandem Mass Spectrometry—The complete workflow for the analysis of glycated proteins, shown in
Concerning mass spectrometry, electron transfer dissociation (ETD) (23) and CID in data-dependent MS3 and pseudoMS3 approaches (neutral loss scan and multistage activation, respectively) (24) have proved to be efficient activation modes for identification of glycated peptides. Nevertheless, the use of the orbitrap hybrid mass analyser enables the application of an additional ion dissociation mode, which has not been tested yet for glycation analysis. This is the HCD mode that is characterized by its performance in an additional octopole collision cell attached to the C-trap using nitrogen as collision gas. The use of nitrogen results in a more energetic fragmentation than helium-based dissociation occurring in CID. In addition, HCD is a fast activation mode as compared to CID, which enables to reach high vibrational energies per bond before dissociation of the target molecular ion. As a result, high-quality fingerprinting spectra are obtained which enhances the identification of glycated peptides.
Analysis in MS2 was complemented by MS3 in neutral loss scanning
Quantitative analysis based on the GIL approach—As shown in
The doublet signals are 533.31/536.32 m/z and 624.82/627.83 m/z, with a mass shift of 3 Da, which is indicative of doubly charged glycated peptides. The peptide at 533.31 m/z corresponded to a horse myoglobin glycated peptide while that at 624.82 m/z was identified as a bovine insulin glycated peptide. This experiment was obtained by incubation of the standard composed of four reference proteins with “light” and “heavy” glucose and subsequent pooling with a 1:1 ratio. The intensity of MS signals corresponding to the two versions of the peptide labelled with both isotopic glucose forms was practically the same. Particularly, the ratios between peak areas were 0.965±0.010 and 1.018±0.025 for myoglobin and insulin glycated peptides, respectively. These values were obtained by analysis of three technical replicates.
Tests of the optimized protocol to human plasma—After optimization of the glucose labelling principle, the next step was to test it with a relatively complex biofluid as human plasma. For this purpose, two aliquots of plasma (50 μl each) were independently incubated with 30 mM [12C6]-glucose and [13C6]-glucose for 24 h at 37° C. In this case, each aliquot was analysed separately using the workflow exposed in
Concerning the experiment based on incubation with [13C6]-glucose, the same peptides provided doublet signals that favour their identification. In this case, the signals corresponding to peptides labelled with “heavy” sugar are caused by glucose perturbation mimicked with in vitro incubation. On the other hand, the signals provided by glycated peptides with “light” glucose are indicative of the native concentration of them in plasma.
This experiment enables to validate the principle of isotopic sugar labelling as a possibility for quantitation of glycated proteins and points out two significant applications of this quantitative approach that are subsequently exposed.
Assessment of the native level of plasma protein glycation—The application of the optimized protocol to plasma enables to obtain a global view about the glycaemic state of a potential patient. This analysis provides the profile of glycated proteins identified together with information about glycation sites as shown in Table 2 for the reference plasma used in this research. A total of 35 proteins was found to be glycated in the reference plasma sample without any pre-fractionation step at the protein level. The proposed methodology is able to detect 113 different glycation sites, which is of particular interest as each glycation site could have a different impact on the biological function of proteins. For instance, 35 different glycation sites were identified for HSA. As it was previously indicated, previous studies have identified preferential glycation sites for HSA in Lys residues located in positions 549, 257, 264, 468 and 160. This approach enables to compare the efficiency of the sugar attachment on the different glycation sites. For this purpose, values of the ratio between the peak areas of the in vivo and in vitro glycated peptides (labelled with [12C6]- and [13C6]-glucose) are estimated using extracted ion chromatograms.
Prediction of the glycation site state as response to glucose stimuli—In this study, glucose perturbations were assessed by independent incubation of two plasma aliquots with [12C6]- and [13C6]-glucose. A glucose concentration of 30 mM was selected for incubation mimicking a glucotoxicity perturbation. After incubation, both aliquots were pooled at 1:1 ratio for standardization prior to proteomics analysis following the reported protocol. As shown in Table 3, 50 glycated proteins were identified with this analysis. As compared to the analysis based on exclusive incubation with [13C6]-glucose, 20 new glycated proteins were identified. Additionally, a total number of 161 glycation sites were detected. Most of these identifications corresponded to singly glycated peptides. Nevertheless, it is worth emphasizing the detection of peptides containing two different glycation sites, which were undetectable in the analysis of native glycation. For this reason, they could be considered as potential biomarkers to assess glucotoxicity levels in clinical patients. Concerning data treatment, the signals corresponding to peptides labelled with [13C6]-glucose are representative of the 30 mM glucose stimuli. On the other hand, the signals provided by peptides labelled with [12C6]-glucose are contribution of two different sources: native glycated proteins present before incubation (equal contribution from both aliquots) and those generated as a consequence of the [12C6]-glucose stimuli for 24 h. Therefore, this approach enables to differentiate glycated proteins formed as a result of the glucotoxic perturbation in relative terms. For doubly glycated peptides, we can discriminate between: those in vitro labelled with [12C6]- or [13C6]-glucose as a result of the stimuli and, those that were singly labelled with [12C6]-glucose before the stimuli and are secondly labelled due to the stimuli with [12C6]- or [13C6]-glucose.
This prediction approach enables the assessment of the impact of glycaemic disturbances for the different glycation sites. Table 3 also evaluates the effect of the 30 mM glucose stimuli for each glycated peptide (right column) by comparison with the native glycation as reference. This parameter was calculated with the following expression:
As an example, preferential glycation sites in HSA such as Lys549, Lys264 and Lys257 experienced glucotoxic effect between 36.2 and 56.8% in plasma subjected to 30 Mm glucose exposition for 24 h.
This research describes the development of an application for qualitative and quantitative analysis of glycated proteins in human plasma. There are several reasons that have contributed to the lack of methods for identification and quantitation of glycated proteins. Among them, we have to emphasize the modification of enzymatic digestion patterns and the lack of strategies to detect glycated proteins present in humans at low concentrations. Due to the influence of glycation on trypsin enzymatic digestion, the implementation of an alternative protease such as Glu-C has proved to be an effective way to avoid pattern modifications. In this way, enzymatic specificity can be maintained for identification of glycated peptides by minimizing the generation of missed cleavage sites and half-cleaved peptides. The development of selective and sensitive strategies for the detection of glycated proteins has been accomplished by the advances experimented by mass spectrometry in the last years. Electron transfer dissociation has proved to be an efficient activation mode for identification of glycated peptides by tandem mass spectrometry. Nevertheless, ETD instrumentation is less distributed and frequently characterized by a significant decrease of sensitivity as compared to CID, which was the initial activation mode for analysis of glycated peptides. However, CID-based fragmentation tends to dissociate Amadori compounds (see
In the present study, a combination of a MS2 mode with HCD activation and CID-MS3 by neutral loss scan is proposed for qualitative analysis of glycated proteins. The high accuracy in HCD-MS2 mode for precursor and fragment ions is crucial to achieve a high identification level (37, 38) for characterization of glycation, particularly, if Glu-C is used for hydrolysis. This enzyme enables to generate long peptides such as the glycated peptide shown in
The CID-MS3 mode is a complementary approach to HCD-MS2 as the former is particularly useful for identification of glycated peptides with charge states (+2) and (+3). As an example to evaluate this complementary application, both MS modes were compared in terms of identification of glycation sites. Thus, if a total of 113 different glycation sites were identified in the analysis of plasma, 64% of them were detected with HCD-MS2 and 46.9% with neutral loss scan. These results justify the complementary application of both MS modes in order to increase the identification capability.
The optimization of the overall method was completed by tests to validate the quantitative approach based on glycation isotopic labelling using [13C6]-glucose. These tests were carried out with a standard of recombinant proteins to ensure the absence of glycation. The provided results proved that both isotopic glucose forms possess similar glycation efficiency, which is derived from the peak areas of the extracted chromatograms corresponding to the precursor ions of the [12C6]- and [13C6]-glucose labelled peptides. Evidently, this is a critical aspect for the implementation of isotopic glucose labelling as a quantitative approach.
Application to the human plasma glycated proteome—As it was previously indicated, any protein can be glycated. However, the reference method for the assessment of the glycaemic control of a patient is the measurement of HbA1c concentration. In addition to be exclusively focused on one protein, the erythrocyte lifespan (˜120 days) defines HbA1c as a long-term indicator of the patient state (41-43). It is clearly evident that the overall profiles of glycated proteins represent a more complete indicator of the glycaemic state of a particular patient. This information can be achieved with the approach based on incubation with [13C6]-glucose as this provides indirectly a view about the current glycaemic state of a potential patient. As [12C6]-glucose concentration is not modified a profile of glycated proteins that are present in a target sample is obtained.
The ratio between peak areas corresponding to the peptides labelled with [12C6]- and [13C6]-glucose provides additional quantitative information in relative terms. Peptides labelled with “heavy” glucose are considered as internal standards with the particularity that these isotopic forms are generated mimicking physiological conditions. Therefore, in vitro labelling with [13C6]-glucose depends on the sample properties such as proteins content or pathological factors affecting glycation. The application of this approach is useful to estimate relatively the extent of glycation for each potential attachment site. In addition, the isotopic glucose labelling is valid as a quantitative approach to compare between two glycation states for the same or different patients.
Prediction of the glycaemic state as response to glucose stimuli—The mechanism of the glycation process (see
The predictive approach here proposed is based on the differential labelling with [12C6]-glucose and [13C6]-glucose and considers all glycation possibilities. As glucose labelling is performed by incubation under physiological conditions, glycation of proteins is mimicked in natural terms. As it has been proved, this fact can be employed for the evaluation of the impact of glucose concentrations on identified sites. This information is collected in Table 3 for each identified glycation site, which was obtained by comparison to native conditions. This approach also enables the identification of new glycation targets for a certain glucotoxic incidence, which is of valuable interest for search of biomarkers by application to a specific pathological disorder.
It can be concluded that an approach for qualitative and quantitative analysis of glycated proteins has been here developed to characterize this undesired PTM. Qualitative analysis, by HCD-MS2 and CID-MS3 operational modes, enabled the identification of glycated proteins in plasma as well as the elucidation of glycation sites. The latter is crucial in order to know the effect of the sugar attachment on the biological function of the protein. Quantitative analysis was accomplished by partial labelling of proteins with 13C6-glucose to discriminate from native glycated proteins labelled with 12C6-glucose. Labelling was performed by physiological incubation taking into account the chemoselective character of glycation. The resulting method was tested by analysis of native glycated proteins in plasma as well as predictive analysis of glycation sites under high glucose concentrations, which is of great interest in clinical applications.
Number | Date | Country | Kind |
---|---|---|---|
0815576.4 | Aug 2008 | GB | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/GB2009/051047 | 8/21/2009 | WO | 00 | 2/2/2011 |