Specific peptides derived from subsequences of the Epidermal Growth Factor Receptor protein and which will be referred to as EGFR, and which can also be referred to as the receptor tyrosine-protein kinase ErbB-1 protein, are provided. The peptide sequence and fragmentation/transition ions for each peptide are particularly useful in a mass spectrometry-based Selected Reaction Monitoring (SRM), which can also be referred to as a Multiple Reaction Monitoring (MRM) assay, and will be referred to as SRM/MRM. The use of one such peptide for SRM/MRM quantitative analysis of the EGFR protein is described.
This SRM/MRM assay can be used to measure relative or absolute quantitative levels of one or more of the specific peptides from the EGFR protein and therefore provide a means of measuring the amount of the EGFR protein in a given protein preparation obtained from a biological sample by mass spectrometry.
More specifically, the SRM/MRM assay can measure these peptides directly in complex protein lysate samples prepared from cells procured from patient tissue samples, such as formalin fixed cancer patient tissue. Methods of preparing protein samples from formalin-fixed tissue are described in U.S. Pat. No. 7,473,532, the contents of which are hereby incorporated by references in their entirety. The methods described in U.S. Pat. No. 7,473,532 may conveniently be carried out using Liquid Tissue™ reagents and protocol available from Expression Pathology Inc. (Rockville, Md.).
The most widely and advantageously available form of tissues from cancer patients tissue is formalin fixed, paraffin embedded tissue. Formaldehyde/formalin fixation of surgically removed tissue is by far and away the most common method of preserving cancer tissue samples worldwide and is the accepted convention for standard pathology practice. Aqueous solutions of formaldehyde are referred to as formalin. “100%” formalin consists of a saturated solution of formaldehyde (this is about 40% by volume or 37% by mass) in water, with a small amount of stabilizer, usually methanol to limit oxidation and degree of polymerization. The most common way in which tissue is preserved is to soak whole tissue for extended periods of time (8 hours to 48 hours) in aqueous formaldehyde, commonly termed 10% neutral buffered formalin, followed by embedding the fixed whole tissue in paraffin wax for long term storage at room temperature. Thus molecular analytical methods to analyze formalin fixed cancer tissue will be the most accepted and heavily utilized methods for analysis of cancer patient tissue.
Results from the SRM/MRM assay can be used to correlate accurate and precise quantitative levels of the EGFR protein within the specific tissue samples (e.g., cancer tissue sample) of the patient or subject from whom the tissue (biological sample) was collected and preserved. This not only provides diagnostic information about the cancer, but also permits a physician or other medical professional to determine appropriate therapy for the patient. Such an assay that provides diagnostically and therapeutically important information about levels of protein expression in a diseased tissue or other patient sample is termed a companion diagnostic assay. For example, such an assay can be designed to diagnose the stage or degree of a cancer and determine a therapeutic agent to which a patient is most likely to respond.
The assays described herein measure relative or absolute levels of specific unmodified peptides from the EGFR protein and also can measure absolute or relative levels of specific modified peptides from the EGFR protein. Examples of modifications include phosphorylated amino acid residues and glycosylated amino acid residues that are present on the peptides.
Relative quantitative levels of the EGFR protein are determined by the SRM/MRM methodology for example by comparing SRM/MRM signature peak areas (e.g., signature peak area or integrated fragment ion intensity) of an individual EGFR peptide in different samples. Alternatively, it is possible to compare multiple SRM/MRM signature peak areas for multiple EGFR signature peptides, where each peptide has its own specific SRM/MRM signature peak, to determine the relative EGFR protein content in one biological sample with the EGFR protein content in one or more additional or different biological samples. In this way, the amount of a particular peptide, or peptides, from the EGFR protein, and therefore the amount of the EGFR protein, is determined relative to the same EGFR peptide, or peptides, across 2 or more biological samples under the same experimental conditions. In addition, relative quantitation can be determined for a given peptide, or peptides, from the EGFR protein within a single sample by comparing the signature peak area for that peptide by SRM/MRM methodology to the signature peak area for another and different peptide, or peptides, from a different protein, or proteins, within the same protein preparation from the biological sample. In this way, the amount of a particular peptide from the EGFR protein, and therefore the amount of the EGFR protein, is determined relative one to another within the same sample. These approaches generate quantitation of an individual peptide, or peptides, from the EGFR protein to the amount of another peptide, or peptides, between samples and within samples wherein the amounts as determined by signature peak area are relative one to another, regardless of the absolute weight to volume or weight to weight amounts of the EGFR peptide in the protein preparation from the biological sample. Relative quantitative data about individual signature peak areas between different samples are normalized to the amount of protein analyzed per sample. Relative quantitation can be performed across many peptides from multiple proteins and the EGFR protein simultaneously in a single sample and/or across many samples to gain insight into relative protein amounts, one peptide/protein with respect to other peptides/proteins.
Absolute quantitative levels of the EGFR protein are determined by, for example, the SRM/MRM methodology whereby the SRM/MRM signature peak area of an individual peptide from the EGFR protein in one biological sample is compared to the SRM/MRM signature peak area of a spiked internal standard. In one embodiment, the internal standard is a synthetic version of the same exact EGFR peptide that contains one or more amino acid residues labeled with one or more heavy isotopes. Such isotope labeled internal standards are synthesized so that when analyzed by mass spectrometry it generates a predictable and consistent SRM/MRM signature peak that is different and distinct from the native EGFR peptide signature peak and which can be used as a comparator peak. Thus when the internal standard is spiked into a protein preparation from a biological sample in known amounts and analyzed by mass spectrometry, the SRM/MRM signature peak area of the native peptide is compared to the SRM/MRM signature peak area of the internal standard peptide, and this numerical comparison indicates either the absolute molarity and/or absolute weight of the native peptide present in the original protein preparation from the biological sample. Absolute quantitative data for fragment peptides are displayed according to the amount of protein analyzed per sample. Absolute quantitation can be performed across many peptides, and thus proteins, simultaneously in a single sample and/or across many samples to gain insight into absolute protein amounts in individual biological samples and in entire cohorts of individual samples.
The SRM/MRM assay method can be used to aid diagnosis of the stage of cancer, for example, directly in patient-derived tissue, such as formalin fixed tissue, and to aid in determining which therapeutic agent would be most advantageous for use in treating that patient. Cancer tissue that is removed from a patient either through surgery, such as for therapeutic removal of partial or entire tumors, or through biopsy procedures conducted to determine the presence or absence of suspected disease, is analyzed to determine whether or not a specific protein, or proteins, and which forms of proteins, are present in that patient tissue. Moreover, the expression level of a protein, or multiple proteins, can be determined and compared to a “normal” or reference level found in healthy tissue. Normal or reference levels of proteins found in healthy tissue may be derived from, for example, the relevant tissues of one or more individuals that do not have cancer. Alternatively, normal or reference levels may be obtained for individuals with cancer by analysis of relevant tissues not affected by the cancer. Assays of protein levels (e.g., EGFR levels) can also be used to diagnose the stage of cancer in a patient or subject diagnosed with cancer by employing the EGFR levels. Levels or amounts of proteins or peptides can be defined as the quantity expressed in moles, mass or weight of a protein or peptide determined by the SRM/MRM assay. The level or amount may be normalized to total the level or amount of protein or another component in the lysate analyzed (e.g., expressed in micromoles/microgram of protein or micrograms/microgram of protein). In addition, the level or amount of a protein or peptide may be determined on volume basis, expressed, for example, in micromolar or nanograms/microliter. The level or amount of protein or peptide as determined by the SRM/MRM assay can also be normalized to the number of cells analyzed. Information regarding EGFR can thus be used to aid in determining stage or grade of a cancer by correlating the level of the EGFR protein (or fragment peptides of the EGFR protein) with levels observed in normal tissues. Once the stage and/or grade, and/or EGFR protein expression characteristics of the cancer has been determined, that information can be matched to a list of therapeutic agents (chemical and biological) developed to specifically treat cancer tissue that is characterized by, for example, abnormal expression of the protein or protein(s) (e.g., EGFR) that were assayed. Matching information from an EGFR protein assay to a list of therapeutic agents that specifically targets, for example, the EGFR protein or cells/tissue expressing the protein, defines what has been termed a personalized medicine approach to treating disease. The assay methods described herein form the foundation of a personalized medicine approach by using analysis of proteins from the patient's own tissue as a source for diagnostic and treatment decisions.
In principle, any predicted peptide derived from EGFR protein, prepared for example by digesting with a protease of known specificity (e.g. trypsin), can be used as a surrogate reporter to determine the abundance of EGFR protein in a sample using a mass spectrometry-based SRM/MRM assay. Similarly, any predicted peptide sequence containing an amino acid residue at a site that is known to be potentially modified in EGFR protein also might potentially be used to assay the extent of modification of EGFR protein in a sample.
EGFR fragment peptides may be generated by a variety of means including by the use of the Liquid Tissue™ protocol provided in U.S. Pat. No. 7,473,532. The Liquid Tissue™ protocol and reagents are capable of producing peptide samples suitable for mass spectroscopic analysis from formalin fixed paraffin embedded tissue by proteolytic digestion of the proteins in the tissue/biological sample. In the Liquid Tissue™ protocol the tissue/biological is heated in a buffer for an extended period of time (e.g., from about 80° C. to about 100° C. for a period of time from about 10 minutes to about 4 hours) to reverse or release protein cross-linking. The buffer employed is a neutral buffer, (e.g., a Tris-based buffer, or a buffer containing a detergent). Following heat treatment the tissue/biological sample is treated with one or more proteases, including but not limited to trypsin, chymotrypsin, pepsin, and endoproteinase Lys-C for a time sufficient to disrupt the tissue and cellular structure of said biological sample and to liquefy said sample (e.g., a period of time from 30 minutes to 24 hours at a temperature from 37° C. to 65° C.). The result of the heating and proteolysis is a liquid, soluble, dilutable biomolecule lysate.
Surprisingly, it was found that many potential peptide sequences from the EGFR protein are unsuitable or ineffective for use in mass spectrometry-based SRM/MRM assays for reasons that are not immediately evident. As it was not possible to predict the most suitable peptides for MRM/SRM assay, it was necessary to experimentally identify modified and unmodified peptides in actual Liquid Tissue™ lysates to develop a reliable and accurate SRM/MRM assay for the EGFR protein. While not wishing to be bound by any theory, it is believed that some peptides might, for example, be difficult to detect by mass spectrometry as they do not ionize well or produce fragments distinct from other proteins, peptides may also fail to resolve well in separation (e.g., liquid chromatography), or adhere to glass or plastic ware.
EGFR peptides found in various embodiments of this disclosure (e.g., Tables 1 and 2) were derived from the EGFR protein by protease digestion of all the proteins within a complex Liquid Tissue™ lysate prepared from cells procured from formalin fixed cancer tissue. Unless noted otherwise, in each instance the protease was trypsin. The Liquid Tissue™ lysate was then analyzed by mass spectrometry to determine those peptides derived from the EGFR protein that are detected and analyzed by mass spectrometry. Identification of a specific preferred subset of peptides for mass-spectrometric analysis is based on; 1) experimental determination of which peptide or peptides from a protein ionize in mass spectrometry analyses of Liquid Tissue™ lysates, and 2) the ability of the peptide to survive the protocol and experimental conditions used in preparing a Liquid Tissue™ lysate. This latter property extends not only to the amino acid sequence of the peptide but also to the ability of a modified amino acid residue within a peptide to survive in modified form during the sample preparation.
Protein lysates from cells procured directly from formalin (formaldehyde) fixed tissue were prepared using the Liquid Tissue™ reagents and protocol that entails collecting cells into a sample tube via tissue microdissection followed by heating the cells in the Liquid Tissue™ buffer for an extended period of time. Once the formalin-induced cross linking has been negatively affected, the tissue/cells are then digested to completion in a predictable manner using a protease, as for example including but not limited to the protease trypsin. Each protein lysate is turned into a collection of peptides by digestion of intact polypeptides with the protease. Each Liquid Tissue™ lysate was analyzed (e.g., by ion trap mass spectrometry) to perform multiple global proteomic surveys of the peptides where the data was presented as identification of as many peptides as could be identified by mass spectrometry from all cellular proteins present in each protein lysate. An ion trap mass spectrometer or another form of a mass spectrometer that is capable of performing global profiling for identification of as many peptides as possible from a single complex protein/peptide lysate is employed. Ion trap mass spectrometers however may be the best type of mass spectrometer for conducting global profiling of peptides. Although SRM/MRM assay can be developed and performed on any type of mass spectrometer, including a MALDI, ion trap, or triple quadrupole, the most advantageous instrument platform for SRM/MRM assay is often considered to be a triple quadrupole instrument platform.
Once as many peptides as possible were identified in a single MS analysis of a single lysate under the conditions employed, then that list of peptides was collated and used to determine the proteins that were detected in that lysate. That process was repeated for multiple Liquid Tissue™ lysates, and the very large list of peptides was collated into a single dataset. That type of dataset can be considered to represent the peptides that can be detected in the type of biological sample that was analyzed (after protease digestion), and specifically in a Liquid Tissue™ lysate of the biological sample, and thus includes the peptides for specific proteins, such as for example the EGFR protein.
In one embodiment, the EGFR tryptic peptides identified as useful in the determination of absolute or relative amounts of the EGFR receptor include one or more, two or more, three or more, four or more, five or more, six or more, eight or more, or ten or more of the peptides of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ 1D NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, and SEQ ID NO:74, each of which are listed in Table 1. Each of those peptides was detected by mass spectrometry in Liquid Tissue™ lysates prepared from formalin fixed, paraffin embedded tissue. Thus, each of the peptides in Table 1, or any combination of those peptides (e.g., one or more, two or more, three or more, four or more, five or more, six or more, eight or more, or ten or more of those peptides recited in Table 1, and particularly combinations with one or more of the peptides found in table 2) are candidates for use in quantitative SRM/MRM assay for the EGRF protein in human biological samples, including directly in formalin fixed patient tissue.
The EGFR tryptic peptides listed in Table 1 include those detected from multiple Liquid Tissue™ lysates of multiple different formalin fixed tissues of different human organs including prostate, colon, and breast. Each of those peptides is considered useful for quantitative SRM/MRM assay of the EGFR protein in formalin fixed tissue. Further data analysis of these experiments indicated no preference is observed for any specific peptides from any specific organ site. Thus, each of these peptides is believed to be suitable for conducting SRM/MRM assays of the EGFR protein on a Liquid Tissue™ lysate from any formalin fixed tissue originating from any biological sample or from any organ site in the body.
In one embodiment the peptides in Table 1, or any combination of those peptides (e.g., one or more, two or more, three or more, four or more, five or more, six or more, eight or more, or nine or more of those peptides recited in Table 1, and particularly combinations with the peptides also found in Table 2) are assayed by methods that do not rely upon mass spectroscopy, including, but not limited to, immunological methods (e.g., Western blotting or ELISA). Regardless of how information directed to the amount of the peptide(s) (absolute or relative) is obtained, the information may be employed in any of the methods described herein, including indicating (diagnosing) the presence of cancer in a subject, determining the stage/grade/status of the cancer, providing a prognosis, or determining the therapeutics or treatment regimen for a subject/patient.
Embodiments of the present disclosure includes compositions comprising one or more, two or more, three or more, four or more, five or more, six or more, eight or more, or ten or more of the peptides in Table 1. In some embodiments, the compositions comprise one or more, two or more, three or more, four or more, five or more, six or more, or seven or more of the peptides in Table 2. Compositions comprising peptides may include one or more, two or more, three or more, four or more, five or more, six or more, eight or more, or ten or more peptides that are isotopically labeled. Each of the peptides may be labeled with one or more isotopes selected independently from the group consisting of: 18O, 17O, 34S, 15N, 13C, 2H or combinations thereof. Compositions comprising peptides from the EGFR protein, whether isotope labeled or not, do not need to contain all of the peptides from that protein (e.g., a complete set of tryptic peptides). In some embodiments the compositions do not contain one or more, two or more, three or more, four or more, five or more, six or more, eight or more, or ten or more peptides from EGFR, and particularly peptides appearing in Table 1 or Table 2. Compositions comprising peptides may be in the form of dried or lyophized materials, liquid (e.g., aqueous) solutions or suspensions, arrays, or blots.
An important consideration for conducting an SRM/MRM assay is the type of instrument that may be employed in the analysis of the peptides. Although SRM/MRM assays can be developed and performed on any type of mass spectrometer, including a MALDI, ion trap, or triple quadrupole, the most advantageous instrument platform for SRM/MRM assay is often considered to be a triple quadrupole instrument platform. That type of a mass spectrometer may be considered to be the most suitable instrument for analyzing a single isolated target peptide within a very complex protein lysate that may consist of hundreds of thousands to millions of individual peptides from all the proteins contained within a cell.
In order to most efficiently implement SRM/MRM assay for each peptide derived from the EGFR protein it is desirable to utilize information in addition to the peptide sequence in the analysis. That additional information may be used in directing and instructing the mass spectrometer (e.g. a triple quadrupole mass spectrometer), to perform the correct and focused analysis of specific targeted peptide(s), such that the assay may be effectively performed.
The additional information about target peptides in general, and about specific EGFR peptides, may include one or more of the mono isotopic mass of the peptide, its precursor charge state, the precursor m/z value, the m/z transition ions, and the ion type of each transition ion. Additional peptide information that may be used to develop an SRM/MRM assay for the EGFR protein is shown by example for eight (8) of the EGFR peptides from the list in Table 1 and is shown in Table 2. Similar additional information described for the eight (8) EGFR peptides shown by example in Table 2 may be prepared, obtained, and applied to the analysis of the other peptides contained in Table 1.
The method described below was used to: 1) identify candidate peptides from the EGFR protein that can be used for a mass spectrometry-based SRM/MRM assay for the EGFR protein, 2) develop individual SRM/MRM assay, or assays, for target peptides from the EGFR protein in order to correlate and 3) apply quantitative assays to cancer diagnosis and/or choice of optimal therapy.
As shown in
The method for measuring the level of the EGFR protein in a biological sample described herein (or fragment peptides as surrogates thereof) may be used as a diagnostic indicator of cancer in a patient or subject. In one embodiment, the results from measurements of the level of the EGFR protein may be employed to determine the diagnostic stage/grade/status of a cancer by correlating (e.g., comparing) the level of EGFR receptor found in a tissue with the level of that protein found in normal and/or cancerous or precancerous tissues.
This application claims the benefit of U.S. Provisional Application 61/289,384, filed Dec. 22, 2009, entitled “Epidermal Growth Factor Receptor (EGFR) Protein SRM/MRM Assay” naming as an inventor David B. Krizman, the entirety of which is incorporated by reference.
Number | Date | Country | |
---|---|---|---|
61289384 | Dec 2009 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13529897 | Jun 2012 | US |
Child | 15444310 | US | |
Parent | PCT/US2010/061916 | Dec 2010 | US |
Child | 13529897 | US |