The invention relates to methods for producing and using protein or peptide fingerprints, and to protein or peptide biomarkers. These methods and biomarkers may be used in the identification, evaluation, study or monitoring of conditions or diseases, for example to aid the discovery, development or use of drugs to treat those conditions or diseases.
Proteins
Proteins are the biologically active products of genes. Proteins occur naturally within cells, as components of cellular structures and as components of natural biological fluids such as blood, urine, saliva, tears, lymph and sweat. Proteins result from a process of synthesis in which specific sequences of genes encoded by DNA have been translated into mRNA which then acts with rRNA and tRNA and amino acids to form a protein molecule. Each type of protein molecule has a separate and unique entity. This is due to the specific linear arrangement of the 20 common amino acids in various combinations and orders. Each of the 20 amino acids shows both common and specific chemical structures or so called groups including an amino group, one side structure, a hydrogen atom, and a carboxyl group. The singular amino acid structures are joined together in a biochemical reaction that allows electrons to be shared between various atoms projected from the two adjacent amino acids. There is a preferred interaction between the amino group of one amino acid and the carboxyl group of a second amino acid. This chemical fusion reaction creates a so-called peptide bond joining the amino acids into a singular structure. Proteins have been identified that contain very few such amino acids bound together or as many as hundreds of such joined amino acids. The peptide bonds are formed during the de novo synthesis of the protein as each amino acid is added. A linear array of singular amino acids joined together by peptide bonding is called by several names including polypeptide, oligopeptide, or just by the name peptide. By convention, the name peptide typically refers to linear arrays of amino acids which are less than 30 amino acid residues in total size. However in some circles of discussion, even much longer strings of amino acids are still referred to as peptides. It has been estimated that 10's of millions of different protein structures composed of differing combinations of and numbers of the 20 common amino acids exist in nature. Proteins differ from one another by a number of physical characteristics including their primary, secondary, tertiary and quaternary structures, their molecular weight, their intrinsic charge, their degree of solubility, hydrophobicity, and hydrophilicity, and the presence of side groups and modifications. However there are similarities in certain portions of differing protein molecules that have been identified using laboratory methods that can measure many of these physical characteristics including the precise sequence identity of individual amino acids within the parent protein molecule.
Example of Biologically Important Proteins: Elastin and Elastic Fibers
One of the principal components of connective tissue is elastic filaments and fibers. These fibers function both in the support and the mechanical features of elasticity and resilience. The primary sites in which elastic fibers are found include the walls of blood vessels, the walls enclosing the alveolar spaces in the lung, ligaments, tendons, and the skin. The elastic fibers are composite units produced by the assemblage of segments of the protein tropoelastin, surrounded by a number of smaller molecular components such as fibrillin, MAGP-1, MAGP-2, fibulin, decoran, biglycan, and vitronectin.[1]. Elastin fibers form a tight macromolecular complex with cross-linkage between elastin strands by bonding through desmosine and isodesdomosine bridges [2].
The actual structure of elastic fibers differs from tissue to tissue and amongst different organisms. The important point for this application is 1) the assemblage of proteins into functional units, 2) the degradation of this functional unit creates structures which can be further degraded to produce peptides that are characteristic for each component protein, and 3) peptide digests of the protein elastin can be identified which are characteristic of elastin.
Generation of Peptides from Proteins
Proteins can be rendered into smaller unit form by breaking the peptide bond joining two adjacent amino acids. This can be accomplished either by chemical hydrolysis, such as by heating the protein in acid, or by enzymatic cleavage by certain other proteins that are capable of interacting and dissolving peptide bonds on other proteins. Such proteins with so-called enzymatic activity against certain protein substrates are also referred to as proteases. A number of such classes or families of proteases have been identified and characterized including those defined by chemical substrates: i.e. metallo proteinases, cysteine proteinases, serine proteases, or those defined by biological substrates: collagenases, elastases, etc.
Elastases as Proteases
Elastin fibers can be degraded by both chemical and biochemical processes into smaller unit forms called peptides. One established method for breaking up elastin is by acid hydrolysis in which the native elastin protein is boiled in hydrochloric acid for 6 hours. Certain enzymes can preferably degrade the elastin as a substrate. These enzymes including metalloproteinases and serine proteinases show differing specificities of recognition and efficiencies in degrading native elastin under normal biological conditions
Some human diseases have been identified in which the established and agreed mechanism of cause of disease is due to the enzymatic destruction of elastin within different organs such as the lungs, liver, vascular system and skin.
Biomarkers
Various biological markers, known as biomarkers, have been identified and studied through the application of biochemistry and molecular biology to medical and toxicological states. A biomarker can be described as “a characteristic that is objectively measured and evaluated as an indicator of normal biologic processes, pathogenic processes, or pharmacologic responses to a therapeutic intervention”. A biomarker is any identifiable and measurable indicator associated with a particular condition or disease where there is a correlation between the presence or level of the biomarker and some aspect of the condition or disease (including the presence of, the level or changing level of, the type of, the stage of, the susceptibility to the condition or disease, or the responsiveness to a drug used for treating the condition or disease). The correlation may be qualitative, quantitative, or both qualitative and quantitative. Typically a biomarker is a compound, compound fragment or group of compounds. Such compounds may be any compounds found in or produced by an organism, including proteins (and peptides), nucleic acids and other compounds.
Biomarkers may have a predictive power, and as such may be used to predict or detect the presence, level, type or stage of particular conditions or diseases (including the presence or level of particular microorganisms or toxins), the susceptibility (including genetic susceptibility) to particular conditions or diseases, or the response to particular treatments (including drug treatments). It is thought that biomarkers will play an increasingly important role in the future of drug discovery and development, by improving the efficiency of research and development programmes. Biomarkers can be used as diagnostic agents, monitors of disease progression, monitors of treatment and predictors of clinical outcome. For example, various biomarker research projects are attempting to identify markers of specific cancers and of specific cardiovascular and immunological diseases.
Proteomics (including peptidomics) technologies have been developed to analyse proteins (including peptides). These technologies are applied in a high-throughput mode, generating an enormous amount of data that is analysed using computer systems. Proteins from a biological sample are isolated and separated at a high resolution, for example by chromatographic separations. The set of proteins is then characterised using qualitative and quantitative techniques such as mass spectrometry. The result is a protein (or peptide) fingerprint (a constant, reproducible set of proteins or peptides). Selected proteins/peptides or groups of proteins/peptides may be analysed further to generate protein/peptide profiles. . Proteomics is now viewed as the large-scale analysis of the function of genes and is becoming a central field in functional genomics.
Separation of proteins is commonly achieved using gel-based techniques. 2D-PAGE (polyacrylamide gel electrophoresis) is currently the principal analytical method for studying the cellular expression of proteins. Instrumental platforms allow almost fully automated operations of 2D-gel analysis. The 2D-gel methods have good sensitivity and resolution for a large fraction of expressed proteins, typically those within a mass range of 10-120 kDa. However the methods have significant limitations in the identification of low abundance/low molecular weight proteins, some of which are present at concentrations as low as a few molecules per cell. Problems of sample loss and/or insufficient recovery have confounded the isolation of low abundance/low molecular weight proteins by 2D-PAGE. In addition, the presence of these proteins can be masked by the higher abundance protein spots. Other classes of proteins that are problematic for 2D-PAGE include acidic, basic, hydrophobic and high molecular weight proteins.
Quantitative and Qualitative Measurements of Peptides
Multidimensional HPLC (High Performance Liquid Chromatography) can be used as a good alternative for separating proteins or peptides unsuited to 2D-PAGE. The protein or peptide mixture is passed through a succession of chromatographic stationary phases or dimensions which gives a higher resolving power. HPLC is flexible for many experimental approaches and various stationary and mobile phases can be selected for their suitability in resolving specific protein or peptide classes of interest and for compatibility with each other and with downstream mass spectrometric methods of detection and identification. On-line configurations of these types of multi-mechanism separation platforms are known [4-7]. Mass spectrometry (MS) is also an essential element of the proteomics field. In fact MS is the major tool used to study and characterise purified proteins in this field [8-10].
The interface link in proteomics and MS, displaying hundreds or thousands of proteins, is made by gel technology where high resolution can be reached on a single gel. Researchers are successfully harnessing the power of MS to supersede the two-dimensional gels that originally gave proteomics its impetus.
The application and development of mass spectrometry (MS) to identify proteins or peptides separated via liquid phase separation techniques and/or gel-based separation techniques have led to significant technological advance in protein and peptide expression analysis. There are two main methods for the mass spectrometric characterization of proteins and peptides: matrix-assisted laser desorption ionization (MALDI) and electrospray ionization (ESI). Using various approaches, MALDI and ESI ion sources can be combined with time-of-flight (TOF) or other types of mass spectrometric analyzers to determine the masses or the sequences of peptides.
In MALDI, peptides are co-crystallized with the matrix, and pulsed with lasers. This treatment vaporizes and ionizes the peptides. The molecular weights (masses) of the charged peptides are then determined in a TOF analyzer. In this device, an electric field accelerates the charged molecules toward a detector, and the differences in the length of time it takes ionized peptides to reach the detector (their time-of-flight) reveal the molecular weights of the peptides; smaller peptides reach the detector more quickly. This method generates mass profiles of the peptide mixtures—that is, profiles of the molecular weights and amounts of peptides in the mixture. These profiles can then be used to identify known proteins from protein sequence databases.
In ESI and a technique called liquid chromatography (LC)/MS/MS, a voltage is applied to a very fine needle that contains a peptide mixture, generating peptide sequences, eluting from the LC-column. The needle then sprays droplets into a mass spectrometric analyzer where the droplets evaporate and peptide ions are released. In LC/MS/MS, researchers use microcapilliary LC devices to initially separate peptides.
Mass spectrometry (MS) is a valuable analytical technique because it measures an intrinsic property of a bio-molecule, its mass, with very high sensitivity. MS can therefore be used to measure a wide range of molecule types (proteins, peptide, or any other bio-molecules) and a wide range of sample types/biological materials. Correct sample preparation is known to be crucial for the MS signal generation and spectra resolution and sensitivity. Sample preparation is therefore a crucial area for overall feasibility and sensitivity of analysis.
Disease Associated Proteins
Proteomics are being used in drug discovery and development, for example to detect protein significantly altered in patients with particular conditions or diseases. Some of these disease-associated proteins may be identified as novel drug targets and some may be useful as biomarkers of disease progression. Such biomarkers may be used to improve clinical development of a new drug or to develop new diagnostics for the particular disease.
Detection of disease-associated proteins may be achieved by the following method. Protein samples are taken from both diseased subjects and healthy subjects. These samples may be cells, tissues, or biological fluids that are processed to extract and enrich protein and/or peptide constituents. Typically the process entails partitioning into solution phase but may also include the establishment of protein and/or peptide components attached to solid matrixes. After high-throughput separation and analysis (proteomics, peptidonomics), protein expression fingerprints are produced for either diseased or healthy subjects by qualitative and quantitative measurement. These fingerprints may be used as unique identifiers to distinguish individuals and/or establish and/or track certain natural or disease processes. These prototype fingerprints are established for each individual sample/subject and are recorded as numerical values in a computer database. The fingerprints are then analysed using bioinformatic tools to identify and select the proteins or peptides that are present in the prototype forms and whose expression may or may not be differentially present in the samples derived from the healthy and diseased subject samples. These proteins/peptides are then further characterised and detailed profiles are produced which identify the characteristic masses and physical properties of the proteins or peptides. Either a singular protein/peptide or groups of proteins/peptides may be determined to be significantly associated with certain natural or diseased processes.
Various disease-associated proteins are known, and some of these are enzymes whose activity increases or decreases at some stage in the development of a particular condition or disease. Such enzymes may be suitable drug targets, leading to a search for pharmaceutically-active compounds (drugs) that could be used to inhibit or stimulate the enzyme and thus prevent or treat the condition or disease. Other disease-associated proteins may be degradation products of particular enzymes, or proteins that are made more abundantly in the presence of the disease.
Examples of disease-associated proteins include the serine proteases, a superfamily of proteinases (enzymes) believed to be important in a plethora of physiological disease processes. Modulation of the activity of one or more serine proteases may well be of benefit in these diseases or conditions. A number of serine proteases inhibitors are known. Examples of disease-associated proteins include those enzymes that have been implicated in the onset and/or progression of Chronic Obstructive Pulmonary Disease (COPD), as discussed below, such as the serine protease named neutrophil elastase (HNE, EC 3.4.21.37.) or also known as leukocyte elastase or EL2.
HNE has a natural substrate, elastin, the insoluble, elastic protein of high tensile strength found in intercellular spaces of the connective tissues of large arteries, trachea, bronchi and ligaments within the pulmonary tract.
Potential Sites of Disease in Lung Tissue
The walls of the alveolar pulmonary bed provide the ventilation and perfusion structures necessary for gas exchange. A very large surface area for these processes is provided by the millions of individual small sacs, aka alveoli, that encompass the lung parenchyma. In simple terms each air space is surrounded by a thin tissue wall composed of cells, connective tissue, matrix components, microcapilliaries, and elastic fibrils. Elastin is the principal component of elastic fibres constituting a main part of the lung's extracellular matrix. Emphysema is a medical condition which results from the destruction and loss of the tissue and connective components of alveolar wall structures.
As the airway walls are degraded, the airway spaces become fused together and overall an over enlargement of the airway spaces occurs. The specific loss of elastic fibers in the walls due to destructive processes, is thought to be a chief determinant of emphysema. HNE activity is a chief cause of this destruction of the elastin matrix. Emphysema can be diagnosed clinically by spirometry testing of lung function as well as by CT and HRCT imaging. In advanced emphysema cases large areas of the lung appear as open spaces completely devoid of tissue. In some emphysema patients, a metabolic abnormality of genetic origin has been identified which results in the destruction of elastin. These patients suffer from a deficiency in the production of the naturally occurring inhibitor of HNE, alpha-1-anti-trypsin (ATA). Normally plasma proteinase inhibitors, especially α1-antitrypsin (α1-AT), prevent proteolytic enzymes from digesting structural proteins of the lung. According to the proteinase-antiproteinase hypothesis, emphysema results from an increase of proteinase release in the lungs, a reduction in the antiproteinase defence, or a combination. Studies show that individuals who are homozygous for α1-AT deficiency have an increased susceptibility for developing pulmonary emphysema, especially if they also smoke. Other causes of emphysema include the long-term exposure to cigarette smoke, and the exposure to occupational agents and noxious substances. Even here it is generally believed that the effect of smoking is to increase the levels of local protease production and release, leading to tissue destruction.
COPD
COPD, which is mainly caused by cigarette smoking, is expected to be the third leading cause of death worldwide by the year 2020. COPD is characterised by reduced maximum expiratory flow and slow forced emptying of the lungs. These airflow limitations are mainly due to chronic bronchitis, involving hypertrophy of mucous glands, and emphysema produced by destruction of alveolar walls. The latter leads to enlargement of the air spaces distal to the terminal bronchiole, with consequent collapse of small airways, limitations of the airflow, destruction of parts of the capillary bed, and loss of the elastic recoil of the lung. This loss of elastic recoil and the enlargement of the air spaces in the lungs of COPD patients lead to reduced values of forced expiratory volume (FEV), and increased values of forced vital capacity (FVC). Disease severity is determined as the degree of lung function impairment, which is measured with a spirometer. The presence of a postbronchodilator FEV1<80% of the predicted value in combination with an FEV1/FVC<70% confirms the presence of airflow limitation that is not fully reversible. The chronic exposure to cigarette smoke causes an inflammatory response in the lung, leading to changes in the airway epithelial surface and to activation and an increased number of several inflammatory cells.
Inflammation Causing Disease
Inflammation by neutrophils and macrophages, and the protease-antiprotease imbalance have long been proposed to act as downstream effectors of the lung destruction following chronic cigarette smoking. Histological studies have demonstrated increased numbers of macrophages and T-lymphocytes in the airways of smokers, and also an increase of neutrophils in the airways of smokers and COPD patients, which related to the severity of the airway obstruction. Alveolar macrophages are long-lived phagocytes, and are the most abundant defence cells in the lung both under normal conditions and during chronic inflammation. By sending out chemotactic factors they then recruit neutrophils and lymphocytes by activating adhesion molecule expression on pulmonary microvascular endothelial cells at the site of infection. The neutrophil and macrophage inflammatory cells invading the smoker's lung produce mediators locally, such as cytokines, serine- and metalloproteases, and oxidants. These mediators, which likely play an important role in the development of COPD, can act to further activate the inflammatory response, and also to degrade the components of the extracellular matrix.
Elastin Derived Peptides
Elastin derived peptides (EDP) have been reported to be released into the airways of individuals and appear in elevated concentrations in bronchial alveolar lavage samples (BAL) in smokers compared to non-smokers [11]. Elastin derived peptides (EDP) can be measured using immuno-assays such as ELISA. In these assays, antibodies specific for elastin or EDP are utilized to either bind directly to their products or as instruments for capturing elastin or EDP on solid surfaces. [12]. These types of assays have been reported for measuring elastin constituents in plasma, sera, BAL both in cancer and non-malignant conditions such as asymptomatic smoking, COPD, emphysema, cystic fibrosis. The major immunoreactive constituent of human lung elastin digested by NE is approximately 70,00 daltons in molecular size [12].
The current use of proteomics or peptidomics in drug discovery and development (particularly for the disease COPD) is limited by various factors, including for example:
We have now developed a new methodology for producing and using protein/peptide fingerprints, allowing us to identify and investigate disease-associated proteins/peptides that can be linked to specific drug targets (such as Neutrophil Elastase), or to specific drug target combinations. We demonstrate and claim the identities of certain peptide products of the digestion of elastin by HNE.
Table 1 shows the MS atomic mass unit identities of 195 peptides resulting from digestion of human elastin by human Neutrophil Elastase and separation by multi-dimensional chromatography.
We hereby provide a new method, composed of multiple linked steps, for detecting and quantifying both naturally occurring and laboratory prepared products of protein expression and protein catabolism. This method may be used for biomedical evaluation and biomedical characterization, as well as for biomarker discovery.
This methodology may be applied to naturally occurring substances or mixtures of naturally occurring substances and synthetic substances. The method results in the identification of specific protein substituents, also known as peptides, as well as biochemically-modified variants thereof, present as separate entities or present within complex mixtures of proteins and peptides. Each peptide may be defined by a specific sequence of amino acids in alignment, that can be selectively identified by either its precise mass, or its unique immuno-affinity binding properties to a given immunoreagent.
The method may quantitatively and qualitatively measure the differential expression of peptides and proteins within complex biological and clinical samples. The method may be applied for the biomedical study of the relationships between the expression and function of proteolytic enzymes and the status of protein degradation products, hereafter referred to as unique peptides.
The method allows the identification of some or all peptides which are proteolytic breakdown products of a given enzyme with a given substrate and which are measurable for example using Mass Spectrometry identification, and/or complementary immunoaffinity technology.
The method combines several key steps together, which results in the specific sample preparation, separation, isolation, and identification of unique peptides present in biological material. The unique peptides are the constituent units of protein molecules identifiable for example by MS or other methodologies.
The method may be applied to human clinical samples. The method may also be applied to samples derived from non-human animals.
We provide a multi-step method for identifying:
1) the unique peptide identity presented, for example, as atomic mass units of entities resulting from the proteolytic interaction of a given proteolytic-enzyme with its given substrate;
2) the preparation, separation, and identification of peptides derived from given proteolytic enzymes with given substrates either occurring naturally or produced in a laboratory setting;
3) the presence and/or absence of these same exact peptides in biological samples, determined by multi-dimensional chromatography separation and mass spectrometry within for example human blood, or urine, or tissue;
4) the statistical method for determining the identification of either naturally occurring peptides partitioned into fractions by the separation methodology referred to in 1) to 3) above;
5) the statistical method for defining the presence and/or absence of these same peptides in multiple human subjects, collected and grouped by, for example clinical disease status.
The method may be used to determine whether certain proteolytic enzyme processes are occurring or have occurred in human subjects, or in non-human animals by analyzing the global expression patterns of peptides present. This may allow association of the presence and/or absence of certain products of proteolytic digestion (for example single peptide markers, or peptide fingerprints where the number of peptides may vary, for example, between 2 and a thousand) in certain persons, or persons with known diseases, or persons with known stages or phases of disease. The method may allow measurement of the presence or absence or quantity of specific peptide fingerprints within human clinical samples such as for example urine or blood. These fingerprints may also be collected in groups as assigned annotations from each patient and sample type.
The method may allow monitoring of the effect of certain and/or all medicines or substances which effect the expression or function of proteolytic enzymes. The method may allow us to measure the presence or absence or quantity of specific peptide fingerprints within human clinical samples such as for example urine or blood as a result of medical and or pharmacological intervention.
The method first identifies all or some of the peptides produced by a given enzyme with a given substrate in a controlled laboratory setting. This step optimizes the chances for producing all likely candidate peptide fragments from a given enzyme-substrate reaction. This includes attention to Michaelis Minten kinetics for maximizing the ratio of reactants, the pH level and salt concentrations used in the reactant solutions, the temperature of the reaction, the time of the reaction, for example. This may result in the production of stable end form unit length entities of unique peptides which are present in the reactant solution [Reaction product 1]. The net result of the optimised laboratory controlled reaction of the given enzyme with the given substrate is a signature peptide profile for that reaction. This collection of unit length peptides resulting from proteolytic digestion is also referred to as a peptide fingerprint. Scheme A (below) illustrates this step (generation and identification of peptide entities, peptide fingerprint, and unique peptide atomic mass identification as products of laboratory controlled digestion of human protein substrates with given protein enzyme):
The peptide fingerprint is then subjected to a series of biochemical separation steps (described below) to fractionate the individual unique peptides by their intrinsic chemical and bio-physical properties, for example charge, size, and hydrophobicity properties. The individual fractions of the unique peptide fingerprint are then identified using MALDI MS to determine precise atomic mass measurements for each unique peptide entity. The net result of this fractionation and identification process is a quantitative and qualitative list of all peptide fragments produced and comprising the peptide fingerprint [Reaction product 2]. This list of atomic mass identities is then used in further steps of the method according to the invention.
In a first aspect of the invention, we provide a method to generate a peptide fingerprint of the degradation products of a disease-associated enzyme X, wherein enzyme X is associated with disease Y, which comprises:
The peptide fingerprint produced by the method of the invention may be used in the diagnosis or study of disease Y, for example to aid the discovery and development and administration of drugs to treat disease Y, particularly drugs wherein enzyme X is the drug target.
Disease Y is any condition or disease affecting humans or non-human animals. In particular, disease Y is any condition or disease affecting humans. For example, disease Y may be a condition or disease affecting the respiratory tract (such as COPD), the cardiovascular system, the gastrointestinal tract, the neurological system, the endocrinological system, or the immunological system. In addition, disease Y may be an allergic condition or disease, an infectious condition or disease, or an oncological condition or disease.
The disease-associated enzyme X is any enzyme that shows increased activity during the onset or progression of any condition or disease affecting humans and non-human animals (particularly humans). This increased activity causes or contributes to disease onset or progression. The disease-associated enzyme X may be a drug target.
The degradation products in the mixture will be peptides generated by breakdown of the substrate by the enzyme. The mixture is analysed by peptidomics technologies to produce a peptide fmgerprint which we define as a constant, reproducible set of the degradation products. The peptide fingerprint and/or selected peptides or groups of peptides may be useful as biomarkers relating to disease Y (including its presence, its development and/or its treatment).
In order to determine the entities within the peptide fingerprint and produce Reaction Product 2 (see scheme A above), the peptides produced as Reaction Product 1 are first separated in a number of consecutive steps (for example, by chromatographic separation, liquid phase separations) utilising mechanisms such as:
After separation, the peptides are profiled by ascertaining their physiochemical properties plus accurate masses (the peptide index, comprising the size, polarity/charge and hydrophobicity of the peptides). This is optionally followed by sequencing of the peptides.
Scheme B below illustrates the technology platform for analysis of unique peptide fingerprints resulting in individual mass identities of peptide entities:
The method according to the first aspect of the invention may be used to identify biomarkers for a particular disease Y that is known to be associated with a particular drug target (enzyme X). The peptide fingerprint of the degradation products is used as a biomarker for disease Y.
In one embodiment of the methods according to the first aspect of the invention: enzyme X HNE or any one of MMP2, MMP3, MMP7, MMP9, MMP12 MMP13, and MMP14; the natural substrate is elastin; disease Y is COPD. In a further embodiment of the methods according to the second aspect of the invention, enzyme X is HNE, (most preferably human HNE), the natural substrate is elastin (most preferably human elastin) and disease Y is COPD.
As an example of a method according to the first aspect of the invention, neutrophil elastase (comprised partly or wholly of HNE) is mixed with human elastin. In this method, the disease-associated enzyme X is HNE which has the natural substrate elastin and is associated with COPD. Conditions are optimised to ensure high HNE activity and good degradation of elastin. Michaelis-Menten kinetics are used to determine the preferred stoichiometry of reactants, and the substrate type and amount are chosen to give a favourable equilibrium constant for the progress of the reaction.
A particular method according to the first aspect of the invention is a method to generate a specific peptide fingerprint composed of certain identified peptide products resulting from the degradation of a substrate by the catalytic activity of enzyme X, wherein enzyme X is associated with a clinical condition known as disease Y, which comprises:
(c) separating the individual components derived from the biochemical interaction of enzyme X with the substrate, and any groupings of the components selected in the steps (a) and (b) using chromatography procedures;
The identities of peptides detected by the above method are peptide fingerprints of the substrate, and may be used either as isolated peptides or as collections of peptides. Such fingerprints may be used to identify, measure, monitor, and compare the activities of enzyme X with the substrate. In one embodiment, enzyme X is human HNE, the substrate is human elastin, and the clinical condition known as disease Y is COPD.
We herein provide a peptide fingerprint comprising peptide products resulting from the degradation of elastin by the enzyme HNE, wherein the peptide fingerprint comprises one or more of the peptides identified in Table 1. We also provide a peptide fingerprint comprising peptide products resulting from the degradation of elastin by the enzyme HNE, wherein the peptide fingerprint comprises at least twenty of the peptides identified in Table 1. We further provide a peptide fingerprint comprising peptide products resulting from the degradation of elastin by the enzyme HNE, wherein the peptide fingerprint comprises at least ninety of the peptides identified in Table 1. We also provide a peptide fingerprint comprising peptide products resulting from the degradation of elastin by the enzyme HNE, wherein the peptide fingerprint comprises at least one hundred and fifty of the peptides identified in Table 1. In one embodiment, a peptide fingerprint comprises all the peptides identified in Table 1.
Any and all combinations of the peptides identified by mass in Table 1 may be used in applications which measure or monitor the presence and/or absence of the protein elastin in any diagnostic setting of a clinical condition. Such clinical conditions include systemic inflammation, vascular inflammation, pulmonary inflammation, hepatic inflammation, cardiac inflammation, or other diseases which can be linked to the long term use of cigarettes or to the exposure to cigarette smoke. In particular, such clinical conditions include COPD. Any and all combinations of the peptides identified by mass in Table 1 may be used in any modified form including chemical modifications of constituent moieties, cross linking to moieties, labelling with moieties including radionuclides, fluorochromes, or like reagents. Any and all combinations of the peptides identified by mass in Table 1 may be used in diagnostic test kits which measure or monitor the presence or absence of the protein elastin.
In a second aspect of the invention, we provide a method to determine if or confirm that an enzyme X is associated with disease Y which comprises:
In a biofluid or tissue sample, enzyme X will have a defined selectivity for the substrate under the conditions used, in relationship to other enzymes in the sample. These other enzymes may or may not have degradative activity against the specific natural substrate used in steps f-h of the above method. In a biofluid or tissue sample, the peptide fingerprint which results from the enzymatic cleavage of the natural substrate in the presence of enzyme X and these other enzymes will be distinct from the peptide singletons or groups of peptides resulting from the reaction in step h of the above method.
In a method according to the second aspect of the invention, human clinical material is analysed using the methodology described below. The quality of the human clinical material is an important factor in obtaining accurate measurements. The methods for accurately determining, identifying, or measuring unique peptide entities in human clinical material are directly dependent upon certain criteria. Quality is defined in relation to human clinical material as follows. Clinical samples should be obtained using methods which preserve the integrity of proteins in a natural state, and minimize the effects of denaturation, and destruction. This includes careful sample preparation, and storage under conditions which preserve protein structure and function. Human clinical material should be well documented in the features of clinical presentation which these samples represent. Information which relates the sample to specific aspects of the disease such as the clinical presentation of disease, may be for example stages or phases of disease, or noted impairments of structure and function characteristic or not of these diseases. Samples from subjects should be identifiable for example as being free or not free from obvious diseases. When possible the best practice should be the linkage of disease with the individual samples, and with other subject samples with similar linkages to disease. When possible the best practice should be the linkage of a particular site or location of disease with the individual samples.
When possible the best practice is to obtain as much information regarding the sample, the history of the sample, and the medical classification of the sample as possible. It is also important to obtain as much information as possible regarding the phases of disease reflected in individual samples.
Scheme C below illustrates the preferred description of quality human clinical samples:
The biofluid or tissue sample may be derived from any part of the human or non-animal body (including cells grown in vitro), preferably from any part of the human body. For example, the sample may be derived from urine, blood, sputum, saliva, nasal secretions, exhaled breath condensate, bronchoalveolar fluid, bronchial fluid or any other biological fluid or tissue. A tissue sample is defined as a sample comprising one or more cells and their constituent parts in any infinite division. A biofluid is defined as any sample of clinical material in solution form (preferably human clinical material). This may include blood, serum, plasma, saliva, lavages, tears, urine, seminal fluid, joint fluid, aqueous humor, washings of cavities or sinuses, the soluble form of tissue preparations, the soluble form of organ preparations, or sweat. The samples may be derived from singular subjects or pools of singular samples from multiple subjects.
As defined herein, healthy biofluid or tissue samples are samples from individuals without recognised clinical disease or symptoms of disease. Healthy biofluid or tissue samples may represent the average or normal variation of expression of gene products in the human population that do not show any signs of disease onset or progression. As defined herein, diseased biofluid or tissue samples are samples from specific identified individuals that have been clinically evaluated and diagnosed for specific disease processes, or who show symptoms of clinical disease which are not yet categorised clinically as a specific disease. Diseased biofluid or tissue samples may express qualitatively and/or quantitatively different sets of peptides/proteins/endogenous products from healthy biofluid or tissue samples. Such differences include changes in the steady state, changes in destructive processes present in resident and non-resident cells, changes in differentiation states and changes in repair processes. Differentiation states are defined as stages of maturity in cell function and/or phenotype.
The biofluid or tissue samples are obtained by acquiring, processing and preparing the biological material. The methods according to the invention may be used for both small scale and large scale clinical investigation, for example with prototype subject/patient groups of 10-20 patients or more in each study group. The clinical study material needs to be of high quality and this is ensured by optimised sampling, sample handling, and sample storage protocols. These sample protocols ensure the minimum degradation of the naturally occurring proteins and peptides present within these samples.
The biofluid or tissue samples and the enzyme/substrate mixture are analysed by peptidomics technologies to produce peptide fingerprints. These peptide fingerprints are obtained as explained above for the method according to the first aspect of the invention (peptide separation by various mechanisms followed by determination of physiochemical properties plus accurate masses, optionally followed by sequencing of the peptides).
In step (e) the healthy peptide fingerprint is compared to the diseased peptide fingerprint. This allows the identification of the set of peptides found exclusively in the diseased fingerprint. When comparing the diseased set of peptides identified in step (e) with the peptide fingerprint of the degradation products produced in step (h), it is necessary to determine if there are statistically significant similarities or differences between them.
The diseased peptide profile may show individual variation based on sample type, the clinical development of disease, and the individual variations in metabolism by the enzymes being measured within the diseased biofluid or tissue sample. It is possible to generate peptide profiles from prototype samples showing a singleton peptide or number of peptides comprising the peptide fingerprints characteristic of early stage disease, mild disease, moderate disease or severe disease. Thus pathological and histological presentation of a disease may be linked to peptide profiling.
The diseased peptide profile may also differ depending on the source of the diseased biofluid or tissue sample. It is possible to generate peptide profiles from samples taken from different compartments of the human or non-animal body.
The diseased peptide profile may also differ depending on the individual human or non-human animal from which the sample was derived, or on the particular grouping of clinical phenotype to which the human or non-human animal belongs.
In a variation of the method according to the second aspect of the invention, multiple biofluid or tissue samples are used wherein each diseased sample has the same disease.
In another variation of the method according to the second aspect of the invention, multiple disease sets are used (by using more than one sample each having a different disease, or by using one sample having more than one disease).
Combined analysis of diseased peptide profiles may be used to reduce a multifactorial disease process to its component parts. Each part and its relation to other parts may be analysed.
In one embodiment of the methods according to the second aspect of the invention: enzyme X is HNE or any one of MMP2, MMP3, MMP7, MMP9, MMP12, MMP 13, and MMP 14; the natural substrate is elastin; disease Y is COPD. In a further embodiment of the methods according to the second aspect of the invention, enzyme X is HNE (most preferably human HNE), the natural substrate is elastin (most preferably human elastin) and disease Y is COPD.
In one embodiment, a method according to the second aspect of the invention is a method to determine if or confirm that the enzyme HNE is associated with disease Y which comprises:
Preferably disease Y is COPD. The peptide fingerprint comprising peptide products resulting from the degradation of elastin by the enzyme HNE preferably comprises at least twenty of the peptides identified in Table 1, or at least ninety of the peptides identified in Table 1, or at least one hundred and fifty of the peptides identified in Table 1.
A particular method according to the second aspect of the invention is a method to determine if or confirm that an enzyme X is associated with a clinical condition known as disease Y which comprises:
In a further embodiment, the enzyme X is HNE, the substrate is elastin, and the substrate fingerprint comprises one or more of the peptides identified in Table 1. In particular the substrate fingerprint comprises at least twenty of the peptides identified in Table 1. More particularly the substrate fingerprint comprises at least ninety of the peptides identified in Table 1. Most particularly the substrate fingerprint comprises at least one hundred and fifty of the peptides identified in Table 1. In one embodiment, a substrate fingerprint comprises all the peptides identified in Table 1.
In a third aspect of the invention, we provide a method to determine the presence of the peptide fingerprint in clinical samples which comprises:
Statistically significant similarities may be detected and registered as singular peptide identities or multiple-peptide identities. Determining statistically significant similarities involves using a prototype product peptide fingerprint characteristic of the reaction products resulting from the catalytic activity of a matrix digesting enzyme with its natural substrate (for example, an HNE/elastin degradative product peptide fingerprint). Determining statistically significant similarities involves using prototype subject samples in analyses described above to quantitatively and qualitatively measure peptides present within fractions resulting from the analyses procedure. Determining statistically significant similarities involves using prototype samples to establish peptide fingerprint profiles in groups of designated subjects representing characteristic clinical groupings and the establishment of comparative fmgerprints within biofluid or tissue samples.
In one embodiment of methods according to the third aspect of the invention: enzyme X is HNE or any one of MMP2, MMP3, MMP7, MMP9, MMP12, MMP 13 and MMP14; the natural substrate is elastin; disease Y is COPD. In a further embodiment of methods according to the third aspect of the invention, enzyme X is HNE (most preferably human HNE), the natural substrate is elastin (most preferably human elastin) and disease Y is COPD.
The method according to the third aspect of the invention may be used to determine the presence of a disease Y in humans or in non-human animals. For example, the method may be used during clinical trials involving individual humans or in pre-clinical trials involving non-human animal models. The humans or non-human animals may appear to be healthy or may appear to be diseased. Those that appear to be healthy may be healthy or may be clinically asymptomatic subjects.
A particular method according to the third aspect of the invention is a method to determine the presence of a clinical condition known as disease Y which comprises:
In one embodiment, the enzyme X is HNE, the substrate is elastin, and disease Y is COPD, and the peptide fingerprint of the degradation products comprises one or more of the peptides identified in Table 1. In particular the peptide fingerprint of the degradation products comprises at least twenty of the peptides identified in Table 1. More particularly the peptide fingerprint of the degradation products comprises at least ninety of the peptides identified in Table 1. Most particularly the peptide fingerprint of the degradation products comprises at least one hundred and fifty of the peptides identified in Table 1. In one embodiment, a peptide fingerprint of the degradation products comprises all the peptides identified in Table 1.
In a further embodiment, a method to determine the presence of a clinical condition known as disease Y comprises:
Preferably disease Y is COPD. The peptide fingerprint comprising peptide products resulting from the degradation of elastin by the enzyme HNE preferably comprises at least twenty of the peptides identified in Table 1, or at least ninety of the peptides identified in Table 1, or at least one hundred and fifty of the peptides identified in Table 1.
In a fourth aspect of the invention we provide a diagnostic test kit for determining the presence of a disease Y which comprises means to compare the peptide fingerprint of a biofluid sample or the peptide fingerprint of a tissue sample with the peptide fingerprint of the degradation products in a mixture of enzyme X with its natural substrate, wherein enzyme X is associated with the clinical condition known as disease Y.
In one embodiment of diagnostic test kits according to the fourth aspect of the invention: enzyme X is HNE or any one of MMP2, MMP3, MMP7, MMP9, MMP12, MMP 13, and MMP14; the natural substrate is elastin; disease Y is COPD. In a further embodiment of kits according to the fourth aspect of the invention, enzyme X is HNE (most preferably human HNE), the natural substrate is elastin (most preferably human elastin), disease Y is COPD and the peptide fingerprint of the degradation products comprises one or more of the peptides identified in Table 1. In particular the peptide fingerprint of the degradation products comprises at least twenty of the peptides identified in Table 1. More particularly the peptide fingerprint of the degradation products comprises at least ninety of the peptides identified in Table 1. Most particularly the peptide fingerprint of the degradation products comprises at least one hundred and fifty of the peptides identified in Table 1. In one embodiment, a peptide fingerprint of the degradation products comprises all the peptides identified in Table 1.
In a further embodiment, a diagnostic test kit for determining the presence of a disease Y comprises means to compare the peptide fingerprint of a biofluid sample or the peptide fingerprint of a tissue sample with a substrate fingerprint comprising peptide products resulting from the degradation of elastin by the enzyme HNE, wherein the substrate fingerprint comprises one or more of the peptides identified in Table 1. Preferably disease Y is COPD. In particular the substrate fingerprint comprises at least twenty of the peptides identified in Table 1. More particularly the substrate fingerprint comprises at least ninety of the peptides identified in Table 1. Most particularly the substrate fingerprint comprises at least one hundred and fifty of the peptides identified in Table 1. In one embodiment, a substrate fingerprint comprises all the peptides identified in Table 1.
In a fifth aspect of the invention we provide a method to analyse the effect of a drug Z on enzyme X, wherein enzyme X is associated with disease Y, which comprises:
The method according to the fifth aspect of the invention may be used during drug discovery and development to ascertain whether the correct drug target is being affected when treating with a particular drug Z. Drug Z may be a drug or a candidate drug compound. The method allows direct study of the effect of drug Z on enzyme X, including the effect of different levels of drug Z. The peptide fingerprint of the degradation products in a mixture of enzyme X with its natural substrate is a biomarker.
In one embodiment of methods according to the fifth aspect of the invention: enzyme X is HNE or any one of MMP2, MMP3, MMP7, MMP9, MMP12, MMP 13 and MMP 14; the natural substrate is elastin; disease Y is COPD. In a further embodiment of methods according to the fifth aspect of the invention, enzyme X is HNE (most preferably human HNE), the natural substrate is elastin (most preferably human elastin) and disease Y is COPD.
A particular method according to a fifth aspect of the invention is a method to analyse the effect of a drug Z on enzyme X, wherein enzyme X is associated with a clinical condition known as disease Y, which comprises:
In one embodiment of the method, the enzyme X is HNE2, the substrate is elastin, and disease Y is COPD, and the peptide fingerprint of the degradation products comprises one or more of the peptides identified in Table 1. In particular the peptide fingerprint of the degradation products comprises at least twenty of the peptides identified in Table 1. More particularly the peptide fingerprint of the degradation products comprises at least ninety of the peptides identified in Table 1. Most particularly the peptide fingerprint of the degradation products comprises at least one hundred and fifty of the peptides identified in Table 1. In a further embodiment, a peptide fingerprint of the degradation products comprises all the peptides identified in Table 1.
In one embodiment, is a method to analyse the effect of a drug Z on the enzyme HNE which comprises:
Preferably disease Y is COPD. The peptide fingerprint comprising peptide products resulting from the degradation of elastin by the enzyme HNE preferably comprises at least twenty of the peptides identified in Table 1, or at least ninety of the peptides identified in Table 1, or at least one hundred and fifty of the peptides identified in Table 1.
In a sixth aspect of the invention we provide a diagnostic test kit for analysing the effect of a drug Z on enzyme X which comprises means to compare the peptide fingerprint of a biofluid sample or the peptide fingerprint of a tissue sample with the peptide fingerprint of the degradation products in a mixture of enzyme X with its natural substrate, wherein the sample has been obtained from a human or non-human animal that has been or is being treated with the drug Z.
In one embodiment of diagnostic test kits according to the sixth aspect of the invention: enzyme X is HNE or any one of HNE, MMP3, MMP7, MMP9, MMP 12, MMP13, and MMP14; the natural substrate is elastin; disease Y is COPD. In a further embodiment of a diagnostic test according to the sixth aspect of the invention, enzyme X is HNE (most preferably human HNE), the natural substrate is elastin (most preferably human elastin), disease Y is COPD and the peptide fingerprint of the degradation products comprises one or more of the peptides identified in Table 1.
More particularly the peptide fingerprint of the degradation products comprises at least ninety of the peptides identified in Table 1. Most particularly the peptide fingerprint of the degradation products comprises at least one hundred and fifty of the peptides identified in Table 1.
In one embodiment, a diagnostic test kit for analysing the effect of a drug Z on the enzyme HNE comprises means to compare the peptide fingerprint of a biofluid sample or the peptide fingerprint of a tissue sample with a substrate fingerprint comprising peptide products resulting from the degradation of elastin by the enzyme HNE, wherein the substrate fingerprint comprises one or more of the peptides identified in Table 1 and wherein the sample has been obtained from a human or non-human animal that has been or is being treated with the drug Z. In particular the substrate fingerprint comprises at least twenty of the peptides identified in Table 1. More particularly the substrate fingerprint comprises at least ninety of the peptides identified in Table 1. Most particularly the substrate fingerprint comprises at least one hundred and fifty of the peptides identified in Table 1. In one embodiment, a substrate fingerprint comprises all the peptides identified in Table 1.
From the methods according to the invention, it is possible to generate a disease model (a predictive indicator of disease development) which encompasses the presence/absence, relative abundance, and qualitative/quantitative characteristics of singleton peptides/proteins or groupings of peptides/proteins within each fingerprint. By analysing biofluid or tissue samples over a dynamic time period in relation to specific protein/peptide fingerprints, an association correlation between specific settings of clinical disease and certain specific peptide fingerprints can be established.
In the methods according to the invention, it is preferable to use the following methodology to generate the protein/peptide fingerprints. This methodology provides and results in measurements with optimal resolution and sensitivity.
In one embodiment, the methodology is an automated multidimensional liquid phase separation platform technology. The entire platform is operated automatically in a closed operation system, where the multidimensional separating mechanisms are performed in liquid separation phases on chromatographic columns. The interconnections of these separation steps are performed on-line with transfer steps between the columns within the workstation. The interfacing between the separation mechanisms is provided by chromatographic conditions that allow the analytes to be transferred from one dimension to the next without losses. This is accomplished by the liquid-liquid transfer between the dimensions. An operational description of the methodology is given below.
Biofluid samples are introduced into the liquid phase peptide profiling platform and kept at 4° C. to ensure stability of the samples over time. The sample is then injected into the first dimensional separation from the autoinjector (column 1). The mechanism in this step is based upon size separation (the separation packing material, of polymer or silica origin, has highly defined pores). In the first dimension, larger sized proteins and biopolymers will be excluded from entering the pores of the beads of the separation material. The analytes of interest, such as peptide analytes, diffuse into the pores and bind to the functionality within the pores. This functionality can be electrostatic charged surfaces or hydrophobic surfaces onto which the peptides are bound. In this way selective enrichment of the peptides occurs as simultaneously the larger sized proteins and biopolymers are excluded and eluted to waste. The column material is then washed a few times with varying eluents in order to exclude interfering components from the sample that has bound to the outer surface as well as to filters and exposed surfaces of the chromatographic system. In this way, the enriched peptide fraction in the pores of the column material is isolated with a high purity.
After the washing steps, a strong eluent is introduced into the next dimension, column 2. In the second dimension, the elution from column 1 is transferred into column 2 on-line and adsorbed on top of the support of column 2. Column 2 is a bead with charged functionality where the separation is performed by electrostatic mechanisms. A gradient elution is used, and the corresponding peptides are separated in eluted fractions. These fractions are next separated in the third dimension by hydrophobicity, whereby the salt is eliminated from the peptide fractions and concentrated using a washing step of aqueous media, followed by elution onto a target plate surface from where the peptide mass sequences is determined.
The fourth dimension of the system utilizes mass spectrometry where the mass, intensity (quantity) of each and every peptide component of all the fractions of the sample is analyzed.
The data generated from the Mass Spectrometer is multi-factorial and representative of exact individual samplings at specific time frames of the process steps. The characteristic physical properties of the peptides and proteins which include size, mass, charge, and hydrophobicity constants result in individual signature profiles for each peptide or protein. The mass spectrophotometer instrument detects and records these characteristics as files is with three headings: Fraction, mass/z and Intensity.
The mass values are typically recorded to the 4th decimal point, however, in practice for presentation, these are rounded off to the nearest digital value.
The process for establishing statistical significance to the associations of peptides with given fingerprints, or between individual fingerprints or groups of fingerprints is based upon constants of a data matrix which is constructed from the Fraction, mass/z and intensity values of each peptide. The data matrix is derived by the combinations of fractions and masses present in each subject sample, and where the intensities are summed for each such mass and fraction combination. The data matrix then becomes:
where I1,1 etc are the summed intensities. Subjects are normalised by equating the total sum of intensities per subject.
A Java code calculates a regularized t-statistic that minimizes the false positive and false negative rates, following the theory in Broberg [13]. In practice one starts out with a top list size or a number of practical top list sizes, and the task is to find an optimal size in the range given and to populate that list with as many true positives as possible. The test statistic used has the form pioneered by Tusher et at [14]
where diff is an effect estimate, e.g. a group mean difference, S is a standard error, and S0 is a regularizing constant. In the two sample case putting S0=0 will yield the equal variance t-test. Using estimates of the false positive and false negative rates an optimisation procedure minimises the criterion C=√(FP2+FN2) over a lattice of possible values of S0 (given by percentiles in the distribution of S) and the length of the top list.
The output includes group means, p-values for the comparison, the false positive rate and false negative rate that would arise from including the current Fraction×Mass and all with smaller p-values. The cut-off is chosen so as to minimise the false positive and false negative rates.
The resulting mass spectra data, peptide mass, peptide fraction and peptide identity is generated by statistical comparisons between for example COPD and healthy subjects. The digital mass unit is grouped into bins with +/−0.5 mass units on either side of the detected mass, and combined with a given peptide fraction. Next the bin intensities are summed to produce an extrapolated identity for each and every fragment. The total bin numbers used in the statistical analysis were typically between 10.000-20.000. The mass fragments are then compared by subject groupings such as for example COPD or healthy subjects. The statistical analysis is based on 40-50 fractions collected from each subject. The cycle time generating the 40-50 peptide fractions is less than 5 hours.
Integrated process steps for biomarker identification is essential, containing the following four process defining corner stones; 1/ high quality biomedical clinical material; 2/ technology platform for qualitative and quantitative determination of peptides; 3/ in vitro assay where qualitative and quantitative analysis of for example, peptide/peptides products resulting from the reaction with HNE enzyme, with human Elastin as the substrate; 4/ a statistical method for relating multiple sets of peak identities to prototype fingerprints and to differential expression of these peptides and peptide fingerprints within and between designated subject groups. This is a preferred way of analysing the data, allowing analysis of the biomarker peptides that are likely to be associated with COPD patient urine (Scheme D).
Each of the four process corner stones are interdependent and required in order to determine biomarker peptides by differential quantitation between healthy and COPD subjects, relating it to for example, the ENE enzyme function/activity derived peptides from Elastin. In one embodiment, the method may also be used to determine and detect the natural elastin breakdown products resulting from the cleavage of elastin with naturally or designed elastolytic specific enzymes including MMP2, MMP3, MMP7, MMP9, and MMP14.
The Invention is Illustrated by the Following Non-Limiting Examples.
Table 1 provides the identities of landmark peptides which can be used as reference points for discovering and or identifying peptides with similar or nearly similar physical-chemical properties in other complex mixtures of proteins. The Table provides the identities of landmark peptides which can be used as reference points for discovering and or identifying peptides of similar characteristic that are present in human clinical samples. This further provides the identities of landmark peptides which can be used as reference points for discovering and or identifying the exact amino acid sequence identity of these same peptide entities. This provides the identities of landmark peptides which can be used as reference points for discovering and or identifying the presence, absence, and relative abundance of individual peptides or any groupings of these peptides in clinical samples from healthy subjects or subjects with clinical conditions which are associated with the breakdown of human elastin, and the subsequent groupings of subjects based upon these fingerprints or any combination of the peptides present in these samples.
Experimental Procedure
An HNE in vitro assay has been developed in order to make peptide annotations that are directly assigned to the HNE activity. These annotations represent peptide masses that are specific to degrading of human elastin by human HNE proteolytic activity and are used to establish signatures of peptide fingerprints that can be measured and matched with signature peptide profiles present within biofluid samples sampled from both healthy subjects and subjects with clinical conditions such as COPD. Other elastin specific proteolytic enzymes such as MMP2, MMP3, MMP7, MMP9, MMP 12, MMP 13 and MMP14 will produce separate and distinct peptide products from the elastin substrate. The assay is run as follows; human lung elastin is used as the substrate in the in vitro assay reaction with human HNE. The insoluble human elastin is washed using a 100 mM TRIS-HCL buffer pH 7.5 containing 0.1 M NaCl and 10 mM CaCl2, and then centrifuged in-between repeated washing steps. Next, 1.2 mg elastin is re-suspended in the assay buffer, 100 mM TRIS-HCL buffer pH 7.5 containing 0.1 M NaCl and 10 mM CaCl2, and 60 μg human HNE. Incubation was made at 37° C. for 7 hours where the elastin was degraded by the enzyme HNE. The proteolysis process was stopped by the addition of iodoacetamide.
After digestion, the samples were analysed directly or kept at −80° C. in the freezer.
The samples were analysed by thawing of samples at room temperature, and a sample preparation step was performed using a reversed phase preparation step. The sample was then eluted from the preparation by an acetonitrile elution step onto the MALDI-TOF target plate. Cyano-4-hydroxycinnamic acid (ACHA) was added as the matrix for crystal formation and run on the MALDI-TOF mass spectrometer where a peptide fingerprint of the HNE/elastin degradation products was identified and annotated.
The specific experimental conditions used in order to generate differentially displayed peptides in human urine samples from healthy subjects and COPD patients is made as follows.
The urine biofluid is obtained and collected from subjects by normal urination and thereafter aliquoted and frozen at −80° C. The frozen urine is thawed at room temperature, pH adjusted to 2.5 with ortophosphoric acid and processed for HPLC separations. The urine samples are introduced into a two-dimensional chromatography system in which the first separation mechanism utilised is size exclusion chromatography. The cut-off of the column material is designated as 15 kDa. The fractionations resulting from 1) the size exclusion column separation step are transferred on-line to a 2) cation—exchange column step where the peptides/proteins are separated based upon charge. These fractions are then transferred to a 3) reversed phase separation column step where all interfering matrix components present in the sample are eliminated. The third dimension fractions are spotted down onto a MALDI target plate by a robotic feeder that added the MALDI matrix to the peptide/protein sample spots. Next the MALDI sample plate are inserted into the MALDI-TOF mass spectrometer instrument and irradiated to produce peptide fragments which are then analysed according to the exact mass, quantity and isotope resolution of each individual MALDI-TOF peptide spectrum.
The elastin peptide fingerprint generated by HNE digestion under laboratory conditions, and described above, is used as a reference landmark for finding identical or nearly identical homologous peptides within clinical biofluids.
Elastin peptide fragments or the elastin peptide fingerprint are identified in the urine of a patient with Chronic Obstructive Pulmonary Disease (COPD). This patient will have been previously identified within a clinical setting as having COPD. In this example the patient will show abnormal respiratory function tests, as revealed by a low FEV1 score. This patient may further show evidence of pulmonary alveolar hyper-inflation and emphysema using CT imaging. This patient may further show evidence of elastin protein destruction within the parenchyma of the lung by histology examination of the lung, and by using a method for identifying elastin protein within lung tissue by immunohistochemistry with an antibody specific for human elastin, and specific for a hexamer epitope of human elastin and tropoelastin. This patient may further show histological evidence of alveolitis and alveolar macrophage accumulation in areas of lung, located adjacent to elastin expression and elastin degradation. This patient may further show histological evidence that the same alveolar located macrophages were activated to express a marker of activation, CD-68, within tissue by immunohistochemistry of sections of lung tissue with an antibody specific for CD-68. This patient may further show histological evidence that the same alveolar located macrophages were activated to express human HNE within tissue by immunohistochemistry of sections of lung tissue with an antibody specific for human HNE. This patient is a member of a group of 20 patients under study.
The Following Method is Described:
1) A healthy human bio fluid sample is obtained. We define healthy in this example as a living adult person aged 20-80, without symptoms of disease, without current clinical condition, not being treated for a clinical condition with medication or prescribed drugs, without at risk behaviour for developing disease such as smoking, drug abuse, alcoholism, obesity, or without a diagnosed genetic disposition for disease in later onset of life for example. We define biofluid here as any sample of human clinical material in solution form. This may include blood, serum, plasma, saliva, lavages, tears, urine, seminal fluid, joint fluid, aqueous humor, washings of cavities or sinuses, the soluble form of tissue preparations, the soluble form of organ preparations, or sweat, for example. The samples may be derived from singular subjects or pools of singular samples from multiple subjects. In this example the healthy biofluid is urine.
2) The healthy urine sample is analysed to produce a healthy peptide fingerprint. We define analysed for example, as any combination of the steps of sample selection, preparation, separation, identification, annotation, retrieval of stored data, and comparisons of data from the body of this application, the examples provided in this application, the claims of this application. We define fingerprint in this example as an identifiable singular peptide constituent of a native protein, and/or, which may be combined with other identifiable singular peptide constituent(s) of a native protein, and/or the sum total of all identifiable singular peptides which can be grouped together so that as such that grouping becomes an entity itself. We define identifiable as the fraction, mass, and intensity of singular peptide entities. We further define mass as the unique MS mass assignment of an identifiable singular peptide, the derived mass of collected identifiable singular peptide entities, and or the derived mass of collected identifiable singular peptide entities in combination.
3) A diseased human bio fluid sample is obtained from a diseased individual. Diseased in this example is defined as an adult person aged 20-80, with clinical symptoms or presentation of disease, and/or with a clinical at risk behaviour for developing disease, such as smoking, drug abuse, alcoholism, obesity, with or without a diagnosed genetic disposition for disease in later onset of life. For example, patients with a clinical diagnosis of COPD, or patients at risk for developing (COPD). We define at risk for disease for example, as a current smoker of tobacco or other medicinal herb, or a person who has ever smoked, or as a person who has smoked and quit smoking, irrespective of time frame in relation to the sampling of these same patients for study. We further define at risk for disease as any person with deficiencies in the expression or the regulation of expression of alpha-1-anti-trypsin or related naturally occurring biochemical molecules, or any biological entity related to the expression or function of alpha-1-anti-trypsin or related naturally occurring biochemical molecules. The diseased individual in this example is a subject who shows signs of the onset or progression of Chronic Obstructive Pulmonary Disease (COPD). We further may characterize COPD patients as subjects which show elastin breakdown using histological analysis, or immunohistochemistry analysis of pulmonary tissue samples. We further may characterize COPD patients as subjects which show alveolitis, airway hyperinflation, or emphysema using X-ray imaging (HRCT,CT), histological analysis, or immuno-histochemistry analysis of pulmonary tissue samples. We further may characterize COPD patients as subjects which show evidence of activated macrophages within pulmonary tissue samples using histology and immunohistochemistry with antibodies specific for the detection of products of genes expressed by activated macrophages. In this example the patient shows histological evidence of pulmonary airway space enlargement, emphysema, and destruction of pulmonary elastin integrity. The patient further shows evidence of activated alveolar macrophages near sites of pulmonary elastin destruction.
We define diseased biofluid here as any sample of human clinical material in solution form taken from patients who fulfil all or parts of the criteria of the definition of disease as above. This may include blood, serum, plasma, saliva, lavages, tears, urine, seminal fluid, joint fluid, aqueous humor, washings of cavities or sinuses, the soluble form of tissue preparations, the soluble form of organ preparations, or sweat, for example. The samples may be derived from singular patients or pools of singular samples from multiple patients. In this example the biofluid was urine.
4) The diseased sample is analysed to produce a diseased peptide fingerprint. We define fingerprint in this example as an identifiable singular peptide constituent of a native protein, and/or, which may be or not combined with other identifiable singular peptide constituent(s) of a native protein, and or the sum total of all identifiable singular peptides which can be grouped together and as such that grouping becomes an entity itself We define identifiable as the fraction, mass, and intensity of singular peptide entities. We further define mass as the unique MS mass assignment of an identifiable singular peptide, the derived mass of collected identifiable singular peptide entities, and or the derived mass of collected identifiable singular peptide entities in combination.
Summary of Experimental Procedure
Description of the Experimental Procedure
Specific experimental conditions which can be used in order to generate differentially displayed peptides in human urine samples from healthy subjects and COPD patients are as follows;
The biofluid is sampled from patients and thereafter aliquoted and frozen at −80° C.
The frozen urine is thawed at room temperature, pH adjusted to 2.5 with orthophosphoric acid and processed for HPLC separations. The urine samples are introduced into a three-dimensional chromatography system where the first separation mechanism utilised is size exclusion chromatography. The cut-off of the column material is approximately 15 kDa. The fractionations resulting from the size exclusion separation step is next transferred on-line to a cation—exchange column chromatography step where the peptides/proteins are separated based upon charge. These fractions are then transferred to a reversed phase separation step where all interfering matrix components present in the sample are eliminated, this is the third dimensional separation. The third dimension fractions are spotted down onto a MALDI target plate by a robotic feeder that added the MALDI matrix to the peptide/protein sample spots. Next the MALDI sample plate is inserted into the MALDI-TOF mass spectrometer instrument and irradiated to produce fragments which were then analysed according to the exact mass, quantity and isotope resolution of each individual MALDI-TOF peptide spectrum.
[1]. Kielty C M, Sherratt M J, Shuttleworth C A. J Cell Sci. 2002 Jul. 15; 115 (Pt 14):2817-28.
[2]. Akagawa M, Suyama K. Connect Tissue Res. 2000; 41(2):131-41
[3]. Mecham R P, Broekelmann T J, Fliszar C J, Shapiro S D, Welgus H G, Senior R M. J Biol Chem. 1997 Jul. 18; 272(29):18071-6.
[4]. Aebersold, R. & Goodlett, D. R. Chem. Rev. 2001, 101, 269-295
[5]. Mann, M., Hendrickson, R. C., Pandey, A. Annu. Rev. Biochem, 2001. 70, 437-473.
[6]. Wolters, D. A., Washburn, M. P., & Yates, J. R., III Anal. Chem. (2001). 73, 5683-5690
[7]. Washburn, M. P., Wolters, D., Yates, J. R., III Nat. Biotechnol. (2001). 19, 242-247
[8]. Aebersold, R. & Mann, M. Nature (2003). 422, 198-207
[9] Steen, H. and M. Mann (2004). Nat Rev Mol Cell Biol 5(9): 699-711.
[10]. Olsen, J. V. and M. Mann Proc Natl Acad Sci U S A (2004). 101(37): 13417-22.
[11]. Betsuyaku T, Nishimura M, Yoshioka A, Takeyabu K, Miyamoto K, Kawakami Y. Am J Respir Crit Care Med. 1996 September; 154 (3 Pt 1):720
[12]. Kucich U, Rosenbloom J, Kimbel P, Weinbaum G, Abrams W R. Am Rev Respir Dis. 1991 February; 143(2): 279-83.).
[13]. Broberg Genome Biology, (2003, 4(6): R41).
[14] Tusher V G, Tibshirani R, Chu G. Proc. Natl. Acad. Sci. USA, (2001), 98: 5116-5121)
Number | Date | Country | Kind |
---|---|---|---|
0500653-1 | Mar 2005 | SE | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/SE2006/000346 | 3/20/2006 | WO | 00 | 4/27/2010 |