DIFFERENTIALLY EXPRESSED GENES RELATED TO CORONARY ARTERY DISEASE

Abstract
Disclosed are genes and peptides whose expression is correlated to the prevalence of coronary artery disease. Also provided are methods of prognosis, diagnosis and methods of monitoring coronary artery disease based on the measurement of gene expression.
Description
FIELD OF THE INVENTION

This invention relates to genes whose expression is correlated to the prevalence of coronary artery disease. In particular, the invention relates to methods of identifying, predicting and monitoring coronary artery disease in a subject based on measurement of gene expression.


BACKGROUND OF THE INVENTION

Coronary artery disease is the principal cause of death in the United States, Europe and most of Asia. As a multigenic disease, understanding patterns of gene expression may help to explain the individual differences in susceptibility to the disease.


According to the present invention, coronary artery disease is defined as is a narrowing of the coronary arteries that supply blood and oxygen to the heart. Coronary disease usually results from the build up of fatty material and plaques (atherosclerosis). As a result of coronary artery stenosis, the flow of blood to the heart can slow or stop. The disease can be characterized by symptoms, including but not limited to, chest pain (stable angina), shortness of breath, atherosclerosis, ischemia/reperfusion, hypertension, restenosis and arterial inflammation.


In spite of the revolution in genomic knowledge, definitive and reproducible insights into how genetic variants relate to coronary disease are lacking. Defining how gene expression products, in the form of an entire population of e.g. mRNA, cDNA or proteins, relate to disease state provides an opportunity to gain new insights into disease.


Systemic and local inflammation plays a prominent pathogenetic role in atherosclerotic coronary artery disease (CAD), but the relationship of phenotypic changes in a sample isolated from a subject such blood, plasma or circulating leukocytes and the extent of CAD remains unclear. Thus, there is a need in the art for an understanding of whether gene expression patterns in such sample are associated with presence and extent of CAD.


SUMMARY OF THE INVENTION

The present invention relates to genes which are differentially expressed in subjects with coronary artery disease relative to their expression in normal or non-disease states. As such they can be used as biomarkers for coronary artery disease. Further, these identified genes may act via their gene expression products with other genes involved in coronary artery disease. Further, the invention relates to proteins whose abundance is altered in subjects with CAD.


Methods are provided for identifying coronary artery disease in a subject, for identifying subjects who may be predisposed to coronary artery disease and for monitoring the treatment and progression of coronary artery disease. Further provided are methods of screening agents for use in the treatment of coronary artery disease. The present invention also provides a kit that can be used for identifying and monitoring coronary artery disease. Measurement of the biomarkers of the present invention can provide information that may correlate with a diagnosis of coronary artery disease.


While these biomarkers are identified from blood as described in the examples, the sample from which they may be detected is not limited to blood but may be detected in other types of samples such as serum, plasma, lymph, urine, tear, saliva, cerebrospinal fluid, or tissue.


A systematic and comprehensive approach has identified a large number of gene expression products such as mRNA or proteins that are differentially displayed in populations with and without coronary disease. These gene expression products include inflammatory mediators and defense mechanism proteins and mRNAs encoding them.


The simultaneous expression pattern of eight genes (Table 7), of 19 genes (Table 9), of 15 genes (Table 10), and/or the abundance of 11 Disease>Control and Predominant in Disease peptides and/or the abundance of 4 Control>Disease and Predominant in Control peptides (Table 11) is highly predictive for coronary artery disease (CAD). Blood, plasma or peripheral leukocyte gene expression pattern is a thus non-invasive biomarker for coronary artery disease and leads to new pathophysiologic insights.


In one aspect of the invention, a method of identifying or predicting the predisposition of coronary artery disease in a subject is provided, comprising:

    • (i) determining the level of gene expression of at least one gene selected from Table 6 in a subject to provide a first value,
    • (ii) determining the level of gene expression of said at least one gene selected from Table 6 in a control or reference standard to provide a second value and
    • (iii) comparing whether there is a difference between said first value and second value.


In another embodiment of the invention, a method of identifying or predicting the predisposition of coronary artery disease in a subject is provided, comprising the steps of

    • (i) determining the level of gene expression of at least one gene selected from Table 7 and/or Table 9 and/or Table 10 in a subject to provide a first value,
    • (ii) determining the level of gene expression of said at least one gene selected from Table 7 and/or Table 9 and/or Table 10 in a control or reference standard to provide a second value and
    • (iii) comparing whether there is a difference between said first value and second value.


Preferably, the control or reference standard is determined from a subject or group of subjects without coronary artery disease, wherein if the level of gene expression in the subject being tested (first value) is higher than that of the control or reference standard (second value) it is indicative of the presence or prediction of coronary artery disease.


Another aspect of the invention provides a method of identifying or predicting coronary artery disease (CAD) in a subject wherein said method comprises the steps of

    • (a) determining the level of one or more peptide selected from Table 11 in a subject to provide a first value,
    • (b) determining the level of said one or more peptide selected from Table 11 in a control or reference standard to provide a second value and
    • (c) comparing whether there is a difference between said first value and second value.


In one embodiment, wherein the first value is increased for the one or more Disease>Control peptide and/or Predominant in Disease peptide of Table 11, and/or wherein the first value is decreased for the one or more Control>Disease and/or the Predominant in Control peptides of Table 11 it is an identification or prediction of coronary artery disease.


In another embodiment of the invention a method of identifying or predicting coronary artery disease (CAD) in a subject is provided comprising step (a) determining the level of one or more peptides selected from Table 11 and the level of gene expression of at least one gene selected from Table 6 and/or Table 7 and/or Table 9 and/or Table 10 in a subject to provide a first value, step (b) determining the level of said one or more peptide selected from Table 11 and the level of gene expression of said at least one gene selected from Table 6 and/or Table 7 and/or Table 9 and/or Table 10 in a control or reference standard to provide a second value and step (c) determining whether there is a difference between said first value and second value, wherein an elevated peptide level in the first value for the one or more Disease>Control and/or Predominant in Disease peptide of Table 11 and/or wherein a decrease of peptide level in the first value for the one or more Control>Disease and/or Predominant in Control peptide of Table 11 and wherein an elevated level of gene expression of the at least one gene selected from Table 6 and/or Table 7 and/or Table 9 and/or Table 10 in the first value is an identification or prediction of coronary artery disease.


In other embodiments of the invention the prediction of the presence of coronary artery disease has a probability of at least 50%, at least 60%, at least 75%, at least 90% or at least 95% chance.


In another embodiment of the invention, the level of gene expression and/or the level of a peptide may be expressed either as an absolute amount (e.g. μg/ml) or a relative amount (e.g. relative intensity of signals) where there may be a fold increase of level of gene expression and/or peptide level in one sample compared to another sample. In preferred embodiments of the invention, there is at least 20% greater difference in the first value compared to the second value.


In other preferred embodiments of the invention, the level of gene expression of at least one gene selected from the group of genes:

    • (i) with the accession codes: BG537190, L37033, AL581768, AF055000, NM025241, AF151074, AF279372 and BF432478 (Table 6) or
    • (ii) with sequence numbers: SEQ ID No. 1, SEQ ID No. 2, SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5, SEQ ID No. 6, SEQ ID No.7 and SEQ ID No. 8 (Table 7) is determined.


In a further preferred embodiment the level of gene expression of a plurality of genes selected from Table 7 is determined. In another preferred embodiment the level of gene expression of at least the gene corresponding to SEQ ID No. 1 from Table 7 is determined. In yet another preferred embodiment the level of gene expression of at least two to eight genes selected from Table 7 is determined. Preferably, the levels of gene expression of 2, 3, 4, 5, 6, and/or 7 genes of Table 7 are determined. In the most preferred embodiment the level of gene expression of all eight genes selected from Table 7 is determined.


In another embodiment of the invention the level of gene expression of at least one gene selected from the group of genes consisting of: PMS2L5 (SEQ ID NO. 9), RXRA (SEQ ID NO. 10), GCN5L1 (SEQ ID NO. 11), CABIN1 (SEQ ID NO. 12), LGALS9 (SEQ ID NO. 13), CEBPA (SEQ ID NO. 14), LRRN4 (SEQ ID NO. 15), STXBP2 (SEQ ID NO. 16), SH3BP2 (SEQ ID NO. 17), RNF24 (SEQ ID NO. 18), PLAUR (SEQ ID NO. 19), RIS1 (SEQ ID NO. 20), ADD1 (SEQ ID NO. 21), GPSM3 (SEQ ID NO. 22), BC002942 (SEQ ID NO. 23), TNFRSF5 (SEQ ID NO. 24), N4BP1 (SEQ ID NO. 25), FLJ12438 (SEQ ID NO. 26) and MMP24 (SEQ ID NO. 27) of Table 9 is determined. In a preferred embodiment, the level of gene expression of the at least one gene corresponding to PMS2L5 (SEQ ID NO. 9) and/or RNF24 (SEQ ID NO. 18) from Table 9 is determined. In yet another preferred embodiment, the levels of gene expression of at least two to nineteen genes selected from Table 9 are determined. Preferably, the levels of gene expression of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 and/or 18 genes of Table 9 are determined. In a most preferred embodiment, the levels of gene expression of a plurality of genes selected from Table 9 are determined. In an even more preferred embodiment the levels of gene expression of all nineteen genes selected from Table 9 are determined.


In a further embodiment the level of gene expression of at least one gene selected from the group of genes consisting of: PTP4A1 (SEQ ID NO. 28), PAFAH1B1 (SEQ ID NO. 29), SOX4 (SEQ ID NO. 30), ASNA1 (SEQ ID NO. 31), MAN2A2 (SEQ ID NO. 32), NFYC (SEQ ID NO. 33), NOTCH2 (SEQ ID NO. 34), HDAC5 (SEQ ID NO. 35), HCFC1 (SEQ ID NO. 36), NFX1 (SEQ ID NO. 37), CRSP2 (SEQ ID NO. 38), ICAM1 (SEQ ID NO. 39), PSG3 (SEQ ID NO. 40), STC2 (SEQ ID NO. 41) and SEMA3C (SEQ ID NO. 42) of Table 10 is determined. Preferably, the level of gene expression of the gene corresponding to PTP4A1 (SEQ ID NO. 28) and/or MAN2A2 (SEQ ID NO. 32) from Table 10 is determined. More preferred, the levels of gene expression of at least two to fifteen genes selected from Table 10 are determined. Preferably, the levels of gene expression of 2, 3, 4, 5, 6, 7, 8, 9, 10,11, 12, 13 and/or 14 genes of Table 10 are determined. In a most preferred embodiment, the levels of gene expression of a plurality of genes selected from Table 10 are determined. In an even more preferred embodiment the levels of gene expression of all fifteen genes selected from Table 10 are determined.


According to another embodiment of the invention the levels of gene expression of at least one gene selected from the genes of Table 7 and/or of at least one gene of Table 9 and/or of at least one gene selected of and/or Table 10 are determined. In another embodiment levels of gene expression of at least one gene selected from the genes of Table 9 and of at least one gene selected from Table 10 are determined. Preferably, the levels of gene expression of 2, 3, 4, 5, 6 or 7 genes of Table 7 and/or the levels of gene expression of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 or 18 genes of Table 9 and/or the levels of gene expression of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 or 14 genes of Table 10 are determined. In a most preferred embodiment, the levels of gene expression of a plurality of genes selected from Table 7 and/or Table 9 and/or Table 10 are determined. In an even more preferred embodiment the levels of gene expression of all genes selected from Table 7 and/or Table 9 and/or Table 10 are determined. The determination of the levels of gene expression of two or more genes may be performed separately or sequentially.


In other preferred embodiments of the invention, the level of 2, 3, 4, 5, 6, 7, 8 and/or 9 Disease>Control peptides and/or the level of 2 Predominant in Disease peptides and/or the level of 2 or 3 Control>Disease peptides and/or the Predominant in Control peptide from Table 11 are determined. Most preferably, the levels of a plurality or all Disease>Control peptides and/or all Predominant in Disease peptides and/or all Control>Disease peptides and/or all Predominant in Control peptide from Table 11 are determined. In most preferred embodiments said peptide levels are measured in a blood, plasma or serum sample. The determination of the levels of gene expression of two or more genes and/or of peptide levels may be performed separately or sequentially.


In another preferred embodiment of the invention the levels of gene expression of at least 5, 10, 15, 20, 25, 30, 40, 50, 60, 70, or at least 80, 90, 100, or at least 110, 120,140 or 150 genes selected from the genes of Table 6 are determined. In another embodiment the level of gene expression for a plurality of genes selected from Table 6 are determined and in a further embodiment the level of expression of all 160 genes of Table 6 is determined.


According to the present invention, the determination of the level of gene expression comprises measuring the protein expression product and detection of said protein expression product may be made by using an antibody, antibody derivative or antibody fragment which specifically binds to the protein.


In another embodiment of the invention, the determination of the level of gene expression or of the peptide level comprises measuring the gene expression of a transcribed polynucleotide of the gene encoding a peptide wherein the transcribed polynucleotide may be mRNA or cDNA. Accordingly, the level of expression may be detected by microarray analysis, Northern blot analysis, reverse transcription PCR or RT-PCR.


In a further aspect of the invention, the level of gene expression and/or the peptide level may be measured ex vivo in a sample selected from the group of: blood, serum, plasma, lymph, urine, tear, saliva, cerebrospinal fluid, leukocyte sample or tissue sample.


In yet another embodiment of the invention, CAD-Index may be measured wherein a CAD Index between 23-100 is indicative of the probability of the presence of coronary artery disease.


In another aspect of the invention, a method of monitoring a subject identified as having coronary artery disease before and after treatment is provided, comprising:

    • (i) determining the level of gene expression of at least one gene from Table 6 or Table 7 in said subject prior to treatment providing a first value,
    • (ii) determining the level of gene expression of the same gene after treatment providing a second value and
    • (iii) comparing the difference in the level of gene expression of said subject before treatment and after treatment.


Another aspect of the invention provides a method of monitoring a subject identified as having coronary artery disease before and during and/or after treatment comprising step (i) determining the level of gene expression of at least one gene from Table 9 or Table 10 in said subject prior to treatment providing a first value, step (ii) determining the level of gene expression of the same at least one gene as in (i) during and/or after treatment providing a second value and (iii) comparing the first value with the second value. Another embodiment provides a method of monitoring a subject identified as having coronary artery disease before and during/after treatment based on the analysis of expression of at least one gene selected of Table 7 and/or Table 9 and/or Table 10. A further embodiment of the invention relates to the use of determining the level of one or more peptides selected from Table 11 in said method. And a still further embodiment of the invention provides such method based on determining the level of one or more peptides selected from Table 11 and the level of gene expression of at least one gene from Table 6 or Table 7 and/or Table 9 and/or Table 10.


In one embodiment of the invention, the method of monitoring a subject further comprises:

    • (iv) determining that a difference in the level of gene expression corresponds to the efficacy of the treatment of coronary artery disease in said subject.


In a preferred embodiment of the invention, a positive response to the treatment is measured when level of gene expression decreases. In another preferred embodiment a decrease in peptide level in the second value for the one or more Disease>Control and/or Predominant in Disease peptide of Table 11; and/or an increase in peptide level in the second value for the one or more Control>Disease and/or Predominant in Control peptide of Table 11, and/or a decrease of level of gene expression of the at least one gene selected from Table 6 or Table 7 and/or Table 9 and/or Table 10 in the second value is indicative for a positive response to the treatment.


In a preferred embodiment of the invention, the level of gene expression of at least SEQ ID No. 1 is determined. In another preferred embodiment of the invention, the level of gene expression of a plurality of genes in Table 7 is determined. In the most preferred embodiment of the invention, the level of gene expression of all 8 genes listed in Table 7 is determined. In other preferred embodiments of the invention the variety of combinations of the one or more peptides of Table 11 and/or of the at least one gene of Table 6 and/or Table 7 and/or Table 9 and/or Table 10 are determined as described further above.


In another aspect of the invention, a method of monitoring the progression or severity of coronary artery disease is provided, comprising:

    • (i) determining the level of gene expression of at least one gene from Table 6 or Table 7 at an initial time point providing a first value,
    • (ii) determining the level of gene expression of at least one gene from Table 6 or Table 7 at a time point after the initial time point providing a second value,
    • (iii) comparing the difference in the level of gene expression of the first value to the second value wherein a higher level of gene expression in the second value is indicative of an increase in severity of coronary artery disease.


Another embodiment of the invention provides a method of monitoring the progression or severity of coronary artery disease comprising the steps of (i) determining the level of gene expression of at least one gene from Table 9 or Table 10 at an initial time point providing a first value, (ii) determining the level of gene expression of the same at least one gene as in (i) at a time point after the initial time point providing a second value, and (iii) determining the difference in the level of gene expression of the first value to the second value wherein a higher level of gene expression in the second value is indicative of an increase in severity of coronary artery disease. In a further embodiment said method is based on the determining the level of gene expression of at least one gene from Table 7 and/or Table 9 and/or Table 10. In preferred embodiments of the invention, a lower level of gene expression in the second value is indicative of a decrease in severity of coronary artery disease. A still further embodiment of the invention provides a method of monitoring the progression or severity of coronary artery disease comprising step (i) determining the level of one or more peptide selected from Table 11 and/or the level of gene expression of at least one gene from Table 6 or Table 7 and/or Table 9 and/or Table 10 at an initial time point providing a first value, step (ii) determining the level of the one or more peptide and the level of gene expression of the same genes as in (i) at a time point after the initial time point providing a second value, and step (iii) determining the difference in the level of the one or more peptides and gene expression of the first value to the second value wherein a higher level of the one or more Disease>Control and/or Predominant in Disease peptide or a lower level of the Control>Disease peptides or the Predominant in Control peptides of Table 11 in the second value, and/or a higher level of expression of the at least one gene from Table 6 or Table 7 and/or Table 9 and/or Table 10 in the second value is indicative of an increase in severity of coronary artery disease.


In yet another aspect of the invention, a method of screening candidate agents for use in treatment of coronary artery disease is provided, comprising:

    • (i) contacting a cell capable of expressing a gene selected from Table 6 or Table 7 with a candidate agent ex vivo,
    • (ii) determining the level of gene expression of said at least one gene from Table 6 or Table 7 to provide a first value,
    • (iii) determining the level of gene expression of the same at least one gene from Table 6 or Table 7 in a sample in the absence of the candidate agent to provide a second value, and
    • (iv) comparing the first value with the second value wherein a difference in level of gene expression is indicative of an agent potentially capable of being used for the treatment of coronary artery disease.


Another aspect of the invention provides a method of screening candidate agents for use in treatment of coronary artery disease said method comprises the steps of

    • (i) contacting a cell or sample of cells capable of expressing a gene selected from Table 7 and/or Table 9 and/or Table 10 with a candidate agent ex vivo,
    • (ii) determining the level of gene expression of said at least one gene from Table 7 and/or Table 9 and/or Table 10 to provide a first value,
    • (iii) determining the level of gene expression of the same at least one gene from Table 7 and/or Table 9 and/or Table 10 in a cell or sample of cells in the absence of the candidate agent to provide a second value, and
    • (iv) comparing the first value with the second value wherein a difference in level of gene expression is indicative of an agent potentially capable of being used for the treatment of coronary artery disease.


Another embodiment of the invention provides the following steps of

    • (i) contacting a cell or sample of cells capable of producing at least one peptide selected from Table 11 and/or capable of expressing at least one gene selected from Table 6 or Table 7 and/or Table 9 and/or Table 10 with a candidate agent ex vivo,
    • (ii) determining the level of the one or more peptide of (i) and/or the level of gene expression of the at least one gene of (i) to provide a first value,
    • (iii) determining the level of the one or more peptide of (i) and/or the level of gene expression of the at least one gene of (i) in a cell or sample of cells in the absence of the candidate agent to provide a second value, and
    • (iv) comparing the first value with the second value wherein a difference in the level of the one or more peptide and/or in the level of gene expression is indicative of an agent potentially capable of being used for the treatment of coronary artery disease.


Preferably, a decrease in the first value of the at least one Disease>Control peptide and/or Predominant in Disease peptide and/or an increase in the first value of the at least one Control>Disease and/or Predominant in Control peptide from Table 11 and/or a decrease in the first value of the at least one gene selected from Table 6 or Table 7 and/or Table 9 and/or Table 10 is indicative of an agent potentially capable of being used for the treatment of coronary artery disease.


In a preferred embodiment of the invention, there is decrease in the level of gene expression in the presence of a candidate agent. In yet another preferred embodiment, a decrease in the level of gene expression of at least SEQ ID No. 1 may be measured. In a further preferred embodiment, a decrease in the level of gene expression of a plurality of genes selected from Table 7 can be measured. In a most preferred embodiment, a decrease in the level of gene expression of all eight genes selected from Table 7 may be measured. In another preferred embodiment the level of gene expression of at least SEQ ID No. 9 of Table 9 and/or of at least SEQ ID No. 28 of Table 10 the is determined. Most preferably, the level of a plurality of peptides of Table 11 and/or the levels of gene expression of a plurality of genes selected from Table 7 and/or Table 9 and/or Table 10 are determined.


In another aspect of the invention, a method of treating or preventing coronary artery disease is provided, comprising administering to a subject an effective amount of an agent that can induce a decrease in the level of gene expression, synthesis, or activity of at least one gene or gene expression products from Table 6 or Table 7. In another aspect of the invention a method of treating or preventing coronary artery disease is provided said method comprises administering to a subject an effective amount of an agent that can induce a decrease in the level of gene expression, synthesis, or activity of at least one gene or gene expression products from Table 6 or Table 7 and/or Table 9 and/or Table 10. Further is provided, a method of treating or preventing coronary artery disease comprising administering to a subject an effective amount of an agent that can induce a decrease in the level of at least one Disease>Control and/or Predominant in Disease peptide and/or an increase in the level of at least one Control>Disease and/or Predominant in Control peptide from Table 11 and/or a decrease in gene expression, synthesis, or activity of at least one gene or gene expression products from Table 6 or Table 7 and/or Table 9 and/or Table 10. In a preferred embodiment said agent is selected from the group consisting of antisense oligonucleotides, double stranded RNA, ribozyme, small molecule, antibody or antibody fragment.


In yet another aspect of the invention, a method of manufacture of a medicament for the treatment or prevention of coronary artery disease is provided, comprising an effective amount of an agent that can induce a decrease in the level of gene expression, synthesis, or activity of at least one gene or gene expression products from Table 6 or Table 7.


Another aspect of the invention relates to the use of a substance comprising an effective amount of an agent that can induce a decrease in the level of gene expression, synthesis, or activity of at least one gene or gene expression products from Table 6 or Table 7 and/or Table 9 and/or Table 10 in the manufacture of a medicament for the treatment or prevention of coronary artery disease. Another aspect provides the use of a substance comprising an effective amount of an agent that can induce a decrease in the level of at least one Disease>Control and/or Predominant in Disease peptide and/or induce an increase in the level of at least one Control>Disease and/or Predominant in Control peptide from Table 11 and/or a decrease in gene expression, synthesis, or activity of at least one gene or gene expression products from Table 6 or Table 7 and/or Table 9 and/or Table 10 in the manufacture of a medicament for the treatment or prevention of coronary artery disease.


In other preferred embodiments of the invention the level and the variety/combination of peptides of Table 11 and/or the level of gene expression and the variety of the genes of Table 6 and/or Table 7 and/or Table 9 and/or Table 10 are determined.


In the methods of screening candidate agents, the methods of monitoring the progression or severity of coronary artery disease, the methods of monitoring a subject identified as having coronary artery disease before and after treatment, the methods of treating or preventing coronary artery disease or the methods of manufacture of a medicament for the treatment or prevention of coronary artery disease provided by the invention, the level of peptides or gene expression may be determined for a variety/combination of peptides or genes as described further above for the method of identifying or predicting the predisposition of coronary artery disease. Accordingly, the levels of gene expression of 2, 3, 4, 5, 6 or 7 genes of Table 7 and/or the levels of gene expression of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 or 18 genes of Table 9 and/or the levels of gene expression of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 or 14 genes of Table 10 are determined. In a most preferred embodiment, the levels of gene expression of a plurality of genes selected from Table 6 or Table 7 and/or Table 9 and/or Table 10 are determined. In an even more preferred embodiment the levels of gene expression of all genes selected from Table 6 or Table 7 and/or Table 9 and/or 10 are determined. In other preferred embodiments of the invention, the level of 2, 3, 4, 5, 6, 7, 8 or 9 Disease>Control peptides and/or the level of 2 Predominant in Disease peptides and/or the level of 2 or 3 Control>Disease peptides and/or the Predominant in Control peptide from Table 11 are determined. Most preferably, the levels of all Disease>Control peptides and/or all Predominant in Disease peptides and/or all Control>Disease peptides and/or all Predominant in Control peptide from Table 11 are determined. Further embodiments provide that the level of peptide or gene expression of a plurality or all of said peptides of Table 11 and/or genes of Table 6 or Table 7 and/or Table 9 and/or Table 10 are determined.


In a further aspect of the invention, a kit is provided for the identifying or predicting the predisposition coronary artery disease in a subject comprising (i) instructions for determining the level of gene expression of at least one gene from Table 6 or Table 7 and (ii) control or reference standard level of gene expression from a normal subject or subjects without coronary artery disease for at least one gene in Table 6 or Table 7. In another aspect a kit for the identifying or predicting the predisposition coronary artery disease in a subject is provided, said kit comprising (i) instructions for determining the level of gene expression of at least one gene selected from Table 6 or Table 7 and/or Table 9 and/or Table 10, (ii) control or reference standard level of gene expression from a normal subject or subjects without coronary artery disease for the genes selected from Table 6 or Table 7 and/or Table 9 and/or Table 10 of (i). In one embodiment, the kit additionally contains antibodies, antibody derivatives or antibody fragments capable of binding to a polypeptide encoded by of at least one gene from Table 6 and/or Table 7, and/or Table 9 and/or Table 10. Further, the invention relates to a kit for the identifying or predicting coronary artery disease in a subject comprising (a) instructions for determining the peptide level of at least one peptide from Table 11 and (b) control or reference standard peptide level from a normal subject or subjects without coronary artery disease for at least one peptide in Table 11. In a preferred embodiment the kit further comprises (c) an antibody that binds to said at least one peptide from Table 11. Another embodiment of the invention provides a kit for the identifying or predicting coronary artery disease in a subject which comprises (a) instructions for determining the peptide level of at least one peptide from Table 11 and for determining the level of gene expression of at least one gene selected from Table 6 or Table 7 and/or Table 9 and/or Table 10 and (b) control or reference standard peptide level from a normal subject or subjects without coronary artery disease for at least one peptide in Table 11 and for determining the level of gene expression of at least one gene of Table 6 or Table 7 and/or Table 9 and/or Table 10, and optionally also (c) an antibody that binds to said at least one peptide from Table 11 and additionally antibodies, antibody derivatives or antibody fragments capable of binding to a polypeptide encoded by the at least one gene from Table 6 or Table 7, and/or Table 9 and/or Table 10 are provided. In further embodiments of the invention kits may be used in any one of the methods of the invention.




BRIEF DESCRIPTION OF FIGURES


FIG. 1 illustrates the second degree polynomial regression analysis of the resulting t-scores versus CAD-Index resulted in the prediction model including 95% confidence range of the regression and the 95% prediction interval with r2=0.764 (p<0.001) (A), and illustrates the predicted CAD-index versus the CAD-index of model 1 involving 8 predictor genes (B).



FIG. 2 illustrates the Variable Importance in the Projection (VIP) of each gene for the separate PLS analyses of the three different cohorts compared to the PLS analysis including all subjects. Displayed are the 24 genes with the highest VIP. The curve shows a steep decrease for the first 8 genes; thereafter, the decrease is rather flat and almost linear.



FIG. 3 illustrates the final PLS analysis with the eight most important predictor genes applied to all 222 subjects involved in this study. The subjects are ordered by their CAD index and have a CAD Index between 23-100 depending on the severity of stenosis. In order to better demonstrate the predictive power of the model, the CAD Index is superimposed on the t-scores.



FIG. 4 illustrates the predicted CAD-index versus the CAD-Index of model 2 involving 19 predictor genes (A) and of model 3 involving 15 predictor genes (B).




DETAILED DESCRIPTION OF THE INVENTION

Unless defined otherwise, all technical and scientific terms used herein have the meaning commonly understood by a person skilled in the art to which this invention belongs. The following references provide one of skill with a general definition of many of the terms used in this invention: Singleton et al., Dictionary of Microbiology and Molecular Biology (2nd ed. 1994); The Cambridge Dictionary of Science and Technology (Walker ed., 1988); The Glossary of Genetics, 5th Ed., R. Rieger et al. (eds.), Springer Verlag (1991); and Hale & Marham, The Harper Collins Dictionary of Biology (1991). As used herein, the following terms have the meanings ascribed to them unless specified otherwise.


“Biomarker” in the context of the present invention refers to genes and gene expression products (i.e. proteins or polypeptides, mRNA) which are differentially expressed in a sample taken from subjects having coronary artery disease as compared to a comparable sample taken from control subjects (e.g., a person with a negative diagnosis or undetectable coronary artery disease, normal or healthy subject).


“Proteins or polypeptides or peptides” of the present invention are contemplated to include any fragments thereof, in particular, immunologically detectable fragments. One of skill in the art would recognize that proteins which are released by cell in the heart which become damaged during vascular injury could become degraded or cleaved into such fragments. Additionally, certain proteins or polypeptides are synthesized in an inactive form, which may be subsequently activated by proteolysis. Such fragments of a particular protein may be detected as a surrogate for the protein itself.


The term “sample” as used herein refers to a sample from a subject obtained for the purpose of identification, diagnosis, prediction, or monitoring. In certain aspects of the invention such a sample may be obtained for the purpose of determining the outcome of an ongoing condition or the effect of a treatment regimen on a coronary artery disease. Preferred test samples include blood, serum, plasma, lymph, urine, tear, saliva, cerebrospinal fluid, leukocyte or tissue samples. In addition, one of skill in the art would realize that some test samples would be more readily analyzed following a fractionation or purification procedure, for example, separation of whole blood into serum or plasma components.


A difference in the “level of gene expression” or in the “peptide level” is a relative difference. For example, it may be a difference in the level of gene expression a sample taken from a subject having coronary artery disease as compared to control subjects or a reference standard. A comparison can be made between the level of gene expression in a subject at risk of coronary artery disease to a subject known to be free of a given condition, i.e. “normal” or “control”. Alternatively, a comparison can be made to a “reference standard” known to be associated with a good outcome (e.g. the absence of coronary artery disease) such as an average level found in a population of normal individuals not suffering from coronary artery disease. According to the present invention, a comparison can be made between the level of gene expression and the identification or predisposition of a subject to develop coronary artery disease.


The level of gene expression or the level of proteins/peptides present in a sample being tested can be either in absolute amount (e.g. μg/ml) or a relative amount (e.g. relative intensity of signals).


A difference is present between the two samples if the amount of gene expression is statistically significantly different from the amount of the polypeptide in the other sample. For example, there is a difference in gene expression or in the level of proteins/peptides between the two samples if the amount of polypeptide is present in at least about 20%, at least about 30%, at least about 50%, at least about 80%, at least about 100%, at least about 200%, at least about 400%, at least about 600%, at least about 800%, or at least about 1000% greater than it is present in the other sample.


Identifying or predicting the predisposition of coronary artery disease may be considered as a diagnostic technique. Diagnostic methods differ in their sensitivity and specificity. The skilled artisan often makes a diagnosis, for example, on the basis of one or more diagnostic indicators, in the present invention these are the expression levels of the genes from Table 6 and/or Table 7 and/or Table 9 and/or Table 10, and/or the peptide levels of Table 11. The presence, absence, or amount of which is indicative of the presence, severity, or absence of the coronary artery disease.


Multiple determination of the gene expression of one or more genes can be made of Table 6 and/or Table 7 and/or Table 9 and/or Table 10, and/or of the peptide levels of Table 11 can be made as well as determination of a temporal change in gene expression or peptide abundancy which can be used to monitor the progress of the disease or a treatment of the disease. For example, gene expression/peptide abundancy may be determined at an initial time, and again at a second time. In such aspects, an increase in the gene expression and/or peptide level from the initial time to the second time may be diagnostic of coronary artery disease. Likewise, a decrease in the gene expression and/or peptide level from the initial time to the second time may be indicative of a responsiveness of a subject to a particular type of treatment of coronary artery disease. Furthermore, the change in gene expression of one or more genes may be related to the severity of coronary artery disease and future adverse events.


In one embodiment of the invention, the level of gene expression as least one gene from Table 6 is determined. In a preferred embodiment, the level of gene expression of at least SEQ ID. No. 1 (Table 7) is determined. In another preferred embodiment, the level of gene expression of a plurality of genes from Table 7 is determined. In a most preferred embodiment, the level of gene expression all eight genes of Table 7 are determined. In other preferred embodiments, the level of gene expression of at least SEQ ID. No. 9 (Table 9) and/or SEQ ID. No. 28 (Table 10) is determined. In yet other preferred embodiments the levels of gene expression of 2, 3, 4, 5, 6, 7 or 8 genes of Table 7 and/or the levels of gene expression of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 or 19 genes of Table 9 and/or the levels of gene expression of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15 genes of Table 10 are determined. In a most preferred embodiment, the levels of gene expression of a plurality of genes selected from Table 6 or Table 7 and/or Table 9 and/or Table 10 are determined. In an even more preferred embodiment the levels of gene expression of all genes selected from Table 6 or Table 7 and/or Table 9 and/or 10 are determined. In other preferred embodiments of the invention, the level of 2, 3, 4, 5, 6, 7, 8 or 9 Disease>Control peptides and/or the level of 2 Predominant in Disease peptides and/or the level of 2 or 3 Control>Disease peptides and/or the Predominant in Control peptide from Table 11 are determined. Most preferably, the levels of all Disease>Control peptides and/or all Predominant in Disease peptides and/or all Control>Disease peptides and/or all Predominant in Control peptide from Table 11 are determined. The level of peptides may optionally be determined together, simultaneously or sequentially, with the level of gene expression of the genes of Table 6 or Table 7 and/or Table 9 and/or Table 10. Further embodiments provide that the level of peptide or gene expression of a plurality or all of said peptides of Table 11 and/or genes of Table 6 or Table 7 and/or Table 9 and/or Table 10 are determined.


The skilled artisan will understand that, while in certain aspects comparative measurements of gene expression are made of the same gene at multiple time points, one could also measure a given gene at one time point, and a second gene at a second time point, and a comparison of the gene expression of these genes may provide diagnostic information or monitor the progress of the disease.


In a preferred aspect of the invention, gene expression of one or more genes from Table 7 and/or Table 9 and/or Table 10 and/or level of peptides of Table 11 may be comparatively measured at different time points.


The phrase “probability of the presence of coronary artery disease” as used herein refers to methods by which the skilled artisan can predict the condition in a subject. It does not refer to the ability to predict the coronary artery disease with 100% accuracy. Instead, the skilled artisan will understand that it refers to an increased probability that a coronary artery disease is present or will develop. For example, coronary artery disease is more likely to occur in a subject having high levels of expression of genes of Table 6 and/or Table 7 and/or Table 9 and/or Table 10 and/or increased levels for Disease>Control peptide and/or Predominant in Disease peptide of Table 11, and/or decreased levels of Control>Disease and/or the Predominant in Control peptides of Table 11 when compared to a control or reference standard such as a subject not being affected by or having a predisposition for CAD. In one aspect of the invention, the probability of the presence of coronary artery disease is about a 50% chance, about a 60% chance, about a 75% chance, about a 90% chance, and about a 95% chance. The term “about” in this context refers to +/−1%.


The skilled artisan will understand that associating a particular gene with a predisposition to coronary artery disease is a statistical analysis. Additionally, a change in gene expression and/or peptide level from baseline levels may be reflective of patient prognosis, and the degree of change in gene expression may be related to the severity of adverse events. Statistical significance is often determined by comparing two or more populations, and determining a confidence interval and/or a p value. Preferred confidence intervals of the invention are 90%, 95%, 97.5%, 98%, 99%, 99.5%, 99.9% and 99.99%, while preferred p values are 0.1, 0.05, 0.025, 0.02, 0.01, 0.005, 0.001, and 0.0001.


In a further aspect, the invention relates to kits for identification of coronary artery disease in a subject. These kits comprise devices and reagents for measuring gene expression and/or determining peptide levels in a subject's sample and instructions for performing the assay and interpreting the results. Such kits preferably contain sufficient reagents to perform one or more such determinations.


The “sensitivity” of an assay according to the present invention is the percentage of diseased individuals (those with coronary artery disease) who test positive (percent of “true positives”). Diseased individuals not detected by the assay are “false negatives”. Subjects who are not diseased and who test negative in the assay, are termed “true negatives.” The “specificity” of a diagnostic assay is 1 minus the false positive rate, where the “false positive rate” is defined as the proportion of those without the disease who test positive. While a particular diagnostic method may not provide a definitive diagnosis of a condition, it suffices if the method provides a positive indication that aids in diagnosis.


Measurement of Gene Expression


Numerous methods and devices are well known to the skilled artisan for the detection and analysis of the gene expression and measurement of peptide levels of the present invention. The term “gene expression” refers to the presence or amount of a specific gene including, but not limited to, mRNA, cDNA or the polypeptide, peptide or protein expression product of a specific gene. In a preferred aspect of the invention, the gene expression of genes from Table 6 and/or Table 7 and/or Table 9 and/or Table 10 and/or the level of peptides of Table 11 are determined.


In one embodiment of the invention, the gene expression is determined by measuring RNA levels. Gene expression may be detected using a PCR-based assay. Or in other aspects of the invention, reverse-transcriptase PCR (RT-PCR) is used to detect the expression of RNA. In RT-PCR, RNA is enzymatically converted to cDNA using a reverse-transcriptase enzyme. The cDNA is then used as a template for a PCR reaction. PCR products can be detected by any suitable method including, but not limited to, gel electrophoresis and staining with a DNA-specific stain or hybridization to a labeled probe. In yet another aspect of the invention, the quantitative RT-PCR with standardized mixtures of competitive templates can be utilized.


In another embodiment of the present invention, gene expression is detected using a hybridization assay. In a hybridization assay, the presence or absence of biomarker is determined based on the ability of the nucleic acid from the sample to hybridize to a complementary nucleic acid molecule, e.g., an oligonucleotide probe. A variety of hybridization assays are available. In some embodiments of the invention, hybridization of a probe to the sequence of interest is detected directly by visualizing a bound probe, e.g., a Northern or Southern assay. In these assays, DNA (Southern) or RNA (Northern) is isolated. The DNA or RNA is then cleaved with a series of restriction enzymes that cleave infrequently in the genome and not near any of the markers being assayed. The DNA or RNA is then separated, e.g., on an agarose gel, and transferred to a membrane. A labeled probe or probes, e.g., by incorporating a radionucleotide, is allowed to contact the membrane under low-, medium- or high-stringency conditions. Unbound probe is removed and the presence of binding is detected by visualizing the labeled probe. In another embodiment of the invention, the gene expression is determined for the genes of Table 6 or Table 7 and/or Table 9 and/or Table 10 and/or for the genes encoding the peptides of Table 11.


In yet another embodiment of the invention, the gene expression is determined by measuring polypeptide gene expression products. In a preferred aspect of the invention, gene expression is measured by identifying the amount of one or more polypeptides encoded by one of the genes in Table 6 or Table 7 and/or Table 9 and/or Table 10 and/or the amount of peptides of Table 11. The present invention is not limited by the method in which gene expression is detected or measured.


A protein or polypeptide or peptide expression product encoded by one of the genes in Table 6 or Table 7 and/or Table 9 and/or Table 10 and/or a peptide of Table 11 may be detected by a suitable method. With regard to peptides, polypeptides or proteins in samples, immunoassay devices and methods are often used. These devices and methods can utilize labeled molecules in various sandwich, competitive, or non-competitive assay formats, to generate a signal that is related to the presence or amount of an analyte of interest. Additionally, certain methods and devices, such as biosensors and optical immunoassays, may be employed to determine the presence or amount of analytes without the need for a labeled molecule.


The presence or amount of a protein or polypeptide or peptides is generally determined using specific antibodies and detecting specific binding. Any suitable immunoassay may be utilized, for example, enzyme-linked immunoassays (ELISA), radioimmunoassays (RIAs), competitive binding assays, and the like. Specific immunological binding of the antibody to the protein or polypeptide can be detected directly or indirectly. Direct labels include fluorescent or luminescent tags, metals, dyes, radionuclides, and the like, attached to the antibody. Indirect labels include various enzymes well known in the art, such as alkaline phosphatase, horseradish peroxidase and the like.


The use of immobilized antibodies specific for the proteins or polypeptides is also contemplated by the present invention. The antibodies can be immobilized onto a variety of solid supports, such as magnetic or chromatographic matrix particles, the surface of an assay place (such as microtiter wells), pieces of a solid substrate material (such as plastic, nylon, paper), and the like. An assay strip can be prepared by coating the antibody or a plurality of antibodies in an array on solid support. This strip can then be dipped into the test sample and then processed quickly through washes and detection steps to generate a measurable signal, such as a colored spot.


The analysis of a plurality of genes and/or peptides of the present invention may be carried out separately or simultaneously with one test sample. In addition, one skilled in the art would recognize the value of testing multiple samples (for example, at successive time points) from the same individual. Such testing of serial samples allows the identification of changes in gene expression and/or peptide levels over time. Increases or decreases in gene expression levels, as well as the absence of change in gene expression and/or peptide levels, can provide useful information about the disease status that includes, but is not limited to identifying the approximate time from onset of the event, the presence and amount of salvagable sample, the appropriateness of drug therapies, the effectiveness of various therapies as indicated by reperfusion or resolution of symptoms, differentiation of the various types of coronary artery disease, identification of the severity of the event, identification of the disease severity, and identification of the patient's outcome, including risk of future events.


A panel comprising of the genes referenced above may be constructed to provide relevant information related to the diagnosis or prognosis of coronary artery disease and management of subjects with coronary artery disease. Such a panel can be constructed preferably using the sequences of Table 7 and/or Table 9 and/or Table 10 and/or the peptides of Table 11. The analysis of a single genes or subsets of genes comprising a larger panel of genes alone or in combination with the analysis of a single peptide or a subset of peptides can be carried out by one skilled in the art to optimize sensitivity or specificity.


The analysis of gene expression and/or determination of peptide levels can be carried out in a variety of physical formats as well. For example, the use of microtiter plates or automation could be used to facilitate the processing of large numbers of test samples in a high through-put manner.


In another aspect of the invention, an array is provided to which probes that correspond in sequence to gene products, e.g., cDNAs, mRNAs, cRNAs, polypeptides and fragments thereof, can be specifically hybridized or bound at a known position. In one embodiment of the invention, the array is a matrix in which each position represents a discrete binding site for a product encoded by a gene (e.g., a protein or RNA) preferably of the genes listed in Table 7 and/or Table 9 and/or Table 10 and/or of the peptides listed in Table 11. In another aspect of the invention, the “binding site”, hereinafter “site”, is a nucleic acid or nucleic acid analogue to which a particular cognate cDNA can specifically hybridize. The nucleic acid or analogue of the binding site can be, e.g., a synthetic oligomer, a full-length cDNA, a less than full-length cDNA or a gene fragment.


In another aspect, the present invention provides a kit for the analysis of gene expression and/or peptide levels. Such a kit preferably comprises devices and reagents for the analysis of at least one test sample and instructions for performing the assay. Optionally the kits may contain one or more means for converting gene expression and/or amounts of peptides to a diagnosis or prognosis of coronary artery disease in a subject. Comparison of the subject's gene expression pattern, with the controls or reference standards, would indicate whether the subject has coronary artery disease.


In one embodiment of the invention, the kits contain antibodies specific for at least one gene, preferably from Table 7 and/or Table 9 and/or Table 10 and/or for at least one peptide of Table 11. In other embodiments, the kits contain reagents specific for the detection of nucleic acid, e.g., oligonucleotide probes or primers. In preferred embodiments, the kits contain all of the components necessary to perform a detection assay, including all controls and instructions for performing assays and for analysis of results. In one embodiment of the invention, the kits contain instructions including a statement of intended use as required by the Environmental Protection Agency or U.S. Food and Drug Administration (FDA) for the labeling of in vitro diagnostic assays and/or of pharmaceutical or food products.


In another aspect of the present invention, a method of screening agents for use in the treatment of coronary artery disease is provided. In particular agents that can induce a decrease in the level of gene expression, synthesis or activity of at least one gene or gene expression product from Table 6 or Table 7 and/or Table 9 and/or Table 10 and/or induce a decrease in the level of at least one Disease>Control and/or Predominant in Disease peptide and/or induce an increase in the level of at least one Control>Disease and/or Predominant in Control peptide from Table 11.


For example, in one embodiment one would first treat a test subject known to have coronary artery disease with a test agent and then analyze a representative sample of the subject for the level of expression of the genes or sequences which change in expression in response to coronary artery disease and/or for the level of peptide. One then compares the analysis of the sample with a control known to have coronary artery disease but not given the test compound and thereby identifies test compounds that are capable of modifying the gene expression.


In another embodiment of the present invention, one would base a therapy on the sequences of the genes disclosed in Table 7 and/or Table 9 and/or Table 10 and/or Table 11. In general, one would try to decrease the expression of genes identified herein as over-expressed in coronary artery disease, and to induce a decrease in the level of Disease>Control and/or Predominant in Disease peptide and to induce an increase in the level of Control>Disease and/or Predominant in Control identified herein.


Methods of decreasing the expression of said genes would be known to one of skill in the art. Examples for supplementation of expression would include supplying subject with additional copies of the gene. A preferred example for decreasing expression would include RNA antisense technologies or pharmaceutical intervention. The genes or peptides disclosed in Table 6 or Table 7 and/or Table 9 and/or Table 10 would be appropriate drug development targets, preferably those in Table 7 and 9 and 10 and/or Table 11.


The details of one or more embodiments of the invention are set forth in the accompanying description above. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and materials are now described. Other features, objects, and advantages of the invention will be apparent from the description and from the claims. In the specification and the appended claims, the singular forms include plural referents unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. All patents and publications cited in this specification are incorporated by reference.


EXAMPLES

The following EXAMPLES are presented in order to more fully illustrate the preferred embodiments of the invention. These examples should in no way be construed as limiting the scope of the invention, as defined by the appended claims. Illustrated below are a sample study protocol and the identification of biomarkers that have positive or negative correlations with coronary artery disease. While the sample protocol has been developed with human blood plasma samples, the same general experimental set-up may be used for other suitable biological samples to detect biomarkers. All documents mentioned herein are fully incorporated by reference.


Example 1

Patients and control subjects are from the “Duke Databank for Cardiovascular Disease and the Duke Cardiac Catheterization Laboratory” of the Duke Clinical Research Institute (DCRI). After the subjects provide informed consent, additional clinical data is collected to supplement the clinical database. Patients with coronary artery disease are recruited at the time of their procedure while in the cardiac catheterization laboratory and the control population are recruited both from the cardiac catheterization laboratory and retrospectively within two years of cardiac catheterization.


Populations are defined in order to minimize differences in plasma proteins unrelated to the presence or absence of coronary artery disease. Three different cohorts of subjects and controls are enrolled:

    • (i) “pooled males” are matched for age and ethnic group. Major diseases known to be associated with differences in proteins such as diabetes mellitus and inflammatory conditions such as recent acute myocardial infarction or active malignancy are excluded.
    • (ii) “other males” do not need to comply with the matching criteria. It is the intention to include a relatively young population of patients with coronary artery disease in order to enrich the sample for factors related to coronary disease that would be expected to more prominent in a younger population.
    • (iii) “females” are enrolled as the third cohort to be able to define characteristics of the overall population and to assess gene polymorphisms of interest in the female population.


Inclusion criteria for the coronary disease patient population are: age between 40 and 65 and coronary artery stenosis of >50% in at least one major coronary artery. Exclusion criteria are acute myocardial infarction within one month, diabetes mellitus, uncontrolled hypertension (systolic blood pressure >180 mmHg or diastolic blood pressure >100 mmHg), and/or with end-organ damage, renal insufficiency (serum creatinine >2.0 mg/dL and/or BUN >40 mg/dL), active malignancy, significant valvular heart disease, NYHA Class III or IV heart failure, cigarette smoking >2 packs per day, total cholesterol >300 mg/dL or triglyceride >400 mg/dL, any other disease or condition expected to cause major alteration in plasma protein composition, anemia (hemoglobin <12.5 g/dL for females or <13.5 g/dL for males), and hypotension (systolic blood pressure <90 mmHg and diastolic blood pressure <50 mmHg).


Inclusion criteria for the “control” population are: age between 40 and 65 for pooled males cohort only, no coronary artery stenosis of >25% on cardiac catheterization within two years, and normal left ventricular ejection fraction and normal regional wall motion. Exclusion criteria are typical signals of angina, or any evidence of myocardial ischemia on stress testing, myocardial infarction or unstable angina, any history of peripheral arterial or cerebrovascular disease including stroke, transient ischemic attack, or significant vascular stenosis on noninvasive imaging or angiography, diabetes, uncontrolled hypertension (systolic blood pressure >180 mmHg or diastolic blood pressure >100 mmHg), and/or with end-organ damage, renal insufficiency (serum creatinine >2.0 mg/dL and/or BUN >40 mg/dL), active malignancy, significant valvular heart disease, NYHA Class III or IV heart failure, cigarette smoking >2 packs per day, total cholesterol >300 mg/dL or triglyceride >400 mg/dL, any symptomatic heart failure, significant valvular disease, any other disease or condition expected to cause major alteration in plasma protein composition, anemia (hemoglobin <12.5 g/dL for females or <13.5 g/dL for males), and hypotension (systolic blood pressure <90 mmHg and diastolic blood pressure <50 mmHg).


Assessment of Clinical Parameters


Demographic information of the subjects include age at time of study (years), age at last catheterization (years), gender, race, smoking behavior, systolic and diastolic blood pressure (mmHg), and severity of coronary artery disease (CAD-Index). Assessment of medical history includes angina, diabetes mellitus including end organ damage, hypertension, myocardial infarction, PCI, CABG, peripheral vascular disease, cerebro-vascular disease, congestive heart failure, severity of congestive heart failure, and renal insufficiency.


Regarding coronary disease following information is collected: LV ejection fraction (%), number of significantly obstructed vessels, maximal percent stenosis of the left main coronary system, maximal percent stenosis of the left anterior descending artery, maximal percent stenosis of the right coronary artery, maximal percent stenosis of the left circumflex artery system, mitral valve stenosis, mitral insufficiency grade, valvular disease, aortic stenosis, severity of coronary artery disease, follow-up MI, PCI, and CABG, indication for catheterization and post-catheterization diagnosis, if different.


Assessment of medication includes ACE inhibitors, amiodarone, angiotensin receptor blocker (ARB), aspirin products, beta-blocker, calcium blocker, central acting agents, digoxin, diuretics, dofetalide, fibrate, hormoone replacement therapy, niacin compound, nitrate, other antiplatelet, sotanol, statin, type 1 agents, vasodilatators, and warfarin.


Clinical laboratory parameters assessed at Duke laboratory included total cholesterol, triglycerides, LDL, HDL, HbA1C, Hct, creatinine, PT, PTT, WBC, phosphorus, sodium, potassium, chloride, calcium, and carbon dioxide.


Clinical laboratory parameters include sodium, potassium, chloride, calcium, magnesium, phosphorus, BUN, creatinine, protein total, CK total, CK-MB, LDH, γ-GT, ASAT, ALAT, alkaline phosphatase, bilirubin total, bilirubin conjugated, C-reactive protein, cholesterol, triglicerides, HCL, albumin, osmolarity, LDL calc, Ca calc, troponin I, TSH, Homocysteine, HIV1-2, HCV3, HBsAg, and HBclg.


Severity of coronary artery disease (CAD-Index) is scored according to following cross-reference table:

TABLE 1Coding of the CAD-Index, vd stands for significantlyobstructed vesselsCAD-IndexCriteria0No CAD >=50%191 vd (50-74%)23>1 vd (50-74%)231 vd (>=75%)321 vd severe (>=95%)372 vd422 vd, 2 severe481 vd, severe proximal LAD482 vd, severe LAD562 vd, severe proximal LAD563 vd633 vd, >=1 severe673 vd, proximal LAD disease743 vd, severe proximal LAD82Left main (75%)100Left main severe


Blood Sampling


One aliquot of 100 mL of blood is collected from male disease patients, and 200 ml (2×100 mL aliquots) of blood is collected from male control subjects, in 20 mL tubes containing citrate, phosphate and dextrose. Females have 2×10 mL tubes of blood drawn. The fall in temperature to ambient is accelerated by placing the tubes in a water bath at 25° C., from the beginning of the collection process. After the end of the collection, the tubes are cooled, allowing the temperature progressively lowered to 20-22° C. in one hour. White blood cell and platelet reduction (less than 1×106/unit) is systematically performed by filtration at room temperature, by passage of the blood on standard gravitation filters. The bags are centrifuged in a standard fashion.


Plasma is separated from red cells using a press and collected in separate bags. Protease inhibitors are added. The tubes are frozen to −70° C., and stored prior to bulk shipment on dry ice.


Microarray Analysis


The blood samples (2.5 mL) are collected directly into PAXgene™ Blood RNA tubes containing a proprietary blend of reagents that bring about immediate stabilization of RNA (PreAnalytiX, Qiagen). The RNA is then isolated using silica-gel-membrane technology supplied in the PAXgene Blood RNA Kit. The resulting RNA accurately represents the expression profile in vivo and is suitable for use in a range of downstream applications. The RNA isolation begins with a centrifugation step to pellet nucleic acids in the PAXgene Blood RNA Tube. The pellet is washed, and Proteinase K is added to digest proteins. Alcohol is added to adjust binding conditions, and the sample is applied to a PAXgene RNA spin column. During a brief centrifugation, RNA is selectively bound to the PAXgene silica-gel membrane as contaminants pass through. Following washing steps, RNA is eluted in an optimized buffer. Total RNA is quantified by the absorbance at λ=260 nm (A260 nm) and the purity is estimated by the ratio A260 nm/A280 nm. Integrity of the RNA molecules is confirmed by non-denaturing agarose gel electrophoresis. RNA was stored at approximately −80° C. until analysis.


DNA Micro-Array Experiment


All GeneChip experiments are conducted in the Genomics Factory EU, as recommended by the manufacturer of the GeneChip system (Affymetrix, Santa Clara, Calif.; Expression analysis technical manual: put into references). Genome U133A expression probe array set (Affymetrix, Inc., San Diego, Calif., USA) is used.


GeneChip Experiment


Double stranded cDNA is synthesized with a starting amount of approximately 1 to 10 μg full-length total RNA using the Superscript Choice System (Invitrogen Life Technologies) in the presence of a T7-(dT) 24 DNA oligonucleotide primer. Following synthesis, the cDNA is cDNA purified on an affinity resin (QiaQuick, Qiagen). The purified cDNA is then transcribed in vitro using the BioArray® High Yield RNA Transcript Labeling Kit (ENZO) in the presence of biotinylated ribonucleotides form biotin labeled cRNA. The labeled cRNA is then purified on an affinity resin (RNeasy, Qiagen), quantified and fragmented. An amount of exactly 10 μg labeled cRNA is hybridized for approximately 16 hours at 45° C. to an expression probe array. The array is then washed and stained twice with streptavidin-phycoerythrin (Molecular Probes) using the GeneChip Fluidics Workstation 400 (Affymetrix). The array is then scanned twice using a confocal laser scanner (GeneArray Scanner, Agilent) resulting in one scanned image, “dat-file”.


Clinical Data


Clinical data are entered into a Clintrial database by DCRI, and data verification is done by double data entry. After data entry completion and lock of the database data are sent to the sponsor in an encrypted SAS format. At the sponsors site, clinical data are loaded into an MSAccess database. Descriptive statistical parameters are calculated for each of the six sub-populations; i.e. pooled male cases and controls, other male cases and controls, and female cases and controls. Descriptive statistics include sample size, frequencies, arithmetic mean, standard deviation, median, minimum and maximum values. Individual clinical data from in total 241 subjects: 121 diseased subjects and 120 controls. The pooled males cohort contains 53 diseased subjects and 53 controls, the other males cohort contain 44 diseased subjects and 38 controls, and the females cohort contains 24 diseased subjects and 29 controls.


Gene Expression Data


This resulting “.dat-file” is processed using the MAS5 program (Affymetrix) into a “.celfile”. The “.cel file” is captured and loaded into the Affymetrix GeneChip Laboratory Information Management System (LIMS). The LIMS database is connected to a UNIX Sun Solaris server through a network filing system that allows for the average intensities for all probes cells (CEL file) to be downloaded into an Oracle database (NPGN). Raw data is converted to expression levels using a “target intensity” of 150. The numerical values displayed are weighted averages of the signal intensities of the probe-pairs comprised in a probe-set for a given transcript sequence (AvgDiff value). Individual gene expression data are obtained from 222 out of 241 patients.


PLS by Partial Least Square


PLS stands for Projections to Latent Structures by means of partial least squares. PLS finds the linear or polynomial relationship between a matrix Y containing the dependent response variables and a matrix X containing the predictor variables. PLS modeling consists of simultaneous projections of both the X and Y spaces on low dimensional hyper planes. The coordinates of the points on these hyper planes constitute the elements of the matrices T and U which are referred to as t-scores and u-scores in the graphical representation of the PLS results below. The PLS analysis has the objectives to well approximate the X and Y spaces and to maximize the correlation between X and Y; i.e. very similar objectives as the canonical correlation analysis. Regarding the X (predictors) and Y (dependent variables) matrices, the objective of orthogonal signal correction —OSC—is to remove all information from X that is unrelated (orthogonal) to Y which does not contribute information of interest. SIMCA-P Version 10.0 (Umetrics, Sweden) was used for predictive modeling by PLS.


Data Integration


Clinical data, gene expression data are linked by a bar-code, that uniquely identified a subject across all data sets


Study Population


The study population consisted of three cohorts. In the first cohort—referred to as “pooled males”—were 53 male controls and 53 male cases. These individuals were matched according to their age and race. The second cohort termed “other males” contained 38 male controls and 44 male cases and the third cohort termed “females” was made of 29 female controls and 24 cases. In total, 241 individuals participated in this clinical investigation with 120 controls and 121 cases. 188 out of 241 individuals were male and 53 female.


Demographic Information


The average age of the study population was between 51.1 and 57.6 years with the smallest difference between cases and controls in the “pooled males” cohort. Body weight of males was in the range of 93.9-100.6 kg and 83.9-89.5 kg for females. In general, controls had a higher body weight than the cases. Frequency of smoking was 29/53 in the pooled male cases and 26/53 in pooled male controls, 34/44 in other male cases and 20/38 in other male controls, and 13/24 female cases and 9/29 female controls indicating that there were more smokers in the cases compared to controls. Systolic and diastolic blood pressure of the individuals was on average in the normal range. However, there is a tendency for higher blood pressure in the controls compared to cases. By far most of the study population belonged to the Caucasian race, followed by Afro-Americans and a few Asians and Hispanics.


Medical History


Angina pectoris was the most prominent clinical event in each of the cohorts with no obvious difference between cases and controls. However, history of hypertension was more reported by the male cases compared to controls. There was no difference in the female cohort.


Myocardial infarction was significantly higher in the cases of all cohort as well as coronary artery by-pass graft (CABG) and peripheral coronary intervention (PCI). A few of the cases reported coronary heart failure (CHF), but none of the controls. Very few of the cases suffered from peripheral or cerebral vascular disease, but none of the control. Diabetes mellitus was not reported and renal insufficiency in two subjects only.


Medication


Most of the patients took at least one medication (NSAIDs) is the class of medication that is taken by most of the subjects and there is no difference between cases and controls or between the three cohorts. Blood pressure lowering agents such as beta-blockers, ACE-inhibitors, calcium channel blockers, diuretics, and angiotensin receptor blockers were in general more frequently taken by the cases compared to the controls. The same holds true for statins. Less frequent medication included amioderone, fibrate, digoxin, nitrate, niacin, and anticoagulants which were taken more often by cases than controls. Some of the females took hormone replacement and only a few patients took centrally acting medication.


Severity of Coronary Artery Disease—CAD-Index


Amongst the cases, however, there was a wide distribution; 52% of the cases had CAD-Indices below 42; 79% of the cases had CAD-Indices below 56; and the remaining 21% of cases had CAD-Indices between 63 and 100%. Most of the controls are found in the CAD-Index range between 15 and 22 but all CAD-values below 23 were by default set to zero since CAD-Indices of zero have a tremendously high leverage effect on all types of regression analyses.

TABLE 2Study populationCohortSample SizePooled Male53 controls53 casesOther Males38 controls44 casesFemales29 controls24 casesTotal241120 controls, 121 cases188 males, 53 females









TABLE 3










Mean demographic data











Pooled Males
Other Males
Females














Cases
Controls
Cases
Controls
Cases
Controls

















Age (yrs)
52.7
52.3
57.6
51.1
54.2
51.9


Weight (kg)
93.9
96.8
94.3
100.6
83.9
89.5


Smoking
29
26
34
20
13
9


SBP (mmHg)
134
139
142
145
138
145


DBP (mmHg)
76
80
82
82
70
77


Race


Caucasian
50
50
39
23
18
21


Afro-american
3
3
4
13
4
5


Native


1


2


american


Asian



1
1


Hispanic



1

1


Others




1
















TABLE 4










Summary of Medical History (frequencies)











Pooled Males
Other Males
Females














Cases
Controls
Cases
Controls
Cases
Controls

















Angina
48
47
42
35
23
25


Pectoris


Hypertension
27
16
27
15
12
13


Myocardial.
29
0
21
2
10
1


Infarction


CABG
15
0
15
0
7
0


PCI
15
0
13
0
4
0


CHF
6
0
4
0
2
0


Peripheral
2
0
5
0
3
0


Vascular


Disease


Cerebral
1
0
4
0
3
0


Vascular


Disease


Diabetes
0
0
0
0
0
0


mellitus


Renal
0
1
1
0
0
0


Insufficiency
















TABLE 5










Summary of Medication (frequencies)











Pooled Males
Other Males
Females














Cases
Controls
Cases
Controls
Cases
Controls

















NSAIDs
47
18
42
10
21
6


Beta-Blocker
43
13
37
6
20
6


ACE Inhibitor
40
5
26
7
13
4


Calcium
10
4
9
5
4
3


Blocker


Diuretic
8
6
12
5
7
7


Angiotensin
0
3
5
2
5
0


RB


Statins
38
10
29
4
15
5


Amiodarone
1
0
0
0
0
0


Fibrate
4
2
8
0
1
0


Digoxin
2
0
0
0
1
0


Nitrate
12
0
6
1
5
0


Niancin
3
0
1
0
1
0


Antiplatelet
8
1
1
0
4
1


Hormon
0
0
1
0
9
5


Replacemt


Central Acting
1
0
1
1
0
0










Correlation of Gene Expression and Laboratory Values Inc. CAD-Index


A univariate correlation analysis by parametric and non-parametric methods was applied to all laboratory values and all genes. The CAD-Index was included in the set of laboratory parameters. The gene filtering of all subsequent univariate and multivariate methods was based on this correlation analysis; i.e. only genes were included in the analysis that exhibited an absolute correlation coefficient with CAD Index greater an arbitrary level, abs(rho)>0.2 for the partial least squares analysis.


Partial Least Squares (PLS) projection to latent structures is an alternative modeling approach to modeling multivariate response data. Due to its optimized algorithm it is able to cope with short and fat matrices; i.e. the number of variables can be much greater than the number of observations.


160 genes correlated with CAD-Index with absolute correlation coefficients of rho >0.2 (Table 6); these genes were included in the PLS analysis. Before performing the LPS the gene expression data were subjected to Orthogonal Signal Correction (OSC) with CAD-Index as the only response variable.

TABLE 6Means and p values of each of the 160 genesworst/bestall(n = 15 per group)(n = 221)SigLogt-testAccession#Unigene#ProbesetGeneVIPCorrRhoRatiop-valueNM_000146.2Hs.433670212788_x_atFTL10.630.240.241.180.008NM_012181.2Hs.17346440850_atFKBP83.690.220.251.390.0795NM_006082.1Hs.446608212639_x_atK-ALPHA-12.920.220.281.220.0196NM_020376.2Hs.11846339854_r_atTTS-2.22.490.250.241.450.0192NM_025241.1Hs.435255220757_s_atUBXD12.430.250.211.510.0155NM_016496.3Hs.331308210075_atLOC512571.20.220.241.440.0023NM_014216.3Hs.408429210740_s_atITPK10.990.290.31.250.0021NM_032409.1Hs.439600209018_s_atPINK10.880.220.221.470.001NM_007219.2Hs.30524210706_s_atRNF240.820.220.281.120.0946NM_006690.2Hs.21258178047_s_atMMP240.820.230.21.290.0125NM_006949.1Hs.379204209367_atSTXBP20.780.220.251.190.0375NM_002957.3Hs.20084202426_s_atRXRA0.740.250.241.340.012NM_174930.2Hs.397073179_atPMS2L50.680.260.241.280.0243NM_022107.1Hs.288316214847_s_atGPSM30.680.250.271.280.0052NM_004364.2Hs.76171204039_atCEBPA0.570.30.351.30.0279Hs.32371248612_atN4BP10.560.210.211.190.0149NM_012295.2Hs.435798202624_s_atCABIN10.510.220.211.280.009NM_001119.3,Hs.183706214726_x_atADD10.480.230.211.260.0208NM_001119.3,NM_176801.1,NM_014190.2NM_021933.1Hs.859548659_atFLJ124380.460.20.221.150.076NM_003023.2Hs.167679209370_s_atSH3BP20.430.210.231.210.0156NM_001250.3,Hs.50481635150_atTNFRSF50.430.230.221.190.0592NM_001250.3NM_015444.1Hs.35861213338_atRIS10.420.220.283.130.0007NM_002659.1Hs.179657211924_s_atPLAUR0.40.220.21.360.0052NM_002308.2,Hs.81337203236_s_atLGALS90.390.20.251.340.0087NM_002308.2NM_002319.2Hs.125742204692_atLRRN40.380.230.261.20.0523NM_033200.1Hs.15054031837_atBC0029420.380.220.281.150.0994NM_001487.1Hs.94672202592_atGCN5L10.370.210.231.110.0955NM_006122.1,Hs.116459202032_s_atMAN2A20.360.240.281.410.0001NM_006122.1NM_005474.3,Hs.9028202455_atHDAC50.350.220.281.090.2789NM_005474.3NM_004317.1Hs.165439202024_atASNA10.30.220.211.240.0049NM_000201.1Hs.386467202637_s_atICAM10.30.20.211.260.02NM_014223.2Hs.285133202215_s_atNFYC0.260.230.31.10.1915NM_002504.3,Hs.413074202585_s_atNFX10.190.280.261.290.0351NM_002504.3,NM_147133.1NM_000430.2Hs.77318200815_s_atPAFAH1B10.170.230.271.290.009NM_024408.2Hs.502564202445_s_atNOTCH20.160.240.21.680.0133NM_003714.1Hs.155223203439_s_atSTC20.140.20.241.310.1213NM_004229.2Hs.407604202612_s_atCRSP20.130.260.242.340.0118NM_006379.2Hs.171921203788_s_atSEMA3C0.110.210.281.170.244NM_003107.2Hs.357901201416_atSOX40.10.210.21.740.018NM_003463.2Hs.227777200730_s_atPTP4A10.090.230.271.320.0552NM_021016.2Hs.438687203399_x_atPSG30.080.220.221.380.3666NM_005334.1Hs.83634202473_x_atHCFC10.030.220.231.240.1219NM_006411.2,Hs.33213832836_atAGPAT10.350.250.291.150.0444NM_006411.2NM_000086.1Hs.194660209275_s_atCLN30.330.20.271.120.2928222302_at0.320.230.221.530.0319NM_004140.2Hs.95659206123_atLLGL10.30.230.221.850.0032NM_007283.4Hs.409826211026_s_atMGLL0.30.230.271.280.0076NM_001666.2Hs.3109204425_atARHGAP40.290.210.231.090.4942NM_014001.2,Hs.87726211815_s_atGGA30.280.260.261.260.0536NM_014001.2NM_018310.2Hs.17270218955_atBRF20.280.230.231.380.0001NM_016531.2Hs.145754219657_s_atKLF30.280.250.211.380NM_018986.2Hs.61053219256_s_atSH3TC10.270.290.281.310.006NM_024009.1Hs.488738215243_s_atGJB30.260.20.241.250.0536NM_000508.2,Hs.351593205650_s_atFGA0.250.210.261.640.024NM_000508.2NM_000964.1Hs.361071211605_s_atRARA0.250.220.261.430.0187NM_003805.2Hs.155566209833_atCRADD0.240.260.251.420.0038NM_012241.2,Hs.282331221010_s_atSIRT50.230.220.211.780.0258NM_012241.2NM_017883.3Hs.12142222138_s_atWDR130.230.210.241.320.0224NM_001625.2,Hs.294008212174_atAK20.220.270.281.10.3327NM_001625.2,NM_172199.1NM_000136.1Hs.253236205189_s_atFANCC0.210.210.231.390.0115NM_005955.1Hs.211581205323_s_atMTF10.210.210.251.090.2811NM_006693.1Hs.434994206688_s_atCPSF40.210.210.221.120.1921NM_004245.1,Hs.129719207911_s_atTGM50.210.220.241.520.1156NM_004245.1NM_014922.3,Hs.104305211822_s_atNALP10.210.210.261.20.052NM_014922.3,NM_021621,NM_021730,NM_033004.2,NM_033005,NM_033006.2XM_093895.6Hs.411317212960_atKIAA08820.210.240.221.580.0032NM_002652.1Hs.99949206509_atPIP0.20.270.232.230.0016NM_002049.2Hs.765210446_atGATA10.20.230.251.690.0517NM_005203.3,Hs.211933211809_x_atCOL13A10.20.240.252.230.0017NM_005203.3,NM_080804.2,NM_080813.2,NM_080812.2,NM_080811.2,NM_080810.2,NM_080809.2,NM_080808.2,NM_080807.2,NM_080806.2,NM_080805.2,NM_080815.2,NM_080814.2,NM_080803.2,NM_080798.2,NM_080800.2,NM_080801.2,NM_080802.2NM_006244.2Hs.75199635_s_atPPP2R5B0.190.210.21.450.0993NM_002588.2,Hs.283794205717_x_atPCDHGC30.180.240.21.540.1409NM_002588.2,NM_032402.1NM_024608.1Hs.512732219396_s_atNEIL10.180.230.311.220.1956NM_015478.4,Hs.300863206822_s_atL3MBTL0.170.330.322.440.005NM_015478.4NM_004711.3,Hs.414343213854_atSYNGR10.170.230.221.940.003NM_004711.3,NM_145731.2NM_002586.3Hs.93728211097_s_atPBX20.160.20.251.240.3167NM_025188.1Hs.301526219923_atTRIM450.160.260.231.490.0017NM_004055.3Hs.248153205166_atCAPN50.150.210.251.490.0165NM_002186.2,Hs.406228208164_s_atIL9R0.150.240.242.670.0017NM_002186.2NM_003547.2Hs.519634208551_atHIST1H4G0.150.240.271.450.2141NM_004137.2Hs.93841209948_atKCNMB10.150.20.221.260.0723NM_032336.1Hs.333166211767_atMGC147990.150.30.271.530.1624NM_015277.2Hs.249798212445_s_atNEDD4L0.150.210.221.320.0629NM_002735.1Hs.478057212559_atPRKAR1B0.150.250.251.740.0163NM_001171.2Hs.442182214033_atABCC60.150.230.261.520.0505215906_at0.150.210.282.130.0052NM_006093.2Hs.251386220811_atPRG30.150.210.242.290.0121NM_001218.2,Hs.279916203963_atCA120.140.220.262.950NM_001218.2NM_000228.1Hs.436983209270_atLAMB30.140.260.252.580.0007NM_173834.2Hs.82719212340_atMGC214160.140.220.271.380.0379NM_006034.2Hs.385634214667_s_atTP53I110.140.270.312.010.0094215971_at0.140.220.211.530.1513NM_004476.1Hs.1915217487_x_atFOLH10.140.220.221.550.0358NM_017715.1,Hs.435302219605_atZNF30.140.260.221.880.0096NM_017715.1NM_024735.2Hs.371923219784_atMGC154190.140.220.261.370.1724NM_025117.1Hs.288727220915_s_atFLJ118710.140.270.242.340.0155NM_006365.1Hs.380027222301_atCROC40.140.20.221.270.1667NM_003914.2Hs.417050205899_atCCNA10.130.210.21.160.5073NM_002640.3,Hs.368077206034_atSERPINB80.130.210.231.440.0271NM_002640.3NM_005622.2,Hs.512678210377_atSAH0.130.260.31.840.0035NM_005622.2NM_007033.2Hs.40500213114_atRER10.130.220.261.80.0476NM_015896.2Hs.167380216663_s_atZMYND100.130.220.252.190.0109NM_024893.1Hs.233634219310_atC20orf390.130.230.212.160.0048NM_016931.2Hs.371036219773_atNOX40.130.260.232.010.1031NM_003299.1Hs.192374216450_x_atTRA10.120.20.21.220.4391217212_s_at0.120.210.21.390.3622NM_016158.1Hs.104671220752_atLOC511450.120.270.231.60.1176NM_006862.2Hs.144439221053_s_atTDRKH0.120.240.261.560.1105NM_014393.1Hs.511992204226_atSTAU20.110.210.261.280.024NM_000240.2Hs.183109204388_s_atMAOA0.110.230.281.330.3663NM_001941.2,Hs.41690206032_atDSC30.110.240.21.520.1093NM_001941.2Hs.134816206507_atZNF3050.110.240.211.460.1793NM_017721.2Hs.269592207083_s_atFLJ202410.110.290.243.190.0002XM_372810.1Hs.300622207290_atPLXNA20.110.220.211.810.0418NM_001623.3,Hs.76364207823_s_atAIF10.110.230.211.730.0957NM_001623.3,NM_004847.2NM_023037.1Hs.390874214319_at13CDNA730.110.250.242.380.0004215763_at0.110.270.261.810.1446NM_020374.2Hs.296198218374_s_atC12orf40.110.250.241.30.0548NM_016260.1Hs.278963220567_atZNFN1A20.110.210.232.120.0201221137_at0.110.240.221.120.6265NM_001503.2,Hs.512001206265_s_atGPLD10.10.290.211.530.3202NM_001503.2NM_000818.1Hs.231829216651_s_atGAD20.10.220.212.070.0085NM_018286.1Hs.173233219230_atFLJ109700.10.250.212.230.0213NM_014332.1Hs.86492219772_s_atSMPX0.10.250.22.240.0193NM_005554.2Hs.367762209125_atKRT6A0.090.240.281.740.0848NM_000268.2,Hs.902211092_s_atNF20.090.260.231.70.0358NM_000268.2,NM_016418.4,NM_181825.1,NM_181826.1,NM_181827.1,NM_181828.1,NM_181829.1,NM_181830.1,NM_181831.1,NM_181832.1,NM_181833.1,NM_181834.1NM_021189.2Hs.365689213948_x_atIGSF4B0.090.220.251.420.2795NM_022112.1Hs.160953220403_s_atP53AIP10.090.220.252.480.01NM_018423.1Hs.24979221696_s_atDKFZp761P10100.090.210.221.620.0547NM_005903.4Hs.167700205187_atMADH50.080.250.221.960.0856NM_000745.2Hs.1614206533_atCHRNA50.080.20.211.460.1336NM_001980.2,Hs.99865207346_atEPIM0.080.230.221.710.0514NM_001980.2NM_002399.2,Hs.362805207480_s_atMEIS20.080.20.21.20.4323NM_002399.2,NM_020149.2,NM_170674.2,NM_170675.2,NM_170676.2,NM_170677.2,NM_172315.1NM_004432.1Hs.166109208427_s_atELAVL20.080.220.2530.0028NM_006203.2Hs.28482210837_s_atPDE4D0.080.220.221.510.1544NM_001791.2,Hs.355832214230_atCDC420.080.20.231.840.0408NM_001791.2NM_031456.2Hs.158313215999_atC17orf1A0.080.210.232.010.0104NM_018144.2Hs.368481219499_atSEC61A20.080.210.21.490.1369NM_000908.1Hs.237028219789_atNPR30.080.210.211.580.1612NM_002429.2,Hs.154057204575_s_atMMP190.070.20.222.040.0192NM_002429.2,NM_022790.1,NM_022791.1NM_002849.2,Hs.198288206084_atPTPRR0.070.240.231.730.1892NM_002849.2NM_000811.1Hs.90791207182_atGABRA60.070.210.232.120.0372NM_015594.1Hs.241421208008_atDKFZP434O0470.070.20.251.530.1304XM_371887.1,Hs.134792212475_atKIAA02410.070.210.281.680.0231XM_371887.1NM_015341.2Hs.308045212949_atBRRN10.070.250.321.370.0983NM_002026.1,Hs.418138214702_atFN10.070.230.211.780.2234NM_002026.1NM_004993.2,Hs.419756216657_atMJD0.070.20.21.410.2134NM_004993.2NM_019086.2Hs.333157220137_atFLJ206740.070.220.241.50.1801NM_018165.2,Hs.173220221212_x_atPB10.070.260.253.580.0075NM_018165.2,NM_018313.2,NM_181041.1NM_002845.2Hs.154151207487_atPTPRM0.060.240.241.880.0884NM_000668.3Hs.4209613_s_atADH1B0.060.210.222.330.0342NM_001797.2,Hs.443435207173_x_atCDH110.050.20.242.430.0033NM_001797.2NM_003628.2Hs.277132214874_atPKP40.050.210.221.360.3519NM_018342.2Hs.176227219750_atFLJ111550.050.210.232.070.0622NM_000586.2Hs.89679207849_atIL20.040.20.221.570.2808NM_025210.1Hs.127689207377_atI-40.030.220.211.830.009NM_000411.3Hs.371350207833_s_atHLCS0.030.20.211.270.2197NM_018052.3Hs.445061216501_atFLJ103050.030.210.251.620.0438Hs.512631220787_atPRO25330.030.210.231.990.0796222308_x_at0.030.220.241.320.4963NM_004789.3Hs.1569206140_atLHX20.020.230.211.630.0702NM_004442.4,Hs.125124211165_x_atEPHB20.020.210.221.420.0743NM_004442.4


Second degree polynomial regression analysis of the resulting t-scores versus CAD-Index resulted in the prediction model including 95% confidence range of the regression and the 95% prediction interval with r2=0.764 (p<0.001). The rather wide prediction interval is due to both, variability in CAD-Index assessment which is only semi-quantitative and variability in gene expression.


In order to test for stability of the model, the PLS analysis was performed separately for each of the three cohorts; i.e. “Pooled Males”, “Other Males”, and “Females”. While the controls remain quite stable in the range of −2 standard deviations, the t1-scores of the cases were located mainly in the +2 standard deviation range and increase with increasing CAD-Index. This relationship is present in each cohort with the lowest variation in the “Other Males”.


Variable Importance in the Projection (VIP) of each gene for the separate PLS analyses of the three different cohorts compared to the PLS analysis including all subjects are shown in text FIG. 5. Displayed are the 24 genes with the highest VIP. The curve shows a steep decrease for the first 8 genes; thereafter, the decrease becomes rather flat and is almost linear. Apart from these eight genes all other genes contribute only marginally to projection. The VIP of the first 24 genes shows only little variation between the different cohorts pointing to a rather high stability of the prediction model.


The identification of the eight highly predictive genes that are able to predict the CAD-Index is given in Table 7.

TABLE 7Identification of the 8 genes that contribute mostly to the prediction model (Model 1)FoldProbesetSymbolSEQ ID NO.Gene Namechange212788_x_atFerritinSEQ ID NO. 1ESTs, Highly similar to1.2FRIL_HUMAN Ferritin light chain(Ferritin L subunit) [H. sapiens], ironion homeostasis40850_atFKBP8SEQ ID NO. 2FK506 binding protein 8, 38 kDa1.4212639_x_atTUBA3SEQ ID NO. 3tubulin, alpha, ubiquitous1.239854_r_atTTS-2.2SEQ ID NO. 4transport-secretion protein 2.21.5220757_s_atUBXD1SEQ ID NO. 5UBX domain-containing 11.5210075_atLOC51257SEQ ID NO. 6Hypothetical protein LOC512571.4210740_s_aITRPK1SEQ ID NO. 7Inositol 1,3,4-triphosphate 5/61.3kinase209018_s_atPINK1SEQ ID NO. 8PTEN induced putative kinase 11.5


A final PLS analysis involving the eight most predictive genes was applied to all 222 subjects involved in this study. The t-scores of the eight genes are able to predict the CAD-Index very accurately. Taking into account that the CAD-Index is just a semi-quantitative estimate of stenosis which naturally implies variation across subjects even with the same extent of stenosis, the prediction based on expression pattern of the eight most predictive genes is convincing.


The differential expression pattern of the 8 predictor genes (Table 7) is confirmed be RT-PCR. The following primers were used for RT-PCR for these genes:

TABLE 8PrimersPrimer NameSEQ.ID No.Probe Sequence (5′-3′)212788_x_atSEQ ID. No. 1TCTGGAAGGCGTGAGCCACTTCTTCSEQ ID. No. 1GCTACGAGCGTCTCCTGAAGATGCASEQ ID. No. 1AAACCCCAGACGCCATGAAAGCTGCSEQ ID. No. 1TGAAAGCTGCCATGGCCCTGGAGAASEQ ID. No. 1CTCTGTGACTTCCTGGAGACTCACTSEQ ID. No. 1GGCTGGGCGAGTATCTCTTCGAAAGSEQ ID. No. 1TCGAAAGGCTCACTCTCAAGCACGASEQ ID. No. 1AGCACGACTAAGAGCCTTCTGAGCCSEQ ID. No. 1GAGCCCAGCGACTTCTGAAGGGCCCSEQ ID. No. 1TCCCTCCAGCCAATAGGCAGCTTTCSEQ ID. No. 1GCAGCTTTCTTAACTATCCTAACAA40850_atSEQ ID No. 2AGACCGCCTTGTACCGGAAAATGCTSEQ ID No. 2GCAAGGGTGCCTGGTCCATCCCATGSEQ ID No. 2GTGCCTGGTCCATCCCATGGAAGTGSEQ ID No. 2CCATCCCATGGAAGTGGCTGTTTGGSEQ ID No. 2TGTTTGGGGCGACTGCTGTTGCCTTSEQ ID No. 2ACTGAGGCCCTCTAGGAGGAAAGCCSEQ ID No. 2CTGAGGCCCTCTAGGAGGAAAGCCCSEQ ID No. 2GAGGCCCTCTAGGAGGAAAGCCCAGSEQ ID No. 2AGGCCCTCTAGGAGGAAAGCCCAGASEQ ID No. 2GGCCCTCTAGGAGGAAAGCCCAGAGSEQ ID No. 2GCCCTCTAGGAGGAAAGCCCAGAGGSEQ ID No. 2CCTCTAGGAGGAAAGCCCAGAGGGASEQ ID No. 2TAGGTCTCCGCCAGGGCTGGCCTCASEQ ID No. 2AGGGCTGGCCTCAGTTTCTCCTCAASEQ ID No. 2GGCTGGCCTCAGTTTCTCCTCAACASEQ ID No. 2AGTTTCTCCTCAACAGGCCTGGGGG212639_x_atSEQ ID No. 3GATCACCAATGCTTGCTTTGAGCCASEQ ID No. 3GCTTTGAGCCAGCCAACCAGATGGTSEQ ID No. 3AAATGTGACCCTCGCCATGGTAAATSEQ ID No. 3CCGTGGTGACGTGGTTCCCAAAGATSEQ ID No. 3ATGTCAATGCTGCCATTGCCACCATSEQ ID No. 3AGTTTGTGGATTGGTGCCCCACTGGSEQ ID No. 3CTCCCACTGTGGTGCCTGGTGGAGASEQ ID No. 3GAGAGCTGTGTGCATGCTGAGCAACSEQ ID No. 3GCCTTTGTTCACTGGTACGTGGGTGSEQ ID No. 3GGCCCGTGAAGATATGGCTGCCCTTSEQ ID No. 3CTAATTATCCATTCCTTTTGGCCCT39854_r_atSEQ ID No. 4ATGCGCAACAACCTCTCGCTGGGGGSEQ ID No. 4CCGAAGCTCTGCGCATGCGCGCACCSEQ ID No. 4CCCCGCGGACCCAGCATCCCCGCAGSEQ ID No. 4CCGCGGACCCAGCATCCCCGCAGCASEQ ID No. 4CCTGCTCCCGAGGCCCGGCCCGTGASEQ ID No. 4GGAACCCTGCCTGAGACGCCTCCATSEQ ID No. 4GAGACGCCTCCATTACCACTGCGCASEQ ID No. 4ACGCCTCCATTACCACTGCGCAGTGSEQ ID No. 4CCACTGCGCAGTGAGATGAGGGGACSEQ ID No. 4AGGGGACTCACAGTTGCCAAGAGGGSEQ ID No. 4ACTCACAGTTGCCAAGAGGGGTCTTSEQ ID No. 4CCTCCCCTGGGCCGCTGAGGCCCCGSEQ ID No. 4GTGCTGCCCGAGCACCTCCCCCGCCSEQ ID No. 4GAACTTTGCAGCTGCCCTTCCCTCCSEQ ID No. 4TTTGCAGCTGCCCTTCCCTCCCCGTSEQ ID No. 4AGAATTATTTATTTTCGCCAAAGCA220757_s_atSEQ ID No. 5GGGCTGCGCAAGTACAACTACACGCSEQ ID No. 5GCACTTTCTACGCTCGGGAGCGGCTSEQ ID No. 5TGCAGAGCGACTGGCTGCCTTTTGASEQ ID No. 5GGCCTCGGGAGGGCAGAAGCTGTCCSEQ ID No. 5TGTCCGAGGACGAGAACCTGGCCTTSEQ ID No. 5ACCTGGCCTTGAACGAGTGCGGGCTSEQ ID No. 5AGCTCCTGTCAGCCATCGAGAAGCTSEQ ID No. 5AAAAGCAGGGTTGGCCTCAGCCCTGSEQ ID No. 5ACCTCTGGAAATACTTGGCTCTGCCSEQ ID No. 5GCCCCATGGGCACGGGAGGGGCGCCSEQ ID No. 5AGCCGTGGAGCTGTGGAATTGGGCC210075_atSEQ ID No. 6CAGTATGAATGCTGGGCTCTCCGGASEQ ID No. 6AGAGGTAGCTGGTGATACCCTGTCCSEQ ID No. 6GGAAGGACTTCCACTTCAACACTTCSEQ ID No. 6GCACGGCCTGAACGCTTCTTAGGCCSEQ ID No. 6TTAGGCCAAGAGACACCATGCGGAGSEQ ID No. 6CATGCGGAGCCTAGTCTGTGATCCTSEQ ID No. 6GACATGGTCCTGAGCTCTGGACGGASEQ ID No. 6TGTGGCCGGTGTATCAAGGGCGCCCSEQ ID No. 6TTCCAGCAAGCTTCTTGCGCTTCTCSEQ ID No. 6CTGGCACCCTCGACTTTATATAAAASEQ ID No. 6TGCACTGCGTTTCAAAAACCCACCC209018_s_atSEQ ID No. 7GGCGGAAACGGCTGTCTGATGGCCCSEQ ID No. 7GGCTGATGCCTGGGCAGTGGGAGCCSEQ ID No. 7GAGCCATCGCCTATGAAATCTTCGGSEQ ID No. 7AGCCGCAGCTACCAAGAGGCTCAGCSEQ ID No. 7CTACCTGCACTGCCCGAGTCAGTGCSEQ ID No. 7TCAGTGCCTCCAGACGTGAGACAGTSEQ ID No. 7TCTGCCCGAGTAGCCGCAAATGTGCSEQ ID No. 7AATGTGCTTCATCTAAGCCTCTGGGSEQ ID No. 7CCAACAATCGGCCGCCACTTTGTTGSEQ ID No. 7ATGCTCTTTCTGGCTAACCTGGAGTSEQ ID No. 7GATGTCCCTGCATGGAGCTGGTGAA210740_s_atSEQ ID No. 8CCCATCACCTTGGCAGCAAAGCACTSEQ ID No. 8TGCTGGGTGAGAGGCATCAGCCCCCSEQ ID No. 8ATCAGCCCCCACAAGTATGTTTTTGSEQ ID No. 8AAGTGCTGAGTGTCCCGAGAGAGGCSEQ ID No. 8CAGCTGGGCTGCAGGATGCCCACTTSEQ ID No. 8CCATCAGAACTGCCCGGCTTTTTTGSEQ ID No. 8ACTGAGGACCCAACAACTAACCACGSEQ ID No. 8CACGACTTGAGTTTTGAACCCCGATSEQ ID No. 8ATTAATGTCTGTACGTCACCTTTCCSEQ ID No. 8AACAGGAAAGCGTGGCTGGCCTCTTSEQ ID No. 8TCTTGCACTGCTTTGTCTCCAAAAT


In summary, it is investigated whether gene expression patterns in circulating leukocytes are associated with presence and extent of CAD. Patients undergoing coronary angiography were selected according to their Duke CAD index (CADi), a validated angiographical measure of the extent of coronary atherosclerosis that correlates with outcome. RNA was extracted from 120 patients with CAD (CADi>23) and from 121 partially matched controls without CAD (CADi=0). Gene expression was assessed using Affymetrix U1333A chips. Genes correlating with CAD were identified using a Spearman test, and predictive gene expression patterns were identified using a partial least squares (PLS) regression analysis.


160 individual genes were found to significantly correlate with CADi (rho>0.2, P<0.0027, n=222), although changes in individual gene expression were relatively small (1.2 to 1.5 fold). Using these 160 genes, the PLS multivariate regression model resulted in a highly predictive model (r2=0.764, P<0.001). Subsequent analysis showed that most of the predictive model was carried by only 8 genes (r2=0.752) (TABLE 7).


In conclusion, simultaneous expression pattern of eight genes is highly predictive for CAD. Peripheral leukocyte gene expression pattern is thus a non-invasive biomarker for CAD and leads to new pathophysiologic insights.


Example 2

Extension of the Modeling (Model 2)


The modelling procedure is repeated with 152 candidate genes, i.e. the 160 candidate genes of Table 6 minus the eight predictive genes of Table 7 of Example 1 by using exactly the PLS methods as for model 1 (Example 1). The CAD-index as predicted by gene expression pattern of 19 genes (Table 9) versus the actual CAD-index as assessed by the clinicians is displayed in FIG. 4. r2 of model 2 is only marginally less than that of model 1 (Example 1) (0.75 versus 0.72) and the 95% prediction confidence bands are comparable.

TABLE 9The 19 best predictor genes of model 2SymbolAccession #Gene NameMedian*rhoSEQ ID NOPMS2L5NM_174930.2postmeiotic4960.24SEQ ID NO. 9segregation increased2-like 5RXRANM_002957retinoid X receptor,4660.24SEQ ID NO. 10alphaGCN5L1NM_001487GCN5 general control5400.23SEQ ID NO. 11of amino-acidsynthesis 5-like 1(yeast)CABIN1NM_012295calcineurin binding4820.21SEQ ID NO. 12protein 1LGALS9NM_002308.2,lectin, galactoside-3910.25SEQ ID NO. 13NM_002308.2binding, soluble, 9(galectin 9)CEBPANM_004364CCAAT/enhancer3700.35SEQ ID NO. 14binding protein(C/EBP), alphaLRRN4NM_002319.2leucine rich repeat4140.26SEQ ID NO. 15neuronal 4STXBP2NM_006949.1syntaxin binding8130.25SEQ ID NO. 16protein 2SH3BP2NM_003023.2SH3-domain binding4790.23SEQ ID NO. 17protein 2RNF24NM_007219.2ring finger protein 249180.28SEQ ID NO. 18PLAURNM_002659.1plasminogen2750.20SEQ ID NO. 19activator, urokinasereceptorRIS1NM_015444.1Ras-induced840.28SEQ ID NO. 20senescence 1ADD1NM_001119.3,adducin 1 (alpha)4640.21SEQ ID NO. 21NM_001119.3,NM_176801.1,NM_014190.2GPSM3NM_022107.1G-protein signalling5510.27SEQ ID NO. 22modulator 3 (AGS3-like, C. elegans)BC002942NM_033200.1hypothetical protein3810.28SEQ ID NO. 23BC002942TNFRSF5NM_001250.3,tumor necrosis factor4080.22SEQ ID NO. 24NM_001250.3receptor superfamily,member 5N4BP1Hs.323712Nedd4 binding protein 18330.21SEQ ID NO. 25(Unigene #)FLJ12438NM_021933.1hypothetical protein5550.22SEQ ID NO. 26FLJ12438MMP24NM_006690.2matrix8200.20SEQ ID NO. 27metalloproteinase 24(membrane-inserted)
*median refers to the median intensity of the signal and rho is Spearman's rank correlation of the respective gene with CAD-Index.


Example 3

Extension of the Modeling (Model 3)


In a third modelling approach the remaining 133 genes, i.e. the 160 candidate genes (Table 6) minus the eight predictive genes (Table 7; Example 1) of model 1, minus the nineteen predictor genes (Table 9; Example 2) of model 2 are subjected to partial least square regression as described in Example 1. The result is depicted in FIG. 4 and the corresponding 15 best predictor genes are compiled in Table 10.

TABLE 10The 15 best predictor genes of model 3SymbolAccession #Gene NameMedian*rhoSEQ ID NOPTP4A1NM_003463.2protein tyrosine620.27SEQ ID NO. 28phosphatase typeIVA, member 1PAFAH1B1NM_000430.2platelet-activating2040.27SEQ ID NO. 29factoracetylhydrolase,isoform lb, alphasubunitSOX4NM_003107.2SRY (sex determining540.20SEQ ID NO. 30region Y)-box 4ASNA1NM_004317.1arsA arsenite3600.21SEQ ID NO. 31transporter, ATP-binding, homolog 1(bacterial)MAN2A2NM_006122.1,mannosidase, alpha,3580.28SEQ ID NO. 32NM_006122.1class 2A, member 2NFYCNM_014223.2nuclear transcription2380.30SEQ ID NO. 33factor Y, gammaNOTCH2NM_024408.2Notch homolog 2870.20SEQ ID NO. 34(Drosophila)HDAC5NM_005474.3,histone deacetylase 54000.28SEQ ID NO. 35NM_005474.3HCFC1NM_005334.1host cell factor C1150.23SEQ ID NO. 36(VP16-accessoryprotein)NFX1NM_002504.3,nuclear transcription820.26SEQ ID NO. 37NM_002504.3,factor, X-box binding 1NM_147133.1CRSP2NM_004229.2cofactor required for350.24SEQ ID NO. 38Sp1 transcriptionalactivation, subunit 2aICAM1NM_000201.1intercellular adhesion3070.21SEQ ID NO. 39molecule 1 (CD54),human rhinovirusreceptorPSG3NM_021016.2pregnancy specific170.22SEQ ID NO. 40beta-1-glycoprotein 3STC2NM_003714.1stanniocalcin 2800.24SEQ ID NO. 41SEMA3CNM_006379.2sema domain,710.28SEQ ID NO. 42immunoglobulindomain (Ig),(semaphorin) 3C
*median refers to the median intensity of the signal and rho is Spearman's rank correlation of the respective gene with CAD-Index


In conclusion, due to the high information content of the 160 candidate genes it is possible to generate three models that may predict the extent of coronary artery disease based on gene expression pattern. By applying the three models for example in parallel to new unknown samples the likelihood of correct prediction may increase dramatically. Furthermore, the robustness of the predictive model may be increased; i.e. if for some technical or biological reason the genes being involved in one model are of poor quality there are still two more predictive models that can be used for prediction.


Further information on the genes of Tables 7 and/or Table 9 and/or Table 10 such as Unigene #, Probeset # and/or Accession # is listed in Table 6.


Example 4

Proteomic Discovery of Coronary Artery Disease Proteins Using Industrial Scale Analysis of Pooled Plasma


Established are male populations of 53 patients with angiographic coronary artery disease (defined as at least one lesion with > or = to 50% stenosis) and 53 patients with no angiographic coronary disease from the Duke Databank for Cardiovascular Disease. For a description of the Duke Databank for Cardiovascular Disease, see Allen LaPointe, Nancy M. et al., Journal of the American College of Cardiology 41 (Suppl A):517A (2003). These patients were matched for age and race and extremes of risk factors and major plasma protein abnormalities were removed in prescreening. Plasma samples of each group were pooled to make large volumes (6 liters each) to identify low abundance proteins, in the picomolar range. After specific removal of albumin and immunoglobulins, and enrichment of smaller proteins (<20-40 kDa), samples were separated into 12,960 fractions by liquid chromatography, and analyzed by mass spectrometry (LC-ESI MS/MS and MALDI-TOF), before and after enzymatic digestion. See, Rose, Keith et al., Proteomics (2004) (DOI 10.1002/pmic.200300718).


731 plasma proteins or fragments were identified, including low abundance moieties such as leptin and bradykinin. 17 were well detected and strongly differentially displayed according to disease state. The proteins in categories are summarized in the following Table 11 with their parent accession number:

TABLE 11Accession #SEQ ID NO.Disease > ControlFibrinogen Gamma ChainP02679SEQ ID NO. 43Mature Form of Collagen alpha 3(VI)P12111SEQ ID NO. 44chainMature Form of Complement C1sP09871SEQ ID NO. 45(C1 esterase) CD59P13987SEQ ID NO. 46Insulin like growth factor bindingP35858SEQ ID NO. 47complex acid labile chain (ALS)Defensin 5Q01523SEQ ID NO. 48Proline Rich Acidic ProteinQ96NZ9SEQ ID NO. 49Emilin-3Q9H8L6SEQ ID NO. 50CA11 ProteinQ9NS71SEQ ID NO. 51Predominant in DiseaseC5A AnaphylotoxinP01031SEQ ID NO. 52Nonsecretory ribonuclease isoformP10153SEQ ID NO. 53(three moieties)Control > DiseaseGlutathione transferase omega 1P78417SEQ ID NO. 54Complement factor H-related protein 1Q03591SEQ ID NO. 55(FHR-1)Secreted phosphoprotein 24 (SPP-24)Q13103SEQ ID NO. 56Predominant in ControlsMature form of chitotriosidaseQ13231SEQ ID NO. 57


Additional information about peptide Q96NZ9 is provided in PCT patent applications WO 02/00690 and WO 02/08284 (PRO1195-SEQ ID 212). Additional information about peptide Q9NS71 is provided in PCT patent applications WO02/00690 and WO02/08284 (PRO1005-SEQ ID 140).


In conclusion, this systematic and comprehensive approach has identified a large number of proteins that are differentially displayed in populations with and without coronary disease. These proteins include inflammatory mediators and defense mechanism proteins and now comprise a group of candidates for additional validation tests to identify novel markers for disease.


Example 5

Gene Expression Profile in Circulating Leukocytes Identifies Patients with Coronary Artery Disease


We have investigated whether gene expression patterns in circulating leukocytes are associated with presence and extent of CAD. Patients undergoing coronary angiography were selected according to their Duke CAD index (CADi), a validated angiographical measure of the extent of coronary atherosclerosis that correlates with outcome. RNA was extracted from 120 patients with CAD (CADi>23) and from 121 partially matched controls without CAD (CADi=0). Gene expression was assessed using Affymetrix U1333A chips. Genes correlating with CAD were identified using a Spearman test, and predictive gene expression patterns were identified using a partial least squares (PLS) regression analysis.


160 individual genes were found to significantly correlate with CADi (rho>0.2, P<0.0027, n=222), although changes in individual gene expression were relatively small (1.2 to 1.5 fold). Using these 160 genes, the PLS multivariate regression model resulted in a highly predictive model (r2=0.764, P<0.001). Subsequent analysis showed that most of the predictive model was carried by only 8 genes (r2=0.752) (Table 12).

TABLE 12GeneFold changeP-valuerhoFerritin light chain1.20.0080.24FK506 binding protein 81.40.0790.25Tubulin alpha1.20.0200.28Transport-secretion protein 2.21.50.0190.24UBX domain-containing 11.50.0150.21Hypothetical protein LOC512571.40.0020.24Inositol 1,3,4-triphosphate 5/6 kinase1.30.0020.30PTEN induced putative kinase 11.50.0010.22


In conclusion, simultaneous expression pattern of eight genes is highly predictive for CAD. Peripheral leukocyte gene expression pattern is thus a non-invasive biomarker for CAD and leads to new pathophysiologic insights.


The foregoing description has been presented only for the purposes of illustration and is not intended to limit the invention to the precise form disclosed, but by the claims appended hereto.


While the foregoing has described what are considered to be the best mode and/or other preferred embodiments, it is understood that various modifications may be made therein and that the invention or inventions may be implemented in various forms and embodiments, and that they may be applied in numerous applications, only some of which have been described herein. As used herein, the terms “includes” and “including” mean without limitation. It is intended by the following claims to claim any and all modifications and variations that fall within the true scope of the inventive concepts.

Claims
  • 1. A method of identifying or predicting the predisposition of coronary artery disease in a subject comprising the steps of: (i) determining the level of gene expression of at least one gene in a subject to provide a first value, wherein the at least one gene is selected from the group of genes consisting of: (a) genes from Table 6 with the accession codes: BG537190, L37033, AL581768, AF055000, NM025241, AF151074, AF279372 and BF432478; (b) genes from Table 7 having sequence numbers: SEQ ID NO. 1, SEQ ID. NO. 2, SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, SEQ ID NO. 6, SEQ ID NO. 7 and SEQ ID. NO. 8; (c) genes from Table 9: PMS2L5 (SEQ ID NO. 9), RXRA (SEQ ID NO. 10), GCN5L1 (SEQ ID NO. 11), CABIN1 (SEQ ID NO. 12), LGALS9 (SEQ ID NO. 13), CEBPA (SEQ ID NO. 14), LRRN4 (SEQ ID NO. 15), STXBP2 (SEQ ID NO. 16), SH3BP2 (SEQ ID NO. 17), RNF24 (SEQ ID NO. 18), PLAUR (SEQ ID NO. 19), RIS1 (SEQ ID NO. 20), ADD1 (SEQ ID NO. 21), GPSM3 (SEQ ID NO. 22), BC002942 (SEQ ID NO. 23), TNFRSF5 (SEQ ID NO. 24), N4BP1 (SEQ ID NO. 25), FLJ12438 (SEQ ID NO. 26) and MMP24 (SEQ ID NO. 27); and (d) genes from Table 10: PTP4A1 (SEQ ID NO. 28), PAFAH1B1 (SEQ ID NO. 29), SOX4 (SEQ ID NO. 30), ASNA1 (SEQ ID NO. 31), MAN2A2 (SEQ ID NO. 32), NFYC (SEQ ID NO. 33), NOTCH2 (SEQ ID NO. 34), HDAC5 (SEQ ID NO. 35), HCFC1 (SEQ ID NO. 36), NFX1 (SEQ ID NO. 37), CRSP2 (SEQ ID NO. 38), ICAM1 (SEQ ID NO. 39), PSG3 (SEQ ID NO. 40), STC2 (SEQ ID NO. 41) and SEMA3C (SEQ ID NO. 42); (ii) determining the level of gene expression of said at least one gene in a control or reference standard to provide a second value and (iii) comparing the difference between said first value and second value, wherein a first value greater than the second value is indicative of the presence or prediction of coronary artery disease.
  • 2. The method according to claim 1, wherein said control or reference standard is determined from a subject or group of subjects without coronary artery disease.
  • 3. The method according to claim 1, wherein the prediction of the presence of coronary artery disease has a probability of at least 50%.
  • 4. The method according to claim 1, wherein the first value is at least 20% greater than the second value.
  • 5. The method according to claim 1, wherein the level of expression is detected by microarray analysis, Northern blot analysis, reverse transcription PCR or RT-PCR.
  • 6. The method according to claim 1, wherein said level of gene expression and/or the level of peptide is measured ex vivo in a sample selected from the group of: blood, serum, plasma, lymph, urine, tear, saliva, cerebrospinal fluid, leukocyte sample or tissue sample.
  • 7. The method according to claim 1, wherein said method further comprises the measurement of CAD-Index.
  • 8. The method according to claim 1, wherein a CAD Index between 23-100 is indicative of the probability of the presence or predisposition of coronary artery disease.
  • 9. The method according to claim 1, wherein the level of gene expression of a plurality of said genes is determined.
  • 10. The method according to claim 1, wherein the level of gene expression of a plurality of genes selected from Table 12 is determined.
  • 11. The method according to claim 1, wherein the level of gene expression of all the genes of Table 12 are determined.
  • 12. A method of monitoring a subject identified as having coronary artery disease before and after treatment comprising the steps of: (i) determining the level of gene expression of at least one gene in a subject to provide a first value, wherein the at least one gene is selected from the group of genes consisting of: (a) genes from Table 6 with the accession codes: BG537190, L37033, AL581768, AF055000, NM025241, AF151074, AF279372 and BF432478; (b) genes from Table 7 having sequence numbers: SEQ ID NO. 1, SEQ ID. NO. 2, SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, SEQ ID NO. 6, SEQ ID NO. 7 and SEQ ID. NO. 8; (c) genes from Table 9: PMS2L5 (SEQ ID NO. 9), RXRA (SEQ ID NO. 10), GCN5L1 (SEQ ID NO. 11), CABIN1 (SEQ ID NO. 12), LGALS9 (SEQ ID NO. 13), CEBPA (SEQ ID NO. 14), LRRN4 (SEQ ID NO. 15), STXBP2 (SEQ ID NO. 16), SH3BP2 (SEQ ID NO. 17), RNF24 (SEQ ID NO. 18), PLAUR (SEQ ID NO. 19), RIS1 (SEQ ID NO. 20), ADD1 (SEQ ID NO. 21), GPSM3 (SEQ ID NO. 22), BC002942 (SEQ ID NO. 23), TNFRSF5 (SEQ ID NO. 24), N4BP1 (SEQ ID NO. 25), FLJ12438 (SEQ ID NO. 26) and MMP24 (SEQ ID NO. 27); and (d) genes from Table 10: PTP4A1 (SEQ ID NO. 28), PAFAH1B1 (SEQ ID NO. 29), SOX4 (SEQ ID NO. 30), ASNAL (SEQ ID NO. 31), MAN2A2 (SEQ ID NO. 32), NFYC (SEQ ID NO. 33), NOTCH2 (SEQ ID NO. 34), HDAC5 (SEQ ID NO. 35), HCFC1 (SEQ ID NO. 36), NFX1 (SEQ ID NO. 37), CRSP2 (SEQ ID NO. 38), ICAM1 (SEQ ID NO. 39), PSG3 (SEQ ID NO. 40), STC2 (SEQ ID NO. 41) and SEMA3C (SEQ ID NO. 42); (ii) determining the level of gene expression of said at least one gene in a control or reference standard to provide a second value and (iii) comparing the difference between said first value and second value, wherein a first value greater than the second value is indicative of the presence or prediction of coronary artery disease; wherein said steps are performed before treatment and after treatment for coronary artery disease to determine that a difference in the level of gene expression corresponds to the efficacy of the treatment of coronary artery disease in said subject.
  • 13. A method of identifying or predicting coronary artery disease (CAD) in a subject comprising: (a) determining the level of one or more peptides selected from Table 11 in a subject to provide a first value, (b) determining the level of said one or more peptides selected from Table 11 in a control or reference standard to provide a second value and (c) comparing whether there is a difference between said first value and second value, wherein a first value different from the second value is indicative of the presence or prediction of coronary artery disease.
  • 14. The method according to claim 13, wherein said peptide level is measured in a blood, plasma or serum sample.
  • 15. A method of monitoring a subject identified as having coronary artery disease before and after treatment comprising the steps of: (a) determining the level of one or more peptides selected from Table 11 in a subject to provide a first value, (b) determining the level of said one or more peptides selected from Table 11 in a control or reference standard to provide a second value and (c) comparing whether there is a difference between said first value and second value, wherein a first value different from the second value is indicative of the presence or prediction of coronary artery disease; wherein said steps are performed before treatment and after treatment for coronary artery disease to determine that a difference in the level of peptide corresponds to the efficacy of the treatment of coronary artery disease in said subject.
CLAIM OF PRIORITY

This patent application is a Continuation-in-Part of U.S. patent application U.S. Ser. No. 10/575814, filed Apr. 14, 2006, which is the national stage of PCT patent application PCT/EP2004/011651, filed Oct. 15, 2004, which claims priority to U.S. provisional patent applications U.S. Ser. No. 60/511784, filed Oct. 16, 2003, and U.S. Ser. No. 60/574818, filed May 27, 2004.

Provisional Applications (2)
Number Date Country
60574818 May 2004 US
60511784 Oct 2003 US
Continuation in Parts (1)
Number Date Country
Parent 10575814 Jun 2007 US
Child 11766915 Jun 2007 US