The present invention is in the field of vascular disease diagnosis and therapy. In particular, the present invention relates to specific single nucleotide polymorphisms (SNPs) in the human genome, and their association with vascular disease and related pathologies, in particular, coronary artery disease (CAD) such as coronary stenosis.
Cardiovascular disorders are a cause of significant morbidity and mortality in the United States. Among the more common cardiovascular disorders are coronary artery diseases (CADs). CADs, sometimes designated coronary heart diseases or ischemic heart diseases, are characterized by insufficiency in blood supply to cardiac muscle. CADs can be manifested as acute cardiac ischemia (e.g., angina pectoris or myocardial infarction) or chronic cardiac ischemia (e.g., coronary arteriosclerosis or coronary atherosclerosis). CADs are a common cause of cardiac failure, cardiac arrhythmias, and sudden death. In patients afflicted with CADs, the cardiac muscle is not sufficiently supplied with oxygen. Severe cardiac ischemia can be manifested as severe pain or cardiac damage. Less severe ischemia can damage cardiac muscle and cause changes to cardiac tissues over the long term that impair cardiac function.
Many disorders, including CADs, develop over time and could be delayed, inhibited, lessened in severity, or prevented altogether by making lifestyle changes or through pharmaceutical treatment. For cardiovascular disorders such as CAD, such changes include increasing exercise, adjusting diet, consuming nutritional or pharmaceutical products known to be effective against cardiovascular disorders, and undergoing heightened medical monitoring. These changes are often not made, due to the expense or inconvenience of the changes to an individual and on her subjective belief that she is not at high risk for cardiovascular disorders. Improved monitoring of cardiovascular health can help to identify individuals at risk for developing cardiovascular disorders, including CAD, and permit for more informed decisions as to whether lifestyle changes are justified.
One way to identify subjects at high risk for developing CAD is by identifying genetic elements that predispose an individual to develop CAD. Polymorphisms conferring higher risks to non-cardiovascular diseases have been identified which aid in their diagnosis. Apolipoprotein E genetic screening aids in identifying genetic carriers of the apoE4 polymorphism in dementia patients for the differential diagnosis of Alzheimer's disease. Factor V Leiden polymorphisms signals a predisposition to deep venous thrombosis. The identification of polymorphisms in disease-associated genes also aids in designing an effective treatment plan for the disorder. For example, in the treatment of cancer, diagnosis of genetic variants in tumor cells is used for the selection of the most appropriate treatment regimen for the individual patient. In breast cancer, genetic variation in estrogen receptor expression or heregulin type 2 (Her2) receptor tyrosine kinase expression determine if anti-estrogenic drugs (e.g. tamoxifen) or anti-Her2 antibody (e.g. Herceptin) will be incorporated into the treatment plan. In chronic myeloid leukemia (CML) diagnosis of the Philadelphia chromosome genetic translocation fusing the genes encoding the Bcr and Abl receptor tyrosine kinases indicates that Gleevec (ST1571), a specific inhibitor of the Bcr-Abl kinase should be used for treatment of the cancer. For CML patients with such a genetic alteration, inhibition of the Bcr-Abl kinase leads to rapid elimination of the tumor cells and remission from leukemia.
Therefore, a need remains for the identification of genomic polymorphisms that predispose an individual to develop cardiovascular diseases such as CAD and that aid in their treatment. The invention provides such CAD-determinative genes and polymorphisms, and related assays, satisfying this need.
The invention broadly relates to estimating, and aiding to estimate, the likelihood that a subject will be afflicted with cardiovascular disease, and to identifying subjects with an elevated risk of developing cardiovascular disease and to related kits and reagents. In one embodiment, the cardiovascular disease is coronary artery disease (CAD). The invention also relates, in part, to methods and reagents for identifying, or aiding in the identification of, subjects at high risk of developing CAD or other cardiovascular diseases.
Another aspect of the invention provides a method for identifying an individual who has an altered risk for developing CAD, comprising detecting the presence of a single nucleotide polymorphism (SNP) in said individual's nucleic acids, wherein the presence of the SNP is correlated with an altered risk for coronary stenosis in said individual. In one embodiment, the SNP is selected from SNPs set forth in Tables 1-5. In one embodiment, the SNP is represented by a SEQ ID NOs: selected from 1-575. In one embodiment, the altered risk is an increased risk. In one embodiment, the detection is carried out by a process selected from the group consisting of: allele-specific probe hybridization, allele-specific primer extension, allele-specific amplification, sequencing, 5′ nuclease digestion, molecular beacon assay, oligonucleotide ligation assay, size analysis, and single-stranded conformation polymorphism.
Assessments of genomic polymorphism content in two or more of the CAD-determinative genes can be combined to determine the risk of a subject in developing cardiovascular disease. This assessment of cardiovascular health can be used to predict the likelihood that the human will develop CAD or other cardiovascular disorders such as myocardial infarction and hypertension. Identification of high-risk subjects allows for the early intervention to prevent, delay, or ameliorate the onset of cardiovascular disease.
Another aspect of the invention provides an isolated nucleic acid molecule comprising at least 10, 15, 20, 21 or more contiguous nucleotides, wherein one of the nucleotides is a single nucleotide polymorphism (SNP) selected from any one of the nucleotide sequences in SEQ ID NOS: 1-575, or a complement thereof.
One aspect of the present invention relates to an isolated nucleic acid molecule comprising a nucleotide sequence in which at least one nucleotide is a SNP disclosed in Tables 1-4. In an alternative embodiment, a nucleic acid of the invention is an amplified polynucleotide, which is produced by amplification of a SNP-containing nucleic acid template. In another embodiment, the invention provides for a variant protein which is encoded by a nucleic acid molecule containing a SNP disclosed herein. In yet another embodiment of the invention, a reagent for detecting a SNP in the context of its naturally-occurring flanking nucleotide sequences (which can be, e.g., either DNA or mRNA) is provided. In particular, such a reagent may be in the form of, for example, a hybridization probe or an amplification primer that is useful in the specific detection of a SNP of interest. In an alternative embodiment, a protein detection reagent is used to detect a variant protein which is encoded by a nucleic acid molecule containing a SNP disclosed herein. A preferred embodiment of a protein detection reagent is an antibody or an antigen-reactive antibody fragment.
Another aspect of the invention provides kits comprising SNP detection reagents, and methods for detecting the SNPs disclosed herein by employing detection reagents. In a specific embodiment, the present invention provides for a method of identifying an individual having an increased or decreased risk of developing coronary artery disease by detecting the presence or absence of one or more SNP alleles disclosed herein. In another embodiment, a method for diagnosis of coronary artery disease by detecting the presence or absence of one or more SNP alleles disclosed herein is provided.
The nucleic acid molecules of the invention can be inserted in an expression vector, such as to produce a variant protein in a host cell. Thus, the present invention also provides for a vector comprising a SNP-containing nucleic acid molecule, genetically-engineered host cells containing the vector, and methods for expressing a recombinant variant protein using such host cells. In another specific embodiment, the host cells, SNP-containing nucleic acid molecules, and/or variant proteins can be used as targets in a method for screening and identifying therapeutic agents or pharmaceutical compounds useful in the treatment of coronary artery disease.
Another aspect of the invention provides a method for treating coronary artery disease in a human subject wherein said human subject harbors a SNP, gene, transcript, and/or encoded protein identified in Tables 1-4, which method comprises administering to said human subject a therapeutically or prophylactically effective amount of one or more agents counteracting the effects of the disease, such as by inhibiting (or stimulating) the activity of the gene, transcript, and/or encoded protein identified in Tables 1-4.
Another aspect of this invention provides a method for treating coronary artery disease in a human subject, which method comprises: (i) determining that said human subject harbors a SNP, gene, transcript, and/or encoded protein identified in Tables 1-4, and (ii) administering to said subject a therapeutically or prophylactically effective amount of one or more agents counteracting the effects of the disease.
Another aspect of this invention provides a method for identifying an agent useful in therapeutically or prophylactically treating coronary artery disease in a human subject wherein said human subject harbors a SNP, gene, transcript, and/or encoded protein identified in Tables 1-2, which method comprises contacting the gene, transcript, or encoded protein with a candidate agent under conditions suitable to allow formation of a binding complex between the gene, transcript, or encoded protein and the candidate agent and detecting the formation of the binding complex, wherein the presence of the complex identifies said agent.
Another aspect of the invention provides a method for stratifying a patient population for treatment of coronary artery disease, wherein said population has an altered risk for developing coronary artery disease due to the presence of a single nucleotide polymorphism (SNP) in any one of the nucleotide sequences of SEQ ID NOS: 1-575 in an individual's nucleic acids from said population, comprising detecting the SNP, wherein the presence of the SNP is correlated with an altered risk for coronary artery disease in said individual thereby indicating said individual should receive treatment for coronary artery disease.
The methods of SNP genotyping provided by the invention are useful for numerous practical applications. Examples of such applications include, but are not limited to, disease predisposition screening, disease diagnosis, disease prognosis, disease progression monitoring, determining therapeutic strategies based on an individual's genotype (“pharmacogenomics”), developing therapeutic agents based on SNP genotypes associated with a disease or likelihood of responding to a drug, stratifying a patient population for clinical trial for a treatment regimen, predicting the likelihood that an individual will experience toxic side effects from a therapeutic agent, and human identification applications such as forensics.
The invention provides, in part, novel methods of determining the risk that an individual will develop a cardiovascular disease. The invention also provides methods of identifying subjects having an elevated risk of developing a cardiovascular disease, such as CAD. The invention is based, in part, on the unexpected findings by applicants that polymorphisms in several genes are highly correlated with the susceptibility of the subject to develop CAD.
The methods and compositions described herein can be used in determining the susceptibility to prognosis of various forms of coronary artery disease. Moreover, the methods and compositions of the present invention can also be used to facilitate the prevention of cardiovascular disease in an individuals found to be at an elevated risk for developing the disease.
One aspect of the invention relates to specific single nucleotide polymorphisms (SNPs) in the human genome, and their association with vascular disease and related pathologies, in particular, coronary artery disease (CAD) such as coronary stenosis. Based on differences in allele frequencies in the vascular disease patient population relative to normal individuals, the naturally-occurring SNPs disclosed herein can be used as targets for the design of diagnostic reagents and the development of therapeutic agents, as well as for disease association and linkage analysis. In particular, the SNPs of the present invention are useful for identifying an individual who is at an increased or decreased risk of developing vascular disease and for early detection of the disease, for providing clinically important information for the prevention and/or treatment of vascular disease, and for screening and selecting therapeutic agents. The SNPs disclosed herein are also useful for human identification applications. Methods, assays, kits, and reagents for detecting the presence of these polymorphisms and their encoded products are provided.
The present invention provides novel SNPs associated with coronary artery disease, as well as some SNPs that were previously known in the art, but were not previously known to be associated with coronary stenosis. Accordingly, the present invention provides novel compositions and methods based on the novel SNPs disclosed herein, and also provides novel methods of using the known, but previously unassociated, SNPs in methods relating to coronary stenosis (e.g., for diagnosing coronary stenosis, etc.).
One specific aspect of the invention provides methods of predicting the risk of developing CAD. One aspect of the invention provides a method of diagnosing premature CAD in an individual, including previously undiagnosed individuals or individuals without any type of cardiovascular disease. In one embodiment, the method comprises obtaining a DNA sample from the individual and determining the presence of one or more polymorphisms in at least one CAD-determinative gene. The presence of one or more polymorphisms is an indication that the individual is at high risk of developing a cardiovascular disease, such as CAD. Preferred polymorphisms are listed on Tables 1, 2 and 3. In one embodiment, the polymorphism is a polymorphism from Table 1 showing a p value of less than 0.05, 0.04, 0.03. 0.02. 0.01, 0.05, 0.02, 0.01, 0.005, 0.002 or 0.001. In some embodiments, the polymorphic change is at the same location along the genome as the polymorphisms found in Tables 1, 2 or 3. As an illustrative embodiment, if a given polymorphism in Table 1 consisted of a G to A nucleotide change at a given position on the genome, some embodiments would include screening for the change of G to C or G to T. Accordingly, in some embodiments, the presence of a polymorphism at the genomic position, regardless of the nature of the nucleotide change(s), indicates that the subject is at a higher risk of developing a cardiovascular disease. In one embodiment, the absence of the wild-type sequence in a polymorphic region is indicative of a higher likelihood of developing CAD.
The methods of the present invention may be used with a variety of contexts and maybe be used to assess the status of a variety of individuals. For example, the methods may be used to assess the status of individuals with no previous diagnosis of coronary artery disease, or with no significant cardiovascular risk factors. Cardiovascular risk factors include, but are not limited to, cholesterol, HDL cholesterol, systolic blood pressure, cigarette smoking, exercise, alcohol, race, obesity, family history of premature coronary artery disease, and medication use, including aspirin, statins, B-blockers and hormone replacement therapy in women.
Other indicia predictive of CAD can be detected or monitored in the subject in conjunction with the detection of polymorphisms in CAD-determinative genes. This may be useful to increase the predictive power of the methods described herein. Preferred indicia include the detection of additional CAD-determinative polymorphisms in genes not listed in Tables 1, 2 or 3, medical examination of the subject's cardiovascular system, and detection of gene products or other metabolites in a sample from a patient, such as a blood sample. In some embodiments, additional factors that may be monitored may be administration of pharmaceuticals known or suspected of having cardiovascular effects, such as increasing blood pressure, preferably in at least 5% or 10% of subjects who are administered the pharmaceuticals. In addition, the presence of cardiovascular risk factors, such as those listed in the preceding paragraph, may be also be weighed when assessing the risk of a subject for developing the cardiovascular disease.
A “coronary artery disease” (“CAD”) is a pathological state characterized by insufficiency of oxygen delivery to cardiac muscle, wherein the condition is associated with some dysfunction of coronary blood vessels. As used in this disclosure, CADs include both disorders in which symptomatic and/or asymptomatic cardiac ischemia occurs (e.g., angina pectoris and myocardial infarction) and disorders that gradually lead to chronic or acute cardiac ischemia, even at the stage of the disorder at which such ischemia is not yet evident (e.g., coronary arteriosclerosis and atherosclerosis).
An “increased risk” refers to a statistically higher frequency of occurrence of the disease or condition in an individual carrying a particular polymorphic allele in comparison to the frequency of occurrence of the disease or condition in a member of a population that does not carry the particular polymorphic allele.
A “treatment plan” refers to at least one intervention undertaken to modify the effect of a risk factor upon a patient. A treatment plan for a cardiovascular disorder or disease can address those risk factors that pertain to cardiovascular disorders or diseases. A treatment plan can include an intervention that focuses on changing patient behavior, such as stopping smoking. A treatment plan can include an intervention whereby a therapeutic agent is administered to a patient. As examples, cholesterol levels can be lowered with proper medication, and diabetes can be controlled with insulin. Nicotine addiction can be treated by withdrawal medications. A treatment plan can include an intervention that is diagnostic. The presence of the risk factor of hypertension, for example, can give rise to a diagnostic intervention whereby the etiology of the hypertension is determined. After the reason for the hypertension is identified, further treatments may be administered.
The phrase “predicting the likelihood of developing” as used herein refers to methods by which the skilled artisan can predict onset of a cardiovascular condition in an individual. The term “predicting” does not refer to the ability to predict the outcome with 100% accuracy. Instead, the skilled artisan will understand that the term “predicting” refers to forecast of an increased or a decreased probability that a certain outcome will occur; that is, that an outcome is more likely to occur in an individual having one or more CAD-determinative polymorphisms.
A subject at higher risk of developing a cardiovascular disease refers to a subject having at least a 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 125%, 150%, 200%, 300%, 400%, 500%, 600%, 7000/, 800%, 900% or 1000% greater probability of developing the condition, relative to the general population. In one embodiment, the comparison is not to a general population but rather to a population matched by one or more factors such as age, sex, race, ethnicity, etc. In one embodiment, the population is one existing within a time frame of 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45 or 50 years from the time of testing.
The term “polymorphism”, as used herein, refers to a difference in the nucleotide sequence of a given region, such as a region in a chromosome, as compared to a nucleotide sequence in a homologous region of another individual, in particular, a difference in the nucleotide of a given region which differs between individuals of the same species. A polymorphism is generally defined in relation to a reference sequence, usually referred to as the “wild-type” sequence. Polymorphisms include single nucleotide differences, differences in sequence of more than one nucleotide, and single or multiple nucleotide insertions, inversions and deletions. In certain embodiments, the polymorphism is within a non-coding region or in a translated region. In certain embodiments, the polymorphism is a silent polymorphism within a translated region. In some embodiments, the polymorphism results in an amino acid substitution. Where a polymorphic site is a single nucleotide in length, the site is referred to as a single nucleotide polymorphism (“SNP”). For example, if at a particular chromosomal location, one member of a population has an adenine and another member of the population has a thymine at the same position, then this position is a polymorphic site, and, more specifically, the polymorphic site is a SNP. Each version of the sequence with respect to the polymorphic site is referred to herein as an “allele” of the polymorphic site. Thus, in the previous example, the SNP allows for both an adenine allele and a thymine allele.
A “haplotype,” as described herein, refers to a combination of genetic markers (“alleles”), such as the SNPs set forth in Tables 1 and 2 and 3.
The nucleotide designation “R” refers to A or G nucleotides, while designation ‘N’ refers to G or A or T or C nucleotides, in accordance with IUPAC designations.
The present invention is based, at least in part, on the identification of alleles, in multiple genes, that are associated (to a statistically-significant extent) with the development of CAD in humans. Detection of these alleles in a subject indicates that the subject is predisposed to the development of a cardiovascular disease and in particular CAD. The identification of individuals predisposed to developing CAD, as identified using the methods described here, may prove useful in allowing the implementation of preventive treatment plans to delay or reduce the incidence of CAD.
Those skilled in the art will readily recognize that nucleic acid molecules may be double-stranded molecules and that reference to a particular site on one strand refers, as well, to the corresponding site on a complementary strand. In defining a SNP position, SNP allele, or nucleotide sequence, reference to an adenine, a thymine (uridine), a cytosine, or a guanine at a particular site on one strand of a nucleic acid molecule also defines the thymine (uridine), adenine, guanine, or cytosine (respectively) at the corresponding site on a complementary strand of the nucleic acid molecule. Thus, reference may be made to either strand in order to refer to a particular SNP position, SNP allele, or nucleotide sequence. Probes and primers, may be designed to hybridize to either strand and SNP genotyping methods disclosed herein may generally target either strand. Throughout the specification, in identifying a SNP position, reference is generally made to the protein-encoding strand, only for the purpose of convenience One aspect of the invention provides a method of estimating, or aiding in the estimation of, the risk of developing a cardiovascular disease, such as CAD, in a subject, the method comprising (i) providing a nucleic acid sample from the subject; (ii) detecting the presence of one or more single nucleotide polymorphisms (SNPs) in a CAD-determinative gene in the nucleic acid sample, wherein the presence of one or more SNPs reflects a higher risk of developing the cardiovascular disease. A related aspect of the invention provides a method of identifying a subject having an elevated risk of developing a cardiovascular disease, such as CAD, the method comprising (i) providing a nucleic acid sample from the subject; (ii) detecting the presence of one or more single nucleotide polymorphisms (SNPs) in a CAD-determinative gene in the genomic sample, wherein a subject having one or more SNPs is identified as a subject having an elevated risk of developing cardiovascular disease. To better characterize the subject's genetic content, occurrence of polymorphisms that are not associated with a disorder can also be assessed, so that one can determine whether the human is 1) homozygous for the CAD-determinative polymorphism at a genomic site, 2) heterozygous for a CAD-determinative and disorder-non-associated polymorphisms at the genomic site, or 3) homozygous for a CAD-non-associated polymorphisms at the site. In one embodiment, both the presence of a SNP polymorphism and of the wild-type sequence is determined.
Tables 1-5 provide a variety of information about SNPs of the present invention that are associated with coronary artery disease. Tables 4 (SEQ ID NOs:1-575) and Table 5 (SEQ ID NOs: 576-1050) disclose genomic SNP sequences. The sequences on Table 4 correspond to genomic sequences containing the SNP, while those on Table 5 have the corresponding genomic sequences without the SNP. Table 3 provides additional information for these sequences, including the chromosome position of the SNP, the gene locus in which the SNP is found, the Genbank accession number (which provides another way of naming the gene locus), a probe number and a genomic location within the chromosomes. Table 3 also provides the SEQ ID NOs for the SNP sequence and the nonSNP sequence for cross-reference with Tables 4-5.
In one embodiment, the CAD-determinative gene containing the SNP is one of the genes listed in Table 1. Table 1 includes the following genes: A1M1L, PLA2G7, OR7E29P, PLN, PTPN6, C1ORF38, GATA2, IL7R, MYLK, ANPEP, PIK3R4, RPLP2, OLR1, PNPLA2, TCF4, ACP5, SELP, BAX, CPNE4, TAL1, KLF15, ABCB1, LHFPL2, ITGAX, LOC389142, PLXNC1, SLA, ELL, NPY, IGSF11, ITPK1, ASB1, SELB, LOC131873, PCCA, HAPIP, PLAUR, SIDT1, RPN1, BPAG1, ROR2, MMP12, GAP43, FSTL1, MAP4, ZNF217, ALOX5, NPHP3, GPNMB, SPP1, ZNF80, MGP, C3ORF15, NEK11, POLQ, ADFP, UBXD1, 38413, FLJ46299, ZBTB20, HLA-DQA2, ZXDC, GRN, PSCD1, GYS1, C14ORF132, CD80, CDGAP, LMOD1, SLC41A3, HOXD1, STAT5A, OPRM1, ITPR2, HIF1A, PKD2, STEAP, AGTR1, NDUFB4, GLRA3, MEF2A, STXBP5L, APOBEC3D, FMNL1, PLXND1, ATP2C1, RUVBL1, CASR, PTPRR, SMPDL3A, APOD, APG3L, FLJ35880, TMCC1, CD96, C1QB, CTSD, FLI1, MMP9, TCIRG1, ITGB5, FLJ25414, NR1H3, HSPBAP1, APOC1, THPO, FTL, HADHSC, ALOX5AP, LAIR1, UPP1, LAPTM5, CSTA, ADCY5, PHLDB2, GM2A, NUDT16, ACSL1, VAMP5, ACP2, HLA-DPA1, TUBA3, MMP7, H41, NR112, FGFR2, OBA, CHAF1A, GSK3B, DOCK2, URB, HCLS1, CD200R1, SLCO2B1, B4GALT4, PLCXD2, FABP7, CAMKK2, FCGR1A, SELL, SELE, HNRPM, MGC45840, F5, SMTN, RAI3, HLA-DRA, CSTB, FLJ12592 and TAGLN3.
In one embodiment, the SNP is one of those listed in Tables 1-4. In another embodiment, the SNP is one that is highly-statistically associated (p<0.1, p<0.05 or p<0.01) with the development of CAD. In another embodiment, the SNP is a SNP in linkage disequilibrium with one of the aforementioned SNPs. The third and fourth columns in Table 1 indicate the chromosome and the location within chromosome where the polymorphism in located.
In one embodiment, the method of estimating the risk of developing coronary artery disease (CAD) in a subject comprises determining the presence of more than one SNP from Tables 1-4 in the genomic sample from the subject, which may be from one gene of from two or more genes.
In addition to the SNPs described in Tables 1-4, one of skill in the art can readily identify other alleles (including polymorphisms and mutations) that are in linkage disequilibrium with one of the SNPs described herein. For example, a nucleic acid sample from a first group of subjects without CAD can be collected, as well as DNA from a second group of subjects with CAD. The nucleic acid sample can then be compared to identify those alleles that are over-represented in the second group as compared with the first group, wherein such alleles are presumably associated with CAD. Alternatively, alleles that are in linkage disequilibrium with a CAD associated-allele can be identified, for example, by genotyping a large population and performing statistical analysis to determine which alleles appear more commonly together than expected.
Preferably the group is chosen to be comprised of genetically-related individuals. Genetically-related individuals include individuals from the same race, the same ethnic group, or even the same family. As the degree of genetic relatedness between a control group and a test group increases, so does the predictive value of polymorphic alleles which are ever more distantly linked to a disease-causing allele. This is because less evolutionary time has passed to allow polymorphisms which are linked along a chromosome in a founder population to redistribute through genetic cross-over events. Thus race-specific, ethnic-specific, and even family-specific diagnostic genotyping assays can be developed to allow for the detection of disease alleles which arose at ever more recent times in human evolution, e.g., after divergence of the major human races, after the separation of human populations into distinct ethnic groups, and even within the recent history of a particular family line.
Appropriate probes may be designed to hybridize to one of the alleles listed in Tables 1-3. Alternatively, these probes may incorporate other regions of the relevant genomic locus, including intergenic sequences. Yet other polymorphisms available for use with the immediate invention are obtainable from various public sources. For example, the human genome database collects intragenic SNPs, is searchable by sequence (http://hgbase.interactiva.de). Also available is a human polymorphism database maintained by NCBI (http://www.ncbi.nim.nih.gov/projects/SNP/). From such sources SNPs as well as other human polymorphisms may be found.
Many methods are available for detecting specific alleles at human polymorphic loci. The preferred method for detecting a specific polymorphic allele will depend, in part, upon the molecular nature of the polymorphism. SNPs are most frequently biallelic-occurring in only two different forms (although up to four different forms of an SNP, corresponding to the four different nucleotide bases occurring in DNA, are theoretically possible). Because SNPs typically have only two alleles, they can be genotyped by a simple plus/minus assay rather than a length measurement, making them more amenable to automation.
A variety of methods are available for detecting the presence of a particular single nucleotide polymorphic allele in an individual. Advancements in this field have provided accurate, easy, and inexpensive large-scale SNP genotyping. Most recently, for example, several new techniques have been described including dynamic allele-specific hybridization (DASH), microplate array diagonal gel electrophoresis (MADGE), pyrosequencing, oligonucleotide-specific ligation, the TaqMan system as well as various DNA “chip” technologies such as the Affymetrix SNP chips. These methods require amplification of the target genetic region, typically by PCR. Still other newly developed methods, based on the generation of small signal molecules by invasive cleavage followed by mass spectrometry or immobilized padlock probes and rolling-circle amplification, might eventually eliminate the need for PCR. Several of the methods known in the art for detecting specific single nucleotide polymorphisms are summarized below. The method of the present invention is understood to include all available methods.
Any cell type or tissue may be utilized to obtain nucleic acid samples for use in the diagnostics described herein. In a preferred embodiment, the DNA sample is obtained from a bodily fluid, e.g., blood, obtained by known techniques (e.g. venipuncture), or saliva. Alternatively, nucleic acid tests can be performed on dry samples (e.g. hair or skin). When using RNA or protein, the cells or tissues that may be utilized must express a CAD-determinative gene. In one embodiment, biological samples such as blood, bone, hair, saliva, or semen may be used.
In one embodiment, the single base polymorphism can be detected by using a specialized exonuclease-resistant nucleotide, as disclosed, e.g., in Mundy, C. R. (U.S. Pat. No. 4,656,127). According to the method, a primer complementary to the allelic sequence immediately 3′ to the polymorphic site is permitted to hybridize to a target molecule obtained from a particular animal or human. If the polymorphic site on the target molecule contains a nucleotide that is complementary to the particular exonuclease-resistant nucleotide derivative present, then that derivative will be incorporated onto the end of the hybridized primer. Such incorporation renders the primer resistant to exonuclease, and thereby permits its detection. Since the identity of the exonuclease-resistant derivative of the sample is known, a finding that the primer has become resistant to exonucleases reveals that the nucleotide present in the polymorphic site of the target molecule was complementary to that of the nucleotide derivative used in the reaction. This method has the advantage that it does not require the determination of large amounts of extraneous sequence data.
In another embodiment of the invention, a solution-based method is used for determining the identity of the nucleotide of a polymorphic site. Cohen, D. et al. (French Patent 2,650,840; PCT Appln. No. WO91/02087). As in the Mundy method of U.S. Pat. No. 4,656,127, a primer is employed that is complementary to allelic sequences immediately 3′ to a polymorphic site. The method determines the identity of the nucleotide of that site using labeled dideoxynucleotide derivatives, which, if complementary to the nucleotide of the polymorphic site will become incorporated onto the terminus of the primer.
An alternative method, known as Genetic Bit Analysis or GBA™ is described by Goelet, P. et al. (PCT Appln. No. 92/15712). The method of Goelet, P. et al. uses mixtures of labeled terminators and a primer that is complementary to the sequence 3′ to a polymorphic site. The labeled terminator that is incorporated is thus determined by, and complementary to, the nucleotide present in the polymorphic site of the target molecule being evaluated. In contrast to the method of Cohen et al. (French Patent 2,650,840; PCT Appln. No. WO91/02087) the method of Goelet, P. et al. is preferably a heterogeneous phase assay, in which the primer or the target molecule is immobilized to a solid phase.
Recently, several primer-guided nucleotide incorporation procedures for assaying polymorphic sites in DNA have been described (Komher, J. S. et al., Nucl. Acids. Res. 17:7779-7784 (1989); Sokolov, B. P., Nucl. Acids Res. 18:3671 (1990); Syvanen, A.-C., et al., Genomics 8:684-692 (1990); Kuppuswamy, M. N. et al., Proc. Natl. Acad. Sci. (U.S.A.) 88:1143-1147 (1991); Prezant, T. R. et al., Hum. Mutat. 1:159-164 (1992); Ugozzoli, L. et al., GATA 9:107-112 (1992); Nyren, P. et al., Anal. Biochem. 208:171-175 (1993)). These methods differ from GBA™ in that they all rely on the incorporation of labeled deoxynucleotides to discriminate between bases at a polymorphic site. In such a format, since the signal is proportional to the number of deoxynucleotides incorporated, polymorphisms that occur in runs of the same nucleotide can result in signals that are proportional to the length of the run (Syvanen, A.-C., et al., Amer. J. Hum. Genet. 52:46-59 (1993)).
For SNPs that produce premature termination of protein translation, the protein truncation test (PTT) offers an efficient diagnostic approach (Roest, et. al., (1993) Hum. Mol. Genet. 2:1719-21; van der Luijt, et. al., (1994) Genomics 20:14). For PTT, RNA is initially isolated from available tissue and reverse-transcribed, and the segment of interest is amplified by PCR. The products of reverse transcription PCR are then used as a template for nested PCR amplification with a primer that contains an RNA polymerase promoter and a sequence for initiating eukaryotic translation. After amplification of the region of interest, the unique motifs incorporated into the primer permit sequential in vitro transcription and translation of the PCR products. Upon sodium dodecyl sulfate-polyacrylamide gel electrophoresis of translation products, the appearance of truncated polypeptides signals the presence of a mutation that causes premature termination of translation. In a variation of this technique, DNA (as opposed to RNA) is used as a PCR template when the target region of interest is derived from a single exon.
Diagnostic procedures may also be performed in situ directly upon tissue sections (fixed and/or frozen) of subject tissue obtained from biopsies or resections, such that no nucleic acid purification is necessary. Nucleic acid reagents may be used as probes and/or primers for such in situ procedures (see, for example, Nuovo, G. J., 1992, PCR in situ hybridization: protocols and applications, Raven Press, N.Y.).
In one preferred detection method is allele specific hybridization using probes overlapping a region of at least one allele of a CAD-determinative gene having about 5, 10, 20, 25, or 30 nucleotides around the mutation or polymorphic region. In one embodiment of the invention, several probes capable of hybridizing specifically to other allelic variants involved in CAD are attached to a solid phase support, e.g., a “chip” (which can hold up to about 250,000 oligonucleotides). Oligonucleotides can be bound to a solid support by a variety of processes, including lithography. Mutation detection analysis using these chips comprising oligonucleotides, also termed “DNA probe arrays” is described e.g., in Cronin et al. (1996) Human Mutation 7:244. In one embodiment, a chip comprises all the allelic variants of at least one polymorphic region of a CAD-determinative gene. The solid phase support is then contacted with a test nucleic acid and hybridization to the specific probes is detected. Accordingly, the identity of numerous allelic variants of one or more genes can be identified in a simple hybridization experiment. The design and use of allele-specific probes for analyzing polymorphisms is known in the art (see, e.g., Dattagupta, EP 235,726, Saiki, WO 89/11548). WO 95/11995 describes subarrays that are optimized for detection of variant forms of a pre-characterized polymorphism.
These techniques may also comprise the step of amplifying the nucleic acid before analysis. Amplification techniques are known to those of skill in the art and include, but are not limited to cloning, polymerase chain reaction (PCR), polymerase chain reaction of specific alleles (ASA), ligase chain reaction (LCR), nested polymerase chain reaction, self-sustained sequence replication (Guatelli, J. C. et al., 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh, D. Y. et al., 1989, Proc. Natl. Acad. Sci. USA 86:1173-1177), and Q-Beta Replicase (Lizardi, P. M. et al., 1988, Bio/Technology 6:1197). PCR-based detection means can include multiplex amplification of a plurality of markers simultaneously. For example, it is well known in the art to select PCR primers to generate PCR products that do not overlap in size and can be analyzed simultaneously. Alternatively, it is possible to amplify different markers with primers that are differentially labeled and thus can each be differentially detected. Of course, hybridization based detection means allow the differential detection of multiple PCR products in a sample. Other techniques are known in the art to allow multiplex analyses of a plurality of markers. Amplification products may be assayed in a variety of ways, including size analysis, restriction digestion followed by size analysis, detecting specific tagged oligonucleotide primers in the reaction products, allele-specific oligonucleotide (ASO) hybridization, allele specific 5′ exonuclease detection, sequencing, hybridization, and the like.
A merely illustrative embodiment of a method using PCR-amplification includes the steps of (i) collecting a sample of cells from a subject, (ii) isolating nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, (iii) contacting the nucleic acid sample with one or more primers which specifically hybridize 5′ and 3′ to at least one CAD-determinative gene under conditions such that hybridization and amplification of the allele occurs, and (iv) detecting the amplification product. These detection schemes are especially useful for the detection of nucleic acid molecules if such molecules are present in very low numbers.
In a preferred embodiment of the subject assay, the allele of an CAD-determinative gene is identified by alterations in restriction enzyme cleavage patterns. For example, sample and control DNA is isolated, amplified (optionally), digested with one or more restriction endonucleases, and fragment length sizes are determined by gel electrophoresis.
Alternatively, allele-specific amplification technology which depends on selective PCR amplification may be used in conjunction with the instant invention. Oligonucleotides used as primers for specific amplification may carry the mutation or polymorphic region of interest in the center of the molecule (so that amplification depends on differential hybridization) (Gibbs et al (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3′ end of one primer where, under appropriate conditions, mismatch can prevent, or reduce polymerase extension (Prossner (1993) Tibtech 11:238; WO 93/22456). In addition it may be desirable to introduce a novel restriction site in the region of the mutation to create cleavage-based detection (Gasparini et al (1992) Mol. Cell Probes 6:1). It is anticipated that in certain embodiments amplification may also be performed using Taq ligase for amplification (Barany (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, ligation will occur only if there is a perfect match at the 3′ end of the 5′ sequence making it possible to detect the presence of a known mutation at a specific site by looking for the presence or absence of amplification.
In yet another embodiment, any of a variety of sequencing reactions known in the art can be used to directly sequence the allele. Exemplary sequencing reactions include those based on techniques developed by Maxim and Gilbert ((1977) Proc. Natl Acad Sci USA 74:560) or Sanger (Sanger et al (1977) Proc. Nat. Acad. Sci. USA 74:5463). It is also contemplated that any of a variety of automated sequencing procedures may be utilized when performing the subject assays (see, for example Biotechniques (1995) 19:448), including sequencing by mass spectrometry (see, for example PCT publication WO 94/16101; Cohen et al. (1996) Adv Chromatogr 36:127-162; and Griffin et al. (1993) Appl Biochem Biotechnol 38:147-159). It will be evident to one of skill in the art that, for certain embodiments, the occurrence of only one, two or three of the nucleic acid bases need be determined in the sequencing reaction. For instance, A-track or the like, e.g., where only one nucleic acid is detected, can be carried out.
In a further embodiment, protection from cleavage agents (such as a nuclease, hydroxylamine or osmium tetraoxide and with piperidine) can be used to detect mismatched bases in RNA/RNA or RNA/DNA or DNA/DNA heteroduplexes (Myers, et al. (1985) Science 230:1242). In general, the art technique of “mismatch cleavage” starts by providing heteroduplexes formed by hybridizing (labeled) RNA or DNA containing the wild-type allele with the sample. The double-stranded duplexes are treated with an agent which cleaves single-stranded regions of the duplex such as which will exist due to base pair mismatches between the control and sample strands. For instance, RNA/DNA duplexes can be treated with RNase and DNA/DNA hybrids treated with S1 nuclease to enzymatically digest the mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched regions. After digestion of the mismatched regions, the resulting material is then separated by size on denaturing polyacrylamide gels to determine the site of mutation. See, for example, Cotton et al (1988) Proc. Natl Acad Sci USA 85:4397; and Saleeba et al (1992) Methods Enzymol. 217:286295. In a preferred embodiment, the control DNA or RNA can be labeled for detection.
In still another embodiment, the mismatch cleavage reaction employs one or more proteins that recognize mismatched base pairs in double-stranded DNA (so called “DNA mismatch repair” enzymes). For example, the mutY enzyme of E. coli cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu et al. (1994) Carcinogenesis 15:1657-1662). According to an exemplary embodiment, a probe based on an allele of a CAD-determinative gene locus haplotype is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be detected from electrophoresis protocols or the like. See, for example, U.S. Pat. No. 5,459,039.
In other embodiments, alterations in electrophoretic mobility will be used to identify a CAD-determinative allele. For example, single strand conformation polymorphism (SSCP) may be used to detect differences in electrophoretic mobility between mutant and wild type nucleic acids (Orita et al. (1989) Proc Natl. Acad. Sci. USA 86:2766, see also Cotton (1993) Mutat Res 285:125-144; and Hayashi (1992) Genet Anal Tech Appl 9:73-79). Single-stranded DNA fragments of sample and control CAD-terminative alleles are denatured and allowed to renature. The secondary structure of single-stranded nucleic acids varies according to sequence, the resulting alteration in electrophoretic mobility enables the detection of even a single base change. The DNA fragments may be labeled or detected with labeled probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), in which the secondary structure is more sensitive to a change in sequence. In a preferred embodiment, the subject method utilizes heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of changes in electrophoretic mobility (Keen et al. (1991) Trends Genet. 7:5). In yet another embodiment, the movement of alleles in polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient gel electrophoresis (DGGE) (Myers et al. (1985) Nature 313:495). When DGGE is used as the method of analysis, DNA will be modified to insure that it does not completely denature, for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is used in place of a denaturing agent gradient to identify differences in the mobility of control and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265:12753).
In another embodiment, identification of the allelic variant is carried out using an oligonucleotide ligation assay (OLA), as described, e.g., in U.S. Pat. No. 4,998,617 and in Landegren, U. et al. ((1988) Science 241:1077-1080). The OLA protocol uses two oligonucleotides which are designed to be capable of hybridizing to abutting sequences of a single strand of a target. One of the oligonucleotides is linked to a separation marker, e.g., biotinylated, and the other is detectably labeled. If the precise complementary sequence is found in a target molecule, the oligonucleotides will hybridize such that their termini abut, and create a ligation substrate. Ligation then permits the labeled oligonucleotide to be recovered using avidin, or another biotin ligand. Nickerson, D. A. et al. have described a nucleic acid detection assay that combines attributes of PCR and OLA (Nickerson, D. A. et al. (1990) Proc. Natl. Acad. Sci. USA 87:8923-27). In this method, PCR is used to achieve the exponential amplification of target DNA, which is then detected using OLA.
Several techniques based on this OLA method have been developed and can be used to detect alleles of an CAD-determinative haplotype. For example, U.S. Pat. No. 5,593,826 discloses an OLA using an oligonucleotide having 3′-amino group and a 5′-phosphorylated oligonucleotide to form a conjugate having a phosphoramidate linkage. In another variation of OLA described in Tobe et al. ((1996) Nucleic Acids Res 24: 3728), OLA combined with PCR permits typing of two alleles in a single microtiter well. By marking each of the allele-specific primers with a unique hapten, i.e. digoxigenin and fluorescein, each OLA reaction can be detected by using hapten specific antibodies that are labeled with different enzyme reporters, alkaline phosphatase or horseradish peroxidase. This system permits the detection of the two alleles using a high throughput format that leads to the production of two different colors.
Examples of other techniques for detecting alleles include, but are not limited to, selective oligonucleotide hybridization, selective amplification, or selective primer extension. For example, oligonucleotide primers may be prepared in which the known mutation or nucleotide difference (e.g., in allelic variants) is placed centrally and then hybridized to target DNA under conditions which permit hybridization only if a perfect match is found (Saiki et al. (1986) Nature 324:163); Saiki et al (1989) Proc. Natl Acad. Sci. USA 86:6230). Such allele specific oligonucleotide hybridization techniques may be used to test one mutation or polymorphic region per reaction when oligonucleotides are hybridized to PCR amplified target DNA or a number of different mutations or polymorphic regions when the oligonucleotides are attached to the hybridizing membrane and hybridized with labeled target DNA. Other methods of detecting polymorphisms, e.g., SNPs, are known, e.g., as described in U.S. Pat. Nos. 6,410,231; 6,361,947; 6,322,980; 6,316,196; 6,258,539; and U.S. Publication Nos. 2004/0137464 and 2004/0072156.
The subjects to be tested for characterizing its risk of CAD in the foregoing methods may be any human or other animal, preferably a mammal. In certain embodiments, the subject does not otherwise have an elevated risk of cardiovascular disease according to the traditional risk factors. Subjects having an elevated risk of cardiovascular disease include those with a family history of cardiovascular disease, elevated lipids, smokers, prior acute cardiovascular event, etc. (See, e.g., Harrison's Principles of Experimental Medicine, 15th Edition, McGraw-Hill, Inc., N.Y.—hereinafter “Harrison's”).
In certain embodiments the subject is an apparently healthy nonsmoker. “Apparently healthy”, as used herein, means individuals who have not previously being diagnosed as having any signs or symptoms indicating the presence of atherosclerosis, such as angina pectoris, history of an acute adverse cardiovascular event such as a myocardial infarction or stroke, evidence of atherosclerosis by diagnostic imaging methods including, but not limited to coronary angiography. Apparently healthy individuals also do not otherwise exhibit symptoms of disease. In other words, such individuals, if examined by a medical professional, would be characterized as healthy and free of symptoms of disease. “Nonsmoker” means an individual who, at the time of the evaluation, is not a smoker. This includes individuals who have never smoked as well as individuals who in the past have smoked but presently no longer smoke.
In certain embodiments, the test subjects are apparently healthy subjects otherwise free of current need for treatment for a cardiovascular disease. In some embodiments, the subject is otherwise free of symptoms calling for treatment with any one of any combination of or all of the foregoing categories of agents. For example, with respect to anti-inflammatory agents, the subject is free of symptoms of rheumatoid arthritis, chronic back pain, autoimmune diseases, vascular diseases, viral diseases, malignancies, and the like. In another embodiment, the subject is not at an elevated risk of an adverse cardiovascular event (e.g., subject with no family history of such events, subjects who are nonsmokers, subjects who are nonhyperlipidemic, subjects who do not have elevated levels of a systemic inflammatory marker), other than having an elevated level of one or more oxidized apoA-I related biomolecules.
In some embodiments, the subject is a nonhyperlipidemic subject. A “nonhyperlipidemic” is a subject that is a nonhypercholesterolemic and/or a nonhypertriglyceridemic subject. A “nonhypercholesterolemic” subject is one that does not fit the current criteria established for a hypercholesterolemic subject. A nonhypertriglyceridemic subject is one that does not fit the current criteria established for a hypertriglyceridemic subject (See, e.g., Harrison's Principles of Experimental Medicine, 15th Edition, McGraw-Hill, Inc., N.Y.—hereinafter “Harrison's”). Hypercholesterolemic subjects and hypertriglyceridemic subjects are associated with increased incidence of premature coronary heart disease. A hypercholesterolemic subject has an LDL level of >160 mg/dL, or >130 mg/dL and at least two risk factors selected from the group consisting of male gender, family history of premature coronary heart disease, cigarette smoking (more than 10 per day), hypertension, low HDL (<35 mg/dL), diabetes mellitus, hyperinsulinemia, abdominal obesity, high lipoprotein (a), and personal history of cerebrovascular disease or occlusive peripheral vascular disease. A hypertriglyceridemic subject has a triglyceride (TO) level of >250 mg/dL. Thus, a nonhyperlipidemic subject is defined as one whose cholesterol and triglyceride levels are below the limits set as described above for both the hypercholesterolemic and hypertriglyceridemic subjects.
Knowledge of CAD-determinative alleles, such as those described in Tables 1-4, alone or in conjunction with information on other genetic defects contributing to CAD, al lows customization of a therapy to the individual's genetic profile. For example, subjects having an CAD-determinative allele of AIM1L, PLA2G7, OR7E29P, PLN, PTPN6, C1ORF38, GATA2, IL7R or MYLK, or any polymorphic nucleic acid sequence in linkage disequilibrium with any of these alleles, may be predisposed to developing CAD and may respond better to particular therapeutics that address the particular molecular basis of the disease in the subject. Thus, comparison of an individual's CAD-determinative allele profile to the population profile for CAD, permits the selection or design of drugs or other therapeutic regimens that are expected to be safe and efficacious for a particular subject or subject population (i.e., a group of subjects having the same genetic alteration).
In addition, the ability to target populations expected to show the highest clinical benefit, based on genetic profile can enable: 1) the repositioning of marketed drugs with disappointing market results; 2) the rescue of drug candidates whose clinical development has been discontinued as a result of safety or efficacy limitations, which are subject subgroup-specific; and 3) an accelerated and less costly development for drug candidates and more optimal drug labeling (e.g. since measuring the effect of various doses of an agent on a CAD causative mutation is useful for optimizing effective dose).
The treatment of an individual with a particular therapeutic can be monitored by determining protein, mRNA and/or transcriptional level of a CAD-determinative gene. Depending on the level detected, the therapeutic regimen can then be maintained or adjusted (increased or decreased in dose). In a preferred embodiment, the effectiveness of treating a subject with an agent comprises the steps of: (i) obtaining a preadministration sample from a subject prior to administration of the agent; (ii) detecting the level or amount of a protein, mRNA or genomic DNA in the preadministration sample of a CAD-determinative gene; (iii) obtaining one or more post-administration samples from the subject; (iv) detecting the level of expression or activity of the protein, mRNA or genomic DNA in the post-administration sample of the CAD-determinative gene; (v) comparing the level of expression or activity of the protein, mRNA or genomic DNA of the CAD-determinative gene in the preadministration sample with the corresponding one in the postadministration sample, respectively; and (vi) altering the administration of the agent to the subject accordingly.
Cells of a subject may also be obtained before and after administration of a therapeutic to detect the level of expression of genes other than an CAD-determinative gene to verify that the therapeutic does not increase or decrease the expression of genes which could be deleterious. This can be done, e.g., by using the method of transcriptional profiling. Thus, mRNA from cells exposed in vivo to a therapeutic and mRNA from the same type of cells that were not exposed to the therapeutic could be reverse transcribed and hybridized to a chip containing DNA from numerous genes, to thereby compare the expression of genes in cells treated and not treated with the therapeutic.
In still another aspect, the invention relates to a method of selecting a dose of a cardiovascular protective agent for administration to a subject. The method comprises assessing occurrence in the human's genome of a CAD-determinative allele. Occurrence of any of the polymorphisms is an indication that a greater dose of the agent should be administered to the human. The dose of the agent can be selected based on occurrence of the polymorphisms. A greater number of CAD-determinative polymorphisms indicates a greater dosage.
In certain embodiments, assessment of one or more markers are combined to increase the predictive value of the analysis in comparison to that obtained from the identification of polymorphisms in CAD-determinative allele(s) alone. Such markers may be assessed, for example, by detecting genetic changes in the genes (e.g. mutations or polymorphisms) or by detecting the level of gene products, metabolites or other molecules level in a biological sample obtained from the subject, such as a serum or blood sample. In one embodiment, the levels of one or more markers for myocardial injury, coagulation, or atherosclerotic plaque rupture are measured from a sample from the subject to increase the predictive value of the described methods
In one embodiment, assessment of one or more additional markers indicative of atherosclerotic plaque rupture is combined with detection of polymorphism(s) in CAD-determinative gene(s). Markers of atherosclerotic plaque rupture that may be useful include human neutrophil elastase, inducible nitric oxide synthase, lysophosphatidic acid, malondialdehyde-modified low-density lipoprotein, matrix metalloproteinase-1, matrix metalloproteinase-2, matrix metalloproteinase-3, and matrix metalloproteinase-9. In one embodiment, assessment of one or more additional markers indicative of coagulation is combined with detection of polymorphism(s) in CAD-determinative gene(s). Coagulation markers include β-thromboglobulin, D-dimer, fibrinopeptide A, platelet-derived growth factor, plasmin-α-2-anti-plasmin complex, platelet factor 4, prothrombin fragment 1+2, P-selectin, thrombin-antithrombin III complex, thrombus precursor protein, tissue factor and von Willebrand factor.
In one embodiment, the marker(s) that may be tested in conjunction with the detection of polymorphism(s) in CAD-determinative gene(s) includes soluble tumor necrosis factor-α receptor-2, interleukin-6, lipoprotein-associated phospholipase A2, C-reactive protein (CRP), Creatine Kinase with Muscle and/or Brain subunits (CKMB), thrombin anti-thrombin (TAT), soluble fibrin monomer (SFM), fibrin peptide A (FPA), myoglobin, thrombin precursor protein (TPP), platelet monocyte aggregate (PMA) troponin and homocysteine. In another embodiment, the additional markers can be Annexin V, B-type natriuretic peptide (BNP) which is also called brain-type natriuretic peptide, enolase, Troponin I (TnI), cardiac-troponin T, Creatine kinase (CK), Glycogen phosphorylase (GP), Heart-type fatty acid binding protein (H-FABP), Phosphoglyceric acid mutase (PGAM) and S-100.
In embodiments where one or more markers are used in combination with detection of polymorphism(s) in CAD-determinative gene(s) to increase the predictive value of the analysis, the patient sample from which the level of the additional marker(s) is to be measured may be the same or different from one used to detect polymorphism(s) in CAD-determinative gene(s). In one embodiment, the biological sample from which the level of additional marker is determined is whole blood. Whole blood may be obtained from the subject using standard clinical procedures. In another embodiment, the biological sample is plasma. Plasma may be obtained from whole blood samples by centrifugation of anti-coagulated blood. Such process provides a buffy coat of white cell components and a supernatant of the plasma. In another embodiment, the biological sample is serum. Serum may be obtained by centrifugation of whole blood samples that have been collected in tubes that are free of anti-coagulant. The blood is permitted to clot prior to centrifugation. The yellowish-reddish fluid that is obtained by centrifugation is the serum. The sample may be pretreated as necessary by dilution in an appropriate buffer solution, heparinized, concentrated if desired, or fractionated by any number of methods including but not limited to ultracentrifugation, fractionation by fast performance liquid chromatography (FPLC), or precipitation of apolipoprotein B containing proteins with dextran sulfate or other methods. Any of a number of standard aqueous buffer solutions, employing one of a variety of buffers, such as phosphate, Tris, or the like, at physiological pH can be used.
In certain embodiments, the subject's risk profile for CAD is determined by combining a first risk value, which is obtained by determining the presence of one or more CAD-determinative polymorphisms, with one or more additional risk values to provide a final risk value. Such additional risk values may be obtained by procedures including, but not limited to, determining the subject's blood pressure, assessing the subject's response to a stress test, determining levels of myeloperoxidase, C-reactive protein, low density lipoprotein, or cholesterol in a bodily sample from the subject, or assessing the subject's atherosclerotic plaque burden.
In some embodiments, genetic variations in additional marker genes are combined with detection of polymorphism(s) in a gene not listed in Tables 1 or 2. In specific embodiments, the additional marker gene is selected from apolipoprotein B, apolipoprotein E, paraoxonase 1, type I angiotensin II receptor, cytochrome b-245(alpha), prothrombin, coagulation factor VII, platelet glycoprotein 1b alpha, platelet glycoprotein IIIa, endothelial nitric oxide synthase, 5,10-methylene tetrahydrofolate reductase, angiotensinogen, plasminogen activator inhibitor 1, coagulation factor V, alpha adducin I, cytochrome P450, G-protein beta, polypeptide 3, methionine synthase reductase, endothelial adhesion molecule 1 and cholesteryl ester transferase. Polymorphisms in these genes are described, for example, in U.S. Patent Publication No. 2004/0005566.
In one embodiment, the methods to assess the test subject's risk of developing CAD comprise performing a medical examination of the subject's cardiovascular systems. Such examinations may be useful to increase the predictive power of the methods. Types of medical examinations include, for example, coronary angiography, coronary intravascular ultrasound (IVUS), stress testing (with and without imaging), assessment of carotid intimal medial thickening, carotid ultrasound studies with or without implementation of techniques of virtual histology, coronary artery electron beam computer tomography (EBTC), cardiac computerized tomography (CT) scan, CT angiography, cardiac magnetic resonance imaging (MRI), and magnetic resonance angiography (MRA).
The present invention provides isolated polynucleotides comprising one or more CAD-determinative polymorphic nucleic acid sequences. In some embodiments, the polymorphism is one that is described in
An isolated polymorphic nucleic acid molecule comprises one or more polymorphisms listed in Tables 1-5. Preferred polymorphism are those found in any one of the following genes: A1M1L, PLA2G7, OR7E29P, PLN, PTPN6, C1ORF38, GATA2, IL7R, MYLK, ANPEP, PIK3R4, RPLP2, OLR1, PNPLA2, TCF4, ACP5, SELP, BAX, CPNE4, TALI, KLF15, ABCB1, LHFPL2, ITGAX, LOC389142, PLXNC1, SLA, ELL, NPY, IGSF11, ITPK1, ASB1, SELB, LOC131873, PCCA, HAPIP, PLAUR, SIDT1, RPN1, BPAG1, ROR2, MMP12, GAP43, FSTL1, MAP4, ZNF217, ALOX5, NPHP3, GPNMB, SPP1, ZNF80, MGP, C3ORF15, NEK11, POLQ, ADFP, UBXD1, 38413, FLJ46299, ZBTB20, HLA-DQA2, ZXDC, GRN, PSCD1, GYS1, C14ORF132, CD80, CDGAP, LMOD1, SLC41A3, HOXD1, STAT5A, OPRM1, 1TPR2, HIF1A, PKD2, STEAP, AGTR1, NDUFB4, GLRA3, MEF2A, STXBP5L, APOBEC3D, FMNL1, PLXND1, ATP2Cl, RUVBL1, CASR, PTPRR, SMPDL3A, APOD, APG3L, FLJ35880, TMCC1, CD96, C1QB, CTSD, FLI1, MMP9, TCIRG1, ITGB5, FLJ25414, NR1H3, HSPBAP1, APOC1, THPO, FTL, HADHSC, ALOX5AP, LAIR1, UPP1, LAPTM5, CSTA, ADCY5, PHLDB2, GM2A, NUDT16, ACSL1, VAMP5, ACP2, HLA-DPA1, TUBA3, MMP7, H41, NR112, FGFR2, GBA, CHAF1A, GSK3B, DOCK2, URB, HCLS1, CD200R1, SLCO2B1, B4GALT4, PLCXD2, FABP7, CAMKK2, FCGR1A, SELL, SELE, HNRPM, MGC45840, F5, SMTN, RAI3, HLA-DRA, CSTB, FLJ2592 and TAGLN3.
In a preferred embodiment, the polymorphism is from A1MIL, PLA2G7, OR7E29P, PLN, PTPN6, C1ORF38, GATA2, IL7R or MYLK. For some uses, e.g., in screening assays, CAD-determinative polymorphic nucleic acid molecules will be of at least about 15 nucleotides (nt), at least about 18 nt, at least about 20 nt, or at least about 25 nt in length, and often at least about 50 nt. Such small DNA fragments are useful as primers for polymerase chain reaction (PCR), hybridization screening, etc. Larger polynucleotide fragments, e.g., at least about 50 nt, at least about 100 nt, at least about 200 nt, at least about 300 nt, at least about 500 nt, at least about 1000 nt, at least about 1500 nt, up to the entire coding region, or up to the entire coding region plus up to about 1000 nt 5′ and/or up to about 1000 nt 3′ flanking sequences from a CAD-determinative gene, are useful for production of the encoded polypeptide, promoter motifs, etc. For use in amplification reactions, such as PCR, a pair of primers will be used. The exact composition of primer sequences is not critical to the invention, but for most applications the primers will hybridize to the subject sequence under stringent conditions, as known in the art.
The present invention also provides isolated nucleic acid molecules that contain one or more SNPs disclosed in Tables 1-4, and in preferred embodiments from Table 4. Preferred isolated nucleic acid molecules contain one or more SNPs identified in Tables 1-1. Isolated nucleic acid molecules containing one or more SNPs disclosed in at least one of Tables 1-4 may be interchangeably referred to throughout the present text as “SNP-containing nucleic, acid molecules.” Isolated nucleic acid molecules may optionally encode a full-length variant protein or fragment thereof. The isolated nucleic acid molecules of the present invention also include probes and primers, which may be used for assaying the disclosed SNPs, and isolated full-length genes, transcripts cDNA molecules, and fragments thereof, which may be used for such purposes as expressing an encoded protein.
As used herein, an “isolated nucleic acid molecule” generally is one that contains a SNP of the present invention or a complement thereof and is separated from most other nucleic acids present in the natural source of the nucleic acid molecule. Moreover, an “isolated” nucleic acid molecule, such as a cDNA molecule containing a SNP of the present invention, can be substantially free of other cellular material, or culture medium when produced by recombinant techniques, or chemical precursors or other chemicals when chemically synthesized. A nucleic acid molecule can be fused to other coding or regulatory sequences and still be considered “isolated”. Examples of “isolated” DNA molecules include recombinant DNA molecules maintained in heterologous host cells, and purified (partially or substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of the isolated SNP-containing DNA molecules of the present invention. Isolated nucleic acid molecules according to the present invention further include such molecules produced synthetically.
Generally, an isolated SNP-containing nucleic acid molecule comprises one or more SNP positions disclosed by the present invention with flanking nucleotide sequences on either side of the SNP positions. A flanking sequence can include nucleotide residues that are naturally associated with the SNP site and/or heterologous nucleotide sequences. Preferably the flanking sequence is up to about 500, 300, 100, 60, 50, 30, 25, 20, 15, 10, 8, or 4 nucleotides (or any other length in-between) on either side of a SNP position, or as long as the full-length gene or entire protein-coding sequence (or any portion thereof such as an exon), especially if the SNP-containing nucleic acid molecule is to be used to produce a protein or protein fragment.
Table 4 shows SNP-containing nucleic acid molecules having 20 nucleotides flanking the SNP site. In one embodiment, the invention provides an isolated SNP-containing nucleic acid molecule comprises the nucleotide sequence of any one of SEQ ID NOs: 1-575. In another embodiment, the SNP-containing nucleic acid molecule provided by the invention comprises a nucleotide sequence identical to 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 27, 28, 29, 30, 31, 32, 34, 35, 36, 37, 38, 39, or 40 contiguous nucleotides of any one of SEQ ID NOs: 1-575. In another embodiment, the SNP-containing nucleic acid molecule provided by the invention comprises a nucleotide sequence identical to 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 27, 28, 29, 30, 31, 32, 34, 35, 36, 37, 38, 39, or 40 contiguous nucleotides of any one of SEQ ID NOs: 1-575 wherein the contiguous nucleotides contain the SNP site (shown in brackets, i.e. “[ ]” in Table 4).
For full-length genes and entire protein-coding sequences, a SNP flanking sequence can be, for example, up to about 5 Kb, 4 Kb, 3 Kb, 2 Kb, 1 Kb on either side of the SNP. Furthermore, in such instances, the isolated nucleic acid molecule comprises exonic sequences (including protein-coding and/or non-coding exonic sequences), but may also include intronic sequences. Thus, any protein coding sequence may be either contiguous or separated by introns. The important point is that the nucleic acid is isolated from remote and unimportant flanking sequences and is of appropriate length such that it can be subjected to the specific manipulations or uses described herein such as recombinant protein expression, preparation of probes and primers for assaying the SNP position, and other uses specific to the SNP-containing nucleic acid sequences.
An isolated nucleic acid molecule of the present invention further encompasses a SNP-containing polynucleotide that is the product of any one of a variety of nucleic acid amplification methods, which are used to increase the copy numbers of a polynucleotide of interest in a nucleic acid sample. Such amplification methods are well known in the art, and they include but are not limited to, polymerase chain reaction (PCR) (U.S. Pat. Nos. 4,683,195; and 4,683,202; PCR Technology: Principles and Applications for DNA Amplification, ed. H. A. Erlich, Freeman Press, NY, N.Y., 1992), ligase chain reaction (LCR) (Wu and Wallace, Genomics 4:560, 1989; Landegren et al., Science 241:1077, 1988), strand displacement amplification (SDA) (U.S. Pat. Nos. 5,270,184; and 5,422,252), transcription-mediated amplification (TMA) (U.S. Pat. No. 5,399,491), linked linear amplification (LLA) (U.S. Pat. No. 6,027,923), and the like, and isothermal amplification methods such as nucleic acid sequence based amplification (NASBA), and self-sustained sequence replication (Guatelli et al., Proc. Natl. Acad. Sci. USA 87: 1874, 1990). Based on such methodologies, a person skilled in the art can readily design primers in any suitable regions 5′ and 3′ to a SNP disclosed herein. Such primers may be used to amplify DNA of any length so long as it contains the SNP of interest in its sequence.
As used herein, an “amplified polynucleotide” of the invention is a SNP-containing nucleic acid molecule whose amount has been increased at least two fold by any nucleic acid amplification method performed in vitro as compared to its starting amount in a test sample. In other preferred embodiments, an amplified polynucleotide is the result of at least ten fold, fifty fold, one hundred fold, one thousand fold, or even ten thousand fold increase as compared to its starting amount in a test sample. In a typical PCR amplification, a polynucleotide of interest is often amplified at least fifty thousand fold in amount over the unamplified genomic DNA, but the precise amount of amplification needed for an assay depends on the sensitivity of the subsequent detection method used.
Generally, an amplified polynucleotide is at least about 16 nucleotides in length. More typically, an amplified polynucleotide is at least about 20 nucleotides in length. In a preferred embodiment of the invention, an amplified polynucleotide is at least about 30 nucleotides in length. In a more preferred embodiment of the invention, an amplified polynucleotide is at least about 32, 40, 45, 50, or, 60 nucleotides in length. In yet another preferred embodiment of the invention, an amplified polynucleotide is at least about 100, 200, 300, 400, or 500 nucleotides in length. While the total length of an amplified polynucleotide of the invention can be as long as an exon, an intron or the entire gene where the SNP of interest resides, an amplified product is typically up to about 1,000 nucleotides in length (although certain amplification methods may generate amplified products greater than 1000 nucleotides in length). More preferably, an amplified polynucleotide is not greater than about 600-700 nucleotides in length. It is understood that irrespective of the length of an amplified polynucleotide, a SNP of interest may be located anywhere along its sequence.
In a specific embodiment of the invention, the amplified product is at least about 21 nucleotides in length, comprises one of the transcript-based context sequences or the genomic-based context sequences shown in Tables 1-4. Such a product may have additional sequences on its 5′ end or 3′ end or both. In another embodiment, the amplified product is about 21 nucleotides in length, and it contains a SNP disclosed herein. Preferably, the SNP is located at the middle of the amplified product (e.g., at position 11 in an amplified product that is 21 nucleotides in length, or at position 51 in an amplified product that is 101 nucleotides in length), or within 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 15, or 20 nucleotides from the middle of the amplified product, (however, as indicated above, the SNP of interest may be located anywhere along the length of the amplified product).
The present invention provides isolated nucleic acid molecules that comprise, consist of, or consist essentially of one or more polynucleotide sequences that contain one or more SNPs disclosed herein, complements thereof, and SNP-containing fragments thereof.
The isolated nucleic acid molecules can encode mature proteins plus additional amino or carboxyl-terminal amino acids or both, or amino acids interior to the mature peptide (when the mature form has more than one peptide chain, for instance). Such sequences may play a role in processing of a protein from precursor to a mature form, facilitate protein trafficking, prolong or shorten protein half-life, or facilitate manipulation of a protein for assay or production. As generally is the case in situ, the additional amino acids may be processed away from the mature protein by cellular enzymes.
Thus, the isolated nucleic acid molecules include, but are not limited to, nucleic acid molecules having a sequence encoding a peptide alone, a sequence encoding a mature peptide and additional coding sequences such as a leader or secretory sequence (e.g., a pre-pro or pro-protein sequence), a sequence encoding a mature peptide with or without additional coding sequences, plus additional non-coding sequences, for example introns and non-coding 5′ and 3′ sequences such as transcribed but untranslated sequences that play a role in, for example, transcription, mRNA processing (including splicing and polyadenylation signals), ribosome binding, and/or stability of mRNA. In addition, the nucleic acid molecules may be fused to heterologous marker sequences encoding, for example, a peptide that facilitates purification.
Isolated nucleic acid molecules can be in the form of RNA, such as mRNA, or in the form DNA, including cDNA and genomic DNA, which may be obtained, for example, by molecular cloning or produced by chemical synthetic techniques or by a combination thereof: (Sambrook and Russell, 2000, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, NY). Furthermore, isolated nucleic acid molecules, particularly SNP detection reagents such as probes and primers, can also be partially or completely in the form of one or more types of nucleic acid analogs, such as peptide nucleic acid (PNA) (U.S. Pat. Nos. 5,539,082; 5,527,675; 5,623,049; 5,714,331). The nucleic acid, especially DNA, can be double-stranded or single-stranded. Single-stranded nucleic acid can be the coding strand (sense strand) or the complementary non-coding; from fragments of the human genome (in the case of DNA or RNA) or single nucleotides, short oligonucleotide linkers, or from a series of oligonucleotides, to provide a synthetic nucleic acid molecule. Nucleic acid molecules can be readily synthesized using the sequences provided herein as a reference; oligonucleotide and PNA oligomer synthesis techniques are well-known in the art (see, e.g., Corey, “Peptide nucleic acids: expanding the scope of nucleic acid recognition”, Trends Biotechnol. June 1997; 15(6):224-9, and Hyrup et al., “Peptide nucleic acids (PNA): synthesis, properties and potential applications”, Bioorg Med. Chem. January 1996; 4(1):5-23). Furthermore, large-scale automated oligonucleotide/PNA synthesis (including synthesis on an array or bead surface or other solid support) can readily be accomplished using commercially available nucleic acid synthesizers, such as the Applied Biosystems (Foster City, Calif.) 3900 High-Throughput DNA Synthesizer or Expedite 8909 Nucleic Acid Synthesis System, and the sequence information provided herein.
The present invention encompasses nucleic acid analogs that contain modified, synthetic, or non-naturally occurring nucleotides or structural elements or other alternative/modified nucleic acid chemistries known in the art. Such nucleic acid analogs are useful, for example, as detection reagents (e.g., primers/probes) for detecting one or more SNPs identified in Tables 1-4. Furthermore, kits/systems (such as beads, arrays, etc.) that include these analogs are also encompassed by the present invention. For example, PNA oligomers that are based on the polymorphic sequences of the present invention are specifically contemplated. PNA oligomers are analogs of DNA in which the phosphate backbone is replaced with a peptide-like backbone (Lagriffoul et al., Bioorganic & Medicinal Chemistry Letters, 4: 1081-1082 (1994), Petersen et al., Bioorganic & Medicinal Chemistry Letters, 6: 793-796 (1996), Kumar et al., Organic Letters 3(9): 1269-1272:(2001), WO96/04000). PNA hybridizes to complementary RNA or DNA with higher affinity and specificity than conventional oligonucleotides and oligonucleotide analogs. The properties of PNA enable novel molecular biology and biochemistry applications unachievable with traditional oligonucleotides and peptides.
Additional examples of nucleic acid modifications that improve the binding properties and/or stability of a nucleic acid include the use of base analogs such as U.S. Pat. No. 5,801,115). Thus, references herein to nucleic acid molecules, SNP-containing nucleic acid molecules, SNP detection reagents (e.g., probes and primers), oligonucleotides/polynucleotides include PNA oligomers and other nucleic acid analogs. Other examples of nucleic acid analogs and alternative/modified nucleic acid chemistries known in the art are described in Current Protocols in Nucleic Acid Chemistry, John Wiley & Sons, N.Y. (2002).
The present invention further provides nucleic acid molecules that encode fragments of the variant polypeptides disclosed herein as well as nucleic acid molecules that encode obvious variants of such variant polypeptides. Such nucleic acid molecules may be naturally occurring, such as paralogs (different locus) and orthologs (different organism), or may be constructed by recombinant DNA methods or by chemical synthesis. Non-naturally occurring variants may be made by mutagenesis techniques, including those applied to nucleic acid molecules, cells, or organisms. Accordingly, the variants can contain nucleotide substitutions, deletions, inversions and insertions (in addition to the SNPs disclosed in Tables 1-4). Variation can occur in either or both the coding and non-coding regions. The variations can produce conservative and/or non-conservative amino acid substitutions.
The nucleic acid molecules of the invention may be used as probes. When used as a probe, an isolated polymorphic CAD-determinative nucleic acid molecule may comprise non-CAD-determinative nucleotide sequences, as long as the additional non-CAD-determinative nucleotide sequences do not interfere with the detection assay. A probe may comprise an isolated polymorphic CAD-determinative sequence, and any number of non-CAD-determinative nucleotide sequences, e.g., from about 1 bp to about 1 kb or more.
For screening purposes, hybridization probes of the polymorphic sequences may be used where both forms are present, either in separate reactions, spatially separated on a solid phase matrix, or labeled such that they can be distinguished from each other. Assays (described below) may utilize nucleic acids that hybridize to one or more of the described polymorphisms. Isolated polymorphic CAD-determinative nucleic acid molecules of the invention may be coupled (e.g., chemically conjugated), directly or indirectly (e.g., through a linker molecule) to a solid substrate. Solid substrates may be any known in the art including, but not limited to, beads, e.g., polystyrene beads; chips, e.g., glass, SiO2, and the like; plastic surfaces, e.g., polystyrene, polycarbonate plastic multi-well plates; and the like.
Additional CAD-determinative gene polymorphisms may be identified using any of a variety of methods known in the art, including, but not limited to SSCP, denaturing HPLC, and sequencing. SSCP may be used to identify additional CAD-determinative gene polymorphisms. In general, PCR primers and restriction enzymes are chosen so as to generate products in a size range of from about 25 bp to about 500 bp, or from about 100 bp to about 250 bp, or any intermediate or overlapping range therein.
The invention further relates to a kit for assessing relative susceptibility of a human to developing CAD. The kit comprises reagents for assessing occurrence in the human's genome of a CAD-determinative polymorphism in at least one, two, three, four or five or more of the CAD-determinative genes. Another aspect of the invention provides kits for detecting a predisposition for developing a CAD.
The kits may contain one or more oligonucleotides, including 5′ and 3′ oligonucleotides that hybridize 5′ and 3′ to at least one allele of a CAD-determinative locus haplotype, such as to any of the SNPs listed in Tables 1 and 2. PCR-amplification oligonucleotides should hybridize between 25 and 2500 base pairs apart, preferably between about 100 and about 500 bases apart, in order to produce a PCR product of convenient size for subsequent analysis.
The design of oligonucleotides for use in the amplification and detection of CAD-determinative polymorphic alleles by the method of the invention is facilitated by the availability of public genomic data for the CAD-determinative genes. Suitable primers for the detection of a human polymorphism in these genes can be readily designed using this sequence information and standard techniques known in the art for the design and optimization of primers sequences. Optimal design of such primer sequences can be achieved, for example, by the use of commercially available primer selection programs such as Primer 2.1, Primer 3 or GeneFisher.
For use in a kit, oligonucleotides may be any of a variety of natural and/or synthetic compositions such as synthetic oligonucleotides, restriction fragments, cDNAs, synthetic peptide nucleic acids (PNAs), and the like. The assay kit and method may also employ labeled oligonucleotides to allow ease of identification in the assays. Examples of labels which may be employed include radio-labels, enzymes, fluorescent compounds, streptavidin, avidin, biotin, magnetic moieties, metal binding moieties, antigen or antibody moieties, and the like.
The kit may, optionally, also include DNA sampling means. DNA sampling means are well known to one of skill in the art and can include, but not be limited to substrates, such as filter papers, the AmpliCard™ (University of Sheffield, Sheffield, England S10 2JF; Tarlow, J W, et al., J. of Invest. Dematol. 103:387-389 (1994)) and the like; DNA purification reagents such as Nucleon™ kits, lysis buffers, proteinase solutions and the like; PCR reagents, such as 10× reaction buffers, thermostable polymerase, dNTPs, and the like; and allele detection means such as the HinfI restriction enzyme, allele specific oligonucleotides, degenerate oligonucleotide primers for nested PCR from dried blood.
A person skilled in the art will recognize that, based on the SNP and associated sequence information disclosed herein, detection reagents can be developed and used to assay any SNP of the present invention individually or in combination, and such detection reagents can be readily incorporated into one of the established kit or system formats which are well known in the art. The terms “kits” and “systems”, as used herein in the context of SNP detection reagents, are intended to refer to such things as combinations of multiple SNP detection reagents, or one or more SNP detection reagents in combination with one or more other types of elements or components (e.g., other types of biochemical reagents, containers, packages such as packaging intended for commercial sale, substrates to which SNP detection reagents are attached, electronic hardware components, etc.). Accordingly, the present invention further provides SNP detection kits and systems, including but not limited to, packaged probe and primer sets (e.g., TaqMan probe/primer sets), arrays/microarrays of nucleic acid molecules, and beads that contain one or more probes, primers, or other detection reagents for detecting one or more SNPs of the present invention. The kits/systems can optionally include various electronic hardware components; for example, arrays (“DNA chips”) and microfluidic systems (“lab-on-a-chip” systems) provided by various manufacturers typically comprise hardware components. Other kits/systems (e.g., probe/primer sets) may not include electronic hardware components, but may be comprised of, for example, one or more SNP detection reagents (along with, optionally, other biochemical reagents) packaged in one or more containers.
In some embodiments, a SNP detection kit typically contains one or more detection reagents and other components (e.g., a buffer, enzymes such as DNA polymerases or ligases, chain extension nucleotides such as deoxynucleotide triphosphates, and in the case of Sanger-type DNA sequencing reactions, chain terminating nucleotides, positive control sequences, negative control sequences, and the like) necessary to carry out an assay or reaction, such as amplification and/or detection of a SNP-containing nucleic acid molecule. A kit may further contain means for determining the amount of a target nucleic acid, and means for comparing the amount with a standard, and can comprise instructions for using the kit to detect the SNP-containing nucleic acid molecule of interest. In one embodiment of the present invention, kits are provided which contain the necessary reagents to carry out one or more assays to detect one or more SNPs disclosed herein. In a preferred embodiment of the present invention, SNP detection kits/systems are in the form of nucleic acid arrays, or compartmentalized kits, including microfluidic/lab-on-a-chip systems.
One aspect of the invention provides DNA microarrays containing one or more SNP nucleic acid molecules. In one embodiment, the microarray includes 1, 2, 3, 4, 5 or more polymorphic CAD-determinative nucleic acid molecules e.g., probes or primers described herein, that are capable of detecting (e.g., hybridizing to) a polymorphic CAD-determinative nucleic acid molecules. Isolated polymorphic CAD-determinative nucleic acid molecules can be obtained by chemical or biochemical synthesis, by recombinant DNA techniques, or by isolating the nucleic acids from a biological source, or a combination of any of the foregoing. For example, the nucleic acid may be synthesized using solid phase synthesis techniques, as are known in the art. Oligonucleotide synthesis is also described in Edge et al. (1981) Nature 292:756; Duckworth et al. (1981) Nucleic Acids Res. 9:1691 and Beaucage and Caruthers (1981) Tet. Letters 22:1859. Following preparation of the nucleic acid, the nucleic acid is then ligated to other members of the expression system to produce an expression cassette or system comprising a nucleic acid encoding the subject product in operational combination with transcriptional initiation and termination regions, which provide for expression of the nucleic acid into the subject polypeptide products under suitable conditions.
SNP detection kits/systems may contain, for example, one or more probes, or pairs of probes, that hybridize to a nucleic acid molecule at or near each target SNP position. Multiple pairs of allele-specific probes may be included in the kit/system to simultaneously assay large numbers of SNPs, at least one of which is a SNP of the present invention. In some kits/systems, the allele-specific probes are immobilized to a substrate such as an array or bead. For example, the same substrate can comprise allele-specific probes for detecting at least 1; 10; 100; 1000; 10,000; 100,000 (or any other number in-between) or substantially all of the SNPs shown in Tables 1-5.
The terms “arrays”, “microarrays”, and “DNA chips” are used herein interchangeably to refer to an array of distinct polynucleotides affixed to a substrate, such as glass, plastic, paper, nylon or other type of membrane, filter, chip, or any other suitable solid support. The polynucleotides can be synthesized directly on the substrate, or synthesized separate from the substrate and then affixed to the substrate. In one embodiment, the microarray is prepared and used according to the methods described in U.S. Pat. No. 5,837,832, Chee et al., PCT application WO95/11995 (Chee et al.), Lockhart, D. J. et al. (1996; Nat. Biotech. 14: 1675-1680) and Schena, M. et al. (1996; Proc. Natl. Acad. Sci. 93: 10614-10619), all of which are incorporated herein in their entirety by reference. In other embodiments, such arrays are produced by the methods described by Brown et al., U.S. Pat. No. 5,807,522.
Nucleic acid arrays are reviewed in the following references: Zammatteo et al., “New chips for molecular biology and diagnostics”, Biotechnol Annu Rev. 2002; 8:85-101; Sosnowski et al., “Active microelectronic array system for DNA hybridization, genotyping and pharmacogenomic applications”, Psychiatr Genet. December 2002; 12(4): 181-92; Heller, “DNA microarray technology: devices, systems, and applications”; Annu Rev Biomed Eng. 2002; 4: 129-53. Epub Mar. 22, 2002; Kolchirisky et al., “Analysis of SNPs and other genomic variations using gel-based chips”, Hum Mutat. April 2002; 19(4):343-60; and McGall et al., “High-density genechip oligonucleotide probe arrays”, Adv Biochem Eng Biotechnol. 2002; 77:21-42.
Any number of probes, such as allele-specific probes, may be implemented in an array, and each probe or pair of probes can hybridize to a different SNP position. In the case of polynucleotide probes, they can be synthesized at designated areas (or synthesized separately and then affixed to designated areas) on a substrate using a tight-directed chemical process. Each DNA chip can contain, for example, thousands to millions of individual synthetic polynucleotide probes arranged in a grid-like pattern and miniaturized (e.g., to the size of a dime). Preferably, probes are attached to a solid support in an ordered, addressable array.
A microarray can be composed of a large number of unique, single-stranded polynucleotides, usually either synthetic antisense polynucleotides or fragments of cDNAs, fixed to a solid support. Typical polynucleotides are preferably about 6-60 nucleotides in length, more preferably about 15-30 nucleotides in length, and most preferably about 18-25 nucleotides in length. For certain types of microarrays or other detection kits/systems, it may be preferable to use oligonucleotides that are only about 7-20 nucleotides in length. In other types of arrays, such as arrays used in conjunction with chemiluminescent detection technology, preferred probe lengths can be, for example, about 15-80 nucleotides in length, preferably about 50-70-nucleotides in length, more preferably about 5565 nucleotides in length, and most preferably about 60 nucleotides in length. The microarray or detection kit can contain polynucleotides that cover the known 5′ or 3′ sequence of a gene/transcript or target SNP site, sequential polynucleotides that cover the full-length sequence of a gene/transcript; or unique polynucleotides selected from particular areas along the length of a target gene/transcript sequence, particularly areas corresponding to one or more SNPs disclosed in Table 1 and/or Table 2. Polynucleotides used in the microarray or detection kit can be specific to a SNP or SNPs of interest (e.g., specific to a particular SNP allele at a target SNP site, or specific to particular SNP alleles at multiple different SNP sites), or specific to a polymorphic gene/transcript or genes/transcripts of interest.
Hybridization assays based on polynucleotide arrays rely on the differences in hybridization stability of the probes to perfectly matched and mismatched target sequence variants. For SNP genotyping, it is generally preferable that stringency conditions used in hybridization assays are high enough such that nucleic acid molecules that differ from one another at as little as a single SNP position can be differentiated (e.g., typical SNP hybridization assays are designed so that hybridization will occur only if one particular nucleotide is present at a SNP position, but will not occur if an alternative nucleotide is present at that SNP position). Such high stringency conditions may be preferable when using, for example, nucleic acid arrays of allele-specific probes for SNP detection. Such high stringency conditions are described in the preceding section, and are well known to those skilled in the art and can be found in, for example, Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6.
In other embodiments, the arrays are used in conjunction with chemiluminescent detection technology. The following patents and patent applications, which are all hereby incorporated by reference, provide additional information pertaining to chemiluminescent detection: U.S. patent application Ser. Nos. 10/620,332 and 10/620,333 describe chemiluminescent approaches for microarray detection; U.S. Pat. Nos. 6,124,478, 6,107,024, 5,994,073, 5,981,768, 5,871,938, 5,843,681, 5,800,999, and 5,773,628 describe methods and compositions of dioxetane for performing chemiluminescent detection; and U.S. Published application US2002/0110828 discloses methods and compositions for microarray controls.
In one embodiment of the invention, a nucleic acid array can comprise an array of probes of about 15-25 nucleotides in length. In further embodiments, a nucleic acid array can comprise any number of probes, in which at least one probe is capable of detecting one or more SNPs disclosed in Tables 1-4, and/or at least one probe comprises a fragment of one of the sequences selected from the group consisting of those disclosed in Table 1-4, the Sequence Listing, and sequences complementary thereto, said fragment comprising at least about 8 consecutive nucleotides, preferably 10, 12, 15, 16, 18, 20, more preferably 22, 25, 30, 40, 47, 50, 55, 60, 65, 70, 80, 90, 100, or more consecutive nucleotides (or any other number in-between) and containing (or being complementary to) a novel SNP allele disclosed in Table 1-4. In some embodiments, the nucleotide complementary to the SNP site is within 5, 4, 3, 2, or 1 nucleotide from the center of the probe, more preferably at the center of said probe.
A polynucleotide probe can be synthesized on the surface of the substrate by using a chemical coupling procedure and an ink jet application apparatus, as described in PCT application WO95/251116 (Baldeschweiler et al.) which is incorporated herein in its entirety by reference. In another aspect, a “gridded” array analogous to a dot (or slot) blot may be used to arrange and link cDNA fragments or oligonucleotides to the surface of a substrate using a vacuum system, thermal, UV, mechanical or chemical bonding procedures. An array, such as those described above, may be produced by hand or by using available devices (slot blot or dot blot apparatus), materials (any suitable solid support), and machines (including robotic instruments), and may contain 8, 24, 96, 384, 1536, 6144 or more polynucleotides, or any other, number which lends itself to the efficient use of commercially available instrumentation.
Using such arrays or other kits/systems, the present invention provides methods of identifying the SNPs disclosed herein in a test sample. Such methods typically involve incubating a test sample of nucleic acids with an array comprising one or more probes corresponding to at least one SNP position of the present invention, and assaying for binding of a nucleic acid from the test sample with one or more of the probes. Conditions for incubating a SNP detection reagent (or a kit/system that employs one or more such SNP detection reagents) with a test sample vary. Incubation conditions depend on such factors as the format employed in the assay, the detection methods employed, and the type and nature of the detection reagents used in the assay. One skilled in the art will recognize that any one of the commonly available hybridization, amplification and array assay formats can readily be adapted to detect the SNPs disclosed herein.
A SNP detection kit/system of the present invention may include components that are used to prepare nucleic acids from a test sample for the subsequent amplification and/or detection of a SNP-containing nucleic acid molecule. Such sample preparation components can be used to produce nucleic acid extracts (including DNA and/or RNA), proteins or membrane extracts from any bodily fluids (such as blood, serum, plasma, urine, saliva, phlegm, gastric juices, semen, tears, sweat, etc.), skin, hair, cells (especially nucleated cells), biopsies, buccal swabs or tissue specimens. The test samples used in the above-described methods will vary based on such factors as the assay format, nature of the detection method, and the specific tissues, cells or extracts used as the test sample to be assayed. Methods of preparing nucleic acids, proteins, and cell extracts are well known in the art and can be readily adapted to obtain a sample that is compatible with the system utilized. Automated sample preparation systems for extracting nucleic acids from a test sample are commercially available, and examples are Qiagen's BioRobot 9600, Applied Biosystems' PRISM™ 6700 sample preparation system, and Roche Molecular Systems' COBAS AmpliPrep System.
Another form of kit contemplated by the present invention is a compartmentalized kit. A compartmentalized kit includes any kit in which reagents are contained in separate containers. Such containers include, for example, small glass containers, plastic containers, strips of plastic, glass or paper, or arraying material such as silica. Such containers allow one to efficiently transfer reagents from one compartment to another compartment such that the test samples and reagents are not cross-contaminated, or from one container to another vessel not included in the kit, and the agents or solutions of each container can be added in a quantitative fashion from one compartment to another or to another vessel. Such containers may include, for example, one or more containers which will accept the test sample, one or more containers which contain at least one probe or other SNP detection reagent for detecting one or more SNPs of the present invention, one or more containers which contain wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and one or more containers which contain the reagents used to reveal the presence of the bound probe or other SNP detection reagents. The kit can optionally further comprise compartments and/or reagents for, for example, nucleic acid amplification or other enzymatic reactions such as primer extension reactions, hybridization, ligation, electrophoresis (preferably capillary electrophoresis), mass spectrometry, and/or laser-induced fluorescent detection. The kit may also include instructions for using the kit. Exemplary compartmentalized kits include microfluidic devices known in the art (see, e.g., Weigl et al., “Lab-on-a-chip for drug development”, Adv Drug Deliv Rev. Feb. 24, 2003; 55(3):349-77). In such microfluidic devices, the containers may be referred to as, for example, microfluidic “compartments”, “chambers”, or “channels”.
Microfluidic devices, which may also be referred to as “lab-on-a-chip” systems, biomedical micro-electro-mechanical systems (bioMEMs), or multicomponent integrated systems, are exemplary kits/systems of the present invention for analyzing SNPs. Such systems miniaturize and compartmentalize processes such as probe/target hybridization, nucleic acid amplification, and capillary electrophoresis reactions in a single functional device. Such microfluidic devices typically utilize detection reagents in at least one aspect of the system, and such detection reagents may be used to detect one or more SNPs of the present invention. One example of a microfluidic system is disclosed in U.S. Pat. No. 5,589,136, which describes the integration of PCR amplification and capillary electrophoresis in chips. Exemplary microfluidic systems comprise a pattern of microchannels designed onto a glass, silicon, quartz, or plastic wafer included on a microchip. The movements of the samples may be controlled by electric, electroosmotic or hydrostatic forces applied across different areas of the microchip to create functional microscopic valves and pumps with no moving parts. Varying the voltage can be used as a means to control the liquid flow at intersections between the micro-machined channels and to change the liquid flow rate for pumping across different sections of the microchip. See, for example, U.S. Pat. Nos. 6,153,073, Dubrow et al., and U.S. Pat. No. 6,156,181, Parce et al.
For genotyping SNPs, an exemplary microfluidic system may integrate, for example, nucleic acid amplification, primer extension, capillary electrophoresis, and a detection method such as laser induced fluorescence detection. In a first step of an exemplary process for using such an exemplary system, nucleic acid samples are amplified, preferably by PCR. Then, the amplification products are subjected to automated primer extension reactions using ddNTPs (specific fluorescence for each ddNTP) and the appropriate oligonucleotide primers to carry out primer extension reactions which hybridize just upstream of the targeted SNP. Once the extension at the 3′ end is completed, the primers are separated from the unincorporated fluorescent ddNTPs by capillary electrophoresis. The separation medium used in capillary electrophoresis can be, for example, polyacrylamide, polyethyleneglycol or dextran. The incorporated ddNTPs in the single nucleotide primer extension products are identified by laser-induced fluorescence detection. Such an exemplary microchip can be used to process, for example, at least 96 to 384 samples, or more, in parallel.
In another aspect, the invention features methods of treating a subject, e.g., a human, at risk of developing a cardiovascular disease, such as coronary artery disease (CAD). The methods include: identifying a subject having, or at risk of developing, CAD, and administering to the subject an agent that decreases CAD-determinative gene signaling (e.g., decreases CAD-determinative gene expression, levels or activity).
The present invention also relates to methods of treating a subject to reduce the risk of developing CAD or a complication from CAD. In one embodiment, the method comprises determining the presence of one or more CAD-determinative polymorphisms in the subject, and for subjects with one, two, three, four, five or more such polymorphisms, administering an agent expected to reduce the onset of cardiovascular disease. In one embodiment, the agent is selected from an anti-inflammatory agent, an antithrombotic agent, an anti-platelet agent, a fibrinolytic agent, a lipid reducing agent, a direct thrombin inhibitor, a glycoprotein Ilb/IIIa receptor inhibitor, a calcium channel blocker, a beta-adrenergic receptor blocker, a cyclooxygenase-2 inhibitor, an angiotensin system inhibitor, and/or combinations thereof. The agent is administered in an amount effective to lower the risk of the subject developing a the cardiovascular disease.
Anti-inflammatory agents include but are not limited to, Aldlofenac; Aldlometasone Dipropionate; Algestone Acetonide; Alpha Amylase; Amcinafal; Amcinafide; Amfenac Sodium; Amiprilose Hydrochloride; Anakinra; Anirolac; Anitrazafen; Apazone; Balsalazide Disodium; Bendazac; Benoxaprofen; Benzydamine Hydrochloride; Bromelains; Broperamole; Budesonide; Carprofen; Cicloprofen; Cintazone; Cliprofen; Clobetasol Propionate; Clobetasone Butyrate; Clopirac; Cloticasone Propionate, Cornethasone Acetate; Cortodoxone; Deflazacort; Desonide; Desoximetasone; Dexamethasone Dipropionate; Diclofenac Potassium; Diclofenac Sodium; Diflorasone Diacetate; Diflumidone Sodium; Diflunisal; Difluprednate; Diftalone; Dimethyl Sulfoxide; Drocinonide; Endrysone; Enlimomab; Enolicam Sodium; Epirizole; Etodolac; Etofenamate; Felbinac; Fenamole; Fenbufen; Fenclofenac; Fenclorac; Fendosal; Fenpipalone; Fentiazac; Flazalone; Fluazacort; Flufenamic Acid; Flumizole; Flunisolide Acetate; Flunixin; Flunixin Meglumine; Fluocortin Butyl; Fluorometholone Acetate; Fluquazone; Flurbiprofen; Fluretofen; Fluticasone Propionate; Furaprofen; Furobufen; Halcinonide; Halobetasol Propionate; Halopredone Acetate; Ibufenac; Ibuprofen; Ibuprofen Aluminum; Ibuprofen Piconol; Ilonidap; Indomethacin; Indomethacin Sodium; Indoprofen; Indoxole; Intrazole; Isoflupredone Acetate; Isoxepac; Isoxicam; Ketoprofen; Lofemizole Hydrochloride; Lomoxicam; Loteprednol Etabonate; Meclofenamate Sodium; Meclofenamic Acid; Meclorisone Dibutyrate; Mefenamic Acid; Mesalamine; Meseclazone; Methylprednisolone Suleptanate; Morniflumate; Nabumetone; Naproxen; Naproxen Sodium; Naproxol; Nimazone; Olsalazine Sodium; Orgotein; Orpanoxin; Oxaprozin; Oxyphenbutazone; Paranyline Hydrochloride; Pentosan Polysulfate Sodium; Phenbutazone Sodium Glycerate; Pirfenidone; Piroxicam; Piroxicam Cinnamate; Piroxicam Olamine; Pirprofen; Prednazate; Prifelone; Prodolic Acid; Proquazone; Proxazole; Proxazole Citrate; Rimexolone; Romazarit; Salcolex; Salnacedin; Salsalate; Salycilates; Sanguinarium Chloride; Seclazone; Sermetacin; Sudoxicam; Sulindac; Suprofen; Talmetacin; Talniflumate; Talosalate; Tebufelone; Tenidap; Tenidap Sodium; Tenoxicam; Tesicam; Tesimide; Tetrydamine; Tiopinac; Tixocortol Pivalate; Tolmetin; Tolmetin Sodium; Triclonide; Triflumidate; Zidometacin; Glucocorticoids; Zomepirac Sodium.
Anti-thrombotic and/or fibrinolytic agents include but are not limited to, Plasminogen (to plasmin via interactions of prekallikrein, kininogens, Factors XII, XIIIa, plasminogen proactivator, and tissue plasminogen activator[TPA]) Streptokinase; Urokinase: Anisoylated Plasminogen-Streptokinase Activator Complex; Pro-Urokinase; (Pro-UK); rTPA (alteplase or activase; r denotes recombinant); rPro-UK; Abbokinase; Eminase; Sreptase Anagrelide Hydrochloride; Bivalirudin; Dalteparin Sodium; Danaparoid Sodium; Dazoxiben Hydrochloride; Efegatran Sulfate; Enoxaparin Sodium; Ifetroban; Ifetroban Sodium; Tinzaparin Sodium; retaplase; Trifenagrel; Warfarin; Dextrans.
Anti-platelet agents include but are not limited to, Clopridogrel; Sulfinpyrazone; Aspirin; Dipyridamole; Clofibrate; Pyridinol Carbamate; PGE; Glucagon; Antiserotonin drugs; Caffeine; Theophyllin Pentoxifyllin; Ticlopidine; Anagrelide.
Lipid-reducing agents include but are not limited to, gemfibrozil, cholystyramine, colestipol, nicotinic acid, probucol lovastatin, fluvastatin, simvastatin, atorvastatin, pravastatin, cerivastatin, and other HMG-CoA reductase inhibitors.
Direct thrombin inhibitors include but are not limited to, hirudin, hirugen, hirulog, agatroban, PPACK, thrombin aptamers.
Glycoprotein IIb/IIIa receptor inhibitors are both antibodies and non-antibodies, and include but are not limited to ReoPro (abcixamab), lamifiban, tirofiban.
Calcium channel blockers are a chemically diverse class of compounds having important therapeutic value in the control of a variety of diseases including several cardiovascular disorders, such as hypertension, angina, and cardiac arrhythmias (Fleckenstein, Cir. Res. v. 52, (suppl. 1), p. 13-16 (1983); Fleckenstein, Experimental Facts and Therapeutic Prospects, John Wiley, New York (1983); McCall, D., Curr Pract Cardiol, v. 10, p. 1-11 (1985)). Calcium channel blockers are a heterogenous group of drugs that prevent or slow the entry of calcium into cells by regulating cellular calcium channels. (Remington, The Science and Practice of Pharmacy, Nineteenth Edition, Mack Publishing Company, Eaton, Pa., p. 963 (1995)). Most of the currently available calcium channel blockers, and useful according to the present invention, belong to one of three major chemical groups of drugs, the dihydropyridines, such as nifedipine, the phenyl alkyl amines, such as verapamil, and the benzothiazepines, such as diltiazem. Other calcium channel blockers useful according to the invention, include, but are not limited to, anrinone, amlodipine, bencyclane, felodipine, fendiline, flunarizine, isradipine, nicardipine, nimodipine, perhexylene, gallopamil, tiapamil and tiapamil analogues (such as 1993RO-11-2933), phenyloin, barbiturates, and the peptides dynorphin, omega-conotoxin, and omega-agatoxin, and the like and/or pharmaceutically acceptable salts thereof.
Beta-adrenergic receptor blocking agents are a class of drugs that antagonize the cardiovascular effects of catecholamines in angina pectoris, hypertension, and cardiac arrhythmias. Beta-adrenergic receptor blockers include, but are not limited to, atenolol, acebutolol, alprenolol, beftunolol, betaxolol, bunitrolol, carteolol, celiprolol, hydroxalol, indenolol, labetalol, levobunolol, mepindolol, methypranol, metindol, metoprolol, metrizoranolol, oxprenolol, pindolol, propranolol, practolol, practolol, sotalolnadolol, tiprenolol, tomalolol, timolol, bupranolol, penbutolol, trimepranol, 2-(3-(1,1-dimethylethyl)-amino-2-hyd-roxypropoxy)-3-pyridenecarbonitrilHCl, 1-butylamino-3-(2,5-dichlorophenoxy-)-2-propanol, 1-isopropylamino-3-(4-(2-cyclopropylmethoxyethyl)phenoxy)-2-propanol, 3-isopropylamino-1-(7-methylindan-4-yloxy)-2-butanol, 2-(3-t-butylamino-2-hydroxy-propylthio)-4-(5-carbamoyl-2-thienyl)thiazol, 7-(2-hydroxy-3-t-butylaminpropoxy)phthalide. The above-identified compounds can be used as isomeric mixtures, or in their respective levorotating or dextrorotating form.
Suitable COX-2 inhibitors include, but are not limited to, COX-2 inhibitors described in U.S. Pat. No. 5,474,995 Phenyl heterocycles as cox-2 inhibitors; U.S. Pat. No. 5,521,213 Diaryl bicyclic heterocycles as inhibitors of cyclooxygenase-2; U.S. Pat. No. 5,536,752 Phenyl heterocycles as COX-2 inhibitors; U.S. Pat. No. 5,550,142 Phenyl heterocycles as COX-2 inhibitors; U.S. Pat. No. 5,552,422 Aryl substituted 5,5 fused aromatic nitrogen compounds as anti-inflammatory agents; U.S. Pat. No. 5,604,253 N-benzylindol-3-yl propanoic acid derivatives as cyclooxygenase inhibitors; U.S. Pat. No. 5,604,260 5-methanesulfonamido-1-indanones as an inhibitor of cyclooxygenase-2; U.S. Pat. No. 5,639,780 N-benzyl indol-3-yl butanoic acid derivatives as cyclooxygenase inhibitors; U.S. Pat. No. 5,677,318 Diphenyl-1, 2-3-thiadiazoles as anti-inflammatory agents; U.S. Pat. No. 5,691,374 Diaryl-5-oxygenated-2-(SH)-furanones as COX-2 inhibitors; U.S. Pat. No. 5,698,584 3,4-diaryl-2-hydroxy-2,5-d-ihydrofurans as prodrugs to COX-2 inhibitors; U.S. Pat. No. 5,710,140 Phenyl heterocycles as COX-2 inhibitors; U.S. Pat. No. 5,733,909 Diphenyl stilbenes as prodrugs to COX-2 inhibitors; U.S. Pat. No. 5,789,413 Alkylated styrenes as prodrugs to COX-2 inhibitors; U.S. Pat. No. 5,817,700 Bisaryl cyclobutenes derivatives as cyclooxygenase inhibitors; U.S. Pat. No. 5,849,943 Stilbene derivatives useful as cyclooxygenase-2 inhibitors; U.S. Pat. No. 5,861,419 Substituted pyridines as selective cyclooxygenase-2 inhibitors; U.S. Pat. No. 5,922,742 Pyridinyl-2-cyclopenten-1-ones as selective cyclooxygenase-2 inhibitors; U.S. Pat. No. 5,925,631 Alkylated styrenes as prodrugs to COX-2 inhibitors; all of which are commonly assigned to Merck Frost Canada, Inc. (Kirkland, Calif.). Additional COX-2 inhibitors are also described in U.S. Pat. No. 5,643,933, assigned to G. D. Searle & Co. (Skokie, Ill.), entitled: Substituted sulfonylphenylheterocycles as cyclooxygenase-2 and 5-lipoxygenase inhibitors.
An angiotensin system inhibitor is an agent that interferes with the function, synthesis or catabolism of angiotensin II. These agents include, but are not limited to, angiotensin-converting enzyme (ACE) inhibitors, angiotensin II antagonists, angiotensin II receptor antagonists, agents that activate the catabolism of angiotensin II, and agents that prevent the synthesis of angiotensin I from which angiotensin II is ultimately derived. The renin-angiotensin system is involved in the regulation of hemodynamics and water and electrolyte balance. Factors that lower blood volume, renal perfusion pressure, or the concentration of Na+ in plasma tend to activate the system, while factors that increase these parameters tend to suppress its function.
Angiotensin (renin-angiotensin) system inhibitors are compounds that act to interfere with the production of angiotensin II from angiotensinogen or angiotensin I or interfere with the activity of angiotensin II. Such inhibitors are well known to those of ordinary skill in the art and include compounds that act to inhibit the enzymes involved in the ultimate production of angiotensin II, including renin and ACE. They also include compounds that interfere with the activity of angiotensin II, once produced. Examples of classes of such compounds include antibodies (e.g., to renin), amino acids and analogs thereof (including those conjugated to larger molecules), peptides (including peptide analogs of angiotensin and angiotensin 1), pro-renin related analogs, etc. Among the most potent and useful renin-angiotensin system inhibitors are renin inhibitors, ACE inhibitors, and angiotensin II antagonists.
Examples of angiotensin II antagonists include: peptidic compounds (e.g., saralasin, [(San1)(Val5)(Ala8)] angiotensin-(1-8) octapeptide and related analogs); N-substituted imidazole-2-one (U.S. Pat. No. 5,087,634); imidazole acetate derivatives including 2-N-butyl-4-chloro-1-(2-chlorobenzile) imidazole-5-acetic acid (see Long et al., J. Pharmacol. Exp. Ther. 247(1), 1-7 (1988)); 4,5,6,7-tetrahydro-1H-imidazo[4,5-c]pyridine-6-carboxylic acid and analog derivatives (U.S. Pat. No. 4,816,463); N2-tetrazole beta-glucuronide analogs (U.S. Pat. No. 5,085,992); substituted pyrroles, pyrazoles, and tryazoles (U.S. Pat. No. 5,081,127); phenol and heterocyclic derivatives such as 1,3-imidazoles (U.S. Pat. No. 5,073,566); imidazo-fused 7-member ring heterocycles (U.S. Pat. No. 5,064,825); peptides (e.g., U.S. Pat. No. 4,772,684); antibodies to angiotensin II (e.g., U.S. Pat. No. 4,302,386); and aralkyl imidazole compounds such as biphenyl-methyl substituted imidazoles (e.g., EP Number 253,310, Jan. 20, 1988); ES8891 (N-morpholinoacetyl-(-1-naphthyl)-L-alany-1-(4, thiazolyl)-L-alanyl (35, 45)-4-amino-3-hydroxy-5-cyclo-hexapentanoyl-1-N-hexylamide, Sankyo Company, Ltd., Tokyo, Japan); SKF108566 (E-alpha-2-[2-butyl-1-(carboxy phenyl)methyl]1H-imidazole-5-yl[methyl-ane]-2-thiophenepropanoic acid, Smith Kline Beecham Pharmaceuticals, Pa.); Losartan (DUP7531MK954, DuPont Merck Pharmaceutical Company); Remikirin (RO42-5892, F. Hoffman LaRoche AG); A2 agonists (Marion Merrill Dow) and certain non-peptide heterocycles (G. D. Searle and Company). Classes of compounds known to be useful as ACE inhibitors include acylmercapto and mercaptoalkanoyl prolines such as captopril (U.S. Pat. No. 4,105,776) and zofenopril (U.S. Pat. No. 4,316,906), carboxyalkyl dipeptides such as enalapril (U.S. Pat. No. 4,374,829), lisinopril (U.S. Pat. No. 4,374,829), quinapril (U.S. Pat. No. 4,344,949), ramipril (U.S. Pat. No. 4,587,258), and perindopril (U.S. Pat. No. 4,508,729), carboxyalkyl dipeptide mimics such as cilazapril (U.S. Pat. No. 4,512,924) and benazapril (U.S. Pat. No. 4,410,520), phosphinylalkanoyl prolines such as fosinopril (U.S. Pat. No. 4,337,201) and trandolopril.
Examples of renin inhibitors that are the subject of United States patents are as follows: urea derivatives of peptides (U.S. Pat. No. 5,116,835); amino acids connected by nonpeptide bonds (U.S. Pat. No. 5,114,937); di and tri peptide derivatives (U.S. Pat. No. 5,106,835); amino acids and derivatives thereof (U.S. Pat. Nos. 5,104,869 and 5,095,119); diol sulfonamides and sulfinyls (U.S. Pat. No. 5,098,924); modified peptides (U.S. Pat. No. 5,095,006); peptidyl beta-aminoacyl aminodiol carbamates (U.S. Pat. No. 5,089,471); pyrolimidazolones (U.S. Pat. No. 5,075,451); fluorine and chlorine statine or statone containing peptides (U.S. Pat. No. 5,066,643); peptidyl amino diols (U.S. Pat. Nos. 5,063,208 and 4,845,079); N-morpholino derivatives (U.S. Pat. No. 5,055,466); pepstatin derivatives (U.S. Pat. No. 4,980,283); N-heterocyclic alcohols (U.S. Pat. No. 4,885,292); monoclonal antibodies to renin (U.S. Pat. No. 4,780,401); and a variety of other peptides and analogs thereof (U.S. Pat. Nos. 5,071,837, 5,064,965, 5,063,207, 5,036,054, 5,036,053, 5,034,512, and 4,894,437).
Information on association/correlation between genotypes and disease-related phenotypes can be exploited in several ways. For example, in the case of a highly-statistically significant association between one or more SNPs with predisposition to a disease for which treatment is available, detection of such a genotype pattern in an individual may justify immediate administration of treatment, or at least the institution of regular monitoring of the individual. Even if detection of one of the SNPs of the invention did not call for immediate therapeutic intervention or monitoring in a particular individual, the subject can nevertheless be motivated to begin simple life-style changes (e.g., diet, exercise) that can be accomplished at little or no cost to the individual but would confer potential benefits in reducing the risk of developing conditions for which that individual may have an increased risk by virtue of having the CAD-susceptibility allele(s).
The SNPs of the invention may contribute to coronary artery disease in an individual in different ways. Some polymorphisms occur within a protein coding sequence and contribute to disease phenotype by affecting protein structure. Other polymorphisms occur in noncoding regions but may exert phenotypic effects indirectly via influence on, for example, replication, transcription, and/or translation. A single SNP may affect more than one phenotypic trait. Likewise, a single phenotypic trait may be affected by multiple SNPs in different genes.
As used herein, the terms “diagnose”, “diagnosis”, and “diagnostics” include, but are not limited to any of the following: detection of coronary artery disease that an individual may presently have, predisposition/susceptibility screening (i.e., determining the increased risk of an individual in developing coronary artery disease in the future, or determining whether an individual has a decreased risk of developing coronary artery disease in the future), determining a particular type or subclass of coronary artery disease in an individual known to have coronary artery disease, confirming or reinforcing a previously made diagnosis of artery disease, pharmacogenomic evaluation of an individual to determine which therapeutic strategy that individual is most likely to positively respond to or to predict whether a patient is likely to respond to a particular treatment, predicting whether a patient is likely to experience toxic effects from a particular treatment or therapeutic compound, and evaluating the future prognosis of an individual having coronary artery disease. Such diagnostic uses are based on the SNPs individually or in a unique combination or SNP haplotypes of the present invention.
Haplotypes are particularly useful in that, for example, fewer SNPs can be genotyped to determine if a particular genomic region harbors a locus that influences a particular phenotype, such as in linkage disequilibrium-based SNP association analysis.
Linkage disequilibrium (LD) refers to the co-inheritance of alleles (e.g., alternative nucleotides) at two or more different SNP sites at frequencies greater than would be expected from the separate frequencies of occurrence of each allele in a given population. The expected frequency of co-occurrence of two alleles that are inherited independently is the frequency of the first allele multiplied by the frequency of the second allele. Alleles that co-occur at expected frequencies are said to be in “linkage equilibrium”. In contrast, LD refers to any non-random genetic association between allele(s) at two or more different SNP sites, which is generally due to the physical proximity of the two loci along a chromosome. LD can occur when two or more SNPs sites are in close physical proximity to each other on a given chromosome and therefore alleles at these SNP sites will tend to remain unseparated for multiple generations with the consequence that a particular nucleotide (allele) at one SNP site will show a non-random association with a particular nucleotide (allele) at a different SNP-site located nearby. Hence, genotyping one of the SNP sites will give almost the same information as genotyping the other SNP site that is in LD.
Various degrees of LD can be encountered between two or more SNPs with the result being that some SNPs are more closely associated (i.e., in stronger LD) than others. Furthermore, the physical distance over which LD extends along a chromosome differs between different regions of the genome, and therefore the degree of physical separation between two or more SNP sites necessary for LD to occur can differ between different regions of the genome.
For diagnostic purposes and similar uses, if a particular SNP site is found to be useful for diagnosing coronary artery disease (e.g., has a significant statistical association with the condition and/or is recognized as a causative polymorphism for the condition), then the skilled artisan would recognize that other SNP sites which are in LD with this SNP site would also be useful for diagnosing the condition. Thus, polymorphisms (e.g., SNPs and/or haplotypes) that are not the actual disease-causing (causative) polymorphisms, but are in LD with such causative polymorphisms, are also useful. In such instances, the genotype of the polymorphism(s) that is/are in LD with the causative polymorphism is, predictive of the genotype of the causative polymorphism and, consequently, predictive of the phenotype (e.g., coronary artery disease) that is influenced by the causative SNP(s). Therefore, polymorphic markers that are in LD with causative polymorphisms are useful as diagnostic markers, and are particularly useful when the actual causative polymorphism(s) is/are unknown.
Examples of polymorphisms that can be in LD with one or more causative polymorphisms (and/or in LD with one or more polymorphisms that have a significant statistical association with a condition) and therefore useful for diagnosing the same condition that the causative/associated SNP(s) is used to diagnose, include, for example, other SNPs in the same gene, protein-coding, or mRNA transcript-coding region as the causative/associated SNP, other SNPs in the same exon or same intron as the causative/associated SNP, other SNPs in the same haplotype block as the causative/associated SNP, other SNPs in the same intergenic region as the causative/associated SNP, SNPs that are outside but near a gene (e.g., within 6 kb on either side, 5′ or 3′, of a gene boundary) that harbors a causative/associated SNP, etc.
Linkage disequilibrium in the human genome is reviewed in: Wall et al., “Haplotype blocks and linkage disequilibrium in the human genome”, Nat Rev Genet August 2003; 4(8):587-97; Garner et al., “On selecting markers for association studies: patterns of linkage disequilibrium between two and three diallelic loci”, Genet Epidemiol. January 2003; 24(1):57-67; Ardlie et al., “Patterns of linkage disequilibrium in the human genome”, Nat Rev Genet. April 2002; 3(4):299-309 (erratum in Nat Rev Genet July 2002; 3(7):566); and Remm et al., “High-density genotyping and linkage disequilibrium in the human genome using chromosome 22 as a model”; Curr Opin Chem. Biol. February 2002; 6(1):24-30.
The contribution or association of particular SNP and/or SNP haplotype with disease phenotypes, such as coronary artery disease, enables the SNPs of the present invention to be used to develop superior diagnostic tests capable of identifying individuals who express a detectable trait, such as coronary artery disease, as the result of a specific genotype, or individuals whose genotype places them at an increased or decreased risk of developing a detectable trait at a subsequent time as compared to individuals who do not have that genotype. As described herein, diagnostics may be based on a single SNP or a group of SNPs. Combined detection of a plurality of SNPs (for example, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 24, 25, 30, 32, 48, 50, 64, 96, 100, or any other number in-between, or more, of the SNPs provided in Tables 1-4) typically increases the probability of an accurate diagnosis. For example, the presence of a single SNP known to correlate with coronary artery disease might indicate a probability of 20% that an individual has or is at risk of developing coronary artery disease, whereas detection of five SNPs, each of which correlates with coronary artery disease, might indicate a probability of 80% that an individual has or is at risk of developing coronary artery disease. To further increase the accuracy of diagnosis or predisposition screening, analysis of the SNPs of the present invention can be combined with that of other polymorphisms or other risk factors of coronary artery disease, such as disease symptoms, pathological characteristics, family history, diet, environmental factors or lifestyle factors.
It will, of course, be understood by practitioners skilled in the treatment or diagnosis of coronary artery disease that the present invention generally does not intend to provide an absolute identification of individuals who are at risk (or less at risk) of developing coronary artery disease, and/or pathologies related to coronary artery disease, but rather to indicate a certain increased (or decreased) degree or likelihood of developing the disease based on statistically significant association results. However, this information is extremely valuable as it can be used to, for example, initiate preventive treatments or to allow an individual carrying one or more significant SNPs or SNP haplotypes to foresee warning signs such as minor clinical symptoms, or to have regularly scheduled physical exams to monitor for appearance of a condition in order to identify and begin treatment of the condition at an early stage. Particularly with diseases that are extremely debilitating or fatal if not treated on time, the knowledge of a potential predisposition, manner to treatment efficacy.
The diagnostic techniques of the present invention may employ a variety of methodologies to determine whether a test subject has a SNP or a SNP pattern associated with an increased or decreased risk of developing a detectable trait or whether the individual suffers from a detectable trait as a result of a particular polymorphism/mutation, including, for example, methods which enable the analysis of individual chromosomes for haplotyping, family studies, single sperm DNA analysis, or somatic hybrids. The trait analyzed using the diagnostics of the invention may be any detectable trait that is commonly observed in pathologies and disorders related to coronary artery disease.
Another aspect of the present invention relates to a method of determining whether an individual is at risk (or less at risk) of developing one or more traits or whether an individual expresses one or more traits as a consequence of possessing a particular trait-causing or trait-influencing allele. These methods generally involve obtaining a nucleic acid sample from an individual and assaying the nucleic acid sample to determine which nucleotide(s) is/are present at one or more SNP positions, wherein the assayed nucleotide(s) is/are indicative of an increased or decreased risk of developing the trait or indicative that the individual expresses the trait as a result of possessing a particular trait-causing or trait-influencing allele.
The invention now being generally described, it will be more readily understood by reference to the following examples, which are included merely for purposes of illustration of certain aspects and embodiments of the present invention and are not intended to be limiting in any way.
The contents of any patents, patent applications, patent publications, or scientific articles referenced anywhere in this application are herein incorporated by reference in their entirety.
Cardiovascular disease (CVD) is the leading cause of morbidity and mortality in the United States. Among the risk factors for cardiovascular disease are behavioral (e.g.) smoking, sedentary lifestyle, or poor diet), age and health-related (e.g. diabetes, hyperlipidemia or hypertension), and genetic factors. Family history as a general marker for genetic risk is one of the most consistently identified risk factors for CVD, yet there are no examples of genes known to increase risk in even a fraction of individuals with CVD. One of the reasons that these genes are so difficult to find is that the genetic effects of any given gene are likely to be small and are likely to interact with other genes. In addition, these effects are likely to manifest themselves at different ages and stages along the CVD continuum.
The Approaches for Genomic Discovery in Atherosclerosis (AGENDA) study was initiated to discover genes for CVD among a large number of genes implicated in a study of gene expression in human aortas. The goal of the human disease association components of the AGENDA study is to evaluate these genes in a clinic-based sample of individuals presenting to the Duke Diagnostic Catheterization Laboratory (DDLC). Patients presenting to the DDCL have been offered the opportunity to contribute to the CATHGEN study blood bank, which houses blood, plasma and RNA samples. These samples are later matched to the diagnostic and outcome information stored in the DISSC database maintained at the Duke Clinical Research Institute. The CATHGEN subjects have consented and the samples have been collected under the appropriate authorizations from the Duke University Medical Center IRB.
Two sets of samples have been obtained from the CATHGEN study for analysis in the AGENDA study. These samples have been selected on the basis of CAD index (CADi, an angiographically-defined measure of disease risk) and age. The first set of samples includes 468 young affected (YA) subjects (age ≦55, CADi>32), 260 older affected (OA) subjects (age >55, CADi>74) and 320 unaffected elderly (ON) subjects (age >60, CADi<23). The OA vs. ON and YA vs. ON comparisons are performed to identify genetic polymorphisms that increase susceptibility to CVD per se. The OA vs. YA comparison is performed to identify genetic polymorphisms that modify risk resulting in disease that presents at a young age, under the assumption that all individuals are at risk for CVD.
Over 1050 single nucleotide polymorphisms in 275 genes have been genotyped. These genes have been selected on the basis of location in the genome relative to a genetic linkage analysis of early onset coronary artery disease in families (the GENECARD study), ability to predict aortic atherosclerosis using gene expression in the human aorta, ability to predict aortic atherosclerosis in APO-E knockout mice, and published reports of genes identified through linkage analysis of CAD.
SNP candidates were selected using an algorithm to identify high-quality SNPs from public resources.
The statistical analysis of these variants was performed in a two-step process. First the genotypes were analyzed to evaluate the quality of the genotyping experiment. The CHG quality control protocol includes error analysis of duplicated samples arranged throughout the SNP analysis plates, evaluation of genotyping efficiency, analysis of allele frequencies and consistency with Hardy-Weinberg equilibrium. Once the SNPs were shown to meet error rate and consistency standards, the second part of the analysis was performed to evaluate association of SNP alleles and genotypes with disease status. Logistic regression was performed of diseased vs. normal or young vs. old disease adjusting for ethnicity and gender. Indicators for SNP alleles or SNP genotypes were included in the model. SNPs with model coefficients providing p-values less than 0.10 were considered interesting and worthy of additional analysis.
Table 1 provides an overview of the lowest p-values for each SNP. The x-axis represents location in the genome and the y-axis shows the negative log(base 10) of the lowest p-value for that SNP. Thus log p-values greater than 1.3 represent p-values less than 0.05 and log p-values greater than 2 represent p-values less than 0.01. Abbreviated gene names are included on the plot for all significant SNPs. Detailed results of this analysis are shown in Table 1:
0.05
0.021
0.045
0.036
0.018
0.008
0.002
0.004
0.003
0.04
0.021
0.021
0.007
0.013
0.005
0.017
0.028
0.016
0.036
0.034
0.008
0.008
0.008
0.016
0.025
0.014
0.037
0.043
0.026
0.024
0.049
0.042
0.005
0.023
0.01
0.003
0.002
0.005
0.004
0.009
3E−04
0.007
0.048
0.011
0.033
0.048
0.026
0.039
0.047
0.029
0.035
0.044
0.042
0.013
4E−04
0.009
0.015
0.005
0.002
0.002
0.004
0.004
0.045
0.02
0.012
0.019
0.018
0.004
0.026
0.001
0.039
0.031
0.025
0.024
0.027
0.03
0.038
0.007
0.044
0.025
0.037
0.025
0.018
0.03
0.024
0.023
0.037
0.042
0.029
0.022
0.02
0.039
0.03
0.034
0.04
0.048
0.017
0.006
0.003
0.007
0.029
0.042
0.015
0.025
0.027
0.046
1E−04
0.001
0.033
0.046
0.014
0.026
0.023
0.029
0.019
0.023
0.015
0.021
0.027
0.015
0.047
0.047
0.017
0.043
0.021
0.034
0.021
0.009
0.003
0.049
0.032
0.038
0.039
0.03
0.017
0.042
0.02
0.018
0.007
0.031
0.039
0.013
0.046
0.023
0.026
0.014
0.028
0.049
0.029
0.024
0.036
0.05
0.031
0.017
0.021
0.027
0.038
0.037
0.006
0.031
0.02
0.041
0.011
0.048
0.024
0.4881
A detailed list of the top genes and SNPs in those genes ranked by p-value for each gene is included as Table 2 below. Genes identified having at least 1 SNP with a p-value less than 0.10 are shown in bold. Genes were identified from logistic regression analysis of three models only: OA vs. ON, YA vs. ON and YA vs. OA. The column headers represent the following: GENE: Gene name (HUGO ID); Gene alias: Non-HUGO ID gene aliases or previous gene names; Meta Rank: Gene rank from the David Seo/Mike West microarray expression study [PMID:15297278]; Pval Rank: Gene rank based on lowest Cathgen p-value for any SNP/model in that gene (lowest p-value has rank of 1); Startloc: Gene's base pair start location from NCBI build 35; Chr: Chromosome; # SNPS: Number of SNPs in that gene genotyped in Cathgen individuals; Lowest p-value: Lowest p-value for any SNP/model in that gene from logistic regression analysis of groups C1+C2 (1037 individuals); adjusted for sex and ethnicity; Model: SNP model with the lowest p-value (responsible for that gene's Top Gene p-value ranking); Other models <0.10: All other SNPs, models in that gene with a p-value <0.10. Abbreviations are as follows: A, Allele test; G, Genotype test; YVN, Young Affected v. Old Normal; OVN, Old Affected v. Old Normal; YVO, Young Affected v. Old Affected
While all candidates listed in Table 2 must be considered very strong candidates, the results of these analyses very strongly implicate several genes in the development of atherosclerosis as measured by CADi. The following is a description of the genes in Table 2: AIM1 L: Absent in melanoma 1-like; PLA2G7: Platelet-activating factor acetylhydrolase precursor (EC 3.1.1.47) (PAF acetylhydrolase) (PAF 2-acylhydrolase) (LDL-associated phospholipase A2) (LDL-PLA(2)) (2-acetyl-1-alkylglycerophosphocholine esterase) (1-alkyl-2-acetylglycerophosphocholine esterase); OR7E29P: olfactory receptor, family 7, subfamily E, member 29 pseudogene; PLN: Cardiac phospholamban (PLB); PTPN6: Protein-tyrosine phosphatase, non-receptor type 6 (EC 3.1.3.48) (Protein-tyrosine phosphatase 1C) (PTP-1C) (Hematopoietic cell protein-tyrosine phosphatase) (SH-PTP1) (Protein-tyrosine phosphatase SHP-1); C1ORF38: ICB-1beta (Clorf38 protein); GATA2: Endothelial transcription factor GATA-2; IL7R: Interleukin-7 receptor alpha chain precursor (IL-7R-alpha) (CDw127) (CD127 antigen); MYLK: Myosin light chain kinase, smooth muscle and non-muscle isozymes (EC 2.7.1.117) (MLCK) [Contains: Telokin (Kinase related protein) (KRP)]; ANPEP: Aminopeptidase N (EC 3.4.11.2) (hAPN) (Alanyl aminopeptidase) (Microsomal aminopeptidase) (Aminopeptidase M) (gp150) (Myeloid plasma membrane glycoprotein CD13); PIK3R4: phosphoinositide-3-kinase, regulatory subunit 4, pISO; RPLP2: 60S acidic ribosomal protein P2; OLR1: OXIDISED LOW DENSITY LIPOPROTEIN (LECTIN-LIKE) RECEPTOR 1; SCAVENGER RECEPTOR CLASS E, MEMBER 1; PNPLA2: patatin-like phospholipase domain containing 2; TCF4: Transcription factor 4 (Immunoglobulin transcription factor 2) (ITF-2) (SL3-3 enhancer factor 2) (SEF-2); ACP5: TARTRATE RESISTANT ACID PHOSPHATASE TYPE 5 PRECURSOR (EC 3.1.3.2) (TR-AP) (TARTRATE-RESISTANT ACID ATPASE) (TRATPASE); SELP: P-selectin precursor (Granule membrane protein 140) (GMP-140) (PADGEM) (CD62P) (Leukocyte-endothelial cell adhesion molecule 3) (LECAM3); BAX: BAX protein, cytoplasmic isoform delta; CPNE4: Copine-4 (Copine IV) (Copine-8); TALI: T-cell acute lymphocytic leukemia-1 protein (TAL-1 protein) (Stem cell protein) (T-cell leukemia/lymphoma-5 protein); KLF15: Krueppel-like factor 15 (Kidney-enriched kruppel-like factor); ABCB1: Multidrug resistance protein 1 (P-glycoprotein 1) (CD243 antigen); LHFPL2: Homo sapiens lipoma HMG1C fusion partner-like 2 (LHFPL2), mRNA; ITGAX: Integrin alpha-X precursor (Leukocyte adhesion glycoprotein p150,95 alpha chain) (Leukocyte adhesion receptor p150,95) (CD11c) (Leu M5); LOC389142: hypothetical LOC389142; PLXNC1: Homo sapiens plexin C1 (PLXNC1), mRNA; SLA: SRC-like-adapter (Src-like-adapter protein 1) (hSLAP); ELL: RNA polymerase II elongation factor ELL (Eleven-nineteen lysine-rich leukemia protein); NPY: Neuropeptide Y precursor [Contains: Neuropeptide Y (Neuropeptide tyrosine) (NPY); C-flanking peptide of NPY (CPON)]; IGSF11: Brain and testis-specific immunoglobin superfamily protein; ITPK1: Homo sapiens inositol 1,3,4-triphosphate 5/6 kinase (ITPK1), mRNA; ASB1: Ankyrin repeat and SOCS box containing protein 1 (ASB-1); SELB: Selenocysteine-specific elongation factor (Elongation factor sec); LOC131873: hypothetical protein LOC131873; PCCA: Propionyl-CoA carboxylase alpha chain, mitochondrial precursor (EC 6.4.1.3) (PCCase alpha subunit) (Propanoyl-CoA:carbon dioxide ligase alpha subunit); HAPIP: Huntingtin-associated protein-interacting protein (Duo protein); PLAUR: Urokinase plasminogen activator surface receptor precursor (uPAR) (U-PAR) (Monocyte activation antigen Mo3) (CD87 antigen); SIDTI: SIDI transmembrane family, member 1; RPN1: Dolichyl-diphosphooligosaccharide—protein glycosyltransferase 67 kDa subunit precursor (EC 2.4.1.119) (Ribophorin I) (RPN-I); BPAG1: Bullous pemphigoid antigen 1 isofomms 1/2/3/4/5/8 (230 kDa bullous pemphigoid antigen) (BPA) (Hemidesmosomal plaque protein) (Dystonia musculorum protein) (Fragment); ROR2: TYROSINE-PROTEIN KINASE TRANSMEMBRANE RECEPTOR ROR2-PRECURSOR (EC 2.7.1.112) (NEUROTROPHIC TYROSINE KINASE, RECEPTOR-RELATED 2); MMP12: MACROPHAGE METALLOELASTASE PRECURSOR (EC 3.4.24.65) (HME) (MATRIX METALLOPROTEINASE-12) (MMP-12) (MACROPHAGE ELASTASE) (ME); GAP43: Neuromodulin (Axonal membrane protein GAP-43) (Growth associated protein 43) (PP46) (Neural phosphoprotein B-50); FSTL11: Follistatin-related protein 1 precursor (Follistatin-like 1); MAP4: Microtubule-associated protein 4 (MAP 4); ZNF217: Zinc finger protein 217; ALOX5: ARACHIDONATE 5-LIPOXYGENASE (EC 1.13.11.34) (5-LIPOXYGENASE) (5-LO); NPHP3: nephronophthisis 3; GPNMB: Putative transmembrane protein NMB precursor (Transmembrane glycoprotein HGFIN); SPP1: Osteopontin precursor (Bone sialoprotein 1) (Urinary stone protein) (Secreted phosphoprotein 1) (SPP-1) (Nephropontin) (Uropontin); ZNF80: Zinc finger protein 80 (ZNFPT17); MGP: Matrix Gla-protein precursor (MGP); C30RF15:; NEK11: NIMA (never in mitosis gene a)—related kinase 11; POLQ: polymerase (DNA directed), theta; ADFP: ADIPOPHILIN (ADIPOSE DIFFERENTIATION-RELATED PROTEIN) (ADRP); UBXD1: UBX domain-containing protein 1; 38413: membrane-associated ring finger (C3HC4) 2; FLJ46299:; ZBTB20: Zinc finger and BTB domain containing protein 20 (Zinc finger protein 288) (Dendritic-derived BTB/POZ zinc finger protein); HLA-DQA2: HLA class II histocompatibility antigen, DQ(6) alpha chain precursor (DX alpha chain) (HLA-DQA1); ZXDC: ZXD family zinc finger C; GRN: Granulins precursor (Acrogranin) (Proepithelin) (PEPI) [Contains: Paragranulin; Granulin 1 (Granulin G); Granulin 2 (Granulin F); Granulin 3 (Granulin B); Granulin 4 (Granulin A); Granulin 5 (Granulin C); Granulin 6 (Granulin D); Granulin 7 (Granulin E)]; PSCD1: CYTOHESIN 1 (SEC7 HOMOLOG B2-1).; GYS1: Glycogen [starch] synthase, muscle (EC 2.4.1.11); C14ORF132: NA; CD80: T lymphocyte activation antigen CD80 precursor (Activation B7-1 antigen) (CTLA-4 counter-receptor B7.1) (B7) (BB1); CDGAP: Cdc42 GTPase-activating protein; LMOD1: Leiomodin 1 (Leiomodin, muscle form) (64 kDa autoantigen D1) (64 kDa autoantigen 1D) (64 kDa autoantigen 1D3) (Thyroid-associated opthalmopathy autoantigen) (Smooth muscle leiomodin) (SM-Lmod); SLC41A3: solute carrier family 41, member 3; HOXD1: Homeobox protein Hox-D1; STAT5A: SIGNAL TRANSDUCER AND ACTIVATOR OF TRANSCRIPTION 5A; OPRM1: Mu-type opioid receptor (MOR-1); ITPR2: INOSITOL 1,4,5-TRISPHOSPHATE RECEPTOR TYPE 2 (TYPE 2 INOSITOL 1,4,5-TRISPHOSPHATE RECEPTOR) (TYPE 2 INSP3 RECEPTOR) (IP3 RECEPTOR ISOFORM 2) (INSP3R2); HIF1A: HYPOXIA-INDUCIBLE FACTOR 1 ALPHA (HIF-1 ALPHA) (ARNT INTERACTING PROTEIN) (MEMBER OF PAS PROTEIN 1) (MOP1) (HIF1 ALPHA); PKD2: Polycystin 2 (Autosomal dominant polycystic kidney disease type II protein) (Polycystwin) (R48321); STEAP: Six transmembrane epithelial antigen of prostate; AGTR1: Type-1 angiotensin II receptor (AT1) (AT1AR); NDUFB4: NADH dehydrogenase (ubiquinone) 1 beta subcomplex; GLRA3: Glycine receptor alpha-3 chain precursor; MEF2A: Myocyte-specific enhancer factor 2A (Serum response factor-like protein 1); STX3BP5L: syntaxin binding protein 5-like; APOBEC3D: NA; FMNL1: FORMIN-LIKE PROTEIN (PROTEIN C17ORF1); PLXND1: Homo sapiens plexin D1 (PLXND1), mRNA; ATP2C1: Calcium-transporting ATPase type 2C, member 1 (ATPase 2Cl) (ATP-dependent Ca(2+) pump PMR1); RUVBL1: RuvB-like 1 (EC 3.6.1.-) (49-kDa TATA box-binding protein-interacting protein) (49 kDa TBP-interacting protein) (TIP49a) (Pontin 52) (Nuclear matrix protein 238) (NMP 238) (54 kDa erythrocyte cytosolic protein) (ECP-54) (TIP60-associated protein 54-alpha) (TAP54-alpha); CASR: Extracellular calcium-sensing receptor precursor (CaSR) (Parathyroid Cell calcium-sensing receptor); PTPRR: PROTEIN-TYROSINE PHOSPHATASE R PRECURSOR (EC 3.1.3.48) (PROTEIN-TYROSINE PHOSPHATASE PCPTP1) (NC-PTPCOM1) (CH-1PTPASE); SMPDL3A: Acid sphingomyelinase-like phosphodiesterase 3a precursor (EC 3.1.4.-) (ASM-like phosphodiesterase 3a); APOD: Apolipoprotein D precursor (Apo-D) (ApoD); APG3L: APG3 autophagy 3-like (S. cerevisiae); FLJ35880: FLJ35880: hypothetical protein FLJ35880; TMCCl: transmembrane and coiled-coil domains 1; CD96: T-cell surface protein tactile precursor (CD96 antigen); C1QB: Complement Clq subcomponent, B chain precursor; CTSD: Cathepsin D precursor (EC 3.4.23.5); FLI1: FRIEND LEUKEMIA INTEGRATION 1 TRANSCRIPTION FACTOR (FLI-1 PROTO-ONCOGENE) (ERGB TRANSCRIPTION FACTOR).; MMP9: 92 kDa type IV collagenase precursor (EC 3.4.24.35) (92 kDa gelatinase) (Matrix metalloproteinase-9) (MMP-9) (Gelatinase B) (GELB); TCIRG1: Vacuolar proton translocating ATPase 116 kDa subunit a isoform 3 (V-ATPase 116-kDa isoform a3) (Osteoclastic proton pump 116 kDa subunit) (OC-116 KDa) (OC116) (T-cell immune regulator 1) (T cell immune response cDNA7 protein) (TIRC7); ITGB5: Integrin beta-5 precursor; FLJ25414: NA; NR1H3: OXYSTEROLS RECEPTOR LXR-ALPHA (LIVER X RECEPTOR ALPHA) (NUCLEAR ORPHAN RECEPTOR LXR-ALPHA).; HSPBAP1: HSPB (eat shock 27 kDa) associated protein 1; APOC1: Apolipoprotein C-1 precursor (Apo-CI); THPO: Thrombopoietin precursor (Megakaryocyte colony stimulating factor) (Myeloproliferative leukemia virus oncogene ligand) (C-mpl ligand) (ML) (Megakaryocyte growth and development factor) (MGDF); FTL: Ferritin light chain (Ferritin L subunit); HADHSC: Short chain 3-hydroxyacyl-CoA dehydrogenase, mitochondrial precursor (EC 1.1.1.35) (HCDH) (Medium and short chain L-3-hydroxyacyl-coenzyme A dehydrogenase); ALOX5AP: 5-lipoxygenase activating protein (FLAP) (MK-886-binding protein); LAIR1: Homo sapiens leukocyte-associated Ig-like receptor 1 (LAIR1), transcript variant a, mRNA; UPP1: Uridine phosphorylase 1 (EC 2.4.2.3) (UrdPase 1) (UPase 1); LAPTM5: Lysosomal-associated multitransmembrane protein (Retinoic acid-inducible E3 protein) (HA 1520); CSTA: cystatin A (stefin A); ADCY5: adenylate cyclase 5; PHLDB2: pleckstrin homology-like domain, family B, member 2; LL5 beta [Homo sapiens]; GM2A: Ganglioside GM2 activator precursor (GM2-AP) (Cerebroside sulfate activator protein) (Shingolipid activator protein 3) (SAP-3); NUDT16: nudix-type motif 16; ACSL1: Long-chain-fatty-acid—CoA ligase 1 (EC 6.2.1.3) (Long-chain acyl-CoA synthetase 1) (LACS 1) (Palmitoyl-CoA ligase 1) (Long-chain fatty acid CoA ligase 2) (Long-chain acyl-CoA synthetase 2) (LACS 2) (Acyl-CoA synthetase 1) (ACS1) (Palmitoyl-CoA ligase 2); VAMP5: Vesicule-associated membrane protein 5 (VAMP-5) (Myobrevin) (HSPC191); ACP2: LYSOSOMAL ACID PHOSPHATASE PRECURSOR (EC 3.1.3.2) (LAP); HLA-DPA1: HLA class II histocompatibility antigen, DP alpha chain precursor (HLA-SB alpha chain) (MHC class II DP3-alpha) (DP(W3)) (DP(W4)); TUBA3: tubulin, alpha 3; MMP7: MATRILYSIN PRECURSOR (EC 3.4.24.23) (PUMP-1 PROTEASE) (UTERINE METALLOPROTEINASE) (MATRIX METALLOPROTEINASE-7) (MMP-7) (MATRIN); H41: hypothetical protein H41; NR12: nuclear receptor subfamily 1, group 1, member 2; FGFR2: FIBROBLAST GROWTH FACTOR RECEPTOR 2 PRECURSOR (EC 2.7.1.112) (FGFR-2) (KERATINOCYTE GROWTH FACTOR RECEPTOR 2).; GBA: Glucosylceramidase precursor (EC 3.2.1.45) (Beta-glucocerebrosidase) (Acid beta-glucosidase) (D-glucosyl-N-acylsphingosine glucohydrolase) (Alglucerase) (Imiglucerase); CHAF1A: Chromatin assembly factor 1 subunit A (CAF-1 subunit A) (Chromatin assembly factor 1 p150 subunit) (CAF-1 150 kDa subunit) (CAF-1p150); GSK3B: glycogen synthase kinase 3 beta; DOCK2: Dedicator of cytokinesis protein 2; URB: steroid sensitive gene 1; HCLS1: Hematopoietic lineage cell specific protein (Hematopoietic cell-specific LYN substrate 1) (LCKBP1); CD200R1: CD200 receptor 1; SLCO2B1: SOLUTE CARRIER FAMILY 21 MEMBER 9 (ORGANIC ANION TRANSPORTER B) (OATP-B) (ORGANIC ANION TRANSPORTER POLYPEPTIDE-RELATED PROTEIN 2) (OATP-RP2) (OATPRP2); B4GALT4: Beta-1,4-galactosyltransferase 4 (EC 2.4.1.-) (b4Gal-T4) [Includes: N-acetyllactosamine synthase (EC 2.4.1.90) (NaI synthetase); Beta-N-acetylglucosaminyl-glycolipid beta-1,4 galactosyltransferase (EC 2.4.1.-)]; PLCXD2: phosphatidylinositol-specific phospholipase C, X domain containing 2; FABP7: Fatty acid-binding protein, brain (B-FABP) (Brain lipid-binding protein) (BLBP) (Mammary derived growth inhibitor related); CAMKK2: Homo sapiens calcium/calmodulin-dependent protein kinase kinase 2, beta (CAMKK2), transcript variant 1, mRNA; FCGR1A: High affinity immunoglobulin gamma Fc receptor I precursor (Fc-gamma R1) (FcRI) (IgG Fc receptor I) (CD64 antigen); SELL: L-selectin precursor (Lymph node homing receptor) (Leukocyte adhesion molecule-1) (LAM-1) (Leukocyte surface antigen Leu-8) (TQ1) (gp90-MEL) (Leukocyte-endothelial cell adhesion molecule 1) (LECAM1) (CD62L); SELE: Homo sapiens selectin E (endothelial adhesion molecule 1) (SELE), mRNA; HNRPM: Heterogeneous nuclear ribonucleoprotein M (hnRNP M); MGC45840: hypothetical protein MGC45840; F5: Coagulation factor V precursor (Activated protein C cofactor); SMTN: Smoothelin; RA13: Homo sapiens retinoic acid induced 3 (RA13), mRNA; HLA-DRA: HLA class II histocompatibility antigen, DR alpha chain precursor (MHC class II antigen DRA); CSTB: Cystatin B (Liver thiol proteinase inhibitor) (CPI-B) (Stefin B); FLJ12592: N/A; TAGLN3: Neuronal protein NP25 (Neuronal protein 22) (NP22).
CATHGEN subjects were recruited sequentially through the cardiac catheterization laboratories at Duke University Hospital (Durham, N.C.) with approval from the Duke Institutional Review Board. All subjects undergoing catheterization were offered participation in the study and signed informed consent. Medical history and clinical data were collected and stored in the Duke Information System for Cardiovascular Care database maintained at the Duke Clinical Research Institute [1].
Controls and cases were chosen on the basis of extent of coronary artery disease as measured by the CAD index (CADi). CADi is a numerical summary of coronary angiographic data that incorporates the extent and anatomical distribution of coronary disease [2]. CADi has been shown to be a better predictor of clinical outcome than the extent of CAD [3]. Affected status was determined by the presence of significant CAD defined as a CADi≧32 [4]. For patients older than 55 years of age, a higher CADi threshold (CADi≧74) was used to adjust for the higher baseline extent of CAD in this group. Medical records were reviewed to determine the age-of-onset (AOO) of CAD, i.e. the age at first documented surgical or percutaneous coronary revascularization procedure, myocardial infarction (MI), or cardiac catheterization meeting the above defined CADi thresholds. The CATHGEN cases were stratified into a young affected group (AOO ≦55 years), which provides a consistent comparison group for the GENECARD family study. Controls were defined as subjects ≧60 years of age, with no CAD as demonstrated by coronary angiography and no documented history of cerebrovascular or peripheral vascular
A set of at least 5 SNPs with a minor allele frequency (MAF) of >10% [5] was selected for genotyping in each gene CATHOEN samples using the SNPselector program [6]. Genomic DNA for CATHGEN samples was extracted from whole blood using the PureGene system (Gentra Systems, Minneapolis, Minn.). Genotyping was performed using the ABI 7900HT Taqman SNP genotyping system (Applied Biosystems, Foster City, Calif.), which incorporates a standard PCR-based, dual fluor, allelic discrimination assay in 384 well plate format with a dual laser scanner. Allelic discrimination assays were purchased through Applied Biosystems or, in cases in which the assays were not available, primer and probe sets were designed and purchased through Integrated DNA Technologies (IDT, Coralville, Iowa). A total of 15 quality control samples, composed of 6 reference genotype controls in duplicate, two Centre d'Etude du Polymorphisme Humain (CEPH) pedigree individuals and one no-template sample, were included in each quadrant of the 384 well plate. Genotyping was also performed using the Illumina BeadStation 500G SNP genotyping system (Illumina, San Diego, Calif.). Each Sentrix Array generates 1536 genotypes for 96 individuals; within each individual array experiment four quality control samples were included, two CEPH pedigree individuals and two identical in-plate controls. Results of the CEPH and quality control samples were compared to identify possible sample plating errors and genotype calling inconsistencies. SNPs that showed mismatches on quality control samples were reviewed by an independent genotyping supervisor for potential genotyping errors. All SNPs examined were successfully genotyped for 95% or more of the individuals in the study. Error rate estimates for SNPs meeting the quality control benchmarks were determined to be less than 0.2%.
All SNPs were tested for deviations from Hardy-Weinberg equilibrium (HWE) in the affected and unaffected race stratified groups. No such deviations were observed.
Additionally, linkage disequilibrium between pairs of SNPs was assessed using the Graphical Overview of Linkage Disequilibrium (GOLD) package [7] and displayed using Haploview[8]. Allelic association in CATHGEN was examined using multivariable logistic regression modeling adjusted for race and sex, and also for race, sex, and known CAD risk factors (history of hypertension, history of diabetes mellitus, body mass index, history of dyslipidemia, and smoking history) as covariates. These adjustments could hypothetically allow us to control for competing genetic pathways that are independent risk factors for CAD, therefore allowing us to detect a separate CAD genetic effect. SAS 9.1 (SAS Institute, Cary, N.C.) was used for statistical analysis. The haplo.stats package was used to identify and test for association of haplotypes in CATHGEN. Haplo.stats expands on the likelihood approach to account for ambiguity in case-control studies by using a generalized linear model (GLM) to test for haplotype association which allows for adjustment of non-genetic covariates [9]. This method derives a score statistic to test the null hypothesis of no association of the trait with the genotype. In addition to the global statistic, haplo.stats computes score statistics for the components of the genetic vectors, such as individual haplotypes.
Results from these experiments are shown in Tables 3-5. The SNP represented by SEQ ID NO:188 contains a five-base pair deletion relative to the wild-type sequence. As used herein, the term SNP also includes this polymorphism having the five-nucleotide deletion. “RK” indicates rank in predicting CAD, with the most predictive genes having a lower number; “CH” indicates the chromosome in which the gene locus resides in the human genome.
This application claims the benefit of the filing date of U.S. Application No. 60/735,694, filed Nov. 10, 2005, entitled “METHODS OF DETERMINING THE RISK OF DEVELOPING CORONARY ARTERY DISEASE.” The entire teachings of the referenced application are herein incorporated by reference.
The invention described herein was supported, in whole or in part, by the National Institute of Health Grant Nos. P01-HL73042 and R01-HL073389. The United States government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US06/43534 | 11/10/2006 | WO | 00 | 3/23/2009 |
Number | Date | Country | |
---|---|---|---|
60735694 | Nov 2005 | US |