Method for detecting a risk of cardiovascular disease

Information

  • Patent Application
  • 20070065863
  • Publication Number
    20070065863
  • Date Filed
    September 21, 2006
    18 years ago
  • Date Published
    March 22, 2007
    17 years ago
Abstract
Single nucleotide polymorphisms associated with serum C-reactive protein levels are disclosed. Also disclosed are methods of using these markers to predict the propensity for cardiovascular diseases and the time to first myocardial infarction.
Description
BACKGROUND OF THE INVENTION

Serum C-reactive protein (CRP) level has emerged as a prognostic marker that is functionally linked to cardiovascular disease, in particular coronary artery disease (CAD) and myocardial infarction (MI) (Ridker PM, et al., N Engl J Med 2000;342(12):836-43; Buffon A, et al., J Am Coll Cardiol 1999;34(5):1512-21; Haverkate F, et al., Lancet 1997;349(9050):462-6; Ridker PM, et al., Circulation 1998;98(8):731-3; Ridker PM, et al., N Engl J Med 1997;336(14):973-9; and Koenig W, et al., Circulation 2004;109(11):1349-53). In general, a higher serum CRP level indicates a greater risk of cardiovascular disease. A significant portion of the inter-individual variability in serum CRP is determined by genetic factors (Pankow JS, et al., Atherosclerosis 2001;154(3):681-9.; Vickers MA, et al., Cardiovasc Res 2002;53(4):1029-34). Most recently, Ridker, et al. demonstrated that reduction of CRP levels to below 2 mg/L through the use of statins results in clinically significant improved event-free survival (Ridker P, et al., New Eng J Med 2005;352(1):20-8).


Using a genome wide scan we identified linkage of serum CRP levels to chromosomes 5 in a data-set of 513 Caucasian families with MI. In the present invention, we sought to identify polymorphisms which affect serum CRP levels and evaluate the impact with regard to clinical phenotypes.


SUMMARY OF THE INVENTION

In one aspect, the present invention relates to a method of screening a human subject for propensity to develop a cardiovascular disease. The method involves (a) determining the status of a marker selected from single nucleotide polymorphism (SNP) marker IL44135 or another SNP marker in linkage disequilibrium with IL44135 in the genome of the human subject and (b) correlating the result from step (a) to the subject's propensity for developing a cardiovascular disease wherein on average subjects who carry the minor allele of IL44135 are less likely to develop a cardiovascular disease than subjects who do not carry the minor allele of IL44135.


In another aspect, the present invention relates to a method of correlating a human subject's serum or plasma C-reactive protein (CRP) level to the subject's genetic composition. The method involves (a) determining the status of a marker selected from single nucleotide polymorphism (SNP) marker IL44135 or another SNP marker in linkage disequilibrium with IL44135 in the genome of the human subject and (b) correlating the result from step (a) to the subject's serum or plasma CRP level wherein on average subjects who carry the minor allele of IL44135 have a lower serum or plasma CRP level than subjects who do not carry the minor allele of IL44135.


In another aspect, the present invention relates to a method of screening a human subject for propensity to develop a cardiovascular disease. The method involves (a) genotyping the genome of the human subject for SNP marker IL44135 or another SNP marker that is in linkage disequilibrium with IL44135, (b) genotyping the genome of the human subject for SNP marker CRP2667 or another SNP marker that is in linkage disequilibrium with CRP2667, and (c) correlating the result from steps (a) and (b) to the subject's propensity for developing a cardiovascular disease wherein individuals who are homozygous for the common allele of both IL44135 and CRP2667 are more likely to develop a cardiovascular disease than individuals who are heterozygous for one and homozygous for the other of IL44135 and CRP2667, who are in turn more likely to develop a cardiovascular disease than individuals who are homozygous for the minor allele of both IL44135 and CRP2667.


In another aspect, the present invention relates to a method of correlating a human subject's serum or plasma CRP level to the subject's genetic composition. The method involves (a) genotyping the genome of the human subject for SNP marker IL44135 or another SNP marker that is in linkage disequilibrium with IL44135, (b) genotyping the genome of the human subject for SNP marker CRP2667 or another SNP marker that is in linkage disequilibrium with CRP2667, and (c) correlating the result from steps (a) and (b) to the subject's serum or plasma CRP level wherein on average individuals who are homozygous for the common allele of both IL44135 and CRP2667 have a higher serum or plasma CRP level than individuals who are heterozygous for one and homozygous for the other of IL44135 and CRP2667, who are in turn have a higher serum or plasma CRP level than individuals who are homozygous for the minor allele of both IL44135 and CRP2667.


In still another aspect, the present invention relates to a method of screening a human subject for predicting time to first myocardial infarction (MI). The method involves (a) determining the status of a marker selected from SNP marker IL41916 or another SNP marker in linkage disequilibrium with IL41916 in the genome of the human subject and (b) correlating the result from step (a) to time to first myocardial infarction wherein subjects who carry the minor allele of IL41916 are likely to have myocardial infarction for the first time at an older age than subjects who do not carry the minor allele of IL41916.




BRIEF DESCRIPTION OF THE DRAWINGS


FIGS. 1
a-1n (SEQ ID NO:3) show the annotated DNA sequence for the IL4 region with SNPs. The amino acid sequence in FIGS. 1a-1n is provided in the sequence listing as SEQ ID NO:7.



FIGS. 2
a-2k (SEQ ID NO:4) show SNPs in the IL4 region with flanking sequences, including a detailed description of the base pair changes using the IUPAC letters (i.e., R=A or G, Y=C or T, M=C or A, K=T or G, W=T or A, and S=G or C).



FIG. 3 shows that multipoint linkage analysis identified a region suggestive for linkage on chromosome 5 with a peak LOD score of 2.23 (p<0.001). Top panel: Ln CRP multipoint LOD plot for chromosome 5. Dashed line represent original linkage signal. Solid line represents LOD scores from conditional linkage analysis. Middle panel: Haploview plot for 45 SNPs in the candidate linkage region. Lower panel: P-values in full family set for selected SNPs with positive association.



FIG. 4 shows mean (±SD) InCRP levels by genotype (IL44135 and CRP2667) in affected individuals.



FIG. 5A shows the difference in relative luciferase activity (RLU) for IL44135 in both HepG2 and Jurkat T cells. RLU is adjusted for transfection efficiency. Significant higher RLU was detected for the minor/rare allele in both cell lines.



FIG. 5B shows the difference in relative luciferase activity (RLU) for IL41916 in both HepG2 and Jurkat T cells. RLU is adjusted for transfection efficiency. Significant higher RLU was detected for the minor/rare allele in Jurkat T cells while there was no significant difference in HepG2 cells. This is consistent with published reports showing that the TFBS located at the IL41916 plays a role in lymphocyte development.



FIG. 6 shows SNP2667 within the CRP gene.




DETAILED DESCRIPTION OF THE INVENTION

Using a positional candidate gene approach, we evaluated single-nucleotide polymorphisms (SNPs) in candidate genes and intergenic regions within a region of human chromosome 5q31. The functional effects of these SNPs were evaluated using reporter-gene luciferase assays. Cox Proportional hazards modeling was used to demonstrate the ability of the SNPs to predict a clinical relevant phenotype, such as time to first myocardial infarction (MI).


In the present invention we disclose significant associations of SNPs within a haplotype block located in the IL4-IL13 intergenic region of human chromosome 5 with serum C-reactive protein (CRP) levels. These SNPs are IL44135, IL41916, IL41158, IL42243, and IL43900. Given that serum CRP level is a prognostic marker that is functionally linked to cardiovascular diseases (Ridker PM, et al., N Engl J Med 2000;342(12):836-43; Buffon A, et al., J Am Coll Cardiol 1999;34(5):1512-21; Haverkate F, et al., Lancet 1997;349(9050):462-6; Ridker PM, et al., Circulation 1998;98(8):731-3; Ridker PM, et al., N Engl J Med 1997;336(14):973-9; and Koenig W, et al., Circulation 2004;109(11):1349-53), SNPs identified here are useful markers for assessing the risk of cardiovascular diseases such as stroke, coronary artery disease (e.g., myocardial infarction or heart attack), end stage renal disease, and peripheral artery disease. The identified SNPs can also serve as potential targets for drug development and diagnostic tools to investigate why patients respond differently to drugs affecting CRP levels.


In particular, we demonstrate that individuals who carry the minor allele of IL44135 have on average a lower serum CRP level and thus a lower likelihood of developing a cardiovascular disease than individuals who do not carry the minor allele of IL44135. By the minor allele (used interchangeably with the term “rare allele”) of IL44135, we mean that the nucleotide at nucleotide position 4135 of the IL4-IL13 intergenic region of human chromosome 5 is “A” (Table 1). The corresponding position for the common allele of IL44135 has a “G” (Table 1).

TABLE 1GAGTGCTTGG TGAGTGGGAG GAAGATGCTG GCCATGGGGC CCAGGGCGGG1900var (1916):[A:0.03]GAGCCCCTTG GCACCT/ACGGG AACCCCAGCC CAGGAGGTTT CACTGGAAGA1950GAGGCTGGGC TTGAGTGAGA AGTGAGACAC ACGCGAGTTT CCGGTGAACT2000CGGCACCAAA CACCTCAGTT TGCTGCTCAG ATGAGGTGTC AGAAAATGCT2050(SEQ ID NO:1)GGGGCTG CACAGCAGGG AGAGTGCTGT GTTATGCGAGvar (4135):[A:0.14]GAGGTTGGAG AAATCCTCCC CATGAGATAA GATGG/AGAACA GAGATCGGGA4150CGAAACAAGG GGAGGGGACA(SEQ ID NO:2)
The bolded numbers to the right indicate the nucleotide number within the IL4-IL13 haplotype block. The bolded letters indicate the nucleotide variation. The bracketed numbers indicate the frequency of the variation.



FIGS. 1
a-1n and 2a-2k are supplied to further illustrate the SNPs. FIGS. 1a-1n (SEQ ID NO:3) show the annotated DNA sequence for the IL4 region with SNPs. FIGS. 2a-ak (SEQ ID NO:4) show SNPs in the IL4 region with flanking sequences, including a detailed description of the base pair changes using the IUPAC letters (i.e., R=A or G, Y=C or T, M=C or A, K=T or G, W=T or A, and S=G or C).


We also demonstrate an interaction between IL44135 and SNP CRP2667 in the CRP gene. In particular, individuals who are homozygous for the common allele of both IL44135 and CRP2667 have the highest mean serum level of CRP while those who are homozygous for the minor allele of both have the lowest levels. Individuals who are heterozygous for one and homozygous for the common allele of the other of IL44135 and CRP2667 have intermediate serum CRP levels. SNP CRP2667 is known in the art. For example, it is known that the common allele of CRP2667 has a “G” at position 2667 of the CRP gene and the minor allele has a “C” at that position.


We further demonstrate here that individuals who carry the minor allele of IL41916 develop myocardial infarction (MI) for the first time at an older average age than individuals who do not carry the minor allele of IL41916. By the minor allele of IL41916, we mean that the nucleotide at nucleotide position 1916 of the IL4-IL13 intergenic region of human chromosome 5 is “A” (Table 1 and FIGS. 1a-1n and 2a-2k). The corresponding position for the common allele of IL41916 has a “T” (Table 1 and FIGS. 1a-1n and 2a-2k).


IL44135 and IL41916 are located in transcription factor binding sites and we demonstrate the functional importance of these two SNPs using in vitro transfection assays which show that the two SNPs manifest allele-specific transcription factor binding site activities.


In one aspect, the present invention relates to a method of correlating a human subject's serum or plasma CRP level to the subject's genetic composition. The method involves (a) determining the status of a marker selected from single nucleotide polymorphism (SNP) marker IL44135 or another SNP marker in linkage disequilibrium with IL44135 in the genome of the human subject and (b) correlating the result from step (a) to the subject's serum or plasma CRP level wherein on average subjects who carry the minor allele of IL44135 have a lower serum or plasma CRP level than subjects who do not carry the minor allele of IL44135. The serum or plasma CRP level of the subject can be provided or measured before, at the same time, or after the status of said SNP marker or markers are determined. By “determining the status of a SNP,” we mean one or more of the following: (i) determining whether an individual carries an allele (e.g., a minor or the common allele) of interest of the SNP, (ii) determining whether an individual is heterozygous or homozygous for a specific allele of interest of the SNP (i.e., genotyping), and (iii) determining which the specific allele or alleles of the SNP that an individual carries.


In another aspect, the present invention relates to a method of screening a human subject for propensity to develop a cardiovascular disease. The method involves (a) determining the status of a marker selected from single nucleotide polymorphism (SNP) marker IL44135 or another SNP marker in linkage disequilibrium with IL44135 in the genome of the human subject; and (b) correlating the result from step (a) to the subject's propensity for developing the cardiovascular disease wherein subjects carrying the minor allele of IL44135 are less likely to develop the cardiovascular disease than subjects who do not carry the minor allele of IL44135.


In another aspect, the present invention relates to a method of correlating a human subject's serum or plasma CRP level to the subject's genetic composition. The method involves (a) genotyping the genome of the human subject for SNP marker IL44135 or another SNP marker that is in linkage disequilibrium with IL44135, (b) genotyping the genome of the human subject for SNP marker CRP2667 or another SNP marker that is in linkage disequilibrium with CRP2667, and (c) correlating the result from step (a) to the subject's serum or plasma CRP level wherein on average individuals who are homozygous for the common allele of both IL44135 and CRP2667 have a higher serum or plasma CRP level than individuals who are heterozygous for one and homozygous for the other of IL44135 and CRP2667, who are in turn have a higher serum or plasma CRP level than individuals who are homozygous for the minor allele of both IL44135 and CRP2667.


In another aspect, the present invention relates to a method of screening a human subject for propensity to develop a cardiovascular disease. The method involves (a) genotyping the genome of the human subject for SNP marker IL44135 or another SNP marker that is in linkage disequilibrium with IL44135, (b) genotyping the genome of the human subject for SNP marker CRP2667 or another SNP marker that is in linkage disequilibrium with CRP2667, and (c) correlating the result from step (a) to the subject's propensity for developing the cardiovascular disease wherein individuals who are homozygous for the common allele of both IL44135 and CRP2667 are more likely to develop the cardiovascular disease than individuals who are heterozygous for one and homozygous for the other of IL44135 and CRP2667, who are in turn more likely to develop the cardiovascular disease than individuals who are homozygous for the minor allele of both IL44135 and CRP2667. The serum or plasma CRP level of the subject can be provided or measured before, at the same time, or after the status of said SNP markers are genotyped.


In still another aspect, the present invention relates to a method of screening a human subject for predicting time to first myocardial infarction (MI). The method involves (a) determining the status of a marker selected from SNP marker IL41916 or another SNP marker in linkage disequilibrium with IL41916 in the genome of the human subject, and (b) correlating the result from step (a) to time to first myocardial infarction wherein subjects carrying the minor allele of IL41916 are likely to have myocardial infarction for the first time at an older age than subjects who do not carry the minor allele of IL41916.


There are many methods to analyze SNPs. For example, one can obtain a nucleic acid sample such as a DNA sample for an individual and determine the presence of the SNPs with any the following methods: The SNPs can be assayed using standard DNA sequencing technology such as ABI dye terminator chemistry on an ABI sequencer. The SNPs are also amenable to detection using any standard SNP genotyping platform such as TaqMan (ABI), mass spectroscopy (Sequenome), various single base pair extension assays, and chip based genotyping platforms (Affymetrix) or bead array platforms (Ilumina).


When we indicate that one would determine the status of a SNP, we mean that one could determine the status of the SNP directly by examining the nucleotide position of the SNP itself or indirectly by determining the status of one or more markers (e.g., SNP markers) that are in linkage disequilibrium with the SNP.


One of skill in the art would understand that there are many ways to evaluate the linkage between two or more markers such as SNPs. Suitable metrics are described in Hedrick, P.W., Genetics 117(2):331-341,1987. For the purposes of the present invention, markers are in a suitable linkage disequilibrium if D′ is greater than 0.8.


The invention will be more fully understood upon consideration of the following non-limiting example.


EXAMPLE
Functional Polymorphisms in the IL4-IL13 Intergenic Region Influence CRP Levels and Time to First MI

In this example, we demonstrate the positional identification of genetic variation within the IL4-IL13 intergenic region influencing serum CRP levels. Using linkage analysis followed by positional candidate gene association analysis, we identified five single nucleotide polymorphisms within a single haplotype block in the IL4-IL13 intergenic region, being associated with CRP levels in a large myocardial infarction family-set. We further demonstrate a joint effect on high sensitivity serum CRP (hsCRP) levels between SNP IL44135 and previously identified SNPs in the CRP gene. Both the association and the combined effect of both polymorphisms were replicated in a second independent population-based study sample. In addition, we demonstrate that the identified SNPs attenuate our initial CRP linkage signal, indicating that these polymorphisms are the underlying correlate of the initial QTL. We also demonstrate that these SNPs alter reporter-gene expression levels. Lastly, we demonstrate that one of the SNPs, together with other established risk factors, significantly affects the age of first MI in the MI family set. While the studies presented here were conducted with Caucasian families, it is expected that the observations are applicable to other populations such as the human population in general given that the functions of CRP and its connection to cardiovascular diseases are conserved across the human population in general.


Methods


A. Subjects and Phenotyping.


All study participants gave written informed consent and the study was approved by the ethics committee at the Medical College of Wisconsin and University of Regensburg, Germany.


B. MI Family Set.


Subject ascertainment and phenotyping, MI/CAD: An in-depth description of the patient ascertainment strategy and the clinical characteristics of the study population have been described in (Broeckel U, et al., Nat Genet 2002;30(2):210-4). Briefly, Western European families were included in the study if probands had suffered from an MI (as documented by criteria chosen according to the published definitions of the MONICA (Monitoring Trends and Determinants in Cardiovascular Disease) investigators of the World Health Organization (WHO)) before the age of 60 and affected siblings had an MI or had undergone percutaneous transluminal coronary angioplasty (PTCA) or bypass grafting (CABG). Time to the first MI was determined based on medical records.


High sensitivity serum CRP (hsCRP) measurements: Blood was obtained from individuals without signs of overt infection or unstable angina. The time between an acute MI and blood collection was at least 2 months. Serum was stored at −80° C. and thawed only once for analysis. hsCRP was quantified by means of particle-enhanced immunonephelometry according to the instructions of the manufacturer (Dade Behring Nephelometer Systems). The lower limit of detection was 0.2 mg/L. The intra- and inter-assay variabilities were 3.1 and 2.5%.

TABLE 2Clinical Characteristics of MI Family Data-set.UnaffectedMI PatientsAffected SibsSibsN = 513N = 618N = 238Characteristic(μ ± SD)(μ ± SD)(μ ± SD)Age at Ascertainment60.0 ± 0.4 62.4 ± 0.3 59.6 ± 0.5MI/PTCA/CABG (N)513/216/267408/235/353Age at First MI51.6 ± 0.4 55.1 ± 0.5Gender (% male)80.978.641.8InCRP Level0.61 ± 1.2 0.79 ± 1.18 0.63 ± 1.13Diabetes Mellitus (%)14.416.38.4BMI27.1 ± 0.2 27.2 ± 0.1 27.2 ± 0.2Cigarette Smoking73.170.248.9(%)Systolic BP (mmHg)141 ± 1 144 ± 1148 ± 1Diastolic BR (mmHg)83 ± 1 84 ± 1 86 ± 1Total Cholesterol228 ± 2 235 ± 2256 ± 3(mg dl−1)LDL Cholesterol159 ± 2 163 ± 2183 ± 3(mg dl−1)HDL Cholesterol48 ± 1 50 ± 1 54 ± 1(mg dl−1)Lipid-lowering57.753.812.2Medication (%)Anti-hypertensive82.882.142.6Medication (%)


C. Genome Scan and Linkage Analysis.


DNA was extracted from peripheral blood lymphocytes (Gentra Puregene DNA extraction kit). Genotyping for the genome scan analysis was performed by the Mammalian Genotyping Service, Marshfield Medical Clinic with a total of 394 microsatellite markers covering all autosomal chromosomes with an average distance of 10 cM (screening set 10). Variance component linkage analysis was performed using the SOLAR genetic analysis program (Almasy L, et al., Am J Hum Genet 1998;62(5):1198-211). The initial analyses screened for general covariables including age, gender, diabetes, cigarette smoking, BMI, LDL, systolic and diastolic hypertension, MI, aspirin use, statin use, blood pressure medication, and hormone replacement therapy. Only significant covariables (p<0.1) were retained for the subsequent linkage analysis. Diabetes mellitus was defined by the use of anti-diabetic medication (oral anti-diabetics or insulin injections) or by a concentration of glycosylated hemoglobin (HbA1c) ≧6.5%. Smoking was categorized as either current/former smoker or never smoker at the time of MI. We defined systolic hypertension as systolic blood pressure ≧140 mmHg and diastolic hypertension as ≧90 mmHg irrespective of the intake of antihypertensive medication. This conservative approach in defining hypertension reduces the probability that an individual is falsely categorized as hypertensive. To account partially for the non-random sampling, we conditioned the likelihood of a family on the MI phenotype of the initial proband. P-values were estimated empirically by using simulation methods incorporated into SOLAR. The heritability of hsCRP (high sensitivity serum CRP) was estimated at 0.31 ±0.07 (p=0.0000015) under a model adjusted for the significant covariables age, gender, diabetes, body mass index (BMI), and smoking.


D. SNP Identification and Genotyping.


Haplotype-tagging SNPs with a minor allele frequency (MAF) ≧5% for each of the candidate genes were selected based on the CEPH data from the SeattleSNPs Program for Genomic Applications (SeattleSNPs. NHLBI Program for Genomic Applications, UW-FHCRC, Seattle, Wash. (URL: http:/lpga.gs. washington.edu). The tested genes included IL-3, IL-4, IL-5, IL-9, IL12B, IL-13, IL17B, CRP, and Colony Stimulating Factor-2 (CSF2).) SeattleSNPs uses a unique algorithm to determine haplotype-tagging SNPs within a gene (Carlson CS, et al., Nat Genet 2003;33(4):518-21). Briefly, haplotype-tagging SNPs are determined based on a binning algorithm which identifies the single SNP that exceeds a threshold level of linkage disequilibrium (as determined by the measure of r2) with the maximum number of other SNPs. This group of SNPs is set as a bin. Each SNP within a bin is then analyzed to determine whether it exceeds the threshold level of LD with all other SNPs in that bin. All SNPs within a bin that meet this criterion are designated as TagSNPs. SNPs are referred to by their sequence coordinate based on SeattleSNPs' sequencing data. Genotyping was performed using the Applied Biosystems' (ABI, Foster City, Calif.) TaqMan technology. Initial SNPs within each of the five conserved non-coding sequence (CNS) regions, as described by Loots et al. (Science 2000;288(5463):136-40), were identified in 24 unrelated Caucasian individuals (12 cases with MI and 12 controls without MI) from the German population using direct fluorescence-based sequencing incorporating Big Dye Terminator chemistry (ABI, Foster City, Calif.). A Polyphred quality score of 20 was used in SNP identification. Identified SNPs with a MAF≧5% were then genotyped in the families using the TaqMan platform.


E. Statistical Analysis.


Linkage Disequilibrium Estimation: The analysis program Haploview (version 3.11) was used to calculate and visualize linkage disequilibrium and haplotype-block patterns of the genotyped SNPs (Barrett J C, et al., Bioinformatics 2005;21 (2):263-5). Haplotype blocks were determined based on the confidence interval method, as implemented in Haploview.


Association Analysis: hsCRP levels were log transformed to account for the non-normal distribution. The QTDT program was used for the quantitative family-based single SNP association analysis (Abecasis G, et al., Am J Hum Genet 2000;66:279-92). The initial association analysis was performed in a subset of the families which contributed to the LOD score based on the calculation of family-specific LOD scores for the peak position of the initial linkage signal as implemented in the genetic analysis software package SOLAR (Almasy L, et al., Am J Hum Genet 1998;62(5):1198-211). SNPs which showed a significant association (p<0.05) were subsequently typed in the full family set. P-values for single SNP association analysis were adjusted for age, gender, body-mass index (BMI), presence or absence of diabetes, smoking status, presence or absence of CAD/MI, lipid medication, total cholesterol:HDL ratio, and the three SNPs within the CRP gene demonstrating significant associations with CRP levels (CRP-2667, CRP-3872, CRP-6192). The PBAT software with offsets determined by genetic effect size was used to examine interaction between the chr. 5 IL4-IL1 3 SNPs and the chr. 1 CRP SNPs (Lange C, et al., Am J Hum Genet 2004;74(2):367-9). The difference in mean InCRP levels among the different genotypic categories of the IL4-IL13 intergenic region SNPs and CRP SNPs was tested with an One-way ANOVA test as implemented in SigmaStat v. 2.03.


Conditional Linkage Analysis/Quantitative Trait Nucleotide Analysis: SNPs demonstrating a positive association with CRP levels in the full dataset were subsequently incorporated into our original linkage model to test if they could account for the initial linkage signal observed on chr. 5 using the measured genotype approach. The measured genotype approach utilizes the variance not accounted for in modeling of genetic effects (QTL effects and residual additive genetic effects) and measured covariates (e.g. age, sex, race etc.) to test whether a particular polymorphism accounts for an observed QTL. We specified an additive model in which the heterozygote mean is intermediate to the two homozygote means. If the measured genotype (e.g. the SNP) is a variant influencing the trait, the IBD allele sharing will provide no additional information and the LOD score will be reduced (Almasy L, et al., Behav Genet 2004;34(2):173-7). These analyses were performed using the program package SOLAR, in which the SNP genotypes were added as covariates to the original linkage model.


Cox-Proportional Hazards Modeling: In order to evaluate the influence of the significant SNPs on time to first MI, Cox proportional hazard regression models were performed with adjustments for possible dependencies within families using a frailty model. In a frailty model, each member of the family shares a common unobservable random effect that modifies their hazard rate by a common multiplicative factor and this random effect was modeled by a gamma distribution (Hougaard P. New York: Springer-Verlag; 2000; Neale MC, et al., Dordrecht: Kluwer Academic Press; 1992).


Our approach was to fit a series of regression models, each with an appropriate frailty. For each possible variable to be entered in the regression model, we determined proportional hazards and checked model fit by using appropriate residual plots (Klein JP, et al., New York: Springer-Verlag; 1997).


The first set of models includes the identified risk factors measured on the subject (e.g., gender, smoking history, diabetes history, etc.), which were adjusted for in subsequent models. Next, we fitted models that included CRP information with indicators of the possible genetic predictors of CRP. At each step we examined not only the significance of the risk coefficients, but also the strength of association between family members.


Variables with p-values of less than 0.05 were considered statistically significant. Analysis was carried out with SAS version 9 and S-PLUS version 7.0 statistical software.


F. Functional Analyses of IL4-IL13 Intergenic Region SNPS.


Promoter-Variant Vector Construction: PCR primers were designed with unique SacI (forward) and KpnI/XhoI (reverse) sites up and downstream of the SNPs to be amplified. Following amplification, PCR products of approximately 300-1000bp 5′ and 3′ of each SNP were cloned into SacI/KpnI/XhoI digested pGL3-promoter and enhancer luciferase vectors (Promega, Madison, Wis.). All constructs were sequence-verified for the presence of each allele respectively using direct fluorescence-based sequencing on an ABI 3730 sequencer.


Cell Culture: HepG2 cells were grown in 0.1 micron- filtered DMEM including high glucose, L-glutamine, pyridoxine hydrochloride and no sodium pyruvate. 15% FBS, 1% L-glutamine, and 1% antimycotic/antibiotic were added and filter sterilized. Cells were grown to 90-95% confluency in 250 mL polystyrene Falcon culture flasks. To split, cells were washed with 1× Phosphate Buffered Solution and then incubated at 37° C. in 0.25% Trypsin solution for 2-5 minutes. Serum-containing growth medium was then added and cells were split into appropriate containers. Jurkat T-cells were grown in 75cm2 plug seal culture flask (Fisher Scientific) with RPMI media (Invitrogen, Carlsbad, Calif.) supplemented with 10% FBS, 1% antibiotic-antimycotic, 1% Sodium Pyruvate, and 1% 1 M HEPES buffer filter sterilized. Cells were split at 80-95% confluency.


Transient transfection luciferase assays: HepG2 cells were seeded in 12-well polystyrene culture plates to 90-95% confluency. Jurkat T-cells were seeded in 12-well plates at a concentration of 1.2×106/mL in 1 mL of cell growth media containing serum and antibiotics. HepG2 cells were transfected with Lipofectamine 2000 (Invitrogen, Carlsbad, Calif.) at a ratio of 1:3 DNA to transfection reagent with equal amounts of pGL3 experimental and phRG-TK renilla plasmids (to control for transfection efficiency). Jurkat T-cells were transfected following the Qiagen Superfect protocol using a DNA to transfection reagent ratio of 1:4 with equal amounts of experimental and renilla plasmids. Cells were incubated for 48 hours (Jurkat T-cells) or 72 hours (HepG2 cells) and then assayed for reporter gene expression using the Dual-Glo Luciferase Assay System (Promega, Madison, Wis.). All experiments were performed each in triplicate in at least two independent determinations.


Results


A. Linkage and Association Analysis.


Multipoint linkage analysis identified a region suggestive for linkage on chromosome 5 flanked by the markers D5S1505 and D5S1456 with a peak LOD score of 2.23 (p<0.001) close to D5S1480 (FIG. 3). Following this initial lead, we typed in this linkage region forty haplotype-tagging SNPs with a minor allele frequency (MAF) ≧5% across eight inflammatory candidate genes and five conserved non-coding regions in a subset of 224 families comprised of 712 individuals contributing to the LOD score. Single SNP analysis identified two SNPs located within the IL4-IL13 intergenic region which demonstrated significant associations (p<0.05) with CRP levels. Nine additional markers in strong linkage disequilibrium (LD) with the two positive SNPs were typed, resulting in the identification of three additional significant markers. These five SNPs fall into one LD block comprising half of the IL4-IL13 intergenic region (FIG. 3), including the conserved non-coding sequence element-1 (CNS-1) (Loots GG, et al., Science 2000;288(5463): 136-40).


CNS-1 has previously been shown to act as an enhancer element coordinately regulating interleukins 4, 5, and 13 in mice (Loots GG, et al., Science 2000;288(5463):136-40). In order to identify SNPs within this enhancer element, we re-sequenced CNS-1 in the above subset of contributing individuals (N=712). However, only one rare variant with a minor allele frequency (MAF) of 0.5% was found. Given this extremely low allele frequency, it is unlikely that this SNP plays a functional role within our population.


Subsequent single-locus analysis of the five positive SNPs within the full 513 families revealed significant evidence of association with plasma CRP levels (IL41158, p=0.0008; IL41916, p=0.0058; IL42243, p=0.0015; IL43900, p=0.0033; and IL44135, p=0.0013). Table 3 summarizes the results of the subset and full dataset single SNP analysis.

TABLE 3Significant Results (p-values) of Association Analysis in the IL4-IL13Intergenic Region in a Subset of Informative Families and the Full Dataset(No. of families) using QTDT.SubsetFull DatasetSubsetFull Dataset(N = 224)(N = 513)(N = 224)(N = 513)Crude Analysis*Fully AdjustedSNP(p-values)Model (p-valuesIL4_11580.00920.00500.00160.0008IL4_19160.00850.00790.01050.0058IL4_22430.00500.01040.00140.0015IL4_39000.01780.03180.00220.0033IL4_41350.00970.00970.00190.0013
*The fully adjusted model includes age, gender, smoking, diabetes, BMI, CAD/MI, TC:HDL ratio, lipid medication, and the three significant CRP gene SNPs (CRP_2667, CRP_3872, CRP_6192) with which we previously detected an association for CRP levels.


B. Interaction with CRP SNPs.


Since polymorphisms in the CRP gene have been previously associated with hsCRP levels, we aimed to determine if these SNPs also play a role in this population. We detected a positive association signal for three of the six haplotype tagging SNPs in the CRP gene (CRP2667, rs1800947, p=0.0006; CRP3872, rs1205, p=0.0109; and CRP6192, rs3093075, p=0.0169; CRP969, CRP1440, CRP1919, n.s.).


Consequently, we next examined a possible interaction between these three CRP SNPs and the newly identified significant SNPs in the IL4-IL13 intergenic region. Using a multivariate extension of the score-based FBAT statistic, as incorporated in the software package PBAT (Lange C, et al., Am J Hum Genet 2004;74(2):367-9), we identified in our full family set a significant interaction between the intergenic region IL44135 SNP and the CRP gene SNPs 2667 and 6192 (p-value for FBAT-I=0.0318; adjusted for age, gender, diabetes, smoking, BMI). In addition, we examined mean InCRP levels by genotype in unrelated affected siblings from our family set (FIG. 4). Overall, we detected a significant difference between the four groups (One Way ANOVA, p=0.007). Individuals homozygous for the common allele of both the CRP2667 and IL44135 SNPs have the highest mean level of InCRP while those who are compound heterozygotes have the lowest levels. The minor/rare allele of IL44135 reduces mean InCRP levels by approximately 45% in homozygotes of CRP2667 compared to individuals homozygous for both SNPs (p<0.05). Furthermore, being heterozygote for CRP2667 and homozygous for IL44135 reduces InCRP values 66% compared to those homozygous for both SNPs (p<0.05) (FIG. 4).


C. Conditional Linkage Analysis.


The next logical step was to determine if these SNPs could in fact account for our initial linkage and could therefore explain the underlying genetic variance. We incorporated all significant SNPs into our original linkage model and then re-evaluated the evidence for linkage, as a model incorporating the causal marker(s) should significantly reduce the initial LOD score. This included also the SNPs in the CRP gene given the combined effect between IL4 and CRP polymorphisms. The largest reduction in the initial LOD score of 2.24 was achieved with the addition of two of the IL4-IL13 SNPs, IL41916 and IL4-4135, and the three CRP SNPs to the conditional linkage model, resulting in a LOD score of 0.68 (FIG. 3: conditional linkage LOD scores in solid line compared to initial plot).


D. In silico and In vitro Functional Analyses.


Prediction of transcription factor binding sites: Since these significant SNPs lie in an intergenic region previously associated with immune regulatory function mediated by various transcription factors, we used transcription factor binding site (TFBS) prediction analysis, as incorporated into Matlnspector (Cartharius K, et al., Bioinformatics 2005;21(13):2933-42), to test whether these SNPs reside within TFBS. Based on this analysis, IL41916 lies within two overlapping predicted binding sites, an NF-KappaB p65 site and an Ikaros-1 site. The crucial role of NF-kappaB in inflammation, immune response, cell proliferation, and differentiation is well-established (Siebenlist U, et al., Nat Rev Immunol 2005;5(6):435-45) and Ikaros proteins are critical factors in T lymphocyte development (Georgopoulos K. Nat Rev Immunol 2002;2(3):162-74). IL44135 resides within a predicted RBP-Jkappa/CBF-1 site and RBP-Jkappa has recently been shown to regulate IL4 gene transcription as part of the Notch signaling pathway (Amsen D, et al., Cell 2004;1 17(4):515-26). The other SNPS in the haplotype block which also showed a positive association signal did not fall into predicted TFBS. Therefore, we performed the subsequent functional analysis only with these two most likely SNPs.


Molecular analysis of SNPs: In order to complement the computational prediction for TFBS, we employed reporter-gene luciferase assays to determine the allele-specific functional relevance of the SNPs via transient transfection assays in both HepG2 and JurkatT- cells. Each of the two alleles for the two significant SNPs (IL41916 and IL44315) was cloned into a pGL3 luciferase-reporter plasmid. Consistent with the results of our previous analyses, allele-specific differences in reporter gene expression were demonstrated for both markers. The minor/rare allele of IL44135 showed a significantly higher luciferase activity in an SV40 promoter vector in comparison to the common allele in HepG2, as well as Jurkat T-cells (FIG. 5A). The minor/rare allele of IL41916 also showed an increased luciferase activity using an enhancer vector in Jurkat T-cells, while we did not detect a difference in HepG2 cells (FIG. 5B).


E. Clinical Relevance of Identified SNPs—Time to first MI Analysis.


Given that individuals with increased CRP levels may have an accelerated time to symptom onset and cardiovascular events, we tested the effect of these SNPs on time to first MI in our family set. Using Cox-Proportional Hazards modeling employing a frailty model to adjust for familial correlation, IL41916 was shown to significantly predict the time to first MI independent of traditional risk factors, including gender, hyperlipidemia, diabetes and smoking (p=0.018). The minor/rare allele of IL41916 exerts a protective effect, conferring a significant 28% reduction in the hazard ratio for MI (HR=0.716, 95% C.I. 0.544-0.944) after adjusting for significant risk factors. Individuals carrying the minor/rare allele of IL41916 have a mean age at first MI of 59 years, while those with the common allele have a significantly younger mean age at first MI of 56 years (p=0.017).


Although the invention has been described in connection with specific examples, it is understood that the invention is not limited to such specific examples but encompasses all such modifications and variations apparent to a skilled artisan that fall within the scope of the appended claims.

Claims
  • 1. A method of screening a human subject for propensity to develop a cardiovascular disease comprising the steps of: (a) determining the status of a marker selected from single nucleotide polymorphism (SNP) marker IL4—4135 or another SNP marker in linkage disequilibrium with IL4—4135 in the genome of the human subject; and (b) correlating the result from step (a) to the subject's propensity for developing a cardiovascular disease wherein subjects who carry the minor allele of IL4—4135 are less likely to develop a cardiovascular disease than subjects who do not carry the minor allele of IL4—4135.
  • 2. The method of claim 1, wherein the cardiovascular disease is selected from stroke, coronary artery disease, end stage renal disease, and peripheral artery disease.
  • 3. The method of claim 2, wherein the coronary artery disease is myocardial infarction.
  • 4. A method of correlating a human subject's serum or plasma C-reactive protein (CRP) level to the subject's genetic composition comprising the steps of: (a) determining the status of a marker selected from single nucleotide polymorphism (SNP) marker IL4—4135 or another SNP marker in linkage disequilibrium with IL4—4135 in the genome of the human subject; and (b) correlating the result from step (a) to the subject's serum or plasma CRP level wherein subjects who carry the minor allele of IL4—4135 have a lower average serum or plasma CRP level than subjects who do not carry the minor allele of IL4—4135.
  • 5. A method of screening a human subject for propensity to develop a cardiovascular disease comprising the steps of: (a) genotyping the genome of the human subject for SNP marker IL4—4135 or another SNP marker that is in linkage disequilibrium with IL4—4135; (b) genotyping the genome of the human subject for SNP marker CRP—2667 or another SNP marker that is in linkage disequilibrium with CRP—2667; and (c) correlating the results from steps (a) and (b) to the subject's propensity for developing a cardiovascular disease wherein individuals who are homozygous for the common allele of both IL4—4135 and CRP—2667 are more likely to develop a cardiovascular disease than individuals who are heterozygous for one and homozygous for the other of IL4—4135 and CRP—2667, who are in turn more likely to develop a cardiovascular disease than individuals who are homozygous for the minor allele of both IL4—4135 and CRP—2667.
  • 6. The method of claim 5, wherein the cardiovascular disease is selected from stroke, coronary artery disease, end stage renal disease, and peripheral artery disease.
  • 7. The method of claim 6, wherein the coronary artery disease is myocardial infarction.
  • 8. A method of correlating a human subject's serum or plasma C-reactive protein (CRP) level to the subject's genetic composition comprising the steps of: (a) genotyping the genome of the human subject for SNP marker IL4—4135 or another SNP marker that is in linkage disequilibrium with IL4—4135; (b) genotyping the genome of the human subject for SNP marker CRP—2667 or another SNP marker that is in linkage disequilibrium with CRP—2667; and (c) correlating the results from steps (a) and (b) to the subject's serum or plasma CRP level wherein individuals who are homozygous for the common allele of both IL4—4135 and CRP—2667 have a higher average CRP level than individuals who are heterozygous for one and homozygous for the other of IL4—4135 and CRP—2667, who are in turn have a higher average CRP level than individuals who are homozygous for the minor allele of both IL4—4135 and CRP—2667.
  • 9. A method of screening a human subject for predicting time to first myocardial infarction comprising the steps of: (a) determining the status of a marker selected from SNP marker IL4—1916 or another SNP marker in linkage disequilibrium with IL4—1916 in the genome of the human subject; and (b) correlating the result from step (a) to time to first myocardial infarction wherein subjects who carry the minor allele of IL4—1916 are likely to have myocardial infarction for the first time at an older age than subjects who do not carry the minor allele of IL4—1916.
CROSS-REFERENCE TO RELATED APPLICATION

This application claims the benefit of U.S. patent application Ser. No. 60/719,727, filed on Sep. 22, 2005.

Government Interests

This invention was made with United States government support from the NHLBI Grant No. HL-R01 074321. The United States government has certain rights in this invention.

Provisional Applications (1)
Number Date Country
60719727 Sep 2005 US