Genetic loci indicative of propensity for longevity and methods for identifying propensity for age-related disease

Information

  • Patent Application
  • 20040185471
  • Publication Number
    20040185471
  • Date Filed
    December 15, 2003
    20 years ago
  • Date Published
    September 23, 2004
    20 years ago
Abstract
This invention relates to genetic loci and methods for using polymorphic markers indicative of the propensity for longevity and the resistance to age-related disease. In particular, the invention defines a specified region of human chromosome 4 associated with exceptional longevity.
Description


FIELD OF THE INVENTION

[0003] This invention relates to polymorphic markers indicative of the propensity for longevity and methods for using such markers, and more particularly to genetic loci on a specified region of human chromosome 4 associated with exceptional longevity.



BACKGROUND OF THE INVENTION

[0004] It has long been suspected that a genetic predisposition exists for longevity in humans. For example, many studies have documented the inheritance of human longevity. McGue et al., J. Gerontol. Biol. Sci., 48: B237-44 (1993); Ljungquist et al., J. Gerontol. Biol Sci., 53: M441-46 (1998). Studies have also documented that centenarians (individuals who live for 100 years or more) are more likely than non-centenarians to have siblings who are long-lived. In particular, one study has shown that the siblings of centenarians have an approximately four-fold greater probability of survival to age 91 than siblings of non-centenarians. Perls et al., Lancet, 351: 560 (1998). A more recent study has shown that female siblings of centenarians have over an eight-fold greater probability of survival to age 100 than siblings of non-centenarians and male siblings of centenarians have over an eleven-fold greater probability of survival to age 97 than siblings of non-centenarians.


[0005] Individuals who achieve exceptional longevity, such as centenarians, tend to live the majority of their lives in excellent health, demonstrating a rapid decline only at the end of their lives. Hitt et al., Lancet, 354: 652 (1999). A relative lack of polymorphic variants associated with diseases of aging may be one prerequisite to achieving exceptional longevity. For example, the absence of genetic polymorphisms among centenarians is exemplified by the rarity of the apolipoprotein E ε-4 allele that has been associated with Alzheimer's disease and cardiovascular disease. Schächter et al., Nat. Genet., 6: 29-32 (1994); Rebeck et al., Neurology, 44: 1513-16 (1994). Another prerequisite to achieving exceptional longevity may be the ability to modulate the rate of the aging process, which also appears to have a genetic component. For example, one study has shown that the offspring of centenarians had more favorable lipid profile characteristics compared to ethnically matched and unmatched controls. Barzilai et al., J. Am. Geriatr. Soc., 49: 76-79 (2001). These and other observations indicate that there may be one or more genetic loci that influence longevity.


[0006] Genetic studies in other species including mammals indicate that specific genetic polymorphisms have powerful influences upon life span (defined by the age of the oldest member of the species). A number of studies on non-human species indicate that a relatively few genetic polymorphisms have a powerful influence upon the ability to achieve exceptional longevity. Many of those polymorphisms appear to play roles in basic mechanisms of metabolism and aging.


[0007] Only recently have researchers begun to investigate a genetic link to exceptional longevity. One approach to determining the significance of genes having an influence on longevity is to screen for polymorphisms in human homologs and to determine the allelic frequencies among specific human phenotypes, such as centenarians, and compare them to ethnically matched younger controls. Alternatively, case-control studies can be performed on a priori-selected candidate genes chosen because of their hypothesized roles in fundamental mechanisms of aging. These basic mechanisms might include modulation of genomic instability, for example by DNA repair and anti-oxidant defenses, gene expression, cell proliferation and senescence, maintenance of differential function, and signal transduction. The drawbacks of such studies include the improbability of picking the right gene to study out of the myriad known and unknown genes effecting the process of interest.


[0008] Knowledge of longevity-associated polymorphisms would not only provide the benefit of predicting individual longevity, but would also provide the ability to predict the likelihood of age-associated diseases. Making such predictions would allow early prophylaxis that would reduce the severity of or would eliminate the occurrence of such diseases. Accordingly, there is a need in the art for genetic loci associated with longevity and methods of using polymorphic markers indicative of the propensity for longevity. Such genetic loci and methods are provided herein.



SUMMARY OF THE INVENTION

[0009] This invention provides a genetic locus associated with extreme longevity. According to the invention, a specified region of human chromosome 4 is linked to the propensity for old age. In particular, the invention provides a longevity locus with a linkage at the D4S1564 marker (also referred to as AFM248zg9, 248zg9, and Z23817) on human chromosome 4. According to the invention, the presence of a familial linkage to the D4S1564 marker on human chromosome 4 is indicative of a polymorphic variant associated with increased likelihood for longevity. Detection of an inherited variant at the D4S1564 marker, or a polymorphism within the longevity locus containing the D4S1564 marker, is indicative of the propensity for extreme old age.


[0010] In a preferred embodiment, the invention provides a polymorphic marker contained in an approximately 10-20 cM region surrounding the D4S1564 marker. In a further embodiment, the invention provides a marker within 20 cM of position 112.6 on human chromosome 4. Also in a preferred embodiment, the invention comprises methods for detecting a longevity marker in a biological sample. Preferably, such methods comprise amplifying DNA in the region of human chromosome 4 comprising the D4S1564 marker. The amplification product comprises a region of human chromosome 4 that contains the locus associated with longevity. The common presence of a variant of the D4S1564 marker among related individuals, at least one of whom has lived to old age, is evidence of the presence of a polymorphic variant associated with the propensity for old age.


[0011] Inherited variants of the D4S1564 marker in association with extreme longevity is indicative of the propensity for old age and the likelihood of avoiding disease of old age. For example, the invention provides the ability to predict the propensity for diseases such as heart disease, cardiovascular disease, stroke, Alzheimer's disease, cancer, ocular disease, and numerous others associated with the aging process. Accordingly, methods of the invention are useful not only to predict the likelihood of longevity, but are also useful to indicate possible early therapeutic intervention to prevent or to lessen the effects of diseases associated with aging. Further details of the use of the invention are provided below in the detailed description thereof.







DESCRIPTION OF THE DRAWINGS

[0012] The invention is pointed out with particularity in the appended claims. The advantages of the invention described above, as well as further advantages of the invention, may be better understood by reference to the following detailed description taken in conjunction with the accompanying drawings, in which:


[0013]
FIG. 1 shows multipoint LOD scores for a genome scan on human chromosome 1.


[0014]
FIG. 2 shows multipoint LOD scores for a genome scan on human chromosome 2.


[0015]
FIG. 3 shows multipoint LOD scores for a genome scan on human chromosome 3.


[0016]
FIG. 4 shows multipoint LOD scores for a genome scan on human chromosome 4.


[0017]
FIG. 5 shows multipoint LOD scores for a genome scan on human chromosome 5.


[0018]
FIG. 6 shows multipoint LOD scores for a genome scan on human chromosome 6.


[0019]
FIG. 7 shows multipoint LOD scores for a genome scan on human chromosome 7.


[0020]
FIG. 8 shows multipoint LOD scores for a genome scan on human chromosome 8.


[0021]
FIG. 9 shows multipoint LOD scores for a genome scan on human chromosome 9.


[0022]
FIG. 10 shows multipoint LOD scores for a genome scan on human chromosome 10.


[0023]
FIG. 12 shows multipoint LOD scores for a genome scan on human chromosome 11.


[0024]
FIG. 12 shows multipoint LOD scores for a genome scan on human chromosome 12.


[0025]
FIG. 13 shows multipoint LOD scores for a genome scan on human chromosome 13.


[0026]
FIG. 14 shows multipoint LOD scores for a genome scan on human chromosome 14.


[0027]
FIG. 15 shows multipoint LOD scores for a genome scan on human chromosome 15.


[0028]
FIG. 16 shows multipoint LOD scores for a genome scan on human chromosome 16.


[0029]
FIG. 17 shows multipoint LOD scores for a genome scan on human chromosome 17.


[0030]
FIG. 18 shows multipoint LOD scores for a genome scan on human chromosome 18.


[0031]
FIG. 19 shows multipoint LOD scores for a genome scan on human chromosome 19.


[0032]
FIG. 20 shows multipoint LOD scores for a genome scan on human chromosome 20.


[0033]
FIG. 21 shows multipoint LOD scores for a genome scan on human chromosome 21.


[0034]
FIG. 22 shows multipoint LOD scores for a genome scan on human chromosome 22.


[0035]
FIG. 23 shows multipoint LOD scores for a genome scan on human chromosome 23.


[0036]
FIG. 24 shows Kaplan-Meier curves depicting the probability of surviving to each age (conditional on surviving to at least age 20) of male and female siblings (n=2,092) of centenarians (n=444) to males and females from the same birth cohort (data from the U.S. Social Security Administration's 1900 cohort life table).







DETAILED DESCRIPTION OF THE INVENTION

[0037] Compositions and methods of the invention are useful for predicting the propensity for exceptional longevity in humans. Additionally, compositions and methods of the invention are useful for predicting the propensity for age-related diseases including, but not limited to heart disease, cardiovascular disease, stroke, Alzheimer's disease, cancer, and ocular disease. Additionally, compositions and methods of the invention are useful for indicating possible early therapeutic intervention to prevent or to lessen the effects of diseases associated with aging.


[0038] The present invention provides a polymorphic locus with significant influence upon longevity in humans. In particular, the invention provides a longevity locus with a linkage to the D4S1564 marker on human chromosome 4. The D4S1564 marker is also referred to as AFM248zg9, 248zg9, and Z23817. The D4S1564 marker is a DNA segment that is approximately 331 base pairs in length and contains a C-A repeat. Gyapay et al., Nat. Genet., 7(2): 246-339 (1994), which is incorporated by reference herein. The D4S1564 marker has 12 alleles with an allele size range of 220-242 base pairs and is located at approximately the 112.6 cM position on human chromosome 4. The D4S1564 marker is available on numerous Internet databases (see, e.g., Genbank, http://www.ncbi.nlm.nih.gov/; Genethon, http://www.genethon.fr/eng/indeng.html; Stanford Genome Center, http://www-shgc.stanford.edu/; Whitehead Institute Genome Center, http://www-genome.wi.mit.edu/; each of which is incorporated by reference herein).


[0039] The following examples provide further details of the polymorphic markers and methods according to the invention. Accordingly, while exemplified in the following manner, the invention is not so limited and the skilled artisan will appreciate its wide range of application upon consideration thereof.



EXAMPLE 1

[0040] Linkage to Exceptional Longevity on Human Chromosome 4 at Marker D4S1564


[0041] To investigate genetic markers that confer a substantial survival advantage in humans, genome-wide scans were performed on 308 individuals belonging to 137 sibships in which at least one sibling was 98 years old and at least one other sibling was at least 95 years old (female) or 91 years old (male). The genome-wide scans were performed using a linkage mapping set obtained from Applied Biosystems (Foster City, Calif.). DNA samples from each individual were initially genotyped with 400 tandem repeat markers with an average heterozygosity of 0.70 and an average marker density of 10 centiMorgans (cM). Size standards and alleles were determined using an allele calling program.


[0042] Loci suggestive of linkage were genotyped with additional markers. Sibships in which the observed genotype data was 105-fold more likely to have arisen if the individuals were unrelated or half-siblings were excluded from the genetic analysis. In addition, those families with aberrantly high numbers of Mendelian inheritance errors were excluded. In suggestively-linked regions with higher density mapping, genotypes that implied double-recombinants were identified and the raw data were reexamined or retyped.


[0043] An affecteds-only multipoint non-parametric analysis was performed. This analysis tested for excess sharing of chromosomal regions that were identical by descent in the sibships. The non-parametric analysis (MLS) was performed using the GENEHUNTER 2.0 program (Falling Rain Genomics, Inc., Lincoln, Mass.).


[0044]
FIGS. 1-23 show multipoint LOD scores for a genome-wide scan of 308 individuals belonging to 137 families. The multipoint LOD scores in FIGS. 1-23 are plotted as a function of specific markers on each chromosome. The horizontal threshold lines on each graph represent a MLS score of 2.0, a score slightly higher than the average maximum score expected by chance once in a genome scan.


[0045] The genome screen of the 308 individuals identified a region on chromosome 4 as highly suggestive of linkage (MLS=2.67), as shown in FIG. 4. Fine mapping at an average of 1 marker every 3 cM was performed in a 20 cM region around this peak resulting in an increased MLS=3.65 at marker D4S1564, as shown in Table 1 below. A drop off of 1.5 in the MLS score on either side of the peak MLS defines with 95% confidence the area in which the gene resides. A drop off in the MLS of 2 on either side of the peak is observed in a 20 cM region encompassed by D4S414 and D4S1611. Table 1 below shows the MLS scores in this region and the HLod scores under the dominant model:
1TABLE 1MarkerPositionMLSHloddomD4S153495.00.570.54D4S414100.81.531.30D4S2986105.32.782.26D4S1572108.03.072.57D4S411109.03.072.60D4S1564112.63.653.26D4S406117.12.552.15D4S1611121.61.701.56D4S402124.51.391.06D4S2975126.70.940.89


[0046] In order to estimate the significance of the observation of linkage to chromosome 4, simulations under the hypothesis of no linkage were designed which match the family structure and marker heterozygosity as described above. One thousand genome scans were simulated with a genome-wide map density of 1 marker every three 3 cM (matching the fine mapping density performed in the region of maximum linkage) and in only 44 of those 1000 genome scans was the observed MLS of 3.65 exceeded (p=0.044).


[0047] Previous studies indicate that parametric linkage analyses using both a dominant and recessive inheritance model may be more powerful in some cases than non-parametric studies for detecting genes contributing to complex traits. In addition, non-parametric methods do not easily allow for the presence of locus heterogeneity. Therefore, multipoint genome-wide affecteds-only parametric linkage analyses were conducted under both a dominant and recessive model with reduced-penetrance and allowing for heterogeneity.


[0048] The finely mapped region of chromosome 4 provided the strongest results, with a maximum HLOD of 3.3 under the dominant model (with 45% of families linked) and a maximum HLOD of 2.9 under the recessive model (with 57% of families linked), both occurring at marker D4S1564. Empiric p-values were determined by genome-wide simulations under the dominant model. Of 1000 replicates, 30 had a LOD score greater than 3.3 under the dominant model (p=0.03) and 78 had a LOD score greater than 2.9 under the recessive model (p=0.08). A LOD score of 3.0 or higher is indicative of linkage. Because two models were evaluated, these parametric results are appropriately summarized and corrected by using the more significant result by multiplying the associated p-value by 2, thus producing a final parametric p-value of 0.06. For this dataset and trait, the parametric analyses provide additional support for the findings of the non-parametric analyses, but do not provide greater statistical significance.


[0049] These analyses provide significant evidence for heterogeneity, with odds in favor of heterogeneity of 1800:1. The significant linkage based upon a sibling-pair study of modest sample size indicates that there is at least one gene of significant importance in conferring exceptional longevity.



EXAMPLE 2

[0050] Propensity for Siblings of Centenarians of Achieving Exceptional Longevity


[0051] To investigate the propensity of siblings achieving exceptional longevity, pedigree data from the families of centenarians was compiled and analyzed. The pedigrees of 444 centenarian subjects revealed 2,092 siblings. Ages of death or for those still alive, ages when last censored, were tabulated. If there were multiple centenarians in a family, the oldest living centenarian was considered the proband. There were 356 female probands (80% of the centenarian subjects) with a median age of 106 years with a 95% confidence interval (CI) from 105 to 107 years. There were 88 male probands (20% of the centenarian subjects) with a median age of 107 years with a 95% confidence interval (CI) from 104 to 108 years. The probands had a median of 5 siblings with a range of 1-15 siblings.


[0052] The survival experience of the siblings of centenarians, segregated by gender, was estimated according the Kaplan-Meier product-limit method. Kaplan et. al., J. Am. Statist. Assoc., 53: 457-481 (1958). Standard errors of the survival probabilities were calculated using Greenwood's formula. Kalbfleisch et al., Statistical Analysis of Failure Time Data (1980). Standard errors were calculated assuming binomial probabilities at each age interval and using the number of centenarian siblings at risk of surviving each age interval as the effective sample size for that interval. The estimated median survival of the 1,075 female siblings (51% of the sample) was 86 years with a 95% confidence interval (CI) from 85 to 88 years. The estimated median survival of the 1,017 male siblings (49% of the sample) was 78 years with a 95% confidence interval (CI) from 76 to 80 years. FIG. 24 shows Kaplan-Meier curves for both gender groups among the individuals who achieved at least the age of twenty years.


[0053] The propensity (λs) of siblings surviving to specific ages was estimated for centenarian siblings relative to the U.S. Social Security Administration's cohort life table of 100,000 individuals from the year. The sibling propensity (λs) was defined as the probability that a centenarian sibling survived to age x given that the sibling survived to age 20, divided by the probability that an individual born in 1900 survived to age x, given that the individual survived to age 20, where x is greater than 21 years of age. Pointwise 95% confidence intervals (CI) for sibling propenisty (λs) was calculated according to the natural log scale and exponentiated. Data analysis was performed using the SAS v8.0 program (SAS Institute Inc., Cary, N.C.).


[0054] Table 2 below compares the two groups for those individuals having achieved at least the age of 20 years, wherein the sibling propensity (λs) according to age and gender is shown. The sibling propensity (λs) determination was limited to age 97 for males and age 100 for females due to sample size. Beyond these ages, the lower 95% confidence interval (CI) dropped below 1.
2TABLE 2SurvivalLowerUpperλsLowerUpperto Age (yrs)λs Male95% Cl95% ClFemale95% Cl95% Cl211.000.951.071.000.951.06301.010.951.071.020.961.08351.010.951.081.030.971.09401.030.961.101.040.981.11451.040.971.121.060.991.13501.081.001.161.071.011.15551.111.021.201.091.021.17601.181.091.281.121.041.20651.291.171.421.161.081.25701.481.321.651.241.141.35751.691.461.951.361.241.49802.031.672.471.541.371.72852.702.003.651.841.582.14904.092.317.232.572.033.25914.552.338.902.772.133.61925.362.4011.973.092.284.18935.362.4011.973.452.444.89945.362.4011.973.742.485.65958.371.9236.374.282.597.09969.671.5161.824.762.568.889711.311.02125.45.472.4812.09986.772.4218.94998.012.1130.331008.641.3256.48


[0055] The survival curves remained approximately the same for both groups through young adulthood (λs˜1). As shown in both FIG. 24 and Table 2, the survival experiences of the siblings of centenarians compared to the survival of the 1900 birth cohort began to diverge at ages 40 to 50 and became increasingly divergent at age 90 and beyond. For example, the sibling propensity (λs) for men by age 97 was 11.3 and the sibling propensity (λs) for women by age 100 was 8.6. Thus, compared to the general population, the siblings of centenarians appear to have a markedly increased propensity for achieving exceptional longevity, suggesting a significant genetic contribution.


[0056] While the invention has been shown and described with reference to specific preferred embodiments, it should be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention as defined by the following claims.


Claims
  • 1. A method for gathering data related to genetic factors that influence longevity, the method comprising: amplifying a segment of DNA from a subject in a region between the genetic markers D4S1564 and D4S1572 on human chromosome 4; and comparing the segment to a reference sequence.
  • 2. A method for gathering genetic information related to factors that influence longevity, the method comprising: genotyping a segment of DNA from a subject in a region between the genetic markers D4S1564 and D4S1572 on human chromosome 4; and comparing the segment to a reference sequence.
  • 3. The method of claim 1 or 2 wherein the reference sequence is from a reference subject who has attained a reference age.
  • 4. The method of claim 3 wherein the reference age is old age.
  • 5. The method of claim 3 wherein the reference subject has attained at least 91 years of age.
  • 6. The method of claim 5 wherein the reference subject has attained at least 95 years of age.
  • 7. The method of claim 6 wherein the reference subject has attained at least 98 years of age.
  • 8. A method of evaluating a subject, the method comprising: amplifying a segment of DNA from a subject, wherein the segment is located between the genetic markers D4S1564 and D4S1572 on human chromosome 4; and comparing the amplified segment to a corresponding segment amplified from a relative of the subject who has attained a reference age.
  • 9. The method of claim 8 wherein the reference age is old age.
  • 10. The method of claim 8 wherein the relative of the subject has attained at least 91 years of age.
  • 11. The method of claim 10 wherein the relative of the subject has attained at least 95 years of age.
  • 12. The method of claim 11 wherein the relative of the subject has attained at least 98 years of age.
  • 13. The method of claim 8 wherein the relative is a sibling.
  • 14. The method of claim 1, 2, or 8 further comprising evaluating the subject for Alzheimer's disease, cancer, or ocular disease.
  • 15. The method of claim 1, 2, or 8 further comprising evaluating the subject for a heart disease, cardiovascular disease, or stroke.
  • 16. A method of evaluating a subject, the method comprising: genotyping a segment of DNA from a subject, wherein the segment is located between the genetic markers D4S1564 and D4S1572 on human chromosome 4; and evaluating the subject for Alzheimer's disease, cancer, or ocular disease.
  • 17. The method of claim 1, 2, 8, or 16 wherein the segment is in a region between 108.0 to 112.6 cM on human chromosome 4.
  • 18. The method of claim 17 wherein the segment is in a region between 108.0 to 109.0 cM on human chromosome 4.
  • 19. The method of claim 17 wherein the segment is in a region between 109.0 to 112.6 cM on human chromosome 4.
  • 20. A method for gathering data related to genetic factors that influence longevity, the method comprising: identifying a subject as a subject who has attained a reference age; amplifying a segment of DNA from a subject in a region between the genetic markers D4S1564 and D4S1572 on human chromosome 4; and recording genetic information about the amplified segment.
  • 21. A method for gathering data related to genetic factors that influence longevity, the method comprising: identifying a plurality of human subjects who have attained a reference age; amplifying a segment of DNA from each of the subjects, the segment comprising the genetic markers D4S1564 on human chromosome 4; and recording genetic information about the amplified segment from each of the subjects.
  • 22. A method of mapping a genetic locus that affects longevity, the method comprising: identifying a plurality of sibships in which at least two human siblings have attained a reference age; genotyping DNA samples for a plurality of markers from each individual of the sibship; and determining a LOD score for each marker of the plurality of markers.
  • 23. The method of claim 20, 21, or 22 wherein the reference age is an old age.
  • 24. The method of claim 22 wherein at least one sibling is at least 95 years old if female and 91 years old if male.
  • 25. The method of claim 22 wherein at least one sibling has attained 98 years of age.
  • 26. The method of claim 24 wherein at least another sibling is at least 95 years old if female and 91 years old if male.
  • 27. The method of claim 22 wherein the genotyping comprises genotyping using a plurality of markers on each chromosome.
  • 28. The method of claim 22 wherein the method comprises a multipoint non-parametric analysis.
  • 29. The method of claim 22 wherein the method comprises evaluating one or more of the individuals of the plurality of sibships for Alzheimer's disease, cancer, or ocular disease.
  • 30. The method of claim 22 wherein the method comprises evaluating one or more of the individuals of the plurality of sibships for heart disease, cardiovascular disease, or stroke.
  • 31. A method of mapping a genetic locus that affects longevity, the method comprising: providing nucleic acid samples from a population of human individuals who have attained a predetermined age; and performing an association study that evaluates association between one or more genetic markers with an affected state of individuals in the population.
  • 32. The method of claim 31 wherein the one or more markers are located on human chromosome four.
  • 33. The method of claim 31 wherein the one or more markers are located on human chromosome four between D4S1564 and D4S1572.
  • 34. The method of claim 31 wherein the predetermined age is 91 years if male and 97 years if female.
  • 35. The method of claim 31 wherein the affected state is ability to live to the predetermined age.
  • 36. The method of claim 31 wherein the affected state is Alzheimer's disease.
  • 37. The method of claim 31 wherein the affected state is ocular disease.
  • 38. The method of claim 31 wherein the affected state is cancer.
  • 39. The method of claim 31 wherein the affected state is heart disease, cardiovascular disease, or stroke.
RELATED APPLICATIONS

[0001] This application is a continuation application of U.S. application Ser. No. 09/928,102, filed Aug. 10, 2001, which claims the benefit of U.S. Application Nos. 60/224,643 and 60/249,921, the disclosures of each of which is incorporated by reference herein.

GOVERNMENT SUPPORT

[0002] This invention was made with Government support under Grant No. AG18721-2 awarded by the National Institutes of Health. The Government may have certain rights in this invention.

Provisional Applications (2)
Number Date Country
60224643 Aug 2000 US
60249921 Nov 2000 US
Continuations (1)
Number Date Country
Parent 09928102 Aug 2001 US
Child 10736471 Dec 2003 US