This invention relates to the field of genomics and pharmacogenetics. More specifically, this invention relates to variants of the gene for apolipoprotein A-IV (APOA4) and their use as predictors of an individual's progression of Alzheimer's Disease (hereinafter, “AD”).
AD is a fatal, progressive, degenerative disorder of the central nervous system. During the course of AD, cognitive, mood, and motor system deficits appear and progressively worsen. In the earliest stages, AD may manifest as Mild Cognitive Impairment (hereinafter, “MCI”), characterized by memory complaints without general cognitive deficits or dementia (Morris et al., Arch. Neurol. 58:397-405 (2001)). Cognitive deficits in AD include difficulty learning and recalling new information, language disorder, disturbances of visuospatial skills and deficits in executive function, all of which increase in severity over the course of the illness. Early in the illness, apathy is apparent and as the illness progresses, agitation becomes increasingly common. In the later stages of the disease, motor system abnormalities manifest (reviewed in Cummings et al., JAMA 287:2335-8 (2002)). AD patients usually survive for 7-10 years after the onset of symptoms (Bracco et al., Arch. Neurol. 51:1213-9 (1994)).
In the United States, the prevalence of AD is estimated at 2.3 million, with a doubling in the prevalence every 5 years after the age of 60 (Brookmeyer et al. Am. J. Public Health 88:1337-42 (1998)). In 1998, the annual cost in the United States for the care of patients with AD was about $40,000 per patient and it is estimated that there will be 14 million AD patients in the United States by the year 2050 (Petersen et al., Neurology 56:1133-42 (2001)). A pharmacological treatment that slows the progression of AD by as little as a year could result in huge cost savings and provide affected individuals with additional time to plan for their future while their decision-making capacity is only minimally affected.
To assess whether a pharmacological treatment is effective in slowing the progression of AD, it is essential to evaluate and detect an alteration in the course of the disease. An evaluation that predicts individuals who are susceptible to a more rapid progression of AD could also be utilized by clinicians to identify patients who may benefit from more aggressive treatment intervention. Furthermore, a method to predict progression of AD may also provide clues to direct the development of new therapeutic agents.
A number of factors have been associated with progression of AD, when considered as the time to institutionalization or the length of survival. Age, gender, marital status (Heyman et al., Neurology 48:1304-9 (1997)), severity of dementia (Heyman et al., supra (1997); Knopman et al., Neurology 52:714-8 (1999)), agitation (Knopman et al., Neurology 52:714-8 (1999)), extrapyramidal signs (Stern et al., Neurology 44:2300-7 (1994)), and higher scores on psychiatric rating scales (Stelle et al., Am. J. Psych. 147:1049-51 (1990) are associated with time to institutionalization. Age (Burns et al., Psychol Med. 21:363-70 (1991); Heyman et al., Neurology 46:656-60 (1996)), gender (Burns et al., supra; Heyman et al., Neurology 46:656-60 (1996)), age of onset, severity of dementia (Kaszniak et al., Ann. Neurol. 3:246-52 (1978); Diesfeldt et al., Acta. Psychiatr. Scand. 73:366-71 (1986); Burns et al., supra; Heyman et al., supra (1996)), severity of behavioral symptoms (Diesfeldt et al., supra), extrapyramidal signs (Stern et al., supra), and comorbidities (Burns et al., supra) are associated with survival.
In addition to the demographic, symptomatic and comorbid factors associated with AD progression, genetics is thought to play an important role and may account for the large inter-individual variability in disease progression (Farrer et al., Arch. Neurol., 52:918-23 (1995)). Early-onset, dominantly inherited AD may have a more rapid course than late-onset, sporadic AD (Swearer et al. J. Geriatr. Psychiatry Neurol. 9:22-5 (1996)). Interestingly, APOE4, an allele that carries an increased risk for developing AD, does not affect disease progression (Corder et al., Neurology 45:1323-8 (1995); Dal Forno et al., Arch. Neurology 53:345-50 (1996); Koivisto et al., Neuroepidemiology 19:327-32 (2000); Kurz et al., Neurology 47:440-3 (1996)).
A protein that may be involved in the progression of AD is apolipoprotein A-IV (APOA4). The APOA4 gene consists of three exons and has been mapped to chromosome 11q23 (Elshourbagy et al., J. Biol. Chem. 262:7973-81 (1987)). A component of high density lipoproteins, APOA4 is likely to have a role in reverse cholesterol transport, moving cholesterol from tissues to the liver for elimination (Durverger et al., Science 273:966-8 (1996)). APOA4 may have an indirect effect on risk and disease progression of AD via its effect on cholesterol levels, which have been associated with AD in numerous epidemiological and clinical studies (Jarvik et al., Neurology 45:1092-96 (1995); Sparks et al., Microscopy Res. Technique 50:287-90 (2000); Kalminjn et al., Ann. Neurol. 42:776-82 (1997); Notkola et al., Neuroepidemiology 17:14-20 (1998)). In vitro and in vivo, cholesterol has also been shown to be indirectly involved in AD pathology. In in vitro studies, cholesterol modulates the proteolytic processing of the amyloid precursor protein (Bodovitz et al., J. Biol. Chem 271:4436-40 (1996)) and increased cellular cholesterol is associated with increased amyloid beta (Wolozin, Proc. Nat'l. Acad. Sci. USA 98:5371-3 (1991)). Amyloid beta is a main component of senile plaques, a pathological hallmark of AD (Maccioni et al., Ciencia Hoje 20:669-77 (1998)). Thus, the pathology of AD and possibly the risk and progression of this disease, may be indirectly affected by APOA4 through its effect on cholesterol levels. Indeed, APOA4 has been shown to be associated with risk of developing AD (Csaszar et al., Neurosci. Lett. 230:151-4 (1997)).
Because of the possible involvement of APOA4 in progression of AD, it would be useful to assess the degree of variation in the APOA4 gene in patients with AD and to determine if any variants of this gene are associated with rate of AD progression.
Accordingly, the inventors herein have discovered a set of haplotypes in the APOA4 gene that are associated with the progression of AD. The inventors have also discovered that the copy number of each of these APOA4 haplotypes affects the progression of AD. The APOA4 haplotypes are shown in Table 1 below.
1The absence of a PS entry for a haplotype indicates that the PS is not part of the marker.
If an individual has two copies of any of haplotypes (1)-(18) in Table 1, then that individual is defined as having a “progression marker I” and is more likely to exhibit a slower progression of AD than an individual having zero copies or one copy of any of haplotypes (1)-(18) in Table 1, such individual being defined as having a “progression marker II.” Information about the composition of each of haplotypes (1)-(18), namely the location in the APOA4 gene of each of the polymorphic sites (PSs), and the identity of the reference and variant allele at each PS, can be found in Table 2, shown below.
1The Poly ID is a unique identifier assigned to the indicated PS by Genaissance Pharmaceuticals, Inc., New Haven, CT.
In addition, as described in more detail below, the inventors believe that additional haplotypes may readily be identified based on linkage disequilibrium between any of the above APOA4 haplotypes and another haplotype located in the APOA4 gene or another gene, or between an allele at one or more of the PSs in the above haplotypes and an allele at another PS located in the APOA4 gene or another gene. In particular, such haplotypes include haplotypes that are in linkage disequilibrium with any of haplotypes (1)-(18) in Table 1, hereinafter referred to as “linked haplotypes,” as well as “substitute haplotypes” for any of haplotypes (1)-(18) in Table 1 in which one or more of the polymorphic sites (PSs) in the original haplotype is substituted with another PS, wherein the allele at the substituted PS is in linkage disequilibrium with the allele at the substituting PS.
In one aspect, the invention provides methods and kits for determining whether an individual has a progression marker I or a progression marker II.
In one embodiment, a method is provided for determining whether an individual has a progression marker I or a progression marker II comprising determining whether the individual has two copies, or one or zero copies of any of (a) haplotypes (1)-(18) in Table 1, (b) a linked haplotype for any of haplotypes (1)-(18) in Table 1, and (c) a substitute haplotype for any of haplotypes (1)-(18) in Table 1.
In another embodiment of the invention, a method is provided for assigning an individual to a first or second progression marker group comprising determining whether the individual has two copies, or one or zero copies of any of (a) haplotypes (1)-(18) in Table 1, (b) a linked haplotype for any of haplotypes (1)-(18) in Table 1, and (c) a substitute haplotype for any of haplotypes (1)-(18) in Table 1, and assigning the individual to a progression marker group based on the copy number of that haplotype. The individual is assigned to the first progression marker group if the individual has two copies of any of (a) haplotypes (1)-(18) in Table 1, (b) a linked haplotype for any of haplotypes (1)-(18) in Table 1, and (c) a substitute haplotype for any of haplotypes (1)-(18) in Table 1, and is assigned to the second progression marker group if the individual has one or zero copies of any of (a) haplotypes (1)-(18) in Table 1, (b) a linked haplotype for any of haplotypes (1)-(18) in Table 1, and (c) a substitute haplotype for any of haplotypes (1)-(18) in Table 1.
One embodiment of a kit for determining whether an individual has a progression marker I or a progression marker II comprises a set of oligonucleotides designed for identifying at least one of the alleles present at each PS in a set of one or more PSs. The set of one or more PSs comprises the set of one or more PSs for any of the haplotypes in Table 1, the set of one or more PSs for a linked haplotype, or the set of one or more PSs for a substitute haplotype. In a further embodiment, the kit comprises a manual with instructions for performing one or more reactions on a human nucleic acid sample to identify the allele(s) present in the individual at each PS in the set and determining if the individual has a progression marker I or a progression marker II based on the identified allele(s).
In yet another embodiment, the invention provides a method for predicting an individual's progression of AD. The method comprises determining whether the individual has a progression marker I or a progression marker II and making a prediction based on the results of the determining step. If the individual is determined to have a progression marker I, then the prediction is that the individual will exhibit a slower progression of AD than an individual not having a progression marker I, and if the individual is determined to have a progression marker II, then the prediction is that the individual will exhibit a faster progression of AD than an individual not having a progression marker II.
In the context of this disclosure, the terms below shall be defined as follows unless otherwise indicated:
Allele—A particular form of a genetic locus, distinguished from other forms by its particular nucleotide sequence, or one of the alternative polymorphisms found at a polymorphic site.
Gene—A segment of DNA that contains the coding sequence for a protein, wherein the segment may include promoters, exons, introns, and other untranslated regions that control expression.
Genotype—An unphased 5′ to 3′ sequence of nucleotide pair(s) found at a set of one or more polymorphic sites in a locus on a pair of homologous chromosomes in an individual. As used herein, genotype includes a full-genotype and/or a sub-genotype as described below.
Genotyping—A process for determining a genotype of an individual.
Haplotype—A 5′ to 3′ sequence of nucleotides found at a set of one or more polymorphic sites in a locus on a single chromosome from a single individual.
Haplotype pair—The two haplotypes found for a locus in a single individual.
Haplotyping—A process for determining one or more haplotypes in an individual and includes use of family pedigrees, molecular techniques and/or statistical inference.
Haplotype data—Information concerning one or more of the following for a specific gene: a listing of the haplotype pairs in an individual or in each individual in a population; a listing of the different haplotypes in a population; frequency of each haplotype in that or other populations, and any known associations between one or more haplotypes and a trait.
Isolated—As applied to a biological molecule such as RNA, DNA, oligonucleotide, or protein, isolated means the molecule is substantially free of other biological molecules such as nucleic acids, proteins, lipids, carbohydrates, or other material such as cellular debris and growth media. Generally, the term “isolated” is not intended to refer to a complete absence of such material or to absence of water, buffers, or salts, unless they are present in amounts that substantially interfere with the methods of the present invention.
Locus—A location on a chromosome or DNA molecule corresponding to a gene or a physical or phenotypic feature, where physical features include polymorphic sites.
Nucleotide pair—The nucleotides found at a polymorphic site on the two copies of a chromosome from an individual.
Phased—As applied to a sequence of nucleotide pairs for two or more polymorphic sites in a locus, phased means the combination of nucleotides present at those polymorphic sites on a single copy of the locus is known.
Polymorphic site (PS)—A position on a chromosome or DNA molecule at which at least two alternative sequences are found in a population.
Polymorphism—The sequence variation observed in an individual at a polymorphic site. Polymorphisms include nucleotide substitutions, insertions, deletions and microsatellites and may, but need not, result in detectable differences in gene expression or protein function.
Polynucleotide—A nucleic acid molecule comprised of single-stranded RNA or DNA or comprised of complementary, double-stranded DNA.
Population Group—A group of individuals sharing a common ethnogeographic origin.
Reference Population—A group of subjects or individuals who are predicted to be representative of the genetic variation found in the general population. Typically, the reference population represents the genetic variation in the population at a certainty level of at least 85%, preferably at least 90%, more preferably at least 95% and even more preferably at least 99%.
Single Nucleotide Polymorphism (SNP)—Typically, the specific pair of nucleotides observed at a single polymorphic site. In rare cases, three or four nucleotides may be found.
Subject—A human individual whose genotypes or haplotypes or response to treatment or disease state are to be determined.
Treatment—A stimulus administered internally or externally to a subject.
Unphased—As applied to a sequence of nucleotide pairs for two or more polymorphic sites in a locus, unphased means the combination of nucleotides present at those polymorphic sites on a single copy of the locus is not known.
Each disease progression marker of the invention is a combination of a particular haplotype and the copy number for that haplotype. Preferably, the haplotype is one of the haplotypes shown in Table 1. The PS or PSs in these haplotypes are referred to herein as PS1, PS2, PS3, PS4, PS5, PS6, PS7, PS8, and PS9, and are located in the APOA4 gene at positions corresponding to those identified in
As described in more detail in the examples below, the disease progression markers of the invention are based on the discovery by the inventors of associations between certain haplotypes in the APOA4 gene and progression of AD in a cohort of individuals diagnosed with AD.
In particular, the inventors herein discovered that a haplotype comprising adenine at PS2 and cytosine at PS6 (haplotype (10) in Table 1) affected the progression of AD of the patients participating in the study. The group of patients having two copies of this haplotype exhibited a slower progression of AD than the patient group having one or zero copies of the haplotype. As used herein, the term “progression” is intended to refer to the rate of decrease in an individual's cognitive function, preferably as measured by the rate of change in his/her scores on the cognitive subscale of the Alzheimer's Disease Assessment (ADAS-cog) (Rosen et al., Am. J. Psychiatry 141:1356-64 (1984); Rockwood et al., J. Neurol. Neurosurg. Psychiatry 71:589-95 (2001); Tariot et al., Neurology 54:2269-76 (2000); Wilcock et al., BMJ 321:1-7 (2000)) administered at two different times. The ADAS-cog measures cognitive function, including spoken language ability, comprehension of spoken language, recall of test instructions, word-finding difficulty in spontaneous speech, following commands, naming objects and fingers, constructional praxis, ideational praxis, orientation, word-recall task and word-recognition task (Alzheimer's Insights Online, Vol. 3, No. 1, 1997). Additionally, an individual's progression of AD may be measured by other scientifically accepted rating scales for cognitive function, including, but not limited to, Behavioral Pathology in Alzheimer's Disease Rating Scale (BEHAVE-AD), Blessed Test, CANTAB (CAmbridge Neuropsychological Test Automated Battery), CERAD (The Consortium to Establish a Registry for Alzheimer's Disease) Clinical and Neuropsychological Tests, Clock Draw Test, Cornell Scale for Depression in Dementia (CSDD), Geriatric Depression Scale (GDS), Mini Mental State Exam (MMSE), Neuropsychiatric Inventory (NPI), and The 7 Minute Screen.
Moreover, as shown in Table 10 below, the different effect of copy number of haplotype (10) on progression of AD is statistically significant. Therefore, this haplotype, in combination with the haplotype copy number, can be used to differentiate the progression of AD that might be observed in an individual having AD. Consequently, two copies of haplotype (10) in Table 1 is referred to herein as a progression marker I, while one or zero copies of haplotype (10) in Table 1 is referred to herein as a progression marker II.
In addition, the skilled artisan would expect that there might be additional PSs in the APOA4 gene or elsewhere on chromosome 11, wherein an allele at that PS is in high linkage disequilibrium (LD) with an allele at one or more of the PSs in the haplotypes comprising a progression marker I or a progression marker II. Two particular alleles at different PSs are said to be in LD if the presence of the allele at one of the sites tends to predict the presence of the allele at the other site on the same chromosome (Stevens, Mol. Diag. 4:309-17 (1999)). One of the most frequently used measures of linkage disequilibrium is Δ2, which is calculated using the formula described by Devlin et al. (Genomics 29(2):311-22 (1995)). Δ2 is the measure of how well an allele X at a first PS predicts the occurrence of an allele Y at a second PS on the same chromosome. The measure only reaches 1.0 when the prediction is perfect (e.g., X if and only if Y).
Thus, the skilled artisan would expect that all of the embodiments of the invention described herein may frequently be practiced by substituting any (or all) of the specifically identified APOA4 PSs in a progression marker with another PS, wherein an allele at the substituted PS is in LD with an allele at the “substituting” PS. This “substituting” PS may be one that is currently known or subsequently discovered and may be present in the APOA4 gene, in a genomic region of about 100 kilobases spanning the APOA4 gene, or elsewhere on chromosome 11.
Further, the inventors contemplate that there will be other haplotypes in the APOA4 gene or elsewhere on chromosome 11 that are in LD with one or more of the haplotypes in Table 1 that would therefore also be predictive of progression of AD. Preferably, the linked haplotype is present in the APOA4 gene or in a genomic region of about 100 kilobases spanning the APOA4 gene. The linkage disequilibrium between the haplotypes in Table 1 and such linked haplotypes can also be measured using Δ2.
In preferred embodiments, the linkage disequilibrium between an allele at a polymorphic site in any of the haplotypes in Table 1 and an allele at a “substituting” polymorphic site, or between any of the haplotypes in Table 1 and a linked haplotype, has a Δ2 value, as measured in a suitable reference population, of at least 0.75, more preferably at least 0.80, even more preferably at least 0.85 or at least 0.90, yet more preferably at least 0.95, and most preferably 1.0. A suitable reference population for this Δ2 measurement is preferably a population for which the distribution of its members reflects that of the population of patients having AD. The reference population may be the general population, a population having AD or AD risk factors, or the like.
LD patterns in genomic regions are readily determined empirically in appropriately chosen samples using various techniques known in the art for determining whether any two alleles (either those occurring at two different PSs or two haplotypes for two different multi-site loci) are in linkage disequilibrium (GENETIC DATA ANALYSIS II, Weir, Sinauer Associates, Inc. Publishers, Sunderland, Mass., 1996). The skilled artisan may readily select which method of determining LD will be best suited for a particular sample size and genomic region.
As described above and in the examples below, the progression markers of the invention are associated with changes in the cognitive subscale of the Alzheimer's Disease Assessment Scale (ADAS-cog) administered at two different times. Thus, the invention provides a method and kit for determining whether an individual has a progression marker I or a progression marker II. A progression marker I is two copies of any of (a) haplotypes (1)-(18) in Table 1, (b) a linked haplotype for any of haplotypes (1)-(18) in Table 1, and (c) a substitute haplotype for any of haplotypes (1)-(18) in Table 1. A progression marker II is one or zero copies of any of (a) haplotypes (1)-(18) in Table 1, (b) a linked haplotype for any of haplotypes (1)-(18) in Table 1, and (c) a substitute haplotype for any of haplotypes (1)-(18) in Table 1.
In one embodiment, the invention provides a method for determining whether an individual has a progression marker I or a progression marker II. The method comprises determining whether the individual has two copies, or one or zero copies of any of (a) haplotypes (1)-(18) in Table 1, (b) a linked haplotype for any of haplotypes (1)-(18) in Table 1, and (c) a substitute haplotype for any of haplotypes (1)-(18) in Table 1.
In some embodiments, the individual is Caucasian and may be diagnosed with a cognitive disorder, such as mild to moderate dementia of the Alzheimer's type, dementia associated with Parkinson's Disease, MCI, a vascular dementia, and Lewy body dementia, or may have risk factors associated with a cognitive disorder.
In another embodiment, the invention provides a method for assigning an individual to a first or second progression marker group. The method comprises determining whether the individual has two copies, or one or zero copies of any of (a) haplotypes (1)-(18) in Table 1, (b) a linked haplotype for any of haplotypes (1)-(18) in Table 1, and (c) a substitute haplotype for any of haplotypes (1)-(18) in Table 1, and assigning the individual to the first progression marker group if the individual has two copies of any of (a) haplotypes (1)-(18) in Table 1, (b) a linked haplotype for any of haplotypes (1)-(18) in Table 1, and (c) a substitute haplotype for any of haplotypes (1)-(18) in Table 1, and assigning the individual to the second progression marker group if the individual has one or zero copies of any of (a) haplotypes (1)-(18) in Table 1, (b) a linked haplotype for any of haplotypes (1)-(18) in Table 1, and (c) a substitute haplotype for any of haplotypes (1)-(18) in Table 1.
In some embodiments, the individual is Caucasian and may be diagnosed with a cognitive disorder, such as mild to moderate dementia of the Alzheimer's type, dementia associated with Parkinson's Disease, MCI, a vascular dementia, and Lewy body dementia, or may have risk factors associated with a cognitive disorder.
The presence in an individual of a progression marker I or a progression marker II may be determined by a variety of indirect or direct methods well known in the art for determining haplotypes or haplotype pairs for a set of one or more PSs in one or both copies of the individual's genome, including those discussed below. The genotype for a PS in an individual may be determined by methods known in the art or as described below.
One indirect method for determining whether zero copies, one copy, or two copies of a haplotype is present in an individual is by prediction based on the individual's genotype determined at one or more of the PSs comprising the haplotype and using the determined genotype at each site to determine the haplotypes present in the individual. The presence of zero copies, one copy, or two copies of a haplotype of interest can be determined by visual inspection of the alleles at the PS that comprise the haplotype. The haplotype pair is assigned by comparing the individual's genotype with the genotypes at the same set of PS corresponding to the haplotype pairs known to exist in the general population or in a specific population group or to the haplotype pairs that are theoretically possible based on the alternative alleles possible at each PS, and determining which haplotype pair is most likely to exist in the individual.
In a related indirect haplotyping method, the presence in an individual of zero copies, one copy, or two copies of a haplotype is predicted from the individual's genotype for a set of PSs comprising the selected haplotype using information on haplotype pairs known to exist in a reference population. In one embodiment, this haplotype pair prediction method comprises identifying a genotype for the individual at the set of PSs comprising the selected haplotype, accessing data containing haplotype pairs identified in a reference population for a set of PSs comprising the PSs of the selected haplotype, and assigning to the individual a haplotype pair that is consistent with the individual's genotype. Whether the individual has a disease progression marker I or a disease progression marker II can be subsequently determined based on the assigned haplotype pair. The haplotype pair can be assigned by comparing the individual's genotype with the genotypes corresponding to the haplotype pairs known to exist in the general population or in a specific population group, and determining which haplotype pair is consistent with the genotype of the individual. In some embodiments, the comparing step may be performed by visual inspection. When the genotype of the individual is consistent with more than one haplotype pair, frequency data may be used to determine which of these haplotype pairs is most likely to be present in the individual. If a particular haplotype pair consistent with the genotype of the individual is more frequent in the reference population than other pairs consistent with the genotype, then that haplotype pair with the highest frequency is the most likely to be present in the individual. The haplotype pair frequency data used in this determination is preferably for a reference population coimprising the same ethnogeographic group as the individual. This determination may also be performed in some embodiments by visual inspection. In other embodiments, the comparison may be made by a computer-implemented algorithm with the genotype of the individual and the reference haplotype data stored in computer-readable formats. For example, as described in WO 01/80156, one computer-implemented algorithm to perform this comparison entails enumerating all possible haplotype pairs which are consistent with the genotype, accessing data containing haplotype pairs frequency data determined in a reference population to determine a probability that the individual has a possible haplotype pair, and analyzing the determined probabilities to assign a haplotype pair to the individual.
Typically, the reference population is composed of randomly selected individuals representing the major ethnogeographic groups of the world. A preferred reference population for use in the methods of the present invention consists of Caucasian individuals, the number of which is chosen based on how rare a haplotype is that one wants to be guaranteed to see. For example, if one wants to have a q % chance of not missing a haplotype that exists in the population at a p % frequency of occurring in the reference population, the number of individuals (n) who must be sampled is given by 2n=log(1−q)/log(1−p) where p and q are expressed as fractions. A preferred reference population allows the detection of any haplotype whose frequency is at least 10% with about 99% certainty. A particularly preferred reference population includes a 3-generation Caucasian family to serve as a control for checking quality of haplotyping procedures.
If the reference population comprises more than one ethnogeographic group, the frequency data for each group is examined to determine whether it is consistent with Hardy-Weinberg equilibrium. Hardy-Weinberg equilibrium (PRINCIPLES OF POPULATION GENOMICS, 3rd ed., Hartl, Sinauer Associates, Sunderland, Mass., 1997) postulates that the frequency of finding the haplotype pair H1/H2 is equal to pH-W(H1/H2)=2p(H1)p(H2) if H1≠H2 and pH-W(H1/H2)=p(H1)p(H2) if H1=H2. A statistically significant difference between the observed and expected haplotype frequencies could be due to one or more factors including significant inbreeding in the population group, strong selective pressure on the gene, sampling bias, and/or errors in the genotyping process. If large deviations from Hardy-Weinberg equilibrium are observed in an ethnogeographic group, the number of individuals in that group can be increased to see if the deviation is due to a sampling bias. If a larger sample size does not reduce the difference between observed and expected haplotype pair frequencies, then one may wish to consider haplotyping the individual using a direct haplotyping method such as, for example, CLASPER System™ technology ((U.S. Pat. No. 5,866,404), single molecule dilution, or allele-specific long-range PCR (Michalotos-Beloin et al., Nucleic Acids Res. 24:4841-3 (1996)).
In one embodiment of this method for predicting a haplotype pair for an individual, the assigning step involves performing the following analysis. First, each of the possible haplotype pairs is compared to the haplotype pairs in the reference population. Generally, only one of the haplotype pairs in the reference population matches a possible haplotype pair and that pair is assigned to the individual. Occasionally, only one haplotype represented in the reference haplotype pairs is consistent with a possible haplotype pair for an individual, and in such cases the individual is assigned a haplotype pair containing this known haplotype and a new haplotype derived by subtracting the known haplotype from the possible haplotype pair. Alternatively, the haplotype pair in an individual may be predicted from the individual's genotype for that gene using reported methods (e.g., Clark et al., Mol. Biol. Evol. 7:111-22 (1990) or WO 01/80156) or through a commercial haplotyping service such as offered by Genaissance Pharmaceuticals, Inc. (New Haven, Conn.). In rare cases, either no haplotypes in the reference population are consistent with the possible haplotype pairs, or alternatively, multiple reference haplotype pairs are consistent with the possible haplotype pairs. In such cases, the individual is preferably haplotyped using a direct molecular haplotyping method such as, for example, CLASPER System™ technology (U.S. Pat. No. 5,866,404), SMD, or allele-specific long-range PCR (Michalotos-Beloin et al., supra).
Determination of the number of haplotypes present in the individual from the genotypes is illustrated here for haplotype (10) in Table 1. Table 3 below shows the 9 (3n, where each of n bi-allelic polymorphic sites may have one of 3 different genotypes present) genotypes that may be detected at PS2 and PS6, using both chromosomal copies from an individual. 8 of the 9 possible genotypes for the two sites allow unambiguous determination of the number of copies of the haplotype (10) in Table 1 present in the individual. However, an individual with the A/G C/T genotype could possess one of the following genotype pairs: AC/GT, AT/GC, GC/AT, and GT/AC, and thus could have either one copy of haplotype (10) in Table 1 (AC/GT, GT/AC), or zero copies (AT/GC, GC/AT) of haplotype (10) in Table 1. For instances where there is ambiguity in the haplotype pair underlying the determined genotype (i.e., when two or more PSs are included in the haplotype), frequency information may be used to determine the most probable haplotype pair and therefore the most likely number of copies of the haplotype in the individual. If a particular haplotype pair consistent with the genotype of the individual is more frequent in the reference population than other pairs consistent with the genotype, then that haplotype pair with the highest frequency is the most likely to be present in the individual. The copy number of the haplotype of interest in this haplotype pair can then be determined by visual inspection of the alleles at the PS that comprise the response marker for each haplotype in the pair.
Alternatively, for the ambiguous genotypes, genotyping of one or more additional sites in APOA4 may be performed to eliminate the ambiguity in deconvoluting the haplotype pairs underlying the genotype at the particular PSs. The skilled artisan would recognize that alleles at these one or more additional sites would need to have sufficient linkage with the alleles in at least one of the possible haplotypes in the pair to permit unambiguous assignment of the haplotype pair. Although this illustration has been directed to the particular instance of determining the number of copies of haplotype (10) in Table 1 present in an individual, the process would be analogous for the other haplotypes shown in Table 1, or for the linked haplotypes or substitute haplotypes for any of the haplotypes in Table 1.
The individual's genotype for the desired set of PS may be determined using a variety of methods well-known in the art. Such methods typically include isolating from the individual a genomic DNA sample comprising both copies of the gene or locus of interest, amplifying from the sample one or more target regions containing the polymorphic sites to be genotyped, and detecting the nucleotide pair present at each PS of interest in the amplified target region(s). It is not necessary to use the same procedure to determine the genotype for each PS of interest.
In addition, the identity of the allele(s) present at any of the novel PSs described herein may be indirectly determined by haplotyping or genotyping another PS having an allele that is in linkage disequilibrium with an allele of the PS that is of interest. PSs having an allele in linkage disequilibrium with an allele of the presently disclosed PSs may be located in regions of the gene or in other genomic regions not examined herein. Detection of the allele(s) present at a PS, wherein the allele is in linkage disequilibrium with an allele of the novel PSs described herein may be performed by, but is not limited to, any of the above-mentioned methods for detecting the identity of the allele at a PS.
Alternatively, the presence in an individual of a haplotype or haplotype pair for a set of PSs comprising a response marker may be determined by directly haplotyping at least one of the copies of the individual's genomic region of interest, or suitable fragment thereof, using methods known in the art. Such direct haplotyping methods typically involve treating a genomic nucleic acid sample isolated from the individual in a manner that produces a hemizygous DNA sample that only has one of the two “copies” of the individual's genomic region which, as readily understood by the skilled artisan, may be the same allele or different alleles, amplifying from the sample one or more target regions containing the PSs to be genotyped, and detecting the nucleotide present at each PS of interest in the amplified target region(s). The nucleic acid sample may be obtained using a variety of methods known in the art for preparing hemizygous DNA samples, which include: targeted in vivo cloning (TIVC) in yeast as described in WO 98/01573, U.S. Pat. No. 5,866,404, and U.S. Pat. No. 5,972,614; generating hemizygous DNA targets using an allele specific oligonucleotide in combination with primer extension and exonuclease degradation as described in U.S. Pat. No. 5,972,614; single molecule dilution (SMD) as described in Ruaño et al., Proc. Natl. Acad. Sci. 87:6296-300 (1990); and allele specific PCR (Ruaño et al., Nucl. Acids Res. 17:8392 (1989); Ruaño et al., Nucl. Acids Res. 19:6877-82 (1991); Michalatos-Beloin et al., supra).
As will be readily appreciated by those skilled in the art, any individual clone will typically only provide haplotype information on one of the two genomic copies present in an individual. If haplotype information is desired for the individual's other copy, additional clones will usually need to be examined. Typically, at least five clones should be examined to have more than a 90% probability of haplotyping both copies of the genomic locus in an individual. In some cases, however, once the haplotype for one genomic allele is directly determined, the haplotype for the other allele may be inferred if the individual has a known genotype for the PSs of interest or if the haplotype frequency or haplotype pair frequency for the individual's population group is known.
While direct haplotyping of both copies of the gene is preferably performed with each copy of the gene being placed in separate containers, it is also envisioned that direct haplotyping could be performed in the same container if the two copies are labeled with different tags, or are otherwise separately distinguishable or identifiable. For example, if first and second copies of the gene are labeled with different first and second fluorescent dyes, respectively, and an allele-specific oligonucleotide labeled with yet a third different fluorescent dye is used to assay the PS(s), then detecting a combination of the first and third dyes would identify the polymorphism in the first gene copy while detecting a combination of the second and third dyes would identify the polymorphism in the second gene copy.
The nucleic acid sample used in the above indirect and direct haplotyping methods is typically isolated from a biological sample taken from the individual, such as a blood sample or tissue sample. Suitable tissue samples include whole blood, saliva, tears, urine, skin and hair.
The target region(s) containing the PS of interest may be amplified using any oligonucleotide-directed amplification method, including but not limited to polymerase chain reaction (PCR) (U.S. Pat. No. 4,965,188), ligase chain reaction (LCR) (Barany et al., Proc. Natl. Acad. Sci. USA 88:189-93 (1991); WO 90/01069), and oligonucleotide ligation assay (OLA) (Landegren et al., Science 241:1077-80 (1988)). Other known nucleic acid amplification procedures may be used to amplify the target region(s) including transcription-based amplification systems (U.S. Pat. No. 5,130,238; European Patent No. EP 329,822; U.S. Pat. No. 5,169,766; WO 89/06700) and isothermal methods (Walker et al., Proc. Natl. Acad. Sci. USA 89:392-6 (1992)).
In both the direct and indirect haplotyping methods, the identity of a nucleotide (or nucleotide pair) at a PS(s) in the amplified target region may be determined by sequencing the amplified region(s) using conventional methods. If both copies of the gene are represented in the amplified target, it will be readily appreciated by the skilled artisan that only one nucleotide will be detected at a PS in individuals who are homozygous at that site, while two different nucleotides will be detected if the individual is heterozygous for that site. The polymorphism may be identified directly, known as positive-type identification, or by inference, referred to as negative-type identification. For example, where a polymorphism is known to be guanine and cytosine in a reference population, a site may be positively determined to be either guanine or cytosine for an individual homozygous at that site, or both guanine and cytosine, if the individual is heterozygous at that site. Alternatively, the site may be negatively determined to be not guanine (and thus cytosine/cytosine) or not cytosine (and thus guanine/guanine).
A PS in the target region may also be assayed before or after amplification using one of several hybridization-based methods known in the art. Typically, allele-specific oligonucleotides are utilized in performing such methods. The allele-specific oligonucleotides may be used as differently labeled probe pairs, with one member of the pair showing a perfect match to one variant of a target sequence and the other member showing a perfect match to a different variant. In some embodiments, more than one PS may be detected at once using a set of allele-specific oligonucleotides or oligonucleotide pairs. Preferably, the members of the set have melting temperatures within 5° C., and more preferably within 2° C., of each other when hybridizing to each of the polymorphic sites being detected.
Hybridization of an allele-specific oligonucleotide to a target polynucleotide may be performed with both entities in solution, or such hybridization may be performed when either the oligonucleotide or the target polynucleotide is covalently or noncovalently affixed to a solid support. Attachment may be mediated, for example, by antibody-antigen interactions, poly-L-Lys, streptavidin or avidin-biotin, salt bridges, hydrophobic interactions, chemical linkages, UV cross-linking baking, etc. Allele-specific oligonucleotides may be synthesized directly on the solid support or attached to the solid support subsequent to synthesis. Solid-supports suitable for use in detection methods of the invention include substrates made of silicon, glass, plastic, paper and the like, which may be formed, for example, into wells (as in 96-well plates), slides, sheets, membranes, fibers, chips, dishes, and beads. The solid support may be treated, coated or derivatized to facilitate the immobilization of the allele-specific oligonucleotide or target nucleic acid.
Detecting the nucleotide or nucleotide pair at a PS of interest may also be determined using a mismatch detection technique, including but not limited to the RNase protection method using riboprobes (Winter et al., Proc. Natl. Acad. Sci. USA 82:7575 (1985); Meyers et al., Science 230:1242 (1985)) and proteins which recognize nucleotide mismatches, such as the E. coli mutS protein (Modrich, Ann. Rev. Genet. 25:229-53 (1991)). Alternatively, variant alleles can be identified by single strand conformation polymorphism (SSCP) analysis (Orita et al., Genomics 5:874-9 (1989); Humphries et al., in MOLECULAR DIAGNOSIS OF GENETIC DISEASES, Elles, ed., pp. 321-340, 1996) or denaturing gradient gel electrophoresis (DGGE) (Wartell et al., Nucl. Acids Res. 18:2699-706 (1990); Sheffield et al., Proc. Natl. Acad. Sci. USA 86:232-6 (1989)).
A polymerase-mediated primer extension method may also be used to identify the polymorphism(s). Several such methods have been described in the patent and scientific literature and include the “Genetic Bit Analysis” method (WO 92/15712) and the ligase/polymerase mediated genetic bit analysis (U.S. Pat. No. 5,679,524. Related methods are disclosed in WO 91/02087, WO 90/09455, WO 95/17676, and U.S. Pat. Nos. 5,302,509 and 5,945,283. Extended primers containing the complement of the polymorphism may be detected by mass spectrometry as described in U.S. Pat. No. 5,605,798. Another primer extension method is allele-specific PCR (Ruaño et al., 1989, supra; Ruaño et al., 1991, supra; WO 93/22456; Turki et al., J. Clin. Invest. 95:1635-41 (1995)). In addition, multiple PSs may be investigated by simultaneously amplifying multiple regions of the nucleic acid using sets of allele-specific primers as described in WO 89/10414.
The genotype or haplotype for the APOA4 gene of an individual may also be determined by hybridization of a nucleic acid sample containing one or both copies of the gene, mRNA, cDNA or fragment(s) thereof, to nucleic acid arrays and subarrays such as described in WO 95/11995. The arrays would contain a battery of allele-specific oligonucleotides representing each of the PSs to be included in the genotype or haplotype.
The invention also provides a kit for determining whether an individual has a progression marker I or a progression marker II. The kit comprises a set of one or more oligonucleotides designed for identifying at least one of the alleles at each PS in a set of one or more PSs, wherein the set of one or more PSs comprises (a) PS2, PS3, PS6, and PS7; (b) PS2, PS3, and PS6; (c) PS2, PS3, PS4, and PS6; (d) PS2, PS3, PS5, and PS6; (e) PS3, PS4, PS5, and PS6; (f) PS2, PS6, and PS7; (g) PS2, PS5, PS6, and PS7; (h) PS4, PS5, PS6, and PS7; (i) PS2, PS4, PS6, and PS7; (j) PS2 and PS6; (k) PS2, PS4, PS5, and PS6; (l) PS2, PS4, and PS6; (m) PS2, PS5, and PS6; (n) PS4, PS5, and PS6; (o) PS3, PS5, PS6, and PS7; (p) PS3, PS5, and PS6; (q) PS1, PS2, PS3, and PS6; (r) PS5, PS6, and PS7; (s) a set of one or more PSs in a linked haplotype for any of haplotypes (1)-(18) in Table 1; or (t) a set of one or more PSs in a substitute haplotype for any of haplotypes (1)-(18) in Table 1. Preferably, the kit comprises a set of one or more oligonucleotides designed for identifying at least one of the alleles at each PS in a set of one or more PSs, wherein the set of one or more PSs is any of (a) PS2, PS3, PS6, and PS7; (b) PS2, PS3, and PS6; (c) PS2, PS3, PS4, and PS6; (d) PS2, PS3, PS5, and PS6; (e) PS3, PS4, PS5, and PS6; (f) PS2, PS6, and PS7; (g) PS2, PS5, PS6, and PS7; (h) PS4, PS5, PS6, and PS7; (i) PS2, PS4, PS6, and PS7; (j) PS2 and PS6; (k) PS2, PS4, PS5, and PS6; (1) PS2, PS4, and PS6; (m) PS2, PS5, and PS6; (n) PS4, PS5, and PS6; (o) PS3, PS5, PS6, and PS7; (p) PS3, PS5, and PS6; (q) PS1, PS2, PS3, and PS6; (r) PS5, PS6, and PS7; (s) a set of one or more PSs in a linked haplotype for any of haplotypes (1)-(18) in Table 1; and (t) a set of one or more PSs in a substitute haplotype for any of haplotypes (1)-(18) in Table 1.
In a preferred embodiment of the kit of the invention, the set of one or more oligonucleotides is designed for identifying both alleles at each PS in the set of one or more PSs. In another preferred embodiment, the individual is Caucasian. In another preferred embodiment, the kit further comprises a manual with instructions for (a) performing one or more reactions on a human nucleic acid sample to identify the allele or alleles present in the individual at each PS in the set of one or more PSs, and (b) determining if the individual has a progression marker I or a progression marker II based on the identified allele or alleles. In another preferred embodiment, the linkage disequilibrium between a linked haplotype for any of haplotypes (1)-(18) in Table 1 and any of haplotypes (1)-(18) in Table 1 has a delta squared value selected from the group consisting of at least 0.75, at least 0.80, at least 0.85, at least 0.90, at least 0.95, and 1.0. In yet another preferred embodiment, the linkage disequilibrium between an allele at a substituting PS and an allele at a substituted PS for any of haplotypes (1)-(18) in Table 1 has a delta squared value selected from the group consisting of at least 0.75, at least 0.80, at least 0.85, at least 0.90, at least 0.95, and 1.0.
As used herein, an “oligonucleotide” is a probe or primer capable of hybridizing to a target region that contains, or that is located close to, a PS of interest. Preferably, the oligonucleotide has less than about 100 nucleotides. More preferably, the oligonucleotide is 10 to 35 nucleotides long. Even more preferably, the oligonucleotide is between 15 and 30, and most preferably, between 20 and 25 nucleotides in length. The exact length of the oligonucleotide will depend on the nature of the genomic region containing the PS as well as the genotyping assay to be performed and is readily determined by the skilled artisan.
The oligonucleotides used to practice the invention may be comprised of any phosphorylation state of ribonucleotides, deoxyribonucleotides, and acyclic nucleotide derivatives, and other functionally equivalent derivatives. Alternatively, oligonucleotides may have a phosphate-free backbone, which may be comprised of linkages such as carboxymethyl, acetamidate, carbamate, polyamide (peptide nucleic acid (PNA)) and the like (Varma, in MOLECULAR BIOLOGY AND BIOTECHNOLOGY, A COMPREHENSIVE DESK REFERENCE, Meyers, ed., pp. 617-20, VCH Publishers, Inc., 1995). Oligonucleotides of the invention may be prepared by chemical synthesis using any suitable methodology known in the art, or may be derived from a biological sample, for example, by restriction digestion. The oligonucleotides may be labeled, according to any technique known in the art, including use of radiolabels, fluorescent labels, enzymatic labels, proteins, haptens, antibodies, sequence tags and the like.
Oligonucleotides of the invention must be capable of specifically hybridizing to a target region of a polynucleotide containing a desired locus. As used herein, specific hybridization means the oligonucleotide forms an anti-parallel double-stranded structure with the target region under certain hybridizing conditions, while failing to form such a structure when incubated with another region in the polynucleotide or with a polynucleotide lacking the desired locus under the same hybridizing conditions. Preferably, the oligonucleotide specifically hybridizes to the target region under conventional high stringency conditions.
A nucleic acid molecule such as an oligonucleotide or polynucleotide is said to be a “perfect” or “complete” complement of another nucleic acid molecule if every nucleotide of one of the molecules is complementary to the nucleotide at the corresponding position of the other molecule. A nucleic acid molecule is “substantially complementary” to another molecule if it hybridizes to that molecule with sufficient stability to remain in a duplex form under conventional low-stringency conditions. Conventional hybridization conditions are described, for example, in MOLECULAR CLONING, A LABORATORY MANUAL, 2nd ed., Sambrook et al., Cold Spring Harbor Press, Cold Spring Harbor, N.Y., 1989, and in NUCLEIC ACID HYBRIDIZATION, A PRACTICAL APPROACH, Haymes et al., IRL Press, Washington, D.C., 1985. While perfectly complementary oligonucleotides are preferred for detecting polymorphisms, departures from complete complementarity are contemplated where such departures do not prevent the molecule from specifically hybridizing to the target region. For example, an oligonucleotide primer may have a non-complementary fragment at its 5′ end, with the remainder of the primer being complementary to the target region. Alternatively, non-complementary nucleotides may be interspersed into the probe or primer as long as the resulting probe or primer is still capable of specifically hybridizing to the target region.
Preferred oligonucleotides of the invention, useful in determining if an individual has a progression marker I or progression marker II, are allele-specific oligonucleotides. As used herein, the term allele-specific oligonucleotide (ASO) means an oligonucleotide that is able, under sufficiently stringent conditions, to hybridize specifically to one allele of a gene, or other locus, at a target region containing a PS while not hybridizing to the corresponding region in another allele(s). As understood by the skilled artisan, allele-specificity will depend upon a variety of readily optimized stringency conditions, including salt and formamide concentrations, as well as temperatures for both the hybridization and washing steps. Examples of hybridization and washing conditions typically used for ASO probes are found in Kogan et al., “Genetic Prediction of Hemophilia A” in PCR PROTOCOLS, A GUIDE TO METHODS AND APPLICATIONS, Academic Press, 1990, and Ruaño et al., Proc. Natl. Acad. Sci. USA 87:6296-300 (1990). Typically, an ASO will be perfectly complementary to one allele while containing a single mismatch for another allele.
Allele-specific oligonucleotides of the invention include ASO probes and ASO primers. ASO probes which usually provide good discrimination between different alleles are those in which a central position of the oligonucleotide probe aligns with the polymorphic site in the target region (e.g., approximately the 7th or 8th position in a 15mer, the 8th or 9th position in a 16mer, and the 10th or 11th position in a 20mer). An ASO primer of the invention has a 3′ terminal nucleotide, or preferably a 3′ penultimate nucleotide, that is complementary to only one of the nucleotide alleles of a particular SNP, thereby acting as a primer for polymerase-mediated extension only if that nucleotide allele is present at the PS in the sample being genotyped. ASO probes and primers hybridizing to either the coding or noncoding strand are contemplated by the invention. ASO probes and primers listed below use the appropriate nucleotide symbol (R=G or A, Y=T or C, M=A or C, K=G or T/U, S=G or C, and W=A or T/U; WIPO standard ST.25) at the position of the PS to represent that the ASO contains either of the two alternative allelic variants observed at that PS.
A preferred ASO probe for detecting the alleles at each of PS1, PS2, PS3, PS4, PS5, PS6, and PS7 is listed in Table 4. Additionally, detection of the alleles at each of PS1, PS2, PS3, PS4, PS5, PS6, and PS7 could be accomplished by utilization of the complement of these ASO probes.
A preferred ASO forward and reverse primer for detecting the alleles at each of PS1, PS2, PS3, PS4, PS5, PS6, and PS7 is listed in Table 4.
1These ASO probes and primers include the appropriate nucleotide symbol, Y = T or C, R = G or A, M = A or C, K = G or T/U, and S = G or C (World Intellectual Property Organization Handbook on Industrial Property Information and Documentation IPO Standard ST.25 (1998), Appendix 2, Table 1), at the position of the PS to represent that the ASO contains one of the two alternative polymorphisms observed at that position.
Other oligonucleotides useful in practicing the invention hybridize to a target region located one to several nucleotides downstream of a PS in a response marker. Such oligonucleotides are useful in polymerase-mediated primer-extension methods for detecting an allele at one of the PSs in the markers described herein and therefore such oligonucleotides are referred to herein as “primer-extension oligonucleotides.” In a preferred embodiment, the 3′-terminus of a primer-extension oligonucleotide is a deoxynucleotide complementary to the nucleotide located immediately adjacent to the PS. A particularly preferred forward and reverse primer-extension oligonucleotide for detecting the alleles at each of PS1, PS2, PS3, PS4, PS5, PS6, and PS7 is listed in Table 5. Termination mixes are chosen to terminate extension of the oligonucleotide at the PS of interest, or one base thereafter, depending on the alternative nucleotides present at the PS.
In some embodiments, the oligonucleotides in a kit of the invention have different labels to allow probing of the identity of nucleotides or nucleotide pairs at two or more PSs simultaneously.
The oligonucleotides in a kit of the invention may also be immobilized on or synthesized on a solid surface such as a microchip, bead, or glass slide (see, e.g., WO 98/20020 and WO 98/20019). Such immobilized oligonucleotides may be used in a variety of polymorphism detection assays, including but not limited to probe hybridization and polymerase extension assays. Immobilized oligonucleotides useful in practicing the invention may comprise an ordered array of oligonucleotides designed to rapidly screen a nucleic acid sample for polymorphisms in multiple genes at the same time.
Kits of the invention may also contain other components such as hybridization buffer (e.g., where the oligonucleotides are to be used as allele-fic specific probes) or dideoxynucleotide triphosphates (ddNTPs; e.g., where the alleles at the polymorphic sites are to be detected by primer extension). In a preferred embodiment, the set of oligonucleotides consists of primer-extension oligonucleotides. The kit may also contain a polymerase and a reaction buffer optimized for primer-extension mediated by the polymerase. Preferred kits may also include detection reagents, such as biotin- or fluorescent-tagged oligonucleotides or ddNTPs and/or an enzyme-labeled antibody and one or more substrates that generate a detectable signal when acted on by the enzyme. It will be understood by the skilled artisan that the set of oligonucleotides and reagents for performing the genotyping or haplotyping assay will be provided in separate receptacles placed in the container if appropriate to preserve biological or chemical activity and enable proper use in the assay.
In a particularly preferred embodiment, each of the oligonucleotides and all other reagents in the kit have been quality tested for optimal performance in an assay for determining the alleles at a set of PSs comprising a progression marker I or progression marker II.
The methods and kits of the invention are useful for helping physicians make decisions about how to treat an individual. They can be used to predict the progression of AD in an individual having AD, thereby permitting the individual's physician to prescribe an appropriate treatment regimen.
Thus, the invention provides a method for predicting the progression of AD in an individual having AD. The method comprises determining whether the individual has a progression marker I or a progression marker II, and making a prediction based on the results of the determining step. The determination of the progression marker present in an individual can be made using one of the direct or indirect methods described herein. In some preferred embodiments, the determining step comprises identifying for one or both copies of the genomic locus present in the individual the identity of the nucleotide or nucleotide pair at the set of PSs comprising the selected response marker. Alternatively, the determining step may comprise consulting a data repository that states the individual's copy number for the haplotypes comprising one of the progression markers I or progression markers II. The data repository may be the individual's medical records or a medical data card. In preferred embodiments, the individual is Caucasian.
In some embodiments, if the individual is determined to have a progression marker I, then the prediction is that the individual will exhibit a slower progression of AD than an individual not having a progression marker I, and if the individual is determined to have a progression marker II, then the prediction is that the individual will exhibit a faster progression of AD than an individual not having a progression marker II.
Further, in performing any of the methods described herein which require information on the haplotype content of the individual (i.e., the haplotypes and haplotype copy number present in the individual for the polymorphic sites in haplotypes comprising a progression marker I or a progression marker II) or which require knowing if a progression marker I or a progression marker II is present in the individual, the individual's APOA4 haplotype content or response marker may be determined by consulting a data repository such as the individual's patient records, a medical data card, a file (e.g., a flat ASCII file) accessible by a computer or other electronic or non-electronic media on which information about the individual's APOA4 haplotype content or response marker can be stored. As used herein, a medical data card is a portable storage device such as a magnetic data card, a smart card, which has an on-board processing unit and which is sold by vendors such as Siemens of Munich Germany, or a flash-memory card. The medical data card may be, but does not have to be, credit-card sized so that it easily fits into pocketbooks, wallets and other such objects carried by the individual. The medical data card may be swiped through a device designed to access information stored on the data card. In an alternative embodiment, portable data storage devices other than data cards can be used. For example, a touch-memory device, such as the “i-button” produced by Dallas Semiconductor of Dallas, Tex. can store information about an individual's APOA4 haplotype content or response marker, and this device can be incorporated into objects such as jewelry. The data storage device may be implemented so that it can wirelessly communicate with routing/intelligence devices through IEEE 802.11 wireless networking technology or through other methods well known to the skilled artisan. Further, as stated above, information about an individual's haplotype content or response marker can also be stored in a file accessible by a computer; such files may be located on various media, including: a server, a client, a hard disk, a CD, a DVD, a personal digital assistant such as a Palm Pilot, a tape, a zip disk, the computer's internal ROM (read-only-memory) or the internet or worldwide web. Other media for the storage of files accessible by a computer will be obvious to one skilled in the art.
Any or all analytical and mathematical operations involved in practicing the methods of the present invention may be implemented by a computer. For example, the computer may execute a program that assigns APOA4 haplotype pairs and/or a progression marker I or a progression marker II to individuals based on genotype data inputted by a laboratory technician or treating physician. In addition, the computer may output the predicted progression of AD following input of the individual's APOA4 haplotype content or progression marker, which was either determined by the computer program or input by the technician or physician. Data on which progression markers were detected in an individual may be stored as part of a relational database (e.g., an instance of an Oracle database or a set of ASCII flat files) containing other clinical and/or haplotype data for the individual. These data may be stored on the computer's hard drive or may, for example, be stored on a CD ROM or on one or more other storage devices accessible by the computer. For example, the data may be stored on one or more databases in communication with the computer via a network.
It is also contemplated that the above described methods and compositions of the invention may be utilized in combination with identifying genotype(s) and/or haplotype(s) for other genomic regions.
Preferred embodiments of the invention are described in the following examples. Other embodiments within the scope of the claims herein will be apparent to one skilled in the art from consideration of the specification or practice of the invention as disclosed herein. It is intended that the specification, together with the examples, be considered exemplary only, with the scope and spirit of the invention being indicated by the claims that follow the examples.
The Examples herein are meant to exemplify the various aspects of carrying out the invention and are not intended to limit the scope of the invention in any way. The Examples do not include detailed descriptions for conventional methods employed, such as in the synthesis of oligonucleotides or polymerase chain reaction. Such methods are well known to those skilled in the art and are described in numerous publications, for example, MOLECULAR CLONING: A LABORATORY MANUAL, 2nd ed., supra.
This example illustrates the clinical and biochemical characterization of selected individuals in a cohort of 449 Caucasian patients diagnosed with Alzheimer's Disease.
The patient cohort was selected from patients participating in three clinical trials of galantamine (GAL-INT2, GAL-USA10, and GAL-INT1), and from patients participating in a non-galantamine clinical trial, but with a similar disease population as the galantamine trials (SAB-USA-25) (Rockwood et al., supra; Tariot et al., supra; Wilcock et al., supra). In brief, the trials were carried out by delivering to patients drug or placebo at daily dosages of 8 mg, 16 mg, 24 mg, or 32 mg depending on the trial. Following 3, 5, 6 or 12 months of treatment in the GAL-INT2, GAL-USA10, GAL-INT1 and SAB-USA25 trials, respectively, the severity of symptoms in patients were evaluated using the cognitive subscale of the Alzheimer's Disease Assessment Scale (ADAS-cog) (Rosen et al., supra; Rockwood et al., supra; Tariot et al., supra; Wilcock et al., supra). The ADAS-cog measures cognitive function, including spoken language ability, comprehension of spoken language, recall of test instructions, word-finding difficulty in spontaneous speech, following commands, naming objects and fingers, constructional praxis, ideational praxis, orientation, word-recall task and word-recognition task (Alzheimer's Insights Online, supra).
For the clinical association study described in Example 2 below, 141 placebo patients were selected and used to populate two groups in a tailed sampling strategy, intended to enrich alleles correlating with disease progression in the population. This population consisted of 89 placebo “responders” and 52 placebo “non-responders.” Patients were assigned to responder and non-responder groups based on having a change in ADAS-cog score (□ADAS-cog) that met a cut-off value that was chosen based on the differences in treatment times in the four clinical trials described above. This can be seen below in Table 6. Table 7 below shows the number of placebo patients from each of the four clinical trials that were placed in each of the clinical association analyses groups.
This example illustrates genotyping of the patient cohort for the nine APOA4 polymorphic sites selected by the inventors herein for analysis.
Genomic DNA samples were isolated from blood samples obtained from each member of the cohort and genotyped at each of PS1-PS9 (Table 2) using the MassARRAY technology licensed from Sequenom (San Diego, Calif.). In brief, this genotyping technology involves performing a homogeneous MassEXTEND assay (hME), in which an initial polymerase chain reaction is followed by an allele-specific oligonucleotide extension reaction in the same tube or plate well, and then detecting the extended oligonucleotide by MALDI-TOF mass spectrometry.
For each of the nine APOA4 polymorphic sites of interest, a genomic DNA sample was amplified in a 8.0 μL multiplexed PCR reaction consisting of 2.5 ng genomic DNA (0.3 ng/μL), 0.85 μL 10× reaction buffer, 0.32 units Taq Polymerase, up to five sets of 0.4 pmol each of forward PCR primer (5′ to 3′) and reverse PCR primer (3′ to 5′) and 1.6 nmol each of dATP, dCTP, dGTP and dTTP. A total of six reactions were performed comprising the following polymorphic site groups: (1) PS1; (2) PS2; (3) PS3; (4) PS4 and PS5; (5) PS6 and PS9; and (6) PS7 and PS8. Forward and Reverse PCR primers used for each of the nine APOA4 polymorphic sites consisted of a 10 base universal tag (5′-AGCGGATAAC-3′; SEQ ID NO:37) followed by one of the APOA4-specific sequences shown in Tables 8A and 8B below:
PCR thermocycling conditions were: initial denaturation of 95° C. for 15 minutes followed by 45 cycles of 94° C. for 20 seconds, 56° C. for 30 seconds and 72° C. for 1 minute followed by a final extension of 72° C. for 3 minutes. Following the final extension, unincorporated deoxynucleotides were degraded by adding 0.48 units of Shrimp Alkaline Phosphatase (SAP) to the PCR reactions and incubation for 20 minutes at 37° C. followed by 5 minutes at 85° C. to inactivate the SAP.
Template-dependent primer extension reactions were then performed on the multiplexed PCR products by adding a 2.0 μL volume of an hME cocktail consisting of 720 pmol each of three dideoxynucleotides and 720 pmol of one deoxynucleotide, 8.6 pmol of an extension primer, 0.2 μL of 5× Thermosequenase Reaction Buffer, and NanoPure grade water. The thermocycling conditions for the mass extension reaction were: initial denaturation for 2 minutes at 94° C. followed by 40 cycles of 94° C. for 5 seconds, 40° C. for 5 seconds and 72° C. for 5 seconds. Extension primers used to genotype each of the nine APOA4 polymorphic sites are shown in Table 9 below:
The extension products were desalted prior to analysis by mass spectrometry by mixing them with AG50X8 NH4OAc cation exchange resin. The desalted multiplexed extension products were applied onto a SpectroCHIP™ using the SpectroPOINT™ 24 pin applicator tool as per manufacturer's instructions (Sequenom Industrial Genomics, Inc. San Diego, Calif.). The SpectroChip™ was loaded into a Bruker Biflex III™ linear time-of flight mass spectrometer equipped with a SCOUT 384 ion source and data was acquired using XACQ 4.0, MOCTL 2.1, AutoXecute 4.2 and XMASS/XTOF 5.0.1 software on an Ultra 5™ work station (Sun Microsystems, Palo Alto Calif.). Mass spectrometry data was subsequently analyzed on a PC running Windows NT 4.0 (Microsoft, Seattle Wash.) with SpectroTYPER™ genotype calling software (Sequenom Industrial Genomics, Inc. San Diego, Calif.).
This example illustrates the deduction of haplotypes from the APOA4 genotyping data generated in Example 2.
Haplotypes were estimated from the unphased genotypes using a computer-implemented algorithm for assigning haplotypes to unrelated individuals in a population sample, essentially as described in WO 01/80156 (Genaissance Pharmaceuticals, Inc., New Haven, Conn.). In this method, haplotypes are assigned directly from individuals who are homozygous at all sites or heterozygous at no more than one of the variable sites. This list of haplotypes is then used to deconvolute the unphased genotypes in the remaining (multiply heterozygous) individuals.
A quality control analysis was performed on the deduced haplotypes, which included analysis of the frequencies of the haplotypes and individual SNPs therein for compliance with principles of Hardy-Weinberg equilibrium.
This example illustrates analysis of the APOA4 haplotypes in Table 1 for association with individuals' progression of AD.
The statistical analyses compared ΔADAS-cog in patients with one or zero copies vs. two copies (within a patient's genome) of a particular allele, using a logistic regression analysis on two-degrees of freedom to associate progression of AD with a particular haplotype. The following covariates were also included: age, gender, history, smoking, ADAS-cog baseline, dose (BID), body mass index, and CYP2D6. The logistic regression included assessment of associations between the haplotypes and the binary outcome of progression of AD.
For the results obtained on the analyses, adjustments were made for multiple comparisons, using a permutation test (MULTIVARIATE PERMUTATION TESTS: WITH APPLICATIONS IN BIOSTATISTICS, Pesarin, John Wiley and Sons, New York, 2001). In this test, a sub-haplotype's data for each observation were kept constant, while all the remaining variables (outcome and covariates) were randomly permuted so that covariates always stayed with the same outcome. The permutation model was fitted for each of the several haplotypes, and the lowest p-value was kept. In total, 1000 permutations were done. Eighteen APOA4 haplotypes of at least one polymorphism were identified that show a correlation with an individual's progression of AD. These APOA4 haplotypes are shown above in Table 1, and the unadjusted (“raw”) and adjusted (“perm.”) p-values for 18 four haplotypes are shown below in Table 10.
As seen in Table 10, each of the 18 haplotypes shows a correlation with an individual's progression of AD. When p-values were adjusted for multiple comparisons, haplotype (1) showed the strongest correlation. The odds ratio (O.R.) column indicates the likelihood that an individual with two copies of a particular haplotype will exhibit a slower progression of AD as compared to an individual with one copy or zero copies of that haplotype. An O.R. greater than 1 indicates that an individual with two copies is less likely to exhibit a slower progression of AD than an individual with one copy or zero copies, and an O.R. less than 1 indicates that an individual with two copies is more likely to exhibit a slower progression of AD than an individual with one copy or zero copies.
In view of the above, it will be seen that the several advantages of the invention are achieved and other advantageous results attained. As various changes could be made in the above methods and compositions without departing from the scope of the invention, it is intended that all matter contained in the above description and shown in the accompanying drawings shall be interpreted as illustrative and not in a limiting sense.
All references cited in this specification, including patents and patent applications, are hereby incorporated in their entirety by reference. The discussion of references herein is intended merely to summarize the assertions made by their authors and no admission is made that any reference constitutes prior art. Applicants reserve the right to challenge the accuracy and pertinence of the cited references.
Number | Date | Country | |
---|---|---|---|
60524467 | Nov 2003 | US |