The present invention relates to single nucleotide polymorphisms in nucleic acids involved in encoding enzymes in the testosterone biosynthetic pathway and to methods for detecting such polymorphisms. The invention has utility in the diagnosis, prognosis, prevention and treatment of disease, particularly those relating to prostate cancer and breast cancer.
Prostate cancer is the most common non-skin cancer in males all over the world. Currently, there are no means to predict how aggressive an individual's cancer will be. Thus, many patients are given unnecessary drastic treatment with severe side effects and possibly others do not receive treatment effective enough.
Incidence of prostate cancer shows strong age dependence, being a disease of old men, and strong race dependence, being almost twice as common in African Americans as in Caucasians, while Asian populations have the lowest risk (Cook et al. (1999) J Urol 161, 152-155; Hsing et al. (2000) Int J Cancer 85, 60-67). The third well-known risk factor is having a family history of prostate cancer (Cerhan et al. (1999) Cancer Epidemiol Biomarkers Prev 8, 53-60; Kalish et al. (2000) Urology 56, 803-806), and several studies have supported the presence of predisposing genetic factors.
Genome wide linkage analyses have pointed multiple chromosomal regions showing linkage in prostate cancer families and several prostate cancer candidate loci have been suggested; HPC1 in 1q24 (Smith et al. (1996) Science 274, 1371-1374), HPCX in Xq27 (Xu et al. (1998) Nat Genet 20, 175-179), PCAP in 1q42.2 (Berthon et al. (1998) Am J Hum Genet 62, 1416-1424), CABP in 1p36 (Gibbs et al. (1999) Am J Hum Gen 64, 776-787), and HPC2/ELAC2 in 17p (Tavtigian et al. (2001) Nat Genet 27, 172-180). Recently, a candidate cancer-susceptibility gene, RNASEL, was cloned at the HPC1 loci (Carpten et al. (2002) Nat Genet 30, 181-184) and two possibly deleterious germline mutations segregating in prostate cancer families were discovered.
The growth of prostate cells is dependent on active testosterone (Ekman (1995) J Urol 101, 22-25) and strikingly, prostate adenocarcinomas can be created by testosterone administration in rats (Gupta et al. (1999) Cancer Res 59, 2115-2120). Testosterone seems to be a strong tumour promoter for the rat prostate, even at doses that do not measurably increase circulating testosterone (Bosland et al. (1991) Princess Takamatsu Symp 22,109-123). Consequently, genes involved in the testosterone biosynthetic pathway, e.g., CYP17, CYP3A4, and SRD5A2 (
Approximately 55 different Cytochrome P450 genes are present in the human genome and are classified into different families and subfamilies on the basis of sequence homology. Members of the CYP3A subfamily catalyze the oxidative, peroxidative and reductive metabolism of different endobiotics, drugs, and protoxic or procarcinogenic molecules. As an example, CYP3A4 is responsible for the oxidative metabolism of an estimated 60% of all clinically used drugs. Up to 30-fold interindividual differences in expression has been detected, causing variation in oral bioavailability and systemic clearance of CYP3A substrates, such as HIV protease inhibitors, several calcium channel blockers and some cholesterol-lowering drugs. Variation in CYP3A expression is particularly important in substrates with narrow therapeutic indices, such as cancer chemotherapeutics and immunosuppressants. Variation in CYP3A expression can result in clinically significant differences in drug toxicities and response.
As with prostate cancer, breast cancer also shows age-dependency indicating a possible hormonal influence on the disease risk. Endogenous oestradiol synthesis takes place in the ovarian theca cells of pre-menopausal women, in the stromal adipose cells of the breast of post-menopausal women, and in minor quantities in peripheral tissue. These cells, as well as breast cancer tissue, express all the necessary enzymes for this synthesis, including CYP17, and enzymes that further hydroxylate oestradiol, such as CYP3A4 (Kristensen et al. (2000) Mutat Res 462, 323-333). Thus, polymorphisms in these enzymes may also be associated with the risk of breast cancer (Kristensen et al. (2000) Mutat Res 462, 323-333). Furthermore, CYP3A4 is also involved in the activation of many mammary carcinogens, such as the polycyclic aromatic hydrocarbons and heterocyclic amines (Guengerich et al. (1991) Chem Res Toxicol. 4, 168-179). According to a recent study (Zheng et al. (2001) Cancer Epidemiol Biomarkers Prev 10, 237-242), high CYP3A4 activity may be a risk factor for breast cancer risk.
Single nucleotide polymorphisms (SNPs) are the most common type of genetic variation in the human genome, and are expected to be helpful in identifying human disease genes. In addition to occurring frequently, on average every 500-2,000 bp (Li & Sadler (1991) Genetics 129, 513-523; Chakravarti (1998) Nat Genet 19, 216-217; Cargill et al. (1999) Nat Genet 22, 231-238; Halushka et al. (1999) Nat Genet 22, 239-247), SNPs have a low mutation rate when compared to microsatellite markers, both of which are characteristics that may have particular advantages for association analysis. The utility of SNPs is not only in their use as markers for discovering additional functional variants and for the general evaluation of a specific gene in the context of a given clinical phenotype but also in their potential functional relevance. However, rather than finding a single SNP with drastic effect on the phenotype, more likely it will be multiple SNPs in relevant genes, either linked (i.e., grouped as a haplotype) or independent (perhaps on different chromosomes), that contribute to the phenotype.
Recently, several studies have shown the utility of haplotypes, i.e., a combination of SNPs with alleles physically assigned to a chromosome, in association analysis (Daly et al. (2001) Nat Genet 29, 229-232). Studying haplotypes might give the analysis more power but traditionally demands either samples from multiple generations or tedious molecular haplotyping. Alternatively, several algorithms have been developed for inferring haplotypes from genotype data (Clark (1990) Mol Biol Evol 7, 111-122; Excoffier & Slatkin s (1995) Mol Biol Evol 12, 921-927; Stephens et al. (2001) Am J Hum Genet 68, 978-989). These algorithms have been proven to work with a very low error rate (Drysdale et al. (2000) PNAS 97, 10483-10488). In a sense, haplotyping is equivalent to performing a study in a family or other select group of people. It helps to get back the power of linkage, and can be regarded as a crucial step in association studies using random individuals.
WO02/055735 discloses specific nucleic acids useful for identifying, diagnosing, monitoring, staging, imaging and treating prostate cancer and breast cancer. Similar compositions comprising prostate specific nucleic acids are described by the same applicant (Diadexus Inc.) in related applications (WO02/42776, WO02/42499, WO02/42463, WO02/42329, WO02/39431, WO02/239431, WO02/38810, WO02/38810, WO02/236808 and WO0224718).
Diadexus Inc. have also disclosed a method of diagnosing, monitoring, staging, imaging and treating prostate and breast cancer by means of specific nucleic acids, in a series of related applications (WO01/39798 & WO00/23111 & WO00/23108).
WO01/53537 (DZ Genes Inc.) describes isolated polynucleotides containing at least one polymorphism useful for the diagnosis of disease, particularly prostate and breast cancer.
Single nucleotide polymorphisms associated with prostate cancer are disclosed in WO01/83828, as are methods for using these SNPs to determine susceptibility to this disease.
In order to improve the lives of prostate and breast cancer patients it is essential to develop prognostic markers for cancer as well as markers allowing general assessment of disease risk. Patients need to be categorized into those needing immediate, extensive treatment, and those who just need watchful waiting. As a result, prostate and breast cancer mortality could be reduced and unnecessary side effects caused by invasive treatments could be avoided. There is therefore a need for prognostic molecular markers for aggressive breast and prostate cancer to aid predicting, diagnosing and monitoring these diseases in individuals. Furthermore, there is a continued need for improved methods of treatment of both conditions in patients. The present invention addresses these needs and provides improvements over the prior art in the form of novel and specific nucleic acids, microarrays and kits useful for the diagnosis of breast and prostate cancer.
According to the first aspect of the present invention, there is provided an isolated polynucleotide selected from the group consisting of a nucleotide sequence comprising one or more polymorphic sequences of SEQ ID NOS 1-34. Suitably, a fragment of the isolated polynucleotide comprises a polymorphic site in the polymorphic sequence.
In a second aspect of the present invention, there is provided an isolated polynucleotide comprising a sequence complementary to one or more of the polymorphic sequences of SEQ ID NOS 1-34. Suitably, a fragment of the complementary nucleotide sequence comprises a polymorphic site in the polymorphic sequence.
Preferably, the polynucleotides of the first and second aspect comprise DNA, RNA, cDNA, or mRNA
Preferably, at least one single nucleotide polymorphism of the isolated polynucleotide is at a position selected from the group consisting of position [CYP3A4_IVS9 +187] of SEQ ID No. 1, position [CYP3A4, 1639 base pairs after the stop codon] of SEQ ID No. 2, position [CYP3A4, 945 base pairs after the stop codon] of SEQ ID No. 3, position [CYP3A4—5′ region −747] of SEQ ID No. 4, position [CYP3A4_IVS7 −202] of SEQ ID No. 5, position [CYP3A4, 2204 base pairs after the stop codon] of SEQ ID No. 6, position [CYP3A4_IVS2 −132] of SEQ ID No. 7, position [CYP3A4_IVS1 −868] of SEQ ID No. 8, position [CYP3A4—5′ region −847] of SEQ ID No. 9, position [CYP3A4, 766 base pairs after the stop codon] of SEQ ID No. 10, position [CYP3A4, 1454 base pairs after the stop codon] of SEQ ID No. 11, position [CYP3A4_IVS3 +1992] of SEQ ID No. 12, position [CYP3A4_IVS9 +841] of SEQ ID No. 13, position [CYP3A4_IVS12 −473 of SEQ ID No. 14, position [CYP3A4_IVS12 +581] of SEQ ID No. 15, position [CYP3A4_IVS12 +586] of SEQ ID No. 16, position [CYP3A4_IVS12 +646] of SEQ ID No. 17, position [CYP3A4_IVS3 −734] of SEQ ID No. 18, position [CYP17_IVS1 −271] of SEQ ID No. 19, position [CYP17_IVS5 +75] of SEQ ID No. 20, position [CYP17_IVS1 +426] of SEQ ID No. 21, position [CYP17_IVS1 −99] of SEQ ID No. 22, position [CYP17_IVS1 −700] of SEQ ID No. 23, position [CYP17_IVS1 −565] of SEQ ID No. 24, position [CYP17_IVS3 +141] of SEQ ID No. 25, position [CYP17—5′ region −1488] of SEQ ID No. 26, position (CYP17—5′ region −1204] of SEQ ID No. 27, position [CYP17_IVS1 +466] of SEQ ID No. 28, position [CYP17, 712 base pairs after the stop codon] of SEQ ID No. 29, position [SRD5A2, 1356 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 30, position [SRD5A2, 849 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 31, position [SRD5A2—5′ region −870] of SEQ ID No. 32, position [SRD5A2—5′ region between −2036 and −2030] of SEQ ID No. 33, and position [SRD5A2, 545 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 34.
More preferably, at least one single nucleotide polymorphism is selected from the group consisting of [CYP3A4_IVS9 +187C>G] of SEQ ID No. 1, [CYP3A4, 1639 base pairs after the stop codon, A>T] of SEQ ID No. 2, [CYP3A4, 945 base pairs after the stop codon, A>T] of SEQ ID No. 3, [CYP3A4—5′ region −747C>G] of SEQ ID No. 4, [CYP3A4_IVS7 −202C>T] of SEQ ID No. 5, [CYP3A4, 2204 base pairs after the stop codon, G>C] of SEQ ID No. 6, [CYP3A4_IVS2 −132C>T] of SEQ ID No. 7, [CYP3A4_IVS1 −868C>T] of SEQ ID No. 8, [CYP3A4—5′ region −847A>T] of SEQ ID No. 9, [CYP3A4, 766 base pairs after the stop codon, delT] of SEQ ID No. 10, [CYP3A4, 1454 base pairs after the stop codon, C>T] of SEQ ID No. 11, [CYP3A4_IVS3 +1992T>C] of SEQ ID No. 12, [CYP3A4_IVS9 +841T>G] of SEQ ID No. 13, [CYP3A4_IVS12 −473T>G] of SEQ ID No. 14, [CYP3A4_IVS12 +581C>T] of SEQ ID No. 15, [CYP3A4_IVS12 +586G>A] of SEQ ID No. 16, [CYP3A4_IVS12 +646C>A] of SEQ ID No. 17, [CYP3A4_IVS3 −734G>A] of SEQ ID No. 18, [CYP17_IVS1 −271A>C] of SEQ ID No. 19, [CYP17_IVS5 +75C>G] of SEQ ID No. 20, [CYP17_IVS1 +426G>A] of SEQ ID No. 21, [CYP17_IVS1 −99C>T] of SEQ ID No. 22, [CYP17_IVS1 −700C>G] of SEQ ID No. 23, [CYP17_IVS1 −565G>A] of SEQ ID No. 24, [CYP17_IVS3 +141A>T] of SEQ ID No. 25, [CYP17—5′ region −1488C>G] of SEQ ID No. 26, [CYP17—5′ region −1204C>T] of SEQ ID No. 27, [CYP17_IVS1 +466G>A] of SEQ ID No. 28, [CYP17, 712 base pairs after the stop codon, G>A] of SEQ ID No. 29, [SRD5A2, 1356 base pairs after the stop codon (3′ UTR), A>C] of SEQ ID No. 30, [SRD5A2, 849 base pairs after the stop codon (3′ UTR), A>G] of SEQ ID No. 31, [SRD5A2—5′ region −870G>A] of SEQ ID No. 32, [SRD5A2—5′ region −2036(A)7-8] of SEQ ID No. 33, and [SRD5A2, 545 base pairs after the stop codon (3′ UTR), T>C] of SEQ ID No. 34.
Optionally, the polynucleotide is the complement of any of the isolated polynucleotides hereinbefore described.
In one aspect, the polynucleotide comprises part of the CYP17 gene, the CYP3A4 gene or the SRD5A2 gene.
Preferably, the isolated polynucleotide further comprises a detectable label. More preferably, the detectable label is selected from the group consisting of fluorophore, radionuclide, peptide, enzyme, antibody and antigen. In a preferred embodiment, the fluorophore is a fluorescent compound selected from the group consisting of Hoechst 33342, Cy2, Cy3, Cy5, CypHer, coumarin, FITC, DAPI, Alexa 633, DRAQ5 and Alexa 488.
In a third aspect of the present invention, there is provided a method for diagnosing a genetic susceptibility for a disease, condition or disorder related to prostate or breast cancer in a subject, the method comprising analysing a biological sample containing nucleic acid obtained from the subject to detect the presence or absence of one or more single nucleotide polymorphisms at a position selected from the group consisting of position [CYP3A4_IVS9 +187] of SEQ ID No. 1, position [CYP3A4, 1639 base pairs after the stop codon] of SEQ ID No. 2, position [CYP3A4, 945 base pairs after the stop codon] of SEQ ID No. 3, position [CYP3A4—5′ region −747] of SEQ ID No. 4, position [CYP3A4_IVS7 −202] of SEQ ID No. 5, position [CYP3A4, 2204 base pairs after the stop codon] of SEQ ID No. 6, position [CYP3A4_IVS2 −132] of SEQ ID No. 7, position [CYP3A4_IVS1 −868] of SEQ ID No. 8, position [CYP3A4—5′ region −847] of SEQ ID No. 9, position [CYP3A4, 766 base pairs after the stop codon] of SEQ ID No. 10, position [CYP3A4, 1454 base pairs after the stop codon] of SEQ ID No. 11, position [CYP3A4_IVS3 +1992] of SEQ ID No. 12, position [CYP3A4_IVS9 +841] of SEQ ID No. 13, position [CYP3A4_IVS12 −473] of SEQ ID No. 14, position [CYP3A4_IVS12 +581] of SEQ ID No. 15, position [CYP3A4_IVS12 +586] of SEQ ID No. 16, position [CYP3A4_IVS12 +646] of SEQ ID No. 17, position [CYP3A4_IVS3 −734] of SEQ ID No. 18, position [CYP17_IVS1 −271] of SEQ ID No. 19, position [CYP17_IVS5 +75] of SEQ ID No. 20, position [CYP17_IVS1 +426] of SEQ ID No. 21, position [CYP17_IVS1 −99] of SEQ ID No. 22, position [CYP17_IVS1 −700] of SEQ ID No. 23, position [CYP17_IVS1 −565] of SEQ ID No. 24, position [CYP17_IVS3 +141] of SEQ ID No. 25, position [CYP17—5′ region −1488] of SEQ ID No. 26, position [CYP17—5′ region −1204] of SEQ ID No. 27, position [CYP17_IVS1 +466] of SEQ ID No. 28, position [CYP17, 712 base pairs after the stop codon] of SEQ ID No. 29, position [SRD5A2, 1356 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 30, position [SRD5A2, 849 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 31, position [SRD5A2—5′ region −870] of SEQ ID No. 32, position [SRD5A2—5′ region between −2036 and −2030] of SEQ ID No. 33, position [SRD5A2, 545 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 34, position [SRD5A2_IVS2 +626] of SEQ ID No. 35, position [SRD5A2—5′ region −8029] of SEQ ID No. 36, position [CYP3A4_IVS7 +34] of SEQ ID No. 42, position [CYP3A4—5′ region −1232] of SEQ ID No. 43, position [SRD5A2—5′ region −3001] of SEQ ID No. 44, and position [SRD5A2, 1552 base pairs after the stop codon] of SEQ ID No. 45.
Suitably, the nucleic acid is DNA, RNA, cDNA or mRNA.
Preferably, the single nucleotide polymorphism is selected from the group consisting of [CYP3A4_IVS9 +187C>G] of SEQ ID No. 1, [CYP3A4, 1639 base pairs after the stop codon, A>T] of SEQ ID No. 2, [CYP3A4, 945 base pairs after the stop codon, A>T] of SEQ ID No. 3, [CYP3A4—5′ region −747C>G] of SEQ ID No. 4, [CYP3A4_IVS7 −202C>T] of SEQ ID No. 5, [CYP3A4, 2204 base pairs after the stop codon, G>C] of SEQ ID No. 6, [CYP3A4_IVS2 −132C>T] of SEQ ID No. 7, [CYP3A4_IVS1 −868C>T] of SEQ ID No. 8, [CYP3A4—5′ region −847A>T] of SEQ ID No. 9, [CYP3A4, 766 base pairs after the stop codon, delT] of SEQ ID No. 10, [CYP3A4, 1454 base pairs after the stop codon, C>T] of SEQ ID No. 11, [CYP3A4_IVS3 +1992T>C] of SEQ ID No. 12, [CYP3A4_IVS9 +841T>G] of SEQ ID No. 13, [CYP3A4_IVS12 −473T>G] of SEQ ID No. 14, [CYP3A4_IVS12 +581C>T] of SEQ ID No. 15, [CYP3A4_IVS12 +586G>A] of SEQ ID No. 16, [CYP3A4_IVS12 +646C>A] of SEQ ID No. 17, [CYP3A4_IVS3 −734G>A] of SEQ ID No. 18, [CYP17_IVS1 −271A>C] of SEQ ID No. 19, [CYP17_IVS5 +75C>G] of SEQ ID No. 20, [CYP17_IVS1 +426G>A] of SEQ ID No. 21, [CYP17_IVS1 −99C>T] of SEQ ID No. 22, [CYP17_IVS1 −700C>G] of SEQ ID No. 23, [CYP17_IVS1 −565G>A] of SEQ ID No. 24, [CYP17_IVS3 +141A>T] of SEQ ID No. 25, [CYP17—5′ region −1488C>G] of SEQ ID No. 26, [CYP17—5′ region −1204C>T] of SEQ ID No. 27, [CYP17_IVS1 +466G>A] of SEQ ID No. 28, [CYP17, 712 base pairs after the stop codon, G>A] of SEQ ID No. 29, [SRD5A2, 1356 base pairs after the stop codon (3′ UTR), A>C] of SEQ ID No. 30, [SRD5A2, 849 base pairs after the stop codon (3′ UTR), A>G] of SEQ ID No. 31, [SRD5A2—5′ region −870G>A] of SEQ ID No. 32, [SRD5A2—5′ region −2036(A)7-8] of SEQ ID No. 33, [SRD5A2, 545 base pairs after the stop codon (3′ UTR), T>C] of SEQ ID No. 34, [SRD5A2_IVS2 +626C>T] of SEQ ID No. 35, [SRD5A2—5′ region −8029C>T] of SEQ ID No. 36, [CYP3A4_IVS7 +34T>G] of SEQ ID No. 42, [CYP3A4—5′ region −1232C>T] of SEQ ID No. 43, [SRD5A2—5′ region −3001G>A] of SEQ ID No. 44, and [SRD5A2, 1552 base pairs after the stop codon, G>A] of SEQ ID No. 45.
Optionally, the single nucleotide polymorphism is selected from the complement of any of the single nucleotide polymorphisms described hereinbefore.
Suitably, the analysis is accomplished by sequencing, genotyping, fragment analysis, hybridisation, restriction fragment analysis, oligonucleotide ligation or allele specific PCR. Preferably, the analysis is accomplished by hybridisation, the method comprising the steps of
In a fourth aspect of the present invention, there is provided a method for diagnosing a genetic susceptibility for a disease, condition or disorder related to prostate or breast cancer in a subject, or predicting an individual's response to a drug, the method comprising adding an antibody to a polypeptide present in a biological sample obtained from the subject which polypeptide is encoded by a polynucleotide selected from the group consisting of SEQ ID NOS 1-36 and SEQ ID NOS 42-45, or the complement thereof, and detecting specific binding of the antibody to the polypeptide.
In a fifth aspect of the present invention, there is provided a kit comprising at least one isolated polynucleotide of at least 5 contiguous nucleotides of SEQ ID NOS: 1-36 or 42-45, or the complement thereof, and containing at least one single nucleotide polymorphic site associated with a disease, condition or disorder related to prostate or breast cancer together with instructions for the use thereof for detecting the presence or the absence of said at least single nucleotide polymorphism in said nucleic acid.
In a sixth aspect of the present invention, there is provided an oligonucleotide array comprising at least one oligonucleotide capable of hybridising to a first polynucleotide at a polymorphic site encompassed therein, wherein the first polynucleotide comprises a nucleotide sequence comprising one or more polymorphic sequences of SEQ ID NOS: 1-36 and SEQ ID NOS: 42-45.
Suitably, the first polynucleotide comprises a fragment of any of the nucleotide sequences, the fragment comprising a polymorphic site in the polymorphic sequence.
Suitably, the first polynucleotide is a complementary nucleotide sequence comprising a sequence complementary to one or more polymorphic sequences of SEQ ID NOS: 1-36 and SEQ ID NOS: 42-45.
Suitably, the first polynucleotide comprises a fragment of said complementary sequence, the fragment comprising a polymorphic site in the polymorphic sequence.
Suitably, the position of the polymorphic site in the kit or the microarray as hereinbefore described is at a position selected from the group consisting of position [CYP3A4_IVS9 +187] of SEQ ID No. 1, position [CYP3A4, 1639 base pairs after the stop codon] of SEQ ID No. 2, position [CYP3A4, 945 base pairs after the stop codon] of SEQ ID No. 3, position [CYP3A4—5′ region −747] of SEQ ID No. 4, position [CYP3A4_IVS7 −202] of SEQ ID No. 5, position [CYP3A4, 2204 base pairs after the stop codon] of SEQ ID No. 6, position [CYP3A4_IVS2 −132] of SEQ ID No. 7, position [CYP3A4_IVS1 −868] of SEQ ID No. 8, position [CYP3A4—5′ region −847] of SEQ ID No. 9, position [CYP3A4, 766 base pairs after the stop codon] of SEQ ID No. 10, position [CYP3A4, 1454 base pairs after the stop codon] of SEQ ID No. 11, position [CYP3A4_IVS3 +1992] of SEQ ID No. 12, position [CYP3A4_IVS9 +841] of SEQ ID No. 13, position [CYP3A4_IVS12 −473] of SEQ ID No. 14, position [CYP3A4_IVS12 +581] of SEQ ID No. 15, position [CYP3A4_IVS12 +586] of SEQ ID No. 16, position [CYP3A4_IVS12 +646] of SEQ ID No. 17, position [CYP3A4_IVS3 −734] of SEQ ID No. 18, position [CYP17_IVS1 −271] of SEQ ID No. 19, position [CYP17_IVS5 +75] of SEQ ID No. 20, position [CYP17_IVS1 +426] of SEQ ID No. 21, position [CYP17_IVS1 −99] of SEQ ID No. 22, position [CYP17_IVS1 −700] of SEQ ID No. 23, position [CYP17_IVS1 −565] of SEQ ID No. 24, position [CYP17_IVS3 +141] of SEQ ID No. 25, position [CYP17—5′ region −1488] of SEQ ID No. 26, position [CYP17—5′ region −1204] of SEQ ID No. 27, position [CYP17_IVS1 +466] of SEQ ID No. 28, position [CYP17, 712 base pairs after the stop codon] of SEQ ID No. 29, position [SRD5A2, 1356 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 30, position [SRD5A2, 849 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 31, position [SRD5A2—5′ region −870] of SEQ ID No. 32, position [SRD5A2—5′ region between −2036 and −2030] of SEQ ID No. 33, position [SRD5A2, 545 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 34.position [SRD5A2_IVS2 +626] of SEQ ID No. 35, position [SRD5A2—5′ region −8029] of SEQ ID No. 36, position [CYP3A4_IVS7 +34] of SEQ ID No. 42, position [CYP3A4—5′ region −1232] of SEQ ID No. 43, position [SRD5A2—5′ region −3001] of SEQ ID No. 44 and position [SRD5A2, 1552 base pairs after the stop codon] of SEQ ID No. 45.
Preferably, at least one single nucleotide polymorphism is selected from the group consisting of [CYP3A4_IVS9 +187C>G] of SEQ ID No. 1, [CYP3A4, 1639 base pairs after the stop codon, A>T] of SEQ ID No. 2, [CYP3A4, 945 base pairs after the stop codon, A>T] of SEQ ID No. 3, [CYP3A4—5′ region −747C>G] of SEQ ID No. 4, [CYP3A4_IVS7 −202C>T] of SEQ ID No. 5, [CYP3A4, 2204 base pairs after the stop codon, G>C] of SEQ ID No. 6, [CYP3A4_IVS2 −132C>T] of SEQ ID No. 7, [CYP3A4_IVS1 −868C>T] of SEQ ID No. 8, [CYP3A4—5′ region −847A>T] of SEQ ID No. 9, [CYP3A4, 766 base pairs after the stop codon, delT] of SEQ ID No. 10, [CYP3A4, 1454 base pairs after the stop codon, C>T] of SEQ ID No. 11, [CYP3A4_IVS3 +1992T>C] of SEQ ID No. 12, [CYP3A4_IVS9 +841T>G] of SEQ ID No. 13, [CYP3A4_IVS12 −473T>G] of SEQ ID No. 14, [CYP3A4_IVS12 +581C>T] of SEQ ID No. 15, [CYP3A4_IVS12 +586G>A] of SEQ ID No. 16, [CYP3A4_IVS12 +646C>A] of SEQ ID No. 17, [CYP3A4_IVS3 −734G>A] of SEQ ID No. 18, [CYP17_IVS1 −271A>C] of SEQ ID No. 19, [CYP17_IVS5 +75C>G] of SEQ ID No. 20, [CYP17_IVS1 +426G>A] of SEQ ID No. 21, [CYP17_IVS1 −99C>T] of SEQ ID No. 22, [CYP17_IVS1 −700C>G] of SEQ ID No. 23, [CYP17_IVS1 −565G>A] of SEQ ID No. 24, [CYP17_IVS3 +141A>T] of SEQ ID No. 25, [CYP17—5′ region −1488C>G] of SEQ ID No. 26, [CYP17—5′ region −1204C>T] of SEQ ID No. 27, [CYP17_IVS1 +466G>A] of SEQ ID No. 28, [CYP17, 712 base pairs after the stop codon, G>A] of SEQ ID No. 29, [SRD5A2, 1356 base pairs after the stop codon (3′ UTR), A>C] of SEQ ID No. 30, [SRD5A2, 849 base pairs after the stop codon (3′ UTR), A>G] of SEQ ID No. 31, [SRD5A2—5′ region −870G>A] of SEQ ID No. 32, [SRD5A2—5′ region −2036(A)7-8] of SEQ ID No. 33, [SRD5A2, 545 base pairs after the stop codon (3′ UTR), T>C] of SEQ ID No. 34, [SRD5A2_IVS2 +626C>T] of SEQ ID No. 35, [SRD5A2—5′ region −8029C>T] of SEQ ID No. 36, [CYP3A4_IVS7 +34T>G] of SEQ ID No. 42, [CYP3A4—5′ region −1232C>T] of SEQ ID No. 43, [SRD5A2—5′ region −3001 G>A] of SEQ ID No. 44, and [SRD5A2, 1552 base pairs after the stop codon, G>A] of SEQ ID No. 45.
Optionally, at least one single nucleotide polymorphism is the complement of any of the single nucleotide polymorphisms as hereinbefore described.
Suitably, the oligonucleotide further comprises a detectable label. Preferably, the label is selected from the group consisting of fluorophore, radionuclide, peptide, enzyme, antibody or antigen. More preferably, the fluorophore is a fluorescent compound selected from the group consisting of Hoechst 33342, Cy2, Cy3, Cy5, CypHer, coumarin, FITC, DAPI, Alexa 633 DRAQ5 and Alexa 488.
In a seventh aspect of the present invention, there is provided a method of treatment or prophylaxis of a subject comprising the steps of
Treatment may take a variety of forms depending upon the nature of the cancer. Hormonal therapy is a widely used treatment for patients with metastatic carcinoma of the prostate (Goethuys et al. (1997) Am J Clin Oncol. 20, 40-45). Such treatment may, for example, involve androgen deprivation by surgical (e.g. orchiectomy) or androgen suppressive agents such as estrogens, (e.g. diethylstilbestrol), antiandrogens (e.g. flutamide) and luteinising hormone-releasing hormone agonists (e.g. leuprolide). Radiotherapy using radionuclides, is such as 32Phosphorus or 89Strontium, can be an effective treatment for the disease. There is also growing interest in the development of vaccines (Slovin (2001) Hematol. Oncol. Clinic N. Am, 15, 477-496) or the use of gene therapeutic methods (Ferrer & Rodriguez (2001) Hematol Oncol Clinic of N. Am 15, 497-508) for the treatment of prostate cancer.
Suitably, the nucleic acid is selected from the group consisting of DNA, RNA and mRNA.
Preferably, the sample is analysed to detect the presence or absence of at least one single nucleotide polymorphism at a position selected from the group consisting of position [CYP3A4_IVS9 +187] of SEQ ID No. 1, position [CYP3A4, 1639 base pairs after the stop codon] of SEQ ID No. 2, position [CYP3A4, 945 base pairs after the stop codon] of SEQ ID No. 3, position [CYP3A4—5′ region 31 747] of SEQ ID No. 4, position [CYP3A4_IVS7 −202] of SEQ ID No. 5, position [CYP3A4, 2204 base pairs after the stop codon] of SEQ ID No. 6, position [CYP3A4_IVS2 −132] of SEQ ID No. 7, position [CYP3A4_IVS1 −868] of SEQ ID No. 8, position [CYP3A4—5′ region −847] of SEQ ID No. 9, position [CYP3A4, 766 base pairs after the stop codon] of SEQ ID No. 10, position [CYP3A4, 1454 base pairs after the stop codon] of SEQ ID No. 11, position [CYP3A4_IVS3 +1992] of SEQ ID No. 12, position [CYP3A4_IVS9 +841] of SEQ ID No. 13, position [CYP3A4_IVS12 −473] of SEQ ID No. 14, position [CYP3A4_IVS12 +581] of SEQ ID No. 15, position [CYP3A4_IVS12 +586] of SEQ ID No. 16, position [CYP3A4_IVS12 +646] of SEQ ID No. 17, position [CYP3A4_IVS3 −734] of SEQ ID No. 18, position [CYP17_IVS1 −271] of SEQ ID No. 19, position [CYP17_IVS5 +75] of SEQ ID No. 20, position [CYP17_IVS1 +426] of SEQ ID No. 21, position [CYP17_IVS1 −99] of SEQ ID No. 22, position [CYP17_IVS1 −700] of SEQ ID No. 23, position [CYP17_IVS1 −565] of SEQ ID No. 24, position [CYP17_IVS3 +141] of SEQ ID No. 25, position [CYP17—5′ region −1488] of SEQ ID No. 26, position [CYP17—5′ region −1204] of SEQ ID No. 27, position [CYP17_IVS1 +466] of SEQ ID No. 28, position [CYP17, 712 base pairs after the stop codon] of SEQ ID No. 29, position [SRD5A2, 1356 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 30, position [SRD5A2, 849 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 31, position [SRD5A2—5′ region −870] of SEQ ID No. 32, position [SRD5A2—5′ region between −2036 and −2030] of SEQ ID No. 33, position [SRD5A2, 545 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 34.position [SRD5A2_IVS2 +626] of SEQ ID No. 35, position [SRD5A2—5′ region −8029] of SEQ ID No. 36, position [CYP3A4_IVS7 +34] of SEQ ID No. 42, position [CYP3A4—5′ region −1232] of SEQ ID No. 43, position [SRD5A2—5′ region −3001] of SEQ ID No. 44, and position [SRD5A2, 1552 base pairs after the stop codon] of SEQ ID No. 45.
More preferably, at least one single nucleotide polymorphism is selected from the group consisting of [CYP3A4_IVS9 +187C>G] of SEQ ID No. 1, [CYP3A4, 1639 base pairs after the stop codon, A>T] of SEQ ID No. 2, [CYP3A4, 945 base pairs after the stop codon, A>T] of SEQ ID No. 3, [CYP3A4—5′ region −747C>G] of SEQ ID No. 4, [CYP3A4_IVS7 −202C>T] of SEQ ID No. 5, [CYP3A4, 2204 base pairs after the stop codon, G>C] of SEQ ID No. 6, [CYP3A4_IVS2 −132C>T] of SEQ ID No. 7, [CYP3A4_IVS1 −868C>T] of SEQ ID No. 8, [CYP3A4—5′ region −847A>T] of SEQ ID No. 9, [CYP3A4, 766 base pairs after the stop codon, delT] of SEQ ID No. 10, [CYP3A4, 1454 base pairs after the stop codon, C>T] of SEQ ID No. 11, [CYP3A4_IVS3 +1992T>C] of SEQ ID No. 12, [CYP3A4_IVS9 +841T>G] of SEQ ID No. 13, [CYP3A4_IVS12 −473T>G] of SEQ ID No. 14, [CYP3A4_IVS12 +581C>T] of SEQ ID No. 15, [CYP3A4_IVS12 +586G>A] of SEQ ID No. 16, [CYP3A4_IVS12 +646C>A] of SEQ ID No. 17, [CYP3A4_IVS3 −734G>A] of SEQ ID No. 18, [CYP17_IVS1 −271A>C] of SEQ ID No. 19, [CYP17_IVS5 +75C>G] of SEQ ID No. 20, [CYP17_IVS1 +426G>A] of SEQ ID No. 21, [CYP17_IVS1 −99C>T] of SEQ ID No. 22, [CYP17_IVS1 −700C>G] of SEQ ID No. 23, [CYP17_IVS1 −565G>A] of SEQ ID No. 24, [CYP17_IVS3 +141A>T] of SEQ ID No. 25, [CYP17—5′ region −1488C>G] of SEQ ID No. 26, [CYP17—5′ region −1204C>T] of SEQ ID No. 27, [CYP17_IVS1 +466G>A] of SEQ ID No. 28, [CYP17, 712 base pairs after the stop codon, G>A] of SEQ ID No. 29, [SRD5A2, 1356 base pairs after the stop codon (3′ UTR), A>C] of SEQ ID No. 30, [SRD5A2, 849 base pairs after the stop codon (3′ UTR), A>G] of SEQ ID No. 31, [SRD5A2—5′ region −870G>A] of SEQ ID No. 32, [SRD5A2—5′ region −2036(A)7-8] of SEQ ID No. 33, [SRD5A2, 545 base pairs after the stop codon (3′ UTR), T>C] of SEQ ID No. 34, [SRD5A2_IVS2 +626C>T] of SEQ ID No. 35, [SRD5A2—5′ region −8029C>T] of SEQ ID No. 36, [CYP3A4_IVS7 +34T>G] of SEQ ID No. 42, [CYP3A4—5′ region −1232C>T] of SEQ ID No. 43, [SRD5A2—5′ region −3001G>A] of SEQ ID No. 44, and [SRD5A2, 1552 base pairs after the stop codon, G>A] of SEQ ID No. 45.
Optionally, at least one single nucleotide polymorphism is the complement of any of the single nucleotide polymorphisms hereinbefore described.
Suitably, the method counteracts the effect of at least one single nucleotide polymorphism detected.
In a first embodiment of the seventh aspect, the method comprises treatment with a polynucleotide selected from the group consisting of polymorphic sequences SEQ ID NOS 1-36 or SEQ ID NOS 42-45, or their complement, provided that the polymorphic sequence, or the complement, does not contain at least one single nucleotide polymorphism at a position selected from the group consisting of position [CYP3A4_IVS9 +187] of SEQ ID No. 1, position [CYP3A4, 1639 base pairs after the stop codon] of SEQ ID No. 2, position [CYP3A4, 945 base pairs after the stop codon,] of SEQ ID No. 3, position [CYP3A4—5′ region −747] of SEQ ID No. 4, position [CYP3A4_IVS7 −202] of SEQ ID No. 5, position [CYP3A4, 2204 base pairs after the stop codon,] of SEQ ID No. 6, position [CYP3A4_IVS2 −132] of SEQ ID No. 7, position [CYP3A4_IVS1 −868] of SEQ ID No. 8, position [CYP3A4—5′ region −847] of SEQ ID No. 9, position [CYP3A4, 766 base pairs after the stop codon] of SEQ ID No. 10, position [CYP3A4, 1454 base pairs after the stop codon] of SEQ ID No. 11, position [CYP3A4_IVS3 +1992] of SEQ ID No. 12, position [CYP3A4_IVS9 +841] of SEQ ID No. 13, position [CYP3A4_IVS12 −473] of SEQ ID No. 14, position [CYP3A4_IVS12 +581] of SEQ ID No. 15, position [CYP3A4_IVS12 +586] of SEQ ID No. 16, position [CYP3A4_IVS12 +646] of SEQ ID No. 17, position [CYP3A4_IVS3 −734] of SEQ ID No. 18, position [CYP17_IVS1 −271] of SEQ ID No. 19, position [CYP17_IVS5 +75] of SEQ ID No. 20, position [CYP17_IVS1 +426] of SEQ ID No. 21, position [CYP17_IVS1 −99] of SEQ ID No. 22, position [CYP17_IVS1 −700] of SEQ ID No. 23, position [CYP17_IVS1 −565] of SEQ ID No. 24, position [CYP17_IVS3 +141] of SEQ ID No. 25, position [CYP17—5′ region −1488] of SEQ ID No. 26, position [CYP17—5′ region −1204] of SEQ ID No. 27, position [CYP17_IVS1 +466] of SEQ ID No. 28, position [CYP17, 712 base pairs after the stop codon] of SEQ ID No. 29, position [SRD5A2, 1356 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 30, position [SRD5A2, 849 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 31, position [SRD5A2—5′ region −870] of SEQ ID No. 32, position [SRD5A2—5′ region between −2036 and −2030] of SEQ ID No. 33, position [SRD5A2, 545 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 34position [SRD5A2_IVS2 +626] of SEQ ID No. 35, position [SRD5A2—5′ region −8029] of SEQ ID No. 36, position [CYP3A4_IVS7 +34] of SEQ ID No. 42, position [CYP3A4—5′ region −1232] of SEQ ID No. 43, position [SRD5A2—5′ region −3001] of SEQ ID No. 44, and position [SRD5A2, 1552 base pairs after the stop codon] of SEQ ID No. 45.
Preferably, the polymorphic sequence does not contain at least one single nucleotide polymorphism selected from the group consisting of [CYP3A4_IVS9 +187C>G] of SEQ ID No. 1, [CYP3A4, 1639 base pairs after the stop codon, A>T] of SEQ ID No. 2, [CYP3A4, 945 base pairs after the stop codon, A>T] of SEQ ID No. 3, [CYP3A4—5′ region −747C>G] of SEQ ID No. 4, [CYP3A4_IVS7 −202C>T] of SEQ ID No. 5, [CYP3A4, 2204 base pairs after the stop codon, G>C] of SEQ ID No. 6, [CYP3A4_IVS2 −132C>T] of SEQ ID No. 7, [CYP3A4_IVS1 −868C>T] of SEQ ID No. 8, [CYP3A4—5′ region −847A>T] of SEQ ID No. 9, [CYP3A4, 766 base pairs after the stop codon, delT] of SEQ ID No. 10, [CYP3A4, 1454 base pairs after the stop codon, C>T] of SEQ ID No. 11, [CYP3A4_IVS3 +1992T>C] of SEQ ID No. 12, [CYP3A4_IVS9 +841T>G] of SEQ ID No. 13, [CYP3A4_IVS12 −473T>G] of SEQ ID No. 14, [CYP3A4_IVS12 +581C>T] of SEQ ID No. 15, [CYP3A4_IVS12 +586G>A] of SEQ ID No. 16, [CYP3A4_IVS12 +646C>A] of SEQ ID No. 17, [CYP3A4_IVS3 −734G>A] of SEQ ID No. 18, [CYP17_IVS1 −271A>C] of SEQ ID No. 19, [CYP17_IVS5 +75C>G] of SEQ ID No. 20, [CYP17_IVS1 +426G>A] of SEQ ID No. 21, [CYP17_IVS1 −99C>T] of SEQ ID No. 22, [CYP17_IVS1 −700C>G] of SEQ ID No. 23, [CYP17_IVS1 −565G>A] of SEQ ID No. 24, [CYP17_IVS3 +141A>T] of SEQ ID No. 25, [CYP17—5′ region −1488C>G] of SEQ ID No. 26, [CYP17—5′ region −1204C>T] of SEQ ID No. 27, [CYP17_IVS1 +466G>A] of SEQ ID No. 28, [CYP17, 712 base pairs after the stop codon, G>A] of SEQ ID No. 29, [SRD5A2, 1356 base pairs after the stop codon (3′ UTR), A>C] of SEQ ID No. 30, [SRD5A2, 849 base pairs after the stop codon (3′ UTR), A>G] of SEQ ID No. 31, [SRD5A2—5′ region −870G>A] of SEQ ID No. 32, [SRD5A2—5′ region −2036(A)7-8] of SEQ ID No. 33, [SRD5A2, 545 base pairs after the stop codon (3′ UTR), T>C] of SEQ ID No. 34, [SRD5A2_IVS2 +626C>T] of SEQ ID No. 35, [SRD5A2—5′ region −8029C>T] of SEQ ID No. 36, [CYP3A4_IVS7 +34T>G] of SEQ ID No. 42, [CYP3A4—5′ region −1232C>T] of SEQ ID No. 43, [SRD5A2—5′ region −3001G>A] of SEQ ID No. 44, and [SRD5A2, 1552 base pairs after the stop codon, G>A] of SEQ ID No. 45.
Preferably, the polymorphic sequence does not contain at least one single nucleotide polymorphism which is the complement of any of the single nucleotide polymorphisms hereinbefore described.
In a second embodiment of the seventh aspect, the method comprises treatment with a polypeptide which is encoded by a polynucleotide selected from the group consisting of polymorphic sequences SEQ ID NOS 1-36 and SEQ ID NOS 42-45 or their complement, provided that the polymorphic sequence, or the complement, does not contain at least one single nucleotide polymorphism at a position selected from the group consisting of position [CYP3A4_IVS9 +187] of SEQ ID No. 1, position [CYP3A4, 1639 base pairs after the stop codon] of SEQ ID No. 2, position [CYP3A4, 945 base pairs after the stop codon] of SEQ ID No. 3, position [CYP3A4—5′ region −747] of SEQ ID No. 4, position [CYP3A4_IVS7 −202] of SEQ ID No. 5, position [CYP3A4, 2204 base pairs after the stop codon] of SEQ ID No. 6, position [CYP3A4_IVS2 −132] of SEQ ID No. 7, position [CYP3A4_IVS1 −868] of SEQ ID No. 8, position [CYP3A4—5′ region −847] of SEQ ID No. 9, position [CYP3A4, 766 base pairs after the stop codon] of SEQ ID No. 10, position [CYP3A4, 1454 base pairs after the stop codon] of SEQ ID No. 11, position [CYP3A4_IVS3 +1992] of SEQ ID No. 12, position [CYP3A4_IVS9 +841] of SEQ ID No. 13, position [CYP3A4_IVS12 −473] of SEQ ID No. 14, position [CYP3A4_IVS12 +581] of SEQ ID No. 15, position [CYP3A4_IVS12 +586] of SEQ ID No. 16, position [CYP3A4_IVS12 +646] of SEQ ID No. 17, position [CYP3A4_IVS3 −734] of SEQ ID No. 18, position [CYP17_IVS1 −271] of SEQ ID No. 19, position [CYP17_IVS5 +75] of SEQ ID No. 20, position [CYP17_IVS1 +426] of SEQ ID No. 21, position [CYP17_IVS1 −99] of SEQ ID No. 22, position [CYP17_IVS1 −700] of SEQ ID No. 23, position [CYP17_IVS1 −565] of SEQ ID No. 24, position [CYP17_IVS3 +141] of SEQ ID No. 25, position [CYP17—5′ region −1488] of SEQ ID No. 26, position [CYP17—5′ region −1204] of SEQ ID No. 27, position [CYP17_IVS1 +466] of SEQ ID No. 28, position [CYP17, 712 base pairs after the stop codon] of SEQ ID No. 29, position [SRD5A2, 1356 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 30, position [SRD5A2, 849 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 31, position [SRD5A2—5′ region −870] of SEQ ID No. 32, position [SRD5A2—5′ region between −2036 and −2030] of SEQ ID No. 33, position [SRD5A2, 545 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 34, position [SRD5A2_IVS2 +626] of SEQ ID No. 35, position [SRD5A2—5′ region −8029] of SEQ ID No. 36, position [CYP3A4_IVS7 +34] of SEQ ID No. 42, position [CYP3A4—5′ region −1232] of SEQ ID No. 43, position [SRD5A2—5′ region −3001] of SEQ ID No. 44 and position [SRD5A2, 1552 base pairs after the stop codon] of SEQ ID No. 45.
Preferably, the polymorphic sequence does not contain at least one single nucleotide polymorphism selected from the group consisting of [CYP3A4_IVS9 +187C>G] of SEQ ID No. 1, [CYP3A4, 1639 base pairs after the stop codon, A>T] of SEQ ID No. 2, [CYP3A4, 945 base pairs after the stop codon, A>T] of SEQ ID No. 3, [CYP3A4—5′ region −747C>G] of SEQ ID No. 4, [CYP3A4_IVS7 −202C>T] of SEQ ID No. 5, [CYP3A4, 2204 base pairs after the stop codon, G>C] of SEQ ID No. 6, [CYP3A4_IVS2 −132C>T] of SEQ ID No. 7, [CYP3A4_IVS1 −868C>T] of SEQ ID No. 8, [CYP3A4—5′ region −847A>T] of SEQ ID No. 9[CYP3A4, 766 base pairs after the stop codon, delT] of SEQ ID No. 10, [CYP3A4, 1454 base pairs after the stop codon, C>T] of SEQ ID No. 11, [CYP3A4_IVS3 +1992T>C] of SEQ ID No. 12, [CYP3A4_IVS9 +841T>G] of SEQ ID No. 13, [CYP3A4_IVS12 −473T>G] of SEQ ID No. 14, [CYP3A4_IVS12 +581C>T] of SEQ ID No. 15, [CYP3A4_IVS12 +586G>A] of SEQ ID No. 16, [CYP3A4_IVS12 +646C>A] of SEQ ID No. 17, [CYP3A4_IVS3 −734G>A] of SEQ ID No. 18, [CYP17_IVS1 −271A>C] of SEQ ID No. 19, [CYP17_IVS5 +75C>G] of SEQ ID No. 20, [CYP17_IVS1 +426G>A] of SEQ ID No. 21, [CYP17_IVS1 −99C>T] of SEQ ID No. 22, [CYP17_IVS1 −700C>G] of SEQ ID No. 23, [CYP17_IVS1 −565G>A] of SEQ ID No. 24, [CYP17_IVS3 +141A>T] of SEQ ID No. 25, [CYP17—5′ region −1488C>G] of SEQ ID No. 26, [CYP17—5′ region −1204C>T] of SEQ ID No. 27, [CYP17_IVS1 +466G>A] of SEQ ID No. 28, [CYP17, 712 base pairs after the stop codon, G>A] of SEQ ID No. 29, [SRD5A2, 1356 base pairs after the stop codon (3′ UTR), A>C] of SEQ ID No. 30, [SRD5A2, 849 base pairs after the stop codon (3′ UTR), A>G] of SEQ ID No. 31, [SRD5A2—5′ region −870G>A] of SEQ ID No. 32, [SRD5A2—5′ region −2036(A)7-8] of SEQ ID No. 33, [SRD5A2, 545 base pairs after the stop codon (3′ UTR), T>C] of SEQ ID No. 34, [SRD5A2_IVS2 +626C>T] of SEQ ID No. 35, [SRD5A2—5′ region −8029C>T] of SEQ ID No. 36, [CYP3A4_IVS7 +34T>G] of SEQ ID No. 42, [CYP3A4—5′ region −1232C>T] of SEQ ID No. 43, [SRD5A2—5′ region −3001G>A] of SEQ ID No. 44, and [SRD5A2, 1552 base pairs after the stop codon, G>A] of SEQ ID No. 45.
Suitably, the polymorphic sequence does not contain at least one single nucleotide which is the complement of any of the single nucleotide polymorphisms as hereinbefore described.
In a third embodiment of the seventh aspect, the method comprises treatment with an antibody that binds specifically with a polypeptide encoded by a polynucleotide selected from the group consisting of SEQ ID NOS 1-34, or SEQ ID NOS 42-45, or the complement thereof.
According to an eighth aspect of the present invention, there is provided a method for predicting the genetic ability of a subject or an organism to metabolise a chemical, the method comprising analysing a biological sample containing nucleic acid obtained from the subject or organism to detect the presence or absence of one or more single nucleotide polymorphisms at a position selected from the group consisting of position [CYP3A4_IVS9 +187] of SEQ ID No. 1, position [CYP3A4, 1639 base pairs after the stop codon] of SEQ ID No. 2, position [CYP3A4, 945 base pairs after the stop codon] of SEQ ID No. 3, position [CYP3A4—5′ region −747] of SEQ ID No. 4, position [CYP3A4_IVS7 −202] of SEQ ID No. 5, position [CYP3A4, 2204 base pairs after the stop codon] of SEQ ID No. 6, position [CYP3A4_IVS2 −132] of SEQ ID No. 7, position [CYP3A4_IVS1 −868] of SEQ ID No. 8, position [CYP3A4—5′ region −847] of SEQ ID No. 9, position [CYP3A4, 766 base pairs after the stop codon] of SEQ ID No. 10, position [CYP3A4, 1454 base pairs after the stop codon] of SEQ ID No. 11, position [CYP3A4_IVS3 +1992] of SEQ ID No. 12, position [CYP3A4_IVS9 +841] of SEQ ID No. 13, position [CYP3A4_IVS12 −473] of SEQ ID No. 14, position [CYP3A4_IVS12 +581] of SEQ ID No. 15, position [CYP3A4_IVS12 +586] of SEQ ID No. 16, position [CYP3A4_IVS12 +646] of SEQ ID No. 17, position [CYP3A4_IVS3 −734] of SEQ ID No. 18, position [CYP17_IVS1 −271] of SEQ ID No. 19, position [CYP17_IVS5 +75] of SEQ ID No. 20, position [CYP17_IVS1 +426] of SEQ ID No. 21, position [CYP17_IVS1 −99] of SEQ ID No. 22, position is [CYP17_IVS1 −700] of SEQ ID No. 23, position [CYP17_IVS1 −565] of SEQ ID No. 24, position [CYP17_IVS3 +141] of SEQ ID No. 25, position [CYP17—5′ region −1488] of SEQ ID No. 26, position [CYP17—5′ region −1204] of SEQ ID No. 27, position [CYP17_IVS1 +466] of SEQ ID No. 28, position [CYP17, 712 base pairs after the stop codon] of SEQ ID No. 29, position [SRD5A2, 1356 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 30, position [SRD5A2, 849 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 31, position [SRD5A2—5′ region −870] of SEQ ID No. 32, position [SRD5A2—5′ region between −2036 and −2030] of SEQ ID No. 33, position [SRD5A2, 545 base pairs after the stop codon (3′ UTR)] of SEQ ID No. 34, position [SRD5A2_IVS2 +626] of SEQ ID No. 35, position [SRD5A2—5′ region −8029] of SEQ ID No. 36, position [CYP3A4_IVS7 +34] of SEQ ID No. 42, position [CYP3A4—5′ region −1232] of SEQ ID No. 43, position [SRD5A2—5′ region −3001] of SEQ ID No. 44, and position [SRD5A2, 1552 base pairs after the stop codon] of SEQ ID No. 45.
Wherein the presence of a polymorphism at one or more of the positions is indicative of the subject's or organism's ability or inability to metabolise the chemical.
Preferably, the analysis comprises detecting the presence or absence of one or more single nucleotide polymorphisms selected from the group consisting of [CYP3A4_IVS9 +187C>G] of SEQ ID No. 1, [CYP3A4, 1639 base pairs after the stop codon, A>T] of SEQ ID No. 2, [CYP3A4, 945 base pairs after the stop codon, A>T] of SEQ ID No. 3, [CYP3A4—5′ region −747C>G] of SEQ ID No. 4, [CYP3A4_IVS7 −202C>T] of SEQ ID No. 5, [CYP3A4, 2204 base pairs after the stop codon, G>C] of SEQ ID No. 6, [CYP3A4_IVS2 −132C>T] of SEQ ID No. 7, [CYP3A4_IVS1 −868C>T] of SEQ ID No. 8, [CYP3A4—5′ region −847A>T] of SEQ ID No. 9, [CYP3A4, 766 base pairs after the stop codon, delT] of SEQ ID No. 10, [CYP3A4, 1454 base pairs after the stop codon, C>T] of SEQ ID No. 11, [CYP3A4_IVS3 +1992T>C] of SEQ ID No. 12, [CYP3A4_IVS9 +841T>G] of SEQ ID No. 13, [CYP3A4_IVS12 −473T>G] of SEQ ID No. 14, [CYP3A4_IVS12 +581C>T] of SEQ ID No. 15, [CYP3A4_IVS12 +586G>A] of SEQ ID No. 16, [CYP3A4_IVS12 +646C>A] of SEQ ID No. 17, [CYP3A4_IVS3 −734G>A] of SEQ ID No. 18, [CYP17_IVS1 −271A>C] of SEQ ID No. 19, [CYP17_IVS5 +75C>G] of SEQ ID No. 20, [CYP17_IVS1 +426G>A] of SEQ ID No. 21, [CYP17_IVS1 −99C>T] of SEQ ID No. 22, [CYP17_IVS1 −700C>G] of SEQ ID No. 23, [CYP17_IVS1 −565G>A] of SEQ ID No. 24, [CYP17_IVS3 +141A>T] of SEQ ID No. 25, [CYP17—5′ region −1488C>G] of SEQ ID No. 26, [CYP17—5′ region −1204C>T] of SEQ ID No. 27, [CYP17_IVS1 +466G>A] of SEQ ID No. 28, [CYP17, 712 base pairs after the stop codon, G>A] of SEQ ID No. 29, [SRD5A2 1356 base pairs after the stop codon (3′ UTR), A>C] of SEQ ID No. 30, [SRD5A2, 849 base pairs after the stop codon (3′ UTR), A>G] of SEQ ID No. 31, [SRD5A2—5′ region −870G>A] of SEQ ID No. 32, [SRD5A2—5′ region −2036(A)7-8] of SEQ ID No. 33, [SRD5A2, 545 base pairs after the stop codon (3′ UTR), T>C] of SEQ ID No. 34, [SRD5A2_IVS2 +626C>T] of SEQ ID No. 35, [SRD5A2—5′ region −8029C>T] of SEQ ID No. 36, [CYP3A4_IVS7 +34T>G] of SEQ ID No. 42, [CYP3A4—5′ region −1232C>T] of SEQ ID No. 43, [SRD5A2—5′ region −3001G>A] of SEQ ID No. 44, and [SRD5A2, 1552 base pairs after the stop codon, G>A] of SEQ ID No. 45.
Preferably, the method further comprises predicting the response of the subject or the organism to the chemical by their ability or inability to metabolise the chemical.
Suitably, the chemical is a drug or a xenobiotic.
Suitably, the organism is selected from the group consisting of bacterium, fungus, protozoa, alga, insect, nematode, amphibian, plant, fish and mammal.
In a ninth aspect of the present invention, there is provided a vector comprising a polynucleotide selected from the group consisting of a nucleotide sequence comprising one or more polymorphic sequences of SEQ ID NOS 1-34 or SEQ ID NOS 42-45.
In a tenth aspect of the present invention, there is provided a host cell transformed with the vector hereinbefore described.
Preferably, the host cell is selected from the group consisting of, bacterium, fungus, protozoa, alga, insect, nematode, amphibian, plant, fish and mammal. More preferably the mammalian cell is a human cell.
In an eleventh aspect of the present invention, there is provided a method of metabolising a chemical using the host cell as hereinbefore described.
In a twelfth aspect of the present invention, there is provided a method for making a host cell resistant to a chemical, the method comprising transforming a cell with any of the polynucleotides or with any of the vectors as hereinbefore described.
In a thirteenth aspect of the present invention, there is provided an isolated haplotype selected from the group consisting of CYP3A4_Hap4 and SRD52_Hap3.
Preferably, the isolated CYP3A4_Hap4 haplotype consists of Allele T at [CYP3A4—5′ region −1232C>T], Allele C at [CYP3A4—5′ region −747C>G], Allele G at [CYP3A4—5′ region −392A>G], Allele G at [CYP3A4_IVS7 +34T>G], Allele T at [CYP3A4_IVS7 −202C>T], Allele G at [CYP3A4_stop +766T>G], Allele C at [CYP3A4_stop +1454C>T], Allele T at [CYP3A4_stop +1639A>T] and Allele C at [CYP3A4 stop +2204G>C].
Preferably, the isolated SRD52_Hap3 haplotype consists of Allele C at [SRD5A2—5′ region −8029C>T], Allele G at [SRD5A2—5′ region −3001 G>A], Allele G at [SRD5A2—145G>A], Allele G at [SRD5A2—265G>C], Allele T at [SRD5A2_IVS2 +626C>T], Allele G at [SRD5A2_stop +1552G>A], Allele G at [SRD5A2_stop +3059G>A] and Allele G at [SRD5A2_stop +9301 G>C].
In a fourteenth aspect of the present invention, there is provided a method for diagnosing a genetic susceptibility for a disease, condition or disorder related to prostate or breast cancer in a subject, the method comprising analysing a biological sample obtained from the subject to detect the presence or absence of a haplotype as hereinbefore described.
In a fifteenth aspect of the present invention, there is provided a method of diagnosing a genetic susceptibility for a disease, condition or disorder related to prostate or breast cancer in a subject, the method comprising adding an antibody to a polypeptide present in a sample obtained from the subject, which polypeptide is encoded by a haplotype as hereinbefore described, or the complement thereof, and detecting specific binding of the antibody to the polypeptide.
In a sixteenth aspect of the present invention, there is provided a method of treatment or prophylaxis of a subject comprising the steps of
Preferably, the method comprises treatment with a portion of the isolated CYP3A4_Hap4 haplotype as hereinbefore described wherein the portion of the haplotype does not consist of at least one allele from the group consisting of Allele T at [CYP3A4—5′ region −1232C>T], Allele C at [CYP3A4—5′ region −747C>G], Allele G at [CYP3A4—5′ region −392A>G], Allele G at [CYP3A4_IVS7 +34T>G], Allele T at [CYP3A4_IVS7 −202C>T], Allele G at [CYP3A4_stop +766T>G], Allele C at [CYP3A4_stop +1454C>T], Allele T at [CYP3A4_stop +1639A>T] and Allele C at [CYP3A4_stop +2204G>C].
Optionally, the method comprises treatment with a portion of the the isolated SRD5A2_Hap3 haplotype as hereinbefore described wherein the portion of the haplotype does not comprise of at least one allele from the group consisting of Allele C at [SRD5A2—5′ region −8029C>T], Allele G at [SRD5A2—5′ region −3001G>A], Allele G at [SRD5A2—145G>A], Allele G at [SRD5A2—265G>C], Allele T at [SRD5A2_IVS2 +626C>T], Allele G at [SRD5A2_stop +1552G>A], Allele G at [SRD5A2_stop +3059G>A] and Allele G at [SRD5A2_stop +9301 G>C].
Approach
A two-phase study was undertaken of CYP17, CYP3A4, and SRD5A2, to evaluate the relationship between their genotypes/haplotypes and prostate cancer. Phase I of the study first searched for single nucleotide polymorphisms (SNPs) in these genes by re-sequencing 24 individuals from Coriell Polymorphism Discovery Resource (Coriell Cell Repositories, Camden, N.J.), approximately 100 men from prostate cancer case-control sibships, and by leveraging public databases. Eighty-seven SNPs were discovered and genotyped in 276 men from case-control sibships. Those SNPs exhibiting preliminary case-control allele frequency differences, or distinguishing (i.e., ‘tagging’) common haplotypes across the genes, were identified for further study (24 SNPs total). In Phase II of the study, the 24 SNPs were genotyped in an additional 841 men from case-control sibships. Finally, associations between genotypes/haplotypes in CYP17, CYP3A4, and SRD5A2 and prostate cancer were evaluated in the total case-control sample of 1,117 brothers.
Subjects
A family-based association study population of 1,117 men (637 cases, 480 controls) was recruited between January 1998 and January 2001 from the major medical institutions in the greater Cleveland area and from the Henry Ford Health System in Detroit. The study was approved by the collaborating institution's Review Boards, and informed consent was obtained from all participating men. Characteristics of the study population have been described (Casey et al. (2002) Nat Genet 32, 581-583).
Men diagnosed with histologically confirmed prostate cancer at age 73 or younger were invited to join the study if they had a living unaffected brother who was either older than the proband, or at most eight years younger than the age at diagnosis of the proband. This age restriction was selected in an attempt to increase the potential for genetic factors affecting disease, and to help make certain that the controls were not unaffected due simply to being of a younger age. To help confirm that the controls were not diseased, the prostate specific antigen (PSA) levels in their blood was tested. Individuals in the study with PSA levels above 4 ng/ml were retained as ‘controls’ unless a subsequent diagnosis of prostate cancer was made, at which time they were reclassified as cases. Keeping them in the study was important because automatically excluding men with elevated PSA levels regardless of their ultimate prostate cancer status can lead to biased estimates of association (Lubin & Hartge (1984) Am J Epidemiol 120, 791-793; Poole (1999) Am J Epidemiol 150, 547-551). Information on the cases' Gleason score (a measure of prostate cancer cellular differentiation) and tumor stage (TNM, tumor-node-metastasis stage) was determined from their medical records. The study population was comprised of 90% Caucasians (European Americans), and the remainder primarily African American (9%).
Polymorphism Discovery
Polymorphisms were discovered by sequencing individuals from prostate cancer sibships (67 cases and 43 controls for CYP17 and CYP3A4, and 51 cases and 41 controls for SRD5A2). Of the 110 individuals sequenced for CYP17 and CYP3A4, 106 were Caucasian, 2 were Hispanic, and 2 were African-American. Of the 92 individuals sequenced for SRD5A2, 84 were Caucasian and 8 were African American. In addition, the 24 individuals from the Coriell Cell Repository Polymorphism Discovery Resource (Collins et al. (1998) Genome Res 8, 1229-1231) were sequenced against the three genes.
PCR primers covering coding regions, splice sites, 5′ and 3′ regions, and parts of introns of CYP3A4 (reference sequence No. 39), CYP17 (reference sequence No. 40), and SRD5A2 (reference sequence No. 41), were designed using the Primer3 program (http://www.genome.wi.mit.edu/cgi-bin/primer/primer3.cgi). PCR products were sequenced using energy transfer dye terminators on the Amersham Bioscience's MegaBACE1000 (Amersham Biosciences, Sunnyvale, Calif.) using standard protocols. Sequence analysis was performed by assigning quality values (Phred; University of Washington, Seattle, Wash.), assembling contigs (Phrap; University of Washington), automated identification of candidate heterozygote SNPs (PolyPhred, University of Washington), automated identification of candidate homozygote SNPs (High is Quality Mismatch, Amersham Biosciences, Sunnyvale, Calif.) and by operator confirmation (Consed, University of Washington). All polymorphisms were confirmed by Single Nucleotide Primer Extension (SNuPE) assay (Amersham Biosciences, Sunnyvale, Calif.)
In addition to novel polymorphisms discovered in this study, several publicly available SNPs from the dbSNP (http://www.ncbi.nlm.nih.gov/SNP/), Utah Genome Center (UGC) (http://www.genome.utah.edu/genesnps/genes/), the Human Cytochrome P450 Allele Nomenclature Committee (HCANC) (http://www.imm.ki.se/CYPalleles/), the Human Gene Mutation Database (HGMD) (http://archive.uwcm.ac.uk/uwcm/mg/hgmd0.html) and the Human Genic Bi-Allelic SEquences (HGBASE) Release 8 (http://hgbase.interactiva.de/) were searched for CYP17, CYP3A4, and SRD5A2. For the Androgen Receptor gene, several publicly available SNPs from dbSNP, HGBASE and the Androgen Receptor Mutation Database (ARMD) (http://ww2.mcgill.ca/androgendb/) were included.
Genotyping
In Phase I, 276 individuals from prostate cancer sibships were genotyped for 29 SNPs (11 novel, 18 known) in CYP17, 33 SNPs (18 novel, 15 known) in CYP3A4, and 25 SNPs (5 novel, 20 known) in SRD5A2. The individuals included 153 cases and 123 brother controls, 70% European Americans and 30% African Americans. The information from the 276 men was then used to determine initial case-control frequency differences and haplotype tagging. The results were then used to determine which SNPs should be genotyped in the remainder of the study population (i.e. in Phase II of the study).
In Phase II, a total of 24 SNPs were genotyped in 841 individuals, giving information on a total of 1117 individuals for Phase II.
Genotyping was performed utilizing the Single Nucleotide Primer Extension (SNuPE) assay on the MegaBACE1000 (Amersham Biosciences, Sunnyvale Calif.) capillary electrophoresis platform (Amersham Biosciences). The Primer3 program (http://www.genome.wi.mit.edu/cgi-bin/primer/primer3.cgi) was used to design PCR primers to amplify regions containing the SNPs of interest. PCR fragments were purified with 0.5 U of Shrimp Alkaline Phosphatase (Amersham Biosciences) and 10 U of Exonuclease I (Amersham Biosciences) by incubating at 37° C. for 40 min and at 85° C. for 15 min. The single base extension (SBE) reaction was set with 1 pmol of HPLC purified SBE primer, 2-4 μl of SNuPe Premix (Amersham Biosciences), 2-4 μl of sterile water, and 1 μof purified PCR fragment, and incubated at 25 cycles of 96° C. for 10 sec, 50° C. for 5 sec, and 60° C. for 10 sec. For phase I of the study, SNuPe reactions were set in 96-well plates at 10 μl volume and purified with AutoSeq™96 Plates (Amersham Biosciences) prior to injecting into the MegaBACE1000 system. For phase II of the study, SNuPe reactions were set in 384-well plates at 5-6 μl volume, diluted with 3-4 μl of sterile water and purified with 1 U of Shrimp Alkaline Phosphatase (Amersham Biosciences) by incubating at 37° C. for 45 min and at 85° C. for 15 min prior to injecting into the MegaBACE4000 system. In cases where low signal was anticipated (due to faint PCR), SNuPe reactions were desalted using a custom 384-well filter plate incorporating modified size-exclusion technology (Millipore Corporation, Billerica, Mass.). The Scierra Genotyping LWS™ (Amersham Biosciences) system was utilized for the tracking and management of samples and laboratory activity for Phase II of the study.
Specific software (SNPriDe) was developed for the automated design of SNuPE primers. Using a purified PCR fragment containing the SNP of interest as a template, a third, internal primer was designed so that the 3' end anneals adjacent to the polymorphic base-pair, and during the SNUPE reaction a fluorescently labeled dideoxynucleotide (terminator) was added onto the primer. A separate software package has been developed (SNP Profiler™, Amersham Biociences) that automatically processes the signal data and outputs the maximum likelihood SNP genotypes. The system includes a user interface for editing and verification.
Three SNPs, SRD5A2_SNP20 (V89L), SRD5A2_SNP22 (A49T) and CYP17-_SNP29(−34>C) were analysed by restriction enzyme digestion (Cicek et al., unpublished data).
Proofreading Genotype Data
A large number of haplotypes inferred during initial rounds of haplotyping implied erroneous genotype data. A phylogenetic study of inferred haplotypes was performed to reveal the relationships between different haplotypes. All haplotypes differing from another haplotype by only one SNP, and being represented by only one individual, were subject to inspection. Genotype data for the individual at stake were reanalysed by SNP Profiler™ (Amersham Biosciences) to exclude the possibility of an incorrect genotype. Rounds of phylogenetic study of haplotypes, followed by reanalysing suspicious genotypes and inferring new haplotypes were applied until no more incorrect genotypes could be found. Three to six rounds were applied for each of the genes.
Haplotyping
Alleles within each of the three candidate genes were in strong linkage disequilibrium with one another. Thus, for each gene, haplotypes were estimated using the resulting genotypes, by disease status and within major ethnic groups using the software PHASE. This program uses Markov chain Monte Carlo to estimate haplotypes, imputes information for missing genotypes, and incorporates a statistical model for the distribution of unresolved haplotypes based on coalescent theory (Stephens et al. (2001) Am J Hum Genet 68, 978-989).
Haplotypes and haplotype tagging SNPs were first determined among the 276 men genotyped for Phase I of the study, where tagging SNPs was necessary to define the most common haplotypes (e.g., >5%). After completing genotyping on the entire study population (Phase II of the study), the resulting data were used to estimate haplotypes.
Association Analysis
Case versus control allele frequencies were first compared within major ethnic groups. Then the association between the resulting genotypes/haplotypes and prostate cancer risk was evaluated by calculating odds ratios (OR, estimates of relative risk) and 95% confidence intervals from conditional logistic regression with family as the matching variable, using a robust variance estimator that incorporates familial correlations. This is a standard approach for analyzing sibling matched case-control data, although sibling sets without any controls do not contribute any information (197 cases total here) (Breslow and Day (1980) IARC Sci Publ 32, 335-338). In the analyses of CYP17, CYP3A4, and SRD5A2 a log-additive coding was used which treats the most common polymorphism (or haplotype) as the null-risk referent group and assumes that the relative risk of carrying one polymorphism (or haplotype) is the square-root of the risk of carrying two. Since haplotypes were estimated for these three genes, the probabilities of observed haplotypes were used in the analyses (Schaid et al. (2002) Am J Hum Genet 70, 425A434).
To control for potential confounding, age was adjusted for in all regression models. In addition to looking at the main effects of each SNP or haplotype, the analyses were also stratified by the case's disease aggressiveness, where high aggressiveness was defined by TNM stage≧T2B or Gleason score≧7; and low aggressiveness by TNM stage<T2B and Gleason score<7. All statistical analyses were undertaken with the S+software (version 6.0, Insightful Corp, 2001).
Polymorphism Discovery (Phase I)
A total of 34 novel SNPs were detected: 11 in CYP17, 18 in CYP3A4, and 5 in SRD5A2 (Table 2). In addition, 11 SNPs were “rediscovered” from the public databases. Including these 11 SNPs, 53 SNPs were selected in total from the databases: 18 in CYP17, 15 in CYP3A4, and 20 in SRD5A2. These were chosen based on the intention to obtain an even distribution of SNPs across the genes and the availability in the databases at that time (January-April 2001). Twenty-one SNPs were chosen from dbSNP, 27 from GeneSNPs, 12 from HGMD, 8 from HGVbase, and 2 from HCANC (the total number of SNPs listed here exceeds 53 as several SNPs were present in multiple databases). Table 3 lists all 87 SNPs (34 novel, 53 from databases), with their origins, exact locations and allele frequencies.
Among the 34 novel SNPs, 26 (76%) were discovered in both the Coriell and case-control populations. Three SNPs were only observed in the Coriell data, and the remaining five were found only in the prostate cancer sibships. Of these five, three were relatively rare (allele frequencies 0.2-1.5%), suggesting that they may not have been discovered in the Coriell population simply due to its small sample size (n=24). Nevertheless, the other two SNPs that were only found in the prostate cancer sibships (CYP3A4_SNP12 and CYP17_SNP42) showed higher allele frequencies (7.5% and 21.8%, respectively), suggesting that they might be specific to the prostate cancer case-control population.
Genotypying and Haplotyping
Phase I
The 87 SNPs were geneotyped in a total of 276 males from prostate cancer sibships (29 in CYP17, 33 in CYP3A4, and 25 in SRD5A2). Eleven SNPs gave ambiguous genotyping results. This might have been due to unoptimized genotyping reactions or primer self-priming due to secondary structures and unspecificity of PCR and/or SNuPe primers, especially within the Cytochrome P450 gene family. Of the remaining 76 SNPs, a similar percentage of those novel (41%, or 12/29) and known (38%, or 18/47) had allele frequencies>10%. However, 19/47 (40%) of the known SNPs were found to be monoallelic in the 276 men, suggesting that they are either extremely rare, population specific, or artifacts.
In light of these results, the 11 SNPs with ambiguous genotype results, the 19 SNPs that appeared monoallelic in all samples tested, and an additional four that were seen only in the Coriell Diversity Set but not in the prostate cancer sibships were excluded. Also excluded was one SNP because >15% of data was missing (due to a low success rate for PCR and SNuPe reaction). Finally, 12 SNPs were excluded because their minor allele frequencies were less than 5% in all of the following four subgroups: European Americans, African Americans, cases, and controls (Table 3). Following these exclusions, a total of 40 SNPs remained for consideration in the Phase II association study (14 in CYP17, 16 in CYP3A4, and 10 in SRD5A2) (Table 3).
Using the preliminary genotype information, haplotypes estimated with a frequency ≧5% in at least one of the four major subgroups (i.e., European American, African American, cases, or controls) were identified. Each gene had a single “common” haplotype, with a frequency ranging between 42 and 51 percent (not shown). Haplotype tagging SNPs were identified and used as a basis for inclusion in Phase II of the study. In addition, non-tagging SNPs exhibiting suggestive case versus control allele frequencies were considered (Table 3). Altogether 24 SNPs were selected for Phase II.
Phase II
The 24 tagging and suggestive SNPs were genotyped in an additional 841 men, giving information on a total of 1117 individuals for Phase II. Case versus control allele frequency differences by ethnic group are presented in Table 3. Haplotypes estimated with a frequency ≧3% in at least one of the four major subgroups of the study population were identified. The major haplotypes for CYP17, CYP3A4, and SRD5A2 along with their frequencies are presented in
Association Analyses
In the association analyses, no associations between CYP17 genotypes/haplotypes and prostate cancer were detected. When looking at CYP3A4, SNP1 was found to be associated with an approximately 50% reduction in risk (OR=0.53, 95% CI=0.29-0.99; p-value=0.05) (Table 4A). Furthermore, the haplotype analysis revealed an association with an approximately 55% decrease in prostate cancer risk and CYP3A4_Hap4 (OR=0.46, 95% CI=0.21-1.02; p-value=0.05) (Table 5A). Two SNPs in SRD5A2 were also found to be associated with an approximately 50% increase in prostate cancer risk: SRD5A2_SNP26 (OR=1.57, 95% CI=1.08-2.30; p-value=0.02), and SRD5A2_SNP20 (V89L) (OR=1.56, 95% CI=1.08-2.25; p-value=0.02) (Table 4A). These SNPs, however, 5 were in almost complete linkage disequilibrium.
When the study population was stratified by high and low aggressiveness of prostate cancer, several interesting associations emerged (see Table 4B and 5B). First, five SNPs in CYP3A4 showed statistically significant associations with low aggressiveness: CYP3A4_SNP11 (CYP3A4*1B) (OR=0.20, 95% CI=0.06-0.67; p-value=0.009), CYP3A4_SNP47 (OR=0.19, 95% CI=0.06-0.62; p-value=0.006), CYP3A4_SNP1 (OR=0.21, 95% CI=0.05-0.86; p-value=0.03), CYP3A4_SNP25 (OR=6.54, 95% CI=0.99-43.10; p-value=0.05) and CYP3A4_SNP15 (OR=0.41, 95% CI=0.22-0.79; p-value=0.007). Second, an association was observed between CYP3A4_Hap4 and low aggressiveness (OR=0.06, 95% CI=0.008-0.50; p-value=0.009) (Table 5B). Finally, an inverse association was observed between SRD5A2_Hap3 and high aggressiveness (OR=0.52, 95% CI=0.29-0.91; p-value=0.02) (Table 5B).
Table 6 provides annotation of CYP3A4, CYP17 and SRD5A2 genomic sequences.
All of the SNPs disclosed in the present invention have utility in the prognosis and diagnosis of prostate and breast cancer.
Although this invention has been described in terms of certain preferred embodiments, other embodiments which will be apparent to those of ordinary skill in the art in view of the disclosure herein are also within the scope of this invention. Accordingly, the scope of the invention is intended to be defined only by reference to the appended claims. All documents cited herein are incorporated herein by reference in their entirety.
#SNP was discovered in the Coriell Diversity Set and was not present in the 276 individuals from prostate cancer sibships (still obviously a real SNP since it's seen in the Diversity Set)
@ambiguous genotyping results; SNP was excluded from all further analyses. However, most likely real SNPs
The numbering system for the location of SNPs is according to the common mutation nomenclature (den Dunnen and Antonarakis (2000) Human Mut 15, 7-12; http://www.dmd.nl/mutnomen.html#DNA).
c
d
aExplanations: (*), SNP did not show up in our study population; (R), rediscovered; (+), we had sequence coverage but did not rediscover the SNP; (+<), we had sequence coverage but did not rediscover the SNP, most likely due to the low minor allele frequency; (−), we did not have sequence coverage explaining why we did not rediscover the SNP; (CDS), novel SNP discovered originally in the
bUnderlined bases indicate the allele for which frequencies are given
cExcluded from haplotyping in Phase I and from consideration for Phase II based on (A) being monoallelic in the prostate cancer sibships, (B) yielding ambiguous genotyping results, (C) low success rate, (D) allele frequency <5%. Included in Phase II association analyses based on (1) being a haplotype tagging SNP, (2) case-control difference in Phase I, (3) previous publications supporting association, (4) SNP conveniently
dI, allele frequencies based on 276 samples; II, allele frequencies based on 1117 samples
eNA, data not available
aFrom conditional logistic regression, with matching on family, and a variance estimator that incorporates sibling correlations.
bAll results are from dominant models that compare homozygous and heterozygous carriers of variant versus the homozygous wildtype (OR = 1.0).
cNA, data not available
aFrom conditional logistic regression, with matching on family, and a variance estimator that incorporates sibling correlation.
bNA, data not available
aFrom conditional logistic regression, with matching on family, and a variance estimator that incorporates sibling correlation.
bNA, data not available
aFrom conditional logistic regression, with matching on family, and a variance estimator that incorporates sibling correlation.
bNA, data not available
This application claims priority to U.S. provisional patent application Nos. 60/413,583 filed Sep. 25, 2002, and 60/491,842 filed Aug. 1, 2003; the disclosures of which are incorporated herein by reference in their entirety.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US03/30359 | 9/25/2003 | WO | 3/24/2005 |
Number | Date | Country | |
---|---|---|---|
60413583 | Sep 2002 | US | |
60491842 | Aug 2003 | US |