Method of diagnosing type II diabetes mellitus using multilocus marker, polynucleotide including marker associated with type II diabetes mellitus, and microarray and diagnostic kit including the polynucleotide

Information

  • Patent Application
  • 20070048751
  • Publication Number
    20070048751
  • Date Filed
    February 10, 2006
    19 years ago
  • Date Published
    March 01, 2007
    17 years ago
Abstract
Provided are a polynucleotide including a marker associated with type II diabetes mellitus and a method of diagnosing type II diabetes mellitus in an individual, which includes determining a nucleotide of a polymorphic site of at least one polynucleotide of Table 1 in the specification.
Description
TECHNICAL FIELD

The present invention relates to a method of diagnosing type II diabetes mellitus using a multilocus marker, a polynucleotide including a marker associated with type II diabetes mellitus, and a microarray and a diagnostic kit including the polynucleotide.


BACKGROUND ART

The genomes of all organisms undergo spontaneous mutation in the course of their continuing evolution, generating variant forms of progenitor nucleic acid sequences (Gusella, Ann. Rev. Biochem. 55, 831-854 (1986)). The variant forms may confer an evolutionary advantage or disadvantage, relative to a progenitor form, or may be neutral. In some instances, a variant form confers a lethal disadvantage and is not transmitted to subsequent generations of the organism. In other instances, a variant form confers an evolutionary advantage to the species and is eventually incorporated into the DNA of most members of the species and effectively becomes the progenitor form. In many instances, both progenitor and variant form(s) survive and co-exist in a species population. The coexistence of multiple forms of a sequence gives rise to polymorphisms.


Several different types of polymorphisms are known, including restriction fragment length polymorphisms (RFLPs), short tandem repeats (STRs), and single-nucleotide polymorphisms (SNPs). Among them, SNPs take the form of single-nucleotide variations between individuals of the same species. When SNPs occur in protein coding sequences, any one of the polymorphic forms may give rise to the expression of a defective or a variant protein. On the other hand, when SNPs occur in non-coding sequences, some of these polymorphisms may result in the expression of defective or variant proteins (e.g., as a result of defective splicing). Other SNPs have no phenotypic effects.


It is known that human SNPs occur at a frequency of 1 in about 300-1,000 bp. When such SNPs induce a phenotypic expression such as a disease, polynucleotides containing the SNPs can be used as primers or probes for diagnosis of a disease. Currently, research into the nucleotide sequences and functions of SNPs is being conducted by many research institutes. The nucleotide sequences and other experimental results of the identified human SNPs have been made into a database to be easily accessible.


Even though findings available to date show that specific SNPs exist on human genomes or cDNAs, phenotypic effects of such SNPs have not been revealed. Functions of most SNPs have not been disclosed yet.


It is known that 90-95% of total diabetes patients suffer type II diabetes mellitus. Type II diabetes mellitus is a disorder which is developed in persons who abnormally produce insulin or have low sensitivity to insulin, thereby resulting in large change in blood glucose level. When disorder of insulin secretion leads to the condition of type II diabetes mellitus, blood glucose cannot be transferred to body cells, which renders the conversion of food into energy difficult. It is known that genetic causes play a role in type II diabetes mellitus. Other risk factors of type II diabetes mellitus include age over 45, familial history of diabetes mellitus, obesity, hypertension, and high cholesterol level. Currently, diabetes mellitus is mainly diagnosed by measuring a pathological phenotypic change, i.e., blood glucose level, using fasting blood glucose (FSB) test, oral glucose tolerance test (OGTT), and the like [National Institute of Diabetes and Digestive and Kidney Diseases of the National Institutes of Health, http://www.niddk.nih.gov, 2003]. When diagnosis of type II diabetes mellitus is made, type II diabetes mellitus can be prevented or its onset can be delayed by exercise, special diet, body weight control, drug therapy, and the like. In this regard, it can be said that type II diabetes mellitus is a disease in which early diagnosis is highly desirable. Millenium Pharmaceuticals Inc. reported that diagnosis and prognosis of type II diabetes mellitus could be made based on genotypic variations present on HNF1 gene [PR newswire, Sep. 1, 1998]. Sequenom Inc. reported that FOXA2 (HNF3β) gene was highly associated with type II diabetes mellitus [PR Newswire, Oct. 28, 2003]. Even though there are reports about some genes associated with type II diabetes mellitus, research into the incidence of type II diabetes mellitus has been focused on specific genes of some chromosomes in specific populations. For this reason, research results may vary according to human species. Furthermore, all causative genes responsible for type II diabetes mellitus have not yet been identified. Diagnosis of type II diabetes mellitus by such a molecular biological technique is now uncommon. In addition, early diagnosis before incidence of type II diabetes mellitus is currently not feasible. Therefore, there is an increasing need to find new SNPs highly associated with type II diabetes mellitus and their related genes that are found in whole human genomes and to make an early diagnosis of type II diabetes mellitus using the SNPs and the related genes.


DISCLOSURE OF THE INVENTION

The present invention provides a method of diagnosing type II diabetes mellitus using a multilocus marker.


The present invention also provides a polynucleotide including a marker associated with type II diabetes mellitus.


The present invention also provides a microarray including a substrate immobilized with the polynucleotide.


The present invention also provides a diagnostic kit for the detection of type II diabetes mellitus including the polynucleotide.


BEST MODE FOR CARRYING OUT THE INVENTION

The present invention provides a method of diagnosing type II diabetes mellitus in an individual, which includes determining a nucleotide of a polymorphic site of at least one polynucleotide selected from polynucleotides identified by NCBI GenBank accession numbers in Table 1 below.

TABLE 1NCBI GenBankAccession No.Polymorphic siters502612position 101 of SEQ ID NO: 1rs1394720position 101 of SEQ ID NO: 2rs488115position 101 of SEQ ID NO: 3rs2051672position 101 of SEQ ID NO: 4rs1038308position 101 of SEQ ID NO: 5rs1943317position 101 of SEQ ID NO: 6rs929476position 101 of SEQ ID NO: 7rs1984388position 101 of SEQ ID NO: 8rs752139position 101 of SEQ ID NO: 9rs2058501position 101 of SEQ ID NO: 10rs1059033position 101 of SEQ ID NO: 11rs492220position 101 of SEQ ID NO: 12rs1461986position 101 of SEQ ID NO: 13rs607209position 101 of SEQ ID NO: 14rs197367position 101 of SEQ ID NO: 15rs1340266position 101 of SEQ ID NO: 16rs1316909position 101 of SEQ ID NO: 17rs1377188position 101 of SEQ ID NO: 18


The polynucleotides of SEQ ID NOS: 1-18 are 201-bp nucleic acid fragments containing nucleotides of polymorphic sites (position 101) of rs502612, rs1394720, rs488115, rs2051672, rs1038308, rs1943317, rs929476, rs1984388, rs752139, rs2058501, rs1059033, rs492220, rs1461986, rs607209, rs197367, rs1340266, rs1316909, and rs1377188, respectively. The nucleotide sequences of SEQ ID NOS: 1-18 and characteristics of single-nucleotide polymorphisms (SNPs) present in the nucleotide sequences are summarized in Tables 2 and 3 below.


The polynucleotides of SEQ ID NOS: 1-18 are polymorphic sequences. A polymorphic sequence refers to a nucleotide sequence containing a polymorphic site at which SNP occurs. A polymorphic site refers to a position of a polymorphic sequence at which SNP occurs. The polynucleotides of SEQ ID NOS: 1-18 may be DNAs or RNAs.


An embodiment of the present invention provides a method of diagnosing type II diabetes mellitus using a multilocus marker including polymorphic sites (position 101) of two or more polymorphic sequences selected from polymorphic sequences of SEQ ID NOS: 1-18 associated with type II diabetes mellitus. The multilocus marker was identified by DNA sequence analysis of blood samples obtained from type II diabetes mellitus patients and normal persons. Characteristics of the polymorphic sequences of SEQ ID NOS: 1-18 are summarized in Tables 2-3.

TABLE 2Allele frequencyNumber of GenotypeSNPSNP sequencecasconcascascasconconconASSAY_IDA1A2(SEQ ID NO.)A2A2DeltaA1A1A1A2A2A2A1A1A1A2A2A2DMX_001CT10.5920.4920.1541361097715172DMX_009TG20.6640.7370.0733113812919119161DMX_011AG30.8660.9310.065766225139258DMX_029CA40.0570.1040.047268283241525DMX_030CT50.0770.1290.052251412221703DMX_032TA60.7180.5930.1252611715751142107DMX_033TC70.8160.90.0841089198451239DMX_044AT80.8460.7870.0597782131593181DMX_056AG90.3620.2730.0891231374016011624DMX_062CT100.4210.5080.087106133597214677DMX_069TC110.440.4980.05896143606616465DMX_104TC120.2740.2040.07158115241849512DMX_116TC130.60.6680.0684115710129139129DMX_152AC140.5620.640.078621369941129123DMX_154AG150.2690.1990.07153131151871009DMX_058AG160.3150.3820.0671381312811114441DMX_101AT170.380.3160.0641181364613813328DMX_131AT180.4410.3760.065971396211813644df = 2ChiOdds ratioHWE statusSample call ratesquareChi_exact(multiple model)concascasconvalue_p-ValueORClHWHWcall_ratecall_rate12.3842.05E−030.67(0.53, 0.838).027, HWE1.195, HWE117.8142.01E−021.42(1.106, 1.82).195, HWE.424, HWE0.99113.6981.06E−032.10(1.414, 3.115).026, HWE.948, HWE0.990.999.1311.04E−021.93(1.247, 2.975)1.514, HWE13.034, HWD10.999.6837.89E−031.79(1.215, 2.64).51, HWE1.004, HWE0.980.98204.54E−050.57(0.449, 0.728).148, HWE.582, HWE1116.7182.34E−042.02(1.434, 2.831)2.023, HWE.005, HWE0.990.986.6873.53E−020.68(0.501, 0.91).452, HWE.013, HWE0.990.9610.5815.04E−030.66(0.52, 0.848).283, HWE.041, HWE119.4688.79E−031.42(1.131, 1.788).034, HWE2.43, HWE0.990.987.1652.78E−021.27(1.007, 1.59)3.708, HWE.364, HWE10.987.8212.00E−020.68(0.519, 0.891).011, HWE.284, HWE0.990.976.5543.77E−021.34(1.059, 1.7).838, HWE2.473, HWE10.997.0342.97E−021.38(1.095, 1.748).774, HWE1.715, HWE0.990.989.0451.09E−020.68(0.515, 0.886).768, HWE3.616, HWE10.995.995.00E−021.34(1.057, 1.708)0.308, HWE0.112, HWE0.990.995.9735.05E−020.75(0.594, 0.957)0.166, HWE0.465, HWE115.147.65E−020.76(0.605, 0.961)0.194, HWE0.946, HWE0.990.99












TABLE 3













SNP




sequence













SNP
(SEQ ID
Chromosome
Chromosome
















ASSAY_ID
rs
A1
A2
NO)
#
position
Band
Gene





DMX_001
rs502612
C
T
1
1
167373461
1q24.2
PRRX1


DMX_009
rs1394720
T
G
2
11
4533242
11p15.4
intergenic


DMX_011
rs488115
A
G
3
11
74409538
11q13.4
intergenic


DMX_029
rs2051672
C
A
4
17
5847149
17p13.2
intergenic


DMX_030
rs1038308
C
T
5
18
44538585
18q21.1
KIAA0427


DMX_032
rs1943317
T
A
6
18
62419479
18q22.1
intergenic


DMX_033
rs929476
T
C
7
19
33499519
19q12
intergenic


DMX_044
rs1984388
A
T
8
22
30658575
22q12.3
intergenic


DMX_056
rs752139
A
G
9
5
175943870
5q35.2
PC-LKC


DMX_062
rs2058501
C
T
10
7
120274187
7q31.31
FLJ21986


DMX_069
rs1059033
T
C
11
9
77736025
9q21.2
GNAQ


DMX_104
rs492220
T
C
12
1
94254590
1p22.1
ABCA4


DMX_116
rs1461986
T
C
13
13
75506683
13q22.2
intergenic


DMX_152
rs607209
A
C
14
4
16808165
4p15.32
intergenic


DMX_154
rs197367
A
G
15
7
36219096
7p14.2
ANLN


DMX_058
rs1340266
A
G
16
6
102381236
6q16.3
GRIK2:










GRIK2


DMX_101
rs1316909
A
T
17
1
156770438
1q23.2


DMX_131
rs1377188
A
T
18
18
29732602
18q12.1
NOL4:










NOL4


















Amino acid



ASSAY_ID
Description
SNP function
change







DMX_001
Paired related homeobox 1
intron
No change



DMX_009

intergenic
No change



DMX_011

vgenic
No change



DMX_029

vgenic
No change



DMX_030
KIAA0427
coding-synon
No change



DMX_032

vgenic
No change



DMX_033

intergenic
No change



DMX_044

intergenic
No change



DMX_056
protocadherin LKC
intron
No change



DMX_062
hypothetical protein
intron
No change



DMX_069
guanine nucleotide binding protein
intron
No change




(G protein), q polypeptide



DMX_104
ATP45; binding cassette, sub45;
intron
No change




family A (ABC1), member 4



DMX_116

intergenic
No change



DMX_152

intergenic
No change



DMX_154
aniline, actin binding protein
coding-nonsynon
K→R




(scraps homolog, Drosophila




glutamate receptor)



DMX_058
ionotropic, kainate 2
intron
No change



DMX_101




DMX_131
nucleolar protein 4
intron
No change










In Tables 2 and 3, the contents in columns are as defined below.

    • Assay_ID represents a marker name.
    • SNP is a polymorphic base of a SNP polymorphic site. Here, A1 and A2 represent respectively a low mass allele and a high mass allele as a result of sequence analysis according to a homogeneous MassExtension (hME) technique (Sequenom) and are optionally designated for convenience of experiments.
    • SNP sequence represents a sequence containing a SNP site, i.e., a sequence containing allele A1 or A2 at position 101.
    • In the allele frequency column, cas_A2, con_A2, and Delta respectively represent allele A2 frequency of a case group, allele A2 frequency of a normal group, and the absolute value of the difference between cas_A2 and con_A2. Here, cas_A2 is (genotype A2A2 frequency×2+genotype A1A2 frequency)/(the number of samples×2) in the case group and con_A2 is (genotype A2A2 frequency×2+genotype A1A2 frequency)/(the number of samples×2) in the normal group.
    • Genotype frequency represents the frequency of each genotype. Here, cas_A1A1, cas_A1A2, and cas_A2A2 are the number of persons with genotypes A1A1, A1A2, and A2A2, respectively, in the case group, and con_A1A1, con_A1A2, and con_A2A2 are the number of persons with genotypes A1A1, A1A2, and A2A2, respectively, in the normal group.
    • df=2 represents a chi-squared value with two degree of freedom. Chi-value represents a chi-squared value and p-value is determined based on the chi-value. Chi_exact_p-value represents p-value of Fisher's exact test of chi-square test. When the number of genotypes is less than 5, results of the chi-square test may be inaccurate. In this respect, determination of more accurate statistical significance (p-value) using the Fisher's exact test is required. The chi_exact p-value is a variable used in the Fisher's exact test. In the present invention, when the p-value≦0.05, it is considered that the genotype of the case group is different from that of the normal group, i.e., there is a significant difference between the case group and the normal group.
    • Odds ratio represents the ratio of the probability of allele A1 in the case group to the probability of allele A1 in the normal group. In the present invention, the Mantel-Haenszel odds ratio method was used. CI represents a 95% confidence interval for the odds ratio and is represented by (lower limit of the confidence interval, upper limit of the confidence interval). When 1 falls under the confidence interval, it is considered that there is insignificant association of allele A1 with disease.
    • HWE represents Hardy-Weinberg Equilibrium. Here, con_HWE and cas_HWE represent degree of deviation from the Hardy-Weinberg Equilibrium in the normal group and the case group, respectively. Based on chi_value=6.63 (p-value=0.01, df=1) in a chi-square (df=1) test, a value larger than 6.63 was regarded as Hardy-Weinberg Disequilibrium (HWD) and a value smaller than 6.63 was regarded as Hardy-Weinberg Equilibrium (HWE).
    • Sample call rate represents the number of genotype-interpretable samples to the total number of samples used in experiments. Here, cas_call_rate and con_call_rate represent the ratio of the number of genotype-interpretable samples to the total number (300 persons) of samples used in the case group and the normal group, respectively.


Tables 2 and 3 present characteristics of SNP markers based on the NCBI build 123.


In an embodiment of the method of the present invention, when nucleotides of polymorphic sites of rs502612, rs1394720, rs488115, rs2051672, rs1038308, rs1943317, rs929476, rs1984388, rs752139, rs2058501, rs1059033, rs492220, rs1461986, rs607209, rs197367, rs1340266, rs1316909, and rs1377188 satisfy at least one of multilocus markers (1) through (7) below, it may be determined that the individual has a higher likelihood of being diagnosed as a type II diabetes mellitus patient or as at risk of developing type II diabetes mellitus:


(1) the genotype of a polymorphic site of rs488115 is AA or AG and the genotype of a polymorphic site of rs1984388 is TT;


(2) the genotype of a polymorphic site of rs2051672 is CC, the genotype of a polymorphic site of rs1943317 is AA, and the genotype of a polymorphic site of rs752139 is AG or GG;


(3) the genotype of a polymorphic site of rs1943317 is TA or AA, the genotype of a polymorphic site of rs929476 is TT or TC, and the genotype of a polymorphic site of rs1377188 is AT or TT;


(4) the genotype of a polymorphic site of rs502612 is TT, the genotype of a polymorphic site of rs2051672 is CC, the genotype of a polymorphic site of rs2058501 is CC or CT, and the genotype of a polymorphic site of rs1461986 is TT or TC;


(5) the genotype of a polymorphic site of rs1394720 is TT or TG, the genotype of a polymorphic site of rs1316909 is AT or TT, and the genotype of a polymorphic site of rs607209 is AG or GG;


(6) the genotype of a polymorphic site of rs2051672 is CC, the genotype of a polymorphic site of rs1340266 is AA, and the genotype of a polymorphic site of rs492220 is TC or CC; and


(7) the genotype of a polymorphic site of rs1038308 is CC, the genotype of a polymorphic site of rs1059033 is TT, and the genotype of a polymorphic site of rs607209 is AA or AC.


As a result of the comparison of occurrence frequencies of the genotype patterns of the multilocus markers (1) through (7) in a patient group and a normal group, it was determined that the genotype patterns of the multilocus markers (1) through (7) were significantly associated with type II diabetes mellitus. Occurrence frequencies of the multilocus markers (1) through (7) are presented in Table 4 below.

TABLE 4OccurrenceOccurrence95%Markerfrequency infrequency inOddsconfidencenameGenotype patternpatient groupnormal groupratiointerval1DMX_011 = AA or AG59193.62(2.1, 6.24)and DMX_044 = TT2DMX_029 = CC,94313.96(2.54, 6.18)DMX_032 = AA, andDMX_056 = AG or GG3DMX_032 = TA or AA,70233.67(2.22, 6.06)DMX_033 = TT or TC, andDMX_131 = AT or TT4DMX_001 = TT,63193.93(2.29, 6.76)DMX_029 = CC,DMX_062 = CC or CT, andDMX_116 = TT or TC5DMX_009 = TT or TG62174.34(2.47, 7.62)DMX_101 = AT or TT, andDMX_154 = AG or GG6DMX_029 = CC,71233.73(2.26, 6.17)DMX_058 = AA, andDMX_104 = TC or CC7DMX_030 = CC,63193.93(2.29, 6.76)DMX_069 = TT, andDMX_152 = AA or AC


NCBI GenBank accession numbers corresponding to the marker names in Table 4 are as presented in Table 3. Table 4 shows occurrence frequencies of the genotype patterns of the multilocus markers (1) through (7) in 300 type II diabetes mellitus patients and 300 normal persons. 82% (247/300) of the patients satisfied at least one of the genotype patterns of the multilocus markers (1) through (7). In Table 4, the odds ratio represents the ratio of the probability of a multilocus genotype pattern in the patient group to the probability of the multilocus genotype pattern in the normal group. As shown in Table 4, all odds ratios were greater than 3.5. This reveals that occurrence frequencies of the genotype patterns of the multilocus markers (1) through (7) are closely positively associated with type II diabetes mellitus.


The method of diagnosing type II diabetes mellitus according to the present invention may include isolating a nucleic acid sample from an individual; and determining a nucleotide of at least one polymorphic site (position 101) of polynucleotides of SEQ ID NOS: 1-18 or complementary polynucleotides thereof.


The operation of isolating the nucleic acid sample from the individual may be carried out using a common DNA isolation method. For example, the nucleic acid sample can be obtained by amplifying a target nucleic acid by polymerase chain reaction (PCR) followed by purification. In addition to PCR, there may be used ligase chain reaction (LCR) (Wu and Wallace, Genomics 4, 560 (1989), Landegren et al., Science 241, 1077 (1988)), transcription amplification (Kwoh et al., Proc. Natl. Acad. Sci. USA 86, 1173 (1989)), self-sustained sequence replication (Guatelli et al., Proc. Natl. Acad. Sci. USA 87, 1874 (1990)), or nucleic acid sequence based amplification (NASBA). The last two methods are related to isothermal reaction based on isothermal transcription and produce 30 or 100-fold RNA single strands and DNA double strands as amplification products.


The operation of determining the nucleotide of the at least one polymorphic site may be carried out using any method known in the art. For example, the dideoxy method for direct nucleotide sequence determination or the hybridization method for indirect nucleotide sequence determination may be used. For the latter, various methods may be used. For example, a nucleic acid microarray may be used. That is, the operation of determining the nucleotide of the at least one polymorphic site may include hybridizing the nucleic acid sample onto a microarray immobilized with one or more polynucleotides for the diagnosis or treatment of type II diabetes mellitus, each of which includes at least 10 contiguous nucleotides derived from the group consisting of nucleotide sequences of SEQ ID NOS: 1-18 and includes a nucleotide of the position 101, or complementary polynucleotides thereof; and detecting the hybridization result.


A microarray and a method of manufacturing a microarray by immobilizing a probe polynucleotide on a substrate are well known in the art. Immobilization of a probe polynucleotide associated with type II diabetes mellitus of the present invention on a substrate can be easily performed using a conventional technique. Hybridization of nucleic acids on a microarray and detection of the hybridization result are also well known in the art. For example, the detection of the hybridization result can be performed by labeling a nucleic acid sample with a labeling material generating a detectable signal, such as a fluorescent material (e.g., Cy3 and Cy5), hybridizing the labeled nucleic acid sample onto a microarray, and detecting a signal generated from the labeling material.


The present invention also provides a polynucleotide including at least 10 contiguous nucleotides of at least one nucleotide sequence selected from the group consisting of polymorphic sequences of Table 5 below and including a nucleotide of a polymorphic site (position 101) of the at least one nucleotide sequence, or a complementary polynucleotide thereof.

TABLE 5NCBI GenBankAccession No.Polymorphic sitePolymorphic basers502612position 101 of SEQ ID NO: 1C or Trs1394720position 101 of SEQ ID NO: 2T or Grs488115position 101 of SEQ ID NO: 3A or Grs2051672position 101 of SEQ ID NO: 4C or Ars1038308position 101 of SEQ ID NO: 5C or Trs1943317position 101 of SEQ ID NO: 6T or Ars929476position 101 of SEQ ID NO: 7T or Crs1984388position 101 of SEQ ID NO: 8A or Trs752139position 101 of SEQ ID NO: 9A or Grs2058501position 101 of SEQ ID NO: 10C or Trs1059033position 101 of SEQ ID NO: 11T or Crs492220position 101 of SEQ ID NO: 12T or Crs1461986position 101 of SEQ ID NO: 13T or Crs607209position 101 of SEQ ID NO: 14A or Crs197367position 101 of SEQ ID NO: 15A or Grs1340266position 101 of SEQ ID NO: 16A or Grs1316909position 101 of SEQ ID NO: 17A or Trs1377188position 101 of SEQ ID NO: 18A or T


The polynucleotide may be at least one polynucleotide set selected from the group consisting of polynucleotide sets (1) through (7) below:


(1) rs488115 and rs1984388;


(2) rs2051672, rs1943317, and rs752139;


(3) rs1943317, rs929476, and rs1377188;


(4) rs502612, rs2051672, rs2058501, and rs1461986;


(5) rs1394720, rs1316909, and rs197367;


(6) rs2051672, rs1340266, and rs492220; and


(7) rs1038308, rs1059033, and rs607209.


The polynucleotide of the present invention can be used as a primer or a probe. The polynucleotide can be immobilized onto a solid substrate, i.e., a microarray, as well as in a solution. Since the polynucleotide of the present invention is a type II diabetes mellitus-specific nucleotide sequence, it can be used for type II diabetes mellitus-related applications such as diagnosis or treatment of type II diabetes mellitus.


The present invention also provides a microarray immobilized with the polynucleotide of the present invention. The polynucleotide and the microarray are as described above.


The present invention also provides a diagnostic kit for the detection of type II diabetes mellitus including the polynucleotide of the present invention. Preferably, the diagnostic kit includes at least one multilocus marker polynucleotide.


In the diagnostic kit of the present invention, the polynucleotide contained in the diagnostic kit is as described above. The diagnostic kit of the present invention may include the manufacturer's specification stating a method, materials, etc. to an extent that can be understood by those of ordinary skill in the art. For example, the diagnostic kit can be used in identifying a predetermined allele at a polymorphic site by hybridizing a nucleic acid sample obtained from an individual onto the polynucleotide of the present invention used as a probe and measuring the degree of hybridization using a signal generated from the resultant hybrids. Based on the identification of predetermined allele or genotype, it can be determined if an individual has a likelihood of being diagnosed as at risk of developing type II diabetes mellitus or as a type II diabetes mellitus patient.


Hereinafter, the present invention will be described more specifically by Examples. However, the following Examples are provided for illustrative purposes only and are not intended to limit the scope of the present invention.







EXAMPLES
Example 1

In Example 1, DNA samples were extracted from blood of a patient group consisting of 300 Korean persons that had been identified as type II diabetes mellitus patients and had been undergoing treatment and a normal group consisting of 300 persons free from symptoms of type II diabetes mellitus and being of the same age as the patient group, and occurrence frequencies of specific SNPs were evaluated. The SNPs used in this Example were selected from a known database (NCBI dbSNP: http://www.ncbi.nlm.nih.gov/SNP/) or (Sequenom: http://www.realsnp.com/). Primers hybridizing with sequences around the selected SNPs were used to assay the nucleotide sequences of SNPs in the DNA samples.


1. Preparation of DNA Samples


DNA samples were extracted from blood of type II diabetes mellitus patients and normal persons. The DNA extraction was performed according to a known extraction method (Molecular cloning: A Laboratory Manual, p 392, Sambrook, Fritsch and Maniatis, 2nd edition, Cold Spring Harbor Press, 1989) and the specification of a commercial kit manufactured by Centra system. Among extracted DNA samples, only DNA samples having a purity (A260/A280 nm) of at least 1.7 were used.


2. Amplification of Target DNAs


Target DNAs, which were predetermined DNA regions containing SNPs to be analyzed, were amplified by PCR. The PCR was performed using a common method under the following conditions. First, 2.5 ng/ml of target genomic DNAs were prepared. Then, the following PCR mixture was prepared.

TABLEWater (HPLC grade)2.24 μl10× buffer (15 mM MgCl2, 25 mM MgCl2) 0.5 μldNTP Mix (GIBCO) (25 mM for each)0.04 μlTaq pol (HotStar) (5U/μl)0.02 μlForward/reverse primer Mix (1μ M for each)0.02 μlDNA1.00 μlTotal volume5.00 μl


Here, the forward and reverse primers were designed based on upstream and downstream sequences of SNPs of a known database. These primers are listed in Table 6 below.


The thermal cycles of PCR were as follows: incubation at 95° C. for 15 minutes; 45 cycles at 95° C. for 30 seconds, at 56° C. for 30 seconds, and at 72° C. for 1 minute; and incubation at 72° C. for 3 minutes and storage at 4° C. As a result, amplified DNA fragments which were 200 or less nucleotides in length were obtained.


3. Analysis of SNPs in Amplified Target DNA Fragments


Analysis of SNPs in the amplified target DNA fragments was performed using a homogeneous MassEXTEND (hME) technique available from Sequenom. The principle of the MassEXTEND technique is as follows. First, primers (also called “extension primers”) ending immediately before SNPs within the target DNA fragments were designed. Then, the primers were hybridized with the target DNA fragments and DNA polymerization was performed. At this time, a polymerization solution contained a reagent (e.g., ddTTP) terminating the polymerization immediately after the incorporation of a nucleotide complementary to a first allelic nucleotide (e.g., A allele). In this regard, when the first allele (e.g., A allele) exists in the target DNA fragments, products in which only a nucleotide (e.g., T nucleotide) complementary to the first allele is extended from the primers will be obtained. On the other hand, when a second allele (e.g., G allele) exists in the target DNA fragments, a nucleotide (e.g., C nucleotide) complementary to the second allele is added to the 3′-ends of the primers and then the primers are extended until a nucleotide complementary to the closest first allele nucleotide (e.g., A nucleotide) is added. The lengths of products extended from the primers were determined by mass spectrometry. Therefore, alleles present in the target DNA fragments could be identified. Illustrative experimental conditions were as follows.


First, unreacted dNTPs were removed from the PCR products. For this, 1.53 μl of deionized water, 0.17 μl of hME buffer, and 0.30 μl of shrimp alkaline phosphatase (SAP) were added and mixed in 1.5 ml tubes to prepare SAP enzyme solutions. The tubes were centrifuged at 5,000 rpm for 10 seconds. Thereafter, the PCR products were added to the SAP solution tubes, sealed, incubated at 37° C. for 20 minutes and then at 85° C. for 5 minutes, and stored at 4° C.


Next, homogeneous extension was performed using the amplified target DNA fragments as templates. The compositions of the reaction solutions for the extension were as follows.

TABLEWater (nanoscale deionized water)1.728 μlhME extension mix (10× buffer containing 2.25 mM d/ddNTPs)0.200 μlExtension primers (1μ M for each)0.054 μlThermosequenase (32U/μl)0.018 μlTotal volume 2.00 μl


The reaction solutions were thoroughly stirred and subjected to spin-down centrifugation. Tubes or plates containing the resultant solutions were compactly sealed and incubated at 94° C. for 2 minutes, followed by 40 thermal cycles at 94° C. for 5 seconds, at 52° C. for 5 seconds, and at 72° C. for 5 seconds, and storage at 4° C. The homogeneous extension products thus obtained were washed with a resin (SpectroCLEAN™). Extension primers used in the extension are listed in Table 6 below.

TABLE 6Primers for amplification and extension primersfor homogeneous extension for target DNAsAmplification primer (SEQ ID NO)Extension primerMarkerForward primerReverse primer(SEQ ID NO)DMX_001192021DMX_009222324DMX_011252627DMX_029282930DMX_030313233DMX_032343536DMX_033373839DMX_044404142DMX_056434445DMX_062464748DMX_069495051DMX_104525354DMX_116555657DMX_152585960DMX_154616263DMX_058646566DMX_101676869DMX_131707172


Nucleotides of polymorphic sites in the extension products were assayed using mass spectrometry, MALDI-TOF (Matrix Assisted Laser Desorption and Ionization-Time of Flight). The MALDI-TOF is operated according to the following principle. When an analyte is exposed to a laser beam, it flies toward a detector positioned at the opposite side in a vacuum state, together with an ionized matrix. At this time, the time taken for the analyte to reach the detector is calculated. A material with a smaller mass reaches the detector more rapidly. The nucleotides of SNPs in the target DNA fragments are determined based on a difference in mass between the DNA fragments and known SNP sequences.


The results for the determination of polymorphic sequences of the target DNAs using the MALDI-TOF are shown in Tables 2 and 3. Each allele may exist in the form of homozygote or heterozygote in an individual. According to Mendel's Law of inheritance and Hardy-Weinberg Law, a genetic makeup of alleles constituting a population is maintained at a constant frequency. When the genetic makeup is statistically significant, it can be considered to be biologically meaningful. The SNPs according to the present invention occur in type II diabetes mellitus patients at a statistically significant level, as shown in Tables 2 and 3, and thus, can be efficiently used in diagnosis of type II diabetes mellitus.


As shown in Table 4, genotype patterns based on the combination of the nucleotides of the polymorphic sites of Tables 2 and 3, i.e., multilocus genotype patterns are highly associated with type II diabetes mellitus.


INDUSTRIAL APPLICABILITY

According to a method of diagnosing type II diabetes mellitus of the present invention, the presence or a risk of type II diabetes mellitus can be effectively detected.

Claims
  • 1. A method of diagnosing type II diabetes mellitus in an individual, which comprises determining a nucleotide of a polymorphic site of at least one polynucleotide selected from polynucleotides identified by NCBI GenBank accession numbers in a table below.
  • 2. The method of claim 1, wherein when the nucleotides of the polymorphic sites of SEQ ID NOS: 1-18 satisfy at least one of multilocus markers (1) through (7) below, it is determined that the individual has a higher likelihood of being diagnosed as a type II diabetes mellitus patient or as at risk of developing type II diabetes mellitus: (1) the genotype of a polymorphic site of rs488115 is AA or AG and the genotype of a polymorphic site of rs1984388 is TT; (2) the genotype of a polymorphic site of rs2051672 is CC, the genotype of a polymorphic site of rs1943317 is AA, and the genotype of a polymorphic site of rs752139 is AG or GG; (3) the genotype of a polymorphic site of rs1943317 is TA or AA, the genotype of a polymorphic site of rs929476 is TT or TC, and the genotype of a polymorphic site of rs1377188 is AT or TT; (4) the genotype of a polymorphic site of rs502612 is TT, the genotype of a polymorphic site of rs2051672 is CC, the genotype of a polymorphic site of rs2058501 is CC or CT, and the genotype of a polymorphic site of rs1461986 is TT or TC; (5) the genotype of a polymorphic site of rs1394720 is TT or TG, the genotype of a polymorphic site of rs1316909 is AT or TT, and the genotype of a polymorphic site of rs197367 is AG or GG; (6) the genotype of a polymorphic site of rs2051672 is CC, the genotype of a polymorphic site of rs1340266 is AA, and the genotype of a polymorphic site of rs492220 is TC or CC; and (7) the genotype of a polymorphic site of rs1038308 is CC, the genotype of a polymorphic site of rs1059033 is TT, and the genotype of a polymorphic site of rs607209 is AA or AC.
  • 3. The method of claim 1, wherein the operation of determining the nucleotide of the polymorphic site is carried out by direct nucleotide sequence analysis or hybridization.
  • 4. The method of claim 3, wherein the operation of determining the nucleotide of the polymorphic site comprises: hybridizing a nucleic acid sample obtained from the individual onto a microarray on which a probe polynucleotide including a polymorphic site of at least one polynucleotide of SEQ ID NOS: 1-18 or a complementary probe polynucleotide thereof is immobilized; and detecting a hybridization result.
  • 5. A polynucleotide comprising at least 10 contiguous nucleotides of at least one nucleotide sequence selected from the group consisting of polymorphic sequences of a table below and comprising a nucleotide of a polymorphic site (position 101) of the at least one nucleotide sequence, or a complementary polynucleotide thereof.
  • 6. The polynucleotide of claim 5, wherein the polynucleotide is at least one polynucleotide set selected from the group consisting of polynucleotide sets (1) through (7) below: (1) rs488115 and rs1984388; (2) rs2051672, rs1943317, and rs752139; (3) rs1943317, rs929476, and rs1377188; (4) rs502612, rs2051672, rs2058501, and rs1461986; (5) rs1394720, rs1316909, and rs197367; (6) rs2051672, rs1340266, and rs492220; and (7) rs1038308, rs1059033, and rs607209.
  • 7. A microarray comprising the polynucleotide of claim 5.
  • 8. A diagnostic kit for the detection of type II diabetes mellitus comprising the polynucleotide of claim 5.
  • 9. A microarray comprising the polynucleotide of claim 6.
  • 10. A diagnostic kit for the detection of type II diabetes mellitus comprising the polynucleotide of claim.
Priority Claims (2)
Number Date Country Kind
10-2005-0012418 Feb 2005 KR national
10-2005-0032739 Apr 2005 KR national