FIELD OF THE INVENTION
This invention relates to genes for diagnosing colorectal cancer, particularly provided a method of clinical diagnosis for colorectal cancer which enables the effects of early diagnosis, specificity, highly sensitivity and safety.
BACKGROUND OF THE INVENTION
Colorectal cancer is one of the most common malignant tumors of the world, it is the second most frequent cause of malignant tumor related mortality in developed countries. In developed countries, mortality rate caused by colorectal cancer seems have a descending tendency progressively in previous 20 years, the main causes for early diagnosis is provided and the improvement of methods of therapy and medicines. But in Taiwan the reason of changing in diet habit is occidental habit input and the rapid changing in environment, the rate of suffering for colorectal cancer is rising constantly, furthermore, also showing an age-descending tendency.
According to top ten related cancer of Taiwanese of 2002, colorectal cancer (CRC) is the third leading cause of cancer-related death for male and female, which is announced by The Department of Health (DOH), highest level of the executive branch, Taiwan. About 6681 new cases of colorectal cancer diagnosed per year such as statistical data by DOH of 1999, for 3649 patients dead in the colorectal cancer per year such as statistical data by DOH of 2002 In Taiwan. The average age of colorectal cancer patient is lower than other countries. In other words, twenty-year-old or thirty-year-old people suffer from the colorectal cancer in Taiwan. Therefore, we can't ignore the possibility of the colorectal cancer caused by young person.
Although methods of diagnosis and surgical operation treatment are improved for colorectal cancer patients, if make a comparison between early diagnosis with later period diagnosis by surgical operation respectively, the treatment is able to probably overcome the colorectal cancer in early diagnosis, but is not able to absolutely overcome the colorectal cancer in the later period diagnosis. The far metastasis are main problem of the treatment for the colorectal cancer, therefore, if a method with highly sensitivity, highly specificity and easily diagnosis which is able to detect early and potentially curable CRC, We believe that is a novel target for CRC diagnosis and therapy.
The present invention is to provide functional genetic method, for diagnostic genes of colorectal cancer consist of 71 types of genes, that can be applied for early diagnosing possibility of recurrence and metastasis for colorectal patients. Simultaneously, tracing 100 colorectal cancer cases, found that 92% genes variation in colorectal tissue. In the process of tracing for 100 colorectal cancer cases simultaneously, mutation of genes is found in 92% colorectal cancer tissues. In the tracing process, although CEA of 16 patients still in normal value range, that detect early tumor cells in blood by using genes variation testing.
In WO0055351, ROSEN CRAIG A et. al., “Human Colon Cancer Associated Gene Sequences And Polypeptides”, disclose colon cancer related polynucleotides and the polypeptides encoded by the polynucleotides herein collectively known as “colon cancer antigens”, screening methods for identifying agonists and antagonists of colon cancer antigens of the invention, But, the present invention is to provide SSH and cDNA microarray technology to identify candidate marker genes which are overexpressed continuously from colorectal proliferous polypus to colorectal oncogene, detecting overexpressed genes are selected from up regulation genes which related intently in colorectal cancer oncogene, and down regulation genes which related in colorectal cancer oncogene. The total 71 genes are used to diagnosing early colorectal cancer.
SUMMARY OF THE INVENTION
Therefore, the main purpose according to this present invention is to provide the methods of clinical diagnosis for colorectal cancer for early diagnosis, specificity, highly sensitivity and safety.
For the purpose stated above, the gene sequences comprise the steps of: (1) deriving epithelium cells from normal intestines, polypus of intestines and colorectal cancer tissue; (2) collecting genes with highly differential gene expression by Suppression Subtractive Hybridization (SSH), and building library; (3) deriving colonies with relatively high signal intensities from cancer tissue; (4) collecting more clinically cancer tissues by Northern Hybridization, real-time Polymerase Chain Reaction (PCR) combined with analysis of bioinformation to affirm variation between differential gene expression; and (5) selecting the most suitable genes from said library. Moreover, the reagent uses the gene sequence as method of clinical diagnosis for colorectal cancer to the early diagnosis.
BRIEF DESCRIPTION OF THE DRAWINGS
The present invention will be better understood from the following detailed description of preferred embodiments of the invention, taken in conjunction with the accompanying drawings, in which.
Table 1 is a table showing the result of clinical examination of colorectal cancer biochip;
FIG. 1 is a view showing the procedure of deriving genes according to the present invention;
FIG. 2
a and FIG. 2b are views showing the primary screening according to the present invention;
FIG. 3
a and FIG. 3b are views showing affirmation to genes using Northern Blotting method according to the present invention;
FIGS. 4
a and 4b are views showing quantity expression of cancer tissue according to the present invention; and
FIG. 5 is a diagram showing second preferred embodiments according to the present invention.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
The following descriptions of the preferred embodiments are provided to understand the methods and the procedures of the present invention. Please refer to FIG. 1, showing the procedure of searching genes according to the present invention. Said procedure comprise the steps of: (1) deriving epithelium cells from normal intestines, polypus of intestines and colorectal cancer tissue; (2) collecting genes with highly differential gene expression by Suppression Subtractive Hybridization (SSH), and building library; (3) deriving colonies with relatively high signal intensities from cancer tissue; (4) collecting more clinically cancer tissues by Northern Hybridization, real-time Polymerase Chain Reaction (PCR) combined with analysis of bioinformation to affirm variation between differential gene expression; and (5) selecting the most suitable genes from said library. Moreover, by using the gene sequence as a reagent, this enables clinical diagnosis for colorectal cancer to the effects of early diagnosis, specificity, highly sensitivity and safety.
The genes for diagnosing colorectal cancer, the specific oligonucleotides sequence are selected from the group consisting of:
|
|
NoHs IDACC NoDiscriptionDefinitionOligo sequence
|
|
1Hs.107213BC027178FNBP3Homo sapiens,CATCATAGGAA
(SEQUENCEForminformin bindingACGTTCCCGCT
LISTING 72)bindingprotein 3, cloneCTCGATCGGGG
protein 3MGC: 16979TCAGATTCAGAT
IMAGE: 4343048,GATGATG
mRNA,(SEQUENCE
complete cdsLISTING 1)
|
2Hs.123107NM_002257KLK1Homo sapiensGCCTTCTGTCG
(SEQUENCEKallikrein 1,kallikrein 1,CCGTCAGAGTG
LISTING 73)renal/pancreas/renal/pancreas/CTGTCTTATGTG
salivarysalivary (KLK1),AAGTGGATCGA
mRNA.GGACA
(SEQUENCE
LISTING 2)
|
3Hs.1369NM_000574DAF DecayHomo sapiensGGGCAGTCAAT
(SEQUENCEacceleratingdecayGGTCAGATATT
LISTING 74)factor foracceleratingGAAGAGTTCTG
complementfactor forCAATCGTAGCT
(CD55,complementGCGAGGTG
Cromer blood(CD55, Cromer(SEQUENCE
groupblood groupLISTING 3)
system)system) (DAF),
mRNA
|
4Hs.151254NM_005046KLK7Homo sapiensTGGAACCACCT
(SEQUENCEKallikrein 7kallikrein 7GTACTGTCTCC
LISTING 75)(chymotryptic,(chymotryptic,GGCTGGGGCAC
stratumstratumTACCACGA
corneum)corneum)(SEQUENCE
(KLK7),LISTING 4)
transcript variant
1, mRNA.
|
5Hs.1526NM_001681ATP2A2Homo sapiensCATCGGCATCT
(SEQUENCEATPase,ATPase, Ca++TCGGGCAGGAT
LISTING 76)Ca++transporting,GAGGACGTGAC
transporting,cardiac muscle,GTCAAAAGCTTT
cardiacslow twitch 2CACAG
muscle, slow(ATP2A2),(SEQUENCE
twitch 2mRNALISTING 5)
|
6Hs.184270NM_006135CAPZA1Homo sapiensTGACCACTTAC
(SEQUENCECappingcapping proteinGGAAAGAAGCA
LISTING 77)protein actin)(actin filament)AGTGACCCCCA
filament)muscle Z-line,GCCAGAAGAAG
muscle Z-alpha 1CAGATG
line, alpha 1(CAPZA1),(SEQUENCE
mRNA.LISTING 6)
|
7Hs.2043NM_001151SLC25A4Homo sapiensAGATCTTCAAGT
(SEQUENCESolute carriersolute carrierCTGATGGCCTG
LISTING 78)family 25family 25AGGGGGCTCTA
(mitochondrial(mitochondrialCCAGGGTTTCA
carriercarrier; adenineACGTC
adeninenucleotide(SEQUENCE
nucleotidetranslocator),LISTING 7)
translocator),member 4
member 4(SLC25A4),
nuclear gene
encoding
mitochondrial
protein, mRNA.
|
8Hs.267871NM_005177ATP6V0A1Homo sapiensGGACAGAAAGG
(SEQUENCEATPase, H+ATPase, H+AATTCAGTGTTT
LISTING 79)transporting,transporting,CCTGGTAGTGG
lysosomal V0lysosomal V0TTGCACTACTGT
subunit asubunit aGTGTACCTTGG
isoform 1isoform 1(SEQUENCE
(ATP6V0A1),LISTING 8)
mRNA.
|
9Hs.4935D79998KIAA0176Human mRNAGGAAAGGATAC
(SEQUENCEKIAA0176for KIAA0176GGGACAATGAG
LISTING 80)proteingene, partial cdsAACAGAACTTCA
CAAGGCCCCGT
GAAGC
(SEQUENCE
LISTING 9)
|
10Hs.5509NM_006495EVI2BHomo sapiensGCCCCTGCCAC
(SEQUENCEEcotropicecotropic viralCAGTAGATTTTA
LISTING 81)viralintegration siteTGAAAAACCAA
integration2B (EVI2B),GAAGATTCCAA
site 2BmRNA.CCTTGAGATCC
AGTGTC
(SEQUENCE
LISTING 10)
|
11Hs.5662NM_006098GNB2L1Homo sapiensATGACTGAGCA
(SEQUENCEGuanineguanineGATGACCCTTC
LISTING 82)nucleotidenucleotideGTGGCACCCTC
bindingbinding proteinAAGGGCCACAA
protein (G(G protein), betaC
protein), betapolypeptide 2- (SEQUENCE
polypeptidelike 1 (GNB2L1),LISTING 11)
2-like 1mRNA.
|
12Hs.75990NM_005143HPHomo sapiensAGGCTGTTGGA
(SEQUENCEHaptoglobinhaptoglobinGATAAACTTCCT
LISTING 83)(HP), mRNA.GAATGTGAAGC
AGATGACGGCT
GCCCG
(SEQUENCE
LISTING 12)
|
13Hs.83384NM_006272S100B S100Homo sapiensCCGAACTCAAG
(SEQUENCEcalciumS100 calciumGAGCTCATCAA
LISTING 84)bindingbinding protein,CAATGAGCTTTC
protein, betabeta (neural)CCATTTCTTAGA
(neural)(S100B), mRNAGGAAATCAAAG
AGCAGGAG
(SEQUENCE
LISTING 13)
|
14Hs.10029NM_001814CTSCHomo sapiensCACCGGAAAGA
(SEQUENCECathepsin Ccathepsin CAGGTGGGAACT
LISTING 85)(CTSC), mRNAGCCTCTGAGAA
TGTGTATGTCAA
CACAGC
(SEQUENCE
LISTING 14)
|
15Hs. 103982NM_005409SCYB11Homo sapiensGGGCATGGCTA
(SEQUENCESmallsmall inducibleTAGCCTTGGCT
LISTING 86)induciblecytokineGTGATATTGTGT
cytokinesubfamily BGCTACAGTTGTT
subfamily B(Cys-X-Cys),CAAGGC
(Cys-X-Cys),member 11(SEQUENCE
member 11(SCYB11),LISTING 15)
mRNA.
|
16Hs.12314AL049397HomoHomo sapiensCAACACCACAG
(SEQUENCEsapiensmRNA; cDNAACAGCTGCAGG
LISTING 87)mRNA; cDNADKFZp586C1019ACTCGATATCCA
DKFZp586C1019(from cloneTGGCTTCTTTCC
(fromDKFZp586C1019)ATCAC
clone(SEQUENCE
DKFZp586C1019)LISTING 16)
|
17Hs.150557NM_001206BTEB1 BasicHomo sapiensTTCCACCCCAG
(SEQUENCEtranscriptionbasicCATGATCAAGC
LISTING 88)elementtranscriptionGATCGAAAAAG
bindingelement bindingGCGCTGGCCAA
protein 1protein 1CGCTTT
(BTEB1),(SEQUENCE
mRNA.LISTING 17)
|
18Hs.169266NM_000909NPY1RHomo sapiensCCGGTCTCGGG
(SEQUENCENeuropeptideneuropeptide YATGATGATTATG
LISTING 89)Y receptor Y1receptor Y1AAACAATAGCC
(NPY1R),ATGTCCACGAT
mRNA.GCACACAG
(SEQUENCE
LISTING 18)
|
19Hs.1827NM_002507NGFR NerveHomo sapiensCAAGCGGGAGG
(SEQUENCEgrowth factornerve growthAGGTGGAGAAG
LISTING 90)receptorfactor receptorCTTCTCAACGG
(TNFR(TNFRCTCTGCG
superfamily,superfamily,(SEQUENCE
member 16)member 16)LISTING 19)
(NGFR), mRNA.
|
20Hs.1869NM_002633PGM1Homo sapiensGCCAACGGGAT
(SEQUENCEPhosphoglucomu-phosphoglucomu-CGGTCGCTTGG
LISTING 91)tase 1tase 1 (PGM1),TTATCGGACAG
mRNA.AATGGAATCCT
CTCCA
(SEQUENCE
LISTING 20)
|
21Hs.194148NM_005433YES1 V-yes-Homo sapiensCAAGTGTGAGC
(SEQUENCE1 Yamaguchiv-yes-1CATTATGGAGC
LISTING 92)sarcoma viralYamaguchiAGAACCCACTA
oncogenesarcoma viralCAGTGTCACCA
homolog 1oncogeneTGTCCG
homolog(SEQUENCE
1(YES1), mRNALISTING 21)
|
22Hs.2352X74210ADCY2H. sapiensTCGTCTGCTTTG
(SEQUENCEAdenylatemRNA forCTGGACAGCTT
LISTING 93)cyclase 2adenylyl cyclaseCTGCAATGCAG
(brain)CAAAAAAGCCT
CTCCC
(SEQUENCE
LiSTING 22)
|
23Hs.246885NM_017958FLJ20783Homo sapiensCCAAGATTCTA
(SEQUENCEHypotheticalhypotheticalGGACAAACACA
LISTING 94)proteinproteinGCGTATGTGGG
FLJ20783FLJ20783CTCTGCAGTCA
(FLJ20783),TGACCG
mRNA.(SEQUENCE
LISTING 23)
|
24Hs.29665NM_014944CLSTN1Homo sapiensCACGAGCCCTT
(SEQUENCECalsyntenin 1calsyntenin 1CTCTGTGACTG
LISTING 95)(CLSTN1),AGGATTACCCG
mRNA.CTCCATCCATC
CAAGAT
(SEQUENCE
LISTING 24)
|
25Hs.3235NM_002272KRT4 KeratinHomo sapiensTTCAGCTGTGG
(SEQUENCE4keratin 4CTCGGCCATTG
LISTING 96)(KRT4), mRNATAGGCGGTGGC
AAGAGAGGT
(SEQUENCE
LISTING 25)
|
26Hs.55209AF327354HomoHomo sapiensTAAAGTGGGCT
(SEQUENCEsapiens DMRDMR proteinCATTGTCATCCC
LISTING 97)proteinmRNA,CAAGCCAGGCC
mRNA,complete cdsAGTTCTCCAGG
complete cdsTGGAA
(SEQUENCE
LISTING 26)
|
27Hs.585NM_000384APOBHomo sapiensGCCCAAGGCCA
(SEQUENCEApolipoproteinapolipoprotein BCAGGGGTCCTT
LISTING 98)B (including(including Ag(x)TATGATTATGTC
Ag(x)antigen)AACAAGTACCA
antigen)(APOB), mRNACTGGG
(SEQUENCE
LISTING 27)
|
28Hs.62187AF022913PIGKHomo sapiensTCTTGTCCTTCG
(SEQUENCEPhosphati-GPIGCAGCGTGGCC
LISTING 99)dylinositoltransamidaseGCTAGTCATATC
glyan, classmRNA,GAGGATCAAGC
Kcomplete cdsAGAA
(SEQUENCE
LISTING 28)
|
29Hs.63290NM_012260HPCL2 2-Homo sapienCATGAACTGCT
(SEQUENCEhydroxyphy-2-GGCCCTTGCTT
LISTING 100)tanoyl-CoAhydroxyphy-GTGATTGGTGG
lyasetanoyl-CoA lyaseTTCCTCTGAAAG
(HPCL2), mRNAAAACCAAG
(SEQUENCE
LISTING 29)
|
30Hs.699NM_000942PPIBHomo sapiensAGCCGGGATAA
(SEQUENCEPeptidylprolylpeptidylprolylACCCCTGAAGG
LISTING 101)isomerase Bisomerase BATGTGATCATC
(cyclophilin(cyclophilin B)GCAGACTGCGG
B)(PPIB), mRNACAAGAT
(SEQUENCE
LISTING 30)
|
31Hs.74111NM_007367RALY RNAHomo sapiensAGCGAGGAAGA
(SEQUENCEbindingRNA bindingGCTGGAACACA
LISTING 102)proteinproteinGCCAGGACACA
(autoantigen-(autoantigenic,GACGCGGATGA
ic, hnRNP-hnRNP-T
associatedassociated with(SEQUENCE
with lethallethal yellow)LISTING 31)
yellow)(RALY)
transcript variant
2, mRNA
|
32Hs.75103NM_003406YWHAZHomo sapiensCGGAAGGTGCT
(SEQUENCETyrosine 3-tyrosine 3-GAGAAAAAACA
LISTING 103)monooxygen-monooxygenase/GCAGATGGCTC
ase/tryptophan/tryptophan 5-GAGAATACAGA
5-monooxygenaseGAGAAAATTGA
monooxygen-activationGACGG
ase activationprotein, zeta(SEQUENCE
protein, zetapolypeptideLISTING 32)
polypeptide(YWHAZ),
mRNA
|
33Hs.75117NM_004515ILF2Homo sapiensTGACTTCTATTT
(SEQUENCEInterleukininterleukinGTGTGAAATGG
LISTING 104)enhancerenhancerCCTTTCCCCGG
binding factorbinding factor 2,GTCAAGCCAGC
2, 45 kD45 kD (ILF2),ACCTG
mRNA(SEQUENCE
LISTING 33)
|
34Hs.75236NM_021952ELAVL4Homo sapiensGCACCATGGAG
(SEQUENCEELAVELAVCCTCAGGTGTC
LISTING 105)(embryonic(embryonicAAATGGTCCGA
lethal,lethal, abnormalCATCCAATACAA
abnormalvision,GCAATG
vision,Drosophila)-like(SEQUENCE
Drosophila)-4 (Hu antigen D)LISTING 34)
like 4 (Hu(ELAVL4),
antigen D)mRNA
|
35Hs.75258NM_004893H2AFY H2AHomo sapiensCACCGAAGCCA
(SEQUENCEhistoneH2A histoneGGAAGCCCCGT
LISTING 106)family,family, memberTTGTAAGCGTG
member YY (H2AFY),TGTTGTGGTGC
transcript variantTTTATT
2, mRNA(SEQUENCE
LISTING 35)
|
36Hs.75498NM_004591SCYA20Homo sapiensGCTACTCCACC
(SEQUENCESmallsmall inducibleTCTGCGGCGAA
LISTING 107)induciblecytokineTCAGAAGCAGC
cytokinesubfamily AAAGCAACTTTGA
subfamily A(Cys—Cys),CTGCT
(Cys—Cys),member 20(SEQUENCE
member 20(SCYA20),LISTING 36)
mRNA
|
37Hs.76913NM_002790PSMA5Homo sapiensGTTTCTTACCCG
(SEQUENCEProteasomeproteasomeGTCTGAGTACG
LISTING 108)(prosome,(prosome,ACAGGGGCGTG
macropain)macropain)AATACTTTTTCT
subunit,subunit, alphaCCCG
alpha type, 5type, 5(SEQUENCE
(PSMA5),LISTING 37)
mRNA
|
38Hs.79889NM_012329MMDHomo sapiensGCTATGAACAT
(SEQUENCEMonocyte tomonocyte toGCTGCTAACTG
LISTING 109)macrophagemacrophageTTACACACACG
differentiation-differentiation-CATTCCTCATTG
associatedassociatedTTCCGGCC
(MMD), mRNA(SEQUENCE
LiSTING 38)
|
39Hs.82173NM_005655TIEG TGFBHomo sapiensTTTGTGGTACC
(SEQUENCEinducibleTGFB inducibleCCAGCCCGTTG
LISTING 110)early growthearly growthTGCAGAGTTCA
responseresponseAAGCCTCCGGT
(TIEG) MrnaG
(SEQUENCE
LISTING 39)
|
40Hs.84072NM_004616TM4SF3Homo sapiensGCAATGACTCT
(SEQUENCETransmembranetransmembraneCAAGCAATTTTT
LISTING 111)44 superfamilyGGTTCTGAAGA
superfamilymember 3TGTAGGCTCTA
member 3(TM4SF3),GCTCCTACGTT
mRNAGCTGTG
(SEQUENCE
LISTING 40)
|
41Hs.85146NM_005239ETS2 V-etsHomo sapiensCTCATGACTCC
(SEQUENCEerythroblastosisv-etsGCCAACTGTGA
LISTING 112)virus E26erythroblastosisATTGCCTTTGTT
oncogenevirus E26AACCCCGTGCA
homolog 2oncogeneGCAAG
(avian)homolog 2(SEQUENCE
(avian) (ETS2),LISTING 41)
mRNA
|
42Hs.85844NM_002529NTRK1Homo sapiensTTCATGGACAA
(SEQUENCENeurotrophicneurotrophicCCCTTTCGAGTT
LISTING 113)tyrosinetyrosine kinase,CAACCCCGAGG
kinase,receptor, type 1ACCCCATCCCT
receptor, type(NTRK1), mRNAGTCT
1(SEQUENCE
LISTING 42)
|
43Hs.88219NM_003454ZNF200 ZincHomo sapiensCCCAGTCAGAA
(SEQUENCEfinger proteinzinc fingerAGTCAAGGAGA
LISTING 114) 200protein 200CCTTGGTTATTA
(ZNF200),TGAAAGATGTG
mRNAAGCTCAAGCCT
TCAGAACAG
(SEQUENCE
LISTING 43)
|
44Hs.9914NM_006350FSTHomo sapiensCCCTGACAGTA
(SEQUENCEFollistatinfollistatin (FST),AGTCGGATGAG
LISTING 115)transcript variantCCTGTCTGTGC
FST317, mRNACAGTGACAATG
CCACTT
(SEQUENCE
LISTING 44)
|
45Hs.169319NM_003419ZNF345 ZincHomo sapiensCAGGGATCTCA
(SEQUENCEfinger proteinzinc fingerGGAAGGACATT
LISTING 116)345protein 345TCAGTGAAATG
(ZNF345),ATATTTACTCCT
mRNAGAAGACATGCC
CACTTTCAG
(SEQUENCE
LISTING 45)
|
46Hs.72805NM_030921DC42Homo sapiensGGCATGGCAGC
(SEQUENCEHypotheticalhypotheticalAAATGCCAACAT
LISTING 117)protein DC42protein DC42TTTGTGGAATAG
(DC42), mRNACAGCAAATCTA
CAAGAGACCCT
GG
(SEQUENCE
LISTING 46)
|
47HS.108301NM_003297NR2C1Homo sapiensGACACCTACAG
(SEQUENCENuclearnuclear receptorGTTATCCAGACT
LlSTING 118)receptorsubfamily 2,ACTACTCAGATT
subfamily 2,group C,GCCAGCTTTAA
group C,member 1GACTGATGAAT
member 1(NR2C1), mRNAGCTACCATC
(SEQUENCE
LISTING 47)
|
48Hs.177926NM_030941LOC81691Homo sapiensCCCAGTGACGA
(SEQUENCEExonucteaseexonucleaseCCAAACTCAAA
LISTING 119)NEF-spNEF-spGATGTACAGAG
(LOC81691),GCAGTTAAAAG
mRNACACTGCTTCCT
C
(SEQUENCE
LISTING 48)
|
49Hs.194746NM_018896CACNA1GHomo sapiensACGTCAGAGAT
(SEQUENCECalciumcalciumTGTGTCTGAAC
LISTING 120)channel,channel,CGTCCTGCTCT
voltage-voltage-CTAGCTCTGAC
dependent,dependent,GGATGA
alpha 1Galpha 1G(SEQUENCE
subunitsubunitLISTING 49)
(CACNA1G),
mRNA
|
50Hs.209061NM_003831SUDD SudDHomo sapiensTCACGGCCTGG
(SEQUENCEsuppressor ofsudDAGTTCTTGTTCC
LISTING 121)bimD6suppressor ofGGGACTGCAGG
homolog (A.bimD6 homologAATGTCTCGCA
nidulans)(A. nidulans)GTT
(SUDD), mRNA(SEQUENCE
LISTING 50)
|
51Hs.25087NM_006070TFG TRK-Homo sapiensTAATCCTTATGC
(SEQUENCEfused geneTRK-fused geneGCGTAACCGTC
LISTING 122)(TFG), mRNACTCCCTTTGGT
CAGGGCTATAC
CCAAC
(SEQUENCE
LISTING 51)
|
52Hs.3017NM_003284TNP1Homo sapiensGATCAAAGCCA
(SEQUENCETransitiontransition proteinGAGAGGAGCCT
LISTING 123)protein 11 (duringATGGAATGTGG
(duringhistone toATCAAATGCCA
histone toprotamineGTTGTGACG
protaminereplacement)(SEQUENCE
replacement)(TNP1), mRNALISTING 52)
|
53Hs.283664NM_032466ASPHHomo sapiensGAACCACAACA
(SEQUENCEAspartateaspartate beta-AGAGGATGATG
LISTING 124)beta-hydroxylaseAGTTTCTTATGG
hydroxylase(ASPH),CGACTGATGTA
transcript variantGATGATAGATTT
3, mRNAGAGACCCTGG
(SEQUENCE
LISTING 53)
|
54Hs.283664NM_032467ASPHHomo sapiensCTCAGGGAGAT
(SEQUENCEAspartateaspartate beta-GGATTTGCTCG
LISTING 125)beta-hydroxylaseTTGTTTTCTTCC
hydroxylase(ASPH),CTCCTTCCCCTT
transcript variantCCTG
4, mRNA(SEQUENCE
LISTING 54)
|
55Hs.171992NM_002843PTPRJHomo sapiensCCGTGGATGTG
(SEQUENCEProteinprotein tyrosineTATGGGATTGT
LISTING 126)tyrosinephosphatase,GTATGACCTTC
phosphatase,receptor type, JGAATGCATAGG
receptor type,(PTPRJ), mRNACCTTTAATGGTG
JC
(SEQUENCE
LISTING 55)
|
56Hs.155172NM_003664AP3B1adaptor-relatedGCCCAGCTTAT
(SEQUENCEprotein complexCATAAACACTGA
LISTING 127)3, beta 1 subunitGAAAACTGTGA
TTGGCTCTGTTC
TGCTGCGGG
(SEQUENCE
LISTING 56)
|
57Hs.183418M37712CDC2L2cell dividionCGAGAAAATGA
(SEQUENCEcycle2-like2AAACCACCTCTT
LISTING 128)GGTTGTTCCAG
AGTCACGGTTC
GACCGAG
(SEQUENCE
LISTING 57)
|
58Hs.244473NM_031900AGXT2alanine-TCCGGGATTGT
(SEQUENCEglyoxylateTACTGTCAGTGT
LISTING 129)aminotransferaseTGGCCATTGCC
2ACCCAAAGGTG
AATGC
(SEQUENCE
LISTING 58)
|
59Hs.12835NM_004842AKAP7A kinaseGAGCCCGATGA
(SEQUENCE(PRKA) anchorCGCTGAACTAG
LISTING 130)protein 7TAAGGCTCAGT
AAGAGGCTGGT
GGAGAA
(SEQUENCE
LISTING 59)
|
60Hs.1650NM_000111SLC26A3solute carrierTCAGCCCCCTA
(SEQUENCEfamily 26,TTACACCTGAC
LISTING 131)member 3GTGGAGACTTT
CCAAAACACCG
TAGGAG
(SEQUENCE
LISTING 60)
|
61Hs.29981NM_000112SLC26A2solute carrierCAGCAGGGATC
(SEQUENCEfamily 26CACACACTGAA
LISTING 132)(sulfateAGAAGTTCGCA
transporter),GAGATTATGAA
member 2GCCATTGGAAT
CC
(SEQUENCE
LISTING 61)
|
62Hs.2246NM_001308CPN1carboxypeptidaseTCAAGTAAGCC
(SEQUENCEN, polypeptideCTGTGAGGAGA
LISTING 133)1, 50 kDGCTCCCAGCAG
AAGGCACGGAG
T
(SEQUENCE
LISTING 62)
|
63Hs.267871NM_005177ATP6V0A1ATPase, H+AAATGCTTGATT
(SEQUENCEtransporting,GCAGAGGTCTG
LISTING 134)lysosomal V0GTGCCCTGTCA
subunit aCCGACCTTGAC
isoform 1TCCAT
(SEQUENCE
LISTING 63)
|
64Hs.75445NM_004684SPARCL1SPARC-like 1CTGCGAGCATC
(SEQUENCE(mast9, hevin)TCTGGTGCCCA
LISTING 135)TGGAACACTGC
ATAACCCGTTTC
TTTGA
(SEQUENCE
LISTING 64)
|
65Hs.39957NM_016445PLEK2pleckstrin 2TGGCGTTCCCA
(SEQUENCE(mouse)CTGGGGTTAAA
LISTING 136)homologGGGAATGTCCA
GGGAAACCTCT
TCAAAG
(SEQUENCE
LISTING 65)
|
66Hs.65029NM_002048GAS1growth arrest-CGACTACTACG
(SEQUENCEspecific 1ATGAGGACTAC
LISTING 137)GATGACGAGCA
GCGCACCGG
(SEQUENCE
LISTING 66)
|
67Hs.239926NM_006745SC4MOLsterol-C4-methylGCTGGTTCTCG
(SEQUENCEoxidase-likeGCATCATGATTT
LISTING 138)CCACCACATGA
ACTTCATTGGAA
ACTATGCTTCAA
C
(SEQUENCE
LISTING 67)
|
68Hs.59271NM_006758U2AF1U2(RNU2) smallTCTGTGACAAC
(SEQUENCEnuclear RNACTGGGAGACCA
LISTING 139)auxillary factor 1CCTGGTGGGGA
ACGTGTACGTC
AAGTTT
(SEQUENCE
LISTING 68)
|
69Hs.8867NM_001554CYR61cysteine-rich,CAAAACGCAGC
(SEQUENCEangiogenicCCTGCGACCAC
LISTING 140)inducer, 61ACCAAGGGGCT
GGAATGCAACT
T
(SEQUENCE
LISTING 69)
|
70Hs.50123NM_003452ZNF189zinc fingerCAACAGCGCAG
(SEQUENCEprotein 189TCTTGTCAACCA
LISTING 141)TCAGATGATCC
ATGCAGAGGTG
AAAACCC
(SEQUENCE
LISTING 70)
|
71Hs.82071NM_006079CITED2Cbp/p300-CACCAGATGAA
(SEQUENCEinteractingCGGGACAAACC
LISTING 142)transactivator,AGCACTTCCGA
with Glu/Asp-GATTGCAACCC
rich carboxy-CAAGCA
terminal domain,(SEQUENCE
2LISTING 71)
|
From the above table, the HS ID of the 71 genes comprises:
Hs. 107213 Hs. 123107 Hs. 1369 Hs. 151254 Hs. 1526Hs. 184270 Hs. 2043 Hs. 267871 Hs. 4935 Hs. 5509 Hs. 5662 Hs. 75990 Hs. 83384 Hs. 10029 Hs. 103982 Hs. 12314 Hs. 150557 Hs. 169266 Hs. 1827 Hs. 1869 Hs. 194148 Hs. 2352 Hs. 246885 Hs. 29665 Hs. 3235 Hs. 55209 Hs. 585 Hs. 62187Hs. 63290 Hs. 699 Hs. 74111 Hs. 75103 Hs. 75117 Hs. 75236 Hs. 75258 Hs. 75498 Hs. 76913 Hs. 79889 Hs. 82173 Hs. 84072 Hs. 85146 Hs. 85844 Hs. 88219 Hs. 9914 Hs. 169319 Hs. 72805 Hs. 108301 Hs. 177926 Hs. 194746 Hs. 209061 Hs. 25087 Hs. 3017 Hs. 283664 Hs. 283664 Hs. 171992 Hs. 155172 Hs. 183418 Hs. 244473 Hs. 12835 Hs. 1650 Hs. 29981 Hs. 2246 Hs. 267871 Hs. 75445 Hs. 39957 Hs. 65029 Hs. 239926 Hs. 59271 Hs. 8867 Hs. 50123 Hs. 82071 etc.
We obtain said specific oligonucleotides sequences by using analysis of OMP (Oligonucleotide Modeling Platform, DNA Software, Inc., Ann Arbor, Mich.) DNA software, Said gene sequences can act as a reagent, a biochip and a medicine for detecting colorectal cancer shown in table 1.
According to the present invention, FIG. 2a and FIG. 2b are views showing the primary screening. FIG. 3a and FIG. 3b are views showing affirmation to genes using Northern Blotting method. FIGS. 4a and 4b are views showing quantity expression of cancer tissue we search over progressive distinctive new genes among the carcinoma process of colorectal cancer by using SSH method to build up CRA libraries and CRC libraries which make the comparison between adenoma, adenocarcinoma and normal tissue, that obtain over 5000 clones in per library; then randomly select about 3000 clones of cDNA from per library to dot on nylon membrane as pre-screen by using Colony Hybridization shown in FIG. 2a and FIG. 2b. The high expression colonies in colorectal cancer and adenoma are selected by the Colony Hybridization and then the nucleic acid of cDNA after purification spot on glass chip by using microarray testing.
The expression profiles of the cDNA chips were derived from a set of cDNA probes including adenoma, adenocarcinoma and the corresponding normal tissue from the same patient. Genes exhibiting at least three-fold greater intensities in the adenocarcinoma or adenoma than in corresponding normal tissue samples were considered significant. The significant up-regulated genes were then further confirmed by Northern blot (FIG. 3a and FIG. 3b) and subsequently sequenced. Northern analysis of each set of cDNA genes on the chip revealed that 36 genes were detected as up-regulated in adenoma compared to normal, and 54 genes were detected as up-regulated in carcinoma as compared to the normal control. A set of 23 genes with serial increase of genes expression from adenoma to carcinoma was identified.
Further, comparison is made by using EMBL/GenBank libraries of NCBI/BLAST program, there are 3 unknown functional genes among 23 identified genes including ectopic viral integration site 2B (Genbank accession no.NM-006495) Homo sapiens chromosome 21q22.1 anonymous mRNA sequence (Genebank accession no.AF003738) and Homo sapiens DMR protein mRNA (Genbank accession no.AF327354), and another 20 functional genes. Among these 20 functional genes, 6 genes are CRC-related (such as TM4SF3), 14 genes are CRC-unrelated (such as ATP2A2). Moreover, we obtain cDNAs of three patients who suffer from adenoma and adenocarcinama simultaneously and four colorectal cancer patients to affirm variation of 23 identified genes, result shown that were at least 3-fold higher in mRNA expression level in the adenocarcinoma tissues compared with normal samples, and the level gradually increased from colorectal adenomas to adenocarcinomas shown in FIG. 4a and FIG. 4b.
Now, methods of clinical diagnosis for detecting colorectal cancer are fecal occult blood test, image test, tumor label and colonoscopy. In each of these methods, we can generalize purpose of the present invention according to disadvantage of these methods.
1. Early Diagnosis
If patient undergo colorectal cancer before tumor cells spread out, five-year survival rate can be achieved over 90%. A certain number of tumor cells are needed for traditional detection by using tumor label method. In the case of image test, normally, correctly affirmation can be made easier when tumor become large. It is high invasion and price to make low acceptance for the patient in the colonoscopy that can not suitable for early diagnosis. Because of the process of circulating of tumor cells, different expression certainly happen among the genes. In the process of proliferation of early tumor cells, the dying cells cause molecule of ribonucleic acid to release into blood circulation. And, early diagnosis can be offered by the detection of using the constructed oligonucleotide biochip which is discharged from small number of tumor cells in the peripheral blood.
2. Specificity and Sensitivity
Fecal occult blood test has shortcomings for high false positives and false negatives to low specificity and sensitivity of the method, therefore the method is merely a first screening tool and the tumor label method is also not high specificity and sensitivity. But, we use these genes to detect peripheral blood of 100 CRC patients, peripheral blood of 50 healthy people and 40 other cancer-related patients as controls shown in FIG. 1, these genes can detect 88 colorectal cancer patients for remarkable sensitivity of 88% ( 88/100) and specificity of 90% ( 90/100) in the clinical analysis.
3. Safety
The colonoscopy has high invasion and price to make low acceptance for patient in the mass screening tool of early diagnosis. Because sample collection is convenience and low invasion, Peripheral blood test of patient is a diagnosis method of genes, that is suitable to mass screening clinical application.
Please refer to FIG. 5, showing another preferred embodiment according to the present invention. We choose genes of colorectal cancer and vector that express simultaneously in eukaryotic and prokaryotic to form recombination genes, and then form eukaryotic transformant cell by using and further form prokaryotic transfectant cell, and then obtain secreted protein by using extract of genes having said recombination genes, and obtain antibody from said secreted protein immune animals for making of protein testing reagent, colorectal vaccine and colorectal protein medicine for colorectal cancer.
The present invention may be embodied in other specific forms without departing from the spirit of the essential attributes thereof; therefore, the illustrated embodiment should be considered in all respects as illustrative and not restrictive, reference being made to the appended claims rather than to the foregoing description to indicate the scope of the invention.
TABLE 1
|
|
Assessment of clinical testing result for
colorectal cancer biochip
No.AgeSexdiagnosisStageresult
|
143FColon cancerC1P
235FColon cancerB2P
368MColon adenomaP
456FColon cancerC2P
532FColon cancerB1N
665MColon cancerB2P
746MColon cancerC1P
867MColon cancerC1P
958FColon cancerC1P
1045MColon cancerB2P
1162FColon adenomaP
1264FColon cancerC2P
1358FColon cancerAN
1476MColon cancerC1P
1538MColon cancerB2P
1667MColon cancerC1P
1786FColon adenomaP
1847FColon cancerC2P
1956MColon cancerB1P
2067FColon cancerB2P
2143FColon adenomaP
2265MColon cancerAP
2343FColon cancerC2P
2454MColon cancerB1P
2534FColon cancerB2P
2676FColon adenomaP
2766MColon cancerB2P
2878FColon cancerB1P
2957MColon cancerB2P
3074MColon adenomaP
3165FColon cancerB1P
3264FColon cancerB2P
3362MColon cancerB1P
3446MColon cancerB2P
3554FColon cancerB1P
3658FColon cancerB1P
3764FColon adenomaP
3856MColon cancerB1P
3967MColon cancerB2P
4048FColon cancerB1P
4155MColon cancerB1P
4264FColon adenomaP
4358FColon cancerC1P
4465MColon cancerC1P
4566MColon cancerC2P
4643MColon cancerC1P
4726FColon cancerC2P
4854MColon cancerC1P
4959FColon cancerC2P
5071FColon adenomaN
5137MColon cancerC1P
5247FColon cancerC1P
5362MColon cancerC2P
5447MColon adenomaP
5555FColon cancerB2P
5648MColon cancerB1P
5766FColon cancerB2P
5864MColon cancerB1N
5930MColon cancerB1P
6056FColon cancerB2P
6146MColon cancerB1P
6267FColon cancerB2P
6335MColon cancerB1P
6445FColon cancerB1P
6586FColon cancerB2P
6654MColon cancerB1P
6757MColon cancerC1P
6876FColon cancerC2P
6946MColon cancerC1P
7068MColon cancerB2P
7145FColon cancerB1P
7287MColon cancerB1P
7353MColon cancerC1P
7458FColon cancerAP
7554MColon cancerB1P
7667FColon cancerC2P
7756FColon cancerAN
7835MColon adenomaP
7979FColon cancerB2P
8082MColon cancerC2P
8176MColon cancerC2P
8254FColon cancerC1P
8342MColon cancerB1P
8468MColon cancerB1P
8527MColon cancerB2P
8667FColon cancerB2P
8746MColon adenomaN
8876FColon cancerB1P
8944MFColon cancerB1P
9056FColon cancerB2P
9165MColon cancerC2P
9257FColon cancerC1P
9367MColon cancerB1P
9478FColon adenomaP
9556FColon cancerC1P
9656MColon cancerC1P
9745FColon cancerB1P
9863FColon cancerB2P
9962MColon cancerC2P
10054FColon cancerC1P
|
|
|
Controls
|
NO
Age
Sex
Diagnosis
result
|
|
1
76
F
Breast cancer
N
|
2
35
F
Breast cancer
N
|
3
74
F
Breast cancer
P
|
4
57
F
Gastric cancer
N
|
5
87
F
Breast cancer
N
|
6
55
M
Gastic cancer
N
|
7
35
M
NPC
N
|
8
78
F
Breast cancer
N
|
9
65
M
NPC
N
|
10
55
F
Breast cancer
N
|
11
54
M
NPC
N
|
12
67
F
normal
N
|
13
86
M
Gastic cancer
P
|
14
53
F
NPC
N
|
15
58
F
normal
N
|
16
78
F
Breast cancer
N
|
17
45
M
normal
N
|
18
78
F
normal
N
|
19
87
F
normal
N
|
20
45
M
normal
N
|
|