Claims
- 1. A method of assembling genomic maps of an organism's DNA or portions thereof comprising:
providing a library of an organism's DNA, wherein individual genomic segments or sequences are found on more than one clone in the library; creating representations of the genome; generating nucleic acid sequence information from the representations; analyzing the sequence information to determine clone overlap from a representation; and combining clone overlap and sequence information from different representations to assemble a genomic map of the organism.
- 2. A method according to claim 1, wherein said creating representations of the genome comprises:
creating a representation of the genomic segments in individual clones by selecting a subpopulation of genomic segments out of a larger set of the genomic segments in that clone.
- 3. A method according to claim 2, wherein said selecting a subpopulation of genomic segments comprises:
subjecting an individual clone to a first restriction endonuclease under conditions effective to cleave DNA from the individual clone so that a degenerate overhang is created in the clone and adding non-palindromic complementary linker adapters to the overhangs in the presence of ligase and the first restriction endonuclease to select or amplify particular fragments from the first restriction endonuclease digested clone as a representation, whereby sufficient linker-genomic fragment products are formed to allow determination of a DNA sequence adjacent the overhang.
- 4. A method according to claim 3, wherein the first restriction endonuclease creates 2 base degenerate overhangs in the clone and 1 to 12 non-palindromic linker adapters, which contain single stranded overhangs of the formula NN/N′N′ where NN/N′N′ is selected from the group consisting of AA/TT, AC/GT, AG/CT, CA/TG, GA/TC, and GG/CC, are used.
- 5. A method according to claim 4, wherein 4 to 6 non-palindromic adapters are used.
- 6. A method according to claim 3, wherein the first restriction endonuclease creates 3 base degenerate overhangs in the clone and 1 to 16 non-palindromic complementary linker adapters, which contain single stranded overhangs of the formulae NAA, NAC, NAG, NAT, NCA, NCC, NCG, NCT, NGA, NGC, NGG, NGT, NTA, NTC, NTG, and NTT, with N being any nucleotide, are used.
- 7. A method according to claim 6, wherein 5 to 9 non-palindromic linker adapters are used.
- 8. A method according to claim 3, wherein the first restriction endonuclease is selected from the group consisting of DrdI, BglI, DraIII, AlwNI, PflMI, AccI, BsiHKAI, SanDI, SexAI, PpuI, AvaII, EcoO109, Bsu36I, BsrDI, BsgI, BpmI, SapI, and isoschizomers thereof.
- 9. A method according to claim 3, wherein said generating nucleic acid sequence information from the representations comprising:
using sequencing primers to obtain sequence information from the ends of a subpopulation of genomic segments out of a larger set of genomic segments.
- 10. A method according to claim 9, wherein the sequencing primers have a 5′ sequence that is complementary to the adapter primers and have a 3′ sequence that is complementary to two or more bases in the degenerate overhang and/or adjacent to the restriction site recognition sequence to obtain sequencing information adjacent to the restriction site.
- 11. A method according to claim 10, wherein 1 to 12 sequencing primers are used with a 3′ end from the set which end in NN, with N being any nucleotide, and/or it's complement N′N′.
- 12. A method according to claim 3, wherein 1 to 16 sequencing primers are used with a 3′ end from the set which end in NAA, NAC, NAG, NAT, NCA, NCC, NCG, NCT, NGA, NGC, NGG, NGT, NTA, NTC, NTG, and NTT, with N being any nucleotide.
- 13. A method according to claim 2, wherein said selecting a subpopulation of genomic segments comprises:
subjecting an individual clone to a first restriction endonuclease under conditions effective to cleave DNA from the individual clone so that a palindromic overhang is created in the clone; and adding complementary linker adapters to the overhangs in the presence of ligase and the first restriction endonuclease to amplify particular fragments from the first restriction endonuclease digested clone as a representation whereby sufficient linker-genomic fragment products are formed to allow determination of a DNA sequence adjacent the overhang.
- 14. A method according to claim 13, wherein the first restriction endonuclease is BamHI, AvrII, NheI, SpeI, XbaI, KpnI, SphI, AatII, AgeI, XmaI, NgoMI, BspEI, MluI, SacII, BsiWI, PstI, ApaLI, or isoschizomers thereof.
- 15. A method according to claim 13, wherein said generating nucleic acid sequence information from the representations comprising:
using sequencing primers to obtain sequence information from the ends of a subpopulation of genomic segments out of a larger set of genomic segments.
- 16. A method according to claim 15, wherein the sequencing primers have a 5′ sequence that is complementary to the adapter primers and have a 3′ sequence that is complementary to two or more bases adjacent to a restriction site recognition sequence to obtain sequencing information adjacent to the restriction site.
- 17. A method according to claim 2, wherein said selecting a subpopulation of genomic segments comprises:
subjecting an individual clone to a first restriction endonuclease under conditions effective to cleave DNA from the individual clone so that a first non-palindromic overhang is created in the clone; subjecting an individual clone to one or more second restriction endonuclease under conditions effective to cleave DNA from the individual clone so that a second overhang different from the first overhang is created in the clone; adding complementary linker adapters to the first and second overhangs in the presence of ligase, the first restriction endonuclease, and the one or more second restriction endonuclease to amplify particular fragments from the restriction endonuclease digest as a representation, whereby sufficient linker-genomic fragment products are formed to allow determination of DNA sequences adjacent to the overhangs.
- 18. A method according to claim 17, wherein the first restriction endonuclease creates 2 base degenerate overhangs in the clone and 1 to 12 non-palindromic linker adapters, which contain single stranded overhangs of the formula NN/N′N′ where NN/N′N′ is selected from the group consisting of AA/TT, AC/GT, AG/CT, CA/TG, GA/TC, and GG/CC, are used.
- 19. A method according to claim 18, wherein 4 to 6 non-palindromic adapters are used.
- 20. A method according to claim 17, wherein the first restriction endonuclease creates 3 base degenerate overhangs in the clone and 1 to 16 non-palindromic complementary linker adapters, which contain single stranded overhangs of the formulae NAA, NAC, NAG, NAT, NCA, NCC, NCG, NCT, NGA, NGC, NGG, NGT, NTA, NTC, NTG, and NTT, with N being any nucleotide, are used.
- 21. A method according to claim 20, wherein 5 to 9 non-palindromic linker adapters are used.
- 22. A method according to claim 17, wherein the first restriction endonuclease is selected from the group consisting of DrdI, BglI, DraIII, AlwNI, PflMI, SanDI, SexAI, PpuI, AvaII, EcoO109, Bsu36I, BsrDI, BsgI, BpmI, SapI, and an isoschizomer thereof and the one or more second restriction endonuclease is MaeII, MspI, BfaI, HhaI, HinP1I, Csp6I, TaqI, MseI, or an isoschizomer thereof.
- 23. A method according to claim 17, wherein said generating nucleic acid sequence information from the representations comprising:
using sequencing primers to obtain sequence information from the ends of a subpopulation of genomic segments out of a larger set of genomic segments.
- 24. A method according to claim 23, wherein (1) the sequencing primers have a 5′ sequence that is complementary to the adapter primers of the first restriction site and have a 3′ sequence that is complementary to two or more bases in the degenerate overhang and/or adjacent to a first restriction site recognition sequence to obtain sequencing information adjacent to the first restriction site and/or (2) the sequencing primers have a 5′ sequence that is complementary to the adapter primers of one or more second restriction site and have a 3′ sequence that is complementary to two or more bases adjacent to the one or more second restriction site recognition sequence to obtain sequencing information adjacent to the one or more second restriction site.
- 25. A method according to claim 17, wherein 1 to 12 sequencing primers are used to obtain sequence information adjacent to the first restriction endonuclease site with the sequencing primers having a 3′ end from the set which end in NN, with N being any nucleotide, and/or it's complement N′N′.
- 26. A method according to claim 17, wherein 1 to 16 sequencing primers are used to obtain sequence information adjacent to the first restriction endonuclease site with the sequencing primers having a 3′ end from the set which ends in NAA, NAC, NAG, NAT, NCA, NCC, NCG, NCT, NGA, NGC, NGG, NGT, NTA, NTC, NTG, and NTT, with N being any nucleotide.
- 27. A method according to claim 1, wherein said generating nucleic acid sequence information from the representations comprising:
using sequencing primers to obtain sequence information from the ends of a subpopulation of genomic segments out of a larger set of genomic segments.
- 28. A method according to claim 27, wherein 1 to 16 sequencing primers are used to obtain sequence information adjacent to the first restriction endonuclease site with the sequencing primers having a 3′ end from the set which ends in NAA, NAC, NAG, NAT, NCA, NCC, NCG, NCT, NGA, NGC, NGG, NGT, NTA, NTC, NTG, and NTT, with N being any nucleotide.
- 29. A method according to claim 27, wherein unique sequencing data is generated for a unique target known as a singlet sequencing run.
- 30. A method according to claim 27, wherein two overlapping sequences are generated for two targets known as a doublet sequencing run.
- 31. A method according to claim 27, wherein three overlapping sequences are generated for three targets known as a triplet sequencing run.
- 32. A method according to claim 27, wherein the sequencing primer has one or two additional bases on its 3′ end to obtain unique singlet sequence information from two or more overlapping sequences.
- 33. A method according to claim 1, wherein said analyzing sequence information comprises:
analyzing sequencing data generated from representations to deconvolute singlet, doublet and triplet sequencing runs and to determine clone overlap.
- 34. A method according to claim 33, wherein two singlet sequencing runs in the same representation set from separate genomic clones are compared, said method further comprising:
evaluating the two singlet sequencing runs for clone overlap by aligning the sequencing runs and scoring identity in at least 8 bases beyond the endonuclease recognition site with less than 3 discordant positions.
- 35. A method according to claim 33, wherein a singlet and a doublet sequencing run in the same representation set from separate genomic clones are compared, said method further comprising:
evaluating the singlet and doublet sequencing runs for clone overlap by aligning the sequencing runs and either scoring identity in at least 8 bases beyond the endonuclease recognition site which are identical in the doublet sequence with the singlet sequence or, alternatively, by scoring at least 16 cases beyond the endonuclease recognition site where the singlet sequence is consistent with either of the bases in the doublet sequence at that position, with less than 3 discordant positions.
- 36. A method according to claim 33, wherein a singlet and a triplet sequencing run in the same representation set from separate genomic clones are compared, said method further comprising:
evaluating whether the clones overlap by aligning the sequencing runs, considering only those positions in the triplet run where two or less bases are read, and either scoring identity in at least 8 bases beyond the endonuclease recognition site which are identical in the triplet sequence with the singlet sequence or, alternatively, by scoring at least 16 cases beyond the endonuclease recognition site where the singlet sequence is consistent with either of the bases in the triplet sequence at that position, with less than 3 discordant positions.
- 37. A method according to claim 33, wherein a doublet and a doublet sequencing run in the same representation set from separate genomic clones are compared, said method further comprising:
evaluating whether the clones overlap by aligning the sequencing runs and scoring identity in at least 16 cases beyond the endonuclease recognition site which are either cases where either doublet sequence has an identical base which is consistent with one or the other of the two bases represented in the other doublet sequence, or cases where both doublet sequences have the same two bases at that position, with less than 3 discordant positions.
- 38. A method according to claim 33, wherein a doublet and a triplet sequencing run in the same representation set from separate genomic clones are compared, said method further comprising:
evaluating whether the clones overlap by aligning the sequencing runs, considering only those positions where two or less bases are read, and scoring identity in at least 16 cases beyond the endonuclease recognition site which are either cases where either doublet or triplet sequence has an identical base which is consistent with one or the other of the two bases represented in the other sequence, or cases where the doublet and triplet sequences have the same two bases at that position, with less than 3 discordant positions.
- 39. A method according to claim 33, wherein two sequencing runs from separate genomic clones in the same representation are compared with either run being a singlet, doublet, or triplet, said method further comprising:
evaluating whether the clones are likely not to overlap by aligning the sequencing runs and scoring discordance in at least 3 positions.
- 40. A method according to claim 1, wherein said combining clone overlap and sequence information comprises:
comparing sequence information in a second representation, present on clones which mark ends of contiguous portions of a first representation, to extend and overlap contigs between representations.
- 41. A method according to claim 1, wherein said combining clone overlap and sequence information comprises:
generating sequence information using a different restriction endonuclease representation on clones which mark ends of contiguous portions in a first representation.
- 42. A method according to claim 1, wherein said combining clone overlap and sequence information comprises:
using singlet sequences in the representations and end sequences for each clone to provide additional sequence information for aligning contiguous portions with the known databases for that organism.
- 43. A method according to claim 1, wherein said combining clone overlap and sequence information comprises:
obtaining unique singlet sequence information from overlapping doublet and triplet sequences, to provide additional sequence information for aligning contiguous portions with the known databases for that organism.
- 44. A method of identifying single nucleotide polymorphisms in genomic DNA comprising:
creating representations of the genomes of multiple individuals; creating a representational library from the representation; generating nucleic acid sequence information from individual clones of the representational library; and analyzing the sequence information to identify single nucleotide polymorphisms among the multiple individuals.
- 45. A method according to claim 44, wherein said creating representations of the genomes of multiple individuals comprises:
subjecting the genomes of multiple individuals to a first restriction endonuclease under conditions effective to cleave DNA so that a first non-palindromic overhang is created in the genomes of multiple individuals; subjecting the genomes of multiple individuals to a one of more second restriction endonuclease under conditions effective to cleave DNA so that a second overhang is created in the genomes of multiple individuals; adding complementary linker adapters to the first and second overhangs in the presence of ligase, the first restriction endonuclease, and the one or more second restriction endonuclease; and adding PCR primers to amplify fragments from the restriction endonuclease digest as a representation.
- 46. A method according to claim 45, wherein the first restriction endonuclease creates 2 base degenerate overhangs in the genomes of multiple individuals and 1 to 12 non-palindromic linker adapters, which contain single stranded overhangs of the formula NN/N′N′ where NN/N′N′ is selected from the group consisting of AA/TT, AC/GT, AG/CT, CA/TG, GA/TC, and GG/CC, are used.
- 47. A method according to claim 45, wherein the first restriction endonuclease creates 3 base degenerate overhangs in the genomes of multiple individuals and 1 to 16 non-palindromic complementary linker adapters, which contain single stranded overhangs of the formula NAA, NAC, NAG, NAT, NCA, NCC, NCG, NCT, NGA, NGC, NGG, NGT, NTA, NTC, NTG, and NTT, with N being any nucleotide, are used.
- 48. A method according to claim 45, wherein the first restriction endonuclease is selected from the group consisting of DrdI, BglI, DraIII, AlwNI, PflMI, SanDI, SexAI, PpuI, AvaII, EcoO109, Bsu36I, BsrDI, BsgI, BpmI, SapI, and an isoschizomer thereof and the one or more second restriction endonuclease is MaeII, MspI, BfaI, HhaI, HinP1I, Csp6I, TaqI, MseI, or an isoschizomer thereof.
- 49. A method according to claim 44, wherein said creating representations of the genomes of multiple individuals comprises;
subjecting the genomes of multiple individuals to a first restriction endonuclease under conditions effective to cleave DNA so that a palindromic overhang is created in the genomes of multiple individuals; and adding complementary linker adapters to the overhangs in the presence of ligase and the first restriction endonuclease; and adding PCR primers to amplify fragments from the restriction endonuclease digest as a representation.
- 50. A method according to claim 49, wherein the first restriction endonuclease is BamHI, AvrII, NheI, SpeI, XbaI, KpnI, SphI, AatII, AgeI, XmaI, NgoMI, BspEI, MluI, SacII, BsiWI, PstI, ApaLI, or isoschizomers thereof.
- 51. A method according to claim 45, wherein PCR primers amplify fragments from the restriction endonuclease digest as a representation and a single linker-adapter primer is used to select fragments containing only one of the degenerate overhangs and the representation of the genome contains approximately 35,500 fragments.
- 52. A method according to claim 51, wherein a size selection of approximately 200 to 1,000 bp is applied prior to amplification, and the representation of the genome contains approximately 19,700 fragments.
- 53. A method according to claim 51, wherein a size selection of approximately 200 to 2,000 bp is applied prior to amplification, and the representation of the genome contains approximately 25,000 fragments.
- 54. A method according to claim 45, wherein PCR primers amplify fragments from the restriction endonuclease digest as a representation and more than one linker-adapter primer is used to select fragments containing some of the degenerate overhangs, a size selection of approximately 200 to 1,000 bp is applied prior to amplification, and the representation of the genome contains approximately 40,000 fragments.
- 55. A method according to claim 45, wherein PCR primers amplify fragments from the restriction endonuclease digest as a representation and more than one linker-adapter primer is used to select fragments containing some of the degenerate overhangs, a size selection of approximately 200 to 1,000 bp is applied prior to amplification, and the representation of the genome contains approximately 120,000 fragments.
- 56. A method according to claim 45, wherein PCR primers amplify fragments from the restriction endonuclease digest as a representation and a single linker-adapter primer is used to select fragments containing only one of the degenerate overhangs, a size selection of approximately 200 to 1,000 bp is applied prior to amplification, a PCR primer with one or two selective bases on the 3′ end is used during the PCR amplification step, and the representation of the genome contains approximately 5,000 fragments.
- 57. A method according to claim 44, wherein a representational library is created from the representation and the linker-adapters used to generate the representation are methylated and PCR primers used to amplify the representation are unmethylated, such that the PCR amplified fragments may be cleaved in both primers to allow for directional cloning of fragments into a cloning vector.
- 58. A method for large scale detection of single nucleotide polymorphisms on a DNA array comprising:
creating a representation of the genome from a clinical sample; providing a plurality of oligonucleotide probe sets, each set characterized by (a) a first oligonucleotide probe, having a target-specific portion and an addressable array-specific portion, and (b) a second oligonucleotide probe, having a target-specific portion and a detectable reporter label, wherein the oligonucleotide probes in a particular set are suitable for ligation together when hybridized adjacent to one another on a corresponding target nucleotide sequence, but have a mismatch which interferes with such ligation when hybridized to any other nucleotide sequence present in the representation of the sample; providing a ligase, blending the sample, the plurality of oligonucleotide probe sets, and the ligase to form a mixture; subjecting the mixture to one or more ligase detection reaction cycles comprising a denaturation treatment, wherein any hybridized oligonucleotides are separated from the target nucleotide sequences, and a hybridization treatment, wherein the oligonucleotide probe sets hybridize at adjacent positions in a base-specific manner to their respective target nucleotide sequences, if present in the sample, and ligate to one another to form a ligated product sequence containing (a) the addressable array-specific portion, (b) the target-specific portions connected together, and (c) the detectable reporter label, and, wherein the oligonucleotide probe sets may hybridize to nucleotide sequences in the sample other than their respective target nucleotide sequences but do not ligate together due to a presence of one or more mismatches and individually separate during the denaturation treatment; providing a support with different capture oligonucleotides immobilized at particular sites, wherein the capture oligonucleotides have nucleotide sequences complementary to the addressable array-specific portions; contacting the mixture, after said subjecting, with the support under conditions effective to hybridize the addressable array-specific portions to the capture oligonucleotides in a base-specific manner, thereby capturing the addressable array-specific portions on the support at the site with the complementary capture oligonucleotide; and detecting the reporter labels of ligated product sequences captured on the support at particular sites, thereby indicating the presence of single nucleotide polymorphisms.
- 59. A method according to claim 58, wherein the oligonucleotide probes in a set are suitable for ligation together at a ligation junction when hybridized adjacent to one another on a corresponding target nucleotide sequence due to perfect complementarity at the ligation junction, but, when the oligonucleotide probes in the set are hybridized to any other nucleotide sequence present in the sample, have a mismatch at a base at the ligation junction which interferes with such ligation.
- 60. A method according to claim 59, wherein the mismatch is at the 3′ base at the ligation junction.
- 61. A method according to claim 58, wherein said creating a representation of the genome from a clinical sample comprises:
subjecting the clinical sample to a first restriction endonuclease under conditions effective to cleave DNA so that a first non-palindromic overhang is created in the clinical sample; subjecting the clinical sample to a one of more second restriction endonuclease under conditions effective to cleave DNA so that a second overhang is created in the clinical sample; adding complementary linker adapters to the first and second overhangs in the presence of ligase, the first restriction endonuclease, and the one or more second restriction endonuclease; and adding PCR primers to amplify fragments from the restriction endonuclease digest as a representation.
- 62. A method according to claim 61, wherein the first restriction endonuclease creates 2 base degenerate overhangs in the clinical sample and 1 to 12 non-palindromic linker adapters, which contain single stranded overhangs of the formula NN/N′N′ where NN/N′N′ is selected from the group consisting of AA/TT, AC/GT, AG/CT, CA/TG, GA/TC, and GG/CC, are used.
- 63. A method according to claim 61, wherein the first restriction endonuclease creates 3 base degenerate overhangs in the clinical sample and 1 to 16 non-palindromic complementary linker adapters, which contain single stranded overhangs of the formula NAA, NAC, NAG, NAT, NCA, NCC, NCG, NCT, NGA, NGC, NGG, NGT, NTA, NTC, NTG, and NTT, with N being any nucleotide, are used.
- 64. A method according to claim 61, wherein the first restriction endonuclease is selected from the group consisting of DrdI, BglI, DraIII, AlwNI, PflMI, SanDI, SexAI, PpuI, AvaII, EcoO109, Bsu36I, BsrDI, BsgI, BpmI, SapI, and an isoschizomer thereof and the one or more second restriction endonuclease is MaeII, MspI, BfaI, HhaI, HinP1I, Csp6I, TaqI, MseI, or an isoschizomer thereof.
- 65. A method according to claim 58, wherein said creating representations of the genomes of a clinical sample comprises:
subjecting the clinical sample to a first restriction endonuclease under conditions effective to cleave DNA so that a palindromic overhang is created in the clinical sample; adding complementary linker adapters to the overhangs in the presence of ligase and the first restriction endonuclease; and adding PCR primers to amplify fragments from the restriction endonuclease digest as a representation.
- 66. A method according to claim 58, wherein the first restriction endonuclease is BamHI, AvrII, NheI, SpeI, XbaI, KpnI, SphI, AatII, AgeI, XmaI, NgoMI, BspEI, MluI, SacII, BsiWI, PstI, ApaLI, or isoschizomers thereof.
- 67. A method according to claim 61, wherein PCR primers amplify fragments from the restriction endonuclease digest as a representation and a size selection of approximately 200 to 2,000 bp is applied prior to amplification, improving the yield of fragments in the representation.
- 68. A method according to claim 61, wherein a single linker-adapter primer is used to select fragments containing only one of the degenerate overhangs and a PCR primer complementary to this linker adapter with one additional selective base on the 3′ end is used during the PCR amplification step.
- 69. A method according to claim 61, wherein more than one linker-adapter primers are used to select fragments containing some of the degenerate overhangs and PCR primers complementary to the more than one linker adapter with one additional selective base on the 3′ end are used.
- 70. A method according to claim 61, wherein PCR primers amplify fragments from the restriction endonuclease digest as a representation and a single linker-adapter primer is used to select fragments containing only one of the degenerate overhangs and PCR primers complementary to this linker adapter with one additional selective base on the 3′ end are used.
- 71. A method according to claim 61, wherein PCR primers amplify fragments from the restriction endonuclease digest as a representation and a single linker-adapter primer is used to select fragments containing only one of the degenerate overhangs and PCR primers complementary to this linker adapter with two additional selective bases on the 3′ end is used during the PCR amplification step.
- 72. A method according to claim 61, wherein PCR primers amplify fragments from the restriction endonuclease digest as a representation and a single linker-adapter primer is used to select fragments containing only one of the degenerate overhangs and a PCR primer complementary to this linker adapter with two additional selective bases on the 3′ end is used during the PCR amplification step.
- 73. A method according to claim 58, wherein said plurality of oligonucleotide probe sets comprises:
(a) a first oligonucleotide probe, having a target-specific portion complementary to a first allele and a first addressable array-specific portion, (b) a second oligonucleotide probe, having a target-specific portion complementary to a second allele and a second addressable array-specific portion and (c) a third oligonucleotide probe, having a target-specific portion and a detectable reporter label, wherein the first and third oligonucleotide probes set are suitable for ligation together when hybridized adjacent to one another on a corresponding first allele target nucleotide sequence, wherein the second and third oligonucleotide probes set are suitable for ligation together when hybridized adjacent to one another on a corresponding second allele target nucleotide sequence, but each set has a mismatch which interferes with such ligation when hybridized to any other nucleotide sequence present in the representation of the sample and, wherein the reporter labels of ligation product sequences captured to the support at particular sites during said detecting where the presence of reporter label at the complement of the first addressable array-specific portion indicates the presence of the first allele, while presence of reporter label at the complement of the second addressable array-specific portion indicates the presence of the second allele, for each set, thereby indicating allele differences.
- 74. A method according to claim 73, wherein the oligonucleotide probes in a set are suitable for ligation together at a ligation junction when hybridized adjacent to one another on a corresponding target nucleotide sequence due to perfect complementarity at the ligation junction, but, when the oligonucleotide probes in the set are hybridized to any other nucleotide sequence present in the sample, have a mismatch at a base at the ligation junction which interferes with such ligation.
- 75. A method according to claim 73, wherein the mismatch is at the 3′ base at the ligation junction.
- 76. A method according to claim 73, wherein the first and second alleles differ by a single nucleotide.
- 77. A method according to claim 73, wherein said method is used to quantify an allele imbalance between first and second alleles and the different capture oligonucleotides immobilized at particular sites are substantially the same for both the first allele target nucleotide sequence and the second allele target nucleotide sequence, wherein the oligonucleotide probe sets have either of two reporter labels which can be detected and distinguished independently so that ligation product sequences for the first allele target nucleotide sequence and the second allele target nucleotide sequence are captured on the support with the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion divided by the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion reflecting an initial allele ratio for each test and normal allele position and the relative imbalance of the first and second alleles in a test sample is determined by dividing the initial allele ratio for the test sample by the initial allele ratio for a normal sample, whereby (1) a ratio of >1 indicates that the first allele is in that number-fold greater in quantity than the second allele, (2) a ratio of <1 indicates that the second allele is in the inverse number-fold greater in quantity than the first allele, and (3) a ratio of about 1 indicates the first and second allele are present in about the same quantity.
- 78. A method according to claim 77, wherein said method is for quantifying loss of heterozygosity (LOH) or gene amplification in a tumor sample containing up to 50% stromal contamination by comparing allele imbalance at a tumor gene locus with allele balance at a control gene locus among a tumor and normal sample from the same individual and the different capture oligonucleotides immobilized at particular sites are substantially the same for both the first allele target nucleotide sequence and the second allele target nucleotide sequence, the two alleles being heterozygous at both the tumor gene locus and the control gene locus with the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion for the tumor gene locus divided by the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion for the control gene locus reflecting an initial tumor to control first allele ratio, wherein for both test and normal samples where the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion for the tumor gene locus divided by the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion for the control gene locus reflects an initial tumor to control second allele ratio and a presence of gene amplification or LOH of the first and second tumor alleles in the tumor sample is determined by dividing the initial tumor to control level for a given allele ratio for the tumor sample by the initial tumor to control level for a given allele ratio for the normal sample where (1) a ratio of >2 for a first tumor gene allele indicates the first tumor gene allele is amplified in the tumor sample, compared with the normal sample, (2) a ratio of >2 for a second tumor gene allele indicates the second tumor gene allele is amplified in the tumor sample, compared with the normal sample, (3) a ratio of <0.5 for a first tumor gene allele shows that the first tumor gene allele underwent LOH in the tumor sample, compared with the normal sample, (4) a ratio of <0.5 for a second tumor gene allele shows that the second tumor gene allele underwent LOH in the tumor sample, compared with the normal sample, and (5) a ratio of about 1 indicates a given tumor allele did not undergo LOH or amplification, compared with the normal sample.
- 79. A method according claim according to claim 73, wherein the method is utilized for quantifying an allele imbalance between a test sample and a normal sample with each set characterized by both first and second oligonucleotide probes, a percentage of each have a second distinct detectable reporter label, wherein the two reporter labels may be detected and distinguished independently such that the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion divided by the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion reflects an initial allele ratio for each test and normal allele position and a relative imbalance of the first and second alleles in the test sample is determined by dividing the initial allele ratio for the test sample by the initial allele ratio for the normal sample, wherein (1) a ratio of >1 indicates that the first allele is in that number-fold greater in quantity than the second allele, (2) a ratio of <1 indicates that the second allele is in the inverse number-fold greater in quantity than the first allele, and (3) a ratio of about 1 indicates that the first and second allele are present in about the same quantity, indicating there is no allele imbalance compared with the normal sample.
- 80. A method according to claim 79, wherein said method is carried out for quantifying loss of heterozygosity (LOH) or gene amplification in a tumor sample containing up to 50% stromal contamination by comparing allele imbalance at a tumor gene locus with allele balance at a control gene locus among a tumor and normal sample from the same individual with the two alleles being heterozygous at both the tumor gene locus and the control gene locus and the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion for the tumor gene locus divided by the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion for the control gene locus reflecting an initial tumor to control first allele ratio, such that for both test and normal samples, the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion for the tumor gene locus divided by the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion for the control gene locus reflects an initial tumor to control second allele ratio and the presence of gene amplification or LOH of the first and second tumor alleles in the tumor sample is determined by dividing the initial tumor to control for a given allele ratio for the tumor sample by the initial tumor to control for a given allele ratio for the normal sample, wherein (1) a ratio of >2 for a first tumor gene allele indicates the first tumor gene allele is amplified in the tumor sample, compared with the normal sample, (2) a ratio of >2 for a second tumor gene allele indicates the second tumor gene allele is amplified in the tumor sample, compared with the normal sample, (3) a ratio of <0.5 for a first tumor gene allele indicates the first tumor gene allele underwent LOH in the tumor sample, compared with the normal sample, (4) a ratio of <0.5 for a second tumor gene allele indicates the second tumor gene allele underwent LOH in the tumor sample, compared with the normal sample, and (5) a ratio of about 1 indicates a given tumor allele did not undergo LOH or amplification, compared with the normal sample.
- 81. A method according to claim 58, wherein said providing a plurality of oligonucleotide probe sets with each set characterized by (a) a first oligonucleotide probe, having a target-specific portion complementary to a first allele and a first detectable reporter label, (b) a second oligonucleotide probe, having a target-specific portion complementary to a second allele and a second distinct detectable reporter label and (c) a third oligonucleotide probe, having a target-specific portion and a addressable array-specific portion, wherein the first and third oligonucleotide probes set are suitable for ligation together when hybridized adjacent to one another on a corresponding first allele target nucleotide sequence, wherein the second and third oligonucleotide probes set are suitable for ligation together when hybridized adjacent to one another on a corresponding second allele target nucleotide sequence, but each set has a mismatch which interferes with such ligation when hybridized to any other nucleotide sequence present in the representation of the sample with the two reporter labels being detected and distinguished independently such that detection of the first reporter label at the complement of the addressable array-specific portion indicates a presence of the first allele, while detection of the second reporter label at the complement of the addressable array-specific portion indicates a presence of the second allele, for each set.
- 82. A method according to claim 81, wherein the mismatch is at a 3′ base at the ligation junction.
- 83. A method according to claim 81, wherein the first and second alleles differ by a single nucleotide.
- 84. A method for according to claim 81, wherein said method is used to quantify an allele imbalance between first and second alleles and the different capture oligonucleotides immobilized at particular sites are substantially the same for both the first allele target nucleotide sequence and the second allele target nucleotide sequence, wherein the oligonucleotide probe sets have either of two reporter labels which can be detected and distinguished independently so that ligation product sequences for the first allele target nucleotide sequence and the second allele target nucleotide sequence are captured on the support at particular sites with the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion divided by the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion reflecting an initial allele ratio for each test and normal allele position and the relative imbalance of the first and second alleles in the test sample is determined by dividing the initial allele ratio for the test sample by the initial allele ratio for the normal sample, whereby (1) a ratio of >1 indicates that the first allele is in that number-fold greater quantity over the second allele, (2) a ratio of <1 indicates that the second allele is in the inverse number-fold greater quantity over the first allele, and (3) a ratio of about 1 determines the first and second allele are present in about the same quantity.
- 85. A method according to claim 81, wherein said method is for quantifying loss of heterozygosity (LOH) or gene amplification in a tumor sample containing up to 50% stromal contamination by comparing allele imbalance at a tumor gene locus with allele balance at a control gene locus among a tumor and normal sample from the same individual and the different capture oligonucleotides immobilized at particular sites are substantially the same for both the first allele target nucleotide sequence and the second allele target nucleotide sequence, the two alleles being heterozygous at both the tumor gene locus and the control gene locus with the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion for the tumor gene locus divided by the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion for the control gene locus reflecting an initial tumor to control first allele ratio, wherein for both test and normal sample where the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion for the tumor gene locus divided by the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion for the control gene locus reflects an initial tumor to control second allele ratio and a presence of gene amplification or LOH of the first and second tumor alleles in the tumor sample is determined by dividing the initial tumor to control level for a given allele ratio for the tumor sample by the initial tumor to control level for a given allele ratio for the normal sample where (1) a ratio of >2 for a first tumor gene allele indicates the first tumor gene allele is amplified in the tumor sample, compared with the normal sample, (2) a ratio of >2 for a second tumor gene allele indicates the second tumor gene allele is amplified in the tumor sample, compared with the normal sample, (3) a ratio of <0.5 for a first tumor gene allele determines the first tumor gene allele underwent LOH in the tumor sample, compared with the normal sample, (4) a ratio of <0.5 for a second tumor gene allele determines the second tumor gene allele underwent LOH in the tumor sample, compared with the normal sample, (5) a ratio of about 1 determines a given tumor allele did not undergo LOH or amplification, compared with the normal sample.
- 86. A method according claim according to claim 81, wherein the method is utilized for quantifying an allele imbalance between a test sample and a normal sample with each set characterized by both first and second oligonucleotide probes, a percentage of each have a second distinct detectable reporter label, wherein the two reporter labels may be detected and distinguished independently such that the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion divided by the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion reflects an initial allele ratio for each test and normal allele position and the relative imbalance of the first and second alleles in the test sample is determined by dividing the initial allele ratio for the test sample by the initial allele ratio for the normal sample, wherein (1) a ratio of >1 indicates that the first allele is in that number-fold greater quantity over the second allele, (2) a ratio of <1 indicates that the second allele is in the inverse number-fold greater quantity over the first allele, and (3) a ratio of about 1 indicates that the first and second allele are present in about the same quantity, indicating there is no allele imbalance compared with the normal sample.
- 87. A method according to claim 81, wherein said method is carried out for quantifying loss of heterozygosity (LOH) or gene amplification in a tumor sample containing up to 50% stromal contamination by comparing allele imbalance at a tumor gene locus with allele balance at a control gene locus among a tumor and normal sample from the same individual with the two alleles being heterozygous at both the tumor gene locus and the control gene locus and the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion for the tumor gene locus divided by the ratio of the first reporter label to the second reporter label at the complement of the first addressable array-specific portion for the control gene locus reflecting an initial tumor to control first allele ratio, such that for both test and normal sample, the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion for the tumor gene locus divided by the ratio of the first reporter label to the second reporter label at the complement of the second addressable array-specific portion for the control gene locus reflects an initial tumor to control second allele ratio and the presence of gene amplification or LOH of the first and second tumor alleles in the tumor sample is determined by dividing the initial tumor to control for a given allele ratio for the tumor sample by the initial tumor to control for a given allele ratio for the normal sample, wherein (1) a ratio of >2 for a first tumor gene allele indicates the first tumor gene allele is amplified in the tumor sample, compared with the normal sample, (2) a ratio of >2 for a second tumor gene allele indicates the second tumor gene allele is amplified in the tumor sample, compared with the normal sample, (3) a ratio of <0.5 for a first tumor gene allele indicates the first tumor gene allele underwent LOH in the tumor sample, compared with the normal sample, (4) a ratio of <0.5 for a second tumor gene allele indicates the second tumor gene allele underwent LOH in the tumor sample, compared with the normal sample, and (5) a ratio of about 1 indicates a given tumor allele did not undergo LOH or amplification, compared with the normal sample.
- 88. A method to sequence directly from a PCR amplified nucleic acid molecule without primer interference comprising:
amplifying a nucleic acid molecule using PCR primers containing alternative nucleoside bases under conditions effective to produce PCR amplification products and cleaving the PCR primers both incorporated and unincorporated in the PCR amplification products under conditions which leave the PCR amplification products intact.
- 89. A method according to claim 88, wherein the PCR primers contain dUTP and starting primers and incorporated primers are cleaved using uracil-N-glycosylase (ung) prior to DNA sequencing.
- 90. A method according to claim 88, wherein the PCR primers contain ribonucleosides and starting primers and incorporated primers are cleaved with a base (0.1N NaOH) followed by neutralization with a buffer prior to DNA sequencing.
Parent Case Info
[0001] This application is a continuation-in-part of U.S. patent application Ser. No. 09/478,189, filed Jan. 5, 2000, and claims the benefit of U.S. Provisional Patent Application Serial No. 60/114,881, filed Jan. 6, 1999, which are hereby incorporated by reference in their entirety.
Government Interests
[0002] The present invention was made with funding from National Institutes of Health Grant No. GM38839. The United Stated Government may have certain rights in this invention.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60114881 |
Jan 1999 |
US |
Continuation in Parts (1)
|
Number |
Date |
Country |
Parent |
09478189 |
Jan 2000 |
US |
Child |
10198235 |
Jul 2002 |
US |