Claims
- 1. An array of oligonucleotide probes for detecting a polymorphism in a target nucleic acid sequence using Principal Component Analysis, said array comprising at least one detection block of probes, said detection block including a first group of probes that are complementary to said target nucleic acid sequence except that the group of probes includes all possible monosubstitutions of positions in said sequence that are within n bases of a base in said sequence that is complementary to said polymorphism, wherein n is from 0 to 5, and a second and third group of probes complementary to marker-specific regions upstream and downstream of the target nucleic acid sequence, wherein the third group of probes differs from the second set of probes at single bases corresponding to known mismatch positions.
- 2. The array of claim 1, wherein the polymorphism is identified as a result of Principal Component Analysis of hybridization intensities of the array of probes.
- 3. The array of claim 1, wherein at least two alleles of the polymorphism are known.
- 4. The array of claim 1, wherein said first group of probes comprises a plurality of different probes that are complementary to overlapping portions of said target nucleic acid sequence.
- 5. The array of claim 1, wherein the monosubstitutions occur at a plurality of distances from a 3′ end of said probes.
- 6. The array of claim 1, wherein said detection block includes between about 8 and 88 different probes.
- 7. The array of claim 1, comprising between 1 and 1,000 different detection blocks, each of said detection blocks including probes complementary to different polymorphisms in said target nucleic acid sequence.
- 8. A method of identifying whether a target nucleic acid sequence includes a polymorphic variant using principal component analysis, comprising:
hybridizing said target nucleic acid sequence to said array comprising at least one detection block of probes, said detection block including a first group of probes that are complementary to said target nucleic acid sequence except that the group of probes includes all possible monosubstitutions of positions in said sequence that are within n bases of a base in said sequence that is complementary to said polymorphism, wherein n is from 0 to 5, and a second and third group of probes complementary to marker-specific regions upstream and downstream of the target nucleic acid sequence, wherein the third group of probes differs from the second set of probes at single bases corresponding to known mismatch positions; and determining hybridization intensities of the target nucleic acid and the marker-specific regions to identify said polymorphic variant.
- 9. The method of claim 8, wherein said target nucleic acid comprises a detectable label.
- 10. The method of claim 9, wherein said detectable label is a fluorescent group.
- 11. The method of claim 9, wherein said label is a binding group.
- 12. The method of claim 11, wherein said binding group is selected from biotin, avidin and streptavidin.
- 13. The method of claim 8, wherein the polymorphism is identified as a result of Principal Component Analysis of hybridization intensities of the array of probes.
- 14. The method of claim 8, wherein at least two alleles of the polymorphism are known.
- 15. The method of claim 8, wherein said step of determining comprises:
a) calculating the control difference between the average of the hybridization intensities of the second group of probes, the hybridization intensities comprising control perfect matches (PM), minus the average of the hybridization intensities, the hybridization intensities comprising control single-base mismatches (MM); b) calculating the possible perfect match intensity and a heteromismatch intensity from the hybridization intensities for each position of monosubstitutions of the first group of probes; c) calculating the difference between the possible perfect match intensity and the heteromismatch intensity for each position of monosubstitutions of the first group of probes; d) calculating a normalized difference(ND) by dividing the difference of step (c) by the control difference; (e) using principal component analysis, identifying a polymorphism by comparing normalized differences between individuals in a population.
- 16. The method of claim 15, wherein an ND is calculated for at least two allele of a polymorphism.
- 17. The method of claim 16, wherein homozygotes and heterozygotes are detected by combining principal component analysis for two alleles.
Parent Case Info
[0001] This application is a continuation-in-part of Ser. No. 08/563,762, filed Nov. 29, 1995, and and claims the benefit of U.S. provisional application 60/017,260, filed May 10, 1996, the disclosures of which are incorporated by reference in their entirety for all purposes.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60017260 |
May 1996 |
US |
Continuations (1)
|
Number |
Date |
Country |
Parent |
08853370 |
May 1997 |
US |
Child |
09939119 |
Aug 2001 |
US |
Continuation in Parts (1)
|
Number |
Date |
Country |
Parent |
08563762 |
Nov 1995 |
US |
Child |
08853370 |
May 1997 |
US |