Primers and methods for detecting mutations in the procollagen II gene (COL2A1) that indicate a genetic predisposition for a COL2A1-associated disease

Information

  • Patent Grant
  • 5948611
  • Patent Number
    5,948,611
  • Date Filed
    Friday, February 3, 1995
    29 years ago
  • Date Issued
    Tuesday, September 7, 1999
    25 years ago
Abstract
The invention provides probes and primers for amplifying certain regions of genes for structural proteins of cartilage and methods for detecting mutations in these genes isolated from the nucleic acid of cells suspected of exhibiting mutant structural protein gene expression or having mutant structural protein genes. The invention also provides methods for determining a genetic predisposition for a disease that alters the structure or function of cartilage because of a mutation in a gene for a structural protein of cartilage in a mammal.
Description

FIELD OF INVENTION
The present invention relates to the field of compounds and methods for detecting genetic diseases linked to anomalies of genes for collagens and other structural proteins found in cartilage and joints.
BACKGROUND OF THE INVENTION
Osteoarthritis is a progressive disease of joints that is a cause of serious disability in large numbers of people. The disease is defined as a progressive degeneration of the cartilaginous surfaces of joints that leads to stiffness, pain, and loss of mobility. Degeneration of the cartilaginous surface of joints seen in osteoarthritis can have a number of causes. For example, severe trauma to a joint or a bacterial infection in a joint can produce degeneration of the joint that is either immediate or slowly progressive over many years. A number of metabolic disturbances are also know to produce degeneration of joints.
Cartilage and membranes that line joints are complex structures. A major source of the strength of cartilage is the fibrils of type II collagen. The fibrils of type II collagen are stretched into three-dimensional arcades primarily by the presence of another group of macromolecules called proteoglycans. Proteoglycans are highly charged and, therefore, absorb water and salts and thereby extend the arcades of type II collagen fibrils. As a result, a highly resilient structure is formed that can withstand the intermittent pounding and pressures that joints must undergo. In addition to proteoglycans and type II collagen, cartilage is known to contain at least four other kinds of collagens (types VI, IX, X and XI) in lesser amounts than type II collagen. It is very likely that additional collagens will be discovered in cartilage in the future. In addition, it is clear that the matrix of cartilage also contains a number of other proteins that are still poorly characterized and that may contribute to the structure and function of the tissue.
Collagens, proteoglycans and other proteins found in the matrix of cartilage are synthesized by cells embedded within the matrix. The matrix is actively synthesized during embryonic development of certain tissues and during periods of growth. The rates of synthesis and degradation of the matrix are less during adult life. However, throughout life, a continual slow synthesis and degradation of cartilage occurs, particularly in response to the pressures associated with physical activity.
Cartilage itself has several different functions in the body. During embryonic development, transient tracks of type II collagen and probably other components of cartilage are formed in many structures. The tracks appear to serve as a guide for cell migration and a template for formatting of skeleton and associated structures. in addition, cartilage serves as a precursor structure for many bones. During the development of long bones such as those of the arms and legs, cartilage is part of the growth plate in which cell growth occurs. More specifically, the cartilage grows away from the midpoint of the long bone and is continually degraded and gradually replaced by bone itself. An additional function of cartilage is to give shape and form to tissues such as the nose and ears. Many of the macromolecules found in cartilage are also present in the vitreous gel of the eye and account for the high viscosity of the vitreous. Still another major function of cartilage is to provide strength and resilience to structures such as the intervertebral disc of the spine. In joints, it provides not only strength and resilience, but also the smooth surfaces for motion under heavy loads.
The degeneration of joint cartilages that occurs in osteoarthritis is caused by a failure of the cartilage to maintain its structural integrity. In this process, the cartilage surface is eroded by physical pressures and is not adequately replaced by the new synthesis of cartilage. Instead of adequate repair of cartilage, secondary changes occur in the joint surface and in the joint. These changes include, for example, inflammatory responses characterized by invasion of white cells and macrophages, abnormal deposition of mineral in the form of calcium and phosphate within the joint space and in the cartilage itself, deposition of fibers of type I and other collagens that are not normally part of cartilage or the joint, abnormal growth of cartilage cells and matrix at locations adjacent to the joint surface and abnormal calcifications of the joints and associated structures. As part of the complex changes that occur, the cells of the cartilage or the invading cells from the blood stream begin to secrete degradative enzymes that further contribute to the degradation of the joint structures.
In the more severe diseases of cartilage known as chondrodysplasias, serious defects in the formation of cartilage are apparent early in life and there is a failure of joints to develop their normal size and shape. There is also a secondary failure of bone growth seen in these diseases. Moreover, there can be a failure of normal development of many tissues such as failure to achieve closure of normal partitions between oral and nasal passages, known as cleft palate, and improper development of the vitreous gel of the eye that causes severe myopia and retinal detachment.
Research has demonstrated that some forms of osteoarthritis and related conditions are caused by mutations in the genes that code for and, therefore, determine the structure of the collagens that are the major source of the strength of cartilage. Mutations of collagen that have been defined include, for example, mutations in the gene for type II collagen and its precursor type II procollagen. These mutations are of two general kinds. One kind of mutation decreases the synthesis of type II procollagen. The second kind of mutation leads to the synthesis of a defective form of type II procollagen. As a result of these mutations, there is either a decrease in the normal level of type II collagen in cartilage and other tissues that contain the protein or the formation of abnormal type II collagen fibrils that do not have the strength of normal type II collagen fibrils and, therefore, cause the cartilage of joints to be degraded by normal wear and tear. The two kinds of mutations can also produce drastic effects during normal growth and development. As a result, some individuals who inherit some of the mutated genes develop severe chondrodysplasias and die in utero or shortly after birth. Alternatively, such individuals can have serious deformities such as dwarfism which shows severe malformation of joints and may be associated with conditions of severe myopia, myopia with retinal detachment and blindness, cataracts, cleft palate, and unusual facial appearance. Other similar mutations in the same genes may produce much milder effects and cause progressive generalized osteoarthritis in which affected individuals are apparently normal until middle age when they develop progressive stiffness, pain and then immobility of many joints. Mutations of the gene for type II procollagen and collagen have been shown to cause these disorders. Research suggests that some of the conditions are caused by similar mutations in other genes that code for other structural macromolecules found in cartilage which contribute to its normal resistance to wear and tear. Methods and compounds are therefore desired for the analysis and detection of mutations in genes, both for type II procollagen and for a series of known collagens that are components of cartilage (types VI, IX, X and XI), as well as for still undiscovered collagens and other structural proteins that contribute to the normal strength and function of cartilage.
SUMMARY OF THE INVENTION
The invention provides probes and primers complementary to certain regions of collagen genes. Also provided are methods using such primers and probes for detecting mutations in a collagen gene sequence isolated from cells from individuals suspected of exhibiting mutant collagen gene expression or containing mutant collagen genes. Also included are methods to detect mutations in the collagen genes of members of the individual's family.
More specifically, the invention provides a method whereby an individual who has developed osteoarthritis or a related condition, or is suspected of developing osteoarthritis or a related condition, is tested to see if the individual has a mutation in the DNA of a gene for a structural protein that is a normal component of cartilage. This method provides that the DNA sequence of a collagen gene is examined in an individual with osteoarthritis or a related condition. The DNA sequence is also compared with corresponding regions of a standard DNA from a series of individuals known not to have the disease in question. Any difference in the base sequence from the DNA of the individual tested as compared to the standard sequence is then evaluated in terms of whether or not it indicates an increased likelihood of the individual suffering from osteoarthritis or related disorder. For the first member of a family tested, all or a substantial portion of DNA coding for the gene is sequenced and compared to the standard sequence.
The invention also provides that once the location of the gene mutation causing the disease is known, it can be looked for in members of the first individual's family. For each genetically predisposed individual family member, the mutation in the gene is expected to appear in the same position in the collagen gene tested. A difference in base sequence of the DNA from the individual tested as compared with the standard sequence can be evaluated to determine how the mutation will effect expression of the gene. The potential for the difference in DNA sequence to produce the disease can also be evaluated in terms of whether it changes an amino acid sequence that is critical for the normal functioning of the protein.
Methods further provide that DNA derived from the cells of the test sample is analyzed to determine whether or not the collagen gene contains a mutation. If a mutation is found in the gene, a rapid test can be devised for other members of the patient's family to determine whether or not they have the same mutation.
The methods of the invention are particularly useful in detecting mutations in human collagen genes. It is believed that these methods will also be useful in detecting mutant collagen genes in mammals.
Intronic sequences provided by the invention are useful for developing oligonucleotide primers to amplify and sequence genomic DNA from patients suspected of having mutations in the gene for the pro.alpha.I (II) chain of type II procollagen which cause various disorders.
Methods of the present invention for detecting mutations in the gene for type II procollagen can readily be applied to detection of mutations in genes coding for other structural proteins found in cartilage and associated tissues. For example, the nucleotide sequences of these genes can be used to design oligonucleotide primers to amplify genomic DNA or cDNA for the gene. The products obtained using PCR can then be used to define the base sequences of genomic DNA or cDNA. Mutations in the genes for these other collagens and structural proteins in matrix that cause osteoarthritis and related conditions can be detected in the same manner as mutations in the gene for type II procollagen.





BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 shows a gene map of type II procollagen (COL2A1) with introns and exons designated by lines and crosshatched boxes respectively.
FIG. 2 shows a schematic for PCR amplification of COL2A1 exon 9 including the region of the gene comprising polymorphism 225 and 252 and agarose gels containing the amplification products.
FIG. 3 shows a schematic for PCR amplification of COL2A1 exons 27 and 28 including the region of the gene comprising an intron 26 polymorphism and agarose gels containing the amplification products.





DETAILED DESCRIPTION OF THE INVENTION
The present invention concerns methods of diagnosing specific kinds of osteoarthritis in which there is systemic degeneration of many joints without any apparent external cause. In particular, the invention relates to progressive generalized osteoarthritis that produces degeneration of the cartilage of many joints. It also concerns diseases in which there is malformation of joints that is apparent at birth or during childhood and that leads to progressive degeneration of joints, as well as other symptoms such as dwarfism and severe malformation of the skeleton and other cartilaginous tissues. More particularly, the invention is directed to diagnosing diseases characterized by degeneration of joints in which the presence of the disease in several members of the same family or similarity between the disease of a given patient and diseases seen in families indicates that the disease has a genetic origin. Therefore, the invention concerns not only progressive generalized osteoarthritis, but also a group of related disorders that are generally defined as chondrodysplasias. Skeletal dysplasia or related disease involving abnormalities of growth and of cartilaginous structures and of tissues containing the same structural proteins as cartilage including chrondrodysplasias, epiphyseal dysplasia, metaphyseal dysplasia, spondyloepiphyseal dysplasia, spondylometaphyseal dysplasia and arthro-ophthalmopathy (the Wagner-Stickler syndrome) may be diagnosed using the methods of the invention. Skeletal disorders such as scoliosis and related conditions involving abnormalities in the cartilage and other components of the vertebrae column and back may also be diagnosed in accordance with the invention.
More specifically, the present invention provides methods for gene analysis whereby one can definitively establish the cause of osteoarthritis and of chondrodysplasias in individuals and in members of the individuals' families. Such methods will make it possible to identify individuals in families who are predisposed to develop these diseases. In the case of severe chondrodysplasias that produce crippling deformities and that may even be lethal, such methods will be useful for prenatal diagnosis. In the case of milder chondrodysplasias and osteoarthritis, the results of the gene analyses may be used to counsel individuals predisposed to develop the diseases concerning preventive exercise programs, life styles, choice of careers, and family planning. Also, the information generated by the gene analyses using the methods of the present invention may be used to develop new rational therapies for diseases of collagen. Moreover, the information can be used to develop animal models for such diseases. For example, information may be useful to develop transgenic mice that will provide new means of testing new agents to cure and prevent the diseases.
Methods of the invention provide that an individual who has developed osteoarthritis or a related condition or is suspected of developing osteoarthritis or a related condition, is tested to see a if mutation in the DNA of a gene for a structural protein for example, type II procollagen, is present. The methods of the invention further provide that after a mutation causing the disease or diseases in one individual is found, it can be sought in members of the first individual's family.
The gene COL2A1 which encodes the pro.alpha.I (II) chain of type II procollagen is provided and examined as a demonstration of the methods of the present invention. This demonstration in no way limits the scope of the claimed invention. In the first stage of the methods of the invention, the DNA sequence of the COL2A1 gene is examined in an individual with a cartilage and joint disease such as osteoarthritis. This DNA sequence is compared with corresponding regions of a standard DNA from a series of individuals known not to have the disease in question. It is believed that this strategy will be useful for any disease of collagen which exhibits at least one mutation in a collagen gene of the diseased mammal. The DNA sequences of the genes tested can be genomic DNA or cDNA prepared from RNA derived from a sample of cells or tissues taken from the individual. DNA may also be extracted from bodily fluids containing lysed cells. The standard DNA sequence and structure of the COL2A1 gene can be obtained by reference to known sequences or those set forth herein in FIG. 1 and listed in the literature or computerized data banks. Sequences of other collagen genes used in the methods of the present invention may also be obtained from databases or may be sequenced employing commonly used methods. See, for example, Sanger et al., DNA Sequencing With Chain-Terminating Inhibitors, Proc. Natl. Acad. Sci. USA 1977, 74, 5463-5467. Any difference in the base sequence from the DNA of the individual tested as compared to the standard sequence is then evaluated to determine whether it indicates an increased likelihood of the individual suffering from osteoarthritis or a related disorder. For the first member of a family tested, all or a substantial portion of DNA coding for the gene is sequenced and compared to the standard sequence. Sequencing of the first family member's DNA may be achieved by DNA sequencing techniques known in the art. These methods of the present invention are useful for all collagen genes.
As used herein, the term "collagen" includes procollagens and collagens types I to XVI, still undiscovered collagens similar to types I to XVI, and genes encoding proteins associated or comprising collagen polymers in tissue matrices.
The term "family member", as used herein, means individuals, including humans and other mammals, genetically related to one another in any degree, such as, for example, parent-child, siblings, cousins, etc. Such genetic relatedness can be determined using standard methods known in the art including, for example, pedigree analysis or DNA "fingerprinting".
Moreover, the term "individual", as used herein, denotes a mammalian individual of any species, including humans.
As illustrated by the demonstration using type II collagen, methods of detection of mutations in a collagen gene comprise several steps. One step involves selecting cells suspected of comprising a mutated collagen gene. Another step includes isolating genomic DNAs from selected cells or preparing cDNAs from selected cells. After the genomic DNA or cDNA is isolated, larger amounts of the gene sequences of interest are prepared by amplifying the DNA with the polymerase chain reaction (PCR). The DNA produced by the PCR are then analyzed for the presence of a disease-causing mutation. The preferred strategy of analysis is to first screen the PCR products with a relatively rapid technique such as denaturing gradient gel electrophoresis (DGGE) that enables one to decide whether or not a specific PCR product from one region of the gene does or does not have a mutation such as a single base difference between the two alleles of the gene. PCR products detected as probably having a mutation by such a technique are then analyzed further by a technique such as dideoxynucleotide sequencing that provides the detailed base sequence that defines the mutation. The methods comprise the aforementioned steps and further comprise comparing the sequence of the collagen gene or other genes for structural proteins of cartilage containing the mutation to corresponding regions of a family member's structural protein genes and determining if the mutation is present in the family member's genes.
It is believed that all of the methods of the present invention are useful to detect any mutation in all procollagen and collagen genes, including, for example, procollagens and collagens I to XVI, still undiscovered procollagens and collagens similar to types I to XVI, as well as other genes associated with collagen structure, such as proteoglycans. It is also believed that these methods are also useful in mammals other than humans.
A second step of the invention provides that once the location of the mutation in a gene causing the disease is known, it can be sought in members of the first individual's family. For each genetically predisposed individual family member, the mutation in the gene is expected to appear in the same position in the structural protein gene tested. For example, in Family A, the genetic mutation may be at position 30; and for Family B, the genetic mutation may be at position 505. In accordance with the methods of the invention, testing the family members can be done by comparing corresponding regions of family member's genes and determining if the mutation is present in the family member. This evaluation of a difference in base sequence of the DNA from the individual tested as compared with the standard sequence can be evaluated in terms of whether it is a disease-causing mutation by determining whether the mutation changes the level of expression of the gene in terms of the rate at which the gene is transcribed into RNA, the rate at which the initial RNA is processed into mRNA and the rate at which the mRNA can be effectively used to synthesize mature collagen matrices, such as pro.alpha.I (II) chains of type II procollagen. The potential for the difference in DNA sequence to produce the disease can also be evaluated in terms of whether it changes an amino acid sequence that is critical for the normal functioning of the protein by strategies used by those familiar in the art. The strategies include demonstration that the same gene mutation is not present in individuals unaffected by the disease in the same family of the general population, detailed linkage analysis of co-inheritance of the mutation with the disease phenotype in large families or a series of families with the same mutation, and structure-function studies on the mutated protein obtained either by isolation of the protein from tissues of affected individuals or expression of the mutated gene in a recombinant system.
In accordance with the methods of the invention, DNA is extracted from a test sample of cells of the family member to be tested by conventional techniques, such as lysis of the cells with sodium dodecyl sulfate (SDS) and digestion of protein with proteinase K, followed by extraction with phenol and chloroform, and ethanol precipitation as described by Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1982, pp 280-281. A sample of cells can be taken from any type of tissues; for example, a piece of skin, a sample of blood, or by scraping of the interior of the mouth. Alternatively, mRNA can be extracted from the test sample and cDNA synthesized with reverse transcriptase, and the resulting cDNA used for analysis. Although cartilage cells cannot regularly be obtained from patients, many other cells including white blood cells have been shown to contain small amounts of the mRNAs for protein synthesized by cartilage and the mRNAs can be analyzed after conversion to cDNAs as reported by Chan and Cole, Journal of Biological Chemistry 1991, 266, 12487.
Following extraction, DNA derived from the cells of the test sample is analyzed to determine whether the structural protein gene contains a mutation. If a mutation is found in the gene, a rapid test can be devised for other members of the patient's family to determine whether they have the same mutation. DNA and cDNA from other structural protein genes can also be used in this method of the invention.
Although the methods of the invention have been demonstrated in the first instance in human beings, it is expected that they will be useful in other mammalian species, particularly commercially important species and in laboratory animals used as models of human disease. For example, it is believed that these methods will be particularly useful in detecting mutant collagen genes and transcripts in transgenic animals comprising mutant collagen genes or transcripts.
Some of the sequences for the normal COL2A1 gene can be found in the literature. For example, in Strom and Upholt, Nucl. Acids Res. 1984, 12, 1025-1038; Cheah et al., Proc. Natl. Acad Sci. USA 1985, 87, 2555-2559; Sangiorgi et al., Nucl. Acids Res. 1985, 13, 2207-2225; Nunez et al., Gene 1986, 44, 11-16; Su et al., Genomics 1989, 4, 483-441; Vikkula and Peltonen, FEBS Lett. 1989, 250, 171-174; Upholt, Collagen Vol. 4, CRC Press, Baton Rouge, Fla., 1989, pp 31-49; Ala-Kokko and Prockop, Genomics 1990, 8, 454-460. However, the sequences for a major part of the introns used for the synthetic oligonucleotide primers employed for the analyses described here are provided as part of the present invention. Those skilled in the art recognize that rapid analysis of a gene by these procedures requires a series of oligonucleotides that are specifically targeted to regions of the gene under analysis. However, many base sequences are not efficient targets for oligonucleotide primers. In the case of procollagen and collagen genes, the number of efficient target sites is limited by the high GC content and repetitive nature of the coding sequences. Also, most of the genes contain a large number of introns, and many of the introns are too short to offer many potential target sites (see FIG. 1). Therefore, the present invention includes efficient oligonucleotide primers for amplification of the type II procollagen gene by PCR, efficient oligonucleotide primers for analysis of PCR products of the type II procollagen gene by DGGE, and efficient oligonucleotide primers for dideoxynucleotide sequencing of PCR products of the type II procollagen. Table I presents specific oligonucleotide primers for amplification of the human type II procollagen gene by PCR and for dideoxynucleotide sequencing of the PCR products. Table II presents a series of neutral sequence variants detected in the type II procollagen gene that are important to avoid as target sites for primers, for defining haplotypes of the gene, and evaluating putative disease-causing mutations in the gene. Table III presents oligonucleotide primers that are efficient for amplifying regions of the gene by PCR in a form that makes the PCR products suitable for screening for mutations by the technique of DGGE.
The analytical methods that are part of the invention involve the PCR. One skilled in the art would recognize that there are many commonly employed schemes for amplifying nucleic acid sequences using PCR. See, for example, PCR Protocols, A Guide to Methods and Applications, Innis et al, Eds., Academic Press, New York, 1990, and Current Communications, Polymerase Chain Reaction, Ehrlich et al., Eds., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. The PCR methods of the invention for amplifying nucleic containing at least one mutation in a collagen gene comprise several steps. The steps enumerated herein are merely exemplary of the steps commonly employed by one skilled in the art performing PCR. Other steps may be added to achieve DNA amplification. Included in the invention are steps for selecting cells suspected of comprising a mutated collagen gene and isolating nucleic acid from the cells. Initial PCR steps comprise contacting the nucleic acid with a first primer and a second primer, extending the first primer to create an extension product, contacting the extension product with a second primer, and extending the second primer to create another extension product. The extension products of the first round of synthesis may be longer than the extension products of the second round of synthesis. Extension products of the first round of synthesis are contacted with the primers comprising complementary sequences. The primers used in this step may be the same primers used in the first round of synthesis. These primers are extended to form "secondary" extension products. These "secondary" extension products may be shorter than the initial extension products from which they were synthesized. Amplification steps comprise amplifying the "secondary" extension products using PCR. The initial runoff DNA extension products will be diluted during amplification. The amplification steps will favor synthesis of the "secondary" extension product DNAs having a length determined by the probes used. Once the PCR steps are completed, steps are provided for detecting the presence or absence of said mutation in at least one extension product. It is believed that these methods will be useful for analyzing cartilage matrix protein genes, especially in humans. It is also believed that these methods are useful to analyze the matrix protein genes of other proteins found in collagen matrices, such as non-collagenous structural protein of cartilage. Further, the PCR methods of the invention are useful for determining if a mammal has a genetic predisposition for a disease exhibiting a mutant collagen gene. These methods include further steps of comparing the sequence of the collagen gene containing the mutation to corresponding regions of a family member's collagen genes and determining if the mutation is present in the family member's collagen genes. It is believed that these methods will also be useful in other mammals.
Methods of the present invention for detecting mutations in the gene for type II procollagen can readily be applied to detection of mutations in genes coding for other structural proteins found in cartilage and associated tissues. For example, the nucleotide sequences of the genes for types VI, IX, X and XI collagens can be used to design oligonucleotide primers to amplify genomic DNA or cDNA for the gene using PCR. The products obtained using PCR can then be used to define the base sequences of genomic DNA or cDNA. Therefore, mutations in the genes for these other collagens and structural proteins in matrix that cause osteoarthritis and related conditions can be detected in the same manner as mutations in the gene for type II procollagen.
The following examples are illustrative of the invention. It is understood that this invention is not limited by these illustrative examples but solely by the claims appended hereto.
EXAMPLES
Example 1
A series of procedures were developed for amplifying important regions of the gene for type II procollagen (COL2A1) by the PCR so that adequate amounts of DNA were generated for analysis. A further series of procedures made it possible to directly analyze the DNA produced by the PCR and define its base sequences.
FIG. 1 presents a diagram of the human COL2A1 gene indicating the location of the 54 exons of the gene. The figure also indicates, using short horizontal lines, the regions of the genes that were amplified with the use of appropriate oligonucleotide primers and conditioned for amplification of the gene by PCR. Table I presents the sequencing and location of the oligonucleotide primers defined by the specific base sequences of the gene beginning with number 1 at the 5'-end of the gene. Ala-Kokko and Prockop, Genomics 1990, 8, 454-460, disclosed all the coding sequences of the COL2A1 gene but no more than 40 intronic bases immediately flanking most of the introns.
Examples of primers effective for amplification of sequences by PCR and sequencing of the PCR products presented in Table I are within regions of the introns which have not been disclosed previously. These sequences and primers are included in the present invention. The steps whereby genomic DNA from an individual from sources such as white blood cells or any other cells in the body can be used in the procedure to detect variations in sequences are as follows. The DNA template is amplified using PCR. The PCR products can be loaded directly on an agarose gel and electrophoresed and analyzed or they can be diluted and subjected to a second round of PCR amplification. PCR products obtained from the first PCR reaction in which the regions of the gene symmetrically amplified were examined by electrophoresis in an agarose gel stained with ethidium bromide. The conditions of the experiment were adequate to generate an intense single band of DNA, an observation indicating that the target region was selectively amplified. If a second PCR amplification is carried out, the products are purified and sequenced. The second PCR in which the product of the first PCR is asymmetrically amplified generates a single-stranded DNA which can be directly used for sequencing. The results indicate the presence of two major bands of DNA, one double-stranded DNA and the second single-stranded DNA appropriate for sequencing. Analysis of the base sequences of PCR products from the second asymmetric PCR were analyzed by dideoxynucleotide sequencing using methods known in the art. The sequencing autoradiograms were of high enough quality to be able to detect single-base variations in which one allele of the gene has one base and the same position of the second allele from the same gene has a different base. For example, patients 1 and 5 had an A in one allele and a T in the other allele at position +5 of intron 9 of the COL2A1 gene, whereas patients 2, 3, and 4 had only a T in this position. Moreover, patient 4 had an A and a G in position +45 of intron 9, whereas the other patients had only a G. Similarly, DNA from one patient showed both a C and a T at position -47 of intron 26 of the COL2A1 gene, whereas a different patient had a C at this position and the patient in the right-hand four lanes had a T in this position. These results demonstrate that the procedures outlined here are adequate to detect single-base variations in a single allele.
TABLE I__________________________________________________________________________Synthetic Oligonucleotide Primers for Amplification of the Type IIProcollagen Gene by PCR and Sequencing ofthe PCR ProductsPrimer Alt. Region/ Primer PCR/SEQ PrimerName/# Code Exon type direction Position Sequence SEQ ID NO__________________________________________________________________________ 1 PCR sense 1 PCR antisenseNA46 1 Seq senseNA36 2B/5B PCR senseNA10B 2B/5B PCR antisenseNA37 2B PCR antisenseNA39 2B Seq senseNA24 3/4 PCR senseNA23 3/4 PCR antisenseNA28 3 Seq antisenseNA25A 4 Seq senseNA9 5A/5B PCR senseCW-2 5B PCR antisense 7688 tgcctaatatgtgactcttc (20) 1NA44 5A Seq antisenseNA33 5 Seq senseNA11A 5A Seq senseNA14 6 PCR senseNA15 6 PCR antisenseNA19 6 Seq antisenseNA31 7 PCR senseNA32 7 PCR antisenseNA20 7 Seq antisenseNA40 7 Seq senseNA17 8 PCR/Seq antisenseNA18 8 PCR antisenseNA21 8 Seq antisense23 DH-15 9 PCR sense 281 agcctgtgctatctgctgcaat (22) 222 DH-14 9 PCR sense 300 aatcccactatgatctctgc (20) 324 DH-16 9 Seq sense 395 tccattgcttaggtgt (16) 425 DH-17 9 PCR antisense 629 cagtcttgctcctcaagatac (21) 51 10 PCR sense 900 caggatgtctacaaaggatgc (21) 62 10 Seq sense 1081 ttgcccatggcgtatg (16) 799 10 Seq sense 1169 catgagtgagccggtacagaag 8 (22)3 10 PCR antisense 1703 caaagtggaggtgttcagag (20) 94 11/12 PCR sense 1424 gtcacttctgagatgaaacgcc 10 (22)5 11 Seq sense 1583 agctgtccaagtgtg (15) 11100 11 Seq sense 1590 caagtgtggggattcgagacaac (23) 12CW-14 11 Seq sense 1640 cctcctgcagccagggca (18) 13102 12 Seq sense 1770 gtatcacgggtgagaag (17) 1412 12 Seq sense 1771 tatcacgggtgagaag (16) 15101 12 Seq sense 1824 ctttggggtgcgtgcatttc (20) 166 12 Seq sense 1849 acttgggtttcccag (15) 17CW-11 11 PCR antisense 1875 tcaatcagacttctgggaaacc (22) 187 11/12 PCR antisense 2166 tcagctcgcactgacacaaac (21) 198&8A 13/14 PCR sense 2142 tgaagtttgtgtcagtgcgagc (22) 20103 13 Seq sense 2165 gagatgaccagggcttttg (19) 219 13 Seq sense 2168 atgaccagggcttttg (16) 22104 14 Seq sense 2689 catcaggaggtccttg (16) 23105 14 Seq sense 2725 ctccctctcctctggtatc (19) 2410 14 Seq sense 2794 ctcatgcttaggctg (15) 2511 13/14 PCR antisense 2927 ctaaagtgctcggcaaatggtg (22) 2613 14 Seq sense ? atcaggaggtccttg (15) 2726 15 PCR sense 5780 tcgcacagacaccaaaactgca (22) 2827 15 Seq sense 5843 caggcacagtgtgtccttcgt (21) 2928 15 PCR antisense 6070 atacaccctcgagactgccttg (22) 3029 15 PCR antisense 6110 ctttccagtagacatcagagtg (22) 3130 DH-22 16 PCR sense 6241 cttctcaccacagatgtagtca (22) 3232 DH-24 16 PCR sense 6354 gatatggagtgaaatcagtac (21) 3331 DH-23 16 PCR sense 6361 agtgaaatcagtacaga (17) 3433 DH-25 16 PCR antisense 6609 gttgttgagggagcaatgagcaag (24) 3534 DH-26 16 PCR antisense 6704 caggtgagactgcgagtgtctg (22) 3635 17 PCR sense 7879 aactgtgtgtgaaccgacatgttc (24) 3736 17 Seq sense 7920 catgtgcataatttagtgctgt (22) 38CW-3 17/19 PCR sense 7934 agtgctgtcgttgcagctgg (20) 3937 17 PCR antisense 8217 cacaactgtcagagcaaagtac (22) 4038 DH-30 17 PCR antisense 8244 cagaatgaaggtttggtggttg (22) 4139 DH-31 18/19 PCR sense 8296 cttgaaacacatagtgggaatgtc (24) 4240 DH-32 18 Seq sense 8321 ctgaaatggacagcacctatg (21) 43CW-5 18 Seq sense 8351 tggatctggatcctggag (18) 4441 DH-33 19 Seq sense 8485 cgtggactttgctac (15) 45CW-1 19 Seq sense 8505 gagagcccagtcctgcct (18) 4642 DH-34 18/19 PCR antisense 8710 gactccagagatgtcagtggaac (23) 4743 DH-38 18/19 PCR antisense 8838 caggtcctcacaccagattctctc (24) 4844 DH-36 20 CPR sense 8688 gttccactgacatctctggagtca (24) 4945 DH-37 20 Seq sense 8725 ctctttcccatgctctc (17) 5046 DH-38 20 PCR antisense 8961 ctgtgcctcatagaacagcag (21) 5147 DH-39 20 PCR antisense 9033 cataatctgaaaggacccagattg (24) 5248 DH-40 20 PCR antisense ? gttaagtctcctccaggcataatc (24) 5349 DH-41 20 PCR antisense 9212 gaagctgtatctgggccttctca (23) 5450 DH-42 21 PCR sense 9238 gtgaacagttggatctttag (20) 5551 DH-43 21 PCR sense 9294 ctttatggcctctcgtcctcaag (23) 56CW-5 21/24 PCR sense 9330 ctgaaacagttgccaaggctac (22) 5752 DH-44 21 Seq sense 9336 cagttgccaaggctacttc (19) 5853 DH-45 21 Seq sense 9355 cttcatactctagatc (16) 59CW-8 21 Seq sense 9380 tccaaggccaggtgaagg (18) 6054 DH-46 21 PCR antisense 9617 cagaacacggaccacaaggact (22) 6155 DH-47 21 PCR antisense 9660 gagaaagaggaggatgacatg (21) 62IH-1 24 PCR sense 9729 gtctgagctccttcccaggaa (21) 63106 22/24 PCR sense 9763 cagaagttaacctctgagaatc (22) 64 22 Seq senseCW-9 22 Seq sense 9768 gttaacctctgagaatcctg (20) 65107 22 Seq sense 9817 gttggtgggttagtgggatg (20) 66CW-10 23 Seq sense 9929 caggtcaagatggtctgg (18) 67108 23 Seq sense 9960 gagtgggagaagaggggctg (20) 68109 24 Seq sense 10165 cttggcttcagaccctcag (19) 69IH-2B 24 Seq sense 10199 ctccttccagccctgcactg (20) 70CW-7 21/24 PCR antisense 10350 ctcagaggatagacttac (18) 71110 24 PCR antisense 10404 catctctcttttcccttgcttc (22) 72111112IH-3 24 PCR antisense 10423 gcctccctaacccaaactccatct (24) 73IH-7 25/26 PCR sense 10651 tagatgctgagcatgtgtgg (20) 74IH-8A 25 Seq sense 10703 cttagtggatgttgggtggat (21) 75IH-8B 26 Seq sense 11157 ttggctgtccatcaggatgt (20) 76IH-9 25/26 PCR antisense 11434 gatcaacactcaatactgagg (21) 77PB10 27/28 PCR sense 11428 gttgatctctgtggctagac (20) 78PB10A 27/28 PCR sense 11532 gcttccatgctgagaacagc (20) 79IH-11A * 27 Seq sense 11588 gtgtggaaatggagctcagc (20) 80PB4 27 Seq antisense 11900 tctggtgtatcagctcagcc (20) 811H-11B * 28 Seq sense 12005 tgagtggtgcaggaagacgc (20) 821H-12 28 Seq antisense 12241 accgatagtgccaagaaagctgc 83 (23)PB12A 27/28 PCR antisense 12295 gctcgatgcctggacactgc (20) 84PB12-A1 27/28 PCR antisense 12413 cgaagtgaccaagcgttagca (21) 85IH-10 * 27/28 PCR sense 7 cctcagtattgagtgttgatc (21) 8691 29/31 PCR sense 12233 ctggacagcagcaggcactatc 87 (22)92 29 Seq sense 12266 cacacctcttgcagtgtccag (21) 88CW-12 29/31 PCR sense 12313 ctgtcactgctgctgcttcc (20) 8917 29/31 PCR sense 12341 ggtctgccctatactgt (17) 9093 29 Seq sense 12341 ggtctgccctatactgtg (18) 9119 29 Seq sense 12375 ggcagcaaactcactc (16) 9218 29/31 PCR antisense ? gactccaggctaccacgaa (19) 9395 30 Seq sense 12624 catggaggagtgatattc (18) 9494 30 Seq sense 12646 ctgctgtggagaattgttc (19) 9520 30 Seq sense 12647 tgctgtggagaattgttc (18) 9697 31 Seq sense 12779 caatgcgggctgcctccttg (20) 97 31/32 PCR sense14 31 PCR sense 12781 aatgcgggctgcctcctt (18) 9896 31 Seq sense 12824 tgctcctttccccacctc (18) 9916 31 Seq sense 12842 tgctcctttccccacct (17) 10021 31 Seq sense 12842 ctgcttctccctggacct (18) 10198 29/31 PCR antisense 13054 gactccaggctaccacgaag (20) 102CW-13 20/31 PCR antisense 13306 ccaggcattccctgaagacc (20) 10315 31 PCR antisense 13390 agccacagctttggtga (17) 104IH-16-1 32/33 PCR sense 13076 cgggctcaggaggaatgaag (20) 105IH-16 * 32/33 PCR sense 13089 aggaatgaagaagaacagaagtg (23) 106IH-17-A * 32 Seq sense 13163 ctggttacccaggctccatg (20) 107IH-17-B-1 33 Seq sense 13442 tgatgaaggtttctgttagc (20) 108PB-17-B-2 32 Seq antisense 13447 tcatcaccaggtgccataag (20) 109IH-18 32/33 PCR antisense 13786 gatcctaatgcccagcagt (19) 110IH-19 34/35 PCR sense 13774 gctgggcattaggatccagc (20) 111IH-19-C 34/35 PCR sense 13801 gtctgggcagtctgccactg (20) 112PB-20-A * 34 Seq sense 13852 gcaactgcagggacttctct (20) 113PB-20-B 35 Seq sense 14298 gctgcacagtaacacaggct (20) 114IH-21 * 34/35 PCR antisense 14557 actgactccctggctctctg (20) 115IH-21-C 34/35 PCR antisense 14724 gcaggcagaggctctgttaa (20) 116PB-22 36/37 PCR sense 14663 gcacgtcactcccatcatgt (20) 117IH-22 * 36/37 PCR sense 14705 ttaacagagcctctgcctgc (20) 118IH-23-A 36 Seq sense 14805 agcagaagcaggtccaggcag (20) 119PB-23-B 37 Seq sense 15079 cgcagatactcacagagtct (20) 120IH-24 36/37 PCR antisense 15375 ctgcgaaccatcctctgcgc (20) 121PB-24 36/37 PCR antisense 15505 caggagatcagcagcttggt (20) 122PB-25 38/39 PCR sense 15483 accaagctgctgatctcctg (20) 123IH-25 38/39 PCR sense 15653 tttagtgccaagaaagctgc (20) 124IH-26-A 38 Seq sense 15708 acagaagcccaccgtcttcc (20) 12556 39 PCR sense 16081 acagcctgtgcctgcttctatg (22) 12657 39 PCR sense 16086 ctgtgcctgcttctatgaccaga (23) 12758 39 PCR sense 16165 ctttccataccaggctctgaga (22) 128PB-26-B 39 Seq sense 16168 tccataccaggctctgagac (20) 12959 39 PCR sense 16170 cataccaggctctgaga (17) 13060 39 PCR antisense 16430 gaacggactcagaggagtgaag (22) 131PB-27 38/39 PCR antisense 16457 tcagttagctactcctccag (20) 132IH-27 38/39 PCR antisense 16700 ccagtgagttcatcaccact (20) 13361 DH-53 40/41 PCR sense 16880 tgtctcacatggtgagaaggttg (23) 13462 DH-54 40/41 PCR sense 16969 cagagaggaaactgctgtcact (22) 13563 40 Seq sense 16990 tgaggccacagtgactttg (19) 13664 41 Seq sense 17283 cttctgagctcacaga (16) 13765 41 Seq sense 17354 gacagagctgtgctgaga (18) 13866 DH-58 40/41 PCR antisense 17599 tgagggaggtagaagccttg (20) 13967 DH-59 40/41 PCR antisense 17638 cttcaggagagggcagacaag (21) 14068 DH-60 42/43 PCR sense 17580 caaggcttctacctccctca (20) 14169 DH-61 42 Seq sense 17606 tcaggaactgtccccttgt (19) 14270 DH-62 42/43 PCR sense 17618 cttgtctgccctctcctgaag (21) 14371 DH-63 43 Seq sense 18046 caaagtgtgagtgagttg (18) 14472 42/43 PCR antisense 18326 tctaggctgagagagactttgttc (24) 14573 42/43 PCR antisense 18328 cttctaggctgagatgagacttg (23) 14674 44/45 PCR sense 18299 gttggaacaagtctcatctca (21) 14776 44 Seq sense 18303 gaacaagtctcatctca (17) 14875 44 Seq sense 18313 catctcagcctagaag (16) 14977 45 Seq sense 18572 cagtctctggactaag (16) 15078 45 Seq sense 18588 gagcagtggcctcagatg (18) 15181 DH-73 44/45 PCR antisense 18840 gtcccacccaagctgaggaatc (22) 15280 44/45 PCR antisense 18889 gacagacaccgattgagtcaggtca (25) 15379 44/45 PCR antisense 18895 gaacaagacagacaccgattg (21) 154PB37 46/47 PCR sense 18571 tccgcagtctctggactaag (20) 155IH-37-1 46 Seq sense 18720 tgagtatccaagtgtcctgc (20) 156IH-38-B 47 Seq sense 19206 atatgtctgtgctgaccgtg (20) 157IH-39 46/47 PCR antisense 19526 tctgtctgacagcggaggca (20) 158IH-38-A * 46 Seq sense ? gggattcctcagcctgggtg (20) 159IH-40 48/49 PCR sense 19622 actgagcatgtgaagaactg (20) 160IH-41-A 48 Seq sense 19661 tatcaggacagccacctacc (20) 161IH-41-B 49 Seq sense 21027 acactctagtacattctagc (20) 162IH-42-A 48/49 PCR antisense 20628 aaccctctggcggaaacttc (20) 16382 DH-74 49 PCR sense 20042 ctaaggaagaaatagacatg (20) 16483 DH-75 49 PCR sense 20047 gaagaaatagacatggtgctgt (22) 16584 DH-76 49 Seq sense 20126 gacactctagtacattc (17) 16685 DH-77 49 Seq sense 20130 ctctagtacattctag (16) 16786 DH-78 49 Seq sense 20135 gtacattctagcaaatg (17) 16887 DH-79 49 PCR antisense 20579 catctcctcccttgctgctag (21) 169PB42 48/49 PCR antisense 20720 atggagtccaccctgaggtc (20) 170IH-43-A 50/51 PCR sense 20762 cttagggctggacttagctc (20) 171IH-43 * 50/51 PCR sense 20849 atcccttgtccctgtaggcc (20) 172IH-44-A 50 Seq sense 20883 caggcctgggtctctcaagc (20) 173IH-44-B 51 Seq sense 21395 ccttgaaccatgaactcttg (20) 174IH-45-A 50/51 PCR antisense 21892 tctctcacctgtcactcagc (20) 175IH-45 * 50/51 PCR antisense 21895 tctctcacctgtcactcagc (20) 176IH-46 52 PCR sense 22098 tggagaggcctttggcaagc (20) 177IH-47 52 Seq sense 22158 ttgtgggctctgatgctcgc (20) 178IH-48 52 PCR antisense 22514 tcaggtcagccattcagtgc (20) 179__________________________________________________________________________
According to some embodiments:
SEQ ID NO:3 and SEQ ID NO:5 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:4 as the primer sequence.
SEQ ID NO:6 and SEQ ID NO:9 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:8 as the primer sequence.
SEQ ID NO:10 and SEQ ID NO:19 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:12 as the primer sequence.
SEQ ID NO:20 and SEQ ID NO:26 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:12 as the primer sequence.
SEQ ID NO:10 and SEQ ID NO:19 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:16 as the primer sequence.
SEQ ID NO:20 and SEQ ID NO:26 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:16 as the primer sequence.
SEQ ID NO:20 and SEQ ID NO:26 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:24 as the primer sequence.
SEQ ID NO:20 and SEQ ID NO:26 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:23 as the primer sequence.
SEQ ID NO:28 and SEQ ID NO:30 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:29 as the primer sequence.
SEQ ID NO:32 and SEQ ID NO:35 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:34 as the primer sequence.
SEQ ID NO:37 and SEQ ID NO:41 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:38 as the primer sequence.
SEQ ID NO:39 and SEQ ID NO:47 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:44 as the primer sequence.
SEQ ID NO:42 and SEQ ID NO:51 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:43 as the primer sequence.
SEQ ID NO:42 and SEQ ID NO:51 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:45 as the primer sequence.
SEQ ID NO:49 and SEQ ID NO:51 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:50 as the primer sequence.
SEQ ID NO:55 and SEQ ID NO:61 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:58 as the primer sequence.
SEQ ID NO:55 and SEQ ID NO:73 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:60 as the primer sequence.
SEQ ID NO:57 and SEQ ID NO:71 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:60 as the primer sequence.
SEQ ID NO:55 and SEQ ID NO:73 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:65 as the primer sequence.
SEQ ID NO:57 and SEQ ID NO:71 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:65 as the primer sequence.
SEQ ID NO:64 and SEQ ID NO:72 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:69 as the primer sequence.
SEQ ID NO:64 and SEQ ID NO:72 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:68 as the primer sequence.
SEQ ID NO:63 and SEQ ID NO:73 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:70 as the primer sequence.
SEQ ID NO:68 and SEQ ID NO:72 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:70 as the primer sequence.
SEQ ID NO:74 and SEQ ID NO:77 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:75 as the primer sequence.
SEQ ID NO:74 and SEQ ID NO:77 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:76 as the primer sequence.
SEQ ID NO:78 and SEQ ID NO:85 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:81 as the primer sequence.
SEQ ID NO:79 and SEQ ID NO:84 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:81 as the primer sequence.
SEQ ID NO:78 and SEQ ID NO:85 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:83 as the primer sequence.
SEQ ID NO:79 and SEQ ID NO:84 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:83 as the primer sequence.
SEQ ID NO:87 and SEQ ID NO:93 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:90 as the primer sequence.
SEQ ID NO:87 and SEQ ID NO:93 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:94 as the primer sequence.
SEQ ID NO:87 and SEQ ID NO:93 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:97 as the primer sequence.
SEQ ID NO:87 and SEQ ID NO:93 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:95 as the primer sequence.
SEQ ID NO:98 and SEQ ID NO:104 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:101 as the primer sequence.
SEQ ID NO:98 and SEQ ID NO:104 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:97 as the primer sequence.
SEQ ID NO:105 and SEQ ID NO:110 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:108 as the primer sequence.
SEQ ID NO:105 and SEQ ID NO:110 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:109 as the primer sequence.
SEQ ID NO:112 and SEQ ID NO:116 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:113 as the primer sequence.
SEQ ID NO:112 and SEQ ID NO:116 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:114 as the primer sequence.
SEQ ID NO:117 and SEQ ID NO:122 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:119 as the primer sequence.
SEQ ID NO:117 and SEQ ID NO:121 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:120 as the primer sequence.
SEQ ID NO:117 and SEQ ID NO:122 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:119 as the primer sequence.
SEQ ID NO:117 and SEQ ID NO:121 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:120 as the primer sequence.
SEQ ID NO:123 and SEQ ID NO:132 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:125 as the primer sequence.
SEQ ID NO:123 and SEQ ID NO:132 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:129 as the primer sequence.
SEQ ID NO:134 and SEQ ID NO:140 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:137 as the primer sequence.
SEQ ID NO:135 and SEQ ID NO:139 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:137 as the primer sequence.
SEQ ID NO:134 and SEQ ID NO:140 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:136 as the primer sequence.
SEQ ID NO:134 and SEQ ID NO:140 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:138 as the primer sequence.
SEQ ID NO:135 and SEQ ID NO:139 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:136 as the primer sequence.
SEQ ID NO:135 and SEQ ID NO:139 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:138 as the primer sequence.
SEQ ID NO:141 and SEQ ID NO:145 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:143 as the primer sequence.
SEQ ID NO:143 and SEQ ID NO:146 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:143 as the primer sequence.
SEQ ID NO:141 and SEQ ID NO:145 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:144 as the primer sequence.
SEQ ID NO:143 and SEQ ID NO:146 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:144 as the primer sequence.
SEQ ID NO:147 and SEQ ID NO:153 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:148 as the primer sequence.
SEQ ID NO:147 and SEQ ID NO:154 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:148 as the primer sequence.
SEQ ID NO:147 and SEQ ID NO:152 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:148 as the primer sequence.
SEQ ID NO:147 and SEQ ID NO:153 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:151 as the primer sequence.
SEQ ID NO:147 and SEQ ID NO:154 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:151 as the primer sequence.
SEQ ID NO:147 and SEQ ID NO:152 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:151 as the primer sequence.
SEQ ID NO:155 and SEQ ID NO:158 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:156 as the primer sequence.
SEQ ID NO:155 and SEQ ID NO:158 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:157 as the primer sequence.
SEQ ID NO:160 and SEQ ID NO:163 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:161 as the primer sequence.
SEQ ID NO:160 and SEQ ID NO:170 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:162 as the primer sequence.
SEQ ID NO:160 and SEQ ID NO:163 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:4 as the primer sequence.
SEQ ID NO:160 and SEQ ID NO:170 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:4 as the primer sequence.
SEQ ID NO:171 and SEQ ID NO:176 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:174 as the primer sequence.
SEQ ID NO:171 and SEQ ID NO:176 are used as PCR primers. The resulting PCR fragment is sequenced as an antisense using SEQ ID:176 as the primer sequence.
SEQ ID NO:171 and SEQ ID NO:176 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:173 as the primer sequence.
SEQ ID NO:177 and SEQ ID NO:179 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:178 as the primer sequence.
Other examples of primers effective for amplification of sequences by PCR and sequencing of the PCR products presented in Table IA are within regions of the introns which have not been disclosed previously. These sequences and primers are included in the present invention. The steps whereby genomic DNA from an individual from sources such as white blood cells or any other cells in the body can be used in the procedure to detect variations in sequences are as follows. The DNA template is amplified using PCR. The PCR products can be loaded directly on an agarose gel and electrophoresed and analyzed or they can be diluted and subjected to a second round of PCR amplification. PCR products obtained from the first PCR reaction in which the regions of the gene symmetrically amplified were examined by electrophoresis in an agarose gel stained with ethidium bromide. The conditions of the experiment were adequate to generate an intense single band of DNA, an observation indicating that the target region was selectively amplified. If a second PCR amplification is carried out, the products are purified and sequenced. The second PCR in which the product of the first PCR is asymmetrically amplified generates a single-stranded DNA which can be directly used for sequencing. The results indicate the presence of two major bands of DNA, one double-stranded DNA and the second single-stranded DNA appropriate for sequencing. Analysis of the base sequences of PCR products from the second asymmetric PCR were analyzed by dideoxynucleotide sequencing using methods known in the art. The sequencing autoradiograms were of high enough quality to be able to detect single-base variations in which one allele of the gene has one base and the same position of the second allele from the same gene has a different base.
TABLE IA__________________________________________________________________________Region/ 5'-Primer 3'-Primer Productexon (sense) (antisense) size__________________________________________________________________________1.sup.c caactccggcagaactccg tatatctgtgcgcaaacttc 880 SEQ ID NO:262 SEQ ID NO:2631 agaagacgcagagcgctgct acagagtcttgattggcag 437 SEQ ID NO:264 SEQ ID NO:2652A ctcagcgtgcttcataggattc tctagtgtctcaggctcattcact 957 SEQ ID NO:266 SEQ ID NO:2672B ctgattagatcttgagctct ctggattaacatagcattgc 398 SEQ ID NO:268 SEQ ID NO:2693 and 4 gcaatgctatgttaatccag TCACCTTTGTCACCACGATC 478 SEQ ID NO:270 SEQ ID NO:2715A and 5B gctctctctcacagATTGTA gtaagtgcaagcagcaatt 548 SEQ ID NO:272 SEQ ID NO:2736 attgctgcttgcacttacctg agcaagaggttgtcagactct 1082 SEQ ID NO:274 SEQ ID NO:2757 gcagacagcagctactgttct atgaggataccaggtcaatc 484 SEQ ID NO:276 SEQ ID NO:2778 catcgtgtcgcagatgattc acagcagctgcgtcctagt 210 SEQ ID NO:278 SEQ ID NO:2797 and 8 gcagacagcagctactgttct acagcagctgcgtcctagt 623 SEQ ID NO:280 SEQ ID NO:281__________________________________________________________________________Exon PCR PrimerNo. Region Sequence Comments__________________________________________________________________________1 1 gcttcctcctcctgctccaag Antisense sequencing SEQ ID NO:2822A.sup.c 2A gagaccaggatatctga SEQ ID NO:2832B 2B gttcagtggagtcacagga SEQ ID NO:2843 3 and 4 agttagaggagagcacaagga Antisense sequencing SEQ ID NO:2854 3 and 4 tcgacgtcgtcgtcttaa SEQ ID NO:2865A 5A and 5B caagttactgatctgcagct SEQ ID NO:2875B 5A and 5B gacagatggctcttggagaa SEQ ID NO:.2886 6 cagactctctggctctactaag Antisense sequencing SEQ ID NO:2897 7 gcagctctgacttggcacat SEQ ID NO:290 atcatctgcgacacgatg Antisense sequencing SEQ ID NO:2918 8 gtggtctatcattagaggct Antisense sequencing SEQ ID NO:2928 7 and 8 catcgtgtcgcagatgattc SEQ ID NO:293__________________________________________________________________________
According to some embodiments:
SEQ ID NO:262 and SEQ ID NO:263 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:282 as the primer sequence.
SEQ ID NO:264 and SEQ ID NO:265 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:282 as the primer sequence.
SEQ ID NO:266 and SEQ ID NO:267 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:283 as the primer sequence.
SEQ ID NO:268 and SEQ ID NO:269 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:284 as the primer sequence.
SEQ ID NO:270 and SEQ ID NO:271 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:285 as the primer sequence.
SEQ ID NO:270 and SEQ ID NO:271 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:286 as the primer sequence.
SEQ ID NO:272 and SEQ ID NO:273 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:287 as the primer sequence.
SEQ ID NO:272 and SEQ ID NO:273 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:288 as the primer sequence.
SEQ ID NO:274 and SEQ ID NO:275 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:289 as the primer sequence.
SEQ ID NO:276 and SEQ ID NO:277 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:290 as the primer sequence.
SEQ ID NO:276 and SEQ ID NO:277 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:291 as the primer sequence.
SEQ ID NO:278 and SEQ ID NO:279 are used as PCR primers. The resulting PCR fragment is antisense sequenced using SEQ ID:292 as the primer sequence.
SEQ ID NO:280 and SEQ ID NO:281 are used as PCR primers. The resulting PCR fragment is sequenced using SEQ ID:293 as the primer sequence.
FIG. 2 demonstrates that the presence of single-base changes can in some instances be confirmed by digestion of the PCR product with restriction endonucleases. As shown in FIG. 2, the presence of variations at locations +15 (position 225 in FIG. 2) and +42 (position 252 in FIG. 2) of intron 9 can be distinguished by degeneration of one or two bands when the PCR products were digested either with restriction endonuclease Acc I or restriction endonuclease Dra I. As indicated in FIG. 3, the same approach can be used to confirm the presence of a base variation in intron 26 of the gene.
Table II summarizes the series of 21 sequence variations detected in analysis of alleles for COL2A1 from different individuals. None of the variations change coding sequences or other important sites in the genes such as consensus sites for RNA splicing. Therefore, they are probably all normal variations in the structure of the gene and not mutations that cause diseases.
TABLE II__________________________________________________________________________TYPE II PROCOLLAGEN NEUTRAL SEQUENCE VARIANTS POSITION ALLELES OBSERVEDREGION * TYPE Major Minor FREQUENCY VERIFICATION__________________________________________________________________________Exon 5B 75 base substitution C A 0.25 R.E.Intron 9 +15 base substitution G A 0.48 R.E.Intron 9 +45 base deletion G 0.16 R.E.Exon 19 21 base substitution T C 0.02 R.EExon 24 3 base substitution T G 0.01 R.E.Exon 26 3 base substitution T C 0.1 PCR-1 R.E.Intron 26 -24 base substitution C A 0.1 R.E.Intron 26 -47 base substitution C T 0.4 PCR-1 R.E.Exon 30 30 base substitution C T 0.03 RSSIntron 30 +7 base substitution A C 0.1 RSSIntron 30 +37 base substitutian G T 0.1 RSSIntron 31 +7 base substitution G A 0.03 RSSIntron 31 +56 base substitution C T 0.1 R.E.Intron 31 +101 base substitution G T/A nd RSSIntron 31 +128 base deletion G nd RSSIntron 31 +183 base substitution C T 0.4 RSSIntron 31 -55 base substitution T G 0.4 RSSIntron 31 -56 base deletion G 0.4 RSSExon 32 102 base substitution T C nd R.E.Intron 32 -22 base substitution G A 0.04 R.E.Intron 32 -32 base substitution T C 0.4 R.E.__________________________________________________________________________ nd = not determined; RSS = reversed strand sequencing; R.E. = restriction enzyme analysis; PCR1 R.E. = PCRintroduced restriction enzyme site analysis. *Position for exon is designated without a .+-. sign. Position in intron is designated with a "-" sign if the sequence variant is located 5' to th next exon and with a "+" sign if the sequence variant is located 3' to th preceding exon.
Example 2
As a more rapid procedure for detecting sequence variations in the COL2A1 gene, a series of procedures were developed whereby the technique known as denaturing gradient gel electrophoresis (DGGE) could be used to analyze regions of the COL2A1 gene that were amplified by PCR. The technique makes it possible to determine which PCR amplified products from the two alleles have a single-base mutation and which do not. Therefore, the technique reduces the number of PCR products that must be analyzed by the more tedious procedure of nucleotide sequencing in order to determine whether or not a gene has a disease-causing mutation.
The technique of denaturing gradient gel electrophoresis was employed as described by Fischer and Lerman, PNAS 1983, 80, 1579; Myers et al., Methods in Enzymology Vol. 155, R. Wu, Ed., Academic Press, San Diego, 1987, pp 501-527; and Abrams et al., Genomics 1990, 7, 463-475. Procedures that are part of the present invention, however, involve the design and proof of efficacy of a series of primers that provide PCR products of sufficient quality for analysis by denaturing gradient gel electrophoresis. Also, the present invention defines conditions whereby the procedure is carried out so that sequence variations can be detected.
Table III presents the sequences and locations of the oligonucleotide primers used to produce PCR products adequate for analysis. Within each pair of primers, one primer contains a GC-rich sequence that serves as a "clamp" for denaturation and, thereby, greatly facilitates detection of a mutation in hybrid PCR products containing base sequences from both alleles.
TABLE III__________________________________________________________________________PRIMERS AND CONDITIONS FOR PCR AMPLIFICATIONOF THE COL2A1 AND ANALYSIS BY DGGE product DGGE timeEXON PRIMERS size (bp) (h) SEQ ID NO__________________________________________________________________________ 5b 5':GC- 246 13 180 clamp + TCTTGGAGAAACACTGCTTCCCATTGATGC.sup.a 181 3':AAAAGCCACATTTCTGGAGGGACAGCCTGA 6 5' :GC- 239 20 182 clamp + CTAGTGCCTTTCAACCTCCTAACGTTG 183 3':AGGTTGTCAGACTCTCTGGCTCTACTAAG7 + 8 5':GC- 184 349 clamp + GTAAACCCCTCATTTTCTGTTCCGATGC 185 3':CAGCTGCGTCCTAGTGGTCTATCATTAG 9 5':GTCCATTGCTTAGGTGTCTTCCCACTA 186 3':GC- 187 clamp + GGAGGCAGCTCCTCATTTGTCTACTC10 5':GC- 245 18 188 clamp + CACTATGCTACGCGTCTCTGAGGAAGCT 189 3':CTCAGAAGTGACCTCATTGAACTGGATGC11 5':GC- 190 248 clamp + CTCCTGCAGCCAGGGCAGCTTTCCACT 191 3':GGAAGAAATGCACGCACCCCAAAGTGC12 5':GC- 192 189 clamp + CCAACTTGGGTTTCCCAGAAGTCTGA 193 3':TTGTCTCCCTCCTCCCCATCCCATTGTAC13 5':GC- 194 270 clamp + ACCACTGAAGTTTGTGTCAGTGCGAGCT 195 3':TTTGCAGCCATCTGATAGTCTGAAGAGTC14 5':GC- 196 248 clamp + CACCCTGAGCACCGTAAAGCCAACTCATG 197 3':CTATGCCTCACAGGTTTGTTTCGAGGGTCA15 5':TGCCTTCTGGCCACCCACTCGCACAGA 198 5':GC- 199 clamp + CCACCCACTCGCACAGACACCAAAACTGCA 200 3': GTGGTTGCACCCAATACACCCTCGAGACTG16 5':GC- 201 340 clamp + TTGCTTTGCCTTCTGAAGCCAGGCAAAGCT 202 3':AGCTTCTCGGCACCCAGAAGTTCCTGACT17 5':GC- 203 298 clamp + TATTGCCCACCCACTAGAGGTCTGTGTC 204 3':CCCACAACTGTCAGAGCAAAGTACAGAGTC18 5':GC- 177 14 205 clamp + GGTGGTTGGGGTTCATTCTTTGCTGCT 206 3':GGGCTCTCCTGGGGTAGCAAAGTCCAC19 5':GC- 207 317 clamp + GGATCTGCTGTGAGTGTTGCCCGTGGACT 208 3':AGAGCATGGGAAAGAGGGGTGATG20 5':GC- 209 425 clamp + TCCCTGGAGAGAATCTGGTGTGAGGACCT 210 3':CTCAGTCCCTGTTAAGTCTCCTCCAGGCAT21 5' :GGCTACTTCCTTCATACTCTAGATCGA.sup.b 305 9 211 3':GC- 212 clamp + CGGACCACAAGGACTCCACTTCCCTCTCGA.sup.b22 + 5':GC- 213 366 clamp + CTGGCTGGGTTGGGCTGTTCTCACTCACTG 214 3':CTGAAGCCAAGGGCAACAGCAGCTCTGCTA24 5':GC- 215 260 clamp + TAGCAGAGCTGCTGTTGCCCTTGGCTTCAG 216 3':ACCCTCCTAGCAGCCCTCAGAGGATAGACT25 3':CACTGTCCCTGGTTAAACTCTACTCAG 217 5':GC- 218 clamp + GTCAATCCTAGATGCTGAGCATGTGTG26 5':GC- 219 264 clamp + CATCAGGATGTGGCCCCAGGCTCAGTC 220 3': GGCCGTTCCCCTGTCCTCCCTGCAGAT27 5':GGAAACTCTGGGCCAGAAGTACCTTTG.sup.C 232 14 221 3': ATCAGCTCAGCCCACATTCACATCTCTCAG 222 3':GC- 223 clamp + CCCAGCCCCCAGGGCACCTGGAGGCTG28 5':GC- 224 284 clamp + AGTGCAGGGAGGCATGCATGCACTGTCTGA 225 3': TATGGCAAAGGACTGACACAGAGAGCCTG29 5':GC-clamp + TTCCCTGCACCCCTGGCTGTCACT 226 3':TGGAGGCTGGGACATGGGTCCAGA 22730 5':CATGTCCCAGCCTCCACAGATGACACAAT 228 3':GC- 229 clamp + GGCCCAAGGAGGCAGCCCGCATTGGCCAACAG31 5':GC- 230 226 clamp + CTTCTCCCCCACTGCTGTTGGTTGATCA 231 3':TACCACGAAGACCCCTACTGGATGCA32 5':GC- 232 325 clamp + TTCAGGGAGAGGTGCTGTCCACTACAGACT 233 3':GCTCCTCAAAAAGGGCTAACAGAAACCTTCA33 5':GC- 234 308 clamp + ACAGCAAATTCCTCTTGGGCAGGGACTG.sup.d 235 3':CAGTGGCAGACTGCCCAGACCCTCTCT34 5':GC- 236 316 clamp + GACTTCTCTGTTAAAATGGGGCCAGAG 237 3':ACAGAACCCCTTTGGCAGGAGATAAGA35 5':AGGGTGCGGGTATGGGCTGCACAGTAA 238 3':GC- 239 clamp + CTGCACTGACTCCCTGGCTCTCTGGTT36 5':GC- 240 327 clamp + TCAGGGTGAGGGCTTTTGGGTTAACAGAG 241 3':TGGATGTGGAACTGGCCTGAGTGGAGGTA37 5':AGGACACACACGCAGATACTCACAGAGT 242 3':GC- 243 clamp + CAGGAGCCCTTCCTTGAGGGAACAATTC38 5':CCTCTTCAGGCTGGGTTTTTAGTGCCA 244 3':GC- 245 clamp + GGGGCCAGGCCTCTTTGTGAGGTGCAG39 5':CCCTTTCCATACCAGGCTCTGAGACCAC 246 3':GC- 247 clamp + AGTGAAGGCCAGCCTGGAGCTCTCCAGA40 5':GC- 248 331 clamp + TGAGGAAGGGTGAGATGAGTCCTCACT 249 3':CTACCCCATGCTCTGTGAGCTCAGAAG41 5':GC- 250 313 clamp + CAGACAGAGCTGTGCTGAGAGGACGAAG 251 3':AGACAAGGGACAGTCCTGAGGGTGCTGA42 5':GCAGGGGTGCTTACCACTTGCACTCAT 274 10.5 252 3':GC- 253 clamp + TCCTTGCTGACCCAGCACAGAGACTCAC43 5':GC- 8 309 254 clamp + GGGCAGAAGAGGAGAGGCCTGGGCTTC 255 3': GTCCTTCTAGGCTGAGATGAGACTTGT44 5':GC- 256 227 clamp + GGAACATTCTTCTCTGAGCCTGAGAC 257 3':CAGGGGAAGGCGGCTTTTACTGAATTC45 + 46 5':GC- 12.5 258 clamp + CTGGACTAAGGAGCAGTGGCCTCAGAT 259 3': CAGCCCTGAGGAAATCCTAGAAACTGC47 5':AATATAGATAGATATGTCTGTGCTGAC 185 10 260 3':GC- 261 clamp + GGCCCCCTCCATCTTCCAACTCCATG__________________________________________________________________________ .sup.a GCclamp:CCGCCCGCCCCGCCCGCCGCCCGCCCCGCCCGCCGCCCGC .sup.b 40 pmols of primer. Twenty pmols of all other primers. .sup.c Primers I and II were used for first PCR amplification and primer I and III were used for the second. .sup.d Annealing temperature was 60.degree. C. instead of 54.degree. C.
The present invention also provides the running times for denaturing gradient gel electrophoresis that are critical for successful analysis of each of the PCR products.
Table IV illustrates the sequence variations detected by the technique. As indicated, five single-base differences were observed within introns of the COL2A1 gene and five within coding exons of the gene. The presence of the sequence variations was confirmed by dideoxynucleotide sequencing of the PCR products as described in Example 1.
TABLE IV______________________________________SEQUENCE VARIANTS DETECTED IN COL2A1 GENE______________________________________INTRON POSITION VARIATION______________________________________19 -11 C/T24 -24 C/T26 -24 C/A32 +69 .sup. C/T.sup.a32 -22 G/A______________________________________AMINO ACIDexon codon Variation______________________________________5B Gly .sup. C/A.sup.a26 Gly 412 C/T31 Arg .fwdarw. Cys 519 C/T32 Gly 565 C/T34 Asn 600 C/T______________________________________ .sup.a New sequence variations detected here.
Example 3
Methods of the present invention were used to analyze the COL2A1 gene in a proband with arthro-ophthalmopathy or the Wagner-Stickler syndrome.
The proband was a male who at the age of three had severe myopia and incipient posterior cataracts. He had mild mid-facial hypoplasia. His knees, elbows and ankle joints were slightly hyperextensible. By the age of eight years, the vision in the left eye had deteriorated because of a cataract. The cataract was removed by suction. Two weeks later he had retinal ablation of the left eye. The other retina was then cryocoagulated. The proband's mother and proband's maternal uncle had similar problems. The proband's maternal grandfather also appeared to have manifestations of the disease since he had retinal ablation in one eye after a fall at the age of 18 and ophthalmologic examination at the age of 23 demonstrated he had a cataract in one eye and very limited vision in both eyes.
Methods of the present invention were used to amplify 15 of the 54 exons of the COL2A1 gene and then to analyze the PCR products by denaturing gradient gel electrophoresis. The results suggested the presence of a mutation in a PCR product containing base sequences from exon 10.
Dideoxynucleotide sequencing of the PCR products from exon 10 demonstrated the proband had both a G and an A in the second base for the codon at position .alpha.1-67 of the .alpha.1(II) chain of type II collagen. The G makes the codon -GTT- for glycine, the normal amino acid present at this position. The A makes the codon -GAT-, a codon for aspartate, an amino acid that is never found in this, the third position in the repeating -Gly-X-Y- sequence of the .alpha. chain domain of a collagen. Extensive analysis of genes of fibrillar collagens in a large number of people have demonstrated that the third position in the repeating -Gly-X-Y- sequence of collagen is always glycine.
Examination of exon 10 from the proband's mother and maternal uncle demonstrated that they also had the same mutation. Unaffected members of the family did not have the mutation. Also, examination of exon 10 from 46 unrelated individuals demonstrated that they did not have the mutation at amino acid position .alpha.1-67. The results, therefore, establish that the mutation at position .alpha.1-67 was the cause of arthro-ophthalmopathy in the proband. The discovery of the mutation makes it possible to offer a prenatal diagnostic test to the family.
__________________________________________________________________________# SEQUENCE LISTING- (1) GENERAL INFORMATION:- (iii) NUMBER OF SEQUENCES: 293- (2) INFORMATION FOR SEQ ID NO: 1:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 1:# 20 CTTC- (2) INFORMATION FOR SEQ ID NO: 2:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 2:# 22GCA AT- (2) INFORMATION FOR SEQ ID NO: 3:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 3:# 20 CTGC- (2) INFORMATION FOR SEQ ID NO: 4:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 4:# 16- (2) INFORMATION FOR SEQ ID NO: 5:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 5:#21 GATA C- (2) INFORMATION FOR SEQ ID NO: 6:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 6:#21 GATG C- (2) INFORMATION FOR SEQ ID NO: 7:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 7:# 16- (2) INFORMATION FOR SEQ ID NO: 8:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 8:# 22AGA AG- (2) INFORMATION FOR SEQ ID NO: 9:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 9:# 20 AGAG- (2) INFORMATION FOR SEQ ID NO: 10:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 10:# 22ACG CC- (2) INFORMATION FOR SEQ ID NO: 11:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 11:# 15- (2) INFORMATION FOR SEQ ID NO: 12:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 12:# 23AGAC AAC- (2) INFORMATION FOR SEQ ID NO: 13:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 13:# 18 CA- (2) INFORMATION FOR SEQ ID NO: 14:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 14:# 17 G- (2) INFORMATION FOR SEQ ID NO: 15:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 15:# 16- (2) INFORMATION FOR SEQ ID NO: 16:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 16:# 20 TTTC- (2) INFORMATION FOR SEQ ID NO: 17:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 17:# 15- (2) INFORMATION FOR SEQ ID NO: 18:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 18:# 22AAA CC- (2) INFORMATION FOR SEQ ID NO: 19:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 19:#21 CAAA C- (2) INFORMATION FOR SEQ ID NO: 20:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 20:# 22CGA GC- (2) INFORMATION FOR SEQ ID NO: 21:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 21:# 19 TTG- (2) INFORMATION FOR SEQ ID NO: 22:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 22:# 16- (2) INFORMATION FOR SEQ ID NO: 23:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 23:# 16- (2) INFORMATION FOR SEQ ID NO: 24:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 24:# 19 ATC- (2) INFORMATION FOR SEQ ID NO: 25:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 25:# 15- (2) INFORMATION FOR SEQ ID NO: 26:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 26:# 22TGG TG- (2) INFORMATION FOR SEQ ID NO: 27:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 27:# 15- (2) INFORMATION FOR SEQ ID NO: 28:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 28:# 22CTG CA- (2) INFORMATION FOR SEQ ID NO: 29:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 29:#21 TTCG T- (2) INFORMATION FOR SEQ ID NO: 30:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 30:# 22CCT TG- (2) INFORMATION FOR SEQ ID NO: 31:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 31:# 22GAG TG- (2) INFORMATION FOR SEQ ID NO: 32:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 32:# 22AGT CA- (2) INFORMATION FOR SEQ ID NO: 33:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 33:#21 AGTA C- (2) INFORMATION FOR SEQ ID NO: 34:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 34:# 17 A- (2) INFORMATION FOR SEQ ID NO: 35:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 35:# 24TGAG CAAG- (2) INFORMATION FOR SEQ ID NO: 36:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 36:# 22GTC TG- (2) INFORMATION FOR SEQ ID NO: 37:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 37:# 24ACAT GTTC- (2) INFORMATION FOR SEQ ID NO: 38:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 38:# 22GCT GT- (2) INFORMATION FOR SEQ ID NO: 39:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 39:# 20 CTGG- (2) INFORMATION FOR SEQ ID NO: 40:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 40:# 22AGT AC- (2) INFORMATION FOR SEQ ID NO: 41:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 41:# 22GGT TG- (2) INFORMATION FOR SEQ ID NO: 42:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 42:# 24GGAA TGTC- (2) INFORMATION FOR SEQ ID NO: 43:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 43:#21 CTAT G- (2) INFORMATION FOR SEQ ID NO: 44:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 44:# 18 AG- (2) INFORMATION FOR SEQ ID NO: 45:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 45:# 15- (2) INFORMATION FOR SEQ ID NO: 46:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 46:# 18 CT- (2) INFORMATION FOR SEQ ID NO: 47:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 47:# 23GTGG AAC- (2) INFORMATION FOR SEQ ID NO: 48:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 48:# 24ATTC TCTC- (2) INFORMATION FOR SEQ ID NO: 49:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 49:# 24TGGA GTCA- (2) INFORMATION FOR SEQ ID NO: 50:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 50:# 17 C- (2) INFORMATION FOR SEQ ID NO: 51:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 51:#21 AGCA G- (2) INFORMATION FOR SEQ ID NO: 52:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 52:# 24CCAG ATTG- (2) INFORMATION FOR SEQ ID NO: 53:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 53:# 24GCAT AATC- (2) INFORMATION FOR SEQ ID NO: 54:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 54:# 23CTTC TCA- (2) INFORMATION FOR SEQ ID NO: 55:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 55:# 20 TTAG- (2) INFORMATION FOR SEQ ID NO: 56:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 56:# 23CCTC AAG- (2) INFORMATION FOR SEQ ID NO: 57:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 57:# 22GCT AC- (2) INFORMATION FOR SEQ ID NO: 58:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 58:# 19 TTC- (2) INFORMATION FOR SEQ ID NO: 59:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 59:# 16- (2) INFORMATION FOR SEQ ID NO: 60:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 60:# 18 GG- (2) INFORMATION FOR SEQ ID NO: 61:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 61:# 22GGA CT- (2) INFORMATION FOR SEQ ID NO: 62:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 62:#21 ACAT G- (2) INFORMATION FOR SEQ ID NO: 63:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 63:#21 AGGA A- (2) INFORMATION FOR SEQ ID NO: 64:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 64:# 22GAA TC- (2) INFORMATION FOR SEQ ID NO: 65:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 65:# 20 CCTG- (2) INFORMATION FOR SEQ ID NO: 66:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 66:# 20 GATG- (2) INFORMATION FOR SEQ ID NO: 67:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 67:# 18 GG- (2) INFORMATION FOR SEQ ID NO: 68:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 68:# 20 GCTG- (2) INFORMATION FOR SEQ ID NO: 69:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 69:# 19 TCAG- (2) INFORMATION FOR SEQ ID NO: 70:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 70:# 20 ACTG- (2) INFORMATION FOR SEQ ID NO: 71:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 71:# 18 AC- (2) INFORMATION FOR SEQ ID NO: 72:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 72:# 22TTGCT TC- (2) INFORMATION FOR SEQ ID NO: 73:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 73:# 24CTCC ATCT- (2) INFORMATION FOR SEQ ID NO: 74:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 74:# 20 GTGG- (2) INFORMATION FOR SEQ ID NO: 75:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 75:#21 TGGA T- (2) INFORMATION FOR SEQ ID NO: 76:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 76:# 20 ATGT- (2) INFORMATION FOR SEQ ID NO: 77:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 77:#21 TGAG G- (2) INFORMATION FOR SEQ ID NO: 78:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 78:# 20 AGAC- (2) INFORMATION FOR SEQ ID NO: 79:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 79:# 20 CAGC- (2) INFORMATION FOR SEQ ID NO: 80:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 80:# 20 CAGC- (2) INFORMATION FOR SEQ ID NO: 81:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 81:# 20 AGCC- (2) INFORMATION FOR SEQ ID NO: 82:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 82:# 20 ACGC- (2) INFORMATION FOR SEQ ID NO: 83:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 83:# 23AAGC TGC- (2) INFORMATION FOR SEQ ID NO: 84:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 84:# 20 CTGC- (2) INFORMATION FOR SEQ ID NO: 85:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 85:#21 TAGC A- (2) INFORMATION FOR SEQ ID NO: 86:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 86:#21 TGAT C- (2) INFORMATION FOR SEQ ID NO: 87:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 87:# 22CTA TC- (2) INFORMATION FOR SEQ ID NO: 88:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 88:#21 TCCA G- (2) INFORMATION FOR SEQ ID NO: 89:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 89:# 20 TTCC- (2) INFORMATION FOR SEQ ID NO: 90:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 90:# 17 T- (2) INFORMATION FOR SEQ ID NO: 91:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 91:# 18 TG- (2) INFORMATION FOR SEQ ID NO: 92:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 92:# 16- (2) INFORMATION FOR SEQ ID NO: 93:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 93:# 19 GAA- (2) INFORMATION FOR SEQ ID NO: 94:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 94:# 18 TC- (2) INFORMATION FOR SEQ ID NO: 95:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 95:# 19 TTC- (2) INFORMATION FOR SEQ ID NO: 96:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 96:# 18 TC- (2) INFORMATION FOR SEQ ID NO: 97:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 97:# 20 CTTG- (2) INFORMATION FOR SEQ ID NO: 98:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 98:# 18 TT- (2) INFORMATION FOR SEQ ID NO: 99:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 99:# 18 TC- (2) INFORMATION FOR SEQ ID NO: 100:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 100:# 17 T- (2) INFORMATION FOR SEQ ID NO: 101:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 101:# 18 CT- (2) INFORMATION FOR SEQ ID NO: 102:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 102:# 20 GAAG- (2) INFORMATION FOR SEQ ID NO: 103:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 103:# 20 GACC- (2) INFORMATION FOR SEQ ID NO: 104:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 104:# 17 A- (2) INFORMATION FOR SEQ ID NO: 105:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 105:# 20 GAAG- (2) INFORMATION FOR SEQ ID NO: 106:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 106:# 23AGAA GTG- (2) INFORMATION FOR SEQ ID NO: 107:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 107:# 20 CATG- (2) INFORMATION FOR SEQ ID NO: 108:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 108:# 20 TAGC- (2) INFORMATION FOR SEQ ID NO: 109:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 109:# 20 TAAG- (2) INFORMATION FOR SEQ ID NO: 110:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 110:# 19 AGT- (2) INFORMATION FOR SEQ ID NO: 111:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 111:# 20 CAGC- (2) INFORMATION FOR SEQ ID NO: 112:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 112:# 20 ACTG- (2) INFORMATION FOR SEQ ID NO: 113:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 113:# 20 CTCT- (2) INFORMATION FOR SEQ ID NO: 114:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 114:#AACACAGGCT 20GCTGCACAGT- (2) INFORMATION FOR SEQ ID NO: 115:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 115:# 20 TCTG- (2) INFORMATION FOR SEQ ID NO: 116:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 116:# 20 TTAA- (2) INFORMATION FOR SEQ ID NO: 117:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 117:# 20 ATGT- (2) INFORMATION FOR SEQ ID NO: 118:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 118:# 20 CTGC- (2) INFORMATION FOR SEQ ID NO: 119:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 119:#21 GGCA G- (2) INFORMATION FOR SEQ ID NO: 120:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 120:# 20 GTCT- (2) INFORMATION FOR SEQ ID NO: 121:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 121:# 20 GCGC- (2) INFORMATION FOR SEQ ID NO: 122:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 122:# 20 TGGT- (2) INFORMATION FOR SEQ ID NO: 123:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 123:# 20 CCTG- (2) INFORMATION FOR SEQ ID NO: 124:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 124:# 20 CTGC- (2) INFORMATION FOR SEQ ID NO: 125:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 125:# 20 TTCC- (2) INFORMATION FOR SEQ ID NO: 126:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 126:# 22CTA TG- (2) INFORMATION FOR SEQ ID NO: 127:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 127:# 23GACC AGA- (2) INFORMATION FOR SEQ ID NO: 128:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 128:# 22TGA GA- (2) INFORMATION FOR SEQ ID NO: 129:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 129:# 20 AGAC- (2) INFORMATION FOR SEQ ID NO: 130:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 130:# 17 A- (2) INFORMATION FOR SEQ ID NO: 131:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 131:# 22TGA AG- (2) INFORMATION FOR SEQ ID NO: 132:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 132:# 20 CCAG- (2) INFORMATION FOR SEQ ID NO: 133:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 133:# 20 CACT- (2) INFORMATION FOR SEQ ID NO: 134:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 134:# 23AAGG TTG- (2) INFORMATION FOR SEQ ID NO: 135:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 135:# 22TCA CT- (2) INFORMATION FOR SEQ ID NO: 136:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 136:# 19 TTG- (2) INFORMATION FOR SEQ ID NO: 137:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 137:# 16- (2) INFORMATION FOR SEQ ID NO: 138:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 138:# 18 GA- (2) INFORMATION FOR SEQ ID NO: 139:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 139:# 20 CTTG- (2) INFORMATION FOR SEQ ID NO: 140:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 140:#21 ACAA G- (2) INFORMATION FOR SEQ ID NO: 141:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 141:# 20 CTCA- (2) INFORMATION FOR SEQ ID NO: 142:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 142:# 19 TGT- (2) INFORMATION FOR SEQ ID NO: 143:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 143:#21 TGAA G- (2) INFORMATION FOR SEQ ID NO: 144:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 144:# 18 TG- (2) INFORMATION FOR SEQ ID NO: 145:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 145:# 24CTTT GTTC- (2) INFORMATION FOR SEQ ID NO: 146:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 146:# 23AGAC TTG- (2) INFORMATION FOR SEQ ID NO: 147:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 147:#21 TCTC A- (2) INFORMATION FOR SEQ ID NO: 148:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 148:# 17 A- (2) INFORMATION FOR SEQ ID NO: 149:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 149:# 16- (2) INFORMATION FOR SEQ ID NO: 150:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 150:# 16- (2) INFORMATION FOR SEQ ID NO: 151:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 151:# 18 TG- (2) INFORMATION FOR SEQ ID NO: 152:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 152:# 22GAA TC- (2) INFORMATION FOR SEQ ID NO: 153:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 153:# 25 GTCA GGTCA- (2) INFORMATION FOR SEQ ID NO: 154:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 154:#21 GATT G- (2) INFORMATION FOR SEQ ID NO: 155:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 155:# 20 TAAG- (2) INFORMATION FOR SEQ ID NO: 156:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 156:# 20 CTGC- (2) INFORMATION FOR SEQ ID NO: 157:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 157:# 20 CGTG- (2) INFORMATION FOR SEQ ID NO: 158:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 158:# 20 GGCA- (2) INFORMATION FOR SEQ ID NO: 159:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 159:# 20 GGTG- (2) INFORMATION FOR SEQ ID NO: 160:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 160:# 20 ACTG- (2) INFORMATION FOR SEQ ID NO: 161:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 161:# 20 TACC- (2) INFORMATION FOR SEQ ID NO: 162:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 162:# 20 TAGC- (2) INFORMATION FOR SEQ ID NO: 163:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 163:# 20 CTTC- (2) INFORMATION FOR SEQ ID NO: 164:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 164:# 20 CATG- (2) INFORMATION FOR SEQ ID NO: 165:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 165:# 22GCT GT- (2) INFORMATION FOR SEQ ID NO: 166:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 166:# 17 C- (2) INFORMATION FOR SEQ ID NO: 167:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 167:# 16- (2) INFORMATION FOR SEQ ID NO: 168:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 168:# 17 G- (2) INFORMATION FOR SEQ ID NO: 169:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 169:#21 GCTA G- (2) INFORMATION FOR SEQ ID NO: 170:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 170:# 20 GGTC- (2) INFORMATION FOR SEQ ID NO: 171:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 171:# 20 GCTC- (2) INFORMATION FOR SEQ ID NO: 172:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 172:# 20 GGCC- (2) INFORMATION FOR SEQ ID NO: 173:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 173:# 20 AAGC- (2) INFORMATION FOR SEQ ID NO: 174:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 174:# 20 CTTG- (2) INFORMATION FOR SEQ ID NO: 175:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 175:# 20 CAGC- (2) INFORMATION FOR SEQ ID NO: 176:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 176:# 20 CAGC- (2) INFORMATION FOR SEQ ID NO: 177:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 177:# 20 AAGC- (2) INFORMATION FOR SEQ ID NO: 178:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 178:# 20 TCGC- (2) INFORMATION FOR SEQ ID NO: 179:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: YES- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 179:# 20 GTGC- (2) INFORMATION FOR SEQ ID NO: 180:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 180:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC TCTTGGAGAA# 70 ATGC- (2) INFORMATION FOR SEQ ID NO: 181:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 181:# 30 GAGG GACAGCCTGA- (2) INFORMATION FOR SEQ ID NO: 182:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 182:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CTAGTGCCTT# 67 G- (2) INFORMATION FOR SEQ ID NO: 183:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 183:# 29 TGGC TCTACTAAG- (2) INFORMATION FOR SEQ ID NO: 184:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 184:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC GTAAACCCCT# 68 GC- (2) INFORMATION FOR SEQ ID NO: 185:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 185:# 28 GTCT ATCATTAG- (2) INFORMATION FOR SEQ ID NO: 186:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 186:# 27 TCTT CCCACTA- (2) INFORMATION FOR SEQ ID NO: 187:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 66 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 187:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC GGAGGCAGCT# 66- (2) INFORMATION FOR SEQ ID NO: 188:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 188:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CACTATGCTA# 68 CT- (2) INFORMATION FOR SEQ ID NO: 189:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 189:# 29 TTGA ACTGGATGC- (2) INFORMATION FOR SEQ ID NO: 190:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 190:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CTCCTGCAGC# 67 T- (2) INFORMATION FOR SEQ ID NO: 191:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 191:# 27 CCCC AAAGTGC- (2) INFORMATION FOR SEQ ID NO: 192:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 66 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 192:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CCAACTTGGG# 66- (2) INFORMATION FOR SEQ ID NO: 193:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 193:# 29 CATC CCATTGTAC- (2) INFORMATION FOR SEQ ID NO: 194:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 194:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC ACCACTGAAG# 68 CT- (2) INFORMATION FOR SEQ ID NO: 195:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 195:# 29 AGTC TGAAGAGTC- (2) INFORMATION FOR SEQ ID NO: 196:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 69 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 196:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CACCCTGAGC# 69 ATG- (2) INFORMATION FOR SEQ ID NO: 197:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 197:# 30 TGTT TCGAGGGTCA- (2) INFORMATION FOR SEQ ID NO: 198:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 198:# 27 ACTC GCACAGA- (2) INFORMATION FOR SEQ ID NO: 199:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 199:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CCACCCACTC# 70 TGCA- (2) INFORMATION FOR SEQ ID NO: 200:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 200:# 30 CACC CTCGAGACTG- (2) INFORMATION FOR SEQ ID NO: 201:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 201:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC TTGCTTTGCC# 70 AGCT- (2) INFORMATION FOR SEQ ID NO: 202:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 202:# 29 GAAG TTCCTGACT- (2) INFORMATION FOR SEQ ID NO: 203:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 203:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC TATTGCCCAC# 68 TC- (2) INFORMATION FOR SEQ ID NO: 204:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 204:# 30 CAAA GTACAGAGTC- (2) INFORMATION FOR SEQ ID NO: 205:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 205:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC GGTGGTTGGG# 67 T- (2) INFORMATION FOR SEQ ID NO: 206:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 206:# 27 GCAA AGTCCAC- (2) INFORMATION FOR SEQ ID NO: 207:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 69 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 207:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC GGATCTGCTG# 69 ACT- (2) INFORMATION FOR SEQ ID NO: 208:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 208:# 24GGGT GATG- (2) INFORMATION FOR SEQ ID NO: 209:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 69 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 209:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC TCCCTGGAGA# 69 CCT- (2) INFORMATION FOR SEQ ID NO: 210:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 210:# 30 TCTC CTCCAGGCAT- (2) INFORMATION FOR SEQ ID NO: 211:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 211:# 27 CTCT AGATCGA- (2) INFORMATION FOR SEQ ID NO: 212:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 212:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CGGACCACAA# 70 TCGA- (2) INFORMATION FOR SEQ ID NO: 213:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 213:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CTGGCTGGGT# 70 ACTG- (2) INFORMATION FOR SEQ ID NO: 214:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 214:# 30 CAGC AGCTCTGCTA- (2) INFORMATION FOR SEQ ID NO: 215:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 215:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC TAGCAGAGCT# 70 TCAG- (2) INFORMATION FOR SEQ ID NO: 216:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 216:# 30 TCAG AGGATAGACT- (2) INFORMATION FOR SEQ ID NO: 217:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 217:# 27 ACTC TACTCAG- (2) INFORMATION FOR SEQ ID NO: 218:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 218:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC GTCAATCCTA# 67 G- (2) INFORMATION FOR SEQ ID NO: 219:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 219:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CATCAGGATG# 67 C- (2) INFORMATION FOR SEQ ID NO: 220:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 220:# 27 TCCC TGCAGAT- (2) INFORMATION FOR SEQ ID NO: 221:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 221:# 27 AAGT ACCTTTG- (2) INFORMATION FOR SEQ ID NO: 222:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 222:# 30 TTCA CATCTCTCAG- (2) INFORMATION FOR SEQ ID NO: 223:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 223:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CCCAGCCCCC# 67 G- (2) INFORMATION FOR SEQ ID NO: 224:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 224:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC AGTGCAGGGA# 70 CTGA- (2) INFORMATION FOR SEQ ID NO: 225:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 225:# 29 CACA GAGAGCCTG- (2) INFORMATION FOR SEQ ID NO: 226:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 64 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 226:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC TTCCCTGCAC# 64- (2) INFORMATION FOR SEQ ID NO: 227:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 227:# 24GGTC CAGA- (2) INFORMATION FOR SEQ ID NO: 228:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 228:# 29 CAGA TGACACAAT- (2) INFORMATION FOR SEQ ID NO: 229:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 72 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 229:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC GGCCCAAGGA# 72AAC AG- (2) INFORMATION FOR SEQ ID NO: 230:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 230:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CTTCTCCCCC# 68 CA- (2) INFORMATION FOR SEQ ID NO: 231:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 231:# 26 ACTG GATGCA- (2) INFORMATION FOR SEQ ID NO: 232:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 232:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC TTCAGGGAGA# 70 GACT- (2) INFORMATION FOR SEQ ID NO: 233:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 31 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 233:# 31 TAAC AGAAACCTTC A- (2) INFORMATION FOR SEQ ID NO: 234:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 234:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC ACAGCAAATT# 68 TG- (2) INFORMATION FOR SEQ ID NO: 235:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 235:# 27 AGAC CCTCTCT- (2) INFORMATION FOR SEQ ID NO: 236:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 236:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC GACTTCTCTG# 67 G- (2) INFORMATION FOR SEQ ID NO: 237:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 237:# 27 AGGA GATAAGA- (2) INFORMATION FOR SEQ ID NO: 238:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 238:# 27 CTGC ACAGTAA- (2) INFORMATION FOR SEQ ID NO: 239:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 239:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CTGCACTGAC# 67 T- (2) INFORMATION FOR SEQ ID NO: 240:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 69 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 240:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC TCAGGGTGAG# 69 GAG- (2) INFORMATION FOR SEQ ID NO: 241:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 241:# 29 CTGA GTGGAGGTA- (2) INFORMATION FOR SEQ ID NO: 242:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 242:# 28 TACT CACAGAGT- (2) INFORMATION FOR SEQ ID NO: 243:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 243:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CAGGAGCCCT# 68 TC- (2) INFORMATION FOR SEQ ID NO: 244:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 244:# 27 TTTT AGTGCCA- (2) INFORMATION FOR SEQ ID NO: 245:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 245:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC GGGGCCAGGC# 67 G- (2) INFORMATION FOR SEQ ID NO: 246:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 246:# 28 CTCT GAGACCAC- (2) INFORMATION FOR SEQ ID NO: 247:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 247:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC AGTGAAGGCC# 68 GA- (2) INFORMATION FOR SEQ ID NO: 248:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 248:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC TGAGGAAGGG# 67 T- (2) INFORMATION FOR SEQ ID NO: 249:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 249:# 27 GAGC TCAGAAG- (2) INFORMATION FOR SEQ ID NO: 250:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 250:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CAGACAGAGC# 68 AG- (2) INFORMATION FOR SEQ ID NO: 251:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 251:# 28 TGAG GGTGCTGA- (2) INFORMATION FOR SEQ ID NO: 252:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 252:# 27 CTTG CACTCAT- (2) INFORMATION FOR SEQ ID NO: 253:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 253:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC TCCTTGCTGA# 68 AC- (2) INFORMATION FOR SEQ ID NO: 254:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 66 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 254:# 50CGCGCC CGCCCCGCCC GCCGCCCGCG GGCAGAAGAG# 66- (2) INFORMATION FOR SEQ ID NO: 255:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 255:# 27 ATGA GACTTGT- (2) INFORMATION FOR SEQ ID NO: 256:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 66 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 256:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC GGAACATTCT# 66- (2) INFORMATION FOR SEQ ID NO: 257:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 257:# 27 TTAC TGAATTC- (2) INFORMATION FOR SEQ ID NO: 258:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 258:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC CTGGACTAAG# 67 T- (2) INFORMATION FOR SEQ ID NO: 259:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 259:# 27 CTAG AAACTGC- (2) INFORMATION FOR SEQ ID NO: 260:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 260:# 27 TCTG TGCTGAC- (2) INFORMATION FOR SEQ ID NO: 261:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 66 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SINGLE (D) TOPOLOGY: LINEAR- (iv) ANTI-SENSE: NO- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 261:# 50CGCCGC CCGCCCCGCC CGCCGCCCGC GGCCCCCTCC# 66- (2) INFORMATION FOR SEQ ID NO: 262:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 262:# 19 CCG- (2) INFORMATION FOR SEQ ID NO: 263:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 263:# 20 CTTC- (2) INFORMATION FOR SEQ ID NO: 264:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 264:# 20 TGCT- (2) INFORMATION FOR SEQ ID NO: 265:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 265:# 19 CAG- (2) INFORMATION FOR SEQ ID NO: 266:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 266:# 22GAT TC- (2) INFORMATION FOR SEQ ID NO: 267:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 267:# 24CATT CACT- (2) INFORMATION FOR SEQ ID NO: 268:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 268:# 20 CTCT- (2) INFORMATION FOR SEQ ID NO: 269:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 269:# 20 TTGC- (2) INFORMATION FOR SEQ ID NO: 270:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 270:# 20 CCAG- (2) INFORMATION FOR SEQ ID NO: 271:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 271:# 20 GATC- (2) INFORMATION FOR SEQ ID NO: 272:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 272:# 20 TGTA- (2) INFORMATION FOR SEQ ID NO: 273:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 273:# 19 ATT- (2) INFORMATION FOR SEQ ID NO: 274:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 274:#21 ACCT G- (2) INFORMATION FOR SEQ ID NO: 275:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 275:#21 ACTC T- (2) INFORMATION FOR SEQ ID NO: 276:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 276:#21 GTTC T- (2) INFORMATION FOR SEQ ID NO: 277:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 277:# 20 AATC- (2) INFORMATION FOR SEQ ID NO: 278:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 278:# 20 ATTC- (2) INFORMATION FOR SEQ ID NO: 279:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 279:# 19 AGT- (2) INFORMATION FOR SEQ ID NO: 280:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 280:#21 GTTC T- (2) INFORMATION FOR SEQ ID NO: 281:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 281:# 19 AGT- (2) INFORMATION FOR SEQ ID NO: 282:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 282:#21 CCAA G- (2) INFORMATION FOR SEQ ID NO: 283:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 283:# 17 A- (2) INFORMATION FOR SEQ ID NO: 284:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 284:# 19 GGA- (2) INFORMATION FOR SEQ ID NO: 285:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 285:#21 AAGG A- (2) INFORMATION FOR SEQ ID NO: 286:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 286:# 18 AA- (2) INFORMATION FOR SEQ ID NO: 287:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 287:# 20 AGCT- (2) INFORMATION FOR SEQ ID NO: 288:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 288:# 20 AGAA- (2) INFORMATION FOR SEQ ID NO: 289:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 289:# 22CTA AG- (2) INFORMATION FOR SEQ ID NO: 290:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 290:# 20 ACAT- (2) INFORMATION FOR SEQ ID NO: 291:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 291:# 18 TG- (2) INFORMATION FOR SEQ ID NO: 292:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 292:# 20 GGCT- (2) INFORMATION FOR SEQ ID NO: 293:- (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 (B) TYPE: NUCLEIC ACID (C) STRANDEDNESS: SING - #LE (D) TOPOLOGY: LINEAR- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 293:# 20 ATTC__________________________________________________________________________
Claims
  • 1. A method of determining a genetic predisposition for a disease in a human individual caused by a mutation in a region of COL2A1, said method comprising:
  • obtaining a cell from said individual;
  • isolating a nucleic acid from said cell, wherein said nucleic acid comprises said region;
  • amplifying said region using a first primer, wherein said first primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs 28-44, 47-66, 68-80, 83, 85, 86, 112-143, 145, and 158; and
  • detecting the presence or absence of said mutation in said region, whereby the presence of said mutation in said region indicates a genetic predisposition for said disease.
  • 2. The method of claim 1, wherein said disease is selected from the group consisting of osteoarthritis, chondrodysplasia, dwarfism, malformation of joints, myopia, retinal detachment, blindness, cataracts, and cleft palate.
  • 3. The method of claim 2, wherein said disease is osteoarthritis.
  • 4. The method of claim 1, wherein said first primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28-44, 47-59, 61-66, 69, 70, 72-80, 83, 86, 112-143, 145, and 158.
  • 5. The method of claim 1, wherein said first primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28-44, 48-58, 61-66, 70, 72-80, 83, 86, 112-118, 120-124, 126-138, 140, and 143.
  • 6. The method of claim 1, wherein said first primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28, 30-37, 39-42, 47-49, 51-57, 61-64, 72-74, 77-79, 86, 112, 115-118, 121-124, 126-128, 130-135, 139-141, 143, 145, and 158.
  • 7. The method of claim 1, wherein said first primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28, 30-37, 39-42, 48, 49, 51-57, 61-64, 72-74, 77-79, 112, 115-118, 121-124, 126-128, 130-135, 140, and 143.
  • 8. The method of claim 1, wherein detecting the presence or absence of said mutation in said region comprises determining the nucleotide sequence of said region using a sequencing primer, wherein said sequencing primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 4, 7, 8, 11-17, 21-25, 27, 29, 38, 43-46, 50, 58-60, 64-70, 75, 76, 80-83, 88, 91, 92, 94-97, 99-101, 107-109, 113, 114, 119, 120, 125, 129, 136-138, 142, 144, 148-151, 156, 157, 159, 161, 162, 166-168, 173, 174, and 178.
  • 9. The method of claim 8, wherein said sequencing primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 29, 38, 43-44, 50, 58-60, 64-66, 68-70, 75, 76, 80, 83, 113, 114, 119, 120, 125, 129, 136-138, and 142.
  • 10. A method of determining a genetic predisposition for a disease in a human individual caused by a mutation in a region of COL2A1, said method comprising:
  • obtaining a cell from said individual;
  • isolating a nucleic acid from said cell, wherein said nucleic acid comprises said region;
  • amplifying said region using a first primer and a second primer, wherein said first primer and said second primer are selected such that when said first primer has a nucleotide sequence listed in Column 1 of the following table, said second primer has the nucleotide sequence listed on the same row in Column 2 of the table:
  • ______________________________________Column 1 Column 2______________________________________SEQ ID NO: 2 SEQ ID NO: 5SEQ ID NO: 3 SEQ ID NO: 5SEQ ID NO: 6 SEQ ID NO: 9SEQ ID NO: 10 SEQ ID NO: 18SEQ ID NO: 10 SEQ ID NO: 19SEQ ID NO: 20 SEQ ID NO: 26SEQ ID NO: 28 SEQ ID NO: 30SEQ ID NO: 28 SEQ ID NO: 31SEQ ID NO: 32 SEQ ID NO: 35SEQ ID NO: 33 SEQ ID NO: 35SEQ ID NO: 34 SEQ ID NO: 35SEQ ID NO: 32 SEQ ID NO: 36SEQ ID NO: 33 SEQ ID NO: 36SEQ ID NO: 34 SEQ ID NO: 36SEQ ID NO: 37 SEQ ID NO: 40SEQ ID NO: 39 SEQ ID NO: 40SEQ ID NO: 37 SEQ ID NO: 41SEQ ID NO: 39 SEQ ID NO: 41SEQ ID NO: 39 SEQ ID NO: 47SEQ ID NO: 39 SEQ ID NO: 48SEQ ID NO: 42 SEQ ID NO: 47SEQ ID NO: 42 SEQ ID NO: 48SEQ ID NO: 49 SEQ ID NO: 51SEQ ID NO: 49 SEQ ID NO: 52SEQ ID NO: 49 SEQ ID NO: 53SEQ ID NO: 49 SEQ ID NO: 54SEQ ID NO: 55 SEQ ID NO: 61SEQ ID NO: 55 SEQ ID NO: 62SEQ ID NO: 55 SEQ ID NO: 71SEQ ID NO: 55 SEQ ID NO: 72SEQ ID NO: 55 SEQ ID NO: 73SEQ ID NO: 56 SEQ ID NO: 61SEQ ID NO: 56 SEQ ID NO: 62SEQ ID NO: 56 SEQ ID NO: 71SEQ ID NO: 56 SEQ ID NO: 72SEQ ID NO: 56 SEQ ID NO: 73SEQ ID NO: 57 SEQ ID NO: 61SEQ ID NO: 57 SEQ ID NO: 62SEQ ID NO: 57 SEQ ID NO: 71SEQ ID NO: 57 SEQ ID NO: 72SEQ ID NO: 57 SEQ ID NO: 73SEQ ID NO: 63 SEQ ID NO: 61SEQ ID NO: 63 SEQ ID NO: 62SEQ ID NO: 63 SEQ ID NO: 71SEQ ID NO: 63 SEQ ID NO: 72SEQ ID NO: 63 SEQ ID NO: 73SEQ ID NO: 64 SEQ ID NO: 61SEQ ID NO: 64 SEQ ID NO: 62SEQ ID NO: 64 SEQ ID NO: 71SEQ ID NO: 64 SEQ ID NO: 72SEQ ID NO: 64 SEQ ID NO: 73SEQ ID NO: 74 SEQ ID NO: 77SEQ ID NO: 78 SEQ ID NO: 84SEQ ID NO: 78 SEQ ID NO: 85SEQ ID NO: 79 SEQ ID NO: 84SEQ ID NO: 79 SEQ ID NO: 85SEQ ID NO: 86 SEQ ID NO: 84SEQ ID NO: 86 SEQ ID NO: 85SEQ ID NO: 87 SEQ ID NO: 93SEQ ID NO: 89 SEQ ID NO: 93SEQ ID NO: 90 SEQ ID NO: 93SEQ ID NO: 87 SEQ ID NO: 102SEQ ID NO: 89 SEQ ID NO: 102SEQ ID NO: 90 SEQ ID NO: 102SEQ ID NO: 87 SEQ ID NO: 103SEQ ID NO: 89 SEQ ID NO: 103SEQ ID NO: 90 SEQ ID NO: 103SEQ ID NO: 97 SEQ ID NO: 102SEQ ID NO: 97 SEQ ID NO: 103SEQ ID NO: 97 SEQ ID NO: 104SEQ ID NO: 98 SEQ ID NO: 102SEQ ID NO: 98 SEQ ID NO: 103SEQ ID NO: 98 SEQ ID NO: 104SEQ ID NO: 105 SEQ ID NO: 110SEQ ID NO: 106 SEQ ID NO: 110SEQ ID NO: 111 SEQ ID NO: 115SEQ ID NO: 111 SEQ ID NO: 116SEQ ID NO: 112 SEQ ID NO: 115SEQ ID NO: 112 SEQ ID NO: 116SEQ ID NO: 117 SEQ ID NO: 121SEQ ID NO: 117 SEQ ID NO: 122SEQ ID NO: 118 SEQ ID NO: 121SEQ ID NO: 118 SEQ ID NO: 122SEQ ID NO: 123 SEQ ID NO: 131SEQ ID NO: 123 SEQ ID NO: 132SEQ ID NO: 123 SEQ ID NO: 133SEQ ID NO: 124 SEQ ID NO: 131SEQ ID NO: 124 SEQ ID NO: 132SEQ ID NO: 124 SEQ ID NO: 133SEQ ID NO: 126 SEQ ID NO: 131SEQ ID NO: 126 SEQ ID NO: 132SEQ ID NO: 126 SEQ ID NO: 133SEQ ID NO: 127 SEQ ID NO: 131SEQ ID NO: 127 SEQ ID NO: 132SEQ ID NO: 127 SEQ ID NO: 133SEQ ID NO: 128 SEQ ID NO: 131SEQ ID NO: 128 SEQ ID NO: 132SEQ ID NO: 128 SEQ ID NO: 133SEQ ID NO: 130 SEQ ID NO: 131SEQ ID NO: 130 SEQ ID NO: 132SEQ ID NO: 130 SEQ ID NO: 133SEQ ID NO: 134 SEQ ID NO: 139SEQ ID NO: 134 SEQ ID NO: 140SEQ ID NO: 135 SEQ ID NO: 139SEQ ID NO: 135 SEQ ID NO: 140SEQ ID NO: 141 SEQ ID NO: 145SEQ ID NO: 141 SEQ ID NO: 146SEQ ID NO: 143 SEQ ID NO: 145SEQ ID NO: 143 SEQ ID NO: 146SEQ ID NO: 147 SEQ ID NO: 152SEQ ID NO: 147 SEQ ID NO: 153SEQ ID NO: 147 SEQ ID NO: 154SEQ ID NO: 155 SEQ ID NO: 158SEQ ID NO: 160 SEQ ID NO: 163SEQ ID NO: 160 SEQ ID NO: 169SEQ ID NO: 160 SEQ ID NO: 170SEQ ID NO: 164 SEQ ID NO: 163SEQ ID NO: 164 SEQ ID NO: 169SEQ ID NO: 164 SEQ ID NO: 170SEQ ID NO: 165 SEQ ID NO: 163SEQ ID NO: 165 SEQ ID NO: 169SEQ ID NO: 165 SEQ ID NO: 170SEQ ID NO: 171 SEQ ID NO: 175SEQ ID NO: 171 SEQ ID NO: 176SEQ ID NO: 172 SEQ ID NO: 175SEQ ID NO: 172 SEQ ID NO: 176SEQ ID NO: 177 SEQ ID NO: 179.______________________________________
  • 11. The method of claim 10, wherein said first primer and said second primer are selected such that when said first primer has a nucleotide sequence listed in Column 1 of the following table, said second primer has the nucleotide sequence listed on the same row in Column 2 of the table:
  • ______________________________________Column 1 Column 2______________________________________SEQ ID NO: 28 SEQ ID NO: 30SEQ ID NO: 28 SEQ ID NO: 31SEQ ID NO: 32 SEQ ID NO: 35SEQ ID NO: 33 SEQ ID NO: 35SEQ ID NO: 34 SEQ ID NO: 35SEQ ID NO: 32 SEQ ID NO: 36SEQ ID NO: 33 SEQ ID NO: 36SEQ ID NO: 34 SEQ ID NO: 36SEQ ID NO: 37 SEQ ID NO: 40SEQ ID NO: 39 SEQ ID NO: 40SEQ ID NO: 37 SEQ ID NO: 41SEQ ID NO: 39 SEQ ID NO: 41SEQ ID NO: 39 SEQ ID NO: 47SEQ ID NO: 39 SEQ ID NO: 48SEQ ID NO: 42 SEQ ID NO: 47SEQ ID NO: 42 SEQ ID NO: 48SEQ ID NO: 49 SEQ ID NO: 51SEQ ID NO: 49 SEQ ID NO: 52SEQ ID NO: 49 SEQ ID NO: 53SEQ ID NO: 49 SEQ ID NO: 54SEQ ID NO: 55 SEQ ID NO: 61SEQ ID NO: 55 SEQ ID NO: 62SEQ ID NO: 55 SEQ ID NO: 71SEQ ID NO: 55 SEQ ID NO: 72SEQ ID NO: 55 SEQ ID NO: 73SEQ ID NO: 56 SEQ ID NO: 61SEQ ID NO: 56 SEQ ID NO: 62SEQ ID NO: 56 SEQ ID NO: 71SEQ ID NO: 56 SEQ ID NO: 72SEQ ID NO: 56 SEQ ID NO: 73SEQ ID NO: 57 SEQ ID NO: 61SEQ ID NO: 57 SEQ ID NO: 62SEQ ID NO: 57 SEQ ID NO: 71SEQ ID NO: 57 SEQ ID NO: 72SEQ ID NO: 57 SEQ ID NO: 73SEQ ID NO: 63 SEQ ID NO: 61SEQ ID NO: 63 SEQ ID NO: 62SEQ ID NO: 63 SEQ ID NO: 71SEQ ID NO: 63 SEQ ID NO: 72SEQ ID NO: 63 SEQ ID NO: 73SEQ ID NO: 64 SEQ ID NO: 61SEQ ID NO: 64 SEQ ID NO: 62SEQ ID NO: 64 SEQ ID NO: 71SEQ ID NO: 64 SEQ ID NO: 72SEQ ID NO: 64 SEQ ID NO: 73SEQ ID NO: 74 SEQ ID NO: 77SEQ ID NO: 78 SEQ ID NO: 84SEQ ID NO: 78 SEQ ID NO: 85SEQ ID NO: 79 SEQ ID NO: 84SEQ ID NO: 79 SEQ ID NO: 85SEQ ID NO: 86 SEQ ID NO: 84SEQ ID NO: 86 SEQ ID NO: 85SEQ ID NO: 111 SEQ ID NO: 115SEQ ID NO: 111 SEQ ID NO: 116SEQ ID NO: 112 SEQ ID NO. 115SEQ ID NO: 112 SEQ ID NO: 116SEQ ID NO: 117 SEQ ID NO: 121SEQ ID NO: 117 SEQ ID NO: 122SEQ ID NO: 118 SEQ ID NO: 121SEQ ID NO: 118 SEQ ID NO: 122SEQ ID NO: 123 SEQ ID NO: 131SEQ ID NO: 123 SEQ ID NO: 132SEQ ID NO: 123 SEQ ID NO: 133SEQ ID NO: 124 SEQ ID NO: 131SEQ ID NO: 124 SEQ ID NO: 132SEQ ID NO: 124 SEQ ID NO: 133SEQ ID NO: 126 SEQ ID NO: 131SEQ ID NO: 126 SEQ ID NO: 132SEQ ID NO: 126 SEQ ID NO: 133SEQ ID NO: 127 SEQ ID NO: 131SEQ ID NO: 127 SEQ ID NO: 132SEQ ID NO: 127 SEQ ID NO: 133SEQ ID NO: 128 SEQ ID NO: 131SEQ ID NO: 128 SEQ ID NO: 132SEQ ID NO: 128 SEQ ID NO: 133SEQ ID NO: 130 SEQ ID NO: 131SEQ ID NO: 130 SEQ ID NO: 132SEQ ID NO: 130 SEQ ID NO: 133SEQ ID NO: 134 SEQ ID NO: 139SEQ ID NO: 134 SEQ ID NO: 140SEQ ID NO: 135 SEQ ID NO: 139SEQ ID NO: 135 SEQ ID NO: 140SEQ ID NO: 141 SEQ ID NO: 145SEQ ID NO: 141 SEQ ID NO: 146SEQ ID NO: 143 SEQ ID NO: 145SEQ ID NO: 143 SEQ ID NO: 146SEQ ID NO: 155 SEQ ID NO: 158.______________________________________
  • 12. A primer, wherein said primer is complementary to an intronic sequence of COL2A1 and has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28-44, 47-66, 68-80, 83, 85, 86, 112-143, 145, and 158.
  • 13. The primer of claim 12, wherein said primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28-44, 47-59, 61-66, 69, 70, 72-80, 83, 86, 112-143, 145, and 158.
  • 14. The primer of claim 12, wherein said primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28-44, 48-58, 61-66, 70, 72-80, 83, 86, 112-118, 120-124, 126-138, 140, and 143.
  • 15. The primer of claim 12, wherein said primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28, 30-37, 39-42, 47-49, 51-57, 61-64, 72-74, 77-79, 86, 112, 115-118, 121-124, 126-128, 130-135, 139-141, 143, 145, and 158.
  • 16. The primer of claim 12, wherein said primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28, 30-37, 39-42, 48, 49, 51-57, 61-64, 72-74, 77-79, 112, 115-118, 121-124, 126-128, 130-135, 140, and 143.
  • 17. The method of claim 1, further comprising amplifying said region using a second primer, wherein said second primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1-179.
  • 18. The primer of claim 12, wherein said primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28, 30-37, 39-42, 48, 49, and 51-57.
  • 19. The primer of claim 12, wherein said primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 61-64, 72-74, 77-79, 112, 115-118, and 121-124.
  • 20. The primer of claim 12, wherein said primer has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 126-128, 130-135, 140, and 143.
CROSS REFERENCE TO RELATED APPLICATIONS

This application claims priority pursuant to 35 U.S.C. .sctn.363 to PCT Application No. PCT/US93/10964, filed Nov. 12, 1993 and is a continuation of U.S. application Ser. No. 07/977,284, filed Nov. 13, 1992 (now U.S. Pat. No. 5,558,988).

ACKNOWLEDGEMENT OF GOVERNMENT RIGHTS

Research for this invention was supported in part by the National Institutes of Health Grants AR-381 88 and AR-39740. The government may have certain rights in the invention.

PCT Information
Filing Document Filing Date Country Kind 102e Date 371c Date
PCT/US93/10964 11/12/1993 2/3/1995 2/3/1995
Publishing Document Publishing Date Country Kind
WO94/11532 5/26/1994
US Referenced Citations (1)
Number Name Date Kind
5045449 Ala-Kokko et al. Sep 1991
Non-Patent Literature Citations (23)
Entry
Horton et al., Proc. Natl. Acad. Sci. USA 89, 4583-4587 (1992).
Huang et al., Eur. J. Biochem. 195, 593-600 (1991).
Abrams et al., "Comprehensive Detection of Single Base Changes in Human Genomic DNA Using Denaturing Gradient Gel Electrophoresis and a GC Clamp", Genomics 7:463-475 (1990).
Ahmad et al., "Stop Codon in the Procollagen II Gene (COL2A1) in a Family with the Stickler Syndrome (Arthro-Ophthalmopathy)", Proc. Natl. Acad. Sci. USA 88:6624-6627 (1991).
Ala-Kokko and Prockop, "Completion of the Intron-Exon Structure of the Gene for Human Type II Procollagen (COL2A1): Variations in the Nucleotide Sequences of the Alleles from Three Chromosomes", Genomics 8:454-460 (1990).
Ala-Kokko et al. "Single Base Mutation in the Type II Procollagen Gene (COL2A1) as a Cause of Primary Osteoarthritis Associated with a Mild Chondrodysplasia", Proc. Natl. Acad. Sci. USA 87:6565-6568 (1990).
Chan and Cole, "Low Basal Transcription of Genes for Tissue-Specific Collagens By Fibroblasts and Lymphoblastoid Cells", Journal of Biological Chemistry 266(19):12487-12494 (1991).
Cheah et al., "Identification and Characterization of the Human Type II Collagen Gene (COL2A1)", Proc. Natl. Acad. Sci. USA 82:2555-2559 (1985).
Fischer and Lerman, "DNA Fragments Differing by Single Base-Pair Substitutions are Separated in Denaturing Gradient Gels: Correspondence with Melting Theory", Proc. Natl. Acad. Sci. USA 80:1579-1583 (1983).
Knowlton et al., "Genetic Linkage Analysis of Hereditary Arthro-Ophthalmopathy (Stickler Syndrome) and the Type II Procollagen Gene", Am. J. Hum. Genet. 45:681-688 (1989).
Knowlton et al., "Genetic Linkage of a Polymorphism in the Type II Procollagen Gene (COL2A1) to Primary Osteoarthritis Associated with Mild Chondrodysplasia", New England Journal of Medicine 322:526-530 (1990).
Mon-Li Chu, "Collagen: Gene Structure", Connective Tissue and Its Heritable Disorders, CRC Press, Boca Raton, FL Chapter 3, Part II, pp. 149-165 (1993).
Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, pp. 280-281 (1982).
Myers et al., "Detection and Localization of Single Base Changes by Denaturing Gradient Gel Electrophoresis", Methods in Enzymology R. Wu, ed., Academic Press, San Diego, 155:501-527 (1987).
Nunez et al., "Promoter Region of the Human Pro-.alpha.1 (II)-Collagen Gene", Gene 44:11-16 (1986).
Ritvaniemi et al., "A Fourth Example Suggests that Premature Termination Codons in the COL2A1 Gene Are a Common Cause of the Stickler Syndrome: Analysis of the COL2A1 Gene by Denaturing Gradient Gel Electrophoresis", Genomics 17:218-221 (1993).
Sanger et al., "DNA Sequencing With Chain-Terminating Inhibitors", Proc. Natl. Acad. Sci. USA 74(12):5463-5467 (1977).
Sangiorgi et al., "Isolation and Partial Characterization of the Entire Human Pro.alpha.1 (II) Collagen Gene", Nucl. Acids Res. 13(7):2207-2225 (1985).
Strom and Upholt, "Isolation and Characterization of Genomic Clones Corresponding to the Human Type II Procollagen Gene", Nucl. Acids Res. 12(2):1025-1038 (1984).
Su et al., "Organization of the Exons Coding for Pro.alpha.1 (II) Collagen N-Propeptide Confirms a Distinct Evolutionary History of this Domain of the Fibrillar Collagen Genes", Genomics 4:438-441 (1989).
Upholt, "The Type II Collagen Gene", Molecular Biology, CRC Press, Boca Raton, FL, 4:31-49 (1989).
Vikkula and Peltonen, "Structural Analyses of the Polymorphic Area in Type II Collagen Gene", FEBS Lett. 250(2):171-174 (1989).
Williams et al., "Detection of Sequence Variants in the Gene for Human Type II Procollagen (COL2A1) by Direct Sequencing of Polymerase Chain Reaction-Amplified Genomic DNA", Human Mutation 1:403-416 (1992).
Continuations (1)
Number Date Country
Parent 977284 Nov 1992