Molecules for disease detection and treatment

TECHNICAL FIELD

The present invention relates to molecules for disease detection and treatment and to the use of these sequences in the diagnosis, study, prevention, and treatment of diseases associated with, as well as effects of exogenous compounds on, the expression of molecules for disease detection and treatment.

BACKGROUND OF THE INVENTION

The human genome is comprised of thousands of genes, many encoding gene products that function in the maintenance and growth of the various cells and tissues in the body. Aberrant expression or mutations in these genes and their products is the cause of, or is associated with, a variety of human diseases such as cancer and other cell proliferative disorders. The identification of these genes and their products is the basis of an ever-expanding effort to find markers for early detection of diseases, and targets for their prevention and treatment.

For example, cancer represents a type of cell proliferative disorder that affects nearly every tissue in the body. A wide variety of molecules, either aberrantly expressed or mutated, can be the cause of, or involved with, various cancers because tissue growth involves complex and ordered patterns of cell proliferation, cell differentiation, and apoptosis. Cell proliferation must be regulated to maintain both the number of cells and their spatial organization. This regulation depends upon the appropriate expression of proteins which control cell cycle progression in response to extracellular signals such as growth factors and other mitogens, and intracellular cues such as DNA damage or nutrient starvation. Molecules which directly or indirectly modulate cell cycle progression fall into several categories, including growth factors and their receptors, second messenger and signal transduction proteins, oncogene products, tumor-suppressor proteins, and mitosis-promoting factors. Aberrant expression or mutations in any of these gene products can result in cell proliferative disorders such as cancer. Oncogenes are genes generally derived from normal genes that, through abnormal expression or mutation, can effect the transformation of a normal cell to a malignant one (oncogenesis). Oncoproteins, encoded by oncogenes, can affect cell proliferation in a variety of ways and include growth factors, growth factor receptors, intracellular signal transducers, nuclear transcription factors, and cell-cycle control proteins. In contrast, tumor-suppressor genes are involved in inhibiting cell proliferation. Mutations which cause reduced or loss of function in tumor-suppressor genes result in aberrant cell proliferation and cancer. Thus a wide variety of genes and their products have been found that are associated with cell proliferative disorders such as cancer, but many more may exist that are yet to be discovered.

DNA-based arrays can provide a simple way to explore the expression of a single polymorphic gene or a large number of genes. When the expression of a single gene is explored, DNA-based arrays are employed to detect the expression of specific gene variants. For example, a p53 tumor suppressor gene array is used to determine whether individuals are carrying mutations that predispose them to cancer. A cytochrome p450 gene array is useful to determine whether individuals have one of a number of specific mutations that could result in increased drug metabolism, drug resistance or drug toxicity.

DNA-based array technology is especially relevant for the rapid screening of expression of a large number of genes. There is a growing awareness that gene expression is affected in a global fashion. A genetic predisposition, disease or therapeutic treatment may affect, directly or indirectly, the expression of a large number of genes. In some cases the interactions may be expected, such as when the genes are part of the same signaling pathway. In other cases, such as when the genes participate in separate signaling pathways, the interactions may be totally unexpected. Therefore, DNA-based arrays can be used to investigate how genetic predisposition, disease, or therapeutic treatment affects the expression of a large number of genes.

The discovery of new molecules for disease detection and treatment satisfies a need in the art by providing new compositions which are useful in the diagnosis, study, prevention, and treatment of diseases associated with, as well as effects of exogenous compounds on, the expression of molecules for disease detection and treatment

SUMMARY OF THE INVENTION

The present invention relates to human disease detection and treatment molecule polynucleotides (mddt) as presented in the Sequence Listing. The mddt uniquely identify genes encoding structural, functional, and regulatory disease detection and treatment molecules.

The invention provides an isolated polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). In one alternative, the polynucleotide comprises a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45. In another alternative, the polynucleotide comprises at least 60 contiguous nucleotides of a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). The invention further provides a composition for the detection of expression of disease detection and treatment molecule polynucleotides comprising at least one isolated polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d); and a detectable label.

The invention also provides a method for detecting a target polynucleotide in a sample, said target polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). The method comprises a) amplifying said target polynucleotide or a fragment thereof using polymerase chain reaction amplification, and b) detecting the presence or absence of said amplified target polynucleotide or fragment thereof, and, optionally, if present, the amount thereof.

The invention also provides a method for detecting a target polynucleotide in a sample, said target polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). The method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex is formed between said probe and said target polynucleotide, and b) detecting the presence or absence of said hybridization complex, and, optionally, if present, the amount thereof. In one alternative, the probe comprises at least 30 contiguous nucleotides. In another alternative, the probe comprises at least 60 contiguous nucleotides.

The invention further provides a recombinant polynucleotide comprising a promoter sequence operably linked to an isolated polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). In one alternative, the invention provides a cell transformed with the recombinant polynucleotide. In another alternative, the invention provides a transgenic organism comprising the recombinant polynucleotide. In a further alternative, the invention provides a method for producing a disease detection and treatment molecule polypeptide, the method comprising a) culturing a cell under conditions suitable for expression of the disease detection and treatment molecule polypeptide, wherein said cell is transformed with the recombinant polynucleotide, and b) recovering the disease detection and treatment molecule polypeptide so expressed.

The invention also provides a purified disease detection and treatment molecule polypeptide (MDDT) encoded by at least one polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45. Additionally, the invention provides an isolated antibody which specifically binds to the disease detection and treatment molecule polypeptide. The invention further provides a method of identifying a test compound which specifically binds to the disease detection and treatment molecule polypeptide, the method comprising the steps of a) providing a test compound; b) combining the disease detection and treatment molecule polypeptide with the test compound for a sufficient time and under suitable conditions for binding; and c) detecting binding of the disease detection and treatment molecule polypeptide to the test compound, thereby identifying the test compound which specifically binds the disease detection and treatment molecule polypeptide.

The invention further provides a microarray wherein at least one element of the microarray is an isolated polynucleotide comprising at least 60 contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). The invention also provides a method for generating a transcript image of a sample which contains polynucleotides. The method comprises a) labeling the polynucleotides of the sample, b) contacting the elements of the microarray with the labeled polynucleotides of the sample under conditions suitable for the formation of a hybridization complex, and c) quantifying the expression of the polynucleotides in the sample.

Additionally, the invention provides a method for screening a compound for effectiveness in altering expression of a target polynucleotide, wherein said target polynucleotide comprises a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). The method comprises a) exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered expression of the target polynucleotide, and c) comparing the expression of the target polynucleotide in the presence of varying amounts of the compound and in the absence of the compound.

The invention further provides a method for assessing toxicity of a test compound, said method comprising a) treating a biological sample containing nucleic acids with the test compound; b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; ii) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; iii) a polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific hybridization complex is formed between said probe and a target polynucleotide in the biological sample, said target polynucleotide comprising a polynucleotide sequence selected from the group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; ii) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:1-45; iii) a polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA equivalent of i)-iv), and alternatively, the target polynucleotide comprises a fragment of a polynucleotide sequence selected from the group consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) comparing the amount of hybridization complex in the treated biological sample with the amount of hybridization complex in an untreated biological sample, wherein a difference in the amount of hybridization complex in the treated biological sample is indicative of toxicity of the test compound.

The invention further provides an isolated polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence selected from the group consisting of SEQ ID NO:46-90, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:46-90, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:46-90, and d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:46-90. In one alternative, the invention provides an isolated polypeptide comprising the amino acid sequence of SEQ ID NO:46-90.

DESCRIPTION OF THE TABLES

Table 1 shows the sequence identification numbers (SEQ ID NO:s) and template identification numbers (template IDs) corresponding to the polynucleotides of the present invention, along with their GenBank hits (GI Numbers), probability scores, and functional annotations corresponding to the GenBank hits.

Table 2 shows the sequence identification numbers (SEQ ID NO:s) and template identification numbers (template IDs) corresponding to the polynucleotides of the present invention, along with polynucleotide segments of each template sequence as defined by the indicated “start” and “stop” nucleotide positions. The reading frames of the polynucleotide segments and the Pfam hits, Pfam descriptions, and E-values corresponding to the polypeptide domains encoded by the polynucleotide segments are indicated.

Table 3 shows the sequence identification numbers (SEQ ID NO:s) and template identification numbers (template IDs) corresponding to the polynucleotides of the present invention, along with polynucleotide segments of each template sequence as defined by the indicated “start” and “stop” nucleotide positions. The reading frames of the polynucleotide segments are shown, and the polypeptides encoded by the polynucleotide segments constitute either signal peptide (SP) or transmembrane (TM) domains, as indicated. The membrane topology of the encoded polypeptide sequence is indicated, the N-terminus (N) listed as being oriented to either the cytosolic (in) or non-cytosolic (out) side of the cell membrane or organelle.

Table 4 shows the sequence identification numbers (SEQ ID NO:s) corresponding to the polynucleotides of the present invention, along with component sequence identification numbers (component IDs) corresponding to each template. The component sequences, which were used to assemble the template sequences, are defined by the indicated “start” and “stop” nucleotide positions along each template.

Table 5 shows the tissue distribution profiles for the templates of the invention.

Table 6 shows the sequence identification numbers (SEQ ID NO:s) corresponding to the polypeptides of the present invention, along with the reading frames used to obtain the polypeptide segments, the lengths of the polypeptide segments, the “start” and “stop” nucleotide positions of the polynucleotide sequences used to define the encoded polypeptide segments, the GenBank hits (GI Numbers), probability scores, and functional annotations corresponding to the GenBank hits.

Table 7 summarizes the bioinformatics tools which are useful for analysis of the polynucleotides of the present invention. The first column of Table 7 lists analytical tools, programs, and algorithms, the second column provides brief descriptions thereof, the third column presents appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth column presents, where applicable, the scores, probability values, and other parameters used to evaluate the strength of a match between two sequences (the higher the score, the greater the homology between two sequences).

DETAILED DESCRIPTION OF THE INVENTION

Before the nucleic acid sequences and methods are presented, it is to be understood that this invention is not limited to the particular machines, methods, and materials described. Although particular embodiments are described, machines, methods, and materials similar or equivalent to these embodiments may be used to practice the invention. The preferred machines, methods, and materials set forth are not intended to limit the scope of the invention which is limited only by the appended claims.

The singular forms “a”, “an”, and “the” include plural reference unless the context clearly dictates otherwise. All technical and scientific terms have the meanings commonly understood by one of ordinary skill in the art. All publications are incorporated by reference for the purpose of describing and disclosing the cell lines, vectors, and methodologies which are presented and which might be used in connection with the invention. Nothing in the specification is to be construed as an admission that the invention is not entitled to antedate such disclosure by virtue of prior invention.

Definitions

As used herein, the lower case “mddt” refers to a nucleic acid sequence, while the upper case “MDDT” refers to an amino acid sequence encoded by mddt. A “full-length” mddt refers to a nucleic acid sequence containing the entire coding region of a gene endogenously expressed in human tissue.

“Adjuvants” are materials such as Freund's adjuvant, mineral gels (aluminum hydroxide), and surface active substances (lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, and dinitrophenol) which may be administered to increase a host's immunological response.

“Allele” refers to an alternative form of a nucleic acid sequence. Alleles result from a “mutation,” a change or an alternative reading of the genetic code. Any given gene may have none, one, or many allelic forms. Mutations which give rise to alleles include deletions, additions, or substitutions of nucleotides. Each of these changes may occur alone, or in combination with the others, one or more times in a given nucleic acid sequence. The present invention encompasses allelic mddt.

“Amino acid sequence” refers to a peptide, a polypeptide, or a protein of either natural or synthetic origin. The amino acid sequence is not limited to the complete, endogenous amino acid sequence and may be a fragment, epitope, variant, or derivative of a protein expressed by a nucleic acid sequence.

“Amplification” refers to the production of additional copies of a sequence and is carried out using polymerase chain reaction (PCR) technologies well known in the art.

“Antibody” refers to intact molecules as well as to fragments thereof, such as Fab, F(ab′)₂, and Fv fragments, which are capable of binding the epitopic determinant. Antibodies that bind MDDT polypeptides can be prepared using intact polypeptides or using fragments containing small peptides of interest as the immunizing antigen. The polypeptide or peptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal.

“Antisense sequence” refers to a sequence capable of specifically hybridizing to a target sequence. The antisense sequence may include DNA, RNA, or any nucleic acid mimic or analog such as peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified sugar groups such as 2′-methoxyethyl sugars or 2′-methoxyethoxy sugars; or oligonucleotides having modified bases such as 5-methyl cytosine, 2′-deoxyuracil, or 7-deaza-2′-deoxyguanosine.

“Antisense sequence” refers to a sequence capable of specifically hybridizing to a target sequence. The antisense sequence can be DNA, RNA, or any nucleic acid mimic or analog.

“Antisense technology” refers to any technology which relies on the specific hybridization of an antisense sequence to a target sequence.

A “bin” is a portion of computer memory space used by a computer program for storage of data, and bounded in such a manner that data stored in a bin may be retrieved by the program.

“Biologically active” refers to an amino acid sequence having a structural, regulatory, or biochemical function of a naturally occurring amino acid sequence.

“Clone joining” is a process for combining gene bins based upon the bins' containing sequence information from the same clone. The sequences may assemble into a primary gene transcript as well as one or more splice variants.

“Complementary” describes the relationship between two single-stranded nucleic acid sequences that anneal by base-pairing (5′-A-G-T-3′ pairs with its complement 3′-T-C-A-5′).

A “component sequence” is a nucleic acid sequence selected by a computer program such as PHRED and used to assemble a consensus or template sequence from one or more component sequences.

A “consensus sequence” or “template sequence” is a nucleic acid sequence which has been assembled from overlapping sequences, using a computer program for fragment assembly such as the GELVIEW fragment assembly system (Genetics Computer Group (GCG), Madison, Wis.) or using a relational database management system (RDMS).

“Conservative amino acid substitutions” are those substitutions that, when made, least interfere with the properties of the original protein, i.e., the structure and especially the function of the protein is conserved and not significantly changed by such substitutions. The table below shows amino acids which may be substituted for an original amino acid in a protein and which are regarded as conservative substitutions.

Original ResidueConservative SubstitutionAlaGly, SerArgHis, LysAsnAsp, Gln, HisAspAsn, GluCysAla, SerGlnAsn, Glu, HisGluAsp, Gln, HisGlyAlaHisAsn, Arg, Gln, GluIleLeu, ValLeuIle, ValLysArg, Gln, GluMetLeu, IlePheHis, Met, Leu, Trp, TyrSerCys, ThrThrSer, ValTrpPhe, TyrTyrHis, Phe, TrpValIle, Leu, Thr

Conservative substitutions generally maintain (a) the structure of the polypeptide backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain.

“Deletion” refers to a change in either a nucleic or amino acid sequence in which at least one nucleotide or amino acid residue, respectively, is absent.

“Derivative” refers to the chemical modification of a nucleic acid sequence, such as by replacement of hydrogen by an alkyl, acyl, amino, hydroxyl, or other group.

The terms “element” and “array element” refer to a polynucleotide, polypeptide, or other chemical compound having a unique and defined position on a microarray.

“E-value” refers to the statistical probability that a match between two sequences occurred by chance.

A “fragment” is a unique portion of mddt or MDDT which is identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a fragment may comprise from 10 to 1000 contiguous amino acid residues or nucleotides. A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous amino acid residues or nucleotides in length. Fragments may be preferentially selected from certain regions of a molecule. For example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain defined sequence. Clearly these lengths are exemplary, and any length that is supported by the specification, including the Sequence Listing and the figures, may be encompassed by the present embodiments.

A fragment of mddt comprises a region of unique polynucleotide sequence that specifically identifies mddt, for example, as distinct from any other sequence in the same genome. A fragment of mddt is useful, for example, in hybridization and amplification technologies and in analogous methods that distinguish mddt from related polynucleotide sequences. The precise length of a fragment of mddt and the region of mddt to which the fragment corresponds are routinely determinable by one of ordinary skill in the art based on the intended purpose for the fragment.

A fragment of MDDT is encoded by a fragment of mddt. A fragment of MDDT comprises a region of unique amino acid sequence that specifically identifies MDDT. For example, a fragment of MDDT is useful as an immunogenic peptide for the development of antibodies that specifically recognize MDDT. The precise length of a fragment of MDDT and the region of MDDT to which the fragment corresponds are routinely deteminable by one of ordinary skill in the art based on the intended purpose for the fragment.

A “full length” nucleotide sequence is one containing at least a start site for translation to a protein sequence, followed by an open reading frame and a stop site, and encoding a “full length” polypeptide.

“Hit” refers to a sequence whose annotation will be used to describe a given template. Criteria for selecting the top hit are as follows: if the template has one or more exact nucleic acid matches, the top hit is the exact match with highest percent identity. If the template has no exact matches but has significant protein hits, the top hit is the protein hit with the lowest E-value. If the template has no significant protein hits, but does have significant non-exact nucleotide hits, the top hit is the nucleotide hit with the lowest E-value.

“Homology” refers to sequence similarity either between a reference nucleic acid sequence and at least a fragment of an mddt or between a reference amino acid sequence and a fragment of an MDDT.

“Hybridization” refers to the process by which a strand of nucleotides anneals with a complementary strand through base pairing. Specific hybridization is an indication that two nucleic acid sequences share a high degree of identity. Specific hybridization complexes form under defined annealing conditions, and remain hybridized after the “washing” step. The defined hybridization conditions include the annealing conditions and the washing step(s), the latter of which is particularly important in determining the stringency of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., binding between pairs of nucleic acid probes that are not perfectly matched. Permissive conditions for annealing of nucleic acid sequences are routinely determinable and may be consistent among hybridization experiments, whereas wash conditions may be varied among experiments to achieve the desired stringency.

Generally, stringency of hybridization is expressed with reference to the temperature under which the wash step is carried out. Generally, such wash temperatures are selected to be about 5° C. to 20° C. lower than the thermal melting point (T_m) for the specific sequence at a defined ionic strength and pH. The T_mis the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. An equation for calculating T_mand conditions for nucleic acid hybridization is well known and can be found in Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 2^nded., vol. 1-3, Cold Spring Harbor Press, Plainview, N.Y.; specifically see volume 2, chapter 9.

High stringency conditions for hybridization between polynucleotides of the present invention include wash conditions of 68° C. in the presence of about 0.2×SSC and about 0.1% SDS, for 1 hour. Alternatively, temperatures of about 65° C., 60° C., or 55° C. may be used. SSC concentration may be varied from about 0.2 to 2×SSC, with SDS being present at about 0.1%. Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, denatured salmon sperm DNA at about 100-200 μg/ml. Useful variations on these conditions will be readily apparent to those skilled in the art. Hybridization, particularly under high stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative of a similar role for the nucleotides and their resultant proteins.

Other parameters, such as temperature, salt concentration, and detergent concentration may be varied to achieve the desired stringency. Denaturants, such as formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, such as RNA:DNA hybridizations. Appropriate hybridization conditions are routinely determinable by one of ordinary skill in the art.

“Immunogenic” describes the potential for a natural, recombinant, or synthetic peptide, epitope, polypeptide, or protein to induce antibody production in appropriate animals, cells, or cell lines.

“Insertion” or “addition” refers to a change in either a nucleic or amino acid sequence in which at least one nucleotide or residue, respectively, is added to the sequence.

“Labeling” refers to the covalent or noncovalent joining of a polynucleotide, polypeptide, or antibody with a reporter molecule capable of producing a detectable or measurable signal.

“Microarray” is any arrangement of nucleic acids, amino acids, antibodies, etc., on a substrate. The substrate may be a solid support such as beads, glass, paper, nitrocellulose, nylon, or an appropriate membrane.

“Linkers” are short stretches of nucleotide sequence which may be added to a vector or an mddt to create restriction endonuclease sites to facilitate cloning. “Polylinkers” are engineered to incorporate multiple restriction enzyme sites and to provide for the use of enzymes which leave 5′ or 3′ overhangs (e.g., BamHI, EcoRI, and HindIII) and those which provide blunt ends (e.g., EcoRV, SnaBI, and StuI).

“Naturally occurring” refers to an endogenous polynucleotide or polypeptide that may be isolated from viruses or prokaryotic or eukaryotic cells.

“Nucleic acid sequence” refers to the specific order of nucleotides joined by phosphodiester bonds in a linear, polymeric arrangement. Depending on the number of nucleotides, the nucleic acid sequence can be considered an oligomer, oligonucleotide, or polynucleotide. The nucleic acid can be DNA, RNA, or any nucleic acid analog, such as PNA, may be of genomic or synthetic origin, may be either double-stranded or single-stranded, and can represent either the sense or antisense (complementary) strand.

“Oligomer” refers to a nucleic acid sequence of at least about 6 nucleotides and as many as about 60 nucleotides, preferably about 15 to 40 nucleotides, and most preferably between about 20 and 30 nucleotides, that may be used in hybridization or amplification technologies. Oligomers may be used as, e.g., primers for PCR, and are usually chemically synthesized.

“Operably linked” refers to the situation in which a first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the promoter affects the transcription or expression of the coding sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, where necessary to join two protein coding regions, in the same reading frame.

“Peptide nucleic acid” (PNA) refers to a DNA mimic in which nucleotide bases are attached to a pseudopeptide backbone to increase stability. PNAs, also designated antigene agents, can prevent gene expression by targeting complementary messenger RNA.

The phrases “percent identity” and “% identity”, as applied to polynucleotide sequences, refer to the percentage of residue matches between at least two polynucleotide sequences aligned using a standardized algorithm Such an algorithm may insert, in a standardized and reproducible way, gaps in the sequences being compared in order to optimize alignment between two sequences, and therefore achieve a more meaningful comparison of the two sequences.

Percent identity between polynucleotide sequences may be determined using the default parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence alignment program. This program is part of the LASERGENE software package, a suite of molecular biological analysis programs (DNASTAR, Madison, Wis.). CLUSTAL V is described in Higgins, D. G. and Sharp, P. M. (1989) CABIOS 5:151-153 and in Higgins, D. G. et al. (1992) CABIOS 8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: Ktuple=2, gap penalty=5, window=4, and “diagonals saved”=4. The “weighted” residue weight table is selected as the default. Percent identity is reported by CLUSTAL V as the “percent similarity” between aligned polynucleotide sequence pairs.

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search Tool (BLAST) (Altschul, S. F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several sources, including the NCBI, Bethesda, Md., and on the Internet at http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis programs including “blastn,” that is used to determine alignment between a known polynucleotide sequence and other sequences on a variety of databases. Also available is a tool called “BLAST 2 Sequences” that is used for direct pairwise comparison of two nucleotide sequences. “BLAST 2 Sequences” can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/b12/. The “BLAST 2 Sequences” tool can be used for both blastn and blastp (discussed below). BLAST programs are commonly used with gap and other parameters set to default settings. For example, to compare two nucleotide sequences, one may use blastn with the “BLAST 2 Sequences” tool Version 2.0.9 (May 7, 1999) set at default parameters. Such default parameters may be, for example:

- Matrix: BLOSUM62
- Reward for match: 1
- Penalty for mismatch: −2
- Open Gap: 5 and Extension Gap: 2 penalties
- Gap×drop-off: 50
- Expect: 10
- Word Size: 11
- Filter: on

Percent identity may be measured over the length of an entire defined sequence, for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported by the sequences shown herein, in figures or Sequence Listings, may be used to describe a length over which percentage identity may be measured.

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences that all encode substantially the same protein.

The phrases “percent identity” and “% identity”, as applied to polypeptide sequences, refer to the percentage of residue matches between at least two polypeptide sequences aligned using a standardized algorithm Methods of polypeptide sequence alignment are well-known. Some alignment methods take into account conservative amino acid substitutions. Such conservative substitutions, explained in more detail above, generally preserve the hydrophobicity and acidity of the substituted residue, thus preserving the structure (and therefore function) of the folded polypeptide.

Percent identity between polypeptide sequences may be determined using the default parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence alignment program (described and referenced above). For pairwise alignments of polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=1, gap penalty=3, window=5, and “diagonals saved”=5. The PAM250 matrix is selected as the default residue weight table. As with polynucleotide alignments, the percent identity is reported by CLUSTAL V as the “percent similarity” between aligned polypeptide sequence pairs.

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise comparison of two polypeptide sequences, one may use the “BLAST 2 Sequences” tool Version 2.0.9 (May 7, 1999) with blastp set at default parameters. Such default parameters may be, for example:

- Matrix: BLOSUM62
- Open Gap: 11 and Extension Gap: 1 penalty
- Gap×drop-off: 50
- Expect: 10
- Word Size: 3
- Filter: on

Percent identity may be measured over the length of an entire defined polypeptide sequence, for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length supported by the sequences shown herein, in figures or Sequence Listings, may be used to describe a length over which percentage identity may be measured.

“Post-translational modification” of an MDDT may involve lipidation, glycosylation, phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by cell type depending on the enzymatic milieu and the MDDT.

“Probe” refers to mddt or fragments thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. “Primers” are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR).

Probes and primers as used in the present invention typically comprise at least 15 contiguous nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also be employed, such as probes and primers that comprise at least 20, 30, 40, 50, 60, 70, 80, 90, 100, or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may be considerably longer than these examples, and it is understood that any length supported by the specification, including the figures and Sequence Listing, may be used.

Methods for preparing and using probes and primers are described in the references, for example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 2^nded., vol. 1-3, Cold Spring Harbor Press, Plainview, N.Y.; Ausubel et al., 1987, Current Protocols in Molecular Biology, Greene Publ. Assoc. & Wiley-Intersciences, New York, N.Y.; Innis et al., 1990, PCR Protocols, A Guide to Methods and Applications, Academic Press, San Diego, Calif. PCR primer pairs can be derived from a known sequence, for example, by using computer programs intended for that purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge, Mass.).

Oligonucleotides for use as primers are selected using software known in the art for such purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection programs have incorporated additional features for expanded capabilities. For example, the PrimOU primer selection program (available to the public from the Genome Center at University of Texas South West Medical Center, Dallas, Tex.) is capable of choosing specific primers from megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection program (available to the public from the Whitehead Institute/MIT Center for Genome Research, Cambridge, Mass.) allows the user to input a “mispriming library,” in which sequences to avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for microarrays. (The source code for the latter two primer selection programs may also be obtained from their respective sources and modified to meet the user's specific needs.) The PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the above selection methods are useful in hybridization technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to those described above.

“Purified” refers to molecules, either polynucleotides or polypeptides that are isolated or separated from their natural environment and are at least 60% free, preferably at least 75% free, and most preferably at least 90% free from other compounds with which they are naturally associated.

A “recombinant nucleic acid” is a sequence that is not naturally occurring or has a sequence that is made by an artificial combination of two or more otherwise separated segments of sequence. This artificial combination is often accomplished by chemical synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques such as those described in Sambrook, supra. The term recombinant includes nucleic acids that have been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell.

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is expressed, inducing a protective immunological response in the mammal.

“Regulatory element” refers to a nucleic acid sequence from nontranslated regions of a gene, and includes enhancers, promoters, introns, and 3′ untranslated regions, which interact with host proteins to carry out or regulate transcription or translation.

“Reporter” molecules are chemical or biochemical moieties used for labeling a nucleic acid, an amino acid, or an antibody. They include radionuclides; enzymes; fluorescent, chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and other moieties known in the art.

An “RNA equivalent,” in reference to a DNA sequence, is composed of the same linear sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose instead of deoxyribose.

“Sample” is used in its broadest sense. Samples may contain nucleic or amino acids, antibodies, or other materials, and may be derived from any source (e.g., bodily fluids including, but not limited to, saliva, blood, and urine; chromosome(s), organelles, or membranes isolated from a cell; genomic DNA, RNA, or cDNA in solution or bound to a substrate; and cleared cells or tissues or blots or imprints from such cells or tissues).

“Specific binding” or “specifically binding” refers to the interaction between a protein or peptide and its agonist, antibody, antagonist, or other binding partner. The interaction is dependent upon the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For example, if an antibody is specific for epitope “A,” the presence of a polypeptide containing epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will reduce the amount of labeled A that binds to the antibody.

“Substitution” refers to the replacement of at least one nucleotide or amino acid by a different nucleotide or amino acid.

“Substrate” refers to any suitable rigid or semi-rigid support including, e.g., membranes, filters, chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, microparticles or capillaries. The substrate can have a variety of surface forms, such as wells, trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound.

A “transcript image” refers to the collective pattern of gene expression by a particular tissue or cell type under given conditions at a given time.

“Transformation” refers to a process by which exogenous DNA enters a recipient cell. Transformation may occur under natural or artificial conditions using various methods well known in the art. Transformation may rely on any known method for the insertion of foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method is selected based on the host cell being transformed.

“Transformants” include stably transformed cells in which the inserted DNA is capable of replication either as an autonomously replicating plasmid or as part of the host chromosome, as well as cells which transiently express inserted DNA or RNA.

A “transgenic organism,” as used herein, is any organism, including but not limited to animals and plants, in which one or more of the cells of the organism contains heterologous nucleic acid introduced by way of human intervention, such as by transgenic techniques well known in the art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The transgenic organisms contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, and plants and animals. The isolated DNA of the present invention can be introduced into the host by methods known in the art, for example infection, transfection, transformation or transconjugation. Techniques for transferring the DNA of the present invention into such organisms are widely known and provided in references such as Sambrook et al. (1989), supra.

A “variant” of a particular nucleic acid sequence is defined as a nucleic acid sequence having at least 25% sequence identity to the particular nucleic acid sequence over a certain length of one of the nucleic acid sequences using blastn with the “BLAST 2 Sequences” tool Version 2.0.9 (May 7, 1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 30%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% or even at least 98% or greater sequence identity over a certain defined length. The variant may result in “conservative” amino acid changes which do not affect structural and/or chemical properties. A variant may be described as, for example, an “allelic” (as defined above), “splice,” “species,” or “polymorphic” variant. A splice variant may have significant identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding polypeptide may possess additional functional domains or lack domains that are present in the reference molecule. Species variants are polynucleotide sequences that vary from one species to another. The resulting polypeptides generally will have significant amino acid identity relative to each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given species. Polymorphic variants also may encompass “single nucleotide polymorphisms” (SNPs) in which the polynucleotide sequence varies by one base. The presence of SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for a disease state.

In an alternative, variants of the polynucleotides of the present invention may be generated through recombinant methods. One possible method is a DNA shuffling technique such as MOLECULARBREEDING (Maxygen Inc., Santa Clara, Calif.; described in U.S. Pat. No. 5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F. C. et al. (1999) Nat. Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or improve the biological properties of MDDT, such as its biological or enzymatic activity or its ability to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene variants is produced using PCR-mediated recombination of gene fragments. The library is then subjected to selection or screening procedures that identify those gene variants with the desired properties. These preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and selection/screening. Thus, genetic diversity is created through “artificial” breeding and rapid molecular evolution. For example, fragments of a single gene containing random point mutations may be recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, fragments of a given gene may be recombined with fragments of homologous genes in the same gene family, either from the same or different species, thereby maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable manner.

A “variant” of a particular polypeptide sequence is defined as a polypeptide sequence having at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the polypeptide sequences using blastp with the “BLAST 2 Sequences” tool Version 2.0.9 (May 7, 1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence identity over a certain defined length of one of the polypeptides.

THE INVENTION

In a particular embodiment, cDNA sequences derived from human tissues and cell lines were aligned based on nucleotide sequence identity and assembled into “consensus” or “template” sequences which are designated by the template identification numbers (template IDs) in column 2 of Table 1. The sequence identification numbers (SEQ ID NO:s) corresponding to the template IDs are shown in column 1. The template sequences have similarity to GenBank sequences, or “hits,” as designated by the GI Numbers in column 3. The statistical probability of each GenBank hit is indicated by a probability score in column 4, and the functional annotation corresponding to each GenBank hit is listed in column 5.

The invention incorporates the nucleic acid sequences of these templates as disclosed in the Sequence Listing and the use of these sequences in the diagnosis and treatment of disease states characterized by defects in disease detection and treatment molecules. The invention further utilizes these sequences in hybridization and amplification technologies, and in particular, in technologies which assess gene expression patterns correlated with specific cells or tissues and their responses in vivo or in vitro to pharmaceutical agents, toxins, and other treatments. In this manner, the sequences of the present invention are used to develop a transcript image for a particular cell or tissue.

Derivation of Nucleic Acid Sequences

cDNA was isolated from libraries constructed using RNA derived from normal and diseased human tissues and cell lines. The human tissues and cell lines used for cDNA library construction were selected from a broad range of sources to provide a diverse population of cDNAs representative of gene transcription throughout the human body. Descriptions of the human tissues and cell lines used for cDNA library construction are provided in the LIFESEQ database (Incyte Genomics, Inc. (Incyte), Palo Alto, Calif.). Human tissues were broadly selected from, for example, cardiovascular, dermatologic, endocrine, gastrointestinal, hematopoietic/immune system, musculoskeletal, neural, reproductive, and urologic sources.

Cell lines used for cDNA library construction were derived from, for example, leukemic cells, teratocarcinomas, neuroepitheliomas, cervical carcinoma, lung fibroblasts, and endothelial cells. Such cell lines include, for example, THP-1, Jurkat, HUVEC, hNT2, WI38, HeLa, and other cell lines commonly used and available from public depositories (American Type Culture Collection, Manassas, Va.). Prior to mRNA isolation, cell lines were untreated, treated with a pharmaceutical agent such as 5′-aza-2′-deoxycytidine, treated with an activating agent such as lipopolysaccharide in the case of leukocytic cell lines, or, in the case of endothelial cell lines, subjected to shear stress.

Sequencing of the cDNAs

Methods for DNA sequencing are well known in the art. Conventional enzymatic methods employ the Klenow fragment of DNA polymerase I, SEQUENASE DNA polymerase (U.S. Biochemical Corporation, Cleveland, Ohio), Taq polymerase (Applied Biosystems, Foster City, Calif.), thermostable T7 polymerase (Amersham Pharmacia Biotech, Inc. (Amersham Pharmacia Biotech), Piscataway, N.J.), or combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE amplification system (Life Technologies Inc. (Life Technologies), Gaithersburg, Md.), to extend the nucleic acid sequence from an oligonucleotide primer annealed to the DNA template of interest. Methods have been developed for the use of both single-stranded and double-stranded templates. Chain termination reaction products may be electrophoresed on urea-polyacrylamide gels and detected either by autoradiography (for radioisotope-labeled nucleotides) or by fluorescence (for fluorophore-labeled nucleotides). Automated methods for mechanized reaction preparation, sequencing, and analysis using fluorescence detection methods have been developed Machines used to prepare cDNAs for sequencing can include the MICROLAB 2200 liquid transfer system (Hamilton Company (Hamilton), Reno, Nev.), Peltier thermal cycler (PTC200; MJ Research, Inc. (MJ Research), Watertown, Mass.), and ABI CATALYST 800 thermal cycler (Applied Biosystems). Sequencing can be carried out using, for example, the ABI 373 or 377 (Applied Biosystems) or MEGABACE 1000 (Molecular Dynamics, Inc. (Molecular Dynamics), Sunnyvale, Calif.) DNA sequencing systems, or other automated and manual sequencing systems well known in the art.

The nucleotide sequences of the Sequence Listing have been prepared by current, state-of-the-art, automated methods and, as such, may contain occasional sequencing errors or unidentified nucleotides. Such unidentified nucleotides are designated by an N. These infrequent unidentified bases do not represent a hindrance to practicing the invention for those skilled in the art. Several methods employing standard recombinant techniques may be used to correct errors and complete the missing sequence information. (See, e.g., those described in Ausubel, F. M. et al. (1997) Short Protocols in Molecular Biology, John Wiley & Sons, New York, N.Y.; and Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y.)

Assembly of cDNA Sequences

Human polynucleotide sequences may be assembled using programs or algorithms well known in the art. Sequences to be assembled are related, wholly or in part, and may be derived from a single or many different transcripts. Assembly of the sequences can be performed using such programs as PHRAP (Phils Revised Assembly Program) and the GELVIEW fragment assembly system (GCG), or other methods known in the art.

Alternatively, cDNA sequences are used as “component” sequences that are assembled into “template” or “consensus” sequences as follows. Sequence chromatograms are processed, verified, and quality scores are obtained using PHRED. Raw sequences are edited using an editing pathway known as Block 1 (See, e.g., the LIFESEQ Assembled User Guide, Incyte Genomics, Palo Alto, Calif.). A series of BLAST comparisons is performed and low-information segments and repetitive elements (e.g., dinucleotide repeats, Alu repeats, etc.) are replaced by “n's”, or masked, to prevent spurious matches. Mitochondrial and ribosomal RNA sequences are also removed. The processed sequences are then loaded into a relational database management system (RDMS) which assigns edited sequences to existing templates, if available. When additional sequences are added into the RDMS, a process is initiated which modifies existing templates or creates new templates from works in progress (i.e., nonfinal assembled sequences) containing queued sequences or the sequences themselves. After the new sequences have been assigned to templates, the templates can be merged into bins. If multiple templates exist in one bin, the bin can be split and the templates reannotated.

Once gene bins have been generated based upon sequence alignments, bins are “clone joined” based upon clone information. Clone joining occurs when the 5′ sequence of one clone is present in one bin and the 3′ sequence from the same clone is present in a different bin, indicating that the two bins should be merged into a single bin. Only bins which share at least two different clones are merged.

A resultant template sequence may contain either a partial or a full length open reading frame, or all or part of a genetic regulatory element. This variation is due in part to the fact that the full length cDNAs of many genes are several hundred, and sometimes several thousand, bases in length. With current technology, cDNAs comprising the coding regions of large genes cannot be cloned because of vector limitations, incomplete reverse transcription of the mRNA, or incomplete “second strand” synthesis. Template sequences may be extended to include additional contiguous sequences derived from the parent RNA transcript using a variety of methods known to those of skill in the art. Extension may thus be used to achieve the full length coding sequence of a gene.

Analysis of the cDNA Sequences

The cDNA sequences are analyzed using a variety of programs and algorithms which are well known in the art. (See, e.g., Ausubel, 1997, supra, Chapter 7.7; Meyers, R. A. (Ed.) (1995) Molecular Biology and Biotechnology, Wiley VCH, New York, N.Y., pp. 856-853; and Table 7.) These analyses comprise both reading frame determinations, e.g., based on triplet codon periodicity for particular organisms (Fickett, J. W. (1982) Nucleic Acids Res. 10:5303-5318); analyses of potential start and stop codons; and homology searches.

Computer programs known to those of skill in the art for performing computer-assisted searches for amino acid and nucleic acid sequence similarity, include, for example, Basic Local Alignment Search Tool (BLAST; Altschul, S. F. (1993) J. Mol. Evol. 36:290-300; Altschul, S. F. et al. (1990) J. Mol. Biol. 215:403-410). BLAST is especially useful in determining exact matches and comparing two sequence fragments of arbitrary but equal lengths, whose alignment is locally maximal and for which the alignment score meets or exceeds a threshold or cutoff score set by the user (Karlin, S. et al. (1988) Proc. Natl. Acad. Sci. USA 85:841-845). Using an appropriate search tool (e.g., BLAST or HMM), GenBank, SwissProt, BLOCKS, PFAM and other databases may be searched for sequences containing regions of homology to a query mddt or MDDT of the present invention.

Other approaches to the identification, assembly, storage, and display of nucleotide and polypeptide sequences are provided in “Relational Database for Storing Biomolecule Information,” U.S. Ser. No. 08/947,845, filed Oct. 9, 1997; “Project-Based Full-Length Biomolecular Sequence Database,” U.S. Ser. No. 08/811,758, filed Mar. 6, 1997; and “Relational Database and System for Storing Information Relating to Biomolecular Sequences,” U.S. Ser. No. 09/034,807, filed Mar. 4, 1998, all of which are incorporated by reference herein in their entirety.

Protein hierarchies can be assigned to the putative encoded polypeptide based on, e.g., motif, BLAST, or biological analysis. Methods for assigning these hierarchies are described, for example, in “Database System Employing Protein Function Hierarchies for Viewing Biomolecular Sequence Data,” U.S. Ser. No. 08/812,290, filed Mar. 6, 1997, incorporated herein by reference.

Human Disease Detection and Treatment Molecule Sequences

The mddt of the present invention may be used for a variety of diagnostic and therapeutic purposes. For example, an mddt may be used to diagnose a particular condition, disease, or disorder associated with disease detection and treatment molecules. Such conditions, diseases, and disorders include, but are not limited to, a cell proliferative disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, a cancer of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; and an autoimmune/inflammatory disorder, such as actinic keratosis, acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, bursitis, cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, eryduroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel syndrome, episodic lymphopenia with lymphocytotoxins, mixed connective tissue disease (MCTD), multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, myelofibrosis, osteoartritis, osteoporosis, pancreatitis, polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjögren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, primary thrombocythemia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, trauma, and hematopoietic cancer including lymphoma, leukemia, and myeloma. The mddt can be used to detect the presence of, or to quantify the amount of, an mddt-related polynucleotide in a sample. This information is then compared to information obtained from appropriate reference samples, and a diagnosis is established. Alternatively, a polynucleotide complementary to a given mddt can inhibit or inactivate a therapeutically relevant gene related to the mddt.

Analysis of mddt Expression Patterns

The expression of mddt may be routinely assessed by hybridization-based methods to determine, for example, the tissue-specificity, disease-specificity, or developmental stage-specificity of mddt expression. For example, the level of expression of mddt may be compared among different cell types or tissues, among diseased and normal cell types or tissues, among cell types or tissues at different developmental stages, or among cell types or tissues undergoing various treatments. This type of analysis is useful, for example, to assess the relative levels of mddt expression in fully or partially differentiated cells or tissues, to determine if changes in mddt expression levels are correlated with the development or progression of specific disease states, and to assess the response of a cell or tissue to a specific therapy, for example, in pharmacological or toxicological studies. Methods for the analysis of mddt expression are based on hybridization and amplification technologies and include membrane-based procedures such as northern blot analysis, high-throughput procedures that utilize, for example, microarrays, and PCR-based procedures.

Hybridization and Genetic Analysis

The mddt, their fragments, or complementary sequences, may be used to identify the presence of and/or to determine the degree of similarity between two (or more) nucleic acid sequences. The mddt may be hybridized to naturally occurring or recombinant nucleic acid sequences under appropriately selected temperatures and salt concentrations. Hybridization with a probe based on the nucleic acid sequence of at least one of the mddt allows for the detection of nucleic acid sequences, including genomic sequences, which are identical or related to the mddt of the Sequence Listing. Probes may be selected from non-conserved or unique regions of at least one of the polynucleotides of SEQ ID NO:1-45 and tested for their ability to identify or amplify the target nucleic acid sequence using standard protocols.

Polynucleotide sequences that are capable of hybridizing, in particular, to those shown in SEQ ID NO:1-45 and fragments thereof, can be identified using various conditions of stringency. (See, e.g., Wahl, G. M. and S. L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A. R. (1987) Methods Enzymol. 152:507-511.) Hybridization conditions are discussed in “Definitions.”

A probe for use in Souther or northern hybridization may be derived from a fragment of an mddt sequence, or its complement, that is up to several hundred nucleotides in length and is either single-stranded or double-stranded. Such probes may be hybridized in solution to biological materials such as plasmids, bacterial, yeast, or human artificial chromosomes, cleared or sectioned tissues, or to artificial substrates containing mddt. Microarrays are particularly suitable for identifying the presence of and detecting the level of expression for multiple genes of interest by examining gene expression correlated with, e.g., various stages of development, treatment with a drug or compound, or disease progression. An array analogous to a dot or slot blot may be used to arrange and link polynucleotides to the surface of a substrate using one or more of the following: mechanical (vacuum), chemical, thermal, or UV bonding procedures. Such an array may contain any number of mddt and may be produced by hand or by using available devices, materials, and machines.

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., Brennan, T. M. et al. (1995) U.S. Pat. No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application WO95/251116; Shalon, D. et al. (1995) PCT application WO95/35505; Heller, R. A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150-2155; and Heller, M. J. et al. (1997) U.S. Pat. No. 5,605,662.)

Probes may be labeled by either PCR or enzymatic techniques using a variety of commercially available reporter molecules. For example, commercial kits are available for radioactive and chemiluminescent labeling (Amersham Pharmacia Biotech) and for alkaline phosphatase labeling (Life Technologies). Alternatively, mddt may be cloned into commercially available vectors for the production of RNA probes. Such probes may be transcribed in the presence of at least one labeled nucleotide (e.g., ³²P-ATP, Amersham Pharmacia Biotech).

Additionally the polynucleotides of SEQ ID NO:1-45 or suitable fragments thereof can be used to isolate full length cDNA sequences utilizing hybridization and/or amplification procedures well known in the art, e.g., cDNA library screening, PCR amplification, etc. The molecular cloning of such full length cDNA sequences may employ the method of cDNA library screening with probes using the hybridization, stringency, washing, and probing strategies described above and in Ausubel, supra, Chapters 3, 5, and 6. These procedures may also be employed with genomic libraries to isolate genomic sequences of mddt in order to analyze, e.g., regulatory elements.

Genetic Mapping

Gene identification and mapping are important in the investigation and treatment of almost all conditions, diseases, and disorders. Cancer, cardiovascular disease, Alzheimer's disease, arthritis, diabetes, and mental illnesses are of particular interest. Each of these conditions is more complex than the single gene defects of sickle cell anemia or cystic fibrosis, with select groups of genes being predictive of predisposition for a particular condition, disease, or disorder. For example, cardiovascular disease may result from malfunctioning receptor molecules that fail to clear cholesterol from the bloodstream, and diabetes may result when a particular individual's immune system is activated by an infection and attacks the insulin-producing cells of the pancreas. In some studies, Alzheimer's disease has been linked to a gene on chromosome 21; other studies predict a different gene and location. Mapping of disease genes is a complex and reiterative process and generally proceeds from genetic linkage analysis to physical mapping.

As a condition is noted among members of a family, a genetic linkage map traces parts of chromosomes that are inherited in the same pattern as the condition. Statistics link the inheritance of particular conditions to particular regions of chromosomes, as defined by RFLP or other markers. (See, for example, Lander, E. S. and Botstein, D. (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) Occasionally, genetic markers and their locations are known from previous studies. More often, however, the markers are simply stretches of DNA that differ among individuals. Examples of genetic linkage maps can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) World Wide Web site.

In another embodiment of the invention, mddt sequences may be used to generate hybridization probes useful in chromosomal mapping of naturally occurring genomic sequences. Either coding or noncoding sequences of mddt may be used, and in some instances, noncoding sequences may be preferable over coding sequences. For example, conservation of an mddt coding sequence among members of a multi-gene family may potentially cause undesired cross hybridization during chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific region of a chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial P1 constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J. J. et al. (1997) Nat. Genet. 15:345-355; Price, C. M. (1993) Blood Rev. 7:127-134; and Trask, B. J. (1991) Trends Genet. 7:149-154.)

Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome mapping techniques and genetic map data. (See, e.g., Meyers, supra, pp. 965-968.) Correlation between the location of mddt on a physical chromosomal map and a specific disorder, or a predisposition to a specific disorder, may help define the region of DNA associated with that disorder. The mddt sequences may also be used to detect polymorphisms that are genetically liked to the inheritance of a particular condition, disease, or disorder.

In situ hybridization of chromosomal preparations and genetic mapping techniques, such as linkage analysis using established chromosomal markers, may be used for extending existing genetic maps. Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may reveal associated markers even if the number or arm of the corresponding human chromosome is not known. These new marker sequences can be mapped to human chromosomes and may provide valuable information to investigators searching for disease genes using positional cloning or other gene discovery techniques. Once a disease or syndrome has been crudely correlated by genetic linkage with a particular genomic region, e.g., ataxia-telangiectasia to 11q22-23, any sequences mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g., Gatti, R. A. et al. (1988) Nature 336:577-580.) The nucleotide sequences of the subject invention may also be used to detect differences in chromosomal architecture due to translocation, inversion, etc., among normal, carrier, or affected individuals.

Once a disease-associated gene is mapped to a chromosomal region, the gene must be cloned in order to identify mutations or other alterations (e.g., translocations or inversions) that may be correlated with disease. This process requires a physical map of the chromosomal region containing the disease-gene of interest along with associated markers. A physical map is necessary for determining the nucleotide sequence of and order of marker genes on a particular chromosomal region. Physical mapping techniques are well known in the art and require the generation of overlapping sets of cloned DNA fragments from a particular organelle, chromosome, or genome. These clones are analyzed to reconstruct and catalog their order. Once the position of a marker is determined, the DNA from that region is obtained by consulting the catalog and selecting clones from that region. The gene of interest is located through positional cloning techniques using hybridization or similar methods.

Diagnostic Uses

The mddt of the present invention may be used to design probes useful in diagnostic assays. Such assays, well known to those skilled in the art, may be used to detect or confirm conditions, disorders, or diseases associated with abnormal levels of mddt expression. Labeled probes developed from mddt sequences are added to a sample under hybridizing conditions of desired stringency. In some instances, mddt, or fragments or oligonucleotides derived from mddt, may be used as primers in amplification steps prior to hybridization. The amount of hybridization complex formed is quantified and compared with standards for that cell or tissue. If mddt expression varies significantly from the standard, the assay indicates the presence of the condition, disorder, or disease. Qualitative or quantitative diagnostic methods may include northern, dot blot, or other membrane or dip-stick based technologies or multiple-sample format technologies such as PCR, enzyme-linked immunosorbent assay (ELISA)-like, pin, or chip-based assays.

The probes described above may also be used to monitor the progress of conditions, disorders, or diseases associated with abnormal levels of mddt expression, or to evaluate the efficacy of a particular therapeutic treatment The candidate probe may be identified from the mddt that are specific to a given human tissue and have not been observed in GenBank or other genome databases. Such a probe may be used in animal studies, preclinical tests, clinical trials, or in monitoring the treatment of an individual patient In a typical process, standard expression is established by methods well known in the art for use as a basis of comparison, samples from patients affected by the disorder or disease are combined with the probe to evaluate any deviation from the standard profile, and a therapeutic agent is administered and effects are monitored to generate a treatment profile. Efficacy is evaluated by determining whether the expression progresses toward or returns to the standard normal pattern. Treatment profiles may be generated over a period of several days or several months. Statistical methods well known to those skilled in the art may be use to determine the significance of such therapeutic agents.

The polynucleotides are also useful for identifying individuals from minute biological samples, for example, by matching the RFLP pattern of a sample's DNA to that of an individual's DNA. The polynucleotides of the present invention can also be used to determine the actual base-by-base DNA sequence of selected portions of an individual's genome. These sequences can be used to prepare PCR primers for amplifying and isolating such selected DNA, which can then be sequenced. Using this technique, an individual can be identified through a unique set of DNA sequences. Once a unique ID database is established for an individual, positive identification of that individual can be made from extremely small tissue samples.

In a particular aspect, oligonucleotide primers derived from the mddt of the invention may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from mddt are used to amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the secondary and tertiary structures of PCR products in single-stranded form, and these differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high-throughput equipment such as DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by comparing the sequences of individual overlapping DNA fragments which assemble into a common consensus sequence. These computer-based methods filter out sequence variations due to laboratory preparation of DNA and sequencing errors using statistical models and automated analyses of DNA sequence chromatograms. In the alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San Diego, Calif.).

DNA-based identification techniques are critical in forensic technology. DNA sequences taken from very small biological samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, etc., can be amplified using, e.g., PCR, to identify individuals. (See, e.g., Erlich, H. (1992) PCR Technology, Freeman and Co., New York, N.Y.). Similarly, polynucleotides of the present invention can be used as polymorphic markers.

There is also a need for reagents capable of identifying the source of a particular tissue. Appropriate reagents can comprise, for example, DNA probes or primers prepared from the sequences of the present invention that are specific for particular tissues. Panels of such reagents can identify tissue by species and/or by organ type. In a similar fashion, these reagents can be used to screen tissue cultures for contamination.

The polynucleotides of the present invention can also be used as molecular weight markers on nucleic acid gels or Southern blots, as diagnostic probes for the presence of a specific mRNA in a particular cell type, in the creation of subtracted cDNA libraries which aid in the discovery of novel polynucleotides, in selection and synthesis of oligomers for attachment to an array or other support, and as an antigen to elicit an immune response.

Disease Model Systems Using mddt

The mddt of the invention or their mammalian homologs may be “knocked out” in an animal model system using homologous recombination in embryonic stem (ES) cells. Such techniques are well known in the art and are useful for the generation of animal models of human disease. (See, e.g., U.S. Pat. No. 5,175,383 and U.S. Pat. No. 5,767,337.) For example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo and grown in culture. The ES cells are transformed with a vector containing the gene of interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M. R. (1989) Science 244:1288-1292). The vector integrates into the corresponding region of the host genome by homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, J. D. (1996) Clin. Invest. 97:1999-2002; Wagner, K. U. et al. (1997) Nucleic Acids Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential therapeutic or toxic agents.

The mddt of the invention may also be manipulated in vitro in ES cells derived from human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J. A. et al. (1998) Science 282:1145-1147).

The mddt of the invention can also be used to create “knockin” humanized animals (pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region of mddt is injected into animal ES cells, and the injected sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and treated with potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, a mammal inbred to overexpress mddt, resulting, e.g., in the secretion of MDDT in its milk, may also serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74).

Screening Assays

MDDT encoded by polynucleotides of the present invention may be used to screen for molecules that bind to or are bound by the encoded polypeptides. The binding of the polypeptide and the molecule may activate (agonist), increase, inhibit (antagonist), or decrease activity of the polypeptide or the bound molecule. Examples of such molecules include antibodies, oligonucleotides, proteins (e.g., receptors), or small molecules.

Preferably, the molecule is closely related to the natural ligand of the polypeptide, e.g., a ligand or fragment thereof, a natural substrate, or a structural or functional mimetic. (See, Coligan et al., (1991) Current Protocols in Immunology 1(2): Chapter 5.) Similarly, the molecule can be closely related to the natural receptor to which the polypeptide binds, or to at least a fragment of the receptor, e.g., the active site. In either case, the molecule can be rationally designed using known techniques. Preferably, the screening for these molecules involves producing appropriate cells which express the polypeptide, either as a secreted protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila, or E. coli. Cells expressing the polypeptide or cell membrane fractions which contain the expressed polypeptide are then contacted with a test compound and binding, stimulation, or inhibition of activity of either the polypeptide or the molecule is analyzed.

An assay may simply test binding of a candidate compound to the polypeptide, wherein binding is detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. Alternatively, the assay may assess binding in the presence of a labeled competitor.

Additionally, the assay can be carried out using cell-free preparations, polypeptide/molecule affixed to a solid support, chemical libraries, or natural product mixtures. The assay may also simply comprise the steps of mixing a candidate compound with a solution containing a polypeptide, measuring polypeptide/molecule activity or binding, and comparing the polypeptide/molecule activity or binding to a standard.

Preferably, an ELISA assay using, e.g., a monoclonal or polyclonal antibody, can measure polypeptide level in a sample. The antibody can measure polypeptide level by either binding, directly or indirectly, to the polypeptide or by competing with the polypeptide for a substrate.

All of the above assays can be used in a diagnostic or prognostic context. The molecules discovered using these assays can be used to treat disease or to bring about a particular result in a patient (e.g., blood vessel growth) by activating or inhibiting the polypeptide/molecule. Moreover, the assays can discover agents which may inhibit or enhance the production of the polypeptide from suitably manipulated cells or tissues.

Transcript Imaging and Toxicological Testing

Another embodiment relates to the use of mddt to develop a transcript image of a tissue or cell type. A transcript image represents the global pattern of gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by quantifying the number of expressed genes and their relative abundance under given conditions and at a given time. (See Seilhamer et al., “Comparative Gene Transcript Analysis,” U.S. Pat. No. 5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by hybridizing the polynucleotides of the present invention or their complements to the totality of transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the hybridization takes place in high-throughput format, wherein the polynucleotides of the present invention or their complements comprise a subset of a plurality of elements on a microarray. The resultant transcript image would provide a profile of gene activity pertaining to disease detection and treatment molecules.

Transcript images which profile mddt expression may be generated using transcripts isolated from tissues, cell lines, biopsies, or other biological samples. The transcript image may thus reflect mddt expression in vivo, as in the case of a tissue or biopsy sample, or in vitro, as in the case of a cell line.

Transcript images which profile mddt expression may also be used in conjunction with in vitro model systems and preclinical evaluation of pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental compounds. All compounds induce characteristic gene expression patterns, frequently termed molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity (Nuwaysir, E. F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and Anderson, N. L. (2000) Toxicol. Lett. 112-113:467-71, expressly incorporated by reference herein). If a test compound has a signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. These fingerprints or signatures are most useful and refined when they contain expression information from a large number of genes and gene families. Ideally, a genome-wide measurement of expression provides the highest quality signature. Even genes whose expression is not altered by any tested compounds are important as well, as the levels of expression of these genes are used to normalize the rest of the expression data. The normalization procedure is useful for comparison of expression data after treatment with different compounds. While the assignment of gene function to elements of a toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for example, Press Release 00-02 from the National Institute of Environmental Health Sciences, released Feb. 29, 2000, available at http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in toxicological screening using toxicant signatures to include all expressed gene sequences.

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample containing nucleic acids with the test compound Nucleic acids that are expressed in the treated biological sample are hybridized with one or more probes specific to the polynucleotides of the present invention, so that transcript levels corresponding to the polynucleotides of the present invention may be quantified. The transcript levels in the treated biological sample are compared with levels in an untreated biological sample. Differences in the transcript levels between the two samples are indicative of a toxic response caused by the test compound in the treated sample.

Another particular embodiment relates to the use of MDDT encoded by polynucleotides of the present invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome can be subjected individually to further analysis. Proteome expression patterns, or profiles, are analyzed by quantifying the number of expressed proteins and their relative abundance under given conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra). The proteins are visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an agent such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is generally proportional to the level of the protein in the sample. The optical densities of equivalently positioned protein spots from different samples, for example, from biological samples either treated or untreated with a test compound or therapeutic agent, are compared to identify any changes in protein spot density related to the treatment. The proteins in the spots are partially sequenced using, for example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In some cases, further sequence data may be obtained for definitive protein identification.

A proteomic profile may also be generated using antibodies specific for MDDT to quantify the levels of MDDT expression. In one embodiment, the antibodies are used as elements on a microarray, and protein expression levels are quantified by exposing the microarray to the sample and detecting the levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 270:103-11; Mendoze, L. G. et al. (1999) Biotechniques 27:778-88). Detection may be performed by a variety of methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino-reactive fluorescent compound and detecting the amount of fluorescence bound at each array element.

Toxicant signatures at the proteome level are also useful for toxicological screening, and should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation between transcript and protein abundances for some proteins in some tissues (Anderson, N. L. and Seilhamer, J. (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the analysis of compounds which do not significantly affect the transcript image, but which alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases.

In another embodiment, the toxicity of a test compound is assessed by treating a biological sample containing proteins with the test compound Proteins that are expressed in the treated biological sample are separated so that the amount of each protein can be quantified. The amount of each protein is compared to the amount of the corresponding protein in an untreated biological sample. A difference in the amount of protein between the two samples is indicative of a toxic response to the test compound in the treated sample. Individual proteins are identified by sequencing the amino acid residues of the individual proteins and comparing these partial sequences to the MDDT encoded by polynucleotides of the present invention.

In another embodiment, the toxicity of a test compound is assessed by treating a biological sample containing proteins with the test compound. Proteins from the biological sample are incubated with antibodies specific to the MDDT encoded by polynucleotides of the present invention. The amount of protein recognized by the antibodies is quantified. The amount of protein in the treated biological sample is compared with the amount in an untreated biological sample. A difference in the amount of protein between the two samples is indicative of a toxic response to the test compound in the treated sample.

Transcript images may be used to profile mddt expression in distinct tissue types. This process can be used to determine disease detection and treatment molecule activity in a particular tissue type relative to this activity in a different tissue type. Transcript images may be used to generate a profile of mddt expression characteristic of diseased tissue. Transcript images of tissues before and after treatment may be used for diagnostic purposes, to monitor the progression of disease, and to monitor the efficacy of drug treatments for diseases which affect the activity of disease detection and treatment molecules.

Transcript images of cell lines can be used to assess disease detection and treatment molecule activity and/or to identify cell lines that lack or misregulate this activity. Such cell lines may then be treated with pharmaceutical agents, and a transcript image following treatment may indicate the efficacy of these agents in restoring desired levels of this activity. A similar approach may be used to assess the toxicity of pharmaceutical agents as reflected by undesirable changes in disease detection and treatment molecule activity. Candidate pharmaceutical agents may be evaluated by comparing their associated transcript images with those of pharmaceutical agents of known effectiveness.

Antisense Molecules

The polynucleotides of the present invention are useful in antisense technology. Antisense technology or therapy relies on the modulation of expression of a target protein through the specific binding of an antisense sequence to a target sequence encoding the target protein or directing its expression. (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics, Humana Press Inc., Totawa, N.J.; Alama, A. et al. (1997) Pharmacol. Res. 36(3):171-178; Crooke, S. T. (1997) Adv. Pharmacol. 40:1-49; Sharma, H. W. and R. Narayanan (1995) Bioessays 17(12):1055-1063; and Lavrosky, Y. et al. (1997) Biochem. Mol. Med. 62(1):11-22.) An antisense sequence is a polynucleotide sequence capable of specifically hybridizing to at least a portion of the target sequence. Antisense sequences bind to cellular mRNA and/or genomic DNA, affecting translation and/or transcription. Antisense sequences can be DNA, RNA, or nucleic acid mimics and analogs. (See, e.g., Rossi, J. J. et al. (1991) Antisense Res. Dev. 1(3):285-288; Lee, R. et al. (1998) Biochemistry 37(3):900-1010; Pardridge, W. M. et al. (1995) Proc. Natl. Acad. Sci. USA 92(12):5592-5596; and Nielsen, P. E. and Haaima, G. (1997) Chem. Soc. Rev. 96:73-78.) Typically, the binding which results in modulation of expression occurs through hybridization or binding of complementary base pairs. Antisense sequences can also bind to DNA duplexes through specific interactions in the major groove of the double helix.

The polynucleotides of the present invention and fragments thereof can be used as antisense sequences to modify the expression of the polypeptide encoded by mddt The antisense sequences can be produced ex vivo, such as by using any of the ABI nucleic acid synthesizer series (Applied Biosystems) or other automated systems known in the art. Antisense sequences can also be produced biologically, such as by transforming an appropriate host cell with an expression vector containing the sequence of interest. (See, e.g., Agrawal, supra.)

In therapeutic use, any gene delivery system suitable for introduction of the antisense sequences into appropriate target cells can be used. Antisense sequences can be delivered intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., Slater, J. E., et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K. J., et al. (1995) 9(13):1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A. D. (1990) Blood 76:271; Ausubel, F. M. et al. (1995) Current Protocols in Molecular Biology, John Wiley & Sons, New York, N.Y.; Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other systems known in the art. (See, e.g., Rossi, J. J. (1995) Br. Med. Bull. 51(1):217-225; Boado, R. J. et al. (1998) J. Pharm. Sci. 87(11):1308-1315; and Morris, M. C. et al. (1997) Nucleic Acids Res. 25(14):2730-2736.)

Expression

In order to express a biologically active MDDT, the nucleotide sequences encoding MDDT or fragments thereof may be inserted into an appropriate expression vector, i.e., a vector which contains the necessary elements for transcriptional and translational control of the inserted coding sequence in a suitable host. Methods which are well known to those skilled in the art may be used to construct expression vectors containing sequences encoding MDDT and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, supra, Chapters 4, 8, 16, and 17; and Ausubel, supra, Chapters 9, 10, 13, and 16.)

A variety of expression vector/host systems may be utilized to contain and express sequences encoding MDDT. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or animal (mammalian) cell systems. (See, e.g., Sambrook, supra; Ausubel, 1995, supra, Van Heeke, G. and S. M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Bitter, G. A. et al. (1987) Methods Enzymol. 153:516-544; Scorer, C. A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu, N. (1987) EMBO J. 6:307-311; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York, N.Y., pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, J. J. et al. (1997) Nat. Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad. Sci. USA 90(13):6340-6344; Buller, R. M. et al. (1985) Nature 317(6040):813-815; McGregor, D. P. et al. (1994) Mol. Immunol. 31(3):219-226; and Verma, I. M. and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host cell employed.

For long term production of recombinant proteins in mammalian systems, stable expression of MDDT in cell lines is preferred. For example, sequences encoding MDDT can be transformed into cell lines using expression vectors which may contain viral origins of replication and/or endogenous expression elements and a selectable marker gene on the same or on a separate vector. Any number of selection systems may be used to recover transformed cell lines. (See, e.g., Wigler, M. et al. (1977) Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823.; Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14; Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 85:8047-8051; Rhodes, C. A. (1995) Methods Mol. Biol. 55:121-131.)

Therapeutic Uses of mddt

The mddt of the invention may be used for somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency (e.g., in the cases of severe combined immunodeficiency (SCID)-X1 disease characterized by X-linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency (Blaese, R. M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R. G. et al. (1995) Hum. Gene Therapy 6:643-666; Crystal, R. G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassemias, familial hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor IX deficiencies (Crystal, R. G. (1995) Science 270:404-410; Verma, I. M. and Somia, N. (1997) Nature 389:239-242)), (ii) express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:11395-11399), hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides brasiliensis; and protozoan parasites such as Plasmodium falcidarum and Trypanosoma cruzi). In the case where a genetic deficiency in mddt expression or regulation causes disease, the expression of mddt from an appropriate population of transduced cells may alleviate the clinical manifestations caused by the genetic deficiency.

In a further embodiment of the invention, diseases or disorders caused by deficiencies in mddt are treated by constructing mammalian expression vectors comprising mddt and introducing these vectors by mechanical means into mddt-deficient cells. Mechanical transfer technologies for use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the use of DNA transposons (Morgan, R. A. and Anderson, W. F. (1993) Annu. Rev. Biochem. 62:191-217; Ivics, Z. (1997) Cell 91:501-510; Boulay, J-L. and Récipon, H. (1998) Curr. Opin. Biotechnol. 9:445-450).

Expression vectors that may be effective for the expression of mddt include, but are not limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad, Calif.), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla, Calif.), and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto, Calif.). The mddt of the invention may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or β-actin genes), (ii) an inducible promoter (e.g., the tetracycline-regulated promoter (Gossen, M. and Bujard, H. (1992) Proc. Natl. Acad. Sci. U.S.A. 89:5547-5551; Gossen, M. et al., (1995) Science 268:1766-1769; Rossi, F. M. V. and Blau, H. M. (1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen); the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F. M. V. and Blau, H. M. supra), or (iii) a tissue-specific promoter or the native promoter of the endogenous gene encoding MDDT from a normal individual.

Commercially available liposome transformation kits (e.g., the PERFECT LIPID TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver polynucleotides to target cells in culture and require minimal effort to optimize experimental parameters. In the alternative, transformation is performed using the calcium phosphate method (Graham, F. L. and Eb, A. J. (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. (1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of these standardized mammalian transfection protocols.

In another embodiment of the invention, diseases or disorders caused by genetic defects with respect to mddt expression are treated by constructing a retrovirus vector consisting of (i) mddt under the control of an independent promoter or the retrovirus long terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive element (RRE) along with additional retrovirus cis-acting RNA sequences and coding sequences required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. Natl. Acad. Sci. U.S.A. 92:6733-6737), incorporated by reference herein. The vector is propagated in an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. (1987) J. Virol. 61:1647-1650; Bender, M. A. et al. (1987) J. Virol. 61:1639-1646; Adam, M. A. and Miller, A. D. (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. et al. (1998) J. Virol. 72:9873-9880). U.S. Pat. No. 5,910,434 to Rigg (“Method for obtaining retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant”) discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4⁺ T-cells), and the return of transduced cells to a patient are procedures well known to persons skilled in the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M. L. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. U.S.A. 95:1201-1206; Su, L. (1997) Blood 89:2283-2290).

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver mddt to cells which have one or more genetic abnormalities with respect to the expression of mddt. The construction and packaging of adenovirus-based vectors are well known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas (Csete, M. E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are described in U.S. Pat. No. 5,707,618 to Armentano (“Adenovirus vectors for gene therapy”), hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999) Annu. Rev. Nutr. 19:511-544 and Verma, I. M. and Somia, N. (1997) Nature 18:389:239-242, both incorporated by reference herein.

In another alternative, a herpes-based, gene therapy delivery system is used to deliver mddt to target cells which have one or more genetic abnormalities with respect to the expression of mddt. The use of herpes simplex virus (HSV)-based vectors may be especially valuable for introducing mddt to cells of the central nervous system, for which HSV has a tropism. The construction and packaging of herpes-based vectors are well known to those with ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1-based vector has been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res.169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. Pat. No. 5,804,413 to DeLuca (“Herpes simplex virus strains for gene transfer”), which is hereby incorporated by reference. U.S. Pat. No. 5,804,413 teaches the use of recombinant HSV d92 which consists of a genome containing at least one exogenous gene to be transferred to a cell under the control of the appropriate promoter for purposes including human gene therapy. Also taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. For HSV vectors, see also Goins, W. F. et al. 1999 J. Virol. 73:519-532 and Xu, H. et al., (1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus sequences, the generation of recombinant virus following the transfection of multiple plasmids containing different segments of the large herpesvirus genomes, the growth and propagation of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of ordinary skill in the art.

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to deliver mddt to target cells. The biology of the prototypic alphavirus, Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on the SFV genome (Garoff, H. and Li, K-J. (1998) Curr. Opin. Biotech. 9:464-469). During alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid proteins. This subgenomic RNA replicates to higher levels than the full-length genomic RNA, resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g., protease and polymerase). Similarly, inserting mddt into the alphavirus genome in place of the capsid-coding region results in the production of a large number of mddt RNAs and the synthesis of high levels of MDDT in vector transduced cells. While alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy application (Dryga, S. A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will allow the introduction of mddt into a variety of cell types. The specific transduction of a subset of cells in a population may require the sorting of cells prior to transduction. The methods of manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA transfections, and performing alphavirus infections, are well known to those with ordinary skill in the art.

Antibodies

Anti-MDDT antibodies may be used to analyze protein expression levels. Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, and Fab fragments. For descriptions of and protocols of antibody technologies, see, e.g., Pound J. D. (1998) Immunochemical Protocols, Humana Press, Totowa, N.J.

The amino acid sequence encoded by the mddt of the Sequence Listing may be analyzed by appropriate software (e.g., LASERGENE NAVIGATOR software, DNASTAR) to determine regions of high immunogenicity. The optimal sequences for immunization are selected from the C-terminus, the N-terminus, and those intervening, hydrophilic regions of the polypeptide which are likely to be exposed to the external environment when the polypeptide is in its natural conformation. Analysis used to select appropriate epitopes is also described by Ausubel (1997, supra, Chapter 11.7). Peptides used for antibody induction do not need to have biological activity; however, they must be antigenic. Peptides used to induce specific antibodies may have an amino acid sequence consisting of at least five amino acids, preferably at least 10 amino acids, and most preferably at least 15 amino acids. A peptide which mimics an antigenic fragment of the natural polypeptide may be fused with another protein such as keyhole hemolimpet cyanin (KLH; Sigma, St. Louis, Mo.) for antibody production. A peptide encompassing an antigenic region may be expressed from an mddt, synthesized as described above, or purified from human cells.

Procedures well known in the art may be used for the production of antibodies. Various hosts including mice, goats, and rabbits, may be immunized by injection with a peptide. Depending on the host species, various adjuvants may be used to increase immunological response.

In one procedure, peptides about 15 residues in length may be synthesized using an ABI 431A peptide synthesizer (Applied Biosystems) using fmoc-chemistry and coupled to KLH (Sigma) by reaction with M-maleimidobenzoyl-N-hydroxysuccimide ester (Ausubel, 1995, supra). Rabbits are immunized with the peptide-KLH complex in complete Freund's adjuvant The resulting antisera are tested for antipeptide activity by binding the peptide to plastic, blocking with 1% bovine serum albumin (BSA), reacting with rabbit antisera, washing, and reacting with radioiodinated goat anti-rabbit IgG. Antisera with antipeptide activity are tested for anti-MDDT activity using protocols well known in the art, including ELISA, radioimmunoassay (RIA), and immunoblotting.

In another procedure, isolated and purified peptide may be used to immunize mice (about 100 μg of peptide) or rabbits (about 1 mg of peptide). Subsequently, the peptide is radioiodinated and used to screen the immunized animals' B-lymphocytes for production of antipeptide antibodies. Positive cells are then used to produce hybridomas using standard techniques. About 20 mg of peptide is sufficient for labeling and screening several thousand clones. Hybridomas of interest are detected by screening with radioiodinated peptide to identify those fusions producing peptide-specific monoclonal antibody. In a typical protocol, wells of a multi-well plate (FAST, Becton-Dickinson, Palo Alto, Calif.) are coated with affinity-purified, specific rabbit-anti-mouse (or suitable anti-species IgG) antibodies at 10 mg/ml. The coated wells are blocked with 1% BSA and washed and exposed to supernatants from hybridomas. After incubation, the wells are exposed to radiolabeled peptide at 1 mg/ml.

Clones producing antibodies bind a quantity of labeled peptide that is detectable above background. Such clones are expanded and subjected to 2 cycles of cloning. Cloned hybridomas are injected into pristane-treated mice to produce ascites, and monoclonal antibody is purified from the ascitic fluid by affinity chromatography on protein A (Amersham Pharmacia Biotech). Several procedures for the production of monoclonal antibodies, including in vitro production, are described in Pound (supra). Monoclonal antibodies with antipeptide activity are tested for anti-MDDT activity using protocols well known in the art, including ELISA, RIA, and immunoblotting.

Antibody fragments containing specific binding sites for an epitope may also be generated. For example, such fragments include, but are not limited to, the F(ab′)2 fragments produced by pepsin digestion of the antibody molecule, and the Fab fragments generated by reducing the disulfide bridges of the F(ab′)2 fragments. Alternatively, construction of Fab expression libraries in filamentous bacteriophage allows rapid and easy identification of monoclonal fragments with desired specificity (Pound, supra, Chaps. 45-47). Antibodies generated against polypeptide encoded by mddt can be used to purify and characterize full-length MDDT protein and its activity, binding partners, etc.

Assays Using Antibodies

Anti-MDDT antibodies may be used in assays to quantify the amount of MDDT found in a particular human cell. Such assays include methods utilizing the antibody and a label to detect expression level under normal or disease conditions. The peptides and antibodies of the invention may be used with or without modification or labeled by joining them, either covalently or noncovalently, with a reporter molecule.

Protocols for detecting and measuring protein expression using either polyclonal or monoclonal antibodies are well known in the art Examples include ELISA, RIA, and fluorescent activated cell sorting (FACS). Such immunoassays typically involve the formation of complexes between the MDDT and its specific antibody and the measurement of such complexes. These and other assays are described in Pound (supra).

Without further elaboration, it is believed that one skilled in the art can, using the preceding description, utilize the present invention to its fullest extent. The following preferred specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way whatsoever.

The disclosures of all patents, applications, and publications mentioned above and below, in particular U.S. Ser. No. 60/185,213, U.S. Ser. No. 60/205,285, U.S. Ser. No. 60/205,232, U.S. Ser. No. 60/205,323, U.S. Ser. No. 60/205,287, U.S. Ser. No. 60/205,324, and U.S. Ser. No. 60/205,286, are hereby expressly incorporated by reference.

EXAMPLES

I. Construction of cDNA Libraries

RNA was purchased from CLONTECH Laboratories, Inc. (Palo Alto, Calif.) or isolated from various tissues. Some tissues were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated with either isopropanol or sodium acetate and ethanol, or by other routine methods.

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA purity. In most cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated using oligo d(T)-coupled paramagnetic particles (Promega Corporation (Promega), Madison, Wis.), OLIGOTEX latex particles (QIAGEN, Inc. (QIAGEN), Valencia, Calif.), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA purification kit (Ambion, Inc., Austin, Tex.).

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP vector system (Stratagene Cloning Systems, Inc. (Stratagene), La Jolla, Calif.) or SUPERSCRIPT plasmid system (Life Technologies), using the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra, Chapters 5.1 through 6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), PCDNA2.1 plasmid (Invitrogen, Carlsbad, Calif.), PBK-CMV plasmid (Stratagene), or pINCY (Incyte Genomics, Palo Alto, Calif.), or derivatives thereof. Recombinant plasmids were transformed into competent E. coli cells including XL1-Blue, XL1-BlueMRF, or SOLR from Stratagene or DH5α, DH10B, or ElectroMAX DH10B from Life Technologies.

II. Isolation of cDNA Clones

Plasmids were recovered from host cells by in vivo excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: the Magic or WIZARD Minipreps DNA purification system (Promega); the AGTC Miniprep purification kit (Edge BioSystems, Gaithersburg, Md.); and the QIAWELL 8, QIAWELL 8 Plus, and QIAWELL 8 Ultra plasmid purification systems or the R.E.A.L. PREP 96 plasmid purification kit (QIAGEN). Following precipitation, plasmids were resuspended in 0.1 ml of distilled water and stored, with or without lyophilization, at 4° C.

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a high-throughput format. (Rao, V. B. (1994) Anal. Biochem. 216:1-14.) Host cell lysis and thermal cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using PICOGREEN dye (Molecular Probes, Inc. (Molecular Probes), Eugene, Oreg.) and a FLUOROSKAN 11 fluorescence scanner (Labsystems Oy, Helsinki, Finland).

III. Sequencing and Analysis

cDNA sequencing reactions were processed using standard methods or high-throughput instrumentation such as the ABI CATALYST 800 thermal cycler (Applied Biosystems) or the PTC-200 thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific Corp., Sunnyvale, Calif.) or the MICROLAB 2200 liquid transfer system (Hamilton). cDNA sequencing reactions were prepared using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI protocols and base calling software; or other sequence analysis systems known in the art. Reading frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, supra, Chapter 7.7). Some of the cDNA sequences were selected for extension using the techniques disclosed in Example VIII.

IV. Assembly and Analysis of Sequences

Component sequences from chromatograms were subject to PHRED analysis and assigned a quality score. The sequences having at least a required quality score were subject to various pre-processing editing pathways to eliminate, e.g., low quality 3′ ends, vector and linker sequences, polyA tails, Alu repeats, mitochondrial and ribosomal sequences, bacterial contamination sequences, and sequences smaller than 50 base pairs. In particular, low-information sequences and repetitive elements (e.g., dinucleotide repeats, Alu repeats, etc.) were replaced by “n's”, or masked, to prevent spurious matches.

Processed sequences were then subject to assembly procedures in which the sequences were assigned to gene bins (bins). Each sequence could only belong to one bin. Sequences in each gene bin were assembled to produce consensus sequences (templates). Subsequent new sequences were added to existing bins using BLASTn (v.1.4 WashU) and CROSSMATCH. Candidate pairs were identified as all BLAST hits having a quality score greater than or equal to 150. Alignments of at least 82% local identity were accepted into the bin. The component sequences from each bin were assembled using a version of PHRAP. Bins with several overlapping component sequences were assembled using DEEP PHAP. The orientation (sense or antisense) of each assembled template was determined based on the number and orientation of its component sequences. Template sequences as disclosed in the sequence listing correspond to sense strand sequences (the “forward” reading frames), to the best determination. The complementary (antisense) strands are inherently disclosed herein. The component sequences which were used to assemble each template consensus sequence are listed in Table 4, along with their positions along the template nucleotide sequences.

Bins were compared against each other and those having local similarity of at least 82% were combined and reassembled. Reassembled bins having templates of insufficient overlap (less than 95% local identity) were re-split. Assembled templates were also subject to analysis by STITCHER/EXON MAPPER algorithms which analyze the probabilities of the presence of splice variants, alternatively spliced exons, splice junctions, differential expression of alternative spliced genes across tissue types or disease states, etc. These resulting bins were subject to several rounds of the above assembly procedures.

Once gene bins were generated based upon sequence alignments, bins were clone joined based upon clone information. If the 5′ sequence of one clone was present in one bin and the 3′ sequence from the same clone was present in a different bin, it was likely that the two bins actually belonged together in a single bin. The resulting combined bins underwent assembly procedures to regenerate the consensus sequences.

The final assembled templates were subsequently annotated using the following procedure. Template sequences were analyzed using BLASTn (v2.0, NCBI) versus gbpri (GenBank version 120). “Hits” were defined as an exact match having from 95% local identity over 200 base pairs through 100% local identity over 100 base pairs, or a homolog match having an E-value, i.e. a probability score, of ≦1×10⁻⁸. The hits were subject to frameshift FASTx versus GENPEPT (GenBank version 120). (See Table 7). In this analysis, a homolog match was defined as having an E-value of ≦1×10⁻⁸. The assembly method used above was described in “System and Methods for Analyzing Biomolecular Sequences,” U.S. Ser. No. 09/276,534, filed Mar. 25, 1999, and the LIFESEQ Gold user manual (Incyte) both incorporated by reference herein.

Following assembly, template sequences were subjected to motif, BLAST, and functional analyses, and categorized in protein hierarchies using methods described in, e.g., “Database System Employing Protein Function Hierarchies for Viewing Biomolecular Sequence Data,” U.S. Ser. No. 08/812,290, filed Mar. 6, 1997; “Relational Database for Storing Biomolecule Information,” U.S. Ser. No. 08/947,845, filed Oct. 9, 1997; “Project-Based Full-Length Biomolecular Sequence Database,” U.S. Ser. No. 08/811,758, filed Mar. 6, 1997; and “Relational Database and System for Storing Information Relating to Biomolecular Sequences,” U.S. Ser. No. 09/034,807, filed Mar. 4, 1998, all of which are incorporated by reference herein.

The template sequences were further analyzed by translating each template in all three forward reading frames and searching each translation against the Pfam database of hidden Markov model-based protein families and domains using the HMMER software package (available to the public from Washington University School of Medicine, St Louis, Mo.). Regions of templates which, when translated, contain similarity to Pfam consensus sequences are reported in Table 2, along with descriptions of Pfam protein domains and families. Only those Pfam hits with an E-value of ≦1×10⁻³are reported. (See also World Wide Web site http://pfam.wustl.edu/ for detailed descriptions of Pfam protein domains and families.)

Additionally, the template sequences were translated in all three forward reading frames, and each translation was searched against hidden Markov models for signal peptides using the HMMER software package. Construction of hidden Markov models and their usage in sequence analysis has been described. (See, for example, Eddy, S. R. (1996) Curr. Opin. Str. Biol. 6:361-365.) Only those signal peptide hits with a cutoff score of 11 bits or greater are reported. A cutoff score of 11 bits or greater corresponds to at least about 91-94% true-positives in signal peptide prediction. Template sequences were also translated in all three forward reading frames, and each translation was searched against TMAP, a program that uses weight matrices to delineate transmembrane segments on protein sequences and determine orientation, with respect to the cell cytosol (Persson, B. and P. Argos (1994) J. Mol. Biol. 237:182-192; Persson, B. and P. Argos (1996) Protein Sci. 5:363-371.) Regions of templates which, when translated, contain similarity to signal peptide or transmembrane consensus sequences are reported in Table 3.

The results of HMMER analysis as reported in Tables 2 and 3 may support the results of BLAST analysis as reported in Table 1 or may suggest alternative or additional properties of template-encoded polypeptides not previously uncovered by BLAST or other analyses.

Template sequences are further analyzed using the bioinformatics tools listed in Table 7, or using sequence analysis software known in the art such as MACDNASIS PRO software (Hitachi Software Engineering, South San Francisco, Calif.) and LASERGENE software (DNASTAR). Template sequences may be further queried against public databases such as the GenBank rodent, mammalian, vertebrate, prokaryote, and eukaryote databases.

The template sequences were translated to derive the corresponding longest open reading frame as presented by the polypeptide sequences. Alternatively, a polypeptide of the invention may begin at any of the methionine residues within the full length translated polypeptide. Polypeptide sequences were subsequently analyzed by querying against the GenBank protein database (GENPEPT, (GenBank version 121)). Full length polynucleotide sequences are also analyzed using MACDNASIS PRO software (Hitachi Software Engineering, South San Francisco, Calif.) and LASERGENE software (DNASTAR). Polynucleotide and polypeptide sequence alignments are generated using default parameters specified by the CLUSTAL algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which also calculates the percent identity between aligned sequences.

Table 6 shows sequences with homology to the polypeptides of the invention as identified by BLAST analysis against the GenBank protein (GENPEPT) database. Column 1 shows the polypeptide sequence identification number (SEQ ID NO:) for the polypeptide segments of the invention. Column 2 shows the reading frame used in the translation of the polynucleotide sequences encoding the polypeptide segments. Column 3 shows the length of the translated polypeptide segments. Columns 4 and 5 show the start and stop nucleotide positions of the polynucleotide sequences encoding the polypeptide segments. Column 6 shows the GenBank identification number (GI Number) of the nearest GenBank homolog. Column 7 shows the probability score for the match between each polypeptide and its GenBank homolog. Column 8 shows the annotation of the GenBank homolog.

V. Analysis of Polynucleotide Expression

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel, 1995, supra, ch. 4 and 16.)

Analogous computer techniques applying BLAST were used to search for identical or related molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer search can be modified to determine whether any particular match is categorized as exact or similar. The basis of the search is the product score, which is defined as:
$\frac{BLAST Score \times Percent Identity}{5 \times minimum {length (Seq . 1), length (Seq . 2)}}$

The product score takes into account both the degree of similarity between two sequences and the length of the sequence match The product score is a normalized value between 0 and 100, and is calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and −4 for every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The product score represents a balance between fractional overlap and quality in a BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the entire length of the shorter of the two sequences being compared. A product score of 70 is produced either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap.

VI. Tissue Distribution Profiling

A tissue distribution profile is determined for each template by compiling the cDNA library tissue classifications of its component cDNA sequences. Each component sequence, is derived from a cDNA library constructed from a human tissue. Each human tissue is classified into one of the following categories: cardiovascular system; connective tissue; digestive system; embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, male; germ cells; hemic and immune system; liver; musculoskeletal system; nervous system; pancreas; respiratory system; sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract. Template sequences, component sequences, and cDNA library/tissue information are found in the LIFESEQ GOLD database (Incyte Genomics, Palo Alto, Calif.).

Table 5 shows the tissue distribution profile for the templates of the invention. For each template, the three most frequently observed tissue categories are shown in column 3, along with the percentage of component sequences belonging to each category. Only tissue categories with percentage values of ≧10% are shown. A tissue distribution of “widely distributed” in column 3 indicates percentage values of <10% in all tissue categories.

VII. Transcript Image Analysis

Transcript images are generated as described in Seilhamer et al., “Comparative Gene Transcript Analysis,” U.S. Pat. No. 5,840,484, incorporated herein by reference.

VIII. Extension of Polynucleotide Sequences and Isolation of a Full-length cDNA

Oligonucleotide primers designed using an mddt of the Sequence Listing are used to extend the nucleic acid sequence. One primer is synthesized to initiate 5′ extension of the template, and the other primer, to initiate 3′ extension of the template. The initial primers may be designed using OLIGO 4.06 software (National Biosciences, Inc. (National Biosciences), Plymouth, Minn.), or another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target sequence at temperatures of about 68° C. to about 72° C. Any stretch of nucleotides which would result in hairpin structures and primer-primer dimerizations are avoided. Selected human cDNA libraries are used to extend the sequence. If more than one extension is necessary or desired, additional or nested sets of primers are designed.

High fidelity amplification is obtained by PCR using methods well known in the art. PCR is performed in 96-well plates using the PTC-200 thermal cycler (MJ Research). The reaction mix contains DNA template, 200 nmol of each primer, reaction buffer containing Me²⁺, (NH₄)₂SO₄, and β-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair PCI A and PCI B: Step 1: 94° C., 3 min; Step 2: 94° C., 15 sec; Step 3: 60° C., 1 min; Step 4: 68° C., 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68° C., 5 min; Step 7: storage at 4° C. In the alternative, the parameters for primer pair T7 and SK+ are as follows: Step 1: 94° C., 3 min; Step 2: text missing or illegible when filed to determine which reactions are successful in extending the sequence.

The extended nucleotides are desalted and concentrated, transferred to 384-well plates, digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison, Wis.), and sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For shotgun sequencing, the digested nucleotides are separated on low concentration (0.6 to 0.8%) agarose gels, fragments are excised, and agar digested with AGAR ACE (Promega). Extended clones are religated using T4 ligase (New England Biolabs, Inc., Beverly, Mass.) into pUC 18 vector (Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, and transfected into competent E. coli cells. Transformed cells are selected on antibiotic-containing media, individual colonies are picked and cultured overnight at 37° C. in 384-well plates in LB/2× carbenicillin liquid media.

The cells are lysed, and DNA is amplified by PCR using Taq DNA polymerase (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1: 94° C., 3 min; Step 2: 94° C., 15 sec; Step 3: 60° C., 1 min; Step 4: 72° C., 2 min; Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72° C., 5 min; Step 7: storage at 4° C. DNA is quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries are reamplified using the same conditions as described above. Samples are diluted with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems).

In like manner, the mddt is used to obtain regulatory sequences (promoters, introns, and enhancers) using the procedure above, oligonucleotides designed for such extension, and an appropriate genomic library.

IX. Labeling of Probes and Southern Hybridization Analyses

Hybridization probes derived from the mddt of the Sequence Listing are employed for screening cDNAs, mRNAs, or genomic DNA. The labeling of probe nucleotides between 100 and 1000 nucleotides in length is specifically described, but essentially the same procedure may be used with larger cDNA fragments. Probe sequences are labeled at room temperature for 30 minutes using a T4 polynucleotide kinase, γ²P-ATP, and 0.5× One-Phor-All Plus (Amersham Pharmacia Biotech) buffer and purified using a ProbeQuant G-50 Microcolumn (Amersham Pharmacia Biotech). The probe mixture is diluted to 10⁷dpm/μg/ml hybridization buffer and used in a typical membrane-based hybridization analysis.

The DNA is digested with a restriction endonuclease such as Eco RV and is electrophoresed through a 0.7% agarose gel. The DNA fragments are transferred from the agarose to nylon membrane (NYTRAN Plus, Schleicher & Schuell, Inc., Keene, N.H.) using procedures specified by the manufacturer of the membrane. Prehybridization is carried out for three or more hours at 68° C., and hybridization is carried out overnight at 68° C. To remove non-specific signals, blots are sequentially washed at room temperature under increasingly stringent conditions, up to 0.1× saline sodium citrate (SSC) and 0.5% sodium dodecyl sulfate. After the blots are placed in a PHOSPHORIMAGER cassette (Molecular Dynamics) or are exposed to autoradiography film, hybridization patterns of standard and experimental lanes are compared. Essentially the same procedure is employed when screening RNA.

X. Chromosome Mapping of mddt

The cDNA sequences which were used to assemble SEQ ID NO:1-45 are compared with sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other implementations of the Smith-Waterman algorithm. Sequences from these databases that match SEQ ID NO:1-45 are assembled into clusters of contiguous and overlapping sequences using assembly algorithms such as PHRAP (Table 7). Radiation hybrid and genetic mapping data available from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for Genome Research (WIGR), and Généthon are used to determine if any of the clustered sequences have been previously mapped. Inclusion of a mapped sequence in a cluster will result in the assignment of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. The genetic map locations of SEQ ID NO:1-45 are described as ranges, or intervals, of human chromosomes. The map position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances are based on genetic markers mapped by Généthon which provide boundaries for radiation hybrid markers whose sequences were included in each of the clusters.

XI. Microarray Analysis

Probe Preparation from Tissue or Cell Samples

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and polyA⁺ RNA is purified using the oligo (dT) cellulose method. Each polyA⁺ RNA sample is reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/l oligo-T primer (21 mer), 1× first strand buffer, 0.03 units/μl RNase inhibitor, 500 μM DATP, 500 μM dGTP, 500 μM dTTP, 40 μM dCTP, 40 μM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse transcription reaction is performed in a 25 ml volume containing 200 ng polyA⁺ RNA with GEMBRIGHT kits (Incyte). Specific control polyA⁺ RNAs are synthesized by in vitro transcription from non-coding yeast genomic DNA (W. Lei, unpublished). As quantitative controls, the control mRNAs at 0.002 ng, 0.02 ng, 0.2 ng, and 2 ng are diluted into reverse transcription reaction at ratios of 1:100,000, 1:10,000, 1:1000, 1:100 (w/w) to sample mRNA respectively. The control mRNAs are diluted into reverse transcription reaction at ratios of 1:3, 3:1, 1:10, 10:1, 1:25, 25:1 (w/w) to sample/mRNA differential expression patterns. After incubation at 37° C. for 2 hr, each reaction sample (one with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and incubated for 20 minutes at 85° C. to the stop the reaction and degrade the RNA. Probes are purified using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. (CLONTECH), Palo Alto, Calif.) and after combining, both reaction samples are ethanol precipitated using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The probe is then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook, N.Y.) and resuspended in 14 μl 5×SSC/0.2% SDS.

Microarray Preparation

Sequences of the present invention are used to generate array elements. Each array element is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses primers complementary to the vector sequences flanking the cDNA insert Array elements are amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 μg. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech).

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR Scientific Products Corporation (VWR), West Chester, Pa.), washed extensively in distilled water, and coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 110° C. oven.

Array elements are applied to the coated glass substrate using a procedure described in U.S. Pat. No. 5,807,522, incorporated herein by reference. 1 μl of the array element DNA, at an average concentration of 100 ng/l, is loaded into the open capillary printing element by a high-speed robotic apparatus. The apparatus then deposits about 5 nl of array element sample per slide.

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate buffered saline (PBS) (Tropix, Inc., Bedford, Mass.) for 30 minutes at 60° C. followed by washes in 0.2% SDS and distilled water as before.

Hybridization

Hybridization reactions contain 9 μl of probe mixture consisting of 0.2 μg each of Cy3 and Cy5 labeled cDNA synthesis products in 5×SSC, 0.2% SDS hybridization buffer. The probe mixture is heated to 65° C. for 5 minutes and is aliquoted onto the microarray surface and covered with an 1.8 cm²coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 μl of 5×SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 6.5 hours at 60° C. The arrays are washed for 10 min at 45° C. in a first wash buffer (1×SSC, 0.1% SDS), three times for 10 minutes each at 45° C. in a second wash buffer (0.1×SSC), and dried.

Detection

Reporter-labeled hybridization complexes are detected with a microscope equipped with an Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara, Calif.) capable of generating spectral lines at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is focused on the array using a 20× microscope objective (Nikon, Inc., Melville, N.Y.). The slide containing the array is placed on a computer-controlled X-Y stage on the microscope and raster-scanned past the objective. The 1.8 cm×1.8 cm array used in the present example is scanned with a resolution of 20 micrometers.

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, Hamamatsu Photonics Systems, Bridgewater, N.J.) corresponding to the two fluorophores. Appropriate filters positioned between the array and the photomultiplier tubes are used to filter the signals. The emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, although the apparatus is capable of recording the spectra from both fluorophores simultaneously.

The sensitivity of the scans is typically calibrated using the signal intensity generated by a cDNA control species added to the probe mix at a known concentration A specific location on the array contains a complementary DNA sequence, allowing the intensity of the signal at that location to be correlated with a weight ratio of hybridizing species of 1:100,000. When two probes from different sources (e.g., representing test and control cells), each labeled with a different fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and adding identical amounts of each to the hybridization mixture.

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital (A/D) conversion board (Analog Devices, Inc., Norwood, Mass.) installed in an IBM-compatible PC computer. The digitized data are displayed as an image where the signal intensity is mapped using a linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission spectra) between the fluorophores using each fluorophore's emission spectrum.

A grid is superimposed over the fluorescence signal image such that the signal from each spot is centered in each element of the grid. The fluorescence signal within each element is then integrated to obtain a numerical value corresponding to the average intensity of the signal. The software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte).

XII. Complementary Nucleic Acids

Sequences complementary to the mddt are used to detect, decrease, or inhibit expression of the naturally occurring nucleotide. The use of oligonucleotides comprising from about 15 to 30 base pairs is typical in the art. However, smaller or larger sequence fragments can also be used. Appropriate oligonucleotides are designed from the mddt using OLIGO 4.06 software (National Biosciences) or other appropriate programs and are synthesized using methods standard in the art or ordered from a commercial supplier. To inhibit transcription, a complementary oligonucleotide is designed from the most unique 5′ sequence and used to prevent transcription factor binding to the promoter sequence. To inhibit translation, a complementary oligonucleotide is designed to prevent ribosomal binding and processing of the transcript.

XIII. Expression of MDDT

Expression and purification of MDDT is accomplished using bacterial or virus-based expression systems. For expression of MDDT in bacteria, cDNA is subcloned into an appropriate vector containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). Antibiotic resistant bacteria express MDDT upon induction with isopropyl beta-D-thiogalactopyranoside (IPTG). Expression of MDDT in eukaryotic cells is achieved by infecting insect or mammalian cell lines with recombinant Autographica califonica nuclear polyhedrosis virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is replaced with cDNA encoding MDDT by either homologous recombination or bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. Infection of the latter requires additional genetic modifications to baculovirus. (See e.g., Engelhard, supra; and Sandig, supra.)

In most expression systems, MDDT is synthesized as a fusion protein with, e.g., glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton enzyme from Schistosoma japonicum, enables the purification of fusion proteins on immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from MDDT at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak Company, Rochester, N.Y.). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra, Chapters 10 and 16). Purified MDDT obtained by these methods can be used directly in the following activity assay.

XIV. Demonstration of MDDT Activity

MDDT, or biologically active fragments thereof, are labeled with ¹²⁵I Bolton-Hunter reagent. (See, e.g., Bolton, A. E. and W. M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules previously arrayed in the wells of a multi-well plate are incubated with the labeled MDDT, washed, and any wells with labeled MDDT complex are assayed. Data obtained using different concentrations of MDDT are used to calculate values for the number, affinity, and association of MDDT with the candidate molecules.

Alternatively, molecules interacting with MDDT are analyzed using the yeast two-hybrid system as described in Fields, S. and O. Song (1989) Nature 340:245-246, or using commercially available kits based on the two-hybrid system, such as the MATCHMAKER system (CLONTECH).

MDDT may also be used in the PATHCALLING process (CuraGen Corp., New Haven, Conn.) which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Pat. No. 6,057,101).

XV. Functional Assays

MDDT function is assessed by expressing mddt at physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression vector containing a strong promoter that drives high levels of cDNA expression Vectors of choice include pCMV SPORT (Life Technologies) and pCR3.1 (Invitrogen Corporation, Carlsbad, Calif.), both of which contain the cytomegalovirus promoter. 5-10 μg of recombinant vector are transiently transfected into a human cell line, preferably of endothelial or hematopoietic origin, using either liposome formulations or electroporation. 1-2 μg of an additional plasmid containing sequences encoding a marker protein are co-transfected.

Expression of a marker protein provides a means to distinguish transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; CLONTECH), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated laser optics-based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the apoptotic state of the cells and other cellular properties.

FCM detects and quantifies the uptake of fluorescent molecules that diagnose events preceding or coincident with cell death These events include changes in nuclear DNA content as measured by staining of DNA with propidium iodide; changes in cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane composition as measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M. G. (1994) Flow Cytometry, Oxford, New York, N.Y.

The influence of MDDT on gene expression can be assessed using highly purified populations of cells transfected with sequences encoding MDDT and either CD64 or CD64-GFP. CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using magnetic beads coated with either ban IgG or antibody against CD64 (DYNAL, Inc., Lake Success N.Y.). mRNA can be purified from the cells using methods well known by those of skill in the art Expression of mRNA encoding MDDT and other genes of interest can be analyzed by northern analysis or microarray techniques.

XVI. Production of Antibodies

MDDT substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., Harrington, M. G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to immunize rabbits and to produce antibodies using standard protocols.

Alternatively, the MDDT amino acid sequence is analyzed using LASERGENE software (DNASTAR) to determine regions of high immunogenicity, and a corresponding peptide is synthesized and used to raise antibodies by means known to those of skill in the art Methods for selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well described in the art. (See, e.g., Ausubel, 1995, supra, Chapter 11.)

Typically, peptides 15 residues in length are synthesized using an ABI 431A peptide synthesizer (Applied Biosystems) using fmoc-chemistry and coupled to KLH (Sigma) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase immunogenicity. (See, e.g., Ausubel, supra.) Rabbits are immunized with the peptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide activity by, for example, binding the peptide to plastic, blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radii iodinated goat anti-rabbit IgG. Antisera with antipeptide activity are tested for anti-MDDT, activity using protocols well known in the art, including ELISA, RIA, and immunoblotting.

XVII. Purification of Naturally Occurring MDDT Using Specific Antibodies

Naturally occurring or recombinant MDDT is substantially purified by immunoaffinity chromatography using antibodies specific for MDDT. An immunoaffinity column is constructed by covalently coupling anti-MDDT antibody to an activated chromatographic resin, such as CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is blocked and washed according to the manufacturer's instructions.

Media containing MDDT are passed over the immunoaffinity column, and the column is washed under conditions that allow the preferential absorbance of MDDT (e.g., high ionic strength buffers in the presence of detergent). The column is eluted under conditions that disrupt antibody/MDDT binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), and MDDT is collected.

All publications and patents mentioned in the above specification are herein incorporated by reference. Various modifications and variations of the described method and system of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention Although the invention has been described in connection with specific preferred embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the above-described modes for carrying out the invention which are obvious to those skilled in the field of molecular biology or related fields are intended to be within the scope of the following claims.

TABLE 1SEQ IDProbability NO:Template IDGI NumberScoreAnnotation1LG:977683.1:2000FEB18g107647780phosphoinositol 3-phosphate-binding protein-2 (Homo2LG:893050.1:2000FEB18g66340252.00E−81KIAA0379 protein (Homo sapiens)3LG:980153.1:2000FEB18g72639900dJ93K22.1 (novel protein (contains DKFZP564B116)) (Homosapiens)4LG:350398.1:2000FEB18g38821753.00E−10KIAA0727 protein (Homo sapiens)5LG:475551.1:2000FEB18g8610290SH3 domain binding protein (Mus musculus)6LG:481407.2:2000FEB18g61195461.00E−41hypothetical protein; 114721-113936 (Arabidopsis thaliana)7LI:443580.1:2000FEB01g45895663.00E−34KIAA0961 protein (Homo sapiens)8LI:803015.1:2000FEB01g52625602.00E−35hypothetical protein (Homo sapiens)9LG:027410.3:2000MAY19g104382671.00E−65unnamed protein product (Homo sapiens)10LG:171377.1:2000MAY19g30777031.00E−107mitsugumin29 (Oryctolagus cuniculus)11LG:352559.1:2000MAY19g72432432.00E−43KIAA1431 protein (Homo sapiens)12LG:247384.1:2000MAY19g99450101.00E−118RING-finger protein MURF (Mus musculus)13LG:403872.1:2000MAY19g70203030unnamed protein product (Homo sapiens)14LG:1135213.1:2000MAY19g66926072.00E−65MGA protein (Mus musculus)15LG:474284.2:2000MAY19g14880472.00E−30RING finger protein (Xenopus laevis)16LG:342147.1:2000MAY19g24775113.00E−41Homo sapiens p20 protein (pir B53814)17LG:1097300.1:2000MAY19g20785311.00E−70Mlark (Mus musculus)18LG:444850.9:2000MAY19g1990000interferon-gamma inducible protein (Mus musculus)19LG:402231.6:2000MAY19g70207376.00E−77unnamed protein product (Homo sapiens)20LG:1076157.1:2000MAY19g52625603.00E−65hypothetical protein (Homo sapiens)21LG:1083142.1:2000MAY19g45895663.00E−23KIAA0961 protein (Homo sapiens)22LG:1083264.1:2000MAY19g100472972.00E−25KIAA1611 protein (Homo sapiens)23LG:350793.2:2000MAY19g72429730KIAA1309 protein (Homo sapiens)24LG:408751.3:2000MAY19g88860251.00E−134collapsin response mediator protein-5 (Homo sapiens)25LI:336120.1:2000MAY01g18640851.00E−160glypican-5 (Homo sapiens)26LI:234104.2:2000MAY01g15185051.00E−114G-protein coupled inwardly rectifying K+ channel (Musmusculus)27LI:450887.1:2000MAY01g76299943.00E−3460S RIBOSOMAL PROTEIN L36 homolog (Arabidopsisthaliana)28LI:119992.3:2000MAY01g72430890KIAA1354 protein (Homo sapiens)29LI:197241.2:2000MAY01g72639900dJ93K22.1 (novel protein (contains DKFZP564B116)) (Homosapiens)30LI:406860.20:2000MAY01g104359193.00E−57unnamed protein product (Homo sapiens)31LI:142384.1:2000MAY01g104362901.00E−131unnamed protein product (Homo sapiens)32LI:895427.1:2000MAY01g31842641.00E−106F02569_2 (Homo sapiens)33LI:757439.1:2000MAY01g76703621.00E−116unnamed protein product (Mus musculus)34LI:1144066.1:2000MAY01g38822817.00E−79KIAA0780 protein (Homo sapiens)35LI:243660.4:2000MAY01g42105010BC85722_1 (Homo sapiens)36LI:334386.1:2000MAY01g63306170KIAA1223 protein (Homo sapiens)37LI:347572.1:2000MAY01g98024331.00E−101ACE-related carboxypeptidase ACE2 (Homo sapiens)38LI:817314.1:2000MAY01g58026150transient receptor potential 4 (Homo sapiens)39LI:000290.1:2000MAY01g72429772.00E−51KIAA1311 protein (Homo sapiens)40LI:023518.3:2000MAY01g7367272.00E−7432 kd accessory protein (Bos taurus)41LI:1084246.1:2000MAY01g54570310protocadherin beta 12 (Homo sapiens)42LI:1165828.1:2000MAY01g54570190protocadherin alpha 7 short form protein (Homo sapiens)43LI:007302.1:2000MAY01g50062500TLR6 (Mus musculus)44LI:236386.4:2000MAY01g61646281.00E−63SH3 and PX domain-containing protein SH3PX1 (Homosapiens)45LI:252904.5:2000MAY01g70229712.00E−62unnamed protein product (Homo sapiens)

TABLE 2

SEQ ID NO:
Template ID
Start
Stop
Frame
Pfam Hit
Pfam Description
E-value

1
LG:977683.1:2000FEB18
540
695
forward 3
PH
PH domain
6.70E−11

1
LG:977683.1:2000FEB18
204
293
forward 3
WW
WW domain
7.50E−05

2
LG:893050.1:2000FEB18
211
309
forward 1
ank
Ank repeat
1.60E−05

3
LG:980153.1:2000FEB18
754
852
forward 1
ank
Ank repeat
8.00E−04

3
LG:980153.1:2000FEB18
2131
2565
forward 1
BTB
BTB/POZ domain
6.90E−07

3
LG:980153.1:2000FEB18
1084
1239
forward 1
RCC1
Regulator of chromosome condensation
3.70E−04

4
LG:350398.1:2000FEB18
7
123
forward 1
myosin_head
Myosin head (motor domain)
2.60E−16

5
LG:475551.1:2000FEB18
702
1157
forward 3
RhoGAP
RhoGAP domain
8.10E−71

6
LG:481407.2:2000FEB18
225
440
forward 3
rrm
RNA recognition motif. (a.k.a. RRM, RBC
1.50E−22

6
LG:481407.2:2000FEB18
504
557
forward 3
zf-CCHC
Zinc knuckle
7.00E−04

7
LI:443580.1:2000FEB01
262
450
forward 1
KRAB
KRAB box
1.60E−41

7
LI:443580.1:2000FEB01
625
693
forward 1
zf-C2H2
Zinc finger, C2H2 type
2.20E−06

8
LI:803015.1:2000FEB01
159
299
forward 3
KRAB
KRAB box
2.30E−17

9
LG:027410.3:2000MAY19
177
290
forward 3
WD40
WD domain, G-beta repeat
6.20E−06

10
LG:171377.1:2000MAY19
300
848
forward 3
Synaptophysin
Synaptophysin/synaptoporin
2.10E−20

11
LG:352559.1:2000MAY19
125
313
forward 2
KRAB
KRAB box
1.60E−41

12
LG:247384.1:2000MAY19
182
256
forward 2
zf-C3HC4
Zinc finger, C3HC4 type (RING finger)
1.80E−06

13
LG:403872.1:2000MAY19
717
1187
forward 3
PAP2
PAP2 superfamily
1.80E−09

14
LG:1135213.1:2000MAY19
340
531
forward 1
T-box
T-box
8.80E−27

15
LG:474284.2:2000MAY19
73
195
forward 1
zf-C3HC4
Zinc finger, C3HC4 type (RING finger)
1.20E−13

16
LG:342147.1:2000MAY19
290
469
forward 2
crystallin
Alpha crystallin A chain, N terminal
3.10E−09

16
LG:342147.1:2000MAY19
452
628
forward 2
HSP20
Hsp20/alpha crystallin family
7.20E−12

17
LG:1097300.1:2000MAY19
59
250
forward 2
rrm
RNA recognition motif. (a.k.a. RRM, RBC
4.10E−16

18
LG:444850.9:2000MAY19
190
1290
forward 1
GBP
Guanylate-binding protein
4.20E−247

19
LG:402231.6:2000MAY19
258
380
forward 3
zf-C3HC4
Zinc finger, C3HC4 type (RING finger)
4.30E−05

20
LG:1076157.1:2000MAY19
180
320
forward 3
KRAB
KRAB box
3.40E−18

21
LG:1083142.1:2000MAY19
129
320
forward 3
KRAB
KRAB box
2.00E−42

22
LG:1083264.1:2000MAY19
440
628
forward 2
KRAB
KRAB box
2.30E−33

23
LG:350793.2:2000MAY19
570
722
forward 3
Kelch
Kelch motif
2.70E−11

24
LG:408751.3:2000MAY19
194
1051
forward 2
Dihydrooratase
Dihydroorotase-like
5.50E−07

25
LI:336120.1:2000MAY01
232
1398
forward 1
Glypican
Glypican
9.90E−141

25
LI:336120.1:2000MAY01
1476
1907
forward 3
Glypican
Glypican
8.60E−70

25
LI:336120.1:2000MAY01
503
775
forward 2
Glypican
Glypican
3.50E−46

26
LI:234104.2:2000MAY01
2517
3002
forward 3
IRK
Inward rectifier potassium channel
8.70E−111

26
LI:234104.2:2000MAY01
2965
3507
forward 1
IRK
Inward rectifier potassium channel
9.20E−111

27
LI:450887.1:2000MAY01
48
344
forward 3
Ribosomal_L36e
Ribosomal protein L36e
6.90E−41

28
LI:119992.3:2000MAY01
788
925
forward 2
Kelch
Kelch motif
1.50E−09

29
LI:197241.2:2000MAY01
1243
1407
forward 1
RCC1
Regulator of chromosome condensation
1.60E−04

30
LI:406860.20:2000MAY01
228
407
forward 3
ig
Immunoglobulin domain
1.90E−08

31
LI:142384.1:2000MAY01
318
791
forward 3
UQ_con
Ubiquitin-conjugating enzyme
1.40E−16

32
LI:895427.1:2000MAY01
437
907
forward 2
RhoGAP
RhoGAP domain
1.20E−40

33
LI:757439.1:2000MAY01
1040
1162
forward 2
zf-C3HC4
Zinc finger, C3HC4 type (RING finger)
7.20E−10

34
LI:1144066.1:2000MAY01
222
365
forward 3
jmjN
jmjN domain
2.80E−23

35
LI:243660.4:2000MAY01
316
522
forward 1
HMG_box
HMG (high mobility group) box
8.60E−17

36
LI:334386.1:2000MAY01
272
370
forward 2
ank
Ank repeat
4.90E−08

36
LI:334386.1:2000MAY01
735
833
forward 3
ank
Ank repeat
4.50E−05

37
LI:347572.1:2000MAY01
130
1878
forward 1
Peptidase_M2
Angiotensin-converting enzyme
2.60E−05

38
LI:817314.1:2000MAY01
934
2034
forward 1
Trans_recep
Transient receptor
6.50E−260

38
LI:817314.1:2000MAY01
1929
2321
forward 3
Trans_recep
Transient receptor
2.20E−81

39
LI:000290.1:2000MAY01
960
1040
forward 3
zf-CCCH
Zinc finger C-x8-C-x5-C-x3-H type (and text missing or illegible when filed

7.70E−04

40
LI:023518.3:2000MAY01
195
845
forward 3
vATP-
ATP synthase (C/AC39) subunit
5.30E−38

synt_AC39

41
LI:1084246.1:2000MAY01
1443
1733
forward 3
cadherin
Cadherin domain
2.30E−20

41
LI:1084246.1:2000MAY01
875
1150
forward 2
cadherin
Cadherin domain
6.60E−17

42
LI:1165828.1:2000MAY01
1421
1705
forward 2
cadherin
Cadherin domain
1.30E−19

43
LI:007302.1:2000MAY01
1646
1810
forward 2
LRRCT
Leucine rich repeat C-terminal domain
2.60E−13

43
LI:007302.1:2000MAY01
1991
2455
forward 2
TIR
TIR domain
3.50E−37

44
LI:236386.4:2000MAY01
677
850
forward 2
SH3
SH3 domain
5.20E−07

45
LI:252904.5:2000MAY01
358
495
forward 1
Kelch
Kelch motif
3.80E−07

TABLE 3

Domain

SEQ ID NO:
Template ID
Start
Stop
Frame
Type
Topology

1
LG:977683.1:2000FEB18
373
459
forward 1
TM
N in

1
LG:977683.1:2000FEB18
657
731
forward 3
TM
N out

2
LG:893050.1:2000FEB18
15
101
forward 3
TM
N out

3
LG:980153.1:2000FEB18
313
375
forward 1
TM
N out

3
LG:980153.1:2000FEB18
391
453
forward 1
TM
N out

3
LG:980153.1:2000FEB18
278
364
forward 2
TM
N out

3
LG:980153.1:2000FEB18
416
493
forward 2
TM
N out

3
LG:980153.1:2000FEB18
809
871
forward 2
TM
N out

3
LG:980153.1:2000FEB18
902
964
forward 2
TM
N out

3
LG:980153.1:2000FEB18
1181
1264
forward 2
TM
N out

3
LG:980153.1:2000FEB18
1427
1510
forward 2
TM
N out

3
LG:980153.1:2000FEB18
1733
1798
forward 2
TM
N out

3
LG:980153.1:2000FEB18
1868
1954
forward 2
TM
N out

3
LG:980153.1:2000FEB18
2141
2227
forward 2
TM
N out

3
LG:980153.1:2000FEB18
2261
2308
forward 2
TM
N out

3
LG:980153.1:2000FEB18
60
125
forward 3
TM
N in

3
LG:980153.1:2000FEB18
402
476
forward 3
TM
N in

3
LG:980153.1:2000FEB18
2031
2081
forward 3
TM
N in

3
LG:980153.1:2000FEB18
2142
2213
forward 3
TM
N in

5
LG:475551.1:2000FEB18
2134
2208
forward 1
TM
N in

5
LG:475551.1:2000FEB18
2039
2125
forward 2
TM
N out

5
LG:475551.1:2000FEB18
1167
1217
forward 3
TM
N in

6
LG:481407.2:2000FEB18
874
927
forward 1
TM

6
LG:481407.2:2000FEB18
949
1035
forward 1
TM

6
LG:481407.2:2000FEB18
1081
1161
forward 1
TM

6
LG:481407.2:2000FEB18
1510
1584
forward 1
TM

6
LG:481407.2:2000FEB18
1355
1435
forward 2
TM
N out

6
LG:481407.2:2000FEB18
1439
1525
forward 2
TM
N out

6
LG:481407.2:2000FEB18
1326
1409
forward 3
TM
N in

6
LG:481407.2:2000FEB18
1446
1526
forward 3
TM
N in

6
LG:481407.2:2000FEB18
1545
1616
forward 3
TM
N in

7
LI:443580.1:2000FEB01
488
574
forward 2
TM
N out

10
LG:171377.1:2000MAY19
318
386
forward 3
TM
N in

10
LG:171377.1:2000MAY19
549
635
forward 3
TM
N in

10
LG:171377.1:2000MAY19
669
740
forward 3
TM
N in

12
LG:247384.1:2000MAY19
1381
1461
forward 1
TM
N in

12
LG:247384.1:2000MAY19
1624
1710
forward 1
TM
N in

12
LG:247384.1:2000MAY19
1409
1495
forward 2
TM
N in

12
LG:247384.1:2000MAY19
1395
1481
forward 3
TM
N in

12
LG:247384.1:2000MAY19
1617
1679
forward 3
TM
N in

13
LG:403872.1:2000MAY19
535
621
forward 1
TM
N in

13
LG:403872.1:2000MAY19
1360
1446
forward 1
TM
N in

13
LG:403872.1:2000MAY19
1522
1581
forward 1
TM
N in

13
LG:403872.1:2000MAY19
1828
1902
forward 1
TM
N in

13
LG:403872.1:2000MAY19
1957
2022
forward 1
TM
N in

13
LG:403872.1:2000MAY19
299
349
forward 2
TM
N in

13
LG:403872.1:2000MAY19
1361
1423
forward 2
TM
N in

13
LG:403872.1:2000MAY19
1439
1501
forward 2
TM
N in

13
LG:403872.1:2000MAY19
1553
1627
forward 2
TM
N in

13
LG:403872.1:2000MAY19
1859
1918
forward 2
TM
N in

13
LG:403872.1:2000MAY19
2027
2110
forward 2
TM
N in

13
LG:403872.1:2000MAY19
2117
2203
forward 2
TM
N in

13
LG:403872.1:2000MAY19
369
452
forward 3
TM
N in

13
LG:403872.1:2000MAY19
549
635
forward 3
TM
N in

13
LG:403872.1:2000MAY19
708
785
forward 3
TM
N in

13
LG:403872.1:2000MAY19
1101
1187
forward 3
TM
N in

13
LG:403872.1:2000MAY19
1419
1505
forward 3
TM
N in

13
LG:403872.1:2000MAY19
1575
1661
forward 3
TM
N in

13
LG:403872.1:2000MAY19
2115
2192
forward 3
TM
N in

13
LG:403872.1:2000MAY19
2226
2273
forward 3
TM
N in

14
LG:1135213.1:2000MAY19
41
127
forward 2
TM
N out

14
LG:1135213.1:2000MAY19
215
274
forward 2
TM
N out

14
LG:1135213.1:2000MAY19
293
379
forward 2
TM
N out

14
LG:1135213.1:2000MAY19
389
475
forward 2
TM
N out

16
LG:342147.1:2000MAY19
142
204
forward 1
TM
N out

16
LG:342147.1:2000MAY19
171
251
forward 3
TM
N out

17
LG:1097300.1:2000MAY19
487
564
forward 1
TM

17
LG:1097300.1:2000MAY19
805
891
forward 1
TM

17
LG:1097300.1:2000MAY19
1372
1458
forward 1
TM

17
LG:1097300.1:2000MAY19
668
754
forward 2
TM
N out

17
LG:1097300.1:2000MAY19
803
874
forward 2
TM
N out

17
LG:1097300.1:2000MAY19
1358
1441
forward 2
TM
N out

17
LG:1097300.1:2000MAY19
522
578
forward 3
TM
N in

17
LG:1097300.1:2000MAY19
750
836
forward 3
TM
N in

17
LG:1097300.1:2000MAY19
894
956
forward 3
TM
N in

17
LG:1097300.1:2000MAY19
1068
1145
forward 3
TM
N in

18
LG:444850.9:2000MAY19
253
315
forward 1
TM
N in

19
LG:402231.6:2000MAY19
407
484
forward 2
TM
N in

23
LG:350793.2:2000MAY19
148
222
forward 1
TM
N in

23
LG:350793.2:2000MAY19
316
384
forward 1
TM
N in

23
LG:350793.2:2000MAY19
1144
1215
forward 1
TM
N in

23
LG:350793.2:2000MAY19
1231
1293
forward 1
TM
N in

23
LG:350793.2:2000MAY19
1339
1425
forward 1
TM
N in

23
LG:350793.2:2000MAY19
1459
1521
forward 1
TM
N in

23
LG:350793.2:2000MAY19
1582
1662
forward 1
TM
N in

23
LG:350793.2:2000MAY19
1882
1953
forward 1
TM
N in

23
LG:350793.2:2000MAY19
1514
1600
forward 2
TM

23
LG:350793.2:2000MAY19
2135
2221
forward 2
TM

23
LG:350793.2:2000MAY19
1422
1493
forward 3
TM

23
LG:350793.2:2000MAY19
2268
2354
forward 3
TM

24
LG:408751.3:2000MAY19
1202
1264
forward 2
TM
N out

24
LG:408751.3:2000MAY19
1137
1223
forward 3
TM
N in

25
LI:336120.1:2000MAY01
241
297
forward 1
TM
N in

25
LI:336120.1:2000MAY01
616
702
forward 1
TM
N in

25
LI:336120.1:2000MAY01
1141
1200
forward 1
TM
N in

25
LI:336120.1:2000MAY01
2524
2598
forward 1
TM
N in

25
LI:336120.1:2000MAY01
1163
1213
forward 2
TM
N in

25
LI:336120.1:2000MAY01
1922
1972
forward 2
TM
N in

25
LI:336120.1:2000MAY01
2060
2119
forward 2
TM
N in

25
LI:336120.1:2000MAY01
2510
2596
forward 2
TM
N in

25
LI:336120.1:2000MAY01
663
749
forward 3
TM
N in

25
LI:336120.1:2000MAY01
1380
1445
forward 3
TM
N in

25
LI:336120.1:2000MAY01
1839
1925
forward 3
TM
N in

25
LI:336120.1:2000MAY01
2148
2234
forward 3
TM
N in

25
LI:336120.1:2000MAY01
2418
2471
forward 3
TM
N in

25
LI:336120.1:2000MAY01
2499
2585
forward 3
TM
N in

26
LI:234104.2:2000MAY01
1873
1947
forward 1
TM
N out

26
LI:234104.2:2000MAY01
2155
2241
forward 1
TM
N out

26
LI:234104.2:2000MAY01
3616
3690
forward 1
TM
N out

26
LI:234104.2:2000MAY01
1112
1168
forward 2
TM
N in

26
LI:234104.2:2000MAY01
2216
2302
forward 2
TM
N in

26
LI:234104.2:2000MAY01
3632
3718
forward 2
TM
N in

26
LI:234104.2:2000MAY01
3998
4045
forward 2
TM
N in

26
LI:234104.2:2000MAY01
1314
1400
forward 3
TM
N in

26
LI:234104.2:2000MAY01
2172
2258
forward 3
TM
N in

26
LI:234104.2:2000MAY01
2607
2684
forward 3
TM
N in

26
LI:234104.2:2000MAY01
2739
2798
forward 3
TM
N in

26
LI:234104.2:2000MAY01
2841
2891
forward 3
TM
N in

26
LI:234104.2:2000MAY01
3621
3707
forward 3
TM
N in

26
LI:234104.2:2000MAY01
4080
4145
forward 3
TM
N in

28
LI:119992.3:2000MAY01
22
102
forward 1
TM
N out

28
LI:119992.3:2000MAY01
151
237
forward 1
TM
N out

28
LI:119992.3:2000MAY01
1444
1530
forward 1
TM
N out

28
LI:119992.3:2000MAY01
1603
1683
forward 1
TM
N out

28
LI:119992.3:2000MAY01
1729
1809
forward 1
TM
N out

28
LI:119992.3:2000MAY01
2197
2253
forward 1
TM
N out

28
LI:119992.3:2000MAY01
2269
2355
forward 1
TM
N out

28
LI:119992.3:2000MAY01
2989
3075
forward 1
TM
N out

28
LI:119992.3:2000MAY01
3163
3249
forward 1
TM
N out

28
LI:119992.3:2000MAY01
1247
1333
forward 2
TM
N in

28
LI:119992.3:2000MAY01
1538
1606
forward 2
TM
N in

28
LI:119992.3:2000MAY01
2207
2293
forward 2
TM
N in

28
LI:119992.3:2000MAY01
2756
2812
forward 2
TM
N in

28
LI:119992.3:2000MAY01
3098
3169
forward 2
TM
N in

28
LI:119992.3:2000MAY01
3281
3343
forward 2
TM
N in

28
LI:119992.3:2000MAY01
3356
3418
forward 2
TM
N in

28
LI:119992.3:2000MAY01
120
188
forward 3
TM
N in

28
LI:119992.3:2000MAY01
627
689
forward 3
TM
N in

28
LI:119992.3:2000MAY01
708
770
forward 3
TM
N in

28
LI:119992.3:2000MAY01
1425
1511
forward 3
TM
N in

28
LI:119992.3:2000MAY01
1782
1868
forward 3
TM
N in

28
LI:119992.3:2000MAY01
2223
2306
forward 3
TM
N in

28
LI:119992.3:2000MAY01
2757
2843
forward 3
TM
N in

28
LI:119992.3:2000MAY01
3027
3113
forward 3
TM
N in

28
LI:119992.3:2000MAY01
3213
3275
forward 3
TM
N in

28
LI:119992.3:2000MAY01
3312
3374
forward 3
TM
N in

29
LI:197241.2:2000MAY01
289
369
forward 1
TM
N out

29
LI:197241.2:2000MAY01
430
507
forward 1
TM
N out

29
LI:197241.2:2000MAY01
799
861
forward 1
TM
N out

29
LI:197241.2:2000MAY01
889
951
forward 1
TM
N out

29
LI:197241.2:2000MAY01
1798
1863
forward 1
TM
N out

29
LI:197241.2:2000MAY01
1930
2016
forward 1
TM
N out

29
LI:197241.2:2000MAY01
2101
2148
forward 1
TM
N out

29
LI:197241.2:2000MAY01
2206
2262
forward 1
TM
N out

29
LI:197241.2:2000MAY01
416
499
forward 2
TM
N out

29
LI:197241.2:2000MAY01
812
862
forward 2
TM
N out

29
LI:197241.2:2000MAY01
1226
1309
forward 2
TM
N out

29
LI:197241.2:2000MAY01
1475
1558
forward 2
TM
N out

29
LI:197241.2:2000MAY01
2210
2296
forward 2
TM
N out

29
LI:197241.2:2000MAY01
60
125
forward 3
TM
N in

29
LI:197241.2:2000MAY01
333
395
forward 3
TM
N in

29
LI:197241.2:2000MAY01
441
503
forward 3
TM
N in

29
LI:197241.2:2000MAY01
2223
2300
forward 3
TM
N in

31
LI:142384.1:2000MAY01
367
432
forward 1
TM
N out

31
LI:142384.1:2000MAY01
93
155
forward 3
TM
N out

32
LI:895427.1:2000MAY01
1796
1879
forward 2
TM
N in

32
LI:895427.1:2000MAY01
1656
1724
forward 3
TM
N in

33
LI:757439.1:2000MAY01
253
312
forward 1
TM
N in

33
LI:757439.1:2000MAY01
817
900
forward 1
TM
N in

33
LI:757439.1:2000MAY01
1507
1572
forward 1
TM
N in

33
LI:757439.1:2000MAY01
1615
1677
forward 1
TM
N in

33
LI:757439.1:2000MAY01
1696
1758
forward 1
TM
N in

33
LI:757439.1:2000MAY01
1834
1899
forward 1
TM
N in

33
LI:757439.1:2000MAY01
1969
2043
forward 1
TM
N in

33
LI:757439.1:2000MAY01
2107
2193
forward 1
TM
N in

33
LI:757439.1:2000MAY01
2506
2586
forward 1
TM
N in

33
LI:757439.1:2000MAY01
815
901
forward 2
TM
N out

33
LI:757439.1:2000MAY01
1634
1720
forward 2
TM
N out

33
LI:757439.1:2000MAY01
1796
1882
forward 2
TM
N out

33
LI:757439.1:2000MAY01
1952
2026
forward 2
TM
N out

33
LI:757439.1:2000MAY01
2486
2563
forward 2
TM
N out

33
LI:757439.1:2000MAY01
783
869
forward 3
TM
N in

33
LI:757439.1:2000MAY01
996
1049
forward 3
TM
N in

33
LI:757439.1:2000MAY01
1545
1631
forward 3
TM
N in

33
LI:757439.1:2000MAY01
2115
2174
forward 3
TM
N in

35
LI:243660.4:2000MAY01
1247
1333
forward 2
TM
N in

36
LI:334386.1:2000MAY01
538
621
forward 1
TM

36
LI:334386.1:2000MAY01
922
1008
forward 1
TM

36
LI:334386.1:2000MAY01
1087
1173
forward 1
TM

36
LI:334386.1:2000MAY01
1468
1530
forward 1
TM

36
LI:334386.1:2000MAY01
1570
1632
forward 1
TM

36
LI:334386.1:2000MAY01
2731
2802
forward 1
TM

36
LI:334386.1:2000MAY01
2992
3054
forward 1
TM

36
LI:334386.1:2000MAY01
3325
3387
forward 1
TM

36
LI:334386.1:2000MAY01
3406
3468
forward 1
TM

36
LI:334386.1:2000MAY01
3487
3570
forward 1
TM

36
LI:334386.1:2000MAY01
3766
3852
forward 1
TM

36
LI:334386.1:2000MAY01
4006
4077
forward 1
TM

36
LI:334386.1:2000MAY01
4342
4416
forward 1
TM

36
LI:334386.1:2000MAY01
4615
4686
forward 1
TM

36
LI:334386.1:2000MAY01
4747
4833
forward 1
TM

36
LI:334386.1:2000MAY01
5062
5124
forward 1
TM

36
LI:334386.1:2000MAY01
5140
5202
forward 1
TM

36
LI:334386.1:2000MAY01
5227
5289
forward 1
TM

36
LI:334386.1:2000MAY01
5563
5649
forward 1
TM

36
LI:334386.1:2000MAY01
1235
1321
forward 2
TM
N in

36
LI:334386.1:2000MAY01
2423
2476
forward 2
TM
N in

36
LI:334386.1:2000MAY01
2702
2764
forward 2
TM
N in

36
LI:334386.1:2000MAY01
2792
2854
forward 2
TM
N in

36
LI:334386.1:2000MAY01
3086
3172
forward 2
TM
N in

36
LI:334386.1:2000MAY01
3302
3355
forward 2
TM
N in

36
LI:334386.1:2000MAY01
3452
3517
forward 2
TM
N in

36
LI:334386.1:2000MAY01
3920
4006
forward 2
TM
N in

36
LI:334386.1:2000MAY01
4064
4144
forward 2
TM
N in

36
LI:334386.1:2000MAY01
4250
4318
forward 2
TM
N in

36
LI:334386.1:2000MAY01
4331
4402
forward 2
TM
N in

36
LI:334386.1:2000MAY01
4523
4576
forward 2
TM
N in

36
LI:334386.1:2000MAY01
4586
4669
forward 2
TM
N in

36
LI:334386.1:2000MAY01
4772
4855
forward 2
TM
N in

36
LI:334386.1:2000MAY01
5039
5125
forward 2
TM
N in

36
LI:334386.1:2000MAY01
5498
5584
forward 2
TM
N in

36
LI:334386.1:2000MAY01
30
116
forward 3
TM
N in

36
LI:334386.1:2000MAY01
324
380
forward 3
TM
N in

36
LI:334386.1:2000MAY01
387
470
forward 3
TM
N in

36
LI:334386.1:2000MAY01
531
608
forward 3
TM
N in

36
LI:334386.1:2000MAY01
1362
1448
forward 3
TM
N in

36
LI:334386.1:2000MAY01
1539
1625
forward 3
TM
N in

36
LI:334386.1:2000MAY01
2232
2279
forward 3
TM
N in

36
LI:334386.1:2000MAY01
2580
2651
forward 3
TM
N in

36
LI:334386.1:2000MAY01
2757
2822
forward 3
TM
N in

36
LI:334386.1:2000MAY01
2820
2870
forward 3
TM
N in

36
LI:334386.1:2000MAY01
3282
3368
forward 3
TM
N in

36
LI:334386.1:2000MAY01
3510
3596
forward 3
TM
N in

36
LI:334386.1:2000MAY01
3981
4064
forward 3
TM
N in

36
LI:334386.1:2000MAY01
4356
4427
forward 3
TM
N in

36
LI:334386.1:2000MAY01
4464
4544
forward 3
TM
N in

36
LI:334386.1:2000MAY01
4959
5024
forward 3
TM
N in

36
LI:334386.1:2000MAY01
5601
5687
forward 3
TM
N in

37
LI:347572.1:2000MAY01
790
876
forward 1
TM
N in

37
LI:347572.1:2000MAY01
1354
1434
forward 1
TM
N in

37
LI:347572.1:2000MAY01
2425
2511
forward 1
TM
N in

37
LI:347572.1:2000MAY01
2599
2685
forward 1
TM
N in

37
LI:347572.1:2000MAY01
2686
2757
forward 1
TM
N in

37
LI:347572.1:2000MAY01
3133
3207
forward 1
TM
N in

37
LI:347572.1:2000MAY01
1184
1255
forward 2
TM

37
LI:347572.1:2000MAY01
2264
2350
forward 2
TM

37
LI:347572.1:2000MAY01
2597
2665
forward 2
TM

37
LI:347572.1:2000MAY01
2942
3028
forward 2
TM

37
LI:347572.1:2000MAY01
3137
3199
forward 2
TM

37
LI:347572.1:2000MAY01
3227
3289
forward 2
TM

37
LI:347572.1:2000MAY01
129
215
forward 3
TM
N in

37
LI:347572.1:2000MAY01
969
1046
forward 3
TM
N in

37
LI:347572.1:2000MAY01
1947
2033
forward 3
TM
N in

37
LI:347572.1:2000MAY01
2208
2288
forward 3
TM
N in

37
LI:347572.1:2000MAY01
2412
2477
forward 3
TM
N in

37
LI:347572.1:2000MAY01
2604
2684
forward 3
TM
N in

37
LI:347572.1:2000MAY01
2739
2795
forward 3
TM
N in

38
LI:817314.1:2000MAY01
460
546
forward 1
TM

38
LI:817314.1:2000MAY01
1192
1278
forward 1
TM

38
LI:817314.1:2000MAY01
1318
1386
forward 1
TM

38
LI:817314.1:2000MAY01
1423
1485
forward 1
TM

38
LI:817314.1:2000MAY01
1537
1599
forward 1
TM

38
LI:817314.1:2000MAY01
1630
1692
forward 1
TM

38
LI:817314.1:2000MAY01
1756
1842
forward 1
TM

38
LI:817314.1:2000MAY01
1930
1992
forward 1
TM

38
LI:817314.1:2000MAY01
2032
2094
forward 1
TM

38
LI:817314.1:2000MAY01
2860
2946
forward 1
TM

38
LI:817314.1:2000MAY01
3127
3213
forward 1
TM

38
LI:817314.1:2000MAY01
362
448
forward 2
TM
N in

38
LI:817314.1:2000MAY01
3158
3244
forward 2
TM
N in

38
LI:817314.1:2000MAY01
30
95
forward 3
TM
N out

38
LI:817314.1:2000MAY01
1239
1301
forward 3
TM
N out

38
LI:817314.1:2000MAY01
1785
1865
forward 3
TM
N out

38
LI:817314.1:2000MAY01
1920
2000
forward 3
TM
N out

38
LI:817314.1:2000MAY01
3189
3269
forward 3
TM
N out

39
LI:000290.1:2000MAY01
1003
1065
forward 1
TM
N in

39
LI:000290.1:2000MAY01
1075
1137
forward 1
TM
N in

39
LI:000290.1:2000MAY01
1195
1248
forward 1
TM
N in

39
LI:000290.1:2000MAY01
767
844
forward 2
TM

39
LI:000290.1:2000MAY01
882
932
forward 3
TM
N in

40
LI:023518.3:2000MAY01
28
108
forward 1
TM
N out

40
LI:023518.3:2000MAY01
20
106
forward 2
TM
N in

41
LI:1084246.1:2000MAY01
178
264
forward 1
TM
N out

41
LI:1084246.1:2000MAY01
2686
2760
forward 1
TM
N out

41
LI:1084246.1:2000MAY01
2932
3003
forward 1
TM
N out

41
LI:1084246.1:2000MAY01
3097
3159
forward 1
TM
N out

41
LI:1084246.1:2000MAY01
3184
3246
forward 1
TM
N out

41
LI:1084246.1:2000MAY01
3352
3405
forward 1
TM
N out

41
LI:1084246.1:2000MAY01
3409
3480
forward 1
TM
N out

41
LI:1084246.1:2000MAY01
3526
3609
forward 1
TM
N out

41
LI:1084246.1:2000MAY01
200
253
forward 2
TM
N in

41
LI:1084246.1:2000MAY01
2171
2254
forward 2
TM
N in

41
LI:1084246.1:2000MAY01
2654
2734
forward 2
TM
N in

41
LI:1084246.1:2000MAY01
3065
3142
forward 2
TM
N in

41
LI:1084246.1:2000MAY01
3284
3358
forward 2
TM
N in

41
LI:1084246.1:2000MAY01
3479
3553
forward 2
TM
N in

41
LI:1084246.1:2000MAY01
582
641
forward 3
TM
N out

41
LI:1084246.1:2000MAY01
2127
2213
forward 3
TM
N out

41
LI:1084246.1:2000MAY01
2457
2543
forward 3
TM
N out

41
LI:1084246.1:2000MAY01
2580
2666
forward 3
TM
N out

41
LI:1084246.1:2000MAY01
2751
2813
forward 3
TM
N out

41
LI:1084246.1:2000MAY01
2826
2888
forward 3
TM
N out

41
LI:1084246.1:2000MAY01
2961
3047
forward 3
TM
N out

41
LI:1084246.1:2000MAY01
3249
3335
forward 3
TM
N out

41
LI:1084246.1:2000MAY01
3429
3515
forward 3
TM
N out

42
LI:1165828.1:2000MAY01
61
147
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
244
312
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
454
510
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
3664
3750
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
3937
4023
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
4600
4653
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
4855
4941
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
5047
5133
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
5227
5298
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
5311
5388
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
5491
5577
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
5800
5871
forward 1
TM
N out

42
LI:1165828.1:2000MAY01
227
301
forward 2
TM
N in

42
LI:1165828.1:2000MAY01
713
775
forward 2
TM
N in

42
LI:1165828.1:2000MAY01
1769
1819
forward 2
TM
N in

42
LI:1165828.1:2000MAY01
2759
2845
forward 2
TM
N in

42
LI:1165828.1:2000MAY01
3869
3928
forward 2
TM
N in

42
LI:1165828.1:2000MAY01
4688
4774
forward 2
TM
N in

42
LI:1165828.1:2000MAY01
5048
5116
forward 2
TM
N in

42
LI:1165828.1:2000MAY01
5531
5617
forward 2
TM
N in

42
LI:1165828.1:2000MAY01
5816
5893
forward 2
TM
N in

42
LI:1165828.1:2000MAY01
39
113
forward 3
TM
N out

42
LI:1165828.1:2000MAY01
906
968
forward 3
TM
N out

42
LI:1165828.1:2000MAY01
1602
1688
forward 3
TM
N out

42
LI:1165828.1:2000MAY01
3471
3557
forward 3
TM
N out

42
LI:1165828.1:2000MAY01
3558
3608
forward 3
TM
N out

42
LI:1165828.1:2000MAY01
4203
4289
forward 3
TM
N out

42
LI:1165828.1:2000MAY01
4749
4835
forward 3
TM
N out

42
LI:1165828.1:2000MAY01
5625
5690
forward 3
TM
N out

42
LI:1165828.1:2000MAY01
5847
5918
forward 3
TM
N out

43
LI:007302.1:2000MAY01
346
426
forward 1
TM
N in

43
LI:007302.1:2000MAY01
2638
2721
forward 1
TM
N in

43
LI:007302.1:2000MAY01
59
145
forward 2
TM
N out

43
LI:007302.1:2000MAY01
653
718
forward 2
TM
N out

43
LI:007302.1:2000MAY01
1799
1885
forward 2
TM
N out

43
LI:007302.1:2000MAY01
321
407
forward 3
TM
N in

43
LI:007302.1:2000MAY01
480
566
forward 3
TM
N in

43
LI:007302.1:2000MAY01
645
704
forward 3
TM
N in

43
LI:007302.1:2000MAY01
807
890
forward 3
TM
N in

43
LI:007302.1:2000MAY01
1161
1223
forward 3
TM
N in

43
LI:007302.1:2000MAY01
1236
1298
forward 3
TM
N in

43
LI:007302.1:2000MAY01
1362
1448
forward 3
TM
N in

43
LI:007302.1:2000MAY01
1809
1868
forward 3
TM
N in

43
LI:007302.1:2000MAY01
1998
2084
forward 3
TM
N in

43
LI:007302.1:2000MAY01
2184
2234
forward 3
TM
N in

43
LI:007302.1:2000MAY01
2457
2540
forward 3
TM
N in

43
LI:007302.1:2000MAY01
2595
2681
forward 3
TM
N in

44
LI:236386.4:2000MAY01
3739
3792
forward 1
TM
N out

44
LI:236386.4:2000MAY01
53
118
forward 2
TM
N out

44
LI:236386.4:2000MAY01
218
304
forward 2
TM
N out

44
LI:236386.4:2000MAY01
3755
3823
forward 2
TM
N out

44
LI:236386.4:2000MAY01
2376
2435
forward 3
TM
N out

45
LI:252904.5:2000MAY01
494
550
forward 2
TM
N out

45
LI:252904.5:2000MAY01
300
374
forward 3
TM
N out

TABLE 4

SEQ ID
Component

NO:
ID
Start
Stop

1
g5813583
610
959

1
6817504J1
1
621

1
g1989978
3
264

1
4292280H1
10
242

1
483000R6
11
337

1
483000H1
11
252

1
g1424329
14
316

1
3255214H1
107
349

1
1450061H1
131
371

1
5388816H1
152
419

1
955673H1
181
406

1
2109273H1
286
547

1
5980116H1
373
651

1
g828864
376
596

1
3072657H1
380
488

1
2949928H1
416
680

1
6016294H1
580
677

1
g1855323
611
695

1
g1623907
611
667

1
g1855498
611
933

1
g1751162
689
928

1
1309114T6
716
955

1
1309114F6
716
979

1
1309114H1
716
971

1
3637614H1
807
1053

1
7065033H1
899
1165

1
6817504H1
971
1358

1
6013754H1
978
1245

1
g573231
1034
1316

1
g709283
1034
1322

1
g767017
1035
1345

1
g692230
1061
1388

1
1617090H1
1084
1209

1
1617090F6
1084
1380

1
g1157664
1112
1412

2
6131346H1
1
193

2
6871387H1
125
662

2
g2279352
352
634

3
7039759H1
1390
1914

3
6481201H1
1428
1542

3
6929893H1
1460
1891

3
160750H1
1643
1734

3
6201684H1
1659
2172

3
492554H1
36
275

3
6710369H1
84
594

3
g770845
369
639

3
6710369J1
538
1037

3
6866894H1
749
1339

3
2045879F6
796
1123

3
2045879H1
796
1064

3
g677645
854
1153

3
g570913
854
1235

3
2837088H1
1
79

3
g878213
855
1194

3
3637810H1
905
1188

3
382301R6
11
244

3
3637810F8
906
1347

3
5516287H1
938
1192

3
382301H1
11
273

3
310657H1
983
1184

3
381716R1
11
471

3
054856H1
1027
1268

3
2676843H1
1102
1294

3
2865460H1
1182
1413

3
5983503H1
1223
1521

3
3296833H1
24
289

3
492559R1
36
564

3
3903656H1
1288
1501

3
2554026H1
1322
1591

3
g1894266
1326
1800

3
3151953H1
2028
2266

3
6357422H1
2056
2344

3
382301T6
2063
2619

3
2498615F6
2077
2500

3
2498615H1
2077
2310

3
492559F1
2104
2658

3
2684917H1
1709
1950

3
3898190H1
1917
2210

3
381716F1
2106
2658

3
5952437H1
1960
2247

3
4701147H1
2134
2402

3
g5435909
2213
2663

3
7067611H1
2254
2764

3
g2563607
2282
2658

3
1889064H1
2300
2577

3
2400488H1
2302
2549

3
g817549
2307
2667

3
g566965
2343
2658

3
g1894154
2354
2658

3
g869609
2394
2667

3
g4291206
2396
2766

3
g646309
2398
2658

3
3249908H1
2467
2760

3
672907H1
2516
2658

3
672763R6
2516
2658

3
672763H1
2516
2658

3
672696H1
2516
2658

3
672763T6
2516
2621

4
g1939101
219
609

4
1749048T6
1
388

5
996489H1
1
289

5
996489R6
1
321

5
6807726H1
9
414

5
g1208184
74
603

5
g1146490
110
406

5
1391557H1
145
273

5
2054016H1
155
406

5
3564377H1
213
498

5
1389469H1
365
607

5
6178475H1
288
554

5
2490333H1
461
684

5
1498011F6
497
816

5
1498011H1
497
735

5
154577H1
512
727

5
2439861H1
600
846

5
6974170H1
655
1206

5
5557446H1
723
990

5
6821354J1
725
1336

5
3801324H1
751
1035

5
159257H1
753
952

5
1562163H1
801
1030

5
7161127H1
827
1358

5
1840238H1
834
989

5
1892815H1
944
1194

5
1893046H1
944
1185

5
1391452H1
962
1131

5
1391452F6
962
1223

5
1680496H1
1117
1345

5
2132470R6
1120
1456

5
1265470H1
1149
1401

5
6804038H1
1164
1555

5
3430883H1
1183
1428

5
2132470H1
1188
1456

5
1515410H1
1224
1442

5
g2056082
1221
1509

5
566614H1
1269
1530

5
4780315H1
1290
1553

5
1637781H1
1302
1454

5
1638827H1
1302
1455

5
1633937H1
1762
1969

5
6821354H1
1419
1971

5
1390745H1
1433
1557

5
1932110H1
1712
1868

5
1932110F6
1713
1960

5
1850028H1
1728
1970

5
386578H1
1753
2029

5
1862471H1
1759
1870

5
4588296H1
1799
1890

5
2028756H1
1816
1890

5
1988349T6
1824
2253

5
1498011T6
1829
2254

5
6157225H1
1842
2101

5
521110H1
1850
1975

5
6157733H1
1854
2051

5
4829815H1
1889
1962

5
4411517H1
1907
2157

5
541981H1
1927
2155

5
4558860H1
1944
2106

5
1391452T6
1958
2260

5
2752758H1
1963
2239

5
1807380T6
1965
2250

5
1807042F6
1970
2290

5
1807042H1
1970
2255

5
2311115H1
1992
2237

5
996489T6
1994
2332

5
6125387H1
2007
2356

5
4905520H1
2022
2280

5
4671595H1
2027
2277

5
318659H1
2041
2291

5
4902185H1
2096
2297

5
g2055975
2105
2298

5
1219763H1
2110
2288

5
1219763R6
2110
2290

5
1219763T6
2110
2251

5
1219763T1
2110
2250

5
581809H1
2110
2369

5
g2788727
2119
2369

5
2753294H1
2255
2364

6
2055577R6
766
1137

6
2055577T6
766
1096

6
g1578280
767
1137

6
g4897043
769
1147

6
g1897641
769
1137

6
g3004281
774
1138

6
6361438H2
776
1335

6
1273945F1
790
1131

6
1273945H1
790
948

6
2558966H1
791
1058

6
g2178992
831
1147

6
g1891843
842
1143

6
g1203333
844
1159

6
g1141073
845
1135

6
g1728655
851
1143

6
4618322H1
860
1133

6
g3179203
882
1147

6
4164817H1
9
261

6
5851107H1
12
270

6
4938618H1
1
285

6
2096384H1
13
274

6
4938518H1
1
184

6
6133436H1
6
304

6
5218795H1
14
282

6
3038155H1
6
294

6
3088308H1
14
285

6
6821608H1
14
578

6
5855412H1
14
297

6
2532161H1
6
258

6
5999068H1
6
559

6
g5431297
7
324

6
2715577H1
14
256

6
3717266H1
6
312

6
3088671H1
14
251

6
1690850T6
16
558

6
4978332H1
19
305

6
2525160H1
368
619

6
2811816H1
382
591

6
5285481H1
381
530

6
g1923667
380
575

6
2724519H1
385
586

6
4403213H1
397
537

6
2525196H1
368
597

6
g2111237
370
592

6
g1155753
370
731

6
g2111348
371
598

6
g3798474
371
588

6
g2968466
372
670

6
g1874430
374
675

6
g3933996
376
589

6
g2567131
409
663

6
g1422584
429
556

6
g2157052
435
744

6
3092788H1
437
722

6
1650634F6
441
871

6
1831391H1
637
867

6
2173245H1
652
888

6
768284H1
670
900

6
g2567185
671
1075

6
2522538H1
672
909

6
g3446544
676
1136

6
4377572H1
680
948

6
g4242762
685
1135

6
g5444329
685
1147

6
g4394905
687
1135

6
g4891466
689
1136

6
4534880T1
604
1111

6
g1422487
626
919

6
3213475H1
692
929

6
g3674532
698
1150

6
g3665343
700
1135

6
g5365390
705
1135

6
3362353H1
708
848

6
g3737258
707
1140

6
3801387H1
711
869

6
g1277444
717
1135

6
6045963H1
722
1176

6
g2236500
716
1139

6
4024228H1
722
1008

6
g4088002
718
1149

6
3553263H1
754
969

6
g2229274
762
1153

6
2055577H1
766
1031

6
5116334H1
19
290

6
1546662H1
19
218

6
2275605H1
19
291

6
5968841H1
19
591

6
1902261H1
1
288

6
6728620H1
29
590

6
1690850F6
29
482

6
1690850H1
29
237

6
5346772H1
29
227

6
5346890H1
29
141

6
4151612H1
31
258

6
g2229063
27
371

6
3074071H1
31
308

6
3717427H1
32
401

6
2467222H1
32
258

6
5687205H1
33
296

6
g2027890
31
188

6
2864630H1
34
341

6
3837823H1
35
321

6
5978027H1
35
298

6
3841249H1
35
236

6
5780416H1
37
313

6
4525495H1
38
294

6
2943180H1
35
281

6
3159688H1
36
136

6
g2156554
35
459

6
5989823H1
38
334

6
4525695H1
38
287

6
774424H1
38
269

6
4376239H1
38
242

6
222536R1
19
533

6
4951501H2
19
325

6
5986222H1
21
289

6
4782312H1
19
258

6
222536H1
19
150

6
6152094H1
26
301

6
3365655H1
27
286

6
2098005H1
27
209

6
2874828H1
27
311

6
4748012H1
29
297

6
5122477H1
27
278

6
5516387H1
27
270

6
5695974H1
27
203

6
4994832H1
36
185

6
g1728758
40
325

6
5993725H1
40
342

6
5995510H1
40
330

6
g4329715
40
406

6
2894305H1
47
310

6
2719394T6
303
625

6
g5658221
327
736

6
5857676H1
296
564

6
5726056H2
297
676

6
2097760H1
300
546

6
2873090H1
329
605

6
3136434H1
334
597

6
g1646811
339
596

6
2738075F6
321
767

6
2738075H1
321
564

6
2719394F6
318
683

6
2719394H1
267
521

6
g5527461
339
586

6
g2437242
340
551

6
4724150H1
343
607

6
g1312816
346
778

6
4787470H1
360
597

6
5003922H1
362
616

6
6156796H1
87
345

6
2895320H1
43
273

6
4665825H1
96
339

6
3232485H1
44
316

6
2399837H1
98
322

6
6904948H1
101
462

6
6411519H1
45
554

6
035304H1
55
324

6
4573015H1
116
388

6
5609131H1
123
365

6
g3598018
135
590

6
g3432506
136
593

6
g5431490
144
323

6
g1646810
57
324

6
g2555607
156
500

6
g1578371
53
198

6
g2229126
158
593

6
g3229125
173
598

6
g3898868
173
593

6
g4452177
180
323

6
g3182012
205
593

6
790141R1
222
746

6
790141H1
222
456

6
3599189H1
229
519

6
g2204943
229
593

6
3258218H1
232
529

6
g2355330
244
592

6
g2882852
65
382

6
g1950563
70
330

6
1548020H1
72
301

6
2823270H1
250
538

6
2873603H1
257
537

6
2755517H1
79
346

6
3718262H1
81
391

6
915491R6
260
597

6
915491H1
260
569

6
4979613H1
276
550

6
6821608J1
278
791

6
3246153H1
278
516

6
4008733H1
281
559

6
4989076H1
497
752

6
g5850851
503
739

6
g4738819
504
739

6
g5849856
504
739

6
6365612H1
519
816

6
5183801H1
525
789

6
3706413H1
529
812

6
4828553H1
532
762

6
2604912H1
539
791

6
g2107086
553
977

6
g5769539
555
733

6
5576107H1
559
800

6
g1891969
565
972

6
3620132H1
31
324

6
4605074H1
598
846

6
1650642F6
441
832

6
3443641H1
484
742

6
g3889543
490
917

6
g3095491
492
586

6
2738075T6
494
1096

6
4534880H1
441
701

6
4277322H1
497
751

6
4989476F8
496
967

6
1650634H1
441
687

6
g2575167
443
843

6
3718361H1
456
769

6
3267371H1
457
700

6
1902161H1
462
586

6
5056004H1
465
746

6
g3751871
477
736

6
2997314H1
482
786

6
2996840H1
483
745

6
4276994H1
497
635

6
g1923480
981
1130

6
6550669H1
1020
1619

6
g4083790
1388
1829

6
4700302H1
1388
1666

6
g3770915
1402
1832

6
g1224283
1032
1442

6
g2767747
1055
1135

6
2539090H1
1087
1334

6
1773532H1
1179
1391

6
6045963J1
1211
1801

6
1650634T6
1270
1789

6
g4373516
1308
1756

7
g2524924
315
730

7
g2161228
313
724

7
g3802198
329
703

7
g3147794
231
688

7
g2162211
119
550

7
2497157H1
78
310

7
2854513H1
1
290

8
1985316H1
1
269

8
1985316R6
1
310

8
197972T6
43
445

8
197972H1
43
274

8
197972R6
43
457

9
7197754H2
1
582

10
g5810426
1
449

10
g2219401
2
423

10
g4329377
27
489

10
g2537784
172
669

10
g1376965
259
669

10
4983705H1
270
539

10
7269840H1
339
848

11
6453567H1
1
503

11
4052122H1
185
457

11
4052122F7
185
636

11
g3897399
255
371

12
973628H1
996
1226

12
3014231H1
1097
1369

12
975169T6
1112
1714

12
3042767T6
1122
1713

12
6218188H1
1165
1678

12
5151940H1
1216
1440

12
975304T6
1231
1709

12
5531975T6
1266
1741

12
3577265H1
1286
1598

12
3016255H1
1291
1599

12
970343R6
1304
1757

12
970343H1
1304
1606

12
970343T6
1322
1714

12
3575519H1
1334
1616

12
5153116H1
1345
1469

12
988837H1
1422
1684

12
g4088627
1503
1756

12
6903302H1
1564
2110

12
975169H1
856
1057

12
g2156118
1
475

12
975304H1
2
248

12
3403717H1
1
249

12
4042617H1
1
256

12
3042767H1
3
267

12
3042767F6
3
275

12
4854092H1
4
234

12
4743545H1
6
265

12
5856186H1
20
270

12
535036H1
27
246

12
3960535H2
379
641

12
3960535F6
379
742

12
6216170H1
579
726

12
4456047H1
621
886

12
945050H1
762
1003

12
920681H1
855
1174

12
923436H1
855
1167

12
975169R6
855
1336

13
4745248H1
1
241

13
7158869H1
7
479

13
3335250F6
34
398

13
3335250H1
34
273

13
7077668H1
136
659

13
4318873H1
159
370

13
6992614H1
236
740

13
753174H1
356
543

13
7046749H1
453
1036

13
6983112H1
621
891

13
g570318
630
905

13
5266308H1
632
788

13
g778569
673
993

13
748982H1
672
901

13
744829R1
672
1226

13
744829H1
672
902

13
g869715
672
1004

13
g565684
901
1080

13
g1025621
1027
1340

13
g1059514
1027
1251

13
g714830
1108
1397

13
4311224H1
1203
1484

13
2292254R6
1398
1866

13
2292421R6
1398
1506

13
2291932H1
1398
1649

13
530715H1
1423
1644

13
7090888H1
1520
1659

13
g3086021
1518
1916

13
2291932T6
1559
2132

13
3335250T6
1562
2050

13
6841962H1
1748
2279

13
6855669H1
1881
2375

13
746910R6
1912
2375

13
746910H1
1912
2143

13
746910T6
1913
2371

13
6844175H1
1941
2375

13
2568562H1
1989
2222

13
g4393425
1996
2415

13
g4109519
2006
2375

13
g2694947
2036
2375

13
g2703845
2040
2375

13
g3884077
2042
2375

13
g3278030
2045
2423

13
4705947H1
2104
2256

13
g714831
2110
2411

13
750787H1
2121
2365

13
667235H1
2126
2370

13
g561290
2150
2375

13
g518739
2157
2375

13
g3230679
2187
2375

13
g717890
2318
2390

14
4145560H1
1
337

14
7182979H1
1
537

14
g4929686
1
1581

14
g1881193
113
359

14
798770H1
206
449

14
g1198695
214
498

14
g1637735
380
642

14
g2204679
39
511

14
5540595H1
1
195

14
g1970769
1
345

14
g1970753
1
325

14
g1971048
1
253

14
g1970777
1
223

14
g815792
8
284

14
g1441646
3
303

14
g4372035
14
479

14
g2930515
35
487

14
g4897951
44
477

14
609028H1
27
178

14
g2782816
15
417

14
g4326525
1
141

14
g2525795
28
236

15
g6450570
1077
1426

15
g6473965
97
472

15
525308H1
117
324

15
g2898932
121
456

15
526619H1
129
370

15
g2942591
134
271

15
2360586H1
145
399

15
2211028H1
228
438

15
987239R1
305
763

15
987239H1
305
478

15
1436565F1
354
824

15
7161757H1
1
521

15
g4372435
23
212

15
g5451540
23
516

15
g3884494
40
407

15
g5545276
40
499

15
2269559H1
44
305

15
2269559R6
44
350

15
g5152652
62
224

15
3222733H1
86
303

15
1664718F6
91
349

15
1664718H1
91
352

15
g880746
97
278

15
1436565H1
354
626

15
2520441H1
360
641

15
3460138H1
393
644

15
6881873J1
142
680

15
6881873H1
51
484

15
1670270F6
637
1077

15
g1921208
645
985

15
6523810H1
659
1052

15
3499282H1
423
706

15
5852917H1
661
921

15
2247228H1
692
959

15
g851799
704
1030

15
4946358H1
711
972

15
5951390H1
729
954

15
6345162H1
792
1031

15
3436737H1
794
1029

15
g2264229
426
815

15
3496822H1
430
703

15
6321740H1
805
1031

15
2112334H1
820
1080

15
1007012H1
470
767

15
2112334R6
820
1167

15
3215530H1
491
714

15
3144904H1
873
1217

15
g4073140
965
1444

15
g4523268
970
1426

15
g5673767
972
1444

15
2836020H1
496
741

15
960106H1
971
1049

15
962045H1
971
1248

15
5109444H1
498
723

15
g2070246
973
1335

15
g2206523
973
1266

15
9880857
501
815

15
g5637498
978
1401

15
g5449171
979
1439

15
3733518H1
980
1275

15
g4763832
981
1444

15
6807693H1
520
1140

15
1968707R6
522
920

15
g5754504
985
1444

15
g5511006
992
1444

15
6154958H1
991
1304

15
g2952676
993
1443

15
1968707H1
522
727

15
961381H1
997
1290

15
6344762H1
534
632

15
959580H1
997
1109

15
g2209838
548
972

15
6856259H1
554
1067

15
2479125H1
565
804

15
4345262H1
577
856

15
959580R1
997
1433

15
g4437873
998
1426

15
g5661623
1002
1410

15
g4332091
1006
1444

15
5031758H1
585
825

15
g1320158
1008
1439

15
g5391778
1012
1444

15
g5933236
1012
1444

15
g2901335
1014
1408

15
g1940416
1015
1444

15
g5113563
1021
1444

15
2517547H1
1043
1277

15
g5451354
1053
1284

15
g2220466
1062
1408

15
g2952784
1064
1440

15
3329431H1
607
885

15
5271370H1
618
855

15
1670270H1
637
862

15
g1367649
1071
1444

15
g3751105
1073
1444

15
g1367704
1083
1437

15
g5904784
1090
1444

15
g4852367
1094
1444

15
g1443408
1101
1445

15
2124915H1
1117
1402

15
g3412275
1126
1443

15
g5671642
1138
1407

15
g2056619
1211
1442

15
g4148637
1249
1426

15
g1921308
1253
1445

15
g2952936
1256
1443

15
g2728303
1276
1446

15
g4195307
1314
1444

15
g2841540
1351
1445

16
1601184H1
304
515

16
3540611H1
297
388

16
3111986H1
304
368

16
1673924H1
297
503

16
1569636H1
297
508

16
2696549F6
297
378

16
g2219716
1
359

16
g2898608
1
211

16
6755069H1
1
654

16
3539560H1
303
476

16
1515102H1
297
466

16
1572728H1
297
492

16
1347783H1
309
435

16
1691349H1
297
436

16
3686316H1
304
498

17
4563458H1
1
197

17
4381069H1
15
261

17
6205262H1
107
542

17
6202507H1
412
921

17
4620133F6
603
940

17
4620133H1
603
851

17
2158854T6
743
1154

17
g5543295
743
1201

17
g1385006
749
1056

17
2158854H1
749
1012

17
3973473H1
782
1055

17
3973473F8
783
1307

17
5629236F6
806
1288

17
3973473T8
883
1519

17
5629236H1
1062
1288

17
2777742H1
1069
1170

17
2509368H1
1108
1343

17
2793074H2
1138
1253

17
2793074F6
1142
1253

17
2793074T6
1177
1260

17
2364001H1
1404
1651

17
g3898774
1582
1927

18
3224948H1
1
177

18
3695977H1
7
312

18
7006140H1
8
566

18
2794410H1
13
150

18
6460326H1
40
396

18
6787346H1
51
555

18
3403667H1
53
289

18
3725949H1
56
297

18
2830626H1
61
333

18
g1646403
62
445

18
2830626F6
61
581

18
6784569H2
61
591

18
5959276H1
74
534

18
6804522J1
100
522

18
3697994H1
118
356

18
581170H1
133
223

18
5610623H1
133
408

18
2770068H1
157
405

18
7165406H1
159
535

18
6702265H1
312
825

18
7037116H1
372
699

18
6531787H1
511
922

18
1214116H1
519
662

18
6804522H1
637
1171

18
7218713H1
677
1237

18
3557937H1
687
987

18
6455665H1
825
1420

18
6701662H1
821
1297

18
6523244H1
847
1324

18
4004887H1
926
1204

18
4876106H1
945
1182

18
4067628F7
1082
1353

18
6932868H1
1082
1543

18
3191237H1
1103
1414

18
7088151H1
1126
1596

18
2818868H1
1173
1275

18
5582555H1
1189
1439

18
5582587H1
1188
1442

18
g4893540
1220
1631

18
4442851H1
1276
1544

18
3022715H1
1325
1618

18
3780205H1
1349
1644

18
g1947313
1365
1595

18
2996242H1
1384
1678

18
3052021H1
1414
1704

18
g3095711
1478
1951

18
3927236H1
1596
1856

18
2769806H1
1625
1854

18
5866616H1
1749
1842

18
3730361H1
1767
1870

18
7169445H1
1
343

19
6546889H1
1
339

19
1651460H1
83
301

19
6264819H1
186
461

19
4753777H1
214
338

19
2331424R6
333
638

19
2331424H1
333
560

19
3398569H1
339
582

19
2435387H1
342
570

19
506031H1
351
527

19
6118353H1
362
469

19
609565H1
377
628

19
2873416H1
397
540

20
2583409H1
204
430

20
g2823866
1
383

20
3488619H1
1
280

20
5633561F6
207
798

20
5633561H1
207
465

21
g4690049
1
195

21
1398471F6
1
410

21
1399832H1
1
227

21
2694772H1
126
337

21
2694772F6
125
338

21
1398471H1
1
238

22
5286647F9
1
615

22
3808866F8
5
457

22
7264977H1
17
605

22
4760775F6
38
607

22
5286647T9
242
819

22
5286647T8
506
825

22
5286647F8
5
552

23
628206T7
1954
2472

23
277808H1
1974
2264

23
278730H1
1976
2309

23
275057H1
1976
2160

23
275257H1
1976
2193

23
2586194T6
1977
2477

23
6479875H1
1990
2477

23
2856722H1
2000
2267

23
1298131T6
2038
2472

23
1298131H1
2038
2291

23
1298131F1
2038
2276

23
1298131F6
2038
2516

23
2300965T6
2040
2476

23
g4075934
2067
2517

23
g3415730
2098
2518

23
g2139392
2111
2489

23
g4735514
2111
2514

23
g4261130
2111
2518

23
g4665764
2111
2513

23
3483466H1
2111
2363

23
g5366013
2119
2512

23
g4599402
2126
2517

23
4096757H1
2144
2441

23
2254547H1
2151
2380

23
g1692867
2185
2513

23
g1157366
2204
2513

23
g1128313
2281
2514

23
g2524394
2295
2514

23
g1227222
2316
2513

23
5913552F6
2405
2537

23
2265479H1
2413
2516

23
5913552H1
2416
2504

23
5643316H1
1884
2089

23
5794438H1
1854
2089

23
5791230H1
1854
2089

23
5791375H1
1854
2089

23
856338H1
1129
1361

23
3280567H1
1148
1399

23
6551617H1
1183
1732

23
6552317H1
1183
1762

23
6751972H1
1191
1762

23
5759260H1
1193
1468

23
4190084H1
1198
1471

23
6136366H1
1270
1571

23
4205570H1
1301
1533

23
3354295H1
1305
1539

23
4303867H1
1317
1502

23
628206H1
1382
1615

23
628206R7
1382
1793

23
4337705H1
1443
1782

23
2881556H1
1467
1726

23
6875744H1
1469
2058

23
5677351H1
1496
1741

23
2772870H1
1505
1749

23
1212235R6
1541
1990

23
1212235H1
1541
1815

23
g1646733
1551
1869

23
2297674H2
1562
1829

23
2586194H1
1590
1839

23
2586194F6
1590
2059

23
2403715H1
1606
1845

23
6859287H1
1655
2089

23
5091604H1
1689
1969

23
2736946H1
1689
1940

23
2823882H1
1714
2005

23
2821225H1
1714
2025

23
573737H1
1740
1857

23
6350742H1
1769
2058

23
2300965H1
1775
2006

23
2300965R6
1775
2170

23
439474H1
1808
2043

23
5686929H1
1843
2106

23
5794171H1
1854
2162

23
5792646H1
1854
2162

23
5792285H1
1854
2089

23
5793871H1
1854
2089

23
4358460H1
1059
1303

23
g2142328
1
284

23
5662770H1
1
178

23
7004664H1
142
653

23
g1692967
194
528

23
265733H1
224
448

23
6406758H1
542
995

23
6259622H1
667
954

23
g1628822
753
1138

23
2587028H1
876
1152

23
3331574H1
913
1177

23
705890H1
915
1149

23
705979H1
915
1181

23
4114902H1
922
1125

23
2889650H1
968
1241

23
6507226H1
1058
1499

23
6258095H1
1059
1340

24
g314920
1324
1655

24
g615297
1324
1656

24
g517687
1324
1655

24
g615578
1370
1656

24
g614283
1374
1656

24
1456735T6
1422
1622

24
g4328099
1446
1662

24
g614262
1449
1656

24
g4152278
1455
1656

24
g562532
1461
1656

24
g671207
1462
1656

24
5945223H1
1578
1660

24
g2985356
1621
1848

24
5498383R6
1236
1619

24
4717574T6
1186
1635

24
1476570F6
1188
1656

24
1476571F6
1188
1532

24
1476570H1
1188
1394

24
g614326
1200
1657

24
1476571T6
1206
1619

24
g4152280
1219
1388

24
g4598685
1229
1657

24
g314775
1244
1656

24
2153570H1
1241
1515

24
4492503H1
1247
1657

24
g615988
1254
1656

24
g775420
1264
1670

24
5659105H1
1264
1344

24
g4617815
1272
1663

24
g5511164
1274
1656

24
g3649444
1275
1658

24
g314750
1287
1656

24
004952H1
1164
1423

24
1476570T6
1171
1617

24
4705993T9
1106
1554

24
1270695T6
1177
1617

24
2416693T6
1090
1611

24
748579R1
1076
1656

24
859218H1
1007
1221

24
g6086997
903
1254

24
533539T6
909
1226

24
5371992T9
942
1580

24
g314842
948
1254

24
g683067
970
1254

24
7290682H1
978
1513

24
009349H1
761
1103

24
6888770H1
772
1287

24
6866213H1
772
1377

24
4943311T6
785
1231

24
7292792H1
793
1368

24
g1192539
802
1254

24
g4223790
815
1254

24
6717166H1
821
1283

24
g3331126
836
1256

24
5310872H1
838
1064

24
5267191H1
858
1118

24
4940779H1
878
1150

24
1270258H1
880
1118

24
g794503
887
1267

24
g816007
884
1243

24
g901436
892
1254

24
6869327H1
724
1228

24
6855475H1
1045
1242

24
1270292T6
1048
1610

24
g822109
1058
1267

24
748579H1
1064
1304

24
859218R6
1007
1446

24
g567610
1012
1254

24
859218R1
1007
1527

24
859218T6
1046
1617

24
1270695F6
541
829

24
1270695H1
541
773

24
7067123H1
525
1069

24
6448066H1
400
951

24
g691925
443
755

24
533539R6
431
951

24
533539H1
427
622

24
5379139H1
434
679

24
6868778H1
494
1123

24
5674272H1
391
645

24
6120160H1
386
785

24
6866026H1
381
974

24
1456735F6
189
605

24
6721132H1
193
579

24
4203426H1
212
337

24
1992224H1
206
475

24
7259028H1
204
579

24
g766593
289
587

24
7058996H1
305
886

24
4092963H1
327
609

24
g614162
336
605

24
g677813
336
565

24
6985794H1
332
788

24
4338771H1
359
628

24
g708822
393
694

24
g764692
395
736

24
g816062
378
790

24
3864471H1
374
591

24
6990907H1
383
921

24
g1627181
208
330

24
5311056H1
591
753

24
5907142H1
659
938

24
5924427H1
681
971

24
2707020H1
557
850

24
5205391H1
565
805

24
5498383H1
573
811

24
5498383F6
573
1055

24
g4152281
207
277

24
7290347H1
188
672

24
1265660F1
176
785

24
1265660H1
181
469

24
3944530H1
184
461

24
g677040
204
322

24
g1950097
237
294

24
6773005J1
33
637

24
6765966J1
33
606

24
6768978J1
33
631

24
g2003419
45
421

24
g1551472
61
213

24
6147606H1
71
625

24
g615579
115
462

24
g389770
122
510

24
6888770J1
153
753

24
g615989
174
503

24
4943311H1
175
458

24
4943311F6
175
595

24
6818987H1
197
267

24
1853628H1
181
421

24
1456735H1
208
332

24
5920291H1
208
267

24
7290834H1
187
505

24
6818987J1
33
250

24
6818431J1
33
570

24
g2003054
31
344

24
6770575J1
35
555

24
g1192915
25
170

24
g1978747
1
307

24
g5553287
1
315

24
6989857H1
1
436

24
6955370H1
22
540

24
g4390046
24
500

24
g4534562
24
504

25
7177245H2
1
455

25
g3015541
154
2103

25
g1864084
221
2759

25
g694473
448
790

25
g710265
448
736

25
g900615
470
914

25
g900616
469
798

25
4720263F6
580
1018

25
4720263H1
582
820

25
g6142053
718
1125

25
g3095833
754
886

25
7213511H1
762
1242

25
g705775
879
1219

25
g1275210
960
1173

25
6551517H1
1098
1692

25
096164H1
1151
1387

25
5451192H1
1222
1451

25
1308461F6
1230
1655

25
1308461H1
1230
1360

25
385195H1
1364
1640

25
3415579H1
1387
1650

25
g1191407
1788
1959

25
4765883H1
2166
2412

25
4760585H1
2225
2489

25
1308461T6
2272
2720

25
658904H1
2278
2532

25
g2987356
2301
2759

25
g2987355
2304
2759

25
4720263T6
2360
2746

25
1308461R1
2486
2759

25
g3887078
2491
2762

25
g824280
2507
2769

26
3315579H1
1
246

26
2564790H1
4
144

26
7037134H1
17
591

26
g2214897
120
460

26
70879775V1
123
576

26
70882313V1
123
561

26
70881021V1
123
654

26
70881583V1
123
700

26
3539234F6
123
536

26
3539234H1
123
348

26
g4332214
139
571

26
5204807H1
152
395

26
7066891H1
196
711

26
70882460V1
324
844

26
6559677H1
357
941

26
70881844V1
392
965

26
70879312V1
427
993

26
7239855H1
468
1020

26
g830101
474
849

26
g889334
474
843

26
6559338H1
490
770

26
6721187H1
534
1104

26
70882570V1
535
1028

26
5780844H1
542
821

26
70882690V1
558
1104

26
5780844F6
565
1096

26
2154958H1
565
667

26
70880555V1
597
1241

26
70888508V1
603
936

26
1394886F6
630
1075

26
1394886H1
630
888

26
1392996H1
630
891

26
671307H1
655
933

26
1270677H1
663
905

26
6560774H1
677
1208

26
70885252V1
693
934

26
7289657H1
729
1231

26
g2215028
736
1137

26
6945491H1
746
1269

26
70887853V1
770
894

26
70881667V1
773
1363

26
6986634H1
816
1297

26
1374120H1
825
961

26
70882560V1
830
1440

26
70880461V1
839
1433

26
4761241H1
884
1159

26
4761249H1
885
1169

26
g901677
927
1310

26
g946847
928
1263

26
g953373
928
1130

26
70818743V1
944
1123

26
70879516V1
955
1615

26
70882124V1
977
1488

26
70881307V1
1002
1476

26
70879227V1
1036
1255

26
3803043H1
1037
1326

26
3013311H1
1056
1341

26
6883273J1
1061
1663

26
3457862H1
1084
1327

26
g316332
1120
1339

26
70880271V1
1130
1719

26
70882630V1
1138
1274

26
1391847F6
1155
1647

26
1391847H1
1155
1407

26
5292536H2
1163
1394

26
70879978V1
1205
1732

26
2453848H1
1218
1444

26
1703631H1
1230
1354

26
70879064V1
1237
1843

26
70881312V1
1275
1788

26
5385719H1
1276
1432

26
4753468H1
1281
1550

26
1966807H1
1286
1555

26
70881555V1
1332
1998

26
70818654V1
1368
1926

26
1350180H1
1376
1646

26
70879359V1
1382
1871

26
6020187H1
1410
2009

26
70881816V1
1422
2015

26
3027682T6
1438
2026

26
1394886T6
1450
2027

26
2301449H1
1455
1541

26
70885937V1
1452
1711

26
1391847T6
1461
2030

26
3447875H2
1468
1723

26
4030281T8
1479
1804

26
70881238V1
1492
2020

26
70880651V1
1539
2110

26
4061612H1
1580
1860

26
g5863332
1584
2067

26
g5111312
1587
2067

26
2877413H1
1607
1908

26
2877413F6
1607
2002

26
g3281621
1609
2068

26
70818645V1
1622
2077

26
g4535191
1624
2068

26
g3426844
1626
2067

26
g2322267
1644
2068

26
g6196543
1654
1928

26
g3134994
1660
2074

26
g2874749
1663
2068

26
2877413T6
1681
2018

26
g830043
1717
2080

26
g946801
1740
2052

26
3539234T6
1764
2255

26
g889242
1768
2079

26
g3178069
1789
2068

26
4000739H1
1795
2068

26
g1372960
1812
4328

26
g3094856
1852
2068

26
g5528202
1869
2072

26
70887416V1
1885
2293

26
g2875209
1886
2068

26
70879855V1
1958
2305

26
70882152V1
2018
2288

26
6554433H1
2886
3287

26
g5863770
4005
4350

27
5911592T6
1
523

27
5911592H1
1
290

27
5911592T8
1
473

27
5911592F8
1
569

27
5911592T9
1
473

27
5911592F6
1
565

28
g1187505
3265
3546

28
g1128275
3293
3495

28
g1507227
3296
3546

28
g899953
3306
3566

28
g1080424
3307
3542

28
962712H1
3307
3546

28
1923976H1
3314
3512

28
g2159328
3320
3551

28
g735553
3320
3545

28
g5913481
3323
3554

28
g3896209
3322
3546

28
g795225
3331
3556

28
g2185988
2435
2887

28
4716403H1
2441
2550

28
112524H1
2441
2661

28
g6142912
2452
3005

28
4582601H1
2503
2780

28
4733207H1
2515
2810

28
g1320604
2527
3046

28
3254646H1
2529
2781

28
2273834H1
2542
2797

28
2688820H1
2567
2829

28
3449902H1
2576
2832

28
g1406097
2583
3005

28
g1406068
2588
3005

28
g2703843
2588
3002

28
g1156665
2602
2792

28
852284H1
2611
2841

28
852284R6
2613
2844

28
3477842H1
2612
2706

28
g2714143
2634
3005

28
2362491H1
2657
2912

28
g1635193
2665
2792

28
552048H1
2670
2921

28
5912223H1
2682
2748

28
g3412761
2692
3005

28
3492839H1
2695
2980

28
g1507002
2710
2916

28
5041915H1
2710
2899

28
643875H1
2715
2976

28
2531919H1
2731
2885

28
g6138438
2732
3005

28
4623249H1
2732
3002

28
2890187H1
2734
2998

28
g1670564
2741
3248

28
1850848H1
2754
3062

28
g3659213
2760
3290

28
956983H1
2762
3049

28
019839H1
2786
3082

28
3813377H1
2823
3095

28
131061H1
2831
2930

28
7054832H1
2837
3406

28
804820H1
2856
3090

28
1842462H1
2878
3146

28
4792127H1
2882
3145

28
1494563H1
2882
3121

28
1753953H1
2883
3125

28
1755130H1
2883
3092

28
3941233H1
2902
3198

28
2116653H1
2902
3193

28
2404516H1
2914
3172

28
4524703H1
2917
3027

28
g1617791
2942
3256

28
4407776H1
2934
3211

28
5186425H1
2942
3195

28
2904404H1
2942
3200

28
3144463H1
2943
3262

28
2359103T6
2953
3498

28
4652661H1
2961
3062

28
2955930H1
2977
3261

28
3115379T6
2982
3507

28
852284T6
2987
3507

28
1661229T6
2988
3505

28
3822074H1
2994
3275

28
4229083H1
2994
3263

28
3842223H1
2994
3234

28
3607528H1
2996
3166

28
g1080514
2999
3320

28
1661229F6
3011
3447

28
1661225H1
3011
3202

28
008660H1
3047
3339

28
2321285H1
3047
3289

28
g2106118
3064
3549

28
868783H1
3065
3326

28
g5176750
3073
3550

28
g2899654
3073
3546

28
g4762266
3073
3549

28
6307419H1
3080
3547

28
g4269311
3078
3549

28
g4075892
3078
3546

28
g3740929
3094
3555

28
g3897396
3097
3546

28
612568H1
3098
3355

28
g3278888
3101
3551

28
g2899655
3101
3544

28
g3744156
3103
3546

28
g2185814
3109
3552

28
6715165H1
3111
3548

28
4864862H1
3117
3405

28
1968272R6
3132
3548

28
1968272T6
3132
3501

28
1968272H1
3132
3401

28
1492449H1
3133
3347

28
g4648047
3136
3547

28
g4438953
3138
3539

28
g2751861
3143
3349

28
g572806
3150
3528

28
g672266
3150
3466

28
g879603
3150
3402

28
g876360
3151
3531

28
g830456
3151
3412

28
321502H1
3151
3397

28
337082H1
3151
3381

28
g4891955
3153
3546

28
g5658866
3163
3547

28
3023052H1
3163
3443

28
g3884073
3170
3546

28
g5325327
3330
3546

28
g1140821
3332
3546

28
2893166T6
3341
3509

28
g2204552
3349
3551

28
g1670543
3357
3546

28
g1190688
3385
3493

28
2552971H1
3401
3550

28
5907555H1
3487
3644

28
3256027H1
3561
3626

28
3256027R6
3561
3626

28
g1959467
1
63

28
076140H1
1
230

28
3400145H1
42
272

28
7166689H1
77
373

28
5513977H1
89
336

28
4970421H1
89
348

28
g6300096
153
586

28
5335382H1
256
490

28
5335373H1
257
488

28
1437260F1
264
814

28
1437260F6
264
658

28
1437260H1
264
533

28
5373320H1
290
505

28
6485087H1
404
923

28
4181761H1
414
498

28
5026859H1
610
693

28
3230444H1
616
763

28
2134545F6
767
1341

28
2134545H1
767
1022

28
265345H1
787
970

28
1437260T6
791
1270

28
3792193H1
878
1098

28
7260531H1
921
1369

28
6986910H1
986
1376

28
4447338H1
1008
1169

28
6494154R9
1031
1550

28
4832434H1
1037
1301

28
2633783H1
1037
1287

28
g1984595
1056
1311

28
2359103R6
1060
1504

28
2359103H1
1060
1314

28
5215646H1
1093
1294

28
425878H1
1096
1306

28
288744H1
1164
1454

28
6531566H1
1238
1809

28
7191895H2
1327
1801

28
288744F1
1349
1793

28
g6140330
1356
1781

28
g6505751
1406
1704

28
7029795H1
1414
2023

28
5641161H1
1506
1745

28
4061776T6
1508
1704

28
4061776F6
1515
1875

28
4061776H1
1516
1704

28
g2106291
1517
1824

28
g1880733
1522
1738

28
g1441510
1522
1904

28
767028H1
1524
1704

28
4177249H1
1546
1816

28
g823676
1505
1807

28
g3230537
1592
2020

28
3115379H1
1620
1700

28
g3840134
1582
1751

28
109465H1
1628
1784

28
951131H1
1599
1811

28
2431313H1
1621
1683

28
2134834H1
1679
1912

28
3811087H1
1700
1965

28
3661827H1
1726
1863

28
3729456T6
1688
1751

28
g3755762
1742
1806

28
2292441H1
1742
1982

28
2293368H1
1745
1970

28
g1939049
1757
2016

28
717351H1
1759
1999

28
g827645
1759
1975

28
5845309H1
1816
1911

28
3806331F6
1820
1915

28
6736585H1
1754
1823

28
487499H1
1809
2069

28
5914004H1
1846
2125

28
6408595H1
1852
2414

28
g1523070
1921
2355

28
g900055
1922
2243

28
5019562H1
1931
2111

28
g2103229
1933
2320

28
g2204602
1939
2229

28
2501393H1
1944
2111

28
g1281535
1964
2431

28
g735660
1994
2170

28
2813574H1
2020
2303

28
2170420H1
2030
2277

28
3718831H1
2031
2320

28
4062530H1
2048
2342

28
g1190010
2075
2225

28
4151403H1
2147
2211

28
962698R2
2147
2672

28
g6301662
2147
2523

28
3716245H1
2147
2399

28
3090607H1
2147
2385

28
962698H1
2147
2367

28
2858893H1
2147
2351

28
5586368H1
2147
2348

28
2571180H1
2147
2332

28
4333921H1
2147
2350

28
6219737H1
2147
2352

28
6400836H1
2147
2227

28
g1196242
2168
2576

28
g1190446
2168
2444

28
g1832964
2172
2494

28
675502H1
2177
2446

28
3903169H1
2207
2492

28
3245445H1
2240
2454

28
g827828
2241
2461

28
4833872H1
2258
2461

28
g1273258
2260
2749

28
4833888H1
2262
2538

28
g1799398
2268
2712

28
g1406166
2268
2643

28
g1406194
2269
2631

28
5185315H1
2285
2542

28
2082955H1
2295
2598

28
6341726H1
2316
2810

28
594752H1
2355
2602

28
g942919
2366
2583

28
7249143H1
2381
2613

28
g1921577
2394
2864

28
2896518H1
2411
2658

28
g1987258
2429
2848

28
g2161140
2435
2928

28
g3430807
3172
3546

28
6737055H1
3179
3546

28
2118476H1
3179
3436

28
5511767H1
3182
3389

28
2782179F6
3201
3588

28
2782195H1
3201
3468

28
3526177H1
3202
3479

28
g4990081
3213
3546

28
3734501H1
3227
3528

28
g3043004
3236
3546

28
g1200843
3238
3546

28
g1243436
3243
3545

28
896988R1
3244
3546

28
896988H1
3245
3472

28
g4330537
3255
3553

28
g883772
3264
3559

29
2837088H1
1
79

29
382301H1
11
278

29
382301R6
11
248

29
381716R1
11
488

29
6853095H1
18
566

29
3296833H1
24
294

29
492559R1
36
582

29
492554H1
36
280

29
6710369H1
84
612

29
g770845
381
657

29
6710369J1
556
1057

29
6866894H1
767
1363

29
2045879F6
814
1144

29
2045879H1
814
1085

29
g677645
874
1174

29
g570913
874
1259

29
g878213
875
1218

29
3637810H1
925
1212

29
3637810F8
926
1371

29
5516287H1
958
1216

29
310657H1
1003
1205

29
054856H1
1048
1292

29
2676843H1
1123
1318

29
2865460H1
1206
1437

29
5983503F8
1245
1610

29
5983503H1
1247
1545

29
6540006H1
1281
1578

29
3903656H1
1312
1525

29
2554026H1
1346
1615

29
g1894266
1350
1824

29
7039759H1
1414
1941

29
6481201H1
1452
1566

29
6929893H1
1484
1917

29
160750H1
1667
1758

29
6201684H1
1683
2203

29
2684917H1
1733
1978

29
3898190H1
1945
2241

29
5983503T8
1966
2626

29
5952437H1
1989
2278

29
3637810T9
2048
2597

29
3151953H1
2057
2297

29
6357422H1
2085
2377

29
382301T6
2092
2657

29
2498615F6
2107
2537

29
2498615H1
2107
2341

29
492559F1
2134
2696

29
381716F1
2136
2696

29
4701147H1
2164
2436

29
g5435909
2244
2701

29
7067611H1
2285
2803

29
g2563607
2313
2696

29
1889064H1
2331
2615

29
5762206H1
2333
2712

29
2400488H1
2334
2587

29
g817549
2339
2706

29
g566965
2376
2696

29
g1894154
2387
2696

29
g869609
2428
2705

29
g4291206
2430
2805

29
g646309
2432
2696

29
7214349H1
2497
2879

29
3249908H1
2502
2799

29
672907H1
2553
2696

29
672763R6
2553
2696

29
672763H1
2553
2696

29
672696H1
2553
2696

29
672763T6
2553
2659

30
6572615H1
1
572

31
6991082H1
1
215

31
g4195018
4
167

31
g5444909
10
139

31
g5765521
10
480

31
g4736683
10
469

31
g5110384
10
474

31
g5744052
26
461

31
7181281H1
31
570

31
3801178H1
71
269

31
6606927H1
91
475

31
5725556H1
402
875

31
6459774H1
790
1082

32
g3744008
2026
2487

32
g3843455
2032
2490

32
g4334045
2035
2487

32
1295257F1
1686
2102

32
1295579H1
1686
1944

32
1295615H1
1686
1932

32
1295257H1
1686
1914

32
g1382787
1690
2060

32
3009590H1
1709
2019

32
g1327091
1710
2099

32
1496765H1
1766
2002

32
4604681H1
1772
2045

32
1596414H1
1772
1993

32
6413696H1
1785
2102

32
4534504H1
1813
2098

32
71227864V1
1847
2362

32
2210129H1
1863
2101

32
1447743H1
1866
2103

32
70861405V1
1894
2228

32
70861649V1
1895
2495

32
6846658H1
1908
2107

32
4534504T1
1907
2456

32
4198839H1
1920
2101

32
1738412T6
1927
2437

32
1737079H1
1932
2060

32
1738412H1
1932
2053

32
g776871
1597
1846

32
2477944H1
1596
1816

32
4250426H1
1611
1861

32
2920084H1
1623
1883

32
70862374V1
1651
2227

32
3602331H1
1634
1931

32
6868176H1
1636
2103

32
4675720H1
1639
1854

32
1561242F6
1658
2077

32
1561242H1
1658
1879

32
g1501696
1667
1973

32
g760301
1677
1915

32
g3278095
2137
2493

32
5900945H1
2134
2423

32
g6138412
2137
2496

32
g4330820
2257
2483

32
g1988368
2268
2493

32
g3843397
2293
2490

32
g3920269
2298
2486

32
4069039H1
2330
2505

32
g6475333
2337
2487

32
312604H1
2371
2483

32
313091H1
2371
2483

32
313091R6
2371
2483

32
311262H1
2371
2483

32
313091T6
2371
2444

32
g794966
2420
2488

32
5585271H1
2056
2170

32
g2657562
2083
2489

32
g5631144
2082
2483

32
70861820V1
2094
2484

32
g4534051
2102
2483

32
g653111
2102
2485

32
g2741121
2113
2483

32
g3900137
2112
2489

32
g4987139
2120
2488

32
g1327037
2121
2495

32
g3750723
2123
2491

32
1712684T6
2129
2444

32
5900418H1
2135
2462

32
5900174H1
2134
2421

32
6811079J1
1
540

32
60205155U1
12
248

32
6886573J1
39
560

32
6886573H1
111
596

32
6811079H1
185
755

32
1453667F1
262
721

32
1453667H1
262
526

32
1453667F6
262
546

32
70818382V1
262
390

32
3747731H1
327
524

32
973584H1
340
620

32
4043303H1
376
512

32
857173H1
550
783

32
6258691H1
598
695

32
3408105H1
614
890

32
6606911H1
661
1207

32
4579377H1
669
938

32
3232119H1
686
966

32
4142126H1
717
926

32
g3405461
764
1127

32
70818359V1
915
1488

32
70793876V1
950
1625

32
71228166V1
983
1533

32
3809253H1
1007
1304

32
1617271H1
1066
1279

32
2863928H1
1081
1360

32
3234412H1
1087
1342

32
70861726V1
1187
1695

32
6999153H1
1207
1857

32
71228213V1
1213
1797

32
754707H1
1226
1478

32
70860887V1
1228
1790

32
71228275V1
1235
1732

32
70861627V1
1248
1846

32
3807022H1
1269
1442

32
2950342H1
1277
1544

32
2952767H1
1277
1536

32
71227990V1
1298
1936

32
71228136V1
1303
1784

32
71227553V1
1310
1757

32
70861671V1
1323
1920

32
70794764V1
1336
1702

32
3140045H1
1338
1625

32
70864551V1
1353
1859

32
6210975H1
1357
1668

32
g653325
1358
1597

32
70862132V1
1378
2033

32
4701559H1
1384
1655

32
7159432H1
1388
1905

32
2109285H1
1398
1660

32
70864775V1
1403
2064

32
70863822V1
1406
2038

32
7343876H1
1408
2057

32
1679948H1
1413
1645

32
6210776H1
1438
1754

32
3866536H1
1442
1582

32
1712684F6
1443
1998

32
1712684H1
1443
1662

32
g758871
1444
1620

32
4426067H1
1466
1711

32
70795476V1
1472
1640

32
5599333H1
1493
1727

32
70797042V1
1502
1640

32
6835201H1
1538
2080

32
70863377V1
1540
1989

32
6844445H1
1560
2067

32
5155068H1
1560
1818

32
g852973
1573
1906

32
g851729
1573
1861

32
g793415
1573
1781

32
6124452H1
1584
2062

32
g788826
1597
1904

32
2130055H1
2435
2493

32
4238420H1
1936
2082

32
g2138791
1962
2385

32
4351833H1
1979
2053

32
71225822V1
1984
2102

32
71225814V1
1981
2104

32
g4390230
2003
2493

32
g4738336
2009
2484

32
g4902383
2012
2483

32
71228259V1
2018
2229

32
g4436056
2019
2491

32
71227844V1
2018
2304

32
g6037828
2021
2487

32
g3740552
2022
2489

32
g3418190
2137
2493

32
g3213525
2137
2487

32
1561242T6
2136
2435

32
g2139296
2137
2481

32
g1382788
2139
2484

32
1453667T6
2144
2442

32
g1501595
2147
2497

32
4401648H1
2175
2229

32
g760248
2190
2477

32
g3249913
2212
2489

32
g852879
2240
2477

32
g4509561
2255
2483

32
6532986H1
2257
2483

33
g779790
1220
1417

33
6117455H1
1343
1638

33
4733091H1
1405
1663

33
2614356H1
1420
1671

33
2614355H1
1420
1569

33
1340369F6
1474
1756

33
1340369H1
1474
1661

33
70920240V1
1488
2070

33
757294H1
1551
1778

33
2658667H1
1624
1866

33
2771444H1
1749
1989

33
1312886F6
1751
2202

33
1312886H1
1751
1949

33
2308711H1
1755
1965

33
3519383H1
1755
1939

33
2306567H1
1756
1936

33
1304465H1
1765
2003

33
5172484H1
1779
2028

33
4172237H1
1810
2077

33
2877775H1
1839
2116

33
869079H1
1839
2071

33
3939024H1
1856
2135

33
71273416V1
1860
2454

33
1420994H1
1918
2156

33
70917213V1
1926
2485

33
1420994F6
1937
2433

33
2661285H1
1939
2207

33
1690542H1
1958
2166

33
4044243H1
1965
2248

33
g841565
1971
2225

33
4633881H1
2015
2270

33
587465H1
2060
2372

33
756115R1
2094
2667

33
756115H1
2094
2348

33
3465750H1
2098
2249

33
71274483V1
2113
2783

33
6609076T2
2142
2819

33
71272794V1
2155
2817

33
3927045H1
2179
2474

33
3928245H1
2179
2470

33
3674253T9
2226
2768

33
2658953H1
2242
2504

33
70920349V1
2261
2805

33
4735215H1
2262
2523

33
1294470T6
2271
2833

33
2791572T6
2319
2835

33
5058201H2
2320
2433

33
1420994T6
2346
2837

33
1312886T6
2355
2836

33
1430732H1
2353
2616

33
2791668T6
2357
2837

33
2791572F6
645
894

33
6828289J1
663
1310

33
70919806V1
671
1312

33
124724H1
738
882

33
g652789
805
1068

33
2251573H1
819
1077

33
71274255V1
948
1609

33
70920002V1
965
1599

33
70919147V1
975
1630

33
70920073V1
974
1610

33
70917224V1
1001
1557

33
g988490
1047
1351

33
71272983V1
1049
1459

33
71031330V1
1104
1535

33
4156408F6
1156
1557

33
4156408H1
1156
1423

33
71031387V1
1159
1604

33
5998189H1
1177
1292

33
71273906V1
1179
1753

33
2791668F6
1216
1550

33
2791668H1
1216
1544

33
6609076H2
1
541

33
2807474H1
7
182

33
6491123H1
19
165

33
6783159H1
27
590

33
g1727301
32
157

33
6828289H1
438
965

33
3674253H1
471
632

33
6953528H1
597
886

33
70917171V1
645
1168

33
2791572H1
646
934

33
756115F1
2364
2872

33
g5658477
2374
2795

33
g2324579
2375
2789

33
2748719H1
2415
2696

33
g4533354
2425
2876

33
g4564567
2440
2876

33
4829083H1
2441
2731

33
g5528721
2457
2877

33
g788300
2535
2872

33
g4283575
2524
2872

33
g4892982
2537
2872

33
g2410925
2550
2875

33
g652629
2559
2857

33
5316017H1
2581
2854

33
5316857H1
2585
2854

33
5318171H1
2597
2854

33
g2337727
2598
2873

33
756115T6
2617
2848

33
4735116H1
2631
2876

33
1365975R6
2632
2872

33
1365975H1
2632
2872

33
1365975T6
2633
2853

33
g1211220
2687
2875

33
2560064H1
2725
2872

33
g988325
2753
2845

34
3373528H1
609
720

34
g5754867
731
968

34
2045586H1
1036
1288

34
6799054H1
1
622

34
6452403H2
29
524

34
g1978677
101
420

34
6982612H1
143
724

34
3359232H1
147
369

34
6834663H1
387
1001

34
7001130H1
504
866

34
7318752H1
574
1174

35
1999073H1
4939
5184

35
g4330742
4944
5258

35
4934920H1
4945
5258

35
g4393289
4948
5263

35
1659543H1
4959
5214

35
g3118267
4973
5261

35
g5849381
4977
5259

35
g1218351
4988
5256

35
3130050H1
4980
5253

35
6342848H1
4981
5253

35
g866163
4979
5254

35
143138F1
4992
5258

35
g3755072
4993
5261

35
g880989
4994
5263

35
g877984
5006
5255

35
1749391T6
4740
5217

35
1344542H1
4747
5062

35
g5176036
4752
5258

35
5595877H1
4753
4917

35
6505354H1
4757
5265

35
1880971T6
4758
5218

35
g5675620
4765
5258

35
g4372792
4767
5256

35
g4281732
4769
5257

35
g5810326
4772
5259

35
g4999023
4773
5253

35
5097726H1
4779
5029

35
5685655H1
4778
5025

35
g3086706
4784
5259

35
g3752346
4790
5264

35
2183473H1
4792
5046

35
g3016110
4805
5260

35
6751216H1
4811
5148

35
5325018H1
4813
5082

35
5321404T9
4813
5124

35
5323707H1
4813
5089

35
5321503H1
4813
5077

35
g5921006
4814
5258

35
5477528H1
4813
5119

35
5482768H1
4813
5046

35
5475712H1
4813
5014

35
5323312H1
4813
5048

35
g5511339
4816
5258

35
g6036549
4817
5262

35
6337194H1
4817
4949

35
g6399777
4829
5263

35
g6117467
4829
5264

35
g4435700
4839
5258

35
g5636554
4842
5258

35
g3594269
4843
5258

35
g4073072
4859
5258

35
g2458074
4844
5260

35
g4533318
4845
5258

35
g2987667
4847
5212

35
1924391R6
4847
5258

35
1924391T6
4847
5218

35
1924391H1
4847
5074

35
g2555756
4854
5257

35
g2054443
4858
5258

35
5771260H1
4874
5258

35
2246911H1
4872
5159

35
g1267895
4883
5266

35
1339830H1
4883
5135

35
g5590233
4886
5257

35
g4303732
4888
5259

35
g2054335
4892
5260

35
6722884H1
4893
5253

35
g1471105
4897
5262

35
g2963543
4895
5261

35
g4900893
4896
5263

35
g775422
4901
5265

35
g5362828
4902
5258

35
g5396797
4907
5264

35
3164806H1
4904
5221

35
g5768150
4907
5251

35
2252371H1
4909
5155

35
g847184
4909
5228

35
2198423T6
4911
5218

35
7063034H1
4916
5253

35
5485489H1
4916
5210

35
1690630H1
4920
5157

35
g5888136
4922
5258

35
723564H1
4923
5070

35
723580H1
4923
5158

35
g1860289
4937
5258

35
1568070H1
4938
5172

35
2775811H1
4069
4341

35
2836761H1
4073
4337

35
g2070265
4078
4492

35
6812440H1
4090
4428

35
6812440J1
4090
4428

35
3151404H1
4109
4352

35
6033478H1
4116
4485

35
g2878580
4117
4402

35
g3050962
4115
4372

35
6273920H2
4141
4414

35
1701815H1
4143
4330

35
6426867H1
4156
4711

35
6427663H1
4178
4711

35
3368975H1
4198
4330

35
g4125826
4225
4670

35
1531459H1
4225
4418

35
2966424H1
4227
4330

35
2684363H1
4237
4393

35
2116137H1
4273
4382

35
669344H1
4290
4560

35
2672272H1
4314
4418

35
1453860H1
4326
4539

35
1453827H1
4326
4491

35
6179108H1
4330
4609

35
g434467
4329
4560

35
1749391F6
4332
4392

35
1749391H1
4332
4386

35
701985H1
4412
4611

35
4407419H1
4419
4685

35
4708563H1
4446
4698

35
6852905H1
4459
5027

35
6264623H1
4490
5031

35
3640801H1
4504
4758

35
2744645H1
4504
4757

35
1879458H1
4505
4778

35
7287970H1
4530
5048

35
6333393H1
4550
5092

35
144995H1
4591
4772

35
3147774H1
4595
4831

35
6329285H1
4599
5271

35
661058H1
4600
4880

35
1834059R6
4601
5054

35
1834059H1
4601
4873

35
6158436H1
4618
4903

35
1622370H1
4620
4876

35
g1423847
4624
4905

35
4576478H1
4629
4893

35
600650H1
4631
4922

35
3316972H1
4638
4904

35
6954952H1
4640
5237

35
2759067H1
4643
4939

35
555514H1
4650
4902

35
5334364H1
4650
4864

35
5334363H1
4650
4806

35
g1367753
4648
5254

35
3526337H1
4662
4986

35
4864025H1
4665
4953

35
3803045H1
4668
4966

35
4002622H1
4679
4784

35
836008H1
4687
4806

35
2957630H1
4690
4989

35
2954183H1
4690
4974

35
6202637H1
4712
5026

35
6202437H1
4710
5128

35
2264722H1
4710
4941

35
2264938H1
4710
4910

35
g3675124
4711
5225

35
6862550H1
4721
5249

35
4941757H1
4711
5007

35
1478716H1
4711
4940

35
1476588H1
4711
4915

35
1476596H1
4711
4914

35
143138H1
4717
4918

35
145092H1
4717
4897

35
g395766
4724
5078

35
1834059T6
4728
5218

35
6393179H1
4736
5021

35
6386330H1
4737
5011

35
g866953
5008
5258

35
g867451
5014
5259

35
3865585H1
5017
5263

35
g2263181
5033
5257

35
g1741383
5041
5258

35
g3889402
5037
5258

35
g5444119
5046
5266

35
2117462H1
5071
5195

35
917065H1
5073
5258

35
g5637280
5073
5257

35
917065T1
5073
5239

35
g2464570
5078
5258

35
g2016352
5088
5258

35
5022709H1
5115
5268

35
2040433H1
5115
5221

35
4018392H1
5123
5241

35
g2079096
5140
5258

35
1453775H1
5143
5258

35
6536539H1
5171
5253

35
504486H1
5177
5246

35
g5554333
1
198

35
7030014H1
75
512

35
6984009H1
91
612

35
g2224552
197
5260

35
7092379H1
285
473

35
7193755H2
513
1006

35
6776509H1
515
1049

35
660357H1
525
791

35
661029H1
525
797

35
6990425H1
538
887

35
5623310H1
656
986

35
6939255H1
673
1165

35
6776509J1
970
1578

35
5629345H1
1070
1249

35
6348743H1
1585
1860

35
6774260J1
1597
2124

35
6765277H1
1861
2427

35
6774260H1
1904
2321

35
6516341H1
2086
2424

35
7012981H1
2178
2351

35
7075422H1
2231
2823

35
7185631H1
2343
2765

35
3101228H1
2529
2835

35
6036945H1
2608
3124

35
6637659H1
2635
3204

35
7331036H1
2646
3182

35
6637659J1
2647
3193

35
7180283H1
2692
3235

35
2708492H1
2897
2999

35
6463093H1
2925
3110

35
7091379H1
2969
3492

35
g1741484
3051
3230

35
3284115H1
3094
3353

35
1517309H1
3246
3455

35
6952950H1
3295
3883

35
3216127H1
3291
3579

35
7174368H1
3332
3903

35
3402651H1
3332
3589

35
7259765H1
3388
4023

35
6604779H1
3511
3997

35
1593761H1
3512
3747

35
7107055H1
3521
3579

35
7199042H1
3532
4116

35
6988147H1
3534
3899

35
6806336J1
3535
4013

35
6806336H1
3536
3983

35
7032229H1
3569
4118

35
3120776H1
3582
3716

35
3745702H1
3587
3892

35
3745703H1
3589
3889

35
7323378H1
3724
4337

35
7032660H1
3722
4284

35
3532688H1
3747
3964

35
6534296H1
3784
4031

35
1661311H1
3802
3897

35
2198423H1
3826
3970

35
1880971F6
3829
4311

35
1880971H1
3829
4098

35
1555666H1
3881
4099

35
1517127H1
3898
4106

35
3170592H1
3940
4237

35
6808106H1
3949
4234

35
6808106J1
3950
4234

35
7185914H1
3966
4388

35
6943659H1
3983
4468

35
g766595
3993
4326

36
4274433H1
3948
4086

36
7289132H1
2748
3156

36
3739607H1
2770
2954

36
2149153T6
2515
3015

36
g1880151
2565
2784

36
2148724T6
2583
3030

36
g5449141
2616
3056

36
1845983T6
2617
3015

36
2658150H1
2654
2950

36
g3181486
2726
3061

36
589633R6
2736
3083

36
589633T6
2736
3029

36
g3797974
2747
3063

36
6883937H1
2092
2600

36
6979204H1
2099
2630

36
g5768436
2174
2636

36
5589055H1
2255
2525

36
5589206H1
2255
2510

36
1845983R6
2276
2760

36
1845983H1
2276
2541

36
g846523
2282
2754

36
5120292T6
2319
2628

36
5771030H1
2355
2872

36
819494H1
2363
2622

36
2149153F6
2494
2777

36
2149153H1
2494
2762

36
2593534T6
5664
6026

36
2593534F6
5671
6070

36
2593534H1
5671
5908

36
g2541279
5708
6071

36
g1751107
5760
6066

36
g778115
5856
6056

36
g2876940
6002
6062

36
3219151H1
5058
5386

36
3203918T6
5064
5609

36
3739027H1
5140
5358

36
2645933H1
5149
5412

36
g3778574
5152
5629

36
g4244154
5153
5624

36
g4311781
5164
5626

36
g4175659
5175
5634

36
3620939H1
5187
5481

36
5113889H1
5186
5447

36
2656336T6
5202
5577

36
5700054H1
5204
5442

36
5700086H1
5204
5267

36
g1751351
5217
5521

36
1679842T6
5226
5584

36
1679842F6
5233
5624

36
1679842H1
5233
5434

36
g2659077
5240
5584

36
g5813116
5263
5626

36
g2659410
5288
5628

36
g4148675
5303
5627

36
g2051261
5311
5630

36
1234495H1
5320
5628

36
2188493H1
5320
5600

36
2683448T6
5334
5590

36
g840575
5338
5626

36
7245834H1
3231
3438

36
824598R6
3289
3534

36
891226H1
3289
3534

36
824598H1
3289
3534

36
824598T1
3289
3494

36
824598T6
3289
3492

36
g2047298
3323
3838

36
g2047291
3323
3820

36
7247410H1
3362
3587

36
3203918F6
3488
3984

36
3203918H1
3489
3685

36
6172362H1
3583
3870

36
5044786H1
3736
4008

36
70046502V1
3863
4274

36
70047585V1
3863
4328

36
1304976F6
3863
4282

36
1304976H1
3863
4108

36
70047549V1
3863
4010

36
826082R1
3920
4502

36
826082H1
3920
4203

36
2308804H1
2791
3054

36
g846473
2794
3065

36
g1218558
2851
3063

36
7291393H1
2960
3486

36
6524466H1
3005
3410

36
6524566H1
3005
3543

36
1599523F6
3076
3438

36
1599523H1
3076
3277

36
g1165330
3140
3528

36
7247361H1
3207
3719

36
g1983706
3207
3474

36
3070168H1
2500
2795

36
5519150H1
4199
4369

36
2717228H1
4200
4443

36
g839478
4978
5251

36
6217349H1
4984
5467

36
2970290H1
5053
5365

36
g1982712
4550
4796

36
613186H1
4558
4795

36
3724286H1
4560
4854

36
4365389H1
4562
4823

36
4754909H1
4583
4854

36
4354479H1
4604
4869

36
3330536H1
4650
4926

36
5581641H1
4650
4911

36
3528092H1
4659
4951

36
2750671H1
4685
4954

36
2668782H1
4691
4881

36
6372588H1
4723
4978

36
g778190
4793
5063

36
1917315H1
4825
5119

36
3621450H1
4843
5024

36
4783325H1
4844
5101

36
2656336F6
4877
5465

36
2656336H1
4877
5104

36
7336890H1
4913
5506

36
5920831H1
4918
5225

36
5096190H1
4963
5229

36
1928876H1
4970
5242

36
6217557H1
4978
5466

36
5744848H1
4239
4494

36
4176436H1
4277
4534

36
6740355H1
4458
5003

36
3487520H1
4498
4794

36
3659439H1
4514
4777

36
4274741H1
3949
4251

36
4274803H1
3949
4119

36
463357H1
4010
4201

36
4314429H1
4057
4342

36
g920351
4116
4382

36
g1149210
4133
4231

36
3766255H1
4149
4322

36
2683448F6
4167
4553

36
2683448H1
4167
4417

36
1300835T7
4174
4404

36
1307359H1
4194
4444

36
2760124H1
1934
2221

36
g858075
1936
2226

36
2760124T6
1983
2605

36
2923468H1
5441
5721

36
6838005H1
5463
5612

36
2923469T6
5476
6028

36
6838105H1
5493
5624

36
g4333756
5545
5629

36
4502184H1
5550
5622

36
5305353H1
5567
5817

36
g3647442
5625
6070

36
2733278T6
5625
6026

36
2294001H1
5633
5891

36
3993959H2
5355
5579

36
3629589H1
5367
5668

36
g2051240
5401
5630

36
1599523T6
5433
5582

36
2923469F6
5441
5868

36
2733278H1
745
977

36
g2538994
879
1084

36
7270376H1
1062
1618

36
g4242829
1103
1541

36
2780338F6
1250
1717

36
2780338H1
1250
1499

36
6244653H1
1330
1838

36
6308158H1
1771
2315

36
g2106835
1893
2201

36
2760124R6
1934
2378

36
g6330616
228
5624

36
2733278F6
745
1284

36
3994147H1
5353
5628

36
6883937J1
1
549

37
70554791V1
269
836

37
70555906V1
482
1070

37
70557145V1
488
1152

37
70328701D1
115
602

37
70557446V1
1746
2364

37
70557024V1
1777
2435

37
70326732D1
1800
2134

37
70326508D1
1800
1870

37
71304277V1
1830
2463

37
71156493V1
1852
2469

37
71303442V1
1864
2504

37
5542815H1
1873
2025

37
71157532V1
1881
2356

37
70555668V1
1893
2524

37
70555958V1
1930
2595

37
70555146V1
1931
2563

37
71303538V1
1959
2455

37
71304228V1
1958
2586

37
6496937H1
1967
2501

37
305090R6
1971
2342

37
305090H1
1970
2306

37
4598818H1
1996
2251

37
6349213H2
2054
2378

37
70556404V1
1493
2023

37
3696047F6
1521
2066

37
3696047H1
1522
1818

37
71158742V1
1536
2128

37
71156538V1
1542
2034

37
70327564D1
1550
2005

37
4670450H1
1563
1762

37
71157870V1
1598
2195

37
70556820V1
1615
2235

37
6416418H1
1667
1887

37
6389818H1
1667
1987

37
4518860H1
1672
1933

37
70554892V1
1703
2343

37
70554965V1
1703
2332

37
6830659J1
1705
2343

37
3279857H1
1719
1993

37
71304118V1
1741
2354

37
71158362V1
1743
2480

37
71155779V1
2409
2987

37
4172634F6
2447
3014

37
4172634H1
2447
2722

37
4438947H1
2448
2716

37
71156387V1
2457
2883

37
71303533V1
2512
2939

37
7353820H1
2529
2887

37
4539057H1
2561
2815

37
2328218H1
2633
2899

37
71304436V1
2666
3213

37
71157628V1
2710
3265

37
5106567H1
2713
2961

37
4599088H1
2761
3020

37
1501621F6
2190
2690

37
1501621H1
2190
2378

37
70557357V1
2284
2914

37
71157279V1
2290
2770

37
6116935H1
2291
2555

37
70325710D1
2321
2741

37
70325612D1
2363
2756

37
70328746D1
2363
2721

37
71156954V1
2388
2865

37
761848H1
2387
2597

37
2528759H1
2396
2656

37
70555774V1
2404
3076

37
3222459H1
2408
2765

37
70555710V1
602
1210

37
70554866V1
605
1225

37
70327790D1
614
1116

37
70325412D1
620
997

37
70326955D1
620
1007

37
6828695H1
703
1285

37
2868052H1
708
843

37
70555300V1
723
1261

37
1582746H1
3153
3386

37
g5848554
3164
3419

37
2770719T6
3195
3431

37
6416515H1
3258
3419

37
g4739984
3348
3419

37
6785591H1
12
523

37
2925464F6
16
568

37
4179240H1
17
287

37
2925464H1
16
274

37
4179553F8
21
514

37
4179553H1
21
247

37
4874914H1
4
263

37
4179741H1
4
294

37
6075277H1
2826
3033

37
1426361F6
2857
3303

37
1426357H1
2857
3060

37
71131546V1
2866
3169

37
5536040H1
2910
3142

37
1501621T6
2953
3435

37
71158019V1
2958
3419

37
4050931H1
2977
3284

37
70326238D1
2988
3419

37
4179553T9
2999
3343

37
71156430V1
3001
3419

37
g4665411
3004
3419

37
4172634T6
3023
3429

37
g2099982
3028
3419

37
2770719H1
3054
3325

37
2770719F6
3054
3249

37
g2077519
3061
3419

37
g2099950
3063
3288

37
g5664324
3092
3419

37
g5452554
3115
3474

37
71158855V1
1155
1627

37
5811393H1
1155
1458

37
71157014V1
1155
1753

37
g5850365
1172
1534

37
g5865429
1177
1479

37
70446257V1
1237
1854

37
70446298V1
1236
1858

37
70326574D1
1292
1722

37
70555309V1
1308
1895

37
70555528V1
1315
1998

37
70556256V1
1368
2053

37
70556149V1
1371
1998

37
70555054V1
1382
1948

37
70555206V1
1385
1982

37
4441126H1
1384
1659

37
70557288V1
1422
2021

37
70560338V1
1426
2013

37
70326191D1
1440
1766

37
70327556D1
1458
2005

37
3699373H1
25
340

37
70327386D1
26
382

37
6784564H2
35
536

37
6786847H2
39
668

37
70554782V1
730
1378

37
70555359V1
732
1309

37
6830659H1
734
1265

37
70555879V1
743
1324

37
70556961V1
761
1427

37
70557092V1
784
1383

37
70554523V1
792
1538

37
70557219V1
804
1427

37
70555075V1
854
1389

37
70555282V1
856
1303

37
70554784V1
862
1429

37
6785373H1
889
1448

37
70556389V1
938
1426

37
70556118V1
963
1544

37
70557489V1
1005
1631

37
70554717V1
1009
1418

37
6784929H1
1068
1464

37
6828695J1
1071
1726

37
70556000V1
1081
1742

37
6934607H1
1085
1599

37
70449057V1
1109
1224

37
71303301V1
1146
1592

37
5811393F6
1155
1729

37
71156205V1
1155
1718

37
71156521V1
1155
1693

37
70554574V1
568
1182

37
70556236V1
564
1260

37
70554808V1
577
1186

37
6788638H1
13
474

37
6787884H1
1
326

37
71303881V1
1465
2036

37
6788583H1
1
581

37
6788770H1
510
1086

37
70554811V1
2066
2662

37
4515767H1
2069
2207

37
71303748V1
2138
2612

37
70328165D1
2151
2705

37
70326303D1
2151
2673

37
70326287D1
2151
2447

37
71155657V1
2163
2702

37
4179741T9
2811
3358

37
70556579V1
2797
3121

37
71303602V1
2803
3455

37
g2051100
2822
3123

38
60100196D1
1959
2231

38
1859554H1
2167
2443

38
1859570H1
2167
2444

38
3361850H1
2214
2460

38
5272051H1
2369
2567

38
5272051F9
2369
2887

38
5272051F8
2369
2912

38
5090972F6
2471
2993

38
5090972H1
2471
2747

38
4274991F6
2519
2898

38
4274991H1
2519
2780

38
2185660H1
2581
2841

38
5090972R6
2805
3071

38
g5802614
1
3437

38
60100191D1
1682
2005

38
g1373056
1770
2132

38
6489031H1
1908
2435

38
5272051T9
2893
3324

38
4274991T6
2954
3393

38
g4196744
2957
3437

38
60100196B1
2968
3406

38
60100198B1
3119
3474

38
60100190B1
3184
3401

38
g3418913
3219
3438

38
60100191B1
3333
3472

38
196837H1
3382
3511

39
6775050J1
717
1394

39
6775050H1
925
1555

39
7361157H1
1029
1613

39
579137H1
1293
1511

39
g6197626
1359
1828

39
7156184J2
747
1335

39
7277468H1
854
1192

39
g2986601
375
462

39
5844017H1
418
618

39
7324537H1
307
843

39
g1277998
1
466

39
804517H1
25
265

39
4918488H1
31
303

39
7156184H2
35
641

39
1703886F6
35
435

39
1703886H1
35
245

39
3809668H1
45
350

39
g5152120
74
458

39
4550249H1
1
264

39
g6142263
81
462

39
g2254363
214
462

39
1703886T6
232
484

39
2656212F6
290
462

40
5314759H1
182
438

40
6222064U1
497
1056

40
g3003145
668
944

40
3818881F6
1
468

40
70536625V1
1
563

40
3818881H1
1
280

40
3345551H1
83
362

40
5988985F9
102
643

40
5988985H1
102
378

40
6267489H1
104
741

40
4072614H1
112
399

40
7167692H1
120
649

41
g1545026
2331
2704

41
g1062645
2331
2693

41
g1064773
2331
2676

41
g1482703
2331
2498

41
6549638H1
2430
3013

41
70300848D1
2452
2708

41
70300835D1
2479
2708

41
415443H1
2572
2798

41
419855H1
2572
2791

41
416163H1
2572
2762

41
1739793H1
3085
3321

41
1739793T6
3100
3767

41
4422806H1
3205
3454

41
415443F1
3205
3806

41
70300638D1
3222
3594

41
70300351D1
3251
3666

41
1595527T6
3300
3770

41
1595527H1
3307
3511

41
415986F1
3324
3806

41
4879243H1
3381
3654

41
g6139643
3394
3806

41
g1482608
3395
3806

41
2287181H1
3404
3604

41
2287181R6
3404
3572

41
g1162076
3447
3742

41
g1527588
3504
3806

41
g1481970
3516
3806

41
5779072H1
3534
3787

41
70300150D1
3556
3802

41
g1062646
3606
3790

41
g1064735
3701
3781

41
g4112497
3100
3288

41
684750H1
3105
3340

41
2402302H1
3037
3261

41
1739793R6
3085
3458

41
2497235H1
1745
2055

41
7190840H1
2160
2660

41
3285638H1
2171
2415

41
3285638F6
2171
2570

41
70300497D1
1250
1823

41
3348848H1
1522
1695

41
60133508V1
1520
1825

41
60131087B1
2209
2545

41
70300222D1
2312
2702

41
g1482020
2331
2775

41
2897538H1
1
259

41
g5457042
169
2567

41
3901248T9
378
1003

41
3899909T8
440
979

41
70516717D1
1091
1389

41
70300884D1
1130
1406

41
415991H1
2572
2642

41
415443R1
2572
3083

41
6362320H1
2606
2807

41
2783446H2
2623
2867

41
4442155H1
2651
2857

41
1849376H1
2685
2967

41
3285638T6
2784
3315

41
70300827D1
2787
3376

41
g3447015
2817
3261

41
2879330H1
2863
3165

41
g4110893
2891
3343

41
g6037968
2932
3343

41
g3693629
2952
3343

41
4113890H1
2979
3246

41
70300837D1
1134
1556

41
70300823D1
1230
1552

41
60211594U1
1243
1746

42
70866933V1
5034
5705

42
5926529H1
5081
5401

42
g1751265
5091
5420

42
4767333H1
5123
5429

42
70812418V1
5132
5800

42
5833936H1
5148
5428

42
g3016077
5152
5415

42
g4149219
5242
5421

42
70814699V1
5281
5854

42
70868813V1
5288
5908

42
1373555H1
5301
5546

42
g4307618
5322
5811

42
70867023V1
5332
5966

42
70869633V1
5404
6021

42
g2409915
5411
5811

42
1433020H1
5460
5705

42
70867216V1
5558
6222

42
1267718H1
4756
5019

42
g318200
4774
5165

42
1464866H1
3731
3992

42
70870570V1
3750
4457

42
71230331V1
3765
4290

42
71222361V1
3780
3934

42
71190090V1
3794
4487

42
70837174V1
3808
4000

42
71216238V1
3533
4246

42
71189613V1
3573
4128

42
4147558H1
4630
4860

42
71191702V1
4644
5197

42
3769383H1
4647
4965

42
71131533V1
4665
5137

42
70816797V1
4715
5387

42
71188635V1
4724
5165

42
7051349H1
3739
4208

42
71189574V1
4075
4700

42
g612859
4079
4410

42
71189238V1
4084
4700

42
g570718
4094
4400

42
71188405V1
4111
4753

42
g2805702
4165
4597

42
g3694501
4167
4598

42
71189379V1
4173
4848

42
g6144708
4176
4598

42
g2323168
4177
4598

42
g819401
4184
4610

42
70868193V1
4190
4727

42
g766671
4190
4568

42
g1516806
4197
4665

42
g1525425
4197
4612

42
g830693
4218
4610

42
71188787V1
4238
4612

42
4785755H1
4253
4533

42
70866811V1
4297
4860

42
g1614228
4303
4568

42
g3229742
467
888

42
g5457022
725
3257

42
g5456921
725
6222

42
g4683485
1334
1781

42
g5765573
1334
1759

42
g3075910
1387
1688

42
7190218H2
2401
2913

42
71229788V1
2815
3413

42
5014904F6
2815
3221

42
5014904H1
2815
3090

42
71229920V1
2973
3658

42
71228807V1
3182
3779

42
6884462H1
3181
3686

42
70868094V1
3257
3948

42
70869027V1
3256
3892

42
g823731
3267
3515

42
7044511H1
3272
3873

42
5919091H1
3287
3555

42
71188683V1
3333
3897

42
71191815V1
3333
3961

42
71191533V1
3333
3856

42
71191734V1
3333
3854

42
1600316F6
3333
3729

42
1600316H1
3333
3435

42
70867333V1
3341
3911

42
g839823
3355
3689

42
g824451
3355
3650

42
71190867V1
3363
3882

42
70870265V1
3396
4051

42
2013807H1
3391
3501

42
70866888V1
3809
4505

42
3673862H1
5564
5859

42
2499983T6
5584
6176

42
g815044
4346
4627

42
70869526V1
4360
4860

42
2499983H1
4367
4635

42
70867729V1
4376
5138

42
5386383H1
4385
4647

42
70868265V1
4388
5095

42
6274578H1
4416
4860

42
71190615V1
4434
5068

42
70866931V1
4477
5082

42
71189990V1
4513
5134

42
71190387V1
4513
5133

42
g672203
4544
4860

42
1269521F6
4577
5030

42
1269521H1
4577
4812

42
71230051V1
4584
5171

42
g670126
4590
4860

42
71188785V1
4601
5195

42
1817860T6
4773
5373

42
71230388V1
3476
4062

42
6337414H1
4795
5436

42
71188365V1
4864
5408

42
71129972V1
4882
5273

42
g3887571
4884
5422

42
7052610H1
3740
3875

42
71230123V1
4787
5357

42
1600316T6
4788
5379

42
g2224630
1
6155

42
g2142053
464
854

42
g3842828
466
883

42
1311611F6
4886
5420

42
1311611T6
4886
5378

42
g575078
4886
5176

42
1311611H1
4886
5148

42
71188609V1
4890
5438

42
71229950V1
4890
5346

42
2293604H1
4890
5151

42
621828H1
4890
5148

42
2626661H1
4890
5070

42
1269521T6
4892
5380

42
6327560H1
4893
5348

42
g2539162
4894
5429

42
g4852194
4905
5421

42
g2932593
4922
5424

42
g3148673
4928
5422

42
7098720H1
4931
5587

42
g5707120
4951
5413

42
3975608H1
4953
5272

42
3975908H1
4954
5274

42
70814653V1
4965
5676

42
g4971769
4971
5424

42
71188351V1
3626
4086

42
70838919V1
3631
4136

42
71188254V1
3638
4239

42
71189595V1
3651
3907

42
70870573V1
3655
4351

42
70868067V1
3679
4334

42
70867164V1
3682
4354

42
71230406V1
3685
4227

42
70869964V1
3682
4340

42
1817860F6
3725
4287

42
1817860H1
3725
4029

42
7050051H1
3739
4283

42
70816308V1
4604
5347

42
70813062V1
4615
5238

42
7103719H1
4627
5050

42
g883091
4613
5038

42
1963922R6
4615
5216

42
70825247V1
4615
5083

42
70815988V1
4615
5030

42
70649447V1
4615
5280

42
70814603V1
4615
5185

42
70812386V1
4615
5163

42
70813116V1
4615
5137

42
70812591V1
4615
5112

42
1963922H1
4615
4860

42
70817149V1
4615
5238

42
71190973V1
3394
4015

42
70866857V1
3421
4053

42
70869712V1
3422
4110

42
71190024V1
3462
4134

42
71222510V1
3809
4002

42
71229550V1
3828
4582

42
7317184H2
3840
4515

42
71191575V1
3866
4388

42
71190157V1
3909
4588

42
71230422V1
3911
4602

42
70868868V1
3926
4435

42
71191209V1
3938
4502

42
71229173V1
3944
4466

42
71191826V1
3939
4349

42
71188071V1
3982
4494

42
71222526V1
3993
4351

42
70868437V1
4000
4529

42
70867683V1
4003
4658

42
71190956V1
4017
4607

42
70867083V1
4019
4527

42
70869984V1
4019
4488

42
g775853
4047
4392

42
71189002V1
4049
4491

42
70870114V1
4057
4751

42
1963922T6
5617
6180

42
745052H1
5643
5869

42
3333795T6
5681
6181

42
4421884H1
5703
5956

42
g4989315
5743
6225

42
g3446159
5744
6227

42
g5853840
5747
6219

42
2280040T6
5748
6175

42
g4264936
5749
6222

42
g5590548
5767
6219

42
2280040R6
5769
6222

42
2280040H1
5769
6044

42
g4114692
5775
6229

42
2157793H1
5776
6020

42
g4269881
5783
6222

42
g314938
5790
6222

42
5014904T6
5789
6175

42
g1516807
5846
6222

42
71190271V1
3599
4339

42
5515021R7
3622
4216

42
71229150V1
3622
4275

42
70867419V1
3623
4261

42
g671390
5960
6219

42
g820781
5971
6244

42
g668623
6031
6222

42
71221653V1
6103
6222

42
g882914
6021
6129

42
71188120V1
4750
4951

42
1267718F1
4756
5198

42
71190911V1
4733
5379

42
71188586V1
4756
5397

42
70869357V1
4982
5696

42
g3756453
4981
5424

42
4776237H1
4985
5261

42
71190506V1
5033
5514

42
6608393T1
5498
6138

42
5907377H1
5524
5800

42
70870592V1
5528
6173

42
70813957V1
5544
6036

42
3333795F6
5552
6027

42
3333795H1
5552
5840

42
71188885V1
4599
5206

42
g1525426
5842
6222

42
g882983
5853
6245

42
g797506
5865
6230

42
g587184
5880
6222

42
70870719V1
5924
6239

42
g814957
5894
6223

42
g822523
5964
6230

42
g612999
4719
5074

43
g2034169
2102
2394

43
5540505T7
2291
2870

43
6377332H1
2417
2702

43
4947810H1
2612
2733

43
g5006247
1
2762

43
5540505F6
953
1415

43
5540505H1
953
1146

43
g2875734
2835
2940

43
g3735348
2634
2945

43
5118201T6
2631
2910

43
2749265F6
2448
2923

43
2749265H1
2448
2714

43
2749265T6
2551
2897

43
537065H1
2429
2663

44
1452312F1
3288
3835

44
70007188D1
3260
3637

44
g898311
3282
3460

44
1452312F6
3288
3736

44
1452312H1
3288
3560

44
2599007H1
3312
3589

44
6325947H1
3442
3749

44
840648H1
3415
3672

44
70012088D1
3420
3797

44
5852153H1
3426
3701

44
70604010V1
1419
2043

44
6952285H1
1480
2049

44
4458494F6
1493
1942

44
70608095V1
1492
1936

44
4458494H1
1494
1730

44
7255931H2
1571
1752

44
6909665J1
1608
2154

44
6969377U1
1616
2026

44
2272356R6
1622
1941

44
2272356H1
1622
1890

44
70608114V1
1801
1904

44
6553230H1
1811
2165

44
6559394H1
1811
2428

44
3382113H1
1881
2090

44
70606021V1
1880
2259

44
70879980V1
2089
2579

44
2661806F6
2089
2531

44
2661806H1
2089
2361

44
70879113V1
2089
2545

44
g6476309
2149
2506

44
2627073H1
2160
2391

44
2627315H1
2160
2389

44
3901711H1
2247
2491

44
70887530V1
2263
2344

44
6969302U1
2280
2623

44
70881572V1
2297
2821

44
5763849H1
2351
2873

44
7256511H1
2398
2905

44
70882796V1
2405
3030

44
70886211V1
2434
2594

44
70882791V1
2477
2906

44
70882271V1
2478
2974

44
70881365V1
2478
2973

44
70003939D1
2481
2947

44
70012299D1
2481
2829

44
70004016D1
2481
3025

44
3572311F6
2487
3077

44
3572311H1
2487
2699

44
70005627D1
2487
2687

44
70010847D1
2517
2952

44
7336064H1
2527
2982

44
70880257V1
2544
3145

44
70011933D1
2553
3044

44
2272356T6
2566
3001

44
70888761V1
2568
2873

44
3011048H1
3342
3641

44
4562117H1
3350
3613

44
4563263H1
3352
3636

44
70603379V1
1131
1723

44
70603933V1
1153
1782

44
70607414V1
1277
1412

44
70607363V1
1042
1396

44
2414751H1
3218
3489

44
389997H1
3676
3915

44
6357624H1
3682
3922

44
g3961665
3684
3920

44
g6477150
3686
3925

44
1689958F6
3693
3923

44
1689958H1
3693
3907

44
1689958T6
3698
3880

44
1702166T6
3718
3866

44
3572311T6
3740
3872

44
g4649451
3791
3915

44
4099042H2
3816
3927

44
4099042F8
3816
4438

44
1243554H1
3816
3923

44
g4325490
3834
3915

44
2968601H1
3954
4247

44
g5810032
3494
3926

44
7255223H1
3518
3915

44
g2237335
3527
3920

44
2878117H1
3530
3815

44
g1400734
3536
3915

44
5104505H1
3540
3772

44
g4081742
3542
3923

44
1452312T6
3546
3876

44
g898312
3565
3918

44
6499719H1
3564
3909

44
g4081564
3565
3923

44
g2335900
3599
3920

44
g6451467
3602
3915

44
g1521304
3605
3931

44
g4534027
3606
3923

44
5790863H1
3609
3903

44
5789451H1
3609
3898

44
5787849H1
3609
3915

44
g5528373
3621
3920

44
g1516463
3624
3931

44
g5912966
3660
3920

44
344685H1
3673
3922

44
2623608H1
3367
3604

44
840648R1
3415
3915

44
4333836H1
3415
3703

44
70881547V1
3400
3921

44
70886619V1
3404
3634

44
2414749F6
3218
3747

44
70605048V1
1033
1331

44
7267489H1
1034
1578

44
6346421H1
3442
3736

44
6317150H1
3442
3746

44
4897563H1
3129
3422

44
5379052H1
3137
3362

44
3406784H1
3145
3410

44
70008878D1
3156
3637

44
70608052V1
1080
1187

44
g3888759
1108
1488

44
2857322H1
2904
3183

44
70881851V1
2904
3275

44
792748R1
2910
3533

44
792748H1
2909
3154

44
793130H1
2910
3134

44
7159471H1
2922
3506

44
70880131V1
2923
3534

44
1541872H1
2940
3161

44
684595H1
2941
3207

44
70886274V1
2982
3197

44
70886318V1
2982
3196

44
6722223H1
3013
3202

44
2806050H1
3019
3347

44
1702166F6
3044
3568

44
1702166H1
3044
3271

44
4980587H1
3057
3327

44
6909665H1
3076
3619

44
4372755H1
3078
3384

44
6074761H1
3079
3396

44
685902H1
2605
2829

44
70880726V1
2616
3181

44
2615527H1
2623
2881

44
70879436V1
2671
3129

44
70882269V1
2673
3180

44
70887568V1
2676
2818

44
70882659V1
2688
3179

44
1438876F1
2686
3071

44
1438880H1
2686
2970

44
1438876H1
2686
2968

44
2258046H1
2717
2963

44
70003496D1
2721
3284

44
70011398D1
2733
3192

44
70882502V1
2739
3418

44
70879669V1
2748
3253

44
70006402D1
2745
3309

44
70004115D1
2745
3108

44
70011055D1
2745
3198

44
70882244V1
2768
3039

44
70007592D1
2769
2981

44
6479471H1
2787
3356

44
7054594H1
2797
3403

44
70879623V1
2807
3487

44
5274874H1
2829
3072

44
70007727D1
2843
3340

44
70010542D1
2843
3307

44
70010162D1
2843
3246

44
70005864D1
2843
3198

44
70002001D1
2843
3074

44
70002333D1
2844
3415

44
70011761D1
2844
3198

44
70001785D1
2849
3344

44
70007867D1
2874
3336

44
70006872D1
2875
3344

44
70004362D1
2885
3284

44
70604116V1
1123
1734

44
2658395H1
3490
3738

44
70879732V1
3478
3911

44
g3429071
3484
3920

44
6317128H1
3442
3575

44
70879089V1
3455
3925

44
2661806T6
3469
3883

44
700495H1
3477
3740

44
70608699V1
853
1342

44
70653541V1
904
1439

44
70607650V1
918
1337

44
6938224H1
924
1338

44
70608866V1
964
1616

44
3776430H1
3217
3522

44
709518H1
3215
3449

44
70888779V1
3218
3398

44
872814H1
3082
3286

44
5438843H1
3097
3403

44
70003362D1
3164
3424

44
70004958D1
3165
3415

44
2527855H1
3178
3528

44
g1521303
3198
3655

44
g1517127
3198
3698

44
2414483H1
3218
3454

44
70010299D1
3248
3632

44
70005831D1
3338
3877

44
70003405D1
3101
3415

44
70007838D1
3099
3382

44
4880465H1
3100
3351

44
70012577D1
3107
3637

44
1320150H1
3127
3364

44
70008556D1
3132
3440

44
4181419H1
1
167

44
6779195J1
66
705

44
113399R6
430
794

44
4507995F6
435
610

44
4507995H1
436
607

44
6831490H1
443
635

44
6831490J1
443
635

44
70604944V1
690
1146

44
70607511V1
785
1414

44
6454789H1
1287
1795

44
70603538V1
1322
1922

44
684735H1
1352
1601

44
70607606V1
1355
1770

44
70603837V1
1402
1982

44
70006129D1
3099
3637

45
3386984H1
1
235

45
3087717H1
1
207

45
4832592H1
11
232

45
3750644H1
15
214

45
3350574H1
18
296

45
3150464H1
24
307

45
3381160H1
29
281

45
3092918H1
38
363

45
3092958H1
38
329

45
1524230H1
43
257

45
3384786H1
92
329

45
6055559H1
174
688

45
6055841H1
174
688

45
4509676H1
259
437

45
3081417H1
405
589

45
2952165H1
422
670

45
70874349V1
542
987

TABLE 5

SEQ ID

NO:
Template ID
Tissue Distribution

1
LG:977683.1:2000FEB18
Nervous System - 21%, Skin - 19%, Embryonic Structures - 11%

2
LG:893050.1:2000FEB18
Digestive System - 40%, Hemic and Immune System - 40%, Nervous System - 20%

3
LG:980153.1:2000FEB18
Nervous System - 16%, Urinary Tract - 12%, Skin - 12%

4
LG:350398.1:2000FEB18
Digestive System - 50%, Hemic and Immune System - 50%

5
LG:475551.1:2000FEB18
Skin - 35%, Hemic and Immune System - 19%, Digestive System - 11%

6
LG:481407.2:2000FEB18
widely distributed

7
LI:443580.1:2000FEB01
Unclassified/Mixed - 60%, Connective Tissue - 17%, Endocrine System - 13%

8
LI:803015.1:2000FEB01
Urinary Tract - 63%, Respiratory System - 38%

9
LG:027410.3:2000MAY19
Respiratory System - 100%

10
LG:171377.1:2000MAY19
Unclassified/Mixed - 74%, Female Genitalia - 13%, Cardiovascular System - 10%

11
LG:352559.1:2000MAY19
Unclassified/Mixed - 71%, Digestive System - 29%

12
LG:247384.1:2000MAY19
Stomatognathic System - 39%, Musculoskeletal System - 28%, Cardiovascular

System - 19%

13
LG:403872.1:2000MAY19
Nervous System - 40%, Embryonic Structures - 23%, Urinary Tract - 14%

14
LG:1135213.1:2000MAY19
Embryonic Structures - 24%, Cardiovascular System - 20%, Unclassified/Mixed - 13%

15
LG:474284.2:2000MAY19
Unclassified/Mixed - 14%

16
LG:342147.1:2000MAY19
Pancreas - 21%, Male Genitalia - 19%, Female Genitalia - 17%, Urinary Tract - 17%

17
LG:1097300.1:2000MAY19
Endocrine System - 25%, Skin - 18%, Unclassified/Mixed - 13%

18
LG:444850.9:2000MAY19
Digestive System - 28%, Connective Tissue - 20%, Exocrine Glands - 10%

19
LG:402231.6:2000MAY19
Endocrine System - 23%, Hemic and Immune System - 23%, Digestive System - 18%

20
LG:1076157.1:2000MAY19
Embryonic Structures - 50%, Endocrine System - 28%, Respiratory System - 17%

21
LG:1083142.1:2000MAY19
Germ Cells - 84%

22
LG:1083264.1:2000MAY19
Liver - 52%, Connective Tissue - 33%

23
LG:350793.2:2000MAY19
Sense Organs - 25%, Connective Tissue - 14%

24
LG:408751.3:2000MAY19
Nervous System - 39%, Sense Organs - 39%

25
LI:336120.1:2000MAY01
Nervous System - 24%, Respiratory System - 22%, Endocrine System - 18%

26
LI:234104.2:2000MAY01
Female Genitalia - 21%, Unclassified/Mixed - 17%, Nervous System - 12%

27
LI:450887.1:2000MAY01
Nervous System - 100%

28
LI:119992.3:2000MAY01
Embryonic Structures - 10%

29
LI:197241.2:2000MAY01
Connective Tissue - 26%, Endocrine System - 12%

30
LI:406860.20:2000MAY01
Digestive System - 100%

31
LI:142384.1:2000MAY01
Connective Tissue - 44%, Germ Cells - 34%

32
LI:895427.1:2000MAY01
Cardiovascular System - 20%, Urinary Tract - 14%, Skin - 13%

33
LI:757439.1:2000MAY01
Digestive System - 18%, Embryonic Structures - 13%, Sense Organs - 12%

34
LI:1144066.1:2000MAY01
Cardiovascular System - 59%, Exocrine Glands - 25%

35
LI:243660.4:2000MAY01
Pancreas - 63%

36
LI:334386.1:2000MAY01
Exocrine Glands - 17%, Male Genitalia - 16%, Musculoskeletal System - 13%

37
LI:347572.1:2000MAY01
Digestive System - 30%, Digestive System - 23%, Respiratory System - 17%

38
LI:817314.1:2000MAY01
Unclassified/Mixed - 55%, Male Genitalia - 26%, Female Genitalia - 11%

39
LI:000290.1:2000MAY01
Female Genitalia - 54%

40
LI:023518.3:2000MAY01
Urinary Tract - 50%, Musculoskeletal System - 27%, Hemic and Immune System - 23%

41
LI:1084246.1:2000MAY01
Sense Organs - 72%

42
LI:1165828.1:2000MAY01
Musculoskeletal System - 19%, Germ Cells - 18%, Nervous System - 14%

43
LI:007302.1:2000MAY01
Connective Tissue - 29%, Respiratory System - 21%, Hemic and Immune System -

18%

44
LI:236386.4:2000MAY01
Skin - 30%, Female Genitalia - 11%

45
LI:252904.5:2000MAY01
Exocrine Glands - 20%, Nervous System - 16%, Endocrine System - 13%

TABLE 6

SEQ ID

Probability

NO:
Frame
Length
Start
Stop
GI Number
score
Annotation

46
3
263
27
815
g10764778
1e−131
phosphoinositol 3-phosphate-binding protein-2

[Homo sapiens]

g10045840
1e−58
TPC2 [unidentified]

g4589582
2e−28
KIAA0969 protein [Homo sapiens]

47
1
217
10
660
g6634025
1e−81
KIAA0379 protein [Homo sapiens]

g6453538
6e−77
hypothetical protein [Homo sapiens]

g4803678
7e−29
ankyrin (brank-2) [Homo sapiens]

48
1
716
613
2760
g7243215
0.0
KIAA1417 protein [Homo sapiens]

g7263990
0.0
dJ93K22.1 (novel protein (contains

DKFZP564B116)) [Homo sapiens]

g7302944
5e−57
CG8060 gene product [Drosophila melanogaster]

49
3
107
60
380

50
3
645
3
1937
g4826478
0.0
dJ37E16.2 (SH3-domain binding protein 1) [Homo

sapiens]

g861029
0.0
SH3 domain binding protein [Mus musculus]

g7018521
0.0
hypothetical protein [Homo sapiens]

51
3
177
93
623
g6119546
1e−45
hypothetical protein; 114721-113936

[Arabidopsis thaliana]

g6522593
3e−10
putative RNA binding protein [Arabidopsis

thaliana]

g950424
4e−10
splicing factor, arginine/serine-rich 7 [Homo

sapiens]

52
1
217
79
729
g4589566
3e−34
KIAA0961 protein [Homo sapiens]

g3970712
3e−26
zinc finger protein 10 [Homo sapiens]

g7630121
8e−25
zinc finger protein 92 [Mus musculus]

53
3
151
3
455
g5262560
2e−35
hypothetical protein [Homo sapiens]

g10434856
1e−29
unnamed protein product [Homo sapiens]

g930123
9e−27
zinc finger protein (583 AA) [Homo sapiens]

54
3
193
3
581
g10438267
1e−74
unnamed protein product [Homo sapiens]

g7290756
8e−16
CG4532 gene product [Drosophila melanogaster]

g5705877
8e−10
POD-1 [Caenorhabditis elegans]

55
3
282
3
848
g3077703
1e−111
mitsugumin29 [Oryctolagus cuniculus]

g3461888
1e−108
mitsugumin29 [Mus musculus]

g3761107
1e−108
mitsugumin29 [Mus musculus]

56
2
211
2
634
g7243243
2e−44
KIAA1431 protein [Homo sapiens]

g4567179
2e−43
BC37295_1 [Homo sapiens]

g3445181
1e−41
R31665_2 [Homo sapiens]

57
2
366
83
1180
g9945010
1e−120
RING-finger protein MURF [Mus musculus]

g9929937
5e−92
hypothetical protein [Macaca fascicularis]

g10439844
1e−36
unnamed protein product [Homo sapiens]

58
3
326
354
1331
g7020303
0.0
unnamed protein product [Homo sapiens]

g10434892
3e−79
unnamed protein product [Homo sapiens]

g6683707
2e−31
KIAA0455 protein [Homo sapiens]

59
1
156
70
537
g6692607
2e−69
MGA protein [Mus musculus]

g5931585
9e−47
T-box family member; T-box domain [Cynops

pyrrhogaster]

g4049463
3e−16
transcription factor TBX6 [Homo sapiens]

60
2
262
239
1024
g1488047
7e−12
RING finger protein [Xenopus laevis]

g3916727
1e−11
estrogen-responsive B box protein [Homo

sapiens]

g401763
1e−11
ataxia-telangiectasia group D-associated

protein [Homo sapiens]

61
3
132
138
533

62
2
167
2
502
g2078531
2e−71
Mlark [Mus musculus]

g2078529
2e−70
Hlark [Homo sapiens]

g1149523
8e−57
Neosin [Mus musculus]

63
1
570
160
1869
g183002
0.0
guanylate binding protein isoform I [Homo

sapiens]

g829177
0.0
guanylate binding protein isoform II [Homo

sapiens]

g7023332
0.0
unnamed protein product [Homo sapiens]

64
3
168
3
506
g7020737
2e−89
unnamed protein product [Homo sapiens]

g8920240
2e−89
AK000559 hypothetical protein, similar to

(U06944) PRAJA1 [Mus musculus] [Homo sapiens]

g2979531
2e−51
R33683_3 [Homo sapiens]

65
3
246
57
794
g5262560
3e−65
hypothetical protein [Homo sapiens]

g10434856
4e−64
unnamed protein product [Homo sapiens]

g930123
7e−56
zinc finger protein (583 AA) [Homo sapiens]

66
3
120
51
410
g4589566
2e−23
KIAA0961 protein [Homo sapiens]

g456269
7e−22
zinc finger protein 30 [Mus musculus

domesticus]

g5080758
2e−20
BC331191_1 [Homo sapiens]

67
2
122
329
694
g10047297
7e−26
KIAA1611 protein [Homo sapiens]

g8163824
2e−19
krueppel-like zinc finger protein HZF2 [Homo

sapiens]

g3329372
6e−19
DNA-binding protein [Homo sapiens]

68
3
428
132
1415
g6094684
0.0
similar to Kelch proteins; similar to

BAA77027 (PID: g4650844) [Homo sapiens]

g7242973
0.0
KIAA1309 protein [Homo sapiens]

g7243089
0.0
KIAA1354 protein [Homo sapiens]

69
2
307
2
922
g8671168
1e−135
hypothetical protein [Homo sapiens]

g8886025
1e−135
collapsin response mediator protein-5 [Homo

sapiens]

g8671360
1e−131
Ulip-like protein [Rattus norvegicus]

70
1
198
856
1449
g1864085
1e−103
glypican-5 [Homo sapiens]

g3015542
1e−103
glypican-5 [Homo sapiens]

g205800
7e−38
intestinal protein OCI-5 [Rattus norvegicus]

71
1
227
511
1191
g1155088
1e−06
zyxin [Homo sapiens]

g1545954
1e−06
zyxin [Homo sapiens]

g576623
2e−06
ESP-2 [Homo sapiens]

72
3
122
3
368
g7629994
4e−41
60S RIBOSOMAL PROTEIN L36 homolog

[Arabidopsis thaliana]

g3236242
5e−40
60S ribosomal protein L36 [Arabidopsis

thaliana]

g11908070
5e−40
60S ribosomal protein-like protein

[Arabidopsis thaliana]

73
2
209
500
1126
g10435614
1e−113
unnamed protein product [Homo sapiens]

g7243089
1e−113
KIAA1354 protein [Homo sapiens]

g7242973
1e−107
KIAA1309 protein [Homo sapiens]

74
1
312
961
1896
g7243215
1e−157
KIAA1417 protein [Homo sapiens]

g7263990
1e−157
dJ93K22.1 (novel protein (contains

DKFZP564B116)) [Homo sapiens]

g7302944
3e−17
CG8060 gene product [Drosophila melanogaster]

75
3
190
3
572
g10435919
6e−69
unnamed protein product [Homo sapiens]

g3327128
3e−33
KIAA0657 protein [Homo sapiens]

g10436504
4e−09
unnamed protein product [Homo sapiens]

76
3
295
3
887
g10436290
1e−105
unnamed protein product [Homo sapiens]

g10436002
6e−99
unnamed protein product [Homo sapiens]

g8489831
2e−27
ubiquitin-conjugating BIR-domain enzyme

APOLLON [Homo sapiens]

77
2
288
374
1237
g3184264
5e−94
F02569_2 [Homo sapiens]

g10435546
5e−84
unnamed protein product [Homo sapiens]

g6653742
4e−54
7h3 protein [Homo sapiens]

78
1
294
97
978
g7670362
1e−106
unnamed protein product [Mus musculus]

g6175860
4e−15
g1-related zinc finger protein [Mus musculus]

g6330555
1e−13
KIAA1214 protein [Homo sapiens]

79
3
196
3
590
g3513300
3e−65
F16601_1, partial CDS [Homo sapiens]

g3882281
3e−50
KIAA0780 protein [Homo sapiens]

g10567164
4e−50
gene amplified in squamous cell carcinoma-1

[Homo sapiens]

80
3
745
285
2519
g2224553
0.0
KIAA0306 [Homo sapiens]

g4210501
0.0
BC85722_1 [Homo sapiens]

g10728201
3e−20
CG2779 gene product [Drosophila melanogaster]

81
3
256
507
1274
g6330617
1e−132
KIAA1223 protein [Homo sapiens]

g7301689
2e−72
CG10011 gene product [Drosophila

melanogaster]

g4803678
2e−33
ankyrin (brank-2) [Homo sapiens]

82
1
235
841
1545
g9802433
2e−76
ACE-related carboxypeptidase ACE2 [Homo

sapiens]

g5817160
2e−76
hypothetical protein [Homo sapiens]

g11876766
2e−76
unnamed protein product [Homo sapiens]

83
1
617
229
2079
g6665594
0.0
trp-related protein 4 truncated variant delta

[Homo sapiens]

g6665592
0.0
trp-related protein 4 truncated variant beta

[Homo sapiens]

g6665590
0.0
trp-related protein 4 [Homo sapiens]

84
3
293
735
1613
g7242977
1e−143
KIAA1311 protein [Homo sapiens]

g912755
2e−15
B0336.3 gene product [Caenorhabditis elegans]

g7298595
8e−12
CG10084 gene product [Drosophila

melanogaster]

85
3
276
30
857
g3955100
2e−74
vacuolar adenosine triphosphatase subunit D

[Mus musculus]

g1226235
2e−74
Ac39/physophilin [Mus musculus]

g736727
2e−74
32 kd accessory protein [Bos taurus]

86
3
355
1392
2456
g5457043
0.0
protocadherin beta 4 [Homo sapiens]

g11142065
0.0
protocadherin beta 9 [Homo sapiens]

g8926617
0.0
protocadherin 3H [Homo sapiens]

87
2
745
716
2950
g5457023
0.0
protocadherin alpha 9 short form protein

[Homo sapiens]

g3540157
0.0
KIAA0345-like 5 [Homo sapiens]

g2224631
0.0
KIAA0345 [Homo sapiens]

88
2
781
50
2392
g5006248
0.0
TLR6 [Homo sapiens]

g11596326
0.0
toll-like receptor 6 [Mus musculus]

g5006250
0.0
TLR6 [Mus musculus]

89
2
293
1313
2191
g6164628
2e−27
SH3 and PX domain-containing protein SH3PX1

[Homo sapiens]

g5327052
2e−27
dJ403L10.1 (SNX9 (Sorting Nexin 9)) [Homo

sapiens]

g4689258
2e−27
sorting nexin 9 [Homo sapiens]

90
1
241
214
936
g7022971
1e−62
unnamed protein product [Homo sapiens]

g3882311
4e−15
KIAA0795 protein [Homo sapiens]

g4539520
4e−14
dA22D12.1 (novel protein similar to

Drosophila Kelch (Ring Canal protein, KEL)

and a heterogenous set of other types of

proteins) [Homo sapiens]

TABLE 7

Parameter

Program
Description
Reference
Threshold

ABI
A program that removes vector sequences and
Applied Biosystems, Foster City, CA.

FACTURA
masks ambiguous bases in nucleic acid

sequences.

ABI/
A Fast Data Finder useful in comparing and
Applied Biosystems, Foster City, CA;
Mismatch <50%

PARACEL FDF
annotating amino acid or nucleic acid
Paracel Inc., Pasadena, CA.

sequences.

ABI
A program that assembles nucleic
Applied Biosystems, Foster City, CA.

AutoAssembler
acid sequences.

BLAST
A Basic Local Alignment Search Tool useful
Altschul, S. F. et al. (1990) J. Mol. Biol.
ESTs: Probability

in sequence similarity search for amino acid
215: 403-410; Altschul, S. F. et al. (1997)
value = 1.0E−8 or less

and nucleic acid sequences. BLAST includes five
Nucleic Acids Res. 25: 3389-3402.
Full Length sequences:

functions: blastp, blastn, blastx, tblastn,

Probability value =

and tblastx.

1.0E−10 or less

FASTA
A Pearson and Lipman algorithm that
Pearson, W. R. and D. J. Lipman
ESTs: fasta E

searches for similarity between a query
(1988) Proc. Natl. Acad Sci. USA 85:
value = 1.06E−6

sequence and a group of sequences of the same
2444-2448; Pearson, W. R. (1990) Methods
Assembled ESTs: fasta

type. FASTA comprises as least five functions:
Enzymol. 183: 63-98; and Smith, T. F.
Identity = 95% or greater

fasta, tfasta, fastx, tfastx, and ssearch.
and M. S. Waterman (1981) Adv. Appl. Math.
and Match length = 200

2: 482-489.
bases or E value =

1.0E−8 or less greater;

fastx Full Length

sequences:

fastx score =100 or

greater

BLIMPS
A BLocks IMProved Searcher that matches
Henikoff, S. and J. G. Henikoff
Probability value =

a sequence against those in BLOCKS,
(1991) Nucleic Acids Res. 19: 6565-6572;
1.0E−3 or less

PRINTS, DOMO, PRODOM, and PFAM databases
Henikoff, J. G. and S. Henikoff (1996)

to search for gene families, sequence homology,
Methods Enzymol. 266: 88-105; and Attwood,

and structural fingerprint regions.
T. K. et al. (1997) J. Chem. Inf.

Comput. Sci. 37: 417-424.

HMMER
An algorithm for searching a query
Krogh, A. et al. (1994) J. Mol. Biol.,
PFAM hits:

sequence against hidden Markov model
235: 1501-1531; Sonnhammer,
Probability value =

(HMM)-based databases of protein family consensus
E. L. L. et al. (1988) Nucleic Acids Res. 26:
1.0E−3 or less Signal

sequences, such as PFAM.
320-322; Durbin, R. et al. (1998) Our World
peptide hits: Score = 0

View, in a Nutshell, Cambridge Univ. Press,
or greater

pp. 1-350.

ProfileScan
An algorithm that searches for structural
Gribskov, M. et al. (1988) CABIOS 4: 61-66;
Normalized quality

and sequence motifs in protein sequences that
Gribskov, M. et al. (1989) Methods Enzymol.
score ≧ GCG- specified

match sequence patterns defined in Prosite.
183: 146-159; Bairoch, A. et al. (1997)
“HIGH” value for that

Nucleic Acids Res. 25: 217-221.
particular Prosite motif.

Generally, score =

1.4-2.1.

Phred
A base-calling algorithm that examines
Ewing, B. et al. (1998) Genome Res.

automated sequencer traces with high sensitivity
8: 175-185; Ewing, B. and

and probability.
P. Green

(1998) Genome Res. 8: 186-194.

Phrap
A Phils Revised Assembly Program including
Smith, T. F. and M. S. Waterman (1981)
Score = 120 or greater;

SWAT and CrossMatch, programs based on
Adv. Appl. Math. 2: 482-489; Smith,
Match length = 56 or

efficient implementation of the Smith-Waterman
T. F. and M. S. Waterman (1981) J.
greater

algorithm, useful in searching sequence homology
Mol. Biol. 147: 195-197; and Green, P.,

and assembling DNA sequences.
University of Washington, Seattle, WA.

Consed
A graphical tool for viewing and editing Phrap
Gordon, D. et al. (1998) Genome Res. 8: 195-202.

assemblies.

SPScan
A weight matrix analysis program that
Nielson, H. et al. (1997) Protein Engineering
Score = 3.5 or greater

scans protein sequences for the presence of
10: 1-6; Claverie, J. M. and S. Audic (1997)

secretory signal peptides.
CABIOS 12: 431-439.

TMAP
A program that uses weight matrices to
Persson, B. and P. Argos (1994)

delineate transmembrane segments on protein
J. Mol. Biol. 237: 182-192; Persson, B. and

sequences and determine orientation.
P. Argos (1996) Protein Sci. 5: 363-371.

TMHMMER
A program that uses a hidden Markov
Sonnhammer, E. L. et al. (1998) Proc.

model (HMM) to delineate transmembrane
Sixth Intl. Conf. on Intelligent Systems for

segments on protein sequences and
Mol. Biol., Glasgow et al., eds., The Am.

determine orientation.
Assoc. for Artificial Intelligence Press, Menlo

Park, CA, pp. 175-182.

Motifs
A program that searches amino acid
Bairoch, A. et al. (1997) Nucleic Acids

sequences for patterns that matched those
Res. 25: 217-221; Wisconsin Package

defined in Prosite.
Program Manual, version 9, page

M51-59, Genetics Computer Group,

Madison, WI.

Number	Date	Country
60185213	Feb 2000	US
60205232	May 2000	US
60205285	May 2000	US
60205323	May 2000	US
60205287	May 2000	US
60205324	May 2000	US
60205286	May 2000	US

Molecules for disease detection and treatment

Information

Publication Number

Date Filed

Date Published

Inventors

CPC

US Classifications

International Classifications

Abstract

Description

Claims

PCT Information

Provisional Applications (7)