Methods for genetic control of plant pest infestation and compositions thereof

Abstract
The present invention is directed to controlling plant pest infestation, and particularly plant nematode infestation, by inhibiting one or more biological functions in the plant pest. The invention discloses methods and compositions for use in controlling plant pest infestation by providing one or more different recombinant double stranded RNA molecules in the diet of the pest in order to achieve a reduction in pest infestation through suppression of pest gene expression. The invention is also directed to methods for making transgenic plants that express the double stranded RNA molecules, to methods for detecting cells comprising the disclosed sequences, and to methods for detecting the disclosed sequences in biological samples.
Description
FIELD OF THE INVENTION

The present invention relates generally to the field of molecular biology and more specifically to the genetic control of plant pests, and even more particularly to the genetic control of Heterodera nematode infestations in plants. More specifically, the present invention relates to methods for modifying expression of one or more polynucleotide and/or protein molecules in one or more cells or tissues of a pest species. The present invention discloses substantially the entire genome sequence of the plant nematode pest, Heterodera glycines, and describes the use of these sequences to modify the expression of one or more target polynucleotide or protein molecules in at least the cells of a Heterodera species by providing in its diet a dsRNA that comprises a part of, or all, or substantially all of one or more polynucleotide molecules of the present invention.


BACKGROUND OF THE INVENTION

Plants and animals are targets of many different pests, including but not limited to nematode and insect pest species. Crops are often the targets of nematode infestations. Chemical nematicides are not effective in eradicating the nematode infestations. Chemical pesticidal agents are not selective and exert their effects on non-target fauna as well, often effectively sterilizing for a period of time a field over which the chemical nematicidal agents have been applied. Some chemical pesticidal agents have been shown to accumulate in food, and to exhibit adverse effects on workers that manufacture and apply such chemical agents. Thus there has been a long felt need for methods for controlling or eradicating nematode pest infestation on or in plants, i.e., methods which are selective, environmentally inert, non-persistent, biodegradable, and that fit well into pest resistance management schemes. Plant biotechnology provides a means to control pest infestations by providing plants that express one or more pest control agents. Recombinant pest control agents have generally been reported to be proteins selectively toxic to a target pest that are expressed by the cells of a recombinant plant. Recently, small RNA molecules provided in the diet of the pest species Meloidogyne incognita have been shown to exhibit effects on the viability of the pest by affecting gene expression in the pest cells (Tobias et al. WO 01/37654 A2). Recombinant approaches to plant pest control can be selective, and are environmentally inert and non-persistent because they are fully biodegradable.


The phenomenon of dsRNA mediated gene silencing has been demonstrated in a number of plant and animal systems (Fire et al. 1998 Nature 391:806-811; Waterhouse et al. 1998 PNAS USA 95:13959-13964; Tabara et al. 1998 Science 282:430-431; Fire et al. WO 99/32619 A1; Trick et al. WO 2004/005485 A2). Methods for delivering dsRNA into the animal systems involved generating transgenic insects that express double stranded RNA molecules or injecting dsRNA solutions into the body of the animal or within the egg sac prior to or during embryonic development. Double stranded RNA mediated gene suppression has been demonstrated in plant parasitic nematodes either by providing dsRNA or miRNA's in the nematodes' diet or by soaking the nematodes in solutions containing such RNA molecules (Atkinson et al., (The University of Leeds) WO 03/052110 A2; Trick et al., (Kansas State University Research Foundation) US 2004-009876A1). Cyst nematodes (Heterodera and Globodera species) are particularly damaging pests of crop plants. Cyst nematodes include but are not limited to Heterodera avenae, H. cruciferae, H. glycines, H. hordecalis, H. latipons, H. oryzae, H. oryzicola, H. rostochinesis, H. zeae, H. schachtii, G. achilleae, G. artemisiae, G. mexicana, G. millefolii, G. pallida, G. rostochiensis, G. tabacum, G. tabacum solanacearum, G. tabacum tabacum, G. tabacum virginiae, Globodera sp. Bouro, Globodera sp. Canha, Globodera sp. Ladeiro, Globodera sp. New Zealand-EK-2004, and Globodera sp. Peru-EK-2004. These species are known to parasitize a wide variety of crops including, but not limited to barley, corn, oats, rice, rye, wheat, cabbage, cauliflower, soybean, sugar beet, spinach, mustards, and potato. Cyst nematodes are particularly problematic. Eggs persist and remain viable in the soil for many years. Genetic resistance by conventional crop breeding has limited success in identifying resistance genes effective against the wide variety of races and biotypes of the cyst nematodes. Of particular concern is the soybean cyst nematode, Heterodera glycines, herein referred to as SCN.


Therefore, there exists a need for improved methods and compositions useful to modulate gene expression by repressing, delaying or otherwise reducing gene expression within a particular plant nematode pest for the purpose of controlling the nematode infestation or to introduce novel agronomically valuable phenotypic traits.


SUMMARY OF THE INVENTION

The present invention comprises methods and compositions for inhibiting expression of one or more target genes and proteins at least in cyst nematodes such as members of the Heterodera and Globodera species. More specifically, the present invention comprises a method of modulating or inhibiting expression of one or more target genes in Heterodera glycines and Heterodera schactii, to cause cessation of feeding, growth, development, reproduction and infectivity and eventually result in the death of the nematode pest. The method comprises introduction of partial or fully, stabilized double-stranded RNA (dsRNA) or its modified forms, such as small interfering RNA (siRNA) or micro RNA (miRNA) sequences, into the cells of the nematode wherein expression of at least one or more target genes is inhibited in the target nematode pest, wherein inhibition of the one or more target genes exerts a deleterious effect upon the nematode pest, wherein the dsRNA, siRNA, or miRNA are derived from the target gene sequences, specifically inhibit such target genes in the target pest, and so are specific to nematode pests such as those of the Heterodera species. It is specifically contemplated that the methods and compositions of the present invention will be useful in limiting or eliminating Heterodera infestation in or on any cyst nematode host by providing one or more compositions comprising dsRNA molecules in the diet of the nematode, wherein the diet is all or part of a plant cell.


In another aspect of the present invention, DNA molecules of the present invention comprise molecules that function as promoter sequences, polypeptide coding sequences, non-coding regulatory sequences, or polyadenylation sequences isolated from the genome of the soybean cyst nematode, the polynucleotide sequence of which is at least from about 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or about 100% identical to sequences selected from the group consisting of SEQ ID NO: 1 through SEQ ID NO:45568, the complement thereof, or a portion thereof. A DNA molecule selected from the group consisting of SEQ ID NO:97730 through SEQ ID NO:119145 exhibits promoter activity, and a DNA molecule selected from the group consisting of SEQ ID NO:45569 through SEQ ID NO:47643 comprises at least one protein coding sequence, whether or not a complete open reading frame is exhibited.


Accordingly, in another aspect of the present invention, a set of isolated and purified polynucleotide sequences as set forth in SEQ ID NO:45569 through SEQ ID NO:47643 are provided as target sequences for the design of DNA constructs that express a stabilized dsRNA, siRNA, or miRNA molecule for inhibition of expression of a target gene in a nematode pest. A stabilized dsRNA, siRNA, or miRNA molecule can comprise two or more polynucleotide molecules that are arranged in a sense and an antisense orientation relative to at least one promoter, wherein the polynucleotide molecule that comprises a sense strand and an antisense strand are linked or connected by a spacer sequence of at least from about five to about one thousand nucleotides, wherein the sense strand and the antisense strand are at least about the same length, and wherein each of the two polynucleotide sequences shares at least about 80% sequence identity, at least about 90%, at least about 95%, at least about 98%, or even about 100% sequence identity, to a polynucleotide sequence as set forth in one of SEQ ID NO:45569 through SEQ ID NO:47643.


The present invention provides a method for identifying a DNA molecule for use as a DNA construct expressing a dsRNA-mediated gene silencing sequence in a plant cell, comprising selecting a target polynucleotide molecule of a Heterodera glycines polynucleotide sequence comprising 21 or more contiguous nucleotides wherein said polynucleotide sequence is selected from the group consisting of SEQ ID NO:45569-50775, SEQ ID NO:45569-47643, and SEQ ID NO:47644-50775.


The present invention also provides a recombinant DNA molecule for use in plant transformation, constructed to contain at least one polynucleotide molecule transcribed as a single stranded RNA molecule. The single stranded RNA molecule is capable of forming in vivo a double stranded RNA molecule through intermolecular hybridization that, when provided in the diet of a nematode pest, inhibits the expression of at least one target gene in one or more cells of the target organism. The polynucleotide molecule is operably linked to at least one promoter sequence that functions in a transgenic plant cell to transcribe the polynucleotide molecule into one or more ribonucleic acid molecules. The RNA molecule(s) self assemble into double stranded RNA molecules and are provided in the diet of a target pest that feeds upon the transgenic plant. The provision of the dsRNA molecule in the diet of the pest achieves the desired inhibition of expression of one or more target genes within the pest organism, resulting in fecundicity, morbidity, and/or mortality of the target pest.


The present invention also provides a recombinant plant cell having in its genome at least one recombinant DNA sequence that is transcribed to produce at least one dsRNA molecule that functions when the cell and/or its contents are ingested by a target nematode or pest to inhibit the expression of at least one target gene in the target nematode or pest. The dsRNA molecule is transcribed from all or a portion of a polynucleotide molecule that at least in part exhibits from about 75 to about 100% identity to a target nematode specific polynucleotide sequence as set forth in SEQ ID NO:45569 through SEQ ID NO:50775.


The present invention also provides a recombinant DNA construct for expression of a dsRNA-mediated gene silencing sequence in a plant cell, which comprises at least two different target sequences that, when expressed in vivo as RNA sequences and provided in the diet of a target nematode pest, inhibit the expression of at least two different target genes in one or more cells or tissues of the target nematode pest. A first target sequence exhibits at least from about 75 to about 100 percent identity to a first specific polynucleotide sequence region as set forth in SEQ ID NO:45569-SEQ ID NO:50775, and a second target sequence that is different from the first target sequence exhibits at least about 75 to about 100 percent identity to a second specific polynucleotide sequence region as set forth in SEQ ID NO:45569 through SEQ ID NO:50775, wherein the two or more target sequences are assembled in a DNA construct and expressed together as a single RNA transcript, and are constructed to form one or more dsRNA's useful in suppression of the one or more target genes. The DNA construct is transformed into a soybean cell, and the cell is regenerated into a recombinant plant. The dsRNA molecules are thus provided in the diet of the target nematode pest in a target nematode pest inhibitory concentration. Ingestion by the target nematode pest of recombinant plant cells or tissues expressing the recombinant dsRNA achieves the desired inhibition of expression of one or more target genes in the nematode, resulting in the fecundicity, morbidity, and/or mortality of the target nematode pest.


Another aspect of the present invention is the use of the nucleotide sequences disclosed herein to identify target sequences that occur in the transcript RNA of other plant pests, in particular insect pests, fungal pests, and nematode pests, that could be targeted simultaneously and/or contemporaneously with a single expression construct designed to suppress related genes in multiple plant pests by identifying sequences of sufficient length and identity in two or more different plant pests, and ensuring that the DNA construct used to produce a recombinant plant expresses one or more dsRNA molecules that function to effectively suppress one or more related genes in the two or more different plant pests. In particular, some contiguous nucleotide sequences equal to or greater than about 21-24 nucleotides in length are identified herein to be present within the genome of the soybean cyst nematode (SCN) Heterodera glycines and have been identified to be present as well in Heterodera schactii, and to some extent, other pest species as well, such as in several other nematode pest species as well as in other plant pest species such as specific insect nucleotide sequences, and in animal pest species such as insect pest species. Such sequences may be useful for effectively suppressing a target sequence in these plant pests, particularly when expressed as a dsRNA molecule in a recombinant plant cell that is provided in the diet of the pest, and can provide resistance to the plant from pest infestation from all or substantially all pests in which such sequences appear, in particular if the sequences in common are within genes shown to be essential for survival, reproduction, mobility, and/or development and differentiation.


The target sequences disclosed in the present invention can be used to identify related target sequences that occur in the transcript RNA of other pest species, particularly nematode species including but not limited to pests such as Heterodera species such as H. avenae, H. ciceri, H. crucifera, H. cyperi, H. fici, H. goettingiana, H. hordecalis, H. humuli, H. latipons, H. litoralis, H. medicaginis, H. mediterranea, H. oryzae, H. oryzicola, H. riparia, H. rostochinesis, H. salixophila, H. schachtii, H. sorghi, H. trifolii, H. turcomanica, and H. zeae, Meloidogyne species such as M. arenaria, M. chitwoodi, M. artiellia, M. fallax, M. hapla, M. javanica, M. incognita, M. microtyla, M. partityla, M. panyuensis, and M. paranaensis, Globodera species such as G. pallida, G. rostochiensis, and G. tabacum, Pratylenchus species such as P. brachyrus, P. crenatus, P. coffeae, P. magnica, P. neglectu, P. penetrans, P. scribneri, P. thornei, and P. vulnus. Other plant pest nematode species hat are within the scope of the present invention include but are not limited to Xiphinema species, Nacobbus species, Hoplolaimus species, Paratylenchus species, Rotylenchulus species, Criconemella species, Hemicycliophora species, Helicotylenchus species, Rotylenchus species, Belonolaimus species, Trichodorus species, Tylenchorhynchus species, Radopholus species, Longidorus species, Dolichodorus species, Aphenlenchoides species, Ditylenchus species, Anguina species, and Tylenchulus species. A DNA construct that expresses a dsRNA molecule in a plant cell that has a target sequence common to multiple plant pests provides plant resistance to pest infestation from each pest containing such target sequences. A particular target sequence can be amplified within a single dsRNA transcript, and can contain only a single contiguous sequence of at least from about 17 to about 21 to about 50 nucleotides in common between any combination of pests, or can be comprised of a chimera consisting of various contiguous sequences of at least from about 17 to about 21 to about 50 or more nucleotides, each such contiguous sequence either being in common between two or more pests, or unique to only a single pest, such that the chimera, when present as a dsRNA sequence and provided in the diet of any one or more of the targeted pests, results in the effective control such one or more pests.


The present invention also provides a method for producing a transgenic plant by introducing into the genome of the plants' cells a polynucleotide sequence consisting of all or a portion of at least one of the aforementioned SCN specific recombinant DNA sequences, linked to linked substantially the complement of that sequence. Transgenic plants are generated from the transformed plant cell, and progeny plants, seeds, and plant products, each comprising the polynucleotide sequence, are produced from the transgenic plants.


The methods and compositions of the present invention may be applied to any monocot and dicot plant, depending on the pest species to be controlled and the host range of the nematode pest. Specifically, the plants are intended to comprise without limitation alfalfa, aneth, apple, apricot, artichoke, arugula, asparagus, avocado, banana, barley, beans, beet, blackberry, blueberry, broccoli, brussel sprouts, cabbage, canola, cantaloupe, carrot, cassaya, cauliflower, celery, cherry, cilantro, citrus, clementine, coffee, corn, cotton, cucumber, Douglas fir, eggplant, endive, escarole, eucalyptus, fennel, figs, gourd, grape, grapefruit, honey dew, jicama, kiwifruit, lettuce, leeks, lemon, lime, Loblolly pine, mango, melon, mushroom, nut, oat, okra, onion, orange, an ornamental plant, papaya, parsley, pea, peach, peanut, pear, pepper, persimmon, pine, pineapple, plantain, plum, pomegranate, poplar, potato, pumpkin, quince, radiata pine, radicchio, radish, raspberry, rice, rye, sorghum, Southern pine, soybean, spinach, squash, strawberry, sugar beet, sugarcane, sunflower, sweet potato, sweetgum, tangerine, tea, tobacco, tomato, turf, a vine, watermelon, wheat, yams, and zucchini plants. Preferably, the present invention is related to a transgenic soybean plant that contains in its genome a DNA construct that expresses a dsRNA molecule from any sequence of the present invention.


The invention also provides a computer readable medium having recorded thereon one or more of the sequences as set forth in SEQ ID NO:1 through SEQ ID NO:171306 and, with reference to nucleotide sequences, the complements thereof, for use in a number of computer based applications, including but not limited to DNA identity and similarity searching, protein identity and similarity searching, transcription profiling characterizations, comparisons between genomes, and artificial hybridization analyses.


BRIEF DESCRIPTION OF THE SEQUENCES

SEQ ID NO:1-SEQ ID NO:45568 correspond to individual sequences (singletons) and assembled singletons forming contiguous overlapping sequences (contigs) derived from DNA sequence analysis of one or more libraries produced from the genome of the soybean cyst nematode strain OP25.


SEQ ID NO:45569-SEQ ID NO:97729 correspond to sequences predicted to encode various proteins, tRNA's, rRNA's and the like, which were identified using the bioinformatics described herein as applied to SEQ ID NO:1-SEQ ID NO:45568, and are further defined in blocks of sequences corresponding to coding sequences characterized as (a) essential to SCN survival (SEQ ID NO:45569-SEQ ID NO:50775) and (b) other coding sequences and elements (SEQ ID NO:50776-SEQ ID NO:97729); and where the essential sequences are further defined in blocks of sequences corresponding to unigenes, EST's, or cDNA's which were (c) linked through bioinformatics analyses described herein to counterpart sequences entirely or partially known in the art (SEQ ID NO:47644-SEQ ID NO:50775) and (d) unique sequences exhibiting no known relationship to sequences known in the art (SEQ ID NO:45569-47643).


SEQ ID NO:97730-SEQ ID NO:119145 correspond to sequences predicted to comprise all or substantially all of one or more SCN promoter sequences.


SEQ ID NO:119146-SEQ ID NO:124352 correspond to amino acid sequences predicted to be encoded from the (a) essential and (b) other coding sequences set forth in SEQ ID NO:45569-SEQ ID NO:97729, and are further defined in blocks of sequences corresponding to (c) peptides essential to SCN survival, as set forth in SEQ ID NO:121221-SEQ ID NO:124352, each based on one or more BLASTP relationship to one or more proteins known to be essential to survival of C. elegans or other organisms (translated from SEQ ID NO:47644-SEQ ID NO:50775), and (d) other peptides lacking any BLASTP relationship to proteins known in the art, as set forth in SEQ ID NO:119146-SEQ ID NO:121220 (translated from SEQ ID NO:45569-SEQ ID NO:47643).







DETAILED DESCRIPTION OF THE INVENTION

The following is a detailed description of the invention provided to aid those skilled in the art in practicing the present invention. Those of ordinary skill in the art may make modifications and variations in the embodiments described herein without departing from the spirit or scope of the present invention.


The inventors have discovered all or substantially all of the polynucleotide sequences that comprise the genomic DNA obtained from the soybean cyst nematode Heterodera glycines, aligned the sequences to derived large blocks of sequence corresponding to genomic contigs set forth herein, analyzed these contigs to identify and characterize untranslated regulatory sequences, for example, promoters, introns, transcriptional initiation sequences, and polyadenylation signals. Genomic polynucleic acid sequences encoding all or part of one or more proteins and characterized as being essential for survival, such as amino acid sequences involved in various metabolic or catabolic biochemical pathways, cell division, reproduction, energy metabolism, digestion, neurological function and the like, are identified in the genomic DNA sequences disclosed in the present invention, and regions of such sequences are demonstrated herein as being useful for selection for use in preparing DNA constructs for use in transforming cells, and that also express double stranded RNA molecules from such constructs in the transformed cells, and when provided in the diet of a target pest, whether artificial diet or a natural diet, especially a plant cell, plant tissues or other plant parts, such as leaves, roots, stems, flowers, fruits or seeds, results in the fecundicity, morbidity, and/or mortality of the pest.


As described herein, ingestion by a target nematode pest of compositions containing one or more dsRNA molecules, wherein at least one segment of the dsRNA molecule corresponds to a substantially identical segment of RNA produced in the cells of the nematode, will result in death, growth inhibition, stunting, inhibition of maturation or fecundity of the nematode. These results indicate that a polynucleotide molecule, either DNA or RNA, derived from SCN can be used to design a DNA construct according to the methods of the present invention to express a recombinant gene product in a transgenic host cell. The host cell can be transformed to contain one or more of the polynucleotide molecules derived from sequences disclosed herein. The DNA construct transformed into the host cell transcribes one or more RNA sequences that form into a dsRNA molecule in the cell or biological fluids within the transformed host, thus making the dsRNA available for ingestion by nematode when it feeds upon the transgenic host. Therefore, the transformed host cell now contains within its genome the genetic potential to defend the host cell and its parents, siblings and children from attack by the nematode.


The present invention relates to genetic control of nematode infestations in host organisms. More particularly, the present invention includes the DNA constructs, selection of target polynucleotides and methods for delivery of nematode control agents to a nematode. The present invention provides methods for employing stabilized dsRNA molecules in the diet of the nematode as a means for suppression of targeted genes in the nematode, thus achieving desired control of nematode infestations in, or about the host or symbiont targeted by the nematode. The preferred host is a plant wherein the plant is transformed with a recombinant DNA construct that expresses recombinant stabilized dsRNA, siRNA, and/or miRNA molecules. The recombinant DNA construct comprises a nucleotide sequence that is transcribed into RNA by the host cell. The term “recombinant DNA” or “recombinant nucleotide sequence” refers to DNA that contains a genetically engineered modification through manipulation as a result of methods for mutagenesis, use of restriction enzymes, and thermal amplification methods, and the like.


The dsRNA molecules, siRNA molecules, and/or miRNA molecules of the present invention are homologous or complementary to at least about a contiguous 17-21 nucleotide sequence selected from the group consisting of SEQ ID NO:45569 through SEQ ID NO:97729. Isolated and purified nucleotide sequences from a SCN are provided from a genomic library constructed from polynucleotide sequences of the pest. The ingestion of such nucleotide sequences results in the reduction or elimination of an essential gene product necessary for the nematode's growth and development or other biological function.


The present invention also contemplates a transformed plant cell and transformed plants and their progeny. The transformed plant cells and transformed plants express one or more of the dsRNA, siRNA, or miRNA sequences of the present invention from one or more of the DNA sequences as set forth in SEQ ID NO:1-SEQ ID NO:45568 and SEQ ID NO:45569 through SEQ ID NO:97729, or the complement thereof.


As used herein the words “gene suppression”, when taken together, are intended to refer to any method for reducing the levels of a gene product as a result of gene transcription to mRNA. Gene suppression is also intended to mean the reduction of protein expression from a gene or a coding sequence including posttranscriptional gene suppression and transcriptional suppression. Posttranscriptional gene suppression is intended to refer to that suppression mediated by the homology between all or a part of an RNA transcript transcribed from a gene or coding sequence targeted for suppression and the corresponding double stranded RNA used for suppression, and refers to the substantial and measurable reduction of the amount of available mRNA available in the cell for binding by ribosomes. The transcribed RNA can be in the sense orientation to effect what is referred to as co-suppression, in the anti-sense orientation to effect what is referred to as anti-sense suppression, or in both orientations producing a dsRNA to effect what is referred to as RNA interference (RNAi). Transcriptional suppression is intended to refer to that suppression mediated by the presence in the cell of a dsRNA, a gene suppression agent, exhibiting substantial sequence identity to a promoter DNA sequence or the complement thereof to effect what is referred to as promoter trans suppression. Gene suppression may be effective against a native plant gene associated with a trait, e.g., to provide plants with reduced levels of a protein encoded by the native gene or with enhanced or reduced levels of an affected metabolite. Gene suppression can also be effective against target genes in plant nematodes that may ingest or contact plant material containing gene suppression agents, specifically designed to inhibit or suppress the expression of one or more homologous or complementary sequences in the cells of a nematode or other pest.


Post-transcriptional gene suppression by anti-sense or sense oriented RNA to regulate gene expression in plant cells is disclosed in U.S. Pat. Nos. 5,107,065, 5,759,829, 5,283,184, and 5,231,020. The use of dsRNA to suppress genes in plants is disclosed in WO 99/53050, WO 99/49029, U.S. Patent Application Publication 2003/0175965 A1, and 2003/0061626 A1, U.S. patent application Ser. No. 10/465,800, and U.S. Pat. Nos. 6,506,559, and 6,326,193.


A preferred method of post transcriptional gene suppression in plants employs both sense-oriented and anti-sense-oriented, transcribed RNA, which is stabilized, e.g., as a hairpin and stem and loop structure. A preferred DNA construct for effecting post-transcriptional gene suppression is one in which a first segment transcribes an RNA molecule in an anti-sense orientation relative to the mRNA of the gene transcript targeted for suppression, the first segment further linked to a second segment spacer region that is not homologous or complementary to the first segment, and linked to a third segment that transcribes an RNA, wherein a portion is substantially complementarity to the first segment. Such a construct would be expected to form a stem and loop structure by hybridization of the first segment with the third segment and a loop structure forms comprising the second segment (WO94/01550, WO98/05770, US 2002/0048814A1, and US 2003/0018993A1).


As used herein, the term “nucleic acid”, “polynucleic acid”, or “polynucleotide” refers to a single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases (also referred to as nucleotides) read from the 5′ to the 3′ end. The polynucleic acid may optionally contain non-naturally occurring or altered nucleotide bases that permit correct read through by a polymerase. The term “nucleotide sequence” or “polynucleic acid sequence” may refer to both the sense and antisense strands of a polynucleic acid molecule as either individual single strands or in the duplex. The term “ribonucleic acid” (RNA) is inclusive of RNAi (inhibitory RNA), dsRNA (double stranded RNA), siRNA (small interfering RNA), mRNA (messenger RNA), miRNA (micro-RNA), tRNA (transfer RNA, whether charged or discharged with a corresponding acylated amino acid), and cRNA (complementary RNA) and the term “deoxyribonucleic acid” (DNA) is inclusive of cDNA and genomic DNA and DNA-RNA hybrids. The words “nucleic acid segment”, “nucleotide sequence segment”, or more generally “segment” will be understood by those in the art as a functional term that includes genomic sequences, ribosomal RNA sequences, transfer RNA sequences, messenger RNA sequences, operon sequences and smaller engineered nucleotide sequences or portions thereof that control or affect the expression of a gene product or that may be adapted to express proteins, polypeptides or peptides. A polynucleic acid may optionally contain naturally occurring or altered nucleotide bases that prevent polymerization by a first polymerase copying the strand that contains such base(s), i.e., one or more bases that cannot be templated by the first polymerase while polymerizing the nascent or growing strand, so that any nucleotide sequence extending beyond the non-templated base(s) results in a cohesive end that can be used to link the polynucleic acid to one or more other nucleic acid sequences linked to the complement of the cohesive end, resulting in a chimeric nucleotide sequence. The naturally occurring or altered nucleotide base(s) can then be templated to link the fragments comprising the chimeric nucleotide sequence by exposing the chimera to a second polymerase that recognizes the naturally occurring or altered nucleotide base(s) and copies that/those base(s) with fidelity (Jarrell et al. U.S. Pat. No. 6,358,712; Newton et al. 1993 21:1155-1162). This method may be particularly useful when assembling multi-component sequences for expression of an RNA sequence that folds into a dsRNA sequence and functions to suppress one or more genes in one or more target organisms.


As used herein, the term “nematode” refers to plant parasitic nematodes, in particular to members of the Tylenchoidea superfamily, and more specifically to the Heteroderidae family of nematodes that include the cyst nematodes (including at least Heterodera and Globodera species) and the rootknot nematodes (Meloidogyne species). More specifically to Heterodera species and even more specifically to Heterodera glycines, the soybean cyst nematode. Nematode species that were shown to have homologous target sequences with H. glycines polynucleotides of the present invention were: rootknot nematode species—Meloidogyne species such as M. arenaria, M. chitwoodi, M. artiellia, M. fallax, M. hapla, M. javanica, M. incognita, M. microtyla, M. partityla, M. panyuensis, and M. paranaensis; cyst nematode species—Heterodera species such as H. schachtii, Globodera species such as G. rostochiensis, G. pallida, and G. tabacum, Heterodera species such as H. trifolii, H. medicaginis, H. ciceri, H. mediterranea, H. cyperi, H. salixophila, H. zeae, H. goettingiana, H. riparia, H. humuli, H latipons, H. sorghi, H. fici, H. litoralis, and H. turcomanica; lesion nematode species—Pratylenchus species such as P. scribneri, P. magnica, P. thornei, P. crenatus, P. brachyrus, P. vulnus, P. penetrans, P. coffeae, and P. neglectus; other plant parasitic nematode species include: Hirschmanniella species, Radopholus species such as R. similis, and Pratylenchoid magnicauda. Animal intestinal parasitic nematode species for which polynucleotides have been identified as a result of comparisons to the sequence data disclosed herein include Ascaris lumbricoides, and Ascaris suum.


As used herein, a “pest resistance” trait is a characteristic of a transgenic plant, transgenic animal, transgenic host or transgenic symbiont that causes the plant, animal, host, or symbiont to be resistant to attack from a pest that typically is capable of inflicting damage or loss to the plant, animal, host or symbiont. Such pest resistance can arise from a natural genetic variation or more typically from incorporation of recombinant DNA that confers pest resistance. Fire et al. (U.S. Pat. No. 6,506,599) generically described inhibition of pest infestation, and demonstrated gene suppression in the non-pest nematode species Caenorhabditis elegans. Similarly, Plaetinck et al. (US 2003/0061626A1) suggests using dsRNA to inhibit gene function in a variety of nematode pests. Mesa et al. (US 2003/0150017 A1) describe using DNA sequences to transform host cells to express dsRNA sequences that are substantially identical to target sequences in specific pathogens, and particularly describe constructing recombinant plants expressing such dsRNA sequences for ingestion by various plant pests, facilitating down-regulation of a gene in the genome of the pest, and improving the resistance of the plant to the pest infestation. As used herein, the term “expression” refers to the transcription and stable accumulation of a nucleotide sequence comprising both sense and antisense RNA derived from the nucleic acid sequences disclosed in the present invention, whether or not the RNA sequence is capped, spliced, and polyadenylated and trafficked into the cytoplasm of the cell. Expression may also refer to translation of mRNA into a polypeptide or protein. As used herein, the term “sense” RNA refers to an RNA transcript corresponding to a sequence or segment that, when produced by the target nematode, is in the form of a mRNA that is capable of being translated into polypeptide by the target nematode cell. As used herein, the term “antisense RNA” refers to an RNA transcript that is complementary to all or a part of a mRNA that is normally produced in a cell of a nematode. The complementarity of an antisense RNA may be with any part of the specific gene transcript, i.e., at the 5′ non-coding sequence, 3′ non-translated sequence, introns, or the coding sequence. As used herein, the term “RNA transcript” refers to the product resulting from RNA polymerase-catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect complementary copy of the DNA sequence, it is referred to as the primary transcript or it may be an RNA sequence derived from post-transcriptional processing of the primary transcript and is referred to as the mature RNA.


Exposure of a plant cyst forming nematode to the dsRNA, siRNA, or miRNA sequences of the present invention may occur during the nematodes' juvenile J2, J3, J4, adult female or adult male developmental stages. Exposure may occur as the J2 or male nematode is migrating through the plant vasculature, for example the cortical cells, or during or after establishment of a feeding site within syncytial cells. Exposure may occur by the production of the dsRNA in neighboring transfer-like cells with movement into the feeding site. dsRNA, siRNA, or miRNA may enter the nematode through a variety of means including, for example, through the stylet and pharnyx, the anus, the extratory duct, or amphidial and phasmid channels. dsRNA produced in the tissues of the feeding site may enter the nematode by transport through the feeding tube (Hussey, R S and Grundler et. al., 1998, Nematode parasitism of plants, Ch. 9, The Physiology and Biochemistry of Free-living and Plant-parasitic Nematodes, eds R N Perry and D J Wright), directly from the cytoplasm, from extracellular regions, or from other plant compartments. Movement of dsRNA, siRNA, or miRNA into the nematode may require that the RNA exhibit a molecular weight of less than or substantially less than 25 Kda (feeding tube size threshold). Creating an siRNA or miRNA in the plant that is bioavailable to the nematode may require preventing the siRNA from entering or remaining within the plant RISC complex, a protein complex well in excess of 25 KD. For example, this may be accomplished through a number of means such as (1) by co-expressing a small RNA-binding protein that exhibits a greater affinity for the plant RISC complex compared to the nematode specific siRNA, (2) by producing in the transgenic cell a nematode specific siRNA that is incompatible with the plant RISC complex yet functional in the nematode RISC complex, or (3) by down-regulating RISC complex expression in the feeding site established by the nematode. Small RNA-binding proteins may be optimized for binding to a specific siRNA or miRNA by modifying amino acid residues by phage display or other peptide selection methods.


As used herein, the phrase “inhibition of gene expression” or “inhibiting expression of a target gene in the cell of a nematode” may refer to the absence (or observable decrease) in the level of protein and/or mRNA product from the target gene. In the event that a particular transcript or translation product is not detectable, whether or not the lack of detection is a result of the expression of a dsRNA specifically designed to suppress the levels of such transcript or translation product, the phrase “inhibition of gene expression” or “inhibiting expression of a target gene in the cell of a nematode” may refer to the observation of a phenotypic effect or the lack thereof within the plant or within or about the target pest that feeds upon the transgenic plant. Specificity refers to the ability to inhibit the target gene without manifest effects on other genes of the cell and without any effects on any gene within the cell that is producing the dsRNA molecule. The inhibition of gene expression of one or more target genes in the nematode may result in novel phenotypic traits in the nematode.


Without limiting the scope of the present invention, there is provided, in one aspect, a method for controlling plant infestation by a nematode or other plant pest using stabilized dsRNA strategies. The method involves generating stabilized dsRNA molecules as one type of nematode control agents, that when provided in the diet of the nematode, induce gene silencing. As used herein, the phrase “generating a stabilized dsRNA molecule” refers to the methods of employing recombinant DNA technologies to construct a DNA nucleotide sequence that transcribes a stabilized dsRNA. As used herein, the term “silencing” refers the effective “down-regulation” of expression of one or more targeted nucleotide sequences within one or more cells of a nematode or other plant pest and, hence, the elimination of the ability of the targeted nucleotide sequence(s) to cause its normal effect within the cell.


The present invention also provides in part a delivery system for providing a nematode control agent to a nematode through exposure of the nematode to a host, such as a plant containing the one or more control agents of the present invention by ingestion of the plants' cells or the contents of those cells. One embodiment of the present invention provides for generating a transgenic plant cell or a plant that contains a recombinant DNA construct transcribing the stabilized dsRNA molecules of the present invention. As used herein, the phrase “generating a transgenic plant cell or a plant” refers to the methods of employing recombinant DNA technologies to construct a plant transformation vector transcribing the stabilized dsRNA molecules of the present invention, to transform a plant cell or a plant with such vector, and to generate the transformed plant cell or transgenic plant containing a part of the vector that transcribes the stabilized dsRNA molecules. In particular, the method of the present invention may comprise a recombinant DNA construct in a cell of a plant that results in dsRNA transcripts that are substantially homologous to an RNA sequence expressed by a nucleotide sequence contained within the genome of a nematode. Where the nucleotide sequence within the genome of a nematode comprises a gene essential to the viability and infectivity of the nematode, its down-regulation results in a reduced capability of the nematode to survive and/or infect and/or cause damage to host cells. Hence, such down-regulation results in a “deleterious effect” on the maintenance, viability, and infectivity of the nematode, in that it prevents or reduces the nematode's ability to feed off of and survive on nutrients derived from the host cells. By virtue of this reduction in the nematode's viability and infectivity, resistance and/or enhanced tolerance to infection by a nematode or other plant pest is facilitated in the cells of a plant.


It is envisioned that the compositions of the present invention can be incorporated within the seeds of a plant species either as a product of expression from a recombinant gene incorporated into the genome of the plant cells, or incorporated into a coating or seed treatment that is applied to the seed before planting. A plant derived from a single plant cell transformed to contain a recombinant or heterologous gene is considered herein to be a transgenic event.


The present invention also includes seeds and plants having more that one agronomically important trait. Such combinations are referred to as “stacked” traits. These stacked traits can include a combination of traits that are directed at the same target nematode pest, or they can be directed at different target nematode pests, or to one or more insect pests, or can provide herbicide tolerance to the plant, for example tolerance to glyphosate herbicide. The stacked traits can be achieved by breeding to plants that have the trait or by building a chimeric DNA construct that contains multiple plant expression cassettes and transforming the expression cassettes into the genome of the plant.


Cells of a plant seed of the present invention may express one or more dsRNA's, the sequence of any one of which is derived from a target sequence, i.e., a nematode specific sequence disclosed herein in SEQ ID NO:1-SEQ ID NO:45569, and also may express a nucleotide sequence that provides herbicide tolerance, for example, resistance to glyphosate, N-(phosphonomethyl)glycine, including the isopropylamine salt form of such herbicide. Herbicides for which transgenic plant tolerance has been demonstrated include but are not limited to: glyphosate, glufosinate, sulfonylureas, imidazolinones, bromoxynil, delapon, cyclohezanedione, protoporphyrionogen oxidase inhibitors, and isoxasflutole herbicides. Polynucleotide molecules encoding proteins involved in herbicide tolerance are known in the art, and include, but are not limited to a polynucleotide molecule encoding 5-enolpyruvylshikimate-3-phosphate synthase, bromoxynil nitrilase, phytoene desaturase, norflurazon, acetohydroxyacid synthase and the bar gene for tolerance to glufosinate and bialaphos (U.S. Pat. Nos. 5,627,061, 5,633,435, 6,040,497, 5,094,945, and 4,810,648).


As used herein, the term “pest control agent”, or “gene suppression agent” refers to one or more particular RNA molecules consisting of a first RNA segment and a second RNA segment that are complimentary to each other and are linked by a third RNA segment. The complementarity between the first and the second RNA segments results in the ability of the two segments to hybridize in vivo and in vitro to form a double stranded molecule, i.e., a stem comprising the first and the second segments linked together by the third segment which forms a loop between the first and second segments, so that the entire structure forms into a stem and loop structure. Structures consisting of a first and a second segment that hybridize more tightly to each other may form into a stem-loop knotted structure. The first and the second segments, when hybridized together, correspond invariably, and not respectively, to a sense and an antisense sequence with respect to the target RNA transcribed from the target gene in the target nematode that is suppressed by the ingestion of the dsRNA molecule, or ingestion of an siRNA molecule derived from the dsRNA molecule. The pest control agent can also be a substantially purified (or isolated) nucleic acid molecule and more specifically nucleic acid molecules or nucleic acid fragment molecules thereof from a genomic DNA (gDNA) or cDNA library. Such substantially purified molecules can be applied to a seed, whether a seed from a transgenic plant or otherwise, in the form of a seed treatment, together with a pharmaceutically acceptable carrier for stabilizing the dsRNA molecules, resulting in the dsRNA being bioavailable within a plant grown from the seed, or bioavailability within the rhizosphere of the root system of the plant grown from the seed. A seed may be treated with one or more agents, each exhibiting different activities designed to provide the seed, the germinating seedling, and the growing plant or root with one or more advantages in comparison to other plants, such as pest resistance, including bacterial, fungal, and nematode resistance, fertilizers, growth stimulants, gene stimulants or suppressors, herbicide functions to which the seed, germ, and or roots and seedling are resistant, and the like. Alternatively, the fragments may comprise smaller dsRNA oligonucleotides comprising from about 15 to about 750 or more consecutive nucleotides selected from the group consisting of SEQ ID NO:1-SEQ ID NO:45569 and the complements thereof, or from about 15 to about 30 nucleotides, or from about 21 to about 24 consecutive nucleotides. The pest control agent may also refer to a DNA construct that comprises the polynucleic acid molecules or nucleic acid fragment molecules of the present invention and the DNA construct is a transgene incorporated into the genome of a host cell. The pest control agent may further refer to a plant comprising such a DNA construct in its genome or in the genome of a subcellular organelle that comprises the polynucleic acid molecules or nucleic acid fragment molecules described in the present invention. The method of the present invention provides for the production of a dsRNA transcript, the nucleotide sequence of which is substantially homologous to a targeted RNA sequence encoded by a target nucleotide sequence within the genome of a target pest.


As used herein, the term “genome” as it applies to cells of a nematode, a plant pest, or a host encompasses not only chromosomal DNA found within the nucleus, but organelle DNA found within subcellular components of the cell. The sequences of the present invention, when introduced into plant cells, can therefore be either chromosomally integrated or organelle-localized. The term “genome” as it applies to bacteria encompasses both the chromosome and plasmids within a bacterial host cell. The DNA's of the present invention introduced into bacterial host cells can therefore be either chromosomally integrated, localized to a plasmid, or to a viral vector capable of replication in the bacterial host.


In certain preferred embodiments expression of the gene targeted for suppression in the plant pest is inhibited by at least about 10%, at least about 33%, at least about 50%, at least about 80%, at least about 90%, at least about 95%, or by at least about 99% or more within cells of the nematode so a significant inhibition takes place. Significant inhibition is intended to refer to inhibition sufficient to result in a detectable phenotype (e.g., cessation of growth, paralysis, sterility, behavioral effects, second generation effects, effects observed on nematodes ingesting dsRNA or on their progeny, morbidity, or mortality, etc.) or a detectable decrease in RNA and/or protein corresponding to the target gene being inhibited. Although in certain embodiments of the invention inhibition occurs in substantially all cells of the nematode, in other preferred embodiments inhibition occurs in only a subset of cells that are contacted with the dsRNA, or that are expressing the target gene transcript.


The advantages of the present invention may include, but are not limited to the ease of introducing dsRNA into the nematode or other pest cells, the low concentration of dsRNA, siRNA, or miRNA which can be used, the stability of dsRNA, siRNA, or miRNA and the effectiveness of the inhibition. The present invention provides a method for selecting polynucleotide sequences of a target gene sequence and is not limited to in vitro use of specific sequence compositions identified by the method or to the set of exemplary target genes of the present invention. Segments of the nucleotide sequences of the present invention may be selected for their level of gene inhibition/suppression by scanning segments of the H. glycines sequences to identify segments that exhibit preferred levels of gene suppression or pest inhibition when provided as a dsRNA molecule in the diet of one or more target pests such as H. glycines.


As used herein, the term “sequence identity”, “sequence similarity” or “homology” is used to describe sequence relationships between two or more nucleotide or amino acid sequences. The percentage of “sequence identity” between two sequences is determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage identity of a reference sequence to another is calculated by determining the number of positions at which the reference sequence (whether nucleic acid or amino acid sequence) is identical to another sequence to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison, and multiplying the result by 100 to yield the percentage of sequence identity. A sequence that is identical at every position in comparison to a reference sequence is said to be, with respect to a nucleotide sequence or amino acid sequence, identical to the reference sequence and vice-versa. A first nucleotide sequence when observed in the 5′ to 3′ direction is said to be the “complement” of, or complementary to, a second or reference nucleotide sequence observed in the 3′ to 5′ direction if the reverse complement of the first nucleotide sequence is identical at every nucleotide position with the second or reference sequence. As used herein, two nucleic acid sequence molecules are said to exhibit “complete complementarity” when every nucleotide of one of the sequences, when read 5′ to 3′, is complementary to every nucleotide of the other sequence when read 3′ to 5′. A nucleotide sequence that is complementary to a reference nucleotide sequence will exhibit a sequence identical to the reverse complement sequence of the reference nucleotide sequence.


In practicing the present invention, a target gene may be derived from a nematode or other pest species that causes damage to one or more different crop plants and/or yield losses to such plants. Several criteria may be employed in the selection of target genes. The gene may be one whose protein product has a rapid turnover rate, so that dsRNA inhibition will result in a rapid decrease in protein levels. In certain embodiments it is advantageous to select a gene for which a small decrease in expression level results in deleterious effects for the pest. It may be desirable to target a broad range of nematode species and so a nucleotide sequence is selected that is highly conserved across the targeted range of species. Conversely, for the purpose of conferring specificity, in certain embodiments a nucleotide sequence is selected that contains regions that are poorly conserved between individual targeted pest species, or between the targeted pest and other organisms. In certain embodiments it may be desirable to select a nucleotide sequence that exhibits no known homology to sequences in other organisms. As used herein, the term “derived from” refers to a specified nucleotide sequence that may be obtained from a particular source or species.


Target genes for use in the present invention may include, for example, those that play important roles in the viability, growth, development, reproduction and infectivity of a particular pest. These target genes may be one or more of any house keeping gene, transcription factor and pest specific gene that provides an observable phenotype, in particular a phenotype that results in the suppression of feeding on or the inability to utilize a transgenic soybean plant expressing a SCN derived dsRNA as a nutrient source. For example, target genes that are anticipated herein to be effective in producing such phenotypes are similar to those that have been shown to affect the viability, growth, development, mobility, neurological stimulation, muscular function, and reproduction in C. elegans, including but not limited to the following phenotypes: (Adl) adult lethal, (Age), (Bli) blistered, (Bmd) body morphology defect, (Ced) Cell death abnormality, (Clr) clear, (Daf) DAuer Formation, (Dpy) dumpy, (Egl) egg laying defect, (Emb) embryonic lethal, (Evl) everted vulva, (Fem) feminization of XX and XO animals, (Fgc) Fewer Germ Cells, (Fog) feminization of germline, (Gon) GONad development abnormal, (Gro) slow growth, (Him) high incidence of male progeny, (Hya) HYperActive, (Let) larval lethal, (Lin) lineage abnormal, (Lon) long body, (Lpd), (Lva) larval arrest, (Lvl) larval lethal, (Mab) Male ABnormal, (Mei) Defective meiosis, (Mig) MIGration of cells abnormal, (Mlt) molt defect, (Morphology), (Mut) Mutator, (Muv) MUltiVulva, (Oma) Oocyte MAturation defective, (Pat) Paralyzed, Arrested elongation at Two-fold, (Pch) PatCHy coloration, (Pnm) Pronuclear migration alteration in early embryo, (Prl) paralyzed, (Prz) PaRaLyzed, (Pvl) protruding vulva, (Pvu) protruding vulva, (Rde), (Reproductive), (Rol) roller, (Rot) centrosome pair and associated pronuclear rotation abnormal, (Rup) exploded, (Sck) sick, (Sle) Slow embryonic development, (Slu) SLUggish, (Sma) small, (Spd) SpinDle, abnormal embryonic, (Spo) Abnormal embryonic spindle position and orientation, (Step) sterile, (Stp) sterile progeny, (Unc) uncoordinated, (Unclassified), (Vul) vulvaless, (WT), (defect) morphological or behavioral defects. SCN genome sequences predicted to encode various gene products set forth herein annotated to the C. elegans specific genes previously shown to exert a negative effect or observable phenotype in Drosophila or in C. elegans are anticipated to be effective targets for achieving a similar phenotype when expressed in planta as a dsRNA for the purpose of suppressing a gene in SCN specifically targeted by the dsRNA. Gene sequences unique to SCN and not annotated to sequences or gene products from other organisms are also anticipated to be effective for achieving control of SCN when such sequences are provided in the diet of the SCN as a dsRNA because the target genes are unique to SCN pest metabolism, physiology, and pathogenicity.


DNA segments of the present invention are desired for use in constructing dsRNA expression sequences, particularly if the DNA segments exhibit at least from about 70% identity, or at least from about 75% identity, or at least from about 80% identity, or at least from about 90% identity, or at least from about 95% identity, or at least from about 98% identity, or at least about 100% identity to contiguous 17-24 nucleotide sequences found within the nematode genome or other pest sequences targeted for suppression. Sequences less than about 80% identical to a target gene are anticipated to be less effective and so less desirable. Inhibition is specific to the nematodes' gene or gene families, the sequence of which corresponds substantially to the dsRNA. Expression of unrelated genes is not affected. This specificity allows the selective targeting of a nematode or other pest species, resulting in the absence of an effect on non-target organisms exposed to the compositions of the present invention.


The regions predicted to be more effective at dsRNA-mediated gene silencing include regions that exhibit higher siRNA efficiency. Higher siRNA efficiency may be achieved by any technique, including, but not limited to, computational methods such as algorithms designed to predict siRNA efficiency based on thermodynamic characteristics of a given dsRNA (or DNA) sequence, generally considering sequences of from about 17, to about 18, to about 19, to about 20, to about 21, to about 22, to about 23, or even to about 24 contiguous nucleotides corresponding to a sequence that is being targeted for suppression (Schwarz et al., 200, Cell 115:199-208; Chalk et al. 2004, Biochem. Biophys. Res. Comm. 319:264-274; Ui-Tei et al., NAR 2004, 32:936-948; Reynolds et al., 2004, Nature Biotechnology 22:326-330).


Inhibition of a target gene using the stabilized dsRNA technology of the present invention is sequence-specific in that nucleotide sequences corresponding to the duplex region of the RNA are targeted for genetic inhibition. RNA containing a nucleotide sequences identical to a portion of the target gene is preferred for inhibition. RNA sequences with insertions, deletions, and single point mutations relative to the target sequence are also effective for gene specific inhibition. In performance of the present invention, it is preferred that the inhibitory dsRNA and the portion of the target gene share at least from about 75% sequence identity, or from about 80% sequence identity, or from about 90% sequence identity, or from about 95% sequence identity, or from about 99% sequence identity, or even about 100% sequence identity. Alternatively, the duplex region of the RNA may be defined functionally as a nucleotide sequence that hybridizes with a portion of the target gene transcript. A greater sequence homology across a target gene sequence that is less than full-length in comparison to the target gene compensates for a less homologous sequence that more closely approximates the full length of the target gene. The length of a nucleotide sequence that is identical to a portion of the target gene sequence can be from about 21, to about 25, to about 50, to about 100, to about 200, to about 300, or more contiguous bases. Normally, a sequence of greater than 20-100 nucleotides is preferable, although a sequence of greater than about 200-300 nucleotides may be preferred, depending on the length of the target gene. The invention has the advantage of being able to tolerate sequence variations due to genetic mutation, strain polymorphism, or evolutionary divergence. Therefore the nucleic acid molecule introduced into a plant for expression as a pest specific dsRNA gene suppression construct may not need to exhibit absolute homology, and may not need to represent the full length of the sequence targeted for suppression.


The dsRNA molecules may be synthesized either in vivo or in vitro. The dsRNA may be formed by a single self-complementary RNA strand or two complementary RNA strands expressed from separate expression constructs. Endogenous RNA polymerase of the cell may mediate transcription in vivo, or cloned RNA polymerase can be used for transcription in vivo or in vitro. Inhibition may be achieved by specific transcription in an organ, tissue, or cell type; stimulation of an environmental condition (e.g., infection, stress, temperature, chemical inducers); and/or engineering transcription at a developmental stage or age of the transgenic plant expressing the dsRNA construct. The RNA sequences expressed from the recombinant construct may or may not be polyadenylated. The RNA sequences expressed from the recombinant construct may or may not be capable of being translated into a polypeptide by a cell's translational apparatus.


The RNA, dsRNA, siRNA, or miRNA of the present invention intended for use in controlling plant pest infestation may be produced chemically or enzymatically through manual or automated reactions or in vivo in an organism other than the plant for which pest control is intended. RNA may also be produced by partial or total organic synthesis. Any modified ribonucleotide can be introduced by in vitro enzymatic or organic synthesis. The RNA may be synthesized by a cellular RNA polymerase or a bacteriophage RNA polymerase (e.g., T3, T7, SP6). If synthesized chemically or by in vitro enzymatic synthesis, the RNA may be purified prior to introduction into the cell or formulated in an agronomically acceptable carrier and applied to the soil, to the roots, or to the seed prior to planting. For example, RNA can be purified from a mixture by extraction with a solvent or resin, precipitation, electrophoresis, chromatography, or a combination thereof. Alternatively, the RNA may be used with no, or a minimum of, purification to avoid losses due to sample processing. The RNA may be dried for storage or dissolved in an aqueous solution. The solution may contain buffers or salts to promote annealing, and/or stabilization of the duplex strands.


For transcription from a transgene in vivo or from an expression cassette, a regulatory region (e.g., promoter, enhancer, silencer, leader, intron and polyadenylation) may be used to modulate the transcription of the RNA strand (or strands). Therefore, in one embodiment, the polynucleotide sequences constructed to facilitate transcription of the RNA molecules of the present invention are operably linked to one or more promoter sequences functional in a plant host. The polynucleotide sequences may be placed under the control of an endogenous promoter normally present in the host genome. The polynucleotide sequences of the present invention, under the control of an operably linked promoter sequence, may further be flanked by additional sequences that advantageously affect its transcription and/or the stability of a resulting transcript. Such sequences are generally located upstream of the promoter and/or downstream of the 3′ end of the expression construct. The term “operably linked”, as used in reference to a regulatory sequence and a structural nucleotide sequence, means that the regulatory sequence causes regulated expression of the linked structural nucleotide sequence. “Regulatory sequences” or “control elements” refer to nucleotide sequences located upstream, within, or downstream of a structural nucleotide sequence, and which influence the timing and level or amount of transcription, RNA processing or stability, or translation of the associated structural nucleotide sequence. Regulatory sequences may include promoters, translation leader sequences, introns, enhancers, stem-loop structures, repressor binding sequences, termination sequences, pausing sequences, polyadenylation recognition sequences, and the like.


In another embodiment, the nucleotide sequence of the present invention comprises an inverted repeat sequence separated by a spacer sequence. The spacer sequence may be a region comprising any sequence of nucleotides that facilitates secondary structure formation between the inverted repeat sequences. In one embodiment, the spacer sequence is part of the sense or antisense polynucleotide sequence for mRNA. The spacer sequence may alternatively comprise any combination of nucleotides or homologues thereof that are capable of being linked covalently to a nucleic acid molecule. The spacer sequence may comprise a contiguous sequence of nucleotides of from about 8-100 nucleotides in length, or alternatively from about 100-200 nucleotides in length, or from about 200-400 nucleotides in length, or from about 400-500 nucleotides in length, or from about 500 to about 1500 nucleotides in length.


The gene or genes targeted for suppression may be amplified using any thermal amplification means and the precise nucleotide sequence determined. One skilled in the art is able to modify the thermal amplification conditions in order to ensure optimal amplicon product formation, and the amplicon may be used as a template for in vitro transcription to generate sense and antisense RNA with the included minimal promoters.


As used herein, the phrase “a substantially purified nucleic acid”, “an artificial sequence”, “an isolated and substantially purified nucleic acid”, or “an isolated and substantially purified nucleotide sequence”, with respect to a naturally occurring nucleotide sequence, refers to a nucleic acid molecule that is substantially removed from the composition with which it is associated in its natural state. Examples of a substantially purified nucleic acid molecule include: (1) a DNA sequence comprising the contiguous sequence at least about 17, or about 18, or about 19 or more nucleotides in length consisting of a portion of a naturally occurring DNA molecule, but which is not flanked by polynucleotide sequences occur naturally on either end of the contiguous sequence; (2) a nucleic acid molecule comprising a naturally occurring contiguous nucleotide sequence isolated from its naturally occurring state and incorporated into a DNA construct; (3) a cDNA, a genomic DNA fragment isolated and purified substantially from all other genomic DNA to which it was originally naturally associated, an amplicon fragment produced using thermal amplification procedures, or a restriction fragment; (4) recombinant DNA; and (5) synthetic DNA. A substantially purified nucleic acid may also be comprised of one or more segments of any of the sequences referred to hereinabove.


Nucleic acid molecules, fragments thereof, and complements thereof selected from the group consisting of SEQ ID NO:1-45568 may be employed as probes or primers to identify related nucleic acid molecules from other species for use in the present invention to produce desired dsRNA, siRNA, and miRNA molecules. Such related nucleic acid molecules include the nucleic acid molecules that encode the complete amino acid sequence of a protein, and the promoters and flanking sequences of such molecules. In addition, such related nucleic acid molecules include nucleic acid molecules that encode gene family members. Such molecules can be readily obtained by using the above-described nucleic acid molecules or fragments thereof to screen complementary DNA or genomic DNA libraries obtained from a nematode or other plant pest species. The screen can be any physical means such as northern, southern, or any immunologically based screening method that detects either the specific sequence of a nucleotide molecule, or the transcribed and/or translated product of such nucleotide molecule, or any mathematical algorithm that is used for comparing nucleotide sequences in silico.


Nucleic acid molecules, fragments thereof, and complements thereof selected from the group consisting of SEQ ID NO:45569-SEQ ID NO:97729 may also be used in a similar fashion to screen other genomes, libraries, and organisms for related sequences. Such related sequences are expected to include but not be limited to homologues that include nucleic acid molecules that encode, in whole or in part, protein homologues of other pest species, plants or other organisms. Such molecules can be readily obtained by using the above-described nucleic acid molecules or fragments thereof to screen EST, cDNA or gDNA libraries. Such homologous molecules may differ in their nucleotide sequences from those found in one or more of SEQ ID NO:1-SEQ ID NO:45568 and SEQ ID NO:45569 through SEQ ID NO:97729 or complements thereof, because perfect complementarity is not required for such related sequences to hybridize to each other. In a particular embodiment, methods for 3′ or 5′ RACE may be used to obtain such sequences (Frohman, M. A. et al., Proc. Natl. Acad. Sci. (U.S.A.) 85:8998-9002, 1988; Ohara, O. et al., Proc. Natl. Acad. Sci. (U.S.A.) 86:5673-5677, 1989). In general, any of the above described nucleic acid molecules or fragments may be used to generate dsRNA's, siRNA's, and/or siRNA's that are suitable for use in a diet, in a spray-on mix, or in a recombinant DNA construct of the present invention.


As used herein, the phrase “coding sequence”, “structural nucleotide sequence” or “structural nucleic acid molecule” refers to a polynucleotide molecule that is translated into a polypeptide when placed under the control of appropriate regulatory sequences. The structural nucleotide sequence, coding sequence, or structural nucleic acid molecule can be referred to using other terms in the art, but is intended to include DNA as well as RNA molecules. A coding sequence can include, but is not limited to, genomic DNA sequences or portions thereof identified to encode or to be capable of encoding a polypeptide, a cDNA produced as a result of reverse transcription of mRNA that has been purified substantially because if its ability to hybridize to a polyT sequence, expressed sequence tagged (EST) sequences, and recombinant nucleotide sequences produced specifically for expression of a protein sequence.


Two molecules are said to be “minimally complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under at least conventional “low-stringency” conditions. Similarly, the molecules are said to be complementary if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under conventional “high-stringency” conditions. Conventional stringency conditions are described by Sambrook, et al., (1985). Appropriate stringency conditions which promotes hybridization of two different nucleic acid sequences are, for example, incubation of the two sequences together in 6.0× sodium chloride/sodium citrate (SSC) at about 45° C. where one of the two different sequences is tethered in some fashion to a solid support and the untethered sequence is linked to a reporter molecule such as a ligand that can be detected using an immunological means, a fluorophores, a radioisotope, or an enzyme. The hybridization of the two sequences under the above conditions can be followed by a wash in 2.0×SSC at 50° to remove any excess reagents or unbound or unhybridized probe or untethered molecules (Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6). For example, the salt concentration in the wash step can be selected from a low stringency of about 2.0×SSC at 50° C. to a high stringency of about 0.2×SSC at 50° C. In addition, the temperature in the wash step can be increased from low stringency conditions at room temperature (about 22° C.) to high stringency conditions (about 65° C.). Temperature and salt may be varied together or independent of each other.


A nucleic acid for use in the present invention may specifically hybridize to one or more of nucleic acid molecules from nematodes or complements thereof under moderately stringent conditions, for example at about 2.0×SSC and about 65° C. A nucleic acid for use in the present invention will include those nucleic acid molecules that specifically hybridize to one or more of the nucleic acid molecules disclosed therein as set forth in SEQ ID NO:1 through SEQ ID NO:47643 or complements thereof under high stringency conditions. Preferably, a nucleic acid for use in the present invention will exhibit at least from about 70%, at least from about 80%, at least from about 90%, at least from about 95%, at least from about 98% or even about 100% sequence identity with one or more nucleic acid molecules as set forth in SEQ ID NO:45569 through SEQ ID NO:47643.


Nucleic acids of the present invention may be entirely synthetically constructed or assembled piecemeal from naturally occurring or combinations of naturally occurring and synthetic components. All or any portion of the nucleic acids of the present invention may be synthesized without reference to codon usage calculated for any particular plant species, however when a particular sequence is intended to be effective in suppression of one or more genes in one or more pest species, it is preferable that the sequence be selected such that the sequence in any gene or species targeted for suppression be entirely or substantially entirely identical or entirely or substantially entirely complementary to the suppressor sequence.


The present invention also relates to recombinant DNA constructs for expression in a microorganism. Heterologous nucleic acids from which an RNA of interest is transcribed can be introduced into a microbial host cell, such as a bacterial cell or a fungal cell, in order to produce quantities of double stranded RNA for use in suppression of one or more genes in one or more plant pests.


The present invention also contemplates transformation of a polynucleotide sequence of the present invention into a plant to achieve nematode or other plant pest inhibitory levels of expression of one or more dsRNA molecules. A plant transformation vector comprises one or more nucleotide sequences that is/are capable of being transcribed as an RNA molecule and that is/are substantially homologous and/or complementary to one or more nucleotide sequences encoded by the genome of the nematode, or other plant pest, such that upon uptake of the RNA molecule, results in a down-regulation of expression of at least one of the respective nucleotide sequences of the nematode or other plant pest. In one embodiment the plant transformation vector is an isolated and purified DNA molecule comprising a promoter operatively linked to a contiguous nucleotide sequence comprising one or more polynucleotide molecules of the present invention selected from the group consisting of SEQ ID NO:45569 through SEQ ID NO:50775. The polynucleotide molecule includes a segment comprising all or part of a RNA molecule complementary to a targeted RNA within a nematode or pest cell, and may also contain a functional intron sequence positioned either upstream of or within the transcribed RNA sequence, and may also contain a five prime (5′) untranslated leader sequence (i.e., a UTR or 5′-UTR) positioned between the promoter and the point of transcription initiation.


A plant transformation vector may contain sequences for suppression of more than one gene, thus allowing production of more than one dsRNA for inhibiting expression of two or more genes. One skilled in the art will readily appreciate that segments of DNA whose sequence corresponds to that present in different genes can be combined into a single composite DNA segment for expression in a transgenic plant to achieve suppression of one or more nematode or pest genes, one or more plant genes, or a combination thereof. Alternatively, a plasmid of the present invention already containing at least one DNA segment can be modified by the sequential insertion of additional DNA segments between an enhancer and/or promoter and the terminator sequences. A nematode or other plant pest control agent of the present invention may be designed for the inhibition of multiple genes, and the genes to be inhibited can be obtained from the same nematode or other plant pest species in order to enhance the effectiveness of the pest control agent, or from different races/variants of the same pest species, or from different pest species or other organisms. In certain embodiments, the genes derived from different nematodes or other plant pests provide for a broadening of the range of nematodes and other plant pests against which the pest control agent is effective. When multiple genes in one pest are targeted for suppression, a polycistronic DNA element can be fabricated (Fillatti, US Patent Application Publication No. US 2004-0029283 A1).


A promoter that drives expression of a polynucleotide sequence in a particular species of plant is selected for use in expression constructs in which a nucleotide sequence of the present invention is to be used to transform a plant. Promoters that function in different plant species are known in the art. Promoters useful for expression of polypeptides in plants are those that are inducible, viral, synthetic, or constitutive as described in Odell et al. (1985 Nature 313:810-812), and/or promoters that are temporally regulated, spatially regulated, and spatio-temporally regulated. For the purpose of the present invention, e.g., for optimum control of species that feed on roots, it is preferable to achieve the highest levels of expression of these genes within the roots of plants. A number of promoters exhibiting root-enhanced levels of expression of operably linked sequences have been identified. (Lu et al., 2000 J. Plant Phys., 156(2):277-283; U.S. Pat. Nos. 5,837,848 and 6,489,542). Expression of the constructs of the present invention may preferably be from polymerase III promoters as an alternative to conventional polymerase II promoters, and also may be linked to inducible promoters, or heterologous promoters that require heterologous accessory proteins, such as for example, phage T7 promoters and the like. Promoters that are induced as a result of the establishment by a cyst nematode of a feeding site (feeding site specific promoters), and promoters up-regulated by nematode invasion are specifically contemplated for use in the present invention (Gheysen et al., 2002, Ann. Rev. Phytopathol. 40:191-219).


A recombinant DNA vector or construct of the present invention will typically comprise a marker that confers a selectable phenotype on transformed plant cells, and may also be used to select for plants or plant cells that contain the exogenous nucleic acids of the present invention. The marker may encode biocide resistance, antibiotic resistance (e.g., kanamycin, G418 bleomycin, hygromycin, etc.), or herbicide resistance (e.g., glyphosate, etc.). Examples of selectable markers include, but are not limited to, a neo gene (Potrykus et al., 1985 Mol. Gen. Genet. 199:183-188) which codes for kanamycin resistance and can be selected for using kanamycin, G418, etc.; a bar gene which codes for bialaphos resistance; a mutant EPSP synthase gene (Hinchee et al., 1988 Bio/Technology 6:915-922) which encodes glyphosate resistance; a nitrilase gene which confers resistance to bromoxynil (Stalker et al., 1988 J. Biol. Chem. 263:6310-6314); a mutant acetolactate synthase gene (ALS) which confers imidazolinone or sulphonylurea resistance (European Patent Application 154,204); an AMPA-acetyltransferase gene for resistance to phosphonates (U.S. Pat. No. 6,448,476), a methotrexate resistant DHFR gene (Thillet et al., 1988 J. Biol. Chem. 263:12500-12508), and compositions for chloroplast or plastid transformation selection (U.S. Pat. Nos. 5,693,507, 5,451,513, and WO 95/24492).


A recombinant vector or construct of the present invention may also include a screenable marker for monitoring expression. Exemplary screenable markers include a β-glucuronidase or uidA gene (GUS) which encodes an enzyme for which various chromogenic substrates are known (Jefferson, Plant Mol. Biol, Rep. 5.387-405, 1987; Jefferson et al., EMBO J. 6:3901-3907, 1987); an R-locus gene, which encodes a product that regulates the production of anthocyanin pigments (red color) in plant tissues (Dellaporta et al., Stadler Symposium 11:263-282, 1988); a β-lactamase gene (Sutcliffe et al., Proc. Natl. Acad. Sci. (U.S.A.) 75:3737-3741, 1978), a gene which encodes an enzyme for which various chromogenic substrates are known (e.g., PADAC, a chromogenic cephalosporin); a luciferase gene (Ow et al., Science 234:856-859, 1986) a xy/E gene (Zukowsky et al., Proc. Natl. Acad. Sci. (U.S.A.) 80:1101-1105, 1983) which encodes a catechol dioxygenase that can convert chromogenic catechols; an α-amylase gene (Ikatu et al., Bio/Technol. 8:241-242, 1990); a tyrosinase gene (Katz et al., J. Gen. Microbiol. 129:2703-2714, 1983) which encodes an enzyme capable of oxidizing tyrosine to DOPA and dopaquinone which in turn condenses to melanin; an α-galactosidase, which catalyzes a chromogenic α-galactose substrate; and a β-galactosidase which catalyzes the conversion of a chromogenic β-galactoside substrate.


In general a functional recombinant DNA is introduced at a non-specific location in a plant genome. In special cases it may be useful to insert a recombinant DNA construct by site-specific integration. Several site-specific recombination systems exist which are known to function implants include cre-lox as disclosed in U.S. Pat. No. 4,959,317 and FLP-FRT as disclosed in U.S. Pat. No. 5,527,695.


Preferred plant transformation vectors include those derived from a Ti plasmid of Agrobacterium tumefaciens (e.g. U.S. Pat. Nos. 4,536,475, 4,693,977, 4,886,937, 5,501,967 and European Patent Application No. 0122791). Agrobacterium rhizogenes plasmids (or “Ri”) are also useful. Other preferred plant transformation vectors include those disclosed, e.g., by Herrera-Estrella (1983 Nature 303:209-213), Bevan (1983 Nature 304:184-187), Klee (1985 Bio/Technol. 3:637-642) and Eur. Pat Appl. No. EP0 120 516.


Methods and compositions for transforming plants by introducing a recombinant DNA construct into a plant genome includes any of a number of methods known in the art. One method for constructing transformed plants is microprojectile bombardment as illustrated in U.S. Pat. Nos. 5,015,580 (soy), 5,550,318 (corn), 5,538,880 (corn), 6,153,812 (wheat), 6,160,208 (corn), 6,288,312 (rice) and 6,399,861 (corn). Another method for constructing transformed plants is Agrobacterium-mediated transformation in cotton (U.S. Pat. No. 5,159,135), corn (U.S. Pat. No. 5,591,616), and soy (U.S. Pat. Nos. 5,824,877 & 6,384,301).


The term “transgenic plant cell” or “transgenic plant” refers to a plant cell or a plant that contains an exogenous or heterologous polynucleotide sequence. A transgenic plant also comprises progeny (seeds, and plants and seeds produced from such seeds, etc.) of any generation of such a transgenic plant or a seed of any generation of all such transgenic plants wherein said progeny or seed comprises the exogenous or heterologous polynucleotide sequence. The heterologous or polynucleotide sequence is a DNA molecule that is transcribed into the RNA, sRNA, dsRNA, siRNA, or miRNA or fragment thereof of the present invention.


A transgenic plant formed using Agrobacterium mediated transformation methods contains at least a single recombinant DNA sequence inserted into the plant chromosome and is referred to as a transgenic event. Such transgenic plants are referred to as being heterozygous for the inserted exogenous sequence. A transgenic plant homozygous with respect to a transgene can be obtained by sexually mating (selfing) an independent segregant transgenic plant that contains a single exogenous gene sequence to itself, for example an F0 plant, to produce F1 seed. One fourth of the F1 seed produced will be homozygous with respect to the transgene. F1 seed can be tested using a SNP or related thermal amplification assay that allows distinction between heterozygotes and homozygotes (i.e., a zygosity assay).


Transgenic plants can also be prepared by crossing a first plant having a recombinant DNA construct with a second plant lacking the construct. For example, a recombinant DNA designed for targeting the suppression of a target gene can be introduced into a first plant line to produce a transgenic plant which can be crossed with a second plant line to introgress the recombinant gene suppression DNA into the second plant line. The second plant line may already contain or be later transformed or bred with another transgenic line to contain one or more transgenes that are different from the gene suppression construct being introgressed from the first plant line.


Without intending to be limited to any single embodiment, the nucleotide sequences of the present invention exhibit a wide variety of usefulness. For example, the sequences can be used to synthesize dsRNA molecules either in in vivo or in vitro systems selected for their ability to cause gene suppression and therefore pest inhibition and such moleculed can be expressed in a transgenic plant, applied to the rhizosphere or biosphere of a plant, or applied in a seed coating or treatment for causing geen suppression in a pest. The sequences can be used in kits incorporating methods for detecting DNA, RNA, or siRNA's in a seed, plant, tissue, biological sample, meal, oil, flour, food product, commodity product, and the like. The sequences can be used for detecting the presence of a homologous sequence in a biological sample. The sequences can be used to construct a dsRNA for suppression of a target gene and can be linked to an RNA segment that binds specifically to one or more receptor molecules, bringing the dsRNA segment into close proximity to a membrane surface, and increasing its likelihood of being taken up by a cell which contains a gene that is targeted for suppression by the dsRNA.


In one embodiment, a nucleotide sequence of the present invention can be recorded on one or more computer readable media. As used herein, “computer readable media” refers to any tangible medium of expression that can be read and accessed directly by a computer. Such media include, but are not limited to: magnetic storage media, such as floppy discs, hard discs, and magnetic tape. Optical storage media include physical storage devices such as compact diskettes. Electrical storage media include random access and read only memory devices (RAM and ROM). A skilled artisan can readily appreciate that any of the presently known computer readable mediums can be used to create a manufacture comprising a computer readable medium having recorded thereon one or more sequences of the present invention. These devices can be accessed with a computer and used to perform a search and comparison of any other sequence of like composition (i.e., nucleotide sequences compared to nucleotide sequences, amino acid sequences compared to amino acid sequences, etc) to determine whether and to what extent a similarity or identity is present between the sequences being compared.


As used herein, “recorded” refers to a process for storing information on computer readable medium. A skilled artisan can readily adopt any of the presently known methods for recording information on computer readable medium to generate media comprising the nucleotide sequence information of the present invention. A variety of data storage structures are available for creating a computer readable medium having recorded thereon one or more sequences of the present invention. The choice of the data storage structure will generally be based on the means chosen to access the stored information. In addition, a variety of data processor programs and formats can be used to store the sequence information of the present invention on computer readable medium. The sequence information can be represented in a word processing text file, formatted in commercially available software such as WordPerfect, Microsoft Word, or shareware such as Linux, or represented in the form of an ASCII text file, stored in a database application, such as DB2, Sybase, Oracle, or the like. The skilled artisan can readily adapt any number of data processor structuring formats (e.g. text file or database) in order to obtain computer readable medium having recorded thereon the nucleotide sequence information of the present invention.


Computer software is publicly available which allows a skilled artisan to access sequence information provided in a computer readable medium. Software which implements the BLAST (Altschul et al., J. Mol. Bio. 215: 403-410, 1990) and BLAZE (Brutlag, et al., Comp. Chem. 17: 203-207, 1993) search algorithms on a Sybase system can be used to identify open reading frames (ORFs) within sequences such as the EST's that are provided herein and that contain homology to ORFs or proteins from other organisms. Such ORFs are protein-encoding fragments within the sequences of the present invention and are useful in producing commercially important proteins such as enzymes used in amino acid biosynthesis, metabolism, transcription, translation, RNA processing, nucleic acid and a protein degradation, protein modification, and DNA replication, restriction, modification, recombination, and repair.


EXAMPLES
Example 1

This example illustrates the construction and DNA sequence analysis of SCN genome libraries.


SCN genomic DNA libraries (LIB5513, LIB 5514, LIB5519, and LIB5520) were constructed from SCN strain OP25 genomic DNA (Dong et al., 1997, Genetics 146:1311-1318). The libraries were generated by ligating size-selected physically sheared DNA into the high copy number plasmid pUC18 and the resulting ligation mixture was transformed into E coli by electroporation. 10 micrograms of SCN genomic DNA were resuspended into 30 microliters TE buffer. The DNA was sheared by sonication. The sonicated DNA was 5′ end-repaired using T4 DNA polymerase (New England BioLabs) and 10 mM dNTP's in a total reaction volume of 35 microliters and equilibrated to 1× ligation buffer (New England BioLabs). 3′ overhangs were repaired by treatment with T4 polynucleotide kinase. The mixture was incubated at 15° C. for 20 minutes, and transferred to 65° C. for 15 minutes to inactivate the kinase and polymerase, and incubated at room temperature for an additional 10 minutes. The repaired DNA was size fractionated by electrophoresis in a 0.7% agarose gel adjacent to a 1 Kb molecular weight marker at 80 volts for two hours in TBE buffer. The 2-4 KB and 4-8 KB DNA fragments were excised from the agarose gel and transferred into microcentrifuge tubes. The size-selected DNA fragments were isolated from the agarose gel and a second round of size selection was performed to eliminate small DNA fragments co-migrating with the selected range in first gel fractionation. Approximately 100 nanograms of the size-selected repaired DNA was inserted by ligation into a pUC18-HincII digested vector (molar ratio of 5 to 1). The ligated DNA was transformed into E. coli DH10B cells by electroporation and planted to LB plates containing 100 micrograms per milliliter ampicillin and incubated for 18-24 hours at 37 C. Several colonies that arose after incubation were randomly selected. The colonies were tested to determine the average DNA insert size and the average number of colonies in the library that appeared to contain no inserted recombinant DNA. Four libraries were constructed. The average insert size in library LIB5513 was 2-4 KB, in library LIB5514 was 4-8 KB, in library LIB5519 was 2-4 KB, and in library LIB5520 was 4-8 KB. Samples of each library were collected and combined together and deposited with the American Type Culture Collection (ATCC) at Rockville Md., USA on Feb. 15, 2005. The combined library was submitted to ATCC, designated as LIB5513141920, and the ATCC has assigned the patent deposit number PTA-6583 to the deposited material.


The cells of the libraries were then plated on large bioassay plates containing Luria Broth (Difco) supplemented with 100 microgram/milliliter carbenicillin (ICN Biomedical), 64 microgram/milliliter IPTG (Shelton Scientific) and 80 microgram/milliliter X-Gal (Shelton Scientific). Individual blue transformants were then picked into 1.2 ml Terrific Broth (Difco) supplemented with 125 μg/ml Ampicillin (Calbiochem) in 96 deep-well boxes by Genetix Q-bot. The boxes were incubated for 21 hours at 37° C., each well archived to individual wells in 384-well glycerol plates, and then pelleted and stored at −20 C.


Alkaline lysis DNA extraction was performed on samples of pelleted clones using a QUIAGEN bead based platform on an automated robotic preparation system. Eluted DNA was stored for sequencing at 4° C. in a 96-well COSTAR plate. Two microliters of the DNA solution was then transferred into a 384 well microtiter plate (AXYGEN) using a Hamilton MPH96 Pipetting Robot. The pipetted DNA was then denatured for 5 minutes at 95° C., and two microliters of Big Dye Reaction Mix (Big Dye Terminators v3.0, 3.2 pmol sequencing primer, 1×TNK, and 0.5M MgCl2) was then added to the denatured DNA using a Hamilton MPH96 Pipetting Robot. Each clone was sequenced using M13 forward and reverse primers in a PCR sequencing reaction using the conditions as follows: 95° C. for 5 seconds, 45° C. for 5 seconds, 60° C. for 2 minutes 30 seconds for a total of 25 cycles. The sequencing reactions were ethanol precipitated and re-suspended in water and loaded onto an ABI 3730xl Sequencing Analyzer (APPLIED BIOSYSTEMS) to generate sequence trace data for each sample. Approximately 400,000 sequencing reads were generated from the four Heterodera glycines genome libraries.


Example 2

This example illustrates the analysis, characterization, and assembly of the sequences obtained from DNA sequence analysis of the SCN genome libraries.


The sequence trace data was converted to sequence and quality files and standard quality control procedures were applied through the use of the block 0/1 pipelines. Quality control procedures included sequence quality trimming, sequence identity, cloning sequence removal, and contamination identification and removal. The results of the general sequencing pre-processing steps were stored in the sequence database SeqDB. Data passing quality controls were retrieved for inclusion in the assembly step. The dataset to be assembled consisted of 338,266 sequence reads that passed the block 0/1 process, represented by an initial output of 404,372 sequencing reads that were submitted to the block 0/1 process. A file of clone pair constraints was produced on the basis of known clone naming conventions and library construction details (insert size range). The clone pair constraint file consisted of 159,389 pairwise entries. Fasta, quality, and constraint files were used as input to the PCAP program (Version Date: Sep. 3, 2004, Huang, X., Wang, J., Aluru, S., Yang, S.-P. and Hillier, L. (2003): PCAP: A Whole-Genome Assembly Program. Genome Research, 13: 2164-2170), and the sequences were assembled. 45,568 output genomic contig sequences were produced whose sum length represented about 80.8 Million bases. These contig sequences are represented by the sequences as set forth in SEQ ID NO:1-SEQ ID NO:45568 and were subsequently used as input sequences to define generic regions of the SCN genome sequence corresponding to predicted coding sequences (referred to herein as vcDNA's or virtual complementary DNA's) and predicted promoter and intronic sequences.


SCN expressed sequences were collected from public sources and used to compare the genomic sequences identified herein as well as to identify unique sequences not present in any known public database set. Public sequences were collected into a file which contained non-identical contigs from (1) the Genome Sequencing Center at Washington University in St. Louis, Mo., USA (Nemagene clusters; McCarter et al., 2003, J. Nematology 35:465-469), (2) Parkinson contigs (Nembase clusters; Parkinson et al., 2004, Nature Genetics 36:1259-1267), (3) EST's in GenBank not contained in contigs (singletons), and (4) nucleotide sequences representing non-EST DNA sequences in GenBank (e.g., mRNAs). These sequences were compiled into and referred to herein as an essential gene sequence list corresponding to sequences as set forth herein at SEQ ID NO:47644-SEQ ID NO:50775.


Gene finding results were consolidated in a relational database in such a way that each predicted gene is represented by a set of coordinates that define the position of all segments of the gene on the genomic DNA contig (gDNA). The genes are described herein, and in particular in the Feature Fields of the Sequence Listing with reference to the nucleotide positions of each vcDNA giving rise to an amino acid sequence and in the amino acid sequence SEQ ID NO's as nucleotide sequences corresponding to portions of vcDNA's encoding the amino acid sequence. Sequences between the indicated protein coding portions correspond to predicted intronic sequences. Other sequence segments that are represented at least in the genomic sequences set forth in SEQ ID NO:1-SEQ ID NO:45568 include but are not limited to peptide-encoding segments such as initial exon, internal exon, terminal exon, or single exon and the like, and non-coding segments including promoter regions, transcription initiation sequences, transcription termination sequences, and polyadenylation signal sequences, and the like. Often the same position within a gDNA contig is predicted to contain a gene by more than one gene finding program. Thus, in order to prepare a library of genes where each position (locus) of the genome is represented by a single gene, several different gene prediction methods were applied and the results were consolidated according to the following algorithm.


1. For each gDNA contig, all clusters of overlapping genes were defined. Each cluster was assumed to correspond to a single gene. The cluster was defined as a set of sequences located on the same DNA strand and either predicted to overlap based on nucleotide sequence identity along the lengths of the sequences or predicted to be located closer than 50 nucleotides from each other. Only peptide-encoding segments were considered when defining a cluster. The start and end positions of the cluster define the maximal dimension of the gene.


2. For each cluster the preferred gene was selected, which represents this locus in the library. The selection algorithm is described as follows:


(a) All genes in a cluster were ranked by the gene-prediction method that produced them. The ranking by the different methods was intended to describe assumed accuracy of the method in predicting genes. The ranking was ordered arbitrarily using FgeneSH, Genemark.hmm and MT/NAP data results. The MT/GAP results were not ranked at all, but were used only if there were no other prediction for the locus, i.e. cluster contained only gene(s) predicted by MT/GAP.


(b) The highest ranking gene was selected unless there were several equally ranked genes (i.e. predicted by the same method) or the cluster coverage by this gene was below 60%. The cluster coverage was computed as the ratio of the gene length to the length of the cluster (maximal dimension of the gene).


(c) For equally ranked genes, the gene with highest cluster coverage was selected.


(d) If the cluster coverage for the best-ranking gene was below 60%, the lower ranking genes were considered (in the ranking order) and the first one providing a gain in cluster coverage of at least 10% was selected.


(e) If a cluster contained only AAT/GAP-predicted genes—the one with the best cluster coverage was selected.


(f) For all other clusters, additional filtering was completed—only sequences that exhibited a translation product of at least 16 amino acids in length were selected.


(g) If a cluster contained only Genemark.hmm-predicted genes—no gene was selected and the locus was assumed not to contain any gene.


The method described above resulted in a list of “preferred” genes. The actual DNA sequence for each of these genes was prepared by extracting a subsequence (region of a sequence) of a gDNA contig which corresponded to the coordinates of the gene. The sequences prepared contained all predicted exons and introns of the gene. In the case of AAT/GAP and FgeneSH genes they also may contain regions between transcription and translation initiation sequences, and between translation termination and polyadenylation sequences.


The three gene-predicting programs—FgeneSH, Genemark.hmm and MT/NAP—in addition to predicting positions of genes, also predict sequences of the translation product, if any. Thus, the “preferred” genes and their translated peptide sequences were simultaneously predicted by these methods. Virtual cDNA sequences (vcDNA) were prepared from genes derived only from MT/GAP prediction results by extracting regions of genomic DNA (gDNA) corresponding to the predicted exons and splicing them together. These virtual cDNA sequences were translated using a translator tool. The feature fields of indicated peptide SEQ ID NO's identify genomic contig sequence positions (for example, Contig_ID=SeqID_XXX) for the coding sequence contained therein. Additional information provided in the feature fields includes the identity of SCN-specific sequences, the nucleotide positions of these sequences in the vcDNA sequence, homology to existing sequences in publicly available databases, a numerical evaluation of the extent of the homology, and the predicted function if any associated with the peptide.


The vcDNA sequences were used to identify sequences corresponding to SCN specific promoter sequences using the following procedure:


1. For each gene predicted by either of the FgeneSH, Genemark.hmm or MT/NAP prediction algorithms, the position of the first peptide-encoding segment was used as the reference point for sequence extraction. The sequence of the gDNA contig which starts 1000 nucleotides upstream and ends 2 nucleotides downstream of the reference point was extracted.


2. The resulting sequence of the upstream region was shorter if the gene was located closer than 1000 nucleotides to the end of the genomic contig. If there was another gene located upstream and predicted by one of these methods—FgeneSH, Genemark.hmm or AAT/NAP, the upstream region was shortened (truncated) so that it did not overlap with the closest peptide-encoding segment of that gene. If the resulting sequence was shorter than 50 nucleotides, it was not included as a promoter sequence in the library of promoter sequences.


3. If the resulting sequence did not end with the translation initiation codon ATG, i.e., the predicted gene was not N-terminal complete—then the sequence was not included as a promoter sequence in the library of promoter sequences.


4. Sequences located upstream of MT/GAP-predicted genes were not included in the library of promoter sequences since this program did not predict a translation initiation position and in certain situations placed the predicted gene on the wrong strand of a gDNA contig.


Example 3

This example illustrates the annotation of predicted SCN genes.


Two methodologies were used to provide annotations of the predicted Heterodera glycines (SCN) peptides, including Gene ontology (GO) and SmartBlast. Both GO and SmartBlast procedures were developed through homology-based sequence searches. In GO procedures, the peptide sequences from SCN peptides were used to BLAST against a protein sequence database, for example, the non-redundant protein (nr-aa) database maintained by the National Center for Biotechnology Information as part of GenBank. The highly conserved homologues of nr-aa from a variety of species were further selected with a minimal E value of 1E-08. The selected SCN homologues were subjected to the sequence match with a protein sequence database (GO proteins from GO Ontology consortium). Finally, three categories according to the GO Ontology consortium (molecular function; biological process; and cellular component) were used to annotate the SCN sequences. In SmartBlast procedures, the peptide sequences from SCN peptides were also used to blast against the non-redundant protein as described above. The homologues were also selected with a minimal E value of 1E-08. Those homologues were subjected to filtering using some non-meaningful words, such as “putative”. The best meaningful homologues were used for SCN sequence annotation. The conditions used to provide the homolog annotation and the best hit with respect to any predicted SCN gene product were referred to in one or more of the feature fields for each of the SCN protein sequences selected from the group consisting of SEQ ID NO:119146-SEQ ID NO:121220, and were further identified as to molecular function, enzyme activity, cellular component and biological process. Genes characterized as encoding proteins that may be essential for survival based on the proteins' relationship at least to one or more C. elegans homologs and the phenotype of the knockout of the C. elegans homolog were further identified in one or more of the feature fields of each of the peptide sequences. The phenotype observed, abbreviations for each, and the standard nomenclature assigned for each with reference to that same phenotype and nomenclature in C. elegans was identified previously hereinabove.


Example 4

This example illustrates a method for screening the SCN genome sequences, the predicted vcDNA sequences, and the predicted amino acid sequence encoded therefrom, against other sequences and selecting sequences unique to SCN.


The sequences disclosed herein can be used in a method to provide a DNA construct for expression of a dsRNA that is effective for silencing of a gene in a soybean cyst nematode or other plant pest by expressing such DNA construct in the cells of a transgenic plant and providing the plant in the diet of the nematode or pest. DNA sequences can be selected from the sequences of the present invention that are useful in achieving dsRNA-mediated gene silencing by selecting from a target gene a DNA sequence consisting of at least from about 17 to about 21 or more contiguous nucleotides. Effective short interfering RNA's (siRNAs) for gene repression are normally from about 21 to about 23-nt long double-stranded RNA duplexes. These siRNA's are known to incorporate into the RNA-inducing silencing complex (RISC). Once unwound, the single-stranded antisense strand guides RISC to the target mRNA, and induces the cleavage of the target messages, resulting in translational inhibition (Dykxhoorn, et al. Molecular Cell Biology, 4:457-467, 2003). Plant siRNA sequences have been characterized generally as contiguous nucleotide sequences of from about 24 nucleotides in length (Tang, 2003, Genes & Development 17:49-63). It is preferred that interfering RNA molecules are selected from the sequences as set forth in SEQ ID NO:1-SEQ ID NO:97729 to limit the un-intended “off-target” effect of gene repression by limiting the potential base-pairing with unintended targets of the host or other non-target organisms.


Example 5

This example illustrates the identification of SCN genes that can be targeted for suppression using the nucleotide sequences of the present invention.


A comparison of the SCN genes was made to the genes identified in C. elegans for which knockouts have been previously identified to result in an observable phenotype. RNAi phenotypes include maternal sterile, embryonic lethal and a variety of postembryonic phenotypes. The relationship between C. elegans knockout phenotypes and their protein sequences were obtained. These protein sequences were then compared to the protein sequences translated from the SCN genomic sequences of the present invention.


A BLAST searchable “All Protein Database” was constructed, which was composed of genome-wise SCN peptides and C. elegans proteins. A reciprocal blast procedure was used to identify the possible orthologues of C. elegans for each SCN peptide.


The All Protein Database was queried using protein sequences of the SCN peptides using the “blastp” algorithm with an E-value cutoff of 1e-8. Up to 1000 hits were retained for each SCN peptide used in the query, and separated by organism names, either C. elegans or SCN. For C. elegans, a list was retained for the hits with SCN sequences exhibiting a more significant E-value than the best hit of the organism. The list contains likely duplicated SCN genes, and was referred to as a Core List. Another list was retained for all the hits from each organism, sorted by the E-value, and was referred to as a Hit List. The hit was identified as an orthologue of the query sequence if it was within the Core List.


Knockout phenotypes of SCN were inferred according to the degree of evolutionary relationship determined to exist between SCN and C. elegans proteins with reference to the knockout phenotypes of C. elegans genes, referred to herein above. For example, C. elegans C37H5.8 corresponds to a HSP-6 protein, and a knockout of this gene has been associated with the observed phenotypes of embryonic lethality and larval arrest. Orthologue identification from the above query indicated that an SCN amino acid sequence corresponding to SEQ ID NO:119310 is an orthologue of C37H5.8. Therefore, it is believed that because of the relationship of the SCN sequence corresponding to SEQ ID NO:119310 to the C. elegans orthologue C37H5.8, suppression of the SCN gene corresponding to SCN vcDNA sequence as set forth at SEQ ID NO:45733 encoding the C37H5.8 orthologue at SEQ ID NO:119310 would be expected to result in an observable phenotype corresponding to embryonic lethal and/or larva arrest in SCN. SCN genes have been categorized based on their relationship to identifiable orthologues with genes or sequences in other organisms and some are further identified as essential genes. Such information has been provided for each amino acid sequence predicted from the vcDNA sequences and is listed in the feature fields for each sequence in the sequence listing. The feature field in the sequence listing has been used to identify important features of the DNA molecules of the present invention. A DNA construct that contains target sequences from multiple SCN essential genes can be constructed to express a chimeric dsRNA molecule that affects more than one SCN gene. This aspect of the present invention reduces the possibility of selecting for a population of SCN that is unaffected by the dsRNA molecule.


SCN genes were grouped into Pfam protein families. Pfam is a comprehensive database of protein domain families, based on multiple alignments of protein domains or conserved protein regions (Nucleic Acids Research 2004 32:D138-D141; Proteins 28:405-420, 1997.). Peptide sequences of a subset of SCN genes have been matched to Pfam entries with HMMPFAM program, with an expectation value cutoff of 0.1 (Biological sequence analysis: probabilistic models of proteins and nucleic acids, Cambridge University Press, 1998.) The subset included 5207 SCN protein sequences that were analyzed by this method, and 3397 of the 5207 protein sequences were grouped into 909 families, as set forth in Table 1.


In order to target a protein gene family for suppression with a single dsRNA molecule, it may be necessary to identify conserved DNA sequence regions among protein gene family members. After the amino acid sequence translations from the virtual cDNA sequences were grouped into protein families, the conserved sequence regions were identified through multiple sequence alignment of the DNA sequences of the family members. For example, using the program CLSUTALW (ref. Nucleic Acids Res. 22:4673-4680), member sequences of a Pfam group can be aligned. One example is illustrated by an alignment of SEQ ID NO's representative of the nucleotide sequences encoding the protein family members in the MRP_L47 family, a mitochondrial ribosomal protein family, corresponding to SEQ ID NO:49132 (HG02471), SEQ ID NO:50709 (HGC08009), and SEQ ID NO:46538 (HG227019.C1.o1.np). An alignment of these three sequences allows the identification of conserved contiguous residues present in each of the three sequences. The conserved segments consisting of at least 21 contiguous nucleotides are representative of the preferred polynucleotide regions for expression in a double stranded RNA sequence for use in targeting the suppression of each member of the entire gene family. The comparison of protein sequences of family members identified and grouped in Table 1 enables the identification of related polynucleotide regions common among the family members by locating the corresponding cDNA and genomic contig sequences identified in the feature field of the Sequence Listing. Using this method of comparison, the protein sequences of family members identified in Table 1 and in SEQ ID NO:119146-SEQ ID NO:124352 allows the skilled artisan to identify the related polynucleotide regions that are common among the family members by locating the corresponding virtual cDNA (vcDNA) and genomic contiguous sequences as set forth in SEQ ID NO:1-SEQ ID NO:119145. These sequences can then be used in a DNA construct to express a dsRNA molecule in plant cells that is directed to the suppression of one or more genes in any of one or more plant pests. These polynucleotides can then be used in a DNA construct to express a homologous dsRNA molecule in plant cells.

TABLE 1SCN GENE FAMILIES and ANNOTATIONSSCN gene familiesGene nameSCN gene family membersProtein annotationbZIP_1SeqID_122625 SeqID_119811bZIP transcription factorSeqID_124042 SeqID_124331SeqID_121799Mito_carrSeqID_121264 SeqID_122047Mitochondrial carrier proteinSeqID_122051 SeqID_122111SeqID_122259 SeqID_122457SeqID_122504 SeqID_122594SeqID_121347 SeqID_121360SeqID_121361 SeqID_122965SeqID_122993 SeqID_123057SeqID_123135 SeqID_123186SeqID_123212 SeqID_123240SeqID_123284 SeqID_123348SeqID_123379 SeqID_123391SeqID_123930 SeqID_119420SeqID_119539 SeqID_120127SeqID_120194 SeqID_120280SeqID_121563 SeqID_121585SeqID_120988 SeqID_121115SeqID_123589 SeqID_123608SeqID_123626 SeqID_123732SeqID_123814 SeqID_123863SeqID_121678 SeqID_124088SeqID_124151 SeqID_124350SeqID_121751 SeqID_121752SeqID_121870bZIP_2SeqID_121260 SeqID_122546Basic region leucine zipperSeqID_122625 SeqID_119811SeqID_124042 SeqID_121799Sec7SeqID_120412Sec7 domainMutS_IVSeqID_119581MutS family domain IVCtaG_Cox11SeqID_119768Cytochrome c oxidase assemblyprotein CtaSynaptobrevinSeqID_122614 SeqID_122853SynaptobrevinSeqID_123133 SeqID_123159SeqID_123266 SeqID_120966SeqID_121065 SeqID_123723Fer2SeqID_122093 SeqID_1223572Fe—2S iron-sulfur cluster bindingSeqID_119370 SeqID_124223domainSeqID_124309WD40SeqID_121225 SeqID_121265WD domain, G-beta repeatSeqID_122030 SeqID_122050SeqID_122140 SeqID_122192SeqID_122212 SeqID_122243SeqID_122288 SeqID_122350SeqID_122349 SeqID_122434SeqID_122449 SeqID_122509SeqID_122514 SeqID_122539SeqID_122547 SeqID_121310SeqID_121337 SeqID_121375SeqID_122883 SeqID_122959SeqID_122996 SeqID_123045SeqID_123090 SeqID_123107SeqID_123122 SeqID_123198SeqID_123318 SeqID_123329SeqID_121391 SeqID_119248SeqID_119272 SeqID_119279SeqID_119343 SeqID_119431SeqID_119538 SeqID_119627SeqID_119670 SeqID_119769SeqID_119826 SeqID_119840SeqID_119913 SeqID_119990SeqID_121523 SeqID_120032SeqID_120112 SeqID_120134SeqID_120235 SeqID_120302SeqID_120344 SeqID_120362SeqID_120457 SeqID_120458SeqID_120507 SeqID_120508SeqID_120576 SeqID_120747SeqID_120826 SeqID_120866SeqID_120885 SeqID_120945SeqID_120994 SeqID_120997SeqID_121007 SeqID_121094SeqID_121096 SeqID_121116SeqID_121184 SeqID_121199SeqID_123563 SeqID_123612SeqID_123643 SeqID_123841SeqID_123880 SeqID_121640SeqID_123965 SeqID_124104SeqID_124118 SeqID_124150SeqID_124186 SeqID_124334SeqID_121790 SeqID_121804SeqID_121816 SeqID_121840SeqID_121844Skp1SeqID_121262 SeqID_122339Skp1 family, dimerisation domainSeqID_121318 SeqID_119848Fer4SeqID_122076 SeqID_1221984Fe—4S binding domainSeqID_120449 SeqID_120455SeqID_124131Enolase_CSeqID_122156 SeqID_123621Enolase, C-terminal TIM barreldomainMucinSeqID_120313Mucin-like glycoproteinNHLSeqID_121326NHL repeatFATSeqID_122973 SeqID_120212FAT domainIso_dhSeqID_122456 SeqID_120630Isocitrate/isopropylmalateSeqID_123561dehydrogenaseAPHSeqID_122384 SeqID_122682Phosphotransferase enzymeSeqID_121416 SeqID_119410familySeqID_119453 SeqID_119764SeqID_121622 SeqID_120808SeqID_124140 SeqID_121803SufSeqID_122937Suppressor of forked protein (Suf)Enolase_NSeqID_122156 SeqID_123621Enolase, N-terminal domainLdh_1_CSeqID_122492 SeqID_120389lactate/malate dehydrogenase,SeqID_123782alpha/beta C-tHMG_CoA_syntSeqID_122417 SeqID_120809Hydroxymethylglutaryl-coenzymeSeqID_121837A synthasPLDcSeqID_119325 SeqID_120704Phospholipase D Active site motifGlycos_transf_1SeqID_121164Glycosyl transferases group 1Dala_Dala_lig_CSeqID_119260D-ala D-ala ligase C-terminusKunitz_BPTISeqID_122170 SeqID_119622Kunitz/Bovine pancreatic trypsinSeqID_119623inhibitoNuc_sug_transpSeqID_119240Nucleotide-sugar transportercobWSeqID_122575CobW/HypB/UreG, nucleotide-binding domainL15SeqID_121270 SeqID_122374Ribosomal protein L15SeqID_123713Ldh_1_NSeqID_122492 SeqID_121371lactate/malate dehydrogenase,SeqID_122868 SeqID_120389NAD binding doSeqID_123782CLP_proteaseSeqID_122193 SeqID_122948Clp proteaseSeqID_124275 SeqID_120143SeqID_123619 SeqID_121952HEAT_PBSSeqID_122486 SeqID_120345PBS lyase HEAT-like repeatSeqID_123693NICSeqID_121904Nucleoporin interacting componentSecYSeqID_122609 SeqID_123948eubacterial secY proteinPCISeqID_122070 SeqID_122383PCI domainSeqID_119309 SeqID_119633SeqID_120088 SeqID_120892SeqID_123711 SeqID_123975SeqID_121783Abhydro_lipaseSeqID_119405ab-hydrolase associated lipaseregionMra1SeqID_123459Suppressor Mra1tRNA-synt_1bSeqID_123017 SeqID_120260tRNA synthetases class I (W andSeqID_121648Y)NIFSeqID_121832NLI interacting factor-likephosphataseLaminin_G_1SeqID_120488Laminin G domaintRNA-synt_1cSeqID_122217 SeqID_122354tRNA synthetases class I (E andSeqID_123425 SeqID_119559Q), cataSeqID_120889 SeqID_123797Laminin_G_2SeqID_120488Laminin G domaintRNA-synt_1eSeqID_119570 SeqID_119679tRNA synthetases class I (C)catalytic dAcyl-CoA_dh_MSeqID_123412 SeqID_119407Acyl-CoA dehydrogenase, middleSeqID_119554domainMak16SeqID_123061 SeqID_120231Mak16 proteinSeqID_120297Clp1SeqID_120900Pre-mRNA cleavage complex IIprotein Clp1Guanylate_cycSeqID_119460Adenylate and Guanylate cyclasecatalytAcyl-CoA_dh_NSeqID_122997 SeqID_123412Acyl-CoA dehydrogenase, N-SeqID_119407 SeqID_119740terminal domaSeqID_124296RNase_PH_CSeqID_122716 SeqID_1201463′ exoribonuclease family, domain 2MoeZ_MoeBSeqID_120491MoeZ/MoeB domainChitin_synth_2SeqID_120016Chitin synthasePAP_centralSeqID_12343 SeqID_119183Poly(A) polymerase central domainSeqID_120181 SeqID_120989SeqID_121025 SeqID_123810rveSeqID_119251 SeqID_119810Integrase core domainRED_NSeqID_120411RED-like protein N-terminal regionAnkSeqID_123070 SeqID_119179Ankyrin repeatSeqID_119568 SeqID_119583SeqID_119584 SeqID_119680SeqID_119792 SeqID_119874SeqID_120226 SeqID_121617CKSSeqID_122543 SeqID_121066Cyclin-dependent kinaseregulatory subunitBand_7SeqID_122693 SeqID_119534SPFH domain/Band 7 familySeqID_123829PAF-AH_p_IISeqID_120737 SEQID_121196Platelet-activating factoracetylhydrolasSF-assemblinSeqID_120078SF-assemblin/beta giardinRibosomal_S24eSeqID_122807 SeqID_120291Ribosomal protein S24eSeqID_124204 SeqID_124210SeqID_121954Ribosomal_S17eSeqID_122394 SeqID_119329Ribosomal S17SeqID_123964Sof1SeqID_122192 SeqID_120865Sof1-like domainSeqID_120866 SeqID_124334LrgBSeqID_120612LrgB-like familyDUF1650SeqID_122945 SeqID_123077Protein of unknown functionSeqID_120203(DUF1650)Laminin_EGFSeqID_121392 SeqID_119297Laminin EGF-like (Domains III andSeqID_119335 SeqID_119714V)SeqID_119816 SeqID_120849SeqID_120911TruB_NSeqID_119847TruB family pseudouridylatesynthase (N termtRNA-synt_2bSeqID_122133 SeqID_122759tRNA synthetase class II coreSeqID_119832 SeqID_120346domain (G,SeqID_120388 SeqID_120642SeqID_121608 SeqID_121085SeqID_121645 SeqID_124158SeqID_124247tRNA-synt_2cSeqID_120391tRNA synthetases class II (A)InnexinSeqID_123073 SeqID_123480InnexinSeqID_120601tRNA-synt_2dSeqID_122320 SeqID_119409tRNA synthetases class II coreSeqID_121659 SeqID_124019domain (FMFS_1SeqID_119430 SeqID_119607Major Facilitator SuperfamilySeqID_120612Cyt-b5SeqID_122136 SeqID_122505Cytochrome b5-like Heme/SteroidSeqID_119879 SeqID_123602bindinMAM33SeqID_122105 SeqID_123650Mitochondrial glycoproteinZZSeqID_120217Zinc finger, ZZ typeDpy-30SeqID_122847 SeqID_121204Dpy-30 motifK_tetraSeqID_122369 SeqID_119930K+ channel tetramerisation domainSeqID_120530Tim44SeqID_123264 SeqID_119402Tim44-like domainMtap_PNPSeqID_121534 SeqID_121548Phosphorylase family 2SeqID_121569PDZSeqID_119401 SeqID_120527PDZ domain (Also known as DHRor GLGF)CHCHSeqID_123527CHCH domainRibonuc_red_smSeqID_122881 SeqID_120548Ribonucleotide reductase, smallSeqID_121571 SeqID_121595chainPro_isomeraseSeqID_12368 SeqID_122436Cyclophilin type peptidyl-prolyl cis-SeqID_12839 SeqID_123273trSeqID_121384 SeqID_119354SeqID_120150 SeqID_123750SeqID_123891 SeqID_121707SeqID_124224DIXSeqID_123325DIX domainHydrolaseSeqID_119435haloacid dehalogenase-likehydrolasePeptidase_C1SeqID_121248 SeqID_121247Papain family cysteine proteaseSeqID_121267 SeqID_122029SeqID_122165 SeqID_122555SeqID_122593 SeqID_123403SeqID_123446 SeqID_121447SeqID_119313 SeqID_123567SeqID_123581 SeqID_123578SeqID_121782 SeqID_121853SeqID_121879Peptidase_C2SeqID_123259 SeqID_120192Calpain family cysteine proteaseSeqID_120374E1-E2_ATPaseSeqID_122184 SeqID_119574E1-E2 ATPaseSeqID_119844 SeqID_120162Peptidase_M13SeqID_122218Peptidase family M13FLYWCHSeqID_119876 SeqID_120405FLYWCH zinc finger domainPeptidase_M14SeqID_122059 SeqID_122194Zinc carboxypeptidaseSeqID_119546 SeqID_123807SeqID_124007Sec62SeqID_121454Translocation protein Sec62Sec63SeqID_120914Sec63 domainPeptidase_M16SeqID_123296 SeqID_119755Insulinase (Peptidase family M16)SeqID_124192EGFSeqID_119714 SeqID_119985EGF-like domainSeqID_120013 SeqID_120488SeqID_120732 SeqID_120849Ribonuc_red_IgcSeqID_121385Ribonucleotide reductase, barreldomaUPF0027SeqID_121387Uncharacterized protein familyUPF0027APC10SeqID_122598 SeqID_119693Anaphase-promoting complex,SeqID_123584subunit 10 (APC1Integrin_alphaSeqID_119890Integrin alpha cytoplasmic regionDynein_heavySeqID_120763 SeqID_120973Dynein heavy chainChromoSeqID_119766 SeqID_120561‘chromo’ (CHRromatinOrganisation MOdifier)SurpSeqID_122195 SeqID_120133Surp moduleSeqID_124100 SeqID_124099Lipase_GDSLSeqID_122975 SeqID_120739GDSL-like Lipase/AcylhydrolaseASCSeqID_119784Amiloride-sensitive sodiumchannelF-actin_cap_ASeqID_123125 SeqID_119252F-actin capping protein alphasubunitRibosomal_L2_CSeqID_122387 SeqID_122576Ribosomal Proteins L2, C-terminalSeqID_119709 SeqID_123662domaSeqID_124015 SeqID_121759SeqID_121891TPR_1SeqID_122178 SeqID_122894Tetratricopeptide repeatSeqID_123022 SeqID_119202SeqID_119274 SeqID_119429SeqID_119528 SeqID_119687SeqID_120074 SeqID_120605SeqID_121168 SeqID_121175SeqID_123940NTP_transf_2SeqID_122343 SeqID_123810Nucleotidyltransferase domainTPR_2SeqID_122178 SeqID_122894Tetratricopeptide repeatSeqID_123022 SeqID_119202SeqID_119274 SeqID_119429SeqID_119528 SeqID_119687SeqID_120074 SeqID_120591SeqID_120605 SeqID_120748SeqID_120947 SeqID_121168SeqID_121175 SeqID_123940TPR_4SeqID_119202Tetratricopeptide repeatCOesteraseSeqID_122237 SeqID_119940CarboxylesteraseSeqID_120067 SeqID_120891SeqID_124012TLE_NSeqID_123010 SeqID_119913Groucho/TLE N-terminal Q-richdomainF-boxSeqID_122666 SeqID_124313F-box domainMRP-L47SeqID_122709 SeqID_120115Mitochondrial 39-S ribosomalSeqID_124286protein L47 (MRCol_cuticle_NSeqID_121964 SeqID_121988Nematode cuticle collagen N-SeqID_121996 SeqID_122013terminal doSeqID_122019 SeqID_122027SeqID_122291 SeqID_122304SeqID_122313 SeqID_122472SeqID_122536 SeqID_122871SeqID_122911 SeqID_122927SeqID_122954 SeqID_123029SeqID_123087 SeqID_123176SeqID_123223 SeqID_123423SeqID_119404 SeqID_119564SeqID_119730 SeqID_119798SeqID_120106 SeqID_120175SeqID_120474 SeqID_120705SeqID_120750 SeqID_123663SeqID_121026 SeqID_121073SeqID_123664 SeqID_123677SeqID_123676 SeqID_123769SeqID_123783 SeqID_123827SeqID_121953 SeqID_121957Na_H_ExchangerSeqID_120225 SeqID_120403Sodium/hydrogen exchangerfamilyATP-synt_abSeqID_122131 SeqID_122660ATP synthase alpha/beta family,SeqID_122734 SeqID_122935nucleotSeqID_119160 SeqID_123656SeqID_123806 SeqID_123805SeqID_124022zf-B_boxSeqID_120931B-box zinc fingerFMO-likeSeqID_122732 SeqID_120410Flavin-binding monooxygenase-SeqID_124057likeRibosomal_S26eSeqID_122109 SeqID_120185Ribosomal protein S26eSeqID_123649Ribosomal_S19eSeqID_122794 SeqID_123065Ribosomal protein S19eSeqID_123504 SeqID_124003SeqID_124229 SeqID_121859Peptidase_C12SeqID_122328 SeqID_120007Ubiquitin carboxyl-terminalSeqID_121135 SeqID_124005hydrolase,Peptidase_C13SeqID_122230 SeqID_121453Peptidase C13 familySeqID_121467 SeqID_120219SeqID_120537 SeqID_121597SeqID_124041 SeqID_121721SeqID_121872 SeqID_121883Peptidase_C14SeqID_123489Caspase domainPeptidase_M24SeqID_122645 SeqID_124035metallopeptidase family M24SeqID_124033Ribosomal_L6e_NSeqID_122489 SeqID_123683Ribosomal protein L6, N-terminaldomaPaf1SeqID_119976Paf1DUF1671SeqID_123049Protein of unknown function(DUF1671)Complex1_51KSeqID_122912 SeqID_123104Respiratory-chain NADHSeqID_123995dehydrogenase 51Peptidase_M28SeqID_122327 SeqID_122621Peptidase family M28SeqID_119445 SeqID_123920PFKSeqID_120213PhosphofructokinaseDAGATSeqID_123153 SeqID_120048Diacylglycerol acyltransferaseSeqID_121202RNA_pol_Rpb7_NSeqID_122067 SeqID_121484RNA polymerase Rpb7, N-terminalSeqID_121111 SeqID_123690domainDnaJ_CSeqID_121279 SeqID_122722DnaJ C terminal regionSeqID_121405 SeqID_120879SeqID_123882zf-DNLSeqID_122584 SeqID_124310DNL zinc fingerCNHSeqID_123483 SeqID_120283CNH domainSeqID_121627DNA_ligase_A_MSeqID_123044 SeqID_119293ATP dependent DNA ligasedomainDNA_ligase_A_NSeqID_119293DNA ligase N terminusLACTSeqID_119637Lecithin:cholesterolacyltransferaseRibosomal_L11_NSeqID_122466 SeqID_122612Ribosomal protein L11, N-terminalSeqID_122798 SeqID_120200domSeqID_120321 SeqID_121603SeqID_123787 SeqID_124218Sec8_exocystSeqID_123021Sec8 exocyst complex componentspecificCoatomer_ESeqID_122494 SeqID_120679Coatomer epsilon subunitSeqID_123777TT_ORF2SeqID_122828 SeqID_123933TT viral ORF2DNA_primase_SSeqID_122299 SeqID_123968DNA primase small subunitNACHTSeqID_122493 SeqID_119176NACHT domainRibosomal_S27eSeqID_122380 SeqID_122513Ribosomal protein S27SeqID_119700 SeqID_120329SeqID_123636 SeqID_123738Na_K-ATPaseSeqID_122309 SeqID_124043Sodium/potassium ATPase betachainTIP49SeqID_122138 SeqID_121376TIP49 C-terminusSeqID_119157 SeqID_119793SeqID_123801GNT-ISeqID_119837GNT-I familyClathrinSeqID_119727Region in Clathrin and VPSMutS_VSeqID_119581 SeqID_119594MutS domain VSeqID_120288 SeqID_120369Acyl_CoA_thioSeqID_122480 SeqID_121327Acyl-CoA thioesteraseSeqID_123795 SeqID_121650SeqID_121919PTPASeqID_121340 SeqID_123363Phosphotyrosyl phosphateSeqID_119880activator (PTPA) prUPF0113SeqID_122707Uncharacterised protein family(UPF0113)dsrmSeqID_119344Double-stranded RNA bindingmotifTom22SeqID_122077 SeqID_124129Mitochondrial import receptorsubunit Tom22EGF_CASeqID_119714 SeqID_119985Calcium binding EGF domainSeqID_120013 SeqID_120732lsy1SeqID_122830 SeqID_120949lsy1-like splicing familyELM2SeqID_120201 SeqID_120354ELM2 domainSeqID_121080HA2SeqID_122331 SeqID_119298Helicase associated domain (HA2)SeqID_120256 SeqID_120575SeqID_123546 SeqID_124253RdRPSeqID_119588RNA dependent RNA polymerase2-oxoacid_dhSeqID_119602 SeqID_1215252-oxoacid dehydrogenasesSeqID_120978acyltransferaseUDPGPSeqID_123086UTP-glucose-1-phosphateuridylyltransferaseArfSeqID_121288 SeqID_122204ADP-ribosylation factor familySeqID_122360 SeqID_122392SeqID_122397 SeqID_122560SeqID_122575 SeqID_122579SeqID_122619 SeqID_122671SeqID_123132 SeqID_123241SeqID_123422 SeqID_119204SeqID_119645 SeqID_119853SeqID_120523 SeqID_120531SeqID_121613 SeqID_123568SeqID_123726 SeqID_123740SeqID_123960 SeqID_123988SeqID_123991UDPGTSeqID_119517UDP-glucoronosyl and UDP-glucosyl transferasCystatinSeqID_122804 SeqID_120505Cystatin domainSeqID_123974ATP-synt_F6SeqID_122448 SeqID_120742Mitochondrial ATP synthaseSeqID_123680coupling factoCyto_heme_lyaseSeqID_122106 SeqID_124078Cytochrome c/c1 heme lyaseDNA_pol_alpha_BSeqID_120691DNA polymerase alpha subunit BArmSeqID_122305 SeqID_122870Armadillo/beta-catenin-like repeatSeqID_121423 SeqID_119427SeqID_121148 SeqID_121201SeqID_124047 SeqID_121836NTP_transferaseSeqID_122569 SeqID_122914Nucleotidyl transferaseSeqID_120001LSMSeqID_122052 SeqID_122121LSM domainSeqID_122135 SeqID_122405SeqID_122422 SeqID_122654SeqID_122720 SeqID_122860SeqID_123047 SeqID_119181SeqID_119424 SeqID_120049SeqID_120122 SeqID_121030SeqID_123658 SeqID_123725SeqID_123789 SeqID_123944SeqID_124195NMTSeqID_120382Myristoyl-CoA:protein N-myristoyltransferaseTRAP-deltaSeqID_122068 SeqID_120359Translocon-associated protein,SeqID_123786delta subunDDOST_48 kDSeqID_121541 SeqID_121553Dolichyl-diphosphooligosaccharide-proteinRibosomal_S28eSeqID_122545 SeqID_120643Ribosomal protein S28eSeqID_123620UPF0120SeqID_122142 SeqID_121137Uncharacterised protein family(UPF0120)Ala_racemase_NSeqID_121220Alanine racemase, N-terminaldomainMFAP1_CSeqID_119946Micro-fibrillar-associated protein 1C-termiAminotran_3SeqID_119827Aminotransferase class-IIIACBPSeqID_121287 SeqID_119464Acyl CoA binding proteinPHDSeqID_123154 SeqID_119379PHD-fingerSeqID_119449 SeqID_119766SeqID_120788Aminotran_4SeqID_122578 SeqID_119653Aminotransferase class IVSeqID_119738E3_bindingSeqID_121975 SeqID_120333e3 binding domainRibosomal_L37aeSeqID_122763 SeqID_119302Ribosomal L37ae protein familySeqID_120838 SeqID_123781SeqID_124249zf-CCCHSeqID_121226 SeqID_122128Zinc finger C-x8-C-x5-C-x3-H typeSeqID_122531 SeqID_122888(and similSeqID_123330 SeqID_119331SeqID_119820 SeqID_120375SeqID_120658 SeqID_121599SeqID_121064 SeqID_123669HMG-CoA_redSeqID_120545Hydroxymethylglutaryl-coenzymeA reductusCRAL_TRIOSeqID_120518CRAL/TRIO domainHATSeqID_120676HAT (Half-A-TPR) repeatAsn_synthaseSeqID_120554 SeqID_120715Asparagine synthasePDCD9SeqID_122558 SeqID_123575Mitochondrial 28S ribosomalprotein S30 (PDCzf-CSLSeqID_122103 SeqID_120511CSL zinc fingerSeqID_123712UBACTSeqID_122346 SeqID_121362Repeat in ubiquitin-activatingSeqID_121432 SeqID_121452(UBA) proteinSeqID_119364 SeqID_124102SeqID_121935Biotin_carb_CSeqID_120976Biotin carboxylase C-terminaldomainFrizzledSeqID_120827Fizzled/Smoothened familymembrane regionFH2SeqID_120125Formin Homology 2 DomainCoatomer_WDADSeqID_119681 SeqID_120257Coatomer WD associated regionzf-RanBPSeqID_123008 SeqID_121015Zn-finger in Ran binding proteinand othCOQ7SeqID_122535 SeqID_119203Ubiquinone biosynthesis proteinSeqID_123627COQ7Ammonium_transpSeqID_120017Ammonium Transporter FamilyPTR2SeqID_119430 SeqID_120270POT familyMPPNSeqID_119150MPPN (rrm-like) domainPyr_redoxSeqID_119535 SeqID_120156Pyridine nucleotide-disulphideoxidoreductaCarn_acyltransfSeqID_119995Choline/Carnitine o-acyltransferaseTUDORSeqID_121121Tudor domainKrr1SeqID_120316Krr1 familyAspSeqID_122475 SeqID_123036Eukaryotic aspartyl proteaseSeqID_123118 SeqID_123359SeqID_121455 SeqID_120476SeqID_120494 SeqID_123971BACKSeqID_120833 SeqID_121637BTB And C-terminal KelchUso1_p115_headSeqID_122989Uso1/p115 like vesicle tetheringproATP-synt_ab_CSeqID_122371 SeqID_122660ATP synthase alpha/beta chain, CSeqID_123656 SeqID_123806terminSeqID_123805Thiolase_CSeqID_122429 SeqID_122772Thiolase, C-terminal domainSeqID_120310 SeqID_1238645_nucleotidSeqID_1206485′ nucleotidase familyFHASeqID_120176 SeqID_120836FHA domainPIDSeqID_123247Phosphotyrosine interactiondomain (PTB/PID)Citrate_syntSeqID_122637 SeqID_122705Citrate synthaseSeqID_120172 SeqID_124243Helicase_CSeqID_122054 SeqID_122318Helicase conserved C-terminalSeqID_122331 SeqID_122336domainSeqID_122624 SeqID_122677SeqID_119233 SeqID_119298SeqID_119324 SeqID_119620SeqID_119843 SeqID_119917SeqID_120153 SeqID_120168SeqID_120256 SeqID_120323SeqID_120342 SeqID_120575SeqID_120689 SeqID_120814SeqID_120914 SeqID_121032SeqID_123546 SeqID_121674SeqID_124119 SeqID_124137SeqID_124148 SeqID_124325Neur_chan_LBDSeqID_121005 SeqID_121087Neurotransmitter-gated ion-channel ligIon_trans_2SeqID_121451 SeqID_119613Ion channelSeqID_119930 SeqID_119939Myosin_tail_1SeqID_123386Myosin tailTMS_TDESeqID_122691 SeqID_119438TMS membrane protein/tumourSeqID_124263differentially eP16-ArcSeqID_119921ARP2/3 complex 16 kDa subunit(p16-Arc)ATP-synt_ab_NSeqID_122110 SeqID_122131ATP synthase alpha/beta family,SeqID_122660 SeqID_119160beta-baSeqID_123806 SeqID_123811SeqID_124022Endonuclease_NSSeqID_123055DNA/RNA non-specificendonucleaseThiolase_NSeqID_122429 SeqID_122772Thiolase, N-terminal domainSeqID_121350 SeqID_122890SeqID_122953 SeqID_123288SeqID_123365 SeqID_121440SeqID_121529 SeqID_120310SeqID_123593 SeqID_123864Glycogen_synSeqID_121477 SeqID_120492Glycogen synthaseBTBSeqID_119616 SeqID_120373BTB/POZ domainSeqID_121165 SeqID_121637DUF236SeqID_119224 SeqID_119247Protein of unknown functionSeqID_120712 SeqID_120713Ubiq_cyt_C_chapSeqID_122228 SeqID_121449Ubiquinol-cytochrome CSeqID_119888 SeqID_123755chaperoneRenin_rSeqID_122129 SeqID_121442Renin receptor-like proteinSeqID_123862Cwf_Cwc_15SeqID_122116 SeqID_120731Cwf15/Cwc15 cell cycle controlSeqID_121658 SeqID_124014proteinA_deaminaseSeqID_121003Adenosine/AMP deaminaseDUF164SeqID_121260 SeqID_123386Uncharacterized ACR, COG1579ATP-synt_CSeqID_122557 SeqID_122592ATP synthase subunit CSeqID_122805 SeqID_123522SeqID_119902 SeqID_119958SeqID_119959 SeqID_120137SeqID_124207 SeqID_124336ATP-synt_DSeqID_122796 SeqID_123297ATP synthase subunit DSeqID_123366 SeqID_123854LAG1SeqID_121272 SeqID_122595Longevity-assurance proteinSeqID_121367(LAG1)ATP-synt_ESeqID_123534ATP synthase E chainATP-synt_FSeqID_122088 SeqID_121348ATP synthase (F/14-kDa) subunitSeqID_121745 SeqID_121827SeqID_121881FilaminSeqID_120307 SeqID_120510Filamin/ABP280 repeatMazGSeqID_119952MazG nucleotidepyrophosphohydrolase domainATP-synt_GSeqID_122669 SeqID_119783Mitochondrial ATP synthase gSeqID_124110subunitEF1_GNESeqID_121982 SeqID_122152EF-1 guanine nucleotide exchangeSeqID_122572 SeqID_119628domainSeqID_119899 SeqID_123606FAD_binding_1SeqID_122094 SeqID_122444FAD binding domainSeqID_123823FAD_binding_2SeqID_120807FAD binding domainDynein_lightSeqID_122635 SeqID_123252Dynein light chain type 1SeqID_119812 SeqID_123564FAD_binding_3SeqID_122411 SeqID_123933FAD binding domainAstacinSeqID_122226 SeqID_122581Astacin (Peptidase family M12A)SeqID_123481 SeqID_119601SeqID_120015 SeqID_120571SeqID_120933 SeqID_121174SeqID_121216 SeqID_124300Tfb4SeqID_122561 SeqID_123565Transcription factor Tfb4Adaptin_NSeqID_120538 SeqID_120950Adaptin N terminal regionEFG_IVSeqID_121397 SeqID_120540Elongation factor G, domain IVS-AdoMet_synt_CSeqID_123143 SeqID_119596S-adenosylmethionine synthetase,C-teRibosomal_S10SeqID_122673 SeqID_119858Ribosomal protein S10p/S20eSeqID_123703 SeqID_121948Ribosomal_S11SeqID_122146 SeqID_123024Ribosomal protein S11SeqID_123430 SeqID_120073SeqID_120744 SeqID_123697SeqID_124177Ribosomal_S12SeqID_122060 SeqID_123235Ribosomal protein S12SeqID_123361 SeqID_123428SeqID_123434 SeqID_123629SeqID_124279UCR_hingeSeqID_122302 SeqID_123516Ubiquinol-cytochrome C reductaseSH3_1SeqID_123228 SeqID_119323SH3 domainSeqID_120958Ribosomal_S13SeqID_122782 SeqID_122809Ribosomal protein S13/S18SeqID_123978 SeqID_124200SAPSeqID_120587SAP domainSH3_2SeqID_123228 SeqID_120958Variant SH3 domainRibosomal_S14SeqID_122064 SeqID_122107Ribosomal protein S14p/S29eSeqID_120012 SeqID_121048SeqID_121107 SeqID_123630SeqID_123890BrixSeqID_121565 SeqID_123250Brix domainSeqID_119314 SeqID_120464SeqID_120490 SeqID_121589SeqID_121109Ribosomal_S15SeqID_122373 SeqID_123410Ribosomal protein S15SeqID_123433 SeqID_121412SeqID_123718RasSeqID_121288 SeqID_122204Ras familySeqID_122360 SeqID_122392SeqID_122397 SeqID_122560SeqID_122575 SeqID_122579SeqID_122619 SeqID_122671SeqID_122838 SeqID_122872SeqID_122903 SeqID_123052SeqID_123189 SeqID_123241SeqID_123414 SeqID_123422SeqID_119204 SeqID_119290SeqID_119645 SeqID_119708SeqID_119885 SeqID_119891SeqID_120379 SeqID_120664SeqID_121613 SeqID_120982SeqID_123605 SeqID_121093SeqID_123568 SeqID_123726SeqID_123740 SeqID_121691SeqID_123960 SeqID_123988SeqID_123991 SeqID_121918zf-TAZSeqID_120357TAZ zinc fingerRibosomal_S17SeqID_122602 SeqID_122629Ribosomal protein S17SeqID_119883 SeqID_123918SeqID_123925Ribosomal_S18SeqID_122652Ribosomal protein S18ELKSeqID_120763ELK domainRibosomal_S19SeqID_122056 SeqID_123333Ribosomal protein S19SeqID_123450 SeqID_123685C2SeqID_119599 SeqID_119684C2 domainSeqID_120248eIF-5_eIF-2BSeqID_123321 SeqID_119265Domain found in IF2B/IF5SeqID_120603 SeqID_121769VWDSeqID_122463 SeqID_123112von Willebrand factor type DSeqID_119726 SeqID_123592domainGHMP_kinasesSeqID_123199GHMP kinases putative ATP-PAP_RNA-bindSeqID_120989Poly(A) polymerase predicted RNAbindingC4SeqID_119273 SeqID_119634C-terminal tandem repeateddomain in $$P_proproteinSeqID_121230 SeqID_122663Proprotein convertase P-domainSeqID_123014 SeqID_123393ELOSeqID_122222 SeqID_122315GNS1/SUR4 familySeqID_119651 SeqID_121493SeqID_120991 SeqID_124009SeqID_124028p450SeqID_120075 SeqID_120579Cytochrome P450Complex1_30 kDaSeqID_121297 SeqID_119579Respiratory-chain NADHdehydrogenase,wntSeqID_122540wnt familyOATPSeqID_119717Organic Anion TransporterPolypeptide (OATP)RIIaSeqID_122847 SeqID_121204Regulatory subunit of type II PKAR-subunitPyrophosphataseSeqID_122209 SeqID_123850Inorganic pyrophosphataseA1_PropeptideSeqID_122475 SeqID_123036A1 PropeptideSeqID_123359 SeqID_121455SeqID_120476 SeqID_120494SeqID_123971PRP1_NSeqID_121212PRP1 splicing factor, N-terminalRhoGAPSeqID_120222 SeqID_120311RhoGAP domainG-alphaSeqID_120144 SeqID_120531G-protein alpha subunitSeqID_120703 SeqID_121138Guanylate_kinSeqID_120766Guanylate kinaseHSF_DNA-bindSeqID_123050HSF-type DNA-bindingDnaJ_CXXCXGXGSeqID_121279 SeqID_122722DnaJ central domain (4 repeats)SeqID_121405 SeqID_121507SeqID_120879 SeqID_121218SeqID_123882CollagenSeqID_121964 SeqID_121996Collagen triple helix repeat (20SeqID_122013 SeqID_122019copies)SeqID_122027 SeqID_122216SeqID_122291 SeqID_122304SeqID_122313 SeqID_122472SeqID_122536 SeqID_122650SeqID_121377 SeqID_122871SeqID_122911 SeqID_122954SeqID_123029 SeqID_123087SeqID_123176 SeqID_123223SeqID_123423 SeqID_119253SeqID_119404 SeqID_119418SeqID_119540 SeqID_119564SeqID_119634 SeqID_119798SeqID_119948 SeqID_119971SeqID_120106 SeqID_120175SeqID_120214 SeqID_120361SeqID_120474 SeqID_120626SeqID_120674 SeqID_120705SeqID_123663 SeqID_121026SeqID_121073 SeqID_121150SeqID_123664 SeqID_123677SeqID_123676 SeqID_123769SeqID_123783 SeqID_123827SeqID_121953 SeqID_121958Tubulin_CSeqID_122269 SeqID_122332Tubulin/FtsZ family, C-terminalSeqID_122440 SeqID_119159domainSeqID_119381 SeqID_119720SeqID_120775 SeqID_123545SeqID_123884 SeqID_123998SeqID_121849 SeqID_121940JmjCSeqID_122284 SeqID_120520jmjC domainZona_pellucidaSeqID_119201 SeqID_120047Zona pellucida-like domainCHSeqID_122025 SeqID_122264Calponin homology (CH) domainSeqID_122337 SeqID_122473SeqID_122788 SeqID_123499SeqID_119750 SeqID_120539SeqID_123855 SeqID_124085SeqID_124206E-MAP-115SeqID_120063E-MAP-115 familyCPSase_L_D2SeqID_119260 SeqID_119619Carbamoyl-phosphate synthase LSeqID_120976chain, AHMG_boxSeqID_122876 SeqID_122918HMG (high mobility group) boxSeqID_121497 SeqID_120582SeqID_120832 SeqID_120943Ribosomal_S25SeqID_122611 SeqID_122695S25 ribosomal proteinSeqID_123417 SeqID_123597SeqID_124222 SeqID_124340Ribosomal_S27SeqID_122053 SeqID_122726Ribosomal protein S27aSeqID_121490 SeqID_123637SeqID_123659Galactosyl_TSeqID_120408GalactosyltransferaseCSSeqID_122778 SeqID_122828CS domainSeqID_123335 SeqID_121480SeqID_120610Voltage_CLCSeqID_120735 SeqID_121213Voltage gated chloride channelLactamase_BSeqID_120930Metallo-beta-lactamasesuperfamilyeRF1_1SeqID_119244eRF1 domain 1eRF1_2SeqID_123076 SeqID_119244eRF1 domain 2FlavoproteinSeqID_122326 SeqID_119773FlavoproteineRF1_3SeqID_123076 SeqID_119244eRF1 domain 3EMP24_GP25LSeqID_122517 SeqID_119571emp24/gp25L/p24 familySeqID_123733NOT2_3_5SeqID_122445 SeqID_121346NOT2/NOT3/NOT5 familySeqID_123876Hydantoinase_ASeqID_121246 SeqID_121266Hydantoinase/oxoprolinasePlus-3SeqID_122270Plus-3 domainIBBSeqID_122870 SeqID_121423Importin beta binding domainSeqID_119952 SeqID_121148Ald_Xan_dh_C2SeqID_120485 SeqID_121912Aldehyde oxidase and xanthinedehydrogeComplex1_49 kDaSeqID_120479Respiratory-chain NADHdehydrogenase,SSrecogSeqID_120076Structure-specific recognitionproteinAldo_ket_redSeqID_123277Aldo/keto reductase familyTFIIS_CSeqID_122597 SeqID_123714Transcription factor S-II (TFIIS)ThioredoxinSeqID_121221 SeqID_122115ThioredoxinSeqID_122712 SeqID_123314SeqID_120315 SeqID_120528SeqID_121134 SeqID_123686SeqID_123796 SeqID_121887vATP-synt_ESeqID_122754 SeqID_121463ATP synthase (E/31 kDa) subunitSeqID_119218 SeqID_124020SeqID_121724MetallothioSeqID_119714MetallothineinMo-co_dimerSeqID_122426 SeqID_119301Mo-co oxidoreductase dimerisationSeqID_123908domaORC2SeqID_121926Origin recognition complex subunit 2Ribosomal_S30SeqID_122415 SeqID_123507Ribosomal protein S30SeqID_120390 SeqID_123707SCPSeqID_121229 SeqID_121227SCP-like extracellular proteinSeqID_121228 SeqID_122770SeqID_122767 SeqID_122864SeqID_123501 SeqID_121425SeqID_119399 SeqID_119556SeqID_120347 SeqID_121641GPP34SeqID_120614Golgi phosphoprotein 3 (GPP34)DMSeqID_121139DM DNA binding domainGTP_CDCSeqID_119791Cell division proteinAhpC-TSASeqID_122512 SeqID_122760AhpC/TSA familySeqID_123258 SeqID_123300SeqID_119348 SeqID_123739SeqID_123758CAP_GLYSeqID_123227CAP-Gly domainXRCC1_NSeqID_120679XRCC1 N terminal domainDUF26SeqID_122879Domain of unknown functionDUF26TRAP_betaSeqID_122465 SeqID_119980Translocon-associated proteinSeqID_123848beta (TRAPB)Cation_ATPase_NSeqID_122184 SeqID_119574Cation transporter/ATPase, N-SeqID_119844 SeqID_120162terminusSeqID_124141 SeqID_124234XPG_ISeqID_120903XPG I-regionTFIID-31 kDaSeqID_122432 SeqID_123899Transcription initiation factor IID,31 kDARPC4SeqID_123032 SeqID_121098ARP2/3 complex 20 kDa subunit(ARPC4)NopSeqID_122208 SeqID_119511Putative snoRNA binding domainSeqID_120274 SeqID_123696DXSeqID_119623DX moduleCholine_kinaseSeqID_122682 SeqID_124140Choline/ethanolamine kinaseSeqID_121776Seryl-tRNA_NSeqID_122169 SeqID_124027Seryl_tRNA synthetase N-terminaldomainUFD1SeqID_122278 SeqID_122991Ubiquitin fusion degradationSeqID_120363 SeqID_123652protein UFD1AICARFT_IMPCHasSeqID_121071AICARFT/IMPCHase bienzymeAdap_comp_subSeqID_122747 SeqID_119495Adaptor complexes mediumSeqID_120295subunit familyV_ATPase_ISeqID_121161V-type ATPase 116 kDa subunitfamilyeIF-6SeqID_122640 SeqID_120843eIF-6 familySeqID_123945TAFSeqID_121234 SeqID_122483TATA box binding proteinSeqID_122744 SeqID_122771associated facSeqID_119330 SeqID_121479SeqID_121482 SeqID_124258SeqID_124262HSP70SeqID_121224 SeqID_121246Hsp70 proteinSeqID_121258 SeqID_121266SeqID_122340 SeqID_122574SeqID_121338 SeqID_123128SeqID_123276 SeqID_121398SeqID_121394 SeqID_121403SeqID_121456 SeqID_119310SeqID_119878 SeqID_121465SeqID_121528 SeqID_120193SeqID_120276 SeqID_120425SeqID_120426 SeqID_120477SeqID_120618 SeqID_120675SeqID_123580 SeqID_121647SeqID_123958 SeqID_121785Rho_GDISeqID_123503 SeqID_119733RHO protein GDP dissociationSeqID_121942inhibitorE2F_TDPSeqID_122353 SeqID_120460E2F/DP family winged-helix DNA-SeqID_120823 SeqID_124183binding domaiVPS28SeqID_122699 SeqID_119677VPS28 proteinSeqID_124233SNARESeqID_122431 SeqID_123902SNARE domainCBM_14SeqID_120683Chitin binding Peritrophin-AdomainFicSeqID_119770Fic protein familyPeptidase_M16_CSeqID_123083 SeqID_119755Peptidase M16 inactive domainefhandSeqID_122031 SeqID_122171EF handSeqID_122356 SeqID_122443SeqID_122544 SeqID_122641SeqID_122769 SeqID_122783SeqID_122802 SeqID_122976SeqID_123119 SeqID_119228SeqID_119553 SeqID_119614zf-CCHCSeqID_122365 SeqID_122548Zinc knuckleSeqID_123105 SeqID_121406SeqID_121418 SeqID_119748SeqID_119884 SeqID_119889SeqID_121017 SeqID_121081SeqID_121192 SeqID_123993RieskeSeqID_119282Rieske [2Fe—2S] domainEF_TSSeqID_122638 SeqID_121030Elongation factor TSGTP_EFTUSeqID_122392 SeqID_122600Elongation factor Tu GTP bindingSeqID_122657 SeqID_121329domainSeqID_122792 SeqID_122872SeqID_123241 SeqID_121430SeqID_119311 SeqID_119332SeqID_119352 SeqID_119600SeqID_120413 SeqID_120540SeqID_121558 SeqID_121576SeqID_121613 SeqID_123754SeqID_121709 SeqID_123950SeqID_124180 SeqID_121717SeqID_121740 SeqID_121805SeqID_121824TPD52SeqID_120171Tumour protein D52 familyUCR_TMSeqID_119282 SeqID_121586Ubiquinol cytochrome reductaseSeqID_121732 SeqID_121833transmembraneCoq4SeqID_120870Coenzyme Q (ubiquinone)biosynthesis proteinzf-HITSeqID_123532HIT zinc fingerCUBSeqID_123097 SeqID_119304CUB domainDUF32SeqID_119459 SeqID_119801Domain of unknown functionSeqID_121149DUF32adh_shortSeqID_121285 SeqID_122087short chain dehydrogenaseSeqID_122187 SeqID_122421SeqID_122523 SeqID_122530SeqID_122596 SeqID_121476SeqID_120081 SeqID_120916SeqID_123628 SeqID_123706SeqID_123772 SeqID_123943RGSSeqID_121152Regulator of G protein signalingPMMSeqID_122570 SeqID_124322Eukaryotic phosphomannomutaseERSeqID_121000Enhancer of rudimentaryPatchedSeqID_120169 SeqID_120785Patched familySeqID_120998RRM_1SeqID_121236 SeqID_121239RNA recognition motif. (a.k.a.SeqID_122099 SeqID_122100RRM, RBD, orSeqID_122122 SeqID_122126SeqID_122158 SeqID_122202SeqID_122250 SeqID_122361SeqID_122469 SeqID_122526SeqID_122528 SeqID_122549SeqID_122676 SeqID_122700SeqID_122831 SeqID_122888SeqID_122938 SeqID_122962SeqID_123004 SeqID_123008SeqID_123071 SeqID_123101SeqID_123331 SeqID_123370SeqID_123384 SeqID_123396SeqID_123424 SeqID_119150SeqID_119180 SeqID_119191SeqID_119215 SeqID_119241SeqID_124159 SeqID_119361SeqID_119362 SeqID_119398SeqID_119489 SeqID_119527SeqID_119551 SeqID_124128I-setSeqID_119655 SeqID_119820Immunoglobulin I-set domainSeqID_119864 SeqID_119970SeqID_121498 SeqID_121513SeqID_120027 SeqID_123815SeqID_120174 SeqID_120251SeqID_120319 SeqID_120336SeqID_120348 SeqID_120467SeqID_120468 SeqID_120514SeqID_120570 SeqID_120741SeqID_120765 SeqID_123671SeqID_121559 SeqID_121578SeqID_121599 SeqID_121600SeqID_121609 SeqID_120880SeqID_120956 SeqID_121015SeqID_121055 SeqID_121119SeqID_121160 SeqID_123601SeqID_123673 SeqID_123785SeqID_123791 SeqID_123809SeqID_123819 SeqID_123820SeqID_123839 SeqID_121697SeqID_124070 SeqID_124091SeqID_124152 SeqID_124176SeqID_124194 SeqID_124259SeqID_121731 SeqID_121748SeqID_121845 SeqID_121882SeqID_121928 SeqID_121933SeqID_123529 SeqID_119158SeqID_119196 SeqID_119491SeqID_119569 SeqID_119713SeqID_119816 SeqID_120343TBCSeqID_119295 SeqID_119731TBC domainSeqID_121193Calpain_IIISeqID_120374Calpain large subunit, domain IIICBFD_NFYB_HMFSeqID_121234 SeqID_122039Histone-like transcription factorSeqID_122046 SeqID_122119(CBF/SeqID_122455 SeqID_122483SeqID_122582 SeqID_122688SeqID_122744 SeqID_122766SeqID_122771 SeqID_121314SeqID_123003 SeqID_123317SeqID_123463 SeqID_123473SeqID_121448 SeqID_121446SeqID_119330 SeqID_119493SeqID_119512 SeqID_119521SeqID_119609 SeqID_119776SeqID_121479 SeqID_121482SeqID_120790 SeqID_121601SeqID_123744 SeqID_121643SeqID_121663 SeqID_121675SeqID_121684 SeqID_121688SeqID_121706 SeqID_124105SeqID_124164 SeqID_124258SeqID_124262 SeqID_124276SeqID_121771 SeqID_121809SeqID_121823 SeqID_121905UDPG_MGDP_dh_CSeqID_123494UDP-glucose/GDP-mannosedehydrogenasezf-UBPSeqID_122511 SeqID_122662Zn-finger in ubiquitin-hydrolasesSeqID_120244and otherUPF0184SeqID_122379 SeqID_123860Uncharacterised protein family(UPF0184)FFSeqID_121478 SeqID_120846FF domainSeqID_121180RhodaneseSeqID_123181 SeqID_123269Rhodanese-like domainTBPSeqID_120108 SeqID_121146Transcription factor TFIID (orSeqID_121734TATA-bindingCytochrom_C1SeqID_119212Cytochrome C1 familyPI-PLC-YSeqID_120248Phosphatidylinositol-specificphospholipaseGlycolyticSeqID_121242 SeqID_121268Fructose-bisphosphate aldolaseSeqID_122585 SeqID_121503class-ISeqID_120227 SeqID_120758SeqID_123730 SeqID_124335SeqID_121856COX15-CtaASeqID_119736Cytochrome oxidase assemblyActinSeqID_121231 SeqID_121257ActinSeqID_122239 SeqID_122271SeqID_122678 SeqID_122922SeqID_122921 SeqID_122940SeqID_122974 SeqID_122977SeqID_122985 SeqID_123092SeqID_123347 SeqID_123374SeqID_123381 SeqID_123376SeqID_123378 SeqID_123388SeqID_123397 SeqID_123394SeqID_123402 SeqID_119184SeqID_121411 SeqID_119898SeqID_121581 SeqID_121592SeqID_123541 SeqID_123542SeqID_123817 SeqID_123843SeqID_121690 SeqID_124081SeqID_124084 SeqID_124093SeqID_124188 SeqID_124303SeqID_121834 SeqID_121888SeqID_121911ATP_synt_HSeqID_122810 SeqID_120261ATP synthase subunit HSeqID_124197SETSeqID_122866 SeqID_123437SET domainSeqID_119452 SeqID_119703SeqID_121141Ribosomal_L5_CSeqID_121970 SeqID_122399ribosomal L5P family C-terminusSeqID_122665 SeqID_123116SeqID_123183 SeqID_120376SeqID_123547 SeqID_124178ADK_lidSeqID_121282 SeqID_122317Adenylate kinase, active site lidGrpESeqID_122071 SeqID_123157GrpESeqID_123878 SeqID_121526SeqID_120867 SeqID_123661SeqID_121743XRN_NSeqID_122345 SeqID_121207XRN 5′-3′ exonuclease N-terminusSeqID_124106Ribosomal_L1SeqID_122679 SeqID_122715Ribosomal protein L1p/L10e familySeqID_124062RhoGEFSeqID_122254 SeqID_120745RhoGEF domainY_phosphataseSeqID_123060 SeqID_119220Protein-tyrosine phosphataseSeqID_119686 SeqID_119866SeqID_119975 SeqID_120184SeqID_120234 SeqID_120792SeqID_121037 SeqID_121036SeqID_121105Ribosomal_L2SeqID_122387 SeqID_122576Ribosomal Proteins L2, RNASeqID_119709 SeqID_121219binding domSeqID_123662 SeqID_124015SeqID_1218917tm_1SeqID_1197867 transmembrane receptor(rhodopsin family)Ribosomal_L3SeqID_122659 SeqID_122727Ribosomal protein L3SeqID_123246 SeqID_121145SeqID_123585Sdh_cytSeqID_122442 SeqID_123881Succinate dehydrogenasecytochrome b subunitDNA_topoisoIVSeqID_120500DNA gyrase/topoisomerase IV,subunit A7tm_2SeqID_1201137 transmembrane receptor(Secretin family)Ribosomal_L4SeqID_122113 SeqID_122385Ribosomal protein L4/L1 familySeqID_121324 SeqID_122845SeqID_121127 SeqID_123536SeqID_1237277tm_3SeqID_120501 SeqID_1207297 transmembrane receptorSeqID_120999(metabotropic glutaRibosomal_L5SeqID_121970 SeqID_122399Ribosomal protein L5SeqID_122665 SeqID_122908SeqID_123116 SeqID_123183SeqID_123517 SeqID_123547SeqID_124178PAPS_reductSeqID_120874Phosphoadenosinephosphosulfate reductaseRibosomal_L6SeqID_122583 SeqID_123234Ribosomal protein L6SeqID_121602 SeqID_120932SeqID_123631ADAM_spacer1SeqID_119479ADAM-TS Spacer 1HSP90SeqID_121981 SeqID_122028HSP90 proteinSeqID_122738 SeqID_121524SeqID_120445 SeqID_120787SeqID_121577 SeqID_123813SeqID_121774 SeqID_121829SeqID_121895Abhydrolase_1SeqID_119405 SeqID_121610alpha/beta hydrolase foldPeptidase_M1SeqID_120743 SeqID_120851Peptidase family M1Herpes_LPSeqID_120428Herpesvirus leader proteinPescadillo_NSeqID_122934Pescadillo N-terminusAbhydrolase_3SeqID_119940 SeqID_120282alpha/beta hydrolase foldSeqID_120891CPSase_L_chainSeqID_119260 SeqID_119619Carbamoyl-phosphate synthase Lchain,PMI_typeISeqID_122372 SeqID_123985Phosphomannose isomerase type IGlyco_hydro_18SeqID_122290 SeqID_119433Glycosyl hydrolases family 18SeqID_123731 SeqID_124053ProfilinSeqID_122484 SeqID_120556ProfilinSeqID_123780 SeqID_121762SeqID_121862RIO1SeqID_120808RIO1 familyTCTPSeqID_122571 SeqID_123113Translationally controlled tumourSeqID_121485 SeqID_120084proteinSeqID_120085 SeqID_123959NTF2SeqID_122143 SeqID_123505Nuclear transport factor 2 (NTF2)SeqID_123530 SeqID_119273domainSeqID_120761AP_endonuc_2SeqID_120685Xylose isomerase-like TIM barrelGATase_2SeqID_122634 SeqID_119643Glutamine amidotransferasesSeqID_123847class-IIRRS1SeqID_122559 SeqID_122857Ribosome biogenesis regulatorySeqID_123569protein (RRS1Gln-synt_CSeqID_122482 SeqID_120289Glutamine synthetase, catalyticdomainPribosyltranSeqID_122656 SeqID_122880Phosphoribosyl transferaseSeqID_120272 SeqID_124138domainDUF367SeqID_120808Domain of unknown function(DUF367)PWP2SeqID_121293Periodic tryptophan protein 2 WDrepeat assoRNA_pol_Rpa2_4SeqID_119390RNA polymerase I, Rpa2 specificdomainHesBSeqID_123420HesB-like domainSPRYSeqID_122960 SeqID_119817SPRY domainCOX4SeqID_122622 SeqID_122739Cytochrome c oxidase subunit IVSeqID_121444 SeqID_121470SeqID_120419 SeqID_121605SeqID_123898 SeqID_121775Gp-FAR-1SeqID_122790 SeqID_123533Nematode fatty acid retinoidSeqID_120157 SeqID_123934binding proteinSeqID_124215Gln-synt_NSeqID_122482 SeqID_123792Glutamine synthetase, beta-GraspdomainTransketolase_CSeqID_119346Transketolase, C-terminal domainCtrSeqID_119988Ctr copper transporter familyRCC1SeqID_122092 SeqID_120326Regulator of chromosomeSeqID_123910condensation (RCC1)Pkinase_TyrSeqID_121233 SeqID_121271Protein tyrosine kinaseSeqID_122086 SeqID_122117SeqID_122221 SeqID_122233SeqID_122285 SeqID_122384SeqID_122470 SeqID_122618SeqID_122687 SeqID_122758SeqID_123026 SeqID_123421SeqID_123491 SeqID_119148SeqID_121409 SeqID_121416SeqID_119206 SeqID_119219SeqID_119238 SeqID_119245SeqID_119296 SeqID_119338SeqID_119410 SeqID_119412SeqID_119422 SeqID_119436SeqID_119453 SeqID_119472SeqID_119483 SeqID_119520SeqID_119525 SeqID_119541SeqID_119728 SeqID_119764SeqID_119789 SeqID_119828SeqID_120006 SeqID_120051SeqID_120126 SeqID_120166SeqID_120298 SeqID_120406SeqID_120418 SeqID_120428SeqID_120452 SeqID_120453SeqID_120550 SeqID_120636SeqID_120663 SeqID_120672SeqID_120772 SeqID_121622SeqID_120934 SeqID_120996SeqID_121189 SeqID_123672SeqID_123838 SeqID_123977SeqID_123989 SeqID_124109SeqID_124294 SeqID_121714SeqID_121742 SeqID_121803SeqID_121897 SeqID_121907OSCPSeqID_122390 SeqID_123173ATP synthase delta (OSCP)SeqID_123852 SeqID_121694subunitHam1p_likeSeqID_120320Ham1 familyTransketolase_NSeqID_122651 SeqID_119261Transketolase, thiamineSeqID_123846diphosphate bHDSeqID_119514 SeqID_119787HD domainMreB_MblSeqID_121224 SeqID_121246MreB/Mbl proteinSeqID_121258 SeqID_121266SeqID_122340 SeqID_119310SeqID_120675 SeqID_123580Fzo_mitofusinSeqID_119682fzo-like conserved regionGCFCSeqID_120111GC-rich sequence DNA-bindingfactor-like proDER1SeqID_123172 SeqID_120680Der1-like familyPhosphorylaseSeqID_120862Carbohydrate phosphorylaseSH2SeqID_122470 SeqID_123228SH2 domainSeqID_119500 SeqID_120234SeqID_120452 SeqID_120453SeqID_120921 SeqID_120958CXCSeqID_121122Tesmin/TSO1-like CXC domainAldedhSeqID_120628Aldehyde dehydrogenase familyCK_II_betaSeqID_120267Casein kinase II regulatory subunitERMSeqID_121435 SeqID_121811Ezrin/radixin/moesin family3HCDH_NSeqID_122892 SeqID_1195033-hydroxyacyl-CoASeqID_120447dehydrogenase, NAD bindingTroponinSeqID_121972 SeqID_122915TroponinSeqID_123477 SeqID_120512SeqID_123537zf-U1SeqID_122257 SeqID_122817U1 zinc fingerSeqID_119881Dynamin_MSeqID_121459 SeqID_121020Dynamin central regionLBP_BPI_CETP_CSeqID_121024LBP/BPI/CETP family, C-terminal doUBASeqID_122024 SeqID_122255UBA/TS-N domainSeqID_122435 SeqID_122689SeqID_120244 SeqID_123742SeqID_123895 SeqID_121722SeqID_121739Dynamin_NSeqID_122325 SeqID_119276Dynamin familySeqID_119682 SeqID_121020SeqID_123830FG-GAPSeqID_120791FG-GAP repeatSupt5SeqID_119904Supt5 repeatCHORDSeqID_122778 SeqID_121480CHORDSeqID_120611 SeqID_124127Ribosomal_S6eSeqID_122091 SeqID_122108Ribosomal protein S6eSeqID_123156 SeqID_123278SeqID_123328 SeqID_123341SeqID_123427 SeqID_119761SeqID_123571 SeqID_124292Gtr1_RagASeqID_123193 SeqID_121217Gtr1/RagA G protein conservedregionCAF1SeqID_122750 SeqID_123054CAF1 family ribonucleaseSeqID_123367 SeqID_124146SeqID_121735RNA_pol_Rpb6SeqID_122713 SeqID_120963RNA polymerase Rpb6SeqID_124170Hist_deacetylSeqID_122658 SeqID_120496Histone deacetylase domainSeqID_124304RNA_pol_Rpb8SeqID_122008 SeqID_122789RNA polymerase Rpb8SeqID_119393 SeqID_123747Ribosomal_L10eSeqID_122247 SeqID_122929Ribosomal L10SeqID_123115 SeqID_123302SeqID_123354 SeqID_121429SeqID_119804 SeqID_123702SeqID_121800 SeqID_121851DUF1127SeqID_121449Domain of unknown function(DUF1127)FARPSeqID_121261 SeqID_122785FMRFamide related peptide familyubiquitinSeqID_121249 SeqID_121259Ubiquitin familySeqID_121959 SeqID_121966SeqID_121974 SeqID_121987SeqID_122018 SeqID_122017SeqID_122021 SeqID_122020SeqID_122023 SeqID_122026SeqID_122033 SeqID_122042SeqID_122053 SeqID_122150SeqID_122234 SeqID_122256SeqID_122415 SeqID_122461SeqID_122550 SeqID_122647SeqID_122726 SeqID_122768SeqID_121317 SeqID_121325SeqID_122784 SeqID_122823SeqID_123207 SeqID_123316SeqID_123462 SeqID_123488SeqID_123500 SeqID_119205SeqID_121408 SeqID_121458SeqID_119321 SeqID_119375SeqID_119457 SeqID_119597SeqID_119702 SeqID_119857mRNA_cap_enzymeSeqID_123044 SeqID_120040mRNA capping enzyme, catalyticdomainRibosomal_60sSeqID_122215 SeqID_12249060s Acidic ribosomal proteinSeqID_123784 SeqID_124059SHMTSeqID_123444Serine hydroxymethyltransferaseTSP_1SeqID_121277 SeqID_119765Thrombospondin type 1 domainSeqID_121174Bin3SeqID_122035 SeqID_120646Bicoid-interacting protein 3 (Bin3)SeqID_123803APS_kinaseSeqID_119710Adenylylsulphate kinaseGSH_synthaseSeqID_122183 SeqID_119391Eukaryotic glutathione synthaseSeqID_120121 SeqID_120409SeqID_120622 SeqID_120719SeqID_120886 SeqID_120895SeqID_121074 SeqID_123719SFT2SeqID_123142 SeqID_121421SFT2-like proteinSeqID_120229HomeoboxSeqID_119358 SeqID_119397Homeobox domainSeqID_119915 SeqID_120364SeqID_120535 SeqID_121140Pox_A_type_incSeqID_121260 SeqID_122546Viral A-type inclusion proteinSeqID_123618repeatiPGM_NSeqID_122467 SeqID_123844BPG-independent PGAM N-SeqID_120794terminus (iPGM_N)RNA_pol_LSeqID_122755 SeqID_120230RNA polymerase Rpb3/Rpb11SeqID_123595dimerisation domaV-setSeqID_123529 SeqID_119196Immunoglobulin V-set domainSeqID_119491 SeqID_119713SeqID_119816 SeqID_120343CTP_synth_NSeqID_122680 SeqID_120279CTP synthase N-terminusSeqID_120818 SeqID_123812AAASeqID_122138 SeqID_122219ATPase family associated withSeqID_122276 SeqID_122358various cellulSeqID_122493 SeqID_122675SeqID_121315 SeqID_121378SeqID_123161 SeqID_119176SeqID_121460 SeqID_119267SeqID_119300 SeqID_119612SeqID_119652 SeqID_119734SeqID_119875 SeqID_120119SeqID_120736 SeqID_121618SeqID_120857 SeqID_120931SeqID_120938 SeqID_123801SeqID_123871 SeqID_124113SeqID_124117 SeqID_121875SeqID_121923PP-bindingSeqID_123286 SeqID_123525Phosphopantetheine attachmentSeqID_120461 SeqID_124213siteSeqID_124214CDC37SeqID_120246Cdc37 familyFtsJSeqID_120165FtsJ-like methyltransferasePeroxin-13_NSeqID_119323Peroxin 13, N-terminalRibosomal_S7eSeqID_122524 SeqID_121334Ribosomal protein S7eSeqID_123124 SeqID_121922Sugar_trSeqID_119607 SeqID_120612Sugar (and other) transporterUCHSeqID_122246 SeqID_122435Ubiquitin carboxyl-terminalSeqID_122511 SeqID_121433hydrolaseSeqID_119366 SeqID_119785SeqID_119799 SeqID_120244SeqID_120577 SeqID_121090SeqID_123828 SeqID_123895SeqID_121784HATPase_cSeqID_121965 SeqID_121968Histidine kinase-, DNA gyrase B-,SeqID_121976 SeqID_122028and HSP90SeqID_122252 SeqID_123282SeqID_124089 SeqID_120445SeqID_123704Activin_recpSeqID_123267Activin types I and II receptordomainDUF602SeqID_119868Protein of unknown function,DUF602DUF1136SeqID_119196Repeat of unknown function(DUF1136)TAFII28SeqID_123141hTAFII28-like protein conservedregionPkinaseSeqID_121233 SeqID_121271Protein kinase domainSeqID_122086 SeqID_122117SeqID_122221 SeqID_122233SeqID_122285 SeqID_122384SeqID_122470 SeqID_122502SeqID_122618 SeqID_122687SeqID_122758 SeqID_122956SeqID_123026 SeqID_123491SeqID_119148 SeqID_121409SeqID_121416 SeqID_121441SeqID_119219 SeqID_119237SeqID_119238 SeqID_119245SeqID_119296 SeqID_119338SeqID_119365 SeqID_119410SeqID_119412 SeqID_119436SeqID_119453 SeqID_119472SeqID_119483 SeqID_119520SeqID_119525 SeqID_119576SeqID_119629 SeqID_119728SeqID_119764 SeqID_119789SeqID_119828 SeqID_119968SeqID_120006 SeqID_120126SeqID_120166 SeqID_120208SeqID_120298 SeqID_120406SeqID_120428 SeqID_120452SeqID_120453 SeqID_120550SeqID_120636 SeqID_120663SeqID_120672 SeqID_120701SeqID_120771 SeqID_120772SeqID_121561 SeqID_121583SeqID_121622 SeqID_120796SeqID_120934 SeqID_120996SeqID_121102 SeqID_121189SeqID_123672 SeqID_123947SeqID_123977 SeqID_123989SeqID_124109 SeqID_124130SeqID_124294 SeqID_121714SeqID_121742 SeqID_121803SeqID_121897 SeqID_121903SeqID_121907 SeqID_121920KH_1SeqID_122074 SeqID_122460KH domainSeqID_121313 SeqID_123392SeqID_123476 SeqID_119195SeqID_120972 SeqID_124132SeqID_121787FA_hydroxylaseSeqID_122136 SeqID_123602Fatty acid hydroxylaseClc-likeSeqID_122446 SeqID_120339Clc-likeSeqID_120721 SeqID_123875KH_2SeqID_122074 SeqID_122370KH domainSeqID_123179 SeqID_121535SeqID_121549 SeqID_123699SeqID_124132 SeqID_124172SeqID_124328 SeqID_121787Galactosyl_T_2SeqID_122573 SeqID_122681GalactosyltransferaseSeqID_119322 SeqID_124349PiwiSeqID_121243 SeqID_121305Piwi domainSeqID_123411 SeqID_119347SeqID_119676 SeqID_120129SeqID_121773 SeqID_121792RLISeqID_122076 SeqID_120449Possible metal-binding domain inSeqID_120808 SeqID_124131RNase L inhHORMASeqID_121286 SeqID_119567HORMA domainSeqID_121728RNA_pol_Rpb1_3SeqID_120039 SeqID_120437RNA polymerase Rpb1, domain 3SeqID_120506Ldh_2SeqID_122223 SeqID_121410Malate/L-lactate dehydrogenaseSeqID_121556 SeqID_121574SeqID_124002NeuralizedSeqID_122352 SeqID_120365NeuralizedSeqID_124000 SeqID_124184RNA_pol_Rpb1_4SeqID_120039 SeqID_120437RNA polymerase Rpb1, domain 4SeqID_120506 SeqID_120541RNA_pol_Rpb1_5SeqID_120506RNA polymerase Rpb1, domain 5RNA_pol_Rpb1_6SeqID_120506RNA polymerase Rpb1, domain 6Clat_adaptor_sSeqID_122102 SeqID_122224Clathrin adaptor complex smallSeqID_122801 SeqID_122988chainSeqID_120385 SeqID_120859SeqID_121131 SeqID_123790SeqID_124166IF4ESeqID_122412 SeqID_120589Eukaryotic initiation factor 4EKinesinSeqID_121296 SeqID_122362Kinesin motor domainSeqID_119669 SeqID_124181G10SeqID_122515G10 proteinGround-likeSeqID_120751Ground-like domainP34-ArcSeqID_123215 SeqID_121855Arp2/3 complex, 34 kD subunitp34-ArcRibosomal_S8eSeqID_122003 SeqID_122010Ribosomal protein S8eSeqID_122130 SeqID_122508SeqID_121333 SeqID_123027SeqID_121508 SeqID_121531SeqID_121545 SeqID_123644SeqID_121667Ribosomal_S3_CSeqID_122370 SeqID_123179Ribosomal protein S3, C-terminalSeqID_119376 SeqID_121535domaiSeqID_121126 SeqID_123699ResIIISeqID_122318 SeqID_119170Type III restriction enzyme, resSeqID_121443 SeqID_119233subunitSeqID_119766 SeqID_120168SeqID_123835 SeqID_124119TFIIE_betaSeqID_121012TFIIE beta subunit core domainAA_kinaseSeqID_122430Amino acid kinase familyExo_endo_phosSeqID_122447 SeqID_123874Endonuclease/Exonuclease/phosphatasefaHLHSeqID_122206 SeqID_119868Helix-loop-helix DNA-bindingSeqID_120631domainKeratin_B2SeqID_120481Keratin, high sulfur B2 proteinTspO_MBRSeqID_122231 SeqID_122587TspO/MBR familySeqID_123338 SeqID_121431SeqID_119499 SeqID_123778SeqID_123779 SeqID_124344SeqID_121908C1-setSeqID_119491 SeqID_120343Immunoglobulin C1-set domainSCO1-SenCSeqID_121873SCO1/SenCT-boxSeqID_122717 SeqID_123320T-boxPSISeqID_119510Plexin repeatAAA_2SeqID_122219 SeqID_122276ATPase family associated withSeqID_121378 SeqID_119652various cellulSeqID_119734 SeqID_121618SeqID_120931 SeqID_123871SeqID_124113 SeqID_121923DUF477SeqID_122322 SeqID_124018Domain of unknown function(DUF477)AAA_3SeqID_122493 SeqID_119176ATPase family associated withSeqID_119742 SeqID_119875various cellulSeqID_120649 SeqID_121618ABC_membraneSeqID_120493ABC transporter transmembraneregionfn3SeqID_119158 SeqID_119225Fibronectin type III domainSeqID_119569 SeqID_120170SeqID_120951 SeqID_121023AAA_5SeqID_122219 SeqID_122493ATPase family associated withSeqID_119176 SeqID_119875various cellulSeqID_120649 SeqID_120763SeqID_120931 SeqID_120936SeqID_123871DestabilaseSeqID_122806 SeqID_124205DestabilaseGlyco_transf_22SeqID_123460 SeqID_120564Alg9-like mannosyltransferasefamilyNot3SeqID_122203 SeqID_121162Not1 N-terminal domain, CCR4-Not complex comCDC50SeqID_122266 SeqID_121335LEM3 (ligand-effect modulator 3)SeqID_120160 SeqID_123808family/CDGlyco_transf_25SeqID_121052Glycosyltransferase family 25 (LPSbiPSSSeqID_119991Phosphatidyl serine synthasePRP38SeqID_123506 SeqID_121713PRP38 familyUCR_14kDSeqID_122477 SeqID_121481Ubiquinol-cytochrome C reductaseSeqID_121514 SeqID_123716complex 14kBiopterin_HSeqID_122189 SeqID_122692Biopterin-dependent aromaticSeqID_122958 SeqID_120567amino acid hSeqID_123832 SeqID_124293Cofilin_ADFSeqID_122563 SeqID_122620Cofilin/tropomyosin-type actin-SeqID_121306 SeqID_121419binding prSeqID_119918 SeqID_124332MOZ_SASSeqID_123033MOZ/SAS familySNaseSeqID_121415 SeqID_120155Staphylococcal nucleasehomologueSkp1_POZSeqID_121262 SeqID_122339Skp1 family, tetramerisationSeqID_122672 SeqID_121318domainSeqID_119994 SeqID_123642Acyl_transf_3SeqID_119415Acyltransferase familyRibosomal_L10SeqID_123472 SeqID_120199Ribosomal protein L10SeqID_121812HMASeqID_122636 SeqID_123900Heavy-metal-associated domainRibosomal_S3AeSeqID_122253 SeqID_123202Ribosomal S3Ae familySeqID_119906 SeqID_121492SeqID_123962Ribosomal_L11SeqID_122466 SeqID_122612Ribosomal protein L11, RNASeqID_122798 SeqID_120200binding doSeqID_121603 SeqID_120902SeqID_123787 SeqID_121653SeqID_124218eIF-1aSeqID_122566 SeqID_123523Eukaryotic initiation factor 1ASeqID_119774 SeqID_124173Ribosomal_L13eSeqID_121985 SeqID_121989Ribosomal protein L13eSeqID_122082 SeqID_119425SeqID_123942Ribosomal_L12SeqID_122236 SeqID_121124Ribosomal protein L7/L12 C-SeqID_123921terminal domS10_plectinSeqID_122608 SeqID_119224Plectin/S10 domainSeqID_123551Ribosomal_L14SeqID_122293 SeqID_123957Ribosomal protein L14p/L23eDUF625SeqID_120752Protein of unknown function(DUF625)Sec23_trunkSeqID_121381 SeqID_119647Sec23/Sec24 trunk domainigSeqID_123529 SeqID_119196Immunoglobulin domainSeqID_119491 SeqID_119569SeqID_119713 SeqID_119816SeqID_120343Ribosomal_L16SeqID_122247 SeqID_122653Ribosomal protein L16SeqID_123302 SeqID_124211Ion_transSeqID_119930 SeqID_120812Ion transport proteinNAF1SeqID_119903NAF1 domainAa_transSeqID_121971 SeqID_119531Transmembrane amino acidtransporter proteinAPG6SeqID_122694 SeqID_121500Autophagy protein Apg6SEC-CSeqID_120562SEC-C motifKE2SeqID_122599 SeqID_122773KE2 family proteinSeqID_124174 SeqID_124230SeqID_124235Lyase_1SeqID_120948 SeqID_121661LyaseSeqID_121786Ran_BP1SeqID_119729 SeqID_120815RanBP1 domainPGM_PMM_IVSeqID_119474 SeqID_120700Phosphoglucomutase/phosphomannomutase,C-tBAHSeqID_123089BAH domainUQ_conSeqID_121983 SeqID_122089Ubiquitin-conjugating enzymeSeqID_122437 SeqID_122519SeqID_122757 SeqID_123232ENTHSeqID_122942 SeqID_123389ENTH domainDUF6SeqID_119240Integral membrane protein DUF6Ribosomal_L21eSeqID_122144 SeqID_123108Ribosomal protein L21eSeqID_123268 SeqID_123293SeqID_123298 SeqID_121434SeqID_119745 SeqID_123751SeqID_124341 SeqID_121876Cyclin_CSeqID_120722Cyclin, C-terminal domainADKSeqID_121282 SeqID_122317Adenylate kinaseSeqID_119710MAS20SeqID_121991 SeqID_122410MAS20 protein import receptorSeqID_123936TIGSeqID_122994IPT/TIG domainDNA_pol_BSeqID_120301DNA polymerase family BRibosomal_L22SeqID_121995 SeqID_122261Ribosomal protein L22p/L17eSeqID_122840 SeqID_123164SeqID_123257 SeqID_121516SeqID_120038 SeqID_123674Ribosomal_L14eSeqID_122701 SeqID_123548Ribosomal protein L14Ribosomal_L23SeqID_122038 SeqID_123046Ribosomal protein L23SeqID_119380 SeqID_119408SeqID_123694 SeqID_124231SNF2_NSeqID_119766 SeqID_120168SNF2 family N-terminal domainSeqID_120323Cgr1SeqID_122506 SeqID_123609Cgr1 familyGlutaredoxinSeqID_123129GlutaredoxinPUASeqID_122707 SeqID_119847PUA domaintRNA_m1G_MT_9SeqID_122151 SeqID_123209tRNA m(1)G methyltransferaseRNA_pol_Rpb2_3SeqID_123375 SeqID_121089RNA polymerase Rpb2, domain 3Ribosomal_L29SeqID_122312 SeqID_122610Ribosomal L29 proteinSeqID_123285 SeqID_119200SeqID_123638 SeqID_123687SeqID_121725RNA_pol_Rpb2_4SeqID_123375 SeqID_121089RNA polymerase Rpb2, domain 4zf-nanosSeqID_121133Nanos RNA binding domainRNA_pol_Rpb2_5SeqID_122280 SeqID_119390RNA polymerase Rpb2, domain 5SeqID_121089 SeqID_124072Peptidase_S8SeqID_121230 SeqID_123401Subtilase familySeqID_119631PUFSeqID_122211 SeqID_122825Pumilio-family RNA binding repeatSeqID_119161 SeqID_121389SeqID_123824 SeqID_121723RNA_pol_Rpb2_6SeqID_122280 SeqID_122552RNA polymerase Rpb2, domain 6SeqID_121330 SeqID_123182SeqID_119390 SeqID_123587SeqID_123831Cyclin_NSeqID_122428 SeqID_122670Cyclin, N-terminal domainSeqID_122816 SeqID_121486SeqID_120722 SeqID_121612SeqID_123768 SeqID_123905Mod_rSeqID_123251 SeqID_123409Modifier of rudimentary (Mod(r))SeqID_120718 SeqID_123566proteinRNA_pol_Rpb2_7SeqID_122552RNA polymerase Rpb2, domain 7Ribosomal_L7AeSeqID_121269 SeqID_122134Ribosomal proteinSeqID_122180 SeqID_122335L7Ae/L30e/S12e/Gadd4SeqID_122367 SeqID_122404SeqID_122604 SeqID_122718SeqID_123155 SeqID_123294SeqID_123455 SeqID_119560SeqID_123888 SeqID_119859SeqID_121504 SeqID_123633SeqID_123660 SeqID_123698SeqID_123927 SeqID_123935SeqID_123992 SeqID_124144POLO_boxSeqID_119828 SeqID_121022POLO box duplicated regionNucleoporin2SeqID_120026Nucleoporin autopeptidasezf-BEDSeqID_119515BED zinc fingerEtsSeqID_119450Ets-domainRibosomal_S2SeqID_122439 SeqID_122525Ribosomal protein S2SeqID_122923 SeqID_122995SeqID_123127 SeqID_123308SeqID_123382 SeqID_120337SeqID_120668 SeqID_123668SeqID_124044Rcd1SeqID_121351 SeqID_119153Cell differentiation family, Rcd1-likeRibosomal_S4SeqID_121533 SeqID_121546Ribosomal protein S4/S9 N-terminal domaiGMC_oxred_CSeqID_122311GMC oxidoreductaseRibosomal_S5SeqID_122366 SeqID_123421Ribosomal protein S5, N-terminalSeqID_123632domaiDUF1240SeqID_120294Protein of unknown function(DUF1240)Topoisom_ISeqID_123167 SeqID_121520Eukaryotic DNA topoisomerase I,SeqID_121660 SeqID_121884catalyticRibosomal_S6SeqID_122928 SeqID_119470Ribosomal protein S6DUF1241SeqID_122262 SeqID_120893Protein of unknown function(DUF1241)Ribosomal_S7SeqID_122487 SeqID_122488Ribosomal protein S7p/S5eSeqID_122733 SeqID_123383SeqID_124307 SeqID_123688SeqID_124272Ssl1SeqID_122605 SeqID_123911Ssl1-likeSeqID_124066 SeqID_121741Ribosomal_S8SeqID_122779 SeqID_120839Ribosomal protein S8Nop52SeqID_122283 SeqID_120786Nucleolar protein, Nop52SeqID_121201 SeqID_124065Ribosomal_L22eSeqID_122791 SeqID_120882Ribosomal L22e protein familyRibosomal_L30SeqID_121300 SeqID_122251Ribosomal protein L30p/L7eSeqID_122565 SeqID_123737SeqID_123736 SeqID_121777SeqID_121894 SeqID_121896AdoHcyaseSeqID_122459 SeqID_123859S-adenosyl-L-homocysteinehydrolaseRibosomal_L15eSeqID_121969 SeqID_122199Ribosomal L15SeqID_123299 SeqID_119488SeqID_119941 SeqID_120030SeqID_123763 SeqID_123764SeqID_124342V-ATPase_CSeqID_120905V-ATPase subunit CProteasomeSeqID_121422 SeqID_121276Proteasome A-type and B-typeSeqID_121284 SeqID_121960SeqID_121962 SeqID_122045SeqID_122055 SeqID_122190SeqID_122396 SeqID_122413SeqID_122474 SeqID_122498SeqID_121342 SeqID_122780SeqID_122803 SeqID_123336SeqID_123362 SeqID_123419SeqID_119278 SeqID_119909SeqID_121468 SeqID_120588SeqID_120924 SeqID_123604SeqID_123611 SeqID_123646SeqID_123681 SeqID_123825SeqID_124136 SeqID_124227SeqID_121889GMC_oxred_NSeqID_122311 SeqID_124037GMC oxidoreductasePHF5SeqID_122542 SeqID_120850PHF5-like proteinSeqID_123639DNA_gyraseBSeqID_120500DNA gyrase BCullinSeqID_122420 SeqID_119796Cullin familySeqID_119797 SeqID_120045SeqID_123756 SeqID_120756SeqID_120773 SeqID_121682SeqID_121780 SeqID_121941DUF572SeqID_119715 SeqID_120330Family of unknown function(DUF572)FAA_hydrolaseSeqID_122355 SeqID_124011Fumarylacetoacetate (FAA)hydrolase famcNMP_bindingSeqID_119563 SeqID_121604Cyclic nucleotide-binding domainV-ATPase_GSeqID_122706 SeqID_123291Vacuolar (H+)-ATPase G subunitSeqID_119894 SeqID_119927SeqID_123872V-ATPase_HSeqID_120190V-ATPase subunit HEpimeraseSeqID_119815 SeqID_120284NAD dependentSeqID_121143epimerase/dehydratase familyLipase_2SeqID_119177Lipase (class 2)Ribosomal_L39SeqID_122520 SeqID_122799Ribosomal L39 proteinHCNGPSeqID_120697 SeqID_120698HCNGP-like proteinPOP1SeqID_123216Ribonucleases P/MRP proteinsubunit POP1SMNSeqID_122580 SeqID_124165Survival motor neuron protein(SMN)ACPSSeqID_122149 SeqID_1241634′-phosphopantetheinyl transferasesuperfamiLampSeqID_122690 SeqID_123937Lysosome-associated membraneglycoprotein (LFragX_IPSeqID_120422Cytoplasmic Fragile-X interactingfamilyAminotran_1_2SeqID_122344Aminotransferase class I and IIABC_tranSeqID_122402 SeqID_119524ABC transporterSeqID_119625 SeqID_120769SeqID_123949GRPSeqID_119747 SeqID_120570Glycine rich protein familyVps54SeqID_122919 SeqID_119956Vps54-like proteinAph-1SeqID_122155 SeqID_123059Aph-1 proteinSeqID_124013Radical_SAMSeqID_120962Radical SAM superfamilyJosephinSeqID_122683 SeqID_123774JosephinSeqID_123928EF1GSeqID_122057 SeqID_123454Elongation factor 1 gamma,SeqID_120547 SeqID_123557conserved domainMonooxygenaseSeqID_122333 SeqID_123771MonooxygenaseEXSSeqID_120654EXS familyPCNA_CSeqID_122062 SeqID_122819Proliferating cell nuclear antigen,SeqID_120167C-terminSad1_UNCSeqID_119813 SeqID_120250Sad1/UNC-like C-terminalAMP-bindingSeqID_123152 SeqID_120233AMP-binding enzymeDIM1SeqID_123449 SeqID_124285Mitosis protein DIM1ATP_bind_1SeqID_122375 SeqID_122533Conserved hypothetical ATPSeqID_122872 SeqID_119782binding proteinSeqID_123655 SeqID_123983DUF652SeqID_122800 SeqID_119259Protein of unknown function,SeqID_123553 SeqID_124216DUF652PCNA_NSeqID_122062 SeqID_122819Proliferating cell nuclear antigen,SeqID_122829 SeqID_121061N-terminSeqID_124283DUF727SeqID_122615 SeqID_119316Protein of unknown functionSeqID_121702 SeqID_124046(DUF727)Utp11SeqID_122295 SeqID_121156Utp11 proteinThiFSeqID_121273 SeqID_122346ThiF familySeqID_119266 SeqID_121104SeqID_124189MMR_HSR1SeqID_121288 SeqID_122204GTPase of unknown functionSeqID_122360 SeqID_122375SeqID_122392 SeqID_122397SeqID_122560 SeqID_122575SeqID_122579 SeqID_122600SeqID_122619 SeqID_122657SeqID_122671 SeqID_121329SeqID_122792 SeqID_122872SeqID_123422 SeqID_119204SeqID_121430 SeqID_119311SeqID_119332 SeqID_119352SeqID_119367 SeqID_119492SeqID_119523 SeqID_119645SeqID_119791 SeqID_120413SeqID_120523 SeqID_120766SeqID_121613 SeqID_120900SeqID_121038 SeqID_121093SeqID_123568 SeqID_123726SeqID_123740 SeqID_123754SeqID_121709 SeqID_123960SeqID_123950 SeqID_123988SeqID_123991 SeqID_124180SeqID_121717 SeqID_121740zf-C2H2SeqID_122179 SeqID_122249Zinc finger, C2H2 typeSeqID_122267 SeqID_123033SeqID_123091 SeqID_123110SeqID_123191 SeqID_119156SeqID_119250 SeqID_119283SeqID_119315 SeqID_119326SeqID_119455 SeqID_119515SeqID_119638 SeqID_119674SeqID_119915 SeqID_119925SeqID_120046 SeqID_120117SeqID_120617 SeqID_120707SeqID_121133 SeqID_121172SeqID_123906 SeqID_123914SeqID_124145 SeqID_121781SeqID_121822HEATSeqID_122200 SeqID_122532HEAT repeatSeqID_121374 SeqID_121380SeqID_122870 SeqID_119166SeqID_121402 SeqID_119389SeqID_119427 SeqID_119658SeqID_124126 SeqID_119790SeqID_119852 SeqID_119929SeqID_121537 SeqID_119998SeqID_120055 SeqID_120163SeqID_120538 SeqID_120549SeqID_120581 SeqID_120656SeqID_121550 SeqID_121614SeqID_121006 SeqID_121175SeqID_121201 SeqID_123600SeqID_121708PWISeqID_122736 SeqID_122749PWI domainSeqID_120835 SeqID_124267SeqID_124270Syja_NSeqID_119649 SeqID_120216Sacl homology domainzf-Sec23_Sec24SeqID_120593Sec23/Sec24 zinc fingerGcd10pSeqID_122163 SeqID_119505Gcd10p familySeqID_124154GelsolinSeqID_119286 SeqID_121187Gelsolin repeatFUN14SeqID_122589FUN14 familyUcrQSeqID_122007 SeqID_122808UcrQ familySeqID_119696 SeqID_124290Ribosomal_L31eSeqID_122378 SeqID_123337Ribosomal protein L31eSeqID_123982Ribosomal_L24eSeqID_122132 SeqID_123616Ribosomal protein L24eSeqID_123617 SeqID_124161CalreticulinSeqID_122265 SeqID_122837Calreticulin familySeqID_119911 SeqID_123897eIF-5aSeqID_122554 SeqID_123204Eukaryotic initiation factor 5ASeqID_120378 SeqID_123582hypusine, DNPex14_NSeqID_122174 SeqID_123766Peroxisomal membrane anchorprotein (Pex14p)DUF663SeqID_123123 SeqID_120018Protein of unknown function(DUF663)UIMSeqID_122683 SeqID_123389Ubiquitin interaction motifSeqID_123774COX5ASeqID_122529 SeqID_120371Cytochrome c oxidase subunit VaSeqID_123670COX5BSeqID_121281 SeqID_122395Cytochrome c oxidase subunit VbSeqID_122479 SeqID_121428SeqID_119688 SeqID_123743SeqID_124156Ribosomal_L23eNSeqID_122038 SeqID_123046Ribosomal protein L23, N-terminalSeqID_119380 SeqID_119408domSeqID_123694PHSeqID_121274 SeqID_123067PH domainSeqID_119258 SeqID_119463SeqID_120842 SeqID_120929SeqID_121878GTP_EFTU_D2SeqID_122391 SeqID_122657Elongation factor Tu domain 2SeqID_121299 SeqID_122792SeqID_123334 SeqID_123345SeqID_119311 SeqID_119352SeqID_121538 SeqID_121551SeqID_120413 SeqID_120540SeqID_123754 SeqID_123950SeqID_123967 SeqID_121805SeqID_121825 SeqID_121826Sas10_Utp3SeqID_120054Sas10/Utp3 familyPrp18SeqID_123355 SeqID_120394Prp18 domainGTP_EFTU_D3SeqID_122729 SeqID_122792Elongation factor Tu C-terminalSeqID_123334 SeqID_123345domainSeqID_121538 SeqID_121551SeqID_120413 SeqID_123754SeqID_121664 SeqID_121825SeqID_121826 SeqID_121939GATASeqID_121080GATA zinc fingerSpectrinSeqID_119246 SeqID_119482Spectrin repeatSeqID_119575 SeqID_120539SeqID_120627V-SNARESeqID_122066 SeqID_121095Vesicle transport v-SNARE proteinSeqID_123939Ribosomal_S5_CSeqID_122366 SeqID_123632Ribosomal protein S5, C-terminalSeqID_121839domaiPXSeqID_120853PX domainKIDSeqID_120763KID repeatGSH_synth_ATPSeqID_122183 SeqID_121436Eukaryotic glutathione synthase,SeqID_120021 SeqID_120101ATP biSeqID_120121 SeqID_120409SeqID_120622 SeqID_120666SeqID_120719 SeqID_120782SeqID_120886 SeqID_120895SeqID_121074 SeqID_123719MCMSeqID_119742 SeqID_119912MCM2/3/5 familySeqID_120936ETF_alphaSeqID_121607Electron transter flavoprotein alphasubuniL51_S25_CI-B8SeqID_122048 SeqID_123231Mitochondrial ribosomal proteinSeqID_121114 SeqID_123645L51/SCBSSeqID_120735 SeqID_121213CBS domainRibosomal_L18eSeqID_122342 SeqID_122374Eukaryotic ribosomal protein L18SeqID_123713 SeqID_123753SeqID_121703 SeqID_124289OTCaceSeqID_119435Aspartate/omithinecarbamoyltransterase, AGRAMSeqID_122481 SeqID_123794GRAM domainSeqID_121636Rad21_Rec8SeqID_122963Conserved region of Rad21/Rec8like protDUF676SeqID_120421Putative serine esterase (DUF676)Ribosomal_L18pSeqID_120495 SeqID_121687Ribosomal L18p/L5e familyMetallophosSeqID_122191 SeqID_122213Calcineurin-like phosphoesteraseSeqID_122289 SeqID_123311SeqID_123307 SeqID_121426SeqID_119337 SeqID_119448SeqID_119502 SeqID_119572SeqID_119659 SeqID_119751SeqID_119860 SeqID_121540SeqID_120128 SeqID_120238SeqID_120434 SeqID_120484SeqID_120553 SeqID_120710SeqID_120825 SeqID_120824SeqID_120834 SeqID_120873SeqID_121046 SeqID_121142SeqID_121188 SeqID_121700SeqID_124029 SeqID_124030SeqID_124061 SeqID_121738SeqID_121746 SeqID_121770SeqID_121899 SeqID_121900SeqID_121915HECTSeqID_121283 SeqID_121316HECT-domain (ubiquitin-SeqID_119865 SeqID_120854transferase)SeqID_121657 SeqID_121712Hormone_recepSeqID_120699 SeqID_121758Ligand-binding domain of nuclearhormonNACSeqID_122153 SeqID_122689NAC domainSeqID_123502 SeqID_120888SeqID_123742C1_1SeqID_120222 SeqID_120526Phorbol esters/diacylglycerolbinding domainCalponinSeqID_121222 SeqID_122248Calponin family repeatSeqID_122936 SeqID_123005SeqID_123171 SeqID_123188SeqID_123400 SeqID_120392SeqID_120928 SeqID_120927SeqID_123804 SeqID_123842SeqID_123853 SeqID_124187RmaADSeqID_122723 SeqID_119487Ribosomal RNA adenineSeqID_123996dimethylaseSPXSeqID_120654SPX domainC1_3SeqID_119379 SeqID_120217C1-like domainSeqID_120788GST_CSeqID_122057 SeqID_122407Glutathione S-transferase, C-SeqID_122628 SeqID_122642terminal domainSeqID_121383 SeqID_123011SeqID_121623 SeqID_121132SeqID_123557 SeqID_123625SeqID_123709 SeqID_123708SeqID_124298 SeqID_121838Na_Ca_exSeqID_120053Sodium/calcium exchanger proteinB3_4SeqID_120243B3/4 domainSec23_helicalSeqID_119286Sec23/Sec24 helical domainRibosomal_L4oeSeqID_122033 SeqID_121408Ribosomal L40e familySeqID_119857 SeqID_123721ICIn_channelSeqID_121998 SeqID_123531Nucleotide-sensitive chlorideconductancHistoneSeqID_121234 SeqID_122039Core histone H2A/H2B/H3/H4SeqID_122046 SeqID_122096SeqID_122119 SeqID_122455SeqID_122483 SeqID_122582SeqID_122688 SeqID_122744SeqID_122766 SeqID_122771SeqID_121314 SeqID_122884SeqID_122955 SeqID_123003SeqID_123317 SeqID_123463SeqID_123466 SeqID_123473SeqID_121448 SeqID_121446SeqID_119330 SeqID_119493SeqID_119512 SeqID_119521SeqID_119609 SeqID_119776SeqID_119983 SeqID_121479SeqID_121482 SeqID_120068SeqID_120077 SeqID_120480SeqID_120534 SeqID_120790SeqID_121570 SeqID_121594SeqID_121601 SeqID_120813SeqID_121054 SeqID_123744SeqID_121645 SeqID_121663SeqID_121675 SeqID_121683SeqID_121684 SeqID_121688SeqID_121706 SeqID_121705SeqID_124105 SeqID_124164SeqID_124258 SeqID_124262SeqID_124276 SeqID_124314SeqID_121771 SeqID_121809SeqID_121819 SeqID_121828SeqID_121823 SeqID_121905DisintegrinSeqID_120481Disintegrin3HCDHSeqID_1204473-hydroxyacyl-CoAdehydrogenase, C-terminalNAPSeqID_122303 SeqID_119982Nucleosome assembly proteinSeqID_124050(NAP)TubulinSeqID_122269 SeqID_122332Tubulin/FtsZ family, GTPaseSeqID_122440 SeqID_121336domainSeqID_123013 SeqID_119381SeqID_119720 SeqID_124114SeqID_121499 SeqID_121506SeqID_121543 SeqID_120775SeqID_121555 SeqID_123545SeqID_123884 SeqID_124069SeqID_121761 SeqID_121820SeqID_121849 SeqID_121893GST_NSeqID_122057 SeqID_122407Glutathione S-transferase, N-SeqID_122628 SeqID_122642terminal domainSeqID_121383 SeqID_123011SeqID_123274 SeqID_119360SeqID_121496 SeqID_120438SeqID_121623 SeqID_123557SeqID_123625 SeqID_123709SeqID_123708 SeqID_124297SeqID_121838ETC_C1_NDUFA5SeqID_122406 SeqID_123873ETC complex I subunit conservedregion2-Hacid_dhSeqID_120275D-isomer specific 2-hydroxyaciddehydrogenAdenylsucc_syntSeqID_122272 SeqID_121355Adenylosuccinate synthetaseSeqID_119162 SeqID_119650SeqID_120090 SeqID_120822SeqID_124006RTCSeqID_122518 SeqID_123729RNA 3′-terminal phosphate cyclaseRibosomal_L19eSeqID_122613 SeqID_122869Ribosomal protein L19eSeqID_123303 SeqID_120779TRAPP_Bet3SeqID_122875Transport protein particle (TRAPP)componeSMC_CSeqID_119625SMC family, C-terminal domainCDP-OH_P_transfSeqID_123356CDP-alcoholphosphatidyltransferaseFrataxin_CyaySeqID_122075 SeqID_123508Frataxin-like domainSeqID_120847 SeqID_124330VHSSeqID_120912VHS domainDUF689SeqID_122204Protein of unknown function(DUF689)SMC_NSeqID_119887 SeqID_119966RecF/RecN/SMC N terminalSeqID_120574 SeqID_120702domainPTPLASeqID_120767Protein tyrosine phosphatase-likeprotein, PPfkBSeqID_122553 SeqID_123583pfkB family carbohydrate kinaseDSPcSeqID_122616 SeqID_122746Dual specificity phosphatase,SeqID_124323 SeqID_119299catalytic domaSeqID_119527 SeqID_120116SeqID_123836 SeqID_124056Biotin_lipoylSeqID_120976Biotin-requiring enzymePkinase_CSeqID_122758 SeqID_122978Protein kinase C terminal domainSeqID_119365 SeqID_121625SeqID_120796DADSeqID_119716DAD familyAlpha_adaptin_CSeqID_120950Alpha adaptin AP2, C-terminaldomainRibosomal_L6eSeqID_122489 SeqID_122949Ribosomal protein L6eSeqID_121011 SeqID_123683S1SeqID_122067 SeqID_123689S1 RNA binding domainSeqID_123690Oxidored_q6SeqID_121980 SeqID_122154NADH ubiquinone oxidoreductase,SeqID_119667 SeqID_12408220 Kd subExtensin_2SeqID_120572 SeqID_120593Extensin-like regionSeqID_120713Gar1SeqID_119747Gar1 protein RNA binding regionS4SeqID_122623 SeqID_121533S4 domainSeqID_121546 SeqID_120098SeqID_124001BromodomainSeqID_119794BromodomainLaminin_NSeqID_119297 SeqID_119335Laminin N-terminal (Domain VI)CDISeqID_122878Cyclin-dependent kinase inhibitorMago_nashiSeqID_123521 SeqID_120904Mago nashi proteinSNF7SeqID_122586 SeqID_119719SNF7SeqID_119746 SeqID_120690SeqID_121136 SeqID_124098ShTKSeqID_122581 SeqID_119601ShTK domainSeqID_120015 SeqID_120669tRNA_antiSeqID_122633 SeqID_120602OB-fold nucleic acid bindingSeqID_124327domainLinker_histoneSeqID_122931 SeqID_120058linker histone H1 and H5 familyDAOSeqID_122333 SeqID_122338FAD dependent oxidoreductaseSeqID_122411 SeqID_120662SeqID_121620 SeqID_123933NDUF_B7SeqID_120001 SeqID_122009NADH-ubiquinone oxidoreductaseSeqID_122777 SeqID_123519B18 subunit (SeqID_123938 SeqID_124080Ribosomal_L34eSeqID_122631 SeqID_123111Ribsomal protein L34eSeqID_123332 SeqID_119760SeqID_123896DUF906SeqID_119497Domain of Unknown Function(DUF906)SPC12SeqID_122753 SeqID_121566Microsomal signal peptidase 12 kDaSeqID_121590 SeqID_124248subunit (CLN3SeqID_123346 SeqID_120607CLN3 proteinSeqID_120681RVT_1SeqID_119187 SeqID_119374Reverse transcriptase (RNA-SeqID_119417 SeqID_119451dependent DNA polSeqID_119461 SeqID_119462SeqID_119508 SeqID_119566SeqID_119662 SeqID_119663SeqID_119834 SeqID_119855SeqID_119943 SeqID_120004SeqID_120103 SeqID_120138SeqID_120352 SeqID_120433SeqID_120465 SeqID_120471SeqID_120546 SeqID_120566SeqID_120586 SeqID_120687SeqID_120716 SeqID_120852SeqID_120935 SeqID_121043SeqID_121058 SeqID_121128SeqID_121129 SeqID_121179Gp_dh_CSeqID_121978 SeqID_122147Glyceraldehyde 3-phosphateSeqID_121379 SeqID_121400dehydrogenase, C-SeqID_121457 SeqID_119718SeqID_123607Ldi_recept_bSeqID_119985 SeqID_120019Low-density lipoprotein receptorrepeatF_actin_cap_BSeqID_122095 SeqID_123254F-actin capping protein, betaSeqID_123555subunitMethyltransf_8SeqID_122377 SeqID_119368Hypothetical methyltransferaseSeqID_123981Mt_ATP-synt_BSeqID_122761 SeqID_120653Mitochondrial ATP synthase BSeqID_124245 SeqID_121789chain precKAP_NTPaseSeqID_120368KAP family P-loop domainMt_ATP-synt_DSeqID_122314 SeqID_123912ATP synthase D chain,mitochondrial (ATSAC3_GANPSeqID_121169SAC3/GANP familyGp_dh_NSeqID_121986 SeqID_122098Glyceraldehyde 3-phosphateSeqID_122147 SeqID_123064dehydrogenase, NASeqID_123079 SeqID_123357SeqID_123457 SeqID_121379SeqID_121457 SeqID_123840SeqID_123607 SeqID_123870SeqID_124339 SeqID_121930An_peroxidaseSeqID_120092 SeqID_120241Animal haem peroxidaseSeqID_120906EphrinSeqID_119824Ephrinpolyprenyl_syntSeqID_120635Polyprenyl synthetaseNeur_chan_membSeqID_121005 SeqID_121087Neurotransmitter-gated ion-channel trazf-NPL4SeqID_123465 SeqID_119522NPL4 family, putative zinc bindingregionXAP5SeqID_122814 SeqID_122850XAP5 proteinSeqID_121167RNA_polSeqID_120255DNA-dependent RNA polymeraseNMT_CSeqID_121407 SeqID_120382Myristoyl-CoA: protein N-myristoyltransferaseAldose_epimSeqID_122507 SeqID_120204Aldose 1-epimeraseSeqID_123603DUF841SeqID_120243Eukaryotic protein of unknownfunction (DUF8Mov34SeqID_121290 SeqID_121294Mov34/MPN/PAD-1 familySeqID_122120 SeqID_122260SeqID_122735 SeqID_121344SeqID_122844 SeqID_123535SeqID_121382 SeqID_121417SeqID_119999 SeqID_120711SeqID_123682 SeqID_121693SeqID_124040 SeqID_124087SeqID_121736NAD_binding_1SeqID_122444 SeqID_123823Oxidoreductase NAD-bindingdomainRibosomal_L28eSeqID_122648 SeqID_120682Ribosomal L28e protein familySeqID_123720LIMSeqID_122321 SeqID_123039LIM domainSeqID_123158 SeqID_123185SeqID_119388 SeqID_119555SeqID_119701 SeqID_120218SeqID_123816 SeqID_121638SeqID_124168SPC25SeqID_122737 SeqID_124254Microsomal signal peptidase 25 kDasubunit (WGRSeqID_121544WGR domainSTT3SeqID_120152Oligosaccharyl transferase STT3subunWH2SeqID_119377WH2 motif14-3-3SeqID_122166 SeqID_12286214-3-3 proteinSeqID_123514 SeqID_119268SeqID_119536 SeqID_119605Alpha_adaptinC2SeqID_120950Adaptin C-terminal domainCbiASeqID_122680CobQ/CobB/MinD/ParA nucleotidebinding dozf-MIZSeqID_120096MIZ zinc fingerLipocalinSeqID_122101 SeqID_122172Lipocalin/cytosolic fatty-acidSeqID_123222 SeqID_119931binding prSeqID_120164 SeqID_123572SeqID_123598 SeqID_121698DLICSeqID_122981Dynein light intermediate chain(DLIC)tRNA-synt_1c_CSeqID_120890tRNA synthetases class I (E andQ), anBestrophinSeqID_119636BestrophineIF-3_zetaSeqID_122685 SeqID_120237Eukaryotic translation initiationSeqID_123560factor 3Porin_3SeqID_122330 SeqID_121328Eukaryotic porinSeqID_119190 SeqID_121557SeqID_121575 SeqID_123999ARIDSeqID_120573ARID/BRIGHT DNA bindingdomainCybSSeqID_122786 SeqID_124202CybSBCAS2SeqID_122510 SeqID_123556Breast carcinoma amplifiedsequence 2 (BCAS2Motile_SpermSeqID_122721 SeqID_123310MSP (Major sperm protein) domainSeqID_119174 SeqID_119175SeqID_119281 SeqID_119604SeqID_119641 SeqID_119724SeqID_120035 SeqID_120095SeqID_120327 SeqID_120441SeqID_120517 SeqID_120563SeqID_120632 SeqID_120633SeqID_120910 SeqID_124175SeqID_121850Transket_pyrSeqID_122651 SeqID_119346Transketolase, pyridine bindingSeqID_120804 SeqID_120844domaiSeqID_123846FibrillarinSeqID_123301 SeqID_119795FibrillarinPABPSeqID_122526 SeqID_121469Poly-adenylate binding protein,SeqID_123820 SeqID_121869unique domaiBRCTSeqID_121491 SeqID_120459BRCA1 C Terminus (BRCT)SeqID_121621domainPsf2SeqID_119632Partner of SLD five, PSF2tRNA-synt_1SeqID_123431 SeqID_119678tRNA synthetases class I (I, L, MSeqID_121502and V)Psf3SeqID_122668 SeqID_123586Partner of SLD five, PSF3tRNA-synt_2SeqID_122301 SeqID_124123tRNA synthetases class II (D, KSeqID_119833 SeqID_124052and N)SeqID_121567 SeqID_121591SeqID_121045NDKSeqID_121977 SeqID_122294Nucleoside diphosphate kinaseSeqID_121345 SeqID_123201SeqID_123452 SeqID_123451SeqID_121388 SeqID_123559ATP-synt_DE_NSeqID_122751 SeqID_121401ATP synthase, Delta/EpsilonSeqID_124264 SeqID_123752chain, betazf-C4SeqID_122310 SeqID_122617Zinc-finger, C4 type (two domains)SeqID_122812 SeqID_123080SeqID_123445 SeqID_119666SeqID_120699 SeqID_120820SeqID_124333DIRPSeqID_119217DIRPRibosomal_L36eSeqID_122112 SeqID_123552Ribosomal protein L36eFilamentSeqID_121260 SeqID_122400Intermediate filament proteinSeqID_122546 SeqID_123386SeqID_119327 SeqID_121473SeqID_120922 SeqID_123579SeqID_123618 SeqID_123953TFIID_30 kDaSeqID_120863Transcription initiation factor TFIID23-DUF926SeqID_120692Domain of Unknown Function(DUF926)DUF854SeqID_119289 SeqID_120777Caenorhabditis elegans repeat ofunknown funTPP_enzyme_MSeqID_120595Thiamine pyrophosphate enzyme,central dPPI_Ypi1SeqID_122389 SeqID_120052Protein phosphatase inhibitorSeqID_123885Myosin_headSeqID_119861Myosin head (motor domain)MH1SeqID_122197 SeqID_123448MH1 domainSeqID_121021RWDSeqID_122537 SeqID_120442RWD domainDUF858SeqID_122961Eukaryotic protein of unknownfunction (DUF83Beta_HSDSeqID_123272 SeqID_1213963-beta hydroxysteroidSeqID_119630 SeqID_121143dehydrogenase/isomeraBIRSeqID_123175 SeqID_119442Inhibitor of Apoptosis domainSeqID_121665MTHFRSeqID_122188 SeqID_120829MethylenetetrahydrofolateSeqID_124115reductaseGYFSeqID_121994GYF domainE1_dhSeqID_122564 SeqID_122651Dehydrogenase E1 componentSeqID_123263 SeqID_123821SeqID_123846Fork_headSeqID_120305Fork head domainDUF1604SeqID_119654Protein of unknown function(DUF1604)OST3_OST6SeqID_123025OST3/OST6 familyCadherinSeqID_119235 SeqID_119705Cadherin domainPPTASeqID_122418 SeqID_123390Protein prenyltransferase alphaSeqID_121057 SeqID_123924subunit repeGCV_HSeqID_122462 SeqID_123213Glycine cleavage H-proteinSeqID_119530Aldolase_IISeqID_121238 SeqID_122401Class II Aldolase and Adducin N-SeqID_122499 SeqID_121462terminalSeqID_121475 SeqID_123767SeqID_121689 SeqID_123952SeqID_121934AIG1SeqID_119367AIG1 familyRNase_PHSeqID_122716 SeqID_1228873′ exoribonuclease family, domain 1SeqID_120679 SeqID_124199Ribosomal_L18aeSeqID_122646 SeqID_122664Ribosomal L18ae protein familySeqID_122858 SeqID_123114SeqID_123121 SeqID_120878SeqID_123624 SeqID_123886SeqID_124236 SeqID_121791SeqID_121797 SeqID_121925Nucleoside_tranSeqID_123441 SeqID_124095Nucleoside transporterRibosomal_L37eSeqID_122591 SeqID_121170Ribosomal protein L37eSeqID_123963PrefoldinSeqID_120211 SeqID_123679Prefoldin subunitBeta-lactamaseSeqID_122907 SeqID_120056Beta-lactamasePC_repSeqID_122969Proteasome/cyclosome repeatDEADSeqID_122207 SeqID_122277DEAD/DEAH box helicaseSeqID_122318 SeqID_122319SeqID_122381 SeqID_123007SeqID_123018 SeqID_119170SeqID_121414 SeqID_121443SeqID_119233 SeqID_119256SeqID_119300 SeqID_119324SeqID_119657 SeqID_119712SeqID_119766 SeqID_120168SeqID_120342 SeqID_120509SeqID_120575 SeqID_121032SeqID_121200 SeqID_123835SeqID_123976 SeqID_124021SeqID_124074 SeqID_124119SURF4SeqID_120816SURF4 familyNCD3GSeqID_120999Nine Cysteines Domain of family 3GPCRSURF6SeqID_122863Surfeit locus protein 6Sec10SeqID_120206Exocyst complex componentSec10Oxidored_molybSeqID_122426 SeqID_123908Oxidoreductase molybdopterinbinding dCation_effluxSeqID_119547 SeqID_119854Cation efflux familyHisKA_2SeqID_121848Histidine kinaseRNA_pol_Rpb5_CSeqID_122441 SeqID_123528RNA polymerase Rpb5, C-terminalSeqID_123883domaindUTPaseSeqID_120762dUTPaseCalx-betaSeqID_120919Calx-beta domainFA_desaturaseSeqID_122220 SeqID_122258Fatty acid desaturaseBRF1SeqID_119280Brf1-like TBP-binding domainW2SeqID_119265 SeqID_121487eIF4-gamma/eIF5/eIF2-epsilonPIP5KSeqID_123028Phosphatidylinositol-4-phosphate5-KinaseRibosomal_L35AeSeqID_122114 SeqID_122827Ribosomal protein L35AeSeqID_123435 SeqID_123889SeqID_124288RTC_insertSeqID_122518 SeqID_120776RNA 3′-terminal phosphate cyclaseSeqID_123973(RTC), iSKIP_SNWSeqID_121033SKIP/SNW domainPAP_assocSeqID_122124 SeqID_123088PAP/25A associated domainSeqID_120195DNA_pol_E_BSeqID_122568 SeqID_124351DNA polymerase epsilon subunit BRNA_pol_Rpb5_NSeqID_122441 SeqID_123137RNA polymerase Rpb5, N-terminalSeqID_123528 SeqID_119977domainSeqID_123883Vicilin_NSeqID_121108Vicilin N terminal regionDEPSeqID_120527Domain found in Dishevelled, Egl-10, and PleCytochrom_CSeqID_122036 SeqID_123651Cytochrome cRibosomal_L38eSeqID_122040 SeqID_119735Ribosomal L38e protein familySeqID_123834GRIM-19SeqID_123149 SeqID_119221GRIM-19 proteinSeqID_121885DUF947SeqID_120569Domain of unknown function(DUF947)DnaJSeqID_121245 SeqID_122232DnaJ domainSeqID_122521 SeqID_122541SeqID_122667 SeqID_122722SeqID_119277 SeqID_119288SeqID_119644 SeqID_120879SeqID_120983 SeqID_121206SeqID_123640 SeqID_123715SeqID_123882 SeqID_124112SeqID_124311G6PD_CSeqID_122743 SeqID_124271Glucose-6-phosphatedehydrogenase, C-terminaPHO4SeqID_122752 SeqID_120810Phosphate transporter familySeqID_124256ReprolysinSeqID_120801Reprolysin (M12B) family zincmetalloproteMITSeqID_120407MIT domainLRR_1SeqID_122148 SeqID_122601Leucine Rich RepeatSeqID_119275 SeqID_119350SeqID_120299 SeqID_120639SeqID_121187 SeqID_123576SeqID_123887Ribosomal_S21eSeqID_123512 SeqID_119578Ribosomal protein S21eSeqID_123570tRNA-synt_1d_CSeqID_122724DALR anticodon binding domainRNA_pol_A_bacSeqID_120230 SeqID_121039RNA polymerase Rpb3/RpoAinsert domainKOWSeqID_122127 SeqID_122516KOW motifSeqID_122554 SeqID_122701SeqID_122710 SeqID_123295SeqID_123458 SeqID_119904SeqID_123548 SeqID_123582SeqID_123634 SeqID_123734ECHSeqID_122281 SeqID_122454Enoyl-CoA hydratase/isomeraseSeqID_119372 SeqID_123865familySeqID_124071 SeqID_121747IF_tailSeqID_121260 SeqID_122400Intermediate filament tail domainSeqID_120920 SeqID_123579PANSeqID_120695PAN domainzf-C3HC4SeqID_122347 SeqID_122496Zinc finger, C3HC4 type (RINGSeqID_123384 SeqID_123511finger)SeqID_119241 SeqID_119357SeqID_120644 SeqID_123692SeqID_124075WWSeqID_122257 SeqID_122416WW domainSeqID_119608 SeqID_120846SeqID_120854 SeqID_123538PB1SeqID_120217PB1 domainNOG1SeqID_119367Nucleolar GTP-binding protein 1(NOG1)PASSeqID_122206PAS domainEI24SeqID_121718Etoposide-induced protein 2.4(EI24)MIF4GSeqID_120041MIF4G domainPI3_PI4_kinaseSeqID_119291 SeqID_119922Phosphatidylinositol 3- and 4-SeqID_120521 SeqID_121582kinaseSeqID_120853 SeqID_121831PAZSeqID_121243 SeqID_119347PAZ domainSeqID_119944 SeqID_121539SeqID_120129 SeqID_121552SeqID_121086 SeqID_121913Cpn60_TCP1SeqID_122080 SeqID_122245TCP-1/cpn60 chaperonin familySeqID_123243 SeqID_123369SeqID_119577 SeqID_119896SeqID_121517 SeqID_120191SeqID_120292 SeqID_120613SeqID_120876 SeqID_121091SeqID_123615 SeqID_123735SeqID_121655 SeqID_123955Tim17SeqID_122097 SeqID_124320Tim17/Tim22/Tim23 familySeqID_121063Ligase_CoASeqID_122725 SeqID_124060CoA-ligaseTrehalaseSeqID_121616TrehalasePQ-loopSeqID_122891 SeqID_123245PQ loop repeatSeqID_123591TTLSeqID_119185Tubulin-tyrosine ligase familyMyb_DNA-bindingSeqID_119288 SeqID_119345Myb-like DNA-binding domainSeqID_120201 SeqID_121080SeqID_121197Ribonuclease_3SeqID_121044RNase3 domainRibophorin_ISeqID_121320 SeqID_120333Ribophorin ISeqID_121666CAS_CSE1SeqID_119961CAS/CSE protein, C-terminusPex2_Pex12SeqID_122740 SeqID_123980Pex2/Pex12 amino terminalregion
Table 1 Legend:

Column 1 - pfam name or designation

Column 2 - gene family member listed by SEQ ID NO corresponding to amino acid sequence translation from vcDNA SEQ ID NO identified in feature field of peptide sequence

Column 3 - Protein annotation based on BLASTP comparisons


In order to construct a dsRNA sequence, or concatamers or chimeras of dsRNA sequences from various genes either within SCN, from other pest nucleotide sequences, or a combination thereof, nucleotide sequences corresponding to the SCN genome sequences were BLASTed against known vertebrate, soybean, and Rhizobium nucleotide sequences to first eliminate contiguous sequences in SCN that matched sequences in known vertebrate, soybean, and Rhozobium sequences that are at least about 21 nucleotides in length. This redacted set of SCN sequences was then compared to known nucleotide sequences in parasitic nematodes, insects, and fungi to identify sequences of substantial identity that could be useful in constructing sequences that, when expressed as a dsRNA sequence, are capable of effecting gene suppression in SCN as well as in another parasitic nematode, or insect, or fungal pest. The results of comparisons to other parasitic nematode sequences are shown in Table 2.

TABLE 2Diverse Parasitic Nematode Nucleotide Coding SequencesMatching H. glycines vcDNA sequencesH. glycinesSequence1Position2GeneID3Position4% identity5Seq ID NO: 5095021-138gi|289160761-11991%Seq ID NO: 5095014-41gi|15947334-61100%Seq ID NO: 5095014-41gi|2454547227-200100%Seq ID NO: 5095014-41gi|551594194-221100%Seq ID NO: 5095014-41gi|551595669-696100%Seq ID NO: 5095014-41gi|1803225434-61100%Seq ID NO: 5095018-41gi|18477256279-256100%Seq ID NO: 5095018-41gi|18477260325-302100%Seq ID NO: 5095018-41gi|18477262634-611100%Seq ID NO: 5095014-35gi|1847725922-1100%Seq ID NO: 5095014-35gi|1847726122-1100%Seq ID NO: 5095014-35gi|3778096822-1100%Seq ID NO: 5097388-108gi|19267657465-445100%Seq ID NO: 51012302-411gi|18080245178-6984%Seq ID NO: 51132182-360gi|323239555-18396%Seq ID NO: 51132184-359gi|331397786-18296%Seq ID NO: 51132182-351gi|331402745-17495%Seq ID NO: 51132195-360gi|323245101-16695%Seq ID NO: 51132218-354gi|331402641-13795%Seq ID NO: 51132182-284gi|323242195-10794%Seq ID NO: 51164117-137gi|2254364532-12100%Seq ID NO: 511691-236gi|47118285740-97597%Seq ID NO: 511691-236gi|47118286753-98897%Seq ID NO: 511691-219gi|16797830731-95097%Seq ID NO: 511691-219gi|21885260751-96997%Seq ID NO: 511691-219gi|26000759727-94597%Seq ID NO: 511691-219gi|31442322728-94697%Seq ID NO: 511691-219gi|31442320728-94696%Seq ID NO: 511694-219gi|16797831732-94796%Seq ID NO: 511691-219gi|38096133730-94894%Seq ID NO: 511691-33gi|16797827776-80796%Seq ID NO: 51169131-159gi|16797844862-89096%Seq ID NO: 51169216-236gi|484797191112-1132100%Seq ID NO: 5118416-47gi|32324419271-302100%Seq ID NO: 51378184-459gi|3232533536-31197%Seq ID NO: 51378504-626gi|32325335311-433100%Seq ID NO: 51378753-841gi|32325335453-54197%Seq ID NO: 51378101-139gi|323253351-39100%Seq ID NO: 51378676-702gi|32325335430-456100%Seq ID NO: 51561402-476gi|3313958480-6100%Seq ID NO: 51561192-229gi|33139584115-78100%Seq ID NO: 51561192-215gi|696991034-11100%Seq ID NO: 515791-138gi|33139914209-34697%Seq ID NO: 5182355-187gi|1808106080-21286%Seq ID NO: 518243-97gi|54548889100-19487%Seq ID NO: 518243-97gi|18089540421-51585%Seq ID NO: 518243-79gi|18081060404-48087%Seq ID NO: 519781-314gi|3313907185-39796%Seq ID NO: 51978307-395gi|33139071406-49497%Seq ID NO: 51978406-452gi|33139071505-55197%Seq ID NO: 51978406-475gi|32325389507-57697%Seq ID NO: 519781-237gi|3314052187-32394%Seq ID NO: 51991239-260gi|3002835764-85100%Seq ID NO: 520511-78gi|33140098496-573100%Seq ID NO: 520511-74gi|32324423496-569100%Seq ID NO: 520511-61gi|33139933499-559100%Seq ID NO: 520511-58gi|33139533495-552100%Seq ID NO: 5205156-168gi|18082418136-24885%Seq ID NO: 52162163-185gi|40670360123-101100%Seq ID NO: 52240388-410gi|3851367344-22100%Seq ID NO: 52285172-204gi|18090560293-325100%Seq ID NO: 522931-133gi|5454920031-16391%Seq ID NO: 5229311-133gi|714402438-16091%Seq ID NO: 5229330-133gi|180904161-10491%Seq ID NO: 5229311-133gi|545450229-13188%Seq ID NO: 5229341-133gi|545455911-9391%Seq ID NO: 523111-75gi|30028941235-30996%Seq ID NO: 523111-75gi|33140204235-30996%Seq ID NO: 524321-102gi|35504996472-57496%Seq ID NO: 524321-58gi|30028727523-580100%Seq ID NO: 524321-54gi|30028246523-576100%Seq ID NO: 524321-39gi|35505048461-499100%Seq ID NO: 52438179-210gi|183830561007-976100%Seq ID NO: 525691-311gi|3232463149-35995%Seq ID NO: 5259677-218gi|30028020420-27997%Seq ID NO: 5264279-324gi|33139639249-49386%Seq ID NO: 5264266-218gi|33140673201-4991%Seq ID NO: 52642286-342gi|32324211207-26391%Seq ID NO: 5271025-152gi|331403741-12894%Seq ID NO: 527621-90gi|341058131098-100998%Seq ID NO: 527621-90gi|27077481096-100797%Seq ID NO: 527621-90gi|510939621079-99097%Seq ID NO: 527621-90gi|313763221040-95295%Seq ID NO: 527621-90gi|310742781105-101794%Seq ID NO: 527621-90gi|313763231036-94894%Seq ID NO: 527621-90gi|69839591059-97194%Seq ID NO: 527623-90gi|310742791103-101794%Seq ID NO: 527621-87gi|341058061106-102094%Seq ID NO: 527621-87gi|308441801105-101993%Seq ID NO: 527621-90gi|305258121092-100292%Seq ID NO: 527621-90gi|313763311031-94292%Seq ID NO: 527621-90gi|54873771134-4492%Seq ID NO: 527621-64gi|341058101104-104198%Seq ID NO: 528311-47gi|32324245540-586100%Seq ID NO: 528311-40gi|33139824507-546100%Seq ID NO: 528312-101gi|18089712314-41384%Seq ID NO: 528311-29gi|33140223507-535100%Seq ID NO: 528312-52gi|7143499320-37186%Seq ID NO: 528311-23gi|33140453507-529100%Seq ID NO: 52854153-174gi|21378611322-301100%Seq ID NO: 5296521-138gi|289160761-11991%Seq ID NO: 5296514-41gi|15947334-61100%Seq ID NO: 5296514-41gi|2454547227-200100%Seq ID NO: 5296514-41gi|551594194-221100%Seq ID NO: 5296514-41gi|551595669-696100%Seq ID NO: 5296514-41gi|1803225434-61100%Seq ID NO: 5296518-41gi|18477256279-256100%Seq ID NO: 5296518-41gi|18477260325-302100%Seq ID NO: 5296518-41gi|18477262634-611100%Seq ID NO: 5296514-35gi|1847725922-1100%Seq ID NO: 5296514-35gi|1847726122-1100%Seq ID NO: 5296514-35gi|3778096822-1100%Seq ID NO: 52976617-707gi|32325459205-29083%Seq ID NO: 5298115-425gi|3314013692-50287%Seq ID NO: 53038793-899gi|18090533269-37589%Seq ID NO: 53058469-594gi|18081946320-19590%Seq ID NO: 53058172-199gi|1766021225-25296%Seq ID NO: 53058172-199gi|7641281173-20096%Seq ID NO: 5311073-218gi|3313913199-24489%Seq ID NO: 5311079-218gi|323242111-14088%Seq ID NO: 5311068-218gi|33140673199-4985%Seq ID NO: 5311086-218gi|33139639256-38884%Seq ID NO: 53110288-314gi|33139639467-49396%Seq ID NO: 5311315-54gi|289160761-4097%Seq ID NO: 531138-35gi|15947334-61100%Seq ID NO: 531138-35gi|2454547227-200100%Seq ID NO: 531138-35gi|551594194-221100%Seq ID NO: 531138-35gi|551595669-696100%Seq ID NO: 531138-35gi|1803225434-61100%Seq ID NO: 5311312-35gi|18477256279-256100%Seq ID NO: 5311312-35gi|18477260325-302100%Seq ID NO: 5311312-35gi|18477262634-611100%Seq ID NO: 531138-29gi|1847725922-1100%Seq ID NO: 531138-29gi|1847726122-1100%Seq ID NO: 531138-29gi|3778096822-1100%Seq ID NO: 5334533-68gi|32324562547-51297%Seq ID NO: 53381122-150gi|18382995332-30496%Seq ID NO: 5344420-42gi|800519186-64100%Seq ID NO: 535841-223gi|3313986241-26399%Seq ID NO: 53584266-445gi|33139862262-44199%Seq ID NO: 535841-129gi|32325297437-565100%Seq ID NO: 53584601-702gi|33139953453-55499%Seq ID NO: 535841-146gi|22544160265-41086%Seq ID NO: 53584767-855gi|7143567305-21788%Seq ID NO: 535841-128gi|30167496229-35682%Seq ID NO: 5377115-54gi|289160761-4097%Seq ID NO: 537718-35gi|15947334-61100%Seq ID NO: 537718-35gi|2454547227-200100%Seq ID NO: 537718-35gi|551594194-221100%Seq ID NO: 537718-35gi|551595669-696100%Seq ID NO: 537718-35gi|1803225434-61100%Seq ID NO: 5377112-35gi|18477256279-256100%Seq ID NO: 5377112-35gi|18477260325-302100%Seq ID NO: 5377112-35gi|18477262634-611100%Seq ID NO: 537718-29gi|1847725922-1100%Seq ID NO: 537718-29gi|1847726122-1100%Seq ID NO: 537718-29gi|3778096822-1100%Seq ID NO: 53788196-219gi|1926581151-74100%Seq ID NO: 5390349-125gi|51334373320-39690%Seq ID NO: 53903281-309gi|7143754460-488100%Seq ID NO: 53903281-309gi|54546250453-481100%Seq ID NO: 539421-127gi|33140348254-12887%Seq ID NO: 5399695-170gi|18080046260-33490%Seq ID NO: 54067614-700gi|1808012472-15984%Seq ID NO: 54067630-658gi|18082691129-15796%Seq ID NO: 542231-32gi|18082819494-463100%Seq ID NO: 542231-32gi|32324738389-420100%Seq ID NO: 542231-32gi|54546964145-176100%Seq ID NO: 542289-154gi|331390175-15099%Seq ID NO: 54361453-568gi|3314074429-14184%Seq ID NO: 5439496-242gi|33140673195-4990%Seq ID NO: 54394110-281gi|33139639256-42888%Seq ID NO: 544041-41gi|18381736808-84890%Seq ID NO: 5440419-63gi|18382098510-55489%Seq ID NO: 5440419-41gi|18381713520-542100%Seq ID NO: 5441169-177gi|1809057965-17686%Seq ID NO: 54431127-151gi|18089765337-361100%Seq ID NO: 45816323-510gi|52128303201-38881%Seq ID NO: 54533295-336gi|714432964-10597%Seq ID NO: 5459817-91gi|33139131116-19092%Seq ID NO: 545986-97gi|323242111-9287%Seq ID NO: 54650161-185gi|32183975527-551100%Seq ID NO: 546641-99gi|33139292225-323100%Seq ID NO: 547091-91gi|6562543742-83298%Seq ID NO: 5470922-59gi|18382129349-38692%Seq ID NO: 547101-112gi|1809057965-17690%Seq ID NO: 5478221-138gi|289160761-11991%Seq ID NO: 5478214-41gi|15947334-61100%Seq ID NO: 5478214-41gi|2454547227-200100%Seq ID NO: 5478214-41gi|551594194-221100%Seq ID NO: 5478214-41gi|551595669-696100%Seq ID NO: 5478214-41gi|1803225434-61100%Seq ID NO: 5478218-41gi|18477256279-256100%Seq ID NO: 5478218-41gi|18477260325-302100%Seq ID NO: 5478218-41gi|18477262634-611100%Seq ID NO: 5478214-35gi|1847725922-1100%Seq ID NO: 5478214-35gi|1847726122-1100%Seq ID NO: 5478214-35gi|3778096822-1100%Seq ID NO: 5481173-94gi|7641297281-302100%Seq ID NO: 54839194-282gi|33139570594-506100%Seq ID NO: 54839198-282gi|30029553597-513100%Seq ID NO: 54839131-282gi|18079986658-50788%Seq ID NO: 54839218-282gi|33139565577-513100%Seq ID NO: 54839132-220gi|21493943587-49989%Seq ID NO: 54839232-282gi|32324861563-513100%Seq ID NO: 54839240-282gi|32324744548-50697%Seq ID NO: 548473-101gi|28916076176-7989%Seq ID NO: 5486554-202gi|33140673199-5188%Seq ID NO: 5486561-202gi|33139639245-38688%Seq ID NO: 5486559-199gi|3313913199-23986%Seq ID NO: 45847183-308gi|18080348231-35685%Seq ID NO: 45847190-308gi|7144294198-31685%Seq ID NO: 54946128-285gi|54546846415-25785%Seq ID NO: 5498183-126gi|5515941028-107093%Seq ID NO: 5498183-126gi|551595267-30993%Seq ID NO: 54999328-498gi|18382277801-97182%Seq ID NO: 55025372-395gi|33138439263-286100%Seq ID NO: 55025419-439gi|33952838448-428100%Seq ID NO: 5510621-138gi|289160761-11991%Seq ID NO: 5510614-41gi|15947334-61100%Seq ID NO: 5510614-41gi|2454547227-200100%Seq ID NO: 5510614-41gi|551594194-221100%Seq ID NO: 5510614-41gi|551595669-696100%Seq ID NO: 5510614-41gi|1803225434-81100%Seq ID NO: 5510618-41gi|18477256279-256100%Seq ID NO: 5510618-41gi|18477260325-302100%Seq ID NO: 5510618-41gi|18477262634-611100%Seq ID NO: 5510614-35gi|1847725922-1100%Seq ID NO: 5510614-35gi|1847726122-1100%Seq ID NO: 5510614-35gi|3778096822-1100%Seq ID NO: 551081-93gi|714386442-13482%Seq ID NO: 5511735-57gi|7144284325-303100%Seq ID NO: 5511936-155gi|28916076140-2090%Seq ID NO: 551751-160gi|5107411189-30100%Seq ID NO: 551751-160gi|2149587189-3095%Seq ID NO: 5517547-160gi|341058131719-1606100%Seq ID NO: 551751-160gi|2149585189-3092%Seq ID NO: 551751-160gi|2738785190-3092%Seq ID NO: 551751-160gi|2738792190-3092%Seq ID NO: 551751-160gi|2738799190-3092%Seq ID NO: 551751-160gi|2738800190-3092%Seq ID NO: 551752-160gi|27077481766-160790%Seq ID NO: 551751-80gi|48479719191-11298%Seq ID NO: 551751-80gi|37674501194-11598%Seq ID NO: 55175115-160gi|3767450175-3095%Seq ID NO: 551751-78gi|11477292525-244898%Seq ID NO: 55175115-160gi|11477292413-236897%Seq ID NO: 551751-78gi|223202187-1098%Seq ID NO: 552261-24gi|33139696554-577100%Seq ID NO: 552451-25gi|3232495940-6496%Seq ID NO: 55392173-276gi|3550455057-16099%Seq ID NO: 55392326-425gi|35504550164-26397%Seq ID NO: 553921-66gi|3550486210-7489%Seq ID NO: 5542557-187gi|5454547593-22492%Seq ID NO: 55425231-321gi|54545475220-31090%Seq ID NO: 554681-141gi|32325191449-58994%Seq ID NO: 5554575-99gi|32325459266-290100%Seq ID NO: 555531-116gi|33139587121-694%Seq ID NO: 55566167-228gi|18083080182-24388%Seq ID NO: 55640887-1044gi|331395861-15896%Seq ID NO: 5566021-138gi|289160761-11991%Seq ID NO: 5566014-41gi|15947334-61100%Seq ID NO: 5566014-41gi|2454547227-200100%Seq ID NO: 5566014-41gi|551594194-221100%Seq ID NO: 5566014-41gi|551595669-696100%Seq ID NO: 5566014-41gi|1803225434-61100%Seq ID NO: 5566018-41gi|18477256279-256100%Seq ID NO: 5566018-41gi|18477260325-302100%Seq ID NO: 5566018-41gi|18477262634-611100%Seq ID NO: 5566014-35gi|1847725922-1100%Seq ID NO: 5566014-35gi|1847726122-1100%Seq ID NO: 5566014-35gi|3778096822-1100%Seq ID NO: 557191-146gi|18080802354-20997%Seq ID NO: 557191-131gi|7144157137-797%Seq ID NO: 557191-130gi|18088100130-197%Seq ID NO: 557191-124gi|54549688124-196%Seq ID NO: 557191-118gi|18089409118-197%Seq ID NO: 557191-117gi|18090782117-197%Seq ID NO: 5571925-146gi|5212897394-21588%Seq ID NO: 557191-78gi|18083086537-61496%Seq ID NO: 557191-75gi|1809040375-196%Seq ID NO: 5571925-121gi|52129301319-41586%Seq ID NO: 55719100-146gi|521290051-4795%Seq ID NO: 5571983-146gi|9033884441-50390%Seq ID NO: 5571983-146gi|15768080331-39390%Seq ID NO: 5571971-146gi|19383550397-47188%Seq ID NO: 5571983-146gi|39747546344-40690%Seq ID NO: 55719100-146gi|39747568266-31295%Seq ID NO: 55719100-146gi|46984679145-9995%Seq ID NO: 55719105-146gi|15769768391-43297%Seq ID NO: 55719105-146gi|40669510255-29697%Seq ID NO: 55719100-140gi|46986671175-13597%Seq ID NO: 55719108-146gi|313262721-3997%Seq ID NO: 55719105-146gi|159682379-42092%Seq ID NO: 55759302-322gi|4698448489-109100%Seq ID NO: 558081-74gi|54545841238-31191%Seq ID NO: 55827140-161gi|46983788125-146100%Seq ID NO: 5584995-269gi|54548527231-40587%Seq ID NO: 55849138-269gi|545459892-13388%Seq ID NO: 55931140-268gi|33139063130-25896%Seq ID NO: 559311-60gi|3313906371-13098%Seq ID NO: 56068123-187gi|21432594122-18686%Seq ID NO: 562111-101gi|714415739-13997%Seq ID NO: 562111-101gi|5454968826-12696%Seq ID NO: 5621122-101gi|18083086614-53596%Seq ID NO: 5621125-101gi|180904031-7796%Seq ID NO: 5621154-75gi|52129301340-319100%Seq ID NO: 5633336-56gi|33140432264-284100%Seq ID NO: 56402639-718gi|33139730156-7693%Seq ID NO: 564111-73gi|16797830749-821100%Seq ID NO: 564111-73gi|16797832747-819100%Seq ID NO: 564111-73gi|26000759745-817100%Seq ID NO: 564111-73gi|31442320746-818100%Seq ID NO: 564111-73gi|31442322746-81898%Seq ID NO: 564118-73gi|21885259776-84198%Seq ID NO: 45947974-995gi|46988291426-405100%Seq ID NO: 56551187-242gi|183813631139-119491%Seq ID NO: 566401-65gi|3314076516-8098%Seq ID NO: 5669936-174gi|28916076140-192%Seq ID NO: 56699154-181gi|15947361-34100%Seq ID NO: 56699154-181gi|2454547200-227100%Seq ID NO: 56699154-181gi|551594221-194100%Seq ID NO: 56699154-181gi|551595696-669100%Seq ID NO: 56699154-181gi|1803225461-34100%Seq ID NO: 56699154-177gi|18477256256-279100%Seq ID NO: 56699154-177gi|18477260302-325100%Seq ID NO: 56699154-177gi|18477262611-634100%Seq ID NO: 56699160-181gi|184772591-22100%Seq ID NO: 56699160-181gi|184772611-22100%Seq ID NO: 56699160-181gi|377809681-22100%Seq ID NO: 56752348-369gi|723545941-20100%Seq ID NO: 45973404-544gi|54546305483-62388%Seq ID NO: 45973181-293gi|54546305345-45783%Seq ID NO: 45973434-544gi|54544886269-37990%Seq ID NO: 56872764-956gi|33140201149-34199%Seq ID NO: 56872569-719gi|331402011-15198%Seq ID NO: 56889189-333gi|54544976245-38989%Seq ID NO: 5692775-226gi|3313913199-25087%Seq ID NO: 5692781-221gi|323242111-14187%Seq ID NO: 569721-101gi|35504916448-54896%Seq ID NO: 569721-85gi|35504725448-53394%Seq ID NO: 57065168-188gi|339526559-29100%Seq ID NO: 57065168-188gi|46422144284-304100%Seq ID NO: 5708462-214gi|33140673201-4992%Seq ID NO: 5708471-214gi|33139639245-38893%Seq ID NO: 57086190-217gi|54547991157-18496%Seq ID NO: 572201-66gi|375172441816-188196%Seq ID NO: 573515-196gi|32323971353-54498%Seq ID NO: 57416412-434gi|551595853-875100%Seq ID NO: 57422783-808gi|46986223315-290100%Seq ID NO: 574681-221gi|223910836-256100%Seq ID NO: 5753918-223gi|32325381184-38992%Seq ID NO: 57539415-498gi|32325381398-48197%Seq ID NO: 57539174-223gi|331398148-57100%Seq ID NO: 57713253-552gi|3313906828-32797%Seq ID NO: 57713633-701gi|33139068334-40298%Seq ID NO: 57713169-198gi|331390681-30100%Seq ID NO: 57713750-877gi|33139557441-56893%Seq ID NO: 57713176-198gi|331395571-23100%Seq ID NO: 57713750-892gi|33140099424-56692%Seq ID NO: 57713255-552gi|3313907916-31397%Seq ID NO: 57713750-849gi|33139079434-53394%Seq ID NO: 577131-198gi|3232466626-22396%Seq ID NO: 57713633-661gi|32324666527-555100%Seq ID NO: 57713271-552gi|331391031-28297%Seq ID NO: 57713287-554gi|3313930838-30697%Seq ID NO: 57713637-864gi|33139308315-54590%Seq ID NO: 577131232-1423gi|33139521162-35396%Seq ID NO: 57713811-966gi|331395211-15989%Seq ID NO: 57713750-878gi|33140185112-24093%Seq ID NO: 57713636-701gi|331401851-6698%Seq ID NO: 57713300-551gi|3232532951-30290%Seq ID NO: 577131229-1384gi|32325562408-56399%Seq ID NO: 57713750-966gi|32325562189-40889%Seq ID NO: 57713598-701gi|3232556240-14396%Seq ID NO: 57713300-534gi|32324015367-60190%Seq ID NO: 577131229-1374gi|33140373408-553100%Seq ID NO: 57713655-698gi|32325327481-52490%Seq ID NO: 57713655-776gi|32325443481-59983%Seq ID NO: 57713300-533gi|33140304225-45889%Seq ID NO: 57713300-517gi|33140764367-58489%Seq ID NO: 577131229-1347gi|32324619408-526100%Seq ID NO: 57713300-490gi|32325078367-55790%Seq ID NO: 577131254-1423gi|33139825258-42790%Seq ID NO: 57713300-453gi|33139892410-56390%Seq ID NO: 57713300-452gi|33140637410-56290%Seq ID NO: 577131288-1423gi|32324536247-38291%Seq ID NO: 57713414-551gi|3232453610-14789%Seq ID NO: 57713384-551gi|331405891-16888%Seq ID NO: 57713393-551gi|331400744-16288%Seq ID NO: 57713655-896gi|32325371173-41183%Seq ID NO: 57713300-401gi|32324162476-57792%Seq ID NO: 57713428-551gi|32324623225-34888%Seq ID NO: 57713300-375gi|32324623103-17893%Seq ID NO: 57713300-353gi|33139035491-54494%Seq ID NO: 577931-189gi|33139897347-53591%Seq ID NO: 578246-146gi|323242111-14188%Seq ID NO: 5782417-151gi|33139131116-25088%Seq ID NO: 578241-145gi|33140673193-4983%Seq ID NO: 5782413-145gi|33139639256-38882%Seq ID NO: 57824152-172gi|45563795521-501100%Seq ID NO: 579071213-1367gi|18081696127-28185%Seq ID NO: 57996664-684gi|3974757393-113100%Seq ID NO: 580014-65gi|27541441265-32684%Seq ID NO: 46037415-488gi|21393234329-40291%Seq ID NO: 46037117-143gi|21393234148-174100%Seq ID NO: 46037377-488gi|18080246261-37285%Seq ID NO: 46037418-488gi|18381808299-22990%Seq ID NO: 46037427-488gi|52128281271-33291%Seq ID NO: 5817185-106gi|22139607133-112100%Seq ID NO: 58223117-138gi|1674628049-70100%Seq ID NO: 58237123-269gi|33139639245-38889%Seq ID NO: 58237114-269gi|33140673201-4987%Seq ID NO: 58237121-246gi|3313913199-22482%Seq ID NO: 58237333-378gi|32324211211-25691%Seq ID NO: 58264869-1010gi|18082514100-24190%Seq ID NO: 582641203-1298gi|18082514245-34092%Seq ID NO: 58357136-238gi|54549492497-59988%Seq ID NO: 58357369-416gi|54549492633-68091%Seq ID NO: 5838212-159gi|183830121144-129184%Seq ID NO: 584271-143gi|1679783062-20497%Seq ID NO: 584271-143gi|1679783162-20297%Seq ID NO: 584271-143gi|1679783263-20397%Seq ID NO: 584271-143gi|4711828574-21496%Seq ID NO: 584271-142gi|2600076262-20195%Seq ID NO: 584271-143gi|3144232262-20295%Seq ID NO: 584271-143gi|3809613362-20293%Seq ID NO: 58438319-443gi|32324393180-30796%Seq ID NO: 584381-83gi|3232439325-10898%Seq ID NO: 58438154-195gi|32324393107-148100%Seq ID NO: 58438239-273gi|32324393147-181100%Seq ID NO: 58438319-383gi|33140257413-47798%Seq ID NO: 58438154-191gi|33140257345-382100%Seq ID NO: 58478192-238gi|1808926822-6889%Seq ID NO: 5867081-148gi|18090251452-51988%Seq ID NO: 58722346-432gi|1926610421-10784%Seq ID NO: 460831-119gi|33139765458-57699%Seq ID NO: 46083265-305gi|52128426396-43695%Seq ID NO: 46084295-415gi|33139765576-45699%Seq ID NO: 46084479-528gi|33139765454-405100%Seq ID NO: 46084109-149gi|52128426436-39695%Seq ID NO: 58765150-170gi|18081306417-437100%Seq ID NO: 587682-66gi|7143917219-28390%Seq ID NO: 58912483-676gi|7144233308-50187%Seq ID NO: 58912483-667gi|7144339311-49586%Seq ID NO: 58913201-271gi|52128794102-17290%Seq ID NO: 58913187-272gi|46985091319-23485%Seq ID NO: 5891385-121gi|18080422365-40194%Seq ID NO: 58913249-272gi|46985022259-236100%Seq ID NO: 5904823-135gi|33139660491-60398%Seq ID NO: 59083221-245gi|4698719349-2596%Seq ID NO: 591311-107gi|180821683-10994%Seq ID NO: 5913116-108gi|5133450937-12988%Seq ID NO: 5913170-108gi|52129549138-17689%Seq ID NO: 591321-56gi|7641359475-42091%Seq ID NO: 591321-59gi|1408590055-11390%Seq ID NO: 591321-47gi|1408611051-9793%Seq ID NO: 591321-56gi|1576766065-12089%Seq ID NO: 591321-56gi|18495580501-44689%Seq ID NO: 5913219-59gi|3974752761-10192%Seq ID NO: 5920121-138gi|289160761-11991%Seq ID NO: 5920114-41gi|15947334-61100%Seq ID NO: 5920114-41gi|2454547227-200100%Seq ID NO: 5920114-41gi|551594194-221100%Seq ID NO: 5920114-41gi|551595669-696100%Seq ID NO: 5920114-41gi|1803225434-61100%Seq ID NO: 5920118-41gi|18477256279-256100%Seq ID NO: 5920118-41gi|18477260325-302100%Seq ID NO: 5920118-41gi|18477262634-611100%Seq ID NO: 5920114-35gi|1847725922-1100%Seq ID NO: 5920114-35gi|1847726122-1100%Seq ID NO: 5920114-35gi|3778096822-1100%Seq ID NO: 59268131-256gi|33139189152-27799%Seq ID NO: 592681-56gi|3313918996-15198%Seq ID NO: 592693-85gi|7143543482-56488%Seq ID NO: 593111-81gi|7144052441-52190%Seq ID NO: 594841-118gi|18080802329-20995%Seq ID NO: 594841-103gi|7144157112-795%Seq ID NO: 594841-102gi|18088100105-195%Seq ID NO: 594841-96gi|5454968899-193%Seq ID NO: 594841-90gi|1808940993-194%Seq ID NO: 594841-89gi|1809078292-194%Seq ID NO: 5948456-118gi|9033884441-50390%Seq ID NO: 5948456-118gi|15768080331-39390%Seq ID NO: 5948456-118gi|39747546344-40690%Seq ID NO: 5948477-118gi|15769768391-43297%Seq ID NO: 5948477-118gi|19383550430-47197%Seq ID NO: 5948477-118gi|39747568271-31297%Seq ID NO: 5948477-112gi|46986671170-135100%Seq ID NO: 5948480-118gi|313262721-3997%Seq ID NO: 5948477-118gi|159682379-42092%Seq ID NO: 5948477-118gi|15784370177-21892%Seq ID NO: 595301-129gi|33139697192-6397%Seq ID NO: 5955815-54gi|289160761-4097%Seq ID NO: 595588-35gi|15947334-61100%Seq ID NO: 595588-35gi|2454547227-200100%Seq ID NO: 595588-35gi|551594194-221100%Seq ID NO: 595588-35gi|551595669-696100%Seq ID NO: 595588-35gi|1803225434-61100%Seq ID NO: 5955812-35gi|18477256279-256100%Seq ID NO: 5955812-35gi|18477260325-302100%Seq ID NO: 5955812-35gi|18477262634-611100%Seq ID NO: 595588-29gi|1847725922-1100%Seq ID NO: 595588-29gi|1847726122-1100%Seq ID NO: 595588-29gi|3778096822-1100%Seq ID NO: 59597141-230gi|33139639297-38685%Seq ID NO: 59597141-193gi|33140673140-8890%Seq ID NO: 596527-89gi|33140014182-264100%Seq ID NO: 5990194-114gi|7798143245-225100%Seq ID NO: 599111-141gi|22391061166-130798%Seq ID NO: 599111-141gi|375172431166-130798%Seq ID NO: 59944178-340gi|33139639257-41989%Seq ID NO: 59944159-310gi|33140673199-4887%Seq ID NO: 6009981-221gi|323242111-14190%Seq ID NO: 6009992-226gi|33139131116-25089%Seq ID NO: 60099305-349gi|33139131293-33793%Seq ID NO: 6009970-220gi|33140673199-4986%Seq ID NO: 6009989-264gi|33139639257-42984%Seq ID NO: 601551-67gi|7144157200-13491%Seq ID NO: 601551-67gi|54549688187-12191%Seq ID NO: 6020646-72gi|15947361-35100%Seq ID NO: 6020646-72gi|2454547200-226100%Seq ID NO: 6020646-72gi|551594221-195100%Seq ID NO: 6020646-72gi|551595696-670100%Seq ID NO: 6020646-72gi|1803225461-35100%Seq ID NO: 6020646-69gi|18477256256-279100%Seq ID NO: 6020646-69gi|18477260302-325100%Seq ID NO: 6020646-69gi|18477262611-634100%Seq ID NO: 6020652-72gi|184772591-21100%Seq ID NO: 6020652-72gi|184772611-21100%Seq ID NO: 6020652-72gi|377809681-21100%Seq ID NO: 60211305-361gi|18081798206-26291%Seq ID NO: 60211305-361gi|54547765204-26089%Seq ID NO: 6021152-77gi|982921155-80100%Seq ID NO: 6021152-77gi|2165272726-51100%Seq ID NO: 602201-95gi|33140348161-25596%Seq ID NO: 6037815-74gi|289160761-6093%Seq ID NO: 603788-35gi|15947334-61100%Seq ID NO: 603788-35gi|2454547227-200100%Seq ID NO: 603788-35gi|551594194-221100%Seq ID NO: 603788-35gi|551595669-696100%Seq ID NO: 603788-35gi|1803225434-61100%Seq ID NO: 6037812-35gi|18477256279-256100%Seq ID NO: 6037812-35gi|18477260325-302100%Seq ID NO: 6037812-35gi|18477262634-611100%Seq ID NO: 603788-29gi|1847725922-1100%Seq ID NO: 603788-29gi|1847726122-1100%Seq ID NO: 603788-29gi|3778096822-1100%Seq ID NO: 60409273-620gi|323243472-35096%Seq ID NO: 60409680-806gi|32324347351-47798%Seq ID NO: 60409680-784gi|33139657351-45598%Seq ID NO: 60409289-615gi|3314024113-33897%Seq ID NO: 60409677-806gi|33140241341-47098%Seq ID NO: 60409272-619gi|331392591-34895%Seq ID NO: 60409278-620gi|3313900012-35596%Seq ID NO: 60409272-615gi|331394781-33996%Seq ID NO: 60409268-620gi|323249191-35495%Seq ID NO: 60409273-609gi|331393902-33995%Seq ID NO: 60409391-620gi|3314035066-29696%Seq ID NO: 60409273-337gi|331403502-6693%Seq ID NO: 60409498-615gi|331396831-11898%Seq ID NO: 604951-88gi|5107411569-656100%Seq ID NO: 604951-88gi|16797830379-466100%Seq ID NO: 604951-88gi|16797831377-46498%Seq ID NO: 604951-88gi|16797832378-46598%Seq ID NO: 604951-88gi|31442322377-46498%Seq ID NO: 604955-88gi|26000760381-46498%Seq ID NO: 604951-88gi|38096135377-46595%Seq ID NO: 6049512-86gi|31442314392-46793%Seq ID NO: 604954-86gi|38096138386-46890%Seq ID NO: 604953-86gi|16797838399-48289%Seq ID NO: 60532243-263gi|4698692844-64100%Seq ID NO: 60533237-257gi|4698692864-44100%Seq ID NO: 60534122-248gi|33139568363-48999%Seq ID NO: 605341-75gi|33139568293-36798%Seq ID NO: 605961-99gi|33139131122-22089%Seq ID NO: 6059629-92gi|33140673143-8087%Seq ID NO: 60782367-405gi|5454798161-2494%Seq ID NO: 60782792-812gi|1943590124-4100%Seq ID NO: 60783218-256gi|5454798124-6194%Seq ID NO: 6078836-165gi|28916076140-1092%Seq ID NO: 6064426-217gi|1808021620-21186%Seq ID NO: 609471-85gi|33140348171-25597%Seq ID NO: 6105173-95gi|39746742647-625100%Seq ID NO: 61179139-160gi|34316675229-208100%Seq ID NO: 6121776-324gi|33139639245-49387%Seq ID NO: 6121771-219gi|33140673197-4989%Seq ID NO: 61217115-219gi|3232421136-14088%Seq ID NO: 6130336-174gi|28916076140-192%Seq ID NO: 61303154-181gi|15947361-34100%Seq ID NO: 61303154-181gi|2454547200-227100%Seq ID NO: 61303154-181gi|551594221-194100%Seq ID NO: 61303154-181gi|551595696-669100%Seq ID NO: 61303154-181gi|1803225461-34100%Seq ID NO: 61303154-177gi|18477256256-279100%Seq ID NO: 61303154-177gi|18477260302-325100%Seq ID NO: 61303154-177gi|18477262611-634100%Seq ID NO: 61303160-181gi|184772591-22100%Seq ID NO: 61303160-181gi|184772611-22100%Seq ID NO: 61303160-181gi|377809681-22100%Seq ID NO: 613514-140gi|18080057229-9385%Seq ID NO: 613514-140gi|54547707233-9784%Seq ID NO: 61352186-312gi|18080057303-42990%Seq ID NO: 613521-134gi|18080057159-29288%Seq ID NO: 613521-139gi|54547707163-30088%Seq ID NO: 61352186-312gi|54547707307-43389%Seq ID NO: 61352363-405gi|54547707443-48590%Seq ID NO: 6152629-138gi|289160769-11991%Seq ID NO: 6155331-90gi|51237572280-33986%Seq ID NO: 462461-165gi|33139608404-56798%Seq ID NO: 462461-62gi|32323997404-46498%Seq ID NO: 61561480-500gi|5454675460-40100%Seq ID NO: 616003-49gi|3232421194-14093%Seq ID NO: 61643680-700gi|23260226280-260100%Seq ID NO: 616861-73gi|16797830749-821100%Seq ID NO: 616861-73gi|16797832747-819100%Seq ID NO: 616861-73gi|26000759745-817100%Seq ID NO: 616861-73gi|31442320746-818100%Seq ID NO: 616861-73gi|31442322746-81898%Seq ID NO: 616868-73gi|21885259776-84198%Seq ID NO: 6168897-117gi|5212872561-81100%Seq ID NO: 61719701-721gi|32185118164-144100%Seq ID NO: 6181013-290gi|3314013639-31785%Seq ID NO: 619375-25gi|19267937180-160100%Seq ID NO: 619551-89gi|33139270471-383100%Seq ID NO: 619801-78gi|33140673126-4992%Seq ID NO: 6198019-84gi|33139131185-25092%Seq ID NO: 61980250-309gi|33139639521-58091%Seq ID NO: 46268777-960gi|3314077055-23799%Seq ID NO: 46268639-695gi|331407701-5798%Seq ID NO: 62170207-227gi|46422740369-349100%Seq ID NO: 62176200-225gi|30028086358-383100%Seq ID NO: 62176198-218gi|16005270477-497100%Seq ID NO: 462795-190gi|35504456365-18096%Seq ID NO: 4627963-188gi|7143616133-25891%Seq ID NO: 4627963-188gi|18381740382-50791%Seq ID NO: 4627963-184gi|18381740131-25291%Seq ID NO: 4627976-188gi|18381923603-49192%Seq ID NO: 4627973-188gi|21493480425-31085%Seq ID NO: 624141-128gi|32323987182-309100%Seq ID NO: 62414173-246gi|32323987309-382100%Seq ID NO: 62414173-222gi|33140527311-36098%Seq ID NO: 62539200-225gi|5454769579-10496%Seq ID NO: 62576170-298gi|3313955685-21394%Seq ID NO: 6257642-126gi|331395561-8598%Seq ID NO: 46293154-303gi|18080373337-48691%Seq ID NO: 626111-45gi|33139645197-24197%Seq ID NO: 62636254-285gi|18382597630-66196%Seq ID NO: 46302262-359gi|18381117806-70992%Seq ID NO: 46302262-362gi|7144103148-24891%Seq ID NO: 46305314-411gi|54548221329-23288%Seq ID NO: 627763-93gi|33140348248-15889%Seq ID NO: 628991-140gi|32325217127-26796%Seq ID NO: 62904154-306gi|18082113218-37089%Seq ID NO: 62904158-306gi|54549200528-38087%Seq ID NO: 62904159-306gi|54545591457-31087%Seq ID NO: 62904179-306gi|7144024503-37787%Seq ID NO: 62986504-558gi|32324842158-21292%Seq ID NO: 630151-98gi|32324253114-1794%Seq ID NO: 63100554-708gi|3232445981-23595%Seq ID NO: 6310736-174gi|28916076140-192%Seq ID NO: 63107154-181gi|15947361-34100%Seq ID NO: 63107154-181gi|2454547200-227100%Seq ID NO: 63107154-181gi|551594221-194100%Seq ID NO: 63107154-181gi|551595696-669100%Seq ID NO: 63107154-181gi|1803225461-34100%Seq ID NO: 63107154-177gi|18477256256-279100%Seq ID NO: 63107154-177gi|18477260302-325100%Seq ID NO: 63107154-177gi|18477262611-634100%Seq ID NO: 63107160-181gi|184772591-22100%Seq ID NO: 63107160-181gi|184772611-22100%Seq ID NO: 63107160-181gi|377809681-22100%Seq ID NO: 6318913-47gi|7143962302-26994%Seq ID NO: 63384576-772gi|32325427297-49397%Seq ID NO: 63384217-381gi|3232542750-21498%Seq ID NO: 63384428-514gi|32325427213-299100%Seq ID NO: 63384110-159gi|323254271-5094%Seq ID NO: 63384629-772gi|180899171-14488%Seq ID NO: 6345015-54gi|289160761-4097%Seq ID NO: 634508-35gi|15947334-61100%Seq ID NO: 634508-35gi|2454547227-200100%Seq ID NO: 634508-35gi|551594194-221100%Seq ID NO: 634508-35gi|551595669-696100%Seq ID NO: 634508-35gi|1803225434-61100%Seq ID NO: 6345012-35gi|18477256279-256100%Seq ID NO: 6345012-35gi|18477260325-302100%Seq ID NO: 6345012-35gi|18477262634-611100%Seq ID NO: 634508-29gi|1847725922-1100%Seq ID NO: 634508-29gi|1847726122-1100%Seq ID NO: 634508-29gi|3778096822-1100%Seq ID NO: 6350816-41gi|3016602939-6496%Seq ID NO: 635294-111gi|33140348148-25593%Seq ID NO: 463443-176gi|5454836726-19985%Seq ID NO: 46344461-533gi|54548367299-37190%Seq ID NO: 636261-138gi|1809077620-15788%Seq ID NO: 46358286-423gi|7144085453-59191%Seq ID NO: 639051-73gi|16797830749-821100%Seq ID NO: 639051-73gi|16797832747-819100%Seq ID NO: 639051-73gi|26000759745-817100%Seq ID NO: 639051-73gi|31442320746-818100%Seq ID NO: 639051-73gi|31442322746-81898%Seq ID NO: 639058-73gi|21885259776-84198%Seq ID NO: 6393036-103gi|18080500393-46086%Seq ID NO: 6403021-138gi|289160761-11991%Seq ID NO: 6403014-41gi|15947334-61100%Seq ID NO: 6403014-41gi|2454547227-200100%Seq ID NO: 6403014-41gi|551594194-221100%Seq ID NO: 6403014-41gi|551595669-696100%Seq ID NO: 6403014-41gi|1803225434-61100%Seq ID NO: 6403018-41gi|18477256279-256100%Seq ID NO: 6403018-41gi|18477260325-302100%Seq ID NO: 6403018-41gi|18477262634-611100%Seq ID NO: 6403014-35gi|1847725922-1100%Seq ID NO: 6403014-35gi|1847726122-1100%Seq ID NO: 6403014-35gi|3778096822-1100%Seq ID NO: 64057318-354gi|3313973082-11997%Seq ID NO: 64082276-364gi|3232503420-10896%Seq ID NO: 6408274-140gi|32324889134-68100%Seq ID NO: 64082325-364gi|331395891-4097%Seq ID NO: 6408274-147gi|2139325927-10085%Seq ID NO: 6408274-147gi|2049890425-9885%Seq ID NO: 641062-35gi|200645841-3497%Seq ID NO: 64146339-442gi|33140237332-229100%Seq ID NO: 6414621-109gi|33140237419-33197%Seq ID NO: 64146452-518gi|33140237219-15398%Seq ID NO: 64146580-617gi|33140237151-114100%Seq ID NO: 64148457-691gi|32325211200-43498%Seq ID NO: 64148302-406gi|3232521196-20096%Seq ID NO: 64148145-221gi|3232521124-9996%Seq ID NO: 64148735-788gi|32325211434-487100%Seq ID NO: 64148897-948gi|32325211540-591100%Seq ID NO: 64148798-842gi|32325211497-541100%Seq ID NO: 6414875-101gi|323252111-27100%Seq ID NO: 64148897-1003gi|32324168477-58399%Seq ID NO: 64148186-221gi|323241681-3697%Seq ID NO: 64148457-684gi|33139272319-54698%Seq ID NO: 641481-101gi|3313927246-14699%Seq ID NO: 64148897-996gi|33139975485-58499%Seq ID NO: 64148178-221gi|331399751-4497%Seq ID NO: 6418121-138gi|289160761-11991%Seq ID NO: 6418114-41gi|15947334-61100%Seq ID NO: 6418114-41gi|2454547227-200100%Seq ID NO: 6418114-41gi|551594194-221100%Seq ID NO: 6418114-41gi|551595669-696100%Seq ID NO: 6418114-41gi|1803225434-61100%Seq ID NO: 6418118-41gi|18477256279-256100%Seq ID NO: 6418118-41gi|18477260325-302100%Seq ID NO: 6418118-41gi|18477262634-611100%Seq ID NO: 6418114-35gi|1847725922-1100%Seq ID NO: 6418114-35gi|1847726122-1100%Seq ID NO: 6418114-35gi|3778096822-1100%Seq ID NO: 64254169-189gi|18083118116-136100%Seq ID NO: 64254169-189gi|18382418749-769100%Seq ID NO: 64268399-518gi|54546846257-38282%Seq ID NO: 64305240-260gi|2446785041-21100%Seq ID NO: 64309148-249gi|18088868401-30090%Seq ID NO: 46394286-347gi|46987704361-30085%Seq ID NO: 64348295-324gi|7922575190-21896%Seq ID NO: 64377645-702gi|54544818520-57791%Seq ID NO: 6439083-126gi|5515941028-107093%Seq ID NO: 6439083-126gi|551595267-30993%Seq ID NO: 6447721-138gi|289160761-11991%Seq ID NO: 6447714-41gi|15947334-61100%Seq ID NO: 6447714-41gi|2454547227-200100%Seq ID NO: 6447714-41gi|551594194-221100%Seq ID NO: 6447714-41gi|551595669-696100%Seq ID NO: 6447714-41gi|1803225434-61100%Seq ID NO: 6447718-41gi|18477256279-256100%Seq ID NO: 6447718-41gi|18477260325-302100%Seq ID NO: 6447718-41gi|18477262634-611100%Seq ID NO: 6447714-35gi|1847725922-1100%Seq ID NO: 6447714-35gi|1847726122-1100%Seq ID NO: 6447714-35gi|3778096822-1100%Seq ID NO: 6450827-157gi|323246929-13997%Seq ID NO: 6451452-204gi|32324211369-52185%Seq ID NO: 6457021-138gi|289160761-11991%Seq ID NO: 6457014-41gi|15947334-61100%Seq ID NO: 6457014-41gi|2454547227-200100%Seq ID NO: 6457014-41gi|551594194-221100%Seq ID NO: 6457014-41gi|551595669-696100%Seq ID NO: 6457014-41gi|1803225434-61100%Seq ID NO: 6457018-41gi|18477256279-256100%Seq ID NO: 6457018-41gi|18477260325-302100%Seq ID NO: 6457018-41gi|18477262634-611100%Seq ID NO: 6457014-35gi|1847725922-1100%Seq ID NO: 6457014-35gi|1847726122-1100%Seq ID NO: 6457014-35gi|3778096822-1100%Seq ID NO: 6458033-147gi|323251911-11592%Seq ID NO: 646332-114gi|331405301-113100%Seq ID NO: 64669300-320gi|52129537617-637100%Seq ID NO: 646921-57gi|3002894155-11198%Seq ID NO: 646921-57gi|3314020455-11198%Seq ID NO: 6469495-147gi|30028941402-454100%Seq ID NO: 646941-43gi|30028941308-350100%Seq ID NO: 6469495-146gi|30029072395-446100%Seq ID NO: 6469495-146gi|33140204402-453100%Seq ID NO: 646941-43gi|33140204308-350100%Seq ID NO: 647111-93gi|33139524233-14198%Seq ID NO: 6476019-144gi|54546846257-38384%Seq ID NO: 64762174-194gi|9829276404-424100%Seq ID NO: 64762174-194gi|39747004520-540100%Seq ID NO: 647801-118gi|3314054849-16695%Seq ID NO: 64780151-225gi|33140548199-273100%Seq ID NO: 6479733-77gi|5454964011-5593%Seq ID NO: 6484742-190gi|33140348108-25592%Seq ID NO: 648471-70gi|33140348105-17492%Seq ID NO: 648471-28gi|3232449817-4496%Seq ID NO: 648627-144gi|7144222169-30692%Seq ID NO: 648627-144gi|54547280139-27692%Seq ID NO: 6486232-149gi|51334182168-28588%Seq ID NO: 6486227-146gi|21494008204-32386%Seq ID NO: 64874205-293gi|18082074129-21783%Seq ID NO: 6493121-41gi|46984838273-293100%Seq ID NO: 649351-84gi|16797830749-83298%Seq ID NO: 649351-84gi|16797832747-83098%Seq ID NO: 649351-84gi|26000759745-82898%Seq ID NO: 649351-84gi|31442320746-82998%Seq ID NO: 649351-84gi|31442322746-82997%Seq ID NO: 649351-77gi|16797831747-82398%Seq ID NO: 649358-84gi|21885259776-85297%Seq ID NO: 6507815-54gi|289160761-4097%Seq ID NO: 650841-166gi|35504916448-61294%Seq ID NO: 650841-85gi|35504725448-53394%Seq ID NO: 65205338-610gi|33140653131-403100%Seq ID NO: 65205160-291gi|331406531-13297%Seq ID NO: 65205663-717gi|33140653401-455100%Seq ID NO: 65205725-746gi|33140653463-484100%Seq ID NO: 65205338-462gi|32325169130-254100%Seq ID NO: 65205161-291gi|323251691-13197%Seq ID NO: 65205500-610gi|32325169254-364100%Seq ID NO: 65205162-291gi|331399641-13098%Seq ID NO: 652611-243gi|331408069-25595%Seq ID NO: 6526162-234gi|3313970424-19995%Seq ID NO: 46445556-665gi|331394431-11195%Seq ID NO: 653281-145gi|1809048197-24185%Seq ID NO: 464521-163gi|714371057-22089%Seq ID NO: 464521-171gi|5454987042-21389%Seq ID NO: 4645216-163gi|180896015-15390%Seq ID NO: 4645229-163gi|545479291-13690%Seq ID NO: 464521-171gi|5123763023-19485%Seq ID NO: 65377274-524gi|33140174121-37198%Seq ID NO: 65377109-229gi|331401741-12199%Seq ID NO: 65377578-680gi|33140174372-474100%Seq ID NO: 65377274-527gi|32324003302-55598%Seq ID NO: 653771-229gi|3232400374-30297%Seq ID NO: 65377579-621gi|32324003554-59697%Seq ID NO: 6537742-229gi|3232493313-20097%Seq ID NO: 65377579-680gi|32324933452-55399%Seq ID NO: 6539790-152gi|18081066414-47690%Seq ID NO: 654742-53gi|2891607652-194%Seq ID NO: 6547433-59gi|15947361-35100%Seq ID NO: 6547433-59gi|2454547200-226100%Seq ID NO: 6547433-59gi|551594221-195100%Seq ID NO: 6547433-59gi|551595696-670100%Seq ID NO: 6547433-59gi|1803225461-35100%Seq ID NO: 6547433-56gi|18477256256-279100%Seq ID NO: 6547433-56gi|18477260302-325100%Seq ID NO: 6547433-56gi|18477262611-634100%Seq ID NO: 6547439-59gi|184772591-21100%Seq ID NO: 6547439-59gi|184772611-21100%Seq ID NO: 6547439-59gi|377809681-21100%Seq ID NO: 655531-63gi|33140188109-17198%Seq ID NO: 655532-50gi|33140188305-35390%Seq ID NO: 6555814-173gi|18383267801-96086%Seq ID NO: 65560480-623gi|35504861212-7097%Seq ID NO: 65572411-582gi|33139585153-32494%Seq ID NO: 65572135-258gi|331395851-12496%Seq ID NO: 65591217-309gi|3232490925-11793%Seq ID NO: 46459203-386gi|18081711111-29486%Seq ID NO: 464591-35gi|5454908864-9894%Seq ID NO: 4645914-35gi|545493191-22100%Seq ID NO: 656431-194gi|33140010238-43196%Seq ID NO: 65643121-194gi|331393141-7497%Seq ID NO: 46465159-238gi|9829340394-31587%Seq ID NO: 65738166-205gi|331408131-4095%Seq ID NO: 6575583-126gi|5515941028-107093%Seq ID NO: 6575583-126gi|551595267-30993%Seq ID NO: 658131-121gi|28916076122-191%Seq ID NO: 65813101-126gi|15947361-36100%Seq ID NO: 65813101-126gi|2454547200-225100%Seq ID NO: 65813101-126gi|551594221-196100%Seq ID NO: 65813101-126gi|551595696-671100%Seq ID NO: 65813101-126gi|1803225461-36100%Seq ID NO: 65813101-124gi|18477256256-279100%Seq ID NO: 65813101-124gi|18477260302-325100%Seq ID NO: 65813101-124gi|18477262611-634100%Seq ID NO: 6583180-203gi|33140348128-25292%Seq ID NO: 6593277-335gi|33139639245-50387%Seq ID NO: 6593270-225gi|33140673199-4488%Seq ID NO: 65932245-349gi|32324211165-26986%Seq ID NO: 65932293-327gi|33139131293-32794%Seq ID NO: 659661-215gi|33140309306-52097%Seq ID NO: 660291-143gi|1679783062-20498%Seq ID NO: 660291-143gi|1679783162-20297%Seq ID NO: 660291-143gi|1679783263-20397%Seq ID NO: 660291-143gi|2600075962-20296%Seq ID NO: 660291-142gi|2600076262-20195%Seq ID NO: 660291-143gi|3144232262-20295%Seq ID NO: 660291-143gi|3809613362-20293%Seq ID NO: 46479143-293gi|1809018596-24690%Seq ID NO: 46479412-474gi|18090185320-38292%Seq ID NO: 6611583-126gi|5515941028-107093%Seq ID NO: 6611583-126gi|551595267-30993%Seq ID NO: 6613821-168gi|54546322121-26887%Seq ID NO: 66223311-331gi|32184515225-245100%Seq ID NO: 66245118-139gi|54544778203-224100%Seq ID NO: 46493113-266gi|33139869326-47692%Seq ID NO: 46493115-269gi|18080529142-29390%Seq ID NO: 46493113-240gi|32325143467-59191%Seq ID NO: 46493113-229gi|33139507467-58092%Seq ID NO: 46493113-236gi|33139658467-58791%Seq ID NO: 46493113-217gi|33139078467-56891%Seq ID NO: 46493171-286gi|7143888280-39289%Seq ID NO: 46493113-216gi|33140145467-56791%Seq ID NO: 46493186-269gi|18080754174-25792%Seq ID NO: 46493144-269gi|52129866585-70886%Seq ID NO: 46493192-286gi|180808441-9589%Seq ID NO: 46493144-254gi|52129769700-80887%Seq ID NO: 46493113-190gi|32324728467-54492%Seq ID NO: 46493113-204gi|33140539467-55590%Seq ID NO: 46493118-239gi|19264211454-57282%Seq ID NO: 46493211-245gi|22140574144-17897%Seq ID NO: 46493220-254gi|521292623-3794%Seq ID NO: 46493113-182gi|15784199440-50984%Seq ID NO: 46493113-148gi|15785306458-49291%Seq ID NO: 663271-81gi|35504440192-27295%Seq ID NO: 66372317-509gi|54546686284-47685%Seq ID NO: 66372454-509gi|331399161-56100%Seq ID NO: 6645423-43gi|21285223168-148100%Seq ID NO: 6651328-80gi|54545135100-15296%Seq ID NO: 66513154-198gi|54549414410-45493%Seq ID NO: 6651338-80gi|545489681-4395%Seq ID NO: 666111-107gi|33140780433-53998%Seq ID NO: 465221-101gi|1838256665-16589%Seq ID NO: 465226-101gi|180898151-9688%Seq ID NO: 4652264-98gi|2149353440-7491%Seq ID NO: 6664580-246gi|323242111-16786%Seq ID NO: 6664569-219gi|33140673199-4983%Seq ID NO: 6664587-219gi|33139639256-38883%Seq ID NO: 666911-141gi|1679783062-20298%Seq ID NO: 666911-141gi|1679783162-20097%Seq ID NO: 666911-141gi|1679783263-20197%Seq ID NO: 666911-141gi|2600075962-20096%Seq ID NO: 666911-141gi|3144232262-20095%Seq ID NO: 666911-141gi|3809613362-20093%Seq ID NO: 666921-160gi|5107411189-30100%Seq ID NO: 666921-160gi|2149587189-3095%Seq ID NO: 6669247-160gi|341058131719-1606100%Seq ID NO: 666921-160gi|2149585189-3092%Seq ID NO: 666921-160gi|2738785190-3092%Seq ID NO: 666921-160gi|2738792190-3092%Seq ID NO: 666921-160gi|2738799190-3092%Seq ID NO: 666921-160gi|2738800190-3092%Seq ID NO: 666922-160gi|27077481766-160790%Seq ID NO: 666921-80gi|48479719191-11298%Seq ID NO: 666921-80gi|37674501194-11598%Seq ID NO: 66692115-160gi|3767450175-3095%Seq ID NO: 666921-78gi|11477292525-244898%Seq ID NO: 66692115-160gi|11477292413-236897%Seq ID NO: 666921-78gi|223202187-1098%Seq ID NO: 66718182-203gi|1576653177-56100%Seq ID NO: 66870256-452gi|18381486666-46984%Seq ID NO: 66870407-452gi|18381486257-21295%Seq ID NO: 670721-90gi|33139333250-16196%Seq ID NO: 4655313-97gi|54546152237-32194%Seq ID NO: 6719923-130gi|18089328199-9287%Seq ID NO: 46562169-329gi|1808114123-18390%Seq ID NO: 67282129-150gi|4556361632-53100%Seq ID NO: 6729519-157gi|54546846257-39685%Seq ID NO: 465719-99gi|1808248210-10092%Seq ID NO: 674141-211gi|355052852-21299%Seq ID NO: 67414251-397gi|35505285211-35799%Seq ID NO: 67414446-535gi|35505285357-44687%Seq ID NO: 67414152-211gi|7144110176-23591%Seq ID NO: 6746521-138gi|289160761-11991%Seq ID NO: 6746514-41gi|15947334-61100%Seq ID NO: 6746514-41gi|2454547227-200100%Seq ID NO: 6746514-41gi|551594194-221100%Seq ID NO: 6746514-41gi|551595669-696100%Seq ID NO: 6746514-41gi|1803225434-61100%Seq ID NO: 6746518-41gi|18477256279-256100%Seq ID NO: 6746518-41gi|18477260325-302100%Seq ID NO: 6746518-41gi|18477262634-611100%Seq ID NO: 6746514-35gi|1847725922-1100%Seq ID NO: 6746514-35gi|1847726122-1100%Seq ID NO: 6746514-35gi|3778096822-1100%Seq ID NO: 6747736-173gi|28916076140-190%Seq ID NO: 67477157-180gi|15947357-34100%Seq ID NO: 67477157-180gi|2454547204-227100%Seq ID NO: 67477157-180gi|551594217-194100%Seq ID NO: 67477157-180gi|551595692-669100%Seq ID NO: 67477157-180gi|1803225457-34100%Seq ID NO: 67477159-180gi|184772591-22100%Seq ID NO: 67477159-180gi|184772611-22100%Seq ID NO: 67477159-180gi|377809681-22100%Seq ID NO: 676821-101gi|714415739-13997%Seq ID NO: 676821-101gi|5454968826-12696%Seq ID NO: 6768222-101gi|18083086614-53596%Seq ID NO: 6768225-101gi|180904031-7796%Seq ID NO: 6768254-75gi|52129301340-319100%Seq ID NO: 6772580-268gi|331396351-18896%Seq ID NO: 67725324-389gi|33139635188-25398%Seq ID NO: 6772581-268gi|323253231-18893%Seq ID NO: 677625-150gi|1808174387-23283%Seq ID NO: 6785666-470gi|3314013692-49690%Seq ID NO: 679531-45gi|33139122150-19497%Seq ID NO: 67954469-489gi|19384388444-464100%Seq ID NO: 6795688-241gi|323242111-15489%Seq ID NO: 6795682-233gi|3313913199-25086%Seq ID NO: 67956312-349gi|33139131293-33094%Seq ID NO: 6795677-227gi|33140673199-4986%Seq ID NO: 6795695-227gi|33139639256-38885%Seq ID NO: 67994273-323gi|3313944512-6296%Seq ID NO: 679961-157gi|32325279266-42298%Seq ID NO: 6807321-187gi|76197381078-124488%Seq ID NO: 68073119-200gi|331393621-82100%Seq ID NO: 6807312-164gi|1235973712-86488%Seq ID NO: 6807312-164gi|54547532371-52387%Seq ID NO: 6807382-164gi|54549430381-46391%Seq ID NO: 6807392-114gi|6382623174-152100%Seq ID NO: 6807392-114gi|23259469284-306100%Seq ID NO: 6831736-174gi|28916076140-190%Seq ID NO: 68317161-181gi|551595689-669100%Seq ID NO: 68317161-181gi|3016862152-32100%Seq ID NO: 466081-166gi|18081390329-16487%Seq ID NO: 466081-166gi|54545329221-38686%Seq ID NO: 4660815-116gi|21056382128-22985%Seq ID NO: 683514-128gi|18083059281-40585%Seq ID NO: 683704-68gi|33140590143-7998%Seq ID NO: 6842315-54gi|289160761-4097%Seq ID NO: 684238-35gi|15947334-61100%Seq ID NO: 684238-35gi|2454547227-200100%Seq ID NO: 684238-35gi|551594194-221100%Seq ID NO: 684238-35gi|551595669-696100%Seq ID NO: 684238-35gi|1803225434-61100%Seq ID NO: 6842312-35gi|18477256279-256100%Seq ID NO: 6842312-35gi|18477260325-302100%Seq ID NO: 6842312-35gi|18477262634-611100%Seq ID NO: 684238-29gi|1847725922-1100%Seq ID NO: 684238-29gi|1847726122-1100%Seq ID NO: 684238-29gi|3778096822-1100%Seq ID NO: 6842922-42gi|28916076280-260100%Seq ID NO: 68464546-566gi|46985670328-348100%Seq ID NO: 685141-59gi|3232467660-11896%Seq ID NO: 68541305-325gi|19103835404-424100%Seq ID NO: 6861521-138gi|289160761-11991%Seq ID NO: 6861514-41gi|15947334-61100%Seq ID NO: 6861514-41gi|2454547227-200100%Seq ID NO: 6861514-41gi|551594194-221100%Seq ID NO: 6861514-41gi|551595669-696100%Seq ID NO: 6861514-41gi|1803225434-61100%Seq ID NO: 6861518-41gi|18477256279-256100%Seq ID NO: 6861518-41gi|18477260325-302100%Seq ID NO: 6861518-41gi|18477262634-611100%Seq ID NO: 6861514-35gi|1847725922-1100%Seq ID NO: 6861514-35gi|1847726122-1100%Seq ID NO: 6861514-35gi|3778096822-1100%Seq ID NO: 68654317-337gi|30165848173-193100%Seq ID NO: 687611-197gi|3313940383-28089%Seq ID NO: 68761515-623gi|33139403421-52885%Seq ID NO: 6876145-157gi|54546450388-50085%Seq ID NO: 68783328-492gi|3314047683-24799%Seq ID NO: 68783129-212gi|331404761-84100%Seq ID NO: 68783653-732gi|33140476245-32498%Seq ID NO: 6883321-138gi|289160761-11991%Seq ID NO: 6883314-41gi|15947334-61100%Seq ID NO: 6883314-41gi|2454547227-200100%Seq ID NO: 6883314-41gi|551594194-221100%Seq ID NO: 6883314-41gi|551595669-696100%Seq ID NO: 6883314-41gi|1803225434-61100%Seq ID NO: 6883318-41gi|18477256279-256100%Seq ID NO: 6883318-41gi|18477260325-302100%Seq ID NO: 6883318-41gi|18477262634-611100%Seq ID NO: 6883314-35gi|1847725922-1100%Seq ID NO: 6883314-35gi|1847726122-1100%Seq ID NO: 6883314-35gi|3778096822-1100%Seq ID NO: 46630218-408gi|331399571-19398%Seq ID NO: 46630486-631gi|33139957192-33798%Seq ID NO: 46630659-791gi|33139957325-45798%Seq ID NO: 46630838-944gi|33139957452-55896%Seq ID NO: 46630218-406gi|331393611-19197%Seq ID NO: 46630487-631gi|33139361193-33798%Seq ID NO: 46630838-887gi|33139361452-50196%Seq ID NO: 69016185-264gi|18383042974-89585%Seq ID NO: 69016230-264gi|183830541411-144597%Seq ID NO: 6904616-120gi|28916076252-35288%Seq ID NO: 6908915-54gi|289160761-4097%Seq ID NO: 690898-35gi|15947334-61100%Seq ID NO: 690898-35gi|2454547227-200100%Seq ID NO: 690898-35gi|551594194-221100%Seq ID NO: 690898-35gi|551595669-696100%Seq ID NO: 690898-35gi|1803225434-61100%Seq ID NO: 6908912-35gi|18477256279-256100%Seq ID NO: 6908912-35gi|18477260325-302100%Seq ID NO: 6908912-35gi|18477262634-611100%Seq ID NO: 690898-29gi|1847725922-1100%Seq ID NO: 690898-29gi|1847726122-1100%Seq ID NO: 690898-29gi|3778096822-1100%Seq ID NO: 6917015-54gi|289160761-4097%Seq ID NO: 691708-35gi|15947334-61100%Seq ID NO: 691708-35gi|2454547227-200100%Seq ID NO: 691708-35gi|551594194-221100%Seq ID NO: 691708-35gi|551595669-696100%Seq ID NO: 691708-35gi|1803225434-61100%Seq ID NO: 6917012-35gi|18477256279-256100%Seq ID NO: 6917012-35gi|18477260325-302100%Seq ID NO: 6917012-35gi|18477262634-611100%Seq ID NO: 691708-29gi|1847725922-1100%Seq ID NO: 691708-29gi|1847726122-1100%Seq ID NO: 691708-29gi|3778096822-1100%Seq ID NO: 6931436-174gi|28916076140-192%Seq ID NO: 69314154-181gi|15947361-34100%Seq ID NO: 69314154-181gi|2454547200-227100%Seq ID NO: 69314154-181gi|551594221-194100%Seq ID NO: 69314154-181gi|551595696-669100%Seq ID NO: 69314154-181gi|1803225461-34100%Seq ID NO: 69314154-177gi|18477256256-279100%Seq ID NO: 69314154-177gi|18477260302-325100%Seq ID NO: 69314154-177gi|18477262611-634100%Seq ID NO: 69314160-181gi|184772591-22100%Seq ID NO: 69314160-181gi|184772611-22100%Seq ID NO: 69314160-181gi|377809681-22100%Seq ID NO: 693241-369gi|51093880722-35284%Seq ID NO: 693241-369gi|51093884722-35283%Seq ID NO: 693241-252gi|51093883723-46985%Seq ID NO: 69324313-369gi|51093883407-35291%Seq ID NO: 69324134-369gi|51093882587-35284%Seq ID NO: 69324134-369gi|54545658621-38783%Seq ID NO: 693271-128gi|3313987474-20190%Seq ID NO: 693511-124gi|3314079230-15398%Seq ID NO: 6935143-124gi|456436421-8298%Seq ID NO: 6935158-124gi|456436461-6792%Seq ID NO: 466513-78gi|18081363236-31096%Seq ID NO: 466513-76gi|54544882242-31595%Seq ID NO: 694101-141gi|1679783062-20298%Seq ID NO: 694101-141gi|1679783162-20097%Seq ID NO: 694101-141gi|1679783263-20197%Seq ID NO: 694101-141gi|2600075962-20096%Seq ID NO: 694101-141gi|3144232262-20095%Seq ID NO: 694101-141gi|3809613362-20093%Seq ID NO: 69440221-245gi|4698719349-2596%Seq ID NO: 694481-53gi|1808105788-14094%Seq ID NO: 6944942-186gi|3550494649-19397%Seq ID NO: 694631-156gi|5454864339-18887%Seq ID NO: 6946321-156gi|180894809-13886%Seq ID NO: 694891-118gi|16797830147-26499%Seq ID NO: 694891-118gi|26000761147-26295%Seq ID NO: 694891-118gi|16797832148-26394%Seq ID NO: 694891-118gi|47118286171-28694%Seq ID NO: 694891-118gi|31442322147-26294%Seq ID NO: 694891-118gi|21885259169-28491%Seq ID NO: 69494323-427gi|33139639245-34990%Seq ID NO: 69494316-441gi|33140673199-7486%Seq ID NO: 69501113-190gi|54549674422-50094%Seq ID NO: 6950113-53gi|54549674321-36195%Seq ID NO: 6957016-153gi|714349858-19685%Seq ID NO: 695921-136gi|16797830686-821100%Seq ID NO: 695921-136gi|16797832684-819100%Seq ID NO: 695921-136gi|31442320683-818100%Seq ID NO: 695921-136gi|26000759682-81799%Seq ID NO: 695921-136gi|31442322683-81899%Seq ID NO: 695921-136gi|38096133685-82097%Seq ID NO: 695921-120gi|16797846702-82490%Seq ID NO: 695921-120gi|16797847692-81490%Seq ID NO: 695921-120gi|14600264704-82690%Seq ID NO: 695921-80gi|16797841691-76997%Seq ID NO: 695921-80gi|16797843725-80397%Seq ID NO: 695921-114gi|16797848691-80790%Seq ID NO: 695921-80gi|16797849692-77097%Seq ID NO: 695931-68gi|5107411156-89100%Seq ID NO: 6959314-68gi|341058131719-1665100%Seq ID NO: 695931-51gi|2149587156-106100%Seq ID NO: 695931-48gi|2738785157-110100%Seq ID NO: 695931-48gi|2738792157-110100%Seq ID NO: 695931-48gi|2738799157-110100%Seq ID NO: 695931-48gi|2738800157-110100%Seq ID NO: 695931-54gi|2149585156-10396%Seq ID NO: 695931-47gi|310742781741-169597%Seq ID NO: 695931-47gi|310742791741-169597%Seq ID NO: 695931-47gi|48479719158-11297%Seq ID NO: 695931-47gi|37674501161-11597%Seq ID NO: 695931-45gi|308441791735-169197%Seq ID NO: 697161-121gi|28916076122-191%Seq ID NO: 69716101-128gi|15947361-34100%Seq ID NO: 69716101-128gi|2454547200-227100%Seq ID NO: 69716101-128gi|551594221-194100%Seq ID NO: 69716101-128gi|551595696-669100%Seq ID NO: 69716101-128gi|1803225461-34100%Seq ID NO: 69716101-124gi|18477256256-279100%Seq ID NO: 69716101-124gi|18477260302-325100%Seq ID NO: 69716101-124gi|18477262611-634100%Seq ID NO: 69716107-128gi|184772591-22100%Seq ID NO: 69716107-128gi|184772611-22100%Seq ID NO: 69716107-128gi|377809681-22100%Seq ID NO: 697476-258gi|51093884252-791%Seq ID NO: 697476-258gi|18080808295-5089%Seq ID NO: 69747207-258gi|180322522936-288592%Seq ID NO: 69747219-252gi|3004983762-2997%Seq ID NO: 69747219-252gi|20064115400-36794%Seq ID NO: 69747219-251gi|1926751033-194%Seq ID NO: 69747219-241gi|1926836627-5100%Seq ID NO: 69795145-251gi|52129366202-30885%Seq ID NO: 6979550-74gi|52129366161-185100%Seq ID NO: 698211-101gi|32324237442-54198%Seq ID NO: 46679448-820gi|331396441-37393%Seq ID NO: 466791053-1213gi|33139644387-54695%Seq ID NO: 6982921-138gi|289160761-11991%Seq ID NO: 6982914-41gi|15947334-61100%Seq ID NO: 6982914-41gi|2454547227-200100%Seq ID NO: 6982914-41gi|551594194-221100%Seq ID NO: 6982914-41gi|551595669-696100%Seq ID NO: 6982914-41gi|1803225434-61100%Seq ID NO: 6982918-41gi|18477256279-256100%Seq ID NO: 6982918-41gi|18477260325-302100%Seq ID NO: 6982918-41gi|18477262634-611100%Seq ID NO: 6982914-35gi|1847725922-1100%Seq ID NO: 6982914-35gi|1847726122-1100%Seq ID NO: 6982914-35gi|3778096822-1100%Seq ID NO: 698381-98gi|183826031075-97889%Seq ID NO: 6983825-98gi|18382603520-44791%Seq ID NO: 6989321-138gi|289160761-11991%Seq ID NO: 6989314-41gi|15947334-61100%Seq ID NO: 6989314-41gi|2454547227-200100%Seq ID NO: 6989314-41gi|551594194-221100%Seq ID NO: 6989314-41gi|551595669-696100%Seq ID NO: 6989314-41gi|1803225434-61100%Seq ID NO: 6989318-41gi|18477256279-256100%Seq ID NO: 6989318-41gi|18477260325-302100%Seq ID NO: 6989318-41gi|18477262634-611100%Seq ID NO: 6989314-35gi|1847725922-1100%Seq ID NO: 6989314-35gi|1847726122-1100%Seq ID NO: 6989314-35gi|3778096822-1100%Seq ID NO: 699301-65gi|142805727058-7122100%Seq ID NO: 699721-247gi|47118286742-98897%Seq ID NO: 699721-247gi|47118285729-97596%Seq ID NO: 699721-230gi|16797830720-95096%Seq ID NO: 699721-230gi|21885260740-96996%Seq ID NO: 699721-230gi|31442322717-94696%Seq ID NO: 699721-230gi|26000759716-94596%Seq ID NO: 699721-230gi|31442320717-94696%Seq ID NO: 699721-230gi|38096133719-94894%Seq ID NO: 699721-44gi|16797827765-80795%Seq ID NO: 69972142-170gi|16797844862-89096%Seq ID NO: 69972227-247gi|484797191112-1132100%Seq ID NO: 699781-157gi|33139812208-36498%Seq ID NO: 699981-79gi|28916077109-3187%Seq ID NO: 70006213-380gi|331392041-16898%Seq ID NO: 7000696-177gi|33139204462-54297%Seq ID NO: 7000696-214gi|33139268461-57998%Seq ID NO: 700721-107gi|180879234-11087%Seq ID NO: 7007234-68gi|5123769640-7491%Seq ID NO: 7009621-138gi|289160761-11991%Seq ID NO: 7009614-41gi|15947334-61100%Seq ID NO: 7009614-41gi|2454547227-200100%Seq ID NO: 7009614-41gi|551594194-221100%Seq ID NO: 7009614-41gi|551595669-696100%Seq ID NO: 7009614-41gi|1803225434-61100%Seq ID NO: 7009618-41gi|18477256279-256100%Seq ID NO: 7009618-41gi|18477260325-302100%Seq ID NO: 7009618-41gi|18477262634-611100%Seq ID NO: 7009614-35gi|1847725922-1100%Seq ID NO: 7009614-35gi|1847726122-1100%Seq ID NO: 7009614-35gi|3778096822-1100%Seq ID NO: 70127759-900gi|33140064109-25098%Seq ID NO: 7012720-127gi|331400641-10898%Seq ID NO: 70140353-392gi|32324842178-21797%Seq ID NO: 7016361-147gi|323246571-8995%Seq ID NO: 7016359-153gi|323250041-9592%Seq ID NO: 7016361-153gi|323243531-9392%Seq ID NO: 7016359-147gi|331403821-8993%Seq ID NO: 7025317-151gi|33139131116-25088%Seq ID NO: 702536-145gi|323242111-14088%Seq ID NO: 70253464-572gi|32324211394-50287%Seq ID NO: 7025313-231gi|33139639256-47181%Seq ID NO: 702531-145gi|33140673193-4983%Seq ID NO: 7034415-54gi|289160761-4097%Seq ID NO: 703448-35gi|15947334-61100%Seq ID NO: 703448-35gi|2454547227-200100%Seq ID NO: 703448-35gi|551594194-221100%Seq ID NO: 703448-35gi|551595669-696100%Seq ID NO: 703448-35gi|1803225434-61100%Seq ID NO: 7034412-35gi|18477256279-256100%Seq ID NO: 7034412-35gi|18477260325-302100%Seq ID NO: 7034412-35gi|18477262634-611100%Seq ID NO: 703448-29gi|1847725922-1100%Seq ID NO: 703448-29gi|1847726122-1100%Seq ID NO: 703448-29gi|3778096822-1100%Seq ID NO: 703571-28gi|3313948828-1100%Seq ID NO: 70376160-259gi|18090001348-44789%Seq ID NO: 46707552-763gi|1808968157-26885%Seq ID NO: 46707463-495gi|1808968121-5396%Seq ID NO: 7045714-41gi|15947334-61100%Seq ID NO: 7045714-41gi|2454547227-200100%Seq ID NO: 7045714-41gi|551594194-221100%Seq ID NO: 7045714-41gi|551595669-696100%Seq ID NO: 7045714-41gi|1803225434-61100%Seq ID NO: 7045718-41gi|18477256279-256100%Seq ID NO: 7045718-41gi|18477260325-302100%Seq ID NO: 7045718-41gi|18477262634-611100%Seq ID NO: 7045714-35gi|1847725922-1100%Seq ID NO: 7045714-35gi|1847726122-1100%Seq ID NO: 7045714-35gi|3778096822-1100%Seq ID NO: 70484210-232gi|18382755877-899100%Seq ID NO: 70513141-344gi|3313974277-28098%Seq ID NO: 70513515-654gi|33139742368-50795%Seq ID NO: 70513384-471gi|33139742280-36795%Seq ID NO: 70513701-771gi|33139742510-57997%Seq ID NO: 7051325-97gi|331397425-7794%Seq ID NO: 7057536-174gi|28916076140-191%Seq ID NO: 70575156-181gi|15947359-34100%Seq ID NO: 70575156-181gi|2454547202-227100%Seq ID NO: 70575156-181gi|551594219-194100%Seq ID NO: 70575156-181gi|551595694-669100%Seq ID NO: 70575156-181gi|1803225459-34100%Seq ID NO: 70575160-181gi|184772591-22100%Seq ID NO: 70575156-177gi|18477257260-281100%Seq ID NO: 70575156-177gi|18477260304-325100%Seq ID NO: 70575160-181gi|184772611-22100%Seq ID NO: 70575156-177gi|18477262613-634100%Seq ID NO: 70575160-181gi|377809681-22100%Seq ID NO: 705811-143gi|1679783062-20498%Seq ID NO: 705811-143gi|1679783162-20297%Seq ID NO: 705811-143gi|1679783263-20397%Seq ID NO: 705811-143gi|2600075962-20296%Seq ID NO: 705811-142gi|2600076262-20195%Seq ID NO: 705811-143gi|3144232262-20295%Seq ID NO: 705811-143gi|3809613362-20293%Seq ID NO: 70595116-138gi|27925886275-297100%Seq ID NO: 7073624-50gi|3232495970-44100%Seq ID NO: 70765327-392gi|18082500283-34890%Seq ID NO: 707654-65gi|180825009-6987%Seq ID NO: 708081-78gi|33140673126-4991%Seq ID NO: 708081-110gi|33139639311-42084%Seq ID NO: 7080819-78gi|33139131185-24486%Seq ID NO: 708151-122gi|28916076122-185%Seq ID NO: 70815106-129gi|15947357-34100%Seq ID NO: 70815106-129gi|2454547204-227100%Seq ID NO: 70815106-129gi|551594217-194100%Seq ID NO: 70815106-129gi|551595692-669100%Seq ID NO: 70815106-129gi|1803225457-34100%Seq ID NO: 70815108-129gi|184772591-22100%Seq ID NO: 70815108-129gi|184772611-22100%Seq ID NO: 70815108-129gi|377809681-22100%Seq ID NO: 7083321-138gi|289160761-11991%Seq ID NO: 7083314-41gi|15947334-61100%Seq ID NO: 7083314-41gi|2454547227-200100%Seq ID NO: 7083314-41gi|551594194-221100%Seq ID NO: 7083314-41gi|551595669-696100%Seq ID NO: 7083314-41gi|1803225434-61100%Seq ID NO: 7083318-41gi|18477256279-256100%Seq ID NO: 7083318-41gi|18477260325-302100%Seq ID NO: 7083318-41gi|18477262634-611100%Seq ID NO: 7083314-35gi|1847725922-1100%Seq ID NO: 7083314-35gi|1847726122-1100%Seq ID NO: 7083314-35gi|3778096822-1100%Seq ID NO: 708381-119gi|3016934449-16799%Seq ID NO: 709681-78gi|33140331512-59290%Seq ID NO: 7097221-138gi|289160761-11991%Seq ID NO: 7097214-41gi|15947334-61100%Seq ID NO: 7097214-41gi|2454547227-200100%Seq ID NO: 7097214-41gi|551594194-221100%Seq ID NO: 7097214-41gi|551595669-696100%Seq ID NO: 7097214-41gi|1803225434-61100%Seq ID NO: 7097218-41gi|18477256279-256100%Seq ID NO: 7097218-41gi|18477260325-302100%Seq ID NO: 7097218-41gi|18477262634-611100%Seq ID NO: 7097214-35gi|1847725922-1100%Seq ID NO: 7097214-35gi|1847726122-1100%Seq ID NO: 7097214-35gi|3778096822-1100%Seq ID NO: 709871-88gi|32324011417-50594%Seq ID NO: 70987137-175gi|32324011506-544100%Seq ID NO: 4673261-204gi|1808113658-20182%Seq ID NO: 71126363-383gi|19103835404-424100%Seq ID NO: 711301-173gi|33140136324-49688%Seq ID NO: 711311-173gi|33140136324-49688%Seq ID NO: 712272-327gi|16797830368-4398%Seq ID NO: 712272-327gi|47118285378-5596%Seq ID NO: 712272-327gi|47118286390-6796%Seq ID NO: 712272-326gi|31442320365-4496%Seq ID NO: 712272-326gi|16797832367-4595%Seq ID NO: 712272-327gi|21885260388-6795%Seq ID NO: 712272-327gi|31442322366-4395%Seq ID NO: 712272-327gi|38096133366-4392%Seq ID NO: 7122718-165gi|31442314354-20786%Seq ID NO: 7122718-127gi|38096139352-24488%Seq ID NO: 7122718-116gi|31442318356-25989%Seq ID NO: 7122718-80gi|16797825322-26095%Seq ID NO: 7122718-80gi|31442305344-28295%Seq ID NO: 7122718-77gi|16797844348-28995%Seq ID NO: 7122720-100gi|28627583300-22089%Seq ID NO: 7122718-127gi|31442313354-24685%Seq ID NO: 712491-138gi|33140698375-51297%Seq ID NO: 71249147-190gi|33140698521-56497%Seq ID NO: 712881-134gi|16797830935-80297%Seq ID NO: 712881-134gi|26000759931-79897%Seq ID NO: 712881-134gi|31442322932-79997%Seq ID NO: 712881-134gi|16797832933-80096%Seq ID NO: 712881-134gi|47118286957-82496%Seq ID NO: 712881-122gi|26000763930-80997%Seq ID NO: 712881-133gi|16797831933-80195%Seq ID NO: 712881-134gi|38096133934-80194%Seq ID NO: 7128851-75gi|16797844886-862100%Seq ID NO: 7128851-72gi|16797845888-867100%Seq ID NO: 46750397-420gi|5212755372-95100%Seq ID NO: 4675012-32gi|403087324-344100%Seq ID NO: 4675012-32gi|37517247324-344100%Seq ID NO: 4675012-32gi|545485539-29100%Seq ID NO: 71318153-177gi|54547857370-34596%Seq ID NO: 713591-93gi|32324127486-57898%Seq ID NO: 71359140-211gi|18089994172-24388%Seq ID NO: 7137121-138gi|289160761-11991%Seq ID NO: 7137114-41gi|15947334-61100%Seq ID NO: 7137114-41gi|2454547227-200100%Seq ID NO: 7137114-41gi|551594194-221100%Seq ID NO: 7137114-41gi|551595669-696100%Seq ID NO: 7137114-41gi|1803225434-61100%Seq ID NO: 7137118-41gi|18477256279-256100%Seq ID NO: 7137118-41gi|18477260325-302100%Seq ID NO: 7137118-41gi|18477262634-611100%Seq ID NO: 7137114-35gi|1847725922-1100%Seq ID NO: 7137114-35gi|1847726122-1100%Seq ID NO: 7137114-35gi|3778096822-1100%Seq ID NO: 714284-75gi|33139530421-350100%Seq ID NO: 714284-75gi|18080188489-41898%Seq ID NO: 714284-71gi|54545749478-41197%Seq ID NO: 7142825-75gi|54545112474-42396%Seq ID NO: 7144615-54gi|289160761-4097%Seq ID NO: 714468-35gi|15947334-61100%Seq ID NO: 714468-35gi|2454547227-200100%Seq ID NO: 714468-35gi|551594194-221100%Seq ID NO: 714468-35gi|551595669-696100%Seq ID NO: 714468-35gi|1803225434-61100%Seq ID NO: 7144612-35gi|18477256279-256100%Seq ID NO: 7144612-35gi|18477260325-302100%Seq ID NO: 7144612-35gi|18477262634-611100%Seq ID NO: 714468-29gi|1847725922-1100%Seq ID NO: 714468-29gi|1847726122-1100%Seq ID NO: 714468-29gi|3778096822-1100%Seq ID NO: 71587297-344gi|33139758490-53793%Seq ID NO: 71587300-344gi|33140062109-6189%Seq ID NO: 715991-184gi|33139461183-36699%Seq ID NO: 716821-47gi|3313944516-6291%Seq ID NO: 7168986-112gi|323242231-27100%Seq ID NO: 717341-156gi|714415723-17993%Seq ID NO: 717341-156gi|5454968810-16692%Seq ID NO: 7173438-156gi|18083086614-49591%Seq ID NO: 7173441-156gi|180904031-11791%Seq ID NO: 717341-91gi|52129301409-31985%Seq ID NO: 7179719-95gi|18081000336-26092%Seq ID NO: 718181-136gi|16797830686-82199%Seq ID NO: 718181-136gi|16797832684-81999%Seq ID NO: 718181-136gi|31442320683-81899%Seq ID NO: 718181-136gi|26000759682-81798%Seq ID NO: 718181-136gi|31442322683-81898%Seq ID NO: 718181-136gi|38096133685-82097%Seq ID NO: 718181-80gi|16797841691-76997%Seq ID NO: 718181-80gi|16797843725-80397%Seq ID NO: 718181-98gi|16797846702-80193%Seq ID NO: 718181-98gi|16797847692-79193%Seq ID NO: 718181-98gi|16797848691-79093%Seq ID NO: 718181-80gi|16797849692-77097%Seq ID NO: 718181-98gi|14600264704-80393%Seq ID NO: 7184846-108gi|28916076140-7992%Seq ID NO: 7198533-55gi|32325183375-397100%Seq ID NO: 719961-141gi|1679783062-20298%Seq ID NO: 719961-141gi|1679783162-20097%Seq ID NO: 719961-141gi|1679783263-20197%Seq ID NO: 719961-141gi|2600075962-20096%Seq ID NO: 719961-141gi|3144232262-20095%Seq ID NO: 719961-141gi|3809613362-20093%Seq ID NO: 72109318-338gi|2446785041-21100%Seq ID NO: 722121-47gi|51093880468-51495%Seq ID NO: 722121-39gi|51093879469-50797%Seq ID NO: 722121-38gi|51093883469-50697%Seq ID NO: 722221-181gi|51093884193-1295%Seq ID NO: 722221-181gi|18080808236-5593%Seq ID NO: 72222129-180gi|180322522942-289192%Seq ID NO: 72222130-180gi|3004983779-2990%Seq ID NO: 72222147-180gi|31326537104-7197%Seq ID NO: 72222130-180gi|20064115417-36788%Seq ID NO: 72222130-179gi|1926751050-188%Seq ID NO: 72222130-169gi|1926836644-590%Seq ID NO: 722885-86gi|54546009456-37487%Seq ID NO: 723501-57gi|355050786-6296%Seq ID NO: 7235021-57gi|355044991-37100%Seq ID NO: 7237736-174gi|28916076140-192%Seq ID NO: 72377154-181gi|15947361-34100%Seq ID NO: 72377154-181gi|2454547200-227100%Seq ID NO: 72377154-181gi|551594221-194100%Seq ID NO: 72377154-181gi|551595696-669100%Seq ID NO: 72377154-181gi|1803225461-34100%Seq ID NO: 72377154-177gi|18477256256-279100%Seq ID NO: 72377154-177gi|18477260302-325100%Seq ID NO: 72377154-177gi|18477262611-634100%Seq ID NO: 72377160-181gi|184772591-22100%Seq ID NO: 72377160-181gi|184772611-22100%Seq ID NO: 72377160-181gi|377809681-22100%Seq ID NO: 72416345-500gi|18089572129-28482%Seq ID NO: 7242615-54gi|289160761-4097%Seq ID NO: 724268-35gi|15947334-61100%Seq ID NO: 724268-35gi|2454547227-200100%Seq ID NO: 724268-35gi|551594194-221100%Seq ID NO: 724268-35gi|551595669-696100%Seq ID NO: 724268-35gi|1803225434-61100%Seq ID NO: 7242612-35gi|18477256279-256100%Seq ID NO: 7242612-35gi|18477260325-302100%Seq ID NO: 7242612-35gi|18477262634-611100%Seq ID NO: 724268-29gi|1847725922-1100%Seq ID NO: 724268-29gi|1847726122-1100%Seq ID NO: 724268-29gi|3778096822-1100%Seq ID NO: 72433106-352gi|33139639245-48887%Seq ID NO: 7243399-257gi|33140673199-4485%Seq ID NO: 72529625-646gi|27926103451-472100%Seq ID NO: 7254477-272gi|33139337247-44197%Seq ID NO: 7254477-245gi|33139239401-56998%Seq ID NO: 72579564-667gi|18082454313-41689%Seq ID NO: 46809356-378gi|17991863197-219100%Seq ID NO: 7262115-54gi|289160761-4097%Seq ID NO: 726218-35gi|15947334-61100%Seq ID NO: 726218-35gi|2454547227-200100%Seq ID NO: 726218-35gi|551594194-221100%Seq ID NO: 726218-35gi|551595669-696100%Seq ID NO: 726218-35gi|1803225434-61100%Seq ID NO: 7262112-35gi|18477256279-256100%Seq ID NO: 7262112-35gi|18477260325-302100%Seq ID NO: 7262112-35gi|18477262634-611100%Seq ID NO: 726218-29gi|1847725922-1100%Seq ID NO: 726218-29gi|1847726122-1100%Seq ID NO: 726218-29gi|3778096822-1100%Seq ID NO: 726351-96gi|33140348160-25596%Seq ID NO: 72667361-457gi|54547761394-49086%Seq ID NO: 727061-121gi|28916076122-191%Seq ID NO: 72706101-128gi|15947361-34100%Seq ID NO: 72706101-128gi|2454547200-227100%Seq ID NO: 72706101-128gi|551594221-194100%Seq ID NO: 72706101-128gi|551595696-669100%Seq ID NO: 72706101-128gi|1803225461-34100%Seq ID NO: 72706101-124gi|18477256256-279100%Seq ID NO: 72706101-124gi|18477260302-325100%Seq ID NO: 72706101-124gi|18477262611-634100%Seq ID NO: 72706107-128gi|184772591-22100%Seq ID NO: 72706107-128gi|184772611-22100%Seq ID NO: 72706107-128gi|377809681-22100%Seq ID NO: 7280821-138gi|289160761-11991%Seq ID NO: 7280814-41gi|15947334-61100%Seq ID NO: 7280814-41gi|2454547227-200100%Seq ID NO: 7280814-41gi|551594194-221100%Seq ID NO: 7280814-41gi|551595669-696100%Seq ID NO: 7280814-41gi|1803225434-61100%Seq ID NO: 7280818-41gi|18477256279-256100%Seq ID NO: 7280818-41gi|18477260325-302100%Seq ID NO: 7280818-41gi|18477262634-611100%Seq ID NO: 7280814-35gi|1847725922-1100%Seq ID NO: 7280814-35gi|1847726122-1100%Seq ID NO: 7280814-35gi|3778096822-1100%Seq ID NO: 728205-100gi|33139131115-21386%Seq ID NO: 72859169-270gi|18090422452-55389%Seq ID NO: 730391-111gi|33139007235-12598%Seq ID NO: 73039119-235gi|33139007117-194%Seq ID NO: 73039132-235gi|33140543104-198%Seq ID NO: 7315673-218gi|32324285154-29997%Seq ID NO: 73156266-408gi|32324285298-44096%Seq ID NO: 73156454-534gi|32324285439-519100%Seq ID NO: 73156454-516gi|33139772439-501100%Seq ID NO: 73170167-212gi|32324562537-49291%Seq ID NO: 73219223-246gi|32325459267-290100%Seq ID NO: 7323375-95gi|19267575145-125100%Seq ID NO: 7342636-174gi|28916076140-192%Seq ID NO: 73426154-181gi|15947361-34100%Seq ID NO: 73426154-181gi|2454547200-227100%Seq ID NO: 73426154-181gi|551594221-194100%Seq ID NO: 73426154-181gi|551595696-669100%Seq ID NO: 73426154-181gi|1803225461-34100%Seq ID NO: 73426154-177gi|18477256256-279100%Seq ID NO: 73426154-177gi|18477260302-325100%Seq ID NO: 73426154-177gi|18477262611-634100%Seq ID NO: 73426160-181gi|184772591-22100%Seq ID NO: 73426160-181gi|184772611-22100%Seq ID NO: 73426160-181gi|377809681-22100%Seq ID NO: 734541-50gi|33140348252-20398%Seq ID NO: 46864225-248gi|5212755372-95100%Seq ID NO: 7359479-252gi|323242111-17184%Seq ID NO: 7359473-224gi|3313913199-25085%Seq ID NO: 73594294-331gi|33139131293-33094%Seq ID NO: 736161-285gi|32324082243-52797%Seq ID NO: 73616331-370gi|32324082527-566100%Seq ID NO: 468711109-1129gi|1576653177-57100%Seq ID NO: 468961-152gi|3232557617-16899%Seq ID NO: 468961-150gi|5454747244-19390%Seq ID NO: 468961-150gi|1808306822-17188%Seq ID NO: 7396036-174gi|28916076140-192%Seq ID NO: 73960154-181gi|15947361-34100%Seq ID NO: 73960154-181gi|2454547200-227100%Seq ID NO: 73960154-181gi|551594221-194100%Seq ID NO: 73960154-181gi|551595696-669100%Seq ID NO: 73960154-181gi|1803225461-34100%Seq ID NO: 73960154-177gi|18477256256-279100%Seq ID NO: 73960154-177gi|18477260302-325100%Seq ID NO: 73960154-177gi|18477262611-634100%Seq ID NO: 73960160-181gi|184772591-22100%Seq ID NO: 73960160-181gi|184772611-22100%Seq ID NO: 73960160-181gi|377809681-22100%Seq ID NO: 739861-101gi|33139929341-44199%Seq ID NO: 73986151-236gi|33139929439-524100%Seq ID NO: 7398616-101gi|323244611-8698%Seq ID NO: 739874-146gi|7144133498-64191%Seq ID NO: 7398752-152gi|183824141346-144693%Seq ID NO: 7451417-120gi|7143618355-25286%Seq ID NO: 74514155-180gi|1808032930-596%Seq ID NO: 74514154-187gi|18080516211-17894%Seq ID NO: 74514155-187gi|7143678478-44693%Seq ID NO: 74633161-181gi|38512984421-441100%Seq ID NO: 748321-56gi|7641359475-42092%Seq ID NO: 7483210-53gi|150035965-4897%Seq ID NO: 748321-57gi|1408590055-11191%Seq ID NO: 748321-53gi|1408611051-10392%Seq ID NO: 748321-53gi|1926557441-9392%Seq ID NO: 748321-57gi|3974752743-9991%Seq ID NO: 748321-56gi|1576766065-12091%Seq ID NO: 748321-56gi|18495580501-44691%Seq ID NO: 7483216-53gi|1500361864-10197%Seq ID NO: 748321-53gi|3402590942-9490%Seq ID NO: 7499272-256gi|33139639245-42992%Seq ID NO: 7499263-214gi|33140673201-5089%Seq ID NO: 7503576-97gi|19266802444-465100%Seq ID NO: 75105431-590gi|33139939312-47197%Seq ID NO: 75105240-388gi|33139939165-31296%Seq ID NO: 75105799-910gi|33139939472-58399%Seq ID NO: 7510564-176gi|3313993952-16494%Seq ID NO: 75105829-959gi|1809052033-16387%Seq ID NO: 75106327-348gi|33140085432-453100%Seq ID NO: 75118318-457gi|142805752710-2849100%Seq ID NO: 75118133-231gi|142805752614-2712100%Seq ID NO: 751182-77gi|142805752538-2613100%Seq ID NO: 75118318-355gi|142805703427-346592%Seq ID NO: 75173161-181gi|32324086604-584100%Seq ID NO: 7519067-108gi|714432964-105100%Seq ID NO: 75203175-233gi|2139334299-15788%Seq ID NO: 75203190-233gi|21393558107-15091%Seq ID NO: 75203175-201gi|7797714281-30796%Seq ID NO: 7528966-470gi|3314013692-49690%Seq ID NO: 75439292-321gi|7922575190-21896%Seq ID NO: 75663438-478gi|18081901558-51892%Seq ID NO: 47026543-822gi|18382431565-48691%Seq ID NO: 7591657-196gi|323239991-14095%Seq ID NO: 759506-261gi|323242111-25684%Seq ID NO: 7595017-145gi|33139131116-24489%Seq ID NO: 7595013-250gi|33139639256-49381%Seq ID NO: 759501-145gi|33140673193-4984%Seq ID NO: 759601-75gi|30028941235-30996%Seq ID NO: 759601-75gi|33140204235-30996%Seq ID NO: 47033313-382gi|3232468613-8298%Seq ID NO: 7609466-470gi|3314013692-49689%Seq ID NO: 76242181-203gi|27001433261-283100%Seq ID NO: 7628116-139gi|5454624015-13890%Seq ID NO: 7628116-139gi|1808939011-13489%Seq ID NO: 76311128-240gi|32324311117-22988%Seq ID NO: 763111-46gi|3232431155-10093%Seq ID NO: 76311130-241gi|33140078156-26787%Seq ID NO: 76311152-241gi|3313902810-9990%Seq ID NO: 47066255-278gi|7143978211-234100%Seq ID NO: 763581-57gi|32324009418-47498%Seq ID NO: 763581-53gi|33139304439-49198%Seq ID NO: 764095-25gi|19267937180-160100%Seq ID NO: 76432620-750gi|33139974141-1199%Seq ID NO: 76432339-516gi|33139974316-13991%Seq ID NO: 76470296-390gi|33140673141-4485%Seq ID NO: 764711-87gi|3550508514-10097%Seq ID NO: 764718-87gi|3550499121-10095%Seq ID NO: 7647149-87gi|355050461-39100%Seq ID NO: 76556591-615gi|19103651327-35296%Seq ID NO: 7663417-151gi|33139131116-25088%Seq ID NO: 766346-145gi|323242111-14088%Seq ID NO: 7663413-279gi|33139639256-51680%Seq ID NO: 766341-145gi|33140673193-4984%Seq ID NO: 766673-64gi|3232528320-8196%Seq ID NO: 767181-84gi|33139696197-28094%Seq ID NO: 767751-149gi|35505158357-50599%Seq ID NO: 4710582-256gi|1808049364-23887%Seq ID NO: 4710516-36gi|1808049348-68100%Seq ID NO: 4710582-254gi|714437951-22387%Seq ID NO: 7697061-82gi|1809001419-40100%Seq ID NO: 77032570-679gi|3313996911-12096%Seq ID NO: 770385-123gi|27077481563-144599%Seq ID NO: 770385-123gi|341058131562-144499%Seq ID NO: 770385-123gi|341058151562-144499%Seq ID NO: 7703817-123gi|308441791550-144495%Seq ID NO: 7703817-123gi|341058071434-132895%Seq ID NO: 7703817-123gi|341058081545-143995%Seq ID NO: 7703817-123gi|341058101561-145595%Seq ID NO: 7703817-123gi|313763221492-138694%Seq ID NO: 7703817-123gi|313763231488-138294%Seq ID NO: 7703817-123gi|341058061558-145293%Seq ID NO: 7703817-123gi|51093982386-28093%Seq ID NO: 7703817-123gi|69839591511-140592%Seq ID NO: 7703817-123gi|22544385274-16892%Seq ID NO: 7703817-123gi|3016995141-14792%Seq ID NO: 7703817-123gi|313763251503-139792%Seq ID NO: 77198308-501gi|33140188263-7096%Seq ID NO: 77198302-469gi|33140188464-29787%Seq ID NO: 771984-91gi|33140188353-26697%Seq ID NO: 771981-88gi|33140188551-46490%Seq ID NO: 77198389-469gi|33140188572-49291%Seq ID NO: 771981-60gi|33140188161-10290%Seq ID NO: 771995-155gi|33140188113-26396%Seq ID NO: 77199372-521gi|33140188266-41596%Seq ID NO: 77199403-519gi|33140188102-21894%Seq ID NO: 771995-161gi|33140188308-46488%Seq ID NO: 77199375-483gi|33140188464-57291%Seq ID NO: 771995-74gi|33140188503-57291%Seq ID NO: 77213199-266gi|18090001163-23085%Seq ID NO: 772531-146gi|33139770381-52698%Seq ID NO: 772604-60gi|33140397536-59298%Seq ID NO: 772604-58gi|33139624522-57698%Seq ID NO: 773421-101gi|714415739-13997%Seq ID NO: 773421-101gi|5454968826-12696%Seq ID NO: 7734222-101gi|18083086614-53596%Seq ID NO: 7734225-101gi|180904031-7796%Seq ID NO: 7734254-75gi|52129301340-319100%Seq ID NO: 773827-92gi|54546009456-37088%Seq ID NO: 773972-222gi|7144372296-51690%Seq ID NO: 773972-214gi|18081853341-55290%Seq ID NO: 774781-43gi|714408150-9290%Seq ID NO: 77496474-494gi|3974717732-12100%Seq ID NO: 77564203-337gi|33140348119-25386%Seq ID NO: 776491-160gi|18081259449-29193%Seq ID NO: 777251-24gi|33139696554-577100%Seq ID NO: 780001-163gi|3232495129-19794%Seq ID NO: 47181709-827gi|1808033913-13188%Seq ID NO: 78139149-236gi|3232436916-10397%Seq ID NO: 78157789-809gi|34025242232-212100%Seq ID NO: 78157789-809gi|32322566529-509100%Seq ID NO: 47188202-341gi|32324409329-46895%Seq ID NO: 471881-90gi|33139667502-59198%Seq ID NO: 78415107-386gi|3314012775-35496%Seq ID NO: 784151-44gi|3314012733-7695%Seq ID NO: 784368-67gi|2891607660-190%Seq ID NO: 7881818-77gi|3002894153-11295%Seq ID NO: 7881818-77gi|3314020453-11295%Seq ID NO: 789271-110gi|3232540574-18395%Seq ID NO: 78955118-142gi|18089765337-361100%Seq ID NO: 79093105-368gi|33139802311-57498%Seq ID NO: 790931-63gi|33139802249-311100%Seq ID NO: 79093105-344gi|33140772311-55098%Seq ID NO: 79160206-305gi|33139131107-20687%Seq ID NO: 79160429-458gi|33139131300-32893%Seq ID NO: 79160291-313gi|7143644362-384100%Seq ID NO: 79239183-270gi|180830612-8987%Seq ID NO: 7927594-131gi|15767717372-40992%Seq ID NO: 79292768-882gi|18082714142-25688%Seq ID NO: 79369124-211gi|32325459290-20884%Seq ID NO: 7937110-86gi|5454734734-11088%Seq ID NO: 794191-110gi|33139485331-44095%Seq ID NO: 79419161-248gi|33139485438-52598%Seq ID NO: 795331-179gi|33140348253-7589%Seq ID NO: 79533122-160gi|3232449844-592%Seq ID NO: 795451-68gi|714391849-11691%Seq ID NO: 795451-42gi|183829701050-100995%Seq ID NO: 47286317-516gi|18088906218-41788%Seq ID NO: 47286106-178gi|18088906140-21291%Seq ID NO: 7957677-189gi|32324732377-48998%Seq ID NO: 795761-69gi|32324732301-369100%Seq ID NO: 7960260-139gi|714365484-590%Seq ID NO: 79614287-458gi|30169344167-33897%Seq ID NO: 796141-119gi|3016934449-167100%Seq ID NO: 797291-28gi|33140324288-315100%Seq ID NO: 7974194-257gi|33140312390-55396%Seq ID NO: 7974194-145gi|33140619501-55294%Seq ID NO: 798181-75gi|3232531537-111100%Seq ID NO: 7981818-75gi|355044161-5898%Seq ID NO: 7981832-75gi|3550454415-58100%Seq ID NO: 7981836-75gi|355044834-43100%Seq ID NO: 798851-141gi|33139639311-45190%Seq ID NO: 798851-78gi|33140673126-4992%Seq ID NO: 7988532-78gi|3232421194-14095%Seq ID NO: 798931-62gi|32324007469-40895%Seq ID NO: 8001717-50gi|7144038422-455100%Seq ID NO: 801025-88gi|33139645485-57091%Seq ID NO: 80192397-417gi|17991359548-528100%Seq ID NO: 473241-98gi|35504434329-42691%Seq ID NO: 47325416-525gi|7143492304-19590%Seq ID NO: 803941-240gi|3313996221-26096%Seq ID NO: 80394295-405gi|33139962264-37497%Seq ID NO: 80394199-239gi|18089348186-22792%Seq ID NO: 8042239-59gi|27926823344-364100%Seq ID NO: 8060150-200gi|33140348105-25594%Seq ID NO: 806018-63gi|3314034853-10796%Seq ID NO: 8060150-77gi|3232449817-4496%Seq ID NO: 8075067-235gi|5454605758-22688%Seq ID NO: 8075012-33gi|33952487540-561100%Seq ID NO: 8075012-33gi|34025939541-562100%Seq ID NO: 80750205-226gi|4535791960-1939100%Seq ID NO: 80985458-478gi|8005746172-192100%Seq ID NO: 8107719-132gi|1808012472-18684%Seq ID NO: 8107735-63gi|18082691129-15796%Seq ID NO: 81120854-916gi|323242174-66100%Seq ID NO: 81120862-883gi|779794463-84100%Seq ID NO: 474001-136gi|33140804236-371100%Seq ID NO: 47400194-337gi|33140804372-51595%Seq ID NO: 813371-154gi|32324094298-45197%Seq ID NO: 813372-135gi|183815851162-102987%Seq ID NO: 81377309-329gi|30165848173-193100%Seq ID NO: 813811-110gi|33139131114-22587%Seq ID NO: 81412136-188gi|32324211450-50290%Seq ID NO: 8147150-242gi|1808861491-28387%Seq ID NO: 81471589-666gi|18088614289-36693%Seq ID NO: 81471606-666gi|32183705270-33088%Seq ID NO: 8147717-237gi|18082509367-58790%Seq ID NO: 8147717-237gi|54548304360-58090%Seq ID NO: 8147717-231gi|54548230373-58788%Seq ID NO: 81624114-264gi|33139681415-56598%Seq ID NO: 816241-72gi|33139681346-41798%Seq ID NO: 816341-153gi|711084931-18391%Seq ID NO: 816341-162gi|54546003335-49687%Seq ID NO: 8164815-54gi|289160761-4097%Seq ID NO: 816488-35gi|15947334-61100%Seq ID NO: 816488-35gi|2454547227-200100%Seq ID NO: 816488-35gi|551594194-221100%Seq ID NO: 816488-35gi|551595669-696100%Seq ID NO: 816488-35gi|1803225434-61100%Seq ID NO: 8164812-35gi|18477256279-256100%Seq ID NO: 8164812-35gi|18477260325-302100%Seq ID NO: 8164812-35gi|18477262634-611100%Seq ID NO: 816488-29gi|1847725922-1100%Seq ID NO: 816488-29gi|1847726122-1100%Seq ID NO: 816488-29gi|3778096822-1100%Seq ID NO: 8170217-91gi|33139131116-19092%Seq ID NO: 817026-97gi|323242111-9287%Seq ID NO: 8170384-347gi|33139639256-51684%Seq ID NO: 8170377-216gi|323242111-14089%Seq ID NO: 8170371-222gi|3313913199-25087%Seq ID NO: 8170366-216gi|33140673199-4986%Seq ID NO: 81813589-938gi|33139425112-46394%Seq ID NO: 81813429-539gi|331394251-11197%Seq ID NO: 81813664-822gi|32323963132-29098%Seq ID NO: 81813877-972gi|32323963290-38598%Seq ID NO: 81813877-971gi|32325615290-38498%Seq ID NO: 81813427-539gi|331392741-11397%Seq ID NO: 81813429-606gi|331405561-18192%Seq ID NO: 81813666-822gi|32324994135-29198%Seq ID NO: 81813428-539gi|323249941-11299%Seq ID NO: 81813877-947gi|33139381232-30298%Seq ID NO: 81813487-539gi|331393811-53100%Seq ID NO: 81813427-523gi|331405831-9796%Seq ID NO: 81817452-473gi|33952321139-160100%Seq ID NO: 8183916-77gi|714365136-9790%Seq ID NO: 81895515-875gi|32325036211-57195%Seq ID NO: 81895472-827gi|32325036216-57194%Seq ID NO: 81895563-897gi|32325036211-54594%Seq ID NO: 81895514-693gi|3232503618-19793%Seq ID NO: 81895482-645gi|3232503634-19794%Seq ID NO: 81895610-789gi|3232503618-19792%Seq ID NO: 81895562-741gi|3232503618-19792%Seq ID NO: 81895658-837gi|3232503618-19792%Seq ID NO: 81895706-885gi|3232503618-19792%Seq ID NO: 81895754-897gi|3232503618-16193%Seq ID NO: 81895472-597gi|3232503672-19793%Seq ID NO: 8189548-246gi|32325036231-42886%Seq ID NO: 818951-246gi|32325036232-47683%Seq ID NO: 81895137-270gi|32325036224-35687%Seq ID NO: 8189548-207gi|3232503639-19783%Seq ID NO: 81895138-253gi|3232503633-14786%Seq ID NO: 818951-157gi|3232503640-19680%Seq ID NO: 81895470-716gi|33140561167-41396%Seq ID NO: 81895520-764gi|33140561169-41395%Seq ID NO: 81895568-812gi|33140561169-41395%Seq ID NO: 81895616-860gi|33140561169-41395%Seq ID NO: 81895664-897gi|33140561169-40295%Seq ID NO: 81895457-668gi|33140561202-41395%Seq ID NO: 818951-230gi|33140561185-41383%Seq ID NO: 81895691-712gi|33140561412-433100%Seq ID NO: 81895835-856gi|33140561412-433100%Seq ID NO: 81895547-568gi|33140561412-433100%Seq ID NO: 81895787-808gi|33140561412-433100%Seq ID NO: 81895643-664gi|33140561412-433100%Seq ID NO: 81895739-760gi|33140561412-433100%Seq ID NO: 81895595-616gi|33140561412-433100%Seq ID NO: 81895499-520gi|33140561412-433100%Seq ID NO: 81895534-896gi|7143587175-53783%Seq ID NO: 81895486-848gi|7143587175-53783%Seq ID NO: 81895484-752gi|7143944428-69982%Seq ID NO: 81895724-896gi|7143944332-50484%Seq ID NO: 81895630-728gi|7143944133-23188%Seq ID NO: 81895582-671gi|7143944133-22289%Seq ID NO: 81895534-623gi|7143944133-22289%Seq ID NO: 81895774-863gi|7143944133-22289%Seq ID NO: 81895486-575gi|7143944133-22289%Seq ID NO: 81895822-896gi|7143944133-20790%Seq ID NO: 81895678-767gi|7143944133-22286%Seq ID NO: 81895470-569gi|33139222297-39697%Seq ID NO: 81895520-617gi|33139222299-39695%Seq ID NO: 81895760-857gi|33139222299-39695%Seq ID NO: 81895616-713gi|33139222299-39695%Seq ID NO: 81895568-665gi|33139222299-39695%Seq ID NO: 81895712-809gi|33139222299-39693%Seq ID NO: 81895664-761gi|33139222299-39693%Seq ID NO: 81895808-897gi|33139222299-38895%Seq ID NO: 81895457-521gi|33139222332-39692%Seq ID NO: 8208873-93gi|3314042113-33100%Seq ID NO: 8209418-129gi|33139131139-25090%Seq ID NO: 8209431-123gi|3232421148-14090%Seq ID NO: 820941-152gi|33139639266-41780%Seq ID NO: 8210919-196gi|18087933464-28790%Seq ID NO: 8210919-184gi|54547517403-56888%Seq ID NO: 8210919-166gi|18081843491-63889%Seq ID NO: 8210919-139gi|18083082505-62590%Seq ID NO: 8210966-191gi|51334233404-52981%Seq ID NO: 821421-79gi|33140136402-48093%Seq ID NO: 82163122-267gi|3313913199-24487%Seq ID NO: 82163128-267gi|323242111-14087%Seq ID NO: 821662-106gi|33139639245-34990%Seq ID NO: 821661-120gi|33140673193-7484%Seq ID NO: 8219966-470gi|3314013692-49689%Seq ID NO: 822211-87gi|18090662244-33088%Seq ID NO: 8225659-122gi|142805726490-655392%Seq ID NO: 82260698-718gi|37972243197-217100%Seq ID NO: 82260698-718gi|45566049213-233100%Seq ID NO: 474763-211gi|18089811280-48887%Seq ID NO: 474763-167gi|18080518317-48188%Seq ID NO: 825061-174gi|33140348248-7589%Seq ID NO: 82506117-155gi|3232449844-592%Seq ID NO: 825291-86gi|3232469254-13997%Seq ID NO: 825297-86gi|1808229852-13190%Seq ID NO: 827991-40gi|33140348254-215100%Seq ID NO: 8279978-135gi|33140348181-12390%Seq ID NO: 82849599-663gi|18080145213-27787%Seq ID NO: 82849358-380gi|40670113439-417100%Seq ID NO: 828651-127gi|33140348255-12893%Seq ID NO: 82865185-227gi|3314034870-2797%Seq ID NO: 82929772-793gi|893075949-28100%Seq ID NO: 82929772-793gi|3402621641-20100%Seq ID NO: 82965302-322gi|4698448489-109100%Seq ID NO: 83225316-338gi|2754067536-58100%Seq ID NO: 83226299-358gi|32325383584-52598%Seq ID NO: 832493-237gi|331395811-23599%Seq ID NO: 832741-55gi|21393574296-35089%Seq ID NO: 833391-268gi|51093880722-45286%Seq ID NO: 833391-252gi|51093883723-46985%Seq ID NO: 83339134-268gi|51093882587-45287%Seq ID NO: 83339134-252gi|54545658621-50387%Seq ID NO: 8337940-139gi|18081804201-30089%Seq ID NO: 8337940-130gi|18088266516-60690%Seq ID NO: 8338242-429gi|32325123663-27687%Seq ID NO: 83382132-519gi|32325123663-27687%Seq ID NO: 833821-384gi|32325123659-27687%Seq ID NO: 8338290-474gi|32325123660-27686%Seq ID NO: 833823-294gi|32325123567-27689%Seq ID NO: 83382177-526gi|32325123663-31486%Seq ID NO: 833821-204gi|32325123479-27690%Seq ID NO: 83382312-530gi|32325123663-44584%Seq ID NO: 833821-87gi|32325123344-25894%Seq ID NO: 8338217-165gi|32325123236-8483%Seq ID NO: 83382308-435gi|32325123214-8483%Seq ID NO: 83382332-384gi|32325123236-18390%Seq ID NO: 83382377-429gi|32325123236-18390%Seq ID NO: 83382219-345gi|32325123213-8481%Seq ID NO: 83382489-519gi|32325123213-183100%Seq ID NO: 83382129-159gi|32325123213-183100%Seq ID NO: 8338217-75gi|32325123143-8488%Seq ID NO: 83382242-294gi|32325123236-18388%Seq ID NO: 83382197-255gi|32325123143-8487%Seq ID NO: 83382467-525gi|32325123143-8487%Seq ID NO: 833821-24gi|32325123206-183100%Seq ID NO: 833829-30gi|32325123105-84100%Seq ID NO: 83382129-519gi|33139983660-27187%Seq ID NO: 8338238-519gi|33139983661-17884%Seq ID NO: 8338278-474gi|33139983666-27186%Seq ID NO: 83382173-526gi|33139983661-30987%Seq ID NO: 83382308-530gi|33139983661-44087%Seq ID NO: 83382351-519gi|33139983663-49588%Seq ID NO: 833821-337gi|33139983429-8782%Seq ID NO: 83382129-513gi|33139225635-25287%Seq ID NO: 833821-378gi|33139225628-25287%Seq ID NO: 8338238-423gi|33139225636-25286%Seq ID NO: 8338282-464gi|33139225637-25686%Seq ID NO: 83382352-519gi|33139225637-47088%Seq ID NO: 833823-159gi|33139225312-15382%Seq ID NO: 8338228-384gi|32324170684-32887%Seq ID NO: 83382129-474gi|32324170673-32887%Seq ID NO: 83382163-519gi|32324170684-32887%Seq ID NO: 8338281-519gi|32324170676-23584%Seq ID NO: 83382219-528gi|32324170673-36488%Seq ID NO: 833823-439gi|32324170574-13283%Seq ID NO: 83382253-526gi|32324170684-41187%Seq ID NO: 833821-357gi|32324170486-12482%Seq ID NO: 83382350-530gi|32324170677-49787%Seq ID NO: 833821-78gi|32324170666-58996%Seq ID NO: 83382433-528gi|32324170684-58990%Seq ID NO: 83382222-519gi|33140346602-30286%Seq ID NO: 83382132-429gi|33140346602-30286%Seq ID NO: 8338242-428gi|33140346602-21083%Seq ID NO: 833821-338gi|33140346553-21083%Seq ID NO: 83382357-526gi|33140346602-43390%Seq ID NO: 8338217-87gi|33140346168-9791%Seq ID NO: 83382377-439gi|33140346168-10590%Seq ID NO: 83382492-528gi|33140346602-566100%Seq ID NO: 83382467-529gi|33140346168-10589%Seq ID NO: 83382332-394gi|33140346168-10589%Seq ID NO: 83382107-169gi|33140346168-10589%Seq ID NO: 83382197-259gi|33140346168-10589%Seq ID NO: 83382377-434gi|3314034675-1790%Seq ID NO: 8338217-74gi|3314034675-1790%Seq ID NO: 833821-114gi|33140346418-30282%Seq ID NO: 833821-34gi|33140346138-10597%Seq ID NO: 83382332-389gi|3314034675-1788%Seq ID NO: 83382107-164gi|3314034675-1788%Seq ID NO: 83382467-524gi|3314034675-1788%Seq ID NO: 83382197-254gi|3314034675-1788%Seq ID NO: 833821-29gi|3314034645-1796%Seq ID NO: 83382197-384gi|33140426442-25488%Seq ID NO: 83382332-519gi|33140426442-25488%Seq ID NO: 83382287-474gi|33140426442-25487%Seq ID NO: 83382242-429gi|33140426442-25488%Seq ID NO: 83382107-294gi|33140426442-25487%Seq ID NO: 833821-159gi|33140426412-25489%Seq ID NO: 8338217-204gi|33140426442-25486%Seq ID NO: 83382377-528gi|33140426442-29088%Seq ID NO: 8338262-249gi|33140426442-25486%Seq ID NO: 833823-87gi|33140426320-23691%Seq ID NO: 8338228-169gi|32325556222-7883%Seq ID NO: 83382298-439gi|32325556222-7882%Seq ID NO: 83382208-357gi|32325556222-7080%Seq ID NO: 83382478-519gi|32325556222-18192%Seq ID NO: 8338217-153gi|32324836149-1083%Seq ID NO: 83382377-513gi|32324836149-1082%Seq ID NO: 83382197-329gi|32324836149-1481%Seq ID NO: 83382107-243gi|32324836149-1081%Seq ID NO: 83382467-519gi|32324836149-9790%Seq ID NO: 833821-104gi|32324836120-1481%Seq ID NO: 8338238-161gi|33139487219-9383%Seq ID NO: 83382350-384gi|33139487222-18897%Seq ID NO: 83382219-339gi|33139487218-9581%Seq ID NO: 83382398-429gi|33139487219-18896%Seq ID NO: 8338243-153gi|33139115124-1183%Seq ID NO: 83382133-243gi|33139115124-1182%Seq ID NO: 83382403-513gi|33139115124-1182%Seq ID NO: 83382223-329gi|33139115124-1582%Seq ID NO: 83382493-519gi|33139115124-98100%Seq ID NO: 83382358-384gi|33139115124-98100%Seq ID NO: 8340761-104gi|5454644655-9890%Seq ID NO: 835023-97gi|18090256296-39090%Seq ID NO: 835311-55gi|33140348201-25598%Seq ID NO: 8355631-110gi|545470371-8086%Seq ID NO: 8355632-110gi|545470331-7986%Seq ID NO: 83658281-303gi|33138709165-143100%Seq ID NO: 83658337-357gi|28916076280-260100%Seq ID NO: 83678289-309gi|37853736373-393100%Seq ID NO: 837311-134gi|3232463149-18294%Seq ID NO: 837311-107gi|3313936349-15292%Seq ID NO: 8373718-39gi|31326189232-211100%Seq ID NO: 83765226-357gi|7144169364-49587%Seq ID NO: 83765274-335gi|18081294455-51691%Seq ID NO: 83765274-358gi|18089822443-52791%Seq ID NO: 83781408-429gi|17971089242-221100%Seq ID NO: 83828539-605gi|33139668557-49192%Seq ID NO: 83828386-419gi|33139668530-49794%Seq ID NO: 83828347-368gi|33139668530-509100%Seq ID NO: 8382835-56gi|33139668530-509100%Seq ID NO: 83828230-251gi|33139668530-509100%Seq ID NO: 83828566-605gi|33139848530-49197%Seq ID NO: 8385527-51gi|331400211-25100%Seq ID NO: 475781-101gi|1838256665-16588%Seq ID NO: 475787-101gi|180898152-9688%Seq ID NO: 4757864-98gi|2149353440-7491%Seq ID NO: 8407675-164gi|18081616391-30287%Seq ID NO: 84205295-402gi|18079959358-46582%Seq ID NO: 84241159-179gi|52127565269-249100%Seq ID NO: 8439075-97gi|3402808828-50100%Seq ID NO: 84419109-281gi|33139639255-42888%Seq ID NO: 8441996-242gi|33140673195-4989%Seq ID NO: 845221-73gi|33140257196-26898%Seq ID NO: 8452244-73gi|323243931-30100%Seq ID NO: 8457673-112gi|21393592395-43490%Seq ID NO: 84645275-323gi|7143520433-38591%Seq ID NO: 846486-28gi|35504845119-97100%Seq ID NO: 84664125-168gi|33139131293-33693%Seq ID NO: 84696527-650gi|35504896483-60696%Seq ID NO: 846961-127gi|35504896213-33996%Seq ID NO: 84696189-267gi|35504896337-41597%Seq ID NO: 84696473-526gi|35504896413-46698%Seq ID NO: 8469640-98gi|7143674296-35493%Seq ID NO: 8469642-98gi|7144456296-35292%Seq ID NO: 8469640-92gi|18382222454-50694%Seq ID NO: 8469640-89gi|18090122491-54094%Seq ID NO: 8469655-92gi|18381283389-42694%Seq ID NO: 84779145-167gi|46985397123-101100%Seq ID NO: 8481420-58gi|289160761-4192%Seq ID NO: 47631473-497gi|54549558143-167100%Seq ID NO: 8495015-54gi|289160761-4097%Seq ID NO: 849508-35gi|15947334-61100%Seq ID NO: 849508-35gi|2454547227-200100%Seq ID NO: 849508-35gi|551594194-221100%Seq ID NO: 849508-35gi|551595669-696100%Seq ID NO: 849508-35gi|1803225434-61100%Seq ID NO: 8495012-35gi|18477256279-256100%Seq ID NO: 8495012-35gi|18477260325-302100%Seq ID NO: 8495012-35gi|18477262634-611100%Seq ID NO: 849508-29gi|1847725922-1100%Seq ID NO: 849508-29gi|1847726122-1100%Seq ID NO: 849508-29gi|3778096822-1100%Seq ID NO: 84996246-365gi|32324211375-49484%
Table 2 Legend:

1H. glycines Clone ID No as set forth in Sequence Listing feature fields; searching the H. glycines sequence identifier in column 1 identifies the corresponding SEQ ID NO for that sequence

2nucleotide position in SEQ ID NO corresponding to Clone ID No in column 1 that matches with position of sequence of GeneID in adjacent cell on same row of table 2

3GeneID number of corresponding matching sequence hit from public database that matches with position of Clone ID No from column 1; derivative organism information is associated with the GeneID No.

4nucleotide position in GeneID that matches with nucleotides specified on same row corresponding to sequence of Clone ID SEQ ID NO

5percent identity between the two sequences in Clone ID and GeneID


Surprisingly the inventors have also discovered that some polynucleotides of the present invention exhibit homology with various insect pests of plants and animals, as illustrated in Table 3. This provides an opportunity to express in plant cells polynucleotides exemplified in Table 3 as double stranded RNA sequences, providing control of many of these insect pests of plants and animals. Mosquitoes, for example, are well known as vectors for spreading malaria, yellow fever, encephalitis, filarial parasites and other serious diseases. Male mosquitoes feed exclusively on plant nectar and on plant cell exudates, and female mosquitoes feed on plants when a blood meal is not available. The present invention therefore provides a means for applying the exemplary sequences as dsRNA molecules expressed in plant cells as a means for controlling nematode and insect pests by expression of sequences identified as representative of common sequences between the two species.

TABLE 3SCN vcDNA Sequences and Insect Nucleotide Sequence HomologousSEQ ID NO1Position2Gene ID3Positions4% identity5Genus species6Seq ID NO: 68821138-159CRA|agCT42044945-966100%Anopheles gambiaeSeq ID NO: 79019537-557CRA|agCT431471437-1457100%Anopheles gambiaeSeq ID NO: 47443622-642CRA|agCT43876833-853100%Anopheles gambiaeSeq ID NO: 7324346-69CRA|agCT44110215-238100%Anopheles gambiaeSeq ID NO: 54820116-136CRA|agCT44330197-177100%Anopheles gambiaeSeq ID NO: 6592413-33CRA|agCT443783440-3460100%Anopheles gambiaeSeq ID NO: 53889589-609CRA|agCT4487113437-13457100%Anopheles gambiaeSeq ID NO: 6672938-59CRA|agCT45079733-754100%Anopheles gambiaeSeq ID NO: 60455259-279CRA|agCT453911624-1644100%Anopheles gambiaeSeq ID NO: 69523121-145CRA|agCT45432192-21696%Anopheles gambiaeSeq ID NO: 8067047-67CRA|agCT468461180-1160100%Anopheles gambiaeSeq ID NO: 8179147-69CRA|agCT469681752-1730100%Anopheles gambiaeSeq ID NO: 67053313-339CRA|agCT478741073-109896%Anopheles gambiaeSeq ID NO: 59568697-718CRA|agCT48203192-171100%Anopheles gambiaeSeq ID NO: 72334238-258CRA|agCT49436729-749100%Anopheles gambiaeSeq ID NO: 63951103-123CRA|agCT49483652-672100%Anopheles gambiaeSeq ID NO: 62555130-150CRA|agCT510962549-2569100%Anopheles gambiaeSeq ID NO: 46459342-364CRA|agCT51427215-237100%Anopheles gambiaeSeq ID NO: 66786307-327CRA|agCT525971900-1920100%Anopheles gambiaeSeq ID NO: 68784 4-24CRA|agCT5508220600-20620100%Anopheles gambiaeSeq ID NO: 53634761-804CRA|agCT55207620-57789%Anopheles gambiaeSeq ID NO: 53634754-774CRA|agCT55207597-577100%Anopheles gambiaeSeq ID NO: 53635 58-101CRA|agCT55207577-62089%Anopheles gambiaeSeq ID NO: 53635 88-108CRA|agCT55207577-597100%Anopheles gambiaeSeq ID NO: 73360270-291CRA|agCT55621744-723100%Anopheles gambiaeSeq ID NO: 82401107-127CRA|agCT55677848-868100%Anopheles gambiaeSeq ID NO: 55012650-670EBI|221281-261100%Anopheles gambiaeSeq ID NO: 78551 90-110EBI|2300624-644100%Anopheles gambiaeSeq ID NO: 68299 84-105EBI|23072080-2101100%Anopheles gambiaeSeq ID NO: 47206 84-105EBI|23072080-2101100%Anopheles gambiaeSeq ID NO: 68556241-262EBI|40533009-3030100%Anopheles gambiaeSeq ID NO: 70190174-194EBI|42835460-5440100%Anopheles gambiaeSeq ID NO: 53886405-425EBI|53262105-2085100%Anopheles gambiaeSeq ID NO: 70190243-266EBI|89821366-1389100%Anopheles gambiaeSeq ID NO: 51267164-184EBI|9090121-141100%Anopheles gambiaeSeq ID NO: 5517512-41gi|1111931436-7 100%Andrya cuniculiSeq ID NO: 6669212-41gi|1111931436-7 100%Andrya cuniculiSeq ID NO: 5517512-41gi|1111931536-7 100%Paranoplocephala sp.Seq ID NO: 6669212-41gi|1111931536-7 100%Paranoplocephala sp.Seq ID NO: 5517512-41gi|1111931736-7 100%Paranoplocephala arcticaSeq ID NO: 6669212-41gi|1111931736-7 100%Paranoplocephala arcticaSeq ID NO: 5517512-41gi|1111931936-7 100%Paranoplocephala serrataSeq ID NO: 6669212-41gi|1111931936-7 100%Paranoplocephala serrataSeq ID NO: 47304674-696gi|23186984477-499100%Echinococcus granulosusSeq ID NO: 7703816-43gi|2463301112-85 96%Neogryporhynchus cheilancristrotusSeq ID NO: 5990175-96gi|31365321176-155100%Toxoptera citricidaSeq ID NO: 51042482-505gi|31365444320-297100%Toxoptera citricidaSeq ID NO: 7735497-117gi|31365580644-664100%Toxoptera citricidaSeq ID NO: 82622494-514gi|3780457057-77100%Rhopalosiphum padiSeq ID NO: 6686173-95gi|46996593400-378100%Acyrthosiphon pisumSeq ID NO: 6400776-96gi|46996721468-448100%Acyrthosiphon pisumSeq ID NO: 6008170-96gi|46998065535-56296%Acyrthosiphon pisumSeq ID NO: 54610316-337gi|46998427300-279100%Acyrthosiphon pisumSeq ID NO: 47304674-695gi|47163116387-408100%Echinococcus granulosusSeq ID NO: 62977216-237gi|47514887260-281100%Acyrthosiphon pisumSeq ID NO: 58768293-314gi|4751713476-55100%Acyrthosiphon pisumSeq ID NO: 6382672-92gi|47522032581-561100%Acyrthosiphon pisumSeq ID NO: 82094183-203gi|47533062209-189100%Acyrthosiphon pisumSeq ID NO: 72433558-578gi|47536611364-384100%Acyrthosiphon pisumSeq ID NO: 57774203-223gi|47536768381-401100%Acyrthosiphon pisumSeq ID NO: 5517512-41gi|5430630936-7 100%Anoplocephaloides cf.Seq ID NO: 6669212-41gi|5430630936-7 100%Anoplocephaloides cf.Seq ID NO: 5517512-41gi|5430631136-7 100%Anoplocephaloides kontrimavichusiSeq ID NO: 6669212-41gi|5430631136-7 100%Anoplocephaloides kontrimavichusiSeq ID NO: 5517512-41gi|5430631236-7 100%Anoplocephaloides lemmiSeq ID NO: 6669212-41gi|5430631236-7 100%Anoplocephaloides lemmiSeq ID NO: 5517512-41gi|5430631636-7 100%Andrya rhopalocephalaSeq ID NO: 6669212-41gi|5430631636-7 100%Andrya rhopalocephalaSeq ID NO: 5517512-41gi|5430631836-7 100%Diandrya compositaSeq ID NO: 6669212-41gi|5430631836-7 100%Diandrya compositaSeq ID NO: 5517512-41gi|5430631936-7 100%Mosgovoyia pectinataSeq ID NO: 6669212-41gi|5430631936-7 100%Mosgovoyia pectinataSeq ID NO: 5517512-41gi|5430632036-7 100%Moniezia sp.Seq ID NO: 6669212-41gi|5430632036-7 100%Moniezia sp.Seq ID NO: 5517512-41gi|5430632136-7 100%Monoecocestus americanusSeq ID NO: 6669212-41gi|5430632136-7 100%Monoecocestus americanusSeq ID NO: 5517512-41gi|5430632236-7 100%Paranoplocephala blanchardiSeq ID NO: 6669212-41gi|5430632236-7 100%Paranoplocephala blanchardiSeq ID NO: 5517512-41gi|5430632336-7 100%Paranoplocephala etholeniSeq ID NO: 6669212-41gi|5430632336-7 100%Paranoplocephala etholeniSeq ID NO: 5517512-41gi|5430632436-7 100%Paranoplocephala fellmaniSeq ID NO: 6669212-41gi|5430632436-7 100%Paranoplocephala fellmaniSeq ID NO: 5517512-41gi|5430632536-7 100%Paranoplocephala gracilisSeq ID NO: 6669212-41gi|5430632536-7 100%Paranoplocephala gracilisSeq ID NO: 5517512-41gi|5430632636-7 100%Paranoplocephala longivaginataSeq ID NO: 6669212-41gi|5430632636-7 100%Paranoplocephala longivaginataSeq ID NO: 5517512-41gi|5430632736-7 100%Paranoplocephala macrocephalaSeq ID NO: 6669212-41gi|5430632736-7 100%Paranoplocephala macrocephalaSeq ID NO: 5517512-41gi|5430632836-7 100%Paranoplocephala cf.Seq ID NO: 6669212-41gi|5430632836-7 100%Paranoplocephala cf.Seq ID NO: 5517512-41gi|5430632936-7 100%Paranoplocephala kalelaiSeq ID NO: 6669212-41gi|5430632936-7 100%Paranoplocephala kalelaiSeq ID NO: 5517512-41gi|5430633136-7 100%Paranoplocephala primordialisSeq ID NO: 6669212-41gi|5430633136-7 100%Paranoplocephala primordialisSeq ID NO: 5517512-41gi|5430633236-7 100%Schizorchis sp.Seq ID NO: 6669212-41gi|5430633236-7 100%Schizorchis sp.Seq ID NO: 60083406-426gi|55794409449-429100%Acyrthosiphon pisumSeq ID NO: 62688 3-23gi|55802365769-789100%Acyrthosiphon pisumSeq ID NO: 82614269-290gi|55803329472-493100%Acyrthosiphon pisumSeq ID NO: 5486519-41gi|55806106709-731100%Acyrthosiphon pisumSeq ID NO: 65500126-146gi|55810448308-328100%Acyrthosiphon pisumSeq ID NO: 80198225-245gi|55810583245-265100%Acyrthosiphon pisumSeq ID NO: 5118474-94gi|55814836337-317100%Acyrthosiphon pisumSeq ID NO: 52762 1-36gi|6467344700-66591%Duplicibothrium paulum
Table 3 Legend:

1H. glycines Clone ID No as set forth in Sequence Listing feature fields; searching the H. glycines sequence identifier in column 1 identifies the corresponding SEQ ID NO for that sequence

2nucleotide position in SEQ ID NO corresponding to Clone ID No in column 1 that matches with position of sequence of GeneID in column 3 on same row

3GeneID number of corresponding matching sequence hit from public database that matches with position of Clone ID No from column 1; information in table is sorted by column 3

4GeneID nucleotide position in column 3 that matches with nucleotides specified on same row corresponding to sequence of SCN Clone ID

5percent identity between the two sequences in Clone ID and GeneID (comparison of identity between column 2 and column 4 sequences)

6Genus and species of organism corresponding to gene sequence set forth in Column 3


Example 6

This example illustrates the suppression of one or more genes in a soybean cyst nematode by providing in the diet of the nematode a double stranded RNA consisting of a nucleotide sequence that is complementary to the messenger RNA sequence expressed from the one or more soybean cyst nematode genes.


Soybean cyst nematode J2 larvae are treated with a double stranded RNA derived from a nucleotide sequence selected from the group consisting of SEQ ID NO:1-SEQ ID NO:97729 in a soaking assay as described in WO 03052110. Briefly, freshly hatched nematode larvae are treated in a soaking buffer (10 mM octopamine in M9 salts, 1 mg/ml FITC in DMF) with or without 2 microgram/microliter of dsRNA for four hours at room temperature. Larvae ingesting the solution are fluorescent. The fluorescent larvae are separated from non-fluorescent larvae, then inoculated into soil containing a germinated soybean seedling. The number of cysts on each plant are counted about 35 days after inoculation. The tested dsRNA molecules that demonstrate significant reduction in the number of cysts counted are then made into plant expression cassettes contained in DNA constructs designed for plant cell transformation. DNA constructs generally comprise constitutive promoters that cause transcription of a linked DNA that transcribes a dsRNA. Promoters that may exhibit enhanced expression in root tissue may be particularly useful for expressing dsRNA effective against soybean cyst nematodes.


These DNA constructs are transformed into soybean plant cells and the cells regenerated into plants. The plants are tested either as Ro plants for nematode resistance or seed is collected and the R1 seed is germinated and the R1 plant roots tested for nematode resistance. Resistance is demonstrated if the transgenic plants have a significant reduction in cyst number or cyst development.


Example 7

This example describes DNA constructs and the expression of a chimeric RNA molecule of the present invention in a transgenic soybean plant cell. The DNA constructs described herein comprise a promoter that causes transcription of an operably linked DNA into an RNA in a soybean cell, the DNA and the transcribed RNA of one or more segments exhibiting homology or complementarity to a soybean cyst nematode contiguous at least about 21-mer nucleotide sequence (DNA or RNA). Exemplary soybean cyst nematode DNA segments were previously described in Table 1 and are further identified in the Sequence Listing as SEQ ID NO:1-SEQ ID NO:45568. When expressed in a plant cell, the DNA construct provides an RNA transcript molecule comprising a self-complimentary segment, a portion of which folds into a double stranded RNA (dsRNA). Detection of the RNA transcript expressed in a cell or tissue of a transgenic plant is diagnostic for the DNA construct(s) that comprises a region of a soybean cyst nematode DNA molecule, and demonstrates that the DNA segment from which the dsRNA molecule is derived is transcribed/expressed in the transgenic soybean cells. Therefore, the transcribed RNA becomes available in the diet of the nematode as it feeds on a plant root cell. The RNA comprises a region that is double stranded and is complementary to a naturally occurring polynucleic acid molecule contained in a soybean cyst nematode cell, and when ingested by the nematode results in suppression of the normal level of the naturally occurring molecule.


Exemplary DNA constructs of the present invention have various regulatory elements that provide transcription or enhance expression or stability of an RNA molecule transcribed from a transgene in a plant cell. For example, a promoter element of a DNA construct of the present invention provides expression of an RNA transcript in a plant cell. Promoters, which can cause the transcription of a linked heterologous DNA are generally known in the art, for example, DNA plant virus promoters (P-CaMV35S, U.S. Pat. Nos. 5,352,605 and 5,196,525, comprising a duplicated enhancer region herein referred to as P-e35S; P-FMV35S, U.S. Pat. Nos. 5,378,619 and 5,018,100, herein incorporated by reference in their entirety), and various plant derived promoters, for example, plant actin promoters (P-Os.Act, U.S. Pat. Nos. 5,641,876 and 6,429,357, herein incorporated by reference in their entirety), and chimeric promoters, for example, P-FMV-Elf1α (U.S. Pat. No. 6,660,911 and other chimeric promoters disclosed therein, herein incorporated by reference in their entirety). Additionally, promoters that provide enhanced expression in root cells relative to other plant cells, may be tested and selected to express the RNA molecules of the present invention. The DNA constructs described in this example utilize the P-e35S and P-FMV promoters to drive the transcription of a DNA and expression of a dsRNA that exhibits homology to a portion of a soybean cyst nematode nucleotide sequence. For example, a nucleotide sequence was assembled consisting of two segments, the forward and reverse nucleotide sequence of SEQ ID NO:22219 from nucleotide position 552-699, linked by an amorphous 20-200 nucleotide segment that did not exhibit any known complementarity to the SCN genome sequences. Bioinformatics analysis indicates that the nucleotide sequence of SEQ ID NO:22219 corresponds to an open reading frame encoding an SCN specific proteasome A-type subunit peptide referred to herein as Pas-4. This chimeric sequence was incorporated into plant expression vectors for use in testing dsRNA mediated suppression of the pas-4 target gene. The DNA constructs 5749 (P-FMV/Pas-4-dsRNA/E6 3′ UTR) was thus assembled and comprises the necessary transfer molecules and regulatory molecules to provide integration into the genome of plant cells and expression of the dsRNA molecule therein.


The DNA constructs comprise a T-DNA region that is transferred into the genome of a plant cell by an Agrobacterium- or Rhizobium-mediated plant cell transformation method, and additional regulatory elements, for example, a 3′ untranslated region (3′ UTR) of the SIE6-3B gene from Gossypium barbadense, herein referred to as E6 3′ UTR (John Plant Mol Biol 30:297-306, 1996, NCBI accession U30508, nucleotide position from about 992-1304). The DNA construct 5749 (P-FMV/Pas-4-dsRNA/E6 3′ UTR) was transferred into Agrobacterium rhizogenes strain.


A transgenic root culture of soybean has been shown to support soybean cyst nematode infection and is useful for the expression of transgenes (Narayanan, et al., Crop Sci. 39:1680-1686, 1999 and Cho et al., Planta (2000) 210:195-204). Agrobacterium rhizogenes transformed to contain the described DNA construct 5749 was used to transformed soybean cells and create independent transgenic root cultures, referred to herein as events 5749-1, etc. Tissues from the transgenic root cultures were assayed for expression of the chimeric SCN gene suppression RNA molecule. Transgenic root tissues were selected using appropriate selection pressures. Transgenic root tissues from each event were screened for the presence of the fluorescence marker expression that was integrated into and adjacent to the dsRNA expression construct. The transgenic root tissues were also screened for the presence of siRNA segments produced from exposure to the root tissue cells' endogenous DICER molecules. siRNA segments were screened for identity to segments of the corresponding dsRNA coding sequences expressed from the plasmid construct expression cassettes. Methods for detecting the presence of an expressed RNA in a cell are known in the art. For example, in this example, the presence of the 3′ UTR was detected using primers that functioned to amplify the UTR sequence from the expressed RNA sequence. A TAQMAN method was then used along with a 3′ UTR specific fluorescence probe to detect the UTR as well as provide information on the relative level of expression from the construct. The data is shown in Table 4.

TABLE 4Levels of Pas-4-dsRNA in Transgenic Soybean Root Cells.EventAve fluorescenseSt DevsiRNA NorthernVector control0.000.00ND5749-12.301.05+5749-33.831.39+5749-45.472.44+5749-53.730.64+5749-80.450.14ND5749-100.160.02ND5749-110.240.04+5749-120.330.17ND5749-A3.140.70+5749-B2.910.27+
ND—not detected


The data in Table 4 indicates that ten events comprising the Pas-4-dsRNA contained detectable levels of the RNA molecule. Northern blot analysis of these events showed detectable levels of siRNA that specifically hybridizes to DNA probes made from a homologous fragment of the Pas-4 coding region. These results demonstrate that soybean cells can be transformed with DNA constructs for expression of dsRNA molecules specific for gene suppression of SCN target genes, and that the transformed plant cells recognize the RNA molecules and dice them into detectable siRNA molecules that may be useful for specific gene suppression of the target gene(s) when provided in the diet of soybean cyst nematodes.


All patent publications cited in this specification are incorporated herein by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.


The Sequence Listing is submitted along with this specification on two compact discs. One disc is labeled ‘Sequence Listing’ according to 37 CFR §1.52(e)(4), and the other disc is labeled ‘CRF’ (computer readable form) according to 37 CFR §1.821(e). Each disc contains a single 271,645 kilo-byte text file labeled ‘SCN_seqListing.txt’, created on Feb. 22, 2005, in IBM-PC format and is compatible with IBM-PC, MS-Windows, Macintosh, and UNIX operating systems. The sequence listing information recorded in computer readable form is identical to the written compact disc sequence listing. The Sequence Listing text file is incorporated herein by reference.

Claims
  • 1. An isolated Heterodera glycines nucleotide sequence selected from the group consisting of SEQ ID NO:22219, SEQ ID NO:45701, and SEQ ID NO:52890.
  • 2. An isolated nucleotide sequence selected from the group consisting of SEQ ID NO:45701 and SEQ ID NO:52890 encoding a polypeptide, wherein said polypeptide is isolated from Heterodera species.
  • 3. A method for suppressing a peptide coding sequence in a Heterodera glycines pest, comprising selecting a target polynucleotide sequence of a Heterodera glycines pest genome comprising at least from about 24 contiguous nucleotides, expressing the polynucleotide sequence as an RNA sequence that forms a double stranded RNA structure in a plant cell, and providing said plant cell in the diet of said Heterodera glycines pest, wherein uptake of the contents of said cell by said pest results in the morbidity and/or mortality of said pest, and wherein said target polynucleotide sequence is selected from the group consisting of SEQ ID NO:22219, SEQ ID NO:45701, and SEQ ID NO:52890.
  • 4. The method of claim 3, wherein said RNA sequence comprises contiguous nucleotides from at least two or more target polynucleotide sequences from two or more different coding sequences of a Heterodera glycines pest.
  • 5. The method of claim 4, wherein each of said two or more different coding sequences are isolated from different Heterodera glycines cDNA sequences.
  • 6. A method for controlling Heterodera glycines infection in a soybean plant comprising a) transforming a soybean plant cell with a DNA construct that expresses a dsRNA molecule in said soybean cell; and b) regenerating said transformed soybean plant cell into a fertile transgenic soybean plant that exhibits resistance to infection by Heterodera glycines; wherein said DNA construct comprises a polynucleotide sequence comprising at least about 24 contiguous nucleotides selected from the group consisting of SEQ ID NO:22219, SEQ ID NO:45701, and SEQ ID NO:52890.
  • 7. The method of claim 6, wherein said DNA construct comprises at least one selectable marker providing herbicide tolerance to the transgenic soybean plant.
  • 8. The method of claim 7, wherein said transgenic soybean plant exhibits tolerance to the herbicide glyphosate.
  • 9. A plasmid library comprising SCN genome sequences wherein said library is ATCC Patent Deposit No. PTA-6583 deposited on Feb. 15, 2005.
  • 10. An isolated and purified plasmid selected from the library of claim 9.
  • 11. A nucleotide sequence selected from the group consisting of SEQ ID NO:1-SEQ ID NO:45568, and the complement thereof, isolated from a plasmid in the library of claim 9.
  • 12. A vcDNA sequence selected from the group consisting of SEQ ID NO:45569-SEQ ID NO:97729, wherein said vcDNA sequence is isolated from a plasmid in the library of claim 9.
  • 13. An amino acid sequence selected from the group consisting of SEQ ID NO:119146-SEQ ID NO:121220 encoded by the vcDNA of claim 12.
  • 14. A transgenic soybean plant comprising the double stranded structure of claim 3.
  • 15. A seed produced from the plant of claim 14, wherein said seed comprises said structure.
  • 16. A plant grown from the seed of claim 15.
CROSS REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of priority to U.S. Provisional Patent Application Ser. No. 60/655,875 filed Feb. 24, 2005, and the sequence listing filed along with that application, which is incorporated herein by reference in its entirety.

Provisional Applications (1)
Number Date Country
60655875 Feb 2005 US