Systematic in silico selection method for identifying drug targets in pathogens

Information

  • Patent Application
  • 20070141612
  • Publication Number
    20070141612
  • Date Filed
    December 14, 2006
    17 years ago
  • Date Published
    June 21, 2007
    17 years ago
Abstract
Methods and compositions are provided for selecting drug targets in silico, that include the following steps performed in the order presented or an alternative order of partially or entirely at the same time: (a) identifying one or more essential or functionally important sequences from a model organism using pre-existing genomic and phenotypic data; (b) comparing sequences from (a) with a DNA or peptide sequence from a pathogen to store homologous sequences; (c) comparing sequences from the pathogen with the DNA or peptide sequence from a host organism to store those sequences absent in the host organism; and (d) comparing sequences from (b) and (c) to identify shared sequences, the shared sequences being a drug target. Additionally, identified drug targets are provided.
Description
BACKGROUND

Pathogenic organisms whether prokaryotic or eukaryotic infect host organisms to cause disease. It is desirable to treat the diseased host with a therapeutic agent or drug that is toxic for the pathogen but leaves the host unharmed. Unfortunately, finding suitable drug targets in the pathogen is frequently problematic because of the lack of knowledge about the pathogen's biology.







DETAILED DESCRIPTION OF THE EMBODIMENTS

A method is provided for finding drug targets by an approach that utilizes a bioinformatics approach. This method requires selecting a suitable model organism so as to generate a set of essential genes or gene products, which are likely to be shared by the pathogen. The choice of the model organism may vary over time according to availability of genomic sequences and phenotypic data in the database. The closer in evolutionary terms the model organism is to the pathogen, the more likely a subset of essential genes will be found to be shared between both the model and pathogenic organisms but will be absent in an evolutionarily distant host genome. Consequently, in one embodiment of the invention, the computer is capable of identifying the phylogenetically closest model organism to the pathogen for which an existing database contains an amount of genome sequence and phenotype data above an arbitrarily assigned threshold. Having selected the model organism using the above criteria, the computer searches the databases to identify a set of genomic sequences that encode essential or functionally important proteins. The essential nature of the protein is defined by the impact of its phenotype on the viability of the pathogen. The model organism can be virtual including genomic data and phenotypic data compiled in silico from a number of closely related real organisms. In the example provided herein, i C. elegans is selected as a model organism for pathogenic nematodes.


In one embodiment of the invention, a computer-based system is provided for identifying drug targets that includes: (a) a memory for storing a plurality of databases, wherein the plurality of databases comprise a plurality of sequence databases; (b) a processor in communication with the memory; and (c) an output device in communication with the processor, wherein the processor is configured to: (i) group sequences belonging to a model organism, a pathogen and a host from one or more of the plurality of sequence databases; (ii) identify essential or functionally important sequences from the model organism by matching phenotypic data with sequences of the model organism grouped in (i); (iii) compare sequences of the pathogen with sequences identified in (ii); (iv) compare sequences of the pathogen with the host sequences and select pathogen sequences that do not share sequence similarity with the host sequences; (v) compare sequences from (iii) and (iv) to identify sequences corresponding to sequences encoding drug targets; and (vi) cause at least some of the sequences identified in (v) to be displayed on the output device. The above steps can be performed in any order or partially or entirely at the same time.


In an embodiment of the invention, the method for identifying drug targets includes one or more of the following steps:


1. Group sequences belonging to the model organism in Database I.


2. Identify essential or functionally important sequences by matching phenotypic data with sequences in Database I and store these selected sequences in Database II.


3. Group pathogen sequences in Database III.


4. Group host sequences in Database IV.


5. Compare sequences from Database III with those in Database II and store sequences that are significantly similar to those in Database II in Database V.


6. Compare sequences from Database III with those in Database IV and store those sequences from Database III that do not have similarity to sequences in Database IV in Database VI.


7. Compare sequences in Database V and VI and store shared sequences in Database VII as encoding or constituting drug targets.


In an embodiment of the invention, the above databases can be constructed from information about genomes of the target pathogen(s) and the target host(s), and phenotypic data relating to the essentiality or functional importance of various sequences (essential model sequences) from a real or virtual model organism representing the pathogen. The latter set of sequences could be pre-existing or may be constructed by systematically evaluating each sequence from a real or virtual model organism as follows (steps 1-4). If a set of essential model sequences is pre-existing, steps 1-4 of the following algorithm may be omitted.


1. Select an arbitrary DNA or peptide sequence from the model organism (model organism sequence) from the database of model organism sequences.


2. Retrieve functional/phenotypic data from auxiliary data sources corresponding to the model organism sequence.


3. If the functional/phenotypic data from step 2 indicates that the model organism sequence is essential or important for normal development or functioning of the model organism, add the sequence to the set of essential model sequences; otherwise, abort and return to step 1.


4. If more model organism sequences remain, return to step 1; otherwise proceed to step 5.


5. Select an arbitrary DNA or peptide sequence from the pathogen (pathogen sequence) from the database of pathogen sequences.


6. Using a comparative sequence analysis method applicable to DNA or peptide sequences, determine if sequences orthologous to the pathogen sequence are present in the genome of the target host and are in the essential or functionally important model organism sequence set.


7. If the results of step 6 indicate that orthologs of the pathogen sequence are absent in the genome of the target host but present in the set of essential or functionally important model sequences, record the identities of the target pathogen in a list of drug targets.


8. If more pathogen sequences are available, return to step 5.


9. Annotate the set of sequences in the list of drug targets using any auxiliary data sources.


10. Prioritize the list of drug targets on the basis of suitability, either manually or programmatically.


The biology of the sequences in the model organism may be determined by gene silencing, gene knockout or other methods known in the art for studying gene function. Step 6 above may be accomplished using a comparative sequence analysis method that allows relatedness determination, for example, a BLAST program (Altschul et al., J. Mol. Biol. 215: 403-410 (1990)) or FASTA (Pearson and Lipman Proc. Natl. Acad. Sci 85:2444-2448 (1988)). Step 10 may use a prioritization protocol based on predictions of a person of ordinary skill in the art with respect to, for example, (a) ability to clone and overexpress DNA (size of gene, isoelectric point etc.); (b) solubility of protein product; (c) availability of an assay for expression product; (d) availability of a protein structure; or (e) impact of phenotype on the viability of the organism e.g., lethality in the embryo or uncoordinated motion in the embryo or disorganization in development of tissue structures, etc. Any of the above parameters may be evaluated by a subjective value structure selected by the experimenter.


EXAMPLE
Example 1
Implementation of the Algorithm to Discover Drug Targets in Brugia Malayi

The target pathogen was Brugia malayi. Databases of genomic DNA sequences as well as sequences predicted to encode protein sequences in Brugia malayi were available from TIGR (The Institute for Genomic Research, USA). The target host was Homo sapiens (human). A database of peptide sequences of human origin was constructed from the public sequence databases available at NCBI (National Center for Biotechnology Information, USA). The model organism for this study was Cenorhabditis elegans. C. elegans peptide sequences and the identities of the genes encoding them were obtained from Wormbase (www.wormbase.org) as wormpep release 150. Phenotypic data for the model organism sequences consisted of the results of genome-wide RNAI knockdown scans of the C. elegans genome. The results were obtained from Wormbase via their WormMart service at the same time that release 150 of wormpep was considered current.


An essential model sequence set was established by retrieving the RNAi phenotype corresponding to the source gene for each peptide in Wormpep. Peptides were included in the essential model sequence set if the corresponding RNAi phenotype was anything other than Wild Type, unclassified, or missing.


Each pathogen sequence was compared to every sequence in the essential model sequence set and target host sequence databases using the BLAST sequence analysis program blastp. A variety of expectation value (e-value) cutoffs were employed in the blastp analysis to tune the stringency of the analysis and limit the size of the drug target list. For the list shown [below], the e-value cutoffs were 1.0×10−13 and 1.0×10−20 for the human and Brugia malayi sequence comparisons, respectively. Pathogen sequences that had no human orthologs with e-values smaller than the cutoff, but did have essential C. elegans orthologs with e-values smaller than the cutoff were placed into the list of drug targets. In the list [below] they are shown with the identifier from the TIGR genome sequence database for the B. malayi genome, along with the contig on which the sequence was located as well as its start and stop coordinates along that contig.

TABLE 1TIGR pubTIGR gene IDlocusGene DescriptionContigStartStop12415.m00014Bm1_00120ATP synthase epsilon124151145966chain, mitochondrial,putative12422.m00027Bm1_00215conserved hypothetical124222992554protein12443.m00018Bm1_00430hypothetical protein124431653170912453.m00017Bm1_00490CEH-25 homeobox124535612316protein-related12478.m00033Bm1_00640hypothetical protein1247820571712482.m00020Bm1_00705P. falciparum RESA-like12482571507protein with DnaJdomain-related12483.m00026Bm1_00720conserved hypothetical12483371910670protein12496.m00298Bm1_00815Cuticle collagen 14,124965073352051putative12506.m00096Bm1_00910Zinc finger, C2H2 type125061910722372family protein12524.m00104Bm1_01030hypothetical protein125246552724712525.m00011Bm1_01060Calcium-binding1252548492413protein.-related12527.m00016Bm1_01075CG15780-PA-related1252718076312528.m00040Bm1_01100hypothetical protein12528113941049512582.m00015Bm1_01505Hypothetical protein125822634284012584.m00063Bm1_01555hypothetical protein125846119489012601.m00336Bm1_01765Probable mitochondrial126015835557798import receptor subunitTOM7-like12616.m00134Bm1_01935Hypothetical protein12616496504792612617.m00005Bm1_01955LPXTG cell wall surface1261718771647anchor family protein,putative12646.m00015Bm1_02135ribosomal protein L91264616772711domain containingprotein12647.m00026Bm1_02140Lethal protein 805,126471694117isoform d, putative12649.m00085Bm1_02155Delta5 fatty acid126491471019desaturase-related12653.m00011Bm1_02195hypothetical protein126531472254912688.m00058Bm1_02410hypothetical protein126881685215212694.m00015Bm1_02455WW domain containing126941923186protein12698.m00331Bm1_025201300013D05Rik protein-126983544333408related12701.m00057Bm1_0256540S ribosomal protein1270146894996S12.-related12715.m00106Bm1_02630hypothetical protein12715119450812720.m00024Bm1_02675GYF domain containing1272038362233protein12778.m00055Bm1_03010PRO0477p-related12778221312254912791.m00115Bm1_03155gag protein-related127911786304812823.m00025Bm1_03370LD15209p, putative12823120643612845.m00017Bm1_03495Kunitz/Bovine pancreatic128451710725trypsin inhibitor domaincontaining protein12870.m00009Bm1_03645Warthog protein-related12870246139212888.m00015Bm1_03765hypothetical protein1288825410112888.m00016Bm1_03770FLYWCH zinc finger1288819221186domain containingprotein12902.m00216Bm1_03840Hypothetical protein1290210037911912902.m00224Bm1_03880hypothetical protein12902395083374412902.m00232Bm1_03920Cytochrome c oxidase129026806870046subunit IV family protein12927.m00019Bm1_04070Salivary glue protein1292725378Sgs-3 precursor.-related13001.m00046Bm1_04435Hypothetical protein130018622013047.m00009Bm1_046652,3-1304728341308bisphosphoglycerate-independentphosphoglyceratemutase, putative13058.m00015Bm1_04725Helix-loop-helix DNA-1305816072715binding domaincontaining protein13066.m00231Bm1_04775hypothetical protein13066153601612913066.m00233Bm1_04785AN1-like Zinc finger130662234220390family protein13066.m00250Bm1_04865DNA polymerase epsilon13066100521101389p17 subunit, putative13068.m00024Bm1_04880NADH-ubiquinone1306869105979oxidoreductase AGGGsubunit homolog,mitochondrialprecursor.-related13123.m00029Bm1_05160Troponin T.-related13123139328313128.m00013Bm1_05190hypothetical protein13128201526813132.m00016Bm1_05210hypothetical protein131321502158313143.m00017Bm1_05345WD-repeat protein 3.-1314395378808related13154.m00137Bm1_05435conserved hypothetical131542540022653protein13156.m00091Bm1_05470hypothetical protein131565006738013192.m00017Bm1_05820hypothetical protein131922102227213204.m00046Bm1_05895Gex interacting protein132041013116129protein 16, isoform d-related13210.m00168Bm1_05960Patched family protein13210595985376213223.m00106Bm1_06045conserved hypothetical132231142210670protein13236.m00034Bm1_06130Nuclear anchorage132361391711658protein 1-related13247.m00698Bm1_06290hypothetical protein1324710669710521213247.m00702Bm1_06310protein R52.2-related1324710958210999213247.m00708Bm1_06340hypothetical protein1324714058514299813250.m00034Bm1_06460hypothetical protein13250228292167113260.m00102Bm1_06640Protein phosphatase132603663438089inhibitor containingprotein13261.m00254Bm1_06655hypothetical protein1326111032881113261.m00256Bm1_06665Transmembrane amino132612561529392acid transporter protein13269.m00314Bm1_06785conserved hypothetical132694122243942protein13278.m00098Bm1_06925F26F3.2 protein-related13278242427113294.m00109Bm1_07190hypothetical protein13294182021640013315.m00131Bm1_07440hypothetical protein13315103421016313315.m00133Bm1_07450hypothetical protein13315135431531813315.m00140Bm1_07485hypothetical protein13315322983275313322.m00190Bm1_07615Peroxin-3 family protein1332210487781013322.m00194Bm1_07635GM16138p-related13322268082596313325.m00229Bm1_07680EB module family133251949822956protein13333.m00082Bm1_07780immunogenic protein 3,133331398314581putative13335.m00032Bm1_07795hypothetical protein133352418386113350.m00131Bm1_07885SD09147p-related13350119491131413354.m00140Bm1_07925peroxisomal membrane1335446983142anchor protein, putative13356.m00233Bm1_08025F-box domain containing13356112969151protein13356.m00235Bm1_08035Hypothetical 36.0 kDa133562077423113protein C45G9.5 inchromosome III.-related13366.m00256Bm1_08175conserved hypothetical133661308416928protein13369.m00043Bm1_08225conserved hypothetical133691082011384protein13388.m00076Bm1_08450hypothetical protein13388251852662913398.m00096Bm1_08545Mediator protein 4-133983312429280related13400.m00320Bm1_08610Hypothetical protein13400547105171313409.m00048Bm1_08695trehalose-6-phosphate1340962879952synthase-related13411.m00122Bm1_08735hypothetical protein13411122371468613411.m00124Bm1_08745hypothetical protein13411194341798313415.m00462Bm1_08915hypothetical protein1341516425917044613444.m00042Bm1_09120Hypothetical protein134443311205913449.m00084Bm1_09160conserved hypothetical13449915211804protein13460.m00194Bm1_09225HIT zinc finger family1346059993075protein13464.m00238Bm1_09270Skp1 related (ubiquitin13464846814561ligase complexcomponent) protein 18-like13465.m00049Bm1_09360conserved hypothetical1346581391009protein13473.m00077Bm1_09495conserved hypothetical134732411120711protein13491.m00026Bm1_09640Nematode cuticle1349142471874collagen N-terminaldomain containingprotein13497.m00179Bm1_09670NADH-ubiquinone134971363416174oxidoreductase B22subunit.-related13527.m00034Bm1_09930kinesin light chain,1352741044196putative13534.m00021Bm1_09975hypothetical protein135341669491413558.m00035Bm1_10140Zinc finger, C2H2 type13558178710family protein13562.m00095Bm1_10195Hypothetical protein13562260532679213572.m00009Bm1_10215Calcium-binding1357219042909protein.-related13579.m00037Bm1_10260gene model 83, putative1357918932713588.m00011Bm1_10315Long protein 1, isoform1358810701198b, putative13604.m00012Bm1_10425hypothetical protein136041914198913613.m00040Bm1_10475hypothetical protein136137719465013632.m00183Bm1_10660Hypothetical 20.9 kDa136323831043protein in PLB1-HXT2intergenic region.-related13644.m00292Bm1_10835hypothetical protein13644821727852613645.m00040Bm1_10860hypothetical protein1364597471352513667.m00039Bm1_11075RNA-dependent helicase,136671958114799putative13705.m00023Bm1_11340C2—HC type zinc finger1370518722063protein C.e-MyT1,putative13736.m00406Bm1_11590Hypothetical 30.1 kDa137364921251909protein ZC434.4 inchromosome I.-related13736.m00410Bm1_11565predicted protein1373613781.m00021Bm1_11825hypothetical protein13781339323913785.m00207Bm1_11840hypothetical protein137859166808813847.m00044Bm1_12400hypothetical protein13847112221250213890.m00008Bm1_12550hypothetical protein13890564189113920.m00451Bm1_12855hypothetical protein1392012573112148913939.m00060Bm1_13005hypothetical protein13939575104513941.m00057Bm1_13030conserved hypothetical139413641466protein13944.m00013Bm1_13050Hypothetical protein139448428513955.m00009Bm1_13150Barrier-to-1395520732198autointegration factor 1,putative13961.m00024Bm1_13170conserved hypothetical139613393541protein13965.m00025Bm1_13195hypothetical protein13965187511214009.m00173Bm1_13520Hypothetical protein-140092745129497conserved14012.m00014Bm1_13550conserved hypothetical140121962657protein14015.m00090Bm1_13600major sperm protein 2,140152157121054putative cytoskeletalMSP14015.m00091Bm1_13605Major Sperm Protein140152202024645(MSP), putativecytoskeletal MSP14033.m00022Bm1_13715Fras1 protein-related14033164229414039.m00119Bm1_13915Nematode astacin140395411856826protease protein 9,isoform c-related14041.m00080Bm1_13965M-phase140414358244602phosphoprotein-related14046.m00194Bm1_14055ShTK domain containing140465014449529protein14052.m00191Bm1_14115hypothetical protein14052397064377314058.m00558Bm1_14240PWWP domain140582392219492containing protein14058.m00575Bm1_14325hypothetical protein14058795557860414058.m00576Bm1_14330Mitochondrial ATP140588621187347synthase coupling factor6 family protein14058.m00579Bm1_14345Helix-loop-helix DNA-14058101813104473binding domaincontaining protein14094.m00132Bm1_14650hypothetical protein14094236881673814094.m00138Bm1_14680PDZ domain containing140943260434066protein14097.m00079Bm1_14750Ubiquinol-cytochrome C140972300025184reductase hinge protein14122.m00164Bm1_14965hypothetical protein14122239062606514151.m00029Bm1_15165hypothetical protein141517109769914164.m00121Bm1_15245RH17657p-related14164149121341414196.m00041Bm1_15510RE18450p, putative141961527326014208.m00914Bm1_15680hypothetical protein14208349743638114219.m00026Bm1_15840Surfeit locus protein 61421952444204containing protein14229.m00038Bm1_15990RE06140p-related142296203384714230.m00222Bm1_16040hypothetical protein14230344383366014237.m00398Bm1_16245symbol-related14237609286314314239.m00342Bm1_16340hypothetical protein14239355603959814248.m00663Bm1_16530Hint module family1424817117852protein14248.m00664Bm1_16540hypothetical protein14248134601498414248.m00667Bm1_16555hypothetical protein14248266512521514250.m00292Bm1_16675hypothetical protein14250225802520214250.m00295Bm1_16685hypothetical protein14250342162837714250.m00299Bm1_16705conserved hypothetical142504280841455protein14253.m00158Bm1_16780hypothetical protein14253942099629214276.m00246Bm1_17070Leucine Rich Repeat142762357027964family protein14279.m00042Bm1_17120Hyaluronan/mRNA1427920384764binding family protein14282.m00452Bm1_17210conserved hypothetical142827135775320protein14284.m00386Bm1_17305Hypothetical protein14284836998172514318.m00072Bm1_17810vacuolar ATP synthase143181356412905subunit H, putative14328.m00023Bm1_17930chitin synthase 2 (chs-2)14328400683fragment14341.m00010Bm1_18060hypothetical protein143411222393614348.m00100Bm1_18115hypothetical protein1434872411474014355.m00214Bm1_18195Serine/threonine protein143551448315779phosphatase PP1isozyme 1, putative14379.m00149Bm1_18685conserved hypothetical143791655218189protein14379.m00151Bm1_18695Nematode cuticle143792901030713collagen N-terminaldomain containingprotein14386.m00052Bm1_18760conserved hypothetical1438637322621protein14387.m00349Bm1_18845GRIM-19 protein14387834458561914396.m00009Bm1_19065conserved hypothetical1439615226613protein14409.m00256Bm1_19285Innexin family protein1440910934410360814411.m00015Bm1_19290hypothetical protein1441176520114417.m00065Bm1_19380hypothetical protein14417166851931814418.m00019Bm1_19390Hypothetical protein144181177223614420.m00010Bm1_19420hypothetical protein1442016897414421.m00015Bm1_19425MNN4 protein.-related144213261234314423.m00101Bm1_19440hypothetical protein14423190512336414450.m00173Bm1_19655Hypothetical protein14450246482291114479.m00132Bm1_19985cuticle collagen 2144791071513407precursor, putative14489.m00060Bm1_20120PDZ domain containing144892052819152protein14522.m00057Bm1_20495hypothetical protein14522142711633114535.m00021Bm1_20745adenosine deaminase1453529501588ADR-1C, putative14538.m00475Bm1_20785hypothetical protein14538363592966014539.m00055Bm1_20840calcium-binding protein,1453954514903putative14554.m00230Bm1_21040Hypothetical thiol145542138319767protease C06G4.2 inchromosome III.-related14569.m00218Bm1_21225hypothetical protein14569303182898014569.m00224Bm1_21255Chitin binding145698168668369Peritrophin-A domaincontaining protein14588.m00024Bm1_21530hypothetical protein14588144931029714590.m00346Bm1_21620Profilin family protein14590481404957014592.m00176Bm1_21655hypothetical protein14592358523482614593.m00155Bm1_21695hypothetical protein14593336833868814599.m00264Bm1_21815hypothetical protein14599580825638514599.m00266Bm1_21825hypothetical protein14599630696113514601.m00160Bm1_21885SWIB/MDM2 domain14601117857195containing protein14603.m00270Bm1_21970conserved hypothetical146034684550470protein14628.m00170Bm1_22440Homeobox protein146283335520goosecoid, putative14631.m00037Bm1_22500hypothetical protein1463197221447314632.m00150Bm1_22525DNA-(Apurinic or146324136537539apyrimidinic site) lyase-related14634.m00536Bm1_22560hypothetical protein14634196921979914637.m00177Bm1_22670conserved hypothetical146372131723555protein14638.m00118Bm1_22725RNA dependent RNA146382071235208polymerase familyprotein14640.m00210Bm1_22765hypothetical protein14640295712760214643.m00073Bm1_22805Hypothetical protein146438865846614643.m00076Bm1_22820hypothetical protein14643339143318014649.m00093Bm1_22905hypothetical protein146499460587814652.m00402Bm1_22990actin-depolymerizing146521375612623factor 1, putative14652.m00406Bm1_23010hypothetical protein14652291192755014652.m00407Bm1_23015hypothetical protein,146523266630721conserved14653.m00286Bm1_23080ATP synthase f chain,146534145840460mitochondrial.-related14656.m00217Bm1_23135Hypothetical protein14656516153514656.m00226Bm1_23180ribosomal protein L32146565410355318containing protein14668.m00161Bm1_23370Ulp1 protease family, C-146682491127782terminal catalyticdomain containingprotein14669.m00054Bm1_23380Zinc finger, C2H2 type14669973612495family protein14669.m00055Bm1_23385conserved hypothetical146691548519178protein14677.m00168Bm1_23555UcrQ family protein14677157751668414677.m00171Bm1_23570conserved hypothetical146773300129833protein14683.m00062Bm1_23670FRG1 protein homolog,146831721617368putative14696.m00216Bm1_23935heavy metal-associated146963078331456domain containingprotein14704.m00455Bm1_24165TolA protein.-related14704494044733714715.m01243Bm1_24555hypothetical protein14715601466101314715.m01245Bm1_24565conserved hypothetical147157278869063protein14715.m01248Bm1_24580ATP synthase e chain,147158577286557mitochondrial.-related14715.m01255Bm1_24615ATP synthase B chain,14715140193137886mitochondrial precursor,putative14735.m00112Bm1_25025Spc97/Spc98 family147351416121918protein14740.m00011Bm1_25060Nematode cuticle1474031111193collagen N-terminaldomain containingprotein14746.m00118Bm1_25120Tudor domain containing147461361616441protein14758.m00155Bm1_25285Prion-like--related147583779501514764.m00052Bm1_25440hypothetical protein14764101088214770.m00165Bm1_25640hypothetical protein14770104401645014773.m00912Bm1_25750Lipase family protein14773883238578014773.m00925Bm1_25810Hepatocellular14773181787182284carcinoma-associatedantigen 127, putative14776.m00033Bm1_25910RIKEN cDNA14776927977982610002M06, putative14786.m00011Bm1_26170hypothetical protein14786773215014799.m00204Bm1_26345Hypothetical protein147997761440614820.m00017Bm1_26495hypothetical protein1482036651614832.m00025Bm1_26605UPF0279 protein1483248215981C14orf129 homolog.-related14853.m00060Bm1_26745hypothetical protein14853285731514878.m00010Bm1_27030Talin 1, putative1487828070114900.m00208Bm1_27220hypothetical protein14900366523019514902.m00008Bm1_27240hypothetical protein1490218226814905.m00133Bm1_27280hypothetical protein14905480575349814907.m00564Bm1_27330conserved hypothetical149075326752044protein14912.m00013Bm1_27455Lectin C-type domain1491225921279containing protein14916.m00477Bm1_27515hypothetical protein14916471684618614917.m00318Bm1_27615bZIP transcription factor1491783346572family protein14917.m00336Bm1_277057B2-related1491714040513794114921.m00195Bm1_28035Hypothetical protein149213321151214921.m00200Bm1_28060Hypothetical protein14921490464808714924.m00113Bm1_28165hypothetical protein14924504965191514929.m00388Bm1_28315conserved hypothetical149295194642468protein14930.m00348Bm1_28490conserved hypothetical14930141151144901protein14932.m00515Bm1_28625Mitochondrial149328115479346glycoprotein14932.m00524Bm1_28670hypothetical protein1493212327212421714937.m00488Bm1_28945hypothetical protein14937702567373914938.m00331Bm1_29000hypothetical protein14938139501262514940.m00174Bm1_29140hypothetical protein14940146771575014944.m00531Bm1_29320hypothetical protein149443910290014944.m00552Bm1_29430Cytochrome c oxidase14944114518115864polypeptide Vb,mitochondrialprecursor.-related14944.m00553Bm1_29435Hypothetical protein1494411965311845614946.m00545Bm1_29610hypothetical protein1494611631211352014947.m01145Bm1_29715Hypothetical protein1494723179823323114950.m01792Bm1_29880Ubiquitin carboxyl-149503074134678terminal hydrolasefamily protein14950.m01808Bm1_29960von Willebrand factor14950115733107271type A domaincontaining protein14950.m01833Bm1_30085Apoptosis regulator14950226346221668proteins, Bcl-2 familyprotein14950.m01862Bm1_30230Hypothetical 19.4 kDa14950399978401134protein ZC395.10 inchromosome III.-related14953.m00217Bm1_30505Neurotransmitter-gated149533186228779ion-channeltransmembrane regionfamily protein14954.m01603Bm1_30695hypothetical protein1495422890823021714954.m01678Bm1_31055RNA recognition motif14954677472683353containing protein,putative14954.m01709Bm1_31210Zinc finger, C2H2 type14954873180876894family protein14956.m00513Bm1_31660hypothetical protein1495623955923701014958.m00350Bm1_31870Surfeit locus protein 51495899724101317containing protein14961.m04897Bm1_32025hypothetical protein14961300342981314961.m04921Bm1_32145Cuticle collagen dpy-714961189438188307precursor, putative14961.m04928Bm1_32180hypothetical protein1496126128225942214961.m04944Bm1_32260conserved hypothetical14961346907342655protein14961.m04948Bm1_32280hypothetical protein1496138546838703714961.m05035Bm1_32720hypothetical protein1496190938190654314961.m05037Bm1_32730LBP/BPI/CETP family,14961948962953135C-terminal domaincontaining protein14961.m05066Bm1_32875Calponin homolog1496111356701132938OV9M.-related14961.m05089Bm1_32990Apical junction molecule1496112859081277047protein 1, isoform d-related14961.m05095Bm1_33020LAMP family protein Imp-14961132332713246021 precursor.-related14961.m05104Bm1_33065hypothetical protein149611364571136798414961.m05112Bm1_33105hypothetical protein149611441631143838814961.m05133Bm1_33205Protein cab-1.-related149611553352155025314961.m05175Bm1_33410CG32584-PB-related149611776707177749314961.m05181Bm1_33440Innexin family protein149611822356182606514961.m05194Bm1_33500hypothetical protein149611892771189051214961.m05207Bm1_33565conserved hypothetical1496119775161974679protein14961.m05209Bm1_33575hypothetical protein149611982685198020914961.m05223Bm1_33635hypothetical protein149612078206208003414961.m05249Bm1_33765hypothetical protein149612224566222164614961.m05250Bm1_33770zgc: 92910-related149612224854222603514961.m05267Bm1_33855Ubiquinol-cytochrome C1496123526282355445chaperone family protein14961.m05319Bm1_34110hypothetical protein149612678053267940814961.m05325Bm1_34145RE35789p-related149612731520273022714961.m05347Bm1_34260M-phase1496129128922916031phosphoprotein, mpp8,putative14961.m05378Bm1_33865predicted protein1496114962.m00670Bm1_34425Ctr copper transporter14962112000113747family protein14962.m00674Bm1_34445zgc: 91831-related1496212791912689814963.m01764Bm1_34455amine oxidase, flavin-1496346761119containing-related14963.m01784Bm1_34560CGI-115 protein-related1496317601017736314967.m01533Bm1_35045hypothetical protein1496717447617529914967.m01536Bm1_35060Troponin family protein1496718443218248814967.m01540Bm1_35075Innexin inx-3, putative1496719914120216914967.m01549Bm1_35120PAN domain containing14967271650266430protein14967.m01570Bm1_35215chitin synthase 1, chs-11496741301040529414968.m01468Bm1_35390Slbp protein, putative1496812398712660514968.m01469Bm1_35395Acyltransferase family14968132176136172protein14968.m01473Bm1_3541550S ribosomal protein14968144841145805L20.-related14968.m01521Bm1_35660Succinate14968398078399458dehydrogenase, putative14971.m02855Bm1_36075D10Ertd718e protein-14971437939437284related14971.m02856Bm1_36080hypothetical protein1497143870644450514971.m02876Bm1_36170PAN domain containing14971515838522920protein14971.m02895Bm1_36265conserved hypothetical14971634824630514protein14971.m02896Bm1_36270TspO/MBR family protein1497163737363597614972.m06948Bm1_36295Resistance to inhibitors149722011821489of cholinesterase protein3-related14972.m06952Bm1_36315spliced leader 175 kDa149724911953484protein, putative14972.m06956Bm1_36335conserved hypothetical149727683377433protein14972.m06981Bm1_36460hypothetical protein1497222377322253614972.m07000Bm1_36555collagen col-34 -14972333451335360Caenorhabditis elegans,putative14972.m07004Bm1_36575conserved hypothetical14972402775401379protein14972.m07044Bm1_36765SD01790p-related1497260345061208614972.m07139Bm1_37230conserved hypothetical1497212555541253200protein14972.m07141Bm1_37240ephrin EFN-4, putative149721279191128058014972.m07143Bm1_37250RNA recognition motif.149721283517128284214972.m07157Bm1_37315hypothetical protein149721339386133820214972.m07193Bm1_37495conserved hypothetical1497215667051571640protein14972.m07197Bm1_37515hypothetical protein149721610896161016914972.m07200Bm1_37530conserved hypothetical1497216275671630330protein14972.m07218Bm1_37610Destabilase family1497217923361790947protein14972.m07236Bm1_37705NHR1 homology to TAF1497219552921958341family protein14972.m07247Bm1_37760Zinc finger, C2H2 type1497220358642042000family protein14972.m07257Bm1_37810GGL domain containing1497220921752093178protein14972.m07267Bm1_37860NADH-dependent xylose1497221679622170201reductase.-related14972.m07286Bm1_37955hypothetical protein149722324626232100914972.m07310Bm1_38065Clc-like149722493831249158514972.m07318Bm1_38105hypothetical protein149722554836255196114972.m07319Bm1_38110hypothetical protein149722559707255664814972.m07321Bm1_38120hypothetical protein149722568777256541514972.m07329Bm1_38160Fatty acid desaturase1497226115432607670family protein14972.m07378Bm1_38400Conserved hypothetical1497229629532963607protein, putative14972.m07383Bm1_384253-5 exonuclease family1497230041273000790protein14972.m07385Bm1_38435Conserved hypothetical1497230249063025501protein, putative14972.m07421Bm1_38610conserved hypothetical1497233489793342705protein14972.m07477Bm1_38875hypothetical protein149723736674373599114972.m07478Bm1_38880Mitochondrial ATP1497237374353738433synthase g subunitfamily protein14972.m07542Bm1_39200hypothetical protein149724211804420893314972.m07555Bm1_39265GH05862p-related149724273123427472814972.m07565Bm1_39315Zinc finger, C2H2 type1497243225074321891family protein14972.m07569Bm1_39335conserved hypothetical1497243359874339481protein14972.m07582Bm1_39400BED zinc finger family1497244242104426962protein14972.m07626Bm1_39610hypothetical protein149724723625471607414972.m07663Bm1_39790conserved hypothetical1497249791464974663protein14972.m07776Bm1_40345conserved hypothetical1497257290185730606protein14972.m07819Bm1_40540conserved hypothetical1497259890385992470protein14972.m07877Bm1_40800Helix-loop-helix DNA-1497263733346371882binding domaincontaining protein14972.m07922Bm1_37570predicted protein149721728585172193814972.m07927Bm1_38270predicted protein149722775572277125214972.m07928Bm1_38370predicted protein149722925519292653714972.m07934Bm1_39000predicted protein149723937484393494814973.m02594Bm1_40975hypothetical protein1497310539510330614973.m02604Bm1_41030conserved hypothetical14973156180158084protein14973.m02617Bm1_41110conserved hypothetical14973233397231475protein14973.m02628Bm1_41180Conserved hypothetical14973297573296744protein, putative14973.m02637Bm1_41220Helix-loop-helix DNA-14973347444348724binding domaincontaining protein14973.m02692Bm1_41495Gex interacting protein14973724480728710protein 4, isoform c-related14973.m02699Bm1_41530BED zinc finger family14973785472788815protein14973.m02709Bm1_41565hAT family dimerisation14973874608881987domain containingprotein14973.m02715Bm1_41590Mitochondrial import14973935297933887inner membranetranslocase subunitTim17 family protein14973.m02724Bm1_41635hypothetical protein1497397988197419614974.m00805Bm1_41700Zn-finger in Ran binding14974110944114736protein and otherscontaining protein14974.m00820Bm1_41775UBA/TS-N domain14974208164212843containing protein14974.m00848Bm1_41915NADH-ubiquinone14974401842402341oxidoreductase 18 kDasubunit, mitochondrialprecursor.-related14975.m04329Bm1_41995Conserved hypothetical149758274780128protein, putative14975.m04365Bm1_42145OTU-like cysteine14975258873256508protease family protein14975.m04411Bm1_42370hypothetical protein1497559358858533114975.m04436Bm1_42470hypothetical protein1497574270574038414975.m04449Bm1_42535Tudor domain containing14975849661846729protein14975.m04466Bm1_42620DNA segment, Chr 7,14975904749905826Wayne State University180, expressed, putative14975.m04471Bm1_42645hypothetical protein1497591453291319714975.m04482Bm1_42700hypothetical protein1497595625295497314975.m04488Bm1_42730conserved hypothetical14975983441982086protein14975.m04537Bm1_42980Immunoglobulin I-set1497512941641293295domain containingprotein14975.m04550Bm1_43045Dumpy: shorter than1497513711331370569wild-type protein 10,isoform b, putative14977.m04857Bm1_43075hypothetical protein1497712164777514977.m04900Bm1_43275membrane-associated14977334342331757RING-CH protein III,putative14977.m04938Bm1_43465Temporarily assigned14977552656559748gene name protein 40,putative14977.m04949Bm1_43515Hypothetical 36.5 kDa14977618331621448protein C56G2.3 inchromosome III.-related14977.m04961Bm1_43570hypothetical protein1497771869672603414977.m04992Bm1_43720conserved hypothetical14977934807933638protein14977.m05040Bm1_43955hypothetical protein149771344816135626214977.m05049Bm1_44000NADH-ubiquinone1497714297281431068oxidoreductase ASHIsubunit, mitochondrialprecursor, putative14977.m05056Bm1_44035hypothetical protein149771496735149361914977.m05063Bm1_44070Zinc finger, C2H2 type1497716016741604813family protein14977.m05068Bm1_44095Conserved hypothetical1497716239561625650protein, putative14977.m05093Bm1_44220conserved hypothetical1497718051381807992protein14977.m05109Bm1_44295Helix-loop-helix DNA-1497719381011943053binding domaincontaining protein14979.m04465Bm1_44775hypothetical protein1497950599450906314979.m04486Bm1_44890hypothetical protein1497962330463041214979.m04520Bm1_45055Chitin binding14979810826812386Peritrophin-A domaincontaining protein14979.m04521Bm1_45060hypothetical protein1497981412581935214979.m04536Bm1_45135Conserved hypothetical14979897478895486protein, putative14979.m04544Bm1_45175conserved hypothetical14979985359986554protein14979.m04546Bm1_45185putative RNA binding14979998637996950protein14979.m04553Bm1_45220CG7038-PA-related149791052050105437814979.m04566Bm1_45285CG13018-PA, putative149791155460115466614979.m04593Bm1_45405LD03534p-related149791312336131321714979.m04631Bm1_45560hypothetical protein149791586951158199314979.m04654Bm1_45665Mediator protein 11-1497917318201731031related14979.m04655Bm1_45670WH2 motif family protein149791733467173730514980.m02723Bm1_46015F-box domain containing14980321906320502protein14980.m02730Bm1_46050Zinc finger, C2H2 type14980365468368689family protein14980.m02739Bm1_46095hypothetical protein1498041614741570814980.m02747Bm1_46130hypothetical protein1498044062844117214980.m02753Bm1_46160Conserved hypothetical14980467374468316protein, putative14980.m02754Bm1_46165Chain A, Structure Of A14980469157468525Brca2-Dss1 Complex.,putative14980.m02796Bm1_46355hypothetical protein1498073689274402614980.m02801Bm1_46380Probable dolichol-14980771892772179phosphatemannosyltransferasesubunit 3, putative14980.m02805Bm1_46400Nematode cuticle14980796837794518collagen N-terminaldomain containingprotein14980.m02824Bm1_46495Zinc finger, C2H2 type14980956839950859family protein14980.m02854Bm1_46520predicted protein1498097788697560514981.m02374Bm1_46675membrane-associated149813989837539RING-CH protein III,putative14981.m02394Bm1_46775hypothetical protein1498118181318063414981.m02398Bm1_46795protein T01B7.5,14981197058199888putative14981.m02425Bm1_46940hypothetical protein1498141428841239914981.m02431Bm1_46970hypothetical protein1498143308143643014981.m02468Bm1_47145Cuticle collagen dpy-214981618741621413precursor, putative14982.m02229Bm1_47280WD-repeat protein14982145884148988WDC146.-related14982.m02233Bm1_47300conserved hypothetical14982176677178003protein14982.m02242Bm1_473453 exoribonuclease14982224027223358family, domain 2containing protein14988.m00035Bm1_47525LD18634p-related14988316151314990.m07639Bm1_47600Suppressor of lurcher14990133342131804protein 1 precursor,putative14990.m07735Bm1_48075M-phase phosphoprotein149907915287897196.-related14990.m07789Bm1_48330Dihydrofolate1499011222081122372reductase.-related14990.m07802Bm1_48395hypothetical protein149901223422123084414990.m07875Bm1_48760putative transcription1499016658551666819factor14990.m07934Bm1_49050hypothetical protein149901994394199284814990.m07962Bm1_49180NADH-ubiquinone1499021541572154862oxidoreductase 15 kDasubunit.-related14990.m07974Bm1_49240hypothetical protein149902235255223445414990.m07993Bm1_49335RUN domain containing1499023273532321510protein14990.m08056Bm1_49645Microcephalin., putative149902694625269965214990.m08061Bm1_49670zgc: 101594-related149902722366272361914990.m08076Bm1_49750ORF; putative149902844297284574614990.m08080Bm1_49770Mitochondrial ribosomal1499028591832860321protein L51/S25/CI-B8 domain containingprotein14990.m08109Bm1_49915conserved hypothetical1499030313693028987protein14990.m08113Bm1_49935Zn-finger in Ran binding1499030632583068759protein and otherscontaining protein14990.m08128Bm1_50000Conserved hypothetical1499031507353149495protein, putative14990.m08152Bm1_50115Hypothetical UPF01721499033679253366463protein CG3501.-related14992.m10844Bm1_50365conserved hypothetical14992261445264437protein14992.m10852Bm1_50395hypothetical protein1499229768630154314992.m10900Bm1_50630hypothetical protein1499256504956902014992.m10916Bm1_50710hypothetical protein1499263375563559014992.m10933Bm1_50790hypothetical protein1499275041575664314992.m10940Bm1_50825hypothetical protein1499281390181557614992.m10976Bm1_51010hypothetical protein149921084073108100514992.m10983Bm1_51095hypothetical protein149921120507112150614992.m10992Bm1_51095Innexin inx-10, putative149921168627117369814992.m11085Bm1_51520hypothetical protein149921810026181152314992.m11124Bm1_51715hypothetical protein149922031741202461614992.m11181Bm1_51995LBP/BPI/CETP family,1499224198762416185C-terminal domaincontaining protein14992.m11195Bm1_52065hypothetical protein149922529293253295414992.m11236Bm1_52255TB2/DP1, HVA22 family1499228213382824401protein14992.m11262Bm1_52385hypothetical protein149923033019302718914992.m11279Bm1_52475hypothetical protein149923152405314792014992.m11295Bm1_52560hypothetical protein149923250873325014414992.m11302Bm1_52595Ground-like domain1499232842343282575containing protein14992.m11304Bm1_52605hypothetical protein149923291373329420714992.m11309Bm1_52630Lipase family protein149923332305333470214992.m11311Bm1_52640hypothetical protein149923341057334188814992.m11327Bm1_52720Conserved hypothetical1499234368673435529protein, putative14992.m11343Bm1_52795hypothetical protein149923541900354318514992.m11347Bm1_52815conserved hypothetical1499235603163556758protein15009.m00162Bm1_53230hypothetical protein15009228112439915009.m00163Bm1_53210predicted protein1500915036.m00014Bm1_53390Ubiquinone biosynthesis1503625122359protein COQ4 homolog,putative15076.m00116Bm1_53630hypothetical protein150769415868215081.m00161Bm1_53755NADH-ubiquinone150813211931463oxidoreductase subunitB14.5b.-related15131.m00092Bm1_54105hypothetical protein1513114381912815131.m00094Bm1_54115RH01479p-related15131225982174915133.m00185Bm1_54140GH14561p-related151334090439615135.m00169Bm1_54230zgc: 101038 protein-151352443923471related15161.m00145Bm1_54490hypothetical protein15161440434659615182.m00047Bm1_54705Nematode cuticle1518256794458collagen N-terminaldomain containingprotein15201.m00015Bm1_54895hypothetical protein15201333137815215.m00045Bm1_55030hypothetical protein152156517801015256.m00006Bm1_55375carbamoyl-phosphate1525613991563synthase, large subunit,putative15258.m00019Bm1_55395Conserved hypothetical1525832524478protein, putative15285.m00017Bm1_55635hypothetical protein1528589411319215295.m00028Bm1_55690NADH-ubiquinone1529587479561oxidoreductase B12subunit.-related15304.m00109Bm1_55745sulfakinin receptor1530457428999protein, putative15304.m00111Bm1_55755major sperm protein,153041490115446putative15304.m00118Bm1_55790hypothetical protein15304391323658715304.m00121Bm1_55805hypothetical protein15304504775342215304.m00123Bm1_55755major sperm protein,153041490115446putative15330.m00019Bm1_56035hypothetical protein153305121168515344.m00010Bm1_56145conserved hypothetical153442134892protein15360.m00009Bm1_56230GH25683p, putative15360323160215384.m00013Bm1_56405hypothetical protein153842008319515404.m00031Bm1_56515Nematode cuticle1540431341277collagen N-terminaldomain containingprotein15406.m00015Bm1_5653028S ribosomal protein154061542845S30, mitochondrial,putative15418.m00009Bm1_56600cystatin-type cysteine154187802220proteinase inhibitor CPI-2, putative15425.m00019Bm1_56645Hypothetical 19.4 kDa1542558504756protein T09A5.5 inchromosome III,putative15443.m00042Bm1_5680550S ribosomal protein1544376388916L10.-related15452.m00015Bm1_56880hypothetical protein154521875202515458.m00030Bm1_56915conserved hypothetical15458124568protein15527.m00010Bm1_57420Zinc finger, C2H2 type1552722973106family protein15530.m00010Bm1_57435ELM2 domain containing155301682354protein15559.m00008Bm1_57645conserved hypothetical155592901201protein12479.m00026Bm1_00645Transmembrane cell12479110169951adhesion receptor mua-3precursor, putative12584.m00064Bm1_01560conserved hypothetical1258482706360protein12621.m00166Bm1_01990chitin synthase 2 (chs-2)126211851308fragment12631.m00043Bm1_02065Abnormal cell migration1263149781152protein 10, putative12673.m00009Bm1_02325Immunoglobulin I-set1267337871197domain containingprotein12849.m00036Kunitz/Bovine1284939444416pancreatictrypsininhibitordomaincontainingprotein13204.m00045Bm1_05890B20-1 protein, putative13204323165613207.m00045Bm1_05925EGF-like domain1320755692854containing protein13207.m00046Bm1_05930Muscle positioning13207123738852protein 4, putative13210.m00167Patched132105352452734related familyprotein 12,isoform b,putative13236.m00035Bm1_06130Nuclear anchorage132361391711658protein 1-related13271.m00048Bm1_06810Cuticle collagen dpy-7132711272210083precursor, putative13456.m00013Bm1_09195Thyroglobulin type-11345641466135repeat family protein13480.m00175Bm1_09565Transmembrane amino134804482450233acid transporter protein13488.m00014Bm1_09610Troponin T, putative13488235417713617.m00050Bm1_10505cuticle collagen 34,136171798620127putative13761.m00027Bm1_11715Phospholipase c like137611332309protein 1, isoform b,putative13832.m00022Bm1_12300Fibronectin type III138326492354domain containingprotein13879.m00034Bm1_12525RNA dependent RNA13879952410985polymerase familyprotein13894.m00035Bm1_12585small heat shock protein138942042254212.6, putative13939.m00062Bm1_13015Nematode cuticle139391422612675collagen N-terminaldomain containingprotein14134.m00015Bm1_15075PDZ domain containing141344403980protein14280.m00063Bm1_17130Kunitz/Bovine pancreatic142801116910679trypsin inhibitor domaincontaining protein14288.m00017Bm1_17360conserved hypothetical14288283211protein14289.m00013Bm1_17365RNA-directed RNA142892521302polymerase 1-related14332.m00026Bm1_18000Zinc finger, C2H2 type1433223915302family protein14356.m00524Bm1_18340LIN-7, putative14356722587159414383.m00056Bm1_18740Transmembrane cell1438368198155adhesion receptor mua-3precursor, putative14456.m00009Bm1_19740Hypothetical protein14456862106814479.m00133Bm1_19990cuticle collagen 2144791624419999precursor, putative14568.m00086Bm1_21210Innexin inx-14.-related145684954618414590.m00344Bm1_21610PDZ domain containing145903886640361protein14629.m00079Bm1_22470conserved hypothetical1462984294911protein14770.m00166Bm1_25645hypothetical protein14770164791694714789.m00059Bm1_26235hypothetical protein1478989131086814804.m00022Bm1_26400Fibronectin type III14804405820domain containingprotein14847.m00069Bm1_26685F25C8.3 protein-related14847116821257014899.m00015Bm1_27205hypothetical protein14899270122514907.m00570Bm1_27360hypothetical protein14907670586950514907.m00591Bm1_27330conserved hypothetical149075326752044protein14916.m00488Bm1_27570ARID/BRIGHT DNA14916120637119811binding domaincontaining protein14926.m00034Bm1_28210hypothetical protein14926213961932314956.m00537Bm1_31780Clc-4 protein., putative1495636658636437914961.m04954Bm1_32310Innexin family protein1496141890742211114961.m05117Bm1_33125conserved hypothetical1496114605751457288protein14961.m05334Bm1_34195Filamin/ABP280 repeat1496128265972815179family protein14961.m05355Bm1_34300Nematode cuticle1496129647252966387collagen N-terminaldomain containingprotein14965.m00415Bm1_34840collagen col-34-14965128392130818Caenorhabditis elegans,putative14968.m01485Bm1_35480hypothetical protein1496818955919175914972.m07061Bm1_36850hypothetical protein1497269032768698714972.m07168Bm1_37370conserved hypothetical1497213876881395188protein14972.m07534Bm1_39165cAMP-dependent protein1497241675514167742kinase regulatory chain,putative14972.m07898Bm1_40905Putative glutamate1497265092796512457synthase, putative14972.m07904Bm1_37335ankyrin-related unc-44-1497213701641371628related14973.m02693Bm1_41500conserved hypothetical14973730367737643protein14975.m04416Bm1_42395Nematode cuticle14975626425628055collagen N-terminaldomain containingprotein14975.m04428Patched14975708159704998family protein14975.m04547Bm1_43030protein C18H9.7,1497513561101354082putative14977.m04877Bm1_43170Transmembrane amino14977190435186932acid transporter protein14977.m04993Bm1_43725Mov34/MPN/PAD-114977936281935192family protein14977.m04996Bm1_43740conserved hypothetical14977960724956647protein14980.m02767Bm1_46225hypothetical protein1498053146853299214981.m02440hypothetical14981468578467689protein14990.m07640Bm1_47605CUB domain containing14990136663135121protein14990.m07659Bm1_47700Mitochondrial import14990232655234681inner membranetranslocase subunitTim17 family protein14990.m07706Bm1_47930VAB-10A protein-related1499057163657040414990.m07759Bm1_48195Innexin family protein1499093128293476914992.m10954Bm1_50900Conserved hypothetical14992926641931225protein, putative14992.m11025Bm1_51260Innexin family protein149921399989140658414992.m11129Bm1_51735Troponin I, putative149922080336208327714992.m11174Bm1_51960conserved hypothetical1499223737722376231protein14992.m11212Bm1_52140P40-related149922662635265946915002.m00035Bm1_53115hypothetical protein15002193329115009.m00158Bm1_53205hypothetical protein15009272632815059.m00090Bm1_53505myotactin form B,150598632410putative15059.m00092Bm1_53515conserved hypothetical150593245329060protein15081.m00162Bm1_53760cytochrome-c oxidase,150813293132512putative15121.m00020Bm1_54070F25C8.3 protein-related15121238541115165.m00050Bm1_54575Membrane calcium151652909031398atpase protein 3,isoform a, putative15213.m00007Bm1_55000Troponin T, putative152132590103115253.m00027Bm1_55355Protein FAM34A.-related1525384781022315297.m00023Bm1_55705Conserved hypothetical152972081397protein, putative15322.m00022Bm1_55970hypothetical protein153226778478315331.m00008Bm1_56045myosin-like protein,15331213111putative15370.m00010Bm1_56290ARID/BRIGHT DNA15370304765binding domaincontaining protein15398.m00011Bm1_56480RhoGAP domain1539835879833containing protein15487.m00008Bm1_57145Fibronectin type III15487402883domain containingprotein15560.m00010Bm1_57650conserved hypothetical15560228385protein

Claims
  • 1. A method for selecting drug targets, comprising: (a) identifying one or more essential or functionally important sequences from a model organism using pre-existing genomic and phenotypic data. (b) comparing sequences from (a) with a DNA or peptide sequence from a pathogen to store homologous sequences; (c) comparing sequences from the pathogen with the DNA or peptide sequence from a host organism to store those sequences absent in the host organism; and (d) comparing sequences from (b) and (c) to identify shared sequences, the shared sequences being a drug target.
  • 2. A method according to claim 1, wherein two or more of steps (a)-(d) are performed sequentially.
  • 3. A method according to claim 1, wherein two or more of steps (a)-(d) are performed in parallel.
  • 4. A drug target, comprising a DNA sequence or protein selected from Table 1.
  • 5. A computer-based system for identifying drug targets comprising: (a) a memory for storing a plurality of databases, wherein the plurality of databases comprise a plurality of sequence databases; (b) a processor in communication with the memory; and (c) an output device in communication with the processor, wherein the processor is configured to: (i) group sequences belonging to a model organism, a pathogen and a host from one or more of the plurality of sequence databases; (ii) identify essential or functionally important sequences from the model organism by matching phenotypic data with sequences of the model organism grouped in (i); (iii) compare sequences of the pathogen with sequences identified in (ii); (iv) compare sequences of the pathogen with the host sequences and select pathogen sequences that do not share sequence similarity with the host sequences; (v) compare sequences from (iii) and (iv) to identify sequences corresponding to sequences encoding drug targets; and (vi) cause at least some of the sequences identified as drug targets in (v) to be displayed on the output device.
CROSS REFERENCE

This application claims priority from U.S. Provisional Application Ser. No. 60/751,396 filed Dec. 16, 2005.

Provisional Applications (1)
Number Date Country
60751396 Dec 2005 US