Angiogenesis associated proteins, and nucleic acids encoding the same

Information

  • Patent Application
  • 20060074023
  • Publication Number
    20060074023
  • Date Filed
    November 22, 2005
    19 years ago
  • Date Published
    April 06, 2006
    18 years ago
Abstract
An isolated polypeptide having at least 80% sequence identity to the sequence SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, and polynucleotides encoding the same, are useful for modulating angiogenesis.
Description
BACKGROUND

Cities have roads and alleys, plants have xylem and phloem, and people have arteries, veins and lymphatics. Without these byways, the vertebrate animal cells would starve or drown in their metabolic refuse. Not only do blood vessels deliver food and oxygen and carry away metabolic wastes, but they also transport signaling substances that apprise cells of situations remote to them but to which they need to respond. Hormonal messages are a common signal.


All blood vessels are ensheathed by a basal lamina and a delicate monolayer of remarkably plastic endothelial cells lining the luminal walls. Depending on location and function, smooth muscle and connective tissue may also be present.


Not only do healthy cells depend on the blood resources transported by the circulatory system, but so, too, unwanted cells: tumorigenic and malignant cells. These cells colonize and proliferate if they are able to divert blood resources to themselves. Angiogenesis, the type of blood vessel formation where new vessels emerge from the proliferation of preexisting vessels (Risau, 1995; Risau and Flamme, 1995), is exploited not only by usual processes, such as in wound healing or myocardial infarction repair, but also by tumors themselves and in cancers, diabetic retinopathy, macular degeneration, psoriasis, and rheumatoid arthritis. Regardless of the process, whether pathological or usual physiological, endothelial cells mediate angiogenesis in a multi-step fashion: (1) endothelia receive an extracellular cue, (2) the signaled cells breach the basal lamina sheath, abetted by proteases they secrete, (3) the cells then migrate to the signal and proliferate, and finally, (4) the cells form a tube, a morphogenic event (Alberts et al., 1994). The complexity of this process indicates complex changes in cellular physiology and morphology, gene expression, and signaling. Angiogenic accomplices that are cues include basic fibroblast growth factors (bFGF), angiopoietins (such as ANG1) and various forms of vascular endothelial growth factor (VEGF).


The molecular events and the order in which they occur and the pathways that are required for this process are of fundamental importance to understand angiogenesis. In vitro models are useful for identifying alterations in gene expression that occur during angiogenesis. A particularly fruitful model systems involves the supspension in a three-dimensional type I collagen gel and various stimuli, such as phorbol myristate acetate (PMA), basic fibroblast growth factor (bFGF), and VEGF. The combination of the stimuli and the collagen gel results in the formation of a three-dimensional tubular network of endothelial cells with interconnecting lumenal structures. In this model, endothelial differentiation into tubelike structures is completely blocked by inhibitors of new mRNA or protein synthesis. Furthermore, the cells progress through differentiation in a coordinated and synchronized manner, thus optimizing the profile of gene expression (Kahn et al., 2000; Yang et al., 1999).


Tumor cells exploit angiogenesis to facilitate tumor growth. Controlling angiogenesis, by controlling the activity or expression of genes and proteins associated with angiogenesis, provides a way to prevent tumor growth, or even destoy tumors.


SUMMARY

The invention is based in part upon the discovery of novel nucleic acid sequences encoding novel polypeptides. Nucleic acids encoding the polypeptides disclosed in the invention, and derivatives and fragments thereof, will hereinafter be collectively designated as “AAP” nucleic acid or polypeptide sequences. AAP, or angiogenesis associated polypeptides (AAP) comprises kelch-like polypeptide (KLP), human ortholog of mouse BAZF (hBAZF), hmt-elongation factor G (hEF-G), human ortholog of rat TRG (hTRG), human myosin X (hMX1) and its splice variant (hMX2), nuclear hormone receptor (NHR), and human mitochondrial protein (hMP).


The invention is based in part upon the discovery of novel nucleic acid sequences encoding novel polypeptides. Nucleic acids encoding the polypeptides disclosed in the invention, and derivatives and fragments thereof, will hereinafter be collectively designated as “AAP” nucleic acid or polypeptide sequences.”


In a first aspect, the present invention is an isolated polypeptide having at least 80% sequence identity to the sequence SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, polynucleotides encoding the same, and antibodies that specifically bind the same.


In a second aspect, the present invention is an isolated polynucleotide having at least 80% sequence identity to the sequence SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, or a complement thereof.


In a third aspect, the present invention is a transgenic non-human animal, having a disrupted AAP gene or a transgenic non-human animal expressing an exogenous polynucleotide having at least 80% sequence identity to the sequence SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, or a complement of said polynucleotide.


In a fourth aspect, the present invention is a method of screening a sample for a mutation in an AAP gene.


In a fifth aspect, the present invention is a method of modulating angiogenesis comprising modulating the activity of at least one AAP polypeptide.


In a sixth aspect, the present invention is a method of increasing, as well as decreasing angiogenesis, comprising modulating the activity of at least one AAP polypeptide. Activity modulation of AAP polypeptides may be over-expressing or eliminating expression of the gene, or impairing a AAP polypeptide's function by contact with specific antagonists or agonists, such as antibodies or aptamers.


In a seventh aspect, the present invention is a method of treating various pathologies, including tumors, cancers, myocardial infarctions and the like.


In an eighth aspect, the present invention is a method of measuring a AAP transcriptional and translational up-regulation or down-regulation activity of a compound.


In a ninth aspect, the invention is a method of screening a tissue sample for tumorigenic potential.


In a tenth aspect, the invention is a method of determining the clinical stage of tumor which compares the expression of at least one AAP in a sample with expression of said at least one gene in control samples.


Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In the case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.







DETAILED DESCRIPTION

A model of angiogenesis—the suspension of endothelial cells in type I collagen gels with various stimuli—was used to identify a molecular fingerprint or transcriptional profile of endothelial differentiation into tubelike structures, using amplification and an imaging approach called GeneCalling (Shimkets et al., 1999). This method was previously shown to provide a comprehensive sampling of cDNA populations in conjunction with the sensitive detection of quantitative differences in mRNA abundance for both known and novel genes. Many differentially expressed cDNA fragments were found. The identification and differential expression of these gens was confirmed by a second independent method employing real-time quantitative polymerase chain reaction (PCR). Although some of the identified cDNA fragments were genes known to play some role in angiogenesis, many other differentially expressed genes were unexpected. The inventors have identified the unexpected genes and polypeptides that are expressed in response to this model of angiogenesis, collectively refered to as angiogenesis associated polypeptides (AAP). AAP are kelch-like polypeptide (KLP), human ortholog of mouse BAZF (hBAZF), hmt-elongation factor G (hEF-G), human ortholog of rat TRG (hTRG), human myosin X (hMX1) and its splice variant (hMX2), nuclear hormone receptor (NHR), and human mitochondrial protein (hMP).


DEFINITIONS

Unless defined otherwise, all technical and scientific terms have the same meaning as is commonly understood by one of skill in the art to which this invention belongs. The definitions below are presented for clarity. All patents and publications referred to herein are, unless noted otherwise, incorporated by reference in their entirety.


The recommendations of (Demerec et al., 1966) where these are relevant to genetics are adapted herein. To distinguish between genes (and related nucleic acids) and the proteins that they encode, the abbreviations for genes are indicated by italicized (or underlined) text while abbreviations for the proteins start with a capital letter and are not italicized. Thus, AAP or AAP refers to the nucleotide sequence that encodes AAP.


“Isolated,” when referred to a molecule, refers to a molecule that has been identified and separated and/or recovered from a component of its natural environment. Contaminant components of its natural environment are materials that interfere with diagnostic or therapeutic use.


“Container” is used broadly to mean any receptacle for holding material or reagent. Containers may be fabricated of glass, plastic, ceramic, metal, or any other material that can hold reagents. Acceptable materials will not react adversely with the contents.


1. Nucleic Acid-related Definitions


(a) Control Sequences


Control sequences are DNA sequences that enable the expression of an operably-linked coding sequence in a particular host organism. Prokaryotic control sequences include promoters, operator sequences, and ribosome binding sites. Eukaryotic cells utilize promoters, polyadenylation signals, and enhancers.


(b) Operably-linked


Nucleic acid is operably-linked when it is placed into a functional relationship with another nucleic acid sequence. For example, a promoter or enhancer is operably-linked to a coding sequence if it affects the transcription of the sequence, or a ribosome-binding site is operably-linked to a coding sequence if positioned to facilitate translation. Generally, “operably-linked” means that the DNA sequences being linked are contiguous, and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is accomplished by conventional recombinant DNA methods.


(c) Isolated Nucleic Acids


An isolated nucleic acid molecule is purified from the setting in which it is found in nature and is separated from at least one contaminant nucleic acid molecule. Isolated AAP molecules are distinguished from the specific AAP molecule, as it exists in cells. However, an isolated AAP molecule includes AAP molecules contained in cells that ordinarily express an AAP where, for example, the nucleic acid molecule is in a chromosomal location different from that of natural cells.


2. Protein-related Definitions


(a) Purified Polypeptide


When the molecule is a purified polypeptide, the polypeptide will be purified (1) to obtain at least 15 residues of N-terminal or internal amino acid sequence using a sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or silver stain. Isolated polypeptides include those expressed heterologously in genetically-engineered cells or expressed in vitro, since at least one component of an AAP natural environment will not be present. Ordinarily, isolated polypeptides are prepared by at least one purification step.


(b) Active Polypeptide


An active AAP or AAP fragment retains a biological and/or an immunological activity of the native or naturally-occurring AAP. Immunological activity refers to the ability to induce the production of an antibody against an antigenic epitope possessed by a native AAP; biological activity refers to a function, either inhibitory or stimulatory, caused by a native AAP that excludes immunological activity. A biological activity of AAP includes, for example, modulating angiogenesis.


(c) Abs


Antibody may be single anti-AAP monoclonal Abs (including agonist, antagonist, and neutralizing Abs), anti-AAP antibody compositions with polyepitopic specificity, single chain anti-AAP Abs, and fragments of anti-AAP Abs. A “monoclonal antibody” refers to an antibody obtained from a population of substantially homogeneous Abs, i.e., the individual Abs comprising the population are identical except for naturally-occurring mutations that may be present in minor amounts


(d) Epitope Tags


An epitope tagged polypeptide refers to a chimeric polypeptide fused to a “tag polypeptide”. Such tags provide epitopes against which Abs can be made or are available, but do not interfere with polypeptide activity. To reduce anti-tag antibody reactivity with endogenous epitopes, the tag polypeptide is preferably unique. Suitable tag polypeptides generally have at least six amino acid residues and usually between about 8 and 50 amino acid residues, preferably between 8 and 20 amino acid residues). Examples of epitope tag sequences include HA from Influenza A virus and FLAG.


The novel AAP of the invention include the nucleic acids whose sequences are provided in Tables 1, 3, 5, 7, 9, 11, 13 and 15, or a fragment thereof. The invention also includes a mutant or variant AAP, any of whose bases may be changed from the corresponding base shown in Tables 1, 3, 5, 7, 9, 11, 13 and 15 while still encoding a protein that maintains the activities and physiological functions of the AAP fragment, or a fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences are complementary to those just described, including complementary nucleic acid fragments. The invention additionally includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures include chemical modifications. Such modifications include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones are modified or derivatized. These modifications are carried out at least in part to enhance the chemical stability of the modified nucleic acid, such that they may be used, for example, as anti-sense binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 20% or more of the bases may be so changed.


The novel AAP of the invention include the protein fragments whose sequences are provided in Tables 2, 4, 6, 8, 10, 12, 14 and 16. The invention also includes an AAP mutant or variant protein, any of whose residues may be changed from the corresponding residue shown in Tables 2, 4, 6, 8, 10, 12, 14 and 16 while still encoding a protein that maintains its native activities and physiological functions, or a functional fragment thereof. In the mutant or variant AAP, up to 20% or more of the residues may be so changed. The invention further encompasses Abs and antibody fragments, such as Fab or (Fab)′2, that bind immunospecifically to any of the AAP of the invention.


The AAP nucleic acids are shown in Tables 1, 3, 5, 7, 9, 11, 13 and 15, and the corresponding polypeptides are shown in Tables 2, 4, 6, 8, 10, 12, 14 and 16, respectivly. Start and stop codons in the polynucleotide sequences are indicated in boldface and with underlining. SEQ ID NO:3 lacks a stop codon. The sequences of hMX1 and hMX2 do not have start codons (see Table 17); consequently, hMX1 and hMX2 polypeptides do not start with a Met. For any lacking polynucleotide sequence, one of skill in the art may retrieve the full length sequence by, for example, probing cDNA or genomic libraries with probes designed according to the sequences of the instant invention.

TABLE 1KLP nucleotide sequence (SEQ ID NO:1)ctggcctaga tactacaact gaactttttt tctttttagt tactccacag gatccgctga60acataggatg ttgccacaaa atctacctcg tgtatttttc tctttcactc atgagctgca120caattgcaga tttgagcaca atgtctgcag actgtgttga aaaactctga agaacctaat180taacacagga tgacctagga gtgattctaa gtctgtgtaa caagatatta ctcattagtg240aatgtgtcag tcttggtact gaatgctgca gataacagca agtaggttct cctttatttc300tgaagtattc acttgacctt ccatcagtaa gacggacttt tctaatctgt tcctggagat360attaatggaa tacagtcatg tccactcaag acgagaggca gatcaatact gaatatgctg420tgtcattgtt ggaacagttg aaactgtttt atgaacagca gttgtttact gacatagtgt480taattgttga gggcactgaa ttcccttgtc ataagatggt tcttgcaaca tgtagctctt540atttcagggc catgtttatg agtggactaa gtgaaagcaa acaaacccat gtacacctga600ggaatgtcga tgctgccacc ttacagataa taataactta tgcatacacg ggtaacttgg660caatgaatga cagcactgta gaacagcttt atgaaacagc ttgcttccta caggtagaag720atgtgttaca acgttgtcgg gaatatttaa ttaaaaaaat aaatgcagag aattgtgtac780gattgttgag ttttgctgat ctcttcagtt gtgaggaatt aaaacagagt gctaaaagaa840tggtggagca caagttcact gctgtgtatc atcaggacgc gttcatgcag ctgtcacatg900acctactgat agatattctc agtagtgaca atttaaatgt agaaaaggaa gaaaccgttc960gagaagctgc tatgctgtgg ctagagtata acacagaatc acgatcccag tatttgtctt1020ctgttcttag ccaaatcaga attgatgcac tttcagaagt aacacagaga gcttggtttc1080aaggtctgcc acccaatgat aagtcagtgg tggttcaagg tctgtataag tccatgccca1140agtttttcaa accaagactt gggatgacta aagaggaaat gatgattttc attgaagcat1200cttcagaaaa tccttgtagt ctttactctt ctgtctgtta cagcccccaa gcagaaaaag1260tttacaagtt atgtagccca ccagctgatt tgcataaggt tgggaccgtt gtaactcctg1320ataatgatat ctacatagca gggggtcaag ttcctctgaa aaacacaaaa acaaatcaca1380gtaaaacaag caaacttcag actgccttca gaactgtgaa ttgcttttat tggtttgatg1440cacagcaaaa tacctggttt ccaaagaccc caatgctttt tgtccgcata aagccatctt1500tggtttgctg tgaaggctat atctatgcaa ttggaggaga tagcgtaggt ggagaactta1560atcggaggac cgtagaaaga tacgacactg agaaagatga gtggacgatg gtaagccctt1620taccttgtgc ttggcaatgg agtgcagcag ttgtggttca tgactgcatt tatgtgatga1680cactgaacct catgtactgt tattttccaa ggtctgactc atgggtagaa atggccatga1740gacagactag taggtccttt gcttcagctg cagcttttgg tgataaaatt ttctatattg1800gagggttgca tattgctacc aattccggca taagactccc ctctggcact gtagatgggt1860cttcagtaac tgtggaaatt tatgatgtga ataaaaatga gtggaaaatg gcagccaaca1920tccctgctaa gaggtactct gacccctgtg ttagagctgt tgtgatctca aattctctat1980gtgtgtttat gcgagaaacc cacttaaatg agcgagctaa atacgtcacc taccaatatg2040acctggaact tgaccggtgg tctctgcggc agcatatatc tgaacgtgta ctgtgggact2100tggggagaga ttttcgatgc actgtgggga aactctatcc atcctgcctt gaagagtctc2160catggaaacc accaacttat cttttttcaa cggatgggac agaagagttt gaactggatg2220gagaaatggt tgcactacca cctgtatagt ggggaagttc agggagtgca cgcctgagtt2280atgtgctttg tcattttctt tgctaaacaa aagaggctat gaaagaacta aatatgagta2340cataaaattc tatctttgat aaattttatt tttatgccct acttaatatt tgcatcagta2400taatatatat cagtgagtct tacagaaaga tatgcttcca taatatgaaa tagattattc2460aataattgag aaactttatg tgtaatcatg agagtataag aatctggatt atctaacagt2520gttagccctg tgtatgtaca gttcaaaaag ttcatttata aaagtagttt cctgttc2577









TABLE 2








KLP polypeptide sequence (SEQ ID NO:2)
















Met Ser Thr Gln Asp Gln Arg Gln Ile Asn Thr Gln Tyr Ala Val Ser



1               5                   10                  15





Len Leu Glu Gln Leu Lys Len Phe Tyr Gln Gln Gln Leu Phe Thr Asp


            20                  25                  30





Ile Val Leu Ile Val Gln Gly Thr Gln Phe Pro Cys His Lys Met Val


        35                  40                  45





Leu Ala Thr Cys Ser Ser Tyr Phe Arg Ala Met Phe Met Ser Gly Leu


    50                  55                  60





Ser Glu Ser Lys Glu Thr His Val His Leu Arg Asn Val Asp Ala Ala


65                  70                  75                  80





Thr Leu Gln Ile Ile Ile Thr Tyr Ala Tyr Thr Gly Asn Leu Ala Met


                85                  90                  95





Asn Asp Ser Thr Val Gln Gln Leu Tyr Gln Thr Ala Cys Phe Leu Gln


            100                 105                 110





Val Gln Asp Val Leu Gln Arg Cys Arg Gln Tyr Leu Ile Lys Lys Ile


        115                 120                 125





Asn Ala Gln Asn Cys Val Arg Leu Leu Ser Phe Ala Asp Leu Phe Ser


    130                 135                 140





Cys Gln Gln Leu Lys Gln Ser Ala Lys Arg Met Val Gln His Lys Phe


145                 150                 155                 160





Thr Ala Val Tyr His Gln Asp Ala Phe Met Gln Leu Ser His Asp Len


                165                 170                 175





Leu Ile Asp Ile Leu Ser Ser Asp Asn Len Asn Val Gln Lys Gln Glu


            180                 185                 190





Thr Val Arg Gln Ala Ala Met Leu Trp Len Glu Tyr Asn Thr Glu Ser


        195                 200                 205





Arg Ser Gln Tyr Leu Ser Ser Val Leu Ser Gln Ile Arg Ile Asp Ala


    210                 215                 220





Leu Ser Gln Val Thr Gln Arg Ala Trp Phe Gln Gly Len Pro Pro Asn


225                 230                 235                 240





Asp Lys Ser Val Val Val Gln Gly Leu Tyr Lys Ser Met Pro Lys Phe


                245                 250                 255





Phe Lys Pro Arg Leu Gly Met Thr Lys Gln Glu Met Met Ile Phe Ile


            260                 265                 270





Glu Ala Ser Ser Glu Asn Pro Cys Ser Leu Tyr Ser Ser Val Cys Tyr


        275                 280                 285





Ser Pro Gln Ala Glu Lys Val Tyr Lys Leu Cys Ser Pro Pro Ala Asp


    290                 295                 300





Leu His Lys Val Gly Thr Val Val Thr Pro Asp Asn Asp Ile Tyr Ile


305                 310                 315                 320





Ala Gly Gly Gln Val Pro Leu Lys Asn Thr Lys Thr Asn His Ser Lys


                325                 330                 335





Thr Ser Lys Leu Gln Thr Ala Phe Arg Thr Val Asn Cys Phe Tyr Trp


            340                 345                 350





Phe Asp Ala Gln Gln Asn Thr Trp Phe Pro Lys Thr Pro Met Leu Phe


        355                 360                 365





Val Arg Ile Lys Pro Ser Leu Val Cys Cys Glu Gly Tyr Ile Tyr Ala


    370                 375                 380





Ile Gly Gly Asp Ser Val Gly Gly Glu Leu Asn Arg Arg Thr Val Glu


385                 390                 395                 400





Arg Tyr Asp Thr Glu Lys Asp Glu Trp Thr Met Val Ser Pro Leu Pro


                405                 410                 415





Cys Ala Trp Gln Trp Ser Ala Ala Val Val Val His Asp Cys Ile Tyr


            420                 425                 430





Val Met Thr Leu Asn Leu Met Tyr Cys Tyr Phe Pro Arg Ser Asp Ser


        435                 440                 445





Trp Val Glu Met Ala Met Arg Gln Thr Ser Arg Ser Phe Ala Ser Ala


    450                 455                 460





Ala Ala Phe Gly Asp Lys Ile Phe Tyr Ile Gly Gly Leu His Ile Ala


465                 470                 475                 480





Thr Asn Ser Gly Ile Arg Leu Pro Ser Gly Thr Val Asp Gly Ser Ser


                485                 490                 495





Val Thr Val Glu Ile Tyr Asp Val Asn Lys Asn Glu Trp Lys Met Ala


            500                 505                 510





Ala Asn Ile Pro Ala Lys Arg Tyr Ser Asp Pro Cys Val Arg Ala Val


        515                 520                 525





Val Ile Ser Asn Ser Leu Cys Val Phe Met Arg Glu Thr His Leu Asn


    530                 535                 540





Glu Arg Ala Lys Tyr Val Thr Tyr Gln Tyr Asp Leu Glu Leu Asp Arg


545                 550                 555                 560





Trp Ser Leu Arg Gln His Ile Ser Glu Arg Val Leu Trp Asp Leu Gly


                565                 570                 575





Arg Asp Phe Arg Cys Thr Val Gly Lys Leu Tyr Pro Ser Cys Leu Glu


            580                 585                 590





Glu Ser Pro Trp Lys Pro Pro Thr Tyr Leu Phe Ser Thr Asp Gly Thr


        595                 600                 605





Glu Glu Phe Glu Leu Asp Gly Glu Met Val Ala Leu Pro Pro Val


    610                 615                 620
















TABLE 3








hBAZF nucleotide sequence (SEQ ID NO:3)

















caagggagcg agggtgtcgt agagggcaga atgaacaaga agaattagga gggaggctgc
60






gtgtgccggg gctaggggct ggaagtcctg gctctagttg cacctcggaa ggaaaaggca
120





aacagaggag ggaaggcgtc ttaggactgc ctggatccag agcactttcc tcggcctcta
180





caggcctgtg tcgctatggg ttcccccgcc gccccggagg gagcgctggg ctacgtccgc
240





gagttcactc gccactcctc cgacgtgctg ggcaacctca acgagctgcg cctgcgcggg
300





atcctcactg acgtcacgct gctggttggc gggcaacccc tcagagcaca caaggcagtt
360





ctcatcgcct gcagtggctt cttctattca attttccggg gccgtgcggg agtcggggtg
420





gacgtgctct ctctgcccgg gggtcccgaa gcgagaggct tcgcccctct attggacttc
480





atgtacactt cgcgcctgcg cctctctcca gccactgcac cagcagtcct agcggccgcc
540





acctatttgc agatggagca cgtggtccag gcatgccacc gcttcatcca ggccagctat
600





gaacctctgg gcatctccct gcgccccctg gaagcagaac ccccaacacc cccaacggcc
660





cctccaccag gtagtcccag gcgctccgaa ggacacccag acccacctac tgaatctcga
720





agctgcagtc aaggcccccc cagtccagcc agccctgacc ccaaggcctg caactggaaa
780





aagtacaagt acatcgtgct aaactctcag gcctcccaag cagggagcct ggtcggggag
840





agaagttctg gtcaaccttg cccccaagcc aggctcccca gtggagacga ggcctccagc
900





agcagcagca gcagcagcag cagcagcagt gaagaaggac ccattcctgg tccccagagc
960





aggctctctc caactgctgc cactgtgcag ttcaaatgtg gggctccagc cagtaccccc
1020





tacctcctca catcccaggc tcaagacacc tctggatcac cctctgaacg ggctcgtcca
1080





ctacccggga gtgaattttt cagctgccag aactgtgagg ctgtggcagg gtgctcatcg
1140





gggctggact ccttggttcc tggggacgaa gacaaaccct ataagtgtca gctgtgccgg
1200





tcttcgttcc gctacaaggg caaccttgcc agtcaccgta cagtgcacac aggggaaaag
1260





ccttaccact gctcaatctg cggagcccgt tttaaccggc cagcaaacct gaaaacgcac
1320





agccgcatcc attcgggaga gaagccgtat aagtgtgaga cgtgcggctc gcgctttgta
1380





caggtacgga gccagcctcc aagtggcttc caaggcaaac ctgcaagagg tggggtgggc
1440





caaaagggag ggttctgttc ctcccagagg caggacttga agtctcctcc ctcccaggtg
1500





gcacatctgc gggcgcacgt gctgatccac accggggaga agccctaccc ttgccctacc
1560





tgcggaaccc gcttccgcca cctgcagacc ctcaagagcc acgttcgcat ccacaccgga
1620





gagaagcctt accactgcga cccctgtggc ctgcatttcc ggcacaagag tcaactgcgg
1680





ctgcatctgc gccagaaaca cggagctgct accaacacca aagtgcacta ccacattctc
1740





ggggggccc
1749
















TABLE 4








hBAZF polypeptide sequence (SEQ ID NO:4)
















Met Gly Ser Pro Ala Ala Pro Glu Gly Ala Leu Gly Tyr Val Arg Glu



1               5                   10                  15





Phe Thr Arg His Ser Ser Asp Val Leu Gly Asn Leu Asn Glu Leu Arg


            20                  25                  30





Leu Arg Gly Ile Leu Thr Asp Val Thr Leu Leu Val Gly Gly Gln Pro


        35                  40                  45





Leu Arg Ala His Lys Ala Val Leu Ile Ala Cys Ser Gly Phe Phe Tyr


    50                  55                  60





Ser Ile Phe Arg Gly Arg Ala Gly Val Gly Val Asp Val Leu Ser Leu


65                  70                  75                  80





Pro Gly Gly Pro Glu Ala Arg Gly Phe Ala Pro Leu Leu Asp Phe Met


                85                  90                  95





Tyr Thr Ser Arg Leu Arg Leu Ser Pro Ala Thr Ala Pro Ala Val Leu


            100                 105                 110





Ala Ala Ala Thr Tyr Leu Gln Met Glu His Val Val Gln Ala Cys His


        115                 120                 125





Arg Phe Ile Gln Ala Ser Tyr Glu Pro Leu Gly Ile Ser Leu Arg Pro


    130                 135                 140





Leu Glu Ala Glu Pro Pro Thr Pro Pro Thr Ala Pro Pro Pro Gly Ser


145                 150                 155                 160





Pro Arg Arg Ser Glu Gly His Pro Asp Pro Pro Thr Glu Ser Arg Ser


                165                 170                 175





Cys Ser Gln Gly Pro Pro Ser Pro Ala Ser Pro Asp Pro Lys Ala Cys


            180                 185                 190





Asn Trp Lys Lys Tyr Lys Tyr Ile Val Leu Asn Ser Gln Ala Ser Gln


        195                 200                 205





Ala Gly Ser Leu Val Gly Glu Arg Ser Ser Gly Gln Pro Cys Pro Gln


    210                 215                 220





Ala Arg Leu Pro Ser Gly Asp Glu Ala Ser Ser Ser Ser Ser Ser Ser


225                 230                 235                 240





Ser Ser Ser Ser Ser Glu Glu Gly Pro Ile Pro Gly Pro Gln Ser Arg


                245                 250                 255





Leu Ser Pro Thr Ala Ala Thr Val Gln Phe Lys Cys Gly Ala Pro Ala


            260                 265                 270





Ser Thr Pro Tyr Leu Leu Thr Ser Gln Ala Gln Asp Thr Ser Gly Ser


        275                 280                 285





Pro Ser Glu Arg Ala Arg Pro Leu Pro Gly Ser Glu Phe Phe Ser Cys


    290                 295                 300





Gln Asn Cys Glu Ala Val Ala Gly Cys Ser Ser Gly Leu Asp Ser Leu


305                 310                 315                 320





Val Pro Gly Asp Glu Asp Lys Pro Tyr Lys Cys Gln Leu Cys Arg Ser


                325                 330                 335





Ser Phe Arg Tyr Lys Gly Asn Leu Ala Ser His Arg Thr Val His Thr


            340                 345                 350





Gly Glu Lys Pro Tyr His Cys Ser Ile Cys Gly Ala Arg Phe Asn Arg


        355                 360                 365





Pro Ala Asn Leu Lys Thr His Ser Arg Ile His Ser Gly Glu Lys Pro


    370                 375                 380





Tyr Lys Cys Glu Thr Cys Gly Ser Arg Phe Val Gln Val Arg Ser Gln


385                 390                 395                 400





Pro Pro Ser Gly Phe Gln Gly Lys Pro Ala Arg Gly Gly Val Gly Gln


                405                 410                 415





Lys Gly Gly Phe Cys Ser Ser Gln Arg Gln Asp Leu Lys Ser Pro Pro


            420                 425                 430





Ser Gln Val Ala His Leu Arg Ala His Val Leu Ile His Thr Gly Glu


        435                 440                 445





Lys Pro Tyr Pro Cys Pro Thr Cys Gly Thr Arg Phe Arg His Leu Gln


    450                 455                 460





Thr Leu Lys Ser His Val Arg Ile His Thr Gly Glu Lys Pro Tyr His


465                 470                 475                 480





Cys Asp Pro Cys Gly Leu His Phe Arg His Lys Ser Gln Leu Arg Leu


                485                 490                 495





His Leu Arg Gln Lys His Gly Ala Ala Thr Asn Thr Lys Val His Tyr


            500                 505                 510





His Ile Leu Gly Gly Pro


        515
















TABLE 5








hEF-G nucleotide sequence (SEQ ID NO:5)

















tctttttcct cgcgtccttt gccccggaag tgctcttaca acattggctg ccggcgtgac
60






tttgaccgct tcccggtgcg ttaccggcag ctgaacccac ccggcgccac gggactttga
120





cgcgtgctct gcgcttgcca tgagactcct gggagctgca gccgtcgcgg ctctggggcg
180





cggaagggcc cccgcctccc taggctggca gaggaagcag gttaattgga aggcctgccg
240





atggtcttca tcaggggtga ttcctaatga aaaaatacga aatattggaa tctcagctca
300





cattgattct gggaaaacta cattaacaga acgagtcctt tactacactg gcagaattgc
360





aaagatgcat gaggtgaaag gtaaagatgg agttggtgct gtcatggatt ccatggaact
420





agagagacaa agaggaatca ctattcagtc agcagccact ttcaccatgt ggaaagatgt
480





caatattaac attatagata ctcctgggca tgtggacttc acaatagaag tggaaagggc
540





cctgagagtg ttggatggtg cagtccttgt tctctgtgct gttggagggg tacagtgcca
600





gaccatgact gtcaatcgtc agatgaagcg ctacaacgtt ccgtttctaa cttttattaa
660





caaattggac cgaatgggct ccaacccagc cagggccctg cagcaaatga ggtctaaact
720





aaatcataat acagcgttta tgcagatacc catgggtttg gagggtaatt ttaaaggtat
780





tgtagatctt attgaggaac gagccatcta ttttgatgga gactttagtc agattgttcg
840





atatggtgag attccagctg aattaagggc ggcggccact gaccaccggc aggagctaat
900





tgaatgtgtt gccaattcag atgaacagct tggtgagatg tttctggaag aaaaaatccc
960





ctcgatttct gatttaaagc tagcaattcg aagagctact ctgaaaagat catttactcc
1020





tgtatttttg ggaagcgcct tgaagaacaa aggagttcag cctcttttag atgctgtttt
1080





agaatacctc ccaaatccat ctgaagtcca gaactatgct attctcaata aaaaggatga
1140





ctcaaaagag aaaaccaaaa tcctaatgaa ctccagtaga cacaattccc acccatttgt
1200





aggcctggct tttcccctgg aggtaggtcg atttggacaa ttaacttatg ttcgcagtta
1260





tcagggagag ctaaagaagg gtgacaccat ctataacaca aggacaagaa agaaagtacg
1320





gttgcaacgg ctggctcgca tgcatgccga catgatggag gcaagtacag aggaagtata
1380





tgccggagac atctgtgcat tgtttggcat tgactgtgct agtggagaca cattcacaga
1440





caaagccaac agcggccttt ctatggagtc aattcatgtt cctgatcctg tcatttcaat
1500





agcaatgaag ccttctaaca agaacgatct ggaaaaattt tcaaaaggta ttggcaggtt
1560





tacaagagaa gatcccacat ttaaagtata ctttgacact gagaacaaag agacagttat
1620





atctggaatg ggagaattac acctggaaat ctatgctcag aggctggaaa gagagtatgg
1680





ctgtccttgt atcacaggaa agccaaaagt tgcctttcga gagaccatta ctgcccctgt
1740





cccgtttgac tttacacata aaaaacaatc aggtggtgca ggccagtatg gaaaagtaat
1800





aggtgtcctg gagcctctgg acccagagga ctacactaaa ttggaatttt cagatgaaac
1860





attcggatca aatattccaa agcagtttgt gcctgctgta gaaaaggggt ttttagatgc
1920





ctgcgagaag ggccctcttt ctggtcacaa gctctctggg ctccggtttg tcctgcaaga
1980





tggagcacac cacatggttg attctaatga aatctctttc atccgagcag gagaaggtgc
2040





tcttaaacaa gccttggcaa atgcaacatt atgtattctt gaacctatta tggctgtgga
2100





agttgtagct ccaaatgaat ttcagggaca agtaattgca ggaattaacc gacgccatgg
2160





ggtaatcact gggcaagatg gagttgagga ctattttaca ctgtatgcag atgtccctct
2220





aaatgatatg tttggttatt ccactgaact taggtcatgc acagagggaa agggagaata
2280





cacaatggag tatagcaggt atcagccatg tttaccatcc acacaagaag acgtcattaa
2340





taagtatttg gaagctacag gtcaacttcc tgttaaaaaa ggaaaagcca agaactaact
2400





ttgcttactg tgagttgact gactctaatt gaatctgcgt ggttttgata ctttgatgga
2460





ttccagtgga ataaattcag gctgctgaaa caagaaattc tgagcccagg aagcgggctc
2520





ttctttcttc aaaagaagcc cttcttgttc atattcagga gcttctgtta tattcaaagg
2580





taattctatg tctatctcaa ctctattgat tggttttata gttcattgaa aatcctcaaa
2640





taaaatataa ttattactga aatatgttta atatttaagg ggaaaagaga ctaatttcag
2700





ttatactttt aagcttagaa tgtatgttca tttccaaatt ttgtatcata agagttttca
2760





acatagagaa aagctgaaaa aatgcaaaga ataaccacat actttccatc taccttcctt
2820





tgttaacggg ttgtttatca tataataatt tgttttgtca tatttgcttt cactgtctat
2880





tatctgttta agtctcataa ctctattttt agtttgctga agacttgaaa gtgaatcgca
2940





tatatcatga cacttcttgg agtgtcatta atgggcaggc ttttctgttg aagagtggat
3000





tccgtatgtt cttcatagag agtgtttttc agattcttca ttgggatatt aaaatattag
3060





ccaaatttcn ctctgtttta tatatgncag tttatttcag tttgtggttt ctgcaaattt
3120





gtaactgcct ctgttttagg agtataagta ttacttcctt gtggtctatt gtgaagtaaa
3180





aagtagaccc ttgcatatac tattcttgtt tgtgttcatc ttaatgtttt tgtacagcta
3240





aatcaaatgt aatttataga gttagtttca tcaacctaat gaatgctagt taaatttgaa
3300





ttccttggaa tttatcgtat attgtattca ctgagattat gaagggacaa atgttaatct
3360





tttgtttcca gaaaaagttg ggctttccca agcagttcta ttacccggtt cagaattgct
3420





tcatccaaaa atcatctgat ggtatagatg gatcctagtc cttttcatta cctgatggta
3480





gaaataaaat aattgatttt a
3501
















TABLE 6








hEF-G polypeptide (SEQ ID NO:6)
















Met Arg Leu Leu Gly Ala Ala Ala Val Ala Ala Leu Gly Arg Gly Arg



1               5                   10                  15





Ala Pro Ala Ser Leu Gly Trp Gln Arg Lys Gln Val Asn Trp Lys Ala


            20                  25                  30





Cys Arg Trp Ser Ser Ser Gly Val Ile Pro Asn Glu Lys Ile Arg Asn


        35                  40                  45





Ile Gly Ile Ser Ala His Ile Asp Ser Gly Lys Thr Thr Leu Thr Glu


    50                  55                  60





Arg Val Leu Tyr Tyr Thr Gly Arg Ile Ala Lys Met His Glu Val Lys


65                  70                  75                  80





Gly Lys Asp Gly Val Gly Ala Val Met Asp Ser Met Glu Leu Glu Arg


                85                  90                  95





Gln Arg Gly Ile Thr Ile Gln Ser Ala Ala Thr Phe Thr Met Trp Lys


            100                 105                 110





Asp Val Asn Ile Asn Ile Ile Asp Thr Pro Gly His Val Asp Phe Thr


        115                 120                 125





Ile Glu Val Glu Arg Ala Leu Arg Val Leu Asp Gly Ala Val Leu Val


    130                 135                 140





Leu Cys Ala Val Gly Gly Val Gln Cys Gln Thr Met Thr Val Asn Arg


145                 150                 155                 160





Gln Met Lys Arg Tyr Asn Val Pro Phe Leu Thr Phe Ile Asn Lys Leu


                165                 170                 175





Asp Arg Met Gly Ser Asn Pro Ala Arg Ala Leu Gln Gln Met Arg Ser


            180                 185                 190





Lys Leu Asn His Asn Thr Ala Phe Met Gln Ile Pro Met Gly Leu Glu


        195                 200                 205





Gly Asn Phe Lys Gly Ile Val Asp Leu Ile Glu Glu Arg Ala Ile Tyr


    210                 215                 220





Phe Asp Gly Asp Phe Ser Gln Ile Val Arg Tyr Gly Glu Ile Pro Ala


225                 230                 235                 240





Glu Leu Arg Ala Ala Ala Thr Asp His Arg Gln Glu Leu Ile Glu Cys


                245                 250                 255





Val Ala Asn Ser Asp Glu Gln Leu Gly Glu Met Phe Leu Glu Glu Lys


            260                 265                 270





Ile Pro Ser Ile Ser Asp Leu Lys Leu Ala Ile Arg Arg Ala Thr Leu


        275                 280                 285





Lys Arg Ser Phe Thr Pro Val Phe Leu Gly Ser Ala Leu Lys Asn Lys


    290                 295                 300





Gly Val Gln Pro Leu Leu Asp Ala Val Leu Glu Tyr Leu Pro Asn Pro


305                 310                 315                 320





Ser Glu Val Gln Asn Tyr Ala Ile Leu Asn Lys Lys Asp Asp Ser Lys


                325                 330                 335





Glu Lys Thr Lys Ile Leu Met Asn Ser Ser Arg His Asn Ser His Pro


            340                 345                 350





Phe Val Gly Leu Ala Phe Pro Leu Glu Val Gly Arg Phe Gly Gln Leu


        355                 360                 365





Thr Tyr Val Arg Ser Tyr Gln Gly Glu Leu Lys Lys Gly Asp Thr Ile


    370                 375                 380





Tyr Asn Thr Arg Thr Arg Lys Lys Val Arg Leu Gln Arg Leu Ala Arg


385                 390                 395                 400





Met His Ala Asp Met Met Glu Ala Ser Thr Glu Glu Val Tyr Ala Gly


                405                 410                 415





Asp Ile Cys Ala Leu Phe Gly Ile Asp Cys Ala Ser Gly Asp Thr Phe


            420                 425                 430





Thr Asp Lys Ala Asn Ser Gly Leu Ser Met Glu Ser Ile His Val Pro


        435                 440                 445





Asp Pro Val Ile Ser Ile Ala Met Lys Pro Ser Asn Lys Asn Asp Leu


    450                 455                 460





Glu Lys Phe Ser Lys Gly Ile Gly Arg Phe Thr Arg Glu Asp Pro Thr


465                 470                 475                 480





Phe Lys Val Tyr Phe Asp Thr Glu Asn Lys Glu Thr Val Ile Ser Gly


                485                 490                 495





Met Gly Glu Leu His Leu Glu Ile Tyr Ala Gln Arg Leu Glu Arg Glu


            500                 505                 510





Tyr Gly Cys Pro Cys Ile Thr Gly Lys Pro Lys Val Ala Phe Arg Glu


        515                 520                 525





Thr Ile Thr Ala Pro Val Pro Phe Asp Phe Thr His Lys Lys Gln Ser


    530                 535                 540





Gly Gly Ala Gly Gln Tyr Gly Lys Val Ile Gly Val Leu Glu Pro Leu


545                 550                 555                 560





Asp Pro Glu Asp Tyr Thr Lys Leu Glu Phe Ser Asp Glu Thr Phe Gly


                565                 570                 575





Ser Asn Ile Pro Lys Gln Phe Val Pro Ala Val Glu Lys Gly Phe Leu


            580                 585                 590





Asp Ala Cys Glu Lys Gly Pro Leu Ser Gly His Lys Leu Ser Gly Leu


        595                 600                 605





Arg Phe Val Leu Gln Asp Gly Ala His His Met Val Asp Ser Asn Glu


    610                 615                 620





Ile Ser Phe Ile Arg Ala Gly Glu Gly Ala Leu Lys Gln Ala Leu Ala


625                 630                 635                 640





Asn Ala Thr Leu Cys Ile Leu Glu Pro Ile Met Ala Val Glu Val Val


                645                 650                 655





Ala Pro Asn Glu Phe Gln Gly Gln Val Ile Ala Gly Ile Asn Arg Arg


            660                 665                 670





His Gly Val Ile Thr Gly Gln Asp Gly Val Glu Asp Tyr Phe Thr Leu


        675                 680                 685





Tyr Ala Asp Val Pro Leu Asn Asp Met Phe Gly Tyr Ser Thr Glu Leu


    690                 695                 700





Arg Ser Cys Thr Glu Gly Lys Gly Glu Tyr Thr Met Glu Tyr Ser Arg


705                 710                 715                 720





Tyr Gln Pro Cys Leu Pro Ser Thr Gln Glu Asp Val Ile Asn Lys Tyr


                725                 730                 735





Leu Glu Ala Thr Gly Gln Leu Pro Val Lys Lys Gly Lys Ala Lys Asn


            740                 745                 750
















TABLE 7








hTRG nucleotide sequence (SEQ ID NO:7)

















gccgcgggag caggcggagg cggaggcggc gggggcagga ggatgtcgca gccgccgctg
60






ctccccgcct cggcggagac tcggaagttc acccgggcgc tgagtaagcc gggcacggcg
120





gccgagctgc ggcagagcgt gtctgaggtg gtgcgcggct ccgtgctcct ggcaaagcca
180





aagctaattg agccactcga ctatgaaaat gtcatcgtcc agaagaagac tcagatcctg
240





aacgactgtt tacgggagat gctgctcttc ccttacgatg actttcagac ggccatcctg
300





agacgacagg gtcgatacat atgctcaaca gtgcctgcga aggcggaaga ggaagcacag
360





agcttgtttg ttacagagtg catcaaaacc tataactctg actggcatct tgtgaactat
420





aaatatgaag attactcagg agagtttcga cagcttccga acaaagtggt caagttggat
480





aaacttccag ttcatgtcta tgaagttgac gaggaggtcg acaaagatga ggatgctgcc
540





tcccttggtt cccagaaagg tgggatcacc aagcatggct ggctgtacaa aggcaacatg
600





aacagtgcca tcagcgtgac catgaggtca tttaagagac gatttttcca cctgattcaa
660





cttggcgatg gatcctataa atttgaattt ttaaaagatc tccaaaagga accaaaagga
720





tcaatatttc tgggattcct gtatggggtg tcgttcagga acaacaaagt caggcgtttt
780





gcttttgagc tcaagatgca ggacaaaagt agttatctct tggcagcaga cagtgaagtg
840





gaaatggaag aatggatcac aattctaaat aagatcctcc agctcaactt tgaagctgca
900





atgcaagaaa agcgaaatgg cgactctcac gaagatgatg aacaaagcaa attggaaggt
960





tctggttccg gtttagatag ctacctgccg gaacttgcca agagtgcaag agaagcagaa
1020





atcaaactga aaagtgaaag cagagtcaaa cttttttatt tggacccaga tgcccagaag
1080





cttgacttct catcagctga gccagaagtg aagtcatttg aagagaagtt tggaaaaagg
1140





atccttgtca agtgcaatga tttatctttc aatttgcaat gctgtgttgc cgaaaatgaa
1200





gaaggaccca ctacaaatgt tgaacctttc tttgttactc tatccctgtt tgacataaaa
1260





tacaaccgga agatttctgc cgatttccac gtagacctga accatttctc agtgaggcaa
1320





atgatcgcca ccacgtcccc ggcgctgatg aatggcagtg ggccggaaac ccaatctgcc
1380





ctcaggggca tccttcatga agccgccatg cagtatccga agcagggaat attttcagtc
1440





acttgtcctc atccagatat atttcttgtg gccagaattg aaaaagtcct tcaggggagc
1500





atcacacatt gcgctgagcc atatatgaaa agttcagact cttctaaggt ggcccagaag
1560





gtgctgaaga atgccaagca ggcatgccaa agactaggac agtatagaat gccatttgct
1620





tgggcagcaa ggacattgtt taaggatgca tctggaaatc ttgacaaaaa tgccagattt
1680





tctgccatct acaggcaaga cagcaataag ctatccaatg atgacatgct caagttactt
1740





gcagactttc ggaaacctga gaagatggct aagctcccag tgattttagg caatctagac
1800





attacaattg ataatgtttc ctcagacttc cctaattatg ttaattcatc atacattccc
1860





acaaaacaat ttgaaacctg cagtaaaact cccatcacgt ttgaagtgga ggaatttgtg
1920





ccctgcatac caaaacacac tcagccttac accatctaca ccaatcacct ttacgtttat
1980





cctaagtact tgaaatacga cagtcagaag tcttttgcca aggctagaaa tattgcgatt
2040





tgcattgaat tcaaagattc agatgaggaa gactctcagc cccttaagtg catttatggc
2100





agacctggtg ggccagtttt cacaagaagc gcctttgctg cagttttaca ccatcaccaa
2160





aacccagaat tttatgatga gattaaaata gagttgccca ctcagctgca tgaaaagcac
2220





cacctgttgc tcacattctt ccatgtcagc tgtgacaact caagtaaagg aagcacgaag
2280





aagagggatg tcgttgaaac ccaagttggc tactcctggc ttcccctcct gaaagacgga
2340





agggtggtga caagcgagca gcacatcccg gtctcggcga accttccttc gggctatctt
2400





ggctaccagg agcttgggat gggcaggcat tatggtccgg aaattaaatg ggtagatgga
2460





ggcaagccac tgctgaaaat ttccactcat ctggtttcta cagtgtatac tcaggatcag
2520





catttacata attttttcca gtactgtcag aaaaccgaat ctggagccca agccttagga
2580





aacgaacttg taaagtacct taagagtctg catgcgatgg aaggccacgt gatgatcgcc
2640





ttcttgccca ctatcctaaa ccagctgttc cgagtcctca ccagagccac acaggaagaa
2700





gtcgcggtta acgtgactcg ggtcattatt catgtggttg cccagtgcca tgaggaagga
2760





ttggagagcc acttgaggtc atatgttaag tacgcgtata aggctgagcc atatgttgcc
2820





tctgaataca agacagtgca tgaagaactg accaaatcca tgaccacgat tctcaagcct
2880





tctgccgatt tcctcaccag caacaaacta ctgaagtact catggttttt ctttgatgta
2940





ctgatcaaat ctatggctca gcatttgata gagaactcca aagttaagtt gctgcgaaac
3000





cagagatttc ctgcatccta tcatcatgca gtggaaaccg ttgtaaatat gctgatgcca
3060





cacatcactc agaagtttcg agataatcca gaggcatcta agaacgcgaa tcatagcctt
3120





gctgtcttca tcaagagatg tttcaccttc atggacaggg gctttgtctt caagcagatc
3180





aacaactaca ttagctgttt tgctcctgga gacccaaaga ccctctttga atacaagttt
3240





gaatttctcc gtgtagtgtg caaccatgaa cattatattc cgttgaactt accaatgcca
3300





tttggaaaag gcaggattca aagataccaa gacctccagc ttgactactc attaacagat
3360





gagttctgca gaaaccactt cttggtggga ctgttactga gggaggtggg gacagccctc
3420





caggagttcc gggaggtccg tctgatcgcc atcagtgtgc tcaagaacct gctgataaag
3480





cattcttttg atgacagata tgcttcaagg agccatcagg caaggatagc caccctctac
3540





ctgcctctgt ttggtctgct gattgaaaac gtccagcgga tcaatgtgag ggatgtgtca
3600





cccttccctg tgaacgcggg catgactgtg aaggatgaat ccctggctct accagctgtg
3660





aatccgctgg tgacgccgca gaagggaagc accctggaca acagcctgca caaggacctg
3720





ctgggcgcca tctccggcat tgcttctcca tatacaacct caactccaaa catcaacagt
3780





gtgagaaatg ctgattcgag aggatctctc ataagcacag attcgggtaa cagccttcca
3840





gaaaggaata gtgagaagag caattccctg gataagcacc aacaaagtag cacattggga
3900





aattccgtgg ttcgctgtga taaacttgac cagtctgaga ttaagagcct actgatgtgt
3960





ttcctctaca tcttaaagag catgtctgat gatgctttgt ttacatattg gaacaaggct
4020





tcaacatctg aacttatgga tttttttaca atatctgaag tctgcctgca ccagttccag
4080





tacatgggga agcgatacat agccagaaca ggaatgatgc atgccagatt gcagcagctg
4140





ggcagcctgg ataactctct cacttttaac cacagctatg gccactcgga cgcagatgtt
4200





ctgcaccagt cattacttga agccaacatt gctactgagg tttgcctgac agctctggac
4260





acgctttctc tatttacatt ggcgtttaag aaccagctcc tggccgacca tggacataat
4320





cctctcatga aaaaagtttt tgatgtctac ctgtgttttc ttcaaaaaca tcagtctgaa
4380





acggctttaa aaaatgtctt cactgcctta aggtccttaa tttataagtt tccctcaaca
4440





ttctatgaag ggagagcgga catgtgtgcg gctctgtgtt acgagattct caagtgctgt
4500





aactccaagc tgagctccat caggacggag gcctcccagc tgctctactt cctgatgagg
4560





aacaactttg attacactgg aaagaagtcc tttgtccgga cacatttgca agtcatcata
4620





tctgtcagcc agctgatagc agacgttgtt ggcattgggg gaaccagatt ccagcagtcc
4680





ctgtccatca tcaacaactg tgccaacagt gaccggctta ttaagcacac cagcttctcc
4740





tctgatgtga aggacttaac caaaaggata cgcacggtgc taatggccac cgcccagatg
4800





aaggagcatg agaacgaccc agagatgctg gtggacctcc agtacagcct ggccaaatcc
4860





tatgccagca cgcccgagct caggaagacg tggctcgada gcatggccag gatccatgtc
4920





aaaaatggcg atctctcaga ggcagcaatg tgctatgtcc acgtaacagc cctagtggca
4980





gaatatctca cacggaaaga agcagtccag tgggagccgc cccttctccc ccacagccat
5040





agcgcctgcc tgaggaggag ccggggaggc gtgtttagac aaggatgcac cgccttcagg
5100





gtcattaccc caaacatcga cgaggaggcc tccatgatgg aagacgtggg gatgcaggat
5160





gtccatttca acgaggatgt gctgatggag ctccttgagc agtgcgcaga tggactctgg
5220





aaagccgagc gctacgagct cattgccgac atctacaaac ttatcatccc catttatgag
5280





aagcggaggg attttgagag gctggcccat ctgtatgaca cgctgcaccg ggcctacagc
5340





aaagtgaccg aggtcatgca ctcgggccgc aggcttctgg ggacctactt ccgggtagcc
5400





ttcttcgggc aggcagcgca ataccagttt acagacagtg aaacagatgt ggagggattc
5460





tttgaagatg aagatggaaa ggagtatatt tacaaggaac ccaaactcac accgctgtcg
5520





gaaatttctc agagactcct taaactgtac tcggataaat ttggttctga aaatgtcaaa
5580





atgatacagg attctggcaa ggtcaaccct aaggatctgg attctaagta tgcctacatc
5640





caggtgactc acgtcatccc cttctttgac gaaaaagagt tgcaagaaag gaaaacagag
5700





tttgagagat cccacaacat ccgccgcttc atgtttgaga tgccatttac gcagaccggg
5760





aagaggcagg gcggggtgga agagcagtgc aaacggcgca ccatcctgac agccatacac
5820





tgcttccctt atgtgaagaa gcgcatccct gtcatgtacc agcaccacac tgacctgaac
5880





cccatcgagg tggccattga cgagatgagt aagaaggtgg cggagctccg gcagctgtgc
5940





tcctcggccg aggtggacat gatcaaactg cagctcaaac tccagggcag cgtgagtgtt
6000





caggtcaatg ctggcccact agcatatgcg cgagctttct tagatgatac aaacacaaag
6060





cgatatcctg acaataaagt gaagctgctt aaggaagttt tcaggcaatt tgtggaagct
6120





tgcggtcaag ccttagcggt aaacgaacgt ctgattaaag aagaccagct cgagtatcag
6180





gaagaaatga aagccaacta cagggaaatg gcgaaggagc tttctgaaat catgcatgag
6240





cagatctgcc ccctggagga gaagacgagc gtcttaccga attcccttca catcttcaac
6300





gccatcagtg ggactccaac aagcacaatg gttcacggga tgaccagctc gtcttcggtc
6360





gtgtgattac atctcatggc ccgtgtgtgg ggacttgctt tgtcatttgc aaactcagga
6420





tgctttccaa agccaatcac tggggagacc gagcacaggg aggaccaagg ggaaggggag
6480





agaaaggaaa taaagaacaa cgttatttct taacagactt tctataggag ttgtaagaag
6540





gtgcacatat ttttttaaat ctcactggca atattcaaag ttttcattgt gtcttaacaa
6600





aggtgtggta gacactcttg agctggactt agattttatt cttccttgca gagtagtgtt
6660





agaatagatg gcctacagaa aaaaaaggtt ctgggatcta catggcaggg agggctgcac
6720





tgacattgat gcctggggga ccttttgcct cgaggctgag ctggaaaatc ttgaaaatat
6780





tttttttttc ctgtggcaca ttcaggttga atacaagaac tatttttgtg actagttttt
6840





gatgacctaa gggaactgac cattgtaatt tttgtaccag tgaaccagga gatttagtgc
6900





ttttatattc atttccttgc atttaagaaa atatgaaagc ttaaggaatt atgtgagctt
6960





aaaactagtc aagcagttta gaaccaaagg cctatattaa taaccgcaac tatgctgaaa
7020





agtacaaagt agtacagtat attgttatgt acatatcatt gttaatacag tcctggcatt
7080





ctgtacatat atgtattaca tttctacatt tttaatactc acatgggctt atgcattaag
7140





tttaattgtg ataaatttgt gctgttccag tatatgcaat acactttaat gttttattct
7200





tgtacataaa aatgtgcaat atggagatgt atacagtctt tactatatta ggtttataaa
7260





cagttttaag aatttcatcc ttttgccaaa atggtggagt atgtaattgg taaatcataa
7320





atcctgtggt gaatggtggt gtactttaaa gctgtcacca tgttatattt tcttttaaga
7380





ctttaattta gtaattttat atttgggaaa ataaaggttt ttaattttat ttaactggaa
7440





tcactgccct gctgtaatta aacattctgt accacatctg tattaaaaag acattgctga
7500





ccatta
7506
















TABLE 8








hTRG polypeptide sequence (SEQ ID NO:8)
















Met Ser Gln Pro Pro Leu Leu Pro Ala Ser Ala Glu Thr Arg Lys Phe



1               5                   10                  15





Thr Arg Ala Leu Ser Lys Pro Gly Thr Ala Ala Glu Leu Arg Gln Ser


            20                  25                  30





Val Ser Glu Val Val Arg Gly Ser Val Leu Leu Ala Lys Pro Lys Leu


        35                  40                  45





Ile Glu Pro Leu Asp Tyr Glu Asn Val Ile Val Gln Lys Lys Thr Gln


    50                  55                  60





Ile Leu Asn Asp Cys Leu Arg Glu Met Leu Leu Phe Pro Tyr Asp Asp


65                  70                  75                  80





Phe Gln Thr Ala Ile Leu Arg Arg Gln Gly Arg Tyr Ile Cys Ser Thr


                85                  90                  95





Val Pro Ala Lys Ala Glu Glu Glu Ala Gln Ser Leu Phe Val Thr Glu


            100                 105                 110





Cys Ile Lys Thr Tyr Asn Ser Asp Trp His Leu Val Asn Tyr Lys Tyr


        115                 120                 125





Glu Asp Tyr Ser Gly Glu Phe Arg Gln Leu Pro Asn Lys Val Val Lys


    130                 135                 140





Leu Asp Lys Leu Pro Val His Val Tyr Glu Val Asp Glu Glu Val Asp


145                 150                 155                 160





Lys Asp Glu Asp Ala Ala Ser Leu Gly Ser Gln Lys Gly Gly Ile Thr


                165                 170                 175





Lys His Gly Trp Leu Tyr Lys Gly Asn Met Asn Ser Ala Ile Ser Val


            180                 185                 190





Thr Met Arg Ser Phe Lys Arg Arg Phe Phe His Leu Ile Gln Leu Gly


        195                 200                 205





Asp Gly Ser Tyr Lys Phe Glu Phe Leu Lys Asp Leu Gln Lys Glu Pro


    210                 215                 220





Lys Gly Ser Ile Phe Leu Gly Phe Leu Tyr Gly Val Ser Phe Arg Asn


225                 230                 235                 240





Asn Lys Val Arg Arg Phe Ala Phe Glu Leu Lys Met Gln Asp Lys Ser


                245                 250                 255





Ser Tyr Leu Leu Ala Ala Asp Ser Glu Val Glu Met Glu Glu Trp Ile


            260                 265                 270





Thr Ile Leu Asn Lys Ile Leu Gln Leu Asn Phe Glu Ala Ala Met Gln


        275                 280                 285





Glu Lys Arg Asn Gly Asp Ser His Glu Asp Asp Glu Gln Ser Lys Leu


    290                 295                 300





Glu Gly Ser Gly Ser Gly Leu Asp Ser Tyr Leu Pro Glu Leu Ala Lys


305                 310                 315                 320





Ser Ala Arg Glu Ala Glu Ile Lys Leu Lys Ser Glu Ser Arg Val Lys


                325                 330                 335





Leu Phe Tyr Leu Asp Pro Asp Ala Gln Lys Leu Asp Phe Ser Ser Ala


            340             345                     350





Glu Pro Glu Val Lys Ser Phe Glu Glu Lys Phe Gly Lys Arg Ile Leu


        355                 360                 365





Val Lys Cys Asn Asp Leu Ser Phe Asn Leu Gln Cys Cys Val Ala Glu


    370                 375                 380





Asn Glu Glu Gly Pro Thr Thr Asn Val Glu Pro Phe Phe Val Thr Leu


385                 390                 395                 400





Ser Leu Phe Asp Ile Lys Tyr Asn Arg Lys Ile Ser Ala Asp Phe His


                405                 410                 415





Val Asp Leu Asn His Phe Ser Val Arg Gln Met Ile Ala Thr Thr Ser


            420                 425                 430





Pro Ala Leu Met Asn Gly Ser Gly Pro Glu Thr Gln Ser Ala Leu Arg


        435                 440                 445





Gly Ile Leu His Glu Ala Ala Met Gln Tyr Pro Lys Gln Gly Ile Phe


    450                 455                 460





Ser Val Thr Cys Pro His Pro Asp Ile Phe Leu Val Ala Arg Ile Glu


465                 470                 475                 480





Lys Val Leu Gln Gly Ser Ile Thr His Cys Ala Glu Pro Tyr Met Lys


                485                 490                 495





Ser Ser Asp Ser Ser Lys Val Ala Gln Lys Val Leu Lys Asn Ala Lys


            500                 505                 510





Gln Ala Cys Gln Arg Leu Gly Gln Tyr Arg Met Pro Phe Ala Trp Ala


        515                 520                 525





Ala Arg Thr Leu Phe Lys Asp Ala Ser Gly Asn Leu Asp Lys Asn Ala


    530                 535                 540





Arg Phe Ser Ala Ile Tyr Arg Gln Asp Ser Asn Lys Leu Ser Asn Asp


545                 550                 555                 560





Asp Met Leu Lys Leu Leu Ala Asp Phe Arg Lys Pro Glu Lys Met Ala


                565                 570                 575





Lys Leu Pro Val Ile Leu Gly Asn Leu Asp Ile Thr Ile Asp Asn Val


            580                 585                 590





Ser Ser Asp Phe Pro Asn Tyr Val Asn Ser Ser Tyr Ile Pro Thr Lys


        595                 600                 605





Gln Phe Glu Thr Cys Ser Lys Thr Pro Ile Thr Phe Glu Val Glu Glu


    610                 615                 620





Phe Val Pro Cys Ile Pro Lys His Thr Gln Pro Tyr Thr Ile Tyr Thr


625                 630                 635                 640





Asn His Leu Tyr Val Tyr Pro Lys Tyr Leu Lys Tyr Asp Ser Gln Lys


                645                 650                 655





Ser Phe Ala Lys Ala Arg Asn Ile Ala Ile Cys Ile Glu Phe Lys Asp


            660                 665                 670





Ser Asp Glu Glu Asp Ser Gln Pro Leu Lys Cys Ile Tyr Gly Arg Pro


        675                 680                 685





Gly Gly Pro Val Phe Thr Arg Ser Ala Phe Ala Ala Val Leu His His


    690                 695                 700





His Gln Asn Pro Glu Phe Tyr Asp Glu Ile Lys Ile Glu Leu Pro Thr


705                 710                 715                 720





Gln Leu His Glu Lys His His Leu Leu Leu Thr Phe Phe His Val Ser


                725                 730                 735





Cys Asp Asn Ser Ser Lys Gly Ser Thr Lys Lys Arg Asp Val Val Glu


            740                 745                 750





Thr Gln Val Gly Tyr Ser Trp Leu Pro Leu Leu Lys Asp Gly Arg Val


        755                 760                 765





Val Thr Ser Glu Gln His Ile Pro Val Ser Ala Asn Leu Pro Ser Gly


    770                 775                 780





Tyr Leu Gly Tyr Gln Glu Leu Gly Met Gly Arg His Tyr Gly Pro Glu


785                 790                 795                 800





Ile Lys Trp Val Asp Gly Gly Lys Pro Leu Leu Lys Ile Ser Thr His


                805                 810                 815





Leu Val Ser Thr Val Tyr Thr Gln Asp Gln His Leu His Asn Phe Phe


            820                 825                 830





Gln Tyr Cys Gln Lys Thr Glu Ser Gly Ala Gln Ala Leu Gly Asn Glu


        835                 840                 845





Leu Val Lys Tyr Leu Lys Ser Leu His Ala Met Glu Gly His Val Met


    850                 855                 860





Ile Ala Phe Leu Pro Thr Ile Leu Asn Gln Leu Phe Arg Val Leu Thr


865                 870                 875                 880





Arg Ala Thr Gln Glu Glu Val Ala Val Asn Val Thr Arg Val Ile Ile


                885                 890                 895





His Val Val Ala Gln Cys His Glu Glu Gly Leu Glu Ser His Leu Arg


            900                 905                 910





Ser Tyr Val Lys Tyr Ala Tyr Lys Ala Glu Pro Tyr Val Ala Ser Glu


        915                 920                 925





Tyr Lys Thr Val His Glu Glu Leu Thr Lys Ser Met Thr Thr Ile Leu


    930                 935                 940





Lys Pro Ser Ala Asp Phe Leu Thr Ser Asn Lys Leu Leu Lys Tyr Ser


945                 950                 955                 960





Trp Phe Phe Phe Asp Val Leu Ile Lys Ser Met Ala Gln His Leu Ile


                965                 970                 975





Glu Asn Ser Lys Val Lys Leu Leu Arg Asn Gln Arg Phe Pro Ala Ser


            980                 985                 990





Tyr His His Ala Val Glu Thr Val Val Asn Met Leu Met Pro His Ile


        995                 1000                1005





Thr Gln Lys Phe Arg Asp Asn Pro Glu Ala Ser Lys Asn Ala Asn


    1010                1015                1020





His Ser Leu Ala Val Phe Ile Lys Arg Cys Phe Thr Phe Met Asp


    1025                1030                1035





Arg Gly Phe Val Phe Lys Gln Ile Asn Asn Tyr Ile Ser Cys Phe


    1040                1045                1050





Ala Pro Gly Asp Pro Lys Thr Leu Phe Glu Tyr Lys Phe Glu Phe


    1055                1060                1065





Leu Arg Val Val Cys Asn His Glu His Tyr Ile Pro Leu Asn Leu


    1070                1075                1080





Pro Met Pro Phe Gly Lys Gly Arg Ile Gln Arg Tyr Gln Asp Leu


    1085                1090                1095





Gln Leu Asp Tyr Ser Leu Thr Asp Glu Phe Cys Arg Asn His Phe


    1100                1105                1110





Leu Val Gly Leu Leu Leu Arg Glu Val Gly Thr Ala Leu Gln Glu


    1115                1120                1125





Phe Arg Glu Val Arg Leu Ile Ala Ile Ser Val Leu Lys Asn Leu


    1130                1135                1140





Leu Ile Lys His Ser Phe Asp Asp Arg Tyr Ala Ser Arg Ser His


    1145                1150                1155





Gln Ala Arg Ile Ala Thr Leu Tyr Leu Pro Leu Phe Gly Leu Leu


    1160                1165                1170





Ile Glu Asn Val Gln Arg Ile Asn Val Arg Asp Val Ser Pro Phe


    1175                1180                1185





Pro Val Asn Ala Gly Met Thr Val Lys Asp Glu Ser Leu Ala Leu


    1190                1195                1200





Pro Ala Val Asn Pro Leu Val Thr Pro Gln Lys Gly Ser Thr Leu


    1205                1210                1215





Asp Asn Ser Leu His Lys Asp Leu Leu Gly Ala Ile Ser Gly Ile


    1220                1225                1230





Ala Ser Pro Tyr Thr Thr Ser Thr Pro Asn Ile Asn Ser Val Arg


    1235                1240                1245





Asn Ala Asp Ser Arg Gly Ser Leu Ile Ser Thr Asp Ser Gly Asn


    1250                1255                1260





Ser Leu Pro Glu Arg Asn Ser Glu Lys Ser Asn Ser Leu Asp Lys


    1265                1270                1275





His Gln Gln Ser Ser Thr Leu Gly Asn Ser Val Val Arg Cys Asp


    1280                1285                1290





Lys Leu Asp Gln Ser Glu Ile Lys Ser Leu Leu Met Cys Phe Leu


    1295                1300                1305





Tyr Ile Leu Lys Ser Met Ser Asp Asp Ala Leu Phe Thr Tyr Trp


    1310                1315                1320





Asn Lys Ala Ser Thr Ser Glu Leu Met Asp Phe Phe Thr Ile Ser


    1325                1330                1335





Glu Val Cys Leu His Gln Phe Gln Tyr Met Gly Lys Arg Tyr Ile


    1340                1345                1350





Ala Arg Thr Gly Met Met His Ala Arg Leu Gln Gln Leu Gly Ser


    1355                1360                1365





Leu Asp Asn Ser Leu Thr Phe Asn His Ser Tyr Gly His Ser Asp


    1370                1375                1380





Ala Asp Val Leu His Gln Ser Leu Leu Glu Ala Asn Ile Ala Thr


    1385                1390                1395





Glu Val Cys Leu Thr Ala Leu Asp Thr Leu Ser Leu Phe Thr Leu


    1400                1405                1410





Ala Phe Lys Asn Gln Leu Leu Ala Asp His Gly His Asn Pro Leu


    1415                1420                1425





Met Lys Lys Val Phe Asp Val Tyr Leu Cys Phe Leu Gln Lys His


    1430                1435                1440





Gln Ser Glu Thr Ala Leu Lys Asn Val Phe Thr Ala Leu Arg Ser


    1445                1450                1455





Leu Ile Tyr Lys Phe Pro Ser Thr Phe Tyr Glu Gly Arg Ala Asp


    1460                1465                1470





Met Cys Ala Ala Leu Cys Tyr Glu Ile Leu Lys Cys Cys Asn Ser


    1475                1480                1485





Lys Leu Ser Ser Ile Arg Thr Glu Ala Ser Gln Leu Leu Tyr Phe


    1490                1495                1500





Leu Met Arg Asn Asn Phe Asp Tyr Thr Gly Lys Lys Ser Phe Val


    1505                1510                1515





Arg Thr His Leu Gln Val Ile Ile Ser Val Ser Gln Leu Ile Ala


    1520                1525                1530





Asp Val Val Gly Ile Gly Gly Thr Arg Phe Gln Gln Ser Leu Ser


    1535                1540                1545





Ile Ile Asn Asn Cys Ala Asn Ser Asp Arg Leu Ile Lys His Thr


    1550                1555                1560





Ser Phe Ser Ser Asp Val Lys Asp Leu Thr Lys Arg Ile Arg Thr


    1565                1570                1575





Val Leu Met Ala Thr Ala Gln Met Lys Glu His Glu Asn Asp Pro


    1580                1585                1590





Glu Met Leu Val Asp Leu Gln Tyr Ser Leu Ala Lys Ser Tyr Ala


    1595                1600                1605





Ser Thr Pro Glu Leu Arg Lys Thr Trp Leu Asp Ser Met Ala Arg


    1610                1615                1620





Ile His Val Lys Asn Gly Asp Leu Ser Glu Ala Ala Met Cys Tyr


    1625                1630                1635





Val His Val Thr Ala Leu Val Ala Glu Tyr Leu Thr Arg Lys Glu


    1640                1645                1650





Ala Val Gln Trp Glu Pro Pro Leu Leu Pro His Ser His Ser Ala


    1655                1660                1665





Cys Leu Arg Arg Ser Arg Gly Gly Val Phe Arg Gln Gly Cys Thr


    1670                1675                1680





Ala Phe Arg Val Ile Thr Pro Asn Ile Asp Glu Glu Ala Ser Met


    1685                1690                1695





Met Glu Asp Val Gly Met Gln Asp Val His Phe Asn Glu Asp Val


    1700                1705                1710





Leu Met Glu Leu Leu Glu Gln Cys Ala Asp Gly Leu Trp Lys Ala


    1715                1720                1725





Glu Arg Tyr Glu Leu Ile Ala Asp Ile Tyr Lys Leu Ile Ile Pro


    1730                1735                1740





Ile Tyr Glu Lys Arg Arg Asp Phe Glu Arg Leu Ala His Leu Tyr


    1745                1750                1755





Asp Thr Leu His Arg Ala Tyr Ser Lys Val Thr Glu Val Met His


    1760                1765                1770





Ser Gly Arg Arg Leu Leu Gly Thr Tyr Phe Arg Val Ala Phe Phe


    1775                1780                1785





Gly Gln Ala Ala Gln Tyr Gln Phe Thr Asp Ser Glu Thr Asp Val


    1790                1795                1800





Glu Gly Phe Phe Glu Asp Glu Asp Gly Lys Glu Tyr Ile Tyr Lys


    1805                1810                1815





Glu Pro Lys Leu Thr Pro Leu Ser Glu Ile Ser Gln Arg Leu Leu


    1820                1825                1830





Lys Leu Tyr Ser Asp Lys Phe Gly Ser Glu Asn Val Lys Met Ile


    1835                1840                1845





Gln Asp Ser Gly Lys Val Asn Pro Lys Asp Leu Asp Ser Lys Tyr


    1850                1855                1860





Ala Tyr Ile Gln Val Thr His Val Ile Pro Phe Phe Asp Glu Lys


    1865                1870                1875





Glu Leu Gln Glu Arg Lys Thr Glu Phe Glu Arg Ser His Asn Ile


    1880                1885                1890





Arg Arg Phe Met Phe Glu Met Pro Phe Thr Gln Thr Gly Lys Arg


    1895                1900                1905





Gln Gly Gly Val Glu Glu Gln Cys Lys Arg Arg Thr Ile Leu Thr


    1910                1915                1920





Ala Ile His Cys Phe Pro Tyr Val Lys Lys Arg Ile Pro Val Met


    1925                1930                1935





Tyr Gln His His Thr Asp Leu Asn Pro Ile Glu Val Ala Ile Asp


    1940                1945                1950





Glu Met Ser Lys Lys Val Ala Glu Leu Arg Gln Leu Cys Ser Ser


    1955                1960                1965





Ala Glu Val Asp Met Ile Lys Leu Gln Leu Lys Leu Gln Gly Ser


    1970                1975                1980





Val Ser Val Gln Val Asn Ala Gly Pro Leu Ala Tyr Ala Arg Ala


    1985                1990                1995





Phe Leu Asp Asp Thr Asn Thr Lys Arg Tyr Pro Asp Asn Lys Val


    2000                2005                2010





Lys Leu Leu Lys Glu Val Phe Arg Gln Phe Val Glu Ala Cys Gly


    2015                2020                2025





Gln Ala Leu Ala Val Asn Glu Arg Leu Ile Lys Glu Asp Gln Leu


    2030                2035                2040





Glu Tyr Gln Glu Glu Met Lys Ala Asn Tyr Arg Glu Met Ala Lys


    2045                2050                2055





Glu Leu Ser Glu Ile Met Hls Glu Gln Ile Cys Pro Leu Glu Glu


    2060                2065                2070





Lys Thr Ser Val Leu Pro Asn Ser Leu His Ile Phe Asn Ala Ile


    2075                2080                2085





Ser Gly Thr Pro Thr Ser Thr Met Val His Gly Met Thr Ser Ser


    2090                2095                2100





Ser Ser Val Val


    2105
















TABLE 9








hMX1 nucleotide sequence (SEQ ID NO:9)

















ttttgtttac agggaacacg ggtctggctg agagaaaatg gccagcattt tccaagtact
60






gtaaattcct gtgcagaagg catcgtcgtc ttccggacag actatggtca ggtattcact
120





tacaagcaga gcacaattac ccaccagaag gtgactgcta tgcaccccac gaacgaggag
180





ggcgtggatg acatggcgtc cttgacagag ctccatggcg gctccatcat gtataactta
240





ttccagcggt ataagagaaa tcaaatatgg acctacatcg gctccatcct ggcctctgtg
300





aacccctacc agcccatcgc cgggctgtac gagcctgcca ccatggagca gtacagccgg
360





cgccacctgg gcgagctgcc cccgcacatc ttcgccatcg ccaacgagtg ctaccgctgc
420





ctgtggaagc gccacgacaa ccagtgcatc ctcatcaagg gtgaaagtgg ggcaggtaaa
480





accgaaagca ctaaattgat cctcaagttt ctgtcagtca tcagtcaaca gtctttggaa
540





ttgtccttaa aggagaagac atcctgtgtt gaacgagcta ttcttgaaag cagccccatc
600





atggaagctt tcggcaatgc gaagaccgtg tacaacaaca actctagtcg ctttgggaag
660





tttgttcagc tgaacatctg tcagaaagga aatattcagg gcgggagaat tgtagattgt
720





atcctctctt cccagaaccg agtagtaagg caaaatcccg gggaaaggaa ttatcacata
780





ttttatgcac tgctggcagg gctggaacat gaagaaagag aagaatttta tttatctacg
840





ccagaaaact accactactt gaatcagtct ggatgtgtag aagacaagac aatcagtgac
900





caggaatcct ttagggaagt tattacggca atggacgtga tgcagttcag caaggaggaa
960





gttcgggaag tgtcgaggct gcttgctggt atactgcatc ttgggaacat agaatttatc
1020





actgctggtg gggcacaggt ttccttcaaa acagctttgg gcagatctgc ggagttactt
1080





gggctggacc caacacagct cacagatgct ttgacccaga gatcaatgtt cctcagggga
1140





gaagagatcc tcacgcctct caatgttcaa caggcagtag acagcaggga ctccctggcc
1200





atggctctgt atgcgtgctg ctttgagtgg gtaatcaaga agatcaacag caggatcaaa
1260





ggcaatgagg acttcaagtc tattggcatc ctcgacatct ttggatttga aaactttgag
1320





gttaatcact ttgaacagtt caatataaac tatgcaaacg agaaacttca ggagtacttc
1380





aacaagcata ttttttcttt agaacaacta gaatatagca gggaaggatt agtgtgggaa
1440





gatattgact ggatagacaa tggagaatgc ctggacttga ttgagaagaa acttggcctc
1500





ctagccctta tcaatgaaga aagccatttt cctcaagcca cagacagcac cttattggag
1560





aagctacaca gtcagcatgc gaataaccac ttttatgtga agcccagagt tgcagttaac
1620





aattttggag tgaagcacta tgctggagag gtgcaatatg atgtccgagg tatcttggag
1680





aagaacagag atacatttcg agatgacctt ctcaatttgc taagagaaag ccggtttgac
1740





tttatctacg atctttttga acatgtttca agccgcaaca accaggatac cttgaaatgt
1800





ggaagcaaac atcggcggcc tacagtcagc tcacagttca aggttgactc actgcattcc
1860





ttaatggcaa cgctaagctc ctctaatcct ttctttgttc gctgtatcaa gccaaacatg
1920





cagaagatgc cagaccagtt tgaccaggcg gttgtgctga accagctgcg gtactcaggg
1980





atgctggaga ctgtgagaat ccgcaaagct gggtatgcgg tccgaagacc ctttcaggac
2040





ttttacaaaa ggtataaagt gctgatgagg aatctggctc tgcctgagga cgtccgaggg
2100





aagtgcacga gcctgctgca gctctatgat gcctccaaca gcgagtggca gctggggaag
2160





accaaggtat ttcttcgaga atccttggaa cagaaactgg agaagcggag ggaagaggaa
2220





gtgagccacg cggccatggt gattcgggcc catgtcttgg gcttcttagc acggaaacaa
2280





tacagaaagg tcctttattg tgtggtgata atacagaaga attacagagc attccttctg
2340





aggaggagat ttttgcacct gaaaaaggca gccatagttt tccagaagca actcagaggt
2400





cagattgctc ggagagttta cagacaattg ctggcagaga aaagggagca agaagaaaag
2460





aagaaacagg aagaggaaga aaagaagaaa cgggaggaag aagaaagaga aagagagaga
2520





gagcgaagag aagccgagct ccgcgcccag caggaagaag aaacgaggaa gcagcaagaa
2580





ctcgaagcct tgcagaagag ccagaaggaa gctgaactga cccgtgaact ggagaaacag
2640





aaggaaaata agcaggtgga agagatcctc cgtctggaga aagaaatcga ggacctgcag
2700





cgcatgaagg agcagcagga gctgtcgctg accgaggctt ccctgcagaa gctgcaggag
2760





cggcgggacc aggagctccg caggctggag gaggaagcgt gcagggcggc ccaggagttc
2820





ctcgagtccc tcaatttcga cgagatcgac gagtgtgtcc ggaatatcga gcggtccctg
2880





tcggggggaa gcgaattttc cagcgagctg gctgagagcg catgcgagga gaagcccaac
2940





ttcaacttca gccagcccta cccagaggag gaggtcgatg agggcttcga agccgacgac
3000





gacgccttca aggactcccc caaccccagc gagcacggcc actcagacca gcgaacaagt
3060





ggcatccgga ccagcgatga ctcttcagag gaggacccat acatgaacga cacggtggtg
3120





cccaccagcc ccagtgcgga cagcacggtg ctgctcgccc catcagtgca ggactccggg
3180





agcctacaca actcctccag cggcgagtcc acctactgca tgccccagaa cgctggggac
3240





ttgccctccc cagacggcga ctacgactac gaccaggatg actatgagga cggtgccatc
3300





acttccggca gcagcgtgac cttctccaac tcctacggca gccagtggtc ccccgactac
3360





cgctgctctg tggggaccta caacagctcg ggtgcctacc ggttcagctc tgagggggcg
3420





cagtcctcgt ttgaagatag tgaagaggac tttgattcca ggtttgatac agatgatgag
3480





ctttcatacc ggcgtgactc tgtgtacagc tgtgtcactc tgccgtattt ccacagcttt
3540





ctgtacatga aaggtggcct gatgaactct tggaaacgcc gctggtgcgt cctcaaggat
3600





gaaaccttct tgtggttccg ctccaagcag gaggccctca agcaaggctg gctccacaaa
3660





aaaggggggg gctcctccac gctgtccagg agaaattgga agaagcgctg gtttgtcctc
3720





cgccagtcca agctgatgta ctttgaaaac gacagcgagg agaagctcaa gggcaccgta
3780





gaagtgcgaa cggcaaaaga gatcatagat aacaccacca aggagaatgg gatcgacatc
3840





attatggccg ataggacttt ccacctgatt gcagagtccc cagaagatgc cagccagtgg
3900





ttcagcgtgc tgagtcaggt ccacgcgtcc acggaccagg agatccagga gatgcatgat
3960





gagcaggcaa acccacagaa tgctgtgggc accttggatg tggggctgat tgattctgtg
4020





tgtgcctctg acagccctga tagacccaac tcgtttgtga tcatcacggc caaccgggtg
4080





ctgcactgca acgccgacac gccggaggag atgcaccact ggataaccct gctgcagagg
4140





tccaaagggg acaccagagt ggagggccag gaattcatcg tgagaggatg gttgcacaaa
4200





gaggtgaaga acagtccaaa gatgtcttca ctgaaactga agaaacggtg gtttgtactc
4260





acccacaatt ccctggatta ctacaagagt tcagagaaga acgcgctcaa actggggacc
4320





ctggtcctca acagcctctg ctctgtcgtc cccccagatg agaagatatt caaagagaca
4380





ggctactgga acgtcaccgt gtacgggcgc aagcactgtt accggctcta caccaagctg
4440





ctcaacgagg ccacccggtg gtccagtgtc attcaaaacg tgactgacac caaggccccg
4500





atcgacaccc ccacccagca gctgattcaa gatatcaagg agaactgcct gaactcggat
4560





gtggtggaac agatttacaa gcggaacccg atccttcgat acacccatca ccccttgcac
4620





tccccgctcc tgccccttcc gtatggggac ataaatctca acttgctgaa agacaaaggc
4680





tataccaccc ttcaggatga ggccatcaag atattcaatt ccctgcagca actggagtcc
4740





atgtctgacc caattccaat aatccagggc atcctacaga cagggcatga cctgcgacct
4800





ctgcgggacg agctgtactg ccagcttatc aaacagacca acaaagtgcc ccaccccggc
4860





agtgtgggca acctgtacag ctggcagatc ctgacatgcc tgagctgcac cttcctgccg
4920





agtcgaggga ttctcaagta tctcaagttc catctgaaaa ggatacggga acagtttcca
4980





ggaaccgaga tggaaaaata cgctctcttc acttacgaat ctcttaagaa aaccaaatgc
5040





cgagagtttg tgccttcccg agatgaaata gaagctctga tccacaggca ggaaatgaca
5100





tccacggtct attgccatgg cggcggctcc tgcaagatca ccatcaactc ccacaccacc
5160





gctggggagg tggtggagaa gctgatccga ggcctggcca tggaggacag caggaacatg
5220





tttgctttgt ttgaatacaa cggccacgtc gacaaagcca ttgaaagtcg aaccgtcgta
5280





gctgatgtct tagccaagtt tgaaaagctg gctgccacat ccgaggttgg ggacctgcca
5340





tggaaattct acttcaaact ttactgcttc ctggacacag acaacgtgcc aaaagacagt
5400





gtggagtttg catttatgtt tgaacaggcc cacgaagcgg ttatccatgg ccaccatcca
5460





gccccggaag aaaacctcca ggttcttgct gccctgcgac tccagtatct gcagggggat
5520





tatactctgc acgctgccat cccacctctc gaagaggttt attccctgca gagactcaag
5580





gcccgcatca gccagtcaac caaaaccttc accccttgtg aacggctgga gaagaggcgg
5640





acgagcttcc tagaggggac cctgaggcgg agcttccgga caggatccgt ggtccggcag
5700





aaggtcgagg aggagcagat gctggacatg tggattaagg aagaagtctc ctctgctcga
5760





gccagtatca ttgacaagtg gaggaaattt cagggaatga accaggaaca ggccatggcc
5820





aagtacatgg ccttgatcaa ggagtggcct ggctatggct cgacgctgtt tgatgtggag
5880





tgcaaggaag gtggcttccc tcaggaactc tggttgggtg tcagcgcgga cgccgtctcc
5940





gtctacaagc gtggagaggg aagaccactg gaagtcttcc agtatgaaca catcctctct
6000





tttggggcac ccctggcgaa tacgtataag atcgtggtcg atgagaggga gctgctcttt
6060





gaaaccagtg aggtagtgga tgtggccaag ctcatgaaag cctacatcag catgatcgtg
6120





aagaagcgct acagcacgac acgctccgcc agcagccagg gcagctccag g
6171
















TABLE 10








hMX1 polypeptide sequence (SEQ ID NO:10)
















Phe Cys Leu Gln Gly Thr Arg Val Trp Leu Arg Glu Asn Gly Gln His



1               5                   10                  15





Phe Pro Ser Thr Val Asn Ser Cys Ala Glu Gly Ile Val Val Phe Arg


            20                  25                  30





Thr Asp Tyr Gly Gln Val Phe Thr Tyr Lys Gln Ser Thr Ile Thr His


        35                  40                  45





Gln Lys Val Thr Ala Met His Pro Thr Asn Glu Glu Gly Val Asp Asp


    50                  55                  60





Met Ala Ser Leu Thr Glu Leu His Gly Gly Ser Ile Met Tyr Asn Leu


65                  70                  75                  80





Phe Gln Arg Tyr Lys Arg Asn Gln Ile Trp Thr Tyr Ile Gly Ser Ile


                85                  90                  95





Leu Ala Ser Val Asn Pro Tyr Gln Pro Ile Ala Gly Leu Tyr Glu Pro


            100                 105                 110





Ala Thr Met Glu Gln Tyr Ser Arg Arg His Leu Gly Glu Leu Pro Pro


        115                 120                 125





His Ile Phe Ala Ile Ala Asn Glu Cys Tyr Arg Cys Leu Trp Lys Arg


    130                 135                 140





His Asp Asn Gln Cys Ile Leu Ile Lys Gly Glu Ser Gly Ala Gly Lys


145                 150                 155                 160





Thr Glu Ser Thr Lys Leu Ile Leu Lys Phe Leu Ser Val Ile Ser Gln


                165                 170                 175





Gln Ser Leu Glu Leu Ser Leu Lys Glu Lys Thr Ser Cys Val Glu Arg


            180                 185                 190





Ala Ile Leu Glu Ser Ser Pro Ile Met Glu Ala Phe Gly Asn Ala Lys


        195                 200                 205





Thr Val Tyr Asn Asn Asn Ser Ser Arg Phe Gly Lys Phe Val Gln Leu


    210                 215                 220





Asn Ile Cys Gln Lys Gly Asn Ile Gln Gly Gly Arg Ile Val Asp Cys


225                 230                 235                 240





Ile Leu Ser Ser Gln Asn Arg Val Val Arg Gln Asn Pro Gly Glu Arg


                245                 250                 255





Asn Tyr His Ile Phe Tyr Ala Leu Leu Ala Gly Leu Glu His Glu Glu


            260                 265                 270





Arg Glu Glu Phe Tyr Leu Ser Thr Pro Glu Asn Tyr His Tyr Leu Asn


        275                 280                 285





Gln Ser Gly Cys Val Glu Asp Lys Thr Ile Ser Asp Gln Glu Ser Phe


    290                 295                 300





Arg Glu Val Ile Thr Ala Met Asp Val Met Gln Phe Ser Lys Glu Glu


305                 310                 315                 320





Val Arg Glu Val Ser Arg Leu Leu Ala Gly Ile Leu His Leu Gly Asn


                325                 330                 335





Ile Glu Phe Ile Thr Ala Gly Gly Ala Gln Val Ser Phe Lys Thr Ala


            340                 345                 350





Leu Gly Arg Ser Ala Glu Leu Leu Gly Leu Asp Pro Thr Gln Leu Thr


        355                 360                 365





Asp Ala Leu Thr Gln Arg Ser Met Phe Leu Arg Gly Glu Glu Ile Leu


    370                 375                 380





Thr Pro Leu Asn Val Gln Gln Ala Val Asp Ser Arg Asp Ser Leu Ala


385                 390                 395                 400





Met Ala Leu Tyr Ala Cys Cys Phe Glu Trp Val Ile Lys Lys Ile Asn


                405                 410                 415





Ser Arg Ile Lys Gly Asn Glu Asp Phe Lys Ser Ile Gly Ile Leu Asp


            420                 425                 430





Ile Phe Gly Phe Glu Asn Phe Glu Val Asn His Phe Glu Gln Phe Asn


        435                 440                 445





Ile Asn Tyr Ala Asn Glu Lys Leu Gln Glu Tyr Phe Asn Lys His Ile


    450                 455                 460





Phe Ser Leu Glu Gln Leu Glu Tyr Ser Arg Glu Gly Leu Val Trp Glu


465                 470                 475                 480





Asp Ile Asp Trp Ile Asp Asn Gly Glu Cys Leu Asp Leu Ile Glu Lys


                485                 490                 495





Lys Leu Gly Leu Leu Ala Leu Ile Asn Glu Glu Ser His Phe Pro Gln


            500                 505                 510





Ala Thr Asp Ser Thr Leu Leu Glu Lys Leu His Ser Gln His Ala Asn


        515                 520                 525





Asn His Phe Tyr Val Lys Pro Arg Val Ala Val Asn Asn Phe Gly Val


    530                 535                 540





Lys His Tyr Ala Gly Glu Val Gln Tyr Asp Val Arg Gly Ile Leu Glu


545                 550                 555                 560





Lys Asn Arg Asp Thr Phe Arg Asp Asp Leu Leu Asn Leu Leu Arg Glu


                565                 570                 575





Ser Arg Phe Asp Phe Ile Tyr Asp Leu Phe Glu His Val Ser Ser Arg


            580                 585                 590





Asn Asn Gln Asp Thr Leu Lys Cys Gly Ser Lys His Arg Arg Pro Thr


        595                 600                 605





Val Ser Ser Gln Phe Lys Val Asp Ser Leu His Ser Leu Met Ala Thr


    610                 615                 620





Leu Ser Ser Ser Asn Pro Phe Phe Val Arg Cys Ile Lys Pro Asn Met


625                 630                 635                 640





Gln Lys Met Pro Asp Gln Phe Asp Gln Ala Val Val Leu Asn Gln Leu


                645                 650                 655





Arg Tyr Ser Gly Met Leu Glu Thr Val Arg Ile Arg Lys Ala Gly Tyr


            660                 665                 670





Ala Val Arg Arg Pro Phe Gln Asp Phe Tyr Lys Arg Tyr Lys Val Leu


        675                 680                 685





Met Arg Asn Leu Ala Leu Pro Glu Asp Val Arg Gly Lys Cys Thr Ser


    690                 695                 700





Leu Leu Gln Leu Tyr Asp Ala Ser Asn Ser Glu Trp Gln Leu Gly Lys


705                 710                 715                 720





Thr Lys Val Phe Leu Arg Glu Ser Leu Glu Gln Lys Leu Glu Lys Arg


                725                 730                 735





Arg Glu Glu Glu Val Ser His Ala Ala Met Val Ile Arg Ala His Val


            740                 745                 750





Leu Gly Phe Leu Ala Arg Lys Gln Tyr Arg Lys Val Leu Tyr Cys Val


        755                 760                 765





Val Ile Ile Gln Lys Asn Tyr Arg Ala Phe Leu Leu Arg Arg Arg Phe


    770                 775                 780





Leu His Leu Lys Lys Ala Ala Ile Val Phe Gln Lys Gln Leu Arg Gly


785                 790                 795                 800





Gln Ile Ala Arg Arg Val Tyr Arg Gln Leu Leu Ala Glu Lys Arg Glu


                805                 810                 815





Gln Glu Glu Lys Lys Lys Gln Glu Glu Glu Glu Lys Lys Lys Arg Glu


            820                 825                 830





Glu Glu Glu Arg Glu Arg Glu Arg Glu Arg Arg Glu Ala Glu Leu Arg


        835                 840                 845





Ala Gln Gln Glu Glu Glu Thr Arg Lys Gln Gln Glu Leu Glu Ala Leu


    850                 855                 860





Gln Lys Ser Gln Lys Glu Ala Glu Leu Thr Arg Glu Leu Glu Lys Gln


865                 870                 875                 880





Lys Glu Asn Lys Gln Val Glu Glu Ile Leu Arg Leu Glu Lys Glu Ile


                885                 890                 895





Glu Asp Leu Gln Arg Met Lys Glu Gln Gln Glu Leu Ser Leu Thr Glu


            900                 905                 910





Ala Ser Leu Gln Lys Leu Gln Glu Arg Arg Asp Gln Glu Leu Arg Arg


        915                 920                 925





Leu Glu Glu Glu Ala Cys Arg Ala Ala Gln Glu Phe Leu Glu Ser Leu


    930                 935                 940





Asn Phe Asp Glu Ile Asp Glu Cys Val Arg Asn Ile Glu Arg Ser Leu


945                 950                 955                 960





Ser Gly Gly Ser Glu Phe Ser Ser Glu Leu Ala Glu Ser Ala Cys Glu


                965                 970                 975





Glu Lys Pro Asn Phe Asn Phe Ser Gln Pro Tyr Pro Glu Glu Glu Val


            980                 985                 990





Asp Glu Gly Phe Glu Ala Asp Asp Asp Ala Phe Lys Asp Ser Pro Asn


        995                 1000                1005





Pro Ser Glu His Gly His Ser Asp Gln Arg Thr Ser Gly Ile Arg


    1010                1015                1020





Thr Ser Asp Asp Ser Ser Glu Glu Asp Pro Tyr Met Asn Asp Thr


    1025                1030                1035





Val Val Pro Thr Ser Pro Ser Ala Asp Ser Thr Val Leu Leu Ala


    1040                1045                1050





Pro Ser Val Gln Asp Ser Gly Ser Leu His Asn Ser Ser Ser Gly


    1055                1060                1065





Glu Ser Thr Tyr Cys Met Pro Gln Asn Ala Gly Asp Leu Pro Ser


    1070                1075                1080





Pro Asp Gly Asp Tyr Asp Tyr Asp Gln Asp Asp Tyr Glu Asp Gly


    1085                1090                1095





Ala Ile Thr Ser Gly Ser Ser Val Thr Phe Ser Asn Ser Tyr Gly


    1100                1105                1110





Ser Gln Trp Ser Pro Asp Tyr Arg Cys Ser Val Gly Thr Tyr Asn


    1115                1120                1125





Ser Ser Gly Ala Tyr Arg Phe Ser Ser Glu Gly Ala Gln Ser Ser


    1130                1135                1140





Phe Glu Asp Ser Glu Glu Asp Phe Asp Ser Arg Phe Asp Thr Asp


    1145                1150                1155





Asp Glu Leu Ser Tyr Arg Arg Asp Ser Val Tyr Ser Cys Val Thr


    1160                1165                1170





Leu Pro Tyr Phe His Ser Phe Leu Tyr Met Lys Gly Gly Leu Met


    1175                1180                1185





Asn Ser Trp Lys Arg Arg Trp Cys Val Leu Lys Asp Glu Thr Phe


    1190                1195                1200





Leu Trp Phe Arg Ser Lys Gln Glu Ala Leu Lys Gln Gly Trp Leu


    1205                1210                1215





His Lys Lys Gly Gly Gly Ser Ser Thr Leu Ser Arg Arg Asn Trp


    1220                1225                1230





Lys Lys Arg Trp Phe Val Leu Arg Gln Ser Lys Leu Met Tyr Phe


    1235                1240                1245





Glu Asn Asp Ser Glu Glu Lys Leu Lys Gly Thr Val Glu Val Arg


    1250                1255                1260





Thr Ala Lys Glu Ile Ile Asp Asn Thr Thr Lys Glu Asn Gly Ile


    1265                1270                1275





Asp Ile Ile Met Ala Asp Arg Thr Phe His Leu Ile Ala Glu Ser


    1280                1285                1290





Pro Glu Asp Ala Ser Gln Trp Phe Ser Val Leu Ser Gln Val His


    1295                1300                1305





Ala Ser Thr Asp Gln Glu Ile Gln Glu Met His Asp Glu Gln Ala


    1310                1315                1320





Asn Pro Gln Asn Ala Val Gly Thr Leu Asp Val Gly Leu Ile Asp


    1325                1330                1335





Ser Val Cys Ala Ser Asp Ser Pro Asp Arg Pro Asn Ser Phe Val


    1340                1345                1350





Ile Ile Thr Ala Asn Arg Val Leu His Cys Asn Ala Asp Thr Pro


    1355                1360                1365





Glu Glu Met His His Trp Ile Thr Leu Leu Gln Arg Ser Lys Gly


    1370                1375                1380





Asp Thr Arg Val Glu Gly Gln Glu Phe Ile Val Arg Gly Trp Leu


    1385                1390                1395





His Lys Glu Val Lys Asn Ser Pro Lys Met Ser Ser Leu Lys Leu


    1400                1405                1410





Lys Lys Arg Trp Phe Val Leu Thr His Asn Ser Leu Asp Tyr Tyr


    1415                1420                1425





Lys Ser Ser Glu Lys Asn Ala Leu Lys Leu Gly Thr Leu Val Leu


    1430                1435                1440





Asn Ser Leu Cys Ser Val Val Pro Pro Asp Glu Lys Ile Phe Lys


    1445                1450                1455





Glu Thr Gly Tyr Trp Asn Val Thr Val Tyr Gly Arg Lys His Cys


    1460                1465                1470





Tyr Arg Leu Tyr Thr Lys Leu Leu Asn Glu Ala Thr Arg Trp Ser


    1475                1480                1485





Ser Val Ile Gln Asn Val Thr Asp Thr Lys Ala Pro Ile Asp Thr


    1490                1495                1500





Pro Thr Gln Gln Leu Ile Gln Asp Ile Lys Glu Asn Cys Leu Asn


    1505                1510                1515





Ser Asp Val Val Glu Gln Ile Tyr Lys Arg Asn Pro Ile Leu Arg


    1520                1525                1530





Tyr Thr His His Pro Leu His Ser Pro Leu Leu Pro Leu Pro Tyr


    1535                1540                1545





Gly Asp Ile Asn Leu Asn Leu Leu Lys Asp Lys Gly Tyr Thr Thr


    1550                1555                1560





Leu Gln Asp Glu Ala Ile Lys Ile Phe Asn Ser Leu Gln Gln Leu


    1565                1570                1575





Glu Ser Met Ser Asp Pro Ile Pro Ile Ile Gln Gly Ile Leu Gln


    1580                1585                1590





Thr Gly His Asp Leu Arg Pro Leu Arg Asp Glu Leu Tyr Cys Gln


    1595                1600                1605





Leu Ile Lys Gln Thr Asn Lys Val Pro His Pro Gly Ser Val Gly


    1610                1615                1620





Asn Leu Tyr Ser Trp Gln Ile Leu Thr Cys Leu Ser Cys Thr Phe


    1625                1630                1635





Leu Pro Ser Arg Gly Ile Leu Lys Tyr Leu Lys Phe His Leu Lys


    1640                1645                1650





Arg Ile Arg Glu Gln Phe Pro Gly Thr Glu Met Glu Lys Tyr Ala


    1655                1660                1665





Leu Phe Thr Tyr Glu Ser Leu Lys Lys Thr Lys Cys Arg Glu Phe


    1670                1675                1680





Val Pro Ser Arg Asp Glu Ile Glu Ala Leu Ile His Arg Gln Glu


    1685                1690                1695





Met Thr Ser Thr Val Tyr Cys His Gly Gly Gly Ser Cys Lys Ile


    1700                1705                1710





Thr Ile Asn Ser His Thr Thr Ala Gly Glu Val Val Glu Lys Leu


    1715                1720                1725





Ile Arg Gly Leu Ala Met Glu Asp Ser Arg Asn Met Phe Ala Leu


    1730                1735                1740





Phe Glu Tyr Asn Gly His Val Asp Lys Ala Ile Glu Ser Arg Thr


    1745                1750                1755





Val Val Ala Asp Val Leu Ala Lys Phe Glu Lys Leu Ala Ala Thr


    1760                1765                1770





Ser Glu Val Gly Asp Leu Pro Trp Lys Phe Tyr Phe Lys Leu Tyr


    1775                1780                1785





Cys Phe Leu Asp Thr Asp Asn Val Pro Lys Asp Ser Val Glu Phe


    1790                1795                1800





Ala Phe Met Phe Glu Gln Ala His Glu Ala Val Ile His Gly His


    1805                1810                1815





His Pro Ala Pro Glu Glu Asn Leu Gln Val Leu Ala Ala Leu Arg


    1820                1825                1830





Leu Gln Tyr Leu Gln Gly Asp Tyr Thr Leu His Ala Ala Ile Pro


    1835                1840                1845





Pro Leu Glu Glu Val Tyr Ser Leu Gln Arg Leu Lys Ala Arg Ile


    1850                1855                1860





Ser Gln Ser Thr Lys Thr Phe Thr Pro Cys Glu Arg Leu Glu Lys


    1865                1870                1875





Arg Arg Thr Ser Phe Leu Glu Gly Thr Leu Arg Arg Ser Phe Arg


    1880                1885                1890





Thr Gly Ser Val Val Arg Gln Lys Val Glu Glu Glu Gln Met Leu


    1895                1900                1905





Asp Met Trp Ile Lys Glu Glu Val Ser Ser Ala Arg Ala Ser Ile


    1910                1915                1920





Ile Asp Lys Trp Arg Lys Phe Gln Gly Met Asn Gln Glu Gln Ala


    1925                1930                1935





Met Ala Lys Tyr Met Ala Leu Ile Lys Glu Trp Pro Gly Tyr Gly


    1940                1945                1950





Ser Thr Leu Phe Asp Val Glu Cys Lys Glu Gly Gly Phe Pro Gln


    1955                1960                1965





Glu Leu Trp Leu Gly Val Ser Ala Asp Ala Val Ser Val Tyr Lys


    1970                1975                1980





Arg Gly Glu Gly Arg Pro Leu Glu Val Phe Gln Tyr Glu His Ile


    1985                1990                1995





Leu Ser Phe Gly Ala Pro Leu Ala Asn Thr Tyr Lys Ile Val Val


    2000                2005                2010





Asp Glu Arg Glu Leu Leu Phe Glu Thr Ser Glu Val Val Asp Val


    2015                2020                2025





Ala Lys Leu Met Lys Ala Tyr Ile Ser Met Ile Val Lys Lys Arg


    2030                2035                2040





Tyr Ser Thr Thr Arg Ser Ala Ser Ser Gln Gly Ser Ser Arg


    2045                2050                2055
















TABLE 11








hMX2 nucleotide sequence (SEQ ID NO:11)

















agctagtatc ttttattgtc agaacttctg tgagccaaca aacagttttg catggttgta
60






cacaaaggga caaggcaaat ttcttttttc gtgtgggtag acttagttgg cccaagtcct
120





taaaactttt ccatataaaa ataaaaagtc caagaccaga ttatttttct tctggtcata
180





aatgctgatt tatttacagg tgccttgttc agaccaccat tataaacttg ggataaaata
240





tgtgtgtatt aaagcctcag catttaatgt cagggtcctt tgaagattca ctcaagtgtt
300





aagacgtttc tggaatgcag cgtctctccc ccatagtcaa catggttatt atatctgtaa
360





tctatccaga atgatagaag ctaaccttcc aagtaacact ttgtttttaa cttaaatctt
420





ttagacatga aagactccaa aatgacttca ttcttgttct aaaaccagca ctggagccag
480





ctgttgaaga gtggtttata aatacagtta tcttgtaggc tgcttatctg tttataatac
540





agcagacaca gatggcagac tttgctacat gtaaaacaat ggagtcaaca cgtgtttttc
600





aaaatacagc aaagacagga aaatccagga tttgggtttg ttaataaaac caccttataa
660





agtaacaatt gagactatag ctctgcatta ttaaaatata cagactgtgt acaccattac
720





acatcctttt tccctttgct ttttaatgct catgaaacca tgattaaggt gttgagttta
780





tgaacacatg cacgaacagg caagcacgta cacttaaaag atgaaacaaa gaaaaaagtt
840





gattcatgtc attccatgag aaaggctgcc cgcagcactc cagctcaaac acactgtccc
900





ctcgagctct ccatccccct tcccactccc tcaccttccc tcagattcgg ggaaatcagg
960





ttgggaggtt agtgcatcat tgacagagaa tgcccccctt ccacgctctg ttaagtctcc
1020





cccagaaggg ggaaaggcag ttcccttcag tagcacagtt acggtcgatt agtgttggtt
1080





ccacaagtta aggcacttcc ggctgctttg gtggcagcgt ggttcctccc ctcctttttt
1140





aaggcatgtg tcctctaaga gtagtaaagc tttggaaact gtgcagactg ttaaagttga
1200





cagcttaata caggatcaat gaaggcggca ggcaaaagga tcctcggaga cacctccctc
1260





agaccagaag cttccagaaa gcctgggcag ctctgtgttt gttttggctg ggcatggcac
1320





actggagcca gcctaggcca gagggtggtg cgttcaggta gcaaagacag gtgggctctg
1380





tcccgccttc acctggagct gccctggctg ctggcggagc gtgtcgtgct gtagcgcttc
1440





ttcacgatca tgctgatgta ggctttcatg agcttggcca catccaccac ctcactggtt
1500





tcaaagagca gctccctctc atcgaccacg atcttatacg tattcgccag gggtgcccca
1560





aaagagagga tgtgttcata ctggaagact tccagtggtc ttccctctcc acgcttgtag
1620





acggagacgg cgtccgcgct gacacccaac cagagttcct gagggaagcc accttccttg
1680





cactggtggg caagagtcaa cagaagagtt aagtcatgaa gtggttggca acagaaagca
1740





tctaaaccat aagacaggct ttgagtgaag tcctctgtgc agaagattaa atatattcga
1800





tgtgcatgca tgcatggagg ggcctgaaat atgaaaaatg gcacctctct ggctatcttg
1860





atttctaact agttaatctc acgcttttgg gaaaacctca ctaactggca gagtctaaca
1920





tcttgctttg actctccact tctcagcatt attctactag ctgtttggat tagctacgtg
1980





gaagtggcct ggaaacgtac atgcttggcc gggggactta agaaagcttc cctgcaaccc
2040





aagccaagtc tactcttgta ttaatatctc cagttctgcc tccaatcctc tttgcggatg
2100





gttagtcttc aaatacaaaa tctaggatca cagaggaaaa ttctccaaat cacgcatgct
2160





gagcagttct ggctcctctt cacaaggagc agcaatggcc ttccatatgc agagtgggaa
2220





cagggacttt accagtttaa ctgtagactt tcctgtacag attggtggaa gaaaataaga
2280





ccccatatga aggggctaac aacacagggc tgatccaaac ctggacaagc aggagggcta
2340





taaattggag acgctgaaaa gagtctctag tttatatccc taataaccag acattctctg
2400





catcctccat gcaaaagcca gtagctttct ttttttcttt ttttttttga cggagtctca
2460





ctctgtcgcc caggctggag tgcagtggcg tgatcctggc tcactgcaac ctccacctcc
2520





tgagttcaag cgattctcct gtctcagcct ctcgagtagc tgggattaca ggtgcatgcc
2580





accacgccca gctaattttt tgcagagatg gggtttcacc gtgttagcca ggatggtctc
2640





gatctcctga cctcatgatc cgcccgcctt ggcctcccaa agcgctggga ttacaggcat
2700





gagccaccgc gcccggccaa gccagtagct ttctatgcta attcacagct cacgttttgc
2760





aggaagccaa gagtttaact gctattatct attccttgtc agggagaaat ggaattatgg
2820





ctttgtacaa agcacctgat ttttttatac ttaaaaacag gcataattga accaaccaaa
2880





ccaatcaaaa acatcaccta atgaaaagcc acccacggat tctagaattt ataatattta
2940





gaattttata cagcctcaat ataaagtcat cagatatacg ctgaattact gtgatcataa
3000





aaaatggaag ctaatctaga cgatgagctg gcacacttat ctgttaaggg ctgcatagta
3060





aagattttta gactttgtgg atcacatggt ctctgtcaca actactcaac tctgtacaaa
3120





aacagccagg gaatatatct aaaggaatga gcttggctgt gttccaataa aactttgttt
3180





agaaaaaaag gaggcaggca agatctgacc cacagaccag tttgccaaac tctcatctag
3240





acaattagta agatttcttt tcaataagcg gtctacttaa aacaaaacaa aaatcagtac
3300





tgggttgatg ccaatggcta aattccatta cgagatagac attcttcctt tcaaacagat
3360





ggctgtaaag aaaaaacaaa gtaaaatgca agtatatcca aagtttctaa tttgtatata
3420





cagctataac atttttttaa atgtagattt ttatcagtgt ttaaaaaatt agatctatag
3480





cttccctaag gaagggtaga agaatagatg acatcttaat tttgcattca ttcctaatat
3540





tacagatgca tttactacac aggagaagag aaactgtgag gagaagggag gcgttaatgg
3600





tacaattttg ggggctcgaa aaaaagaggt tgagagagca aaatgctcca tcttgtcttc
3660





tctccacatg aacttggccg tgatccatgt tctcagatgc cagcacccag cccaccccaa
3720





cacatggcag ccagttctca cctccacatc aaacagcgtc gagccatagc caggccactc
3780





cttgatcaag gccatgtact tggccatggc ctgttcctgg ttcattccct gaaatttcct
3840





ccacttgtca atgatactgg ctcgagcaga ggagacttct tccttaatcc acatgtccag
3900





catctgctcc tcctcgacct tctgccggac cacggatcct gtccggaagc tccgcctcag
3960





ggtcccctct aggaagctcg tccgcctctt ctccagccgt tcacaagggg tgaaggtttt
4020





ggttgactgg ctgatgcggg ccttgagtct ctgcagggaa taaacctctt cgagaggtgg
4080





gatggcagcg tgcagagtat aatccccctg cagatactgg agtcgcaggg cagcaagaac
4140





ctggaggttt tcttccgggg ctggatggtg gccatggata accgcttcgt gggcctgttc
4200





aaacataaat gcaaactcca cactgtcttt tggcacgttg tctgtgtcca ggaagcagta
4260





aagtttgaag tagaatttcc atggcaggtc cccaacctcg gatgtggcag ccagcttttc
4320





aaacttggct aagacatcag ctacgacggt tcgactttca atggctttgt cgacgtggcc
4380





gttgtattca aacaaagcaa acatgttcct gctgtcctcc atggccaggc ctcggatcag
4440





cttctccacc acctccccag cggtggtgtg ggagttgatg gtgatcttgc aggagccgcc
4500





gccatggcaa tagaccgtgg atgtcatttc ctgcctgtgg atcagagctt ctatttcatc
4560





tcgggaaggc acaaactctc ggcatttggt tttcttaaga gattcgtaag tgaagagagc
4620





gtatttttcc atctcggttc ctggaaactg ttcccgtatc cttttcagat ggaacttgag
4680





atacttgaga atccctcgac tcggcaggaa ggtgcagctc aggcatgtca ggatctgcca
4740





gctgtacagg ttgcccacac tgccggggtg gggcactttg ttggtctgtt tgataagctg
4800





gcagtacagc tcgtcccgca gaggtcgcag gtcatgccct gtctgtagga tgccctggat
4860





tattggaatt gggtcagaca tggactccag ttgctgcagg gaattgaata tcttgatggc
4920





ctcatcctga agggtggtat agcctttgtc tttcagcaag ttgagattta tgtccccata
4980





cggaaggggc aggagcgggg agtgcaaggg gtgatgggtg tatcgaagga tcgggttccg
5040





cttgtaaatc tgttccacca catccgagtt caggcagttc tccttgatat cttgaatcag
5100





ctgctgggtg ggggtgtcga tcggggcctt ggtgtcagtc acgttttgaa tgacactgga
5160





ccaccgggtg gcctcgttga gcagcttggt gtagagccgg taacagtgct tgcgcccgta
5220





cacggtgacg ttccagtagc ctgtctcttt gaatatcttc tcatctgggg ggacgacaga
5280





gcagaggctg ttgaggacca gggtccccag tttgagcgcg ttcttctctg aactcttgta
5340





gtaatccagg gaattgtggg tgagtacaaa ccaccgtttc ttcagtttca gtgaagacat
5400





ctttggactg ttcttcacct ctttgtgcaa ccatcctctc acgatgaatt cctggccctc
5460





cactctggtg tcccctttgg acctctgcag cagggttatc cagtggtgca tctcctccgg
5520





cgtgtcggcg ttgcagtgca gcacccggtt ggccgtgatg atcacaaacg agttgggtct
5580





atcagggctg tcagaggcac acacagaatc aatcagcccc acatccaagg tgcccacagc
5640





attctgtggg tttgcctgct catcatgcat ctcctggatc tcctggtccg tggacgcgtg
5700





gacctgactc agcacgctga accactggct ggcatcttct ggggactctg caatcaggtg
5760





gaaagtccta tcggccataa tgatgtcgat cccattctcc ttggtggtgt tatctatgat
5820





ctcttttgcc gttcgcactt ctacggtgcc cttgagcttc tcctcgctgt cgttttcaaa
5880





gtacatcagc ttggactggc ggaggacaaa ccagcgcttc ttccaatttc tcctggacag
5940





cgtggaggag cccccccctt ttttgtggag ccagccttgc ttgagggcct cctgcttgga
6000





gcggaaccac aagaaggttt catccttgag gacgcaccag cggcgtttcc aagagttcat
6060





caggccacct ttcatgtaca gaaagctgtg gaaatacggc agagtgacac agctgtacac
6120





agagtcacgc cggtatgaaa gctcatcatc tgtatcaaac ctggaatcaa agtcctcttc
6180





actatcttca aacgaggact gcgccccctc agagctgaac cggtaggcac ccgagctgtt
6240





gtaggtcccc acagagcagc ggtagtcggg ggaccactgg ctgccgtagg agttggagaa
6300





ggtcacgctg ctgccggaag tgatggcacc gtcctcatag tcatcctggt cgtagtcgta
6360





gtcgccgtct ggggagggca agtccccagc gttctggggc atgcagtagg tggactcgcc
6420





gctggaggag ttgtgtaggc tcccggagtc ctgcactgat ggggcgagca gcaccgtgct
6480





gtccgcactg gggctggtgg gcaccaccgt gtcgttcatg tatgggtcct cctctgaaga
6540





gtcatcgctg gtccggatgc cacttgttcg ctggtctgag tggccgtgct cgctggggtt
6600





gggggagtcc ttgaaggcgt cgtcgtcggc ttcgaagccc tcatcgacct cctcctctgg
6660





gtagggctgg ctgaagttga agttgggctt ctcctcgcat gcgctctcag ccagctcgct
6720





ggaaaattcg cttccccccg acagggaccg ctcgatattc cggacacact cgtcgatctc
6780





gtcgaaattg agggactcga ggaactcctg ggccgccctg cacgcttcct cctccagcct
6840





gcggagctcc tggtcccgcc gctcctgcag cttctgcagg gaagcctcgg tcagcgacag
6900





ctcctgctgc tccttcatgc gctgcaggtc ctcgatttct ttctccagac ggaggatctc
6960





ttccacctgc ttattttcct tctgtttctc cagttcacgg gtcagttcag cttccttctg
7020





gctcttctgc aaggcttcga gttcttgctg cttcctcgtt tcttcttcct gctgggcgcg
7080





gagctcggct tctcttcgct ctctctctct ttctctttct tcttcctccc gtttcttctt
7140





ttcttcctct tcctgtttct tcttttcttc ttgctccctt ttctctgcca gcaattgtct
7200





gtaaactctc cgagcaatct gacctctgag ttgcttctgg aaaactatgg ctgccttttt
7260





caggtgcaaa aatctcctcc tcagaaggaa tgctctgtaa ttcttctgta ttatcaccac
7320





acaataaagg acctttctgt attgtttccg tgctaagaag cccaagacat gggcccgaat
7380





caccatggcc gcgtggctca cttcctcttc cctccgcttc tccagtttct gttccaagga
7440





ttctcgaaga aataccttgg tcttccccag ctgccactcg ctgttggagg catcatagag
7500





ctgcagcagg ctcgtgcact tccctcggac gtcctcaggc agagccagat tcctcatcag
7560





cactttatac cttttgtaaa agtcctgaaa gggtcttcgg accgcatacc cagctttgcg
7620





gattctcaca gtctccagca tccctgagta ccgcagctgg ttcagcacaa ccgcctggtc
7680





aaactggtct ggcatcttct gcatgtttgg cttgatacag cgaacaaaga aaggattaga
7740





ggagcttagc gttgccatta aggaatgcag tgagtcaacc ttgaactgtg agctgactgt
7800





aggccgccga tgtttgcttc cacatttcaa ggtatcctgg ttgttgcggc ttgaaacatg
7860





ttcaaaaaga tcgtagataa agtcaaaccg gctttctctt agcaaattga gaaggtcatc
7920





tcgaaatgta tctctgttct tctccaagat acctcggaca tcatattgca cctctccagc
7980





atagtgcttc actccaaaat tgttaactgc aactctgggc ttcacataaa agtggttatt
8040





cgcatgctga ctgtgtagct tctccaataa ggtgctgtct gtggcttgag gaaaatggct
8100





ttcttcattg ataagggcta ggaggccaag tttcttctca atcaagtcca ggcattctcc
8160





attgtctatc cagtcaatat cttcccacac taatccttcc ctgctatatt ctagttgttc
8220





taaagaaaaa atatgcttgt tgaagtactc ctgaagtttc tcgtttgcat agtttatatt
8280





gaactgttca aagtgattaa cctcaaagtt ttcaaatcca aagatgtcga ggatgccaat
8340





agacttgaag tcctcattgc ctttgatcct gctgttgatc ttcttgatta cccactcaaa
8400





gcagcacgca tacagagcca tggccaggga gtccctgctg tctactgcct gttgaacatt
8460





gagaggcgtg aggatctctt ctcccctgag gaacattgat ctctgggtca aagcatctgt
8520





gagctgtgtt gggtccagcc caagtaactc cgcagatctg cccaaagctg ttttgaagga
8580





aacctgtgcc ccaccagcag tgataaattc tatgttccca agatgcagta taccagcaag
8640





cagcctcgac acttcccgaa cttcctcctt gctgaactgc atcacgtcca ttgccgtaat
8700





aacttcccta aaggattcct ggtcactgat tgtcttgtct tctacacatc cagactgatt
8760





caagtagtgg tagttttctg gcgtagataa ataaaattct tctctttctt catgttccag
8820





ccctgccagc agtgcataaa atatgtgata attcctttcc ccgggatttt gccttactac
8880





tcggttctgg gaagagagga tacaatctac aattctcccg ccctgaatat ttcctttctg
8940





acagatgttc agctgaacaa acttcccaaa gcgactagag ttgttgttgt acacggtctt
9000





cgcattgccg aaagcttcca tgatggggct gctttcaaga atagctcgtt caacacagga
9060





tgtcttctcc tttaaggaca attccaaaga ctgttgactg atgactgaca gaaacttgag
9120





gatcaattta gtgctttcgg ttttacctgc cccactttca cccttgatga ggatgcactg
9180





gttgtcgtgg cgcttccaca ggcagcggta gcactcgttg gcgatggcga agatgtgcgg
9240





gggcagctcg cccaggtggc gccggctgta ctgctccatg gtggcaggct cgtacagccc
9300





ggcgatgggc tggtaggggt tcacagaggc caggatggag ccgatgtagg tccatatttg
9360





atttctctta taccgctgga ataagttata catgatggag ccgccatgga gctctgtcaa
9420





ggacgccatg tcatccacgc cctcctcgtt cgtggggtgc atagcagtca ccttctggtg
9480





ggtaattgtg ctctgcttgt aagtgaatac ctgaccatag tctgtccgga agacgacgat
9540





gccttctgca caggaattta cagtacttgg aaaatgctgg ccattttctc tcagccagac
9600





ccgtgttccc tgtaaacaaa a
9621
















TABLE 12








hMX2 polypeptide sequence (SEQ ID NO:12)
















Phe Cys Leu Gln Gly Thr Arg Val Trp Leu Arg Glu Asn Gly Gln His



1               5                   10                  15





Phe Pro Ser Thr Val Asn Ser Cys Ala Glu Gly Ile Val Val Phe Arg


            20                  25                  30





Thr Asp Tyr Gly Gln Val Phe Thr Tyr Lys Gln Ser Thr Ile Thr His


        35                  40                  45





Gln Lys Val Thr Ala Met His Pro Thr Asn Glu Glu Gly Val Asp Asp


    50                  55                  60





Met Ala Ser Leu Thr Glu Leu His Gly Gly Ser Ile Met Tyr Asn Leu


65                  70                  75                  80





Phe Gln Arg Tyr Lys Arg Asn Gln Ile Trp Thr Tyr Ile Gly Ser Ile


                85                  90                  95





Leu Ala Ser Val Asn Pro Tyr Gln Pro Ile Ala Gly Leu Tyr Glu Pro


            100                 105                 110





Ala Thr Met Glu Gln Tyr Ser Arg Arg His Leu Gly Glu Leu Pro Pro


        115                 120                 125





His Ile Phe Ala Ile Ala Asn Glu Cys Tyr Arg Cys Leu Trp Lys Arg


    130                 135                 140





His Asp Asn Gln Cys Ile Leu Ile Lys Gly Glu Ser Gly Ala Gly Lys


145                 150                 155                 160





Thr Glu Ser Thr Lys Leu Ile Leu Lys Phe Leu Ser Val Ile Ser Gln


                165                 170                 175





Gln Ser Leu Glu Leu Ser Leu Lys Glu Lys Thr Ser Cys Val Glu Arg


            180                 185                 190





Ala Ile Leu Glu Ser Ser Pro Ile Met Glu Ala Phe Gly Asn Ala Lys


        195                 200                 205





Thr Val Tyr Asn Asn Asn Ser Ser Arg Phe Gly Lys Phe Val Gln Leu


    210                 215                 220





Asn Ile Cys Gln Lys Gly Asn Ile Gln Gly Gly Arg Ile Val Asp Cys


225                 230                 235                 240





Ile Leu Ser Ser Gln Asn Arg Val Val Arg Gln Asn Pro Gly Glu Arg


                245                 250                 255





Asn Tyr His Ile Phe Tyr Ala Leu Leu Ala Gly Leu Glu His Glu Glu


            260                 265                 270





Arg Glu Glu Phe Tyr Leu Ser Thr Pro Glu Asn Tyr His Tyr Leu Asn


        275                 280                 285





Gln Ser Gly Cys Val Glu Asp Lys Thr Ile Ser Asp Gln Glu Ser Phe


    290                 295                 300





Arg Glu Val Ile Thr Ala Met Asp Val Met Gln Phe Ser Lys Glu Glu


305                 310                 315                 320





Val Arg Glu Val Ser Arg Leu Leu Ala Gly Ile Leu His Leu Gly Asn


                325                 330                 335





Ile Glu Phe Ile Thr Ala Gly Gly Ala Gln Val Ser Phe Lys Thr Ala


            340                 345                 350





Leu Gly Arg Ser Ala Glu Leu Leu Gly Leu Asp Pro Thr Gln Leu Thr


        355                 360                 365





Asp Ala Leu Thr Gln Arg Ser Met Phe Leu Arg Gly Glu Glu Ile Leu


    370                 375                 380





Thr Pro Leu Asn Val Gln Gln Ala Val Asp Ser Arg Asp Ser Leu Ala


385                 390                 395                 400





Met Ala Leu Tyr Ala Cys Cys Phe Glu Trp Val Ile Lys Lys Ile Asn


                405                 410                 415





Ser Arg Ile Lys Gly Asn Glu Asp Phe Lys Ser Ile Gly Ile Leu Asp


            420                 425                 430





Ile Phe Gly Phe Glu Asn Phe Glu Val Asn His Phe Glu Gln Phe Asn


        435                 440                 445





Ile Asn Tyr Ala Asn Glu Lys Leu Gln Glu Tyr Phe Asn Lys His Ile


    450                 455                 460





Phe Ser Leu Glu Gln Leu Glu Tyr Ser Arg Glu Gly Leu Val Trp Glu


465                 470                 475                 480





Asp Ile Asp Trp Ile Asp Asn Gly Glu Cys Leu Asp Leu Ile Glu Lys


                485                 490                 495





Lys Leu Gly Leu Leu Ala Leu Ile Asn Glu Glu Ser His Phe Pro Gln


            500                 505                 510





Ala Thr Asp Ser Thr Leu Leu Glu Lys Leu His Ser Gln His Ala Asn


        515                 520                 525





Asn His Phe Tyr Val Lys Pro Arg Val Ala Val Asn Asn Phe Gly Val


    530                 535                 540





Lys His Tyr Ala Gly Glu Val Gln Tyr Asp Val Arg Gly Ile Leu Glu


545                 550                 555                 560





Lys Asn Arg Asp Thr Phe Arg Asp Asp Leu Leu Asn Leu Leu Arg Glu


                565                 570                 575





Ser Arg Phe Asp Phe Ile Tyr Asp Leu Phe Glu His Val Ser Ser Arg


            580                 585                 590





Asn Asn Gln Asp Thr Leu Lys Cys Gly Ser Lys His Arg Arg Pro Thr


        595                 600                 605





Val Ser Ser Gln Phe Lys Val Asp Ser Leu His Ser Leu Met Ala Thr


    610                 615                 620





Leu Ser Ser Ser Asn Pro Phe Phe Val Arg Cys Ile Lys Pro Asn Met


625                 630                 635                 640





Gln Lys Met Pro Asp Gln Phe Asp Gln Ala Val Val Leu Asn Gln Leu


                645                 650                 655





Arg Tyr Ser Gly Met Leu Glu Thr Val Arg Ile Arg Lys Ala Gly Tyr


            660                 665                 670





Ala Val Arg Arg Pro Phe Gln Asp Phe Tyr Lys Arg Tyr Lys Val Leu


        675                 680                 685





Met Arg Asn Leu Ala Leu Pro Glu Asp Val Arg Gly Lys Cys Thr Ser


    690                 695                 700





Leu Leu Gln Leu Tyr Asp Ala Ser Asn Ser Glu Trp Gln Leu Gly Lys


705                 710                 715                 720





Thr Lys Val Phe Leu Arg Glu Ser Leu Glu Gln Lys Leu Glu Lys Arg


                725                 730                 735





Arg Glu Glu Glu Val Ser His Ala Ala Met Val Ile Arg Ala His Val


            740                 745                 750





Leu Gly Phe Leu Ala Arg Lys Gln Tyr Arg Lys Val Leu Tyr Cys Val


        755                 760                 765





Val Ile Ile Gln Lys Asn Tyr Arg Ala Phe Leu Leu Arg Arg Arg Phe


    770                 775                 780





Leu His Leu Lys Lys Ala Ala Ile Val Phe Gln Lys Gln Leu Arg Gly


785                 790                 795                 800





Gln Ile Ala Arg Arg Val Tyr Arg Gln Leu Leu Ala Glu Lys Arg Glu


                805                 810                 815





Gln Glu Glu Lys Lys Lys Gln Glu Glu Glu Glu Lys Lys Lys Arg Glu


            820                 825                 830





Glu Glu Glu Arg Glu Arg Glu Arg Glu Arg Arg Glu Ala Glu Leu Arg


        835                 840                 845





Ala Gln Gln Glu Glu Glu Thr Arg Lys Gln Gln Glu Leu Glu Ala Leu


    850                 855                 860





Gln Lys Ser Gln Lys Glu Ala Glu Leu Thr Arg Glu Leu Glu Lys Gln


865                 870                 875                 880





Lys Glu Asn Lys Gln Val Glu Glu Ile Leu Arg Leu Glu Lys Glu Ile


                885                 890                 895





Glu Asp Leu Gln Arg Met Lys Glu Gln Gln Glu Leu Ser Leu Thr Glu


            900                 905                 910





Ala Ser Leu Gln Lys Leu Gln Glu Arg Arg Asp Gln Glu Leu Arg Arg


        915                 920                 925





Leu Glu Glu Glu Ala Cys Arg Ala Ala Gln Glu Phe Leu Glu Ser Leu


    930                 935                 940





Asn Phe Asp Glu Ile Asp Glu Cys Val Arg Asn Ile Glu Arg Ser Leu


945                 950                 955                 960





Ser Gly Gly Ser Glu Phe Ser Ser Glu Leu Ala Glu Ser Ala Cys Glu


                965                 970                 975





Glu Lys Pro Asn Phe Asn Phe Ser Gln Pro Tyr Pro Glu Glu Glu Val


            980                 985                 990





Asp Glu Gly Phe Glu Ala Asp Asp Asp Ala Phe Lys Asp Ser Pro Asn


        995                 1000                1005





Pro Ser Glu His Gly His Ser Asp Gln Arg Thr Ser Gly Ile Arg


    1010                1015                1020





Thr Ser Asp Asp Ser Ser Glu Glu Asp Pro Tyr Met Asn Asp Thr


    1025                1030                1035





Val Val Pro Thr Ser Pro Ser Ala Asp Ser Thr Val Leu Leu Ala


    1040                1045                1050





Pro Ser Val Gln Asp Ser Gly Ser Leu His Asn Ser Ser Ser Gly


    1055                1060                1065





Glu Ser Thr Tyr Cys Met Pro Gln Asn Ala Gly Asp Leu Pro Ser


    1070                1075                1080





Pro Asp Gly Asp Tyr Asp Tyr Asp Gln Asp Asp Tyr Glu Asp Gly


    1085                1090                1095





Ala Ile Thr Ser Gly Ser Ser Val Thr Phe Ser Asn Ser Tyr Gly


    1100                1105                1110





Ser Gln Trp Ser Pro Asp Tyr Arg Cys Ser Val Gly Thr Tyr Asn


    1115                1120                1125





Ser Ser Gly Ala Tyr Arg Phe Ser Ser Glu Gly Ala Gln Ser Ser


    1130                1135                1140





Phe Glu Asp Ser Glu Glu Asp Phe Asp Ser Arg Phe Asp Thr Asp


    1145                1150                1155





Asp Glu Leu Ser Tyr Arg Arg Asp Ser Val Tyr Ser Cys Val Thr


    1160                1165                1170





Leu Pro Tyr Phe His Ser Phe Leu Tyr Met Lys Gly Gly Leu Met


    1175                1180                1185





Asn Ser Trp Lys Arg Arg Trp Cys Val Leu Lys Asp Glu Thr Phe


    1190                1195                1200





Leu Trp Phe Arg Ser Lys Gln Glu Ala Leu Lys Gln Gly Trp Leu


    1205                1210                1215





His Lys Lys Gly Gly Gly Ser Ser Thr Leu Ser Arg Arg Asn Trp


    1220                1225                1230





Lys Lys Arg Trp Phe Val Leu Arg Gln Ser Lys Leu Met Tyr Phe


    1235                1240                1245





Glu Asn Asp Ser Glu Glu Lys Leu Lys Gly Thr Val Glu Val Arg


    1250                1255                1260





Thr Ala Lys Glu Ile Ile Asp Asn Thr Thr Lys Glu Asn Gly Ile


    1265                1270                1275





Asp Ile Ile Met Ala Asp Arg Thr Phe His Leu Ile Ala Glu Ser


    1280                1285                1290





Pro Glu Asp Ala Ser Gln Trp Phe Ser Val Leu Ser Gln Val His


    1295                1300                1305





Ala Ser Thr Asp Gln Glu Ile Gln Glu Met His Asp Glu Gln Ala


    1310                1315                1320





Asn Pro Gln Asn Ala Val Gly Thr Leu Asp Val Gly Leu Ile Asp


    1325                1330                1335





Ser Val Cys Ala Ser Asp Ser Pro Asp Arg Pro Asn Ser Phe Val


    1340                1345                1350





Ile Ile Thr Ala Asn Arg Val Leu His Cys Asn Ala Asp Thr Pro


    1355                1360                1365





Glu Glu Met His His Trp Ile Thr Leu Leu Gln Arg Ser Lys Gly


    1370                1375                1380





Asp Thr Arg Val Glu Gly Gln Glu Phe Ile Val Arg Gly Trp Leu


    1385                1390                1395





His Lys Glu Val Lys Asn Ser Pro Lys Met Ser Ser Leu Lys Leu


    1400                1405                1410





Lys Lys Arg Trp Phe Val Leu Thr His Asn Ser Leu Asp Tyr Tyr


    1415                1420                1425





Lys Ser Ser Glu Lys Asn Ala Leu Lys Leu Gly Thr Leu Val Leu


    1430                1435                1440





Asn Ser Leu Cys Ser Val Val Pro Pro Asp Glu Lys Ile Phe Lys


    1445                1450                1455





Glu Thr Gly Tyr Trp Asn Val Thr Val Tyr Gly Arg Lys His Cys


    1460                1465                1470





Tyr Arg Leu Tyr Thr Lys Leu Leu Asn Glu Ala Thr Arg Trp Ser


    1475                1480                1485





Ser Val Ile Gln Asn Val Thr Asp Thr Lys Ala Pro Ile Asp Thr


    1490                1495                1500





Pro Thr Gln Gln Leu Ile Gln Asp Ile Lys Glu Asn Cys Leu Asn


    1505                1510                1515





Ser Asp Val Val Glu Gln Ile Tyr Lys Arg Asn Pro Ile Leu Arg


    1520                1525                1530





Tyr Thr His His Pro Leu His Ser Pro Leu Leu Pro Leu Pro Tyr


    1535                1540                1545





Gly Asp Ile Asn Leu Asn Leu Leu Lys Asp Lys Gly Tyr Thr Thr


    1550                1555                1560





Leu Gln Asp Glu Ala Ile Lys Ile Phe Asn Ser Leu Gln Gln Leu


    1565                1570                1575





Glu Ser Met Ser Asp Pro Ile Pro Ile Ile Gln Gly Ile Leu Gln


    1580                1585                1590





Thr Gly His Asp Leu Arg Pro Leu Arg Asp Glu Leu Tyr Cys Gln


    1595                1600                1605





Leu Ile Lys Gln Thr Asn Lys Val Pro His Pro Gly Ser Val Gly


    1610                1615                1620





Asn Leu Tyr Ser Trp Gln Ile Leu Thr Cys Leu Ser Cys Thr Phe


    1625                1630                1635





Leu Pro Ser Arg Gly Ile Leu Lys Tyr Leu Lys Phe His Leu Lys


    1640                1645                1650





Arg Ile Arg Glu Gln Phe Pro Gly Thr Glu Met Glu Lys Tyr Ala


    1655                1660                1665





Leu Phe Thr Tyr Glu Ser Leu Lys Lys Thr Lys Cys Arg Glu Phe


    1670                1675                1680





Val Pro Ser Arg Asp Glu Ile Glu Ala Leu Ile His Arg Gln Glu


    1685                1690                1695





Met Thr Ser Thr Val Tyr Cys His Gly Gly Gly Ser Cys Lys Ile


    1700                1705                1710





Thr Ile Asn Ser His Thr Thr Ala Gly Glu Val Val Glu Lys Leu


    1715                1720                1725





Ile Arg Gly Leu Ala Met Glu Asp Ser Arg Asn Met Phe Ala Leu


    1730                1735                1740





Phe Glu Tyr Asn Gly His Val Asp Lys Ala Ile Glu Ser Arg Thr


    1745                1750                1755





Val Val Ala Asp Val Leu Ala Lys Phe Glu Lys Leu Ala Ala Thr


    1760                1765                1770





Ser Glu Val Gly Asp Leu Pro Trp Lys Phe Tyr Phe Lys Leu Tyr


    1775                1780                1785





Cys Phe Leu Asp Thr Asp Asn Val Pro Lys Asp Ser Val Glu Phe


    1790                1795                1800





Ala Phe Met Phe Glu Gln Ala His Glu Ala Val Ile His Gly His


    1805                1810                1815





His Pro Ala Pro Glu Glu Asn Leu Gln Val Leu Ala Ala Leu Arg


    1820                1825                1830





Leu Gln Tyr Leu Gln Gly Asp Tyr Thr Leu His Ala Ala Ile Pro


    1835                1840                1845





Pro Leu Glu Glu Val Tyr Ser Leu Gln Arg Leu Lys Ala Arg Ile


    1850                1855                1860





Ser Gln Ser Thr Lys Thr Phe Thr Pro Cys Glu Arg Leu Glu Lys


    1865                1870                1875





Arg Arg Thr Ser Phe Leu Glu Gly Thr Leu Arg Arg Ser Phe Arg


    1880                1885                1890





Thr Gly Ser Val Val Arg Gln Lys Val Glu Glu Glu Gln Met Leu


    1895                1900                1905





Asp Met Trp Ile Lys Glu Glu Val Ser Ser Ala Arg Ala Ser Ile


    1910                1915                1920





Ile Asp Lys Trp Arg Lys Phe Gln Gly Met Asn Gln Glu Gln Ala


    1925                1930                1935





Met Ala Lys Tyr Met Ala Leu Ile Lys Glu Trp Pro Gly Tyr Gly


    1940                1945                1950





Ser Thr Leu Phe Asp Val Glu Val Arg Thr Gly Cys His Val Leu


    1955                1960                1965





Gly Trp Ala Gly Cys Trp His Leu Arg Thr Trp Ile Thr Ala Lys


    1970                1975                1980





Phe Met Trp Arg Glu Asp Lys Met Glu His Phe Ala Leu Ser Thr


    1985                1990                1995





Ser Phe Phe Arg Ala Pro Lys Ile Val Pro Leu Thr Pro Pro Phe


    2000                2005                2010





Ser Ser Gln Phe Leu Phe Ser Cys Val Val Asn Ala Ser Val Ile


    2015                2020                2025





Leu Gly Met Asn Ala Lys Leu Arg Cys His Leu Phe Phe Tyr Pro


    2030                2035                2040





Ser Leu Gly Lys Leu


    2045
















TABLE 13








hMP nucleotide sequence (SEQ ID NO:13)

















ccaacttttg cagctccacc caggatgtgg cctcgctcca ccccagctgt gcgcctctct
60






ccacccctag gcgaaggcac tagaatttcc caaattaaga acgaagagga agtttggacc
120





ttttcggcca ccgctcgctt caatatggct gcccccaggg agagacgagg ctaccatgaa
180





ggagccgagc gcagaccctg agtccgtcac ccatggatcg cagcgcggag ttcaggaaat
240





ggaaggcgca atgtttgagc aaagcggacc tcagccggaa gggcagtgtt gacgaggatg
300





tggtagagct tgtgcagttt ctgaacatgc gagatcagtt tttcaccacc agctccttcg
360





ctggccgcat cctactcctt gaccggggta taaatggttt tgaggttcag aaacaaaact
420





gttgctggct actggttaca cacaaacttt gtgtaaaaga tgatgtgatt gtagctctga
480





agaaagcaaa tggtgatgcc actttgaaat ttgaaccatt tgttcttcat gtgcagtgtc
540





gacaattgca ggatgcacag attctgcatt ccatggcaat agattctggt ttcaggaact
600





ctggcataac ggtgggaaag agaggaaaaa ctatgttggc tgtccggagt acacatggct
660





tagaagttcc attaagccat aagggaaaac tgatggtgac agaggaatat attgacttcc
720





tgttaaatgt ggcaaatcaa aaaatggagg aaaacaagaa aagaattgag aggttttaca
780





actgcctaca gcatgctttg gaaagggaaa cgatgactaa cttacatccc aagatcaaag
840





agaaaaataa ctcatcatat attcataaga aaaaaagaaa cccagaaaaa acacgtgccc
900





agtgtattac taaagaaagt gatgaagaac ttgaaaatga tgatgatgat gatctaggaa
960





tcaatgttac catcttccct gaagattactaagctttggt tctgatgtgt cttggccgta 1020





atgtttctag taggttttat aaagctgctc ttcataagag tattttagtt tgttgagtgt
1080





atcagccatt cataagccag taatgacaag tgcagagctt caaactataa ctttgttgcc
1140





cagaggatgt gcagttgtca tctaagctct cagcagtacc cggcttatcc tacgacttca
1200





cctgaaatgc tatagttatc cctacttttt taccagtttc tcccagaagc acctgcttaa
1260





taaatcaaag atgtttgaaa aaaaaaaa
1288
















TABLE 14








hMP polypeptide sequence (SEQ ID NO:14)
















Met Asp Arg Ser Ala Glu Phe Arg Lys Trp Lys Ala Gln Cys Leu Ser



1               5                   10                  15





Lys Ala Asp Leu Ser Arg Lys Gly Ser Val Asp Glu Asp Val Val Glu


            20                  25                  30





Leu Val Gln Phe Leu Asn Met Arg Asp Gln Phe Phe Thr Thr Ser Ser


        35                  40                  45





Phe Ala Gly Arg Ile Leu Leu Leu Asp Arg Gly Ile Asn Gly Phe Glu


    50                  55                  60





Val Gln Lys Gln Asn Cys Cys Trp Leu Leu Val Thr His Lys Leu Cys


65                  70                  75                  80





Val Lys Asp Asp Val Ile Val Ala Leu Lys Lys Ala Asn Gly Asp Ala


                85                  90                  95





Thr Leu Lys Phe Glu Pro Phe Val Leu His Val Gln Cys Arg Gln Leu


            100                 105                 110





Gln Asp Ala Gln Ile Leu His Ser Met Ala Ile Asp Ser Gly Phe Arg


        115                 120                 125





Asn Ser Gly Ile Thr Val Gly Lys Arg Gly Lys Thr Met Leu Ala Val


    130                 135                 140





Arg Ser Thr His Gly Leu Glu Val Pro Leu Ser His Lys Gly Lys Leu


145                 150                 155                 160





Met Val Thr Glu Glu Tyr Ile Asp Phe Leu Leu Asn Val Ala Asn Gln


                165                 170                 175





Lys Met Glu Glu Asn Lys Lys Arg Ile Glu Arg Phe Tyr Asn Cys Leu


            180                 185                 190





Gln His Ala Leu Glu Arg Glu Thr Met Thr Asn Leu His Pro Lys Ile


        195                 200                 205





Lys Glu Lys Asn Asn Ser Ser Tyr Ile His Lys Lys Lys Arg Asn Pro


    210                 215                 220





Glu Lys Thr Arg Ala Gln Cys Ile Thr Lys Glu Ser Asp Glu Glu Leu


225                 230                 235                 240





Glu Asn Asp Asp Asp Asp Asp Leu Gly Ile Asn Val Thr Ile Phe Pro


                245                 250                 255





Glu Asp Tyr
















TABLE 15








NHR nucleotide sequence (SEQ ID NO:15)

















acgcgtgcag gtggcgtggc gccagggatt tgaaccgcgc tgacgaagtt tggtgatcca
60






tcttccgagt atcgccggga tttcgaatcg cgatgatcat cccctctcta gaggagctgg
120





actccctcaa gtacagtgac ctgcagaact tagccaagag tctgggtctc cgggccaacc
180





tgagggcaac caagttgtta aaagccttga aaggctacat taaacatgag gcaagaaaag
240





gaaatgagaa tcaggatgaa agtcaaactt ctgcatcctc ttgtgatgag actgagatac
300





agatcagcaa ccaggaagag ctgagagaca gccacttggc catgtcacca aaacaaggta
360





aaggtgcaag actgtccgtg tggaccctga ctcacagaga atcattcaga gataaaaata
420





agtaatccca ctgaattcca gaatcatgaa aagcaggaaa gccaggatct cagagcactg
480





caaaagttcc ttctccacca gacgagcacc aagaagctga gaatgctgtt tcctcaggta
540





acagagattc aaaggtacct tcagaaggaa agaaatctct ctacacagat gagtcatcca
600





aacctggaaa aaataaaaga actgcaatca ctactccaaa ctttaagaag cttcatgaag
660





ctcattttaa ggaaatggag tccattgatc caatatatng aggagaaaaa aagaaacatt
720





ttgaagaaca caattccatg aatgaactga agcagccgcc catcaataag ggaggggtca
780





ggactccagt acctccaaga ggaagactct ctgtggcttc tactcccatc agccaacgac
840





gctcgcaagg ccggtcttgt ggccctgcaa gtcagagtac cttgggtctg aaggggtcac
900





tcaagcgctc tgctatctct gcagctaaaa cgggtgtcag gttttcagct gctactaaag
960





ataatgagca taagcgttca ctgaccaaga ctccagccag aaagtctgca catgtgaccg
1020





tgtctggggg cacccaaaaa ggcgaggctg tgcttgggac acacaaatta aagaccatca
1080





cggggaattc tgctgctgtt attaccccat tcaagttgac aactgaggca acgcagactc
1140





cagtctccaa taagaaacca gtgtttgatc ttaaagcaag tttgtctcgt cccctcaact
1200





atgaaccaca caaaggaaag ctaaaaccat gggggcaatc taaagaaaat aattatctaa
1260





atcaacatgt caacaaatta acttctacaa gaaaacttac aaacaacccc atctccagac
1320





aaaggaagag caacggaaga aacgcgagca agaagaaagg agaagaaagc aaaggttttg
1380





ggaatgcgaa ggggcctcat tttggctgaa gattaataat tttttaacat cttgtaaata
1440





ttcctgtatt ctcaactttt ttccttttgt aaattttttt tttttgctgt catccccact
1500





ttagtcacga gatctttttc tgctaactgt tcatagtctg tgtagtgtcc atgggttctt
1560





catgtgctat gatctctgaa aagacgttat caccttaaag ctcaaattct ttgggatggt
1620





ttttacttaa gtccattaac aattcaggtt tctaacgaga cccatcctaa aattctcttt
1680





ctagtttttt aatgtcacca tcccaaactc ccgtttctgg atttttaatc cccagctccc
1740





cagttccctc ttatcgtact aatattaaca gaactgcagt cttctgctag ccaatagcat
1800





ttacctgatg gcagctagtt atgcaagctt caggagaatt tgaacaataa caagaatagg
1860





gtaagctggg atagaaaggc cacctcttca ctctctatag aatatagtaa cctttatgaa
1920





acggggccat atagtttggt tatgacatca atattttacc taggtgaaat tgtttaggct
1980





tatgtacctt cgttcaaata tcctcatgta attgccatct gtcactcact atattcacaa
2040





aaataaaact ctacaactca ttctaacatt gcttacttaa aagctacata gccctatcga
2100





aatgcgagga ttaatgcttt aatgctttta gagacagggt ctcactgtgt tgcccaggct
2160





ggtctcaaac tccaccaaat gtacttctta ttcattttat ggaaaagact aggctttgct
2220





tagtatcatg tccatgtttc cttcacctca gtggagcttc tgagttttat actgctcaag
2280





atcgtcataa ataaaatttt ttctcattgt caaaaaaaaa aaaaaaaaaa aaaaaaaaaa
2340





aaaaaaaaaa aa
2352
















TABLE 16








NHR polypeptide sequence (SEQ ID NO:16)
















Met Ile Ile Pro Ser Leu Glu Glu Leu Asp Ser Leu Lys Tyr Ser Asp



1               5                   10                  15





Leu Gln Asn Leu Ala Lys Ser Leu Gly Leu Arg Ala Asn Leu Arg Ala


            20                  25                  30





Thr Lys Leu Leu Lys Ala Leu Lys Gly Tyr Ile Lys His Glu Ala Arg


        35                  40                  45





Lys Gly Asn Glu Asn Gln Asp Glu Ser Gln Thr Ser Ala Ser Ser Cys


   50                  55                  60





Asp Glu Thr Glu Ile Gln Ile Ser Asn Gln Glu Glu Ala Glu Arg Gln


65                  70                  75                  80





Pro Leu Gly His Val Thr Lys Thr Arg Arg Arg Cys Lys Thr Val Arg


                85                  90                  95





Val Asp Pro Asp Ser Gln Gln Asn His Ser Glu Ile Lys Ile Ser Asn


            100                 105                 110





Pro Thr Glu Phe Gln Asn His Glu Lys Gln Glu Ser Gln Asp Leu Arg


        115                 120                 125





Ala Thr Ala Lys Val Pro Ser Pro Pro Asp Glu His Gln Glu Ala Glu


    130                 135                 140





Asn Ala Val Ser Ser Gly Asn Arg Asp Ser Lys Val Pro Ser Glu Gly


145                 150                 155                 160





Lys Lys Ser Leu Tyr Thr Asp Glu Ser Ser Lys Pro Gly Lys Asn Lys


                165                 170                 175





Arg Thr Ala Ile Thr Thr Pro Asn Phe Lys Lys Leu His Glu Ala His


            180                 185                 190





Phe Lys Glu Met Glu Ser Ile Asp Pro Ile Tyr Xaa Gly Glu Lys Lys


        195                 200                 205





Lys His Phe Glu Glu His Asn Ser Met Asn Glu Leu Lys Gln Pro Pro


    210                 215                 220





Ile Asn Lys Gly Gly Val Arg Thr Pro Val Pro Pro Arg Gly Arg Leu


225                 230                 235                 240





Ser Val Ala Ser Thr Pro Ile Ser Gln Arg Arg Ser Gln Gly Arg Ser


                245                 250                 255





Cys Gly Pro Ala Ser Gln Ser Thr Leu Gly Leu Lys Gly Ser Leu Lys


            260                 265                 270





Arg Ser Ala Ile Ser Ala Ala Lys Thr Gly Val Arg Phe Ser Ala Ala


        275                 280                 285





Thr Lys Asp Asn Glu His Lys Arg Ser Leu Thr Lys Thr Pro Ala Arg


    290                 295                 300





Lys Ser Ala His Val Thr Val Ser Gly Gly Thr Gln Lys Gly Glu Ala


305                 310                 315                 320





Val Leu Gly Thr His Lys Leu Lys Thr Ile Thr Gly Asn Ser Ala Ala


                325                 330                 335





Val Ile Thr Pro Phe Lys Leu Thr Thr Glu Ala Thr Gln Thr Pro Val


            340                 345                 350





Ser Asn Lys Lys Pro Val Phe Asp Leu Lys Ala Ser Leu Ser Arg Pro


        355                 360                 365





Leu Asn Tyr Glu Pro His Lys Gly Lys Leu Lys Pro Trp Gly Gln Ser


    370                 375                 380





Lys Glu Asn Asn Tyr Leu Asn Gln His Val Asn Arg Ile Asn Phe Tyr


385                 390                 395                 400





Lys Lys Thr Tyr Lys Gln Pro His Leu Gln Thr Lys Glu Glu Gln Arg


                405                 410                 415





Lys Lys Arg Glu Gln Glu Arg Lys Glu Lys Lys Ala Lys Val Leu Gly


            420                 425                 430





Met Arg Arg Gly Leu Ile Leu Ala Glu Asp


        435                 440









Table 17 displays alignment of hMX1, hMX2 with human myosin (SEQ ID NO:31; GenBank AF247457) (Berg et al., 2000). As seen from the alignment, hMX1 and hMX2 have a likely N-terminus of M N D residues. One of skill in the art can easily verify this observation by probing cDNA or genomic human libraries, or PCR techniques, to acquire the full length polynucleotide sequence.

TABLE 17Alignment of hMX1, hMX2 and human myosin X101---FCLQGTRVWLRENGQHFPSTVNSCAEGIVVFRTDYGQVFTYKQSTIT121---FCLQGTRVWLRENGQHFPSTVNSCAEGIVVFRTDYGQVFTYKQSTIThumX1MDNFFTEGTRVWLRENGQHFPSTVNSCAEGIVVFRTDYGQVFTYKQSTIT1048HQKVTAMHPTNEEGVDDMASLTELHGGSIMYNLFQRYKRNQIWTYIGSIL1248HQKVTAMHPTNEEGVDDMASLTELHGGSIMYNLFQRYKRNQIWTYIGSILhumX51HQKVTAMHPTNEEGVDDMASLTELHGGSIMYNLFQRYKRNQIYTYIGSIL1098ASVNPYQPIAGLYEPATMEQYSRRHLGELPPHIFAIANECYRCLWKRHDN1298ASVNPYQPIAGLYEPATMEQYSRRHLGELPPHIFAIANECYRCLWKRHDNhumX101ASVNPYQPIAGLYEPATMEQYSRRHLGELPPHIFAIANECYRCLWKRYDN10148QCILIKGESGAGKTESTKLILKFLSVISQQSLELSLKEKTSCVERAILES12148QCILIKGESGAGKTESTKLILKFLSVISQQSLELSLKEKTSCVERAILEShumX151QCILISGESGAGKTESTKLILKFLSVISQQSLELSLKEKTSCVERAILES10198SPIMEAFGNAKTVYNNNSSRFGKFVQLNICQKGNIQGGRIVDCILSSQNR12198SPIMEAFGNAKTVYNNNSSRFGKFVQLNICQKGNIQGGRIVDCILSSQNRhumX201SPIMEAFGNAKTVYNNNSSRFGKFVQLNICQKGNIQGGRIVDYLLE-KNR10248VVRQNPGERNYHIFYALLAGLEHEEREEFYLSTPENYHYLNQSGCVEDKT12248VVRQNPGERNYHIFYALLAGLEHEEREEFYLSTPENYHYLNQSGCVEDKThumX250VVRQNPGERNYHIFYALLAGLEHEEREEFYLSTPENYHYLNQSGCVEDKT10298ISDQESFREVITAMDVMQFSKEEVREVSRLLAGILHLGNIEFITAGGAQV12298ISDQESFREVITAMDVMQFSKEEVREVSRLLAGILHLGNIEFITAGGAQVhumX300ISDQESFREVITAMDVMQFSKEEVREVSRLLAGILHLGNIEFITAGGAQV10348SFKTALGRSAELLGLDPTQLTDALTQRSMFLRGEEILTPLNVQQAVDSRD12348SFKTALGRSAELLGLDPTQLTDALTQRSMFLRGEEILTPLNVQQAVDSRDhumX350SFKTALGRSAELLGLDPTQLTDALTQRSMFLRGEEILTPLNVQQAVDSRD10398SLAMALYACCFEWVIKKINSRIKGNEDFKSIGILDIFCFENFEVNHFEQF12398SLAMALYACCFEWVIKKINSRIKGNEDFKSIGILDIFGFENFEVNHFEQFhumX400SLAMALYACCFEWVIKKINSRIKGNEDFKSIGILDIFGFENFEVNHFEQF10448NINYANEKLQEYFNKHIFSLEQLEYSREGLVWEDIDWIDNCECLDLIEKK12448NINYANEKLQEYFNKHIFSLEQLEYSREGLVWEDIDWIDUGECLDLIEKKhumX450NINYANEKLQEYFNKHIFSLEQLEYSREGLVWEDIDWIDNCECLDLIEKK10498LGLLALINEESHFPQATDSTLLEKLHSQHANNHFYVKPRVAVNNFGVKHY12498LCLLALINEESHFPQATDSTLLEKLHSQHANNHFYVKPRVAVNNFGVKHYhumX500LGLLALINEESHFPQATDSTLLEKLHSQHANNHFYVKPRVAVNNFGVKHY10548AGEVQYDVRCILEKNRDTFRDDLLNLLRESRFDFIYDLFEHVSSRNNQDT12548AGEVQYDVRGILEKNRDTFRDDLLNLLRESRFDFIYDLFEHVSSRNNQDThumX550AGEVQYDVRGILEKNRDTFRDDLLNLLRESRFDFIYDLFEHVSSRNNQDT10598LKCGSKHRRPTVSSQFKVDSLHSLMATLSSSNPFFVRCIKPNMQKMPDQF12598LKCGSKHRRPTVSSQFKVDSLHSLMATLSSSNPFFVRCIKPNMQKMPDQFhumX600LKCGSKHRRPTVSSQFKDS-LHSLMATLSSSNPFFVRCIKPNMQKMPDQF10648DQAVVLNQLRYSGMLETVRIRKAGYAVRRPFQDFYKRYKVLMRNLALPED12648DQAVVLNQLRYSGMLETVRIRKAGYAVRRPFQDFYKRYKVLMRNLALPEDhumX649DQAVVLNQLRYSGMLETVRIRKAGYAVRRPFQDFYKRYKVLMRNLALPED10698VRGKCTSLLQLYDASNSEWQLGKTKVFLRESLEQKLEKRREEEVSHAAMV12698VRGKCTSLLQLYDASNSEWQLGKTKVFLRESLEQKLEKRREEEVSHAAMVhumX699VRGKCTSLLQLYDASNSEWQLGKTKVFLRESLEQKLEKRREEEVSHAAMV10748IRAHVLGFLARKQYRKVLYCVVIIQKNYRAFLLRRRFLHLKKAAIVFQKQ12748IRAHVLGFLARKQYRKVLYCVVIIQKNYRAFLLRRRFLHLKKAAIVFQKQhumX749IRAHVLGFLARKQYRKVLYCVVIIQKNYRAFLLRRRFLHLKKAAIVFQKQ10798LRGQIARRVYRQLLAEKREQEEKKKQEEEEKKKREEEERERERERREAEL12798LRGQIARRVYRQLLAEKREQEEKKKQEEEEKKKREEEERERERERREAELhumX799LRGQIARRVYRQLLAEKREQEEKKKQEEEEKKKREEEERERERERREAEL10848RAQQEEETRKQQELEALQKSQKEAELTRELEKQKENKQVEEILRLEKEIE12848RAQQEEETRKQQELEALQKSQKEAELTRELEKQKENKQVEEILRLEKEIEhumX849RAQQEEETRKQQELEALQKSQKEAELTRELEKQKEMKQVEEILRLEKEIE10898DLQRMKEQQELSLTEASLQKLQERRDQELRRLEEEACRAAQEFLESLNFD12898DLQRMKEQQELSLTEASLQKLQERRDQELRRLEEEACRAAQEFLESLNFDhumX899DLQRMKEQQELSLTEASLQKLQERRDQELRRLEEEACRAAQEFLESLNFD10948EIDECVRNIERSLSGGSEFSSELAESACEEKPNFNFSQPYPEEEVDEGFE12948EIDECVRNIERSLSGGSEFSSELAESACEEKPNFNFSQPYPEEEVDEGFEhumx949EIDECVRNIERSLSVGSEFSSELAESACEEKPNFNFSQPYPEEEVDEGFE10998ADDDAFKDSPNPSEHGHSDQRTSGIRTSDDSSEEDPYMNDTVVPTSPSAD12998ADDDAFKDSPNPSEHGHSDQRTSGIRTSDDSSEEDPYMMDTVVPTSPSADhumX999ADDDAFKDSPNPSEHGHSDQRTSGIRTSDDSSEEDPYMMDTVVPTSPSAD101048STVLLAPSVQDSGSLHNSSSGESTYCMPQNAGDLPSPDGDYDYDQDDYED121048STVLLAPSVQDSGSLHNSSSGESTYCMPQNAGDLPSPDGDYDYDQDDYEDhumX1049STVLLAPSVQDSGSLHNSSSGESTYCMPQNAGDLPSPDGDYDYDQDDYED101098GAITSGSSVTFSNSYGSQWSPDYRCSVGTYNSSGAYRFSSEGAQSSFEDS121098GAITSGSSVTFSNSYGSQWSPDYRCSVGTYNSSGAYRFSSEGAQSSFEDShumX1099GAITSGSSVTFSNSYGSQWSPDYRCSVGTYNSSGAYRFSSEGAQSSFEDS101148EEDFDSRFDTDDELSYRRDSVYSCVTLPYFHSFLYMKGGLMNSWKRRWCV121148EEDFDSRFDTDDELSYRRDSVYSCVTLPYFHSFLYMKGGLMNSWKRRWCVhumX1149EEDFDSRFDTDDELSYRRDSVYSCVTLPYFHSFLYMKGGLMNSWKRRWCV101198LKDETFLWFRSKQEALKQGWLHKKGGGSSTLSRRNWKKRWFVLRQSKLMY121198LKDETFLWFRSKQEALKQGWLHKKGGGSSTLSRRNWKKRWFVLRQSKLMYhumX1199LKDETFLWFRSKQEALKQGWLHKKGGGSSTLSRRNWKKRWFVLRQSKLMY101248FENDSEEKLKGTVEVRTAKEIIDNTTKENGIDIIMADRTFHLIAESPEDA121248FENDSEEKLKGTVEVRTAKEIIDNTTKENGIDIIMADRTFHLIAESPEDAhumX1249FENDSEEKLKGTVEVRTAKEIIDNTTKENGIDIIMADRTFHLIAESPEDA101298SQWFSVLSQVHASTDQEIQEMHDEQANPQNAVGTLDVGLIDSVCASDSPD121298SQWFSVLSQVHASTDQEIQEMHDEQANPQNAVGTLDVGLIDSVCASDSPDhumX1299SQWFSVLSQVHASTDQEIQEMHDEQANPQNAVGTLDVGLIDSVCASDSPD101348RPNSFVIITANRVLHCNADTPEEMHHWITLLQRSKGDTRVEGQEFIVRGW121348RPNSFVIITANRVLHCNADTPEEMHHWITLLQRSKGDTRVEGQEFIVRGWhumX1349RPNSFVIITANRVLHCNADTPEEMNHWITLLQRSKGDTRVEGQEFIVRGW101398LHKEVKNSPKMSSLKLKKRWFVLTHNSLDYYKSSEKNALKLGTLVLNSLC121398LHKEVKNSPKMSSLKLKKRWFVLTHNSLDYYKSSEKNALKLGTLVLNSLChumX1399LHKEVKNSPKMSSLKLKKRWFVLTHNSLDYYKSSEKNALKLGTLVLNSLC101448SVVPPDEKIFKETGYWNVTVYGRKHCYRLYTKLLNEATRWSSVIQNVTDT121448SVVPPDEKIFKETGYWNVTVYGRKHCYRLYTKLLNEATRWSSVIQNVTDThumX1449SVVPPDEKIFKETGYWNVTVYGRKHCYRLYTKLLNEATRWSSAIQNVTDT101498KAPIDTPTQQLIQDIKENCLNSDVVEQIYKRNPILRYTHHPLHSPLLPLP121498KAPIDTPTQQLIQDIKENCLNSDVVEQIYKRNPILRYTHHPLHSPLLPLPhumX1499KAPIDTPTQQLIQDIKENCLNSDVVEQIYKRNPILRYTHHPLHSPLLPLP101548YGDINLNLLKDKGYTTLQDEAIKIFNSLQQLESMSDPIPIIQGILQTGHD121548YGDINLNLLKDKGYTTLQDEAIKIFNSLQQLESMSDPIPIIQGILQTGHDhumX1549YGDINLNLLKDKGYTTLQDEAIKIFNSLQQLESMSDPIPIIQGILQTGHD101598LRPLRDELYCQLIKQTNKVPHPGSVGNLYSWQILTCLSCTFLPSRGILKY121598LRPLRDELYCQLIKQTNKVPHPGSVGNLYSWQILTCLSCTFLPSRGILKYhumX1599LRPLRDELYCQLIKQTNKVPHPGSVGNLYSWQILTCLSCTFLPSRGILKY101648LKFHLKRIREQFPGTEMEKYALFTYESLKKTKCREFVPSRDEIEALIHRQ121648LKFHLKRIREQFPGTEMEKYALFTYESLKKTKCREFVPSRDEIEALIHRQhumX1649LKFHLKRIREQFPGTEMEKYALFTYESLKKTKCREFVPSRDEIEALIHRQ101698EMTSTVYCHGGGSCKITINSHTTAGEVVEKLIRGLAMEDSRNMFALFEYN121698EMTSTVYCHGGGSCKITINSHTTAGEVVEKLIRGLAMEDSRNMFALFEYNhumX1699EMTSTVYCHGGGSCKITINSHTTAGEVVEKLIRGLAMEDSRNMFALFEYN101748GHVDKAIESRTVVADVLAKFEKLAATSEVGDLPWKFYFKLYCFLDTDNVP121748GHVDKAIESRTVVADVLAKFEKLAATSEVGDLPWKFYFKLYCFLDTDNVPhumX1749GHVDKAIESRTVVADVLAKFEKLAATSEVGDLPWKFYFKLYCFLDTDNVP101798KDSVEFAFMFEQAHEAVIHGHHPAPEENLQVLAALRLQYLQGDYTLHAAI121798KDSVEFAFMFEQAHEAVIHGHHPAPEENLQVLAALRLQYLQGDYTLHAAIhumX1799KDSVEFAFMFEQAHEAVIHGHHPAPEENLQVLAALRLQYLQGDYTLHAAI101848PPLEEVYSLQRLKARISQSTKTFTPCERLEKRRTSFLEGTLRRSFRTGSV121848PPLEEVYSLQRLKARISQSTKTFTPCERLEKRRTSFLEGTLRRSFRTGSVhumX1849PPLEEVYSLQRLKARISQSTKTFTPCERLEKRRTSFLEGTLRRSFRTGSV101898VRQKVEEEQMLDMWIKEEVSSARASIIDKWRKFQGMNQEQAMAKYMALIK121898VRQKVEEEQMLDMWIKEEVSSARASIIDKWRKFQGMNQEQAMAKYMALIKhumX1899VRQKVEEEQMLDMWIKEEVSSARASIIDKWRKFQGMNQEQAMAKYMALIK101948EWPGYGSTLFDVECKEGGFPQELWLGVSADAVSVYKRGEGRPLEVFQYEH121948EWPGYGSTLFDVEVRTG-CHVLGWAGCWHLRTWITAKFMWREDKMEHFALhumX1949EWPGYGSTLFDVECKEGGFPQELWLGVSADAVSVYKRGEGRPLEVFQYEH101998ILSFGAPLANTYKIVVDERELLFETSEVVDVAKLMKAYISMIVKKRYSTT121997STSFFRAPKIVPLTPPFSSQFLFSCVVNASVILGMNAKLRCHLFFYPSLGhumX1999ILSFGAPLANTYKIVVDERELLFETSEVVDVAKLMKAYISMIVKKRYSTT102048RSASSQGSSR122047KLhumX2049RSASSQGSSR


The invention also includes polypeptides and nucleotides having 80-100%, including 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 and 99%, sequence identity to SEQ ID NOS:1-16, as well as nucleotides encoding any of these polypeptides, and compliments of any of these nucleotides. In an alternative embodiment, polypeptides and/or nucleotides (and compliments thereof) identical to any one of, or more than one of, SEQ ID NOS:1-16 are excluded. In yet another embodiment, polypeptides and/or nucleotides (and compliments thereof) having 81-100% identical, including 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 and 99%, sequence identity to SEQ ID NOS:1-16 are excluded.


The nucleic acids and proteins of the invention are potentially useful in promoting wound healing, for example after organ transplantation, or in the treatment of myocardial infarction, but also in treating tumors, and in cancers, diabetic retinopathy, macular degeneration, psoriasis, and rheumatoid arthritis. For example, a cDNA encoding AAP may be useful in gene therapy, and AAP proteins may be useful when administered to a subject in need thereof. The novel nucleic acid encoding AAP, and the AAP proteins of the invention, or fragments thereof, may further be useful in diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are to be assessed. These materials are further useful in the generation of Abs that bind immunospecifically to the novel substances of the invention for use in therapeutic or diagnostic methods.


Kelch-like Protein (KLP)


The putative protein encoded by KLP contains 1 putative BTB domain and 4 putative Kelch motifs. The BTB (broad complex, tramtrack, bric-a-brac)/POZ (poxvirus, zinc finger) domain is involved in protein protein interactions. The kelch motif is sixfold tandem element in the sequence of the Drosophila kelch ORF1 protein that also contains BTB. Kelch ORF1 localizes to the ring canals in the egg chamber and helps to organize the F-actin cytoskeleton (Adams et al., 2000). The repeated kelch motifs predict a conserved tertiary structure, a β-propeller. This module appears in many different polypeptide contexts and contains multiple potential protein-protein interaction sites. Members of this growing superfamily are present throughout the cell and extracellularly and have diverse activities (Adams et al., 2000). Such activities include cytoskeleton organization, as well as other morphological processes, gene expression, interactions with viruses, and various extracellular events, such as cell spreading.


Alignment with Drosophila kelch and other kelch-like proteins, human kelch-like protein (GenBank AAF20938 (SEQ ID NO:17)), hypothetical C. elegans (GenBank 061795 (SEQ ID NO:18) and the skeletal muscle-specific sarcosin (GenBank 060662 (SEQ ID NO:19); (Taylor et al., 1998)) reveals that the disclosed protein (SEQ ID NO:2) is a member of a new subfamily


KLP is associated with tube formation and angiogenesis because it is upregulated in the in vitro model of angiogenesis of Example 1. Kelch mediates cytoskeletal associations, it is involved in morphogenetic processes, such as tube formation, that depend on cytoskeletal arrangements and signaling. KLP represents an attractive target for small molecule drug therapy.


Human Ortholog of Mouse BAZF (hBAZF)


hBAZF (SEQ ID NO:4) is the human ortholog of mouse BAZF (GenBank AB011665; SEQ ID NO:20), BAZF is a Bcl-6 (LAZ3) homolog, a transcription repressor that controls germinal center formation and the T cell-dependent immune response. Expression of Bcl-6 negatively correlates with cellular proliferation: Bcl-6 suppresses growth associated with impaired mitotic S phase progression and apoptosis (Albagli et al., 1999).


BAZF contains a BTB/POZ domain and five repeats of the Kruppel-like zinc finger motifs, instead of 6 in Bcl-6 (Okabe et al., 1998). Expression of BAZF mRNA is relegated to heart and lung, unlike Bcl-6 mRNA, but is induced in activated lymphocytes as an immediate-early gene, like Bcl-6 (Okabe et al., 1998).


The hBAZF sequence was derived by using tblastn (protein query-translated database) (Altschul et al., 1997), with the mouse protein sequence (GenBank 088282; SEQ ID NO:21) that has homology to GenBank AC015918 (SEQ ID NO:22), a clone of Homo sapiens chromosome 17. Human BAZF contains five Kruppel-like zinc finger motif repeats and a BTB/POZ domain.


The peptide sequence, “RSQ . . . PQV” that is present in the human sequence, might represent an alternative spliced form of the gene. Aligment with mouse BAZF, and aligment with mouse and human Bcl-6 demonstrates that the four proteins are almost identical in this region, but only human BAZF has this inserted sequence.


hBAZF is upregulated in HUVE cells grown embedded in collagen gels but not as a monolayer grown on collagen. When HUVE cells are suspended in collagen, they do not proliferate. Analagous to the role of mBAZF plays a role in regulating cell proliferation (Okabe et al.,i998), hBAZF plays a roll in cell proliferation in HUVE suspended in collagen. Because of its high expression during vessel morphogenesis, hBAZF represents an excellent molecular marker, as well as an attractive target for various therapies to inhibit angiogenesis.


hmt-Elongation Factor G (hEF-G)


The original isolation of hEF-G (SEQ ID NO:6) is 84% identical and colinear with Rattus norvegicus nuclear encoded mitochondrial elongation factor G (GenBank L14684 (SEQ ID NO:23); (Barker et al., 1993). No human gene is described in GenBank. However, searching EST databases, the human gene is contained inside GenBank AC010936 (SEQ ID NO:24), a chromosome 3 clone. Aligment of hEF-G with rat mtEF-G and yeast EF-G 1 demonstrates that the novel sequence is the ortholog of rat nuclear-encoded mitochondrial elongation factor G.


Bacterial elongation factor G (EF-G) physically associates with translocation-competent ribosomes and facilitates transition to the subsequent codon through the coordinate binding and hydrolysis of GTP. The deduced amino acid sequence of hmt-EF-G reveals characteristic motifs shared by all GTP binding proteins. Therefore, similarly to other elongation factors, the enzymatic function of hmt-EF-G is predicted to depend on GTP binding and hydrolysis.


Hmt-EF-G is strongly induced (30-fold) in an in vitro model of angiogenesis (Example 1), and as such, hmt-EF-G represents an excellent molecular marker for vessel formation. Because of its putative localization to the mitochondrion, hmt-EF-G is also an attractive therapeutic target to treat disease states associated with mitochondrial dysfunction.


Human Thyroid Regulated Transcript (hTRG)


hTRG (SEQ ID NO:8) is the human ortholog of rat TRG, a novel thyroid transcript negatively regulated by TSH (GenBank KIAA1058 (SEQ ID NO:25); (Bonapace et al., 1990).


SEQ ID NO:25 appears to be a partial peptide since there are C. elegans homologous proteins of 2000 residues. Using tblastn (Altschul et al., 1997) against genomic sequences, the hTRG sequence (SEQ ID NO:8) was assembled.


In C. elegans, homologous proteins localize either to the plasma membrane or to the mitochondrial inner membrane. A partial sequence, KIAA0694 (SEQ ID NO:26) also localizes to the mitochondrial matrix. hTRG has a PH domain, and has weak homology to an extracellular fibronectin-binding protein precursor. SEQ ID NO:26 has homology to Drosophila DOS and mouse Gab-2 proteins; both of which are involved in signal transduction, acting as adapter proteins between receptors and kinases like Rasi (Hibi and Hirano, 2000).


Because of hTRG is upregulated during the in vitro model of angiogenesis (Example 1), and because of its homologies with adapter proteins, hTRG is likely to be involved in signal transduction between receptors and kinases. As such, hTRG represent an excellent candidate for small molecule drug therapy to modulate angiogenesis and treat angiogenesis-related diseases. In addition, because of its putative ability to respond to thyroid stimulating hormone (TSH), modulation of hTRG is useful to treat diseases related to TSH imbalance.


Human myosin X (hMX1 (SEQ ID NO:10) and hMX2 (SEQ ID NO:12)


The hMX proteins represent the human ortholog of bovine myosin X, (GenBank AAB39486; SEQ ID NO:27). Using tblastn (Altschul et al., 1997) and the bovine sequence, a series of genomic clones from human chromosome 5 were identified; GenBank AC010310 (SEQ ID NO:28) appears to contain the entire sequence. Interestingly, a partial cDNA sequence from mouse (GenBank AF184153; SEQ ID NO:29) localizes to a 0.8 cM interval on the short arm of chromosome 5, between the polymorphic microsatellite markers D5S416 and D5S2114. In this region lies the gene for familial chondrocalcinosis (CCAL2) (Rojas et al., 1999).


Another GenBank entry, AB018342 (SEQ ID NO:30) that represents the 3′ region of hMX, appears to encode an alternative splice form. Noteworthy, this variant (hMX2) has a very hydrophobic carboxy terminus, while the more prevalent form (hMX1) is hydrophilic and potentially interacts with DNA/RNA since it has homology to high mobility group box (HMG) and ribosomal proteins. Additionaly, a myosin head domain was found in the NH terminus, as well as a myosin talin domain, two calmodulin binding domains, four pleckstrin domains and a band 4.1 domain.


The band 4.1 domain represents a crossroads between cytoskeletal organization and signal transduction. The domain was first described in the red blood cell protein band 4.1. The ERM proteins ezrin, radixin, and moesin and the unconventional myosins VIla and X all possess the band 4.1 domain (Louvet-Vallee, 2000). The band 4.1 domain binds single transmembrane protein at the membrane-proximal region in the C-terminal cytoplasmic tail.


HMX is upregulated during angiogenesis in an in vitro model (Example 1). Because hMX contains the protein-protein interaction domains PH and band 4.1 domain, hMX1 and hMX2 are involved in angiogenesis, likely transducing signals from angiogenic factors, perhaps modulating the cytoskeleton.


Human Mitochondrial Protein (hMP)


Analysis of hMP (SEQ ID NO: 14) reveals several subdomain that are homologous to proteins involved in transport across membranes, K+ATPase α and γ chains. Further analysis indicates that hMP may bind DNA and or RNA, since hMP is homologous to histones and transcription factors, especially those possessing basic region plus leucine zipper domains.


Although PSORT analysis (Nakai and Horton, 1999) predicts nuclear localization (P=0.6), hMP may in fact be a nuclear-encoded mitochondrial protein. Homologies with mostly bacterial proteins and a PSORT prediction ofmitochondrial matrix space localization (P=0.4478) strongly support this contention.


Because hMP is upregulated in an in vitro model of angiogenesis (Example 1), and because of its homologies with mitochondrial and nuclear-localized polypeptides, hMP is important in vascular morphogenesis, most likely through either powering the cellular differentiation-redifferentiation process, and/or affecting changes in the nuclear matrix that change global gene expression. Alternatively, hMP may be a transcription factor for either the nuclear or mitochondrial genomes.


Nuclear Hormone Receptor (NHR)


NHR (SEQ ID NO: 16) has two domains: (1) the NH region is similar to Swi3 (yeast SWI/SNF complexes regulate transcription by chromatin remodeling), indicating a role in transcriptional regulation, and (2) the COOH region is similar to parathyroid hormone-related proteins that bind parathyroid hormones. PSORT (Nakai and Horton, 1999) predicts the protein to localize in the nucleus P=0.9600.


The identification of this new putative hormone receptor-transcriptional regulator and hBAZF suggest a novel human transcriptional pathway that resembles, to some extent, that of Bcl-6.


Bcl-6 suppresses transcription via the BTB domain, which recruits a complex containing SMRT, retinoid thyroid hormone receptor, nuclear receptor corepressor (N-CoR), mammalian Sin3A, and histone deacetylase (HDAC). hBAZF, which also possesses a BTB domain, might recruit a similar complex containing deacetylase. Expression data indicate that hBAZF is up-regulated while NHR is down-regulated. These data agree with other evidence related to tube formation. Testosterone (a steroid) and dexamethasone (a steroid-like molecule) strongly inhibit vessel formation, and all-trans retinoic acid (at-RA) and 9-cis retinoic acid (9-cis RA) stimulate capillary-like tubular structures (Lansink et al., 1998).


Upon angiogenic stimulation, endothelial cells may become incompetent to respond to anti-angiogenic responses mediated by hormones using a dual mechanism, sequestering hormnones and suppressing transcription. Because nHR is down-regulated during in vitro angiogenesis (Example 1), this polypeptide is likely to be involved in non-angiogenesis-specific gene transcription. nHR is an attractive therapeutic target, especially in therapies that are directed at preventing vascularization.


AAP Polynucleotides


One aspect of the invention pertains to isolated nucleic acid molecules that encode AAP or biologically-active portions thereof. Also included in the invention are nucleic acid fragments sufficient for use as hybridization probes to identify AAP-encoding nucleic acids (e.g., AAP mRNAs) and fragments for use as polymerase chain reaction (PCR) primers for the amplification and/or mutation of AAP molecules. A “nucleic acid molecule” includes DNA molecules (e.g., cDNA or genomic DNA), RNA molecules (e.g., mRNA), analogs of the DNA or RNA generated using nucleotide analogs, and derivatives, fragments and homologs. The nucleic acid molecule may be single-stranded or double-stranded, but preferably comprises double-stranded DNA.


1. Probes


Probes are nucleic acid sequences of variable length, preferably between at least about 10 nucleotides (nt), 100 nt, or many (e.g., 6,000 nt) depending on the specific use. Probes are used to detect identical, similar, or complementary nucleic acid sequences. Longer length probes can be obtained from a natural or recombinant source, are highly specific, and much slower to hybridize than shorter-length oligomer probes. Probes may be single- or double-stranded and designed to have specificity in PCR, membrane-based hybridization technologies, or ELISA-like technologies. Probes are substantially purified oligonucleotides that will hybridize under stringent conditions to at least optimally 12, 25, 50, 100, 150, 200, 250, 300, 350 or 400-consecutive sense strand nucleotide sequence of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13, or 15; or an anti-sense strand nucleotide sequence of these sequences; or of a naturally occurring mutant of these sequences.


The full- or partial length native sequence AAP may be used to “pull out” similar (homologous) sequences (Ausubel et al., 1987; Sambrook, 1989), such as: (1) full-length or fragments of AAP cDNA from a cDNA library from any species (e.g. human, murine, feline, canine, bacterial, viral, retroviral, yeast), (2) from cells or tissues, (3) variants within a species, and (4) homologues and variants from other species. To find related sequences that may encode related genes, the probe may be designed to encode unique sequences or degenerate sequences. Sequences may also be genomic sequences including promoters, enhancer elements and introns of native sequence AAP.


For example, an AAP coding region in another species may be isolated using such probes. A probe of about 40 bases is designed, based on an AAP, and made. To detect hybridizations, probes are labeled using, for example, radionuclides such as 32P or 35S, or enzymatic labels such as alkaline phosphatase coupled to the probe via avidin-biotin systems. Labeled probes are used to detect nucleic acids having a complementary sequence to that of an AAP in libraries of cDNA, genomic DNA or mRNA of a desired species.


Such probes can be used as a part of a diagnostic test kit for identifying cells or tissues which mis-express an AAP, such as by measuring a level of an AAP in a sample of cells from a subject e.g., detecting AAP mRNA levels or determining whether a genomic AAP has been mutated or deleted.


2. Isolated Nucleic Acid


An isolated nucleic acid molecule is separated from other nucleic acid molecules that are present in the natural source of the nucleic acid. Preferably, an isolated nucleic acid is free of sequences that naturally flank the nucleic acid (i.e., sequences located at the 5′- and 3′-termini of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. For example, in various embodiments, isolated AAP molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, spleen, etc.). Moreover, an isolated nucleic acid molecule, such as a cDNA molecule, can be substantially free of other cellular material or culture medium when produced by recombinant techniques, or of chemical precursors or other chemicals when chemically synthesized.


A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the nucleotide sequence of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, or a complement of this aforementioned nucleotide sequence, can be isolated using standard molecular biology techniques and the provided sequence information. Using all or a portion of the nucleic acid sequence of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15 as a hybridization probe, AAP molecules can be isolated using standard hybridization and cloning techniques (Ausubel et al., 1987; Sambrook, 1989).


PCR amplification techniques can be used to amplify AAP using cDNA, mRNA or alternatively, genomic DNA, as a template and appropriate oligonucleotide primers. Such nucleic acids can be cloned into an appropriate vector and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to AAP sequences can be prepared by standard synthetic techniques, e.g., an automated DNA synthesizer.


3. Oligonucleotide


An oligonucleotide comprises a series of linked nucleotide residues, which oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction or other application. A short oligonucleotide sequence may be based on, or designed from, a genomic or cDNA sequence and is used to amplify, confirm, or reveal the presence of an identical, similar or complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 50 nt, or 100 nt in length, preferably about 15 nt to 30 nt in length. In one embodiment of the invention, an oligonucleotide comprising a nucleic acid molecule less than 100 nt in length would further comprise at least 6 contiguous nucleotides of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, or a complement thereof. Oligonucleotides may be chemically synthesized and may also be used as probes.


4. Complementary Nucleic Acid Sequences; Binding


In another embodiment, an isolated nucleic acid molecule of the invention comprises a nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, or a portion of this nucleotide sequence (e.g., a fragment that can be used as a probe or primer or a fragment encoding a biologically-active portion of an AAP). A nucleic acid molecule that is complementary to the nucleotide sequence shown in SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, is one that is sufficiently complementary to the nucleotide sequence shown in SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, that it can hydrogen bond with little or no mismatches to the nucleotide sequence shown in SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, thereby forming a stable duplex.


“Complementary” refers to Watson-Crick or Hoogsteen base pairing between nucleotides units of a nucleic acid molecule, and the term “binding” means the physical or chemical interaction between two polypeptides or compounds or associated polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, van der Waals, hydrophobic interactions, and the like. A physical interaction can be either direct or indirect. Indirect interactions may be through or due to the effects of another polypeptide or compound. Direct binding refers to interactions that do not take. place through, or due to, the effect of another polypeptide or compound, but instead are without other substantial chemical intermediates.


Nucleic acid fragments are at least 6 (contiguous) nucleic acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific hybridization in the case of nucleic acids or for specific recognition of an epitope in the case of amino acids, respectively, and are at most some portion less than a full-length sequence. Fragments may be derived from any contiguous portion of a nucleic acid or amino acid sequence of choice.


5. Derivatives, and Analogs


Derivatives are nucleic acid sequences or amino acid sequences formed from the native compounds either directly or by modification or partial substitution. Analogs are nucleic acid sequences or amino acid sequences that have a structure similar to, but not identical to, the native compound but differ from it in respect to certain components or side chains. Analogs may be synthetic or from a different evolutionary origin and may have a similar or opposite metabolic activity compared to wild type. Homologs are nucleic acid sequences or amino acid sequences of a particular gene that are derived from different species.


Derivatives and analogs may be full length or other than full length, if the derivative or analog contains a modified nucleic acid or amino acid, as described below. Derivatives or analogs of the nucleic acids or proteins of the invention include, but are not limited to, molecules comprising regions that are substantially homologous to the nucleic acids or proteins of the invention, in various embodiments, by at least about 70%, 80%, or 95% identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid sequence of identical size or when compared to an aligned sequence in which the alignment is done by a computer homology program known in the art, or whose encoding nucleic acid is capable of hybridizing to the complement of a sequence encoding the aforementioned proteins under stringent, moderately stringent, or low stringent conditions (Ausubel et al., 1987).


6. Homology


A “homologous nucleic acid sequence” or “homologous amino acid sequence,” or variations thereof, refer to sequences characterized by homology at the nucleotide level or amino acid level as discussed above. Homologous nucleotide sequences encode those sequences coding for isoforms of AAP. Isoforms can be expressed in different tissues of the same organism as a result of, for example, alternative splicing of RNA. Alternatively, different genes can encode isoforms. In the invention, homologous nucleotide sequences include nucleotide sequences encoding for an AAP of species other than humans, including, but not limited to: vertebrates, and thus can include, e.g., frog, mouse, rat, rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide sequences also include, but are not limited to, naturally occurring allelic variations and mutations of the nucleotide sequences set forth herein. A homologous nucleotide sequence does not, however, include the exact nucleotide sequence encoding human AAP. Homologous nucleic acid sequences include those nucleic acid sequences that encode conservative amino acid substitutions (see below) in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, as well as a polypeptide possessing AAP biological activity. Various biological activities of the AAP are described below.


7. Open Reading Frames


The open reading frame (ORF) of an AAP gene encodes an AAP. An ORF is a nucleotide sequence that has a start codon (ATG) and terminates with one of the three “stop” codons (TAA, TAG, or TGA). In this invention, however, an ORF may be any part of a coding sequence that may or may not comprise a start codon and a stop codon. To achieve a unique sequence, preferable AAP ORFs encode at least 50 amino acids.


AAP Polypeptides


1. Mature


An AAP can encode a mature AAP. A “mature” form of a polypeptide or protein disclosed in the present invention is the product of a naturally occurring polypeptide or precursor form or proprotein. The naturally occurring polypeptide, precursor or proprotein includes, by way of nonlimiting example, the full-length gene product, encoded by the corresponding gene. Alternatively, it may be defined as the polypeptide, precursor or proprotein encoded by an open reading frame described herein. The product “mature” form arises, again by way of nonlimiting example, as a result of one or more naturally occurring processing steps as they may take place within the cell, or host cell, in which the gene product arises. Examples of such processing steps leading to a “mature” form of a polypeptide or protein include the cleavage of the N-terminal methionine residue encoded by the initiation codon of an open reading frame, or the proteolytic cleavage of a signal peptide or leader sequence. Thus a mature form arising from a precursor polypeptide or protein that has residues 1 to N, where residue 1 is the N-terminal methionine, would have residues 2 through N remaining after removal of the N-terminal methionine. Alternatively, a mature form arising from a precursor polypeptide or protein having residues 1 to N, in which an N-terminal signal sequence from residue 1 to residue M is cleaved, would have the residues from residue M+1 to residue N remaining. Further as used herein, a “mature” form of a polypeptide or protein may arise from a step of post-translational modification other than a proteolytic cleavage event. Such additional processes include, by way of non-limiting example, glycosylation, myristoylation or phosphorylation. In general, a mature polypeptide or protein may result from the operation of only one of these processes, or a combination of any of them.


2. Active


An active AAP polypeptide or AAP polypeptide fragment retains a biological and/or an immunological activity similar, but not necessarily identical, to an activity of a naturally-occuring (wild-type) AAP polypeptide of the invention, including mature forms. A particular biological assay, with or without dose dependency, can be used to determine AAP activity. A nucleic acid fragment encoding a biologically-active portion of AAP can be prepared by isolating a portion of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15 that encodes a polypeptide having an AAP biological activity (the biological activities of the AAP are described below), expressing the encoded portion of AAP (e.g., by recombinant expression in vitro) and assessing the activity of the encoded portion of AAP. Immunological activity refers to the ability to induce the production of an antibody against an antigenic epitope possessed by a native AAP; biological activity refers to a function, either inhibitory or stimulatory, caused by a native AAP that excludes immunological activity.


AAP Nucleic Acid Variants and Hybridization


1. variant polynucleotides, genes and recombinant genes The invention further encompasses nucleic acid molecules that differ from the nucleotide sequences shown in SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15 due to degeneracy of the genetic code and thus encode the same AAP as that encoded by the nucleotide sequences shown in SEQ ID NO NOS:1, 3, 5, 7, 9, 11, 13 or 15. An isolated nucleic acid molecule of the invention has a nucleotide sequence encoding a protein having an amino acid sequence shown in SEQ ID NOS:2,4, 6, 8, 10,12, 14 or 16.


In addition to the AAP sequences shown in SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, DNA sequence polymorphisms that change the amino acid sequences of the AAP may exist within a population. For example, allelic variation among individuals will exhibit genetic polymorphism in an AAP. The terms “gene” and “recombinant gene” refer to nucleic acid molecules comprising an open reading frame (ORF) encoding an AAP, preferably a vertebrate AAP. Such natural allelic variations can typically result in 1-5% variance in an AAP. Any and all such nucleotide variations and resulting amino acid polymorphisms in an AAP, which are the result of natural allelic variation and that do not alter the functional activity of an AAP are within the scope of the invention.


Moreover, AAP from other species that have a nucleotide sequence that differs from the human sequence of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, are contemplated. Nucleic acid molecules corresponding to natural allelic variants and homologues of an AAP cDNAs of the invention can be isolated based on their homology to an AAP of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15 using cDNA-derived probes to hybridize to homologous AAP sequences under stringent conditions.


“AAP variant polynucleotide” or “AAP variant nucleic acid sequence” means a nucleic acid molecule which encodes an active AAP that (1) has at least about 80% nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length native AAP, (2) a full-length native AAP lacking the signal peptide, (3) an extracellular domain of an AAP, with or without the signal peptide, or (4) any other fragment of a full-length AAP. Ordinarily, an AAP variant polynucleotide will have at least about 80% nucleic acid sequence identity, more preferably at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, nucleic acid sequence identity and yet more preferably at least about 99% nucleic acid sequence identity with the nucleic acid sequence encoding a full-length native AAP. An AAP variant polynucleotide may encode a full-length native AAP lacking the signal peptide, an extracellular domain of an AAP, with or without the signal sequence, or any other fragment of a full-length AAP. Variants do not encompass the native nucleotide sequence.


Ordinarily, AAP variant polynucleotides are at least about 30 nucleotides in length, often at least about 60, 90, 120, 150, 180, 210, 240, 270, 300, 450, 600 nucleotides in length, more often at least about 900 nucleotides in length, or more.


“Percent (%) nucleic acid sequence identity” with respect to AAP-encoding nucleic acid sequences identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with the nucleotides in the AAP sequence of interest, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining % nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared.


When nucleotide sequences are aligned, the % nucleic acid sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) can be calculated as follows:

% nucleic acid sequence identity=W/Z·100


where


W is the number of nucleotides cored as identical matches by the sequence alignment program's or algorithm's alignment of C and D


and


Z is the total number of nucleotides in D.


When the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to C.


2. Stringency


Homologs (i.e., nucleic acids encoding an AAP derived from species other than human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or high stringency hybridization with all or a portion of the particular human sequence as a probe using methods well known in the art for nucleic acid hybridization and cloning.


The specificity of single stranded DNA to hybridize complementary fragments is determined by the “stringency” of the reaction conditions. Hybridization stringency increases as the propensity to form DNA duplexes decreases. In nucleic acid hybridization reactions, the stringency can be chosen to either favor specific hybridizations (high stringency), which can be used to identify, for example, full-length clones from a library. Less-specific hybridizations (low stringency) can be used to identify related, but not exact, DNA molecules (homologous, but not identical) or segments.


DNA duplexes are stabilized by: (1) the number of complementary base pairs, (2) the type of base pairs, (3) salt concentration (ionic strength) of the reaction mixture, (4) the temperature of the reaction, and (5) the presence of certain organic solvents, such as formamide which decreases DNA duplex stability. In general, the longer the probe, the higher the temperature required for proper annealing. A common approach is to vary the temperature: higher relative temperatures result in more stringent reaction conditions. (Ausubel et al., 1987) provide an excellent explanation of stringency of hybridization reactions.


To hybridize under “stringent conditions” describes hybridization protocols in which nucleotide sequences at least 60% homologous to each other remain hybridized. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength, pH and nucleic acid concentration) at which 50% of the probes complementary to the target sequence hybridize to the target sequence at equilibrium. Since the target sequences are generally present at excess, at Tm, 50% of the probes are occupied at equilibrium.


(a) High Stringency


“Stringent hybridization conditions” conditions enable a probe, primer or oligonucleotide to hybridize only to its target sequence. Stringent conditions are sequence-dependent and will differ. Stringent conditions comprise: (1) low ionic strength and high temperature washes (e.g. 15 mM sodium chloride, 1.5 mM sodium citrate, 0.1 % sodium dodecyl sulfate at 50° C.); (2) a denaturing agent during hybridization (e.g. 50% (v/v) formamide, 0.1% bovine serum albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 50 mM sodium phosphate buffer (pH 6.5; 750 mM sodium chloride, 75 mM sodium citrate at 42° C.); or (3) 50% formamide. Washes typically also comprise 5×SSC (0.75 M 10 NaCl, 75 mM sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5× Denhardt's solution, sonicated salmon sperm DNA (50 μg/ml), 0.1% SDS, and 10% dextran sulfate at 42° C., with washes at 42° C. in 0.2×SSC (sodium chloride/sodium citrate) and 50% formamide at 55° C., followed by a high-stringency wash consisting of 0.1×SSC containing EDTA at 55° C. Preferably, the conditions are such that sequences at least about 65%, 70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically remain hybridized to each other. These conditions are presented as examples and are not meant to be limiting.


(b) Moderate Stringency


“Moderately stringent conditions” use washing solutions and hybridization conditions that are less stringent (Sambrook, 1989), such that a polynucleotide will hybridize to the entire, fragments, derivatives or analogs of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15. One example comprises hybridization in 6×SSC, 5× Denhardt's solution, 0.5% SDS and 100 mg/ml denatured salmon sperm DNA at 55° C., followed by one or more washes in 1×SSC, 0.1% SDS at 37° C. The temperature, ionic strength, etc., can be adjusted to accommodate experimental factors such as probe length. Other moderate stringency conditions are described in (Ausubel et al., 1987; Kriegler, 1990).


(c) Low Stringency


“Low stringent conditions” use washing solutions and hybridization conditions that are less stringent than those for moderate stringency (Sambrook, 1989), such that a polynucleotide will hybridize to the entire, fragments, derivatives or analogs of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15. A non-limiting example of low stringency hybridization conditions are hybridization in 35% formnamide, 5×SSC, 50 mM Tris-HCl (pH 7.5),5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm DNA, 10% (wt/vol) dextran sulfate at 40° C., followed by one or more washes in 2×SSC, 25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 0.1% SDS at 50° C. Other conditions of low stringency, such as those for cross-species hybridizations are described in (Ausubel et al., 1987; Kriegler, 1990; Shilo and Weinberg, 1981).


3. Conservative Mutations


In addition to naturally-occurring allelic variants of AAP, changes can be introduced by mutation into SEQ ID NO NOS:1, 3, 5, 7, 9, 11, 13 or 15 sequences that incur alterations in the amino acid sequences of the encoded AAP that do not alter the AAP function. For example, nucleotide substitutions leading to amino acid substitutions at “non-essential” amino acid residues can be made in the sequence of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16. A “non-essential” amino acid residue is a residue that can be altered from the wild-type sequences of the AAP without altering their biological activity, whereas an “essential” amino acid residue is required for such biological activity. For example, amino acid residues that are conserved among the AAP of the invention are predicted to be particularly non-amenable to alteration. Amino acids for which conservative substitutions can be made are well-known in the art.


Useful conservative substitutions are shown in Table A, “Preferred substitutions.” Conservative substitutions whereby an amino acid of one class is replaced with another amino acid of the same type fall within the scope of the subject invention so long as the substitution does not materially alter the biological activity of the compound. If such substitutions result in a change in biological activity, then more substantial changes, indicated in Table B as exemplary are introduced and the products screened for an AAP polypeptide's biological activity.

TABLE APreferred substitutionsExemplaryPreferredOriginal residuesubstitutionssubstitutionsAla (A)Val, Leu, IleValArg (R)Lys, Gln, AsnLysAsn (N)Gln, His, Lys, ArgGlnAsp (D)GluGluCys (C)SerSerGln (Q)AsnAsnGlu (E)AspAspGly (G)Pro, AlaAlaHis (H)Asn, Gln, Lys, ArgArgIle (I)Leu, Val, Met, Ala, Phe,LeuNorleucineLeu (L)Norleucine, Ile, Val, Met, Ala,IlePheLys (K)Arg, Gln, AsnArgMet (M)Leu, Phe, IleLeuPhe (F)Leu, Val, Ile, Ala, TyrLeuPro (P)AlaAlaSer (S)ThrThrThr (T)SerSerTrp (W)Tyr, PheTyrTyr (Y)Trp, Phe, Thr, SerPheVal (V)Ile, Leu, Met, Phe, Ala,LeuNorleucine


Non-conservative substitutions that effect (1) the structure of the polypeptide backbone, such as a β-sheet or α-helical conformation, (2) the charge or (3) hydrophobicity, or (4) the bulk of the side chain of the target site can modify an AAP polypeptide's function or immunological identity. Residues are divided into groups based on common side-chain properties as denoted in Table B. Non-conservative substitutions entail exchanging a member of one of these classes for another class. Substitutions may be introduced into conservative substitution sites or more preferably into non-conserved sites.

TABLE BAmino acid classesClassAmino acidshydrophobicNorleucine, Met, Ala, Val, Leu, Ileneutral hydrophilicCys, Ser, ThracidicAsp, GlubasicAsn, Gln, His, Lys, Argdisrupt chain conformationGly, ProaromaticTrp, Tyr, Phe


The variant polypeptides can be made using methods known in the art such as oligonucleotide-mediated (site-directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis (Carter, 1986; Zoller and Smith, 1987), cassette mutagenesis, restriction selection mutagenesis (Wells et al., 1985) or other known techniques can be performed on the cloned DNA to produce the AAP variant DNA (Ausubel et al., 1987; Sambrook, 1989).


In one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein comprises an amino acid sequence at least about 45%, preferably 60%, more preferably 70%, 80%, 90%, and most preferably about 95% homologous to SEQ ID NOS:1, 3, 5, 7, 9, 11, 13, or 15.


A mutant AAP can be assayed for blocking angiogenesis in vitro.


4. Anti-sense Nucleic Acids


Using antisense and sense AAP oligonucleotides can prevent AAP polypeptide expression. These oligonucleotides bind to target nucleic acid sequences, forming duplexes that block transcription or translation of the target sequence by enhancing degradation of the duplexes, terminating prematurely transcription or translation, or by other means.


Antisense or sense oligonucleotides are singe-stranded nucleic acids, either RNA or DNA, which can bind a target AAP mRNA (sense) or an AAP DNA (antisense) sequences. Anti-sense nucleic acids can be designed according to Watson and Crick or Hoogsteen base pairing rules. The anti-sense nucleic acid molecule can be complementary to the entire coding region of an AAP mRNA, but more preferably, to only a portion of the coding or noncoding region of an AAP mRNA. For example, the anti-sense oligonucleotide can be complementary to the region surrounding the translation start site of an AAP mRNA. Antisense or sense oligonucleotides may comprise a fragment of the AAP DNA coding region of at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. In general, antisense RNA or DNA molecules can comprise at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 bases in 25 length or more. Among others, (Stein and Cohen, 1988; van der Krol et al., 1988a) describe methods to derive antisense or a sense oligonucleotides from a given cDNA sequence.


Examples of modified nucleotides that can be used to generate the anti-sense nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyl uracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5′-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the anti-sense nucleic acid can be produced biologically using an expression vector into which a nucleic acid has been sub-cloned in an anti-sense orientation such that the transcribed RNA will be complementary to a target nucleic acid of interest.


To introduce antisense or sense oligonucleotides into target cells (cells containing the target nucleic acid sequence), any gene transfer method may be used. Examples of gene transfer methods include (1) biological, such as gene transfer vectors like Epstein-Barr virus or conjugating the exogenous DNA to a ligand-binding molecule, (2) physical, such as electroporation and injection, and (3) chemical, such as CaPO4 precipitation and oligonucleotide-lipid complexes.


An antisense or sense oligonucleotide is inserted into a suitable gene transfer retroviral vector. A cell containing the target nucleic acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Examples of suitable retroviral vectors include those derived from the murine retrovirus M-MuLV, N2 (a retrovirus derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCT5C (WO 90/13641, 1990). To achieve sufficient nucleic acid molecule transcription, vector constructs in which the transcription of the anti-sense nucleic acid molecule is controlled by a strong pol II or pol III promoter are preferred.


To specify target cells in a mixed population of cells cell surface receptors that are specific to the target cells can be exploited. Antisense and sense oligonucleotides are conjugated to a ligand-binding molecule, as described in (WO 91/04753, 1991). Ligands are chosen for receptors that are specific to the target cells. Examples of suitable ligand-binding molecules include cell surface receptors, growth factors, cytokines, or other ligands that bind to cell surface receptors or molecules. Preferably, conjugation of the ligand-binding molecule does not substantially interfere with the ability of the receptors or molecule to bind the ligand-binding molecule conjugate, or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell.


Liposomes efficiently transfer sense or an antisense oligonucleotide to cells (WO 90/10448, 1990). The sense or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous lipase.


The anti-sense nucleic acid molecule of the invention may be an α-anomeric nucleic acid molecule. An α-anomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA in which, contrary to the usual α-units, the strands run parallel to each other (Gautier et al., 1987). The anti-sense nucleic acid molecule can also comprise a 2′-o-methylribonucleotide (Inoue et al., 1987a) or a chimeric RNA-DNA analogue (Inoue et al., 1987b).


In one embodiment, an anti-sense nucleic acid of the invention is a ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. Thus, ribozymes, such as hammerhead ribozymes (Haseloff and Gerlach, 1988) can be used to catalytically cleave AAP mRNA transcripts and thus inhibit translation. A ribozyme specific for an AAP-encoding nucleic acid can be designed based on the nucleotide sequence of an AAP cDNA (i.e., SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in 20 which the nucleotide sequence of the active site is complementary to the nucleotide sequence to be cleaved in an AAP-encoding mRNA (Cech et al., U.S. Pat. No. 5,116,742, 1992; Cech et al., U.S. Pat. No. 4,987,071, 1991). An AAP mRNA can also be used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules (Bartel and Szostak, 1993).


Alternatively, AAP expression can be inhibited by targeting nucleotide sequences complementary to the regulatory region of an AAP (e.g., an AAP promoter and/or enhancers) to form triple helical structures that prevent transcription of an AAP in target cells (Helene, 1991; Helene et al., 1992; Maher, 1992).


Modifications of antisense and sense oligonucleotides can augment their effectiveness. Modified sugar-phosphodiester bonds or other sugar linkages (WO 91/06629, 1991), increase in vivo stability by conferring resistance to endogenous nucleases without disrupting binding specificity to target sequences. Other modifications can increase the affinities of the oligonucleotides for their targets, such as covalently linked organic moieties (WO 90/10448, 1990) or poly-(L)-lysine. Other attachments modify binding specificities of the oligonucleotides for their targets, including metal complexes or intercalating (e.g. ellipticine) and alkylating agents.


For example, the deoxyribose phosphate backbone of the nucleic acids can be modified to generate peptide nucleic acids (Hyrup and Nielsen, 1996). “Peptide nucleic acids” or “PNAs” refer to nucleic acid mimics (e.g., DNA mimics) in that the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are retained The neutral backbone of PNAs allows for specific hybridization to DNA and RNA under conditions of low ionic strength. The synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis protocols (Hyrup and Nielsen, 1996; Perry-O'Keefe et al., 1996).


PNAs of an AAP can be used in therapeutic and diagnostic applications. For example, PNAs can be used as anti-sense or antigene agents for sequence-specific modulation of gene expression by inducing transcription or translation arrest or inhibiting replication. AAP PNAs may also be used in the analysis of single base pair mutations (e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination with other enzymes, e.g., Si nucleases (Hyrup and Nielsen, 1996); or as probes or primers for DNA sequence and hybridization (Hyrup and Nielsen, 1996; Perry-O'Keefe et al., 1996).


PNAs of an AAP can be modified to enhance their stability or cellular uptake. Lipophilic or other helper groups may be attached to PNAs, PNA-DNA dimmers formed, or the use of liposomes or other drug delivery techniques. For example, PNA-DNA chimeras can be generated that may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes (e.g., RNase H and DNA polymerases) to interact with the DNA portion while the PNA portion provides high binding affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of base stacking, number of bonds between the nucleobases, and orientation (Hyrup and Nielsen, 1996). The synthesis of PNA-DNA chimeras can be performed (Finn et al., 1996; Hyrup and Nielsen, 1996). For example, a DNA chain can be synthesized on a solid support using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 5′-(4-methoxytrityl)amino-5′-deoxy-thymidine phosphoramidite, can be used between the PNA and the 5′ end of DNA (Finn et al., 1996; Hyrup and Nielsen, 1996). PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5′ PNA segment and a 3′ DNA segment (Finn et al., 1996). Alternatively, chimeric molecules can be synthesized with a 5′ DNA segment and a 3′ PNA segment (Petersen et al., 1976).


The oligonucleotide may include other appended groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the cell membrane (lemaitre et al., 1987; Letsinger et al., 1989) or PCT Publication No. WO88/09810) or the blood-brain barrier (e.g., PCT Publication No. WO 89/10134). In addition, oligonucleotides can be modified with hybridization-triggered cleavage agents (van der Krol et al., 1988b) or intercalating agents (Zon, 1988). The oligonucleotide may be conjugated to another molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered cleavage agent, and the like.


AAP Polypeptides


One aspect of the invention pertains to isolated AAP, and biologically-active portions derivatives, fragments, analogs or homologs thereof. Also provided are polypeptide fragments suitable for use as immunogens to raise anti-AAP Abs. In one embodiment, a native AAP can be isolated from cells or tissue sources by an appropriate purification scheme using standard protein purification techniques. In another embodiment, AAP are produced by recombinant DNA techniques. Alternative to recombinant expression, an AAP can be synthesized chemically using standard peptide synthesis techniques.


1. Polypeptides


An AAP polypeptide includes the amino acid sequence of an AAP whose sequences are provided in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16. The invention also includes a mutant or variant protein any of whose residues may be changed from the corresponding residues shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, while still encoding a protein that maintains its AAP activities and physiological functions, or a functional fragment thereof.


2. Variant AAP Polypeptides


In general, an AAP variant that preserves an AAP-like function and includes any variant in which residues at a particular position in the sequence have been substituted by other amino acids, and further includes the possibility of inserting an additional residue or residues between two residues of the parent protein as well as the possibility of deleting one or more residues from the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed by the invention. In favorable circumstances, the substitution is a conservative substitution as defined above.


“AAP polypeptide variant” means an active AAP polypeptide having at least: (1) about 80% amino acid sequence identity with a full-length native sequence AAP polypeptide sequence, (2) an AAP polypeptide sequence lacking the signal peptide, (3) an extracellular domain of an AAP polypeptide, with or without the signal peptide, or (4) any other fragment of a full-length AAP polypeptide sequence. For example, AAP polypeptide variants include AAP polypeptides wherein one or more amino acid residues are added or deleted at the N- or C- terminus of the full-length native amino acid sequence. An AAP polypeptide variant will have at least about 80% amino acid sequence identity, preferably at least about 81% amino acid sequence identity, more preferably at least about 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% amino acid sequence identity and most preferably at least about 99% amino acid sequence identity with a full-length native sequence AAP polypeptide sequence. An AAP polypeptide variant may have a sequence lacking the signal peptide, an extracellular domain of an AAP polypeptide, with or without the signal peptide, or any other fragment of a full-length AAP polypeptide sequence. Ordinarily, AAP variant polypeptides are at least about 10 amino acids in length, often at least about 20 amino acids in length, more often at least about 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, or 300 amino acids in length, or more.


“Percent (%) amino acid sequence identity” is defined as the percentage of amino acid residues that are identical with amino acid residues in a disclosed AAP polypeptide sequence in a candidate sequence when the two sequences are aligned. To determine % amino acid identity, sequences are aligned and if necessary, gaps are introduced to achieve the maximum % sequence identity; conservative substitutions are not considered as part of the sequence identity. Amino acid sequence alignment procedures to determine percent identity are well known to those of skill in the art. Often publicly available computer software such as BLAST, BLAST2, ALIGN2 or Megalign (DNASTAR) software is used to align peptide sequences. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared.


When amino acid sequences are aligned, the % amino acid sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid sequence identity to, with, or against a given amino acid sequence B) can be calculated as:

% amino acid sequence identity=X/Y·100


where


X is the number of amino acid residues scored as identical matches by the sequence alignment program's or algorithm's alignment of A and B


and


Y is the total number of amino acid residues in B.


If the length of amino acid sequence A is not equal to the length of amino acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity of B to A.


3. Isolated/Purified Polypeptides


An “isolated” or “purified” polypeptide, protein or biologically active fragment is separated and/or recovered from a component of its natural environment. Contaminant components include materials that would typically interfere with diagnostic or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous materials. Preferably, the polypeptide is purified to a sufficient degree to obtain at least 15 residues of N-terminal or internal amino acid sequence. To be substantially isolated, preparations having less than 30% by dry weight of non-AAP contaminating material (contaminants), more preferably less than 20%, 10% and most preferably less than 5% contaminants. An isolated, recombinantly-produced AAP or biologically active portion is preferably substantially free of culture medium, i.e., culture medium represents less than 20%, more preferably less than about 10%, and most preferably less than about 5% of the volume of the AAP preparation. Examples of contaminants include cell debris, culture media, and substances used and produced during in vitro synthesis of an AAP.


4. Biologically Active


Biologically active portions of an AAP include peptides comprising amino acid sequences sufficiently homologous to or derived from the amino acid sequences of an AAP (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16) that include fewer amino acids than a full-length AAP, and exhibit at least one activity of an AAP. Biologically active portions comprise a domain or motif with at least one activity of a native AAP. A biologically active portion of an AAP can be a polypeptide that is, for example, 10, 25, 50, 100 or more amino acid residues in length. Other biologically active portions, in which other regions of the protein are deleted, can be prepared by recombinant techniques and evaluated for one or more of the functional activities of a native AAP.


Biologically active portions of an AAP may have an amino acid sequence shown in SEQ ID NOS:2,4, 6, 8, 10, 12, 14 or 16, or substantially homologous to SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, and retains the functional activity of the protein of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, yet differs in amino acid sequence due to natural allelic variation or mutagenesis. Other biologically active AAP may comprise an amino acid sequence at least 45% homologous to the amino acid sequence of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, and retains the functional activity of native AAP.


5. Determining Homology Between Two or More Sequences


“AAP variant” means an active AAP having at least: (1) about 80% amino acid sequence identity with a full-length native sequence AAP sequence, (2) an AAP sequence lacking the signal peptide, (3) an extracellular domain of an AAP, with or without the signal peptide, or (4) any other fragment of a full-length AAP sequence. For example, AAP variants include an AAP wherein one or more amino acid residues are added or deleted at the N- or C- terminus of the full-length native amino acid sequence. An AAP variant will have at least about 80% amino acid sequence identity, preferably at least about 81% amino acid sequence identity, more preferably at least about 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%,92%, 93%, 94%, 95%, 96%, 97%, 98% amino acid sequence identity and most preferably at least about 99% amino acid sequence identity with a full-length native sequence AAP sequence. An AAP variant may have a sequence lacking the signal peptide, an extracellular domain of an AAP, with or without the signal peptide, or any other fragment of a full-length AAP sequence. Ordinarily, AAP variant polypeptides are at least about 10 amino acids in length, often at least about 20 amino acids in length, more often at least about 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, or 300 amino acids in length, or more.


“Percent (%) amino acid sequence identity” is defined as the percentage of amino acid residues that are identical with amino acid residues in a disclosed AAP sequence in a candidate sequence when the two sequences are aligned. To determine % amino acid identity, sequences are aligned and if necessary, gaps are introduced to achieve the maximum % sequence identity; conservative substitutions are not considered as part of the sequence identity. Amino acid sequence alignment procedures to determine percent identity are well known to those of skill in the art. Often publicly available computer software such as BLAST, BLAST2, ALIGN2 or Megalign (DNASTAR) software is used to align peptide sequences. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared.


When amino acid sequences are aligned, the % amino acid sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid sequence identity to, with, or against a given amino acid sequence B) can be calculated as:

% amino acid sequence identity=X/Y·100


where


X is the number of amino acid residues scored as identical matches by the sequence alignment program's or algorithm's alignment of A and B


and


Y is the total number of amino acid residues in B.


If the length of amino acid sequence A is not equal to the length of amino acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity of B to A.


6. Chimeric and Fusion Proteins


Fusion polypeptides are useful in expression studies, cell-localization, bioassays, and AAP purification. An AAP “chimeric protein” or “fusion protein” comprises an AAP fused to a non-AAP polypeptide. A non-AAP polypeptide is not substantially homologous to an AAP (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16). An AAP fusion protein may include any portion to an entire AAP, including any number of the biologically active portions. An AAP may be fused to the C-terminus of the GST (glutathione S-transferase) sequences. Such fusion proteins facilitate the purification of a recombinant AAP. In certain host cells, (e.g. mammalian), heterologous signal sequences fusions may ameliorate AAP expression and/or secretion. Additional exemplary fusions are presented in Table C.


Other fusion partners can adapt an AAP therapeutically. Fusions with members of the immunoglobulin (Ig) protein family are useful in therapies that inhibit an AAP ligand or substrate interactions, consequently suppressing an AAP-mediated signal transduction in vivo. Such fusions, incorporated into pharmaceutical compositions, may be used to treat proliferative and differentiation disorders, as well as modulating cell survival. An AAP-Ig fusion polypeptides can also be used as immunogens to produce an anti-AAP Abs in a subject, to purify AAP ligands, and to screen for molecules that inhibit interactions of an AAP with other molecules.


Fusion proteins can be easily created using recombinant methods. A nucleic acid encoding an AAP can be fused in-frame with a non-AAP encoding nucleic acid, to an AAP NH2— or COO— -terminus, or internally. Fusion genes may also be synthesized by conventional techniques, including automated DNA synthesizers. PCR amplification using anchor primers that give rise to complementary overhangs between two consecutive gene fragments that can subsequently be annealed and reamplified to generate a chimeric gene sequence (Ausubel et al., 1987) is also useful. Many vectors are commercially available that facilitate sub-cloning an AAP in-frame to a fusion moiety.

TABLE CUseful non-AAP fusion polypeptidesReporterin vitroin vivoNotesReferenceHuman growthRadioimmunoassaynoneExpensive,(Selden et al.,hormone (hGH)insensitive,1986)narrow linearrange.β-glucuronidaseColorimetric,colorimetricsensitive,(Gallagher,(GUS)fluorescent, or(histo-chemicalbroad linear1992)chemiluminescentstaining with X-range, non-gluc)iostopic.GreenFluorescentfluorescentcan be used in(Chalfie et al.,fluorescentlive cells;1994)protein (GFP)resists photo-and relatedbleachingmolecules (RFP,BFP, AAP, etc.)LuciferasebioluminsecentBioluminescentprotein is(de Wet et al.,(firefly)unstable,1987)difficult toreproduce,signal is briefChloramphenicoalChromatography,noneExpensive(Gorman et al.,acetyltransferasedifferentialradioactive1982)(CAT)extraction,substrates,fluorescent, ortime-immunoassayconsuming,insensitive,narrow linearrangeβ-galacto-sidasecolorimetric,colorimetricsensitive,(Alam andfluorescence,(histochemicalbroad linearCook, 1990)chemiluminscencestaining with X-range; somegal), bioluminescentcells have highinendogenouslive cellsactivitySecrete alkalinecolorimetric,noneChemiluminscence(Berger et al.,phosphatasebioluminescent,assay is1988)(SEAP)chemiluminescentsensitive andbroad linearrange; somecells haveendogenousealkalinephosphataseactivity
Therapeutic applications of AAP


1. Agonists and Antagonists


“Antagonist” includes any molecule that partially or fully blocks, inhibits, or neutralizes a biological activity of an endogenous AAP. Similarly, “agonist” includes any molecule that mimics a biological activity of an endogenous AAP. Molecules that can act as agonists or antagonists include Abs or antibody fragments, fragments or variants of an endogenous AAP, peptides, antisense oligonucleotides, small organic molecules, etc.


2. Identifying Antagonists and Agonists


To assay for antagonists, an AAP is added to, or expressed in, a cell along with the compound to be screened for a particular activity. If the compound inhibits the activity of interest in the presence of an AAP, that compound is an antagonist to the AAP; if an AAP activity is enhanced, the compound is an agonist.


(a) Specific Examples of Potential Antagonists and Agonist


Any molecule that alters AAP cellular effects is a candidate antagonist or agonist. Screening techniques well known to those skilled in the art can identify these molecules. Examples of antagonists and agonists include: (1) small organic and inorganic compounds, (2) small peptides, (3) Abs and derivatives, (4) polypeptides closely related to an AAP, (5) antisense DNA and RNA, (6) ribozymes, (7) triple DNA helices and (8) nucleic acid aptamers.


Small molecules that bind to an AAP active site or other relevant part of the polypeptide and inhibit the biological activity of the AAP are antagonists. Examples of small molecule antagonists include small peptides, peptide-like molecules, preferably soluble, and synthetic non-peptidyl organic or inorganic compounds. These same molecules, if they enhance an AAP activity, are examples of agonists.


Almost any antibody that affects an AAP's function is a candidate antagonist, and occasionally, agonist. Examples of antibody antagonists include polyclonal, monoclonal, single-chain, anti-idiotypic, chimeric Abs, or humanized versions of such Abs or fragments. Abs may be from any species in which an immune response can be raised. Humanized Abs are also contemplated.


Alternatively, a potential antagonist or agonist may be a closely related protein, for example, a mutated form of an AAP that recognizes an AAP-interacting protein but imparts no effect, thereby competitively inhibiting AAP action. Alternatively, a mutated AAP may be constitutively activated and may act as an agonist.


Antisense RNA or DNA constructs can be effective antagonists. Antisense RNA or DNA molecules block function by inhibiting translation by hybridizing to targeted mRNA. Antisense technology can be used to control gene expression through triple-helix formation or antisense DNA or RNA, both of which depend on polynucleotide binding to DNA or RNA. For example, the 5′ coding portion of an AAP sequence is used to design an antisense RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be complementary to a region of the gene involved in transcription (triple helix) (Beal and Dervan, 1991; Cooney et al., 1988; Lee et al., 1979), thereby preventing transcription and the production of the AAP. The antisense RNA oligonucleotide hybridizes to the MRNA in vivo and blocks translation of the mRNA molecule into the AAP (antisense) (Cohen, 1989; Okano et al., 1991). These oligonucleotides can also be delivered to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the AAP. When antisense DNA is used, oligodeoxyribonucleotides derived from the translation-initiation site, e.g., between about −10 and +10 positions of the target gene nucleotide sequence, are preferred.


Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. Ribozymes act by sequence-specific hybridization to the complementary target RNA, followed by endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by known techniques (WO 97/33551, 1997; Rossi, 1994).


To inhibit transcription, triple-helix nucleic acids that are single-stranded and comprise deoxynucleotides are useful antagonists. These oligonucleotides are designed such that triple-helix formation via Hoogsteen base-pairing rules is promoted, generally requiring stretches of purines or pyrimidines (WO 97/33551, 1997).


Aptamers are short oligonucleotide sequences that can be used to recognize and specifically bind almost any molecule. The systematic evolution of ligands by exponential enrichment (SELEX) process (Ausubel et al., 1987; Ellington and Szostak, 1990; Tuerk and Gold, 1990) is powerful and can be used to find such aptamers. Aptamers have many diagnostic and clinical uses; almost any use in which an antibody has been used clinically or diagnostically, aptamers too may be used. In addition, they are cheaper to make once they have been identified, and can be easily applied in a variety of formats, including administration in pharmaceutical compositions, in bioassays, and diagnostic tests (Jayasena, 1999).


Anti-AAP Abs


The invention encompasses Abs and antibody fragments, such as Fab or (Fab)2, that bind immunospecifically to any AAP epitopes.


“Antibody” (Ab) comprises single Abs directed against an AAP (anti-AAP Ab; including agonist, antagonist, and neutralizing Abs), anti-AAP Ab compositions with poly-epitope specificity, single chain anti-AAP Abs, and fragments of anti-AAP Abs. A “monoclonal antibody” is obtained from a population of substantially homogeneous Abs, i.e., the individual Abs comprising the population are identical except for possible naturally-occurring mutations that may be present in minor amounts. Exemplary Abs include polyclonal (pAb), monoclonal (mAb), humanized, bi-specific (bsAb), and heteroconjugate Abs.


1. Polyclonal Abs (pAbs)


Polyclonal Abs can be raised in a mammalian host, for example, by one or more injections of an immunogen and, if desired, an adjuvant. Typically, the immunogen and/or adjuvant are injected in the mammal by multiple subcutaneous or intraperitoneal injections. The immunogen may include an AAP or a fusion protein. Examples of adjuvants include Freund's complete and monophosphoryl Lipid A synthetic-trehalose dicorynomycolate (MPL-TDM). To improve the immune response, an immunogen may be conjugated to a protein that is immunogenic in the host, such as keyhole limpet hemocyanin (KLH), serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Protocols for antibody production are described by (Ausubel et al., 1987; Harlow and Lane, 1988). Alternatively, pAbs may be made in chickens, producing IgY molecules (Schade et al., 1996).


2. Monoclonal Abs (mAbs)


Anti-AAP mAbs may be prepared using hybridoma methods (Milstein and Cuello, 1983). Hybridoma methods comprise at least four steps: (1) immunizing a host, or lymphocytes from a host; (2) harvesting the mAb secreting (or potentially secreting) lymphocytes, (3) fusing the lymphocytes to immortalized cells, and (4) selecting those cells that secrete the desired (anti-AAP) mAb.


A mouse, rat, guinea pig, hamster, or other appropriate host is immunized to elicit lymphocytes that produce or are capable of producing Abs that will specifically bind to the immunogen. Alternatively, the lymphocytes may be immunized in vitro. If human cells are desired, peripheral blood lymphocytes (PBLs) are generally used; however, spleen cells or lymphocytes from other mammalian sources are preferred The immunogen typically includes an AAP or a fusion protein.


The lymphocytes are then fused with an immortalized cell line to form hybridoma cells, facilitated by a fusing agent such as polyethylene glycol (Goding, 1996). Rodent, bovine, or human myeloma cells immortalized by transformation may be used, or rat or mouse myeloma cell lines. Because pure populations of hybridoma cells and not unfused immortalized cells are preferred, the cells after fusion are grown in a suitable medium that contains one or more substances that inhibit the growth or survival of unfused, immortalized cells. A common technique uses parental cells that lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT). In this case, hypoxanthine, aminopterin and thymidine are added to the medium (HAT medium) to prevent the growth of HGPRT-deficient cells while permitting hybridomas to grow.


Preferred immortalized cells fuse efficiently, can be isolated from mixed populations by selecting in a medium such as HAT, and support stable and high-level expression of antibody after fusion. Preferred immortalized cell lines are murine myeloma lines, available from the American Type Culture Collection (Manassas, Va.). Human myeloma and mouse-human heteromyeloma cell lines also have been described for the production of human mAbs (Kozbor et al., 1984; Schook, 1987).


Because hybridoma cells secrete antibody extracellularly, the culture media can be assayed for the presence of mAbs directed against an AAP (anti-AAP mAbs). Immunoprecipitation or in vitro binding assays, such as radio immunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA), measure the binding specificity of mAbs (Harlow and Lane, 1988; Harlow and Lane, 1999), including Scatchard analysis (Munson and Rodbard, 1980).


Anti-AAP mAb secreting hybridoma cells may be isolated as single clones by limiting dilution procedures and sub-cultured (Goding, 1996). Suitable culture media include Dulbecco's Modified Eagle's Medium, RPMI-1640, or if desired, a protein-free or -reduced or serum-free medium (e.g., Ultra DOMA PF or HL-1; Biowhittaker; Walkersville, Md.). The hybridoma cells may also be grown in vivo as ascites.


The mAbs may be isolated or purified from the culture medium or ascites fluid by conventional Ig purification procedures such as protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, ammonium sulfate precipitation or affinity chromatography (Harlow and Lane, 1988; Harlow and Lane, 1999).


The mAbs may also be made by recombinant methods (U.S. Pat. No. 4,166,452, 1979). DNA encoding anti-AAP mAbs can be readily isolated and sequenced using conventional procedures, e.g., using oligonucleotide probes that specifically bind to murine heavy and light antibody chain genes, to probe preferably DNA isolated from anti-AAP-secreting InAb hybridoma cell lines. Once isolated, the isolated DNA fragments are sub-cloned into expression vectors that are then transfected into host cells such as simian COS-7 cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce Ig protein, to express mAbs. The isolated DNA fragments can be modified, for example, by substituting the coding sequence for human heavy and light chain constant domains in place of the homologous murine sequences (U.S. Pat. No. 4,816,567, 1989; Morrison et al., 1987), or by fusing the Ig coding sequence to all or part of the coding sequence for a non-Ig polypeptide. Such a non-Ig polypeptide can be substituted for the constant domains of an antibody, or can be substituted for the variable domains of one antigen-combining site to create a chimeric bivalent antibody.


3. Monovalent Abs


The Abs may be monovalent Abs that consequently do not cross-link with each other. For example, one method involves recombinant expression of Ig light chain and modified heavy chain. Heavy chain truncations generally at any point in the Fc region will prevent heavy chain cross-linking. Alternatively, the relevant cysteine residues are substituted with another amino acid residue or are deleted, preventing crosslinking. In vitro methods are also suitable for preparing monovalent Abs. Abs can be digested to produce fragments, such as Fab fragments (Harlow and Lane, 1988; Harlow and Lane, 1999).


4. Humanized and Human Abs


Anti-AAP Abs may further comprise humanized or human Abs. Humanized forms of non-human Abs are chimeric Igs, Ig chains or fragments (such as Fv, Fab, Fab′, F(ab′)2 or other antigen-binding subsequences of Abs) that contain minimal sequence derived from non-human Ig.


Generally, a humanized antibody has one or more amino acid residues introduced from a non-human source. These non-human amino acid residues are often referred to as “import” residues, which are typically taken from an “import” variable domain. Humanization is accomplished by substituting rodent CDRs or CDR sequences for the corresponding sequences of a human antibody (Jones et al., 1986; Riechmann et al., 1988; Verhoeyen et al., 1988). Such “humanized” Abs are chimeric Abs (U.S. Pat. No. 4,816,567, 1989), wherein substantially less than an intact human variable domain has been substituted by the corresponding sequence from a non-human species. In practice, humanized Abs are typically human Abs in which some CDR residues and possibly some FR residues are substituted by residues from analogous sites in rodent Abs. Humanized Abs include human Igs (recipient antibody) in which residues from a complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human species (donor antibody) such as mouse, rat or rabbit, having the desired specificity, affinity and capacity. In some instances, corresponding non-human residues replace F, framework residues of the human Ig. Humanized Abs may comprise residues that are found neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the humanized antibody comprises substantially all of at least one, and typically two, variable domains, in which most if not all of the CDR regions correspond to those of a non-human Ig and most if not all of the FR regions are those of a human Ig consensus sequence. The humanized antibody optimally also comprises at least a portion of an Ig constant region (Fc), typically that of a human Ig (Jones et al., 1986; Presta, 1992; Riechmann et al., 1988).


Human Abs can also be produced using various techniques, including phage display libraries (Hoogenboom et al., 1991; Marks et al., 1991) and the preparation of human mAbs (Boemer et al., 1991; Reisfeld and Sell, 1985). Similarly, introducing human Ig genes into transgenic animals in which the endogenous Ig genes have been partially or completely inactivated can be exploited to synthesize human Abs. Upon challenge, human antibody production is observed, which closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and antibody repertoire (U.S. Pat. No. 5,545,807, 1996; U.S. Pat. No. 5,545,806, 1996; U.S. Pat. No. 5,569,825, 1996; U.S. Pat. No. 5,633,425, 1997; U.S. Pat. No. 5,661,016, 1997; U.S. Pat. No. 5,625,126, 1997; Fishwild et al., 1996; Lonberg and Huszar, 1995; Lonberg et al., 1994; Marks et al., 1992).


5. Bi-specific mAbs


Bi-specific Abs are monoclonal, preferably human or humanized, that have binding specificities for at least two different antigens. For example, a binding specificity is an AAP; the other is for any antigen of choice, preferably a cell-surface protein or receptor or receptor subunit.


Traditionally, the recombinant production of bi-specific Abs is based on the co-expression of two Ig heavy-chain/light-chain pairs, where the two heavy chains have different specificities (Milstein and Cuello, 1983). Because of the random assortment of Ig heavy and light chains, the resulting hybridomas (quadromas) produce a potential mixture of ten different antibody molecules, of which only one has the desired bi-specific structure. The desired antibody can be purified using affinity chromatography or other techniques (WO 93/08829,1993; Traunecker et al., 1991).


To manufacture a bi-specific antibody (Suresh et al., 1986), variable domains with the desired antibody-antigen combining sites are fused to Ig constant domain sequences. The fusion is preferably with an Ig heavy-chain constant domain, comprising at least part of the hinge, CH2, and CH3 regions. Preferably, the first heavy-chain constant region (CH1) containing the site necessary for light-chain binding is in at least one of the fusions. DNAs encoding the Ig heavy-chain fusions and, if desired, the Ig light chain, are inserted into separate expression vectors and are co-transfected into a suitable host organism.


The interface between a pair of antibody molecules can be engineered to maximize the percentage of heterodimers that are recovered from recombinant cell culture (WO 96/27011, 1996). The preferred interface comprises at least part of the CH3 region of an antibody constant domain. In this method, one or more small amino acid side chains from the interface of the first antibody molecule are replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory “cavities” of identical or similar size to the large side chain(s) are created on the interface of the second antibody molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or threonine). This mechanism increases the yield of the heterodimer over unwanted end products such as homodimers.


Bi-specific Abs can be prepared as full length Abs or antibody fragments (e.g. F(ab′)2 bi-specific Abs). One technique to generate bi-specific Abs exploits chemical linkage. Intact Abs can be proteolytically cleaved to generate F(ab′)2 fragments (Brennan et al., 1985). Fragments are reduced with a dithiol complexing agent, such as sodium arsenite, to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The generated Fab′ fragments are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab′-TNB derivatives is then reconverted to the Fab′-thiol by reduction with mercaptoethylamine and is mixed with an equimolar amount of the other Fab′-TNB derivative to form the bi-specific antibody. The produced bi-specific Abs can be used as agents for the selective immobilization of enzymes.


Fab′ fragments may be directly recovered from E. coli and chemically coupled to form bi-specific Abs. For example, fully humanized bi-specific F(ab′)2 Abs can be produced (Shalaby et al., 1992). Each Fab′ fragment is separately secreted from E. coli and directly coupled chemically in vitro, forming the bi-specific antibody.


Various techniques for making and isolating bi-specific antibody fragments directly from recombinant cell culture have also been described. For example, leucine zipper motifs can be exploited (Kostelny et al., 1992). Peptides from the Fos and Jun proteins are linked to the Fab′ portions of two different Abs by gene fusion. The antibody homodimers are reduced at the hinge region to form monomers and then re-oxidized to form antibody heterodimers. This method can also produce antibody homodimers. The “diabody” technology (Holliger et al., 1993) provides an alternative method to generate bi-specific antibody fragments. The fragments comprise a heavy-chain variable domain (VH) connected to a light-chain variable domain (VL) by a linker that is too short to allow pairing between the two domains on the same chain. The VH and VL domains of one fragment are forced to pair with the complementary VL and VH domains of another fragment, forming two antigen-binding sites. Another strategy for making bi-specific antibody fragments is the use of single-chain Fv (sFv) dimers (Gruber et al., 1994). Abs with more than two valencies are also contemplated, such as tri-specific Abs (Tutt et al., 1991).


Exemplary bi-specific Abs may bind to two different epitopes on a given AAP. Alternatively, cellular defense mechanisms can be restricted to a particular cell expressing the particular AAP: an anti-AAP arm may be combined with an arm that binds to a leukocyte triggering molecule, such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or to Fc receptors for IgG (FcγR), such as FcγRI (CD64), FcγRII (CD32) and FcγRIII (CD16). Bi-specific Abs may also be used to target cytotoxic agents to cells that express a particular AAP. These Abs possess an AAP-binding arm and an arm that binds a cytotoxic agent or a radionuclide chelator.


6. Heteroconjugate Abs


Heteroconjugate Abs, consisting of two covalently joined Abs, have been proposed to target immune system cells to unwanted cells (U.S. Pat. No. 4,676,980, 1987) and for treatment of human immunodeficiency virus (HIV) infection (WO 91/00360, 1991; WO 92/20373, 1992). Abs prepared in vitro using synthetic protein chemistry methods, including those involving cross-linking agents, are contemplated. For example, immunotoxins may be constructed using a disulfide exchange reaction or by forming a thioether bond Examples of suitable reagents include iminothiolate and methyl-4-mercaptobutyrimidate (U.S. Pat. No. 4,676,980, 1987).


7. Immunoconjugates


Immunoconjugates may comprise an antibody conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin or fragment of bacterial, fungal, plant, or animal origin), or a radioactive isotope (i.e., a radioconjugate).


Useful enzymatically-active toxins and fragments include Diphtheria A chain, non-binding active fragments of Diphtheria toxin, exotoxin A chain from Pseudomonas aeruginosa, ricin A chain, abrin A chain, modeccin A chain, α-sarcin, Aleurites fordii proteins, Dianthin proteins, Phytolaca americana proteins, Momordica charantia inhibitor, curcin, crotin, Sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are available for the production of radioconjugated Abs, such as 212Bi, 131I, 131In, 90Y, and 186Re.


Conjugates of the antibody and cytotoxic agent are made using a variety of bi-functional protein-coupling agents, such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), bi-functional derivatives of imidoesters (such as dimethyl adipimidate HCl), active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6- diisocyanate), and bis-active fluorine compounds (such as 1,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared (Vitetta et al., 1987). 14C-labeled 1-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugating radionuclide to antibody (WO 94/11026, 1994).


In another embodiment, the antibody may be conjugated to a “receptor” (such as streptavidin) for utilization in tumor pre-targeting wherein the antibody-receptor conjugate is administered to the patient, followed by removal of unbound conjugate from the circulation using a clearing agent and then administration of a streptavidin “ligand” (e.g., biotin) that is conjugated to a cytotoxic agent (e.g., a radionuclide).


8. Effector Function Engineering


The antibody can be modified to enhance its effectiveness in treating a disease, such as cancer. For example, cysteine residue(s) may be introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. Such homodimeric Abs may have improved internalization capability and/or increased complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC) (Caron et al., 1992; Shopes, 1992). Homodimeric Abs with enhanced anti-tumor activity can be prepared using hetero-bifunctional cross-linkers (Wolff et al., 1993). Alternatively, an antibody engineered with dual Fc regions may have enhanced complement lysis (Stevenson et al., 1989).


9. Immunoliposomes


Liposomes containing the antibody may also be formulated (U.S. Pat. No. 4,485,045, 1984; U.S. Pat. No. 4,544,545, 1985; U.S. Pat. No. 5,013,556, 1991; Eppstein et al., 1985; Hwang et al., 1980). Useful liposomes can be generated by a reverse-phase evaporation method with a lipid composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized phosphatidylethanolamine (PEG- PE). Such preparations are extruded through filters of defined pore size to yield liposomes with a desired diameter. Fab′ fragments of the antibody can be conjugated to the liposomes (Martin and Papahadjopoulos, 1982) via a disulfide-interchange reaction. A chemotherapeutic agent, such as Doxorubicin, may also be contained in the liposome (Gabizon et al., 1989). Other useful liposomes with different compositions are contemplated.


10. Diagnostic Applications of Abs Directed Against an AAP


Anti-AAP Abs can be used to localize and/or quantitate anAAP (e.g., for use in measuring levels of an AAP within tissue samples or for use in diagnostic methods, etc.). Anti-AAP epitope Abs can be utilized as pharmacologically-active compounds.


Anti-AAP Abs can be used to isolate an AAP by standard techniques, such as immunoaffinity chromatography or immunoprecipitation. These approaches facilitate purifying an endogenous AAP antigen-containing polypeptides from cells and tissues. These approaches, as well as others, can be used to detect an AAP in a sample to evaluate the abundance and pattern of expression of the antigenic protein. Anti-AAP Abs can be used to monitor protein levels in tissues as part of a clinical testing procedure; for example, to determine the efficacy of a given treatment regimen. Coupling the antibody to a detectable substance (label) allows detection of Ab-antigen complexes. Classes of labels include fluorescent, luminescent, bioluminescent, and radioactive materials, enzymes and prosthetic groups. Useful labels include horseradish peroxidase, alkaline phosphatase, β-galactosidase, acetylcholinesterase, streptavidinfbiotin, avidin/biotin, umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride, phycoerythrin, luminol, luciferase, luciferin, aequorin, and 125I, 131I, 35S or 3H.


11. Antibody Therapeutics


Abs of the invention, including polyclonal, monoclonal, humanized and fully human Abs, can be used therapeutically. Such agents will generally be employed to treat or prevent a disease or pathology in a subject. An antibody preparation, preferably one having high antigen specificity and affinity generally mediates an effect by binding the target epitope(s). Generally, administration of such Abs may mediate one of two effects: (1) the antibody may prevent ligand binding, eliminating endogenous ligand binding and subsequent signal transduction, or (2) the antibody elicits a physiological result by binding an effector site on the target molecule, initiating signal transduction.


A therapeutically effective amount of an antibody relates generally to the amount needed to achieve a therapeutic objective, epitope binding affinity, administration rate, and depletion rate of the antibody from a subject. Common ranges for therapeutically effective doses may be, as a nonlimiting example, from about 0.1 mg/kg body weight to about 50 mg/kg body weight. Dosing frequencies may range, for example, from twice daily to once a week.


12. Pharmaceutical Compositions of Abs


Anti-AAP Abs, as well as other AAP interacting molecules (such as aptamers) identified in other assays, can be administered in pharmaceutical compositions to treat various disorders. Principles and considerations involved in preparing such compositions, as well as guidance in the choice of components can be found in (de Boer, 1994; Gennaro, 2000; Lee, 1990).


Since some AAP are intracellular, Abs that are internalized are preferred used when whole Abs are used as inhibitors. Liposomes may also be used as a delivery vehicle for intracellular introduction. Where antibody fragments are used, the smallest inhibitory fragment that specifically binds to the epitope is preferred. For example, peptide molecules can be designed that bind a preferred epitope based on the variable-region sequences of a useful antibody. Such peptides can be synthesized chemically and/or produced by recombinant DNA technology (Marasco et al., 1993). Formulations may also contain more than one active compound for a particular treatment, preferably those with activities that do not adversely affect each other. The composition may comprise an agent that enhances function, such as a cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent.


The active ingredients can also be entrapped in microcapsules prepared by coacervation techniques or by interfacial polymerization; for example, hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacrylate) microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in macroemulsions.


The formulations to be used for in vivo administration are highly preferred to be sterile. This is readily accomplished by filtration through sterile filtration membranes or any of a number of techniques.


Sustained-release preparations may also be prepared, such as semi-permeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides (Boswell and Scribner, U.S. Pat. No. 3,773,919, 1973), copolymers of L-glutamic acid and γ ethyl-L-glutamate, non-degradable ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such as injectable microspheres composed of lactic acid-glycolic acid copolymer, and poly-D-(−)-3-hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of molecules for over 100 days, certain hydrogels release proteins for shorter time periods and may be preferred.


AAP Recombinant Expression Vectors and Host Cells


Vectors are tools used to shuttle DNA between host cells or as a means to express a nucleotide sequence. Some vectors function only in prokaryotes, while others function in both prokaryotes and eukaryotes, enabling large-scale DNA preparation from prokaryotes for expression in eukaryotes. Inserting the DNA of interest, such as an AAP nucleotide sequence or a fragment, is accomplished by ligation techniques and/or mating protocols well-known to the skilled artisan. Such DNA is inserted such that its integration does not disrupt any necessary components of the vector. In the case of vectors that are used to express the inserted DNA protein, the introduced DNA is operably-linked to the vector elements that govern its transcription and translation.


Vectors can be divided into two general classes: Cloning vectors are replicating plasmid or phage with regions that are non-essential for propagation in an appropriate host cell, and into which foreign DNA can be inserted; the foreign DNA is replicated and propagated as if it were a component of the vector. An expression vector (such as a plasmid, yeast, or animal virus genome) is used to introduce foreign genetic material into a host cell or tissue in order to transcribe and translate the foreign DNA. In expression vectors, the introduced DNA is operably-linked to elements, such as promoters, that signal to the host cell to transcribe the inserted DNA. Some promoters are exceptionally useful, such as inducible promoters that control gene transcription in response to specific factors. Operably-linking an AAP or anti-sense construct to an inducible promoter can control the expression of an AAP or fragments, or anti-sense constructs. Examples of classic inducible promoters include those that are responsive to α-interferon, heat-shock, heavy metal ions, and steroids such as glucocorticoids (Kaufman, 1990) and tetracycline. Other desirable inducible promoters include those that are not endogenous to the cells in which the construct is being introduced, but, however, is responsive in those cells when the induction agent is exogenously supplied.


Vectors have many difference manifestations. A “plasmid” is a circular double stranded DNA molecule into which additional DNA segments can be introduced. Viral vectors can accept additional DNA segments into the viral genome. Certain vectors are capable of autonomous replication in a host cell (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. In general, useful expression vectors are often plasmids. However, other forms of expression vectors, such as viral vectors (e. g., replication defective retroviruses, adenoviruses and adeno-associated viruses) are contemplated.


Recombinant expression vectors that comprise an AAP (or fragments) regulate an AAP transcription by exploiting one or more host cell-responsive (or that can be manipulated in vitro) regulatory sequences that is operably-linked to an AAP. “Operably-linked” indicates that a nucleotide sequence of interest is linked to regulatory sequences such that expression of the nucleotide sequence is achieved.


Vectors can be introduced in a variety of organisms and/or cells (Table D). Alternatively, the vectors can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.

TABLE DExamples of hosts for cloning or expressionOrganismsExamplesSources and References*ProkaryotesE. coliEnterobacteriaceaeK 12 strain MM294ATCC 31,446X1776ATCC 31,537W3110ATCC 27,325K5 772ATCC 53,635EnterobacterErwiniaKlebsiellaProteusSalmonella (S. tyhpimurium)Serratia (S. marcescans)ShigellaBacilli (B. subtilis and B. licheniformis)Pseudomonas (P. aeruginosa)StreptomycesEukaryotesSaccharomyces cerevisiaeYeastsSchizosaccharomyces pombeKluyver omyces(Fleer et al., 1991)K. lactis MW98-8C,(de Louvencourt et al., 1983)CBS683, CBS4574ATCC 12,424K. fragilisATCC 16,045K. bulgaricusATCC 24,178K. wickeramiiATCC 56,500K. waltiiATCC 36,906K. drosophilarum(EPO 402226, 1990)K. thermotoleransK. marxianus; yarrowiaPichia pastoris(Sreekrishna et al., 1988)CandidaTrichoderma reesiaNeurospora crassa(Case et al., 1979)TorulopsisRhodotorulaSchwanniomyces (S. occidentalis)Filamentous FungiNeurosporaPenicilliumTolypocladium(WO 91/00357, 1991)Aspergillus (A. nidulans and(Kelly and Hynes, 1985; TilburnA. niger)et al., 1983; Yelton et al., 1984)Invertebrate cellsDrosophila S2Spodoptera Sf9Vertebrate cellsChinese Hamster Ovary(CHO)simian COSATCC CRL 1651COS-7HEK 293
*Unreferenced cells are generally available from American Type Culture Collection (Manassas, VA).


Vector choice is dictated by the organism or cells being used and the desired fate of the vector. Vectors may replicate once in the target cells, or may be “suicide” vectors. In general, vectors comprise signal sequences, origins of replication, marker genes, enhancer elements, promoters, and transcription termination sequences. The choice of these elements depends on the organisms in which the vector will be used and are easily determined. Some of these elements may be conditional, such as an inducible or conditional promoter that is turned “on” when conditions are appropriate. Examples of inducible promoters include those that are tissue-specific, which relegate expression to certain cell types, steroid-responsive, or heat-shock reactive. Some bacterial repression systems, such as the lac operon, have been exploited in mammalian cells and transgenic animals (Fieck et al., 1992; Wyborski et al., 1996; Wyborski and Short, 1991). Vectors often use a selectable marker to facilitate identifying those cells that have incorporated the vector. Many selectable markers are well known in the art for the use with prokaryotes, usually antibiotic-resistance genes or the use of autotrophy and auxotrophy mutants.


Using antisense and sense AAP oligonucleotides can prevent an AAP polypeptide expression. These oligonucleotides bind to target nucleic acid sequences, forming duplexes that block transcription or translation of the target sequence by enhancing degradation of the duplexes, terminating prematurely transcription or translation, or by other means.


Antisense or sense oligonucleotides are singe-stranded nucleic acids, either RNA or DNA, which can bind a target AAP mRNA (sense) or an AAP DNA (antisense) sequences. According to the present invention, antisense or sense oligonucleotides comprise a fragment of an AAP DNA coding region of at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. In general, antisense RNA or DNA molecules can comprise at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 bases in length or more. Among others, (Stein and Cohen, 1988; van der Krol et al., 1988a) describe methods to derive antisense or a sense oligonucleotides from a given cDNA sequence.


Modifications of antisense and sense oligonucleotides can augment their effectiveness. Modified sugar-phosphodiester bonds or other sugar linkages (WO 91/06629, 1991), increase in vivo stability by conferring resistance to endogenous nucleases without disrupting binding specificity to target sequences. Other modifications can increase the affinities of the oligonucleotides for their targets, such as covalently linked organic moieties (WO 90/10448, 1990) or poly-(L)-lysine. Other attachments modify binding specificities of the oligonucleotides for their targets, including metal complexes or intercalating (e.g. ellipticine) and alkylating agents.


To introduce antisense or sense oligonucleotides into target cells (cells containing the target nucleic acid sequence), any gene transfer method may be used and are well known to those of skill in the art. Examples of gene transfer methods include 1) biological, such as gene transfer vectors like Epstein-Barr virus or conjugating the exogenous DNA to a ligand-binding molecule (WO 91/04753, 1991), 2) physical, such as electroporation, and 3) chemical, such as CaPO4 precipitation and oligonucleotide-lipid complexes (WO 90/10448, 1990).


The terms “host cell” and “recombinant host cell” are used interchangeably. Such terms refer not only to a particular subject cell but also to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term.


Methods of eukaryotic cell transfection and prokaryotic cell transformation are well known in the art. The choice of host cell will dictate the preferred technique for introducing the nucleic acid of interest. Table E, which is not meant to be limiting, summarizes many of the known techniques in the art. Introduction of nucleic acids into an organism may also be done with ex vivo techniques that use an in vitro method of transfection, as well as established genetic techniques, if any, for that particular organism.

TABLE EMethods to introduce nucleic acid into cellsCellsMethodsReferencesNotesProkaryotesCalcium chloride(Cohen et al., 1972;(bacteria)Hanahan, 1983; Mandel andHiga, 1970)Electroporation(Shigekawa and Dower,1988)EukaryotesCalcium phosphateN-(2-Cells may beMammaliantransfectionHydroxyethyl)piperazine-N′-“shocked” withcells(2-ethanesulfonic acidglycerol or(HEPES) buffered salinedimethylsulfoxidesolution (Chen and(DMSO) to increaseOkayama, 1988; Graham andtransfectionvan der Eb, 1973; Wigler etefficiency (Ausubelal., 1978)et al., 1987).BES (N,N-bis(2-hydroxyethyl)-2-aminoethanesulfonic acid)buffered solution (Ishiura etal., 1982)Diethylaminoethyl(Fujita et al., 1986; Lopata etMost useful for(DEAE)-Dextranal., 1984; Selden et al.,transient, but nottransfection1986)stable, transfections.Chloroquine can beused to increaseefficiency.Electroporation(Neumann et al., 1982;Especially useful forPotter, 1988; Potter et al.,hard-to-transfect1984; Wong and Neumann,lymphocytes.1982)Cationic lipid(Elroy-Stein and Moss, 1990;Applicable to bothreagentFelgner et al., 1987; Rose etin vivo and in vitrotransfectional., 1991; Whitt et al., 1990)transfection.RetroviralProduction exemplified byLengthy process,(Cepko et al., 1984; Millermany packagingand Buttimore, 1986; Pear etlines available atal., 1993)ATCC. ApplicableInfection in vitro and in vivo:to both in vivo and(Austin and Cepko, 1990;in vitro transfection.Bodine et al., 1991; Feketeand Cepko, 1993; Lemischkaet al., 1986; Turner et al.,1990; Williams et al, 1984)Polybrene(Chaney et al., 1986; Kawaiand Nishizawa, 1984)Microinjection(Capecchi, 1980)Can be used toestablish cell linescarrying integratedcopies of AAP DNAsequences.Protoplast fusion(Rassoulzadegan et al., 1982;Sandri-Goldin et al., 1981;Schaffner, 1980)Insect cellsBaculovirus(Luckow, 1991; Miller,Useful for in vitro(in vitro)systems1988; O'Reilly et al., 1992)production ofproteins witheukaryoticmodifications.YeastElectroporation(Becker and Guarente, 1991)Lithium acetate(Gietz et al., 1998; Ito et al.,1983)Spheroplast fusion(Beggs, 1978; Hinnen et al.,Laborious, can1978)produce aneuploids.Plant cellsAgrobacterium(Bechtold and Pelletier,(generaltransformation1998; Escudero and Hohn,reference:1997; Hansen and Chilton,(Hansen and1999; Touraev and al., 1997)Wright,Biolistics(Finer et al., 1999; Hansen1999))(microprojectiles)and Chilton, 1999; Shillito,1999)Electroporation(Fromm et al., 1985; Ou-Lee(protoplasts)et al., 1986; Rhodes et al.,1988; Saunders et al., 1989)May be combined withliposomes (Trick and al.,1997)Polyethylene(Shillito, 1999)glycol (PEG)treatmentLiposomesMay be combined withelectroporation (Trick andal., 1997)in planta(Leduc and al., 1996; Zhoumicroinjectionand al., 1983)Seed imbibition(Trick and al., 1997)Laser beam(Hoffman, 1996)Silicon carbide(Thompson and al., 1995)whiskers


Vectors often use a selectable marker to facilitate identifying those cells that have incorporated the vector. Many selectable markers are well known in the art for the use with prokaryotes, usually antibiotic-resistance genes or the use of autotrophy and auxotrophy mutants. Table F lists often-used selectable markers for mammalian cell transfection.

TABLE FUseful selectable markers for eukaryote cell transfectionSelectable MarkerSelectionActionReferenceAdenosine deaminaseMedia includes 9-β-D-Conversion of Xyl-A to(Kaufman et(ADA)xylofuranosyl adenineXyl-ATP, whichal., 1986)(Xyl-A)incorporates intonucleic acids, killingcells. ADA detoxifiesDihydrofolateMethotrexate (MTX)MTX competitive(Simonsenreductase (DHFR)and dialyzed seruminhibitor of DHFR. Inand(purine-free media)absence of exogenousLevinson,purines, cells require1983)DHFR, a necessaryenzyme in purinebiosynthesis.AminoglycosideG418G418, an(Southernphosphotransferaseaminoglycosideand Berg,(“APH”, “neo”,detoxified by APH,1982)“G418”)interferes withribosomal function andconsequently,translation.Hygromycin-B-hygromycin-BHygromycin-B, an(Palmer etphosphotransferaseaminocyclitolal., 1987)(HPH)detoxified by HPH,disrupts proteintranslocation andpromotesmistranslation.Thymidine kinaseForward selectionForward: Aminopterin(Littlefield,(TK)(TK+): Media (HAT)forces cells to synthesze1964)incorporatesdTTP from thymidine, aaminopterin.pathway requiring TK.Reverse selection (TK−):Reverse: TKMedia incorporatesphosphorylates BrdU,5-bromodeoxyuridinewhich incorporates into(BrdU).nucleic acids, killingcells.


A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be used to produce an AAP. Accordingly, the invention provides methods for producing an AAP using the host cells of the invention. In one embodiment, the method comprises culturing the host cell of the invention (into which a recombinant expression vector encoding an AAP has been introduced) in a suitable medium, such that an AAP is produced. In another embodiment, the method further comprises isolating an AAP from the medium or the host cell.


Transgenic AAP Animals


Transgenic animals are useful for studying the function and/or activity of an AAP and for identifying and/or evaluating modulators of AAP activity. “Transgenic animals” are non-human animals, preferably mammals, more preferably a rodents such as rats or mice, in which one or more of the cells include a transgene. Other transgenic animals include primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A “transgene” is exogenous DNA that is integrated into the genome of a cell from which a transgenic animal develops, and that remains in the genome of the mature animal. Transgenes preferably direct the expression of an encoded gene product in one or more cell types or tissues of the transgenic animal with the purpose of preventing expression of a naturally encoded gene product in one or more cell types or tissues (a “knockout” transgenic animal), or serving as a marker or indicator of an integration, chromosomal location, or region of recombination (e.g. cre/loxP mice). A “homologous recombinant animal” is a non-human animal, such as a rodent, in which an endogenous AAP has been altered by an exogenous DNA molecule that recombines homologously with an endogenous AAP in a (e.g. embryonic) cell prior to development the animal. Host cells with an exogenous AAP can be used to produce non-human transgenic animals, such as fertilized oocytes or embryonic stem cells into which an AAP-coding sequences have been introduced. Such host cells can then be used to create non-human transgenic animals or homologous recombinant animals.


1. Approaches to Transgenic Animal Production


A transgenic animal can be created by introducing an AAP into the male pronuclei of a fertilized oocyte (e.g., by microinjection, retroviral infection) and allowing the oocyte to develop in a pseudopregnant female foster animal (pffa). An AAP cDNA sequences (SEQ ID NO:1, 3, 5, 7, 9, 11, 13 or 15) can be introduced as a transgene into the genome of a non-human animal. Alternatively, a homologue of an AAP, such as the naturally-occuring variant of an AAP, can be used as a transgene. Intronic sequences and polyadenylation signals can also be included in the transgene to increase transgene expression. Tissue-specific regulatory sequences can be operably-linked to the AAP transgene to direct expression of the AAP to particular cells. Methods for generating transgenic animals via embryo manipulation and microinjection, particularly animals such as mice, have become conventional in the art, e.g. (Evans et al., U.S. Pat. No. 4,870,009, 1989; Hogan, 0879693843, 1994; Leder and Stewart, U.S. Pat. No. 4,736,866, 1988; Wagner and Hoppe, U.S. Pat. No. 4,873,191, 1989). Other non-mice transgenic animals may be made by similar methods. A transgenic founder animal, which can be used to breed additional transgenic animals, can be identified based upon the presence of the transgene in its genome and/or expression of the transgene mRNA in tissues or cells of the animals. Transgenic animals can be bred to other transgenic animals carrying other transgenes.


2. Vectors for Transgenic Animal Production


To create a homologous recombinant animal, a vector containing at least a portion of an AAP into which a deletion, addition or substitution has been introduced to thereby alter, e.g., disrupt or alter the expression of,-an AAP. An AAP can be a murine gene, or other AAP homologue, such as a naturally occurring variant. In one approach, a knockout vector functionally disrupts an endogenous AAP gene upon homologous recombination, and thus a non-functional AAP protein, if any, is expressed.


Alternatively, the vector can be designed such that, upon homologous recombination, an endogenous AAP is mutated or otherwise altered but still encodes functional protein (e.g., the upstream regulatory region can be altered to thereby alter the expression of an endogenous AAP). In this type of homologous recombination vector, the altered portion of the AAP is flanked at its 5′- and 3′-termini by additional nucleic acid of the AAP to allow for homologous recombination to occur between the exogenous AAP carried by the vector and an endogenous AAP in an embryonic stem cell. The additional flanking AAP nucleic acid is sufficient to engender homologous recombination with the endogenous AAP. Typically, several kilobases of flanking DNA (both at the 5′- and 3′-termini) are included in the vector (Thomas and Capecchi, 1987). The vector is then introduced into an embryonic stem cell line (e.g., by electroporation), and cells in which the introduced AAP has homologously-recombined with an endogenous AAP are selected (Li et al., 1992).


3. Introduction of an AAP Transgene Cells During Development


Selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to form aggregation chimeras (Bradley, 1987). A chimeric embryo can then be implanted into a suitable pffa and the embryo brought to term. Progeny harboring the homologously-recombined DNA in their germ cells can be used to breed animals in which all cells of the animal contain the homologously-recombined DNA by germline transmission of the transgene. Methods for constructing homologous recombination vectors and homologous recombinant animals are described (Berns et al., WO 93/04169, 1993; Bradley, 1991; Kucherlapati et al., WO 91/01140, 1991; Le Mouellic and Brullet, WO 90/11354, 1990).


Alternatively, transgenic animals that contain selected systems that allow for regulated expression of the transgene can be produced. An example of such a system is the cre/loxP recombinase system of bacteriophage P1 (Lakso et al., 1992). Another recombinase system is the FLP recombinase system of Saccharoniyces cerevisiae (O'Gorman et al., 1991). If a cre/loxP recombinase system is used to regulate expression of the transgene, animals containing transgenes encoding both the Cre recombinase and a selected protein are required. Such animals can be produced as “double” transgenic animals, by mating an animal containing a transgene encoding a selected protein to another containing a transgene encoding a recombinase.


Clones of transgenic animals can also be produced (Wilmut et al., 1997). In brief, a cell from a transgenic animal can be isolated and induced to exit the growth cycle and enter G0 phase. The quiescent cell can then be fused to an enucleated oocyte from an animal of the same species from which the quiescent cell is isolated. The reconstructed oocyte is then cultured to develop to a morula or blastocyte and then transferred to a pffa The offspring bome of this female foster animal will be a clone of the “parent” transgenic animal.


Pharmaceutical Compositions


The AAP nucleic acid molecules, AAP polypeptides, and anti-AAP Abs (active compounds) of the invention, and derivatives, fragments, analogs and homologs thereof, can be incorporated into pharmaceutical compositions. Such compositions typically comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable carrier. A “pharmaceutically acceptable carrier” includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, compatible with pharmaceutical administration (Gennaro, 2000). Preferred examples of such carriers or diluents include, but are not limited to, water, saline, finger's solutions, dextrose solution, and 5% human serum albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be used Except when a conventional media or agent is incompatible with an active compound, use of these compositions is contemplated. Supplementary active compounds can also be incorporated into the compositions.


1. General Considerations


A pharmaceutical composition of the invention is formulated to be compatible with its intended route of administration, including intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (i.e., topical), transmucosal, and rectal administration. Solutions or suspensions used for parenteral, intradermal, or subcutaneous application can include: a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol-or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic.


2. Injectable Formulations


Pharmaceutical compositions suitable for injection include sterile aqueous solutions (where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological saline, bacteriostatic water, CREMOPHOR EL™ (BASF, Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the composition must be sterile and should be fluid so as to be administered using a syringe. Such compositions should be stable during manufacture and storage and must be preserved against contamination from microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (such as glycerol, propylene glycol, and liquid polyethylene glycol), and suitable mixtures. Proper fluidity can be maintained, for example, by using a coating such as lecithin, by maintaining the required particle size in the case of dispersion and by using surfactants. Various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, and thimerosal, can contain microorganism contamination. Isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, and sodium chloride can be included in the composition. Compositions that can delay absorption include agents such as aluminum monostearate and gelatin.


Sterile injectable solutions can be prepared by incorporating the active compound (e.g., an AAP or anti-AAP antibody) in the required amount in an appropriate solvent with one or a combination of ingredients as required, followed by sterilization. Generally, dispersions are prepared by incorporating the active compound into a sterile vehicle that contains a basic dispersion medium, and the other required ingredients as discussed. Sterile powders for the preparation of sterile injectable solutions, methods of preparation include vacuum drying and freeze-drying that yield a powder containing the active ingredient and any desired ingredient from a sterile solutions.


3. Oral Compositions


Oral compositions generally include an inert diluent or an edible carrier. They can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic administration, the active compound can be incorporated with excipients and used in the form of tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is applied orally. Pharmaceutically compatible binding agents, and/or adjuvant materials can be included. Tablets, pills, capsules, troches and the like can contain any of the following ingredients, or compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic acid, PRIMOGEL, or corn starch; a lubricant such as magnesium stearate or STEROTES; a glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring.


4. Compositions for Inhalation


For administration by inhalation, the compounds are delivered as an aerosol spray from a nebulizer or a pressurized container that contains a suitable propellant, e.g., a gas such as carbon dioxide.


5. Systemic Administration


Systemic administration can also be transmucosal or transdermal. For transmucosal or transdermal administration, penetrants that can permeate the target barrier(s) are selected. Transmucosal penetrants include, detergents, bile salts, and fusidic acid derivatives. Nasal sprays or suppositories can be used for transmucosal administration. For transdermal administration, the active compounds are formulated into ointments, salves, gels, or creams.


The compounds can also be prepared in the form of suppositories (e.g., with bases such as cocoa butter and other glycerides) or retention enemas for rectal delivery.


6. Carriers


In one embodiment, the active compounds are prepared with carriers that protect the compound against rapid elimination from the body, such as a controlled release formulation, including implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Such materials can be obtained commercially from ALZA Corporation (Mountain View, Calif.) and NOVA Pharmaceuticals, Inc. (Lake Elsinore, Calif.), or prepared by one of skill in the art. Liposomal suspensions can also be used as pharmaceutically acceptable carriers. These can be prepared according to methods known to those skilled in the art, such as in (Eppstein et al., U.S. Pat. No. 4,522,811, 1985).


7. Unit Dosage


Oral formulations or parenteral compositions in unit dosage form can be created to facilitate administration and dosage uniformity. Unit dosage form refers to physically discrete units suited as single dosages for the subject to be treated, containing a therapeutically effective quantity of active compound in association with the required pharmaceutical carrier. The specification for the unit dosage forms of the invention are dictated by, and directly dependent on, the unique characteristics of the active compound and the particular desired therapeutic effect, and the inherent limitations of compounding the active compound.


8. Gene Therapy Compositions


The nucleic acid molecules of the invention can be inserted into vectors and used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, intravenous injection, local administration (Nabel and Nabel, U.S. Pat. No. 5,328,470, 1994), or by stereotactic injection (Chen et al., 1994). The pharmaceutical preparation of a gene therapy vector can include an acceptable diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical preparation can include one or more cells that produce the gene delivery system.


9. Dosage


The pharmaceutical composition and method of the present invention may further 1 5 comprise other therapeutically active compounds as noted herein that are usually applied in the treatment of the above mentioned pathological conditions.


In the treatment or prevention of conditions which require AAP modulation an appropriate dosage level will generally be about 0.01 to 500 mg per kg patient body weight per day which can be administered in single or multiple doses. Preferably, the dosage level will be about 0.1 to about 250 mg/kg per day; more preferably about 0.5 to about 100 mg/kg per day. A suitable dosage level may be about 0.01 to 250 mg/kg per day, about 0.05 to 100 mg/kg per day, or about 0.1 to 50 mg/kg per day. Within this range the dosage may be 0.05 to 0.5, 0.5 to 5 or 5 to 50 mg/kg per day. For oral administration, the compositions are preferably provided in the form of tablets containing 1.0 to 1000 milligrams of the active ingredient, particularly 1.0, 5.0, 10.0, 15.0. 20.0, 25.0, 50.0, 75.0, 100.0, 150.0, 200.0, 250.0, 300.0, 400.0, 500.0, 600.0, 750.0, 800.0, 900.0, and 1000.0 milligrams of the active ingredient for the symptomatic adjustment of the dosage to the patient to be treated. The compounds may be administered on a regimen of 1 to 4 times per day, preferably once or twice per day.


It will be understood, however, that the specific dose level and frequency of dosage for any particular patient may be varied and will depend upon a variety of factors including the activity of the specific compound employed, the metabolic stability and length of action of that compound, the age, body weight, general health, sex, diet, mode and time of administration, rate of excretion, drug combination, the severity of the particular condition, and the host undergoing therapy.


10. Kits for Pharmaceutical Compositions


The pharmaceutical compositions can be included in a kit, container, pack, or dispenser together with instructions for administration. When the invention is supplied as a kit, the different components of the composition may be packaged in separate containers and admixed immediately before use. Such packaging of the components separately may permit long-term storage without losing the active components' functions.


Kits may also include reagents in separate containers that facilitate the execution of a specific test, such as diagnostic tests or tissue typing. For example, AAP DNA templates and suitable primers may be supplied for internal controls.


(a) Containers or Vessels


The reagents included in the kits can be supplied in containers of any sort such that the life of the different components are preserved, and are not adsorbed or altered by the materials of the container. For example, sealed glass ampules may contain lyophilized luciferase or buffer that have been packaged under a neutral, non-reacting gas, such as nitrogen. Ampoules may consist of any suitable material, such as glass, organic polymers, such as polycarbonate, polystyrene, etc., ceramic, metal or any other material typically employed to hold reagents. Other examples of suitable containers include simple bottles that may be fabricated from similar substances as ampules, and envelopes, that may consist of foil-lined interiors, such as aluminum or an alloy. Other containers include test tubes, vials, flasks, bottles, syringes, or the like. Containers may have a sterile access port, such as a bottle having a stopper that can be pierced by a hypodermic injection needle. Other containers may have two compartments that are separated by a readily removable membrane that upon removal permits the components to mix. Removable membranes may be glass, plastic, rubber, etc.


(b) Instructional Materials


Kits may also be supplied with instructional materials. Instructions may be printed on paper or other substrate, and/or may be supplied as an electronic-readable medium, such as a floppy disc, CD-ROM, DVD-ROM, Zip disc, videotape, audiotape, etc. Detailed instructions may not be physically associated with the kit; instead, a user may be directed to an internet web site specified by the manufacturer or distributor of the kit, or supplied as electronic mail.


Screening and Detection Methods


The isolated nucleic acid molecules of the invention can be used to express an AAP (e.g., via a recombinant expression vector in a host cell in gene therapy applications), to detect an AAP mRNA (e.g., in a biological sample) or a genetic lesion in an AAP, and to modulate AAP activity, as described below. In addition, AAP polypeptides can be used to screen drugs or compounds that modulate the AAP activity or expression as well as to treat disorders characterized by insufficient or excessive production of an AAP or production of AAP forms that have decreased or aberrant activity compared to an AAP wild-type protein, or modulate biological function that involve an AAP. In addition, the anti-AAP Abs of the invention can be used to detect and isolate an AAP and modulate AAP activity.


1. Screening Assays


The invention provides a method (screening assay) for identifying modalities, i.e., candidate or test compounds or agents (e.g., peptides, peptidomimetics, small molecules or other drugs), foods, combinations thereof, etc., that effect an AAP, a stimulatory or inhibitory effect, inlcuding translation, transcription, activity or copies of the gene in cells. The invention also includes compounds identified in screening assays.


Testing for compounds that increase or decrease AAP activity are desirable. A compound may modulate an AAP activity by affecting: (1) the number of copies of the gene in the cell (amplifiers and deamplifiers); (2) increasing or decreasing transcription of an AAP (transcription up-regulators and down-regulators); (3) by increasing or decreasing the translation of an AAP mRNA into protein (translation up-regulators and down-regulators); or (4) by increasing or decreasing the activity of an AAP itself (agonists and antagonists).


(a) Effects of Compounds


To identify compounds that affect an AAP at the DNA, RNA and protein levels, cells or organisms are contacted with a candidate compound and the corresponding change in an AAP DNA, RNA or protein is assessed (Ausubel et al., 1987). For DNA amplifiers and deamplifiers, the amount of an AAP DNA is measured, for those compounds that are transcription up-regulators and down-regulators the amount of an AAP MRNA is determined; for translational up- and down-regulators, the amount of an AAP polypeptide is measured. Compounds that are agonists or antagonists may be identified by contacting cells or organisms with the compound, and then examining, for example, the model of angiogenesis in vitro.


In one embodiment, many assays for screening candidate or test compounds that bind to or modulate the activity of an AAP or polypeptide or biologically-active portion are available. Ttest compounds can be obtained using any of the numerous approaches in combinatorial library methods, including: biological libraries; spatially addressable parallel solid phase or solution phase libraries; synthetic library methods requiring deconvolution; the “one-bead one-compound” library method; and synthetic library methods using affinity chromatography selection. The biological library approach is limited to peptides, while the other four approaches encompass peptide, non-peptide oligomer or small molecule libraries of compounds (Lam, 1997).


(b) Small Molecules


A “small molecule” refers to a composition that has a molecular weight of less than about 5 kD and more preferably less than about 4 kD, most preferably less than 0.6 kD. Small molecules can be, nucleic acids, peptides, polypeptides, peptidomimetics, carbohydrates, lipids or other organic or inorganic molecules. Libraries of chemical and/or biological mixtures, such as fungal, bacterial, or algal extracts, are known in the art and can be screened with any of the assays of the invention. Examples of methods for the synthesis of molecular libraries can be found in: (Carell et al., 1994a; Carell et al., 1994b; Cho et al., 1993; DeWitt et al., 1993; Gallop et al., 1994; Zuckermann et al., 1994).


Libraries of compounds may be presented in solution (Houghten et al., 1992) or on beads (Lam et al., 1991), on chips (Fodor et al., 1993), bacteria, spores (Ladner et al., U.S. Pat. No. 5,223,409, 1993), plasmids (Cull et al., 1992) or on phage (Cwirla et al., 1990; Devlin et al., 1990; Felici et al., 1991; Ladner et al., U.S. Pat. No. 5,223,409, 1993; Scott and Smith, 1990). A cell-free assay comprises contacting an AAP or biologically-active fragment with a known compound that binds the AAP to form an assay mixture, contacting the assay mixture with a test compound, and determining the ability of the test compound to interact with the AAP, where determining the ability of the test compound to interact with the AAP comprises determining the ability of the AAP to preferentially bind to or modulate the activity of an AAP target molecule.


(c) Cell-free Assays


The cell-free assays of the invention may be used with both soluble or a membrane-bound forms of an AAP. In the case of cell-free assays comprising the membrane-bound form, a solubilizing agent to maintain the AAP in solution. Examples of such solubilizing agents include non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, TRITON® X-100 and others from the TRITON® series, THESIT®, Isotridecypoly(ethylene glycol ether)n, N-dodecyl-N,N-dimethyl-3-ammonio-1-propane sulfonate, 3-(3-cholamidopropyl) dimethylamminiol-1-propane sulfonate (CHAPS), or 3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy- 1-propane sulfonate (CHAPSO).


(d) Immobilization of Target Molecules to Facilitate Screening


In more than one embodiment of the assay methods, immobilizing either an AAP or its partner molecules can facilitate separation of complexed from uncomplexed forms of one or both of the proteins, as well as to accommodate high throughput assays. Binding of a test compound to an AAP, or interaction of an AAP with a target molecule in the presence and absence of a candidate compound, can be accomplished in any vessel suitable for containing the reactants, such as microtiter plates, test tubes, and micro-centrifuge tubes. A fusion protein can be provided that adds a domain that allows one or both of the proteins to be bound to a matrix. For example, GST-AAP fusion proteins or GST-target fusion proteins can be adsorbed onto glutathione sepharose beads (SIGMA Chemical, St. Louis, Mo.) or glutathione derivatized microtiter plates that are then combined with the test compound or the test compound and either the non-adsorbed target protein or an AAP, and the mixture is incubated under conditions conducive to complex formation (e.g., at physiological conditions for salt and pH). Following incubation, the beads or microtiter plate wells are washed to remove any unbound components, the matrix immobilized in the case of beads, complex determined either directly or indirectly, for example, as described. Alternatively, the complexes can be dissociated from the matrix, and the level of AAP binding or activity determined using standard techniques.


Other techniques for immobilizing proteins on matrices can also be used in screening assays. Either an AAP or its target molecule can be immobilized using biotin-avidin or biotin-streptavidin systems. Biotinylation can be accomplished using many reagents, such as biotin-NHS (N-hydroxy-succinimide; PIERCE Chemicals, Rockford, Ill.), and immobilized in wells of streptavidin-coated 96 well plates (PIERCE Chemical). Alternatively, Abs reactive with an AAP or target molecules, but which do not interfere with binding of the AAP to its target molecule, can be derivatized to the wells of the plate, and unbound target or an AAP trapped in the wells by antibody conjugation. Methods for detecting such complexes, in addition to those described for the GST-immobilized complexes, include immunodetection of complexes using Abs reactive with an AAP or its target, as well as enzyme-linked assays that rely on detecting an enzymatic activity associated with the AAP or target molecule.


(e) Screens to Identify Modulators


Modulators of AAP expression can be identified in a method where a cell is contacted with a candidate compound and the expression of an AAP mRNA or protein in the cell is determined. The expression level of the AAP mRNA or protein in the presence of the candidate compound is compared to the AAP mRNA or protein levels in the absence of the candidate compound. The candidate compound can then be identified as a modulator of the AAP mRNA or protein expression based upon this comparison. For example, when expression of an AAP mRNA or protein is greater (i.e., statistically significant) in the presence of the candidate compound than in its absence, the candidate compound is identified as a stimulator of the AAP mRNA or protein expression. Alternatively, when expression of the AAP mRNA or protein is less (statistically significant) in the presence of the candidate compound than in its absence, the candidate compound is identified as an inhibitor of the AAP mRNA or protein expression. The level of an AAP mRNA or protein expression in the cells can be determined by methods described for detecting an AAP mRNA or protein.


(i) Hybrid Assays


In yet another aspect of the invention, an AAP can be used as “bait” in two-hybrid or three hybrid assays (Bartel et al., 1993; Brent et al., WO94/10300, 1994; Iwabuchi et al., 1993; Madura et al., 1993; Saifer et al., U.S. Pat. No. 5,283,317, 1994; Zervos et al., 1993) to identify other proteins that bind or interact with the AAP and modulate AAP activity. Such AAP-bps are also likely to be involved in the propagation of signals by the AAP as, for example, upstream or downstream elements of an AAP pathway.


The two-hybrid system is based on the modular nature of most transcription factors, which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two different DNA constructs. In one construct, the gene that codes for an AAP is fused to a gene encoding the DNA binding domain of a known transcription factor (e.g., GAL4). The other construct, a DNA sequence from a library of DNA sequences that encodes an unidentified protein (“prey” or “sample”) is fused to a gene that codes for the activation domain of the known transcription factor. If the “bait” and the “prey” proteins are able to interact in vivo, forming an AAP-dependent complex, the DNA-binding and activation domains of the transcription factor are brought into close proximity. This proximity allows transcription of a reporter gene (e.g., LacZ) that is operably-linked to a transcriptional regulatory site responsive to the transcription factor. Expression of the reporter gene can be detected, and cell colonies containing the functional transcription factor can be isolated and used to obtain the cloned gene that encodes the AAP-interacting protein.


The invention further pertains to novel agents identified by the aforementioned screening assays and uses thereof for treatments as described herein.


2. Detection Assays


Portions or fragments of an AAP cDNA sequences identified herein (and the complete AAP gene sequences) are useful in themselves. By way of non-limiting example, these sequences can be used to: (I) identify an individual from a minute biological sample (tissue typing); and (2) aid in forensic identification of a biological sample.


(a) Tissue Typing


The AAP sequences of the invention can be used to identify individuals from minute biological samples. In this technique, an individual's genomic DNA is digested with one or more restriction enzymes and probed on a Southern blot to yield unique bands. The sequences of the invention are useful as additional DNA markers for “restriction fragment length polymorphisms” (RFLP; (Smulson et al., U.S. Pat. No. 5,272,057, 1993)).


Furthermore, the AAP sequences can be used to determine the actual base-by-base DNA sequence of targeted portions of an individual's genome. AAP sequences can be used to prepare two PCR primers from the 5′- and 3′-termini of the sequences that can then be used to amplify an the corresponding sequences from an individual's genome and then sequence the amplified fragment.


Panels of corresponding DNA sequences from individuals can provide unique individual identifications, as each individual will have a unique set of such DNA sequences due to allelic differences. The sequences of the invention can be used to obtain such identification sequences from individuals and from tissue. The AAP sequences of the invention uniquely represent portions of an individual's genome. Allelic variation occurs to some degree in the coding regions of these sequences, and to a greater degree in the noncoding regions. The allelic variation between individual humans occurs with a frequency of about once ever 500 bases. Much of the allelic variation is due to single nucleotide polymorphisms (SNPs), which include RFLPs.


Each of the sequences described herein can, to some degree, be used as a standard against which DNA from an individual can be compared for identification purposes. Because greater numbers of polymorphisms occur in noncoding regions, fewer sequences are necessary to differentiate individuals. Noncoding sequences can positively identify individuals with a panel of 10 to 1,000 primers that each yield a noncoding amplified sequence of 100 bases. If predicted coding sequences, such as those in SEQ I) NOS:1, 3, 5, 7, 9, 11, 13, and 15 are used, a more appropriate number of primers for positive individual identification would be 500-2,000.


Predictive Medicine


The invention also pertains to the field of predictive medicine in which diagnostic assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for prognostic (predictive) purposes to treat an individual prophylactically. Accordingly, one aspect of the invention relates to diagnostic assays for determining an AAP and/or nucleic acid expression as well as AAP activity, in the context of a biological sample (e.g., blood, serum, cells, tissue) to determine whether an individual is afflicted with a disease or disorder, or is at risk of developing a disorder, associated with aberrant AAP expression or activity, including cancer. The invention also provides for prognostic (or predictive) assays for determining whether an individual is at risk of developing a disorder associated with an AAP, nucleic acid expression or activity. For example, mutations in an AAP can be assayed in a biological sample. Such assays can be used for prognostic or predictive purpose to prophylactically treat an individual prior to the onset of a disorder characterized by or associated with the AAP, nucleic acid expression, or biological activity.


Another aspect of the invention provides methods for determining AAP activity, or nucleic acid expression, in an individual to select appropriate therapeutic or prophylactic agents for that individual (referred to herein as “pharmacogenomics”). Pharmacogenomics allows for the selection of modalities (e.g., drugs, foods) for therapeutic or prophylactic treatment of an individual based on the individual's genotype (e.g., the individual's genotype to determine the individual's ability to respond to a particular agent). Another aspect of the invention pertains to monitoring the influence of modalities (e.g., drugs, foods) on the expression or activity of an AAP in clinical trials.


1. Diagnostic Assays


An exemplary method for detecting the presence or absence of an AAP in a biological sample involves obtaining a biological sample from a subject and contacting the biological sample with a compound or an agent capable of detecting the AAP or the AAP nucleic acid (e.g., mRNA, genomic DNA) such that the presence of the AAP is confirmed in the sample. An agent for detecting the AAP mRNA or genomic DNA is a labeled nucleic acid probe that can hybridize to the AAP mRNA or genomic DNA. The nucleic acid probe can be, for example, a full-length AAP nucleic acid, such as the nucleic acid of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13 or 15, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to an AAP mRNA or genomic DNA.


An agent for detecting an AAP polypeptide is an antibody capable of binding to the AAP, preferably an antibody with a detectable label. Abs can be polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment (e.g., Fab or F(ab′)2) can be used. A labeled probe or antibody is coupled (i.e., physically linking) to a detectable substance, as well as indirect detection of the probe or antibody by reactivity with another reagent that is directly labeled. Examples of indirect labeling include detection of a primary antibody using a fluorescently-labeled secondary antibody and end-labeling of a DNA probe with biotin such that it can be detected with fluorescently-labeled streptavidin. The term “biological sample” includes tissues, cells and biological fluids isolated from a subject, as well as tissues, cells and fluids present within a subject. The detection method of the invention can be used to detect an AAP mRNA, protein, or genomic DNA in a biological sample in vitro as well as in vivo. For example, in vitro techniques for detection of an AAP mRNA include Northern hybridizations and in situ hybridizations. In vitro techniques for detection of an AAP polypeptide include enzyme linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations, and immunofluorescence. In vitro techniques for detection of an AAP genomic DNA include Southern hybridizations and fluorescence in situ hybridization (FISH). Furthermore, in vivo techniques for detecting an AAP include introducing into a subject a labeled anti-AAP antibody. For example, the antibody can be labeled with a radioactive marker whose presence and location in a subject can be detected by standard imaging techniques.


In one embodiment, the biological sample from the subject contains protein molecules, and/or mRNA molecules, and/or genomic DNA molecules. A preferred biological sample is blood.


In another embodiment, the methods further involve obtaining a biological sample from a subject to provide a control, contacting the sample with a compound or agent to detect an AAP, mRNA, or genomic DNA, and comparing the presence of the AAP, mRNA or genomic DNA in the control sample with the presence of the AAP, mRNA or genomic DNA in the test sample.


The invention also encompasses kits for detecting an AAP in a biological sample. For example, the kit can comprise: a labeled compound or agent capable of detecting an AAP or an AAP mRNA in a sample; reagent and/or equipment for determining the amount of an AAP in the sample; and reagent and/or equipment for comparing the amount of an AAP in the sample with a standard. The compound or agent can be packaged in a suitable container. The kit can further comprise instructions for using the kit to detect the AAP or nucleic acid.


2. Prognostic Assays


The diagnostic methods described herein can furthermore be utilized to identify subjects having or at risk of developing a disease or disorder associated with an aberrant AAP expression or activity. For example, the assays described herein, can be used to identify a subject having or at risk of developing a disorder associated with AAP, nucleic acid expression or activity. Alternatively, the prognostic assays can be used to identify a subject having or at risk for developing a disease or disorder. Tthe invention provides a method for identifying a disease or disorder associated with an aberrant AAP expression or activity in which a test sample is obtained from a subject and the AAP or nucleic acid (e.g., mRNA, genomic DNA) is detected. A test sample is a biological sample obtained from a subject. For example, a test sample can be a biological fluid (e.g., serum), cell sample, or tissue.


Pognostic assays can be used to determine whether a subject can be administered a modality (e.g., an agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, food, etc.) to treat a disease or disorder associated with an aberrant AAP expression or activity. Such methods can be used to determine whether a subject can be effectively treated with an agent for a disorder. The invention provides methods for determining whether a subject can be effectively treated with an agent for a disorder associated with an aberrant AAP expression or activity in which a test sample is obtained and the AAP or nucleic acid is detected (e.g., where the presence of the AAP or nucleic acid is diagnostic for a subject that can be administered the agent to treat a disorder associated with the aberrant AAP expression or activity).


The methods of the invention can also be used to detect genetic lesions in an AAP to determine if a subject with the genetic lesion is at risk for a disorder characterized by aberrant angiogenesis. Methods include detecting, in a sample from the subject, the presence or absence of a genetic lesion characterized by at an alteration affecting the integrity of a gene encoding an AAP polypeptide, or the mis-expression of an AAP. Such genetic lesions can be detected by ascertaining: (1) a deletion of one or more nucleotides from an AAP; (2) an addition of one or more nucleotides to an AAP; (3) a substitution of one or more nucleotides in an AAP, (4) a chromosomal rearrangement of an AAP gene; (5) an alteration in the level of an AAP mRNA transcripts, (6) aberrant modification of an AAP, such as a change genomic DNA methylation, (7) the presence of a non-wild-type splicing pattern of an AAP mRNA transcript, (8) a non-wild-type level of an AAP, (9) allelic loss of an AAP, and/or (10) inappropriate post-translational modification of an AAP polypeptide. There are a large number of known assay techniques that can be used to detect lesions in an AAP. Any biological sample containing nucleated cells may be used.


In certain embodiments, lesion detection may use a probe/primer in a polymerase chain reaction (PCR) (e.g., (Mullis, U.S. Pat. No. 4,683,202, 1987; Mullis et al., U.S. Pat. No. 4,683,195, 1987), such as anchor PCR or rapid amplification of cDNA ends (RACE) PCR, or, alternatively, in a ligation chain reaction (LCR) (e.g., (Landegren et al., 1988; Nakazawa et al., 1994), the latter is particularly useful for detecting point mutations in AAP-genes (Abravaya et al., 1995). This method may include collecting a sample from a patient, isolating nucleic acids from the sample, contacting the nucleic acids with one or more primers that specifically hybridize to an AAP under conditions such that hybridization and amplification of the AAP (if present) occurs, and detecting the presence or absence of an amplification product, or detecting the size of the amplification product and comparing the length to a control sample. It is anticipated that PCR and/or LCR may be desirable to use as a preliminary amplification step in conjunction with any of the techniques used for detecting mutations described herein.


Alternative amplification methods include: self sustained sequence replication (Guatelli et al., 1990), transcriptional amplification system (Kwoh et al., 1989); Qu Replicase (Lizardi et al., 1988), or any other nucleic acid amplification method, followed by the detection of the amplified molecules using techniques well known to those of skill in the art. These detection schemes are especially useful for the detection of nucleic acid molecules present in low abundance.


Mutations in an AAP from a sample can be identified by alterations in restriction enzyme cleavage patterns. For example, sample and control DNA is isolated, amplified (optionally), digested with one or more restriction endonucleases, and fragment length sizes are determined by gel electrophoresis and compared. Differences in fragment length sizes between sample and control DNA indicates mutations in the sample DNA. Moreover, the use of sequence specific ribozymes can be used to score for the presence of specific mutations by development or loss of a ribozyme cleavage site.


Hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high-density arrays containing hundreds or thousands of oligonucleotides probes, can identify genetic mutations in an AAP (Cronin et al., 1996; Kozal et al., 1996). For example, genetic mutations in an AAP can be identified in two-dimensional arrays containing light-generated DNA probes as described in Cronin, et al., supra. Briefly, a first hybridization array of probes can be used to scan through long stretches of DNA in a sample and control to identify base changes between the sequences by making linear arrays of sequential overlapping probes. This step allows the identification of point mutations. This is followed by a second hybridization array that allows the characterization of specific mutations by using smaller, specialized probe arrays complementary to all variants or mutations detected. Each mutation array is composed of parallel probe sets, one complementary to the wild-type gene and the other complementary to the mutant gene.


In yet another embodiment, any of a variety of sequencing reactions known in the art can be used to directly sequence an AAP and detect mutations by comparing the sequence of the sample AAP-with the corresponding wild-type (control) sequence. Examples of sequencing reactions include those based on classic techniques (Maxam and Gilbert, 1977; Sanger et al., 1977). Any of a variety of automated sequencing procedures can be used when performing diagnostic assays (Naeve et al., 1995) including sequencing by mass spectrometry (Cohen et al., 1996; Griffin and Griffin, 1993; Koster, WO94/16101, 1994).


Other methods for detecting mutations in an AAP include those in which protection from cleavage agents is used to detect mismatched bases in RNA/RNA or RNA/DNA heteroduplexes (Myers et al., 1985). In general, the technique of “mismatch cleavage” starts by providing heteroduplexes formed by hybridizing (labeled) RNA or DNA containing a wild-type AAP sequence with potentially mutant RNA or DNA obtained from a sample. The double-stranded duplexes are treated with an agent that cleaves single-stranded regions of the duplex such as those that arise from base pair mismatches between the control and sample strands. For instance, RNA/DNA duplexes can be treated with RNase and DNA/DNA hybrids treated with S1 nuclease to enzymatically digest the mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched regions. The digested material is then separated by size on denaturing polyacrylamide gels to determine the mutation site (Grompe et al., 1989; Saleeba and Cotton, 1993). The control DNA or RNA can be labeled for detection.


Mismatch cleavage reactions may employ one or more proteins that recognize mismatched base pairs in double-stranded DNA (DNA mismatch repair) in defined systems for detecting and mapping point mutations in an AAP cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu et al., 1994). According to an exemplary embodiment, a probe based on a wild-type AAP sequence is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be detected from electrophoresis protocols or the like (Modrich et al., U.S. Pat. No. 5,459,039, 1995).


Electrophoretic mobility alterations can be used to identify mutations in an AAP. For example, single strand conformation polymorphism (SSCP) may be used to detect differences in electrophoretic mobility between mutant and wild type nucleic acids (Cotton, 1993; Hayashi, 1992; Orita et al., 1989). Single-stranded DNA fragments of sample and control AAP nucleic acids are denatured and then renatured The secondary structure of single-stranded nucleic acids varies according to sequence; the resulting alteration in electrophoretic mobility allows detection of even a single base change. The DNA fragments may be labeled or detected with labeled probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), in which the secondary structure is more sensitive to a sequence changes. The subject method may use heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of changes in electrophoretic mobility (Keen et al., 1991).


The migration of mutant or wild-type fragments can be assayed using denaturing gradient gel electrophoresis (DGGE; (Myers et al., 1985). In DGGE, DNA is modified to prevent complete denaturation, for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich DNA by PCR. A temperature gradient may also be used in place of a denaturing gradient to identify differences in the mobility of control and sample DNA (Rossiter and Caskey, 1990).


Examples of other techniques for detecting point mutations include, but are not limited to, selective oligonucleotide hybridization, selective amplification, or selective primer extension. For example, oligonucleotide primers may be prepared in which the known mutation is placed centrally and then hybridized to target DNA under conditions that permit hybridization only if a perfect match is found (Saiki et al., 1986; Saiki et al., 1989). Such allele-specific oligonucleotides are hybridized to PCR-amplified target DNA or a number of different mutations when the oligonucleotides are attached to the hybridizing membrane and hybridized with labeled target DNA.


Alternatively, allele specific amplification technology that depends on selective PCR amplification may be used. Oligonucleotide primers for specific amplifications may carry the mutation of interest in the center of the molecule (so that amplification depends on differential hybridization (Gibbs et al., 1989)) or at the extreme 3′-terminus of one primer where, under appropriate conditions, mismatch can prevent, or reduce polymerase extension (Prosser, 1993). Novel restriction site in the region of the mutation may be introduced to create cleavage-based detection (Gasparini et al., 1992). Certain amplification may also be performed using Taq ligase for amplification (Barany, 1991).


In such cases, ligation occurs only if there is a perfect match at the 3′-terminus of the 5′ sequence, allowing detection of a known mutation by scoring for amplification.


The described methods may be performed, for example, by using pre-packaged kits comprising at least one probe (nucleic acid or antibody) that may be conveniently used, for example, in clinical settings to diagnose patients exhibiting symptoms or family history of a disease or illness involving an AAP.


Furthermore, any cell type or tissue in which an AAP is expressed may be utilized in the prognostic assays described herein.


3. Phannacogenomics


Agents, or modulators that have a stimulatory or inhibitory effect on AAP activity or expression, as identified by a screening assay can be administered to individuals to treat, prophylactically or therapeutically, disorders, including those associated with angiogenesis. In conjunction with such treatment, the pharmacogenomics (i.e., the study of the relationship between a subject's genotype and the subject's response to a foreign modality, such as a food, compound or drug) may be considered. Metabolic differences of therapeutics can lead to severe toxicity or therapeutic failure by altering the relation between dose and blood concentration of the pharmacologically active drug. Thus, the pharmacogenomics of the individual permits the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a consideration of the individual's genotype. Pharmacogenomics can further be used to determine appropriate dosages and therapeutic regimens. Accordingly, the activity of an AAP, expression of an AAP nucleic acid, or an AAP mutation(s) in an individual can be determined to guide the selection of appropriate agent(s) for therapeutic or prophylactic treatment.


Pharmacogenomics deals with clinically significant hereditary variations in the response to modalities due to altered modality disposition and abnormal action in affected persons (Eichelbaum and Evert, 1996; Linder et al., 1997). In general, two pharmacogenetic conditions can be differentiated: (1) genetic conditions transmitted as a single factor altering the interaction of a modality with the body (altered drug action) or (2) genetic conditions transmitted as single factors altering the way the body acts on a modality (altered drug metabolism). These pharmnacogenetic conditions can occur either as rare defects or as nucleic acid polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the main clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of fava beans.


As an illustrative embodiment, the activity of drug metabolizing enzymes is a major determinant of both the intensity and duration of drug action. The discovery of genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) explains the phenomena of some patients who show exaggerated drug response and/or serious toxicity after taking the standard and safe dose of a drug. These polymorphisms are expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among different populations. For example, the CYP2D6 gene is highly polymorphic and several mutations have been identified in PM, which all lead to the absence of functional CYP2D6. Poor metabolizers due to mutant CYP2D6 and CYP2C]9 frequently experience exaggerated drug responses and side effects when they receive standard doses. If a metabolite is the active therapeutic moiety, PM shows no therapeutic response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed metabolite morphine. At the other extreme are the so-called ultra-rapid metabolizers who are unresponsive to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification.


The activity of an AAP, expression of an AAP nucleic acid, or mutation content of an AAP in an individual can be determined to select appropriate agent(s) for therapeutic or prophylactic treatment of the individual. In addition, pharmacogenetic studies can be used to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the identification of an individual's drug responsiveness phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency when treating a subject with an AAP modulator, such as a modulator identified by one of the described exemplary screening assays.


4. Monitoring Effects During Clinical Trials


Monitoring the influence of agents (e.g., drugs, compounds) on the expression or activity of an AAP (e.g., the ability to modulate angiogenesis) can be applied not only in basic drug screening, but also in clinical trials. For example, the effectiveness of an agent determined by a screening assay to increase an AAP expression, protein levels, or up-regulate an AAP's activity can be monitored in clinical trails of subjects exhibiting decreased AAP expression, protein levels, or down-regulated AAP activity. Alternatively, the effectiveness of an agent determined to decrease an AAP expression, protein levels, or down-regulate an AAP's activity, can be monitored in clinical trails of subjects exhibiting increased the AAP expression, protein levels, or up-regulated AAP activity. In such clinical trials, the expression or activity of the AAP and, preferably, other genes that have been implicated in, for example, angiogenesis can be used as a “read out” or markers for a particular cell's responsiveness.


For example, genes, including an AAP, that are modulated in cells by treatment with a modality (e.g., food, compound, drug or small molecule) can be identified. To study the effect of agents on angiogenesis, for example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the levels of expression of an AAP and other genes implicated in the disorder. The gene expression pattern can be quantified by Northern blot analysis, nuclear run-on or RT-PCR experiments, or by measuring the amount of protein, or by measuring the activity level of the AAP or other gene products. In this manner, the gene expression pattern itself can serve as a marker, indicative of the cellular physiological response to the agent. Accordingly, this response state may be determined before, and at various points during, treatment of the individual with the agent.


The invention provides a method for monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, protein, peptide, peptidomimetic, nucleic acid, small molecule, food or other drug candidate identified by the screening assays described herein) comprising the steps of (1) obtaining a pre-administration sample from a subject; (2) detecting the level of expression of an AAP, mRNA, or genomic DNA in the preadministration sample; (3) obtaining one or more post-administration samples from the subject; (4) detecting the level of expression or activity of the AAP, mRNA, or genomic DNA in the post-administration samples; (5) comparing the level of expression or activity of the AAP, mRNA, or genomic DNA in the pre-administration sample with the AAP, mRNA, or genomic DNA in the post administration sample or samples; and (6) altering the administration of the agent to the subject accordingly. For example, increased administration of the agent may be desirable to increase the expression or activity of the AAP to higher levels than detected, i.e., to increase the effectiveness of the agent. Alternatively, decreased administration of the agent may be desirable to decrease expression or activity of the AAP to lower levels than detected, i.e., to decrease the effectiveness of the agent.


5. Methods of Treatment


The invention provides for both prophylactic and therapeutic methods of treating a subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant AAP expression or activity. Furthermore, these same methods of treatment may be used to induce or inhibit angiogenesis, by changing the level of expression or activity of an AAP.


6. Disease and Disorders


Diseases and disorders that are characterized by increased AAP levels or biological activity may be treated with therapeutics that antagonize (i.e., reduce or inhibit) activity. Antognists may be administered in a therapeutic or prophylactic manner. Therapeutics that may be used include: (1) AAP peptides, or analogs, derivatives, fragments or homologs thereof; (2) Abs to an AAP peptide; (3) AAP nucleic acids; (4) administration of antisense nucleic acid and nucleic acids that are “dysfunctional” (i.e., due to a heterologous insertion within the coding sequences) that are used to eliminate endogenous function of by homologous recombination (Capecchi, 1989); or (5) modulators (i.e., inhibitors, agonists and antagonists, including additional peptide mimetic of the invention or Abs specific to an AAP) that alter the interaction between an AAP and its binding partner.


Diseases and disorders that are characterized by decreased AAP levels or biological activity may be treated with therapeutics that increase (i.e., are agonists to) activity. Therapeutics that upregulate activity may be administered therapeutically or prophylactically. Therapeutics that may be used include peptides, or analogs, derivatives, fragments or homologs thereof; or an agonist that increases bioavailability.


Increased or decreased levels can be readily detected by quantifying peptide and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying in vitro for RNA or peptide levels, structure and/or activity of the expressed peptides (or AAP mRNAs). Methods include, but are not limited to, immunoassays (e.g., by Western blot analysis, immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, and the like).


7. Prophylactic Methods


The invention provides a method for preventing, in a subject, a disease or condition associated with an aberrant AAP expression or activity, by administering an agent that modulates an AAP expression or at least one AAP activity. Subjects at risk for a disease that is caused or contributed to by an aberrant AAP expression or activity can be identified by, for example, any or a combination of diagnostic or prognostic assays. Administration of a prophylactic agent can occur prior to the manifestation of symptoms characteristic of the AAP aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its progression. Depending on the type of AAP aberrancy, for example, an AAP agonist or AAP antagonist can be used to treat the subject. The appropriate agent can be determined based on screening assays.


8. Therapeutic Methods


Another aspect of the invention pertains to methods of modulating an AAP expression or activity for therapeutic purposes. The modulatory method of the invention involves contacting a cell with an agent that modulates one or more of the activities of AAP activity associated with the cell. An agent that modulates AAP activity can be a nucleic acid or a protein, a naturally occurring cognate ligand of an AAP, a peptide, an AAP peptidomimetic, or other small molecule. The agent may stimulate AAP activity. Examples of such stimulatory agents include an active AAP and an AAP nucleic acid molecule that has been introduced into the cell. In another embodiment, the agent inhibits AAP activity. Examples of inhibitory agents include antisense AAP nucleic acids and anti-AAP Abs. Modulatory methods can be performed in vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). As such, the invention provides methods of treating an individual afflicted with a disease or disorder characterized by aberrant expression or activity of an AAP or nucleic acid molecule. In one embodiment, the method involves administering an agent (e.g., an agent identified by a screening assay), or combination of agents that modulates (e.g., up-regulates or down-regulates) AAP expression or activity. In another embodiment, the method involves administering an AAP or nucleic acid molecule as therapy to compensate for reduced or aberrant AAP expression or activity.


Stimulation of AAP activity is desirable in situations in which AAP is abnormally down-regulated and/or in which increased AAP activity is likely to have a beneficial effect. One example of such a situation is where a subject has a disorder characterized by aberrant angiogenesis (e.g., cancer).


9. Deternination of the Biological Effect of the Therapeutic


Suitable in vitro or in vivo assays can be performed to determine the effect of a specific therapeutic and whether its administration is indicated for treatment of the affected tissue.


In various specific embodiiments, in vitro assays may be performed with representative cells of the type(s) involved in the patient's disorder, to determine if a given therapeutic exerts the desired effect upon the cell type(s). Modalities for use in therapy may be tested in suitable animal model systems including, but not limited to rats, mice, chicken, cows, monkeys, rabbits, and the like, prior to testing in human subjects. Similarly, for in vivo testing, any of the animal model system known in the art may be used prior to administration to human subjects.


10. Prophylactic and Therapeutic Uses of the Compositions of the Invention


AAP nucleic acids and proteins are useful in potential prophylactic and therapeutic applications implicated in a variety of disorders including, but not limited to those related to angiogenesis.


As an example, a cDNA encoding an AAP may be useful in gene therapy, and the protein may be useful when administered to a subject in need thereof.


AAP nucleic acids, or fragments thereof, may also be useful in diagnostic applications, wherein the presence or amount of the nucleic acid or the protein is to be assessed. A further use could be as an anti-bacterial molecule (i.e., some peptides have been found to possess anti-bacterial properties). These materials are further useful in the generation of Abs that immunospecifically bind to the novel substances of the invention for use in therapeutic or diagnostic methods.


The following example is meant to not be limiting.


EXAMPLE

Identification of Genes Differentially-regulated


A comprehensive mRNA profiling technique (GeneCalling) was used to determine differential gene expression profiles of human endothelial cells undergoing differentiation into tube-like structures (Kahn et al., 2000). To confirm the expression data from GeneCalling, independent experiments were undertaken that used gene-specific PCR oligonucleotide primer pairs and an oligonucleotide probe labeled with a fluorescent dye at the 5′ end and quencher fluorescent dye at the 3′ end. Total RNA (50 ng) was added to a 50 μl RT-PCR mixture and run.


The following data were collected:

hEF Gcollagen gel 24 hr versus 4 h4.5 fold upregulatedhTRGcollagen gel 24 hr versus 4 h3.5 fold upregulatedKLPcollagen gel 24 hr versus 4 h3.5 fold upregulatedmyosin Xcollagen gel 24 hr versus 4 h3.5 fold upregulatedNHRcollagen gel 24 hr versus 4 h7.3 fold downregulatedHBAZFcollagen gel 24 hr versus 4 h2.1 fold upregulated


Equivalents


Although particular embodiments have been disclosed herein in detail, this has been done by way of example for purposes of illustration only, and is not intended to be limiting with respect to the scope of the appended claims that follow. In particular, it is contemplated by the inventors that various substitutions, alterations, and modifications may be made to the invention without departing from the spirit and scope of the invention as defined by the claims. The choice of nucleic acid starting material, clone of interest, or library type is believed to be a matter of routine for a person of ordinary skill in the art with knowledge of the embodiments described herein. Other aspects, advantages, and modifications considered within the scope of the following claims.


REFERENCES



  • U.S. Pat. No. 4,166,452. Apparatus for testing human responses to stimuli. 1979.

  • U.S. Pat. No. 4,485,045. Synthetic phosphatidyl cholines useful in forming liposomes. 1984.

  • U.S. Pat. No. 4,544,545. Liposomes containing modified cholesterol for organ targeting. 1985.

  • U.S. Pat. No. 4,676,980. Target specific cross-linked heteroantibodies. 1987.

  • U.S. Pat. No. 4,816,567. Recombinant immunoglobin preparations. 1989.

  • WO 90/10448. Covalent conjugates of lipid and oligonucleotide. 1990.

  • WO 90/13641. Stably transformed eucaryotic cells comprisng a foreign transcribable DNA under the control of a pol im promoter. 1990.

  • EPO 402226. Transformation vectors for yeast Yarrowia. 1990.

  • WO 91/00360. Bispecific reagents for AIDS therapy. 1991.

  • WO 91/04753. Conjugates of antisense oligonucleotides and therapeutic uses thereof. 1991.

  • U.S. Pat. No. 5,013,556. Liposomes with enhanced circulation time. 1991.

  • WO 91/00357. New strain with filamentous fungi mutants, process for the production of recombinant proteins using said strain, and strains and proteins. 1991.

  • WO 91/06629. Oligonucleotide analogs with novel linkages. 1991.

  • WO 92/20373. Heteroconjugate antibodies for treatment of HIV infection. 1992.

  • WO 93/08829. Compositions that mediate killing of HIV-infected cells. 1993.

  • WO 94/11026. Therapeutic application of chimeric and radiolabeled antibodies to human B lymphocyte restricted differentiation antigen for treatment of B cells. 1994.

  • WO 96/27011. A method for making heteromultimeric polypeptides. 1996.

  • U.S. Pat. No. 5,545,807. Production of antibodies from transgenic animals. 1996.

  • U.S. Pat. No. 5,545,806. Ransgenic <sic> non-human animals for producing heterologous antibodies. 1996.

  • U.S. Pat. No. 5,569,825. Transgenic non-human animals capable of producing heterologous antibodies of various isotypes. 1996.

  • WO 97/33551. Compositions and methods for the diagnosis, prevention, and treatment of neoplastic cell growth and proliferation. 1997.



U.S. Pat. No.5,633,425. Transgenic non-human animals capable of producing heterologous antibodies. 1997.


U.S. Pat. No. 5,661,016. Transgenic non-human animals capable of producing heterologous antibodies of various isotypes. 1997.

  • U.S. Pat. No. 5,625,126. Transgenic non-human animals for producing heterologous antibodies. 1997.
  • Abravaya, K., J. J. Carrino, S. Muldoon, and H. H. Lee. 1995. Detection of point mutations with a modified ligase chain reaction (Gap-LCR). Nucleic Acids Res. 23:675-82.
  • Adams, J., R. Kelso, and L. Cooley. 2000. The kelch repeat superfamily of proteins: propellers of cell function. Trends Cell Biol. 10:17-24.
  • Alam, J., and J. L. Cook. 1990. Reporter genes: Application to the study of mammalian gene transcription. Anal. Biochem. 188:245-254.
  • Albagli, O., D. Lantoine, S. Quief, F. Quignon, et al. 1999. Overexpressed BCL6 (LAZ3) oncoprotein triggers apoptosis, delays S phase progression and associates with replication foci. Oncogene. 18:5063-75.
  • Alberts, B., D. Bray, J. Lewis, M. Raff, et al. 1994. Molecular Biology of the Cell. Garland Publishing, Inc., New York, N.Y. 1294 pp.
  • Altschul, S. F., T. L. Madden, A. A. Schaffer, J. Zhang, et al. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:3389-402.
  • Austin, C. P., and C. L. Cepko. 1990. Cellular migration patterns in the developing mouse cerebral cortex. Development. 110:713-732.
  • Ausubel, F. M., R. Brent, R. E. Kingston, D. D. Moore, et al. 1987. Current protocols in molecular biology. John Wiley & Sons, New York.
  • Barany, F. 1991. Genetic disease detection and DNA amplification using cloned themnostable ligase. Proc Natl Acad Sci U S A. 88:189-93.
  • Barker, C., A. Makris, C. Patriotis, S. E. Bear, et al. 1993. Identification of the gene encoding the mitochondrial elongation factor G in mammals. Nucleic Acids Res. 21:2641-7.
  • Bartel, D. P., and J. W. Szostak. 1993. Isolation of new ribozymes from a large pool of random sequences [see comment]. Science. 261:1411-8.
  • Bartel, P., C. T. Chien, R. Sternglanz, and S. Fields. 1993. Elimination of false positives that arise in using the two-hybrid system. Biotechniques. 14:920-4.
  • Beal, P. A., and P. B. Dervan. 1991. Second structural motif for recognition of DNA by oligonucleotide- directed triple-helix formation. Science. 251:1360-3.
  • Bechtold, N., and G. Pelletier. 1998. In planta Agrobacterium-mediated transformation of adult Arabidopsis thaliana plants by vacuum infiltration. Methods Mol Biol. 82:259-66.
  • Becker, D. M., and L. Guarente. 1991. High-efficiency transformation of yeast by electroporation. Methods Enzymol. 194:182-187.
  • Beggs, J. D. 1978. Transformation of yeast by a replicating hybrid plasmid. Nature. 275:104-109.
  • Berg, J. S., B. H. Derfler, C. M. Pennisi, D. P. Corey, et al. 2000. Myosin-X, a novel myosin with pleckstrin homology domains, associates with regions of dynamic actin. J Cell Sci. 113 Pt 19:3439-51.
  • Berger, J., J. Hauber, R. Hauber, R. Geiger, et al. 1988. Secreted placental alkaline phosphatase: A powerful new qunatitative indicator of gene expression in eukaryotic cells. Gene. 66:1-10.
  • WO 93/04169. GENE TARGETING IN ANIMAL CELLS USING ISOGENIC DNA CONSTRUCTS. 1993.
  • Bodine, D. M., K. T. McDonagh, N. E. Seidel, and A. W. Nienhuis. 1991. Survival and retrovirus infection of murine hematopoietic stem cells in vitro: effects of 5-FU and method of infection. Exp. Hematol. 19:206-212.
  • Boerner, P., R. Lafond, W. Z. Lu, P. Brams, et al. 1991. Production of antigen-specific human monoclonal antibodies from in vitro-primed human splenocytes. J Immunol. 147:86-95.
  • Bonapace, I. M., M. Sanchez, S. Obici, A. Gallo, et al. 1990. Extinction and activation of the thyroglobulin promoter in hybrids of differentiated and transformed thyroid cells. Mol Cell Biol. 10:1033-40.
  • U.S. Pat. No. 3,773,919. Polylactide-drug mixtures. 1973.
  • Bradley. 1987. Teratocarcinomas and Embryonic Stem Cells: A Practical Approach.


Oxford University Press, Inc., Oxford. 268 pp.

  • Bradley, A. 1991. Modifying the mammalian genome by gene targeting. Curr Opin Biotechnol. 2:823-9.
  • Brennan, M., P. F. Davison, and H. Paulus. 1985. Preparation of bispecific antibodies by chemical recombination of monoclonal immunoglobulin G1 fragments. Science. 229:81-3.
  • WO94/10300. INTERACTION TRAP SYSTEM FOR ISOLATING NOVEL PROTEINS. 1994.
  • Capecchi, M. R. 1980. High efficiency transformation by direct microinjection of DNA into cultured mammalian cells. Cell. 22:479.
  • Capecchi, M. R. 1989. Altering the genome by homologous recombination. Science. 244:1288-92.
  • Carell, T., E. A. Wintner, and J. Rebek Jr. 1994a. A novel procedure for the synthesis of libraries containing small organic molecules. Angewandte Chemie International Edition. 33:2059-2061.
  • Carell, T., E. A. Wintner, and J. Rebek Jr. 1994b. A solution phase screening procedure for the isolation of active compounds from a molecular library. Angewandte Chemie International Edition. 33:2061-2064.
  • Caron, P. C., W. Laird, M. S. Co, N. M. Avdalovic, et al. 1992. Engineered humanized dimeric forms of IgG are more effective antibodies. J Exp Med. 176:1191-5.
  • Carter, P. 1986. Site-directed mutagenesis. Biochem J. 237:1-7.
  • Case, M. E., M. Schweizer, S. R. Kushner, and N. H. Giles. 1979. Efficient transformation of Neurospora crassa by utilizing hybrid plasmid DNA. Proc Natl Acad Sci U S A. 76:5259-63.
  • U.S. Pat. No. 5,116,742. RNA ribozyme restriction endoribonucleases and methods. 1992.
  • U.S. Pat. No. 4,987,071. RNA ribozyme polymerases, dephosphorylases, restriction endoribonucleases and methods. 1991.
  • Cepko, C. L., B. E. Roberts, and R. E. Mulligan. 1984. Construction and applications of a highly transmissible murine retrovirus shuttle vector. Cell. 37:1053-1062.
  • Chalfie, M., Y. tu, G. Euskirchen, W. W. Ward, et al. 1994. Green fluorescent protein as a marker for gene expression. Science. 263:802-805.
  • Chaney, W. G., D. R. Howard, J. W. Pollard, S. Sallustio, et al. 1986. High-frequency transfection of CHO cells using Polybrene. Somatic Cell Mol. Genet. 12:237.
  • Chen, C., and H. Okayama. 1988. Calcium phosphate-mediated gene transfer: A highly efficient system for stably transforming cells with plasmid DNA. BioTechniques. 6:632-638.
  • Chen, S. H., H. D. Shine, J. C. Goodman, R. G. Grossman, et al. 1994. Gene therapy for brain tumors: regression of experimental gliomas by adenovirus-mediated gene transfer in vivo. Proc Natl Acad Sci U S A. 91:3054-7.
  • Cho, C. Y., E. J. Moran, S. R. Cherry, J. C. Stephans, et al. 1993. An unnatural biopolymer. Science. 261:1303-5.
  • Cohen, A. S., D. L. Smisek, and B. H. Wang. 1996. Emerging technologies for sequencing antisense oligonucleotides: capillary electrophoresis and mass spectrometry. Adv Chromatogr. 36:127-62.
  • Cohen, J. S. 1989. Oligodeoxynucleotides: Antisense inhibitors of gene expression. CRC Press, Boca Raton, Fla. 255 pp.
  • Cohen, S. M. N., A. C. Y. Chang, and L. Hsu. 1972. Nonchromosomal antibiotic resistance in bacteria: Genetic transformation of Escherichia coli by R-factor DNA. Proc. Natl. Acad. Sci. USA. 69:2110.
  • Cooney, M., G. Czernuszewicz, E. H. Postel, S. J. Flint, et al. 1988. Site-specific oligonucleotide binding represses transcription of the human c-myc gene in vitro. Science. 241:456-9.
  • Cotton, R. G. 1993. Current methods of mutation detection. Mutat Res. 285:125-44.
  • Cronin, M. T., R. V. Fucini, S. M. Kim, R. S. Masino, et al. 1996. Cystic fibrosis mutation detection by hybridization to light-generated DNA probe arrays. Hum Mutat. 7:244-55.
  • Cull, M. G., J. F. Miller, and P. J. Schatz. 1992. Screening for receptor ligands using large libraries of peptides linked to the C terminus of the lac repressor. Proc Natl Acad Sci USA. 89:1865-9.
  • Cwirla, S. E., E. A. Peters, R. W. Barrett, and W. J. Dower. 1990. Peptides on phage: a vast library of peptides for identifying ligands. Proc Natl Acad Sci U S A. 87:6378-82.
  • de Boer, A. G. 1994. Drug absorption enhancement: Concepts, possibilities, limitations and trends. Harwood Academic Publishers, Langhorne, Pa.
  • de Louvencourt, L., H. Fukuhara, H. Heslot, and M. Wesolowski. 1983. Transformation of Kluyveromyces lactis by killer plasmid DNA. J Bacteriol. 154:737-42.
  • de Wet, J. R., K. V. Wood, M. DeLuca, D. R. Helinski, et al. 1987. Sturcture and expression in mammalian cells. Mol. Cell Biol. 7:725-737.
  • Demerec, M., E. A. Adelberg, A. J. Clark, and P. E. Hartman. 1966. A proposal for a uniform nomenclature in bacterial genetics. Genetics. 54:61-76.
  • Devlin, J. J., L. C. Panganiban, and P. E. Devlin. 1990. Random peptide libraries: a source of specific protein binding molecules. Science. 249:404-6.
  • DeWitt, S. H., J. S. Kiely, C. J. Stankovic, M. C. Schroeder, et al. 1993. “Diversomers”: an approach to nonpeptide, nonoligomeric chemical diversity. Proc Natl Acad Sci U SA. 90:6909-13.
  • Eichelbaum, M., and B. Evert. 1996. Influence of pharmacogenetics on drug disposition and response. Clin Exp Phannacol Physiol. 23:983-5.
  • Ellington, A. D., and J. W. Szostak. 1990. In vitro selection of RNA molecules that bind specific ligands. Nature. 346:818-22.
  • Elroy-Stein, O., and B. Moss. 1990. Cytoplasmic expression system based on constitutive synthesis of bacteriophage T7 RNA polymerase in mammalian cells. Proc. Natl. Acad. Sci. USA. 87:6743-6747.
  • U.S. Pat. No. 4,522,811. Serial injection of muramyldipeptides and liposomes enhances the anti-infective activity of muramyldipeptides Serial injection of muramyldipeptides and liposomes enhances the anti-infective activity of muramyldipeptides. 1985.
  • Eppstein, D. A., Y. V. Marsh, M. van der Pas, P. L. Feigner, et al. 1985. Biological activity of liposome-encapsulated murine interferon gamma is mediated by a cell membrane receptor. Proc Natl Acad Sci U S A. 82:3688-92.
  • Escudero, J., and B. Hohn. 1997. Transfer and integration of T-DNA without cell injury in the host plant. Plant Cell. 9:2135-2142.
  • U.S. Pat. No.4,870,009. Method of obtaining gene product through the generation of transgenic animals. 1989.
  • Fekete, D. M., and C. L. Cepko. 1993. Retroviral infection coupled with tissue transplantation limits gene transfer in the chick embryo. Proc. Natl. Acad. Sci. USA. 90:2350-2354.
  • Felgner, P. L., T. R. Gadek, M. Holm, R. Roman, et al. 1987. Lipofectin: A highly efficient, lipid-mediated DNA/transfection procedure. Proc. Natl. Acad. Sci. USA. 84:7413-7417.
  • Felici, F., L. Castagnoli, A. Musacchio, R. Jappelli, et al. 1991. Selection of antibody ligands from a large library of oligopeptides expressed on a multivalent exposition vector. J Mol Biol. 222:301-10.
  • Fieck, A., D. L. Wyborski, and J. M. Short. 1992. Modifications of the E. coli Lac repressor for expression in eukaryotic cells: effects of nuclear signal sequences on protein activity and nuclear accumulation. Nucleic Acids Res. 20:1785-91.
  • Finer, J. J., K. R. Finer, and T. Ponappa. 1999. Particle bombardment-mediated transformation. Current Topics in microbiology and immunology. 240:59-80.
  • Finn, P. J., N. J. Gibson, R. Fallon, A. Hamilton, et al. 1996. Synthesis and properties of DNA-PNA chimeric oligomers. Nucleic Acids Res. 24:3357-63.
  • Fishwild, D. M., S. L. O'Donnell, T. Bengoechea, D. V. Hudson, et al. 1996. High-avidity human IgG kappa monoclonal antibodies from a novel strain of minilocus transgenic mice [see comments]. Nat Biotechnol. 14:845-51.
  • Fleer, R., P. Yeh, N. Amellal, I. Maury, et al. 1991. Stable multicopy vectors for high-level secretion of recombinant human serum albumin by Kluyveromyces yeasts. Biotechnology (N Y). 9:968-75.
  • Fodor, S. P., R. P. Rava, X. C. Huang, A. C. Pease, et al. 1993. Multiplexed biochemical assays with biological chips. Nature. 364:555-6.
  • Fromm, M., L. P. Taylor, and V. Walbot. 1985. Expression of genes transferred into monocot and dicot plant cells by electroporation. Proc. Natl. Acad. Sci. USA. 82:5824-5828.
  • Fujita, T., H. Shubiya, T. Ohashi, K. Yamanishi, et al. 1986. Regulation of human interleukin-2 gene: Functional DNA sequences in the 5′ flanking region for the gene expression in activated T lymphocytes. Cell. 46:401-407.
  • Gabizon, A., R. Shiota, and D. Papahadjopoulos. 1989. Pharmacokinetics and tissue distribution of doxorubicin encapsulated in stable liposomes with long circulation times. J Natl Cancer Inst. 81:1484-8.
  • Gallagher, S. R. 1992. GUS protocols: Using the GUS gene as a reporter of gene expression. Academic Press, San Diego, Calif.
  • Gallop, M. A., R. W. Barrett, W. J. Dower, S. P. Fodor, et al. 1994. Applications of combinatorial technologies to drug discovery. 1. Background and peptide combinatorial libraries. J Med Chem. 37:1233-51.
  • Gasparini, P., A. Bonizzato, M. Dognini, and P. F. Pignatti. 1992. Restriction site generating-polymerase chain reaction (RG-PCR) for the probeless detection of hidden genetic variation: application to the study of some common cystic fibrosis mutations. Mol Cell Probes. 6:1-7.
  • Gautier, C., F. Morvan, B. Rayner, T. Huynh-Dinh, et al. 1987. Alpha-DNA. IV: Alpha-anomeric and beta-anomeric tetrathymidylates covalently linked to intercalating oxazolopyridocarbazole. Synthesis, physicochemical properties and poly (rA) binding. Nucleic Acids Res. 15:6625-41.
  • Gennaro, A. R. 2000. Remington: The science and practice of pharmacy. Lippincott, Williams & Wilkins, Philadelphia, Pa.
  • Gibbs, R. A., P. N. Nguyen, and C. T. Caskey. 1989. Detection of single DNA base differences by competitive oligonucleotide priming. Nucleic Acids Res. 17:2437-48.
  • Gietz, R. D., R. A. Woods, P. Manivasakam, and R. H. Schiestl. 1998. Growth and transformation of Saccharomyces cerevisiae. In Cells: A laboratory manual. Vol. I. D. Spector, R. Goldman, and L. Leinwand, editors. Cold Spring Harbor Press, Cold Spring Harbor, N.Y.
  • Goding, J. W. 1996. Monoclonal antibodies: Principles and Practice. Academic Press, San Diego. 492 pp.
  • Gorman, C. M., L. F. Moffat, and B. H. Howard. 1982. Recombinant genomes which express chloramphenicol acetyltransferase in mammalian cells. Mol. Cell. Biol. 2:1044-1051.
  • Graham, F. L., and A. J. van der Eb. 1973. A new technique for the assay of infectivity of human adenovirus 5 DNA. Virology. 52:456-.
  • Griffin, H. G., and A. M. Griffin. 1993. DNA sequencing. Recent innovations and future trends. Appl Biochem Biotechnol. 38:147-59.
  • Grompe, M., D. M. Muzny, and C. T. Caskey. 1989. Scanning detection of mutations in human ornithine transcarbamoylase by chemical mismatch cleavage. Proc Natl Acad Sci USA. 86:5888-92.
  • Gruber, M., B. A. Schodin, E. R. Wilson, and D. M. Kranz. 1994. Efficient tumor cell lysis mediated by a bispecific single chain antibody expressed in Escherichia coli. J Immunol. 152:5368-74.
  • Guatelli, J. C., K. M. Whitfield, D. Y. Kwoh, K. J. Barringer, et al. 1990. Isothermal, in vitro amplification of nucleic acids by a multienzyme reaction modeled after retroviral replication. Proc Natl Acad Sci USA. 87:1874-8.
  • Hanahan, D. 1983. Studies on transformation of Escherichia coli with plasmids. J. Mol. Biol. 166:557-580.
  • Hansen, G., and M.-D. Chilton. 1999. Lessons in gene transfer to plants by a gifted microbe. Curr. Top. Microbiol. Immunol. 240:21-57.
  • Hansen, G., and M. S. Wright. 1999. Recent advances in the transformation of plants. Trends Plant Sci. 4:226-231.
  • Harlow, E., and D. Lane. 1988. Antibodies: A laboratory manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor. 726 pp.
  • Harlow, E., and D. Lane. 1999. Using antibodies: A laboratory manual. Cold Spring Harbor Laboratory PRess, Cold Spring Harbor, N.Y.
  • Haseloff, J., and W. L. Gerlach. 1988. Simple RNA enzymes with new and highly specific endoribonuclease activities. Nature. 334:585-91.
  • Hayashi, K. 1992. PCR-SSCP: A method for detection of mutations. Genetic and Analytical Techniques Applications. 9:73-79.
  • Helene, C. 1991. The anti-gene strategy: control of gene expression by triplex-forming-oligonucleotides. Anticancer Drug Des. 6:569-84.
  • Helene, C., N. T. Thuong, and A. Harel-Bellan. 1992. Control of gene expression by triple helix-forming oligonucleotides. The antigene strategy. Ann N Y Acad Sci. 660:27-36.
  • Hibi, M., and T. Hirano. 2000. Gab-family adapter molecules in signal transduction of cytokine and growth factor receptors, and T and B cell antigen receptors. Leuk Lymphoma. 37:299-307.
  • Hinnen, A., J. B. Hicks, and G. R. Fink. 1978. Transformation of yeast. Proc. Natl. Acad. Sci. USA. 75:1929-1933.
  • Hoffman, F. 1996. Laser microbeams for the manipulation of plant cells and subcellular structures. Plant Sci. 113:1 -11.
  • Hogan, B., Beddington, R., Costantini, F., Lacy, E. 1994. Manipulating the Mouse Embryo: A Laboratory Manual. Cold Spring Harbor Laboratory Press. 500 pp.
  • Holliger, P., T. Prospero, and G. Winter. 1993. “Diabodies”: small bivalent and bispecific antibody fragments. Proc Natl Acad Sci U S A. 90:6444-8.
  • Hoogenboom, H. R., A. D. Griffiths, K. S. Johnson, D. J. Chiswell, et al. 1991. Multi-subunit proteins on the surface of filamentous phage: methodologies for displaying antibody (Fab) heavy and light chains. Nucleic Acids Res. 19:4133-7.
  • Houghten, R. A., J. R. Appel, S. E. Blondelle, J. H. Cuervo, et al. 1992. The use of synthetic peptide combinatorial libraries for the identification of bioactive peptides. Biotechniques. 13:412-21.
  • Hsu, I. C., Q. Yang, M. W. Kahng, and J. F. Xu. 1994. Detection of DNA point mutations with DNA mismatch repair enzymes. Carcinogenesis. 15:1657-62.
  • Hwang, K. J., K. F. Luk, and P. L. Beaumier. 1980. Hepatic uptake and degradation of unilamellar sphingomyelin/cholesterol liposomes: a kinetic study. Proc Natl Acad Sci U S A. 77:4030-4.
  • Hyrup, B., and P. E. Nielsen. 1996. Peptide nucleic acids (PNA): synthesis, properties and potential applications. Bioorg Med Chem. 4:5-23.
  • Inoue, H., Y. Hayase, A. Imura, S. Iwai, et al. 1987a. Synthesis and hybridization studies on two complementary nona(2′-O- methyl)ribonucleotides. Nucleic Acids Res. 15:6131-48.
  • Inoue, H., Y. Hayase, S. Iwai, and E. Ohtsuka. 1987b. Sequence-dependent hydrolysis of RNA using modified oligonucleotide splints and RNase H. FEBS Lett. 215:327-30.
  • Ishiura, M., S. Hirose, T. Uchida, Y. Hamada, et al. 1982. Phage particle-mediated gene transfer to cultured mammalian cells. Molecular and Cellular Biology. 2:607-616.
  • Ito, H., Y. Fukuda, K. Murata, and A. Kimura. 1983. Transformation of intact yeast cells treated with alkali cations. J. Bacteriol. 153:163-168.
  • Iwabuchi, K., B. Li, P. Bartel, and S. Fields. 1993. Use of the two-hybrid system to identify the domain of p53 involved in oligomerization. Oncogene. 8:1693-6.
  • Jayasena, S. D. 1999. Aptamers: an emerging class of molecules that rival antibodies in diagnostics. Clin Chem. 45:1628-50.
  • Jones, P. T., P. H. Dear, J. Foote, M. S. Neuberger, et al. 1986. Replacing the complementarity-determining regions in a human antibody with those from a mouse. Nature. 321:522-5.
  • Kahn, J., F. Mehraban, G. Ingle, X. Xin, et al. 2000. Gene expression profiling in an in vitro model of angiogenesis. Am J Pathol. 156:1887-900.
  • Kaufman, R. J. 1990. Vectors used for expression in mammalian cells. Methods Enzymol. 185:487-511.
  • Kaufman, R. J., P. Murtha, D. E. Ingolia, C.-Y. Yeung, et al. 1986. Selection and amplification of heterologous genes encoding adenosine deaminase in mammalian cells. Proc. Natl. Acad. Sci. USA. 83:3136-3140.
  • Kawai, S., and M. Nishizawa. 1984. New procedure for DNA transfection with polycation and dimethyl sulfoxide. Mol. Cell. Biol. 4:1172.
  • Keen, J., D. Lester, C. Inglehearn, A. Curtis, et al. 1991. Rapid detection of single base mismatches as heteroduplexes on Hydrolink gels. Trends Genet. 7:5.
  • Kelly, J. M., and M.J. Hynes. 1985. Transformation of Aspergillus niger by the amdS gene of Aspergillus nidulans. Embo J. 4:475-9.
  • Kostelny, S. A., M. S. Cole, and J. Y. Tso. 1992. Formation of a bispecific antibody by the use of leucine zippers. J Immunol. 148:1547-53.
  • WO94/16101. DNA SEQUENCING BY MASS SPECTROMETRY. 1994.
  • Kozal, M. J., N. Shah, N. Shen, R. Yang, et al. 1996. Extensive polymorphisms observed in HIV-1 clade B protease gene using high-density oligonucleotide arrays. Nat Med. 2:753-9.
  • Kozbor, D., P. Tripputi, J. C. Roder, and C. M. Croce. 1984. A human hybrid myeloma for production of human monoclonal antibodies. J Immunol. 133:3001-5.
  • Kriegler, M. 1990. Gene transfer and expression: A laboratory manual. Stockton Press, New York. 242 pp.
  • WO 91/01140. HOMOLOGOUS RECOMBINATION FOR UNIVERSAL DONOR CELLS AND CHIERIC MAMMALIAN HOSTS. 1991.
  • Kwoh, D. Y., G. R. Davis, K. M. Whitfield, H. L. Chappelle, et al. 1989. Transcription-based amplification system and detection of amplified human immunodeficiency virus type I with a bead-based sandwich hybridization format. Proc Natl Acad Sci USA. 86:1173-7.
  • U.S. Pat. No. 5,223,409. Directed evolution of novel binding proteins. 1993.
  • Lakso, M., B. Sauer, B. Mosinger, E. J. Lee, et al. 1992. Targeted oncogene activation by site-specific recombination in transgenic mice. Proc Natl Acad Sci U S A. 89:6232-6.
  • Lam, K. S. 1997. Application of combinatorial library methods in cancer research and drug discovery. Anticancer Drug Design. 12:145-167.
  • Lam, K. S., S. E. Salmon, E. M. Hersh, V. J. Hruby, et al. 1991. General method for rapid synthesis of multicomponent peptide mixtures. Nature. 354:82-84.
  • Landegren, U., R. Kaiser, J. Sanders, and L. Hood. 1988. A ligase-mediated gene detection technique. Science. 241:1077-80.
  • Lansink, M., P. Koolwijk, V. van Hinsbergh, and T. Kooistra. 1998. Effect of steroid hormones and retinoids on the formation of capillary-like tubular structures of human microvascular endothelial cells in fibrin matrices is related to urokinase expression. Blood. 92:927-38.
  • WO 90/11354. Process for the specific replacement of a copy of a gene present in the receiver genome via the integration of a gene. 1990.
  • U.S. Pat. No. 4,736,866. Transgenic non-human animals. 1988.
  • Leduc, N., and e. al. 1996. Isolated maize zygotes mimic in vivo embryogenic development and express microinjected genes when cultured in vitro. Dev. Biol. 10:190-203.
  • Lee, J. S., D. A. Johnson, and A. R. Morgan. 1979. Complexes formed by (pyrimidine)n . (purine)n DNAs on lowering the pH are three-stranded. Nucleic Acids Res. 6:3073-91.
  • Lee, V. H. L. 1990. Peptide and protein drug delivery. Marcel Dekker, New-York, N.Y.
  • Lemaitre, M., B. Bayard, and B. Lebleu. 1987. Specific antiviral activity of a poly(L-lysine)-conjugated oligodeoxyribonucleotide sequence complementary to vesicular stomatitis virus N protein mRNA initiation site. Proc Natl Acad Sci U S A. 84:648-52.
  • Lemischka, I. R., D. H. Raulet, and R. C. Mulligan. 1986. Developmental potential and dynamic behavior of hematopoietic stem cells. Cell. 45:917-927.
  • Letsinger, R. L., G. R. Zhang, D. K. Sun, T. Ikeuchi, et al. 1989. Cholesteryl-conjugated oligonucleotides: synthesis, properties, and activity as inhibitors of replication of human immunodeficiency virus in cell culture. Proc Natl Acad Sci U S A. 86:6553-6.
  • Li, E., T. H. Bestor, and R. Jaenisch. 1992. Targeted mutation of the DNA methyltransferase gene results in embryonic lethality. Cell. 69:915-26.
  • Linder, M. W., R. A. Prough, and R. Valdes. 1997. Pharmacogenetics: a laboratory tool for optimizing therapeutic efficiency. Clin Chem. 43:254-66.
  • Littlefield, J. W. 1964. Selection of hybrids from matings of fibroblasts in vitro and their presumed recombinants. Science. 145:709-710.
  • Lizardi, P. M., C. E. Guerra, H. Lomeli, I. Tussie-Luna, et al. 1988. Exponential amplification of recombinant-RNA hybridization probes. Biotechnology. 6:1197-1202.
  • Lonberg, N., and D. Huszar. 1995. Human antibodies from transgenic mice. Int Rev Immunol. 13:65-93.
  • Lonberg, N., L. D. Taylor, F. A. Harding, M. Trounstine, et al. 1994. Antigen-specific human antibodies from mice comprising four distinct genetic modifications [see comments]. Nature. 368:856-9.
  • Lopata, M. A., D. W. Cleveland, and B. Sollner-Webb. 1984. High-level expression of a chloramphenicol acetyltransferase gene by DEAEdextran-mediated DNA traansfection couled with a dimethylsulfoxide or glycerol shock treatment. Nucleic Acids Research. 12:5707.
  • Louvet-Vallee, S. 2000. ERM proteins: from cellular architecture to cell signaling. Biol Cell. 92:305-16.
  • Luckow, V. A. 1991. Cloning and expression of heterologous genes in insect cells with baculovirus vectors. In Recombinant DNA technology and applications. A. Prokop, R. K. Bajpai, and C. Ho, editors. McGraw-Hill, New York. 97-152.
  • Madura, K., R. J. Dohmen, and A. Varshavsky. 1993. N-recognin/Ubc2 interactions in the N-end rule pathway. J Biol Chem. 268:12046-54.
  • Maher, L. J. 1992. DNA triple-helix formation: an approach to artificial gene repressors? Bioessays. 14:807-15.
  • Mandel, M., and A. Higa. 1970. Calcium-dependent bacteriophage DNA infection. J. Mol. biol. 53:159-162.
  • Marasco, W. A., W. A. Haseltine, and S. Y. Chen. 1993. Design, intracellular expression, and activity of a human anti-human immunodeficiency virus type I gp120 single-chain antibody. Proc Natl Acad Sci USA. 90:7889-93.
  • Marks, J. D., A. D. Griffiths, M. Malmqvist, T. P. Clackson, et al. 1992. By-passing immunization: building high affinity human antibodies by chain shuffling. Biotechnology (N Y). 10:779-83.
  • Marks, J. D., H. R. Hoogenboom, T. P. Bonnert, J. McCafferty, et al. 1991. By-passing immunization. Human antibodies from V-gene libraries displayed on phage. J Mol Biol. 222:581-97.
  • Martin, F. J., and D. Papahadjopoulos. 1982. Irreversible coupling of immunoglobulin fragments to preformed vesicles. An improved method for liposome targeting. J Biol Chem. 257:286-8.
  • Maxam, A. M., and W. Gilbert. 1977. A new method for sequencing DNA. Proc Natl Acad Sci U S A. 74:5604.
  • Miller, A. D., and C. Buttimore. 1986. Redesign of retrovirus packaging cell lines to avoid recombination leading to helper virus production. Mol. Cell biol. 6:2895-2902.
  • Miller, L. K. 1988. Baculoviruses as gene expression vectors. Annu. Rev. Microbiol. 42:177-199.
  • Milstein, C., and A. C. Cuello. 1983. Hybrid hybridomas and their use in immunohistochemistry. Nature. 305:537-40.
  • U.S. Pat. No. 5,459,039. Methods for mapping genetic mutations. 1995.
  • Morrison, S. L., L. Wims, S. Wallick, L. Tan, et al. 1987. Genetically engineered antibody molecules and their application. Ann N Y Acad Sci. 507:187-98.
  • U.S. Pat. No. 4,683,202. Process for amplifying nucleic acid sequences. 1987.
  • U.S. Pat. No. 4,683,195. Process for amplifying, detecting, and/or cloning nucleic acid sequences. 1987.
  • Munson, P. J., and D. Rodbard. 1980. Ligand: a versatile computerized approach for characterization of ligand-binding systems. Anal Biochem. 107:220-39.
  • Myers, R. M., Z. Larin, and T. Maniatis. 1985. Detection of single base substitutions by ribonuclease cleavage at mismatches in RNA:DNA duplexes. Science. 230:1242-6.
  • U.S. Pat. No. 5,328,470. Treatment of diseases by site-specific instillation of cells or site-specific transformation of cells and kits therefor. 1994.
  • Naeve, C. W., G. A. Buck, R. L. Niece, R. T. Pon, et al. 1995. Accuracy of automated DNA sequencing: a multi-laboratory comparison of sequencing results. Biotechniques. 19:448-53.
  • Nakai, K., and P. Horton. 1999. PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization. Trends Biochem Sci. 24:34-6.
  • Nakazawa, H., D. English, P. L. Randell, K. Nakazawa, et al. 1994. UV and skin cancer: specific p53 gene mutation in normal skin as a biologically relevant exposure measurement. Proc Natl Acad Sci U SA. 91:360-4.
  • Neumann, E., M. Schaefer-Ridder, Y. Wang, and P. H. Hofschneider. 1982. Gene transfer into mouse lyoma cells by electroporation in high electric fields. EMBO J. 1:841-845.
  • O'Gorman, S., D. T. Fox, and G. M. Wahl. 1991. Recombinase-mediated gene activation and site-specific integration in mammalian cells. Science. 251:1351-5.
  • Okabe, S., T. Fukuda, K. Ishibashi, S. Kojima, et al. 1998. BAZF, a novel Bcl6 homolog, functions as a transcriptional repressor. Mol Cell Biol. 18:4235-44.
  • Okano, H., J. Aruga, T. Nakagawa, C. Shiota, et al. 1991. Myelin basic protein gene and the function of antisense RNA in its repression in myelin-deficient mutant mouse. J Neurochem. 56:560-7.
  • O'Reilly, D. R., L. K. Miller, and V. A. Luckow. 1992. Baculovirus expression vectors. W.H. Freeman and Company, New York.
  • Orita, M., H. Iwahana, H. Kanazawa, K. Hayashi, et al. 1989. Detection of polymorphisms of human DNA by gel electrophoresis as single-strand conformation polymorphisms. Proc Natl Acad Sci U S A. 86:2766-70.
  • Ou-Lee, T. M., R. Turgeon, and R. Wu. 1986. Uptake and expression of a foreign gene linked to either a plant virus or Drosophila promoter in protoplasts of rice, wheat and sorghum. Proc. Natl. Acad. Sci. USA. 83:6815-6819.
  • Palmer, T. D., R. A. Hock, W. R. A. osborne, and A. D. Miller. 1987. Efficient retrovirus-mediated transfer and expression of a human adenosine deaminase gene in diploid skin fibroblasts from an adenosie-deficient human. Proc. Natl. Acad. Sci. USA. 84:1055-1059.
  • Pear, W., G. Nolan, M. Scott, and D. Baltimore. 1993. Production of high-titer helper-free retroviruses by transient transfection. Proc. Natl. Acad. Sci. USA. 90:8392-8396.
  • Perry-O'Keefe, H., X. W. Yao, J. M. Coull, M. Fuchs, et al. 1996. Peptide nucleic acid pre-gel hybridization: an alternative to southern hybridization. Proc Natl Acad Sci U S A. 93:14670-5.
  • Petersen, K. H., D. K. Jensen, M. Egholm, O. Buchardt, et al. 1976. A PNA-DNA linker synthesis of N-((4,4′-dimethoxytrityloxy)ehtyl)-N-(thymin-1-ylacetyl)glycine. Biorganic and Medicianl Chemistry Letters. 5:1119-1124.
  • Potter, H. 1988. Electroporation in biology: Methods, applications,, and instrumentation. Analytical Biochemistry. 174:361-373.
  • Potter, H., L. Weir, and P. Leder. 1984. Enhancer-dependent expression of human kappa immunoglobulin genes introduced into mouse pre-B lymphocytes by electroporation. Proc. Natl. Acad. Sci. USA. 81:7161-7165.
  • Presta, L. G. 1992. Antibody engineering. Curr Opin Biotechnol. 3:394-8.
  • Prosser, J. 1993. Detecting single-base mutations. Trends Biotechnol. 11:238-46.
  • Rassoulzadegan, M., B. Binetruy, and F. Cuzin. 1982. High frequency of gene transfer after fusion between bacteria and eukaryotic cells. Nature. 295:257.
  • Reisfeld, R. A., and S. Sell. 1985. Monoclonal antibodies and cancer therapy: Proceedings of the Roche-UCLA symposium held in Park City, Utah, Jan. 26-Feb. 2, 1985. Alan R. Liss, New York. 609 pp.
  • Rhodes, C. A., D. A. Pierce, I. J. Mettler, D. Mascarenhas, et al. 1988. Genetically transformed maize plants from protoplasts. Science. 240:204-207.
  • Riechmann, L., M. Clark, H. Waldmann, and G. Winter. 1988. Reshaping human antibodies for therapy. Nature. 332:323-7.
  • Risau, W. 1995. Differentiation of endothelium. Faseb J. 9:926-33.
  • Risau, W., and I. Flamme. 1995. Vasculogenesis. Annu Rev Cell Dev Biol. 11:73-91.
  • Rojas, K., L. Serrano de la Pena, T. Gallardo, A. Simmons, et al. 1999. Physical map and characterization of transcripts in the candidate interval for familial chondrocalcinosis at chromosome 5p15.1. Genomics. 62:177-83.
  • Rose, J. K., L. Buonocore, and M. Whitt. 1991. A new cationic liposome reagent mediating nearly quantitative transfection of animal cells. BioTechniques. 10:520-525.
  • Rossi, J. J. 1994. Practical ribozymes. Making ribozymes work in cells. Curr Biol. 4:469-71.
  • Rossiter, B. J., and C. T. Caskey. 1990. Molecular scanning methods of mutation detection. J Biol Chem. 265:12753-6.
  • U.S. Pat. No. 5,283,317. Intermediates for conjugation of polypeptides with high molecular weight polyalkylene glycols. 1994.
  • Saiki, R. K., T. L. Bugawan, G. T. Horn, K. B. Mullis, et al. 1986. Analysis of enzymatically amplified beta-globin and HLA-DQ alpha DNA with allele-specific oligonucleotide probes. Nature. 324:163-6.
  • Saiki, R. K., P. S. Walsh, C. H. levenson, and H. A. Erlich. 1989. Genetic analysis of amplified DNA with immobilized sequence-specific oligonucleotide probes. Proc Natl Acad Sci U S A. 86:6230-4.
  • Saleeba, J. A., and R. G. Cotton. 1993. Chemical cleavage of mismatch to detect mutations. Methods Enzymol. 217:286-95.
  • Sambrook, J. 1989. Molecular cloning: a laboratory manual. Cold Spring Harbor Laboratory, Cold Spring Harbor.
  • Sandri-Goldin, R. M., A. L. Goldin, J. C. Glorioso, and M. Levine. 1981. High-frequency transfer of cloned herpes simjplex virus type I sequences to mammalian cells by protoplast fusion. Mol. Cell. Biol. 1:7453-752.
  • Sanger, F., S. Nicklen, and A. R. Coulson. 1977. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 74:5463-7.
  • Saunders, J. A., B. F. Matthews, and P. D. Miller. 1989. Plant gene transfer using electrofusion and electroporation. In Electroporation and electrofusion in cell biology. E. Neumann, A. E. Sowers, and C. A. Jordan, editors. Plenum Press, New York. 343-354.
  • Schade, R., C. Staak, C. Hendriksen, M. Erhard, et al. 1996. The production of avian (egg yold) antibodies: IgY. The report and recommendations of ECVAM workshop. Alternatives to Laboratory Animals (ATLA). 24:925-934.
  • Schaffner, W. 1980. Direct transfer of cloned genes from bacteria to mammalian cells. Proc. Natl. Acad. Sci. USA. 77:2163.
  • Schook, L. B. 1987. Monoclonal antibody production techniques and applications. Marcel Dekker, Inc., New York. 336 pp.
  • Scott, J. K., and G. P. Smith. 1990. Searching for peptide ligands with an epitope library. Science. 249:386-90.
  • Selden, R. F., K. Burke-Howie, M. E. Rowe, H. M. Goodman, et al. 1986. Human growth hormone as a reporter gene in regulation studies employing transient gene expression. Molecular and Cellular Biololgy. 6:3173-3179.
  • Shalaby, M. R., H. M. Shepard, L. Presta, M. L. Rodrigues, et al. 1992. Development of humanized bispecific antibodies reactive with cytotoxic lymphocytes and tumor cells overexpressing the HER2 protooncogene. J Exp Med. 175:217-25.
  • Shigekawa, K., and W. J. Dower. 1988. Electroporation of eukaryotes and prokaryotes: A general approach to the introduction of macomolecules into cells. BioTechniques. 6:742-751.
  • Shillito, R. 1999. Methods of genetic transformations: Electroporation and polyethylene glycol treatment. In Molecular improvement of cereal crop. I. Vasil, editor. Kluwer, Dordrecht, The Netherlands. 9-20.
  • Shilo, B. Z., and R. A. Weinberg. 1981. DNA sequences homologous to vertebrate oncogenes are conserved in Drosophila melanogaster. Proc Natl Acad Sci U S A. 78:6789-92.
  • Shimkets, R. A., D. G. Lowe, J. T. Tai, P. Sehl, et al. 1999. Gene expression analysis by transcript profiling coupled to a gene database query. Nat Biotechnol. 17:798-803.
  • Shopes, B. 1992. A genetically engineered human IgG mutant with enhanced cytolytic activity. J Immunol. 148:2918-22.
  • Simonsen, C. C., and A. D. Levinson. 1983. Isolation and expression of an altered mouse dihydrofolate reductase cDNA. Proc. Natl. Acad. Sci. USA. 80:2495-2499.
  • U.S. Pat. No. 5,272,057. Method of detecting a predisposition to cancer by the use of restriction fragment length polymorphism of the gene for human poly (ADP-ribose) polymerase. 1993.
  • Southern, P. J., and P. Berg. 1982. Transformation of mammalian cells to antibiotic resistanced with a bacterial gene under control of the SV40 early region promoter. J. Mol. Appl. Gen. 1:327-341.
  • Sreekrishna, K., R. H. Potenz, J. A. Cruze, W. R. McCombie, et al. 1988. High level expression of heterologous proteins in methylotrophic yeast Pichia pastoris. J Basic Microbiol. 28:265-78.
  • Stein, C. A., and J. S. Cohen. 1988. Oligodeoxynucleotides as inhibitors of gene expression: a review. Cancer Res. 48:2659-68.
  • Stevenson, G. T., A. Pindar, and C. J. Slade. 1989. A chimeric antibody with dual Fc regions (bisFabFc) prepared by manipulations at the IgG hinge. Anticancer Drug Des. 3:219-30.
  • Suresh, M. R., A. C. Cuello, and C. Milstein. 1986. Bispecific monoclonal antibodies from hybrid hybridomas. Methods Enzymol. 121:210-28.
  • Taylor, A., K. Obholz, G. Linden, S. Sadiev, et al. 1998. DNA sequence and muscle-specific expression of human sarcosin transcripts. Mol Cell Biochem. 183:105-12.
  • Thomas, K. R., and M. R. Capecchi. 1987. Site-directed mutagenesis by gene targeting in mouse embryo-derived stem cells. Cell. 51:503-12.
  • Thompson, J. A., and e. al. 1995. Maize transformation utilizing silicon carbide whiskers: A review. Euphytica. 85:75-80.
  • Tilbum, J., C. Scazzocchio, G. G. Taylor, J. H. Zabicky-Zissman, et al. 1983. Transformation by integration in Aspergillus nidulans. Gene. 26:205-21.
  • Touraev, A., and e. al. 1997. Plant male germ line transformation. Plant J. 12:949-956.
  • Traunecker, A., F. Oliveri, and K. Karjalainen. 1991. Myeloma based expression system for production of large mammalian proteins. Trends Biotechnol. 9:109-13.
  • Trick, H. N., and e. al. 1997. Recent advances in soybean transformation. Plant Tissue Cult. Biotechnol: 3:9-26.
  • Tuerk, C., and L. Gold. 1990. Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. Science. 249:505-10.
  • Turner, D. L., E. Y. Snyder, and C. L. Cepko. 1990. Lineage-independent determinationh of cell type in the embryonic mouse retina. Neuron. 4:833-845.
  • Tutt, A., G. T. Stevenson, and M. J. Glennie. 1991. Trispecific F(ab′)3 derivatives that use cooperative signaling via the TCR/CD3 complex and CD2 to activate and redirect resting cytotoxic T cells. J Immunol. 147:60-9.
  • van der Krol, A. R., J. N. Mol, and A. R. Stuitje. 1988b. Modulation of eukaryotic gene expression by complementary RNA or DNA sequences. Biotechniques. 6:958-76.
  • van der Krol, A. R., J. N. Mol, and A. R. Stuitje. 1988a. Modulation of eukaryotic gene expression by complementary RNA or DNA sequences. Biotechniques. 6:958-76.
  • Verhoeyen, M., C. Milstein, and G. Winter. 1988. Reshaping human antibodies: grafting an antilysozyme activity. Science. 239:1534-6.
  • Vitetta, E. S., R. J. Fulton, R. D. May, M. Till, et al. 1987. Redesigning nature's poisons to create anti-tumor reagents. Science. 238:1098-104.
  • U.S. Pat. No. 4,873,191. Genetic transformation of zygotes. 1989.
  • Wells, J. A., M. Vasser, and D. B. Powers. 1985. Cassette mutagenesis: an efficient method for generation of multiple mutations at defined sites. Gene. 34:315-23.
  • Whitt, M. A., L. Buonocore, J. K. Rose, V. Ciccarone, et al. 1990. TransfectACE reagent promotes transient transfection frequencies greater than 90%. Focus. 13:8-12.
  • Wigler, M., A. Pellicer, S. Silversttein, and R. Axel. 1978. Biochemical transfer of single-copy eucaryotic genes using total cellular DNA as donor. Cell. 14:725.
  • Williams, D. A., I. R. Lemischka, D. G. Nathan, and R. C. Mulligan. 1984. Introduction of a new genetic material into pluripotent haematopoietic stem cells of the mouse. Nature. 310:476-480.
  • Wilmut, I., A. E. Schnieke, J. McWhir, A. J. Kind, et al. 1997. Viable offspring derived from fetal and adult mammalian cells. Nature. 385:810-3.
  • Wolff, E. A., G. J. Schreiber, W. L. Cosand, and H. V. Raff. 1993. Monoclonal antibody homodimers: enhanced antitumor activity in nude mice. Cancer Res. 53:2560-5.
  • Wong, T. K., and E. Neumann. 1982. Electric field mediated gene transfer. Biochemical and Biophysical Research Communications. 107:584-587.
  • Wyborski, D. L., L. C. DuCoeur, and J. M. Short. 1996. Parameters affecting the use of the lac repressor system in eukaryotic cells and transgenic animals. Environ Mol Mutagen. 28:447-58.
  • Wyborski, D. L., and J. M. Short. 1991. Analysis of inducers of the E. coli lac repressor system in mammalian cells and whole animals. Nucleic Acids Res. 19:4647-53.
  • Yang, S., J. Graham, J. W. Kahn, E. A. Schwartz, et al. 1999. Functional roles for PECAM-1 (CD31) and VE-cadherin (CD144) in tube assembly and lumen formation in three-dimensional collagen gels. Am J Pathol. 155:887-95.
  • Yelton, M. M., J. E. Hamer, and W. E. Timberlake. 1984. Transformation of Aspergillus nidulans by using a trpc plasmid. Proc Natl Acad Sci U S A. 81:1470-4.
  • Zervos, A. S., J. Gyuris, and R. Brent. 1993. Mxi1, a protein that specifically interacts with Max to bind Myc-Max recognition sites. Cell. 72:223-32.
  • Zhou, G., and e. al. 1983. Introduction of exogenous DNA into cotton embryos. Methods Enzymol. 101:433-481.
  • Zoller, M. J., and M. Smith. 1987. Oligonucleotide-directed mutagenesis: a simple method using two oligonucleotide primers and a single-stranded DNA template. Methods Enzymol. 154:329-50.
  • Zon, G. 1988. Oligonucleotide analogues as potential chemotherapeutic agents. Pharm Res. 5:539-49.
  • Zuckermann, R. N., E. J. Martin, D. C. Spellmeyer, G. B. Stauber, et al. 1994. Discovery of nanomolar ligands for 7-transmembrane G-protein-coupled receptors from a diverse N-(substituted)glycine peptoid library. J Med Chem. 37:2678-85.


All publications and patents mentioned in the above specification are herein incorporated by reference.

Claims
  • 1-41. (canceled)
  • 42. An isolated polynucleotide encoding a polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence of SEQ ID NO: 4, or a complement of said polynucleotide.
  • 43. An isolated polynucleotide comprising a nucleotide sequence having at least 80% sequence identity to the sequence of SEQ ID NO: 3, or a complement of said polynucleotide.
  • 44. The polynucleotide of claim 43, wherein said nucleotide sequence has at least 90% sequence identity to the sequence of SEQ ID NO: 3, or a complement of said polynucleotide.
  • 45. The polynucleotide of claim 43, wherein said nucleotide sequence has at least 98% sequence identity to the sequence of SEQ ID NO: 3, or a complement of said polynucleotide.
  • 46. The polynucleotide of claim 43, wherein said nucleotide sequence has the nucleotide sequence of SEQ ID NO: 3, or a complement of said polynucleotide.
  • 47. The isolated polynucleotide of claim 42 encoding a polypeptide comprising an amino acid sequence having at least 90% sequence identity to an amino acid sequence of SEQ ID NO: 4, or a complement of said polynucleotide.
  • 48. The isolated polynucleotide of claim 42 encoding a polypeptide comprising an amino acid sequence of SEQ ID NO: 4, or a complement of said polynucleotide.
  • 49. A vector comprising the polynucleotide of claim 42.
  • 50. A cell comprising the vector of claim 49.
  • 51. A method of detecting angiogenesis in a tissue sample, comprising measuring expression of a polynucleotide of claim 42 in the tissue sample, wherein an increase in expression of the polynucleotide is indicative of angiogenesis.
  • 52. The method of claim 51, wherein detecting angiogenesis indicates the presence of cancer or tumor.
  • 53. The method of claim 51, wherein expression of the polynucleotide is measured using a probe.
  • 54. The method of claim 51, wherein expression of the polynucleotide is measured using PCR.
  • 55. A method of identifying an antagonist of angiogenesis, comprising; a) contacting a cell or tissue sample undergoing angiogenesis with a candidate agent; and b) detecting expression of a polynucleotide of claim 42, wherein a candidate compound that decreases expression of the polynucleotide is identified as an antagonist of angiogenesis.
  • 56. A method of screening a tissue sample for tumorigenic potential, comprising measuring expression of a polynucleotide of claim 42 in the tissue sample, wherein an increase in expression of the polynucleotide of claim 42 indicates that the tissue has tumorigenic potential.
  • 57. The method of claim 56, wherein expression of the polynucleotide is measured using a probe.
  • 58. The method of claim 56, wherein expression of the polynucleotide is measured using PCR.
  • 59. A method of determining the clinical stage of a tumor in a subject, comprising: a) detecting expression of a polynucleotide of claim 42 in a tissue sample from the subject; and b) comparing expression of the polypeptide to expression of the polypeptide in a control sample.
  • 60. The method of claim 59, wherein expression of the polynucleotide is measured using a probe.
  • 61. The method of claim 59, wherein expression of the polynucleotide is measured using PCR.
  • 62. A method of monitoring the effectiveness of a treatment for an angiogenic disorder in a subject, comprising detecting expression of a polynucleotide of claim 42 in a tissue sample from the subject, wherein a decrease in expression of the polynucleotide indicates that the treatment is efficacious.
  • 63. The method of claim 62, wherein the angiogenic disorder is cancer or tumor.
  • 64. A transgenic non-human animal, having a disruption in a polynucleotide having at least 80% sequence identity to SEQ ID NO:3, wherein the disruption prevents expression of a polypeptide encoded by the sequence.
  • 65. The transgenic non-human animal of claim 64, wherein the non-human animal is a mouse.
  • 66. A transgenic non-human animal, comprising an exogenous polynucleotide having at least 80% sequence identity to the sequence of SEQ ID NO: 3, or a complement of said polynucleotide.
  • 67. The transgenic non-human animal of claim 66, wherein said exogenous polynucleotide has at least 90% sequence identity to the sequence of SEQ ID NO: 3, or a complement of said polynucleotide.
  • 68. A method of inhibiting angiogenesis comprising inhibiting expression of a polynucleotide of claim 42.
  • 69. The method of claim 68, wherein expression of the polynucleotide of claim 42 is inhibited by an antisense molecule that hybridizes to a polynucleotide of claim 42.
  • 70. The method of claim 68, wherein expression of the polynucleotide of claim 42 is inhibited by an aptamer.
  • 71. A method of treating a tumor in a patient comprising administering an inhibitor of expression of a polynucleotide of claim 42, wherein the inhibitor is an antisense molecule or aptamer.
RELATED APPLICATIONS

This application claims priority to U.S. provisional application Ser. No. 60/191,134 filed Mar. 22, 2000, which is incorporated herein by reference in its entirety.

Provisional Applications (1)
Number Date Country
60191134 Mar 2000 US
Divisions (1)
Number Date Country
Parent 09815379 Mar 2001 US
Child 11285818 Nov 2005 US