COMPOSITIONS AND METHODS RELATED TO ACTIVATABLE THERAPEUTIC AGENTS

SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in XML format and is hereby incorporated by reference in its entirety. Said XML file, created on Dec. 19, 2022, is named 737804_SA9-740PCCON_ST26.xml and is 4,005,294 bytes in size.

BACKGROUND

A key challenge in developing prodrug therapeutics is avoiding unwanted immunogenicity and nonspecific activation at biological sites in vivo other than the target site. Various release sites have been optimized in vitro and incorporated into prodrugs for programmed and targeted activation, for example, by protease(s) natively produced at or near diseased tissue(s). Such engineered release segments can form T- or B-cell epitopes that can elicit undesired immunogenicity in patients. Further, there is currently a lack of methods for adequately predicting in vivo responses of patients to prodrugs. In particular, with respect to protease-activated prodrugs, diseased tissues being targeted often contain a multitude of proteases with varying activities and specificities, which is difficult to reconstitute in vitro and complicates any prediction of in vivo prodrug activation. There remains a need for identifying new peptide segments that can be incorporated into a variety of prodrug therapeutic, diagnostic and prophylactic compositions for a more effective and reliable release mechanism. There also remains a need for developing more accurate and robust methods for predicting therapeutic responses and outcomes upon administration of prodrugs or other activatable compositions.

SUMMARY

In certain aspects, the present disclosure provides a method for assessing a likelihood of a subject being responsive to a therapeutic agent that is activatable by a mammalian protease expressed in the subject, the method comprising:

- (a) determining, in a biological sample from the subject, a presence or an amount of
  - (i) a polypeptide comprising at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acid residues shown in a sequence set forth in Column V of Table A (or a subset thereof); or
  - (ii) a polypeptide comprising at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column IV of Table A (or a subset thereof); or
  - (iii) a polypeptide comprising at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column VI of Table A (or a subset thereof); and
- (b) designating the subject as being likely to respond to the therapeutic agent when the polypeptide of (i), (ii) or (iii) is present and/or if its amount exceeds a threshold.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the therapeutic agent comprises a peptide substrate, which peptide substrate is susceptible to cleavage by the mammalian protease at a scissile bond. In some embodiments, the polypeptide of (i), (ii), or (iii) comprises a portion containing at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acid residues of the peptide substrate that is either N-terminal or C-terminal side of the scissile bond. In some embodiments, the peptide substrate is susceptible to cleavage by the mammalian protease at a scissile bond, and wherein the polypeptide of (i), (ii), or (iii) is a cleavage product of a reporter polypeptide comprising a substrate sequence that is susceptible to cleavage by the same mammalian protease at a scissile bond and where the reporter polypeptide comprises a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the peptide substrate is susceptible to cleavage by the mammalian protease at a scissile bond, and wherein the polypeptide of (i), (ii), or (iii) is a cleavage product of a human protein that comprises a portion containing at least five or six consecutive amino acid residues of the peptide substrate that includes the scissile bond.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the polypeptide of (i) comprises at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acid residues shown in a sequence set forth in Column V of Table A (or a subset thereof). In some embodiments, the polypeptide of (ii) comprises at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column IV of Table A (or a subset thereof). In some embodiments, the polypeptide of (iii) comprises at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column VI of Table A (or a subset thereof).

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, (a) comprises determining the presence or the amount of any two of (i)-(iii). In some embodiments, (a) comprises determining the presence or the amount of all three of (i)-(iii).

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the threshold is zero or nominal. In some embodiments, the biological sample comprises a serum or plasma sample. In some embodiments, the biological sample comprises a serum sample. In some embodiments, the biological sample comprises a plasma sample.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the mammalian protease is a serine protease, a cysteine protease, an aspartate protease, a threonine protease, or a metalloproteinase. In some embodiments, the mammalian protease is selected from the group consisting of disintegrin and metalloproteinase domain-containing protein 10 (ADAM10), disintegrin and metalloproteinase domain-containing protein 12 (ADAM12), disintegrin and metalloproteinase domain-containing protein 15 (ADAM15), disintegrin and metalloproteinase domain-containing protein 17 (ADAM17), disintegrin and metalloproteinase domain-containing protein 9 (ADAM9), disintegrin and metalloproteinase with thrombospondin motifs 5 (ADAMTS5), Cathepsin B, Cathepsin D, Cathepsin E, Cathepsin K, cathepsin L, cathepsin S, Fibroblast activation protein alpha, Hepsin, kallikrein-2, kallikrein-4, kallikrein-3, Prostate-specific antigen (PSA), kallikrein-13, Legumain, matrix metallopeptidase 1 (MMP-1), matrix metallopeptidase 10 (MMP-10), matrix metallopeptidase 11 (MMP-11), matrix metallopeptidase 12 (MMP-12), matrix metallopeptidase 13 (MMP-13), matrix metallopeptidase 14 (MMP-14), matrix metallopeptidase 16 (MMP-16), matrix metallopeptidase 2 (MMP-2), matrix metallopeptidase 3 (MMP-3), matrix metallopeptidase 7 (MMP-7), matrix metallopeptidase 8 (MMP-8), matrix metallopeptidase 9 (MMP-9), matrix metallopeptidase 4 (MMP-4), matrix metallopeptidase 5 (MMP-5), matrix metallopeptidase 6 (MMP-6), matrix metallopeptidase 15 (MMP-15), neutrophil elastase, protease activated receptor 2 (PAR2), plasmin, prostasin, PSMA-FOLH1, membrane type serine protease 1 (MT-SP1), matriptase, and u-plasminogen. In some embodiments, the mammalian protease is selected from the group consisting of matrix metallopeptidase 1 (MMP1), matrix metallopeptidase 2 (MMP2), matrix metallopeptidase 7 (MMP1), matrix metallopeptidase 9 (MMP9), matrix metallopeptidase 11 (MMP11), matrix metallopeptidase 14 (MMP14), urokinase-type plasminogen activator (uPA), legumain, and matriptase. In some embodiments, the mammalian protease is preferentially expressed or activated in a target tissue or cell.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the target tissue or cell is a tumor. In some embodiments, the target tissue or cell produces or is co-localized with the mammalian protease.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the target tissue or cell contains therein or thereon, or is associated with in proximity thereto, a reporter polypeptide. In some embodiments, the reporter polypeptide is a polypeptide selected from the group consisting of coagulation factor, complement component, tubulin, immunoglobulin, apolipoprotein, serum amyloid, insulin, growth factor, fibrinogen, PDZ domain protein, LIM domain protein, c-reactive protein, serum albumin, versican, collagen, elastin, keratin, kininogen-1, alpha-2-antiplasmin, clusterin, biglycan, alpha-1-antitrypsin, transthyretin, alpha-1-antichymotrypsin, glucagon, hepcidin, thymosin beta-4, haptoglobin, hemoglobin subunit alpha, caveolae-associated protein 2, alpha-2-HS-glycoprotein, chromogranin-A, vitronectin, hemopexin, epididymis secretory sperm binding protein, secretogranin-2, angiotensinogen, transgelin-2, pancreatic prohormone, neurosecretory protein VGF, ceruloplasmin, PDZ and LIM domain protein 1, multimerin-1, inter-alpha-trypsin inhibitor heavy chain H2, N-acetylmuramoyl-L-alanine amidase, histone H1.4, adhesion G-protein coupled receptor G6, mannan-binding lectin serine protease 2, prothrombin, deleted in malignant brain tumors 1 protein, desmoglein-3, calsyntenin-1, alpha-2-macroglobulin, myosin-9, sodium/potassium-transporting ATPase subunit gamma, oncoprotein-induced transcript 3 protein, serglycin, histidine-rich glycoprotein, inter-alpha-trypsin inhibitor heavy chain H5, integrin alpha-IIb, membrane-associated progesterone receptor component 1, histone H1.2, rho GDP-dissociation inhibitor 2, zinc-alpha-2-glycoprotein, talin-1, secretogranin-1, neutrophil defensin 3, cytochrome P450 2E1, gastric inhibitory polypeptide, transcription initiation factor TFIID subunit 1, integral membrane protein 2B, pigment epithelium-derived factor, voltage-dependent N-type calcium channel subunit alpha-1B, ras GTPase-activating protein nGAP, type I cytoskeletal 17, sulfhydryl oxidase 1, homeobox protein Hox-B2, transcription factor SOX-10, E3 ubiquitin-protein ligase SIAH2, decorin, secreted protein acidic and rich in cysteine (SPARC), laminin gamma 1 chain, vimentin, and nidogen-1 (NID1). In some embodiments, the reporter polypeptide is a polypeptide selected from the group consisting of versican, type II collagen alpha-1 chain, kininogen-1, complement C4-A, complement C4-B, complement C3, alpha-2-antiplasmin, clusterin, biglycan, elastin, fibrinogen alpha chain, alpha-1-antitrypsin, fibrinogen beta chain, type III collagen alpha-1 chain, serum amyloid A-1 protein, transthyretin, apolipoprotein A-I, apolipoprotein A-I Isoform 1, alpha-1-antichymotrypsin, glucagon, hepcidin, serum amyloid A-2 protein, thymosin beta-4, haptoglobin, hemoglobin subunit alpha, caveolae-associated protein 2, alpha-2-HS-glycoprotein, chromogranin-A, vitronectin, hemopexin, epididymis secretory sperm binding protein, zyxin, apolipoprotein secretogranin-2, angiotensinogen, c-reactive protein, serum albumin, transgelin-2, pancreatic prohormone, neurosecretory protein VGF, ceruloplasmin, PDZ and LIM domain protein 1, tubulin alpha-4A chain, multimerin-1, inter-alpha-trypsin inhibitor heavy chain H2, apolipoprotein C-I, fibrinogen gamma chain, N-acetylmuramoyl-L-alanine amidase, immunoglobulin lambda variable 3-21, histone H1.4, adhesion G-protein coupled receptor G6, immunoglobulin lambda variable 3-25, immunoglobulin lambda variable 1-51, immunoglobulin lambda variable 1-36, mannan-binding lectin serine protease 2, immunoglobulin kappa variable 3-20, immunoglobulin kappa variable 2-30, insulin-like growth factor II, apolipoprotein A-II, probable non-functional immunoglobulin kappa variable 2D-24, prothrombin, coagulation factor IX, apolipoprotein L1, deleted in malignant brain tumors 1 protein, desmoglein-3, calsyntenin-1, immunoglobulin lambda constant 3, complement C5, alpha-2-macroglobulin, myosin-9, sodium/potassium-transporting ATPase subunit gamma, immunoglobulin kappa variable 2-28, oncoprotein-induced transcript 3 protein, serglycin, coagulation factor XII, coagulation factor XIII A chain, insulin, histidine-rich glycoprotein, immunoglobulin kappa variable 3-11, immunoglobulin kappa variable 1-39, collagen alpha-1(I) chain, inter-alpha-trypsin inhibitor heavy chain H5, latent-transforming growth factor beta-binding protein 2, integrin alpha-11b, membrane-associated progesterone receptor component 1, immunoglobulin lambda variable 6-57, immunoglobulin kappa variable 3-15, complement C1r subcomponent-like protein, histone H1.2, rho GDP-dissociation inhibitor 2, latent-transforming growth factor beta-binding protein 4, collagen alpha-1(XVIII) chain, immunoglobulin lambda variable 2-18, zinc-alpha-2-glycoprotein, talin-1, secretogranin-1, neutrophil defensin 3, cytochrome P450 2E1, gastric inhibitory polypeptide, immunoglobulin heavy variable 3-15, immunoglobulin lambda variable 2-11, transcription initiation factor TFIID subunit 1, collagen alpha-1(VII) chain, integral membrane protein 2B, pigment epithelium-derived factor, voltage-dependent N-type calcium channel subunit alpha-1B, immunoglobulin lambda variable 3-27, ras GTPase-activating protein nGAP, keratin, type I cytoskeletal 17, tubulin beta chain, sulfhydryl oxidase 1, immunoglobulin kappa variable 4-1, complement C1r subcomponent, homeobox protein Hox-B2, transcription factor SOX-10, E3 ubiquitin-protein ligase SIAH2, decorin, SPARC, type I collagen alpha-1 chain, type IV collagen alpha-1 chain, laminin gamma 1 chain, vimentin, type III collagen, type IV collagen alpha-3 chain, type VII collagen alpha-1 chain, type VI collagen alpha-1 chain, type V collagen alpha-1 chain, nidogen-1, and type VI collagen alpha-3 chain. In some embodiments, the reporter polypeptide comprises a sequence set forth in Columns II-VI of Table A (or a subset thereof). In some embodiments, the reporter polypeptide is selected from the group set forth in Column I of Table A (or a subset thereof).

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the target tissue or cell is characterized by an increased amount or activity of the mammalian protease in proximity to the target tissue or cell as compared to a non-target tissue or cell in the subject. In some embodiments, the subject is suffering from, or is suspected of suffering from, a disease or condition characterized by an increased expression or activity of the mammalian protease in proximity to a target tissue or cell as compared to a corresponding non-target tissue or cell in the subject.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the disease or condition is a cancer or an inflammatory or autoimmune disease. In some embodiments, the disease or condition is selected from the group consisting of carcinoma, Hodgkin's lymphoma, and non-Hodgkin's lymphoma, diffuse large B cell lymphoma, follicular lymphoma, mantle cell lymphoma, blastoma, breast cancer, ER/PR+ breast cancer, Her2+ breast cancer, triple-negative breast cancer, colon cancer, colon cancer with malignant ascites, mucinous tumors, prostate cancer, head and neck cancer, skin cancer, melanoma, genito-urinary tract cancer, ovarian cancer, ovarian cancer with malignant ascites, peritoneal carcinomatosis, uterine serous carcinoma, endometrial cancer, cervix cancer, colorectal, uterine cancer, mesothelioma in the peritoneum, kidney cancer, Wilm's tumor, lung cancer, small-cell lung cancer, non-small cell lung cancer, gastric cancer, stomach cancer, small intestine cancer, liver cancer, hepatocarcinoma, hepatoblastoma, liposarcoma, pancreatic cancer, gall bladder cancer, cancers of the bile duct, esophageal cancer, salivary gland carcinoma, thyroid cancer, epithelial cancer, arrhenoblastoma, adenocarcinoma, sarcoma, and B-cell derived chronic lymphatic leukemia. In some embodiments, the disease or condition is selected from the group consisting of ankylosing spondylitis (AS), arthritis (for example, and not limited to, rheumatoid arthritis (RA), juvenile idiopathic arthritis (JIA), osteoarthritis (OA), psoriatic arthritis (PsA), gout, chronic arthritis), chagas disease, chronic obstructive pulmonary disease (COPD), dermatomyositis, type 1 diabetes, endometriosis, Goodpasture syndrome, Graves' disease, Guillain-Barre syndrome (GB S), Hashimoto's disease, suppurative scab, Kawasaki disease, IgA nephropathy, idiopathic thrombocytopenic purpura, inflammatory bowel disease (IBD) (for example, and not limited to, Crohn's disease (CD), clonal disease, ulcerative colitis, collagen colitis, lymphocytic colitis, ischemic colitis, empty colitis, Behcet's syndrome, infectious colitis, indeterminate colitis, interstitial Cystitis), lupus (for example, and not limited to, systemic lupus erythematosus, discoid lupus, subacute cutaneous lupus erythematosus, cutaneous lupus erythematosus (such as chilblain lupus erythematosus), drug-induced lupus, neonatal lupus, lupus nephritis), mixed connective tissue disease, morphea, multiple sclerosis (MS), severe muscle Force disorder, narcolepsy, neuromuscular angina, pemphigus vulgaris, pernicious anemia, psoriasis, psoriatic arthritis, polymyositis, primary biliary cirrhosis, relapsing polychondritis, schizophrenia, scleroderma, Sjogren's syndrome, systemic stiffness syndrome, temporal arteritis (also known as giant cell arteritis), vasculitis, vitiligo, Wegener's granulomatosis, transplant rejection-associated immune reaction(s) (for example, and not limited to, renal transplant rejection, lung transplant rejection, liver transplant rejection), psoriasis, Wiskott-Aldrich syndrome, autoimmune lymphoproliferative syndrome, myasthenia gravis, inflammatory chronic rhinosinusitis, colitis, celiac disease, Barrett's esophagus, inflammatory gastritis, autoimmune nephritis, autoimmune hepatitis, autoimmune carditis, autoimmune encephalitis, autoimmune mediated hematological disease, asthma, atopic dermatitis, atopy, allergy, allergic rhinitis, scleroderma, bronchitis, pericarditis, the inflammatory disease is, Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, inflammatory lung disease, inflammatory skin disease, atherosclerosis, myocardial infarction, stroke, gram-positive shock, gram-negative shock, sepsis, septic shock, hemorrhagic shock, anaphylactic shock, systemic inflammatory response syndrome.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the therapeutic agent is an anti-cancer agent. In some embodiments, the therapeutic agent is an activatable therapeutic agent. In some embodiments, the therapeutic agent is an activatable therapeutic agent, or non-natural, activatable therapeutic agent as described herein.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the therapeutic agent further comprises a masking moiety (MM). In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the masking moiety (MM) is capable of being released from the therapeutic agent upon cleavage of the peptide substrate by the mammalian protease. In some embodiments, the masking moiety (MM) interferes with an interaction of the therapeutic agent, in an uncleaved state, to a target tissue or cell. In some embodiments, a bioactivity of the therapeutic agent is capable of being enhanced upon cleavage of the peptide substrate by the mammalian protease. In some embodiments, the masking moiety (MM) is an extended recombinant polypeptide (XTEN). In some embodiments, the XTEN is characterized in that: (i) it comprises at least 100 amino acids; (ii) at least 90% of the amino acid residues of it are selected from glycine (G), alanine (A), serine (S), threonine (T), glutamate (E) and proline (P); and (iii) it comprises at least 4 different types of amino acids selected from G, A, S, T, E, and P.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, further comprises transmitting the designation to a healthcare provider and/or the subject.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, further comprises, subsequent to (b), contacting the therapeutic agent with the mammalian protease.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, further comprises, subsequent to (b), administering to the subject an effective amount of the therapeutic agent based on the designation of step (b).

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, (a) comprises detecting the polypeptide of (i), (ii) or (iii) in an immuno-assay. In some embodiments, the immuno-assay utilizes an antibody that specifically binds to the polypeptide of (i), (ii) or (iii), or an epitope thereof.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, (a) comprises detecting the polypeptide of (i), (ii) or (iii) (or a derivative (including fragment(s)) thereof) by using a mass spectrometer (MS)

In some embodiment of the method is use of a diagnostic reagent for assessing a likelihood of a subject being responsive to a therapeutic agent that is activatable by a mammalian protease expressed in said subject having a disease or disorder.

In certain aspects the diagnostic reagent is used for assessing a likelihood of a subject being responsive to a therapeutic agent that is activatable by a mammalian protease expressed in said subject having a disease or disorder.

In some embodiments is a kit for the practice of a method for assessing a likelihood of a subject being responsive to a therapeutic agent that is activatable by a mammalian protease expressed in said subject having a disease or disorder comprising a reagent for detecting the presence or amount of a proteolytic peptide product produced by action of said mammalian protease.

In certain aspects, the present disclosure provides a method for treating a subject in need of a therapeutic agent that is activatable by a mammalian protease expressed in the subject, the method comprising: administering an effective amount of the therapeutic agent to the subject, wherein the subject has been shown to express in a biological sample from the subject:

- (i) a polypeptide comprising at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acid residues shown in a sequence set forth in Column V of Table A (or a subset thereof); or
- (ii) a polypeptide comprising at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column IV of Table A (or a subset thereof); or
- (iii) a polypeptide comprising at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column VI of Table A (or a subset thereof); or
- (iv) expression level of polypeptide (i), (ii) or (iii) exceeds a threshold.

In some embodiments for treating the subject with the therapeutic agent, the polypeptide of (i) comprises at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acid residues shown in a sequence set forth in Column V of Table A (or a subset thereof). In some embodiments, the polypeptide of (ii) comprises at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column IV of Table A (or a subset thereof). In some embodiments, the polypeptide of (iii) comprises at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column VI of Table A (or a subset thereof). In some embodiments, the subject has been shown to express in the biological sample any two of (i)-(iii). In some embodiments, the subject has been shown to express in the biological sample all three of (i)-(iii).

In some embodiments for treating the subject with the therapeutic agent, the therapeutic agent comprises a peptide substrate susceptible to cleavage by the mammalian protease. In some embodiments, the peptide substrate is susceptible to cleavage by the mammalian protease at a scissile bond, and wherein the polypeptide of (i), (ii), or (iii) comprises a portion containing at least four consecutive amino acid residues of the peptide substrate that is either N-terminal or C-terminal of the scissile bond. In some embodiments, a portion of the peptide substrate that is N-terminal of the scissile bond has at most three or two amino acid substitutions or at most one amino acid substitution with respect to a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column IV or V of Table A (or a subset thereof), wherein none of the amino acid substitution is at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond. In some embodiments, a portion of the peptide substrate that is N-terminal of the scissile bond has at most three or two amino acid substitutions or at most one amino acid substitution with respect to a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column IV of Table A (or a subset thereof), wherein none of the amino acid substitution is at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond. In some embodiments, a portion of the peptide substrate that is N-terminal of the scissile bond has at most three or two amino acid substitutions or at most one amino acid substitution with respect to a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V of Table A (or a subset thereof), wherein none of the amino acid substitution is at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond. In some embodiments, the portion of the peptide substrate that is N-terminal of the scissile bond comprises a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column IV or V of Table A (or a subset thereof). In some embodiments, the portion of the peptide substrate that is N-terminal of the scissile bond comprises a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column IV of Table A (or a subset thereof). In some embodiments, the portion of the peptide substrate that is N-terminal of the scissile bond comprises a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V of Table A (or a subset thereof). In some embodiments, a portion of the peptide substrate that is C-terminal of the scissile bond has at most three or two amino acid substitutions or at most one amino acid substitution with respect to an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V or VI of Table A (or a subset thereof), wherein none of the amino acid substitution is at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond. In some embodiments, a portion of the peptide substrate that is C-terminal of the scissile bond has at most three or two amino acid substitutions or at most one amino acid substitution with respect to an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V of Table A (or a subset thereof), wherein none of the amino acid substitution is at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond. In some embodiments, a portion of the peptide substrate that is C-terminal of the scissile bond has at most three or two amino acid substitutions or at most one amino acid substitution with respect to an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column VI of Table A (or a subset thereof), wherein none of the amino acid substitution is at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond. In some embodiments, the portion of the peptide substrate that is C-terminal of the scissile bond comprises an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V or VI of Table A (or a subset thereof). In some embodiments, the portion of the peptide substrate that is C-terminal of the scissile bond comprises an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V of Table A (or a subset thereof). In some embodiments, the portion of the peptide substrate that is C-terminal of the scissile bond comprises an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column VI of Table A (or a subset thereof).

In some embodiments for treating the subject with the therapeutic agent, the threshold is zero or nominal. In some embodiments, the biological sample comprises a serum or plasma sample. In some embodiments, the biological sample comprises a serum sample. In some embodiments, the biological sample comprises a plasma sample.

In some embodiments for treating the subject with the therapeutic agent, the mammalian protease is a serine protease, a cysteine protease, an aspartate protease, a threonine protease, or a metalloproteinase. In some embodiments, the mammalian protease is selected from the group consisting of disintegrin and metalloproteinase domain-containing protein 10 (ADAM10), disintegrin and metalloproteinase domain-containing protein 12 (ADAM12), disintegrin and metalloproteinase domain-containing protein 15 (ADAM15), disintegrin and metalloproteinase domain-containing protein 17 (ADAM17), disintegrin and metalloproteinase domain-containing protein 9 (ADAM9), disintegrin and metalloproteinase with thrombospondin motifs 5 (ADAMTS5), Cathepsin B, Cathepsin D, Cathepsin E, Cathepsin K, cathepsin L, cathepsin S, Fibroblast activation protein alpha, Hepsin, kallikrein-2, kallikrein-4, kallikrein-3, Prostate-specific antigen (PSA), kallikrein-13, Legumain, matrix metallopeptidase 1 (MMP-1), matrix metallopeptidase 10 (MMP-10), matrix metallopeptidase 11 (MMP-11), matrix metallopeptidase 12 (MMP-12), matrix metallopeptidase 13 (MMP-13), matrix metallopeptidase 14 (MMP-14), matrix metallopeptidase 16 (MMP-16), matrix metallopeptidase 2 (MMP-2), matrix metallopeptidase 3 (MMP-3), matrix metallopeptidase 7 (MMP-7), matrix metallopeptidase 8 (MMP-8), matrix metallopeptidase 9 (MMP-9), matrix metallopeptidase 4 (MMP-4), matrix metallopeptidase 5 (MMP-5), matrix metallopeptidase 6 (MMP-6), matrix metallopeptidase 15 (MMP-15), neutrophil elastase, protease activated receptor 2 (PAR2), plasmin, prostasin, PSMA-FOLH1, membrane type serine protease 1 (MT-SP1), matriptase, and u-plasminogen. In some embodiments, the mammalian protease is selected from the group consisting of matrix metallopeptidase 1 (MMP1), matrix metallopeptidase 2 (MMP2), matrix metallopeptidase 7 (MMP1), matrix metallopeptidase 9 (MMP9), matrix metallopeptidase 11 (MMP11), matrix metallopeptidase 14 (MMP14), urokinase-type plasminogen activator (uPA), legumain, and matriptase. In some embodiments, the mammalian protease is preferentially expressed or activated in a target tissue or cell. In some embodiments, the target tissue or cell is a tumor. In some embodiments, the target tissue or cell produces or is co-localized with the mammalian protease.

In some embodiments for treating the subject with the therapeutic agent, the target tissue or cell contains therein or thereon, or is associated with in proximity thereto, a reporter polypeptide. In some embodiments, the reporter polypeptide is a polypeptide selected from the group consisting of coagulation factor, complement component, tubulin, immunoglobulin, apolipoprotein, serum amyloid, insulin, growth factor, fibrinogen, PDZ domain protein, LIM domain protein, c-reactive protein, serum albumin, versican, collagen, elastin, keratin, kininogen-1, alpha-2-antiplasmin, clusterin, biglycan, alpha-1-antitrypsin, transthyretin, alpha-1-antichymotrypsin, glucagon, hepcidin, thymosin beta-4, haptoglobin, hemoglobin subunit alpha, caveolae-associated protein 2, alpha-2-HS-glycoprotein, chromogranin-A, vitronectin, hemopexin, epididymis secretory sperm binding protein, secretogranin-2, angiotensinogen, transgelin-2, pancreatic prohormone, neurosecretory protein VGF, ceruloplasmin, PDZ and LIM domain protein 1, multimerin-1, inter-alpha-trypsin inhibitor heavy chain H2, N-acetylmuramoyl-L-alanine amidase, histone H1.4, adhesion G-protein coupled receptor G6, mannan-binding lectin serine protease 2, prothrombin, deleted in malignant brain tumors 1 protein, desmoglein-3, calsyntenin-1, alpha-2-macroglobulin, myosin-9, sodium/potassium-transporting ATPase subunit gamma, oncoprotein-induced transcript 3 protein, serglycin, histidine-rich glycoprotein, inter-alpha-trypsin inhibitor heavy chain H5, integrin alpha-IIb, membrane-associated progesterone receptor component 1, histone H1.2, rho GDP-dissociation inhibitor 2, zinc-alpha-2-glycoprotein, talin-1, secretogranin-1, neutrophil defensin 3, cytochrome P450 2E1, gastric inhibitory polypeptide, transcription initiation factor TFIID subunit 1, integral membrane protein 2B, pigment epithelium-derived factor, voltage-dependent N-type calcium channel subunit alpha-1B, ras GTPase-activating protein nGAP, type I cytoskeletal 17, sulfhydryl oxidase 1, homeobox protein Hox-B2, transcription factor SOX-10, E3 ubiquitin-protein ligase SIAH2, decorin, secreted protein acidic and rich in cysteine (SPARC), laminin gamma 1 chain, vimentin, and nidogen-1 (NID1). In some embodiments, the reporter polypeptide is a polypeptide selected from the group consisting of versican, type II collagen alpha-1 chain, kininogen-1, complement C4-A, complement C4-B, complement C3, alpha-2-antiplasmin, clusterin, biglycan, elastin, fibrinogen alpha chain, alpha-1-antitrypsin, fibrinogen beta chain, type III collagen alpha-1 chain, serum amyloid A-1 protein, transthyretin, apolipoprotein A-I, apolipoprotein A-I Isoform 1, alpha-1-antichymotrypsin, glucagon, hepcidin, serum amyloid A-2 protein, thymosin beta-4, haptoglobin, hemoglobin subunit alpha, caveolae-associated protein 2, alpha-2-HS-glycoprotein, chromogranin-A, vitronectin, hemopexin, epididymis secretory sperm binding protein, zyxin, apolipoprotein secretogranin-2, angiotensinogen, c-reactive protein, serum albumin, transgelin-2, pancreatic prohormone, neurosecretory protein VGF, ceruloplasmin, PDZ and LIM domain protein 1, tubulin alpha-4A chain, multimerin-1, inter-alpha-trypsin inhibitor heavy chain H2, apolipoprotein C-I, fibrinogen gamma chain, N-acetylmuramoyl-L-alanine amidase, immunoglobulin lambda variable 3-21, histone H1.4, adhesion G-protein coupled receptor G6, immunoglobulin lambda variable 3-25, immunoglobulin lambda variable 1-51, immunoglobulin lambda variable 1-36, mannan-binding lectin serine protease 2, immunoglobulin kappa variable 3-20, immunoglobulin kappa variable 2-30, insulin-like growth factor II, apolipoprotein A-II, probable non-functional immunoglobulin kappa variable 2D-24, prothrombin, coagulation factor IX, apolipoprotein L1, deleted in malignant brain tumors 1 protein, desmoglein-3, calsyntenin-1, immunoglobulin lambda constant 3, complement C5, alpha-2-macroglobulin, myosin-9, sodium/potassium-transporting ATPase subunit gamma, immunoglobulin kappa variable 2-28, oncoprotein-induced transcript 3 protein, serglycin, coagulation factor XII, coagulation factor XIII A chain, insulin, histidine-rich glycoprotein, immunoglobulin kappa variable 3-11, immunoglobulin kappa variable 1-39, collagen alpha-1(I) chain, inter-alpha-trypsin inhibitor heavy chain H5, latent-transforming growth factor beta-binding protein 2, integrin alpha-IIb, membrane-associated progesterone receptor component 1, immunoglobulin lambda variable 6-57, immunoglobulin kappa variable 3-15, complement C1r subcomponent-like protein, histone H1.2, rho GDP-dissociation inhibitor 2, latent-transforming growth factor beta-binding protein 4, collagen alpha-1(XVIII) chain, immunoglobulin lambda variable 2-18, zinc-alpha-2-glycoprotein, talin-1, secretogranin-1, neutrophil defensin 3, cytochrome P450 2E1, gastric inhibitory polypeptide, immunoglobulin heavy variable 3-15, immunoglobulin lambda variable 2-11, transcription initiation factor TFIID subunit 1, collagen alpha-1(VII) chain, integral membrane protein 2B, pigment epithelium-derived factor, voltage-dependent N-type calcium channel subunit alpha-1B, immunoglobulin lambda variable 3-27, ras GTPase-activating protein nGAP, keratin, type I cytoskeletal 17, tubulin beta chain, sulfhydryl oxidase 1, immunoglobulin kappa variable 4-1, complement C1r subcomponent, homeobox protein Hox-B2, transcription factor SOX-10, E3 ubiquitin-protein ligase SIAH2, decorin, SPARC, type I collagen alpha-1 chain, type IV collagen alpha-1 chain, laminin gamma 1 chain, vimentin, type III collagen, type IV collagen alpha-3 chain, type VII collagen alpha-1 chain, type VI collagen alpha-1 chain, type V collagen alpha-1 chain, nidogen-1, and type VI collagen alpha-3 chain. In some embodiments, the reporter polypeptide comprises a sequence set forth in Columns II-VI of Table A (or a subset thereof). In some embodiments, the reporter polypeptide is selected from the group set forth in Column I of Table A (or a subset thereof).

In some embodiments for treating the subject with the therapeutic agent, the target tissue or cell is characterized by an increased amount or activity of the mammalian protease in proximity to the target tissue or cell as compared to a non-target tissue or cell in the subject. In some embodiments, the subject is suffering from, or is suspected of suffering from, a disease or condition characterized by an increased expression or activity of the mammalian protease in proximity to a target tissue or cell as compared to a corresponding non-target tissue or cell in the subject. In some embodiments, the disease or condition is a cancer or an inflammatory or autoimmune disease. In some embodiments, the disease or condition is selected from the group consisting of ankylosing spondylitis (AS), arthritis (for example, and not limited to, rheumatoid arthritis (RA), juvenile idiopathic arthritis (JIA), osteoarthritis (OA), psoriatic arthritis (PsA), gout, chronic arthritis), chagas disease, chronic obstructive pulmonary disease (COPD), dermatomyositis, type 1 diabetes, endometriosis, Goodpasture syndrome, Graves' disease, Guillain-Barre syndrome (GBS), Hashimoto's disease, suppurative scab, Kawasaki disease, IgA nephropathy, idiopathic thrombocytopenic purpura, inflammatory bowel disease (IBD) (for example, and not limited to, Crohn's disease (CD), clonal disease, ulcerative colitis, collagen colitis, lymphocytic colitis, ischemic colitis, empty colitis, Behcet's syndrome, infectious colitis, indeterminate colitis, interstitial Cystitis), lupus (for example, and not limited to, systemic lupus erythematosus, discoid lupus, subacute cutaneous lupus erythematosus, cutaneous lupus erythematosus (such as chilblain lupus erythematosus), drug-induced lupus, neonatal lupus, lupus nephritis), mixed connective tissue disease, morphea, multiple sclerosis (MS), severe muscle Force disorder, narcolepsy, neuromuscular angina, pemphigus vulgaris, pernicious anemia, psoriasis, psoriatic arthritis, polymyositis, primary biliary cirrhosis, relapsing polychondritis, schizophrenia, scleroderma, Sjogren's syndrome, systemic stiffness syndrome, temporal arteritis (also known as giant cell arteritis), vasculitis, vitiligo, Wegener's granulomatosis, transplant rejection-associated immune reaction(s) (for example, and not limited to, renal transplant rejection, lung transplant rejection, liver transplant rejection), psoriasis, Wiskott-Aldrich syndrome, autoimmune lymphoproliferative syndrome, myasthenia gravis, inflammatory chronic rhinosinusitis, colitis, celiac disease, Barrett's esophagus, inflammatory gastritis, autoimmune nephritis, autoimmune hepatitis, autoimmune carditis, autoimmune encephalitis, autoimmune mediated hematological disease, asthma, atopic dermatitis, atopy, allergy, allergic rhinitis, scleroderma, bronchitis, pericarditis, the inflammatory disease is, Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, inflammatory lung disease, inflammatory skin disease, atherosclerosis, myocardial infarction, stroke, gram-positive shock, gram-negative shock, sepsis, septic shock, hemorrhagic shock, anaphylactic shock, systemic inflammatory response syndrome. In some embodiments, the disease or condition is selected from the group consisting of carcinoma, Hodgkin's lymphoma, and non-Hodgkin's lymphoma, diffuse large B cell lymphoma, follicular lymphoma, mantle cell lymphoma, blastoma, breast cancer, ER/PR+ breast cancer, Her2+ breast cancer, triple-negative breast cancer, colon cancer, colon cancer with malignant ascites, mucinous tumors, prostate cancer, head and neck cancer, skin cancer, melanoma, genito-urinary tract cancer, ovarian cancer, ovarian cancer with malignant ascites, peritoneal carcinomatosis, uterine serous carcinoma, endometrial cancer, cervix cancer, colorectal, uterine cancer, mesothelioma in the peritoneum, kidney cancer, Wilm's tumor, lung cancer, small-cell lung cancer, non-small cell lung cancer, gastric cancer, stomach cancer, small intestine cancer, liver cancer, hepatocarcinoma, hepatoblastoma, liposarcoma, pancreatic cancer, gall bladder cancer, cancers of the bile duct, esophageal cancer, salivary gland carcinoma, thyroid cancer, epithelial cancer, arrhenoblastoma, adenocarcinoma, sarcoma, and B-cell derived chronic lymphatic leukemia. In some embodiments, the therapeutic agent is an anti-cancer agent. In some embodiments, the therapeutic agent is an activatable therapeutic agent. In some embodiments, the therapeutic agent is a non-natural, activatable therapeutic agent as described herein.

In some embodiments for treating the subject with the therapeutic agent, the therapeutic agent comprises a masking moiety (MM). In some embodiments, the masking moiety (MM) is capable of being released from the therapeutic agent upon cleavage of the peptide substrate by the mammalian protease. In some embodiments, the masking moiety (MM) interferes with an interaction of the therapeutic agent, in an uncleaved state, to a target tissue or cell. In some embodiments, a bioactivity of the therapeutic agent is capable of being enhanced upon cleavage of the peptide substrate by the mammalian protease. In some embodiments, the masking moiety (MM) is an extended recombinant polypeptide (XTEN). In some embodiments, the XTEN is characterized in that: (i) it comprises at least 100 amino acids; (ii) at least 90% of the amino acid residues of it are selected from glycine (G), alanine (A), serine (S), threonine (T), glutamate (E) and proline (P); and (iii) it comprises at least 4 different types of amino acids selected from G, A, S, T, E, and P.

In some embodiments for treating the subject with the therapeutic agent, the subject is determined to have a likelihood of a response to the therapeutic agent by a method as described herein.

In certain aspects, the present disclosure provides a method for treating a disease or condition in a subject, comprising administering to the subject in need thereof one or more therapeutically effective doses of a therapeutic agent as described herein, or a pharmaceutical composition as described herein.

In some embodiments for the method for treating the disease or condition in the subject, the subject is selected from the group consisting of mouse, rat, monkey, and human. In some embodiments, the subject is a human. In some embodiments, the subject is determined to have a likelihood of a response to the therapeutic agent or the pharmaceutical composition. In some embodiments, the likelihood of the response is 50% or higher. In some embodiments, the likelihood of the response is determined by a method as described herein.

In some embodiments for the method for treating the disease or condition in the subject, the disease or condition is a cancer or an inflammatory or autoimmune disease. In some embodiments, the disease or condition is selected from the group consisting of ankylosing spondylitis (AS), arthritis (for example, and not limited to, rheumatoid arthritis (RA), juvenile idiopathic arthritis (JIA), osteoarthritis (OA), psoriatic arthritis (PsA), gout, chronic arthritis), chagas disease, chronic obstructive pulmonary disease (COPD), dermatomyositis, type 1 diabetes, endometriosis, Goodpasture syndrome, Graves' disease, Guillain-Barre syndrome (GBS), Hashimoto's disease, suppurative scab, Kawasaki disease, IgA nephropathy, idiopathic thrombocytopenic purpura, inflammatory bowel disease (IBD) (for example, and not limited to, Crohn's disease (CD), clonal disease, ulcerative colitis, collagen colitis, lymphocytic colitis, ischemic colitis, empty colitis, Behcet's syndrome, infectious colitis, indeterminate colitis, interstitial Cystitis), lupus (for example, and not limited to, systemic lupus erythematosus, discoid lupus, subacute cutaneous lupus erythematosus, cutaneous lupus erythematosus (such as chilblain lupus erythematosus), drug-induced lupus, neonatal lupus, lupus nephritis), mixed connective tissue disease, morphea, multiple sclerosis (MS), severe muscle Force disorder, narcolepsy, neuromuscular angina, pemphigus vulgaris, pernicious anemia, psoriasis, psoriatic arthritis, polymyositis, primary biliary cirrhosis, relapsing polychondritis, schizophrenia, scleroderma, Sjogren's syndrome, systemic stiffness syndrome, temporal arteritis (also known as giant cell arteritis), vasculitis, vitiligo, Wegener's granulomatosis, transplant rejection-associated immune reaction(s) (for example, and not limited to, renal transplant rejection, lung transplant rejection, liver transplant rejection), psoriasis, Wiskott-Aldrich syndrome, autoimmune lymphoproliferative syndrome, myasthenia gravis, inflammatory chronic rhinosinusitis, colitis, celiac disease, Barrett's esophagus, inflammatory gastritis, autoimmune nephritis, autoimmune hepatitis, autoimmune carditis, autoimmune encephalitis, autoimmune mediated hematological disease, asthma, atopic dermatitis, atopy, allergy, allergic rhinitis, scleroderma, bronchitis, pericarditis, the inflammatory disease is, Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, inflammatory lung disease, inflammatory skin disease, atherosclerosis, myocardial infarction, stroke, gram-positive shock, gram-negative shock, sepsis, septic shock, hemorrhagic shock, anaphylactic shock, systemic inflammatory response syndrome. In some embodiments, the disease or condition is selected from the group consisting of carcinoma, Hodgkin's lymphoma, and non-Hodgkin's lymphoma, diffuse large B cell lymphoma, follicular lymphoma, mantle cell lymphoma, blastoma, breast cancer, ER/PR+ breast cancer, Her2+ breast cancer, triple-negative breast cancer, colon cancer, colon cancer with malignant ascites, mucinous tumors, prostate cancer, head and neck cancer, skin cancer, melanoma, genito-urinary tract cancer, ovarian cancer, ovarian cancer with malignant ascites, peritoneal carcinomatosis, uterine serous carcinoma, endometrial cancer, cervix cancer, colorectal, uterine cancer, mesothelioma in the peritoneum, kidney cancer, Wilm's tumor, lung cancer, small-cell lung cancer, non-small cell lung cancer, gastric cancer, stomach cancer, small intestine cancer, liver cancer, hepatocarcinoma, hepatoblastoma, liposarcoma, pancreatic cancer, gall bladder cancer, cancers of the bile duct, esophageal cancer, salivary gland carcinoma, thyroid cancer, epithelial cancer, arrhenoblastoma, adenocarcinoma, sarcoma, and B-cell derived chronic lymphatic leukemia.

In certain aspects, the present disclosure provides use of a therapeutic agent as described herein in the preparation of a medicament for the treatment of a disease or condition in a subject.

In certain aspects, the present disclosure provides use of a pharmaceutical composition as described herein in the preparation of a medicament for the treatment of a disease or condition in a subject.

In some embodiments of the use, the subject is selected from the group consisting of mouse, rat, monkey, and human. In some embodiments, the subject is a human. In some embodiments, the subject is determined to have a likelihood of a response to the therapeutic agent or the pharmaceutical composition. In some embodiments, the likelihood of the response is 50% or higher. In some embodiments, the likelihood of the response is determined by a method as described herein.

In some embodiments of the use, the disease or condition is a cancer or an inflammatory or autoimmune disease. In some embodiments, the disease or condition is selected from the group consisting of carcinoma, Hodgkin's lymphoma, and non-Hodgkin's lymphoma, diffuse large B cell lymphoma, follicular lymphoma, mantle cell lymphoma, blastoma, breast cancer, ER/PR+ breast cancer, Her2+ breast cancer, triple-negative breast cancer, colon cancer, colon cancer with malignant ascites, mucinous tumors, prostate cancer, head and neck cancer, skin cancer, melanoma, genito-urinary tract cancer, ovarian cancer, ovarian cancer with malignant ascites, peritoneal carcinomatosis, uterine serous carcinoma, endometrial cancer, cervix cancer, colorectal, uterine cancer, mesothelioma in the peritoneum, kidney cancer, Wilm's tumor, lung cancer, small-cell lung cancer, non-small cell lung cancer, gastric cancer, stomach cancer, small intestine cancer, liver cancer, hepatocarcinoma, hepatoblastoma, liposarcoma, pancreatic cancer, gall bladder cancer, cancers of the bile duct, esophageal cancer, salivary gland carcinoma, thyroid cancer, epithelial cancer, arrhenoblastoma, adenocarcinoma, sarcoma, and B-cell derived chronic lymphatic leukemia. In some embodiments, the disease or condition is selected from the group consisting of ankylosing spondylitis (AS), arthritis (for example, and not limited to, rheumatoid arthritis (RA), juvenile idiopathic arthritis (JIA), osteoarthritis (OA), psoriatic arthritis (PsA), gout, chronic arthritis), chagas disease, chronic obstructive pulmonary disease (COPD), dermatomyositis, type 1 diabetes, endometriosis, Goodpasture syndrome, Graves' disease, Guillain-Barre syndrome (GB S), Hashimoto's disease, suppurative scab, Kawasaki disease, IgA nephropathy, idiopathic thrombocytopenic purpura, inflammatory bowel disease (IBD) (for example, and not limited to, Crohn's disease (CD), clonal disease, ulcerative colitis, collagen colitis, lymphocytic colitis, ischemic colitis, empty colitis, Behcet's syndrome, infectious colitis, indeterminate colitis, interstitial Cystitis), lupus (for example, and not limited to, systemic lupus erythematosus, discoid lupus, subacute cutaneous lupus erythematosus, cutaneous lupus erythematosus (such as chilblain lupus erythematosus), drug-induced lupus, neonatal lupus, lupus nephritis), mixed connective tissue disease, morphea, multiple sclerosis (MS), severe muscle Force disorder, narcolepsy, neuromuscular angina, pemphigus vulgaris, pernicious anemia, psoriasis, psoriatic arthritis, polymyositis, primary biliary cirrhosis, relapsing polychondritis, schizophrenia, scleroderma, Sjogren's syndrome, systemic stiffness syndrome, temporal arteritis (also known as giant cell arteritis), vasculitis, vitiligo, Wegener's granulomatosis, transplant rejection-associated immune reaction(s) (for example, and not limited to, renal transplant rejection, lung transplant rejection, liver transplant rejection), psoriasis, Wiskott-Aldrich syndrome, autoimmune lymphoproliferative syndrome, myasthenia gravis, inflammatory chronic rhinosinusitis, colitis, celiac disease, Barrett's esophagus, inflammatory gastritis, autoimmune nephritis, autoimmune hepatitis, autoimmune carditis, autoimmune encephalitis, autoimmune mediated hematological disease, asthma, atopic dermatitis, atopy, allergy, allergic rhinitis, scleroderma, bronchitis, pericarditis, the inflammatory disease is, Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, inflammatory lung disease, inflammatory skin disease, atherosclerosis, myocardial infarction, stroke, gram-positive shock, gram-negative shock, sepsis, septic shock, hemorrhagic shock, anaphylactic shock, systemic inflammatory response syndrome.

In some aspects, the present disclosure provides a therapeutic agent (e.g., activatable therapeutic agent, or non-natural, activatable therapeutic agent) comprising a release segment (RS) linked, directly or indirectly, to a biologically active moiety (BM), wherein the RS comprises a peptide substrate having an amino acid sequence susceptible to cleavage by a mammalian protease at a scissile bond, wherein the peptide substrate comprises an amino acid sequence having at most three amino acid substitutions (or at most two amino acid substitutions, or at most one amino acid substitution) with respect to a sequence set forth in Column II or III of Table A (or a subset thereof).

In some aspects, the present disclosure provides a therapeutic agent (e.g., activatable therapeutic agent, or non-natural, activatable therapeutic agent) comprising a release segment (RS) linked, directly or indirectly, to a biologically active moiety (BM), wherein the RS comprises a peptide substrate having an amino acid sequence susceptible to cleavage by a mammalian protease at a scissile bond, wherein the therapeutic agent is configured for activation at or in proximity to a target tissue or cell in a subject,

wherein the target tissue or cell contains therein or thereon, or is associated with in proximity thereto, a reporter sequence capable of being cleaved by the mammalian protease at a cleavage sequence, and

wherein the peptide substrate comprises an amino acid sequence having at most three amino acid substitutions (or at most two amino acid substitutions, or at most one amino acid substitution) with respect to the cleavage sequence of the reporter polypeptide.

In some embodiments of the therapeutic agent, the reporter polypeptide is a coagulation factor, complement component, tubulin, immunoglobulin, apolipoprotein, serum amyloid, insulin, growth factor, fibrinogen, PDZ domain protein, LIM domain protein, c-reactive protein, serum albumin, versican, collagen, elastin, keratin, kininogen-1, alpha-2-antiplasmin, clusterin, biglycan, alpha-1-antitrypsin, transthyretin, alpha-1-antichymotrypsin, glucagon, hepcidin, thymosin beta-4, haptoglobin, hemoglobin subunit alpha, caveolae-associated protein 2, alpha-2-HS-glycoprotein, chromogranin-A, vitronectin, hemopexin, epididymis secretory sperm binding protein, secretogranin-2, angiotensinogen, transgelin-2, pancreatic prohormone, neurosecretory protein VGF, ceruloplasmin, PDZ and LIM domain protein 1, multimerin-1, inter-alpha-trypsin inhibitor heavy chain H2, N-acetylmuramoyl-L-alanine amidase, histone H1.4, adhesion G-protein coupled receptor G6, mannan-binding lectin serine protease 2, prothrombin, deleted in malignant brain tumors 1 protein, desmoglein-3, calsyntenin-1, alpha-2-macroglobulin, myosin-9, sodium/potassium-transporting ATPase subunit gamma, oncoprotein-induced transcript 3 protein, serglycin, histidine-rich glycoprotein, inter-alpha-trypsin inhibitor heavy chain H5, integrin alpha-IIb, membrane-associated progesterone receptor component 1, histone H1.2, rho GDP-dissociation inhibitor 2, zinc-alpha-2-glycoprotein, talin-1, secretogranin-1, neutrophil defensin 3, cytochrome P450 2E1, gastric inhibitory polypeptide, transcription initiation factor TFIID subunit 1, integral membrane protein 2B, pigment epithelium-derived factor, voltage-dependent N-type calcium channel subunit alpha-1B, ras GTPase-activating protein nGAP, type I cytoskeletal 17, sulfhydryl oxidase 1, homeobox protein Hox-B2, transcription factor SOX-10, E3 ubiquitin-protein ligase SIAH2, decorin, secreted protein acidic and rich in cysteine (SPARC), laminin gamma 1 chain, vimentin, and nidogen-1 (NID1).

In some embodiments of the therapeutic agent, the reporter polypeptide is a polypeptide selected from the group consisting of versican, type II collagen alpha-1 chain, kininogen-1, complement C4-A, complement C4-B, complement C3, alpha-2-antiplasmin, clusterin, biglycan, elastin, fibrinogen alpha chain, alpha-1-antitrypsin, fibrinogen beta chain, type III collagen alpha-1 chain, serum amyloid A-1 protein, transthyretin, apolipoprotein A-I, apolipoprotein A-I Isoform 1, alpha-1-antichymotrypsin, glucagon, hepcidin, serum amyloid A-2 protein, thymosin beta-4, haptoglobin, hemoglobin subunit alpha, caveolae-associated protein 2, alpha-2-HS-glycoprotein, chromogranin-A, vitronectin, hemopexin, epididymis secretory sperm binding protein, zyxin, apolipoprotein secretogranin-2, angiotensinogen, c-reactive protein, serum albumin, transgelin-2, pancreatic prohormone, neurosecretory protein VGF, ceruloplasmin, PDZ and LIM domain protein 1, tubulin alpha-4A chain, multimerin-1, inter-alpha-trypsin inhibitor heavy chain H2, apolipoprotein C-I, fibrinogen gamma chain, N-acetylmuramoyl-L-alanine amidase, immunoglobulin lambda variable 3-21, histone H1.4, adhesion G-protein coupled receptor G6, immunoglobulin lambda variable 3-25, immunoglobulin lambda variable 1-51, immunoglobulin lambda variable 1-36, mannan-binding lectin serine protease 2, immunoglobulin kappa variable 3-20, immunoglobulin kappa variable 2-30, insulin-like growth factor II, apolipoprotein A-II, probable non-functional immunoglobulin kappa variable 2D-24, prothrombin, coagulation factor IX, apolipoprotein L1, deleted in malignant brain tumors 1 protein, desmoglein-3, calsyntenin-1, immunoglobulin lambda constant 3, complement C5, alpha-2-macroglobulin, myosin-9, sodium/potassium-transporting ATPase subunit gamma, immunoglobulin kappa variable 2-28, oncoprotein-induced transcript 3 protein, serglycin, coagulation factor XII, coagulation factor XIII A chain, insulin, histidine-rich glycoprotein, immunoglobulin kappa variable 3-11, immunoglobulin kappa variable 1-39, collagen alpha-1(I) chain, inter-alpha-trypsin inhibitor heavy chain H5, latent-transforming growth factor beta-binding protein 2, integrin alpha-IIb, membrane-associated progesterone receptor component 1, immunoglobulin lambda variable 6-57, immunoglobulin kappa variable 3-15, complement C1r subcomponent-like protein, histone H1.2, rho GDP-dissociation inhibitor 2, latent-transforming growth factor beta-binding protein 4, collagen alpha-1(XVIII) chain, immunoglobulin lambda variable 2-18, zinc-alpha-2-glycoprotein, talin-1, secretogranin-1, neutrophil defensin 3, cytochrome P450 2E1, gastric inhibitory polypeptide, immunoglobulin heavy variable 3-15, immunoglobulin lambda variable 2-11, transcription initiation factor TFIID subunit 1, collagen alpha-1(VII) chain, integral membrane protein 2B, pigment epithelium-derived factor, voltage-dependent N-type calcium channel subunit alpha-1B, immunoglobulin lambda variable 3-27, ras GTPase-activating protein nGAP, keratin, type I cytoskeletal 17, tubulin beta chain, sulfhydryl oxidase 1, immunoglobulin kappa variable 4-1, complement C1r subcomponent, homeobox protein Hox-B2, transcription factor SOX-10, E3 ubiquitin-protein ligase SIAH2, decorin, SPARC, type I collagen alpha-1 chain, type IV collagen alpha-1 chain, laminin gamma 1 chain, vimentin, type III collagen, type IV collagen alpha-3 chain, type VII collagen alpha-1 chain, type VI collagen alpha-1 chain, type V collagen alpha-1 chain, nidogen-1, and type VI collagen alpha-3 chain.

In some embodiments of the therapeutic agent, the cleavage sequence of the reporter polypeptide is a cleavage sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the cleavage sequence does not comprise a methionine residue immediately N-terminal to a scissile bond (contained therein), when the methionine is the first residue at N terminus of the reporter polypeptide. In some embodiments, the target tissue or cell is characterized by an increased amount or activity of the mammalian protease in proximity to the target tissue or cell as compared to a non-target tissue or cell in the subject. In some embodiments, the mammalian proatease is produced at the target tissue or cell. In some embodiments, the peptide substrate comprises an amino acid sequence having at most three amino acid substitutions, or at most two amino acid substitutions, or at most one amino acid substitution with respect to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the peptide substrate comprises an amino acid sequence having at most three amino acid substitutions with respect to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the scissile bond is not immediately C-terminal to a methionine residue.

In some embodiments of the therapeutic agent, the peptide substrate contains from six to twenty-five or six to twenty amino acid residues. In some embodiments of the therapeutic agent, the peptide substrate contains from six to twenty-five amino acid residues. In some embodiments of the therapeutic agent, the peptide substrate contains from six to twenty amino acid residues. In some embodiments, the peptide substrate contains from seven to twelve amino acid residues. In some embodiments, the peptide substrate comprises an amino acid sequence having at most two amino acid substitutions with respect to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the peptide substrate comprises an amino acid sequence having at most one amino acid substitution with respect to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, none of the at most three amino acid substitutions, or the at most two amino acid substitutions, or the at most one amino acid substitution is at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond of the corresponding sequence shown in Column II or III of Table A (or a subset thereof). In some embodiments, the peptide substrate comprises an amino acid sequence identical to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the peptide substrate does not comprise a methionine residue immediately N-terminal to a scissile bond (contained therein). In some embodiments, the peptide substrate does not comprise an amino acid sequence selected from the group consisting of #279, #280, #282, #283, #298, #299, #302, #303, #305, #307, #308, #349, #396, #397, #416, #417, #418, #458, #459, #460, #466, #481 and #482 (or any combination thereof) of Column II of Table A. In some embodiments, the peptide substrate comprises two or three sequences set forth in Column II or III of Table A (or a subset thereof). In some embodiments, where the peptide substrate comprises two sequences set forth in Column II or III of Table A (or a subset thereof), the two sequences partially overlap one another. In some embodiments, where the peptide substrate comprises two sequences set forth in Column II or III of Table A (or a subset thereof), the two sequences do not overlap one another. In some embodiments, where the peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), two or all of the three sequences do not overlap one another. In some embodiments, where the peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), one of the three sequences partially overlaps with another sequence or both other sequences of the three sequences. In some embodiments, where the peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), two of the three sequences partially overlap with one another. In some embodiments, where the peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), each two of the three sequences partially overlap with one another. In some embodiments, where the peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), all of the three sequences partially overlap with one another. In some embodiments, the peptide substrate susceptible to cleavage by the mammalian protease is susceptible to cleavage by a plurality of mammalian proteases comprising the mammalian protease. In some embodiments, the peptide substrate susceptible to cleavage by the plurality of mammalian proteases has at most three amino acid substitutions, or at most two amino acid substitutions, or at most one amino acid substitution with respect to a sequence set forth in Table 1(j). In some embodiments, the peptide substrate susceptible to cleavage by the plurality of mammalian proteases has at most three amino acid substitutions with respect to a sequence set forth in Table 1(j). In some embodiments, the peptide substrate susceptible to cleavage by the plurality of mammalian proteases has at most two amino acid substitutions with respect to a sequence set forth in Table 1(j). In some embodiments, the peptide substrate susceptible to cleavage by the plurality of mammalian proteases has at most one amino acid substitution with respect to a sequence set forth in Table 1(j). In some embodiments, none of the at most three amino acid substitutions, or the at most two amino acid substitutions, or the at most one amino acid substitution is at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond of the corresponding sequence set forth in Table 1(j). In some embodiments, the peptide substrate susceptible to cleavage by the plurality of mammalian proteases comprises a sequence set forth in Table 1(j).

In some embodiments of the therapeutic agent, the release segment (RS) is capable of being cleaved when in proximity to a target tissue or cell, and wherein the target tissue or cell produces the mammalian protease for which the RS is a peptide substrate. In some embodiments, the mammalian protease for cleavage of the release segment (RS) is a serine protease, a cysteine protease, an aspartate protease, a threonine protease, or a metalloproteinase. In some embodiments, the mammalian protease for cleavage of the release segment (RS) is selected from the group consisting of disintegrin and metalloproteinase domain-containing protein 10 (ADAM10), disintegrin and metalloproteinase domain-containing protein 12 (ADAM12), disintegrin and metalloproteinase domain-containing protein 15 (ADAM15), disintegrin and metalloproteinase domain-containing protein 17 (ADAM17), disintegrin and metalloproteinase domain-containing protein 9 (ADAM9), disintegrin and metalloproteinase with thrombospondin motifs 5 (ADAMTS5), Cathepsin B, Cathepsin D, Cathepsin E, Cathepsin K, cathepsin L, cathepsin S, Fibroblast activation protein alpha, Hepsin, kallikrein-2, kallikrein-4, kallikrein-3, Prostate-specific antigen (PSA), kallikrein-13, Legumain, matrix metallopeptidase 1 (MMP-1), matrix metallopeptidase 10 (MMP-10), matrix metallopeptidase 11 (MMP-11), matrix metallopeptidase 12 (MMP-12), matrix metallopeptidase 13 (MMP-13), matrix metallopeptidase 14 (MMP-14), matrix metallopeptidase 16 (MMP-16), matrix metallopeptidase 2 (MMP-2), matrix metallopeptidase 3 (MMP-3), matrix metallopeptidase 7 (MMP-7), matrix metallopeptidase 8 (MMP-8), matrix metallopeptidase 9 (MMP-9), matrix metallopeptidase 4 (MMP-4), matrix metallopeptidase 5 (MMP-5), matrix metallopeptidase 6 (MMP-6), matrix metallopeptidase 15 (MMP-15), neutrophil elastase, protease activated receptor 2 (PAR2), plasmin, prostasin, PSMA-FOLH1, membrane type serine protease 1 (MT-SP1), matriptase, and u-plasminogen. In some embodiments, the mammalian protease for cleavage of the release segment (RS) is selected from the group consisting of matrix metallopeptidase 1 (MMP1), matrix metallopeptidase 2 (MMP2), matrix metallopeptidase 7 (MMP1), matrix metallopeptidase 9 (MMP9), matrix metallopeptidase 11 (MMP11), matrix metallopeptidase 14 (MMP14), urokinase-type plasminogen activator (uPA), legumain, and matriptase.

In some embodiments of the therapeutic agent, the therapeutic agent further comprises a masking moiety (MM) linked, directly or indirectly, to the release segment (RS). In some embodiments, the therapeutic agent, in an uncleaved state, has a structural arrangement from N-terminus to C-terminus of BM-RS-MM or MM-RS-BM. In some embodiments of the therapeutic agent, upon cleavage of the release segment (RS), the masking moiety (MM) is released from the therapeutic agent. In some embodiments, the masking moiety (MM) comprises an extended recombinant polypeptide (XTEN). In some embodiments, the XTEN is characterized in that: (i) it comprises at least 100 amino acids; (ii) at least 90% of the amino acid residues of it are selected from glycine (G), alanine (A), serine (S), threonine (T), glutamate (E) and proline (P); and (iii) it comprises at least 4 different types of amino acids selected from G, A, S, T, E, and P. In some embodiments, the extended recombinant polypeptide (XTEN) comprises an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity to a sequence set forth in Tables 2b-2c. In some embodiments, the masking moiety (MM), when linked to the therapeutic agent, interferes with an interaction of the biologically active moiety (BM) to the target tissue or cell such that a dissociation constant (K_d) of the BM of the therapeutic agent with a target cell marker borne by the target tissue or cell is greater, when the therapeutic agent is in an uncleaved state, compared to a dissociation constant (Kd) of a corresponding biologically active moiety with the target cell marker. In some embodiments, the therapeutic agent effects a broader therapeutic window in delivery of the BM to the target tissue or cell compared to a corresponding biologically active moiety. In some embodiments, the therapeutic agent has a longer terminal half-life compared to that of a corresponding biologically active moiety. In some embodiments, the therapeutic agent is less immunogenic compared to a corresponding biologically active moiety. In some embodiments, the immunogenicity is ascertained by measuring production of IgG antibodies that selectively bind to the biologically active moiety after administration of comparable doses to a subject. In some embodiments, the therapeutic agent has a greater apparent molecular weight factor under a physiological condition compared to a corresponding biologically active moiety.

In some embodiments of the therapeutic agent, the release segment (RS) is a first release segment (RS1), wherein the scissile bond is a first scissile bond, and wherein the therapeutic agent further comprises a second release segment (RS2) linked, directly or indirectly, to the biologically active moiety (BM), wherein the RS2 comprises a second peptide substrate or cleavage by a mammalian protease at a second scissile bond. In some embodiments, the mammalian protease for cleavage of the RS2 is identical to the mammalian protease for cleavage of the RS1. In some embodiments, the mammalian protease for cleavage of the RS2 is different from the mammalian protease for cleavage of the RS1. In some embodiments, the RS2 has an amino acid sequence identical to that of the RS1. In some embodiments, the RS2 has an amino acid sequence different from that of the RS1. In some embodiments, each of the RS1 and the RS2 comprises a peptide substrate for a different mammalian protease selected from the group consisting of disintegrin and metalloproteinase domain-containing protein 10 (ADAM10), disintegrin and metalloproteinase domain-containing protein 12 (ADAM12), disintegrin and metalloproteinase domain-containing protein 15 (ADAM15), disintegrin and metalloproteinase domain-containing protein 17 (ADAM17), disintegrin and metalloproteinase domain-containing protein 9 (ADAM9), disintegrin and metalloproteinase with thrombospondin motifs 5 (ADAMTS5), Cathepsin B, Cathepsin D, Cathepsin E, Cathepsin K, cathepsin L, cathepsin S, Fibroblast activation protein alpha, Hepsin, kallikrein-2, kallikrein-4, kallikrein-3, Prostate-specific antigen (PSA), kallikrein-13, Legumain, matrix metallopeptidase 1 (MMP-1), matrix metallopeptidase 10 (MMP-10), matrix metallopeptidase 11 (MMP-11), matrix metallopeptidase 12 (MMP-12), matrix metallopeptidase 13 (MMP-13), matrix metallopeptidase 14 (MMP-14), matrix metallopeptidase 16 (MMP-16), matrix metallopeptidase 2 (MMP-2), matrix metallopeptidase 3 (MMP-3), matrix metallopeptidase 7 (MMP-7), matrix metallopeptidase 8 (MMP-8), matrix metallopeptidase 9 (MMP-9), matrix metallopeptidase 4 (MMP-4), matrix metallopeptidase 5 (MMP-5), matrix metallopeptidase 6 (MMP-6), matrix metallopeptidase 15 (MMP-15), neutrophil elastase, protease activated receptor 2 (PAR2), plasmin, prostasin, PSMA-FOLH1, membrane type serine protease 1 (MT-SP1), matriptase, and u-plasminogen. In some embodiments, each of the RS1 and the RS2 comprises a peptide substrate for a different mammalian protease selected from the group consisting of matrix metallopeptidase 1 (MMP1), matrix metallopeptidase 2 (MMP2), matrix metallopeptidase 7 (MMP1), matrix metallopeptidase 9 (MMP9), matrix metallopeptidase 11 (MMP11), matrix metallopeptidase 14 (MMP14), urokinase-type plasminogen activator (uPA), legumain, and matriptase. In some embodiments, the second scissile bond is not immediately C-terminal to a methionine residue.

In some embodiments of the therapeutic agent, the second peptide substrate contains from six to twenty-five or six to twenty amino acid residues. In some embodiments of the therapeutic agent, the second peptide substrate contains from six to twenty-five amino acid residues. In some embodiments of the therapeutic agent, the second peptide substrate contains from six to twenty amino acid residues. In some embodiments, the second peptide substrate contains from seven to twelve amino acid residues. In some embodiments, the second peptide substrate comprises an amino acid sequence having at most three amino acid substitutions, or at most two amino acid substitutions, or at most one amino acid substitution with respect to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the second peptide substrate comprises an amino acid sequence having at most three amino acid substitutions with respect to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the second peptide substrate comprises an amino acid sequence having at most two amino acid substitutions with respect to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the second peptide substrate comprises an amino acid sequence having at most one amino acid substitution with respect to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, none of the at most three amino acid substitutions, or the at most two amino acid substitutions, or the at most one amino acid substitution (of the second peptide substrate) is at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond of the corresponding sequence shown in Column II or III of Table A (or a subset thereof). In some embodiments, the second peptide substrate comprises an amino acid sequence identical to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the second peptide substrate does not comprise a methionine residue immediately N-terminal to a scissile bond (contained therein). In some embodiments, the second peptide substrate does not comprise an amino acid sequence selected from the group consisting of #279, #280, #282, #283, #298, #299, #302, #303, #305, #307, #308, #349, #396, #397, #416, #417, #418, #458, #459, #460, #466, #481 and #482 (or any combination thereof) of Column II of Table A. In some embodiments, the second peptide substrate comprises two or three sequences set forth in Column II or III of Table A (or a subset thereof). In some embodiments, where the second peptide substrate comprises two sequences set forth in Column II or III of Table A (or a subset thereof), the two sequences (of the second peptide substrate) partially overlap one another. In some embodiments, where the second peptide substrate comprises two sequences set forth in Column II or III of Table A (or a subset thereof), the two sequences (of the second peptide substrate) do not overlap one another. In some embodiments, where the second peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), two or all of the three sequences (of the second peptide substrate) do not overlap one another. In some embodiments, where the second peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), one of the three sequences (of the second peptide substrate) partially overlaps with another sequence or both other sequences of the three sequences (of the second peptide substrate). In some embodiments, where the second peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), two of the three sequences (of the second peptide substrate) partially overlap with one another. In some embodiments, where the second peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), each two of the three sequences (of the second peptide substrate) partially overlap with one another. In some embodiments, where the second peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), all of the three sequences (of the second peptide substrate) partially overlap with one another. In some embodiments, the second peptide substrate susceptible to cleavage by the mammalian protease is susceptible to cleavage by a plurality of mammalian proteases comprising the mammalian protease. In some embodiments, the second peptide substrate susceptible to cleavage by the plurality of mammalian proteases has at most three amino acid substitutions, or at most two amino acid substitutions, or at most one amino acid substitution with respect to a sequence set forth in Table 1(j). In some embodiments, the second peptide substrate susceptible to cleavage by the plurality of mammalian proteases has at most three amino acid substitutions with respect to a sequence set forth in Table 1(j). In some embodiments, the second peptide substrate susceptible to cleavage by the plurality of mammalian proteases has at most two amino acid substitutions with respect to a sequence set forth in Table 1(j). In some embodiments, the second peptide substrate susceptible to cleavage by the plurality of mammalian proteases has at most one amino acid substitution with respect to a sequence set forth in Table 1(j). In some embodiments, none of the at most three amino acid substitutions, or the at most two amino acid substitutions, or the at most one amino acid substitution (of the second peptide substrate) is at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond of the corresponding sequence set forth in Table 1(j). In some embodiments, the second peptide substrate susceptible to cleavage by the plurality of mammalian proteases comprises a sequence set forth in Table 1(j).

In some embodiments of the therapeutic agent, the second release segment (RS2) is capable of being cleaved when in proximity to the target tissue or cell, and wherein the target tissue or cell produces the mammalian protease for which the RS2 is a peptide substrate. This includes tumor produced proteases and tumor melieu produced proteases. In some embodiments, the mammalian protease for cleavage of the second release segment (RS2) is a serine protease, a cysteine protease, an aspartate protease, a threonine protease or a metalloproteinase. In some embodiments, the mammalian protease for cleavage of the release segment (RS) is selected from the group consisting of disintegrin and metalloproteinase domain-containing protein 10 (ADAM10), disintegrin and metalloproteinase domain-containing protein 12 (ADAM12), disintegrin and metalloproteinase domain-containing protein 15 (ADAM15), disintegrin and metalloproteinase domain-containing protein 17 (ADAM17), disintegrin and metalloproteinase domain-containing protein 9 (ADAM9), disintegrin and metalloproteinase with thrombospondin motifs 5 (ADAMTS5), Cathepsin B, Cathepsin D, Cathepsin E, Cathepsin K, cathepsin L, cathepsin S, Fibroblast activation protein alpha, Hepsin, kallikrein-2, kallikrein-4, kallikrein-3, Prostate-specific antigen (PSA), kallikrein-13, Legumain, matrix metallopeptidase 1 (MMP-1), matrix metallopeptidase 10 (MMP-10), matrix metallopeptidase 11 (MMP-11), matrix metallopeptidase 12 (MMP-12), matrix metallopeptidase 13 (MMP-13), matrix metallopeptidase 14 (MMP-14), matrix metallopeptidase 16 (MMP-16), matrix metallopeptidase 2 (MMP-2), matrix metallopeptidase 3 (MMP-3), matrix metallopeptidase 7 (MMP-7), matrix metallopeptidase 8 (MMP-8), matrix metallopeptidase 9 (MMP-9), matrix metallopeptidase 4 (MMP-4), matrix metallopeptidase 5 (MMP-5), matrix metallopeptidase 6 (MMP-6), matrix metallopeptidase 15 (MMP-15), neutrophil elastase, protease activated receptor 2 (PAR2), plasmin, prostasin, PSMA-FOLH1, membrane type serine protease 1 (MT-SP1), matriptase, and u-plasminogen. In some embodiments, the mammalian protease for cleavage of the second release segment (RS2) is selected from the group consisting of matrix metallopeptidase 1 (MMP1), matrix metallopeptidase 2 (MMP2), matrix metallopeptidase 7 (MMP1), matrix metallopeptidase 9 (MMP9), matrix metallopeptidase 11 (MMP11), matrix metallopeptidase 14 (MMP14), urokinase-type plasminogen activator (uPA), legumain, and matriptase.

In some embodiments of the therapeutic agent, the masking moiety (MM) is a first masking moiety (MM1), and wherein the therapeutic agent further comprises a second masking moiety (MM2) linked, directly or indirectly, to the second release segment (RS2). In some embodiments, the therapeutic agent, in an uncleaved state, has a structural arrangement from N-terminus to C-terminus of MM1-RS1-BM-RS2-MM2, MM1-RS2-BM-RS1-MM2, MM2-RS1-BM-RS2-MM1, or MM2-RS2-BM-RS1-MM1. In some embodiments of the therapeutic agent, upon cleavage of the second release segment (RS2), the second masking moiety (MM2) is released from the therapeutic agent. In some embodiments, the second masking moiety (MM2) comprises a second extended recombinant polypeptide (XTEN2). In some embodiments, the XTEN2 is characterized in that: (i) it comprises at least 100 amino acids; (ii) at least 90% of the amino acid residues of it are selected from glycine (G), alanine (A), serine (S), threonine (T), glutamate (E) and proline (P); and (iii) it comprises at least 4 different types of amino acids selected from G, A, S, T, E, and P. In some embodiments, the XTEN2 comprises an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity to a sequence selected from the group of sequences set forth in Tables 2b-2c. In some embodiments, the first masking moiety (MM1) and the second masking moiety (MM2), when both linked in the therapeutic agent, interfere with an interaction of the biologically active moiety (BM) to the target tissue or cell such that a dissociation constant (K_d) of the BM of the therapeutic agent with a target cell marker borne by the target tissue or cell is greater, when the therapeutic agent is in an uncleaved state, compared to a dissociation constant (Kd) of a corresponding biologically active moiety. In some embodiments, the therapeutic agent, in which the biologically active moiety (BM) is linked, directly or indirectly, to one or both of the first masking moiety (MM1) and the second masking moiety (MM2), effects a broader therapeutic window in delivery of the BM to the target tissue or cell compared to a corresponding biologically active moiety. In some embodiments, the therapeutic agent, in which the biologically active moiety (BM) is linked, directly or indirectly, to one or both of the first masking moiety (MM1) and the second masking moiety (MM2), has a longer terminal half-life compared to that of a corresponding biologically active moiety. In some embodiments, the therapeutic agent, in which the biologically active moiety (BM) is linked, directly or indirectly, to one or both of the first masking moiety (MM1) and the second masking moiety (MM2), is less immunogenic compared to a corresponding biologically active moiety. In some embodiments of the therapeutic agent, immunogenicity is ascertained by measuring production of IgG antibodies that selectively bind to the biologically active moiety after administration of comparable doses to a subject. In some embodiments, the therapeutic agent, in which the biologically active moiety (BM) is linked, directly or indirectly, to one or both of the first masking moiety (MM1) and the second masking moiety (MM2), has a greater apparent molecular weight factor under a physiological condition compared to a corresponding biologically active moiety. In some embodiments, the therapeutic agent comprises a fusion polypeptide or conjugate.

In some embodiments of the therapeutic agent, the biologically active moiety (BM) comprises a biologically active peptide (BP). In some embodiments, the BP comprises an antibody, a cytokine, a cell receptor, or a fragment thereof.

In some embodiments, the therapeutic agent comprises a recombinant polypeptide. In some embodiments, the recombinant polypeptide comprises the biologically active peptide (BP) and the release segment (RS). In some embodiments, the recombinant polypeptide comprises the biologically active peptide (BP), the release segment (RS), and the masking moiety (MM). In some embodiments, the recombinant polypeptide, in an uncleaved state, has a structural arrangement from N-terminus to C-terminus of BP-RS-MM or MM-RS-BP. In some embodiments, the recombinant polypeptide comprises the biologically active peptide (BP), the first release segment (RS1), and the second release segment (RS2). In some embodiments, the recombinant polypeptide comprises the biologically active peptide (BP), the first release segment (RS1), the second release segment (RS2), the first masking moiety (MM1), and the second masking moiety (MM2). In some embodiments, the recombinant polypeptide, in an uncleaved state, has a structural arrangement from N-terminus to C-terminus of MM1-RS1-BP-RS2-MM2, MM1-RS2-BP-RS1-MM2, MM2-RS1-BP-RS2-MM1, or MM2-RS2-BP-RS1-MM1. In some embodiments, the recombinant polypeptide comprises the biologically active peptide (BP), the first release segment (RS1), the second release segment (RS2), the first extended recombinant polypeptide (XTEN1), and the second extended recombinant polypeptide (XTEN2). In some embodiments, the recombinant polypeptide, in an uncleaved state, has a structural arrangement from N-terminus to C-terminus of XTEN1-RS1-BP-RS2-XTEN2, XTEN1-RS2-BP-RS1-XTEN2, XTEN2-RS1-BP-RS2-XTEN1, or XTEN2-RS2-BP-RS1-XTEN1.

In some embodiments of the therapeutic agent, the biologically active polypeptide (BP) comprises a binding moiety having a binding affinity for a target cell marker on the target tissue or cell. In some embodiments, the target cell marker is an effector cell antigen expressed on a surface of an effector cell. In some embodiments, the binding moiety is an antibody. In some embodiments, the binding moiety is an antibody selected from the group consisting of Fv, Fab, Fab′, Fab′-SH, nanobody (also known as single domain antibody or V_HH), linear antibody, and single-chain variable fragment (scFv). In some embodiments, the binding moiety is a first binding moiety, wherein the target cell marker is a first target cell marker, and wherein the biologically active polypeptide (BP) further comprises a second binding moiety linked, directly or indirectly to the first binding moiety, wherein the second binding moiety has a binding affinity for a second target cell marker on the target tissue or cell. In some embodiments, the second target cell marker is a marker on a tumor cell or a cancer cell. In some embodiments, the second binding moiety is an antibody. In some embodiments, the second binding moiety is an antibody selected from the group consisting of Fv, Fab, Fab′, Fab′-SH, nanobody (also known as single domain antibody or V_HH), linear antibody, and single-chain variable fragment (scFv).

Certain aspects of the present disclosure provide an isolated nucleic acid, the isolated nucleic acid comprising: (a) a polynucleotide encoding a recombinant polypeptide as described herein; or (b) a reverse complement of the polynucleotide of (a).

Certain aspects of the present disclosure provide an expression vector, the expression vector comprising a polynucleotide sequence as described herein and a recombinant regulatory sequence operably linked to the polynucleotide sequence.

Certain aspects of the present disclosure provide an isolated host cell, the isolated cell comprising the expression vector as described herein. In some embodiments, the host cell is a prokaryote. In some embodiments, the host cell is E. coli or a mammalian cell. In some embodiments, the host cell is E. coli. In some embodiments, the host cell is a mammalian cell.

Some aspects of the present disclosure provide a pharmaceutical composition, the pharmaceutical composition comprising a therapeutic agent as described herein and one or more pharmaceutically suitable excipients. In some embodiments, the pharmaceutical composition is formulated for oral, intradermal, subcutaneous, intravenous, intra-arterial, intraabdominal, intraperitoneal, intrathecal, or intramuscular administration. In some embodiments, the pharmaceutical composition is in a liquid form or frozen form. In some embodiments, the pharmaceutical composition is in a pre-filled syringe for a single injection. In some embodiments, the pharmaceutical composition is formulated as a lyophilized powder to be reconstituted prior to administration.

Some aspects of the present disclosure provide a kit, the kit comprising a pharmaceutical composition as described herein, a container, and a label or package insert on or associated with the container.

In certain aspects, the present disclosure provides a method for preparing a therapeutic agent (e.g., activatable therapeutic agent, or non-natural, activatable therapeutic agent) as provided herein.

In certain aspects, the present disclosure provides a method for preparing a therapeutic agent (e.g., activatable therapeutic agent, or non-natural, activatable therapeutic agent), the method comprising:

- (a) culturing a host cell comprising a nucleic acid construct that encodes a recombinant polypeptide under conditions sufficient to express the recombinant polypeptide in the host cell, wherein the recombinant polypeptide comprises a biologically active polypeptide (BP), a release segment (RS), and a masking moiety (MM), wherein:
  - the RS comprises a peptide substrate susceptible for cleavage by a mammalian protease at a scissile bond, wherein the peptide substrate comprises an amino acid sequence having at most three or two amino acid substitutions (or at most one amino acid substitution) with respect to a sequence set forth in Column II or III of Table A (or a subset thereof); and
  - the recombinant polypeptide has a structural arrangement from N-terminus to C-terminus of BP-RS-MM or MM-RS-BP; and
- (b) recovering the therapeutic agent (e.g., activatable therapeutic agent, or non-natural, activatable therapeutic agent) comprising the recombinant polypeptide.

In some embodiments of the method for preparing the therapeutic agent, the peptide substrate susceptible to cleavage by the mammalian protease is susceptible to cleavage by a plurality of mammalian proteases comprising the mammalian protease. In some embodiments, the peptide substrate susceptible to cleavage by the plurality of mammalian proteases has at most three amino acid substitutions, or at most two amino acid substitutions, or at most one amino acid substitution with respect to a sequence set forth in Table 1(j). In some embodiments, the peptide substrate susceptible to cleavage by the plurality of mammalian proteases comprises a sequence set forth in Table 1(j). In some embodiments, the peptide substrate does not comprise SEQ ID NO: 1. In some embodiments, the peptide substrate does not comprise SEQ ID NO: 2. In some embodiments, the peptide substrate does not comprise SEQ ID NO: 3. In some embodiments, the peptide substrate does not comprise SEQ ID NO: 4. In some embodiments, the peptide substrate does not comprise SEQ ID NO: 5. In some embodiments, the peptide substrate does not comprise SEQ ID NO: 6. In some embodiments, the peptide substrate does not comprise SEQ ID NO: 7. In some embodiments, the peptide substrate does not comprise SEQ ID NO: 8. In some embodiments, the masking moiety (MM) comprises an extended recombinant polypeptide (XTEN).

In some embodiments of the method for preparing the therapeutic agent, the release segment (RS) is a first release segment (RS1), wherein the peptide substrate is a first peptide substrate, wherein the scissile bond is a first scissile bond, wherein the masking moiety (MM) is a first masking moiety (MM1), and wherein the recombinant polypeptide further comprises a second release segment (RS2), and a second masking moiety (MM2), wherein: the RS2 comprises a second peptide substrate susceptible for cleavage by a mammalian protease at a second scissile bond, wherein the second peptide substrate comprises an amino acid sequence having at most three amino acid substitutions, or at most two amino acid substitutions, or at most one amino acid substitution with respect to a sequence set forth in Column II or III of Table A (or a subset thereof); and the recombinant polypeptide has a structural arrangement from N-terminus to C-terminus of MM1-RS1-BP-RS2-MM2, MM1-RS2-BP-RS1-MM2, MM2-RS1-BP-RS2-MM1, or MM2-RS2-BP-RS1-MM1.

In some embodiments of the method for preparing the therapeutic agent, the second peptide substrate susceptible to cleavage by the mammalian protease is susceptible to cleavage by a plurality of mammalian proteases comprising the mammalian protease. In some embodiments, the second peptide substrate susceptible to cleavage by the plurality of mammalian proteases has at most three amino acid substitutions, or at most two amino acid substitutions, or at most one amino acid substitution with respect to a sequence set forth in Table 1(j). In some embodiments, the second peptide substrate susceptible to cleavage by the plurality of mammalian proteases comprises a sequence set forth in Table 1(j). In some embodiments, the second peptide substrate does not comprise SEQ ID NO: 1. In some embodiments, the second peptide substrate does not comprise SEQ ID NO: 2. In some embodiments, the second peptide substrate does not comprise SEQ ID NO: 3. In some embodiments, the second peptide substrate does not comprise SEQ ID NO: 4. In some embodiments, the second peptide substrate does not comprise SEQ ID NO: 5. In some embodiments, the second peptide substrate does not comprise SEQ ID NO: 6. In some embodiments, the second peptide substrate does not comprise SEQ ID NO: 7. In some embodiments, the second peptide substrate does not comprise SEQ ID NO: 8. In some embodiments, one of the first masking moiety (MM1) and the second masking moiety (MM2) comprises an extended recombinant polypeptide (XTEN). In some embodiments, the extended recombinant polypeptide (XTEN) is characterized in that: (i) it comprises at least 100 amino acids; (ii) at least 90% of the amino acid residues of it are selected from glycine (G), alanine (A), serine (S), threonine (T), glutamate (E) and proline (P); and (iii) it comprises at least 4 different types of amino acids selected from G, A, S, T, E, and P. In some embodiments, the extended recombinant polypeptide (XTEN) comprises an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity to a sequence selected from the group set forth in Tables 2b-2c. In some embodiments, the extended recombinant polypeptide (XTEN) is a first extended recombinant polypeptide (XTEN1), and wherein the other one of the first masking moiety (MM1) and the second masking moiety (MM2) comprises a second extended recombinant polypeptide (XTEN2). In some embodiments, the second extended recombinant polypeptide (XTEN2) is characterized in that: (i) it comprises at least 100 amino acids; (ii) at least 90% of the amino acid residues of it are selected from glycine (G), alanine (A), serine (S), threonine (T), glutamate (E) and proline (P); and (iii) it comprises at least 4 different types of amino acids selected from G, A, S, T, E, and P. In some embodiments, the XTEN1 and the XTEN2 each comprise an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity to a sequence selected from the group of sequences set forth in Tables 2b-2c.

In some embodiments of the method for preparing the therapeutic agent, the masking moiety (MM), when linked to the recombinant polypeptide, interferes with an interaction of the BP to a target tissue or cell such that a dissociation constant (K_d) of the BP of the recombinant polypeptide with a target cell marker borne by the target tissue or cell is greater, when the recombinant polypeptide is in an uncleaved state, compared to a dissociation constant (K_d) of a corresponding biologically active peptide, as measured in an in vitro assay under equivalent molar concentrations. In some embodiments, the first masking moiety (MM1) and the second masking moiety (MM2), when both linked in the recombinant polypeptide, interfere with an interaction of the BP to a target tissue or cell such that a dissociation constant (K_d) of the BP of the recombinant polypeptide with a target cell marker borne by the target tissue or cell is greater, when the recombinant polypeptide is in an uncleaved state, compared to a dissociation constant (K_d) of a corresponding biologically active peptide, as measured in an in vitro assay under equivalent molar concentrations. In some embodiments, the in vitro assay is selected from cell membrane integrity assay, mixed cell culture assay, cell-based competitive binding assay, FACS based propidium Iodide assay, trypan Blue influx assay, photometric enzyme release assay, radiometric 51Cr release assay, fluorometric Europium release assay, CalceinAM release assay, photometric MTT assay, XTT assay, WST-1 assay, alamar blue assay, radiometric 3H-Thd incorporation assay, clonogenic assay measuring cell division activity, fluorometric rhodamine123 assay measuring mitochondrial transmembrane gradient, apoptosis assay monitored by FACS-based phosphatidylserine exposure, ELISA-based TUNEL test assay, sandwich ELISA, caspase activity assay, cell-based LDH release assay, and cell morphology assay, or any combination thereof. In some embodiments, the activatable therapeutic agent is an activatable therapeutic agent or non-natural, activatable therapeutic agent as described herein.

Additional aspects and advantages of the present disclosure will become readily apparent to those skilled in this art from the following detailed description, wherein only illustrative embodiments of the present disclosure are shown and described. As will be realized, the present disclosure is capable of other and different embodiments, and its several details are capable of modifications in various obvious respects, all without departing from the disclosure. Accordingly, the drawings and description are to be regarded as illustrative in nature, and not as restrictive.

INCORPORATION BY REFERENCE

All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. To the extent publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.

BRIEF DESCRIPTION OF THE DRAWINGS

The novel features of the invention are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized, and the accompanying drawings (also “Figure” and “FIG.” herein), of which:

FIG. 1 illustrates the nomenclature of a peptide biomarker sequence in a reporter polypeptide (e.g., a protein within or adjacent to a target tissue or cell from which a biomarker sequence is generated) (such as any set forth in Table A). The illustrative reporter polypeptide sequence comprises two cleavage sequences, a first cleavage sequence and a second cleavage sequence (such as any set forth in Table A), both capable of being recognized and cleaved by mammalian enzyme(s) (such as mammalian protease(s)). For example, in some cases, the first and second cleavage sequences can be recognized and cleaved by the same enzyme or the same set of enzymes. As another example, in some cases, the first and second cleavage sequences can be recognized and cleaved by different enzymes or different sets of enzymes. The first cleavage sequence contains a first scissile bond; and the second cleavage sequence, which is C-terminal to the first cleavage sequence, contains a second scissile bond. The first and second scissile bonds (such as indicated with hyphen (-) in Table A) divide the illustrative reporter polypeptide into three portions. By cleaving the illustrative reporter polypeptide with the corresponding enzyme(s) for which both the first and second cleavage sequences are substrates for, an N-terminal fragment (N-terminal to the first scissile bond), a center fragment (between the first and second scissile bonds), and a C-terminal fragment (C-terminal to the second scissile bond) can be obtained. The N-terminal, center, or C-terminal fragment (if present) (such as any set forth in Table A), or a derivative thereof, can function as a peptide biomarker sequence. The first or second cleavage sequence (such as any set forth in Table A) can be incorporated into a release segment of an activatable therapeutic agent (such as any described herein).

FIG. 2 illustrates the nomenclature of a peptide substrate and a scissile bond thereof for cleavage. The illustrative peptide substrate contains eight consecutive amino acid residues, of which four amino acid residues (with side chain groups, in the order from the N-terminus to the C-terminus, R₄, R₃, R₂, and R₁) are immediately N-terminal to the scissile bond and four amino acid residues (with side chain groups, in the order from the N-terminus to the C-terminus, R′₁, R′₂, R′₃, and R′₄) are immediately C-terminal to the scissile bond. For example, mammalian proteases can recognize up to four residues on both sides of the scissile bond. Upon cleavage, the illustrative peptide substrate separates into an N-terminal proteolytic fragment and a C-terminal proteolytic fragment. The four amino acid residues immediately N-terminal to the scissile bond in the illustrative peptide substrate forms the C-terminus of the N-terminal proteolytic fragment; and the four amino acid residues immediately C-terminal to the scissile bond in the illustrative peptide substrate forms the N-terminus of the C-terminal proteolytic fragment.

FIG. 3 illustrates a structural configuration of an exemplary activatable antibody (AA) composition comprising an antibody or a fragment thereof, a masking moiety (MM), and a release segment (RS).

FIG. 4 illustrates a structural configuration of an exemplary activatable antibody complex (AAC) composition with cross-masking occurring such that target binding by both antibodies or fragments thereof is attenuated in its uncleaved state, and target binding is increased upon cleavage of the release segment (RS) allowing the complex to disassemble. In this figure, the two antibodies or fragments thereof are referred to as the antibody domain 1 (ABD1) and antibody domain 2 (ABD2), respectively.

FIG. 5 illustrates a structural configuration of an exemplary activatable antibody complex (AAC) composition comprising two antibodies or fragments thereof, a masking moiety (MM), and a release segment (RS).

FIG. 6 illustrates a structural configuration of an exemplary activatable antibody complex (AAC) composition comprising four antibodies or fragments thereof, two masking moieties (MM) and three release segments (RS).

FIG. 7 illustrates a structural configuration of an exemplary activatable antibody composition (AA) comprising one antibody or antibody fragment (AB), two masking moieties (MM), and two release segments (RS).

FIG. 8 illustrates a structural configuration of an XTENylated Protease-Activated T-Cell Engager (XPAT). The illustrative XPAT comprises two binding moieties, each linked to an XTEN via a release segment.

FIG. 9 illustrates the results of mammalian protease cleavage of release segments having sequence similarities to a sequence found in collagen I. The cleavage site is identified by a star (★) with portions of the sequences identical to the collagen site underlined. A sequence engineered not to be recognized or cleaved by proteases that recognize the collagen-derived cleavage site is set forth as 818-NonClv (RSR-3058) and amino acids that vary from the collagen sequence are shown in black type.

DETAILED DESCRIPTION

In various cancer therapy modalities, agents have been generated that are conditionally activatable in the tumor microenvironment. However, there remains a need for developing more accurate and robust methods for predicting whether administration of these therapies will actually lead to therapeutic responses and outcomes upon administration of prodrugs or other activatable compositions. It is recognized that there is a cascade of events that leads to metastatic growth of cancer cells. A central factor in these events is the interaction between cancer cells and their microenvironment through which the tumor cells proliferate, build new vessels, leave the primary tumor bed and finally enter and persist at secondary sites of metastatic tumor growth. The extracellular matrix (ECM) of the tumor microenvironment consists of a variety of macromolecules, including collagen and glycoproteins. While the basement membranes of the ECM are formed mostly by type IV collagen, type I and type III collagen are the most abundant proteins of the underlying interstitial matrix. In healthy tissue, the ECM undergoes constant remodeling, mediated mainly by matrix-metalloproteinases (MMP), and matrix degradation is balanced by protein formation. This controlled remodeling of the ECM becomes disrupted in cancer development and progression.

In the process of MMP-mediated ECM degradation, small fragments of ECM turnover products are generated and released into the bloodstream. Several studies have shown that serum levels of collagen degradation fragments are elevated in cancer patients compared to healthy controls. Bager et al. found levels of MMP-degraded collagen type I, III and IV (i.e., C1M, C3M and C4M, respectively, Cancer Biomark. 2015; 15:783-788) to be 1.5 to 6-fold higher in ovarian and breast cancer patients than in controls. In the present invention, it is demonstrated that cleavage of the ECM by MMPs results in a cleavage product that is highly similar to the MMP cleavage site in protease-cleavable linkers in XPATs. The data presented herein demonstrate that the protease cleavable linker employed in the XPATs of this invention are more efficiently cleaved than the ECM by purified MMPs. As such, it is shown that the presence of ECM peptides in cancer patients can serve as an indicator that the patients' tumors have a microenvironment that has the appropriate protease (e.g., MMP) activity that can cleave the protease-cleavable linker in an XPAT. In this manner, the presence of the ECM peptides in the sample of a cancer patient thereby predicts whether a given patient or tumor will be able to cleave the XPAT and hence result in treatment of the tumor. This allows for a personalized approach to determine whether an XPAT will be cleaved in a given tumor type by determining whether the subject that has said tumor type has elevated plasma levels of certain cleavage product(s) derived from the extracellular matrix.

Before the embodiments of the disclosure are described, it is to be understood that such embodiments are provided by way of example only, and that various alternatives to the embodiments of the disclosure described herein may be employed in practicing the invention. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. In case of conflict, the patent specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention.

Definitions

In the context of the present application, the following terms have the meanings ascribed to them unless specified otherwise:

As used throughout the specification and claims, the terms “a”, “an” and “the” are generally used in the sense that they mean “at least one”, “at least a first”, “one or more” or “a plurality” of the referenced components or steps, except in instances wherein an upper limit is thereafter specifically stated. For example, a “cleavage sequence”, as used herein, means “at least a first cleavage sequence” but includes a plurality of cleavage sequences. The operable limits and parameters of combinations, as with the amounts of any single agent, will be known to those of ordinary skill in the art in light of the present disclosure.

The term “activatable,” as used herein with respect to a therapeutic agent, generally means that an activity or bioactivity of the therapeutic agent is capable of being enhanced upon activation, for example, via a physical, chemical or physiological process (e.g., enzymatic processes and metabolic processes).

As used herein, the term “activatable therapeutic agent,” generally refers to a therapeutic agent, of which an activity or bioactivity is capable of being enhanced upon activation, for example, via a physical, chemical or physiological process (e.g., enzymatic processes and metabolic processes). For example, the term “activatable therapeutic agent” may refer to a therapeutic agent in an inactive (or less active) state (at least inactive in one aspect) configured to be activated (i.e., in vitro, in vivo, or ex vivo) into an active (or more active) state (at least in the aspect that is inactive prior to activation). As another example, the term “activatable therapeutic agent” may refer to an active therapeutic agent (at least active in one aspect), of which an activity or bioactivity can be further enhanced (i.e., in vitro, in vivo, or ex vivo). Non-limiting examples of an activatable therapeutic agent include a prodrug, a probody, and a pro-moiety.

The terms “polypeptide”, “peptide”, and “protein” are used interchangeably herein to generally refer to polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non-amino acids. The terms also encompass an amino acid polymer that has been modified, for example, by disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component.

As used herein in the context of the structure of a polypeptide, “N-terminus” (or “amino terminus”) and “C-terminus” (or “carboxyl terminus”) generally refer to the extreme amino and carboxyl ends of the polypeptide, respectively.

The term “N-terminal end sequence,” as used herein with respect to a polypeptide or polynucleotide sequence of interest, generally means that no other amino acid or nucleotide residues precede the N-terminal end sequence in the polypeptide or polynucleotide sequence of interest at the N-terminal end. The term “C-terminal end sequence,” as used herein with respect to a polypeptide or polynucleotide sequence of interest, generally means that no other amino acid or nucleotide residues follows the C-terminal end sequence in the polypeptide or polynucleotide sequence of interest at the C-terminal end.

The terms “non-naturally occurring” and “non-natural” are used interchangeably herein. The term “non-naturally occurring” or “non-natural,” as used herein with respect to a therapeutic agent, generally means that the agent is not biologically derived in mammals (including but not limited to human). The term “non-naturally occurring” or “non-natural,” as applied to sequences and as used herein, means polypeptide or polynucleotide sequences that do not have a counterpart to, are not complementary to, or do not have a high degree of homology with a wild-type or naturally-occurring sequence found in a mammal. For example, a non-naturally occurring polypeptide or fragment may share no more than 99%, 98%, 95%, 90%, 80%, 70%, 60%, 50% or even less amino acid sequence identity as compared to a natural sequence when suitably aligned.

As used herein, the term “antibody” generally refers to an immunoglobulin molecule, or any fragment thereof, which is immunologically reactive with an antigen of interest. For example, an antibody fragment may retain the ability to bind its ligand yet have a smaller molecular size and be in a single-chain format. The term “antibody” is used herein in the broadest sense and encompasses various antibody structures, including but not limited to monoclonal antibodies, polyclonal antibodies, multispecific antibodies (e.g., bispecific antibodies), and antibody fragments so long as they exhibit the desired antigen-binding activity. The full-length antibodies may be for example monoclonal, recombinant, chimeric, deimmunized, humanized and human antibodies.

A “variant,” when applied to a biologically active protein is a protein with sequence homology to the native biologically active protein that retains at least a portion of the therapeutic and/or biological activity of the biologically active protein. For example, a variant protein may share at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% amino acid sequence identity compared with the reference biologically active protein. As used herein, the term “biologically active protein variant” includes proteins modified deliberately, as for example, by site directed mutagenesis, synthesis of the encoding gene, insertions, or accidentally through mutations and that retain activity.

The term “sequence variant” means polypeptides that have been modified compared to their native or original sequence by one or more amino acid insertions, deletions, or substitutions. Insertions may be located at either or both termini of the protein, and/or may be positioned within internal regions of the amino acid sequence. A non-limiting example is substitution of an amino acid in an XTEN with a different amino acid. In deletion variants, one or more amino acid residues in a polypeptide as described herein are removed. Deletion variants, therefore, include all fragments of a described polypeptide sequence. In substitution variants, one or more amino acid residues of a polypeptide are removed and replaced with alternative residues. In one aspect, the substitutions are conservative in nature and conservative substitutions of this type are well known in the art. In the context of an antibody or a biologically active polypeptide, a sequence variant would retain at least a portion of the binding affinity or biological activity, respectively, of the unmodified polypeptide.

The term “moiety” means a component of a larger composition or that is intended to be incorporated into a larger composition, such as a proteinaceous portion joined to a larger polypeptide as a contiguous or non-contiguous sequence. A moiety of a larger composition can confer a desired functionality. For example, an antibody fragment may retain the ability to bind its ligand yet have a smaller molecular size and be in a single-chain format. A masking moiety (including but not limited to an extended recombinant polypeptide (XTEN)) may confer the functionality of increasing molecular weight and/or half-life of a resulting larger composition with which the masking moiety is associated.

The terms “binding domain” and “binding moiety” are used interchangeably herein and each refer to a moiety having specific binding affinity to an antigen (such as an effector cell antigen, or a tumor-specific marker or an antigen of a target cell).

As used herein, a “release segment” or “RS” generally refers to a peptide with one or more cleavage sites in the sequence that can be recognized and cleaved by one or more mammalian enzymes (such as one or more proteases).

As used herein, a “peptide substrate” generally refers to an amino acid sequence recognized by an enzyme (such as a mammalian protease), leading to cleavage at a peptide bond (or the peptide bond) within the peptide substrate such that two consecutive amino acid residues connected by the peptide bond (or the scissile bond) prior to cleavage are separated upon cleavage. As used herein, a “scissile bond” generally refers to a peptide bond joining consecutive amino acids via an amide linkage that can be cleaved (or is cleaved) by an enzyme (such as a mammalian protease). For example, in the context of a peptide substrate, the scissile bond divides the peptide substrate into a C-terminal proteolytic fragment (or a C-terminal fragment) and an N-terminal proteolytic fragment (or an N-terminal fragment), where the C-terminal proteolytic fragment (or the C-terminal fragment) is N-terminal to the scissile bond in the peptide substrate and the N-terminal proteolytic fragment (or the N-terminal fragment) is C-terminal to the scissile bond in the peptide substrate. For example, the (putative) scissile bond of each cleavage sequence listed in Table A is indicated by a hyphen (-).

As used herein, the term “scissile bond” generally refers to a peptide bond between two amino acids which is capable of being cleaved by one or more proteases.

As used herein, the term “mammalian protease” generally means a protease that normally exists in the body fluids, cells, tissues, and may be found in higher levels in certain target tissues or cells, e.g., in diseased tissues (e.g., tumor) of a mammal.

The term “within”, when referring to a first polypeptide being linked to a second polypeptide, encompasses linking or fusion of an additional component that connects the N-terminus of the first or second polypeptide to the C-terminus of the second or first polypeptide, respectively, as well as insertion of the first polypeptide into the sequence of the second polypeptide. For example, when an RS component is linked “within” an recombinant polypeptide, the RS may be linked to the N-terminus, the C-terminus, or may be inserted between any two amino acids of an XTEN polypeptide.

The term “linked directly,” as used herein in the context of a therapeutic agent, generally refers to a structure in which a moiety is connected with or attached to another moiety without an intervening tether. The term “linked indirectly,” as used herein in the context of a therapeutic agent, generally refers to a structure in which a moiety of the therapeutic agent is connected with, or attached to, another moiety of the therapeutic agent via an intervening tether. The terms “link,” “linked,” and “linking,” as used herein in the context of a therapeutic agent, generally includes both covalent and non-covalent attachment of a moiety of the therapeutic agent to another moiety of the therapeutic agent.

“Activity” (such as “bioactivity”) as applied to form(s) of a composition provided herein, generally refers to an action or effect, including but not limited to receptor binding, antagonist activity, agonist activity, a cellular or physiologic response, cell lysis, cell death, or an effect generally known in the art for the effector component of the composition, whether measured by an in vitro, ex vivo or in vivo assay or a clinical effect.

“Effector cell”, as used herein, includes any eukaryotic cells capable of conferring an effect on a target cell. For example, an effect cell can induce loss of membrane integrity, pyknosis, karyorrhexis, apoptosis, lysis, and/or death of a target cell. In another example, an effector cell can induce division, growth, differentiation of a target cell or otherwise altering signal transduction of a target cell.

An “effector cell antigen” refers to molecules expressed by an effector cell, including without limitation cell surface molecules such as proteins, glycoproteins or lipoproteins. An effector cell antigen can serve as the binding counterpart of a binding moiety of the subject recombinant polypeptide.

As used herein, the term “ELISA” refers to an enzyme-linked immunosorbent assay as described herein or as otherwise known in the art.

A “host cell” generally includes an individual cell or cell culture which can be or has been a recipient for the subject vectors into which exogenous nucleic acid has been introduced, such as those described herein. Host cells include progeny of a single host cell. The progeny may not necessarily be completely identical (in morphology or in genomic of total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation. A host cell includes cells transfected in vivo with a vector of this disclosure.

The term “isolated”, when used to describe the various polypeptides disclosed herein, generally means polypeptide that has been identified and separated and/or recovered from a component of its natural environment or from a more complex mixture (such as during protein purification). Contaminant components of its natural environment are materials that would typically interfere with diagnostic or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous solutes. As is apparent to those of skill in the art, a non-naturally occurring polynucleotide, peptide, polypeptide, protein, antibody, or fragments thereof, does not require “isolation” to distinguish it from its naturally occurring counterpart. In addition, a “concentrated”, “separated” or “diluted” polynucleotide, peptide, polypeptide, protein, antibody, or fragments thereof, is distinguishable from its naturally occurring counterpart in that the concentration or number of molecules per volume is generally greater than that of its naturally occurring counterpart. In general, a polypeptide made by recombinant means and expressed in a host cell is considered to be “isolated.”

An “isolated nucleic acid” is a nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule with which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. For example, an isolated polypeptide-encoding nucleic acid molecule is other than in the form or setting in which it is found in nature. Isolated polypeptide-encoding nucleic acid molecules therefore are distinguished from the specific polypeptide-encoding nucleic acid molecule as it exists in natural cells. However, an isolated polypeptide-encoding nucleic acid molecule includes polypeptide-encoding nucleic acid molecules contained in cells that ordinarily express the polypeptide where, for example, the nucleic acid molecule is in a chromosomal or extra-chromosomal location different from that of natural cells.

A “chimeric” protein or polypeptide contains at least one fusion polypeptide comprising at least one region in a different position in the sequence than that which occurs in nature. The regions may normally exist in separate proteins and are brought together in the fusion polypeptide; or they may normally exist in the same protein but are placed in a new arrangement in the fusion polypeptide. A chimeric protein may be created, for example, by chemical synthesis, or by recombinantly creating and translating a polynucleotide in which the peptide regions are encoded in the desired relationship.

The terms “fused” and “fusion” are used interchangeably herein, and refers to the joining together of two or more peptide or polypeptide sequences by recombinant means. A “fusion protein” or “chimeric protein” comprises a first amino acid sequence linked to a second amino acid sequence with which it is not naturally linked in nature.

“Uncleaved” and “uncleaved state” are used interchangeably herein, and refers to a polypeptide that has not been cleaved or digested by a protease such that the polypeptide remains intact.

“XTENylated” is used to denote a peptide or polypeptide that has been modified by the linking or fusion of one or more XTEN polypeptides (described, below) to the peptide or polypeptide, whether by recombinant or chemical cross-linking means.

“Crosslinking,” and “conjugating,” are used interchangeably herein, and refer to the covalent joining of two different molecules by a chemical reaction. The crosslinking can occur in one or more chemical reactions, as known in the art.

In the context of polypeptides, a “linear sequence” or a “sequence” is an order of amino acids in a polypeptide in an amino to carboxyl terminus (N- to C-terminus) direction in which residues that neighbor each other in the sequence are contiguous in the primary structure of the polypeptide. A “partial sequence” is a linear sequence of part of a polypeptide that is known to comprise additional residues in one or both directions.

“Heterologous” means derived from a genotypically distinct entity from the rest of the entity to which it is being compared. For example, a glycine rich sequence removed from its native coding sequence and operatively linked to a coding sequence other than the native sequence is a heterologous glycine rich sequence. The term “heterologous” as applied to a polynucleotide, a polypeptide, means that the polynucleotide or polypeptide is derived from a genotypically distinct entity from that of the rest of the entity to which it is being compared.

The terms “polynucleotides”, “nucleic acids”, “nucleotides” and “oligonucleotides” are used interchangeably. They refer to nucleotides of any length, encompassing a singular nucleic acid as well as plural nucleic acids, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three-dimensional structure, and may perform any function, known or unknown. The following are non-limiting examples of polynucleotides: coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers. A polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non-nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.

As used herein, the term “reporter polypeptide(s)” refers to human polypeptide(s) or protein(s) that, under certain circumstances, can be acted upon to generate a detectable signal (such as being enzymatically digested to produce detectable peptide sequence(s)) that can be identified and characterized from outside of a cell, organ, tissue, or body of a subject. For example, a “reporter polypeptide” can be a human protein capable of being cleaved by protease(s) that are also capable of cleaving activatable therapeutic agent(s) (such as described hereinbelow) comprising peptide substrate. Non-limiting examples of peptide substrates include those described hereinbelow in section “Release Segments (RS).”

The term “complement of a polynucleotide” denotes a polynucleotide molecule having a complementary base sequence and reverse orientation as compared to a reference sequence, such that it could hybridize with a reference sequence with complete fidelity.

“Recombinant” as applied to a polynucleotide means that the polynucleotide is the product of various combinations of recombination steps which may include cloning, restriction and/or ligation steps, and other procedures that result in expression of a recombinant protein in a host cell.

The terms “gene” and “gene fragment” are used interchangeably herein. They refer to a polynucleotide containing at least one open reading frame that is capable of encoding a particular protein after being transcribed and translated. A gene or gene fragment may be genomic or cDNA, as long as the polynucleotide contains at least one open reading frame, which may cover the entire coding region or a segment thereof. A “fusion gene” is a gene composed of at least two heterologous polynucleotides that are linked together.

The term “homology” or “homologous” or “identity” interchangably refers to sequence similarity between two or more polynucleotide sequences or between two or more polypeptide sequences. When using a program such as BestFit to determine sequence identity, similarity or homology between two different amino acid sequences, the default settings may be used, or an appropriate scoring matrix, such as blosum45 or blosum80, may be selected to optimize identity, similarity or homology scores. Preferably, polynucleotides that are homologous are those which hybridize under stringent conditions as defined herein and have at least 70%, preferably at least 80%, more preferably at least 90%, more preferably 95%, more preferably 97%, more preferably 98%, and even more preferably 99% sequence identity, when optimally aligned, compared to those sequences. Polypeptides that are homologous preferably have sequence identities that are at least 70%, preferably at least 80%, even more preferably at least 90%, even more preferably at least 95-99% identical when optimally aligned over sequences of comparable length.

The terms “percent identity,” percentage of sequence identity,” and “% identity,” as applied to polynucleotide sequences, refer to the percentage of residue matches between at least two polynucleotide sequences aligned using a standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in the sequences being compared in order to optimize alignment between two sequences, and therefore achieve a more meaningful comparison of the two sequences. Percent identity may be measured over the length of an entire defined polynucleotide sequence, or may be measured over a shorter length, for example, over the length of a fragment taken from a larger, defined polynucleotide sequence, for instance, a fragment of at least 45, at least 60, at least 90, at least 120, at least 150, at least 210 or at least 450 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to describe a length over which percentage identity may be measured. The percentage of sequence identity is calculated by comparing two optimally aligned sequences over the window of comparison, determining the number of matched positions (at which identical residues occur in both polypeptide sequences), dividing the number of matched positions by the total number of positions in the window of comparison (e.g., the window size), and multiplying the result by 100 to yield the percentage of sequence identity. When sequences of different length are to be compared, the shortest sequence defines the length of the window of comparison. Conservative substitutions are not considered when calculating sequence identity.

“Percent (%) sequence identity” and “percent (%) identity” with respect to the polypeptide sequences identified herein, is defined as the percentage of amino acid residues in a query sequence that are identical with the amino acid residues of a second, reference polypeptide sequence of comparable length or a portion thereof, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity, thereby resulting in optimal alignment. Alignment for purposes of determining percent amino acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve optimal alignment over the full length of the sequences being compared. Percent identity may be measured over the length of an entire defined polypeptide sequence, or may be measured over a shorter length, for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to describe a length over which percentage identity may be measured.

The term “expression” as used herein refers to a process by which a polynucleotide produces a gene product, for example, an RNA or a polypeptide. It includes without limitation transcription ofthe polynucleotide into messenger RNA (mRNA), transfer RNA (tRNA), small hairpin RNA (shRNA), small interfering RNA (siRNA) or any other RNA product, and the translation of an mRNA into a polypeptide. Expression produces a “gene product.” As used herein, a gene product can be either a nucleic acid, e.g., a messenger RNA produced by transcription of a gene, or a polypeptide which is translated from a transcript. Gene products described herein further include nucleic acids with post transcriptional modifications, e.g., polyadenylation or splicing, or polypeptides with post translational modifications, e.g., methylation, glycosylation, the addition of lipids, association with other protein subunits, or proteolytic cleavage.

A “vector” or “expression vector” are used interchangeably and refers to a nucleic acid molecule, preferably self-replicating in an appropriate host, which transfers an inserted nucleic acid molecule into and/or between host cells. The term includes vectors that function primarily for insertion of DNA or RNA into a cell, replication of vectors that function primarily for the replication of DNA or RNA, and expression vectors that function for transcription and/or translation of the DNA or RNA. Also included are vectors that provide more than one of the above functions. An “expression vector” is a polynucleotide which, when introduced into an appropriate host cell, can be transcribed and translated into a polypeptide(s). An “expression system” usually connotes a suitable host cell comprised of an expression vector that can function to yield a desired expression product.

The terms “t_1/2”, “half-life”, “terminal half-life”, “elimination half-life” and “circulating half-life” are used interchangeably herein and, as used herein, generally means the terminal half-life calculated as ln(2)/K_el. K_elis the terminal elimination rate constant calculated by linear regression of the terminal linear portion of the log concentration vs. time curve. Half-life typically refers to the time required for half the quantity of an administered substance deposited in a living organism to be metabolized or eliminated by normal biological processes. When a clearance curve of a given polypeptide is constructed as a function of time, the curve is usually biphasic with a rapid α-phase and longer beta-phase. The typical beta-phase half-life of a human antibody in humans is 21 days. Half-life can be measured using timed samples from any body fluid, but is most typically measured in serum or plasma samples.

The term “molecular weight” generally refers to the sum of atomic weights of the constituent atoms in a molecule. Molecular weight can be determined theoretically by summing the atomic masses of the constituent atoms in a molecule. When applied in the context of a polypeptide, the molecular weight is calculated by adding, based on amino acid composition, the molecular weight of each type of amino acid in the composition or by estimation from comparison to molecular weight standards in an SDS electrophoresis gel. The calculated molecular weight of a molecule can differ from the apparent molecular weight of a molecule, which generally refers to the molecular weight of a molecule as determined by one or more analytical techniques. “Apparent molecular weight factor” and “apparent molecular weight” are related terms and when used in the context of a polypeptide, the terms refer to a measure of the relative increase or decrease in apparent molecular weight exhibited by a particular amino acid or polypeptide sequence. The apparent molecular weight can be determined, for example, using size exclusion chromatography (SEC) or similar methods by comparing to globular protein standards, as measured in “apparent kD” units. The apparent molecular weight factor is the ratio between the apparent molecular weight and the “molecular weight”; the latter is calculated by adding, based on amino acid composition as described above, or by estimation from comparison to molecular weight standards in an SDS electrophoresis gel. The determination of apparent molecular weight and apparent molecular weight factor is described inter alia in U.S. Pat. No. 8,673,860.

The terms “hydrodynamic radius” or “Stokes radius” is the effective radius (Rh in nm) of a molecule in a solution measured by assuming that it is a body moving through the solution and resisted by the solution's viscosity. In the embodiments of the disclosure, the hydrodynamic radius measurements of the XTEN polypeptides correlate with the “apparent molecular weight factor” which is a more intuitive measure. The “hydrodynamic radius” of a protein affects its rate of diffusion in aqueous solution as well as its ability to migrate in gels of macromolecules. The hydrodynamic radius of a protein is determined by its molecular weight as well as by its structure, including shape and compactness. Methods for determining the hydrodynamic radius are well known in the art, such as by the use of size exclusion chromatography (SEC), as described inter alia in U.S. Pat. Nos. 6,406,632 and 7,294,513. Most proteins have globular structure, which is the most compact three-dimensional structure a protein can have with the smallest hydrodynamic radius. Some proteins adopt a random and open, unstructured, or ‘linear’ conformation and as a result have a much larger hydrodynamic radius compared to typical globular proteins of similar molecular weight.

“Physiological conditions” refers to a set of conditions in a living host as well as in vitro conditions, including temperature, salt concentration, pH, that mimic those conditions of a living subject. A host of physiologically relevant conditions for use in in vitro assays have been established. Generally, a physiological buffer contains a physiological concentration of salt and is adjusted to a neutral pH ranging from about 6.5 to about 7.8, and preferably from about 7.0 to about 7.5. A variety of physiological buffers are listed in Sambrook et al. (2001). Physiologically relevant temperature ranges from about 25° C. to about 38° C., and preferably from about 35° C. to about 37° C.

The term “binding moiety” is used herein in the broadest sense, and is specifically intended to include the categories of cytokines, cell receptors, antibodies or antibody fragments that have specific affinity for an antigen or ligand such as cell-surface receptors, target cell markers, or antigens or glycoproteins, oligonucleotides, enzymatic substrates, antigenic determinants, or binding sites that may be present in or on the surface of a tissue or cell.

The term “monoclonal antibody” as used herein refers to an antibody obtained from a population of substantially homogeneous antibodies, e.g., the individual antibodies comprising the population are identical and/or bind the same epitope, except for possible variant antibodies, e.g., containing naturally occurring mutations or arising during production of a monoclonal antibody preparation, such variants generally being present in minor amounts. In contrast to polyclonal antibody preparations, which typically include different antibodies directed against different determinants (epitopes), each monoclonal antibody of a monoclonal antibody preparation is directed against a single determinant on an antigen. Thus, the modifier “monoclonal” indicates the character of the antibody as being obtained from a substantially homogeneous population of antibodies, and is not to be construed as requiring production of the antibody by any particular method. For example, the monoclonal antibodies to be used in accordance with the present invention may be made by a variety of techniques, including but not limited to the hybridoma method, recombinant DNA methods, phage-display methods, and methods utilizing transgenic animals containing all or part of the human immunoglobulin loci, such methods and other exemplary methods for making monoclonal antibodies being known in the art or described herein.

An “antibody fragment,” as used herein, generally refers to a molecule other than an intact antibody that comprises a portion of an intact antibody and that binds the antigen to which the intact antibody binds. Examples of antibody fragments include but are not limited to Fv, Fab, Fab′, Fab′-SH, F(ab′)2, diabodies, single chain diabodies, linear antibodies, nanobodies (also known as single domain antibodies (including single domain camelid antibodies) or V_HH) single-chain variable fragment (scFv) antibody molecules, and multispecific antibodies formed from antibody fragments.

“scFv” or “single chain fragment variable” are used interchangeably herein to refer to an antibody fragment format comprising regions of variable heavy (“VH”) and variable light (“VL”) chains or two copies of a VH or VL chain, which are joined together by a short flexible peptide linker. The scFv is not actually a fragment of an antibody, but is a fusion protein of the variable regions of the heavy (VH) and light chains (VL) of immunoglobulins, and can be easily expressed in functional form in E. coli or mammalian cell(s) in either N- to C-terminus orientation; VL-VH or VH-VL.

The terms “antigen”, “target cell marker” and “ligand” are used interchangeably herein to refer to the structure or binding determinant that a binding moiety, an antibody, antibody fragment or an antibody fragment-based molecule binds to or has binding specificity against.

The term “epitope” refers to the particular site on an antigen molecule to which an antibody, antibody fragment, or binding moiety binds. An epitope is a ligand of an antibody, antibody fragment, or a binding moiety.

As used herein, “CD3” or “cluster of differentiation 3” means the T cell surface antigen CD3 complex, which includes in individual form or independently combined form all known CD3 subunits, for example CD3 epsilon, CD3 delta, CD3 gamma, CD3 zeta, CD3 alpha and CD3 beta. The extracellular domains of CD3 epsilon, gamma and delta contain an immunoglobulin-like domain, so are therefore considered part of the immunoglobulin superfamily.

The terms “specific binding” or “specifically bind” or “binding specificity” are used interchangeably herein to refer to the high degree of binding affinity of a binding moiety to its corresponding target. Typically, specific binding as measured by one or more of the assays disclosed herein would have a dissociation constant or K_dof less than about 10⁻⁶M (e.g, of 10⁻⁷M to 10⁻¹²M).

The term “affinity,” as used herein, generally refers to the strength of the sum total of noncovalent interactions between a single binding site of a molecule (e.g., an antibody) and its binding partner (e.g., an antigen). Unless indicated otherwise, as used herein, “binding affinity” refers to intrinsic binding affinity which reflects a 1:1 interaction between members of a binding pair (e.g., antibody and antigen). The affinity of a molecule X for its partner Y can generally be represented by the dissociation constant (K_d). As used herein “a greater binding affinity” or “increased binding affinity” means a lower K_dvalue; e.g., 1×10⁻⁹M is a greater binding affinity than 1×10⁻⁸M, while a “lower binding affinity” means a greater K_dvalue; e.g., 1×10⁻⁷M is a lower binding affinity than 1×10⁻⁸M.

“Inhibition constant”, or “K_i”, are used interchangeably and mean the dissociation constant of the enzyme-inhibitor complex, or the reciprocal of the binding affinity of the inhibitor to the enzyme.

“Dissociation constant”, or “K_d”, are used interchangeably and mean the affinity between a ligand “L” and a protein “P”; e.g., how tightly a ligand binds to a particular protein. It can be calculated using the formula K_d=[L] [P]/[LP], where [P], [L] and [LP] represent molar concentrations of the protein, ligand and complex, respectively. The term “k_on”, as used herein, is intended to refer to the on rate constant for association of an antibody to the antigen to form the antibody/antigen complex as is known in the art. The term “k_off”, as used herein, is intended to refer to the off rate constant for dissociation of an antibody from the antibody/antigen complex as is known in the art. Techniques such as flow cytometry or surface plasmon resonance can be used to detect binding events. The assays may comprise soluble antigens or receptor molecules, or may determine the binding to cell-expressed receptors. Such assays may include cell-based assays, including assays for proliferation, cell death, apoptosis and cell migration. The binding affinity of the subject compositions for the target ligands can be assayed using binding or competitive binding assays, such as Biacore assays with chip-bound receptors or binding proteins or ELISA assays, as described in U.S. Pat. No. 5,534,617, assays described in the Examples herein, radio-receptor assays, reporter gene activity assays, or other assays known in the art. For example, an exemplary reporter gene activity assay can be based on genetically engineered cell(s), generated by stably introducing relevant gene(s) for the receptor(s)-of-interest and the signaling pathway(s)-of-interest, such that binding to the engineered receptor triggers a signaling cascade leading to the activation of the engineered gene pathway with a subsequent production of signature polypeptide(s) (such as an enzyme). The binding affinity constant can then be determined using standard methods, such as Scatchard analysis, as described by van Zoelen, et al., Trends Pharmacol Sciences (1998) 19)12):487, or other methods known in the art.

A “target cell marker” refers to a molecule expressed by a target cell including but not limited to cell-surface receptors, cytokine receptors, antigens, tumor-associated antigens, glycoproteins, oligonucleotides, enzymatic substrates, antigenic determinants, or binding sites that may be present in the on the surface of a target tissue or cell that may serve as ligands for a binding moiety. Non-limiting examples of target cell markers include the target markers of Table 6.

The term “target tissue” generally refers to a tissue that is the cause of or is part of a disease condition such as, but not limited to cancer or inflammatory conditions. Sources of diseased target tissue include a body organ, a tumor, a cancerous cell or population of cancerous cells or cells that form a matrix or are found in association with a population of cancerous cells, bone, skin, cells that produce cytokines or factors contributing to a disease condition.

The term “target cell” generally refers to a cell that has the ligand of a binding moiety, an antibody or antibody fragment of the subject compositions and is associated with or causes a disease or pathologic condition, including cancer cells, tumor cells, and inflammatory cells. The ligand of a target cell is referred to herein as a “target cell marker” or “target cell antigen” and includes, but is not limited to, cell surface receptors or antigens, cytokines, cytokine receptors, MHC proteins, and cytosol proteins or peptides that are exogenously presented. As used herein, “target cell” would not include an effector cell.

As used herein, an “immunoassay” generally refers to a biochemical test that measures the presence or concentration of a substance in a sample, such as a biological sample, using the reaction of an antibody (or a fragment thereof) to its cognate antigen, for example the specific binding of an antibody to a protein. Both the presence of the antigen or the amount of the antigen present can be measured.

As used herein, a “mass spectrometer (MS)” generally refers to an apparatus that includes a means for ionizing molecules and detecting charged molecules. A mass spectrum generated by a mass spectrometer can be used to identify molecule(s) of interest based on the molar mass. Non-limiting examples of “mass spectrometer (MS)” include all combinations with liquid chromatography (LC), such as liquid chromatography with mass spectrometry (LC-MS), liquid chromatography with tandem mass spectrometry (LC-MS/MS), etc.

As used herein, the terms “treatment” or “treating,” or “palliating” or “ameliorating” are used interchangeably herein. These terms generally refer to an approach for obtaining beneficial or desired results including but not limited to a therapeutic benefit and/or a prophylactic benefit. By therapeutic benefit is meant eradication or amelioration of the underlying disorder being treated. Also, a therapeutic benefit is achieved with the eradication or amelioration of one or more of the physiological symptoms or improvement in one or more clinical parameters associated with the underlying disorder such that an improvement is observed in the subject, notwithstanding that the subject may still be afflicted with the underlying disorder. For prophylactic benefit, the compositions may be administered to a subject at risk of developing a particular disease, or to a subject reporting one or more of the physiological symptoms of a disease, even though a diagnosis of this disease may not have been made.

A “therapeutic effect” or “therapeutic benefit,” as used herein, generally refers to a physiologic effect, including but not limited to the mitigation, amelioration, or prevention of disease or an improvement in one or more clinical parameters associated with the underlying disorder in humans or other animals, or to otherwise enhance physical or mental wellbeing of humans or animals, resulting from administration of a polypeptide of the disclosure other than the ability to induce the production of an antibody against an antigenic epitope possessed by the biologically active protein. For prophylactic benefit, the compositions may be administered to a subject at risk of developing a particular disease, a recurrence of a former disease, condition or symptom of the disease, or to a subject reporting one or more of the physiological symptoms of a disease, even though a diagnosis of this disease may not have been made.

The terms “therapeutically effective amount” and “therapeutically effective dose”, as used herein, generally refer to an amount of a drug or a biologically active protein, either alone or as a part of a polypeptide composition, that is capable of having any detectable, beneficial effect on any symptom, aspect, measured parameter or characteristics of a disease state or condition when administered in one or repeated doses to a subject. Such effect need not be absolute to be beneficial. Determination of a therapeutically effective amount is well within the capability of those skilled in the art, especially in light of the detailed disclosure provided herein.

The term “equivalent molar dose” generally means that the amounts of materials administered to a subject have an equivalent amount of moles, based on the molecular weight of the material used in the dose.

The term “therapeutically effective and non-toxic dose,” as used herein, generally refers to a tolerable dose of the compositions as defined herein that is high enough to cause depletion of tumor or cancer cells, tumor elimination, tumor shrinkage or stabilization of disease without or essentially without major toxic effects in the subject. Such therapeutically effective and non-toxic doses may be determined by dose escalation studies described in the art and should be below the dose inducing severe adverse side effects.

The terms “cancer” and “cancerous” refer to or describe the physiological condition in mammals that is typically characterized by unregulated cell growth/proliferation.

Compositions
Therapeutic Agents

Provided herein, in some embodiments, is a therapeutic agent (or an activatable therapeutic agent, or a non-natural, activatable therapeutic agent) that comprises a release segment (RS) (such as one described hereinbelow in the RELEASE SEGMENTS section or described anywhere else herein) linked, directly or indirectly, to a biologically active moiety (BM) (such as one described hereinbelow in the BIOLOGICALLY ACTIVE MOIETIES section or described anywhere else herein). The biologically active moiety (BM) can be a biologically active peptide (BP) (such as one described hereinbelow in the BIOLOGICALLY ACTIVE MOIETIES section or described anywhere else herein). The release segment (RS) can comprise a peptide substrate (such as one described hereinbelow in the RELEASE SEGMENTS section or described anywhere else herein) susceptible to cleavage by a mammalian protease (such as one described hereinbelow or described anywhere else herein) at a scissile bond. The therapeutic agent can further comprise a masking moiety (MM) (such as one described hereinbelow in the MASKING MOIETIES section or described anywhere else herein) linked, directly or indirectly, to the release segment (RS). A bioactivity of the therapeutic agent can be enhanced upon cleavage of the peptide substrate by the mammalian protease (thereby releasing the masking moiety). The therapeutic agent, in an uncleaved state, can have a structural arrangement from N-terminus to C-terminus of BM-RS-MM or MM-RS-BM. Upon cleavage of the release segment (RS), the masking moiety (MM) can be released from the therapeutic agent. The masking moiety (MM) can comprise an extended recombinant polypeptide (XTEN). The therapeutic agent, in an uncleaved state, can have a structural arrangement from N-terminus to C-terminus of BM-RS-XTEN or XTEN-RS-BM.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), where the release segment (RS) can be a first release segment (RS1), where the peptide substrate (of the RS1) can be a first peptide substrate, and where the scissile bond (of the RS1) can be a first scissile bond, the therapeutic agent can further comprise a second release segment (RS2) (such as one described hereinbelow in the RELEASE SEGMENTS section or described anywhere else herein) linked, directly or indirectly, to the biologically active moiety (BM). The second release segment (RS2) can comprise a second peptide substrate (such as one described hereinbelow in the RELEASE SEGMENTS section or described anywhere else herein) for cleavage by a mammalian protease (such as one described hereinbelow or described anywhere else herein) at a second scissile bond. A bioactivity of the therapeutic agent can be enhanced upon cleavage of one or both of the first and second peptide substrate by the mammalian protease (thereby releasing one or both of the first and second masking moieties). The mammalian protease for cleavage of the second release segment (RS2) can be identical to the mammalian protease for cleavage of the first release segment (RS1). The mammalian protease for cleavage of the second release segment (RS2) can be different from the mammalian protease for cleavage of the first release segment (RS1). The second release segment (RS2) can have an amino acid sequence identical to that of the first release segment (RS1). The second release segment (RS2) can have an amino acid sequence different from that of the first release segment (RS1). In some embodiments, the scissile bond (or the first scissile bond, or the second scissile bond) is not immediately C-terminal to a methionine residue. In some embodiments, the first scissile bond is not immediately C-terminal to a methionine residue. In some embodiments, the second scissile bond is not immediately C-terminal to a methionine residue.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), where the masking moiety (MM) can be a first masking moiety (MM1), the therapeutic agent can further comprise a second masking moiety (MM2) (such as one described hereinbelow in the MASKING MOIETIES section or described anywhere else herein) linked, directly or indirectly, to the second release segment (RS2). The therapeutic agent, in an uncleaved state, can have a structural arrangement from N-terminus to C-terminus of MM1-RS1-BM-RS2-MM2, MM1-RS2-BM-RS1-MM2, MM2-RS1-BM-RS2-MM1, or MM2-RS2-BM-RS1-MM1. Upon cleavage of the second release segment (RS2), the second masking moiety (MM2) can be released from the therapeutic agent. The first masking moiety (MM1) can comprise a first extended recombinant polypeptide (XTEN1). The second masking moiety (MM2) can comprise a second extended recombinant polypeptide (XTEN2). The therapeutic agent, in an uncleaved state, can have a structural arrangement from N-terminus to C-terminus of XTEN1-RS1-BP-RS2-XTEN2, XTEN1-RS2-BP-RS1-XTEN2, XTEN2-RS1-BP-RS2-XTEN1, or XTEN2-RS2-BP-RS1-XTEN1.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), the therapeutic agent can comprise a fusion polypeptide (e.g., a recombinant fusion protein) or conjugate (e.g., linked by chemical conjugation). In some embodiments, the therapeutic agent can be configured for activation at or in proximity to a target tissue or cell (such as one described hereinbelow in the TARGET TISSUES OR CELLS section or described anywhere else herein) in a subject. The therapeutic agent can be an anti-cancer agent (such as an activatable anti-cancer agent, or a non-natural, activatable anti-cancer agent). The therapeutic agent can be configured for activation by one or more mammalian proteases (such as one or any combination of those described herein).

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), the therapeutic agent can comprise a recombinant polypeptide. The recombinant polypeptide can comprise the biologically active peptide (BP) and the release segment (RS). The recombinant polypeptide can comprise the biologically active peptide (BP), the release segment (RS), and the masking moiety (MM). The recombinant polypeptide, in an uncleaved state, can have a structural arrangement from N-terminus to C-terminus of BP-RS-MM or MM-RS-BP. The recombinant polypeptide can comprise the biologically active peptide (BP), the first release segment (RS1), and the second release segment (RS2). The recombinant polypeptide can comprise the biologically active peptide (BP), the first release segment (RS1), the second release segment (RS2), the first masking moiety (MM1), and the second masking moiety (MM2). The recombinant polypeptide, in an uncleaved state, can have a structural arrangement from N-terminus to C-terminus of MM1-RS1-BP-RS2-MM2, MM1-RS2-BP-RS1-MM2, MM2-RS1-BP-RS2-MM1, or MM2-RS2-BP-RS1-MM1. The recombinant polypeptide can comprise the biologically active peptide (BP), the first release segment (RS1), the second release segment (RS2), the first extended recombinant polypeptide (XTEN1), and the second extended recombinant polypeptide (XTEN2). The recombinant polypeptide, in an uncleaved state, can have a structural arrangement from N-terminus to C-terminus of XTEN1-RS1-BP-RS2-XTEN2, XTEN1-RS2-BP-RS1-XTEN2, XTEN2-RS1-BP-RS2-XTEN1, or XTEN2-RS2-BP-RS1-XTEN1.

Release Segments (RS)

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), the peptide substrate (or the first peptide substrate, or the second peptide substrate) can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a cleavage sequence (such as one set forth in Tables 1(a)-1(j) or Table A) of a reporter polypeptide (such as one described hereinbelow in the TARGET TISSUES OR CELLS section or described anywhere else herein). The peptide substrate (or the first peptide substrate, or the second peptide substrate) can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a cleavage sequence (such as one set forth in Tables 1(a)-1(j) or Table A) of the reporter polypeptide. The peptide substrate (or the first peptide substrate, or the second peptide substrate) can comprise an amino acid sequence identical to a cleavage sequence (such as one set forth in Tables 1(a)-1(j) or Table A) of the reporter polypeptide. In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), the peptide substrate (or the first peptide substrate, or the second peptide substrate) can comprise an amino acid sequence having at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Column II or III of Table A (or a subset thereof) and/or the group set forth in Tables 1(a)-1(j) (or any subset thereof). The peptide substrate (or the first peptide substrate, or the second peptide substrate) can comprise an amino acid sequence having at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Column II or III of Table A (or a subset thereof) and/or the group set forth in Tables 1(a)-1(j) (or any subset thereof). The peptide substrate (or the first peptide substrate, or the second peptide substrate) can comprise an amino acid sequence identical to a sequence set forth in Column II or III of Table A (or a subset thereof) and/or the group set forth in Tables 1(a)-1(j) (or any subset thereof). In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) comprises two or three sequences set forth in Column II or III of Table A (or a subset thereof). In some embodiments, where the peptide substrate (or the first peptide substrate, or the second peptide substrate) comprises two sequences set forth in Column II or III of Table A (or a subset thereof), the two sequences partially overlap one another. In some embodiments, where the peptide substrate (or the first peptide substrate, or the second peptide substrate) comprises two sequences set forth in Column II or III of Table A (or a subset thereof), the two sequences do not overlap one another. In some embodiments, where the peptide substrate (or the first peptide substrate, or the second peptide substrate) comprises three sequences set forth in Column II or III of Table A (or a subset thereof), two or all of the three sequences do not overlap one another. In some embodiments, where the peptide substrate (or the first peptide substrate, or the second peptide substrate) comprises three sequences set forth in Column II or III of Table A (or a subset thereof), one of the three sequences partially overlaps with another sequence or both other sequences of the three sequences. In some embodiments, where the peptide substrate (or the first peptide substrate, or the second peptide substrate) comprises three sequences set forth in Column II or III of Table A (or a subset thereof), two of the three sequences partially overlap with one another. In some embodiments, where the peptide substrate (or the first peptide substrate, or the second peptide substrate) comprises three sequences set forth in Column II or III of Table A (or a subset thereof), each two of the three sequences partially overlap with one another. In some embodiments, where the peptide substrate (or the first peptide substrate, or the second peptide substrate) comprises three sequences set forth in Column II or III of Table A (or a subset thereof), all of the three sequences partially overlap with one another. In some embodiments, none of the at most four, at most three, at most two, or at most one amino acid substitution(s) is/are at a position corresponding to an amino acid residue immediately adjacent to a scissile bond of a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, none of the at most four, at most three, at most two, or at most one amino acid substitution(s) is/are at a position corresponding to an amino acid residue immediately adjacent to a scissile bond of a corresponding sequence selected from the group set forth in Tables 1(a)-1(i) (or any subset thereof). In some embodiments, none of the at most four, at most three, at most two, or at most one amino acid substitution(s) is/are at a position corresponding to an amino acid residue immediately adjacent to a scissile bond of a corresponding sequence selected from the group set forth in Table 1(j) (or any subset thereof). The peptide substrate (or the first peptide substrate, or the second peptide substrate) can contain 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 amino acid residues or a range of any two of the foregoing values. The peptide substrate can contain from six to twenty-five or six to twenty amino acid residues. The peptide substrate can contain from six to twenty-five amino acid residues. The peptide substrate can contain from six to twenty amino acid residues. In some embodiments, the peptide substrate contains from seven to twelve amino acid residues. The peptide substrate can comprise a fragment of an amino acid sequence set forth in Column II or III of Table A (or a subset thereof) and/or the group set forth in Tables 1(a)-1(j) (or any subset thereof). The fragment of the peptide substrate can contain at least four amino acid residues and a corresponding scissile bond (such as indicated in Tables 1(a)-1(j) or Table A). The fragment of the peptide substrate can contain at least five, at least six, at least seven, at least eight, at least nine, or at least ten amino acid residues. In some cases, a portion of the peptide substrate that is N-terminal of the scissile bond can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column IV or V of Table A (or a subset thereof). The portion of the peptide substrate that is N-terminal of the scissile bond can comprise a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column IV or V of Table A (or a subset thereof). In some cases, a portion of the peptide substrate that is N-terminal of the scissile bond can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column IV of Table A (or a subset thereof). The portion of the peptide substrate that is N-terminal of the scissile bond can comprise a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column IV of Table A (or a subset thereof). In some cases, a portion of the peptide substrate that is N-terminal of the scissile bond can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V of Table A (or a subset thereof). The portion of the peptide substrate that is N-terminal of the scissile bond can comprise a C-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V of Table A (or a subset thereof). In some cases, a portion of the peptide substrate that is C-terminal of the scissile bond can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V or VI of Table A (or a subset thereof). The portion of the peptide substrate that is C-terminal of the scissile bond can an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V or VI of Table A (or a subset thereof). In some cases, a portion of the peptide substrate that is C-terminal of the scissile bond can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V of Table A (or a subset thereof). The portion of the peptide substrate that is C-terminal of the scissile bond can an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column V of Table A (or a subset thereof). In some cases, a portion of the peptide substrate that is C-terminal of the scissile bond can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column VI of Table A (or a subset thereof). The portion of the peptide substrate that is C-terminal of the scissile bond can an N-terminal end sequence containing from four to ten amino acid residues of a sequence set forth in Column VI of Table A (or a subset thereof). In some embodiments, where the peptide substrate comprises a scissile bond (for cleavage by one or more mammalian proteases), the peptide substrate does not comprise a methionine residue immediately N-terminal to the scissile bond. In some embodiments, where the peptide substrate comprises a plurality of scissile bonds, the peptide substrate does not comprise a methionine residue immediately N-terminal to at least one scissile bond of the plurality of scissile bonds. In some embodiments, where the peptide substrate comprises a plurality of scissile bonds, the peptide substrate does not comprise a methionine residue immediately N-terminal to each scissile bond of the plurality of scissile bonds. In some embodiments, the peptide substrate does not comprise an amino acid sequence selected from the group consisting of #279, #280, #282, #283, #298, #299, #302, #303, #305, #307, #308, #349, #396, #397, #416, #417, #418, #458, #459, #460, #466, #481 and #482 (or any combination thereof) of Column II of Table A.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent) that comprises (1) a first release segment (RS1) comprising a first peptide substrate and (2) a second release segment (RS2) comprising a second peptide substrate, the second peptide substrate can contain 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 amino acid residues or a range of any two of the foregoing values. The second peptide substrate can contain from six to twenty-five or six to twenty amino acid residues. The second peptide substrate can contain from six to twenty-five amino acid residues. The second peptide substrate can contain from six to twenty amino acid residues. The second peptide substrate can contain from seven to twelve amino acid residues. The second peptide substrate can comprise an amino acid sequence having at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Column II or III of Table A (or a subset thereof) and/or the group set forth in Tables 1(a)-1(j) (or any subset thereof). The second peptide substrate can comprise an amino acid sequence having at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Column II or III of Table A (or a subset thereof) and/or the group set forth in Tables 1(a)-1(j) (or any subset thereof). The second peptide substrate can comprise an amino acid sequence identical to a sequence set forth in Column II or III of Table A (or a subset thereof) and/or the group set forth in Tables 1(a)-1(j) (or any subset thereof). In some embodiments, the second peptide substrate comprises two or three sequences set forth in Column II or III of Table A (or a subset thereof). In some embodiments, where the second peptide substrate comprises two sequences set forth in Column II or III of Table A (or a subset thereof), the two sequences (of the second peptide substrate) partially overlap one another. In some embodiments, where the second peptide substrate comprises two sequences set forth in Column II or III of Table A (or a subset thereof), the two sequences (of the second peptide substrate) do not overlap one another. In some embodiments, where the second peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), two or all of the three sequences (of the second peptide substrate) do not overlap one another. In some embodiments, where the second peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), one of the three sequences (of the second peptide substrate) partially overlaps with another sequence or both other sequences of the three sequences (of the second peptide substrate). In some embodiments, where the second peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), two of the three sequences (of the second peptide substrate) partially overlap with one another. In some embodiments, where the second peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), each two of the three sequences (of the second peptide substrate) partially overlap with one another. In some embodiments, where the second peptide substrate comprises three sequences set forth in Column II or III of Table A (or a subset thereof), all of the three sequences (of the second peptide substrate) partially overlap with one another. In some embodiments, where the second peptide substrate comprises a scissile bond (for cleavage by one or more mammalian proteases), the second peptide substrate does not comprise a methionine residue immediately N-terminal to the scissile bond. In some embodiments, where the second peptide substrate comprises a plurality of scissile bonds, the second peptide substrate does not comprise a methionine residue immediately N-terminal to at least one scissile bond of the plurality of scissile bonds. In some embodiments, where the second peptide substrate comprises a plurality of scissile bonds, the second peptide substrate does not comprise a methionine residue immediately N-terminal to each scissile bond of the plurality of scissile bonds. In some embodiments, the second peptide substrate does not comprise an amino acid sequence selected from the group consisting of #279, #280, #282, #283, #298, #299, #302, #303, #305, #307, #308, #349, #396, #397, #416, #417, #418, #458, #459, #460, #466, #481 and #482 (or any combination thereof) of Column II of Table A.

In some embodiments of the present disclosure, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a sequence selected from SEQ ID NOS: 1-8. In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a sequence of SEQ ID NO: 1. In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a sequence of SEQ ID NO: 2. In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a sequence of SEQ ID NO: 3. In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a sequence of SEQ ID NO: 4. In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a sequence of SEQ ID NO: 5. In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a sequence of SEQ ID NO: 6. In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a sequence of SEQ ID NO: 7. In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a sequence of SEQ ID NO: 8. In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a methionine residue immediately N-terminal to a scissile bond (contained therein) (for cleavage by one or more mammalian proteases). In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a methionine residue immediately N-terminal to one or more scissile bonds (contained therein). In some embodiments, the peptide substrate (or the first peptide substrate, or the second peptide substrate) does not comprise a methionine residue immediately N-terminal to any scissile bond (contained therein). In some embodiments, the peptide substrate (or the first peptide substrate or the second peptide substrate) does not comprise an amino acid sequence selected from the group consisting of #279, #280, #282, #283, #298, #299, #302, #303, #305, #307, #308, #349, #396, #397, #416, #417, #418, #458, #459, #460, #466, #481 and #482 (or any combination thereof) of Column II of Table A.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), a six to ten consecutive amino acid sequence of a peptide substrate (e.g., a first peptide substrate, a second peptide substrate, etc.) comprises at most four, at most three, at most two, or at most one amino acid substitution(s), with respect to a corresponding six to ten consecutive amino acid sequence of a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, a six to ten consecutive amino acid sequence of a peptide substrate (e.g., a first peptide substrate, a second peptide substrate, etc.) is identical to a corresponding six to ten consecutive amino acid sequence of a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, an eight to ten consecutive amino acid sequence of a peptide substrate (e.g., a first peptide substrate, a second peptide substrate, etc.) comprises at most three, at most two, or at most one amino acid substitution(s), with respect to a corresponding eight to ten consecutive amino acid sequence of a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, an eight to ten consecutive amino acid sequence of a peptide substrate (e.g., a first peptide substrate, a second peptide substrate, etc.) is identical to a corresponding eight to ten consecutive amino acid sequence of a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, an eight consecutive amino acid sequence of a peptide substrate (e.g., a first peptide substrate, a second peptide substrate, etc.) comprises at most three, at most two, or at most one amino acid substitution(s), with respect to a corresponding eight consecutive amino acid sequence of a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, an eight consecutive amino acid sequence of a peptide substrate (e.g., a first peptide substrate, a second peptide substrate, etc.) is identical to a corresponding eight consecutive amino acid sequence of a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, a nine consecutive amino acid sequence of a peptide substrate (e.g., a first peptide substrate, a second peptide substrate, etc.) comprises at most three, at most two, or at most one amino acid substitution(s), with respect to a corresponding nine consecutive amino acid sequence of a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, a nine consecutive amino acid sequence of a peptide substrate (e.g., a first peptide substrate, a second peptide substrate, etc.) is identical to a corresponding nine consecutive amino acid sequence of a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, a ten consecutive amino acid sequence of a peptide substrate (e.g., a first peptide substrate, a second peptide substrate, etc.) comprises at most three, at most two, or at most one amino acid substitution(s), with respect to a corresponding ten consecutive amino acid sequence of a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, a ten consecutive amino acid sequence of a peptide substrate (e.g., a first peptide substrate, a second peptide substrate, etc.) is identical to a corresponding ten consecutive amino acid sequence of a sequence set forth in Column II or III of Table A (or a subset thereof).

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), the release segment (RS) (or the first release segment (RS1), or the second release segment (RS2), can (each independently) comprise a peptide substrate (or a first peptide substrate, or a second peptide substrate) for cleavage by a mammalian protease, such as a serine protease, a cysteine protease, an aspartate protease, a threonine protease, or a metalloproteinase. The release segment (RS) (or the first release segment (RS1), or the second release segment (RS2), can (independently) comprise a peptide substrate (or a first peptide substrate, or a second peptide substrate) for cleavage by a mammalian protease selected from the group consisting of disintegrin and metalloproteinase domain-containing protein 10 (ADAM10), disintegrin and metalloproteinase domain-containing protein 12 (ADAM12), disintegrin and metalloproteinase domain-containing protein 15 (ADAM15), disintegrin and metalloproteinase domain-containing protein 17 (ADAM17), disintegrin and metalloproteinase domain-containing protein 9 (ADAM9), disintegrin and metalloproteinase with thrombospondin motifs 5 (ADAMTS5), Cathepsin B, Cathepsin D, Cathepsin E, Cathepsin K, cathepsin L, cathepsin S, Fibroblast activation protein alpha, Hepsin, kallikrein-2, kallikrein-4, kallikrein-3, Prostate-specific antigen (PSA), kallikrein-13, Legumain, matrix metallopeptidase 1 (MMP-1), matrix metallopeptidase 10 (MMP-10), matrix metallopeptidase 11 (MMP-11), matrix metallopeptidase 12 (MMP-12), matrix metallopeptidase 13 (MMP-13), matrix metallopeptidase 14 (MMP-14), matrix metallopeptidase 16 (MMP-16), matrix metallopeptidase 2 (MMP-2), matrix metallopeptidase 3 (MMP-3), matrix metallopeptidase 7 (MMP-7), matrix metallopeptidase 8 (MMP-8), matrix metallopeptidase 9 (MMP-9), matrix metallopeptidase 4 (MMP-4), matrix metallopeptidase 5 (MMP-5), matrix metallopeptidase 6 (MMP-6), matrix metallopeptidase 15 (MMP-15), neutrophil elastase, protease activated receptor 2 (PAR2), plasmin, prostasin, PSMA-FOLH1, membrane type serine protease 1 (MT-SP1), matriptase, and u-plasminogen. The release segment (RS) (or the first release segment (RS1), or the second release segment (RS2), can (independently) comprise a peptide substrate (or a first peptide substrate, or a second peptide substrate) for cleavage by a mammalian protease selected from the group consisting of matrix metallopeptidase 1 (MMP1) (for which the sequences listed in Table 1(a), as examples without being limited to, are substrate sequences), matrix metallopeptidase 2 (MMP2) (for which the sequences listed in Table 1(b), as examples without being limited to, are substrate sequences), matrix metallopeptidase 7 (MMP1) (for which the sequences listed in Table 1(c), as examples without being limited to, are substrate sequences), matrix metallopeptidase 9 (MMP9) (for which the sequences listed in Table 1(d), as examples without being limited to, are substrate sequences), matrix metallopeptidase 11 (MMP11) (for which the sequences listed in Table 1(e), as examples without being limited to, are substrate sequences), matrix metallopeptidase 14 (MMP14) (for which the sequences listed in Table 1(f), as examples without being limited to, are substrate sequences), urokinase-type plasminogen activator (uPA) (for which the sequences listed in Table 1(g), as examples without being limited to, are substrate sequences), legumain (for which the sequences listed in Table 1(h), as examples without being limited to, are substrate sequences), and matriptase (for which the sequences listed in Table 1(i), as examples without being limited to, are substrate sequences). The release segment (RS) (or the first release segment (RS1), or the second release segment (RS2), can (independently) comprise a peptide substrate (or a first peptide substrate, or a second peptide substrate) for cleavage by a plurality of mammalian proteases. The peptide substrate (or the first peptide substrate, or the second peptide substrate) susceptible to cleavage by the mammalian protease can be susceptible to cleavage by a plurality of mammalian proteases comprising the mammalian protease. The peptide substrate (or the first peptide substrate, or the second peptide substrate) susceptible to cleavage by the plurality of mammalian proteases can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Table 1(j). The peptide substrate (or the first peptide substrate, or the second peptide substrate) susceptible to cleavage by the plurality of mammalian proteases can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Table 1(j). The peptide substrate (or the first peptide substrate, or the second peptide substrate) susceptible to cleavage by the plurality of mammalian proteases can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Table 1(j). The peptide substrate (or the first peptide substrate, or the second peptide substrate) susceptible to cleavage by the plurality of mammalian proteases can comprise a sequence set forth in Table 1(j).

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent) that comprises a set of release segments, each release segment in the set can (independently) comprise a peptide substrate for cleavage by a mammalian protease, such as a serine protease, a cysteine protease, an aspartate protease, a threonine protease, or a metalloproteinase. Each release segment in the set can (independently) comprise a peptide substrate for a different mammalian protease (independently) selected from the group consisting of disintegrin and metalloproteinase domain-containing protein 10 (ADAM10), disintegrin and metalloproteinase domain-containing protein 12 (ADAM12), disintegrin and metalloproteinase domain-containing protein 15 (ADAM15), disintegrin and metalloproteinase domain-containing protein 17 (ADAM17), disintegrin and metalloproteinase domain-containing protein 9 (ADAM9), disintegrin and metalloproteinase with thrombospondin motifs 5 (ADAMTS5), Cathepsin B, Cathepsin D, Cathepsin E, Cathepsin K, cathepsin L, cathepsin S, Fibroblast activation protein alpha, Hepsin, kallikrein-2, kallikrein-4, kallikrein-3, Prostate-specific antigen (PSA), kallikrein-13, Legumain, matrix metallopeptidase 1 (MMP-1), matrix metallopeptidase 10 (MMP-10), matrix metallopeptidase 11 (MMP-11), matrix metallopeptidase 12 (MMP-12), matrix metallopeptidase 13 (MMP-13), matrix metallopeptidase 14 (MMP-14), matrix metallopeptidase 16 (MMP-16), matrix metallopeptidase 2 (MMP-2), matrix metallopeptidase 3 (MMP-3), matrix metallopeptidase 7 (MMP-7), matrix metallopeptidase 8 (MMP-8), matrix metallopeptidase 9 (MMP-9), matrix metallopeptidase 4 (MMP-4), matrix metallopeptidase 5 (MMP-5), matrix metallopeptidase 6 (MMP-6), matrix metallopeptidase 15 (MMP-15), neutrophil elastase, protease activated receptor 2 (PAR2), plasmin, prostasin, PSMA-FOLH1, membrane type serine protease 1 (MT-SP1), matriptase, and u-plasminogen. Each release segment in the set can (independently) comprise a peptide substrate for a different mammalian protease (independently) selected from the group consisting of matrix metallopeptidase 1 (MMP1) (for which the sequences listed in Table 1(a), as examples without being limited to, are substrate sequences), matrix metallopeptidase 2 (MMP2) (for which the sequences listed in Table 1(b), as examples without being limited to, are substrate sequences), matrix metallopeptidase 7 (MMP1) (for which the sequences listed in Table 1(c), as examples without being limited to, are substrate sequences), matrix metallopeptidase 9 (MMP9) (for which the sequences listed in Table 1(d), as examples without being limited to, are substrate sequences), matrix metallopeptidase 11 (MMP11) (for which the sequences listed in Table 1(e), as examples without being limited to, are substrate sequences), matrix metallopeptidase 14 (MMP14) (for which the sequences listed in Table 1(f), as examples without being limited to, are substrate sequences), urokinase-type plasminogen activator (uPA) (for which the sequences listed in Table 1(g), as examples without being limited to, are substrate sequences), legumain (for which the sequences listed in Table 1(h), as examples without being limited to, are substrate sequences), and matriptase (for which the sequences listed in Table 1(i), as examples without being limited to, are substrate sequences). In some cases, at least one release segment (RS) of the set of release segments can (independently) comprise a peptide substrate for cleavage by a plurality of mammalian proteases. The peptide substrate susceptible to cleavage by the plurality of mammalian proteases can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Table 1(j). The peptide substrate susceptible to cleavage by the plurality of mammalian proteases can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Table 1(j). The peptide substrate susceptible to cleavage by the plurality of mammalian proteases can have at most four, or at most three, or at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Table 1(j). The peptide substrate susceptible to cleavage by the plurality of mammalian proteases can comprise a sequence set forth in Table 1(j). One of skill in the art will understand that a sequence set forth in Tables 1(a)-1(j) may, alternatively or additionally, be cleaved by one or more other proteases with substrate specificity similar to that of a corresponding protease, identified in a corresponding table, as capable of cleaving the sequence.

TABLE 1a

Exemplary peptide substrates for cleavage by

matrix metallopeptidase 1 (MMP1)

SEQ

ID
Amino Acid

Name of Reporter Polypeptide
NO:
Sequence

elastin
36
IGPGG-VAAAA

alpha-1-antitrypsin
37
DPQG-DAAQ

type I collagen alpha-1 chain
38
DGVRG-LTGPI

type V collagen alpha-1 chain
39
RGPSG-HMGRE

elastin
40
ISPEA-QAAAA

Complement C4-B OR Complement
41
TPLQ-LFEG

C4-A

type III collagen alpha-1 chain
42
QGPPG-KNGET

alpha-2-HS-glycoprotein
43
PPLG-APGL

apolipoprotein L1
44
KPLG-DWAA

type II collagen alpha-1 chain
45
DGAAG-VKGDR

TABLE 1b

Exemplary peptide substrates for cleavage by

matrix metallopeptidase 2 (MMP2)

SEQ

ID
Amino Acid

Name of Reporter Polypeptide
NO:
Sequence

alpha-1-antichymotrypsin
46
LLSA-LVET

pigment epithelium-derived factor
47
QPAH-LTFP

SPARC
48
DHPVE-LLARD

integrin alpha-IIb
49
QPSR-LQDP

type I collagen alpha-1 chain
50
DGVRG-LTGPI

zyxin
51
QPVS-LANT

elastin
52
IGPGG-VAAAA

vitronectin
53
LTSD-LQAQ

immunoglobulin kappa variable 2-30
54
SPLS-LPVT

type IV collagen alpha-1 chain
55
GDPGE-ILGHV

TABLE 1c

Exemplary peptide substrates for cleavage by

matrix metallopeptidase 7 (MMP7)

SEQ

ID
Amino Acid

Name of Reporter Polypeptide
NO:
Sequence

elastin
56
IGPGG-VAAAA

Complement C4-B OR Complement C4-A
57
TPLQ-LFEG

SPARC
58
DHPVE-LLARD

type I collagen alpha-1 chain
59
DGVRG-LTGPI

immunoglobulin kappa variable 2-30
60
LPVT-LGQP

pigment epithelium-derived factor
61
QPAH-LTFP

probable non-functional
62
SPVT-LGQP

immunoglobulin kappa

variable 2D-24

immunoglobulin kappa variable 3-20
63
GTLS-LSPG

fibrinogen beta chain
64
EEAPS-LRPA

type II collagen alpha-1 chain
65
DGAAG-VKGDR

TABLE 1(d)

Exemplary peptide substrates for cleavage by

matrix metallopeptidase 9 (MMP9)

SEQ

ID
Amino Acid

Name of Reporter Polypeptide
NO:
Sequence

type I collagen alpha-1 chain
66
DGVRG-LTGPI

elastin
67
IGPGG-VAAAA

type III collagen alpha-1 chain
68
QGPPG-KNGET

type V collagen alpha-1 chain
69
RGPSG-HMGRE

type II collagen alpha-1 chain
70
DGAAG-VKGDR

type VI collagen alpha-1 chain
71
KGAKG-YRGPE

alpha-2-HS-glycoprotein
72
PPLG-APGL

type VI collagen alpha-3 chain
73
IGNRG-PRGET

chromogranin-A
74
GPQL-RRGW

transcription factor SOX-10
75
SPPG-VDAK

TABLE 1(e)

Exemplary peptide substrates for cleavage by

matrix metallopeptidase 11 (MMP11)

SEQ

ID
Amino Acid

Name of Reporter Polypeptide
NO:
Sequence

alpha-1-antitrypsin
76
AAGA-MFLE

serum amyloid A-1 protein
77
AAEA-ISDA

fibrinogen alpha chain
78
EAAF-FDTA

complement C4-A OR complement
79
KSHA-LQLN

C4-B

apolipoprotein C-III
80
SARA-SEAE

ceruloplasmin
81
PAWA-KEKH

serum amyloid A-2 protein
82
AWAA-EVIS

fibrinogen beta chain
83
EEAPS-LRPA

immunoglobulin lambda variable
84
SEAS-YELT

3-25

PDZ and LIM domain protein 1
85
PFTA-SPAS

TABLE 1(f)

Exemplary peptide substrates for cleavage by

matrix metallopeptidase 14 (MMP14)

SEQ

ID
Amino Acid

Name of Reporter Polypeptide
NO:
Sequence

integrin alpha-IIb
86
QPSR-LQDP

alpha-1-antichymotrypsin
87
LLSA-LVET

pigment epithelium-derived factor
88
QPAH-LTFP

Complement C4-B OR Complement
89
TPLQ-LFEG

C4-A

zyxin
90
QPVS-LANT

type I collagen alpha-1 chain
91
DGVRG-LTGPI

SPARC
92
DHPVE-LLARD

immunoglobulin kappa variable
93
SPLS-LPVT

2-30

immunoglobulin kappa variable
94
LPVT-LGQP

2-30

elastin
95
IGPGG-VAAAA

TABLE 1(g)

Exemplary peptide substrates for cleavage by

urokinase-type plasminogen activator (uPA)

SEQ
Amino

ID
Acid

Name of Reporter Polypeptide
NO:
Sequence

serum amyloid A-2 protein
96
RSGR-DPNH

serum amyloid A-2 protein
97
AAKR-GPGG

deleted in malignant brain
98
RSKR-DVGS

tumors 1 protein

secretogranin-2
99
VSKR-FPVG

serum amyloid A-1 protein OR
100
VSSR-SFFS

serum amyloid A-2 protein

haptoglobin
101
PVQR-ILGG

fibrinogen alpha chain
102
SSGP-GSTG

fibrinogen beta chain
103
FSAR-GHRP

complement C4-A OR complement
104
RQIR-GLEE

C4-B

oncoprotein-induced transcript
105
RMRR-GAGG

3 protein

TABLE 1(h)

Exemplary peptide substrates for cleavage by

legumain

SEQ

ID
Amino Acid

Name of Reporter Polypeptide
NO:
Sequence

neurosecretory protein VGF
106
RKKN-APPE

coagulation factor XII
107
GDRN-KPGV

Complement C4-B OR Complement
108
TGRN-GFKS

C4-A

fibrinogen alpha chain
109
GSWN-SGSS

tubulin beta chain
110
EPYN-ATLS

transthyretin
111
FTAN-DSGP

fibrinogen beta chain
112
QGVN-DNEE

fibrinogen alpha chain
113
SPRN-PSSA

angiotensinogen
114
QQLN-KPEV

multimerin-1
115
TSLN-TVGG

TABLE 1(i)

Exemplary peptide substrates for cleavage by

matriptase

SEQ
Amino

ID
Acid

Name of Reporter Polypeptide
NO:
Sequence

oncoprotein-induced transcript
116
RMRR-GAGG

3 protein

deleted in malignant brain
117
RSKR-DVGS

tumors 1 protein

serum amyloid A-2 protein
118
AAKR-GPGG

inter-alpha-trypsin inhibitor
119
RVPR-QVRL

heavy chain H5

haptoglobin
120
PVQR-ILGG

alpha-2-HS-glycoprotein
121
RKTR-TVVQ

sulfhydryl oxidase 1
122
PGLR-AAPG

gastric inhibitory polypeptide
123
RGPR-YAEG

keratin, type I cytoskeletal 17
124
RQVR-TIVE

complement C4-A OR complement
125
RQIR-GLEE

C4-B

TABLE 1(j)

Exemplary peptide substrates for cleavage by

multiple proteases

SEQ

Exemplary Proteases That

ID

May Cleave the Peptide

NO.
Amino Acid Sequence
substrate

1
GPGG-VAAAVSKR-FPVG
MMP2, MMP7, uPA

2
GVRG-LTGPVSKR-FPVG
MMP2, MMP7, uPA

3
VSKR-FPVGEAGR-SAN-H
uPA, matriptase,

legumain

4
EAGR-SAN-HGVRG-LTGP
matriptase, legumain,

MMP1

5
EAGR-SAN-HTPAG-LTGP
MMP2, MMP9, matriptase,

legumain

6
SPEA-QAAAEAGR-SAN-H
MMP1, matriptase,

legumain

7
QPAH-LTFPEAGR-SAN-H
MMP2, MMP14, legumain,

matriptase

8
AGSPGK-DGVRG-LTGP
matriptase, MMP2, MMP9

Masking Moieties (MM)

A masking moiety (MM) of the present disclosure may be capable of specifically or non-specifically interacting with a biologically active moiety (BM) (or any component(s) or fragment(s) thereof) of an activatable therapeutic agent composition (such as described herein), thereby masking the BM (at least in certain cases) by inhibiting or reducing the ability of the BM to bind with designated target(s). In some instances, the masking moiety (MM) may specifically bind to or have specific affinity for the biologically active moiety (e.g., an antibody or antibody fragment), thereby interfering and/or inhibiting binding of the BM to its designed target (e.g., antigen target). In some instances, the masking moiety does not have significant affinity for the biologically active moiety, but exerts it masking effect due to non-specific steric hinderance.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), the masking moiety (MM) (or the first masking moiety (MM1), or the second masking moiety (MM2)), when linked to the corresponding therapeutic agent, can (each independently, individually or collectively) interfere with an interaction of the biologically active moiety (BM) to a target tissue or cell (such as one described hereinbelow in the TARGET TISSUES OR CELLS section or described anywhere else herein) such that a dissociation constant (K_d) of the BM of the therapeutic agent with a target cell marker (such as one described hereinbelow in the TARGET TISSUES OR CELLS section or described anywhere else herein) borne by the target tissue or cell can be greater, when the therapeutic agent is in an uncleaved state, compared to a dissociation constant (K_d) of a corresponding biologically active moiety (as remaining after the release segment (RS) is cleaved and the MM is released) with the target cell marker. The dissociation constant (K_d) of the biologically active moiety (BM) of the therapeutic agent, when the therapeutic agent is in an uncleaved state, with the target cell marker can be at least (about) 2-fold greater, at least (about) 5-fold greater, at least (about) 10-fold greater, at least (about) 50-fold greater, at least (about) 100-fold greater, at least (about) 200-fold greater, at least (about) 300-fold greater, at least (about) 400-fold greater, at least (about) 500-fold greater, at least (about) 600-fold greater, at least (about) 700-fold greater, at least (about) 800-fold greater, at least (about) 900-fold greater, or at least (about) 1000-fold greater, than the dissociation constant (Kd) of the corresponding biologically active moiety with the target cell marker. The dissociation constant (Kd) can be measured in an in vitro assay under equivalent molar concentrations. The in vitro assay can be selected from cell membrane integrity assay, mixed cell culture assay, cell-based competitive binding assay, FACS based propidium Iodide assay, trypan Blue influx assay, photometric enzyme release assay, radiometric 51Cr release assay, fluorometric Europium release assay, CalceinAM release assay, photometric MTT assay, XTT assay, WST-1 assay, alamar blue assay, radiometric 3H-Thd incorporation assay, clonogenic assay measuring cell division activity, fluorometric rhodamine123 assay measuring mitochondrial transmembrane gradient, apoptosis assay monitored by FACS-based phosphatidylserine exposure, ELISA-based TUNEL test assay, sandwich ELISA, caspase activity assay, cell-based LDH release assay, and cell morphology assay, reporter gene activity assay, or any combination thereof.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), the therapeutic agent can effect an enhancement in a safety profile, for example, improve a maximum tolerable exposure level (MTEL), and/or reduce a side effect (e.g., cytotoxicity), in delivery of the BM to a target tissue or cell (such as one described hereinbelow in the TARGET TISSUES OR CELLS section or described anywhere else herein) compared to a corresponding biologically active moiety (as remaining after the release segment (RS) is cleaved and the MM is released). The therapeutic agent, in which the biologically active moiety (BM) is linked (directly or indirectly) to the masking moiety (MM) (or the first masking moiety (MM1), or the second masking moiety (MM2)) can effect an enhancement in a safety profile, for example, improve a maximum tolerable exposure level (MTEL), and/or reduce a side effect (e.g., cytotoxicity), by at least (about) 2-fold, by at least (about) 5-fold, by at least (about) 10 fold, by at least (about) 50-fold, by at least (about) 100-fold, by at least (about) 200-fold, by at least (about) 300-fold, by at least (about) 400-fold, or by at least (about) 500-fold higher, in delivery of the BM to the target tissue or cell, than the corresponding biologically active moiety.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), the therapeutic agent can have a longer terminal half-life compared to that of a corresponding biologically active moiety. The therapeutic agent, in which the biologically active moiety (BM) is linked (directly or indirectly) to the masking moiety (MM) (or the first masking moiety (MM1), or the second masking moiety (MM2)) can have a terminal half-life of at least (about) 2-fold longer, at least (about) 5-fold longer, at least (about) 10-fold longer, at least (about) 15-fold longer, at least (about) 20-fold longer, at least (about) 50-fold longer, or at least (about) 100-fold longer, than the terminal half-life of the corresponding biologically active moiety.

In some embodiments, the therapeutic agent can be less immunogenic compared to a corresponding biologically active moiety. The therapeutic agent, in which the biologically active moiety (BM) is linked (directly or indirectly) to the masking moiety (MM) (or the first masking moiety (MM1), or the second masking moiety (MM2)), can be at least (about) 2-fold less immunogenic, at least (about) 5-fold less immunogenic, or at least (about) 10-fold less immunogenic, than the corresponding biologically active moiety. The immunogenicity can be ascertained by measuring production of IgG antibodies that selectively bind to the biologically active moiety after administration of comparable doses to a subject.

In some embodiments, the therapeutic agent can have a greater apparent molecular weight factor under a physiological condition, compared to a corresponding biologically active moiety. The therapeutic agent, in which the biologically active moiety (BM) is linked (directly or indirectly) to the masking moiety (MM) (or the first masking moiety (MM1), or the second masking moiety (MM2)), can have an apparent molecular weight factor of at least (about) 1.5-fold greater, at least (about) 2-fold greater, at least (about) 5-fold greater, at least (about) 8-fold greater, at least (about) 10-fold greater, at least (about) 12-fold greater, at least (about) 15-fold greater, at least (about) 18-fold greater, or at least (about) 20-fold greater, under a physiological condition, than the corresponding biologically active moiety.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent) that comprises a first masking moiety (MM1) and a second masking moiety (MM2), the MM1 and the MM2, when both linked in the therapeutic agent, can (each independently, individually or collectively) interfere with an interaction of the biologically active moiety (BM) to a target tissue or cell (such as one described hereinbelow in the TARGET TISSUES OR CELLS section or described anywhere else herein) such that a dissociation constant (K_d) of the biologically active moiety (BM) of the therapeutic agent with a target cell marker (such as one described hereinbelow in the TARGET TISSUES OR CELLS section or described anywhere else herein) borne by the target tissue or cell can be greater, when the therapeutic agent is in an uncleaved state, compared to a dissociation constant (Kd) of a corresponding biologically active peptide (as remaining after one or both of the first release segment (RS1) and the second release segment (RS2) is/are cleaved and one or both of the MM1 and the MM2 is/are released). The dissociation constant (Kd) of the biologically active moiety (BM) of the therapeutic agent, when the therapeutic agent is in an uncleaved state, with the target cell marker can be at least (about) 2-fold greater, at least (about) 5-fold greater, at least (about) 10-fold greater, at least (about) 50-fold greater, at least (about) 100-fold greater, at least (about) 200-fold greater, at least (about) 300-fold greater, at least (about) 400-fold greater, at least (about) 500-fold greater, at least (about) 600-fold greater, at least (about) 700-fold greater, at least (about) 800-fold greater, at least (about) 900-fold greater, or at least (about) 1000-fold greater, than the dissociation constant (Kd) of the corresponding biologically active peptide. The dissociation constant (Kd) can be measured in an in vitro assay under equivalent molar concentrations. The in vitro assay can be selected from cell membrane integrity assay, mixed cell culture assay, cell-based competitive binding assay, FACS based propidium Iodide assay, trypan Blue influx assay, photometric enzyme release assay, radiometric ⁵¹Cr release assay, fluorometric Europium release assay, CalceinAM release assay, photometric MTT assay, XTT assay, WST-1 assay, alamar blue assay, radiometric 3H-Thd incorporation assay, clonogenic assay measuring cell division activity, fluorometric rhodamine123 assay measuring mitochondrial transmembrane gradient, apoptosis assay monitored by FACS-based phosphatidylserine exposure, ELISA-based TUNEL test assay, sandwich ELISA, caspase activity assay, cell-based LDH release assay, reporter gene activity assay, and cell morphology assay, or any combination thereof.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent) that comprises a first masking moiety (MM1) and a second masking moiety (MM2), the therapeutic agent, in which the biologically active moiety (BM) is linked, directly or indirectly, to one or both of the MM1 and the MM2, can effect an enhancement in a safety profile, for example, improve a maximum tolerable exposure level (MTEL), and/or reduce a side effect (e.g., cytotoxicity), in delivery of the biologically active moiety (BM) to the target tissue or cell compared to a corresponding biologically active moiety (as remaining after one or both of the first release segment (RS1) and the second release segment (RS2) is/are cleaved and one or both of the MM1 and the MM2 is/are released). The therapeutic agent, in which the biologically active moiety (BM) is linked (directly or indirectly) to one or both of the MM1 and the MM2, can effect an enhancement in a safety profile, for example, improve a maximum tolerable exposure level (MTEL), and/or reduce a side effect (e.g., cytotoxicity) by at least (about) 2-fold, by at least (about) 5-fold, by at least (about) 10 fold, by at least (about) 50-fold, by at least (about) 100-fold, by at least (about) 200-fold, by at least (about) 300-fold, by at least (about) 400-fold, or by at least (about) 500-fold higher in delivery of the BM to the target tissue or cell, than the corresponding biologically active moiety.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent) that comprises a first masking moiety (MM1) and a second masking moiety (MM2), the therapeutic agent, in which the biologically active moiety (BM) is linked, directly or indirectly, to one or both of the MM1 and the MM2, can have a longer terminal half-life compared to that of a corresponding biologically active moiety (as remaining after one or both of the first release segment (RS1) and the second release segment (RS2) is/are cleaved and one or both of the MM1 and the MM2 is/are released). The therapeutic agent, in which the biologically active moiety (BM) is linked (directly or indirectly) to one or both of the MM1 and the MM2, can have a terminal half-life of at least (about) 2-fold longer, at least (about) 5-fold longer, at least (about) 10-fold longer, at least (about) 15-fold longer, at least (about) 20-fold longer, at least (about) 50-fold longer, at least (about) 100-fold longer, than the terminal half-life of the corresponding biologically active moiety.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent) that comprises a first masking moiety (MM1) and a second masking moiety (MM2), the therapeutic agent, in which the biologically active moiety (BM) is linked, directly or indirectly, to one or both of the MM1 and MM2, can be less immunogenic compared to a corresponding biologically active moiety (as remaining after one or both of the first release segment (RS1) and the second release segment (RS2) is/are cleaved and one or both of the MM1 and the MM2 is/are released). The therapeutic agent, in which the biologically active moiety (BM) is linked (directly or indirectly) to one or both of the MM1 and the MM2, can be at least (about) 2-fold less immunogenic, at least (about) 5-fold less immunogenic, or at least (about) 10-fold less immunogenic, than the corresponding biologically active moiety. The immunogenicity can be ascertained by measuring production of IgG antibodies that selectively bind to the biologically active moiety after administration of comparable doses to a subject.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent) that comprises a first masking moiety (MM1) and a second masking moiety (MM2), the therapeutic agent, in which the biologically active moiety (BM) is linked, directly or indirectly, to one or both of the MM1 and the MM2, can have a greater apparent molecular weight factor under a physiological condition compared to a corresponding biologically active moiety. The therapeutic agent, in which the biologically active moiety (BM) is linked (directly or indirectly) to one or both of the MM1 and the MM2, can have an apparent molecular weight factor of at least (about) 1.5-fold greater, at least (about) 2-fold greater, at least (about) 5-fold greater, at least (about) 8-fold greater, at least (about) 10-fold greater, at least (about) 12-fold greater, at least (about) 15-fold greater, at least (about) 18-fold greater, or at least (about) 20-fold greater, under a physiological condition, than the corresponding biologically active moiety.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), the masking moiety (MM) (or the first masking moiety (MM1), or the second masking moiety (MM2)) can (each independently) comprise an extended recombinant polypeptide (XTEN). The XTEN can be characterized in that: (i) it comprises at least 100 amino acids; (ii) at least 90% of the amino acid residues of it are selected from glycine (G), alanine (A), serine (S), threonine (T), glutamate (E) and proline (P); and (iii) it comprises at least 4 different types of amino acids selected from G, A, S, T, E, and P. The XTEN can be characterized in that: (i) it comprises at least 150 amino acids; (ii) at least 90% of the amino acid residues of it are selected from glycine (G), alanine (A), serine (S), threonine (T), glutamate (E) and proline (P); and (iii) it comprises at least 4 different types of amino acids selected from G, A, S, T, E, and P. The extended recombinant polypeptide (XTEN) can (each independently) comprise an amino acid sequence having at least (about) 90%, at least (about) 91%, at least (about) 92%, at least (about) 93%, at least (about) 94%, at least (about) 95%, at least (about) 96%, at least (about) 97%, at least (about) 98%, at least (about) 99%, or 100% sequence identity to a sequence set forth in Tables 2b-2c, or any subset thereof.

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent) that comprises (1) a first masking moiety (MM1) comprising a first extended recombinant polypeptide (XTEN1) and (2) a second masking moiety (MM2) comprising a second extended recombinant polypeptide (XTEN2), the XTEN2 can be characterized in that: (i) it comprises at least 100 amino acids; (ii) at least 90% of the amino acid residues of it are selected from glycine (G), alanine (A), serine (S), threonine (T), glutamate (E) and proline (P); and (iii) it comprises at least 4 different types of amino acids selected from G, A, S, T, E, and P. The XTEN2 can be characterized in that: (i) it comprises at least 150 amino acids; (ii) at least 90% of the amino acid residues of it are selected from glycine (G), alanine (A), serine (S), threonine (T), glutamate (E) and proline (P); and (iii) it comprises at least 4 different types of amino acids selected from G, A, S, T, E, and P. The XTEN2 can comprise an amino acid sequence having at least (about) 90%, at least (about) 91%, at least (about) 92%, at least (about) 93%, at least (about) 94%, at least (about) 95%, at least (about) 96%, at least (about) 97%, at least (about) 98%, at least (about) 99%, or 100% sequence identity to a sequence selected from the group of sequences set forth in Tables 2b-2c, or any subset thereof.

In some embodiments, the XTEN (or the XTEN1, or the XTEN2) can (each independently) comprise, or can (each independently) be formed from, a plurality of non-overlapping sequence motifs. At least one of the non-overlapping sequence motifs can be recurring (or repeated at least two times in the corresponding XTEN). At least one of the non-overlapping sequence motifs can be non-recurring (or found only once within the corresponding XTEN). The plurality of non-overlapping sequence motifs can comprise (i) a set of (recurring) non-overlapping sequence motifs, where each motif of the set is repeated at least two times in the corresponding XTEN and (ii) a non-overlapping (non-recurring) sequence motif that occurs (or is found) only once within the corresponding XTEN. Each non-overlapping sequence motif can be from 9 to 14 (or 10 to 14, or 11 to 13) amino acids in length. Each non-overlapping sequence motif can be 12 amino acids in length. The plurality of non-overlapping sequence motifs can comprise a set of non-overlapping (recurring) sequence motifs, where each motif of the set can be (1) repeated at least two times in the corresponding XTEN and (2) between 9 and 14 amino acids in length. The set of (recurring) non-overlapping sequence motifs can comprise 12-mer sequence motifs selected from the group set forth in Table 2a. The set of (recurring) non-overlapping sequence motifs can comprise 12-mer sequence motifs selected from the group set forth in Table 2a. The set of (recurring) non-overlapping sequence motifs can comprise at least two, at least three, or all four of 12-mer sequence motifs of the group set forth in Table 2a.

TABLE 2a

Exemplary 12-mer sequence motifs for construction

of the XTENs

SEQ

ID

Motif Family*
NO:
Amino Acid Sequence

AD
126
GESPGGSSGSES

AD
127
GSEGSSGPGESS

AD
128
GSSESGSSEGGP

AD
129
GSGGEPSESGSS

AE, AM
130
GSPAGSPTSTEE

AE, AM, AQ
131
GSEPATSGSETP

AE, AM, AQ
132
GTSESATPESGP

AE, AM, AQ
133
GTSTEPSEGSAP

AF, AM
134
GSTSESPSGTAP

AF, AM
135
GTSTPESGSASP

AF, AM
136
GTSPSGESSTAP

AF, AM
137
GSTSSTAESPGP

AG, AM
138
GTPGSGTASSSP

AG, AM
139
GSSTPSGATGSP

AG, AM
140
GSSPSASTGTGP

AG, AM
141
GASPGTSSTGSP

AQ
142
GEPAGSPTSTSE

AQ
143
GTGEPSSTPASE

AQ
144
GSGPSTESAPTE

AQ
145
GSETPSGPSETA

AQ
146
GPSETSTSEPGA

AQ
147
GSPSEPTEGTSA

BC
148
GSGASEPTSTEP

BC
149
GSEPATSGTEPS

BC
150
GTSEPSTSEPGA

BC
151
GTSTEPSEPGSA

BD
152
GSTAGSETSTEA

BD
153
GSETATSGSETA

BD
154
GTSESATSESGA

BD
155
GTSTEASEGSAS

*Denotes individual motif sequences that, when used together in various permutations, results in a “family sequence”

TABLE 2b

Exemplary XTEN polypeptides

XTEN
SEQ ID

Name
NO.
Amino Acid Sequence

AE144
156
GSEPATSGSETPGTSESATPESGPGSEPATSGSETPGSPAGSPTSTEEGTST

EPSEGSAPGSEPATSGSETPGSEPATSGSETPGSEPATSGSETPGTSTEPSE

GSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAP

AE144_1A
157
SPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTE

PSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSGS

ETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPG

AE144_2A
158
TSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSES

ATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEG

SAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPG

AE144_2B
159
TSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSES

ATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEG

SAPGTSTEPSEGSAPGTS ESATPESGPGTSESATPESGPG

AE144_3A
160
SPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTE

PSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEG

SAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPG

AE144_3B
161
SPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTE

PSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEG

SAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPG

AE144_4A
162
TSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSES

ATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTS

TEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPG

AE144_4B
163
TSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSES

ATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTS

TEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPG

AE144_5A
164
TSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSES

ATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEG

AE144_6B
165
TSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPA

TSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEG

SAPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPG

AE288_1
166
GTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSE

SATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSG

SETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAP

GTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEP

ATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPATSG

SETPGTSESATPESGPGTSTEPSEGSAP

AE288_2
167
GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTST

EPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSE

GSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTST

EPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPT

STEEGTSESATPESGPGTSTEPSEGSAP

AE576
168
GSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTST

EPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSG

SETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAP

GSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTST

EPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTST

EPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPT

STEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPA

GSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSE

GSAP

AE624
169
MAEPAGSPTSTEEGTPGSGTASSSPGSSTPSGATGSPGASPGTSSTGSPGS

PAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEP

SEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSGSE

TPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGS

PAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSTEP

SEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGS

APGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEP

SEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTST

EEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGS

PTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGS

AP

AE864
170
GSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTST

EPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSG

SETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAP

GSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTST

EPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTST

EPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPT

STEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPA

GSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSE

GSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETP

GTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEP

ATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETP

GSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGSAP

AE865
171
GGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTS

TEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATS

GSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPS

EGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTS

TEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSP

TSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSP

AGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSET

PGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPS

EGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSET

PGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGSAP

AE866
172
PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTS

TEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATS

GSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPS

EGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTS

TEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSP

TSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSP

AGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSET

PGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPS

EGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSET

PGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGSAPG

AE1152
173
GSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTST

EPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSG

SETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAP

GSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTST

EPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTST

EPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPT

STEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPA

GSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSE

GSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETP

GTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEP

ATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETP

GSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSG

SETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPA

GSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATP

ESGPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEE

GTSTEPSEGSAPGTSTEPSEGSAPGSEPATSGSETPGTSESATPESGPGTST

EPSEGSAP

AE144
174
STEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPAT

A

SGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGS

APGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGS

AE144
175
SEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTE

B

PSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPG

AE180
176
TSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTE

A

EGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTS

ESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPS

EGSAPGTSTEPSEGSAPGSEPATS

AE216
177
PESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSET

A

PGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTS

ESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEPATS

GSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPATSGSET

PGTSESAT

AE252
178
ESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEE

A

GTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPA

GSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAP

GTSTEPSEGSAPGSEPATSGSETPGTSESATPESGPGTSTEPSE

AE288
179
TPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPES

A

GPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGT

SESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGS

PTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPES

GPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGT

STEPSEGSAPGSEPATSGSETPGTSESA

AE324
180
PESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESG

A

PGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTE

EGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSE

PATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPS

EGSAPGSEPATS

AE360
181
PESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTE

A

EGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPS

EGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESG

PGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTS

ESATPESGPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSP

TSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPATSGSETPGTSESAT

AE396
182
PESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

A

PGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTS

TEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATS

GSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPS

AE432
183
EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSET

A

PGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTS

ESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESAT

PESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTS

ESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSP

TSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESG

PGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTS

TEPSEGSAPGSEPATS

AE468
184
EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSA

A

PGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTS

ESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSP

TSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESAT

PESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESG

PGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSP

AGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPATSGSETPGTSESAT

AE504
185
EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSA

A

PGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTS

ESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESAT

PESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESG

PGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTE

EGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSE

PATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPS

EGSAPGSEPATSGSETPGTSESATPESGPGTSTEPS

AE540
186
TPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPES

A

GPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEG

SPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEG

SAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPG

SPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSES

ATPESGPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTS

TEEGTSTEPSEGSAPGTSTEP

AE576
187
TPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSE

A

TPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGT

STEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEP

SEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSE

TPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGS

PAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESA

TPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPES

GPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPG

TSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSES

ATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEPATSGS

ETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPATSGSETPG

TSESA

AE612
188
GSETPGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESG

A

PGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSP

TSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSET

PGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSP

AGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSP

TSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESG

PGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTS

TEPSEGSAPGTSTEPSEGSAPGSEPATSGSETPGTSESAT

AE648
189
PESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSA

A

PGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTS

ESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSA

PGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTS

ESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESAT

PESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESG

PGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTE

EGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSE

PATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPS

EGSAPGSEPATSGSETPGTSESAT

AE684
190
EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESG

A

PGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTS

TEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSP

AGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTS

TEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATS

GSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSA

PGSEPATS

AE720
191
TSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEG

A

SAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPG

TSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTE

PSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTS

TEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTE

PSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPG

TSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSES

ATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEG

TSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPE

SGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTE

AE756
192
TSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEG

A

SAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPG

TSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTE

PSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTS

TEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTE

PSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPG

TSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSES

ATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEG

TSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPE

SGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPG

TSTEPSEGSAPGSEPATSGSETPGTSES

AE792
193
EGSAPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTE

A

EGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTS

TEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSTEPSEGSAPGTSESAT

PESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESG

PGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTS

ESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSA

PGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTS

ESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSP

TSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESAT

PESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESG

PGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSP

AGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPATSGSETPGTSESAT

PESGPGTSTEPS

AE828
194
PESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSA

A

PGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTS

ESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPS

EGSAPGTSTEPSEGSAPGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

ESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESAT

PESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSA

PGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTS

ESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESAT

PESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTE

EGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPS

EGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESG

PGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTS

ESATPESGPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSP

TSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPATSGSETPGTSESAT

AE869
195
GSPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEG

TSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPA

TSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEG

SAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPG

TSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTE

PSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPE

SGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPG

SPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTE

PSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPG

SEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTE

PSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGS

ETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPG

SEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGR

AE144_R1
196
SAGSPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTE

EGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSE

PATSGSETPGSPAGSPTSTEEGTSESATPESGPGTESASR

AE288_R1
197
SAGSPTGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATP

ESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETP

GTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSE

SATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEPATSG

SETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPATSGSETP

GTSESATPESGPGTSTEPSEGSAPSASR

AE432_R1
198
SAGSPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTE

EGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSE

PATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESG

PGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTS

TEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSP

AGSPTSTEEGTESASR

AE576_R1
199
SAGSPTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSE

GSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAP

GTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEP

ATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSE

GSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEE

GTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSE

SATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPT

STEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEE

GSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSE

SATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSE

GSAPGTSTEPSEGSAPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

SASR

AE864_R1
200
SAGSPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTE

EGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSE

PATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESG

PGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTS

TEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSP

AGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTS

TEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATS

GSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSA

PGSEPATSGSETPGTSESATPESGPGTESASR

AE712
201
PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTS

TEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATS

GSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPS

EGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTS

TEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSP

TSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSP

AGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSET

PGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGSPAGSPTSTEAHHH

AE864_R2
202
GSPGAGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTE

EGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSE

PATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESG

PGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTS

TEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSP

AGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTS

TEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATS

GSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSA

PGSEPATSGSETPGTSESATPESGPGTESASR

AE288_3
203
SPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEG

SAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPG

SPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSES

ATPESGPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTS

TEEGTSTEPSEGSAPGTSTEPSEGSAPG

AE284
204
GTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSE

SATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSG

SETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAP

GTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEP

ATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPATSG

SETPGTSESATPESGPGTSTEPSE

AE292
205
SPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEG

SAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPG

SPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSES

ATPESGPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTS

TEEGTSTEPSEGSAPGTSTEPSEGSAPGGSAP

AE293
206
PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTS

TEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATS

GSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPS

EGSAPGTSESATPESGPGTSESATPEGAAEPEA

AE300
207
PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTS

TEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATS

GSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPS

EGSAPGTSESATPESGPGTSESATPESGPGSPAGAAEPEA

AE864_2
208
AGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPS

EGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSGSET

PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGSP

AGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSTEPS

EGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTE

EGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSP

TSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSA

PGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTS

ESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSA

PGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSE

PATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPATS

GSETPGTSESATPESGPGTSTEPSEGAAEPEA

AE867
209
GSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTST

EPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSG

SETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAP

GSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTST

EPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTST

EPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPT

STEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPA

GSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSE

GSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETP

GTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEP

ATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETP

GSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGAAEPEA

AE867_2
210
SPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGT

STEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPAT

SGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGS

APGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGT

STEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEP

SEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPES

GPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPG

SPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTE

PSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPG

SEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTE

PSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGS

ETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPG

SEPATSGSETPGTSESATPESGPGTSTEPSEGSAPG

AE868
211
PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTS

TEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATS

GSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPS

EGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTS

TEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSP

TSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSP

AGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSET

PGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPS

EGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSET

PGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGAAEPEA

AE584
212
PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTS

TEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATS

GSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPS

EGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTS

TEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSP

TSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSP

AGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGAAEPEA

TABLE 2c

Exemplary XTEN polypeptides

Exemplary
SEQ ID

Use
NO.
Amino Acid Sequence

C-terminal
213
PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEE

XTEN

GTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPG

SEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGT

STEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTS

ESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTST

EPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAG

SPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEP

SEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATP

ESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTST

EEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPG

TSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGT

SESATPESGPGTSESATPESGPGftabTSESATPESGPGSEPATSGPTE

SGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTESTPSEGSAP

GSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGEPEA

C-terminal
214
PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEE

XTEN

GTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPG

SEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGT

STEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTS

ESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTST

EPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAG

SPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEP

SEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATP

ESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTST

EEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPG

TSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGT

SESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGPTESGSE

PATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGSAPGEPEA

C-terminal
215
PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEE

XTEN

GTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPG

SEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGT

STEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTS

ESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTST

EPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAG

SPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEP

SEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATP

ESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTST

EEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPG

TSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGT

SESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSE

PATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTESTPSEGSAPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGSAPGEPEA

N-terminal
216
ASSPAGSPTSTESGTSESATPESGPGTETEPSEGSAPGTSESATPESGP

XTEN

GSEPATSGSETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPG

TSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGESPATSGSTPEGT

SESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTS

ESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEP

ATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGGSAP

N-terminal
217
ASSPAGSPTSTESGTSESATPESGPGTSTEPSEGSAPGTSESATPESGP

XTEN

GSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPG

TSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGESPATSGSTPEGT

SESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTS

ESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEP

ATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGGSAP

N-terminal
218
ASSPAGSPTSTESGTSESATPESGPGTSTEPSEGSAPGTSESATPESGP

XTEN

GSEPATSGSETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPG

TSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGEEPATSGSTPEGT

SESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTS

ESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEP

ATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGGSAP

N-terminal
219
ASSPAGSPTSTESGTSESATPESGPGTSTEPSEGSAPGTSESATPESGP

XTEN

GSEPATSGSETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPG

TSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGT

SESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTS

ESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEP

ATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGGSAP

C-terminal
220
PGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEE

XTEN

GTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPG

SEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGT

STEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTS

ESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTST

EPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAG

SPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEP

SEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATP

ESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTST

EEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPG

TSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGT

SESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSE

PATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTESTPSEGSAPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGSAPG

C-terminal
221
PGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGP

XTEN

GTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGT

SESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTS

ESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPA

GSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESA

TPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGS

ETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTESTPSEGS

APGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPG

N-terminal
222
SAGSPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPT

XTEN

STEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSTPAESGS

ETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGS

APGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSTET

PGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPG

TSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGT

STEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTS

ESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSE

SATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESA

TPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESAT

PESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSG

SETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEG

SAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSE

TPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSA

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEE

GTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPG

TSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGS

PAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGSP

AGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTST

EPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTE

PSEGSAPGTSESATPESGPGTESAS

C-terminal
223
SAGSPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPT

XTEN

STEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATSGS

ETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGS

APGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETP

GTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPG

SPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGT

STEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTS

TEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSE

SATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSES

ATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGS

PTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATP

ESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGS

APGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSET

PGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAP

GSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEG

TSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGT

STEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSGSTETPGS

PAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGSP

AGSPTSTEEGTSTEPSEGSAPGTATESPEGSAPGTSESATPESGPGTST

EPSEGSAPGTSAESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTST

EPSEGSAPGTSESATPESGPGTESAS

N-terminal
224
GSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEG

XTEN

TSTEPSEGSAPGTSTEPSEGSAPATSESATPESGPGSEPATSGSETPGS

EPATSGSETPGSPAGSPTSTEEGTSESASPESGPGTSTEPSEGSAPGTS

TEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSE

SATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTE

PSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGS

PTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSE

GSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPE

SGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPES

GPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTE

EGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPG

TSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGT

SESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTS

ESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEP

ATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPA

TSGSETPGTSESATPESGPGTSTEPSEGSAP

N-terminal
225
GSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEG

XTEN

TSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSESATSGSETPGS

EPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTS

TEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSE

SATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTE

PSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGS

PTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSE

GSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPE

SGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPES

GPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTE

EGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPG

TSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGT

SESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTS

ESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEP

ATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGSEPA

TSGSETPGTSESATPESGPGTSTEPSEGSAP

N-terminal
226
SPAGSPTSTESGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

XTEN (with

EPATSGSETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPGTS

His-tag)

TEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSE

SATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSES

ATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEPAT

SGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGGSAP

C-terminal
227
PGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGP

XTEN

GTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGT

SESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTS

ESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPA

GSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESA

TPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGS

ETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTESTPSEGS

APGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGEPEA

C-terminal
228
TPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATS

XTEN

GSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSE

GSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTS

TEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPES

GPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGP

GTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPG

SEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGT

SESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSP

AGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSE

SATPESGPGSEPATSGSETPGSESATSGSETPGSPAGSPTSTEEGTSTE

PSEGSAPGTSTEPSEGSAPGSEPATSGSETPGTSESA

C-terminal
229
GTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPG

XTEN

SPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGS

PAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTST

EPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEP

SEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSE

GSAPGTSESASPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTS

TEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSE

TPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

C-terminal
230
GSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSE

XTEN

GSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPE

SGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGS

APGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTE

EGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPG

TSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPA

GSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSTETGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEP

SEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATS

C-terminal
231
EGSAPGTSTEPSEGSAPGTSESATPESGPGTSTEPSEGSAPGTSESATP

XTEN

ESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPE

SGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSE

TPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSA

PGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEE

GTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPG

SEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

PAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEP

ATSGSETPGTSESASPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAGS

PTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESAT

N-terminal
232
ASSPAGSPTSTESGTSESATPESGPGTSTEPSEGSAPGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPG

TSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGT

SESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTS

ESATPESGPGTSESATPESGPGTSESATPESGPGSEPATSGSETPGSEP

ATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGGSAP

Additional examples of XTEN sequences that can be used according to the present disclosure are disclosed in U.S. Patent Publication Nos. 2010/0239554 A1, 2010/0323956 A1, 2011/0046060 A1, 2011/0046061 A1, 2011/0077199 A1, 2011/0172146 A1, 2018/0244736 A1, 2018/0346952 A1, and 2019/0153115 A1; U.S. Pat. Nos. 8,673,860, 9,371,369, 9,926,351, 9,249,211, and 9,976,166; and International Patent Publication Nos. WO 2010/091122 A1, WO 2010/144502 A2, WO 2010/144508 A1, WO 2011/028228 A1, WO 2011/028229 A1, WO 2011/028344 A2, WO 2014/011819 A2, WO 2015/023891, WO 2016/077505 A2, WO 2017/040344 A2, and WO 2019/126576 A1.

In general, XTEN are polypeptides with non-naturally occurring, substantially non-repetitive sequences having a low degree or no secondary or tertiary structure under physiologic conditions, as well as additional properties described in the paragraphs that follow. XTEN can have at least (about) 100, at least (about) 150, at least (about) 200, at least (about) 300, at least (about) 400, at least (about) 500, at least (about) 600, at least (about) 700, at least (about) 800, at least (about) 900, at least (about) 1,000 amino acids, or a range between any of the foregoing. As used herein, XTEN specifically excludes whole antibodies or antibody fragments (e.g. single-chain antibodies and Fc fragments). XTEN polypeptides have utility as fusion partners in that they serve in various roles, conferring certain desirable properties when linked to a composition comprising, for example, one or more biologically active moieties (such as one described herein). The resulting compositions have enhanced properties, such as enhanced pharmacokinetic, physicochemical, pharmacologic, and improved toxicological and pharmaceutical properties compared to the corresponding one or more biologically active moieties not linked to XTEN, making them useful in the treatment of certain conditions for which the one or more biologically active moieties are known in the art to be used.

The unstructured characteristic and physicochemical properties of the XTEN result, in part, from the overall amino acid composition that is disproportionately limited to 4-6 types of hydrophilic amino acids, the sequence of the amino acids in a quantifiable, substantially non-repetitive design, and from the resulting length of the XTEN polypeptide. In an advantageous feature common to XTEN but uncommon to native polypeptides, the properties of XTEN disclosed herein may not be tied to an absolute primary amino acid sequence, as evidenced by the diversity of the exemplary sequences of Tables 2b-2c that, within varying ranges of length, possess similar properties and confer enhanced properties on the compositions to which they are linked, many of which are documented in the Examples. Indeed, it is specifically contemplated that the compositions of the disclosure not be limited to those XTEN specifically enumerated in Tables 8 or 10, but, rather, the embodiments at least include sequences having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity, when optimally aligned, to the sequences of Tables 2b-2c as they exhibit the properties of XTEN described herein. It has been established that such XTEN have properties more like non-proteinaceous, hydrophilic polymers (such as polyethylene glycol, or “PEG”) than they do proteins. The XTEN of the present disclosure exhibit one or more of the following advantageous properties: defined and uniform length (for a given sequence), conformational flexibility, reduced or lack of secondary structure, high degree of random coil formation, high degree of aqueous solubility, high degree of protease resistance, low immunogenicity, low binding to mammalian receptors, a defined degree of charge, and increased hydrodynamic (or Stokes) radii; properties that are similar to certain hydrophilic polymers (e.g., polyethylene glycol) that make them particularly useful as fusion partners.

XTEN, as described herein, are designed to behave like denatured peptide sequences under physiological conditions, despite the extended length of the polymer. “Denatured” describes the state of a peptide in solution that is characterized by a large conformational freedom of the peptide backbone. Most peptides and proteins adopt a denatured conformation in the presence of high concentrations of denaturants or at elevated temperature. Peptides in denatured conformation have, for example, characteristic circular dichroism (CD) spectra and are characterized by a lack of long-range interactions as determined by NMR. “Denatured conformation” and “unstructured conformation” are used synonymously herein. In some embodiments, the disclosure provides compositions that comprise XTEN sequences that, under physiologic conditions, resemble denatured sequences that are substantially devoid of secondary structure under physiologic conditions. “Substantially devoid,” as used in this context, means that at least about 80%, or about 90%, or about 95%, or about 97%, or at least about 99% of the XTEN amino acid residues of the XTEN sequence do not contribute to secondary structure, as measured or determined by the methods described herein, including algorithms or spectrophotometric assays.

A variety of well-established methods and assays are known in the art for determining and confirming the physicochemical properties of the subject XTEN and the subject polypeptide compositions into which they are incorporated. Such properties include but are not limited to secondary or tertiary structure, solubility, protein aggregation, stability, absolute and apparent molecular weight, purity and uniformity, melting properties, contamination and water content. The methods to measure such properties include analytical centrifugation, EPR, HPLC-ion exchange, HPLC-size exclusion chromatography (SEC), HPLC-reverse phase, light scattering, capillary electrophoresis, circular dichroism, differential scanning calorimetry, fluorescence, HPLC-ion exchange, HPLC-size exclusion, IR, NMR, Raman spectroscopy, refractometry, and UV/Visible spectroscopy. In particular, secondary structure can be measured spectrophotometrically, e.g., by circular dichroism spectroscopy in the “far-UV” spectral region (190-250 nm). Secondary structure elements, such as alpha-helix and beta-sheet, each give rise to a characteristic shape and magnitude of CD spectra, as does the lack of these structure elements. Secondary structure can also be predicted for a polypeptide sequence via certain computer programs or algorithms, such as the well-known Chou-Fasman algorithm (Chou, P. Y., et al. (1974) Biochemistry, 13: 222-45) and the Garnier-Osguthorpe-Robson algorithm (“GOR IV algorithm”) (Gamier J, Gibrat J F, Robson B. (1996), GOR method for predicting protein secondary structure from amino acid sequence. Methods Enzymol 266:540-553), as described in US Patent Application Publication No. 20030228309A1. For a given sequence, the algorithms can predict whether there exists some or no secondary structure at all, expressed as the total and/or percentage of residues of the sequence that form, for example, alpha-helices or beta-sheets or the percentage of residues of the sequence predicted to result in random coil formation (which lacks secondary structure). Polypeptide sequences can be analyzed using the Chou-Fasman algorithm using sites on the world wide web at, for example, fasta.bioch.virginia.edu/fasta_www2/fasta_www.cgi?rm=misc1 and the GOR IV algorithm at npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_gor4.html (both accessed on Dec. 8, 2017). Random coil can be determined by a variety of methods, including by using intrinsic viscosity measurements, which scale with chain length in a conformation-dependent way (Tanford, C., Kawahara, K. & Lapanje, S. (1966) J. Biol. Chem. 241, 1921-1923), as well as by size-exclusion chromatography (Squire, P. G., Calculation of hydrodynamic parameters of random coil polymers from size exclusion chromatography and comparison with parameters by conventional methods. Journal of Chromatography, 1981, 5,433-442). Additional methods are disclosed in Arnau, et al., Prot Expr and Purif (2006) 48, 1-13.

In some embodiments of the present disclosure, the activatable therapeutic agent is an activatable antibody (AA) composition, where the masking moiety (MM) refers to an amino acid sequence coupled to an antibody or antibody fragment (AB) and positioned such that it reduces the ability of the AB to bind its designated binding target by specifically binding to the antigen-binding domain of the AB (such as the complementarity-determining region(s) (CDR(s)). Such binding can be non-covalent. In some embodiments, the activatable antibody composition can be prevented from binding to the designated binding target by binding the MM to an N- or C-terminus of the activatable antibody composition.

Alternatively, the MM may not specifically bind the AB, but rather interfere with AB-target binding through non-specific interactions such as steric hindrance. For example, the MM may be positioned in the uncleaved activatable antibody composition such that the tertiary or quaternary structure of the activatable antibody allows the MM to mask the AB through charge-based interaction, thereby holding the MM in place to interfere with target access to the AB. The masking moiety (MM) can interfere or/and inhibit binding of the antibody or antibody fragment (AB) to the target allosterically or sterically.

When the antibody or antibody fragment (AB) is modified with a MM and is in the presence of the target, specific binding of the AB to its target can be reduced or inhibited, as compared to the specific binding of the AB, not modified with an MM, to the target. A dissociation constant (K_d) of the AB modified with a MM towards the AB's target can be generally greater than a corresponding K_dof the AB, not modified with a MM, towards the target. Conversely, a binding affinity of the AB modified with a MM towards the target can be generally lower than a binding affinity of the AB, not modified with a MM, towards the target. In some embodiments, the masking moiety (MM) of the activatable antibody composition can have an equilibrium dissociation constant (K_d) for binding to the antibody or a fragment thereof which is greater than the equilibrium dissociation of the antibody or the fragment thereof for binding to its designated binding target (near or at a diseased site in a subject).

When the antibody or antibody fragment (AB) is modified with a release segment (RS) and a masking moiety (MM) and is in the presence of the target but not sufficient protease or protease activity to cleave the RS, specific binding of the modified AB to the target can be generally reduced or inhibited, as compared to the specific binding of the AB modified with a RS and a MM in the presence of the target and sufficient protease or protease activity to cleave the RS. For example, when the modified antibody is an activatable antibody composition and comprises a release segment (RS), the AB can be unmasked upon cleavage of the RS, in the presence of protease, preferably a disease-specific protease. Thus, the MM is one that when the activatable antibody composition is uncleaved provides for masking of the AB from target binding, but does not substantially or significantly interfere or compete for binding of the target to the AB when the activatable antibody composition is in the cleaved conformation. A schematic of an exemplary activatable antibody (AA) composition is provided in FIG. 3. As illustrated, the release segment (RS) is positioned such that in a cleaved (or relatively active state) and in the presence of a target, the antibody or antibody fragment (AB) binds a target, while in an uncleaved (or relatively inactive state) in the presence of the target, specific binding of the AB to its target is reduced or inhibited. The specific binding of the antibody or antibody fragment (AB) to its target can be reduced due to the due to the inhibition or masking of the AB's ability to specifically bind its target by the masking moiety (MM).

In some embodiments of the activatable antibody compositions, where an antibody or antibody fragment (AB) is capable of specifically binding its designated binding target, a coupling of the masking moiety (MM) to the antibody or antibody fragment (AB) can reduce the ability of the AB to bind its designated binding target as compared to the ability of the AB not coupled to the MM to bind the designated binding target (for example, when assayed in vitro using a target displacement assay). Such coupling of the MM to the AB can reduce the ability of the AB to bind its designated binding target for a duration.

The masking moiety (MM) can be provided in a variety of different forms. In certain embodiments, the MM can be selected to be a known binding partner of the antibody or antibody fragment (AB), provided that the MM binds the AB with less affinity and/or avidity than the target protein to which the AB is designed to bind following cleavage of the release segment (RS) so as to reduce interference of MM in target-AB binding Stated differently, as discussed above, the MM is one that masks the AB from target binding when the activatable antibody composition is uncleaved, but does not substantially or significantly interfere or compete for binding for target when the activatable antibody composition is in the cleaved conformation. In a specific embodiment, the AB and MM do not contain the amino acid sequences of a naturally-occurring binding partner pair, such that at least one of the AB and MM does not have the amino acid sequence of a member of a naturally occurring binding partner. The masking moiety (MM) may not comprise more than 50% amino acid sequence identity to a natural binding partner of the antibody or antibody fragment (AB). The masking moiety (MM) can comprise a consensus sequence specific for binding to a class of antibodies against a designated binding target (e.g., diseased target). The MM can be a polypeptide of no more than 40 (e.g., from 2 to 40) amino acids in length. The MM can be coupled to the activatable antibody composition by covalent binding.

In some embodiments, the present disclosure provides for an activatable antibody complex (AAC) composition (as illustrated in FIG. 4) comprising: (1) two antibodies or antibody fragments (AB1 and AB2), each capable of specifically binding its designated binding target, (2) at least one masking moiety (MM) coupled to either AB1 or AB2, capable of inhibiting the specific binding of AB1 and AB2 to their designated binding target(s), and (3) at least one release segment (RS) coupled to either AB1 or AB2, capable of being specifically cleaved by a protease whereby activating the AAC composition. In some embodiments, when the AAC is in an uncleaved state, the MM can inhibit the specific binding of AB1 and AB2 to their designated binding target(s) and when the AAC is in a cleaved state, the MM does not inhibit the specific binding of AB1 and AB2 to their designated binding targets. The two ABs can bind different targets, or different epitopes on the same target.

In some embodiments, the MM does not inhibit cellular entry of the activatable antibody composition.

In some embodiments, the masking moiety (MM) can comprise an anti-albumin domain, such as a single domain antibody (sdAb) anti-albumin domain. In some embodiments, the anti-albumin domain can comprise non-CDR loops, CDR loops, or any combination thereof. In some embodiments, the anti-albumin domain can comprise both non-CDR loops and CDR loops. The non-CDR loops can be capable of binding to one or more antibody or antibody fragment (AB) (for example, and not limited to, the CDRs of the AB) of an activatable antibody (AA) composition, thereby masking the AB (at least in some cases) by inhibiting or reducing the ability of the AB to bind to its designated target(s). The CDR loops can be capable of binding albumin (e.g., human serum albumin), thereby (at least in some cases) masking the AB in the activatable antibody (AA) composition from binding to its designated target(s) via steric or allosteric hindrance and/or conferring half-life extension for the AA composition. In some embodiments, the non-CDR loops can be engineered into different position of the anti-albumin sdAb domain. In some embodiments, the MM can (1) inhibit or reduce the ability of the AB to bind to its designated target(s) via (1a) specific binding to the target recognition region of the AB and/or (1b) steric masking of target recognition region of the AB, and/or the MM can (2) confer half-life extension for the AA containing the AB via binding to albumin. The MM can be coupled (directly or indirectly) to the activatable antibody composition by covalent binding.

As illustrated in the schematic shown in FIG. 5, an exemplary activatable antibody complex (AAC) composition can comprise: (1) at least two antibodies or antibody fragments (AB1 and AB2), each capable of specifically binding their designated binding target(s), (2) at least one masking moiety (MM) coupled to AB1 or AB2, capable of inhibiting the specific binding of AB1 or AB2 to their designated binding target(s), and (3) at least one release segment (RS) coupled to AB1 or AB2, capable of being specifically cleaved by a protease whereby activating the activatable antibody complex (AAC) composition. In some embodiments, when the AA is in an uncleaved state, the MM can inhibit the specific binding of AB1 or AB2 to their designated binding target(s), and when the activatable antibody complex (AAC) composition is in a cleaved state, the MM does not inhibit the specific binding of AB1 or AB2 to their designated binding target(s). In some embodiments, the masking moiety (MM) can be coupled to both AB1 and AB2 via two separate release segments (RS). In other words, the MM can be placed between AB1 and AB2, coupled either to the C end of AB1 and the N end of AB2, or coupled to the N end of AB1 and the C end of AB2.

In some embodiments of the present disclosure, the activatable therapeutic agent is an activatable antibody (AA) composition, where the masking moiety (MM) refers to an amino acid sequence coupled to an antibody or antibody fragment (AB) (for example, but not limited to, an scFv, an sdAb, or a fragment thereof) and positioned such that it reduces the ability of the AB to dimerize with another antibody or antibody fragment, preventing the formation of an antibody or an antibody fragment capable of binding to target. Such binding can be non-covalent. In some embodiments, the activatable antibody composition can be prevented from binding to the designated binding target by binding the MM to an N- or C-terminus of the activatable antibody composition.

When the antibody or antibody fragment (AB) is modified with a MM and is in the presence of the target, specific binding of the AB to its dimerization partner can be reduced or inhibited, as compared to the specific binding of the AB, not modified with an MM, to its dimerization partner. A dissociation constant (K_d) of the AB modified with a MM towards its dimerization partner can be generally greater than a corresponding K_dof the AB, not modified with a MM, towards its dimerization partner. Conversely, a binding affinity of the AB modified with a MM towards its dimerization partner can be generally lower than a binding affinity of the AB, not modified with a MM, towards its dimerization partner. In some embodiments, the masking moiety (MM) of the activatable antibody composition can have an equilibrium dissociation constant (K_d) for binding to the antibody or a fragment thereof which is greater than the equilibrium dissociation of the antibody or the fragment thereof for binding to its designated dimerization partner.

When the antibody or antibody fragment (AB) is modified with a release segment (RS) and a masking moiety (MM) and is in the presence of the target but not sufficient protease or protease activity to cleave the RS, specific ability of the modified AB to dimerize with another antibody or antibody fragment and the resulting ability of the dimer to bind to its designated binding target(s) can be generally reduced or inhibited, as compared to the specific dimerization ability of the AB modified with a RS and a MM and the subsequent ability of the dimer to bind to its designated binding target(s) in the presence of the target and sufficient protease or protease activity to cleave the RS. For example, when the modified antibody is an activatable antibody composition and comprises a release segment (RS), the AB can be unmasked upon cleavage of the RS, in the presence of protease, preferably a disease-specific protease. Thus, the MM is one that when the activatable antibody composition is uncleaved provides for masking of the AB from dimerization with another AB and for reduction or inhibition of binding of the resulting dimer to its designated binding target(s), but does not substantially or significantly interfere or compete for dimerization to another AB and for reduction or inhibition of binding of the resulting dimer to its designated binding target(s) when the activatable antibody composition is in the cleaved conformation.

The masking moiety can be provided in different forms. In some embodiments, the masking domain can be an inhibitory antibody or antibody fragment (IAB; for example, but not limited to, a VL or VH domain), provided that the MM binds the AB with less affinity and/or avidity than the dimerization partner with which AB is designed to dimerize following cleavage of the release segment (RS) so as to reduce interference of MM in AB-AB dimerization. Stated differently, as discussed above, the MM is one that masks the AB from dimerization to another AB when the activatable antibody composition is uncleaved, but does not substantially or significantly interfere or compete for dimerization with another AB when the activatable antibody composition is in the cleaved conformation. The MM can be coupled to the activatable antibody composition by covalent binding.

In some embodiments, the present disclosure provides for an activatable antibody complex (AAC) composition (as illustrated in FIG. 6) comprising: (1) two antibodies or antibody fragments (AB1 and AB2) (2) two masking moieties (MM) coupled to one each to AB1 and AB2, capable of reducing or inhibiting the specific dimerization of AB1 and AB2 and subsequent binding of AB1-AB2 complex to their designated binding target(s), (3) at least three release segments (RS) coupled to AB1, AB2 and MMs capable of being specifically cleaved by a protease whereby activating the AAC composition, (4) at least one additional antibody or antibody fragment (AB3 and/or AB4; for example, but not limited to, an scFv or an sdAb), coupled to AB1 and/or AB2. In some embodiments, when the AAC is in an uncleaved state, the MM can inhibit or reduce the specific dimerization of AB1 and AB2 and subsequently inhibit or reduce the binding of the resulting AB1-AB2 dimer to its designated binding target(s) and when the AAC is in a cleaved state, the MM does not reduce or inhibit the specific dimerization of AB1 and AB2 and does not reduce or inhibit the subsequent binding of the AB1-AB2 dimer to its designated binding target(s). When more than one additional AB is coupled to AB1 and/or AB2, the additional ABs can bind the same target or different targets.

In some embodiments, the MM can comprise a coiled-coil domain, for example, but not limited to, (1) high affinity parallel heterodimeric leucine zipper coiled-coil domain, containing or devoid of cysteines, (2) low affinity parallel heterodimeric coiled-coil leucine zipper domain, containing or devoid of cysteines, (3) disulfide-linked covalent coiled-coil domain, (4) antiparallel heterodimeric leucine zipper coiled-coil domain, (5) helix-turn-helix homodimeric leucine zipper coiled coil domain. The MM can be coupled (directly or indirectly) to the activatable antibody composition by covalent binding. In some embodiments, the MM can reduce or inhibit the binding of AB to its intended target(s) via steric or allosteric hindrance.

In some embodiments, the present disclosure provides for an activatable antibody complex (AAC) composition (as illustrated in FIG. 7) comprising: (1) at least one antibody or antibody fragment (AB), (2) at least one masking moiety (MM) coupled to AB, capable of inhibiting the specific binding of AB to its designated binding target, and (3) at least one release segment (RS) coupled to AB, capable of being specifically cleaved by a protease whereby activating the AAC composition. In some embodiments, when the AAC is in an uncleaved state, the MM can reduce or inhibit the specific binding of AB to its designated binding target(s) and when the AAC is in a cleaved state, the MM does not reduce or inhibit the specific binding of AB to its designated binding target(s).

In some embodiments, the activatable therapeutic agent may incorporate a cleavage sequence as described herein, and/or be administered to a patient who is identified as being a likely responder to the therapeutic agent based on the identification of a peptide biomarker in a biological sample from the subject (as described further herein).

Biologically Active Moieties (BM)

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), the biologically active moiety (BM) can comprise a biologically active peptide (BP). The biologically active peptide (BP) can comprise an antibody, a cytokine, a cell receptor, or a fragment thereof. The biologically active polypeptide (BP) can comprise a binding moiety having a binding affinity for a target cell marker on a target tissue or cell. The target cell marker can be an effector cell antigen expressed on a surface of an effector cell. The binding moiety can be an antibody. The antibody can be selected from the group consisting of Fv, Fab, Fab′, Fab′-SH, nanobody (also known as single domain antibody or V_HH), linear antibody, and single-chain variable fragment (scFv).

In some embodiments of the therapeutic agent (or the activatable therapeutic agent, or the non-natural, activatable therapeutic agent), where the binding moiety can be a first binding moiety, and wherein the target cell marker can be a first target cell marker, the biologically active polypeptide (BP) can further comprise a second binding moiety linked, directly or indirectly to the first binding moiety. The second binding moiety can have a binding affinity for a second target cell marker on the target tissue or cell. The second target cell marker can be a marker on a tumor cell or a cancer cell. The second binding moiety can be an antibody. The second binding moiety can be an antibody selected from the group consisting of Fv, Fab, Fab′, Fab′-SH, nanobody (also known as single domain antibody or V_HH), linear antibody, and single-chain variable fragment (scFv).

In some embodiments as disclosed herein, a biologically active moiety (BM) or a biologically active peptide (BP) can exhibit a binding specificity to a given target (or a given number of targets) or/and another desired biological characteristic, when used in vivo or when utilized in an in vitro assay. For example, the BM or BP can be an agonist, a receptor, a ligand, an antagonist, an enzyme, an antibody (e.g., mono- or bi-specific), or a hormone. Of particular interest are BM or BP used, or known to be useful, for a disease or disorder where the native BM or BP have a relatively short terminal half-life and for which an enhancement of a pharmacokinetic parameter (which optionally could be released from a conjugate or a fusion polypeptide by cleavage of a spacer sequence) would permit less frequent dosing or an enhanced pharmacologic effect. Also of interest are BM or BP that have a relatively narrow therapeutic window between the minimum effective dose or blood concentration (C_min) and the maximum tolerated dose or blood concentration (C_max). In such cases, the linking of the BM or BP within a conjugate or a fusion polypeptide comprising a select masking moiety, such as XTEN, can result in an improvement in these properties, making them more useful as therapeutic or preventive agents compared to the BM or BP not linked to a masking moiety, such as XTEN. The BM or BP encompassed by the inventive compositions described herein can have utility in the treatment in various therapeutic or disease categories, including but not limited to glucose and insulin disorders, metabolic disorders, cardiovascular diseases, coagulation and bleeding disorders, growth disorders or conditions, endocrine disorders, eye diseases, kidney diseases, liver diseases, tumorigenic conditions, inflammatory conditions, autoimmune conditions, etc.

In some embodiments of the compositions disclosed herein, where the biologically active moiety is a biologically active peptide (BP), the BP can comprise a peptide sequence that exhibits at least (about) 80% sequence identity (e.g., at least (about) 81%, at least (about) 82%, at least (about) 83%, at least (about) 84%, at least (about) 85%, at least (about) 86%, at least (about) 87%, at least (about) 88%, at least (about) 89%, at least (about) 90%, at least (about) 91%, at least (about) 92%, at least (about) 93%, at least (about) 94%, at least (about) 95%, at least (about) 96%, at least (about) 97%, at least (about) 98%, at least (about) 99%, or 100% sequence identity to an amino acid sequence of a glucose regulating peptide or a glucagon-like peptide (native or synthetic analog) set forth in Tables 3a-3c (such as one described more fully hereinbelow in the GLUCOSE REGULATING PEPTIDES section), or to an amino acid sequence of a protein relating to metabolic disorders and cardiology set forth in Table 3d (such as one described more fully hereinbelow in the METABOLIC DISEASE AND CARDIOVASCULAR PROTEINS section), or to an amino acid sequence of a growth hormone set forth in Table 3f (such as one described more fully hereinbelow in the GROWTH HORMONE PROTEINS section), or to an amino acid sequence of a cytokine set forth in Table 3g (such as one described more fully hereinbelow in the CYTOKINES section), or to an amino acid sequence of a transduction domain in Table 3h (such as one described more fully hereinbelow). In some embodiments of the compositions of this disclosure, the sequence of the BP can comprise one or more substitutions shown in Table 4 (such as one described more fully hereinbelow).

In some embodiments of the compositions disclosed herein, where the biologically active moiety is a biologically active peptide (BP), the BP can comprise an antibody (e.g., a monospecific, bispecific, trispecific, or multispecific antibody) (as defined hereinabove, the term “antibody” includes, among other things, an antibody fragment) (such as one described more fully hereinbelow in the ANTIBODIES section). The antibody can comprise a binding domain (or binding moiety) having binding affinity for an effector cell antigen. The effector cell antigen can be expressed on the surface of an effector cell selected from a plasma cell, a T cell, a B cell, a cytokine induced killer cell (CIK cell), a mast cell, a dendritic cell, a regulatory T cell (RegT cell), a helper T cell, a myeloid cell, and a NK cell. The effector cell antigen can be expressed on or within an effector cell. The effector cell antigen can be expressed on a T cell, such as a CD4+, CD8+, or natural killer (NK) cell. The effector cell antigen can be expressed on the surface of a T cell. The effector cell antigen can be expressed on a B cell, master cell, dendritic cell, or myeloid cell. The binding domain (or binding moiety) can comprise VH and VL regions derived from a monoclonal antibody capable of binding human CD3. In some embodiments, where the binding domain (or binding moiety) having binding affinity for CD3, the binding domain (or binding moiety) can have binding affinity for a member of the CD3 complex, which includes in individual form or independently combined form all known CD3 subunits of the CD3 complex; for example, CD3 epsilon, CD3 delta, CD3 gamma, CD3 zeta, CD3 alpha and CD3 beta. The binding domain (or binding moiety) having binding affinity for CD3 can have binding affinity for CD3 epsilon, CD3 delta, CD3 gamma, CD3 zeta, CD3 alpha or CD3 beta. In some embodiments of the compositions of this disclosure, the binding domain (or binding moiety) binding human CD3 can be derived from an anti-CD3 antibody selected from the group of antibodies set forth in Tables 5a-5e. The binding domain (or binding moiety) binding human CD3 can comprise VH and VL regions, where each VH and VL regions exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99%, or 100% sequence identity to paired VL and VH sequences of an anti-CD3 antibody selected from those set forth in Table 5a or Table 5d. The binding domain (or binding moiety) binding human CD3 can comprise VH and VL regions, where each VH and VL regions exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99%, or 100% sequence identity to paired VL and VH sequences of the huUCHT1 anti-CD3 antibody of Table 5a. The binding domain (or binding moiety) binding human CD3 can comprise a CDR-H1 region, a CDR-H2 region, a CDR-H3 region, a CDR-L1 region, a CDR-L2 region, and a CDR-H3 region, wherein each of the regions can be derived from a monoclonal antibody selected from the group of antibodies set forth in Tables 5a-5b or Table 5d. The binding domain (or binding moiety) binding human CD3 can comprise FRs each independently exhibiting at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99%, or 100% sequence identity to a corresponding FR set forth in Table 5c. The binding domain (or binding moiety) binding human CD3 can comprise a single-chain variable fragment (scFv) sequence exhibiting at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99%, or 100% sequence identity to an anti-CD3 scFv sequence set forth in Table 5e. In the foregoing embodiments, the VH and/or VL domains can be configured as scFv, diabodies, a single domain antibody, or a single domain camelid antibody. The antibody can comprise a binding domain (or binding moiety) having specific binding affinity to a tumor-specific marker or an antigen of a target cell (or a target antigen). The tumor-specific marker or the antigen of the target cell can be selected from the group consisting of alpha 4 integrin, Ang2, B7-H3, B7-H6 (e.g., its natural ligand Nkp30 rather than an antibody fragment), CEACAM5, cMET, CTLA4, FOLR1, EpCAM (epithelial cell adhesion molecule), CCR5, CD19, HER2, HER2 neu, HER3, HER4, HER1 (EGFR), PD-L1, PSMA, CEA, TROP-2, MUC1(mucin), MUC-2, MUC3, MUC4, MUC5AC, MUC5B, MUC7, MUC16, βhCG, Lewis-Y, CD20, CD33, CD38, CD30, CD56 (NCAM), CD133, ganglioside GD3, 9-O-acetyl-GD3, GM2, Globo H, fucosyl GM1, GD2, carbonicanhydrase IX, CD44v6, Nectin-4, Sonic Hedgehog (Shh), Wue-1, plasma cell antigen 1 (PC-1), melanoma chondroitin sulfate proteoglycan (MCSP), CCR8, 6-transmembrane epithelial antigen of prostate (STEAP), mesothelin, A33 antigen, prostate stem cell antigen (PSCA), Ly-6, desmoglein 4, fetal acetylcholine receptor (fnAChR), CD25, cancer antigen 19-9 (CA19-9), cancer antigen 125 (CA-125), Muellerian inhibitory substance receptor type II (MISIIR), sialylated Tn antigen (sTN), fibroblast activation antigen (FAP), endosialin (CD248), epidermal growth factor receptor variant III (EGFRvIII), tumor-associated antigen L6 (TAL6), SAS, CD63, TAG72, Thomsen-Friedenreich antigen (TF-antigen), insulin-like growth factor I receptor (IGF-IR), Cora antigen, CD7, CD22, CD70 (e.g., its natural ligand, CD27 rather than an antibody fragment), CD79a, CD79b, G250, MT-MMPs, fibroblast activation antigen (FAP), alpha-fetoprotein (AFP), VEGFR1, VEGFR2, DLK1, SP17, ROR1, EphA2, ENPP3, glypican 3 (GPC3), and TPBG/5T4 (trophoblast glycoprotein). The tumor-specific marker or the antigen of the target cell can be selected from alpha 4 integrin, Ang2, CEACAM5, cMET, CTLA4, FOLR1, EpCAM (epithelial cell adhesion molecule), CD19, HER2, HER2 neu, HER3, HER4, HER1 (EGFR), PD-L1, PSMA, CEA, TROP-2, MUC1(mucin), Lewis-Y, CD20, CD33, CD38, mesothelin, CD70 (e.g., its natural ligand, CD27 rather than an antibody fragment), VEGFR1, VEGFR2, ROR1, EphA2, ENPP3, glypican 3 (GPC3), and TPBG/5T4 (trophoblast glycoprotein). The tumor-specific marker or the antigen of the target cell can be any one set forth in the “Target” column of Table 6. The binding domain (or binding moiety) with binding affinity to the tumor-specific marker or the target cell antigen can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99%, or 100%, sequence identity to any one of the paired VL and VH sequences set forth in the “VH Sequences” and “VL Sequences” columns of Table 6. Without limiting the scope, additional exemplary tumor antigen target(s) can be selected from the group consisting of: FGFR2, LIV1, TRK, RET, BCMA, CD71, CD166, SSTR2, cKIT, VISTA, GPNMB, DLL3, CD123, LAMP1, P-Cadherin, Ephrin-A4, PTK7, NaPi2b, GCC, C4.4a, Mucin 17, FLT3, NKG2D ligands, SLAMF7, IL13a2R, CLL-1/CLEC12A, CD66e, IL3Ra, CD5, ULBP1, B7H4, CSPG4, SDC1, IL1RAP, Survivin, CD138, CD74, TIM1, SLITRK6, CD37, CD142, AXL, ETBR, Cadherin 6, FGFR3, CA6, CanAg (novel glycophorm of Muc 1), Integrin alpha V, Cripto 1 (TDGF1), CD352, and NOTCH3.

The bioactivity of the BP embodiments described herein can be evaluated by using assays or measured/determined parameters as described herein, and those sequences that retain at least (about) 40%, or at least (about) 50%, or at least (about) 55%, or at least (about) 60%, or at least (about) 70%, or at least (about) 80%, or at least (about) 90%, or at least (about) 95% or more activity compared to the corresponding native BP sequence would be considered suitable for inclusion in the compositions of this disclosure.

Glucose Regulating Peptides

Endocrine and obesity-related diseases or disorders have reached epidemic proportions in most developed nations, and represent a substantial and increasing health care burden in most developed nations, which include a large variety of conditions affecting the organs, tissues, and circulatory system of the body. Of particular concern are endocrine and obesity-related diseases and disorders, which. Chief amongst these is diabetes; one of the leading causes of death in the United States. Diabetes is divided into two major sub-classes-Type I, also known as juvenile diabetes, or Insulin-Dependent Diabetes Mellitus (IDDM), and Type II, also known as adult onset diabetes, or Non-Insulin-Dependent Diabetes Mellitus (NIDDM). Type I Diabetes is a form of autoimmune disease that completely or partially destroys the insulin producing cells of the pancreas in such subjects, and requires use of exogenous insulin during their lifetime. Even in well-managed subjects, episodic complications can occur, some of which are life-threatening.

In Type II diabetics, rising blood glucose levels after meals do not properly stimulate insulin production by the pancreas. Additionally, peripheral tissues are generally resistant to the effects of insulin, and such subjects often have higher than normal plasma insulin levels (hyperinsulinemia) as the body attempts to overcome its insulin resistance. In advanced disease states insulin secretion is also impaired.

Insulin resistance and hyperinsulinemia have also been linked with two other metabolic disorders that pose considerable health risks: impaired glucose tolerance and metabolic obesity. Impaired glucose tolerance is characterized by normal glucose levels before eating, with a tendency toward elevated levels (hyperglycemia) following a meal. These individuals are considered to be at higher risk for diabetes and coronary artery disease. Obesity is also a risk factor for the group of conditions called insulin resistance syndrome, or “Syndrome X,” as is hypertension, coronary artery disease (arteriosclerosis), and lactic acidosis, as well as related disease states. The pathogenesis of obesity is believed to be multifactorial but an underlying problem is that in the obese, nutrient availability and energy expenditure are not in balance until there is excess adipose tissue. Other related diseases or disorders include, but are not limited to, gestational diabetes, juvenile diabetes, obesity, excessive appetite, insufficient satiety, metabolic disorder, glucagonomas, retinal neurodegenerative processes, and the “honeymoon period” of Type I diabetes.

Dyslipidemia is a frequent occurrence among diabetics; typically characterized by elevated plasma triglycerides, low HDL (high density lipoprotein) cholesterol, normal to elevated levels of LDL (low density lipoprotein) cholesterol and increased levels of small dense, LDL particles in the blood. Dyslipidemia is a main contributor to an increased incidence of coronary events and deaths among diabetic subjects.

Most metabolic processes in glucose homeostatis and insulin response are regulated by multiple peptides and hormones, and many such peptides and hormones, as well as analogues thereof, have found utility in the treatment of metabolic diseases and disorders. Many of these peptides tend to be highly homologous to each other, even when they possess opposite biological functions. Glucose-increasing peptides are exemplified by the peptide hormone glucagon, while glucose-lowering peptides include exendin-4, glucagon-like peptide 1, and amylin. However, the use of therapeutic peptides and/or hormones, even when augmented by the use of small molecule drugs, has met with limited success in the management of such diseases and disorders. In particular, dose optimization is important for drugs and biologics used in the treatment of metabolic diseases, especially those with a narrow therapeutic window. Hormones in general, and peptides involved in glucose homeostasis often have a narrow therapeutic window. The narrow therapeutic window, coupled with the fact that such hormones and peptides typically have a short half-life, which necessitates frequent dosing in order to achieve clinical benefit, results in difficulties in the management of such patients. While chemical modifications to a therapeutic protein, such as pegylation, can modify its in vivo clearance rate and subsequent serum half-life, it requires additional manufacturing steps and results in a heterogeneous final product. In addition, unacceptable side effects from chronic administration have been reported. Alternatively, genetic modification by fusion of an Fc domain to the therapeutic protein or peptide increases the size of the therapeutic protein, reducing the rate of clearance through the kidney, and promotes recycling from lysosomes by the FcRn receptor. Unfortunately, the Fc domain does not fold efficiently during recombinant expression and tends to form insoluble precipitates known as inclusion bodies. These inclusion bodies must be solubilized and functional protein must be renatured; a time-consuming, inefficient, and expensive process.

In some embodiments of the compositions of this disclosure, the biologically active peptide (BP) can comprise peptides involved in glucose homoestasis, insulin resistance and obesity (collectively, “glucose regulating peptides”), which compositions have utility in the treatment of glucose, insulin, and obesity disorders, disease and related conditions. Glucose regulating peptides can include any protein of biologic, therapeutic, or prophylactic interest or function that is useful for preventing, treating, mediating, or ameliorating a disease, disorder or condition of glucose homeostasis or insulin resistance or obesity. Suitable glucose-regulating peptides that can be linked to a masking moiety (such as XTEN) can include all biologically active polypeptides that increase glucose-dependent secretion of insulin by pancreatic beta-cells or potentiate the action of insulin. Glucose-regulating peptides can also include all biologically active polypeptides that stimulate pro-insulin gene transcription in the pancreatic beta-cells. Furthermore, glucose-regulating peptides can also include all biologically active polypeptides that slow down gastric emptying time and reduce food intake. Glucose-regulating peptides can also include all biologically active polypeptides that inhibit glucagon release from the alpha cells of the Islets of Langerhans. Table 3a provides a non-limiting list of sequences of glucose regulating peptides that can be encompassed by the compositions of this disclosure. In some embodiments of the compositions disclosed herein, where the biologically active moiety can be a biologically active peptide (BP), the BP can comprise a peptide sequence that exhibits at least (about) 80% sequence identity (e.g., at least (about) 81%, at least (about) 82%, at least (about) 83%, at least (about) 84%, at least (about) 85%, at least (about) 86%, at least (about) 87%, at least (about) 88%, at least (about) 89%, at least (about) 90%, at least (about) 91%, at least (about) 92%, at least (about) 93%, at least (about) 94%, at least (about) 95%, at least (about) 96%, at least (about) 97%, at least (about) 98%, at least (about) 99%, or 100% sequence identity) to an amino acid sequence of a glucose regulating peptide set forth in Table 3a.

TABLE 3a

Glucose-Regulating Peptides

Name of Protein
SEQ ID

(Synonym)
NO:
Amino Acid Sequence

Adrenomedullin (ADM)
233
YRQSMNNFQGLRSFGCRFGTCTVQKLAHQIYQFTDKDKD

NVAPRSKISPQGY

Amylin, rat
234
KCNTATCATQRLANFLVRSSNNLGPVLPPTNVGSNTY

Amylin, human
235
KCNTATCATQRLANFLVHSSNNFGAILSSTNVGSNTY

Calcitonin (hCT)
236
CGNLSTCMLGTYTQDFNKFHTFPQTAIGVGAP

Calcitonin, salmon
237
CSNLSTCVLGKLSQELHKLQTYPRTNTGSGTP

Calcitonin gene related peptide
238
ACDTATCVTHRLAGLLSRSGGVVKNMVPTNVGSKAF

(h-CGRP α)

Calcitonin gene related peptide
239
ACNTATCVTHRLAGLLSRSGGMVKSNFVPTNVGSKAF

(h-CGRP β)

cholecystokinin (CCK)
240
MNSGVCLCVLMAVLAAGALTQPVPPADPAGSGLQRAEE

APRRQLRVSQRTDGESRAHLGALLARYIQQARKAPSGRM

SIVKNLQNLDPSHRISDRDYMGWMDFGRRSAEEYEYPS

CCK-33
241
KAPSGRMSIVKNLQNLDPSHRISDRDYMGWMDF

CCK-8
242
DYMGWMDF

Exendin-3
243
HSDGTFTSDLSKQMEEEAVRLFIEWLKNGGPSSGAPPPS

Exendin-4
244
HGEGTFTSDLSKQMEEEAVR LFIEWLKNGGPSSGAPPPS

FGF-19
245
MRSGCVVVHVWILAGLWLAVAGRPLAFSDAGPHVHYG

WGDPIRLRHLYTSGPHGLSSCFLRIRADGVVDCARGQSAH

SLLEIKAVALRTVAIKGVHSVRYLCMGADGKMQGLLQYS

EEDCAFEEEIRPDGYNVYRSEKHRLPVSLSSAKQRQLYKN

RGFLPLSHFLPMLPMVPEEPEDLRGHLESDMFSSPLETDSM

DPFGLVTGLEAVRSPSFEK

FGF-21
246
MDSDETGFEHSGLWVSVLAGLLLGACQAHPIPDSSPLLQF

GGQVRQRYLYTDDAQQTEAHLEIREDGTVGGAADQSPES

LLQLKALKPGVIQILGVKTSRFLCQRPDGALYGSLHFDPEA

CSFRELLLEDGYNVYQSEAHGLPLHLPGNKSPHRDPAPRG

PARFLPLPGLPPALPEPPGILAPQPPDVGSSDPLSMVGPSQG

RSPSYAS

Gastrin
247
QLGPQGPPHLVADPSKKQGPWLEEEEEAYGWMDF

Gastrin-17
248
DPSKKQGPWLEEEEEAYGWMDF

Gastric inhibitory polypeptide
249
YAEGTFISDYSIAMDKIHQQDFVNWLLAQKGKKNDWKH

(GIP)

NITQ

Ghrelin
250
GSSFLSPEHQRVQQRKESKKPPAKLQPR

Glucagon
251
HSQGTFTSDYSKYLDSRRAQDFVQWLMNT

Glucagon-like peptide-1
252
HDEFERHAEGTFTSDVSSTLEGQAALEFIAWLVKGRG

(hGLP-1) (GLP-1; 1-37)

GLP-1 (7-36), human
253
HAEGTFTSDVSSYLEGQAALEFIAWLVKGR

GLP-1 (7-37), human
254
HAEGTFTSDVSSTLEGQAALEFIAWLVKGRG

GLP-1, frog
255
HAEGTYTNDVTEYLEEKAAKEFIEWLIKGKPKKIRYS

Glucagon-like peptide 2 (GLP-
256
HADGSFSDEMNTILDNLAARDFINWLIETKITD

2), human

GLP-2, frog
257
HAEGTFTNDMTNYLEEKAAKEFVGWLIKGRP-OH

IGF-1
258
GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQ

TGIVDECCFRSCDLRRLEMYCAPLKPAKSA

IGF-2
259
AYRPSETLCGGELVDTLQFVCGDRGFYFSRPASRVSRRSR

GIVEECCFRSCDLALLETYCATPAKSE

INGAP peptide
260
EESQKKLPSSRITCPQGSVAYGSYCYSLILIPQTWSNAELSC

(islet neogenesis-associated

QMHFSGHLAFLLSTGEITFVSSLVKNSLTAYQYIWIGLHDP

protein)

SHGTLPNGSGWKWSSSNVLTFYNWERNPSIAADRGYCAV

LSQKSGFQKWRDFNCENELPYICKFKV

Intermedin (AFP-6)
261
TQAQLLRVGCVLGTCQVQNLSHRLWQLMGPAGRQDSAP

VDPSSPHSY

Leptin, human
262
VPIQKVQDDTKTLIKTIVTRINDISHTQSVSSKQKVTGLDFI

PGLHPILTLSKMDQTLAVYQQILTSMPSRNVIQISNDLENL

RDLLHVLAFSKSCHLPWASGLETLDSLGGVLEASGYSTEV

VALSRLQGSLQDMLWQLDLSPGC

Neuromedin (U-8) porcine
263
YFLFRPRN

Neuromedin (U-9)
264
GYFLFRPRN

neuromedin (U25) human)
265
FRVDEEFQSPFASQSRGYFLFRPRN

Neuromedin (U25) pig
266
FKVDEEFQGPIVSQNRRYFLFRPRN

Neuromedin S, human
267
ILQRGSGTAAVDFTKKDHTATWGRPFFLFRPRN

Neuromedin U, rat
268
YKVNEYQGPVAPSGGFFLFRPRN

oxyntomodulin (OXM)
269
HSQGTFTSDYSKYLDSRRAQDFVQWLMNTKRNRNNIA

Peptide YY (PYY)
270
YPIKPEAPGEDASPEELNRYYASLRHYLNLVTRQRY

Pramlintide
271
KCNTATCATNRLANFLVHSSNNFGPILPPTNVGSNTY-NH2

Urocortin (Ucn-1)
272
DNPSLSIDLTFHLLRTLLELARTQSQRERAEQNRIIFDSV

Urocortin (Ucn-2)
273
IVLSLDVPIGLLQILLEQARARAAREQATTNARILARVGHC

Urocortin (Ucn-3)
274
FTLSLDVPTNIMNLLFNIAKAKNLRAQAAANAHLMAQI

“Adrenomedullin” or “ADM” means the human adrenomedulin peptide hormone and species and sequence variants thereof having at least a portion of the biological activity of mature ADM. ADM is generated from a 185 amino acid preprohormone through consecutive enzymatic cleavage and amidation, resulting in a 52 amino acid bioactive peptide with a measured plasma half-life of 22 min. ADM-containing fusion proteins of the invention may find particular use in diabetes for stimulatory effects on insulin secretion from islet cells for glucose regulation or in subjects with sustained hypotension. The complete genomic infrastructure for human AM has been reported (Ishimitsu, et al., Biochem. Biophys. Res. Commun 203:631-639 (1994)), and analogs of ADM peptides have been cloned, as described in U.S. Pat. No. 6,320,022.

“Amylin” means the human peptide hormone referred to as amylin, pramlintide, and species variations thereof, as described in U.S. Pat. No. 5,234,906, having at least a portion of the biological activity of mature amylin. Amylin is a 37-amino acid polypeptide hormone co-secreted with insulin by pancreatic beta cells in response to nutrient intake (Koda et al., Lancet 339:1179-1180. 1992), and has been reported to modulate several key pathways of carbohydrate metabolism, including incorporation of glucose into glycogen. Amylin-containing fusion proteins of the invention may find particular use in diabetes and obesity for regulating gastric emptying, suppressing glucagon secretion and food intake, thereby affecting the rate of glucose appearance in the circulation. Thus, the fusion proteins may complement the action of insulin, which regulates the rate of glucose disappearance from the circulation and its uptake by peripheral tissues. Amylin analogues have been cloned, as described in U.S. Pat. Nos. 5,686,411 and 7,271,238. Amylin mimetics can be created that retain biologic activity. For example, pramlintide has the sequence KCNTATCATNRLANFLVHSSNNFGPILPPTNVGSNTY (SEQ ID NO: 271), wherein amino acids from the rat amylin sequence are substituted for amino acids in the human amylin sequence. In one embodiment, the invention contemplates fusion proteins comprising amylin mimetics of the sequence KCNTATCATX₁RLANFLVHSSNNFGX₂ILX₂X₂TNVGSNTY (SEQ ID NO: 275), wherein X₁is independently N or Q and X₂is independently S, P or G. In one embodiment, the amylin mimetic incorporated into a composition of this disclosure can have the sequence KCNTATCATNRLANFLVHSSNNFGGILGGTNVGSNTY (SEQ ID NO: 276). In another embodiment, wherein the amylin mimetic is used at the C-terminus of the composition, the mimetic can have the sequence KCNTATCATNRLANFLVHSSNNFGGILGGTNVGSNTY(NH₂) (SEQ ID NO: 276).

“Calcitonin” (CT) means the human calcitonin protein and species and sequence variants thereof, including salmon calcitonin (“sCT”), having at least a portion of the biological activity of mature CT. CT is a 32 amino acid peptide cleaved from a larger prohormone of the thyroid that appears to function in the nervous and vascular systems, but has also been reported to be a potent hormonal mediator of the satiety reflex. CT is named for its secretion in response to induced hypercalcemia and its rapid hypocalcemic effect. It is produced in and secreted from neuroendocrine cells in the thyroid termed C cells. CT has effects on the osteoclast, and the inhibition of osteoclast functions by CT results in a decrease in bone resorption. In vitro effects of CT include the rapid loss of ruffled borders and decreased release of lysosomal enzymes. A major function of CT(1-32) is to combat acute hypercalcemia in emergency situations and/or protect the skeleton during periods of “calcium stress” such as growth, pregnancy, and lactation. (Reviewed in Becker, JCEM, 89(4): 1512-1525 (2004) and Sexton, Current Medicinal Chemistry 6: 1067-1093 (1999)). Calcitonin-containing fusion proteins of the invention may find particular use for the treatment of osteoporosis and as a therapy for Paget's disease of bone. Synthetic calcitonin peptides have been created, as described in U.S. Pat. Nos. 5,175,146 and 5,364,840.

“Calcitonin gene related peptide” or “CGRP” means the human CGRP peptide and species and sequence variants thereof having at least a portion of the biological activity of mature CGRP. Calcitonin gene related peptide is a member of the calcitonin family of peptides, which in humans exists in two forms, α-CGRP (a 37 amino acid peptide) and β-CGRP. CGRP has 43-46% sequence identity with human amylin. CGRP-containing fusion proteins of the invention may find particular use in decreasing morbidity associated with diabetes, ameliorating hyperglycemia and insulin deficiency, inhibition of lymphocyte infiltration into the islets, and protection of beta cells against autoimmune destruction. Methods for making synthetic and recombinant CGRP are described in U.S. Pat. No. 5,374,618.

“Cholecystokinin” or “CCK” means the human CCK peptide and species and sequence variants thereof having at least a portion of the biological activity of mature CCK. CCK-58 is the mature sequence, while the CCK-33 amino acid sequence first identified in humans is the major circulating form of the peptide. The CCK family also includes an 8-amino acid in vivo C-terminal fragment (“CCK-8”), pentagastrin or CCK-5 being the C-terminal peptide CCK(29-33), and CCK-4 being the C-terminal tetrapeptide CCK(30-33). CCK is a peptide hormone of the gastrointestinal system responsible for stimulating the digestion of fat and protein. CCK-33 and CCK-8-containing fusion proteins of the invention may find particular use in reducing the increase in circulating glucose after meal ingestion and potentiating the increase in circulating insulin. Analogues of CCK-8 have been prepared, as described in U.S. Pat. No. 5,631,230. 1002171″Exendin-3″ means a glucose regulating peptide isolated from Heloderma horridum and sequence variants thereof having at least a portion of the biological activity of mature exendin-3. Exendin-3 amide is a specific exendin receptor antagonist from that mediates an increase in pancreatic cAMP, and release of insulin and amylase. Exendin-3-containing fusion proteins of the invention may find particular use in the treatment of diabetes and insulin resistance disorders. The sequence and methods for its assay are described in U.S. Pat. No. 5,424,286.

Exendin-4″ means a glucose regulating peptide found in the saliva of the Gila-monster Heloderma suspectum, as well as species and sequence variants thereof, and includes the native 39 amino acid sequence His-Gly-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Leu-Ser-Lys-Gln-Met-Glu-Glu-Glu-Ala-Val-Arg-Leu-Phe-Ile-Glu-Trp-Leu-Lys-Asn-Gly-Gly-Pro-Ser-Ser-Gly-Ala-Pro-Pro-Pro-Ser and homologous sequences and peptide mimetics, and variants thereof; natural sequences, such as from primates and non-natural having at least a portion of the biological activity of mature exendin-4. Exendin-4 is an incretin polypeptide hormone that decreases blood glucose, promotes insulin secretion, slows gastric emptying and improves satiety, providing a marked improvement in postprandial hyperglycemia. The exendins have some sequence similarity to members of the glucagon-like peptide family, with the highest identity being to GLP-1 (Goke, et al., J. Biol. Chem., 268:19650-55 (1993)). A variety of homologous sequences can be functionally equivalent to native exendin-4 and GLP-1. Conservation of GLP-1 sequences from different species are presented in Regulatory Peptides 2001 98 p. 1-12. Table 3b shows the sequences from a wide variety of species, while Table 3c shows a list of synthetic GLP-1 analogs; all of which are contemplated for use in the composition described herein. Exendin-4 binds at GLP-1 receptors on insulin-secreting OTC′ cells, and also stimulates somatostatin release and inhibits gastrin release in isolated stomachs (Goke, et al., J. Biol. Chem. 268:19650-55, 1993). As a mimetic of GLP-1, exendin-4 displays a similar broad range of biological activities, yet has a longer half-life than GLP-1, with a mean terminal half-life of 2.4 h. Exenatide is a synthetic version of exendin-4, marketed as Byetta. However, due to its short half-life, exenatide is currently dosed twice daily, limiting its utility. Exendin-4-containing fusion proteins of the invention may find particular use in the treatment of diabetes and insulin resistance disorders.

‘Fibroblast growth factor 21’, or “FGF-21” means the human protein encoded by the FGF21 gene, or species and sequence variants thereof having at least a portion of the biological activity of mature FGF21. FGF-21 stimulates glucose uptake in adipocytes but not in other cell types; the effect is additive to the activity of insulin. FGF-21 injection in ob/ob mice results in an increase in Glut1 in adipose tissue. FGF21 also protects animals from diet-induced obesity when over expressed in transgenic mice and lowers blood glucose and triglyceride levels when administered to diabetic rodents (Kharitonenkov A, et al., (2005). “FGF-21 as a novel metabolic regulator”. J. Clin. Invest. 115: 1627-35). FGF-21-containing fusion proteins of the invention may find particular use in treatment of diabetes, including causing increased energy expenditure, fat utilization and lipid excretion. FGF-21 has been cloned, as disclosed in U.S. Pat. No. 6,716,626.

“FGF-19”, or “fibroblast growth factor 19” means the human protein encoded by the FGF19 gene, or species and sequence variants thereof having at least a portion of the biological activity of mature FGF-19. FGF-19 is a protein member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes. FGF-19 increases liver expression of the leptin receptor, metabolic rate, stimulates glucose uptake in adipocytes, and leads to loss of weight in an obese mouse model (Fu, L, et al. FGF-19-containing fusion proteins of the invention may find particular use in increasing metabolic rate and reversal of dietary and leptin-deficient diabetes. FGF-19 has been cloned and expressed, as described in US Patent Application No. 20020042367.

“Gastrin” means the human gastrin peptide, truncated versions, and species and sequence variants thereof having at least a portion of the biological activity of mature gastrin. Gastrin is a linear peptide hormone produced by G cells of the duodenum and in the pyloric antrum of the stomach and is secreted into the bloodstream. Gastrin is found primarily in three forms: gastrin-34 (“big gastrin”); gastrin-17 (“little gastrin”); and gastrin-14 (“minigastrin”). It shares sequence homology with CCK. Gastrin-containing fusion proteins of the invention may find particular use in the treatment of obesity and diabetes for glucose regulation. Gastrin has been synthesized, as described in U.S. Pat. No. 5,843,446.

“Ghrelin” means the human hormone that induces satiation, or species and sequence variants thereof, including the native, processed 27 or 28 amino acid sequence and homologous sequences. Ghrelin is produced mainly by P/D1 cells lining the fundus of the human stomach and epsilon cells of the pancreas that stimulates hunger, and is considered the counterpart hormone to leptin. Ghrelin levels increase before meals and decrease after meals, and can result in increased food intake and increase fat mass by an action exerted at the level of the hypothalamus. Ghrelin also stimulates the release of growth hormone. Ghrelin is acylated at a serine residue by n-octanoic acid; this acylation is essential for binding to the GHS1a receptor and for the GH-releasing capacity of ghrelin. Ghrelin-containing fusion proteins of the invention may find particular use as agonists; e.g., to selectively stimulate motility of the GI tract in gastrointestinal motility disorder, to accelerate gastric emptying, or to stimulate the release of growth hormone. Ghrelin analogs with sequence substitutions or truncated variants, such as described in U.S. Pat. No. 7,385,026, may find particular use as fusion partners to XTEN for use as antagonists for improved glucose homeostasis, treatment of insulin resistance and treatment of obesity. The isolation and characterization of ghrelin has been reported (Kojima M, et al., Ghrelin is a growth-hormone-releasing acylated peptide from stomach. Nature. 1999; 402(6762):656-660.) and synthetic analogs have been prepared by peptide synthesis, as described in U.S. Pat. No. 6,967,237.

“Glucagon” means the human glucagon glucose regulating peptide, or species and sequence variants thereof, including the native 29 amino acid sequence and homologous sequences; natural, such as from primates, and non-natural sequence variants having at least a portion of the biological activity of mature glucagon. The term “glucagon” as used herein also includes peptide mimetics of glucagon. Native glucagon is produced by the pancreas, released when blood glucose levels start to fall too low, causing the liver to convert stored glycogen into glucose and release it into the bloodstream. While the action of glucagon is opposite that of insulin, which signals the body's cells to take in glucose from the blood, glucagon also stimulates the release of insulin, so that newly-available glucose in the bloodstream can be taken up and used by insulin-dependent tissues. Glucagon-containing fusion proteins of the invention may find particular use in increasing blood glucose levels in individuals with extant hepatic glycogen stores and maintaining glucose homeostasis in diabetes. Glucagon has been cloned, as disclosed in U.S. Pat. No. 4,826,763.

“GLP-1” means human glucagon like peptide-1 and sequence variants thereof having at least a portion of the biological activity of mature GLP-1. The term “GLP-1” includes human GLP-1(1-37), GLP-1(7-37), and GLP-1(7-36)amide. GLP-1 stimulates insulin secretion, but only during periods of hyperglycemia. The safety of GLP-1 compared to insulin is enhanced by this property and by the observation that the amount of insulin secreted is proportional to the magnitude of the hyperglycemia. The biological half-life of GLP-1(7-37)OH is a mere 3 to 5 minutes (U.S. Pat. No. 5,118,666). GLP-1-containing fusion proteins of the invention may find particular use in the treatment of diabetes and insulin-resistance disorders for glucose regulation. GLP-1 has been cloned and derivatives prepared, as described in U.S. Pat. No. 5,118,666. Non-limited examples of glucagon-like peptide sequences from a wide variety of species, and synthetic analogs thereof, are shown in Tables 3b-3c. In some embodiments of the compositions disclosed herein, where the biologically active moiety can be a biologically active peptide (BP), the BP can comprise a peptide sequence that exhibits at least (about) 80% sequence identity (e.g., at least (about) 81%, at least (about) 82%, at least (about) 83%, at least (about) 84%, at least (about) 85%, at least (about) 86%, at least (about) 87%, at least (about) 88%, at least (about) 89%, at least (about) 90%, at least (about) 91%, at least (about) 92%, at least (about) 93%, at least (about) 94%, at least (about) 95%, at least (about) 96%, at least (about) 97%, at least (about) 98%, at least (about) 99%, or 100% sequence identity) to an amino acid sequence of a glucagon-like peptide (native or synthetic analog) set forth in Tables 3b-3c.

TABLE 3b

Representative Naturally-Occurring GLP-1 Homologs as BP Candidates

SEQ ID

Gene Name
NO:
Amino Acid Sequence

GLP-1 [frog]
277
HAEGTYTNDVTEYLEEKAAKEFIEWLIKGKPKKIRYS

GLP-la [Xenopus laevis]
278
HAEGTFTSDVTQQLDEKAAKEFIDWLINGGPSKEIIS

GLP-16 [Xenopus laevis]
279
HAEGTYTNDVTEYLEEKAAKEFIIEWLIKGKPK

GLP-1c [Xenopus laevis]
280
HAEGTFTNDMTNYLEEKAAKEFVGWLIKGRPK

Gastric Inhibitory
281
HAEGTFISDYSIAMDKIRQQDFVNWLL

Polypeptide [Mus musculus]

Glucose-dependent
282
HAEGTFISDYSIAMDKIRQQDFVNWLL

insulinotropic polypeptide

[Equus caballus]

Glucagon-like peptide
283
HADGTFTNDMTSYLDAKAARDFVSWLARSDKS

[Petromyzon marinus]

Glucagon-like peptide
284
HAEGTYTSDVSSYLQDQAAKEFVSWLKTGR

[Anguilla rostrata]

Glucagon-like peptide
285
HAEGTYTSDVSSYLQDQAAKEFVSWLKTGR

[Anguilla anguilla]

Glucagon-like peptide
286
HADGIYTSDVASLTDYLKSKRFVESLSNYNKRQNDRRM

[Hydrolagus colliei]

Glucagon-like peptide
287
YADAPYISDVYSYLQDQVAKKWLKSGQDRRE

[Amia calva]

GLUC_ICTPU/38-65
288
HADGTYTSDVSSYLQEQAAKDFITWLKS

GLUCL_ANGRO/1-28
289
HAEGTYTSDVSSYLQDQAAKEFVSWLKT

GLUC_BOVIN/98-125
290
HAEGTFTSDVSSYLEGQAAKEFIAWLVK

GLUC1_LOPAM/91-118
291
HADGTFTSDVSSYLKDQAIKDFVDRLKA

GLUCL_HYDCO/1-28
292
HADGIYTSDVASLTDYLKSKRFVESLSN

GLUC_CAVPO/53-80
293
HSQGTFTSDYSKYLDSRRAQQFLKWLLN

GLUC_CHIBR/1-28
294
HSQGTFTSDYSKHLDSRYAQEFVQWLMN

GLUC1_LOPAM/53-80
295
HSEGTFSNDYSKYLEDRKAQEFVRWLMN

GLUC_HYDCO/1-28
296
HTDGIFSSDYSKYLDNRRTKDFVQWLLS

GLUC_CALMI/1-28
297
HSEGTFSSDYSKYLDSRRAKDFVQWLMS

GIP_BOVIN/1-28
298
YAEGTFISDYSIAMDKIRQQDFVNWLLA

VIP_MELGA/89-116
299
HADGIFTTVYSHLLAKLAVKRYLHSLIR

PACA_CHICK/131-158
300
HIDGIFTDSYSRYRKQMAVKKYLAAVLG

VIP_CAVPO/45-72
301
HSDALFTDTYTRLRKQMAMKKYLNSVLN

VIP_DIDMA/1-28
302
HSDAVFTDSYTRLLKQMAMRKYLDSILN

EXE1_HELSU/1-28
303
HSDATFTAEYSKLLAKLALQKYLESILG

SLIB_CAPHI/1-28
304
YADAIFTNSYRKVLGQLSARKLLQDIMN

SLIB_RAT/31-58
305
HADAIFTSSYRRILGQLYARKLLHEIMN

SLIB_MOUSE/31-58
306
HVDAIFTTNYRKLLSQLYARKVIQDIMN

PACA_HUMAN/83-110
307
VAHGILNEAYRKVLDQLSAGKHLQSLVA

PACA_SHEEP/83-110
308
VAHGILDKAYRKVLDQLSARRYLQTLMA

PACA_ONCNE/82-109
309
HADGMFNKAYRKALGQLSARKYLHSLMA

GLUC_BOVIN/146-173
310
HADGSFSDEMNTVLDSLATRDFINWLLQ

SECR_CANFA/1-27
311
HSDGTFTSELSRLRESARLQRLLQGLV

SECR_CHICK/1-27
312
HSDGLFTSEYSKMRGNAQVQKFIQNLM

EXE3_HELHO/48-75
313
HSDGTFTSDLSKQMEEEAVRLFIEWLKN

TABLE 3c

Representative GLP-1 Synthetic Analogs

SEQ

ID

NO:
Amino Acid Sequence

314
HAEGTFTSDVSSYLEGQAAREFIAWLVKGRG

315
HAEGTFTSDVSSYLEGQAAKEFIAWLVRGRG

316
HAEGTFTSDVSSYLEGQAAKEFIAWLVKGKG

317
HAEGTFTSDVSSYLEGQAAREFIAWLVRGKG

318
HAEGTFTSDVSSYLEGQAAREFIAWLVRGKGR

319
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRK

320
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRRK

321
HAEGTFTSDVSSYLEGQAAREFIAWLVKGKG

322
HAEGTFTSDVSSYLEGQAAKEFIAWLVRGKG

323
HAEGTFTSDVSSYLEGQAAREFIAWLVKGRGRK

324
HAEGTFTSDVSSYLEGQAAKEFIAWLVRGRGRRK

325
HAEGTFTSDVSSYLEGQAAREFIAWLVRGKGRK

326
HAEGTFTSDVSSYLEGQAAREFIAWLVRGKGRRK

327
HGEGTFTSDVSSYLEGQAAREFIAWLVKGRG

328
HGEGTFTSDVSSYLEGQAAKEFIAWLVRGRG

329
HGEGTFTSDVSSYLEGQAAKEFIAWLVKGKG

330
HGEGTFTSDVSSYLEGQAAREFIAWLVRGKG

331
HGEGTFTSDVSSYLEGQAAREFIAWLVRGRGRK

332
HGEGTFTSDVSSYLEGQAAREFIAWLVRGRGRRK

333
HGEGTFTSDVSSYLEGQAAREFIAWLVKGKG

334
HGEGTFTSDVSSYLEGQAAKEFIAWLVRGKG

335
HGEGTFTSDVSSYLEGQAAREFIAWLVKGRGRK

336
HGEGTFTSDVSSYLEGQAAKEFIAWLVRGRGRRK

337
HGEGTFTSDVSSYLEGQAAREFIAWLVRGKGRK

338
HGEGTFTSDVSSYLEGQAAREFIAWLVRGKGRRK

339
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGK

340
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRK

341
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRRK

342
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREK

343
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFK

344
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPK

345
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEK

346
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEEK

347
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGK

348
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRK

349
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRRK

350
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREK

351
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFK

352
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPK

353
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEK

354
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEEK

355
DEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRK

356
DEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRRK

357
DEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREK

358
DEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFK

359
DEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPK

360
DEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEK

361
DEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEEK

362
EFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGK

363
EFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRK

364
EFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRRK

365
EFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREK

366
EFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFK

367
EFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPK

368
EFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEK

369
EFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEEK

370
FERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGK

371
FERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRK

372
FERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRRK

373
FERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREK

374
FERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFK

375
FERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPK

376
FERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEK

377
FERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEEK

378
ERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGK

379
ERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRK

380
ERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRRK

381
ERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREK

382
ERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFK

383
ERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPK

384
ERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEK

385
ERHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEEK

386
RHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGK

387
RHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRK

388
RHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRRK

389
RHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREK

390
RHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFK

391
RHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPK

392
RHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEK

393
RHAEGTFTSDVSSYLEGQAAREFIAWLVRGRGRREFPEEK

394
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVKGRGK

395
HDEFERHAEGTFTSDVSSYLEGQAAKEFIAWLVRGRGK

396
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGKGK

397
HAEGTFTSDVSSYLEGQAAREFIAWLVKGRGK

398
HAEGTFTSDVSSYLEGQAAKEFIAWLVRGRGK

399
HAEGTFTSDVSSYLEGQAAREFIAWLVRGKGK

400
HAEGTFTSDVSSYLEGQAAREFIAWLVRGRGK

401
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVKGRGRK

402
HDEFERHAEGTFTSDVSSYLEGQAAKEFIAWLVRGRGRK

403
HDEFERHAEGTFTSDVSSYLEGQAAREFIAWLVRGKGRK

404
HAEGTFTSDVSSYLEGQAAREFIAWLVKGRGRK

405
HAEGTFTSDVSSYLEGQAAKEFIAWLVRGRGRK

406
HAEGTFTSDVSSYLEGQAAREFIAWLVRGKGRK

407
HGEGTFTSDVSSYLEGQAAREFIAWLVKGRGK

408
HGEGTFTSDVSSYLEGQAAREFIAWLVRGKGK

GLP native sequences may be described by several sequence motifs, which are presented below. Letters in brackets represent acceptable amino acids at each sequence position: [HVY] [AGISTV] [DEHQ] [AG] [ILMPSTV] [FLY] [DINST] [ADEKNST] [ADENSTV] [LMVY] [ANRSTY] [EHIKNQRST] [AHILMQVY] [LMRT] [ADEGKQS] [ADEGKNQSY] [AEIKLMQR] [AKQRSVY] [AILMQSTV] [GKQR] [DEKLQR] [FHLVWY] [ILV] [ADEGHIKNQRST] [ADEGNRSTW] [GILVW] [AIKLMQSV] [ADGIKNQRST] [GKRSY]. In addition, synthetic analogs of GLP-1 can be useful as fusion partners to a masking moiety (such as XTEN) to create a fusion composition with biological activity useful in treatment of glucose-related disorders. Further sequences homologous to Exendin-4 or GLP-1 may be found by standard homology searching techniques.

“GLP-2” means human glucagon like peptide-2 and sequence variants thereof having at least a portion of the biological activity of mature GLP-2. More particularly, GLP-2 is a 33 amino acid peptide, co-secreted along with GLP-1 from intestinal endocrine cells in the small and large intestine.

“IGF-1” or “Insulin-like growth factor 1” means the human IGF-1 protein and species and sequence variants thereof having at least a portion of the biological activity of mature IGF-1. IGF-1, which was once called somatomedin C, is a polypeptide protein anabolic hormone similar in molecular structure to insulin, and that modulates the action of growth hormone. IGF-1 consists of 70 amino acids and is produced primarily by the liver as an endocrine hormone as well as in target tissues in a paracrine/autocrine fashion. IGF-1-containing fusion proteins of the invention may find particular use in the treatment of diabetes and insulin-resistance disorders for glucose regulation. IGF-1 has been cloned and expressed in E. coli and yeast, as described in U.S. Pat. No. 5,324,639.

“IGF-2” or “Insulin-like growth factor 2” means the human IGF-2 protein and species and sequence variants thereof having at least a portion of the biological activity of mature IGF-2. IGF-2 is a polypeptide protein hormone similar in molecular structure to insulin, with a primary role as a growth-promoting hormone during gestation. IGF-2 has been cloned, as described in Bell G I, et al. Isolation of the human insulin-like growth factor genes: insulin-like growth factor II and insulin genes are contiguous. Proc Natl Acad Sci USA. 1985. 82(19):6450-4.

“INGAP”, or “islet neogenesis-associated protein”, or “pancreatic beta cell growth factor” means the human INGAP peptide and species and sequence variants thereof having at least a portion of the biological activity of mature INGAP. INGAP is capable of initiating duct cell proliferation, a prerequisite for islet neogenesis. INGAP-containing fusion proteins of the invention may find particular use in the treatment or prevention of diabetes and insulin-resistance disorders. INGAP has been cloned and expressed, as described in R Rafaeloff R, et al., Cloning and sequencing of the pancreatic islet neogenesis associated protein (INGAP) gene and its expression in islet neogenesis in hamsters. J Clin Invest. 1997. 99(9): 2100-2109.

“Intermedin” or “AFP-6” means the human intermedin peptide and species and sequence variants thereof having at least a portion of the biological activity of mature intermedin. Intermedin is a ligand for the calcitonin receptor-like receptor. Intermedin treatment leads to blood pressure reduction both in normal and hypertensive subjects, as well as the suppression of gastric emptying activity, and is implicated in glucose homeostasis. Intermedin-containing fusion proteins of the invention may find particular use in the treatment of diabetes, insulin-resistance disorders, and obesity. Intermedin peptides and variants have been cloned, as described in U.S. Pat. No. 6,965,013.

“Leptin” means the naturally occurring leptin from any species, as well as biologically active D-isoforms, or fragments and sequence variants thereof. Leptin plays a key role in regulating energy intake and energy expenditure, including appetite and metabolism. Leptin-containing fusion proteins of the invention may find particular use in the treatment of diabetes for glucose regulation, insulin-resistance disorders, and obesity. Leptin is the polypeptide product of the ob gene as described in the International Patent Pub. No. WO 96/05309. Leptin has been cloned, as described in U.S. Pat. No. 7,112,659, and leptin analogs and fragments in U.S. Pat. Nos. 5,521,283, 5,532,336, PCT/US96/22308 and PCT/US96/01471.

“Neuromedin” means the neuromedin family of peptides including neuromedin U and S peptides, and sequence variants thereof. The native active human neuromedin U peptide hormone is neuromedin-U25, particularly its amide form. Of particular interest are their processed active peptide hormones and analogs, derivatives and fragments thereof. Included in the neuromedin U family are various truncated or splice variants, e.g., FLFHYSKTQKLGKSNVVEELQSPFASQSRGYFLFRPRN (SEQ ID NO: 409). Exemplary of the neuromedin S family is human neuromedin S with the sequence ILQRGSGTAAVDFTKKDHTATWGRPFFLFRPRN (SEQ ID NO: 267), particularly its amide form. Neuromedin fusion proteins of the invention may find particular use in treating obesity, diabetes, reducing food intake, and other related conditions and disorders as described herein. Of particular interest are neuromedin modules combined with an amylin family peptide, an exendin peptide family or a GLP I peptide family module.

“Oxyntomodulin”, or “OXM” means human oxyntomodulin and species and sequence variants thereof having at least a portion of the biological activity of mature OXM. OXM is a 37 amino acid peptide produced in the colon that contains the 29 amino acid sequence of glucagon followed by an 8 amino acid carboxyterminal extension. OXM has been found to suppress appetite. OXM-containing fusion proteins of the invention may find particular use in the treatment of diabetes for glucose regulation, insulin-resistance disorders, obesity, and can be used as a weight loss treatment.

“PYY” means human peptide YY polypeptide and species and sequence variants thereof having at least a portion of the biological activity of mature PYY. PYY includes both the human full length, 36 amino acid peptide, PYY_1-36and PYY_3-36which have the PP fold structural motif. PYY inhibits gastric motility and increases water and electrolyte absorption in the colon. PYY may also suppress pancreatic secretion. PPY-containing fusion proteins of the invention may find particular use in the treatment of diabetes for glucose regulation, insulin-resistance disorders, and obesity. Analogs of PYY have been prepared, as described in U.S. Pat. Nos. 5,604,203, 5,574,010 and 7,166,575.

“Urocortin” means a human urocortin peptide hormone and sequence variants thereof having at least a portion of the biological activity of mature urocortin. There are three human urocortins: Ucn-1, Ucn-2 and Ucn-3. Further urocortins and analogs have been described in U.S. Pat. No. 6,214,797. Urocortins Ucn-2 and Ucn-3 have food-intake suppression, antihypertensive, cardioprotective, and inotropic properties. Ucn-2 and Ucn-3 have the ability to suppress the chronic HPA activation following a stressful stimulus such as dieting/fasting, and are specific for the CRF type 2 receptor and do not activate CRF-R1 which mediates ACTH release. Therapeutic agents comprising urocortin, e.g., Ucn-2 or Ucn-3, may be useful for vasodilation and thus for cardiovascular uses such as chronic heart failure. Urocortin-containing fusion proteins of the invention may also find particular use in treating or preventing conditions associated with stimulating ACTH release, hypertension due to vasodilatory effects, inflammation mediated via other than ACTH elevation, hyperthermia, appetite disorder, congestive heart failure, stress, anxiety, and psoriasis. Urocortin-containing fusion proteins may also be combined with a natriuretic peptide module, amylin family, and exendin family, or a GLP 1 family module to provide an enhanced cardiovascular benefit, e.g. treating CHF, as by providing a beneficial vasodilation effect.

Metabolic Disease and Cardiovascular Proteins

Metabolic and cardiovascular diseases represent a substantial health care burden in most developed nations, with cardiovascular diseases remaining the number one cause of death and disability in the United States and most European countries. Metabolic diseases and disorders include a large variety of conditions affecting the organs, tissues, and circulatory system of the body. Chief amongst these is diabetes; one of the leading causes of death in the United States, as it results in pathology and metabolic dysfunction in both the vasculature, central nervous system, major organs, and peripheral tissues. Insulin resistance and hyperinsulinemia have also been linked with two other metabolic disorders that pose considerable health risks: impaired glucose tolerance and metabolic obesity. Impaired glucose tolerance is characterized by normal glucose levels before eating, with a tendency toward elevated levels (hyperglycemia) following a meal. These individuals are considered to be at higher risk for diabetes and coronary artery disease. Obesity is also a risk factor for the group of conditions called insulin resistance syndrome, or “Syndrome X,” as is hypertension, coronary artery disease (arteriosclerosis), and lactic acidosis, as well as related disease states. The pathogenesis of obesity is believed to be multifactorial but an underlying problem is that in the obese, nutrient availability and energy expenditure are not in balance until there is excess adipose tissue.

Dyslipidemia is a frequent occurrence among diabetics and subjects with cardiovascular disease; typically characterized by parameters such as elevated plasma triglycerides, low HDL (high density lipoprotein) cholesterol, normal to elevated levels of LDL (low density lipoprotein) cholesterol and increased levels of small dense, LDL particles in the blood. Dyslipidemia and hypertension is a main contributor to an increased incidence of coronary events, renal disease, and deaths among subjects with metabolic diseases like diabetes and cardiovascular disease.

Cardiovascular disease can be manifest by many disorders, symptoms and changes in clinical parameters involving the heart, vasculature and organ systems throughout the body, including aneurysms, angina, atherosclerosis, cerebrovascular accident (Stroke), cerebrovascular disease, congestive heart failure, coronary artery disease, myocardial infarction, reduced cardiac output and peripheral vascular disease, hypertension, hypotension, blood markers (e.g., C-reactive protein, BNP, and enzymes such as CPK, LDH, SGPT, SGOT), amongst others.

Most metabolic processes and many cardiovascular parameters are regulated by multiple peptides and hormones (“metabolic proteins”), and many such peptides and hormones, as well as analogues thereof, have found utility in the treatment of such diseases and disorders. However, the use of therapeutic peptides and/or hormones, even when augmented by the use of small molecule drugs, has met with limited success in the management of such diseases and disorders. In particular, dose optimization is important for drugs and biologics used in the treatment of metabolic diseases, especially those with a narrow therapeutic window. Hormones in general, and peptides involved in glucose homeostasis often have a narrow therapeutic window. The narrow therapeutic window, coupled with the fact that such hormones and peptides typically have a short half-life which necessitates frequent dosing in order to achieve clinical benefit, results in difficulties in the management of such patients. Therefore, there remains a need for therapeutics with broader therapeutic window and increased efficacy and safety in the treatment of metabolic diseases.

In some embodiments of the compositions, as disclosed herein in this disclosure, the biologically active peptide (BP) can comprise a biologically active metabolic protein, and the composition can have utility in the treatment of metabolic and cardiovascular diseases and disorders. The metabolic proteins can include any protein of biologic, therapeutic, or prophylactic interest or function that is useful for preventing, treating, mediating, or ameliorating a metabolic or cardiovascular disease, disorder or condition. Table 3d provides a non-limiting list of such sequences of metabolic BPs that can be encompassed by the compositions (e.g., the therapeutic agents) of the invention. In some embodiments of the compositions disclosed herein, where the biologically active moiety is a biologically active peptide (BP), the BP can comprise a peptide sequence that exhibits at least (about) 80% sequence identity (e.g., at least (about) 81%, at least (about) 82%, at least (about) 83%, at least (about) 84%, at least (about) 85%, at least (about) 86%, at least (about) 87%, at least (about) 88%, at least (about) 89%, at least (about) 90%, at least (about) 91%, at least (about) 92%, at least (about) 93%, at least (about) 94%, at least (about) 95%, at least (about) 96%, at least (about) 97%, at least (about) 98%, at least (about) 99%, or 100% sequence identity) to an amino acid sequence of a metabolic protein set forth in Table 3d.

TABLE 3d

Biologically Active Proteins Relating to Metabolic Disorders and Cardiology

SEQ ID

Name of Protein
NO:
Sequence

Anti-CD3

See U.S. Pat. Nos. 5,885,573 and 6,491,916

IL-1ra, human full length
410
MEICRGLRSHLITLLLFLFHSETICRPSGRKSSKMQAFRIWD

VNQKTFYLRNNQLVAGYLQGPNVNLEEKIDVVPIEPHALF

LGIHGGKMCLSCVKSGDETRLQLEAVNITDLSENRKQDKR

FAFIRSDSGPTTSFESAACPGWFLCTAMEADQPVSLTNMP

DEGVMVTKFYFQEDE

IL-1ra, Dog
411
METCRCPLSYLISFLLFLPHSETACRLGKRPCRMQAFRIWD

VNQKTFYLRNNQLVAGYLQGSNTKLEEKLDVVPVEPHAV

FLGIHGGKLCLACVKSGDETRLQLEAVNITDLSKNKDQDK

RFTFILSDSGPTTSFESAACPGWFLCTALEADRPVSLTNRPE

EAMMVTKFYFQKE

IL-1ra, Rabbit
412
MRPSRSTRRHLISLLLFLFHSETACRPSGKRPCRMQAFRIW

DVNQKTFYLRNNQLVAGYLQGPNAKLEERIDVVPLEPQLL

FLGIQRGKLCLSCVKSGDKMKLHLEAVNITDLGKNKEQD

KRFTFIRSNSGPTTTFESASCPGWFLCTALEADQPVSLTNTP

DDSIVVTKFYFQED

IL-1ra, Rat
413
MEICRGPYSHLISLLLILLFRSESAGHIPAGKRPCKMQAFRI

WDTNQKTFYLRNNQLIAGYLQGPNTKLEEKIDMVPIDFRN

VFLGIHGGKLCLSCVKSGDDTKLQLEEVNITDLNKNKEED

KRFTFIRSETGPTTSFESLACPGWFLCTTLEADHPVSLTNTP

KEPCTVTKFYFQED

IL-1ra, Mouse
414
MEICWGPYSHLISLLLILLFHSEAACRPSGKRPCKMQAFRI

WDTNQKTFYLRNNQLIAGYLQGPNIKLEEKIDMVPIDLHS

VFLGIHGGKLCLSCAKSGDDIKLQLEEVNITDLSKNKEEDK

RFTFIRSEKGPTTSFESAACPGWFLCTTLEADRPVSLTNTPE

EPLIVTKFYFQEDQ

Anakinra
415
MRPSGRKSSKMQAFRIWDVNQKTFYLRNNQLVAGYLQGP

NVNLEEKIDVVPIEPHALFLGIHGGKMCLSCVKSGDETRLQ

LEAVNITDLSENRKQDKRFAFIRSDSGPTTSFESAACPGWF

LCTAMEADQPVSLTNMPDEGVMVTKFYFQEDE

α-natriuretic peptide (ANP)
416
SLRRSSCFGGRMDRIGAQSGLGCNSFRY

β-natriuretic peptide, human
417
SPKMVQGSGGFGRKMDRISSSSGLGCKVLRRH

(BNP human)

Brain natriuretic
418
NSKMAHSSSCFGQKIDRIGAVSRLGCDGLRLF

peptide, Rat; (BNP Rat)

C-type natriuretic peptide
419
GLSKGCFGLKLDRIGSMSGLGC

(CNP, porcine)

Fibroblast growth factor 2
420
PALPEDGGSGAFPPGHFKDPKRLYCKNGGFFLRIHPDGRV

(FGF-2)

DGVREKSDPHIKLQLQAEERGVVSIKGVCANRYLAMKED

GRLLASKCVTDECFFFERLESNNYNTYRS RKYTSWYVAL

KRTGQYKLGS KTGPGQKAIL FLPMSAKS

TNF receptor (TNFR)
421
LPAQVAFTPYAPEPGSTCRLREYYDQTAQMCCSKCSPGQH

AKVFCTKTSDTVCDSCEDSTYTQLWNWVPECLSCGSRCSS

DQVETQACTREQNRICTCRPGWYCALSKQEGCRLCAPLR

KCRPGFGVARPGTETSDVVCKPCAPGTFSNTTSSTDICRPH

QICNVVAIPGNASMDAVCTSTSPTRSMAPGAVHLPQPVST

RSQHTQPTPEPSTAPSTSFLLPMGPSPPAEGSTGD

“Anti-CD3” means a monoclonal antibody against the T cell surface protein CD3, species and sequence variants, and fragments thereof, including OKT3 (also called muromonab) and humanized anti-CD3 monoclonal antibody (hOKT31(Ala-Ala))(KC Herold et al., New England Journal of Medicine 346:1692-1698. 2002) Anti-CD3 prevents T-cell activation and proliferation by binding the T-cell receptor complex present on all differentiated T cells. Anti-CD3-containing fusion proteins of the invention may find particular use to slow new-onset Type 1 diabetes, including use of the anti-CD3 as a therapeutic effector as well as a targeting moiety for a second therapeutic BP in the composition of this disclosure. The sequences for the variable region and the creation of an anti-CD3 have been described in U.S. Pat. Nos. 5,885,573 and 6,491,916.

“IL-1ra” means the human IL-1 receptor antagonist protein and species and sequence variants thereof, including the sequence variant anakinra (Kineret®), having at least a portion of the biological activity of mature IL-1ra. Human IL-1ra is a mature glycoprotein of 152 amino acid residues. The inhibitory action of IL-1ra results from its binding to the type I IL-1 receptor. The protein has a native molecular weight of 25 kDa, and the molecule shows limited sequence homology to IL-1α (19%) and IL-1β (26%). Anakinra is a nonglycosylated, recombinant human IL-1ra and differs from endogenous human IL-1ra by the addition of an N-terminal methionine. A commercialized version of anakinra is marketed as Kineret®. It binds with the same avidity to IL-1 receptor as native IL-1ra and IL-1b, but does not result in receptor activation (signal transduction), an effect attributed to the presence of only one receptor binding motif on IL-1ra versus two such motifs on IL-1α and IL-1β. Anakinra has 153 amino acids and 17.3 kD in size, and has a reported half-life of approximately 4-6 hours.

Increased IL-1 production has been reported in patients with various viral, bacterial, fungal, and parasitic infections; intravascular coagulation; high-dose IL-2 therapy; solid tumors; leukemias; Alzheimer's disease; HIV-1 infection; autoimmune disorders; trauma (surgery); hemodialysis; ischemic diseases (myocardial infarction); noninfectious hepatitis; asthma; UV radiation; closed head injury; pancreatitis; peritonitis; graft-versus-host disease; transplant rejection; and in healthy subjects after strenuous exercise. There is an association of increased IL-1b production in patients with Alzheimer's disease and a possible role for IL 1 in the release of the amyloid precursor protein. IL-1 has also been associated with diseases such as type 2 diabetes, obesity, hyperglycemia, hyperinsulinemia, type 1 diabetes, insulin resistance, retinal neurodegenerative processes, disease states and conditions characterized by insulin resistance, acute myocardial infarction (AMI), acute coronary syndrome (ACS), atherosclerosis, chronic inflammatory disorders, rheumatoid arthritis, degenerative intervertebral disc disease, sarcoidosis, Crohn's disease, ulcerative colitis, gestational diabetes, excessive appetite, insufficient satiety, metabolic disorders, glucagonomas, secretory disorders of the airway, osteoporosis, central nervous system disease, restenosis, neurodegenerative disease, renal failure, congestive heart failure, nephrotic syndrome, cirrhosis, pulmonary edema, hypertension, disorders wherein the reduction of food intake is desired, irritable bowel syndrome, myocardial infarction, stroke, post-surgical catabolic changes, hibernating myocardium, diabetic cardiomyopathy, insufficient urinary sodium excretion, excessive urinary potassium concentration, conditions or disorders associated with toxic hypervolemia, polycystic ovary syndrome, respiratory distress, chronic skin ulcers, nephropathy, left ventricular systolic dysfunction, gastrointestinal diarrhea, postoperative dumping syndrome, irritable bowel syndrome, critical illness polyneuropathy (CIPN), systemic inflammatory response syndrome (SIRS), dyslipidemia, reperfusion injury following ischemia, and coronary heart disease risk factor (CHDRF) syndrome. IL-1ra-containing fusion proteins of the invention may find particular use in the treatment of any of the foregoing diseases and disorders. IL-1ra has been cloned, as described in U.S. Pat. Nos. 5,075,222 and 6,858,409.

“Natriuretic peptides” means atrial natriuretic peptide (ANP), brain natriuretic peptide (BNP or B-type natriuretic peptide) and C-type natriuretic peptide (CNP); both human and non-human species and sequence variants thereof having at least a portion of the biological activity of the mature counterpart natriuretic peptides. Alpha atrial natriuretic peptide (aANP) or (ANP) and brain natriuretic peptide (BNP) and type C natriuretic peptide (CNP) are homologous polypeptide hormones involved in the regulation of fluid and electrolyte homeostasis. Sequences of useful forms of natriuretic peptides are disclosed in U.S. Patent Publication 20010027181. Examples of ANPs include human ANP (Kangawa et al., BBRC 118:131 (1984)) or that from various species, including pig and rat ANP (Kangawa et al., BBRC 121:585 (1984)). Sequence analysis reveals that preproBNP consists of 134 residues and is cleaved to a 108-amino acid ProBNP. Cleavage of a 32-amino acid sequence from the C-terminal end of ProBNP results in human BNP (77-108), which is the circulating, physiologically active form. The 32-amino acid human BNP involves the formation of a disulfide bond (Sudoh et al., BBRC 159:1420 (1989)) and U.S. Pat. Nos. 5,114,923, 5,674,710, 5,674,710, and 5,948,761. Compositions-containing one or more natriuretic functions may be useful in treating hypertension, diuresis inducement, natriuresis inducement, vascular conduct dilatation or relaxation, natriuretic peptide receptors (such as NPR-A) binding, 112apida secretion suppression from the kidney, aldostrerone secretion suppression from the adrenal gland, treatment of cardiovascular diseases and disorders, reducing, stopping or reversing cardiac remodeling after a cardiac event or as a result of congestive heart failure, treatment of renal diseases and disorders; treatment or prevention of ischemic stroke, and treatment of asthma.

“FGF-2” or heparin-binding growth factor 2, means the human FGF-2 protein, and species and sequence variants thereof having at least a portion of the biological activity of the mature counterpart. FGF-2 had been shown to stimulate proliferation of neural stem cells differentiated into striatal-like neurons and protect striatal neurons in toxin-induced models of Huntington Disease, and also my have utility in treatment of cardiac reperfusion injury, and may have endothelial cell growth, anti-angiogenic and tumor suppressive properties, wound healing, as well as promoting fracture healing in bones. FGF-2 has been cloned, as described in Burgess, W. H. and Maciag, T., Ann. Rev. Biochem., 58:575-606 (1989); Coulier, F., et al., 1994, Prog. Growth Factor Res. 5:1; and the PCT publication WO 87/01728.

“TNF receptor” means the human receptor for TNF, and species and sequence variants thereof having at least a portion of the biological receptor activity of mature TNFR. P75 TNF Receptor molecule is the extracellular domain of p75 TNF receptor, which is from a family of structurally homologous receptors which includes the p55 TNF receptor. TNFα and TNFβ (TNF ligands) compete for binding to the p55 and p75 TNF receptors. The x-ray crystal structure of the complex formed by the extracellular domain of the human p55 TNF receptor and TNFβ has been determined (Banner et al. Cell 73:431, 1993, incorporated herein by reference).

Growth Hormone Proteins

“Growth Hormone” or “GH” means the human growth hormone protein and species and sequence variants thereof, and includes, but is not limited to, the 191 single-chain amino acid human sequence of GH. Thus, GH can be the native, full-length protein or can be a truncated fragment or a sequence variant that retains at least a portion of the biological activity of the native protein. Effects of GH on the tissues of the body can generally be described as anabolic. Like most other protein hormones, GH acts by interacting with a specific plasma membrane receptor, referred to as growth hormone receptor. There are two known types of human GH (hereinafter “hGH”) derived from the pituitary gland: one having a molecular weight of about 22,000 daltons (22 kD hGH) and the other having a molecular weight of about 20,000 daltons (20 kD hGH). The 20 kD HGH has an amino acid sequence that corresponds to that of 22 kD hGH consisting of 191 amino acids except that 15 amino acid residues from the 32^ndto the 46^thof 22 kD hGH are missing. Some reports have shown that the 20 kD hGH has been found to exhibit lower risks and higher activity than 22 kD hGH. The invention also contemplates use of the 20 kD hGH as being appropriate for use as a biologically active polypeptide for the compositions of this disclosure.

The invention contemplates inclusion in the compositions of any GH homologous sequences, sequence fragments that are natural, such as from primates, mammals (including domestic animals), and non-natural sequence variants which retain at least a portion of the biologic activity or biological function of GH and/or that are useful for preventing, treating, mediating, or ameliorating a GH-related disease, deficiency, disorder or condition. Non-mammalian GH sequences are well-described in the literature. For example, a sequence alignment of fish GHs can be found in Genetics and Molecular Biology 2003 26 p. 295-300. An analysis of the evolution of avian GH sequences is presented in Journal of Evolutionary Biology 2006 19 p. 844-854. In addition, native sequences homologous to human GH may be found by standard homology searching techniques, such as NCBI BLAST.

In one embodiment, the GH incorporated into the subject compositions can be a recombinant polypeptide with a sequence corresponding to a protein found in nature. In another embodiment, the GH can be a sequence variant, fragment, homolog, or a mimetics of a natural sequence that retains at least a portion of the biological activity of the native GH. Table 3f provides a non-limiting list of sequences of GHs from a wide variety of mammalian species that are encompassed by the compositions of this disclosure. Any of these GH sequences or homologous derivatives constructed by shuffling individual mutations between species or families may be useful for the fusion proteins of this invention. In some embodiments of the compositions disclosed herein, where the biologically active moiety can be a biologically active peptide (BP), the BP can comprise a peptide sequence that exhibits at least (about) 80% sequence identity (e.g., at least (about) 81%, at least (about) 82%, at least (about) 83%, at least (about) 84%, at least (about) 85%, at least (about) 86%, at least (about) 87%, at least (about) 88%, at least (about) 89%, at least (about) 90%, at least (about) 91%, at least (about) 92%, at least (about) 93%, at least (about) 94%, at least (about) 95%, at least (about) 96%, at least (about) 97%, at least (about) 98%, at least (about) 99%, or 100% sequence identity) to an amino acid sequence of a growth hormone set forth in Table 3f.

TABLE 3f

Growth Hormone Amino Acid Sequences from Animal Species

SEQ

Species GH
ID NO:
Amino Acid Sequence

Man
422
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSL

CFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGAS

DSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALL

KNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF

Pig
423
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKDEAQQRSDVELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRAGQILKQTYDKFDTNLRSDDALL

KNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Alpaca
424
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERTYIPEGQRYSIQNAQAAF

CFSETIPAPTGKDEAQQRSDVELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRAGQILRQTYDKFDTNLRSDDALL

KNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Camel
425
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERTYIPEGQRYSIQNAQAAF

CFSETIPAPTGKDEAQQRSDVELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRAGQILRQTYDKFDTNLRSDDALL

KNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Horse
426
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKDEAQQRSDMELLRFSLLLIQSWLGPVQLLSRVFTNSLVFG

TSDRVYEKLRDLEEGIQALMRELEDGSPRAGQILKQTYDKFDTNLRSDDAL

LKNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Elephant
427
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKDEAQQRSDVELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRPGQVLKQTYDKFDTNMRSDDAL

LKNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Red fox
428
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKDEAQQRSDVELLRFSLVLIQSWLGPLQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRAGQILKQTYDKFDTNLRSDDALL

KNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Dog
429
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKDEAQQRSDVELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRAGQILKQTYDKFDTNLRSDDALL

KNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Cat
430
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKDEAQQRSDVELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRGGQILKQTYDKFDTNLRSDDALL

KNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

American
431
FPAMPLSSLFANAVLRAQHLHQLAADTYKDFERAYIPEGQRYSIQNAQAAF

mink

CFSETIPAPTGKDEAQQRSDMELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRAGPILKQTYDKFDTNLRSDDALL

KNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Finback
432
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

whale

CFSETIPAPTGKDEAQQRSDVELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRAGQILKQTYDKFDTNMRSDDAL

LKNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Dolphin
433
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNTQAAF

CFSETIPAPTGKDEAQQRSDVELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRAGQILKQTYDKFDTNMRSDDAL

LKNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Hippo
434
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNTQAAF

CFSETIPAPTGKDEAQQRSDVELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGSPRAGQILKQTYDKFDTNMRSDDAL

LKNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Rabbit
435
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKDEAQQRSDMELLRFSLLLIQSWLGPVQFLSRAFTNTLVFG

TSDRVYEKLKDLEEGIQALMRELEDGSPRVGQLLKQTYDKFDTNLRGDDA

LLKNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCVF

Rat
436
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKEEAQQRTDMELLRFSLLLIQSWLGPVQFLSRIFTNSLMFGT

SDRVYEKLKDLEEGIQALMQELEDGSPRIGQILKQTYDKFDANMRSDDALL

KNYGLLSCFKKDLHKAETYLRV MKCRRFAESSCAF

Mouse
437
FPAMPLSSLFSNAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKEEAQQRTDMELLRFSLLLIQSWLGPVQFLSRIFTNSLMFGT

SDRVYEKLKDLEEGIQALMQELEDGSPRVGQILKQTYDKFDANMRSDDAL

LKNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Hamster
438
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQTAF

CFSETIPAPTGKEEAQQRSDMELLRFSLLLIQSWLGPVQFLSRIFTNSLMFGT

SDRVYEKLKDLEEGIQALMQELEDGSPRVGQILKQTYDKFDTNMRSDDAL

LKNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Mole rat
439
FPAMPLSNLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKEEAQQRSDMELLRFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVFEKLKDLEEGIQALMRELEDGSLRAGQLLKQTYDKFDTNMRSDDAL

LKNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Guinea pig
440
FPAMPLSSLFGNAVLRAQHLHQLAADTYKEFERTYIPEGQRYSIHNTQTAF

CFSETIPAPTDKEEAQQRSDVELLHFSLLLIQSWLGPVQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGIQALMRELEDGTPRAGQILKQTYDKFDTNLRSNDALL

KNYGLLSCFRKDLHRTETYLRV MKCRRFVESSCAF

Ox
441
AFPAMSLSGLFANAVLRAQHLHQLAADTFKEFERTYIPEGQRYSIQNTQVA

FCFSETIPAPTGKNEAQQKSDLELLRISLLLIQSWLGPLQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGILALMRELEDGTPRAGQILKQTYDKFDTNMRSDDAL

LKNYGLLSCFRKDLHKTETYLRV MKCRRFGEASCAF

Sheep/Goat
442
AFPAMSLSGLFANAVLRAQHLHQLAADTFKEFERTYIPEGQRYSIQNTQVA

FCFSETIPAPTGKNEAQQKSDLELLRISLLLIQSWLGPLQFLSRVFTNSLVFGT

SDRVYEKLKDLEEGILALMRELEDVTPRAGQILKQTYDKFDTNMRSDDAL

LKNYGLLSCFRKDLHKTETYLRV MKCRRFGEASCAF

Red deer
443
FPAMSLSGLFANAVLRAQHLHQLAADTFKEFERTYIPEGQRYSIQNTQVAF

CFSETIPAPTGKNEAQQKSDLELLRISLLLIQSWLGPLQFLSRVFTNSLVFGTS

DRVYEKLKDLEEGILALMRELEDGTPRAGQILKQTYDKFDTNMRSDDALL

KNYGLLSCFRKDLHKTETYLRV MKCRRFGEASCAF

Giraffe
444
AFPAMSLSGLFANAVLRAQHLHQLAADTFKEFERTYIPEGQRYSIQNTQVA

FCFSETIPAPTGKNEAQQKSDLELLRISLLLIQSWLGPLQFLSRVFSNSLVFGT

SDRVYEKLKDLEEGILALMRELEDGTPRAGQILKQTYDKFDTNMRSDDAL

LKNYGLLSCFRKDLHKTETYLRV MKCRRFGEASCAF

Chevrotain-
445
FPAMSLSGLFANAVLRVQHLHQLAADTFKEFERTYIPEGQRYSIQNTQVAF

1

CFSETIPAPTGKNEAQQKSDLELLRISLLLIQSWLGPLQFLSRVFTNSLVFGTS

DRVYEKLKDLEEGILALMRELEDGPPRAGQILKQTYDKFDTNMRSDDALL

KNYGLLSCFRKDLHKTETYLRV MKCRRFGEASCAF

Slow loris
446
FPAMPLSSLFANAVLRAQHLHQLAADTYKEFERAYIPEGQRYSIQNAQAAF

CFSETIPAPTGKDEAQQRSDMELLRFSLLLIQSWLGPVQLLSRVFTNSLVLG

TSDRVYEKLKDLEEGIQALMRELEDGSPRVGQILKQTYDKFDTNLRSDDAL

LKNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Marmoset
447
FPTIPLSRLLDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSL

CFSESIPTPASKKETQQKSNLELLRMSLLLIQSWFEPVQFLRSVFANSLLYGV

SDSDVYEYLKDLEEGIQTLMGRLEDGSPRTGEIFMQTYRKFDVNSQNNDAL

LKNYGLLYCFRKDMDKVETFLRI VQCR-SVEGSCGF

BrTailed
448
FPAMPLSSLFANAVLRAQHLHQLVADTYKEFERTYIPEAQRHSIQSTQTAFC

Possum

FSETIPAPTGKDEAQQRSDVELLRFSLLLIQSWLSPVQFLSRVFTNSLVFGTS

DRVYEKLRDLEEGIQALMQELEDGSSRGGLVLKTTYDKFDTNLRSDEALL

KNYGLLSCFKKDLHKAETYLRV MKCRRFVESSCAF

Monkey
449
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSL

(rhesus)

CFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGTS

YSDVYDLLKDLEEGIQTLMGRLEDGSSRTGQIFKQTYSKFDTNSHNNDALL

KNYGLLYCFRKDMDKIETFLRI VQCR-SVEGSCGF

Cytokines

The BP can be a cytokine. Cytokines encompassed by the inventive compositions can have utility in the treatment in various therapeutic or disease categories, including but not limited to cancer, rheumatoid arthritis, multiple sclerosis, myasthenia gravis, systemic lupus erythematosus, Alzheimer's disease, Schizophrenia, viral infections (e.g., chronic hepatitis C, AIDS), allergic asthma, retinal neurodegenerative processes, metabolic disorder, insulin resistance, and diabetic cardiomyopathy. Cytokines can be especially useful in treating inflammatory conditions and autoimmune conditions.

The BP can be one or more cytokines. The cytokines refer to proteins (e.g., chemokines, interferons, lymphokines, interleukins, and tumor necrosis factors) released by cells which can affect cell behavior. Cytokines can be produced by a broad range of cells, including immune cells such as macrophages, B lymphocytes, T lymphocytes and mast cells, as well as endothelial cells, fibroblasts, and various stromal cells. A given cytokine can be produced by more than one type of cell. Cytokines can be involved in producing systemic or local immunomodulatory effects.

Certain cytokines can function as pro-inflammatory cytokines. Pro-inflammatory cytokines refer to cytokines involved in inducing or amplifying an inflammatory reaction. Pro-inflammatory cytokines can work with various cells of the immune system, such as neutrophils and leukocytes, to generate an immune response. Certain cytokines can function as anti-inflammatory cytokines. Anti-inflammatory cytokines refer to cytokines involved in the reduction of an inflammatory reaction. Anti-inflammatory cytokines, in some cases, can regulate a pro-inflammatory cytokine response. Some cytokines can function as both pro- and anti-inflammatory cytokines.

Examples of cytokines that are regulatable by systems and compositions of the present disclosure include, but are not limited to lymphokines, monokines, and traditional polypeptide hormones except for human growth hormone. Included among the cytokines are parathyroid hormone; thyroxine; insulin; proinsulin; relaxin; prorelaxin; glycoprotein hormones such as follicle stimulating hormone (FSH), thyroid stimulating hormone (TSH), and luteinizing hormone (LH); hepatic growth factor; fibroblast growth factor; prolactin; placental lactogen; tumor necrosis factor-alpha; mullerian-inhibiting substance; mouse gonadotropin-associated peptide; inhibin; activin; vascular endothelial growth factor; integrin; thrombopoietin (TPO); nerve growth factors such as NGF-alpha; platelet-growth factor; transforming growth factors (TGFs) such as TGF-alpha, TGF-beta, TGF-beta1, TGF-beta2, and TGF-beta3; insulin-like growth factor-I and —II; erythropoietin (EPO); Flt-3L; stem cell factor (SCF); osteoinductive factors; interferons (IFNs) such as IFN-α, IFN-β, IFN-γ; colony stimulating factors (CSFs) such as macrophage-CSF (M-CSF); granulocyte-macrophage-CSF (GM-CSF); granulocyte-CSF (G-CSF); macrophage stimulating factor (MSP); interleukins (ILs) such as IL-1, IL-1a, IL-1b, IL-IRA, IL-18, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-12b, IL-13, IL-14, IL-15, IL-16, IL-17, IL-20; a tumor necrosis factor such as CD154, LT-beta, TNF-alpha, TNF-beta, 4-1BBL, APRIL, CD70, CD153, CD178, GITRL, LIGHT, OX40L, TALL-1, TRAIL, TWEAK, TRANCE; and other polypeptide factors including LIF, oncostatin M (OSM) and kit ligand (KL). Cytokine receptors refer to the receptor proteins which bind cytokines. Cytokine receptors may be both membrane-bound and soluble.

The target polynucleotide can encode for a cytokine. Non-limiting examples of cytokines include 4-1BBL, activin βA, activin βB, activin βC, activin 13E, artemin (ARTN), BAFF/BLyS/TNFSF138, BMP10, BMP15, BMP2, BMP3, BMP4, BMP5, BMP6, BMP7, BMP8a, BMP8b, bone morphogenetic protein 1 (BMP1), CCL1/TCA3, CCL11, CCL12/MCP-5, CCL13/MCP-4, CCL14, CCL15, CCL16, CCL17/TARC, CCL18, CCL19, CCL2/MCP-1, CCL20, CCL21, CCL22/MDC, CCL23, CCL24, CCL25, CCL26, CCL27, CCL28, CCL3, CCL3L3, CCL4, CCL4L1/LAG-1, CCL5, CCL6, CCL7, CCL8, CCL9, CD153/CD30L/TNFSF8, CD40L/CD154/TNFSF5, CD40LG, CD70, CD70/CD27L/TNFSF7, CLCF1, c-MPL/CD110/TPOR, CNTF, CX3CL1, CXCL1, CXCL10, CXCL11, CXCL12, CXCL13, CXCL14, CXCL15, CXCL16, CXCL17, CXCL2/MIP-2, CXCL3, CXCL4, CXCL5, CXCL6, CXCL7/Ppbp, CXCL9, EDA-A1, FAM19A1, FAM19A2, FAM19A3, FAM19A4, FAM19A5, Fas Ligand/FASLG/CD95L/CD178, GDF10, GDF11, GDF15, GDF2, GDF3, GDF4, GDF5, GDF6, GDF7, GDF8, GDF9, glial cell line-derived neurotrophic factor (GDNF), growth differentiation factor 1 (GDF1), IFNA1, IFNA10, IFNA13, IFNA14, IFNA2, IFNA4, IFNA5/IFNaG, IFNA7, IFNA8, IFNB1, IFNE, IFNG, IFNZ, IFNω/IFNW1, IL11, IL18, IL18BP, ILIA, IL1B, IL1F10, IL1F3/IL1RA, IL1F5, IL1F6, IL1F7, IL1F8, IL1F9, IL1RL2, IL31, IL33, IL6, IL8/CXCL8, inhibin-A, inhibin-B, Leptin, LIF, LTA/TNFB/TNFSF1, LTB/TNFC, neurturin (NRTN), OSM, OX-40L/TNFSF4/CD252, persephin (PSPN), RANKL/OPGL/TNFSF11(CD254), TL1A/TNFSF15, TNFA, TNF-alpha/TNFA, TNFSF10/TRAIL/APO-2L(CD253), TNFSF12, TNFSF13, TNFSF14/LIGHT/CD258, XCL1, and XCL2. In some embodiments, the target gene encodes for an immune checkpoint inhibitor. Non-limiting examples of such immune checkpoint inhibitors include PD-1, CTLA-4, LAG3, TIM-3, A2AR, B7-H3, B7-H4, BTLA, IDO, KIR, and VISTA. In some embodiments, the target gene encodes for a T cell receptor (TCR) alpha, beta, gamma, and/or delta chain.

In some cases, the cytokine can be a chemokine. The chemokine can be selected from a group including, but not limited to, ARMCX2, BCA-1/CXCL13, CCL11, CCL12/MCP-5, CCL13/MCP-4, CCL15/MIP-5/MIP-1 delta, CCL16/HCC-4/NCC4, CCL17/TARC, CCL18/PARC/MIP-4, CCL19/MIP-3b, CCL2/MCP-1, CCL20/MIP-3 alpha/MIP3A, CCL21/6Ckine, CCL22/MDC, CCL23/MIP 3, CCL24/Eotaxin-2/MPIF-2, CCL25/TECK, CCL26/Eotaxin-3, CCL27/CTACK, CCL28, CCL3/Mip1a, CCL4/MIP1B, CCL4L1/LAG-1, CCL5/RANTES, CCL6/C10, CCL8/MCP-2, CCL9, CML5, CXCL1, CXCL10/Crg-2, CXCL12/SDF-1 beta, CXCL14/BRAK, CXCL15/Lungkine, CXCL16/SR-PSOX, CXCL17, CXCL2/MIP-2, CXCL3/GRO gamma, CXCL4/PF4, CXCL5, CXCL6/GCP-2, CXCL9/MIG, FAM19A1, FAM19A2, FAM19A3, FAM19A4/TAFA4, FAM19A5, Fractalkine/CX3CL1, I-309/CCL1/TCA-3, IL-8/CXCL8, MCP-3/CCL7, NAP-2/PPBP/CXCL7, XCL2, and IL10.

Table 3g provides a non-limiting list of such sequences of BPs that are encompassed by the compositions of this disclosure. In some embodiments of the compositions disclosed herein, where the biologically active moiety can be a biologically active peptide (BP), the BP can comprise a peptide sequence that exhibits at least (about) 80% sequence identity (e.g., at least (about) 81%, at least (about) 82%, at least (about) 83%, at least (about) 84%, at least (about) 85%, at least (about) 86%, at least (about) 87%, at least (about) 88%, at least (about) 89%, at least (about) 90%, at least (about) 91%, at least (about) 92%, at least (about) 93%, at least (about) 94%, at least (about) 95%, at least (about) 96%, at least (about) 97%, at least (about) 98%, at least (about) 99%, or 100% sequence identity) to an amino acid sequence of a cytokine set forth in Table 3g.

TABLE 3g

Cytokines for Conjugation

Name of

Protein
SEQ ID

(Synonym)
NO:
Amino Acid Sequence

Anti-CD3

See U.S. Pat. Nos. 5,885,573 and 6,491,916

IL-1ra, human
450
MEICRGLRSHLITLLLFLFHSETICRPSGRKSSKMQAFRIWDVNQKTFYLR

full length

NNQLVAGYLQGPNVNLEEKIDVVPIEPHALFLGIHGGKMCLSCVKSGDE

TRLQLEAVNITDLSENRKQDKRFAFIRSDSGPTTSFESAACPGWFLCTAM

EADQPVSLTNMPDEGVMVTKFYFQEDE

IL-1ra, Dog
451
METCRCPLSYLISFLLFLPHSETACRLGKRPCRMQAFRIWDVNQKTFYLR

NNQLVAGYLQGSNTKLEEKLDVVPVEPHAVFLGIHGGKLCLACVKSGD

ETRLQLEAVNITDLSKNKDQDKRFTFILSDSGPTTSFESAACPGWFLCTAL

EADRPVSLTNRPEEAMMVTKFYFQKE

IL-1ra, Rabbit
452
MRPSRSTRRHLISLLLFLFHSETACRPSGKRPCRMQAFRIWDVNQKTFYL

RNNQLVAGYLQGPNAKLEERIDVVPLEPQLLFLGIQRGKLCLSCVKSGD

KMKLHLEAVNITDLGKNKEQDKRFTFIRSNSGPTTTFESASCPGWFLCTA

LEADQPVSLTNTPDDSIVVTKFYFQED

IL-1ra, Rat
453
MEICRGPYSHLISLLLILLFRSESAGHIPAGKRPCKMQAFRIWDTNQKTFY

LRNNQLIAGYLQGPNTKLEEKIDMVPIDFRNVFLGIHGGKLCLSCVKSGD

DTKLQLEEVNITDLNKNKEEDKRFTFIRSETGPTTSFESLACPGWFLCTTL

EADHPVSLTNTPKEPCTVTKFYFQED

IL-1ra, Mouse
454
MEICWGPYSHLISLLLILLFHSEAACRPSGKRPCKMQAFRIWDTNQKTFY

LRNNQLIAGYLQGPNIKLEEKIDMVPIDLHSVFLGIHGGKLCLSCAKSGD

DIKLQLEEVNITDLSKNKEEDKRFTFIRSEKGPTTSFESAACPGWFLCTTL

EADRPVSLTNTPEEPLIVTKFYFQEDQ

Anakinra
455
MRPSGRKSSKMQAFRIWDVNQKTFYLRNNQLVAGYLQGPNVNLEEKID

VVPIEPHALFLGIHGGKMCLSCVKSGDETRLQLEAVNITDLSENRKQDKR

FAFIRSDSGPTTSFESAACPGWFLCTAMEADQPVSLTNMPDEGVMVTKF

YFQEDE

IL-10
456
MHSSALLCCLVLLTGVRASPGQGTQSENSCTHFPGNLPNMLRDLRDAFS

RVKTFFQMKDQLDNLLLKESLLEDFKGYLGCQALSEMIQFYLEEVMPQA

ENQDPDIKAHVNSLGENLKTLRLRLRRCHRFLPCENKSKAVEQVKNAFN

KLQEKGIYKAMSEFDIFINYIEAYMTMKIRN

In some cases, the BP can be IL-10. IL-10 can be an effective anti-inflammatory cytokine that represses the production of the proinflammatory cytokines and chemokines. IL-10 is the one of the major TH2-type cytokine that increases humoral immune responses and lowers cell-mediated immune reactions. IL-10 can be useful for the treatment of autoimmune diseases and inflammatory diseases such as rheumatoid arthritis, multiple sclerosis, myasthenia gravis, systemic lupus erythematosus, Alzheimer's, Schizophrenia, allergic asthma, retinal neurodegenerative processes, and diabetes.

In some cases, IL-10 can be modified to improve stability and decrease prolytic degradation. The modification can be one or more amide bond substitution. In some cases, one or more amide bonds within backbone of IL-10 can be substituted to achieve the abovementioned effects. The one or more amide linkages (—CONH—) in IL-10 can be replaced with a linkage which is an isostere of an amide linkage, such as —CH₂NH—, —CH₂S—, —CH₂CH₂—, —CH═CH— (cis and trans), —COCH₂—, —CH(OH)CH₂— or —CH₂SO—. Furthermore, the amide linkages in IL-10 can also be replaced by a reduced isostere pseudopeptide bond. See Couder et al. (1993) Int. J. Peptide Protein Res. 41:181-184, which is hereby incorporated by reference in its entirety.

The one or more acidic amino acids, including aspartic acid, glutamic acid, homoglutamic acid, tyrosine, alkyl, aryl, arylalkyl, and heteroaryl sulfonamides of 2,4-diaminopriopionic acid, ornithine or lysine and tetrazole-substituted alkyl amino acids; and side chain amide residues such as asparagine, glutamine, and alkyl or aromatic substituted derivatives of asparagine or glutamine; as well as hydroxyl-containing amino acids, including serine, threonine, homoserine, 2,3-diaminopropionic acid, and alkyl or aromatic substituted derivatives of serine or threonine can be substituted.

The one or more hydrophobic amino acids in IL-10 such as alanine, leucine, isoleucine, valine, norleucine, (S)-2-aminobutyric acid, (5)-cyclohexylalanine or other simple alpha-amino acids can be substituted with amino acids including, but not limited to, an aliphatic side chain from C1-C10 carbons including branched, cyclic and straight chain alkyl, alkenyl or alkynyl substitutions

In some cases, the one or more hydrophobic amino acids in IL-10 such as can be substituted substitution of aromatic-substituted hydrophobic amino acids, including phenylalanine, tryptophan, tyrosine, sulfotyrosine, biphenylalanine, 1-naphthylalanine, 2-naphthylalanine, 2-benzothienylalanine, 3-benzothienylalanine, histidine, including amino, alkylamino, dialkylamino, aza, halogenated (fluoro, chloro, bromo, or iodo) or alkoxy (from C₁-C₄)-substituted forms of the above-listed aromatic amino acids, illustrative examples of which are: 2-, 3- or 4-aminophenylalanine, 2-, 3- or 4-chlorophenylalanine, 2-, 3- or 4-methylphenylalanine, 2-, 3- or 4-methoxyphenylalanine, 5-amino-, 5-chloro-, 5-methyl- or 5-methoxytryptophan, 2′-, 3′-, or 4′-amino-, 2′-, 3′-, or 4′-chloro-, 2, 3, or 4-biphenylalanine, 2′-, 3′-, or 4′-methyl-, 2-, 3- or 4-biphenylalanine, and 2- or 3-pyridylalanine;

The one or more hydrophobic amino acids in IL-10 such as phenylalanine, tryptophan, tyrosine, sulfotyrosine, biphenylalanine, 1-naphthylalanine, 2-naphthylalanine, 2-benzothienylalanine, 3-benzothienylalanine, histidine, including amino, alkylamino, dialkylamino, aza, halogenated (fluoro, chloro, bromo, or iodo) or alkox can be substituted by aromatic amino acids including: 2-, 3- or 4-aminophenylalanine, 2-, 3- or 4-chlorophenylalanine, 2-, 3- or 4-methylphenylalanine, 2-, 3- or 4-methoxyphenylalanine, 5-amino-, 5-chloro-, 5-methyl- or 5-methoxytryptophan, 2′-, 3′-, or 4′-amino-, 2′-, 3′-, or 4′-chloro-, 2, 3, or 4-biphenylalanine, 2′-, 3′-, or 4′-methyl-, 2-, 3- or 4-biphenylalanine, and 2- or 3-pyridylalanine

The amino acids comprising basic side chains, including arginine, lysine, histidine, ornithine, 2,3-diaminopropionic acid, homoarginine, including alkyl, alkenyl, or aryl-substituted derivatives of the previous amino acids, can be substituted. Examples are N-epsilon-isopropyl-lysine, 3-(4-tetrahydropyridyl)-glycine, 3-(4-tetrahydropyridyl)-alanine, N,N-gamma, gamma′-diethyl-homoarginine, alpha-methyl-arginine, alpha-methyl-2,3-diaminopropionic acid, alpha-methyl-histidine, and alpha-methyl-ornithine where the alkyl group occupies the pro-R position of the alpha-carbon. The modified IL-10 can comprise amides formed from any combination of alkyl, aromatic, heteroaromatic, ornithine, or 2,3-diaminopropionic acid, carboxylic acids or any of the many well-known activated derivatives such as acid chlorides, active esters, active azolides and related derivatives, lysine, and ornithine.

In some cases, IL-10 comprises can comprise one or more naturally occurring L-amino acids, synthetic L-amino acids, and/or D-enantiomers of an amino acid. The IL-10 polypeptide can comprise one or more of the following amino acids: ω-aminodecanoic acid, ω-aminotetradecanoic acid, cyclohexylalanine, α,γ-diaminobutyric acid, α,β-diaminopropionic acid, δ-amino valeric acid, t-butylalanine, t-butylglycine, N-methylisoleucine, phenylglycine, cyclohexylalanine, norleucine, naphthylalanine, ornithine, citrulline, 4-chlorophenylalanine, 2-fluorophenylalanine, pyridylalanine 3-benzothienyl alanine, hydroxyproline, β-alanine, o-aminobenzoic acid, m-aminobenzoic acid, p-aminobenzoic acid, m-aminomethylbenzoic acid, 2,3-diaminopropionic acid, α-aminoisobutyric acid, N-methylglycine(sarcosine), 3-fluorophenylalanine, 4-fluorophenylalanine, penicillamine, 1,2,3,4-tetrahydroisoquinoline-3-carboxylic acid, β-2-thienylalanine, methionine sulfoxide, homoarginine, N-acetyl lysine, 2,4-diamino butyric acid, rho-aminophenylalanine, N-methylvaline, homocysteine, homoserine, ε-amino hexanoic acid, ω-aminohexanoic acid, ω-aminoheptanoic acid, ω-aminooctanoic acid, and 2,3-diaminobutyric acid.

IL-10 can comprise a cysteine residue or a cysteine which can act as linker to another peptide via a disulfide linkage or to provide for cyclization of the IL-10 polypeptide. Methods of introducing a cysteine or cysteine analog are known in the art; see, e.g., U.S. Pat. No. 8,067,532. An IL-10 polypeptide can be cyclized. Other means of cyclization include introduction of an oxime linker or a lanthionine linker; see, e.g., U.S. Pat. No. 8,044,175. Any combination of amino acids (or non-amino acid moieties) that can form a cyclizing bond can be used and/or introduced. A cyclizing bond can be generated with any combination of amino acids (or with an amino acid and —(CH₂)_nCO— or —(CH₂)_nC₆H₄—CO—) with functional groups which allow for the introduction of a bridge. Some examples are disulfides, disulfide mimetics such as the —(CH₂)_n-carba bridge, thioacetal, thioether bridges (cystathionine or lanthionine) and bridges containing esters and ethers.

The IL-10 can be substituted with an N-alkyl, aryl, or backbone crosslinking to construct lactams and other cyclic structures, C-terminal hydroxymethyl derivatives, o-modified derivatives, N-terminally modified derivatives including substituted amides such as alkylamides and hydrazides. In some cases, an IL-10 polypeptide is a retroinverso analog.

IL-10 can be IL-10 can be native protein, peptide fragment IL-10, or modified peptide, having at least a portion of the biological activity of native IL-10. IL-10 can be modified to improve intracellular uptake. One such modification can be attachment of a protein transduction domain. The protein transduction domain can be attached to the C-terminus of the IL-10. Alternatively, the protein transduction domain can be attached to the N-terminus of the IL-10. The protein transduction domain can be attached to IL-10 via covalent bond. The protein transduction domain can be chosen from any of the sequences listed in Table 3h.

TABLE 3h

Exemplary protein transduction domains

SEQ ID NO:
Amino Acid Sequence

457
YGRKKRRQRRR

458
RRQRRTSKLMKR

459
GWTLNSAGYLLGKINLKALAALAKKIL

460
KALAWEAKLAKALAKALAKHLAKALAKALKCEA

461
RQIKIWFQNRRMKWKK

462
YGRKKRRQRRR

463
RKKRRQRRR

464
YGRKKRRQRRR

465
RKKRRQRR

466
YARAAARQARA

467
THRLPRRRRRR

468
GGRRARRRRRR

The BP of the subject compositions are not limited to native, full-length polypeptides, but also include recombinant versions as well as biologically and/or pharmacologically active variants or fragments thereof. For example, it will be appreciated that various amino acid substitutions can be made in the GP to create variants without departing from the spirit of the invention with respect to the biological activity or pharmacologic properties of the BP. Examples of conservative substitutions for amino acids in polypeptide sequences are shown in Table 4. However, in embodiments of the compositions of this disclosure in which the sequence identity of the BP is less than 100% compared to a specific sequence disclosed herein, the invention contemplates substitution of any of the other 19 natural L-amino acids for a given amino acid residue of the given BP, which may be at any position within the sequence of the BP, including adjacent amino acid residues. If any one substitution results in an undesirable change in biological activity, then one of the alternative amino acids can be employed and the construct evaluated by the methods described herein, or using any of the techniques and guidelines for conservative and non-conservative mutations set forth, for instance, in U.S. Pat. No. 5,364,934, the contents of which is incorporated by reference in its entirety, or using methods generally known to those of skill in the art. In addition, variants can also include, for instance, polypeptides wherein one or more amino acid residues are added or deleted at the N- or C-terminus of the full-length native amino acid sequence of a BP that retains at least a portion of the biological activity of the native peptide.

TABLE 4

Exemplary conservative amino acid substitutions

Original Residue
Exemplary Substitutions

Ala (A)
val; leu; ile

Arg (R)
lys; gln; asn

Asn (N)
gln; his; Iys; arg

Asp (D)
glu

Cys (C)
ser

Gln (Q)
asn

Glu (E)
asp

Gly (G)
pro

His (H)
asn: gln: Iys: arg

xIle (I)
leu; val; met; ala; phe: norleucine

Leu (L)
norleucine: ile: val; met; ala: phe

Lys (K)
arg: gln: asn

Met (M)
leu; phe; ile

Phe (F)
leu: val: ile; ala

Pro (P)
gly

Ser (S)
thr

Thr (T)
ser

Trp (W)
tyr

Tyr(Y)
trp: phe: thr: ser

Val (V)
ile; leu; met; phe; ala; norleucine

In some embodiments, a BP incorporated into a composition of this disclosure can have a sequence that exhibits at least (about) 80% (or at least (about) 81%, or at least (about) 82%, or at least (about) 83%, or at least (about) 84%, or at least (about) 85%, or at least (about) 86%, or at least (about) 87%, or at least (about) 88%, or at least (about) 89%, or at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99%, or (about) 100% sequence identity to a sequence from Tables 3a-3h. In some embodiments of the compositions of this disclosure, the sequence of the BP can comprise one or more substitutions shown in Table 4.

Antibodies:

In some embodiments of the compositions of this disclosure, the biologically active peptide (BP) can comprise an antibody, such as a monospecific, bispecific, or multispecific antibody. The antibody can comprise a binding domain (or binding moiety) having specific binding affinity to a tumor-specific marker or an antigen of a target cell (or a target cell antigen) (such as one described more fully hereinbelow). The antibody can comprise a binding domain (or binding moiety) that binds to an effector cell antigen (such as one described more fully hereinbelow). In some embodiments of the compositions of this disclosure, the antibody, such as a bispecific or multi-specific antibody, can comprise (1) a binding domain (e.g., a first or second binding domain) having specific binding affinity to a tumor-specific marker or a target cell antigen (such as one described more fully hereinbelow) and (2) another binding domain (e.g., a second or first binding domain) that binds to an effector cell antigen (such as one described more fully hereinbelow). The disclosure contemplates use of single chain binding domains, such as but not limited to Fv, Fab, Fab′, Fab′-SH, nanobodies (also known as single domain antibodies or V_HH), F(ab′)2, linear antibodies, single domain antibody, single domain camelid antibody, single-chain antibody molecules (scFv), multispecific antibodies formed from antibody fragments, and diabodies capable of binding ligands or receptors associated with effector cells and antigens of diseased tissues or cells (such as cancers, tumors, or other malignant tissues). The binding domain (or the first binding domain, or the second binding domain) can be a non-antibody scaffold selected from anticalins, adnectins, fynomers, affilins, affibodies, centyrins, DARPins. The binding domain (or the first binding domain, or the second binding domain) for a tumor cell target can be a variable domain of a T cell receptor engineered to bind major histocompatibility complex (MHC) that is loaded with a peptide fragment of a protein that is overexpressed by tumor cells. In some embodiments of the compositions of this disclosure (such as XTENylated Protease-Activated T Cell Engagers (“XPAT” or “XPATs”), other masked therapeutic antibodies, etc.) the biologically active peptide (BP) can be a bispecific antibody (e.g., a bispecific T-cell engager).

With respect to single chain binding domains (or binding moieties), as is well established, an active antibody fragment (Fv) is the minimum antibody fragment which contains a complete antigen recognition and binding site; consisting of a dimer of one heavy (VH) and one light chain variable domain (VL) in non-covalent association. Each scFv can comprise one VL and one VH. Within each VH and VL chain are three complementarity determining regions (CDRs) that interact to define an antigen binding site on the surface of the VH-VL dimer; the six CDRs of a binding domain (or binding moiety) confer antigen binding specificity to the antibody or single chain binding domain (or binding moiety). In some cases, scFv are created in which each has 3, 4, or 5 CHRs within each binding domain (or binding moiety). Framework sequences flanking the CDRs have a tertiary structure that is essentially conserved in native immunoglobulins across species, and the framework residues (FR) serve to hold the CDRs in their appropriate orientation. The constant domains are not required for binding function, but may aid in stabilizing VH-VL interaction. In some embodiments, the domain of the binding site of the polypeptide can be a pair of VH-VL, VH-VH or VL-VL domains either of the same or of different immunoglobulins, however it is generally preferred to make single chain binding domains (or binding moieties) using the respective VH and VL chains from the parental antibody. The order of VH and VL domains within the polypeptide chain is not limiting for the present invention; the order of domains given may be reversed usually without any loss of function, but it is understood that the VH and VL domains are arranged so that the antigen binding site can properly fold. Thus, the single chain binding domains of the bispecific scFv embodiments of the subject compositions can be in the order (VL-VH)1-(VL-VH)2, wherein “1” and “2” represent the first and second binding domains (or the first and second binding moieties), respectively, or (VL-VH)1-(VH-VL)2, or (VH-VL)1-(VL-VH)2, or (VH-VL)1-(VH-VL)2, wherein the paired binding domains (or binding moieties) are linked by a polypeptide linker as described hereinbelow.

In some embodiments of the compositions, wherein the BP comprises (1) a binding domain (or binding moiety) having specific binding affinity to a tumor-specific marker or an antigen of a target cell (or a target cell antigen) and (2) a binding domain (or binding moiety) that binds to an effector cell antigen, the arrangement of the binding domains (or binding moieties) in an exemplary bispecific single chain antibody disclosed herein may therefore be one in which the first binding domain (or first binding moiety) can be located C-terminally to the second binding domain (or second binding moiety). The arrangement of the V chains can be VH (target cell surface antigen)-VL (target cell surface antigen)-VL (effector cell antigen)-VH (effector cell antigen), VH (target cell surface antigen)-VL (target cell surface antigen)-VH (effector cell antigen)-VL (effector cell antigen), VL (target cell surface antigen)-VH (target cell surface antigen)-VL (effector cell antigen)-VH (effector cell antigen) or VL (target cell surface antigen)-VH (target cell surface antigen)-VH (effector cell antigen)-VL (effector cell antigen). For an arrangement, in which the second binding domain (or second binding moiety) can be located N-terminally to the first binding domain (or first binding moiety), the following orders are possible: VH (effector cell antigen)-VL (effector cell antigen)-VL (target cell surface antigen)-VH (target cell surface antigen), VH (effector cell antigen)-VL (effector cell antigen)-VH (target cell surface antigen)-VL (target cell surface antigen), VL (effector cell antigen)-VH (effector cell antigen)-VL (target cell surface antigen)-VH (target cell surface antigen) or VL (effector cell antigen)-VH (effector cell antigen)-VH (target cell surface antigen)-VL (target cell surface antigen). As used herein, “N-terminally to” or “C-terminally to” and grammatical variants thereof denote relative location within the primary amino acid sequence rather than placement at the absolute N- or C-terminus of the bispecific single chain antibody. Hence, as a non-limiting example, a first binding domain (or first binding moiety) which is “located C-terminally to the second binding domain” denotes that the first binding is located on the carboxyl side of the second binding domain (or second binding moiety) within the bispecific single chain antibody, and does not exclude the possibility that an additional sequence, for example a His-tag, or another compound such as a radioisotope, is located at the C-terminus of the bispecific single chain antibody.

The VL and VH domains can be derived from monoclonal antibodies with binding specificity to the tumor-specific marker or the antigen of the target cell and effector cell antigens, respectively. In other cases, the first and second binding domains (or the first and second binding moieties) each comprise six CDRs derived from monoclonal antibodies with binding specificity to a target cell marker, such as a tumor-specific marker and effector cell antigens, respectively. In other embodiments, the first and second binding domains (or the first and second binding moieties) of the subject compositions can have 3, 4, or 5 CHRs within each binding domain (or each binding moiety). In other embodiments, the embodiments of the invention comprise a first binding domain and a second binding domain wherein each comprises a CDR-H1 region, a CDR-H2 region, a CDR-H3 region, a CDR-L1 region, a CDR-L2 region, and a CDR-H3 region, where each of the regions can be derived from a monoclonal antibody capable of binding the tumor-specific marker or the antigen of the target cell, and effector cell antigens, respectively.

In some embodiments, where the BP comprises a binding domain (or binding moiety) (or a first binding domain, or a second binding domain) having binding affinity for an effector cell antigen, the effector cell antigen can be expressed on the surface of an effector cell selected from a plasma cell, a T cell, a B cell, a cytokine induced killer cell (CIK cell), a mast cell, a dendritic cell, a regulatory T cell (RegT cell), a helper T cell, a myeloid cell, and a NK cell. The effector cell antigen can be expressed on or within an effector cell. The effector cell antigen can be expressed on a T cell, such as a CD4+, CD8+, or natural killer (NK) cell. The effector cell antigen can be expressed on the surface of a T cell. The effector cell antigen can be expressed on a B cell, master cell, dendritic cell, or myeloid cell.

In some embodiments of the compositions herein, the BP can comprise a binding domain (or binding moiety) (or a first binding domain, or a second binding domain) having specific binding affinity to a tumor-specific marker or an antigen of a target cell (or a target cell antigen). The tumor-specific marker or the target cell antigen can be associated with a tumor cell. The tumor cell can be of a tumor, such as stroma cell tumor, fibroblast tumor, myofibroblast tumor, glial cell tumor, epithelial cell tumor, fat cell tumor, immune cell tumor, vascular cell tumor, or smooth muscle cell tumor. The tumor-specific marker or the antigen of the target cell can be selected from the group consisting of alpha 4 integrin, Ang2, B7-H3, B7-H6 (e.g., its natural ligand Nkp30 rather than an antibody fragment), CEACAM5, cMET, CTLA4, FOLR1, EpCAM, CCR5, CD19, HER2, HER2 neu, HER3, HER4, HER1 (EGFR), PD-L1, PSMA, CEA, TROP-2, MUC1(mucin), MUC-2, MUC3, MUC4, MUC5AC, MUC5B, MUC7, MUC16 βhCG, Lewis-Y, CD20, CD33, CD38, CD30, CD56 (NCAM), CD133, ganglioside GD3; 9-O-Acetyl-GD3, GM2, Globo H, fucosyl GM1, GD2, carbonicanhydrase IX, CD44v6, Nectin-4, Sonic Hedgehog (Shh), Wue-1, plasma cell antigen 1, melanoma chondroitin sulfate proteoglycan (MCSP), CCR8, 6-transmembrane epithelial antigen of prostate (STEAP), mesothelin, A33 antigen, prostate stem cell antigen (PSCA), Ly-6, desmoglein 4, fetal acetylcholine receptor (fnAChR), CD25, cancer antigen 19-9 (CA19-9), cancer antigen 125 (CA-125), Muellerian inhibitory substance receptor type II (MISIIR), sialylated Tn antigen (s TN), fibroblast activation antigen (FAP), endosialin (CD248), epidermal growth factor receptor variant III (EGFRvIII), tumor-associated antigen L6 (TAL6), SAS, CD63, TAG72, Thomsen-Friedenreich antigen (TF-antigen), insulin-like growth factor I receptor (IGF-IR), Cora antigen, CD7, CD22, CD70 (e.g., its natural ligand, CD27 rather than an antibody fragment), CD79a, CD79b, G250, MT-MMPs, CA19-9, CA-125, alpha-fetoprotein (AFP), VEGFR1, VEGFR2, DLK1, SP17, ROR1, and EphA2. The tumor-specific marker or the antigen of the target cell can be selected from the group consisting of alpha 4 integrin, Ang2, B7-H3, B7-H6 (e.g., its natural ligand Nkp30 rather than an antibody fragment), CEACAM5, cMET, CTLA4, FOLR1, EpCAM (epithelial cell adhesion molecule), CCR5, CD19, HER2, HER2 neu, HER3, HER4, HER1 (EGFR), PD-L1, PSMA, CEA, TROP-2, MUC1(mucin), MUC-2, MUC3, MUC4, MUC5AC, MUC5B, MUC7, MUC16, βhCG, Lewis-Y, CD20, CD33, CD38, CD30, CD56 (NCAM), CD133, ganglioside GD3, 9-O-acetyl-GD3, GM2, Globo H, fucosyl GM1, GD2, carbonicanhydrase IX, CD44v6, Nectin-4, Sonic Hedgehog (Shh), Wue-1, plasma cell antigen 1 (PC-1), melanoma chondroitin sulfate proteoglycan (MCSP), CCR8, 6-transmembrane epithelial antigen of prostate (STEAP), mesothelin, A33 antigen, prostate stem cell antigen (PSCA), Ly-6, desmoglein 4, fetal acetylcholine receptor (fnAChR), CD25, cancer antigen 19-9 (CA19-9), cancer antigen 125 (CA-125), Muellerian inhibitory substance receptor type II (MISIIR), sialylated Tn antigen (sTN), fibroblast activation antigen (FAP), endosialin (CD248), epidermal growth factor receptor variant III (EGFRvIII), tumor-associated antigen L6 (TAL6), SAS, CD63, TAG72, Thomsen-Friedenreich antigen (TF-antigen), insulin-like growth factor I receptor (IGF-IR), Cora antigen, CD7, CD22, CD70 (e.g., its natural ligand, CD27 rather than an antibody fragment), CD79a, CD79b, G250, MT-MMPs, alpha-fetoprotein (AFP), VEGFR1, VEGFR2, DLK1, SP17, ROR1, EphA2, ENPP3, glypican 3 (GPC3), and TPBG/5T4 (trophoblast glycoprotein). The tumor-specific marker or the antigen of the target cell can be selected from alpha 4 integrin, Ang2, CEACAM5, cMET, CTLA4, FOLR1, EpCAM (epithelial cell adhesion molecule), CD19, HER2, HER2 neu, HER3, HER4, HER1 (EGFR), PD-L1, PSMA, CEA, TROP-2, MUC1(mucin), Lewis-Y, CD20, CD33, CD38, mesothelin, CD70 (e.g., its natural ligand, CD27 rather than an antibody fragment), VEGFR1, VEGFR2, ROR1, EphA2, ENPP3, glypican 3 (GPC3), and TPBG/5T4 (trophoblast glycoprotein). The VL and VH sequences of the binding domain (or binding moiety) (or the first binding domain, or the second binding domain) having specific binding affinity to a tumor-specific marker or an antigen of a target cell (or a target antigen) can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99%, or 100%, sequence identity to any one of the paired VL and VH sequences set forth in the “VH Sequences” and “VL Sequences” columns of Table 6 (as described more fully hereinbelow).

Therapeutic monoclonal antibodies from which VL and VH and CDR domains can be derived for the subject compositions are known in the art. Such therapeutic antibodies include, but are not limited to, rituximab, IDEC/Genentech/Roche (see for example U.S. Pat. No. 5,736,137), a chimeric anti-CD20 antibody used in the treatment of many lymphomas, leukemias, and some autoimmune disorders; ofatumumab, an anti-CD20 antibody approved for use for chronic lymphocytic leukemia, and under development for follicular non-Hodgkin's lymphoma, diffuse large B cell lymphoma, rheumatoid arthritis and relapsing remitting multiple sclerosis, being developed by GlaxoSmithKline; lucatumumab (HCD122), an anti-CD40 antibody developed by Novartis for Non-Hodgkin's or Hodgkin's Lymphoma (see, for example, U.S. Pat. No. 6,899,879), AME-133, an antibody developed by Applied Molecular Evolution which binds to cells expressing CD20 to treat non-Hodgkin's lymphoma, veltuzumab (hA20), an antibody developed by Immunomedics, Inc. which binds to cells expressing CD20 to treat immune thrombocytopenic purpura, HumaLYM developed by Intracel for the treatment of low-grade B-cell lymphoma, and ocrelizumab, developed by Genentech which is an anti-CD20 monoclonal antibody for treatment of rheumatoid arthritis (see for example U.S. Patent Application 20090155257), trastuzumab (see for example U.S. Pat. No. 5,677,171), a humanized anti-Her2/neu antibody approved to treat breast cancer developed by Genentech; pertuzumab, an anti-HER2 dimerization inhibitor antibody developed by Genentech in treatment of in prostate, breast, and ovarian cancers; (see for example U.S. Pat. No. 4,753,894); cetuximab, an anti-EGFR antibody used to treat epidermal growth factor receptor (EGFR)-expressing, KRAS wild-type metastatic colorectal cancer and head and neck cancer, developed by Imclone and BMS (see U.S. Pat. No. 4,943,533; PCT WO 96/40210); panitumumab, a fully human monoclonal antibody specific to the epidermal growth factor receptor (also known as EGF receptor, EGFR, ErbB-1 and HER1, currently marketed by Amgen for treatment of metastatic colorectal cancer (see U.S. Pat. No. 6,235,883); zalutumumab, a fully human IgG1 monoclonal antibody developed by Genmab that is directed towards the epidermal growth factor receptor (EGFR) for the treatment of squamous cell carcinoma of the head and neck (see for example U.S. Pat. No. 7,247,301); nimotuzumab, a chimeric antibody to EGFR developed by Biocon, YM Biosciences, Cuba, and Oncosciences, Europe) in the treatment of squamous cell carcinomas of the head and neck, nasopharyngeal cancer and glioma (see for example U.S. Pat. Nos. 5,891,996; 6,506,883); alemtuzumab, a humanized monoclonal antibody to CD52 marketed by Bayer Schering Pharma for the treatment of chronic lymphocytic leukemia (CLL), cutaneous T-cell lymphoma (CTCL) and T-cell lymphoma; muromonab-CD3, an anti-CD3 antibody developed by Ortho Biotech/Johnson & Johnson used as an immunosuppressant biologic given to reduce acute rejection in patients with organ transplants; ibritumomab tiuxetan, an anti-CD20 monoclonal antibody developed by IDEC/Schering AG as treatment for some forms of B cell non-Hodgkin's lymphoma; gemtuzumab ozogamicin, an anti-CD33 (p67 protein) antibody linked to a cytotoxic chelator tiuxetan, to which a radioactive isotope can be attached, developed by Celltech/Wyeth used to treat acute myelogenous leukemia; ABX-CBL, an anti-CD147 antibody developed by Abgenix; ABX-IL8, an anti-IL8 antibody developed by Abgenix, ABX-MA1, an anti-MUC18 antibody developed by Abgenix, Pemtumomab (R1549, 90Y-muHMFG1), an anti-MUC1 in development by Antisoma, Therex (R1550), an anti-MUC1 antibody developed by Antisoma, AngioMab (AS1405), developed by Antisoma, HuBC-1, developed by Antisoma, Thioplatin (AS1407) developed by Antisoma, ANTEGREN (natalizumab), an anti-alpha-4-beta-1 (VLA4) and alpha-4-beta-7 antibody developed by Biogen, VLA-1 mAb, an anti-VLA-1 integrin antibody developed by Biogen, LTBR mAb, an anti-lymphotoxin beta receptor (LTBR) antibody developed by Biogen, CAT-152, an anti-TGF-β2 antibody developed by Cambridge Antibody Technology, J695, an anti-IL-12 antibody developed by Cambridge Antibody Technology and Abbott, CAT-192, an anti-TGFβ1 antibody developed by Cambridge Antibody Technology and Genzyme, CAT-213, an anti-Eotaxin 1 antibody developed by Cambridge Antibody Technology, LYMPHOSTAT-B, an anti-Blys antibody developed by Cambridge Antibody Technology and Human Genome Sciences Inc., TRAIL-R1mAb, an anti-TRAIL-R1 antibody developed by Cambridge Antibody Technology and Human Genome Sciences, Inc.; HERCEPTIN, an anti-HER receptor family antibody developed by Genentech; Anti-Tissue Factor (ATF), an anti-Tissue Factor antibody developed by Genentech; XOLAIR (Omalizumab), an anti-IgE antibody developed by Genentech, MLN-02 Antibody (formerly LDP-02), developed by Genentech and Millennium Pharmaceuticals; HUMAX CD4®, an anti-CD4 antibody developed by Genmab; tocilizuma, and anti-IL6R antibody developed by Chugai; HUMAX-IL15, an anti-IL15 antibody developed by Genmab and Amgen, HUMAX-Inflam, developed by Genmab and Medarex; HUMAX-Cancer, an anti-Heparanase I antibody developed by Genmab and Medarex and Oxford GlycoSciences; HUMAX-Lymphoma, developed by Genmab and Amgen, HUMAX-TAC, developed by Genmab; IDEC-131, an anti-CD40L antibody developed by IDEC Pharmaceuticals; IDEC-151 (Clenoliximab), an anti-CD4 antibody developed by IDEC Pharmaceuticals; IDEC-114, an anti-CD80 antibody developed by IDEC Pharmaceuticals; IDEC-152, an anti-CD23 developed by IDEC Pharmaceuticals; an anti-KDR antibody developed by Imclone, DC101, an anti-flk-1 antibody developed by Imclone; anti-VE cadherin antibodies developed by Imclone; CEA-CIDE (labetuzumab), an anti-carcinoembryonic antigen (CEA) antibody developed by Immunomedics; Yervoy (ipilimumab), an anti-CTLA4 antibody developed by Bristol-Myers Squibb in the treatment of melanoma; Lumphocide® (Epratuzumab), an anti-CD22 antibody developed by Immunomedics, AFP-Cide, developed by Immunomedics; MyelomaCide, developed by Immunomedics; LkoCide, developed by Immunomedics; ProstaCide, developed by Immunomedics; MDX-010, an anti-CTLA4 antibody developed by Medarex; MDX-060, an anti-CD30 antibody developed by Medarex; MDX-070 developed by Medarex; MDX-018 developed by Medarex; OSIDEM (IDM-1), an anti-HER2 antibody developed by Medarex and Immuno-Designed Molecules; HUMAX®-CD4, an anti-CD4 antibody developed by Medarex and Genmab; HuMax-IL15, an anti-IL15 antibody developed by Medarex and Genmab; anti-intercellular adhesion molecule-1 (ICAM-1) (CD54) antibodies developed by MorphoSys, MOR201; tremelimumab, an anti-CTLA-4 antibody developed by Pfizer; visilizumab, an anti-CD3 antibody developed by Protein Design Labs; Anti-a 5β1 Integrin, developed by Protein Design Labs; anti-IL-12, developed by Protein Design Labs; ING-1, an anti-Ep-CAM antibody developed by Xoma; and MLN01, an anti-Beta2 integrin antibody developed by Xoma; all of the above-cited antibody references in this paragraph are expressly incorporated herein by reference. The sequences for the above antibodies can be obtained from publicly available databases, patents, or literature references.

Methods to measure binding affinity and/or other biologic activity of the subject compositions of the invention can be those disclosed herein or methods generally known in the art. For example, the binding affinity of a binding pair (e.g., antibody and antigen), denoted as K_d, can be determined using various suitable assays including, but not limited to, radioactive binding assays, non-radioactive binding assays such as fluorescence resonance energy transfer and surface plasmon resonance (SPR, Biacore), and enzyme-linked immunosorbent assays (ELISA), kinetic exclusion assay (KinExA®), reporter gene activity assay, or as described in the Examples. An increase or decrease in binding affinity, for example of a subject therapeutic agent (e.g., a chimeric polypeptide assembly) which has been cleaved to remove a masking moiety compared to the therapeutic agent (e.g., the chimeric polypeptide assembly) with the masking moiety attached, can be determined by measuring the binding affinity of the therapeutic agent (e.g., the chimeric polypeptide assembly) to its target binding partner with and without the masking moiety.

Measurement of half-life of a subject therapeutic agent can be performed by various suitable methods. For example, the half-life of a substance can be determined by administering the substance to a subject and periodically sampling a biological sample (e.g., biological fluid such as blood or plasma or ascites) to determine the concentration and/or amount of that substance in the sample over time. The concentration of a substance in a biological sample can be determined using various suitable methods, including enzyme-linked immunosorbent assays (ELISA), reporter gene activity assays, immunoblots, and chromatography techniques including high-pressure liquid chromatography and fast protein liquid chromatography. In some cases, the substance may be labeled with a detectable tag, such as a radioactive tag or a fluorescence tag, which can be used to determine the concentration of the substance in the sample (e.g., a blood sample, a serum sample, or a plasma sample. The various pharmacokinetic parameters are then determined from the results, which can be done using software packages such as SoftMax Pro software, or by manual calculations known in the art.

In addition, the physicochemical properties of the subject therapeutic agents (e.g., the chimeric polypeptide assembly compositions) may be measured to ascertain the degree of solubility, structure and retention of stability. Assays of the subject compositions are conducted that allow determination of binding characteristics of the binding domains (or binding moieties) towards a ligand, including binding dissociation constant (K_d, K_onand K_off), the half-life of dissociation of the ligand-receptor complex, as well as the activity of the binding domain (or binding moiety) to inhibit the biologic activity of the sequestered ligand compared to free ligand (IC₅₀values). The term “IC₅₀” refers to the concentration needed to inhibit half of the maximum biological response of the ligand agonist, and can be generally determined by competition binding assays. The term “EC₅₀” refers to the concentration needed to achieve half of the maximum biological response of the active substance, and can be generally determined by ELISA or cell-based assays, and/or reporter gene activity assay, including the methods of the Examples described herein.

Anti-CD3 Binding Domains

The CD3 complex is a group of cell surface molecules that associates with the T-cell antigen receptor (TCR) and functions in the cell surface expression of TCR and in the signaling transduction cascade that originates when a peptide:MHC ligand binds to the TCR. Typically, when an antigen binds to the T-cell receptor, the CD3 sends signals through the cell membrane to the cytoplasm inside the T cell. This causes activation of the T cell that rapidly divide to produce new T cells sensitized to attack the particular antigen to which the TCR were exposed. The CD3 complex is comprised of the CD3epsilon molecule, along with four other membrane-bound polypeptides (CD3-gamma, -delta, -zeta, and -beta). In humans, CD3-epsilon is encoded by the CD3E gene on Chromosome 11. The intracellular domains of each of the CD3 chains contain immunoreceptor tyrosine-based activation motifs (ITAMs) that serve as the nucleating point for the intracellular signal transduction machinery upon T cell receptor engagement.

A number of therapeutic strategies modulate T cell immunity by targeting TCR signalling, particularly the anti-human CD3 monoclonal antibodies (mAbs) that are widely used clinically in immunosuppressive regimes. The CD3-specific mouse mAb OKT3 was the first mAb licensed for use in humans (Sgro, C. Side-effects of a monoclonal antibody, muromonab CD3/orthoclone OKT3: bibliographic review. Toxicology 105:23-29, 1995) and is widely used clinically as an immunosuppressive agent in transplantation (Chatenoud, Clin. Transplant 7:422-430, (1993); Chatenoud, Nat. Rev. Immunol. 3:123-132 (2003); Kumar, Transplant. Proc. 30:1351-1352 (1998)), type 1 diabetes, and psoriasis. Importantly, anti-CD3 mAbs can induce partial T cell signalling and clonal anergy (Smith, JA, Nonmitogenic Anti-CD3 Monoclonal Antibodies Deliver a Partial T Cell Receptor Signal and Induce Clonal Anergy J. Exp. Med. 185:1413-1422 (1997)). OKT3 has been described in the literature as a T cell mitogen as well as a potent T cell killer (Wong, JT. The mechanism of anti-CD3 monoclonal antibodies. Mediation of cytolysis by inter-T cell bridging. Transplantation 50:683-689 (1990)). In particular, the studies of Wong demonstrated that by bridging CD3 T cells and target cells, one could achieve killing of the target and that neither FcR-mediated ADCC nor complement fixation was necessary for bivalent anti-CD3 MAB to lyse the target cells.

OKT3 exhibits both a mitogenic and T-cell killing activity in a time-dependent fashion; following early activation of T cells leading to cytokine release, upon further administration OKT3 later blocks all known T-cell functions. It is due to this later blocking of T cell function that OKT3 has found such wide application as an immunosuppressant in therapy regimens for reduction or even abolition of allograft tissue rejection. Other antibodies specific for the CD3 molecule are disclosed in Tunnacliffe, Int. Immunol. 1 (1989), 546-50, WO2005/118635 and WO2007/033230 describe anti-human monoclonal CD3 epsilon antibodies, U.S. Pat. No. 5,821,337 describes the VL and VH sequences of murine anti-CD3 monoclonal Ab UCHT1 (muxCD3, Shalaby et al., J. Exp. Med. 175, 217-225 (1992) and a humanized variant of this antibody (hu UCHT1), and United States Patent Application 20120034228 discloses binding domains capable of binding to an epitope of human and non-chimpanzee primate CD3 epsilon chain.

TABLE 5a

Anti-CD3 Monoclonal Antibodies and VH & VL Sequences

Clone
Antibody

Name
Name
Target
VH Sequence
VL Sequence

huOKT3

CD3
QVQLVQSGGGVVQPGRSLRL
DIQMTQSPSSLSASVGDRVTIT

SCKASGYTFTRYTMHWVRQ
CSASSSVSYMNWYQQTPGKA

APGKGLEWIGYINPSRGYTN
PKRWIYDTSKLASGVPSRFSG

YNQKVKD
RFTISRDNSKNTA
SGSGTDYTFTISSLQPEDIATY

FLQMDSLRPEDTGVYFCARY
YCQQWSSNPFTFGQGTKLQI

YDDHYCLDY
WGQGTPVTVS
TR (SEQ ID NO: 479)

S (SEQ ID NO: 469)

huUCHT1

CD3
EVQLVESGGGLVQPGGSLRLS
DIQMTQSPSSLSASVGDRVTIT

CAASGYSFTGYTMNWVRQA
CRASQDIRNYLNWYQQKPG

PGKGLEWVALINPYKGVSTY
KAPKLLIYYTSRLESGVPSRF

NQKFKDRFTISVDKSKNTAYL
SGSGSGTDYTLTISSLQPEDFA

QMNSLRAEDTAVYYCARSG
TYYCQQGNTLPWTFGQGTK

YYGDSDWYFDV
WGQGTLVT
VEIK (SEQ ID NO: 480)

VSS (SEQ ID NO: 470)

hu12F6

CD3
QVQLVQSGGGVVQPGRSLRL
DIQMTQSPSSLSASVGDRVTM

SCKASGYTFTSYTMHWVRQ
TCRASSSVSYMHWYQQTPG

APGKGLEWIGYINPSSGYTK
KAPKPWIYATSNLASGVPSRF

YNQKFKD
RFTISADKSKSTAF
SGSGSGTDYTLTISSLQPEDIA

LQMDSLRPEDTGVYFCARW
TYYCQQWSSNPPTFGQGTKL

QDYDVYFDY
WGQGTPVTVS
QITR (SEQ ID NO: 481)

S (SEQ ID NO: 471)

mOKT3

CD3
QVQLQQSGAELARPGASVKM
QIVLTQSPAIMSASPGEKVTM

SCKASGYTFTRYTMHWVKQ
TCSASSSVSYMNWYQQKSGT

RPGQGLEWIGYINPSRGYTN
SPKRWIYDTSKLASGVPAHF

YNQKFKD
KATLTTDKSSSTA
RGSGSGTSYSLTISGMEAEDA

YMQLSSLTSEDSAVYYCARY
ATYYCQQWSSNPFTFGSGTK

YDDHYCLDY
WGQGTTLTVS
LEINR (SEQ ID NO: 482)

S (SEQ ID NO: 472)

MT103
blinatumo
CD3
DIKLQQSGAELARPGASVKM
DIQLTQSPAIMSASPGEKVTM

mab

SCKTSGYTFTRYTMHWVKQ
TCRASSSVSYMNWYQQKSGT

RPGQGLEWIGYINPSRGYTN
SPKRWIYDTSKVASGVPYRFS

YNQKFKD
KATLTTDKSSSTA
GSGSGTSYSLTISSMEAEDAA

YMQLSSLTSEDSAVYYCARY
TYYCQQWSSNPLTFGAGTKL

YDDHYCLDY
WGQGTTLTVS
ELK (SEQ ID NO: 483)

S (SEQ ID NO: 473)

MT110
solitomab
CD3
DVQLVQSGAEVKKPGASVKV
DIVLTQSPATLSLSPGERATLS

SCKASGYTFTRYTMHWVRQ
CRASQSVSYMNWYQQKPGK

APGQGLEWIGYINPSRGYTN
APKRWIYDTSKVASGVPARF

YADSVKG
RFTITTDKSTSTAY
SGSGSGTDYSLTINSLEAEDA

MELSSLRSEDTATYYCARYY
ATYYCQQWSSNPLTFGGGT

DDHYCLDY
WGQGTTVTVSS
KVEIK (SEQ ID NO: 484)

(SEQ ID NO: 474)

CD3.7

CD3
EVQLVESGGGLVQPGGSLKL
QTVVTQEPSLTVSPGGTVTLT

SCAASGFTFNKYAMNWVRQ
CGSSTGAVTSGYYPNWVQQK

APGKGLEWVARIRSKYNNYA
PGQAPRGLIGGTKFLAPGTPA

TYYADSVKDRFTISRDDSKNT
RFSGSLLGGKAALTLSGVQPE

AYLQMNNLKTEDTAVYYCV
DEAEYYCALWYSNRWVFGG

RHGNFGNSYISYWAYWGQG
GTKLTVL (SEQ ID NO: 485)

TLVTVSS (SEQ ID NO: 475)

CD3.8

CD3
EVQLVESGGGLVQPGGSLRLS
QAVVTQEPSLTVSPGGTVTLT

CAASGFTFNTYAMNWVRQA
CGSSTGAVTTSNYANWVQQK

PGKGLEWVGRIRSKYNNYAT
PGQAPRGLIGGTNKRAPGVPA

YYADSVKGRFTISRDDSKNTL
RFSGSLLGGKAALTLSGAQPE

YLQMNSLRAEDTAVYYCVR
DEAEYYCALWYSNLWVFGG

HGNFGNSYVSWFAYWGQGT
GTKLTVL (SEQ ID NO: 486)

LVTVSS (SEQ ID NO: 476)

CD3.9

CD3
EVQLLESGGGLVQPGGSLKLS
ELVVTQEPSLTVSPGGTVTLT

CAASGFTFNTYAMNWVRQA
CRSSTGAVTTSNYANWVQQK

PGKGLEWVARIRSKYNNYAT
PGQAPRGLIGGTNKRAPGTPA

YYADSVKDRFTISRDDSKNTA
RFSGSLLGGKAALTLSGVQPE

YLQMNNLKTEDTAVYYCVR
DEAEYYCALWYSNLWVFGG

HGNFGNSYVSWFAYWGQGT
GTKLTVL (SEQ ID NO: 487)

LVTVSS (SEQ ID NO: 477)

CD3.10

CD3
EVKLLESGGGLVQPKGSLKLS
QAVVTQESALTTSPGETVTLT

CAASGFTFNTYAMNWVRQA
CRSSTGAVTTSNYANWVQEK

PGKGLEWVARIRSKYNNYAT
PDHLFTGLIGGTNKRAPGVPA

YYADSVKDRFTISRDDSQSIL
RFSGSLIGDKAALTITGAQTE

YLQMNNLKTEDTAMYYCVR
DEAIYFCALWYSNLWVFGGG

HGNFGNSYVSWFAYWGQGT
TKLTVL (SEQ ID NO: 488)

LVTVSS (SEQ ID NO: 478)

*underlined sequences, if present, are CDRs within the VL and VH

In some embodiments of the compositions of this disclosure, the BP can comprise a binding domain (or a binding moiety) (such as an antigen binding fragment) having specific binding affinity for an effector cell antigen. The effector cell antigen can be expressed on the surface of an effector cell selected from a plasma cell, a T cell, a B cell, a cytokine induced killer cell (CIK cell), a mast cell, a dendritic cell, a regulatory T cell (RegT cell), a helper T cell, a myeloid cell, and a NK cell. The effector cell antigen can be expressed on the surface of a T cell. The binding domain (or binding moiety) can have binding affinity for CD3. In some embodiments, where the binding domain (or binding moiety) having binding affinity for CD3, the binding domain (or binding moiety) can have binding affinity for a member of the CD3 complex, which includes in individual form or independently combined form all known CD3 subunits of the CD3 complex; for example, CD3 epsilon, CD3 delta, CD3 gamma, CD3 zeta, CD3 alpha and CD3 beta. The binding domain (or binding moiety) having binding affinity for CD3 can have binding affinity for CD3 epsilon, CD3 delta, CD3 gamma, CD3 zeta, CD3 alpha or CD3 beta.

The origin of the antigen binding fragments (comprised in the binding domain or binding moiety) contemplated by the disclosure can be derived from a naturally occurring antibody or fragment thereof, a non-naturally occurring antibody or fragment thereof, a humanized antibody or fragment thereof, a synthetic antibody or fragment thereof, a hybrid antibody or fragment thereof, or an engineered antibody or fragment thereof. Methods for generating an antibody for a given target marker are well known in the art. For example, the monoclonal antibodies may be made using the hybridoma method first described by Kohler et al., Nature, 256:495 (1975), or may be made by recombinant DNA methods (U.S. Pat. No. 4,816,567). The structure of antibodies and fragments thereof, variable regions of heavy and light chains of an antibody (VH and VL), single chain variable regions (scFv), complementarity determining regions (CDR), and domain antibodies (dAbs) are well understood. Methods for generating a polypeptide having a desired antigen binding fragment with binding affinity to a given antigen are known in the art.

It will be understood that use of the term antigen binding fragments for the composition embodiments disclosed herein is intended to include portions or fragments of antibodies that retain the ability to bind the antigens that are the ligands of the corresponding intact antibody. In such embodiments, the antigen binding fragment can be, but is not limited to, CDRs and intervening framework regions, variable or hypervariable regions of light and/or heavy chains of an antibody (VL, VH), variable fragments (Fv), Fab′ fragments, F(ab′)2 fragments, Fab fragments, single chain antibodies (scAb), VHH camelid antibodies, single chain variable fragment (scFv), linear antibodies, a single domain antibody, complementarity determining regions (CDR), domain antibodies (dAbs), single domain heavy chain immunoglobulins of the BHH or BNAR type, single domain light chain immunoglobulins, or other polypeptides known in the art containing a fragment of an antibody capable of binding an antigen. The antigen binding fragments having CDR-H and CDR-L can be configured in a (CDR-H)-(CDR-L) or a (CDR-H)-(CDR-L) orientation, N-terminus to C-terminus. The VL and VH of two antigen binding fragments can also be configured in a single chain diabody configuration; e.g., the VL and VH of the first and second binding domains (or binding moieties) configured with linkers of an appropriate length to permit arrangement as a diabody.

Various CD3 binding domains of the disclosure have been specifically modified to enhance their stability in the polypeptide embodiments described herein. Binding specificity can be determined by complementarity determining regions (CDRs), such as light chain CDRs or heavy chain CDRs. In many cases, binding specificity is determined by light chain CDRs and heavy chain CDRs. A given combination of heavy chain CDRs and light chain CDRs provides a given binding pocket that confers greater affinity and/or specificity towards an effector cell antigen as compared to other reference antigens. Protein aggregation of antibodies continues to be a significant problem in their developability and remains a major area of focus in antibody production. Antibody aggregation can be triggered by partial unfolding of its domains, leading to monomer-monomer association followed by nucleation and aggregate growth. Although the aggregation propensities of antibodies and antibody-based proteins can be affected by the external experimental conditions, they are strongly dependent on the intrinsic antibody properties as determined by their sequences and structures. Although it is well known that proteins are only marginally stable in their folded states, it is often less well appreciated that most proteins are inherently aggregation-prone in their unfolded or partially unfolded states, and the resulting aggregates can be extremely stable and long-lived. Reduction in aggregation propensity has also been shown to be accompanied by an increase in expression titer, showing that reducing protein aggregation is beneficial throughout the development process and can lead to a more efficient path to clinical studies. For therapeutic proteins, aggregates are a significant risk factor for deleterious immune responses in patients, and can form via a variety of mechanisms. Controlling aggregation can improve protein stability, manufacturability, attrition rates, safety, formulation, titers, immunogenicity, and solubility. The intrinsic properties of proteins such as size, hydrophobicity, electrostatics and charge distribution play important roles in protein solubility. Low solubility of therapeutic proteins due to surface hydrophobicity has been shown to render formulation development more difficult and may lead to poor bio-distribution, undesirable pharmacokinetics behavior and immunogenicity in vivo. Decreasing the overall surface hydrophobicity of candidate monoclonal antibodies can also provide benefits and cost savings relating to purification and dosing regimens. Individual amino acids can be identified by structural analysis as being contributory to aggregation potential in an antibody, and can be located in CDR as well as framework regions. In particular, residues can be predicted to be at high risk of causing hydrophobicity issues in a given antibody.

In some embodiments, the invention provides therapeutic agents that comprise binding domain(s) with binding affinity to T cell antigen(s). In some embodiments, the binding domain with binding affinity to a T cell antigen can comprise VL and VH derived from a monoclonal antibody to an antigen of the cluster of differentiation 3 T cell receptor (CD3). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD3epsilon and CD3delta subunits. Monoclonal antibodies to CD3 neu are known in the art. Exemplary, non-limiting examples of VL and VH sequences of monoclonal antibodies to CD3 are presented in Table 5a. The binding domain with binding affinity to CD3 can comprise anti-CD3 VL and VH sequences set forth in Table 5a. The binding domain with binding affinity to CD3epsilon can comprise anti-CD3epsilon VL and VH sequences set forth in Table 5a. The binding domain with binding affinity to CD3 can comprise VH and VL regions wherein each VH and VL regions exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99%, or 100% identity to paired VL and VH sequences of the huUCHT1 anti-CD3 antibody of Table 5a. The binding domain with binding affinity to CD3 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each is derived from the respective anti-CD3 VL and VH sequences set forth in Table 5a. The binding domain with binding affinity to CD3 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein the CDR sequences. The binding domain with binding affinity to CD3 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein the CDR sequences are RASQDIRNYLN (SEQ ID NO: 489), YTSRLES (SEQ ID NO: 490), QQGNTLPWT (SEQ ID NO: 491), GYSFTGYTMN (SEQ ID NO: 492), LINPYKGVST (SEQ ID NO: 493), and SGYYGDSDWYFDV (SEQ ID NO: 494).

In some embodiments, the present disclosure provides a binding domain (or binding moiety) that binds CD3, for incorporation into the compositions described herein, can comprise CDR-L and CDR-H. The binding domain binding CD3 can comprise a CDR-H1, a CDR-H2, and a CDR-H3, each (independently) having an amino acid sequence exhibiting at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to the amino acid sequence set forth in Table 5b. The binding domain binding CD3 can comprise a CDR-L1, a CDR-L2, and a CDR-L3, each (independently) having an amino acid sequence exhibiting at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to the amino acid sequence set forth in Table 5b.

In some embodiments, the present disclosure provides a binding domain (or binding moiety) that binds CD3, for incorporation into the compositions described herein, can comprise light chain framework regions (FR-L) and heavy chain framework regions (FR-H). The binding domain binding CD3 can comprise a FR-L1 exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a FR-L1 sequence set forth in Table 5c. The binding domain binding CD3 can comprise a FR-L2 exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a FR-L2 sequence set forth in Table 5c. The binding domain binding CD3 can comprise a FR-L3 exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a FR-L3 sequence set forth in Table 5c. The binding domain binding CD3 can comprise a FR-L4 exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a FR-L4 sequence set forth in Table 5c. The binding domain binding CD3 can comprise a FR-H1 exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a FR-H1 sequence set forth in Table 5c. The binding domain binding CD3 can comprise a FR-H2 exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a FR-H2 sequence set forth in Table 5c. The binding domain binding CD3 can comprise a FR-H3 exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a FR-H3 sequence set forth in Table 5c. The binding domain binding CD3 can comprise a FR-H4 exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a FR-H4 sequence set forth in Table 5c.

In some embodiments, the present disclosure provides a binding domain (or binding moiety) that binds CD3, for incorporation into the compositions described herein, can comprise a variable light (VL) amino acid sequence and a variable heavy (VH) amino acid sequence. The binding domain that binds CD3 can comprise a VL exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a VL sequence set forth in Table 5d. The binding domain that binds CD3 can comprise a VH exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a VH sequence set forth in Table 5d. The binding domain that binds CD3 can comprise an amino acid sequence exhibiting at least 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity or is identical to a scFv sequence set forth in Table 5d.

In some embodiments of the compositions of this disclosure, the VL and VH of the antigen binding fragments can be fused by relatively long linkers, consisting 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, or 35 hydrophilic amino acids that, when joined together, have a flexible characteristic. In some embodiment, the VL and VH of any of the scFv embodiments described herein can be linked by relatively long linkers of hydrophilic amino acids selected from the sequences GSGEGSEGEGGGEGSEGEGSGEGGEGEGSG (SEQ ID NO: 495), TGSGEGSEGEGGGEGSEGEGSGEGGEGEGSGT (SEQ ID NO: 496), GATPPETGAETESPGETTGGSAESEPPGEG (SEQ ID NO: 497), or GSAAPTAGTTPSASPAPPTGGSSAAGSPST (SEQ ID NO: 498).

In some embodiments of the compositions of this disclosure, where the BP comprises a first binding domain (or first binding moiety) and a second binding domain (or second binding moiety), the first and second binding domains (or the first and second binding moieties) can be linked together by a short linker of hydrophilic amino acids having 3, 4, 5, 6, or 7 amino acids. The short linker sequences can be selected from the group of sequences SGGGGS (SEQ ID NO: 499), GGGGS (SEQ ID NO: 500), GGSGGS (SEQ ID NO: 501), GGS, or GSP. In some embodiment, the disclosure provides compositions comprising a single chain diabody in which after folding, the first domain (VL or VH) is paired with the last domain (VH or VL) to form one scFv and the two domains in the middle are paired to form the other scFv in which the first and second domains, as well as the third and last domains, are fused together by one of the foregoing short linkers and the second and the third variable domains are fused by one of the foregoing relatively long linkers. As will be appreciated by one of skill in the art, the selection of the short linker and relatively long linker can be to prevent the incorrect pairing of adjacent variable domains, thereby facilitating the formation of the single chain diabody configuration comprising the VL and VH of the first antigen binding fragment and the second antigen binding fragment.

TABLE 5b

Exemplary CD3 CDR Sequences

SEQ ID

Construct
Region
NO:
Amino Acid Sequence

3.23, 3.30, 3.31, 3.32
CDR-L1
502
RSSNGAVTSSNYAN

3.24
CDR-L1
503
RSSNGEVTTSNYAN

3.33, 3.9
CDR-L1
504
RSSTGAVTTSNYAN

3.23, 3.30, 3.31, 3.32, 3.9, 3.33
CDR-L2
505
GTNKRAP

3.24
CDR-L2
506
GTIKRAP

3.23, 3.24, 3.30, 3.31, 3.32
CDR-L3
507
ALWYPNLWVF

3.33, 3.9
CDR-L3
508
ALWYSNLWVF

3.23, 3.24, 3.30, 3.31, 3.32,
CDR-H1
509
GFTFNTYAMN

3.9, 3.33

3.23, 3.24, 3.30, 3.31, 3.32,
CDR-H2
510
RIRSKYNNYATYYADSVKD

3.9, 3.33

3.23. 3.24, 3.30, 3.31, 3.32
CDR-H3
511
HENFGNSYVSWFAH

3.9, 3.33
CDR-H3
512
HGNFGNSYVSWFAY

TABLE 5c

Exemplary CD3 FR Sequences

SEQ

Construct
Region
ID NO:
Amino Acid Sequence

3.23, 3.24, 3.30,
FR-L1
513
ELVVTQEPSLTVSPGGTVTLTC

3.31, 3.32, 3.9,

3.33

3.23, 3.24, 3.30,
FR-L2
514
WVQQKPGQAPRGLIG

3.31, 3.32, 3.9,

3.33

3.23, 3.24
FR-L3
515
GTPARFSGSLLGGKAALTLSGVQPEDEAVYYC

3.30
FR-L3
516
GTPARFSGSSLGGKAALTLSGVQPEDEAVYYC

3.31
FR-L3
517
GTPARFSGSLLGGSAALTLSGVQPEDEAVYYC

3.32
FR-L3
518
GTPARFSGSSLGGSAALTLSGVQPEDEAVYYC

3.9
FR-L3
519
GTPARFSGSLLGGKAALTLSGVQPEDEAEYYC

3.33
FR-L3
520
GTPARFSGSSLGGSAALTLSGVQPEDEAEYYC

3.23, 3.24, 3.30,
FR-L4
521
GGGTKLTVL

3.31, 3.32, 3.9,

3.33

3.23. 3.24
FR-H1
522
EVQLLESGGGIVQPGGSLKLSCAAS

3.30, 3.31, 3.32
FR-H1
523
EVQLQESGGGIVQPGGSLKLSCAAS

3.33
FR-H1
524
EVQLQESGGGLVQPGGSLKLSCAAS

3.9
FR-H1
525
EVQLLESGGGLVQPGGSLKLSCAAS

3.23, 3.24, 3.30,
FR-H2
526
WVRQAPGKGLEWVA

3.31, 3.32, 3.9,

3.33

3.23, 3.24, 3.30,
FR-H3
527
RFTISRDDSKNTVYLQMNNLKTEDTAVYYCVR

3.31, 3.32

3.9. 3.33
FR-H3
528
RFTISRDDSKNTAYLQMNNLKTEDTAVYYCVR

3.23, 3.24, 3.30,
FR-H4
529
WGQGTLVTVSS

3.31, 3.32, 3.9,

3.33

TABLE 5d

Exemplary VL & VH Sequences

SEQ ID

Construct
Region
NO:
Amino Acid Sequence

3.23
VL
530
ELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQQKPGQAPR

GLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCALWY

PNLWVFGGGTKLTVL

3.23,
VH
531
EVQLLESGGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEW

3.24

VARIRSKYNNYATYYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTA

VYYCVRHENFGNSYVSWFAHWGQGTLVTVSS

3.24
VL
532
ELVVTQEPSLTVSPGGTVTLTCRSSNGEVTTSNYANWVQQKPGQAPR

GLIGGTIKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCALWYP

NLWVFGGGTKLTVL

3.30
VL
533
ELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQQKPGQAPR

GLIGGTNKRAPGTPARFSGSSLGGKAALTLSGVQPEDEAVYYCALWY

PNLWVFGGGTKLTVL

3.30,
VH
534
EVQLQESGGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEW

3.31,

VARIRSKYNNYATYYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTA

3.32

VYYCVRHENFGNSYVSWFAHWGQGTLVTVSS

3.31
VL
535
ELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQQKPGQAPR

GLIGGTNKRAPGTPARFSGSLLGGSAALTLSGVQPEDEAVYYCALWYP

NLWVFGGGTKLTVL

3.32
VL
536
ELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQQKPGQAPR

GLIGGTNKRAPGTPARFSGSSLGGSAALTLSGVQPEDEAVYYCALWYP

NLWVFGGGTKLTVL

3.9
VL
537
ELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPGQAPR

GLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYS

NLWVFGGGTKLTVL

3.9
VH
538
EVQLLESGGGLVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLE

WVARIRSKYNNYATYYADSVKDRFTISRDDSKNTAYLQMNNLKTEDT

AVYYCVRHGNFGNSYVSWFAYWGQGTLVTVSS

3.33
VL
539
ELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPGQAPR

GLIGGTNKRAPGTPARFSGSSLGGSAALTLSGVQPEDEAEYYCALWYS

NLWVFGGGTKLTVL

3.33
VH
540
EVQLQESGGGLVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLE

WVARIRSKYNNYATYYADSVKDRFTISRDDSKNTAYLQMNNLKTEDT

AVYYCVRHGNFGNSYVSWFAYWGQGTLVTVSS

TABLE 5e

Exemplary scFv Sequences

SEQ ID

Construct
NO:
Amino Acid Sequence

3.23
541
ELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQQKPGQAPRGLIGG

TNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCALWYPNLWVFGG

GTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGIVQPGG

SLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSV

KDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAHWG

QGTLVTVSS

3.24
542
ELVVTQEPSLTVSPGGTVTLTCRSSNGEVTTSNYANWVQQKPGQAPRGLIGG

TIKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCALWYPNLWVFGGG

TKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGIVQPGGS

LKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSVK

DRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAHWGQ

GTLVTVSS

3.30
543
ELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQQKPGQAPRGLIGG

TNKRAPGTPARFSGSSLGGKAALTLSGVQPEDEAVYYCALWYPNLWVFGGG

TKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLQESGGGIVQPGGS

LKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSVK

DRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAHWGQ

GTLVTVSS

3.31
544
ELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQQKPGQAPRGLIGG

TNKRAPGTPARFSGSLLGGSAALTLSGVQPEDEAVYYCALWYPNLWVFGGG

TKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLQESGGGIVQPGGS

LKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSVK

DRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAHWGQ

GTLVTVSS

3.32
545
ELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQQKPGQAPRGLIGG

TNKRAPGTPARFSGSSLGGSAALTLSGVQPEDEAVYYCALWYPNLWVFGGG

TKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLQESGGGIVQPGGS

LKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSVK

DRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAHWGQ

GTLVTVSS

3.9
546
ELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPGQAPRGLIGG

TNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYSNLWVFGGG

TKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGLVQPGGS

LKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSVK

DRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQ

GTLVTVSS

3.33
547
ELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPGQAPRGLIGG

TNKRAPGTPARFSGSSLGGSAALTLSGVQPEDEAEYYCALWYSNLWVFGGG

TKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLQESGGGLVQPGGS

LKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSVK

DRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQ

GTLVTVSS

4.11
548
QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNN

QRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLSGLWVFGGG

TKLTVLGATPPETGAETESPGETTGGSAESEPPGEGQVQLQQWGGGLVKPGG

SLRLSCAASGFTFSSYSMNWVRQAPGKGLEWVSRINSDGSSTNYADSVKGRF

TISRDNAKNTLYLQMNSLRAEDTAVYYCARELRWGNWGQGTLVTVSS

4.12
549
QAGLTQPPSASGTPGQRVTLSCSGSYSNIGTYYVYWYQQLPGTAPKLLIYSN

DQRLSGVPDRFSGSKSGTSASLAISGLQSEDEAAYYCAAWDDSLNGWAFGG

GTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGQVQLQQWGGGLVKPG

GSLRLSCAASGFTFSSYSMNWVRQAPGKGLEWVSRINSDGSSTNYADSVKG

RFTISRDNAKNTLYLQMNSLRAEDTAVYYCARELRWGNWGQGTLVTVSS

4.13
550
QPGLTQPPSASGTPGQRVTLSCSGRSSNIGSYYVYWYQHLPGMAPKLLIYRN

SRRPSGVPDRFSGSKSGTSASLVISGLQSDDEADYYCAAWDDSLKSWVFGGG

TKLTVLGATPPETGAETESPGETTGGSAESEPPGEGQVQLQQWGGGLVKPGG

SLRLSCAASGFTFSSYSMNWVRQAPGKGLEWVSRINSDGSSTNYADSVKGRF

TISRDNAKNTLYLQMNSLRAEDTAVYYCARELRWGNWGQGTLVTVSS

4.14
551
QSVLTQPPSASGTPGQRVTISCSGSSSNIGTNYVYWYQQFPGTAPKLLIYSNN

QRPSGVPDRFSGSKSGTSGSLAISGLQSEDEADYSCAAWDDSLNGWVFGGGT

KLTVLGATPPETGAETESPGETTGGSAESEPPGEGQVQLVQWGGGLVKPGGS

LRLSCAASGFTFSSYSMNWVRQAPGKGLEWVSRINSDGSSTNYADSVKGRF

TISRDNAKNTLYLQMNSLRAEDTAVYYCARELRWGNWGQGTLVTVSS

4.15
552
QPGLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNN

QRPSGVPDRLSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLSGWVFGGGT

KLTVLGATPPETGAETESPGETTGGSAESEPPGEGQVQLVQWGGGLVKPGGS

LRLSCAASGFTFSSYSMNWVRQAPGKGLEWVSRINSDGSSTNYADSVKGRF

TISRDNAKNTLYLQMNSLRAEDTAVYYCARELRWGNWGQGTLVTVSS

4.16
553
QAVLTQPPSASGTPGQRVTISCSGSSSNIGSYYVYWYQQVPGAAPKLLMRLN

NQRPSGVPDRFSGAKSGTSASLVISGLRSEDEADYYCAAWDDSLSGQWVFG

GGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGQVQLQQWGGGLVKP

GGSLRLSCAASGFTFSSYSMNWVRQAPGKGLEWVSRINSDGSSTNYADSVK

GRFTISRDNAKNTLYLQMNSLRAEDTAVYYCARELRWGNWGQGTLVTVSS

4.17
554
QAGLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNN

QRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCATWDASLSGWVFGGGT

KLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLVQWGGGLVKPGGS

LRLSCAASGFTFSSYSMNWVRQAPGKGLEWVSRINSDGSSTNYADSVKGRF

TISRDNAKNTLYLQMNSLRAEDTAVYYCARELRWGNWGQGTLVTVSS

Tumor-Specific Markers or Antigens of Target Cells

In some embodiments of the compositions of this disclosure, the binding domain (e.g., the first binding domain) can have specific binding affinity to a tumor-specific marker or an antigen of a target cell. Some embodiments of the compositions of this disclosure can comprise another binding domain (e.g., the second binding domain) that binds to an effector cell antigen. The tumor-specific marker can be associated with a tumor cell (such as of stroma cell tumor, fibroblast tumor, myofibroblast tumor, glial cell tumor, epithelial cell tumor, fat cell tumor, immune cell tumor, vascular cell tumor, or smooth muscle cell tumor). The tumor-specific marker or the antigen of the target cell can be selected from the group consisting of alpha 4 integrin, Ang2, B7-H3, B7-H6 (e.g., its natural ligand Nkp30 rather than an antibody fragment), CEACAM5, cMET, CTLA4, FOLR1, EpCAM (epithelial cell adhesion molecule), CCR5, CD19, HER2, HER2 neu, HER3, HER4, HER1 (EGFR), PD-L1, PSMA, CEA, TROP-2, MUC1(mucin), MUC-2, MUC3, MUC4, MUC5AC, MUC5B, MUC7, MUC16, βhCG, Lewis-Y, CD20, CD33, CD38, CD30, CD56 (NCAM), CD133, ganglioside GD3, 9-O-acetyl-GD3, GM2, Globo H, fucosyl GM1, GD2, carbonicanhydrase IX, CD44v6, Nectin-4, Sonic Hedgehog (Shh), Wue-1, plasma cell antigen 1 (PC-1), melanoma chondroitin sulfate proteoglycan (MCSP), CCR8, 6-transmembrane epithelial antigen of prostate (STEAP), mesothelin, A33 antigen, prostate stem cell antigen (PSCA), Ly-6, desmoglein 4, fetal acetylcholine receptor (fnAChR), CD25, cancer antigen 19-9 (CA19-9), cancer antigen 125 (CA-125), Muellerian inhibitory substance receptor type II (MISIIR), sialylated Tn antigen (sTN), fibroblast activation antigen (FAP), endosialin (CD248), epidermal growth factor receptor variant III (EGFRvIII), tumor-associated antigen L6 (TAL6), SAS, CD63, TAG72, Thomsen-Friedenreich antigen (TF-antigen), insulin-like growth factor I receptor (IGF-IR), Cora antigen, CD7, CD22, CD70 (e.g., its natural ligand, CD27 rather than an antibody fragment), CD79a, CD79b, G250, MT-MMPs, alpha-fetoprotein (AFP), VEGFR1, VEGFR2, DLK1, SP17, ROR1, EphA2, ENPP3, glypican 3 (GPC3), and TPBG/5T4 (trophoblast glycoprotein). The tumor-specific marker or the antigen of the target cell can be selected from alpha 4 integrin, Ang2, CEACAM5, cMET, CTLA4, FOLR1, EpCAM (epithelial cell adhesion molecule), CD19, HER2, HER2 neu, HER3, HER4, HER1 (EGFR), PD-L1, PSMA, CEA, TROP-2, MUC1(mucin), Lewis-Y, CD20, CD33, CD38, mesothelin, CD70 (e.g., its natural ligand, CD27 rather than an antibody fragment), VEGFR1, VEGFR2, ROR1, EphA2, ENPP3, glypican 3 (GPC3), and TPBG/5T4 (trophoblast glycoprotein). The tumor-specific marker or the antigen of the target cell can be any one set forth in the “Target” column of Table 6. The binding domain with binding affinity to the tumor-specific marker or the target cell antigen can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99%, or 100%, sequence identity to any one of the paired VL and VH sequences set forth in the “VH Sequences” and “VL Sequences” columns of Table 6.

TABLE 6

Anti-target Cell Monoclonal Antibodies and Sequences

Trade
Antibody

SEQ

SEQ

Name
Name
Target
ID NO:
VH Sequence
ID NO:
VL Sequence

Tysabri™
natalizumab
Alpha
555
QVQLVQSGAEVKKPG
654
DIQMTQSPSSLSASVG

4

ASVKVSCKASGFNIKD

DRVTITCKTSQDINK

Integrin

TYIH
WVRQAPGQRLE

YMA
WYQQTPGKAPR

WMGRIDPANGYTKY

LLIHYTSALQPGIPSR

DPKFQG
RVTITADTSA

FSGSGSGRDYTFTISS

STAYMELSSLRSEDTA

LQPEDIATYYCLQYD

VYYCAREGYYGNYG

NLWT
FGQGTKVEIK

VYAMDY
WGQGTLVT

VSS

REGN910
nesvacumab
Ang2
556
EVQLVESGGGLVQPGG
655
EIVLTQSPGTLSLSPG

SLRLSCAASGFTFSSY

ERATLSCRASQSVSS

DIH
WVRQATGKGLEW

TYLA
WYQQKPGQAP

VSAIGPAGDTYYPGSV

RLLIYGASSRATGIPD

KGRFTISRENAKNSLY

RFSGSGSGTDFTLTIS

LQMNSLRAGDTAVYY

RLEPEDFAVYYCQH

CARGLITFGGLIAPFD

YDNSQ
TFGQGTKVEI

YWGQGTLVTVSS

K

hMFE23

CEA
557
QVKLEQSGAEVVKPG
656
ENVLTQSPSSMSASV

ASVKLSCKASGFNIKD

GDRVNIACSASSSVS

S
YMHWLRQGPGQRLE

YMHWFQQKPGKSPK

WIGWIDPENGDTEYAP

LWIYSTSNLASGVPS

KFQGKATFTTDTSANT

RFSGSGSGTDYSLTIS

AYLGLSSLRPEDTAVY

SMQPEDAATYYCQQ

YCNEGTPTGPYYFDY

RSSYPL
TFGGGTKLEI

WGQGTLVTVSS

K

M5A

CEA
558
EVQLVESGGGLVQPGG
657
DIQLTQSPSSLSASVG

(humanized

SLRLSCAASGFNIKDT

DRVTITCRAGESVDI

T84.66)

YMH
WVRQAPGKGLE

FGVGFLH
WYQQKPG

WVARIDPANGNSKYA

KAPKLLIYRASNLES

DSVKG
RFTISADTSKN

GVPSRFSGSGSRTDFT

TAYLQMNSLRAEDTA

LTISSLQPEDFATYYC

VYYCAPFGYYVSDYA

QQTNEDPYT
FGQGT

MAY
WGQGTLVTVSS

KVEIK

M5B

CEA
559
EVQLVESGGGLVQPGG
658
DIQLTQSPSSLSASVG

(humanized

SLRLSCAASGFNIKDT

DRVTITCRAGESVDI

T84.66)

YM
HWVRQAPGKGLE

FGVGFLH
WYQQKPG

WVARIDPANGNSKYV

KAPKLLIYRASNLES

PKFQG
RATISADTSKN

GVPSRFSGSGSRTDFT

TAYLQMNSLRAEDTA

LTISSLQPEDFATYYC

VYYCAPFGYYVSDYA

QQTNEDPYT
FGQGT

MAY
WGQGTLVTVSS

KVEIK

CEA-Cide
Labetuzumab
CEAC
560
EVQLVESGGGVVQPG
659
DIQLTQSPSSLSASVG

(MN-14)
AM5

RSLRLSCSASGFDFTTY

DRVTITCKASQDVGT

WMS
WVRQAPGKGLE

SVA
WYQQKPGKAPK

WIGEIHPDSSTINYAPS

LLIYWTSTRHTGVPS

LKD
RFTISRDNAKNTL

RFSGSGSGTDFTFTIS

FLQMDSLRPEDTGVYF

SLQPEDIATYYCQQY

CASLYFGFPWFAYWG

SLYRS
FGQGTKVEIK

QGTPVTVSS

CEA-Scan
arcitumomab
CEAC
561
EVKLVESGGGLVQPGG
660
QTVLSQSPAILSASPG

AM5

SLRLSCATSGFTFTDY

EKVTMTCRASSSVTY

YMN
WVRQPPGKALE

IHW
YQQKPGSSPKS

WLGFIGNKANGYTTE

WIYATSNLASGVPAR

YSAS
VKGRFTISRDKS

FSGSGSGTSYSLTISR

QSILYLQMNTLRAEDS

VEAEDAATYYCQHW

ATYYCTRDRGLRFYF

SSKPPT
FGGGTKLEIK

DYWGQGTTLTVSS

R

MT110

CEAC
562
EVQLVESGGGLVQPGR
661
QAVLTQPASLSASPG

AM5

SLRLSCAASGFTVSSY

ASASLTCTLRRGINV

WMH
WVRQAPGKGLE

GAYSIY
WYQQKPGSP

WVGFIRNKANGGTTE

PQYLLRYKSDSDKQ

YAASVKG
RFTISRDDS

QGS
GVSSRFSASKDA

KNTLYLQMNSLRAED

SANAGILLISGLQSED

TAVYYCARDRGLRFY

EADYYCMIWHSGAS

FDY
WGQGTTVTVSS

AV
FGGGTKLTVL

MT103
blinatumom
CD19
563
QVQLQQSGAELVRPGS
662
DIQLTQSPASLAVSLG

SVKISCKASGYAFSSY

QRATISCKASQSVDY

WMN
WVKQRPGQGLE

DGDSY
LNWYQQIPG

at

WIGQIWPGDGDTNYN

QPPKLLIYDASNLVS

GKFKG
KATLTADESSS

GIPPRFSGSGSGTDFT

TAYMQLSSLASEDSAV

LNIHPVEKVDAATYH

YFCARRETTTVGRYY

CQQSTEDPWTFGGG

YAMDY
WGQGTTVTVS

TKLEIK

S

Arzerra
ofatumumab
CD20
564
EVQLVESGGGLVQPGR
663
EIVLTQSPATLSLSPG

SLRLSCAASGFTFNDY

ERATLSCRASQSVSS

AMH
WVRQAPGKGLE

YLA
WYQQKPGQAPR

WVSTISWNSGSIGYAD

LLIYDASNRATGIPAR

SVKG
RFTISRDNAKKS

FSGSGSGTDFTLTISS

LYLQMNSLRAEDTAL

LEPEDFAVYYCQQRS

YYCAKDIQYGNYYYG

NWPIT
FGQGTRLEIK

MDV
WGQGTTVTVSS

Bexxar™
tositumomab
CD20
565
QAYLQQSGAELVRPG
664
QIVLSQSPAILSASPG

ASVKMSCKASGYTFTS

EKVTMTCRASSSVSY

YNMH
WVKQTPRQGLE

MHW
YQQKPGSSPKP

WIGAIYPGNGDTSYN

WIYAPSNLASGVPAR

QKFKG
KATLTVDKSS

FSGSGSGTSYSLTISR

STAYMQLSSLTSEDSA

VEAEDAATYYCQQW

VYFCARVVYYSNSYW

SFNPPT
FGAGTKLEL

YFDV
WGTGTTVTVSG

K

GAZYVA
Obinutuzumab
CD20
566
QVQLVQSGAEVKKPG
665
DIVMTQTPLSLPVTPG

SSVKVSCKASGYAFSY

EPASISCRSSKSLLHS

SWIN
WVRQAPGQGLE

NGITYLY
WYLQKPG

WMGRIFPGDGDTDYN

QSPQLLIYQMSNLVS

GKFKG
RVTITADKSTS

GVPDRFSGSGSGTDF

TAYMELSSLRSEDTAV

TLKISRVEAEDVGVY

YYCARNVFDGYWLV

YCAQNLELPYTFGG

Y
WGQGTLVTVSS

GTKVEIK

Ocrelizuma
CD20
567
EVQLVESGGGLVQPGG
666
DIQMTQSPSSLSASVG

b/2H7 v16

SLRLSCAASGYTFTSY

DRVTITCRASSSVSY

N
MHWVRQAPGKGLE

MH
WYQQKPGKAPKP

WVGAIYPGNGDTSYN

LIYAPSNLASGVPSRF

QKFKGRFTISVDKSKN

SGSGSGTDFTLTISSL

TLYLQMNSLRAEDTA

QPEDFATYYCQQWS

VYYCARVVYYSNSYW

FNPPT
FGQGTKVEIK

YFDV
WGQGTLVTVSS

Rituxan™
rituximab
CD20
568
QVQLQQPGAELVKPG
667
QIVLSQSPAILSASPG

ASVKMSCKASGYTFT

EKVTMTCRASSSVSY

SY
NMHWVKQTPGRGL

IHWFQQKPGSSPKPW

EWIGAIYPGNGDTSYN

IYATSNLASGVPVRFS

QKFKGKATLTADKSSS

GSGSGTSYSLTISRVE

TAYMQLSSLTSEDSAV

AEDAATYYCQQWTS

YYCARSTYYGGDWY

NPPT
FGGGTKLEIK

FNV
WGAGTTVTVSA

Zevalin™
ibritumomab
CD20
569
QAYLQQSGAELVRPG
668
QIVLSQSPAILSASPG

tieuxetan

ASVKMSCKASGYTFT

EKVTMTCRASSSVSY

SYN
MHWVKQTPRQGL

MH
WYQQKPGSSPKP

EWIGAIYPGNGDTSYN

WIYAPSNLASGVPAR

QKF
KGKATLTVDKSSS

FSGSGSGTSYSLTISR

TAYMQLSSLTSEDSAV

VEAEDAATYYCQQW

YFCARVVYYSNSYWY

SFNPPT
FGAGTKLEL

FDV
WGTGTTVTVSA

K

Mylotarg
Gemtuzumab
CD33
570
QLVQSGAEVKKPGSSV
669
DIQLTQSPSTLSASVG

(hP67.6)

KVSCKASGYTITDSNI

DRVTITCRASESLDN

H
WVRQAPGQSLEWIG

YGIRFLT
WFQQKPG

YIYPYNGGTDYNQKF

KAPKLLMYAASNQG

KN
RATLTVDNPTNTA

S
GVPSRFSGSGSGTEF

YMELSSLRSEDTDFYY

TLTISSLQPDDFATYY

CVNGNPWLAYWGQG

CQQTKEVPWSFGQG

TLVTVSS

TKVEVK

Daratumumab

CD38
571
EVQLLESGGGLVQPGG
670
EIVLTQSPATLSLSPG

SLRLSCAVSGFTFNSF

ERATLSCRASQSVSS

A
MSWVRQAPGKGLE

Y
LAWYQQKPGQAPR

WVSAISGSGGGTYYA

LLIYDASNRATGIPAR

DSVKGRFTISRDNSKN

FSGSGSGTDFTLTISS

TLYLQMNSLRAEDTA

LEPEDFAVYYCQQRS

VYFCAKDKILWFGEP

NWPPT
FGQGTKVEIK

VFDY
WGQGTLVTVSS

1F6
CD70
572
QIQLVQSGPEVKKPGE
671
DIVLTQSPASLAVSLG

TVKISCKASGYTFTNY

QRATISCRASKSVSTS

GMN
WVKQAPGKGLK

GYSFMH
WYQQKPG

WMGWINTYTGEPTY

QPPKLLIYLASNLESG

ADAFKG
RFAFSLETSA

VPARFSGSGSGTDFT

STAYLQINNLKNEDTA

LNIHPVEEEDAATY

TYFCARDYGDYGMDY

YCQHSREVPWTFGG

WGQGTSVTVSS

GTKLEIK

2F2
CD70
573
QVQLQQSGTELMTPG
672
DIVLTQSPASLTVSLG

ASVTMSCKTSGYTFST

QKTTISCRASKSVSTS

YWIE
WVKQRPGHGLE

GYSFMH
WYQLKPGQ

WIGEILGPSGYTDYNE

SPKLLIYLASDLPSGV

KFKA
KATFTADTSSNT

PARFSGSGSGTDFTL

AYMQLSSLASEDSAVY

KIHPVEEEDAATY

YCARWDRLYAMDYW

YCQHSREIPYTFGGG

GGGTSVTVSS

TKLEIT

2H5
CD70
574
QVQLVESGGGVVQPG
673
EIVLTQSPATLSLSPG

RSLRLSCAASGFTFSSY

ERATLSCRASQSVSS

IMH
WVRQAPGKGLEW

YLA
WYQQKPGQAPR

VAVISYDGRNKYYAD

LLIYDASNRATGIPAR

SVKG
RFTISRDNSKNT

FSGSGSGTDFTLTISS

LYLQMNSLRAED

LEPEDFAVYYCQQ

TAVYYCARDTDGYDF

RTNWPLT
FGGGTKV

DY
WGQGTLVTVSS

EIK

10B4
CD70
575
QIQLVESGGGVVQPGR
674
AIQLTQSPSSLSASVG

SLRLSCAASGFTFGYY

DRVTITCRASQGISSA

AMH
WVRQAPGKGLE

LA
WYQQKPGKAPKF

WVAVISYDGSIKYYA

LIYDASSLESGVPSRF

DSVKG
RFTISRDNSKN

SGSGSGTDFTLTISSL

TLYLQMNSLRAED

QPEDFATYYCQQ

TAVYYCAREGPYSNY

FNSYPFT
FGPGTKVD

LDY
WGQGTLVTVSS

IK

8B5
CD70
576
QVQLVESGGGVVQPG
675
DIQMTQSPSSLSASVG

RSLRLSCATSGFTFSDY

DRVTITCRASQGISS

GMH
WVRQAPGKGLE

WLA
WYQQKPEKAPK

WVAVIWYDGSNKYY

SLIYAASSLQSGVPSR

ADSVKG
RFTISRDNSK

FSGSGSGTDFTLTISS

KTLSLQMNSLRAED

LQPEDFATYYCQQ

TAVYYCARDSIMVRG

YNSYPLT
FGGGTKVE

DY
WGQGTLVTVSS

IK

18E7
CD70
577
QVQLVESGGGVVQPG
676
DIQMTQSPSSLSASVG

RSLRLSCAASGFTFSD

DRVTITCRASQGISS

HGMH
WVRQAPGKGL

WLA
WYQQKPEKAPK

EWVAVIWYDGSNKY

SLIYAASSLQSGVPSR

YADSVKG
RFTISRDNS

FSGSGSGTDFTLTISS

KNTLYLQMNSLRAED

LQPEDFATYYCQQ

TAVYYCARDSIMVRG

YNSYPLT
FGGGTKVE

DY
WGQGTLVTVSS

IK

69A7
CD70
578
QVQLQESGPGLVKPSE
677
EIVLTQSPATLSLSPG

TLSLTCTVSGGSVSSD

ERATLSCRASQSVSS

YYYWS
WIRQPPGKGL

YLA
WYQQKPGQAPR

EWLGYIYYSGSTNYNP

LLIFDASNRATGIPAR

SLKS
RVTISVDTSKNQF

FSGSGSGTDFTLTISS

SLKLRSVTTA

LEPEDFAVYYCQQ

DTAVYYCARGDGDYG

RSNWPLT
FGGGTKV

GNCFDYWGQGTLVTV

EIK

SS

CE-

cMET
579
QVQLVQSGAEVKKPG
678
DIQMTQSPSSVSASV

355621

ASVKVSCKASGYTFTS

GDRVTITCRASQGIN

YGFS
WVRQAPGQGLE

TWLA
WYQQKPGKA

WMGWISASNGNTYY

PKLLIYAASSLKSGVP

AQKLQG
RVTMTTDTS

SRFSGSGSGTDFTLTI

TSTAYMELRSLRSDDT

SSLQPEDFATYYCQQ

AVYYCARVYADYADY

ANSFPLT
FGGGTKVE

WGQGTLVTVSS

IK

LY28753
emibetuzumab
cMET
580
QVQLVQSGAEVKKPG
679
DIQMTQSPSSLSASVG

58

ASVKVSCKASGYTFT

DRVTITCSVSSSVSSI

DYY
MHWVRQAPGQG

Y
LHWYQQKPGKAPK

LEWMGRVNPNRRGTT

LLIYSTSNLASGVPSR

YNQKFEGRVTMTTDTS

FSGSGSGTDFTLTISS

TSTAYMELRSLRSDDT

LQPEDFATYYCQVYS

AVYYCARANWLDYW

GYPLT
FGGGTKVEIK

GQGTTVTVSS

MetMAb
onartuzumab
cMET
581
EVQLVESGGGLVQPGG
680
DIQMTQSPSSLSASVG

SLRLSCAASGYTFTSY

DRVTITCKSSQSLLY

WLH
WVRQAPGKGLE

TSSQKNYLA
WYQQK

WVGMIDPSNSDTRFN

PGKAPKLLIYWASTR

PNFKD
RFTISADTSKN

ES
GVPSRFSGSGSGT

TAYLQMNSLRAEDTA

DFTLTISSLQPEDFAT

VYYCATYRSYVTPLD

YYCQQYYAYPWTFG

Y
WGQGTLVTVSS

QGTKVEIK

tremelimumab
CTLA
582
QVQLVESGGGVVQPG
681
DIQMTQSPSSLSASVG

(CP-675206,
4

RSLRLSCAASGFTFSS

DRVTITCRASQSINSY

or 11.2.1)

YGMH
WVRQAPGKGL

LD
WYQQKPGKAPKL

EWVAVIWYDGSNKY

LIYAASSLQSGVPSRF

YADSV
KGRFTISRDNS

SGSGSGTDFTLTISSL

KNTLYLQMNSLRAED

QPEDFATYYCQQYY

TAVYYCARDPRGATL

STPFT
FGPGTKVEIK

YYYYYGMDV
WGQGT

TVTVSS

Yervoy
Ipilimumab
CTLA
583
QVQLVESGGGVVQPG
682
EIVLTQSPGTLSLSPG

10D1
4

RSLRLSCAASGFTFSSY

ERATLSCRASQSVGS

TMH
WVRQAPGKGLE

SYLA
WYQQKPGQAP

WVTFISYDGNNKYYA

RLLIYGAFSRATGIPD

DSVKG
RFTISRDNSKN

RFSGSGSGTDFTLTIS

TLYLQMNSLRAEDTAI

RLEPEDFAVYYCQQ

YYCARTGWLGPFDY

YGSSPWT
FGQGTKV

WGQGTLVTVSS

EIK

AGS16F
H16-7.8
ENPP3
584
QVQLQESGPGLVKPSQ
683
EIVLTQSPDFQSVTPK

TLSLTCTVSGGSISSGG

EKVTITCRASQSIGIS

YY
WSWIRQHPGKGLE

LH
WYQQKPDQSPKL

WIGIIYYSGSTYYNPSL

LIKYASQSFSGVPSRF

KS
RVTISVDTSKNQFSL

SGSGSGTDFTLTINSL

KLNSVTAADTAVFYC

EAEDAATYYCHQSR

ARVAIVTTIPGGMDV

SFPWT
FGQGTKVEIK

WGQGTTVTVSS

MT110
solitomab
EpCA
585
EVQLLEQSGAELVRPG
684
ELVMTQSPSSLTVTA

M

TSVKISCKASGYAFTN

GEKVTMSCKSSQSLL

YWLG
WVKQRPGHGL

NSGNQKNYLT
WYQ

EWIGDIFPGSGNIHYN

QKPGQPPKLLIYWAS

EKFKG
KATLTADKSSS

TRES
GVPDRFTGSGS

TAYMQLSSLTFEDSAV

GTDFTLTISSVQAEDL

YFCARLRNWDEPMD

AVYYCQNDYSYPLT

Y
WGQGTTVTVSS

FGAGTKLEIK

MT201
Adecatumumab
EpCA
586
EVQLLESGGGVVQPGR
685
ELQMTQSPSSLSASV

M

SLRLSCAASGFTFSSYG

GDRVTITCRTSQSISS

MH
WVRQAPGKGLEW

YLN
WYQQKPGQPPK

VAVISYDGSNKYYAD

LLIYWASTRESGVPD

SVKG
RFTISRDNSKNT

RFSGSGSGTDFTLTIS

LYLQMNSLRAEDTAV

SLQPEDSATYYCQQS

YYCAKDMGWGSGW

YDIPYT
FGQGTKLEI

RPYYYYGMDV
WGQG

K

TTVTVSS

Panorex
Edrecolomab
EpCA
587
QVQLQQSGAELVRPGT
686
NIVMTQSPKSMSMSV

Mab CO17-
M

SVKVSCKASGYAFTN

GERVTLTCKASENVV

1A

YL
IEWVKQRPGQGLE

TY
VSWYQQKPEQSP

WIGVINPGSGGTNYNE

KLLIYGASNRYTGVP

KFKGKATLTADKSSST

DRFTGSGSATDFTLTI

AYMQLSSLTSDDSAVY

SSVQAEDLADYHCG

FCARDGPWFAYWGQ

QGYSYPYT
FGGGTK

GTLVTVSA

LEIK

tucotuzumab
EpCA
588
QIQLVQSGPELKKPGE
687
QILLTQSPAIMSASPG

M

TVKISCKASGYTFTNY

EKVTMTCSASSSVSY

GMN
WVRQAPGKGLK

MLWYQQKPGSSPKP

WMGWINTYTGEPTY

WIFDTSNLASGFPAR

AD
DFKGRFVFSLETSA

FSGSGSGTSYSLIISSM

STAFLQLNNLRSEDTA

EAEDAATYYCHQRS

TYFCVRFISKGDYWGQ

GYPYT
FGGGTKLEIK

GTSVTVSS

UBS-54

EpCA
589
VQLQQSDAELVKPGAS
688
DIVMTQSPDSLAVSL

M

VKISCKASGYTFTDHA

GERATINCKSSQSVL

IH
WVKQNPEQGLEWI

YSSNNKNYLA
WYQQ

GYFSPGNDDFKYNER

KPGQPPKLLIYWAST

FKGKATLTADKSSSTA

RES
GVPDRFSGSGSG

YVQLNSLTSEDSAVYF

TDFTLTISSLQAEDVA

CTRSLNMAYWGQGTS

VYYCQQYYSYPLTF

VTVSS

GGGTKVKES

3622W94
323/A3
EpCA
590
EVQLVQSGPEVKKPGA
689
DIVMTQSPLSLPVTPG

M

SVKVSCKASGYTFTN

EPASISCRSSINKKGS

YGMN
WVRQAPGQGL

NGITY
LYWYLQKPG

EWMGWINTYTGEPTY

QSPQLLIYQMSNLAS

GE
DFKGRFAFSLDTSA

GVPDRFSGSGSGTDF

STAYMELSSLRSEDTA

TLKISRVEAEDVGVY

VYFCARFGNYVDYWG

YCAQNLEIPRTFGQG

QGSLVTVSS

TKVEIK

4D5MOC

EpCA
591
EVQLVQSGPGLVQPGG
690
DIQMTQSPSSLSASVG

Bv2

M

SVRISCAASGYTFTNY

DRVTITCRSTKSLLH

GMN
WVKQAPGKGLE

SNGITYLY
WYQQKP

WMGWINTYTGESTY

GKAPKLLIYQMSNLA

ADSFKG
RFTFSLDTSA

S
GVPSRFSSSGSGTDF

SAAYLQINSLRAEDTA

TLTISSLQPEDFATYY

VYYCARFAIKGDYWG

CAQNLEIPRTFGQGT

QGTLLTVSS

KVEIK

4D5MOC

EpCA
592
EVQLVQSGPGLVQPGG
691
DIQMTQSPSSLSASVG

B

M

SVRISCAASGYTFTNY

DRVTITCRSTKSLLH

GMN
WVKQAPGKGLE

SNGITYLY
WYQQKP

WMGWINTYTGESTY

GKAPKLLIYQMSNLA

ADSFKG
RFTFSLDTSA

S
GVPSRFSSSGSGTDF

SAAYLQINSLRAEDTA

TLTISSLQPEDFATYY

VYYCARFAIKGDYWG

CAQNLEIPRTFGQGT

QGTLLTVSS

KVELK

MEDI-
1C1
EphA2
593
EVQLLESGGGLVQPGG
692
DIQMTQSPSSLSASVG

547

SLRLSCAASGFTFSHY

DRVTITCRASQSIST

MMA
WVRQAPGKGLE

WLA
WYQQKPGKAP

WVSRIGPSGGPTHYA

KLLIYKASNLHTGVP

DSVKG
RFTISRDNSKN

SRFSGSGSGTEFSLTIS

TLYLQMNSLRAEDTA

GLQPDDFATYYCQQ

VYYCAGYDSGYDYVA

YNSYSRT
FGQGTKVE

VAGPAEYFQH
WGQG

IK

TLVTVSS

MORAb-
farletuzumab
FOLR1
594
EVQLVESGGGVVQPG
693
DIQLTQSPSSLSASVG

003

RSLRLSCSASGFTFSG

DRVTITCSVSSSISSN

YG
LSWVRQAPGKGLE

N
LHWYQQKPGKAPK

WVAMISSGGSYTYYA

PWIYGTSNLASGVPS

DSVKGRFAISRDNAKN

RFSGSGSGTDYTFTIS

TLFLQMDSLRPEDTGV

SLQPEDIATYYCQQW

YFCARHGDDPAWFAY

SSYPYMYT
FGQGTK

WGQGTPVTVSS

VEIK

M9346A
huMOV19
FOLR1
595
QVQLVQSGAEVVKPG
694
DIVLTQSPLSLAVSLG

(vLCv1.00)

ASVKISCKASGYTFTG

QPAIISCKASQSVSFA

YFMN
WVKQSPGQSLE

GTSLMH
WYHQKPG

WIGRIHPYDGDTFYN

QQPRLLIYRASNLEA

QKFQG
KATLTVDKSS

GVPDRFSGSGSKTDF

NTAHMELLSLTSEDFA

TLNISPVEAEDAATY

VYYCTRYDGSRAMDY

YCQQSREYPYTFGG

WGQGTTVTVSS

GTKLEIK

M9346A
huMOV19
FOLR1
596
QVQLVQSGAEVVKPG
695
DIVLTQSPLSLAVSLG

(vLCv1.60)

ASVKISCKASGYTFTG

QPAIISCKASQSVSFA

YFMN
WVKQSPGQSLE

GTSLMH
WYHQKPG

WIGRIHPYDGDTFYN

QQPRLLIYRASNLEA

QKFQG
KATLTVDKSS

GVPDRFSGSGSKTDF

NTAHMELLSLTSEDFA

TLTISPVEAEDAATY

VYYCTRYDGSRAMDY

YCQQSREYPYTFGG

WGQGTTVTVSS

GTKLEIK

26B3.F2

FOLR1
597
GPELVKPGASVKISCK
696
PASLSASVGETVTITC

ASDYSFTGYFMNWVM

RTSENIFSYLA
WYQQ

QSHGKSLEWIGRIFPY

KQGISPQLLVYNAKT

NGDTFYNQKFKG
RAT

LAE
GVPSRFSGSGSG

LTVDKSSSTAHMELRS

TQFSLKINSLQPEDFG

LASEDSAVYFCARGTH

SYYCQHHYAFPWTF

YFDY
WGQGTTLTVSS

GGGSKLEIK

RG7686
GC33
GPC3
598
QVQLVQSGAEVKKPG
697
DVVMTQSPLSLPVTP

ASVKVSCKASGYTFTD

GEPASISCRSSQSLVH

YEMH
WVRQAPGQGL

SNGNTYLH
WYLQKP

EWMGALDPKTGDTA

GQSPQLLIYKVSNRF

YSQKFKG
RVTLTADK

S
GVPDRFSGSGSGTD

STSTAYMELSSLTSED

FTLKISRVEAEDVGV

TAVYYCTRFYSYTYW

YYCSQNTHVPPTFG

GQGTLVTVSS

QGTKLEIK

4A6
GPC3
599
EVQLVQSGAEVKKPGE
698
EIVLTQSPGTLSLSPG

SLKISCKGSGYSFTSY

ERATLSCRAVQSVSS

WIA
WVRQMPGKGLE

SYLA
WYQQKPGQAP

WMGIIFPGDSDTRYSP

RLLIYGASSRATGIPD

SFQ
GQVTISADRSIRTA

RFSGSGSGTDFTLTIS

YLQWSSLKASD

RLEPEDFAVYYCQ

TALYYCARTREGYFD

QYGSSPT
FGGGTKVE

YWGQGTLVTVSS

IK

11E7
GPC3
600
EVQLVQSGAEVKKPGE
699
EIVLTQSPGTLSLSPG

SLKISCKGSGYSFTNY

ERATLSCRASQSVSS

WIA
WVRQMPGKGLE

SYLA
WYQQKPGQAP

WMGIIYPGDSDTRYSP

RLLIYGASSRATGIPD

SFQG
QVTISADKSIRTA

RFSGSGSGTDFTLTIS

YLQWSSLKASD

RLEPEDFAVYYCQ

TAMYYCARTREGYFD

QYGSSPT
FGGGTKVE

Y
WGQGTLVTVSS

IK

16D10
GPC3
601
EVQLVQSGADVTKPGE
700
EILLTQSPGTLSLSPG

SLKISCKVSGYRFTNY

ERATLSCRASQSVSS

WIG
WMRQMSGKGLE

SYLA
WYQQKPGQAP

WMGIIYPGDSDTRYSP

RLLIYGASSRATGIPD

SFQG
HVTISADKSINTA

RFSGSGSGTDFTLTIS

YLRWSSLKASD

RLEPEDFAVYYCQ

TAIYYCARTREGFFDY

QYGSSPT
FGQGTKVE

WGQGTPVTVSS

IK

AMG-595

HER1
602
QVQLVESGGGVVQSG
701
DTVMTQTPLSSHVTL

(EGFR)

RSLRLSCAASGFTFRN

GQPASISCRSSQSLV

YGMH
WVRQAPGKGL

HSDGNTYLS
WLQQR

EWVAVIWYDGSDKY

PGQPPRLLIYRISRRF

YADSVRG
RFTISRDNS

S
GVPDRFSGSGAGTD

KNTLYLQMNSLRAED

FTLEISRVEAEDVGV

TAVYYCARDGYDILT

YYCMQSTHVPRTFG

GNPRDFDY
WGQGTLV

QGTKVEIK

TVSS

Erubitux™
cetutximab
HER1
603
QVQLKQSGPGLVQPSQ
702
DILLTQSPVILSVSPGE

(EGFR)

SLSITCTVSGFSLTNYG

RVSFSCRASQSIGTNI

VHWVRQSPGKGLEWL

HWYQQRTNGSPRLLI

GVIWSGGNTDYNTPF

KYASESISGIPSRFSGS

TSRLSINKDNSKSQVFF

GSGTDFTLSINSVESE

KMNSLQSNDTAIYYCA

DIADYYCQQNNNWP

RALTYYDYEFAY
WGQ

TT
FGAGTKLELK

GTLVTVSA

GA201
Imgatuzumab
HER1
604
QVQLVQSGAEVKKPG
703
DIQMTQSPSSLSASVG

(EGFR)

SSVKVSCKASGFTFTD

DRVTITCRASQGINN

YKIH
WVRQAPGQGLE

YLN
WYQQKPGKAPK

WMGYFNPNSGYSTYA

RLIYNTNNLQTGVPS

QKFQG
RVTITADKSTS

RFSGSGSGTEFTLTISS

TAYMELSSLRSEDTAV

LQPEDFATYYCLQH

YYCARLSPGGYYVMD

NSFPT
FGQGTKLEIK

A
WGQGTTVTVSS

Humax
zalutumumab
HER1
605
QVQLVESGGGVVQPG
704
AIQLTQSPSSLSASVG

(EGFR)

RSLRLSCAASGFTFSTY

DRVTITCRASQDISSA

GMH
WVRQAPGKGLE

LV
WYQQKPGKAPKL

WVAVIWDDGSYKYY

LIYDASSLESGVPSRF

GDSVKG
RFTISRDNSK

SGSESGTDFTLTISSL

NTLYLQMNSLRAEDT

QPEDFATYYCQQFNS

AVYYCARDGITMVRG

YPLT
FGGGTKVEIK

VMKDYFDY
WGQGTL

VTVSS

IMC-11F8
necitumumab
HER1
606
QVQLQESGPGLVKPSQ
705
EIVMTQSPATLSLSPG

(EGFR)

TLSLTCTVSGGSISSGD

ERATLSCRASQSVSS

YYWS
WIRQPPGKGLE

YLA
WYQQKPGQAPR

WIGYIYYSGSTDYNPS

LLIYDASNRATGIPAR

LKS
RVTMSVDTSKNQF

FSGSGSGTDFTLTISS

SLKVNSVTAADTAVY

LEPEDFAVYYCHQY

YCARVSIFGVGTFDY

GSTPLT
FGGGTKAEI

WGQGTLVTVSS

K

MM-151
PIX
HER1
607
QVQLVQSGAEVKKPG
706
DIQMTQSPSTLSASV

(EGFR)

SSVKVSCKASGGTFSS

GDRVTITCRASQSISS

YAIS
WVRQAPGQGLE

WWA
WYQQKPGKAP

WMGSIIPIFGTVNYAQ

KLLIYDASSLESGVPS

KFQG
RVTITADESTST

RFSGSGSGTEFTLTISS

AYMELSSLRSEDTAVY

LQPDDFATYYCQQY

YCARDPSVNLYWYFD

HAHPTT
FGGGTKVEI

L
WGRGTLVTVSS

K

MM-151
P2X
HER1
608
QVQLVQSGAEVKKPG
707
DIVMTQSPDSLAVSL

(EGFR)

SSVKVSCKASGGTFGS

GERATINCKSSQSVL

YAIS
WVRQAPGQGLE

YSPNNKNYLA
WYQQ

WMGSIIPIFGAANPAQ

KPGQPPKLLIYWAST

KSQG
RVTITADESTST

RES
GVPDRFSGSGSG

AYMELSSLRSEDTAVY

TDFTLTISSLQAEDVA

YCAKMGRGKVAFDI

VYYCQQYYGSPITFG

WGQGTMVTVSS

GGTKVEIK

MM-151
P3X
HER1
609
QVQLVQSGAEVKKPG
708
EIVMTQSPATLSVSPG

(EGFR)

ASVKVSCKASGYAFTS

ERATLSCRASQSVSS

YGIN
WVRQAPGQGLE

NLA
WYQQKPGQAPR

WMGWISAYNGNTYY

LLIYGASTRATGIPAR

AQKLRG
RVTMTTDTS

FSGSGSGTEFTLTISSL

TSTAYMELRSLRSDDT

QSEDFAVYYCQDYR

AVYYCARDLGGYGSG

TWPRRV
FGGGTKVE

SVPFDP
WGQGTLVTVSS

IK

TheraCIM
nimotuzumab
HER1
610
QVQLQQSGAEVKKPG
709
DIQMTQSPSSLSASVG

(EGFR)

SSVKVSCKASGYTFTN

DRVTITCRSSQNIVHS

YYIY
WVRQAPGQGLE

NGNTYLD
WYQQTPG

WIGGINPTSGGSNFNE

KAPKLLIYKVSNRFS

KFKT
RVTITADESSTT

GVPSRFSGSGSGTDFT

AYMELSSLRSEDTAFY

FTISSLQPEDIATYYC

FCTRQGLWFDSDGRG

FQYSHVPWT
FGQGT

FDF
WGQGTTVTVSS

KLQIT

Vectibix™
panitumimab
HER1
611
QVQLQESGPGLVKPSE
710
DIQMTQSPSSLSASVG

(EGFR)

TLSLTCTVSGGSVSSG

DRVTITCQASQDISN

DYY
WTWIRQSPGKGL

Y
LNWYQQKPGKAPK

EWIGHIYYSGNTNYNP

LLIYDASNLETGVPSR

SLKSRLTISIDTSKTQFS

FSGSGSGTDFTFTISSL

LKLSSVTAADTAIYYC

QPEDIATYFCQHFDH

VRDRVTGAFDI
WGQG

LPLA
FGGGTKVEIK

TMVTVSS

07D06

HER1
612
QIQLVQSGPELKKPGE
711
DVVMTQTPLSLPVSL

(EGFR)

TVKISCKASGYTFTEY

GDQASISCRSSQSLV

P
IHWVKQAPGKGFKW

HSNGNTY
LHWYLQK

MGMIYTDIGKPTYAE

PGQSPKLLIYKVSNR

EFKGRFAFSLETSASTA

FSGVPDRFSGSGSGT

YLQINNLKNEDTATYF

DFTLKISRVEAEDLG

CVRDRYDSLFDYWGQ

VYFCSQSTHVPWTF

GTTLTVSS

GGGTKLEIK

12D03

HER1
613
EMQLVESGGGFVKPG
712
DVVMTQTPLSLPVSL

(EGFR)

GSLKLSCAASGFAFSH

GDQASISCRSSQSLV

YDMS
WVRQTPKQRLE

HSNGNTYLH
WYLQK

WVAYIASGGDITYYA

PGQSPKLLIYKVSNR

DTVKG
RFTISRDNAQN

FS
GVPDRFSGSGSGT

TLYLQMSSLKSEDTAM

DFTLKISRVEAEDLG

FYCSRSSYGNNGDAL

VYFCSQSTHVLTFGS

DF
WGQGTSVTVSS

GTKLEIK

C1
HER2
614
QVQLVESGGGLVQPG
713
QSPSFLSAFVGDRITIT

GSLRLSCAASGFTFSSY

CRASPGIRNYLAWY

AMG
WVRQAPGKGLE

QQKPGKAPKLLIYAA

WVSSISGSSRYIYYAD

STLQS
GVPSRFSGSGS

SVKG
RFTISRDNSKNT

GTDFTLTISSLQPEDF

LYLQMNSLRAEDTAV

ATYYCQQYNSYPLSF

YYCAKMDASGSYFNF

GGGTKVEIK

WGQGTLVTVSS

Erbicin

HER2
615
QVQLLQSAAEVKKPGE
714
QAVVTQEPSFSVSPG

SLKISCKGSGYSFTSY

GTVTLTCGLSSGSVS

WIG
WVRQMPGKGLE

TSYYPS
WYQQTPGQ

WMGIIYPGDSDTRYSP

APRTLIYSTNTRSSGV

SFQG
QVTISADKSISTA

PDRFSGSILGNKAALT

YLQWSSLKASDTAVY

ITGAQADDESDYYCV

YCARWRDSPLWGQGT

LYMGSGQYV
FGGGT

LVTVSS

KLTVL

Herceptin
trastuzumab
HER2
616
EVQLVESGGGLVQPGG
715
DIQMTQSPSSLSASVG

SLRLSCAASGFNIKDT

DRVTITCRASQDVNT

Y
IHWVRQAPGKGLEW

A
VAWYQQKPGKAPK

VARIYPTNGYTRYADS

LLIYSASFLYSGVPSR

VKGRFTISADTSKNTA

FSGSRSGTDFTLTISSL

YLQMNSLRAEDTAVY

QPEDFATYYCQQHY

YCSRWGGDGFYAMD

TTPPT
FGQGTKVEIK

Y
WGQGTLVTVSS

MAGH22
margetuximab
HER2
617
QVQLQQSGPELVKPGA
716
DIVMTQSHKFMSTSV

SLKLSCTASGFNIKDT

GDRVSITCKASQDVN

Y
IHWVKQRPEQGLEWI

TA
VAWYQQKPGHSP

GRIYPTNGYTRYDPKF

KLLIYSASFRYTGVPD

QDKATITADTSSNTAY

RFTGSRSGTDFTFTIS

LQVSRLTSEDTAVYYC

SVQAEDLAVYYCQQ

SRWGGDGFYAMDY
W

HYTTPPT
FGGGTKVE

GQGASVTVSS

IK

MM-302
F5
HER2
618
QVQLVESGGGLVQPG
717
QSVLTQPPSVSGAPG

GSLRLSCAASGFTFRSY

QRVTISCTGSSSNIGA

AMS
WVRQAPGKGLE

GYGVH
WYQQLPGTA

WVSAISGRGDNTYYA

PKLLIYGNTNRPSGV

DSVKG
RFTISRDNSKN

PDRFSGFKSGTSASLA

TLYLQMNSLRAEDTA

ITGLQAEDEADYYCQ

VYYCAKMTSNAFAFD

FYDSSLSGWV
FGGG

Y
WGQGTLVTVSS

TKLTVL

Perjeta
pertuzumab
HER2
619
EVQLVESGGGLVQPGG
718
DIQMTQSPSSLSASVG

SLRLSCAASGFTFTDY

DRVTITCKASQDVSI

T
MDWVRQAPGKGLE

G
VAWYQQKPGKAPK

WVADVNPNSGGSIYN

LLIYSASYRYTGVPSR

QRFKGRFTLSVDRSKN

FSGSGSGTDFTLTISS

TLYLQMNSLRAEDTA

LQPEDFATYYCQQY

VYYCARNLGPSFYFD

YIYPYT
FGQGTKVEI

Y
WGQGTLVTVSS

K

MM-121/

HER3
620
EVQLLESGGGLVQPGG
719
QSALTQPASVSGSPG

SAR2562

SLRLSCAASGFTFSHY

QSITISCTGTSSDVGS

12

VMA
WVRQAPGKGLE

YNVVS
WYQQHPGKA

WVSSISSSGGWTLYA

PKLIIYEVSQRPSGVS

DSVKG
RFTISRDNSKN

NRFSGSKSGNTASLTI

TLYLQMNSLRAEDTA

SGLQTEDEADYYCCS

VYYCTRGLKMATIFD

YAGSSIFVI
FGGGTK

Y
WGQGTLVTVSS

VTVL

MEHD79
Duligotumab
HER1
621
EVQLVESGGGLVQPGG
720
DIQMTQSPSSLSASVG

45A

(EGFR)/

SLRLSCAASGFTLSGD

DRVTITCRASQNIAT

HER3

WIH
WVRQAPGKGLE

DVA
WYQQKPGKAPK

WVGEISAAGGYTDYA

LLIYSASFLYSGVPSR

DSVKG
RFTISADTSKN

FSGSGSGTDFTLTISS

TAYLQMNSLRAEDTA

LQPEDFATYYCQQSE

VYYCARESRVSFEAA

PEPYT
FGQGTKVEIK

MDY
WGQGTLVTVSS

MM-111

HER2/
622
QVQLQESGGGLVKPG
721
QSALTQPASVSGSPG

3

GSLRLSCAASGFTFSSY

QSITISCTGTSSDVGG

WMS
WVRQAPGKGLE

YNFVS
WYQQHPGKA

WVANINRDGSASYYV

PKLMIYDVSDRPSGV

DSVKG
RFTISRDDAKN

SDRFSGSKSGNTASLI

SLYLQMNSLRAEDTAV

ISGLQADDEADYYCS

YYCARDRGVGYFDL

SYGSSSTHVI
FGGGT

WGRGTLVTVSS

KVTVL

MM-111

HER2/
623
QVQLVQSGAEVKKPG
722
QSVLTQPPSVSAAPGQ

3

ESLKISCKGSGYSFTSY

KVTISCSGSSSNIGNN

WIA
WVRQMPGKGLEY

YVS
WYQQLPGTAPK

MGLIYPGDSDTKYSPS

LLIYDHTNRPAGVPD

FQG
QVTISVDKSVSTA

RFSGSKSGTSASLAIS

YLQWSSLKPSDSAVYF

GFRSEDEADYYCAS

CARHDVGYCTDRTCA

WDYTLSGWV
FGGG

KWPEWLGV
WGQGTL

TKLTVL

VTVSS

Hu3S193
Lewis-
624
EVQLVESGGGVVQPG
723
DIQMTQSPSSLSASVG

Y

RSLRLSCSTSGFTFSDY

DRVTITCRSSQRIVHS

YMY
WVRQAPGKGLE

NGNTYLE
WYQQTPG

WVAYMSNVGAITDYP

KAPKLLIYKVSNRFS

DTVKG
RFTISRDNSKN

GVPSRFSGSGSGTDFT

TLFLQMDSLRPEDTGV

FTISSLQPEDIATYYC

YFCARGTRDGSWFAY

FQGSHVPFT
FGQGT

WGQGTPVTVSS

KLQIT

BAY 94-
anetumab
Mesothelin
625
QVELVQSGAEVKKPGE
724
DIALTQPASVSGSPGQ

9343
ravtansine

SLKISCKGSGYSFTSY

SITISCTGTSSDIGGY

W
IGWVRQAPGKGLEW

NS
VSWYQQHPGKAP

MGIIDPGDSRTRYSPSF

KLMIYGVNNRPSGVS

QGQVTISADKSISTAYL

NRFSGSKSGNTASLTI

QWSSLKASDTAMYYC

SGLQAEDEADYYCSS

ARGQLYGGTYMDG
W

YDIESATPV
FGGGTK

GQGTLVTVSS

LTVL

SS1
Mesothelin
626
QVQLQQSGPELEKPGA
725
DIELTQSPAIMSASPG

SVKISCKASGYSFTGYT

EKVTMTCSASSSVSY

MNWVKQSHGKSLEWI

MHWYQQKSGTSPKR

GLITPYNGASSYNQKF

WIYDTSKLASGVPGR

RGKATLTVDKSSSTAY

FSGSGSGNSYSLTISS

MDLLSLTSEDSAVYFC

VEAEDDATYYCQQW

ARGGYDGRGFDYWGQ

SGYPLTFGAGTKLEIK

GTTVTVSS

Mesothelin
627
QVYLVESGGGVVQPG
726
EIVLTQSPATLSLSPG

RSLRLSCAASGITFSIY

ERATLSCRASQSVSS

GMH
WVRQAPGKGLE

YLA
WYQQKPGQAPR

WVAVIWYDGSHEYY

LLIYDASNRATGIPAR

ADSVKG
RFTISRDNSK

FSGSGSGTDFTLTISS7

NTLYLLMNSLRAED

LEPEDFAVYYCQQ

TAVYYCARDGDYYDS

RSNWPLT
FGGGTKV

GSPLDY
WGQGTLVTV

EIK

SS

Mesothelin
628
QVHLVESGGGVVQPG
727
EIVLTQSPATLSLSPG

RSLRLSCVASGITFRIY

ERATLSCRASQSVSS

GM
HWVRQAPGKGLE

YLA
WYQQKPGQAPR

WVAVLWYDGSHEYY

LLIYDASNRATGIPAR

ADSVKG
RFTISRDNSK

FSGSGSGTDFTLTISS

NTLYLQMNSLRAED

LEPEDFAVYYCQQ

TAIYYCARDGDYYDS

RSNWPLT
FGGGTKV

GSPLDY
WGQGTLVTV

EIK

SS

Mesothelin
629
EVHLVESGGGLVQPGG
728
EIVLTQSPGTLSLSPG

SLRLSCAASGFTFSRY

ERATLSCRASQSVSS

WMS
WVRQAQGKGLE

SYLA
WYQQKPGQAP

WVASIKQAGSEKTYV

RLLIYGASSRATGIPD

DSVKG
RFTISRDNAKN

RFSGSGSGTDFTLTIS

SLSLQMNSLRAED

RLEPEDFAVYYCQ

TAVYYCAREGAYYYD

QYGSSQYT
FGQGTK

SASYYPYYYYYSMDV

LEIK

WGQGTTVTVSS

MORAb-
amatuximab
Mesothelin
630
QVQLQQSGPELEKPGA
729
DIELTQSPAIMSASPG

009

SVKISCKASGYSFTGY

EKVTMTCSASSSVSY

TMN
WVKQSHGKSLE

MH
WYQQKSGTSPKR

WIGLITPYNGASSYNQ

WIYDTSKLASGVPGR

KFRG
KATLTVDKSSST

FSGSGSGNSYSLTISS

AYMDLLSLTSEDSAVY

VEAEDDATYYCQQW

FCARGGYDGRGFDY

SKHPLT
FGSGTKVEI

WGSGTPVTVSS

K

hPAM4

MUC-
631
EVQLQESGPELVKPGA
730
DIVMTQSPAIMSASP

1

SVKMSCKASGYTFPSY

GEKVTMTCSASSSVS

VLH
WVKQKPGQGLE

SSYLY
WYQQKPGSSP

WIGYINPYNDGTQYN

KLWIYSTSNLASGVP

EKFKG
KATLTSDKSSS

ARFSGSGSGTSYSLTI

TAYMELSRLTSED

SSMEAEDAASYFCH

SAVYYCARGFGGSYG

QWNRYPYT
FGGGTK

FAY
WGQGTLITVSA

LEIK

hPAM4-
clivatuzumab
MUC1
632
QVQLQQSGAEVKKFG
731
DIQLTQSPSSLSASVG

Cide

ASVKVSCEASGYTFPS

DRVTMTCSASSSVSS

YVLH
WVKQAPGQGLE

SYLY
WYQQKPGKAP

WIGYINPYNDGTQTN

KLWIYSTSNLASGVP

KKFKG
KATLTRDTSIN

ARFSGSGSGTDFTLTI

TAYMELSRLRSDDTAV

SSLQPEDSASYFCHQ

YYCARGFGGSYGFAY

WNRYPYT
FGGGTRL

NGQGTLVTVSS

EIK

SAR5666
huDS6v1.01
MUC1
754
QAQLQVSGAEVVKPG
732
EIVLTQSPATMSASPG

58

ASVKMSCKASGYTFTS

ERVTITCSAHSSVSF

YNMH
WVKQTPGQGL

MH
WFQQKPGTSPKL

EWIGYIYPGNGATNY

WIYSTSSLASGVPAR

NQKFQG
KATLTADTS

FGGSGSGTSYSLTISS

SSTAYMQISSLTSEDSA

MEAEDAATYYCQQR

VYFCARGDSVPFAYW

SSFPLT
FGAGTKLEL

GQGTLVTVSA

K

Theragyn
Pemtumomab
MUC1
633
QVQLQQSGAELMKPG
733
DIVMSQSPSSLAVSV

muHMFG1

ASVKISCKATGYTFSA

GEKVTMSCKSSQSLL

YWIE
WVKQRPGHGLE

YSSNQKIYLA
WYQQ

WIGEILPGSNNSRYNE

KPGQSPKLLIYWAST

KFKG
KATFTADTSSNT

RES
GVPDRFTGGGSG

AYMQLSSLTSEDSAVY

TDFTLTISSVKAEDLA

YCSRSYDFAWFAYWG

VYYCQQYYRYPRTF

QGTPVTVSA

GGGTKLEIK

Therex
Sontuzumab
MUC1
634
QVQLVQSGAEVKKPG
734
DIQMTQSPSSLSASVG

huHMFG1

ASVKVSCKASGYTFSA

DRVTITCKSSQSLLY

AS1402

YWIE
WVRQAPGKGLE

SSNQKIYLA
WYQQK

R1150

WVGEILPGSNNSRYN

PGKAPKLLIYWASTR

EKFKG
RVTVTRDTST

ES
GVPSRFSGSGSGT

NTAYMELSSLRSEDTA

DFTFTISSLQPEDIATY

VYYCARSYDFAWFAY

YCQQYYRYPRTFGQ

WGQGTLVTVSS

GTKVEIK

MDX-

PD-L1
635
QVQLVQSGAEVKKPG
735
EIVLTQSPATLSLSPG

1105 or

SSVKVSCKTSGDTFST

ERATLSCRASQSVSS

BMS-

YAIS
WVRQAPGQGLE

YLA
WYQQKPGQAPR

936559

WMGGIIPIFGKAHYA

LLIYDASNRATGIPAR

QKFQG
RVTITADESTS

FSGSGSGTDFTLTISS

TAYMELSSLRSEDTAV

LEPEDFAVYYCQQRS

YFCARKFHFVSGSPFG

NWPT
FGQGTKVEIK

MDV
WGQGTTVTVSS

MEDI-
durvalumab
PD-L1
636
EVQLVESGGGLVQPGG
736
EIVLTQSPGTLSLSPG

4736

SLRLSCAASGFTFSRY

ERATLSCRASQRVSS

W
MSWVRQAPGKGLE

SY
LAWYQQKPGQAP

WVANIKQDGSEKYYV

RLLIYDASSRATGIPD

DSVKGRFTISRDNAKN

RFSGSGSGTDFTLTIS

SLYLQMNSLRAEDTAV

RLEPEDFAVYYCQQ

YYCAREGGWFGELA

YGSLPWT
FGQGTKV

FDY
WGQGTLVTVSS

EIK

MPDL328
atezolizumab
PD-L1
637
EVQLVESGGGLVQPGG
737
DIQMTQSPSSLSASVG

0A

SLRLSCAASGFTFSDS

DRVTITCRASQDVST

W
IHWVRQAPGKGLEW

A
VAWYQQKPGKAPK

VAWISPYGGSTYYAD

LLIYSASFLYSGVPSR

SVKGRFTISADTSKNT

FSGSGSGTDFTLTISS

AYLQMNSLRAEDTAV

LQPEDFATYYCQQY

YYCARRHWPGGFDY

LYHPAT
FGQGTKVEI

WGQGTLVTVSS

K

MSB0010
avelumab
PD-L1
638
EVQLLESGGGLVQPGG
738
QSALTQPASVSGSPG

718C

SLRLSCAASGFTFSSYI

QSITISCTGTSSDVGG

MMWVRQAPGKGLEW

YNY
VSWYQQHPGKA

VSSIYPSGGITFYADTV

PKLMIYDVSNRPSGV

KGRFTISRDNSKNTLY

SNRFSGSKSGNTASL

LQMNSLRAEDTAVYY

TISGLQAEDEADYYC

CARIKLGTVTTVDYW

SSYTSSSTRV
FGTGT

GQGTLVTVSS

KVTVL

MLN591

PSMA
639
EVQLVQSGPEVKKPGA
739
DIQMTQSPSSLSTSVG

TVKISCKTSGYTFTEY

DRVTLTCKASQDVG

TIH
WVKQAPGKGLEW

TAVD
WYQQKPGPSP

IGNINPNNGGTTYNQ

KLLIYWASTRHTGIP

KFED
KATLTVDKSTDT

SRFSGSGSGTDFTLTI

AYMELSSLRSEDTAVY

SSLQPEDFADYYCQQ

YCAAGWNFDYWGQG

YNSYPLT
FGPGTKVD

TLLTVSS

IK

MT112
pasotuxizumab
PSMA
640
QVQLVESGGGLVKPGE
740
DIQMTQSPSSLSASVG

SLRLSCAASGFTFSDY

DRVTITCKASQNVDT

Y
MYWVRQAPGKGLE

N
VAWYQQKPGQAPK

WVAIISDGGYYTYYSD

SLIYSASYRYSDVPSR

IIKGRFTISRDNAKNSL

FSGSASGTDFTLTISS

YLQMNSLKAEDTAVY

VQSEDFATYYCQQY

YCARGFPLLRHGAM

DSYPYT
FGGGTKLEI

DY
WGQGTLVTVSS

K

ROR1
641
QEQLVESGGRLVTPGG
741
ELVLTQSPSVSAALG

SLTLSCKASGFDFSAY

SPAKITCTLSSAHKT

YMS
WVRQAPGKGLE

DTID
WYQQLQGEAP

WIATIYPSSGKTYYAT

RYLMQVQSDGSYTK

WVNG
RFTISSDNAQNT

RP
GVPDRFSGSSSGA

VDLQMNSLTAAD

DRYLIIPSVQADDEA

RATYFCARDSYADDG

DY

ALFNI
WGPGTLVTISS

YCGADYIGGYVFGG

GTQLTVTG

ROR1
642
EVKLVESGGGLVKPGG
742
DIKMTQSPSSMYASL

SLKLSCAASGFTFSSYA

GERVTITCKASPDINS

MS
WVRQIPEKRLEWV

YLS
WFQQKPGKSPKT

ASISRGGTTYYPDSVK

LIYRANRLVDGVPSR

G
RFTISRDNVRNILYLQ

FSGGGSGQDYSLTINS

MSSLRSEDT

LEYEDMGIYYCLQ

AMYYCGRYDYDGYY

YDEFPYT
FGGGTKLE

AMDY
WGQGTSVTVSS

MK

ROR1
643
QSLEESGGRLVTPGTPL
743
ELVMTQTPSSVSAAV

TLTCTVSGIDLNSHWM

GGTVTINCQASQSIG

S
WVRQAPGKGLEWIGI

SYLA
WYQQKPGQPP

IAASGSTYYANWAKG

KLLIYYASNLASGVP

RFTISKTSTTVDLRIASP

SRFSGSGSGTEYTLTI

TTEDTATY

SGVQREDAATYYCLG

FCARDYGDYRLVTFNI

SLSNSDNV
FGGGTEL

WGPGTLVTVSS

EIL

ROR1
644
QSVKESEGDLVTPAGN
744
ELVMTQTPSSTSGAV

LTLTCTASGSDINDYPI

GGTVTINCQASQSID

S
WVRQAPGKGLEWIG

SNLA
WFQQKPGQPPT

FINSGGSTWYASWVK

LLIYRASNLASGVPS

G
RFTISRTSTTVDLKM

RFSGSRSGTEYTLTIS

TSLTTDDTATY

GVQREDAATYYCLG

FCARGYSTYYCDFNI

GVGNVSYRTS
FGGG

WGPGTLVTISS

TEVVVK

CC49

TAG-
645
QVQLVQSGAEVVKPG
745
DIVMSQSPDSLAVSL

(Humanized)

72

ASVKISCKASGYTFTD

GERVTLNCKSSQSLL

HAIH
WVKQNPGQRLE

YSGNQKNYLA
WYQ

WIGYFSPGNDDFKYN

QKPGQSPKLLIYWAS

ERFKG
KATLTADTSAS

ARES
GVPDRFSGSGS

TAYVELSSLRSEDTAV

GTDFTLTISSVQAEDV

YFCTRSLNMAYWGQG

AVYYCQQYYSYPLT

TLVTVSS

FGAGTKLELK

Murine A1
TPBG/
646
QIQLVQSGPELKKPGE
746
SIVMTQTPKFLLVSA

5T4

TVKISCKASGYTFTNF

GDRVTITCKASQSVS

GMN
WVKQGPGEGLK

NDVA
WYQQKPGQSP

WMGWINTNTGEPRY

KLLINFATNRYTGVP

AEEFKG
RXAFSLETTA

NRFTGSGYGTDFTFTI

STAYLQINNLKNEDTA

STVQAEDLALYFCQQ

TYFCARDWDGAYFFD

DYSSPWT
FGGGTKLE

Y
WGQGTTLTVSS

IK

Murine A2
TPBG/
647
QVQLQQSRPELVKPGA
747
SVIMSRGQIVLTQSPA

5T4

SVKMSCKASGYTFTD

IMSASLGERVTLTCT

YVIS
WVKQRTGQGLE

ASSSVNSNYLH
WYQ

WIGEIYPGSNSIYYNE

QKPGSSPKLWIYSTS

KFKG
RATLTA

NLAS
GVPARFSGSGS

DKSSSTAYMQLSSLTS

GTSYSLTISSMEAEDA

EDSAVYFCAMGGNYG

ATYYCHQYHRSPLT

FDY
WGQGTTLTVSS

FGAGTKLELK

Murine A3
TPBG/
648
EVQLVESGGGLVQPKG
748
DIVMTQSHIFMSTSV

5T4

SLKLSCAASGFTFNTY

GDRVSITCKASQDVD

AMN
WVRQAPGKGLE

TAVA
WYQQKPGQSP

WVARIRSKSNNYATY

KLLIYWASTRLTGVP

YADSVKD
RFTISRDDS

DRFTGSGSGTDFTLTI

QSMLYLQMNNLKTED

SNVQSEDLADYFCQQ

TAMYXCVRQWDYDV

YSSYPYT
FGGGTKLE

RAMNYWGQGTSVTVSS

IK

IMMU-
hRS-7
TROP-
649
QVQLQQSGSELKKPGA
749
DIQLTQSPSSLSASVG

132

2

SVKVSCKASGYTFTNY

DRVSITCKASQDVSI

GMN
WVKQAPGQGLK

AVAW
YQQKPGKAPK

WMGWINTYTGEPTY

LLIYSASYRYTGVPD

TDDFKG
RFAFSLDTSV

RFSGSGSGTDFTLTIS

STAYLQISSLKADDTA

SLQPEDFAVYYCQQ

VYFCARGGFGSSYWY

HYITPLT
FGAGTKVE

FDV
WGQGSLVTVSS

IK

IMC-18F1
icrucumab
VEGF
650
QAQVVESGGGVVQSG
750
EIVLTQSPGTLSLSPG

RI

RSLRLSCAASGFAFSS

ERATLSCRASQSVSS

YGMHW
VRQAPGKGL

SYLA
WYQQKPGQAP

EWVAVIWYDGSNKY

RLLIYGASSRATGIPD

YADSVRG
RFTISRDNS

RFSGSGSGTDFTLTIS

ENTLYLQMNSLRAEDT

RLEPEDFAVYYCQQ

AVYYCARDHYGSGVH

YGSSPLT
FGGGTKVE

HYFYYGLDV
WGQGTT

IK

VTVSS

Cyramza
ramucirumab
VEGF
651
EVQLVQSGGGLVKPG
751
DIQMTQSPSSVSASIG

R2

GSLRLSCAASGFTFSS

DRVTITCRASQGIDN

YSMN
WVRQAPGKGLE

WLGWYQQKPGKAPK

WVSSISSSSSYIYYADS

LLIYDASNLDTGVPS

VKG
RFTISRDNAKNSL

RFSGSGSGTYFTLTIS

YLQMNSLRAEDTAVY

SLQAEDFAVYFCQQ

YCARVTDAFDIWGQG

AKAFPPT
FGGGTKV

TMVTVSSA

DIK

g165DFM-
alacizumabpegol
VEGF
652
EVQLVESGGGLVQPGG
752
DIQMTQSPSSLSASVG

PEG

R2

SLRLSCAASGFTFSSY

DRVTITCRASQDIAG

G
MSWVRQAPGKGLE

S
LNWLQQKPGKAIKR

WVATITSGGSYTYYV

LIYATSSLDSGVPKRF

DSVKGRFTISRDNAKN

SGSRSGSDYTLTISSL

TLYLQMNSLRAEDTA

QPEDFATYYCLQYGS

VYYCVRIGEDALDYW

FPPT
FGQGTKVEIK

GQGTLVTVSS

Imclone6.64

VEGF
653
KVQLQQSGTELVKPGA
753
DIVLTQSPASLAVSLG

R2

SVKVSCKASGYIFTEYI

QRATISCRASESVDSY

IHWVKQRSGQGLEWIG

GNSFMHWYQQKPGQ

WLYPESNIIKYNEKFK

PPKLLIYRASNLESGI

DKATLTADKSSSTVYM

PARFSGSGSRTDFTLT

ELSRLTSEDSAVYFCTR

INPVEADDVATYYCQ

HDGTNFDYWGQGTTL

QSNEDPLTFGAGTKL

TVSSA

ELK

*underlined & bolded sequences, if present, are CDRs within the VL and VH

Anti-Epcam (Epithelial Cell Adhesion Molecule) Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the tumor-specific marker EpCAM. The binding domain can comprise VL and VH derived from a monoclonal antibody to EpCAM. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for EpCAM and another binding domain (e.g., having specific binding affinity to an effector cell).

Monoclonal antibodies to EpCAM are known in the art (such as described more fully in the following paragraphs). Exemplary, non-limiting examples of EpCAM monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the tumor-specific marker EpCAM can comprise anti-EpCAM VL and VH sequences set forth in Table 6. Some embodiments of the binding domain with binding affinity to the tumor-specific marker EpCAM can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequences of the anti-EpCAM antibodies (such as 4D5MUCB) of Table 6. Some embodiments of the binding domain with binding affinity to the tumor-specific marker EpCAM can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequences set forth in Table 6. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising a binding domain specific for EpCAM and another binding domain (e.g., having specific binding affinity to an effector cell). In some embodiments of the compositions of this disclosure, the binding domain specific for EpCAM can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay. The binding domains can be in a scFv format. The binding domains can be in a single chain diabody format.

In general, epithelial cell adhesion molecule (EpCAM, also known as 17-1A antigen) is a 40-kDa membrane-integrated glycoprotein composed of 314 amino acids expressed in certain epithelia and on many human carcinomas (see, Balzar, The biology of the 17-1A antigen (Ep-CAM), J. Mol. Med. 1999, 77:699-712). EpCAM was initially discovered by use of the murine monoclonal antibody 17-1A/edrecolomab that was generated by immunization of mice with colon carcinoma cells (Goettlinger, Int J Cancer. 1986; 38, 47-53 and Simon, Proc. Natl. Acad. Sci. USA. 1990; 87, 2755-2759). Because of their epithelial cell origin, tumor cells from most carcinomas express EpCAM on their surface (more so than normal, healthy cells), including the majority of primary, metastatic, and disseminated non-small cell lung carcinoma cells (Passlick, B., et al. The 17-1A antigen is expressed on primary, metastatic and disseminated non-small cell lung carcinoma cells. Int. J. Cancer 87(4):548-552, 2000), gastric and gastro-oesophageal junction adenocarcinomas (Martin, I. G., Expression of the 17-1A antigen in gastric and gastro-oesophageal junction adenocarcinomas: a potential immunotherapeutic target? J Clin Pathol 1999; 52:701-704), and breast and colorectal cancer (Packeisen J, et al. Detection of surface antigen 17-1A in breast and colorectal cancer. Hybridoma. 1999 18(1):37-40) and, therefore, are an attractive target for immunotherapy approaches. Indeed, increased expression of EpCAM correlates to increased epithelial proliferation; in breast cancer, overexpression of EpCAM on tumor cells is a predictor of survival (Gastl, Lancet. 2000, 356, 1981-1982). Due to their epithelial cell origin, tumor cells from most carcinomas still express EpCAM on their surface, and the bispecific solitomab single-chain antibody composition that targets EpCAM on tumor cells and also contains a CD3 binding region has been proposed for use against primary uterine and ovarian CS cell lines (Ferrari F, et al., Solitomab, an EpCAM/CD3 bispecific antibody construct (BITE®), is highly active against primary uterine and ovarian carcinosarcoma cell lines in vitro. J Exp Clin Cancer Res. 2015 34:123). Monoclonal antibodies to EpCAM are known in the art. The EpCAM monclonals ING-1, 3622W94, adecatumumab and edrecolomab have been described as having been tested in human patients (Münz, M. Side-by-side analysis of five clinically tested anti-EpCAM monoclonal antibodies Cancer Cell International, 10:44-56, 2010). Bispecific antibodies directed against EpCAM and against CD3 have also been described, including construction of two different bispecific antibodies by fusing a hybridoma producing monoclonal antibody against EpCAM with either of the two hybridomas OKT3 and 9.3 (Möller, SA, Reisfeld, RA, Bispecific-monoclonal-antibody-directed lysis of ovarian carcinoma cells by activated human T lymphocytes. Cancer Immunol. Immunother. 33:210-216, 1991). Other examples of bispecific antibodies against EpCAM include BiUII, (anti-CD3 (rat) x anti-EpCAM (mouse)) (Zeidler, J. Immunol., 1999, 163:1247-1252), a scFv CD3/17-1A-bispecific (Mack, M. A small bispecific antibody composition expressed as a functional single-chain molecule with high tumor cell cytotoxicity. Proc. Natl. Acad. Sci., 1995, 92:7021-7025), and a partially humanized bispecific diabody having anti-CD3 and antiEpCAM specificity (Helfrich, W. Construction and characterization of a bispecific diabody for retargeting T cells to human carcinomas. Int. J. Cancer, 1998, 76:232-239).

Anti-CCR5 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CCR5. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CCR5 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CCR5. Monoclonal antibodies to CCR5 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CCR5 can comprise anti-CCR5 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CCR5 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CCR5 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CCR5 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CCR5 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD19 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD19. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD19 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD19. Monoclonal antibodies to CD19 are known in the art. Exemplary, non-limiting example(s) of CD19 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD19 can comprise anti-CD19 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD19 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-CD19 antibody/antibodies (e.g., MT103) of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD19 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for CD19 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-HER-2 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen HER-2. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for HER-2 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to HER-2. Monoclonal antibodies to HER-2 are known in the art. Exemplary, non-limiting example(s) of HER-2 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen HER-2 can comprise anti-HER-2 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen HER-2 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-HER-2 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen HER-2 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for HER-2 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-HER-3 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen HER-3. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for HER-3 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to HER-3. Monoclonal antibodies to HER-3 are known in the art. Exemplary, non-limiting example(s) of HER-3 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen HER-3 can comprise anti-HER-3 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen HER-3 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-HER-3 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen HER-3 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for HER-3 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-HER-4 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen HER-4. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for HER-4 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to HER-4. Monoclonal antibodies to HER-4 are known in the art. Exemplary, non-limiting example(s) of HER-4 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen HER-4 can comprise anti-HER-4 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen HER-4 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-HER-4 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen HER-4 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for HER-4 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-EGFR (Epidermal Growth Factor Receptor) Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen EGFR. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for EGFR and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to EGFR. Monoclonal antibodies to EGFR are known in the art. Exemplary, non-limiting example(s) of EGFR monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen EGFR can comprise anti-EGFR VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen EGFR can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-EGFR antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen EGFR can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for EGFR can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-PSMA Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen PSMA (prostate-specific membrane antigen). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for PSMA and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to PSMA. Monoclonal antibodies to PSMA are known in the art. Exemplary, non-limiting example(s) of PSMA monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen PSMA can comprise anti-PSMA VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen PSMA can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-PSMA antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen PSMA can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for PSMA can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CEA Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CEA (carcinoembryonic antigen). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CEA and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CEA. Monoclonal antibodies to CEA are known in the art. Exemplary, non-limiting example(s) of CEA monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CEA can comprise anti-CEA VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CEA can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-CEA antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CEA can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for CEA can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-MUC1 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen MUC1. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for MUC1 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to MUC1. Monoclonal antibodies to MUC1 are known in the art. Exemplary, non-limiting example(s) of MUC1 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC1 can comprise anti-MUC1 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC1 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-MUC1 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC1 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for MUC1 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-MUC2 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen MUC2. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for MUC2 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to MUC2. Monoclonal antibodies to MUC2 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC2 can comprise anti-MUC2 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen MUC2 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-MUC2 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC2 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for MUC2 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-MUC3 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen MUC3. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for MUC3 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to MUC3. Monoclonal antibodies to MUC3 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC3 can comprise anti-MUC3 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen MUC3 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-MUC3 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC3 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for MUC3 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-MUC4 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen MUC4. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for MUC4 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to MUC4. Monoclonal antibodies to MUC4 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC4 can comprise anti-MUC4 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen MUC4 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-MUC4 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC4 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for MUC4 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-MUC5AC Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen MUC5AC. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for MUC5AC and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to MUC5AC. Monoclonal antibodies to MUC5AC are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC5AC can comprise anti-MUC5AC VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen MUC5AC can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-MUC5AC antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC5AC can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for MUC5AC can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-MUC5B Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen MUC5B. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for MUC5B and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to MUC5B. Monoclonal antibodies to MUC5B are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC5B can comprise anti-MUC5B VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen MUC5B can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-MUC5B antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC5B can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for MUC5B can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-MUC7 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen MUC7. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for MUC7 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to MUC7. Monoclonal antibodies to MUC7 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC7 can comprise anti-MUC7 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen MUC7 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-MUC7 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen MUC7 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for MUC7 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-bhCG Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen βhCG. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for βhCG and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to βhCG. Monoclonal antibodies to βhCG are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen βhCG can comprise anti-βhCG VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen βhCG can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-βhCG antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen βhCG can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for βhCG can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Lewis-Y Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen Lewis-Y. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for Lewis-Y and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to Lewis-Y. Monoclonal antibodies to Lewis-Y are known in the art. Exemplary, non-limiting example(s) of Lewis-Y monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen Lewis-Y can comprise anti-Lewis-Y VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen Lewis-Y can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-Lewis-Y antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen Lewis-Y can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for Lewis-Y can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD20 Binding Domains

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD20. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD20 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD20. Monoclonal antibodies to CD20 are known in the art. Exemplary, non-limiting example(s) of CD20 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD20 can comprise anti-CD20 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD20 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-CD20 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD20 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for CD20 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD33 Binding Domains

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD33. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD33 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD33. Monoclonal antibodies to CD33 are known in the art. Exemplary, non-limiting example(s) of CD33 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD33 can comprise anti-CD33 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD33 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-CD33 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD33 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for CD33 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD30 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD30. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD30 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD30. Monoclonal antibodies to CD30 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CD30 can comprise anti-CD30 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CD30 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CD30 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CD30 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CD30 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Ganglioside GD3 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen ganglioside GD3. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for ganglioside GD3 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to ganglioside GD3. Monoclonal antibodies to ganglioside GD3 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen ganglioside GD3 can comprise anti-ganglioside GD3 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen ganglioside GD3 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-ganglioside GD3 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen ganglioside GD3 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for ganglioside GD3 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-9-O-Acetyl-GD3 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen 9-O-Acetyl-GD3. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for 9-O-Acetyl-GD3 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to 9-O-Acetyl-GD3. Monoclonal antibodies to 9-O-Acetyl-GD3 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen 9-O-Acetyl-GD3 can comprise anti-9-O-Acetyl-GD3 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen 9-O-Acetyl-GD3 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-9-O-Acetyl-GD3 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen 9-O-Acetyl-GD3 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for 9-O-Acetyl-GD3 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Globo H Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen globo H. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for globo H and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to globo H. Monoclonal antibodies to globo H are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen globo H can comprise anti-globo H VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen globo H can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-globo H antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen globo H can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for globo H can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Fucosyl GM1 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen fucosyl GM1. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for fucosyl GM1 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to fucosyl GM1. Monoclonal antibodies to fucosyl GM1 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen fucosyl GM1 can comprise anti-fucosyl GM1 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen fucosyl GM1 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-fucosyl GM1 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen fucosyl GM1 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for fucosyl GM1 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-GD2 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen GD2. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for GD2 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to GD2. Monoclonal antibodies to GD2 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen GD2 can comprise anti-GD2 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen GD2 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-GD2 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen GD2 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for GD2 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Carbonicanhydrase IX Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CA IX (carbonicanhydrase IX). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CA IX and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CA IX. Monoclonal antibodies to CA IX are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CA IX can comprise anti-CA IX VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CA IX can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CA IX antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CA IX can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CA IX can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD44v6 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD44v6. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD44v6 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD44v6. Monoclonal antibodies to CD44v6 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CD44v6 can comprise anti-CD44v6 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CD44v6 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CD44v6 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CD44v6 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CD44v6 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Sonic Hedgehog (Shh) Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen Shh (sonic hedgehog). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for Shh and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to Shh. Monoclonal antibodies to Shh are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen Shh can comprise anti-Shh VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen Shh can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-Shh antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen Shh can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for Shh can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Wue-1 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen Wue-1. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for Wue-1 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to Wue-1. Monoclonal antibodies to Wue-1 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen Wue-1 can comprise anti-Wue-1 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen Wue-1 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-Wue-1 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen Wue-1 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for Wue-1 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Plasma Cell Antigen 1 (PC-1) Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen PC-1 (plasma cell antigen). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for PC-1 (plasma cell antigen) and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to PC-1 (plasma cell antigen). Monoclonal antibodies to PC-1 (plasma cell antigen) are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen PC-1 (plasma cell antigen) can comprise anti-PC-1 (plasma cell antigen) VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen PC-1 (plasma cell antigen) can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-PC-1 (plasma cell antigen) antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen PC-1 (plasma cell antigen) can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for PC-1 (plasma cell antigen) can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Melanoma Chondroitin Sulfate Proteoglycan (MCSP) Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen MCSP (melanoma chondroitin sulfate proteoglycan). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for MCSP and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to MCSP. Monoclonal antibodies to MCSP are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen MCSP can comprise anti-MCSP VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen MCSP can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-MCSP antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen MCSP can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for MCSP can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CCR8 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CCR8. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CCR8 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CCR8. Monoclonal antibodies to CCR8 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CCR8 can comprise anti-CCR8 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CCR8 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CCR8 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CCR8 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CCR8 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-6-Transmembrane Epithelial Antigen of Prostate (STEAP) Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen STEAP. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for STEAP and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to STEAP. Monoclonal antibodies to STEAP are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen STEAP can comprise anti-STEAP VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen STEAP can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-STEAP antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen STEAP can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for STEAP can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Mesothelin Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen mesothelin. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for mesothelin and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to mesothelin. Monoclonal antibodies to mesothelin are known in the art. Exemplary, non-limiting example(s) of mesothelin monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen mesothelin can comprise anti-mesothelin VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen mesothelin can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-mesothelin antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen mesothelin can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for mesothelin can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-A33 Antigen Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen A33. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for A33 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to A33. Monoclonal antibodies to A33 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen A33 can comprise anti-A33 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen A33 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-A33 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen A33 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for A33 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-PSCA Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen PSCA. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for PSCA and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to PSCA. Monoclonal antibodies to PSCA are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen PSCA can comprise anti-PSCA VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen PSCA can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-PSCA antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen PSCA can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for PSCA can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Ly-6 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen Ly-6. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for Ly-6 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to Ly-6. Monoclonal antibodies to Ly-6 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen Ly-6 can comprise anti-Ly-6 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen Ly-6 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-Ly-6 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen Ly-6 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for Ly-6 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-SAS Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen SAS. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for SAS and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to SAS. Monoclonal antibodies to SAS are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen SAS can comprise anti-SAS VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen SAS can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-SAS antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen SAS can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for SAS can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Desmoglein 4 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen desmoglein 4. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for desmoglein 4 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to desmoglein 4. Monoclonal antibodies to desmoglein 4 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen desmoglein 4 can comprise anti-desmoglein 4 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen desmoglein 4 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-desmoglein 4 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen desmoglein 4 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for desmoglein 4 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-fnAChR (Fetal Acetylcholine Receptor) Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen fnAChR (fetal acetylcholine receptor). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for fnAChR and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to fnAChR. Monoclonal antibodies to fnAChR are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen fnAChR can comprise anti-fnAChR VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen fnAChR can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-fnAChR antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen fnAChR can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for fnAChR can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD25 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD25. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD25 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD25. Monoclonal antibodies to CD25 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CD25 can comprise anti-CD25 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CD25 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CD25 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CD25 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CD25 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Cancer Antigen 19-9 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen cancer antigen 19-9. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for cancer antigen 19-9 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to cancer antigen 19-9. Monoclonal antibodies to cancer antigen 19-9 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen cancer antigen 19-9 can comprise anti-cancer antigen 19-9 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen cancer antigen 19-9 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-cancer antigen 19-9 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen cancer antigen 19-9 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for cancer antigen 19-9 (CA 19-9) can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Misiir Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen MISIIR (müllerian inhibiting substance type II receptor). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for MISIIR and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to MISIIR. Monoclonal antibodies to MISIIR are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen MISIIR can comprise anti-MISIIR VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen MISIIR can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-MISIIR antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen MISIIR can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for MISIIR can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-sTn (Sialylated Tn Antigen) Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen sTn (sialylated to antigen). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for sTn and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to sTn. Monoclonal antibodies to sTn are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen sTn can comprise anti-sTn VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen sTn can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-sTn antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen sTn can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for sTn can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-FAP Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen FAP. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for FAP and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to FAP. Monoclonal antibodies to FAP are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen FAP can comprise anti-FAP VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen FAP can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-FAP antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen FAP can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for FAP can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD248 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD248. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD248 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD248. Monoclonal antibodies to CD248 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CD248 can comprise anti-CD248 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CD248 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CD248 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CD248 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CD248 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-EGFRvIII Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen EGFRvIII. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for EGFRvIII and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to EGFRvIII. Monoclonal antibodies to EGFRvIII are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen EGFRvIII can comprise anti-EGFRvIII VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen EGFRvIII can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-EGFRvIII antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen EGFRvIII can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for EGFRvIII can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-TAL6 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen TAL6. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for TAL6 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to TAL6. Monoclonal antibodies to TAL6 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen TAL6 can comprise anti-TAL6 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen TAL6 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-TAL6 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen TAL6 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for TAL6 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD63 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD63. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD63 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD63. Monoclonal antibodies to CD63 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CD63 can comprise anti-CD63 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CD63 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CD63 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CD63 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CD63 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-TAG72 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen TAG72. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for TAG72 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to TAG72. Monoclonal antibodies to TAG72 are known in the art. Exemplary, non-limiting example(s) of TAG72 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen TAG72 can comprise anti-TAG72 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen TAG72 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-TAG72 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen TAG72 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for TAG72 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-TF-Antigen Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen TF antigen. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for TF antigen and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to TF antigen. Monoclonal antibodies to TF antigen are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen TF antigen can comprise anti-TF antigen VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen TF antigen can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-TF antigen antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen TF antigen can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for TF antigen can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-IGF-IR Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen IGF-IR. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for IGF-IR and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to IGF-IR. Monoclonal antibodies to IGF-IR are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen IGF-IR can comprise anti-IGF-IR VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen IGF-IR can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-IGF-IR antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen IGF-IR can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for IGF-IR can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Cora Antigen Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen cora antigen. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for cora antigen and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to cora antigen. Monoclonal antibodies to cora antigen are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen cora antigen can comprise anti-cora antigen VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen cora antigen can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-cora antigen antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen cora antigen can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for cora antigen can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD7 Binding Domains

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD7. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD7 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD7. Monoclonal antibodies to CD7 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CD7 can comprise anti-CD7 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CD7 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CD7 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CD7 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CD7 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD22 Binding Domains

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD22. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD22 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD22. Monoclonal antibodies to CD22 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CD22 can comprise anti-CD22 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CD22 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CD22 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CD22 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CD22 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD79a Binding Domains

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD79a. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD79a and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD79a. Monoclonal antibodies to CD79a are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CD79a can comprise anti-CD79a VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CD79a can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CD79a antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CD79a can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CD79a can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD79b Binding Domains

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD79b. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD79b and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD79b. Monoclonal antibodies to CD79b are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen CD79b can comprise anti-CD79b VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen CD79b can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-CD79b antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen CD79b can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for CD79b can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-G250 Binding Domains

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen G250. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for G250 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to G250. Monoclonal antibodies to G250 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen G250 can comprise anti-G250 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen G250 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-G250 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen G250 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for G250 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-MT-MMPs Binding Domains

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen MT-MMPs. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for MT-MMPs and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to MT-MMPs. Monoclonal antibodies to MT-MMPs are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen MT-MMPs can comprise anti-MT-MMPs VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen MT-MMPs can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-MT-MMPs antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen MT-MMPs can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for MT-MMPs can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-F19 Antigen Binding Domains

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen F19. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for F19 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to F19. Monoclonal antibodies to F19 are known in the art. Some embodiments of the binding domain with binding affinity to the marker/antigen F19 can comprise anti-F19 VL and VH sequence(s). Some embodiments of the binding domain with binding affinity to the marker/antigen F19 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of anti-F19 antibody/antibodies. Some embodiments of the binding domain with binding affinity to the marker/antigen F19 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s). In some embodiments of the compositions of this disclosure, the binding domain specific for F19 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-EphA2 Receptor Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen EphA2. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for EphA2 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to EphA2. Monoclonal antibodies to EphA2 are known in the art. Exemplary, non-limiting example(s) of EphA2 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen EphA2 can comprise anti-EphA2 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen EphA2 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-EphA2 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen EphA2 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for EphA2 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Alpha 4 Integrin Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen alpha 4 integrin. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for alpha 4 integrin and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to alpha 4 integrin. Monoclonal antibodies to alpha 4 integrin are known in the art. Exemplary, non-limiting example(s) of alpha 4 integrin monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen alpha 4 integrin can comprise anti-alpha 4 integrin VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen alpha 4 integrin can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the natalizumab antibody of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen alpha 4 integrin can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for alpha 4 integrin can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-Ang2 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen Ang2 (Angiopoietin-2). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for Ang2 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to Ang2. Monoclonal antibodies to Ang2 are known in the art. Exemplary, non-limiting example(s) of Ang2 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen Ang2 can comprise anti-Ang2 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen Ang2 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the nesvacumab antibody of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen Ang2 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for Ang2 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CEACAM5 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CEACAM5 (Carcinoembryonic Antigen-Related Cell Adhesion Molecule 5). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CEACAM5 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CEACAM5. Monoclonal antibodies to CEACAM5 are known in the art. Exemplary, non-limiting example(s) of CEACAM5 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CEACAM5 can comprise anti-CEACAM5 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CEACAM5 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-CEACAM5 antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CEACAM5 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for CEACAM5 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD38 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD38. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD38 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD38. Monoclonal antibodies to CD38 are known in the art. Exemplary, non-limiting example(s) of CD38 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD38 can comprise anti-CD38 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD38 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-CD38 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD38 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for CD38 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CD70 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CD70. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CD70 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CD70. Monoclonal antibodies to CD70 are known in the art. Exemplary, non-limiting example(s) of CD70 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD70 can comprise anti-CD70 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD70 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-CD70 antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CD70 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for CD70 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-cMET (Mesenchymal Epithelial Transition Factor) Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen cMET. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for cMET and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to cMET. Monoclonal antibodies to cMET are known in the art. Exemplary, non-limiting example(s) of cMET monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen cMET can comprise anti-cMET VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen cMET can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-cMET antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen cMET can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for cMET can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-CTLA4 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen CTLA4. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for CTLA4 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to CTLA4. Monoclonal antibodies to CTLA4 are known in the art. Exemplary, non-limiting example(s) of CTLA4 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CTLA4 can comprise anti-CTLA4 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CTLA4 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-CTLA4 antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen CTLA4 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for CTLA4 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-ENPP3 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen ENPP3 (ectonucleotide pyrophosphatase/phosphodiesterase 3). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for ENPP3 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to ENPP3. Monoclonal antibodies to ENPP3 are known in the art. Exemplary, non-limiting example(s) of ENPP3 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen ENPP3 can comprise anti-ENPP3 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen ENPP3 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the H16-7.8 antibody of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen ENPP3 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for ENPP3 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-FOLR1 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen FOLR1. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for FOLR1 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to FOLR1. Monoclonal antibodies to FOLR1 are known in the art. Exemplary, non-limiting example(s) of FOLR1 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen FOLR1 can comprise anti-FOLR1 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen FOLR1 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-FOLR1 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen FOLR1 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for FOLR1 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-GPC3 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen GPC3 (glypican 3). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for GPC3 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to GPC3. Monoclonal antibodies to GPC3 are known in the art. Exemplary, non-limiting example(s) of GPC3 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen GPC3 can comprise anti-GPC3 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen GPC3 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-GPC3 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen GPC3 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for GPC3 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-PD-L1 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen PD-L1. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for PD-L1 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to PD-L1. Monoclonal antibodies to PD-L1 are known in the art. Exemplary, non-limiting example(s) of PD-L1 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen PD-L1 can comprise anti-PD-L1 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen PD-L1 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-PD-L1 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen PD-L1 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for PD-L1 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-ROR1 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen ROR1. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for ROR1 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to ROR1. Monoclonal antibodies to ROR1 are known in the art. Exemplary, non-limiting example(s) of ROR1 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen ROR1 can comprise anti-ROR1 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen ROR1 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-ROR1 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen ROR1 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for ROR1 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-TPBG/5T4 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen TPBG/5T4 (trophoblast glycoprotein). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for TPBG/5T4 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to TPBG/5T4. Monoclonal antibodies to TPBG/5T4 are known in the art. Exemplary, non-limiting example(s) of TPBG/5T4 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen TPBG/5T4 can comprise anti-TPBG/5T4 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen TPBG/5T4 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-TPBG/5T4 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen TPBG/5T4 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for TPBG/5T4 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-TROP-2 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen TROP-2. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for TROP-2 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to TROP-2. Monoclonal antibodies to TROP-2 are known in the art. Exemplary, non-limiting example(s) of TROP-2 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen TROP-2 can comprise anti-TROP-2 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen TROP-2 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-TROP-2 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen TROP-2 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for TROP-2 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-VEGFR1 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen VEGFR1. Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for VEGFR1 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to VEGFR1. Monoclonal antibodies to VEGFR1 are known in the art. Exemplary, non-limiting example(s) of VEGFR1 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen VEGFR1 can comprise anti-VEGFR1 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen VEGFR1 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-VEGFR1 antibody/antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen VEGFR1 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for VEGFR1 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

Anti-VEGFR2 Binding Domains:

In some embodiments of the compositions of this disclosure, the binding domain can have specific binding affinity to the marker/antigen VEGFR2 (vascular endothelial growth factor receptor 2). Some embodiments of the compositions of this disclosure can comprise a bispecific bioactive assembly comprising the binding domain specific for VEGFR2 and another binding domain (e.g., having specific binding affinity to an effector cell). The binding domain can comprise VL and VH derived from a monoclonal antibody to VEGFR2. Monoclonal antibodies to VEGFR2 are known in the art. Exemplary, non-limiting example(s) of VEGFR2 monoclonal antibodies and the VL and VH sequences thereof are presented in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen VEGFR2 can comprise anti-VEGFR2 VL and VH sequence(s) set forth in Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen VEGFR2 can comprise VH and VL regions wherein each VH and VL regions can exhibit at least (about) 90%, or at least (about) 91%, or at least (about) 92%, or at least (about) 93%, or at least (about) 94%, or at least (about) 95%, or at least (about) 96%, or at least (about) 97%, or at least (about) 98%, or at least (about) 99% identity to, or is identical to, paired VL and VH sequence(s) of the anti-VEGFR2 antibodies of Table 6. Some embodiments of the binding domain with binding affinity to the marker/antigen VEGFR2 can comprise the CDR-L1 region, the CDR-L2 region, the CDR-L3 region, the CDR-H1 region, the CDR-H2 region, and the CDR-H3 region, wherein each can be derived from the respective VL and VH sequence(s) set forth in Table 6. In some embodiments of the compositions of this disclosure, the binding domain specific for VEGFR2 can have a K_dvalue of greater than 10⁻⁷to 10⁻¹⁰M, as determined using an in vitro binding assay.

It is specifically contemplated that the compositions of this disclosure can comprise any one of the foregoing binding domains or sequence variants thereof so long as the variants exhibit binding specificity for the described antigen. A sequence variant can be created by substitution of an amino acid in the VL or VH sequence with a different amino acid. In deletion variants, one or more amino acid residues in a VL or VH sequence as described herein are removed. Deletion variants, therefore, include all fragments of a binding domain polypeptide sequence. In substitution variants, one or more amino acid residues of a VL or VH (or CDR) polypeptide are removed and replaced with alternative residues. The substitutions can be conservative in nature and conservative substitutions of this type are well known in the art. In addition, it is specifically contemplated that the compositions comprising the first and the second binding domains disclosed herein can be utilized in any of the methods disclosed herein.

Exemplary Activatable Therapeutic Agents

In some embodiments of the compositions of this disclosure, the activatable therapeutic agent is a recombinant polypeptide comprising an amino acid sequence having at least (about) 80% sequence identity to a sequence set forth in Table 7, or a subset thereof. The activatable therapeutic agent can comprise an amino acid sequence having at least (about) 81%, at least (about) 82%, at least (about) 83%, at least (about) 84%, at least (about) 85%, at least (about) 86%, at least (about) 87%, at least (about) 88%, at least (about) 89%, at least (about) 90%, at least (about) 91%, at least (about) 92%, at least (about) 93%, at least (about) 94%, at least (about) 95%, at least (about) 96%, at least (about) 97%, at least (about) 98%, or at least (about) 99% sequence identity to a sequence set forth in Table 7, or a subset thereof. The activatable therapeutic agent can comprise an amino acid sequence identical to a sequence set forth in Table 7, or a subset thereof. It is specifically contemplated that the compositions of this disclosure can comprise sequence variants of the amino acid sequences set forth in Table 7, or a subset thereof, such as with linker sequence(s) inserted or with purification tag sequence(s) attached thereto, so long as the variants exhibit substantially similar or same bioactivity/bioactivities and/or activation mechanism(s).

TABLE 7

Amino acid sequences of exemplary recombinant polypeptides

SEQ ID

NOS.
Amino Acid Sequence

9
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPGPSGHMGRATSGSETPGTDIQMTQSPSSLSASVGDRVTITCKASQ

DVSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTISSLQPEDF

ATYYCQQYYTYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGE

VQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPN

SGGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLGPSFYFDY

WGQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWV

QQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCAL

WYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLES

GGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYAT

YYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAH

WGQGTLVTVSSGTAEAASASGGPSGHMGRPGSPAGSPTSTEEGTSESATPESGPG

TSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEG

SAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESAT

PESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTE

PSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSA

PGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSP

TSTEEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

10
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPHPVELLARATSGSETPGTDIQMTQSPSSLSASVGDRVTITCQASQ

DISNYLNWYQQKPGKAPKLLIYDASNLETGVPSRFSGSGSGTDFTFTISSLQPEDIA

TYFCQHFDHLPLAFGGGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGQV

QLQESGPGLVKPSETLSLTCTVSGGSVSSGDYYWTWIRQSPGKGLEWIGHIYYSG

NTNYNPSLKSRLTISIDTSKTQFSLKLSSVTAADTAIYYCVRDRVTGAFDIWGQGT

MVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPG

QAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYSNL

WVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGLV

QPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADS

VKDRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQG

TLVTVSSGTAEAASASGHPVELLARPGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTS

TEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPG

TSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEG

TSESATPESGPGTSTEPSEGSAPGAAEPEA

11
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPVSKRFPVGATSGSETPGTDIQMTQSPSSLSASVGDRVTITCQASQ

DISNYLNWYQQKPGKAPKLLIYDASNLETGVPSRFSGSGSGTDFTFTISSLQPEDIA

TYFCQHFDHLPLAFGGGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGQV

QLQESGPGLVKPSETLSLTCTVSGGSVSSGDYYWTWIRQSPGKGLEWIGHIYYSG

NTNYNPSLKSRLTISIDTSKTQFSLKLSSVTAADTAIYYCVRDRVTGAFDIWGQGT

MVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPG

QAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYSNL

WVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGLV

QPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADS

VKDRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQG

TLVTVSSGTAEAASASGVSKRFPVGPGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTS

TEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPG

TSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEG

TSESATPESGPGTSTEPSEGSAPGAAEPEA

12
ASSPAGSPTSTESGTSESATPESGPGTETEPSEGSAPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSP

TSTEEGTSESATPESGPGESPATSGSTPEGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSE

PATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPG

GSAPEAGRSANHTPAGLTGPATSGSETPGTEIVLTQSPATLSLSPGERATLSCKAS

QDVSIGVAWYQQKPGQAPRLLIYSASYRYSGVPARFSGSGSGTDFTLTISSLEPED

FAVYYCQQYYIYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEG

QVQLVQSGVEVKKPGASVKVSCKASGFTFTDYTMDWVRQAPGQGLEWMADVN

PNSGGSIYNQRFKGRVTLTTDSSTTTAYMELKSLQFDDTAVYYCARNLGPSFYFD

YWGQGTLVTVSSGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWV

QQKPGQAPRGLIGGTNKRAPGTPARFSGSSLGGKAALTLSGVQPEDEAVYYCAL

WYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLQES

GGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYAT

YYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAH

WGQGTLVTVSSGTAEAASASGEAGRSANHTPAGLTGPTPESGPGTSESATPESGP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGS

APGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSE

GSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESA

TPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPA

GSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGT

SESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPES

GPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATP

ESGPGTSESATPESGPGSEPATSGSETPGSESATSGSETPGSPAGSPTSTEEGTSTEP

SEGSAPGTSTEPSEGSAPGSEPATSGSETPGTSESA

13
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPSPEAQAAAATSGSETPGTDIQMTQSPSSLSASVGDRVTITCQASQ

DISNYLNWYQQKPGKAPKLLIYDASNLETGVPSRFSGSGSGTDFTFTISSLQPEDIA

TYFCQHFDHLPLAFGGGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGQV

QLQESGPGLVKPSETLSLTCTVSGGSVSSGDYYWTWIRQSPGKGLEWIGHIYYSG

NTNYNPSLKSRLTISIDTSKTQFSLKLSSVTAADTAIYYCVRDRVTGAFDIWGQGT

MVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPG

QAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYSNL

WVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGLV

QPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADS

VKDRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQG

TLVTVSSGTAEAASASGSPEAQAAAPGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTS

TEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPG

TSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEG

TSESATPESGPGTSTEPSEGSAPGAAEPEA

14
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPEAGRSANHGVRGLTGPATSGSETPGTDIQMTQSPSSLSASVGDRV

TITCKASQDVSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTI

SSLQPEDFATYYCQQYYTYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAES

EPPGEGEVQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEW

VADVNPNSGGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLG

PSFYFDYWGQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSS

NYANWVQQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEA

VYYCALWYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGE

VQLLESGGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKY

NNYATYYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYV

SWFAHWGQGTLVTVSSGTAEAASASGEAGRSANHGVRGLTGPPGSPAGSPTSTE

EGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEG

SAPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESAT

PESGPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTE

PSEGSAPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTS

TEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEG

TSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTS

TEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

15
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPLTSDLQAQATSGSETPGTDIQMTQSPSSLSASVGDRVTITCKASQ

DVSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTISSLQPEDF

ATYYCQQYYTYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGE

VQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPN

SGGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLGPSFYFDY

WGQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWV

QQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCAL

WYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLES

GGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYAT

YYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAH

WGQGTLVTVSSGTAEAASASGLTSDLQAQPGSPAGSPTSTEEGTSESATPESGPGT

STEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGP

GSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGS

APGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATP

ESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEP

SEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGT

STEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAP

GTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPES

GPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPT

STEEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

16
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPQPVSLANTATSGSETPGTDIQMTQSPSSLSASVGDRVTITCKASQD

VSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTISSLQPEDFA

TYYCQQYYTYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGEV

QLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPNS

GGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLGPSFYFDYW

GQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQ

QKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCALW

YPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGG

GIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYY

ADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAHW

GQGTLVTVSSGTAEAASASGQPVSLANTPGSPAGSPTSTEEGTSESATPESGPGTST

EPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAP

GTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPES

GPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPAT

SGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTST

EPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGT

SESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTST

EEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

17
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPGVRGLTGPATSGSETPGTDIQMTQSPSSLSASVGDRVTITCKASQ

DVSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTISSLQPEDF

ATYYCQQYYTYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGE

VQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPN

SGGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLGPSFYFDY

WGQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWV

QQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCAL

WYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLES

GGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYAT

YYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAH

WGQGTLVTVSSGTAEAASASGGVRGLTGPPGSPAGSPTSTEEGTSESATPESGPG

TSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEG

SAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESAT

PESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTE

PSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSA

PGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSP

TSTEEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

18
SAGSPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTE

PSEGSAPGTSTEPSEGSAPGTSESATPESGPGSTPAESGSETPGSEPATSGSETPGSP

AGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEG

TSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSTETPGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATP

ESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEP

SEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTST

EPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGT

SESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGP

GSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGS

APGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATP

ESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESA

TPESGPGSPAGSPTSTEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSE

SATPESGPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGT

STEPSEGSAPGTSTEPSEGSAPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGS

APGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPT

STEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEP

SEGSAPGTSTEPSEGSAPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEP

ATSGSETPGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTESASASGRAAN

ETPPGLTGAATSGSETPGTEIVLTQSPATLSLSPGERATLSCKASQDVSIGVAWYQ

QKPGQAPRLLIYSASYRYSGVPARFSGSGSGTDFTLTISSLEPEDFAVYYCQQYYIY

PYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGQVQLVQSGVEVK

KPGASVKVSCKASGFTFTDYTMDWVRQAPGQGLEWMADVNPNSGGSIYNQRFK

GRVTLTTDSSTTTAYMELKSLQFDDTAVYYCARNLGPSFYFDYWGQGTLVTVSS

GGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQQKPGQAPRGLIG

GTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCALWYPNLWVFGGGT

KLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGIVQPGGSLKLS

CAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSVKDRFTISRD

DSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAHWGQGTLVTVSSGTA

EAASASGASGRAANETPPGLTGAGSETPGSPAGSPTSTEEGTSESATPESGPGTSTE

PSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTS

ESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPG

TSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESG

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEG

SAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPS

EGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSES

ATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSP

AGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSTETG

TSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTE

EGTSESATPESGPGSEPATS

19
ASSPAGSPTSTESGTSESATPESGPGTETEPSEGSAPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSP

TSTEEGTSESATPESGPGESPATSGSTPEGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSE

PATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPG

GSAPTTGRAGEAANATSAGATGPATSGSETPGTDIQMTQSPSSLSASVGDRVTITC

RASQDVNTAVAWYQQKPGKAPKLLIYSASFLYSGVPSRFSGSRSGTDFTLTISSLQ

PEDFATYYCQQHYTTPPTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPP

GEGEVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHWVRQAPGKGLEWVARI

YPTNGYTRYADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFY

AMDYWGQGTLVTVSSGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYA

NWVQQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYY

CALWYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQL

LESGGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNY

ATYYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWF

AHWGQGTLVTVSSGTAEAASASGTTGRAGEAANATSAGATGPSAGSPGSPAGSPT

STEEGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSE

SATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGT

STEPSEGSAPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETP

GTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTST

EEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSE

GSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGS

PTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGS

PAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGP

GSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGS

APGSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPT

STEEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESA

TPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTST

EPSEGSAPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGT

SESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAP

GTSESATPESGPGSEPATSGSETPGSEPATSGSTETPGSPAGSPTSTEEGTSESATPE

SGPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTATESP

EGSAPGTSESATPESGPGTSTEPSEGSAPGTSAESATPESGPGSEPATSGSETPGTST

EPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTESAS

20
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPGPGGVAAAATSGSETPGTDIQMTQSPSSLSASVGDRVTITCKASQ

DVSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTISSLQPEDF

ATYYCQQYYTYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGE

VQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPN

SGGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLGPSFYFDY

WGQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWV

QQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCAL

WYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLES

GGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYAT

YYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAH

WGQGTLVTVSSGTAEAASASGGPGGVAAAPGSPAGSPTSTEEGTSESATPESGPG

TSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESG

PGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEG

SAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESAT

PESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTE

PSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSE

PATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSA

PGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSP

TSTEEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

21
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPVSKRFPVGATSGSETPGTDIQMTQSPSSLSASVGDRVTITCKASQ

DVSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTISSLQPEDF

ATYYCQQYYTYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGE

VQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPN

SGGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLGPSFYFDY

WGQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWV

QQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCAL

WYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLES

GGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYAT

YYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAH

WGQGTLVTVSSGTAEAASASGVSKRFPVGPGSPAGSPTSTEEGTSESATPESGPGT

STEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGP

GSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGS

APGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATP

ESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEP

SEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGT

STEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAP

GTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPES

GPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPT

STEEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

22
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPGPGGVAAAATSGSETPGTDIQMTQSPSSLSASVGDRVTITCQASQ

DISNYLNWYQQKPGKAPKLLIYDASNLETGVPSRFSGSGSGTDFTFTISSLQPEDIA

TYFCQHFDHLPLAFGGGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGQV

QLQESGPGLVKPSETLSLTCTVSGGSVSSGDYYWTWIRQSPGKGLEWIGHIYYSG

NTNYNPSLKSRLTISIDTSKTQFSLKLSSVTAADTAIYYCVRDRVTGAFDIWGQGT

MVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPG

QAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYSNL

WVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGLV

QPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADS

VKDRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQG

TLVTVSSGTAEAASASGGPGGVAAAPGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTS

TEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPG

TSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEG

TSESATPESGPGTSTEPSEGSAPGAAEPEA

23
ASSPAGSPTSTESGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSP

TSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSE

PATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPG

GSAPASGRSTNAGPPGLTGPATSGSETPGTDIQMTQSPSSLSASVGDRVTITCRAS

QDVNTAVAWYQQKPGKAPKLLIYSASFLYSGVPSRFSGSRSGTDFTLTISSLQPED

FATYYCQQHYTTPPTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEG

EVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHWVRQAPGKGLEWVARIYPT

NGYTRYADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAM

DYWGQGTLVTVSSGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANW

VQQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCA

LWYSNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLE

SGGGLVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYA

TYYADSVKDRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFA

YWGQGTLVTVSSGTAEAASASGASGRSTNAGPPGLTGPPGSPAGSPTSTEEGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTS

TEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEG

TSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSET

PGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTS

TEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAG

SPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSP

AGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPG

SEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTESTPSEGSA

PGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPG

24
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPQPAHLTFPATSGSETPGTDIQMTQSPSSLSASVGDRVTITCKASQD

VSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTISSLQPEDFA

TYYCQQYYIYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGEV

QLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPNS

GGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLGPSFYFDYW

GQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWVQ

QKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCALW

YPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGG

GIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYY

ADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAHW

GQGTLVTVSSGTAEAASASGQPAHLTFPPGSPAGSPTSTEEGTSESATPESGPGTST

EPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAP

GTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPES

GPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSE

GSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPAT

SGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTST

EPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGT

SESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGP

GTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTST

EEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

25
ASSPAGSPTSTESGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSP

TSTEEGTSESATPESGPGEEPATSGSTPEGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSE

PATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPG

GSAPEAGRSANHTPAGLTGPATSGSETPGTDIQMTQSPSSLSASVGDRVTITCRAS

QDVNTAVAWYQQKPGKAPKLLIYSASFLYSGVPSRFSGSRSGTDFTLTISSLQPED

FATYYCQQHYTTPPTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEG

EVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHWVRQAPGKGLEWVARIYPT

NGYTRYADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAM

DYWGQGTLVTVSSGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANW

VQQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCA

LWYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLE

SGGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYAT

YYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAH

WGQGTLVTVSSGTAEAASASGEAGRSANHTPAGLTGPSAGSPGSPAGSPTSTEEG

TSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPE

SGPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPS

EGSAPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTE

PSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTS

ESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTE

EGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSP

TSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSP

AGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEG

SPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESG

PGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEG

SAPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSESAT

PESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGSEPATSGSTETPGSPAGSPTSTEEGTSESATPESGPGT

STEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTATESPEGSAP

GTSESATPESGPGTSTEPSEGSAPGTSAESATPESGPGSEPATSGSETPGTSTEPSEG

SAPGTSTEPSEGSAPGTSESATPESGPGTESAS

26
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPEAGSPGKDGVRGLTGPATSGSETPGTDIQMTQSPSSLSASVGDRV

TITCKASQDVSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTI

SSLQPEDFATYYCQQYYTYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAES

EPPGEGEVQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEW

VADVNPNSGGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLG

PSFYFDYWGQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSS

NYANWVQQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEA

VYYCALWYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGE

VQLLESGGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKY

NNYATYYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYV

SWFAHWGQGTLVTVSSGTAEAASASGEAGSPGKDGVRGLTGPPGSPAGSPTSTE

EGTSESATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEG

SAPGTSESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESAT

PESGPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTE

PSEGSAPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTS

TEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEG

TSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSA

PGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTS

TEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATS

GSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

27
ASSPAGSPTSTESGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSP

TSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSE

PATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPG

GSAPRTGRTGESANETPAGLGGPATSGSETPGTEIVLTQSPATLSLSPGERATLSC

KASQDVSIGVAWYQQKPGQAPRLLIYSASYRYSGVPARFSGSGSGTDFTLTISSLE

PEDFAVYYCQQYYIYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPG

EGQVQLVQSGVEVKKPGASVKVSCKASGFTFTDYTMDWVRQAPGQGLEWMAD

VNPNSGGSIYNQRFKGRVTLTTDSSTTTAYMELKSLQFDDTAVYYCARNLGPSFY

FDYWGQGTLVTVSSGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYAN

WVQQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYC

ALWYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLL

ESGGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYA

TYYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFA

HWGQGTLVTVSSGTAEAASASGRTGRTGESANETPAGLGGPGSETPGSPAGSPTS

TEEGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPS

EGSAPGTSTEPSEGSAPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSP

AGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPG

TSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSA

PGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPE

SGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSP

TSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSTETGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGSPAGSPTSTEEGTSESATPESGPGSEPATS

28
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPSPEAQAAAATSGSETPGTDIQMTQSPSSLSASVGDRVTITCKASQ

DVSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTISSLQPEDF

ATYYCQQYYTYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGE

VQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPN

SGGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLGPSFYFDY

WGQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWV

QQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCAL

WYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLES

GGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYAT

YYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAH

WGQGTLVTVSSGTAEAASASGSPEAQAAAPGSPAGSPTSTEEGTSESATPESGPGT

STEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGP

GSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGS

APGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATP

ESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEP

SEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGT

STEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAP

GTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPES

GPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPT

STEEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

29
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPLTSDLQAQATSGSETPGTDIQMTQSPSSLSASVGDRVTITCQASQ

DISNYLNWYQQKPGKAPKLLIYDASNLETGVPSRFSGSGSGTDFTFTISSLQPEDIA

TYFCQHFDHLPLAFGGGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGQV

QLQESGPGLVKPSETLSLTCTVSGGSVSSGDYYWTWIRQSPGKGLEWIGHIYYSG

NTNYNPSLKSRLTISIDTSKTQFSLKLSSVTAADTAIYYCVRDRVTGAFDIWGQGT

MVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPG

QAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYSNL

WVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGLV

QPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADS

VKDRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQG

TLVTVSSGTAEAASASGLTSDLQAQPGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTS

TEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPG

TSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEG

TSESATPESGPGTSTEPSEGSAPGAAEPEA

30
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPQPAHLTFPATSGSETPGTDIQMTQSPSSLSASVGDRVTITCQASQD

ISNYLNWYQQKPGKAPKLLIYDASNLETGVPSRFSGSGSGTDFTFTISSLQPEDIAT

YFCQHFDHLPLAFGGGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGQVQ

LQESGPGLVKPSETLSLTCTVSGGSVSSGDYYWTWIRQSPGKGLEWIGHIYYSGN

TNYNPSLKSRLTISIDTSKTQFSLKLSSVTAADTAIYYCVRDRVTGAFDIWGQGTM

VTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPGQ

APRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYSNLW

VFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGLVQP

GGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSVK

DRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQGTL

VTVSSGTAEAASASGQPAHLTFPPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEG

SAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATS

GSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTE

PSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSAPG

TSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSET

PGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEG

SAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTE

PSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTS

ESATPESGPGTSTEPSEGSAPGAAEPEA

31
ASSPAGSPTSTESGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSTPAESGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSP

TSTEEGTSESATPESGPGEEPATSGSTPEGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATPESGPGSE

PATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPG

GSAPGAGRTDNHEPLELGAAATSGSETPGTDIQMTQSPSSLSASVGDRVTITCRAS

QDVNTAVAWYQQKPGKAPKLLIYSASFLYSGVPSRFSGSRSGTDFTLTISSLQPED

FATYYCQQHYTTPPTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEG

EVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHWVRQAPGKGLEWVARIYPT

NGYTRYADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAM

DYWGQGTLVTVSSGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANW

VQQKPGQAPRGLIGGTNKRAPGTPARFSGSSLGGSAALTLSGVQPEDEAEYYCAL

WYSNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLQES

GGGLVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYAT

YYADSVKDRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAY

WGQGTLVTVSSGTAEAASASGGAGRTDNHEPLELGAAPGSPAGSPTSTEEGTSES

ATPESGPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTS

ESATPESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPG

TSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEG

SAPGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTE

PSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPG

TSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTE

EGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGSPAGSP

TSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTSTEEGSPAG

SPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGftabTSESATPESGP

GSEPATSGPTESGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTESTPSEGS

APGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAPGEPEA

32
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPQPVSLANTATSGSETPGTDIQMTQSPSSLSASVGDRVTITCQASQD

ISNYLNWYQQKPGKAPKLLIYDASNLETGVPSRFSGSGSGTDFTFTISSLQPEDIAT

YFCQHFDHLPLAFGGGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGQVQ

LQESGPGLVKPSETLSLTCTVSGGSVSSGDYYWTWIRQSPGKGLEWIGHIYYSGN

TNYNPSLKSRLTISIDTSKTQFSLKLSSVTAADTAIYYCVRDRVTGAFDIWGQGTM

VTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPGQ

APRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYSNLW

VFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGLVQP

GGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADSVK

DRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQGTL

VTVSSGTAEAASASGQPVSLANTPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEG

SAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPATS

GSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSTE

PSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSAPG

TSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGSET

PGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEG

SAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSESAT

PESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTE

PSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEGTS

ESATPESGPGTSTEPSEGSAPGAAEPEA

33
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPGPSGHMGRATSGSETPGTDIQMTQSPSSLSASVGDRVTITCQASQ

DISNYLNWYQQKPGKAPKLLIYDASNLETGVPSRFSGSGSGTDFTFTISSLQPEDIA

TYFCQHFDHLPLAFGGGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGQV

QLQESGPGLVKPSETLSLTCTVSGGSVSSGDYYWTWIRQSPGKGLEWIGHIYYSG

NTNYNPSLKSRLTISIDTSKTQFSLKLSSVTAADTAIYYCVRDRVTGAFDIWGQGT

MVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPG

QAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYSNL

WVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGLV

QPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADS

VKDRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQG

TLVTVSSGTAEAASASGGPSGHMGRPGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTS

TEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPG

TSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEG

TSESATPESGPGTSTEPSEGSAPGAAEPEA

34
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPGVRGLTGPATSGSETPGTDIQMTQSPSSLSASVGDRVTITCQASQ

DISNYLNWYQQKPGKAPKLLIYDASNLETGVPSRFSGSGSGTDFTFTISSLQPEDIA

TYFCQHFDHLPLAFGGGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGQV

QLQESGPGLVKPSETLSLTCTVSGGSVSSGDYYWTWIRQSPGKGLEWIGHIYYSG

NTNYNPSLKSRLTISIDTSKTQFSLKLSSVTAADTAIYYCVRDRVTGAFDIWGQGT

MVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSTGAVTTSNYANWVQQKPG

QAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAEYYCALWYSNL

WVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLESGGGLV

QPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYATYYADS

VKDRFTISRDDSKNTAYLQMNNLKTEDTAVYYCVRHGNFGNSYVSWFAYWGQG

TLVTVSSGTAEAASASGGVRGLTGPPGSPAGSPTSTEEGTSESATPESGPGTSTEPS

EGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPGSEPA

TSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTS

TEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGPG

TSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEPSEGSA

PGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEPATSGS

ETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPS

EGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSES

ATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTS

TEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPTSTEEG

TSESATPESGPGTSTEPSEGSAPGAAEPEA

35
ASHHHHHHSPAGSPTSTEEGTSESATPESGPGTSTEPSEGSAPGTSESATPESGPGS

EPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPESGPGTSTEPSEGSAP

GSPAGSPTSTEEGTSESATPESGPGSEPATSGSETPGTSESATPESGPGSPAGSPTST

EEGSPAGSPTSTEEGTSTEPSEGSAPGTSESATPESGPGTSESATPESGPGTSESATP

ESGPGSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEP

SEGSAPGGSAPHPVELLARATSGSETPGTDIQMTQSPSSLSASVGDRVTITCKASQ

DVSIGVAWYQQKPGKAPKLLIYSASYRYTGVPSRFSGSGSGTDFTLTISSLQPEDF

ATYYCQQYYTYPYTFGQGTKVEIKGATPPETGAETESPGETTGGSAESEPPGEGE

VQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPN

SGGSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLGPSFYFDY

WGQGTLVTVSSGGGGSELVVTQEPSLTVSPGGTVTLTCRSSNGAVTSSNYANWV

QQKPGQAPRGLIGGTNKRAPGTPARFSGSLLGGKAALTLSGVQPEDEAVYYCAL

WYPNLWVFGGGTKLTVLGATPPETGAETESPGETTGGSAESEPPGEGEVQLLES

GGGIVQPGGSLKLSCAASGFTFNTYAMNWVRQAPGKGLEWVARIRSKYNNYAT

YYADSVKDRFTISRDDSKNTVYLQMNNLKTEDTAVYYCVRHENFGNSYVSWFAH

WGQGTLVTVSSGTAEAASASGHPVELLARPGSPAGSPTSTEEGTSESATPESGPGT

STEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATPESGP

GSEPATSGSETPGSEPATSGSETPGSPAGSPTSTEEGTSESATPESGPGTSTEPSEGS

APGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAPGTSTEPSEGSAPGTSESATP

ESGPGTSTEPSEGSAPGTSESATPESGPGSEPATSGSETPGTSTEPSEGSAPGTSTEP

SEGSAPGTSESATPESGPGTSESATPESGPGSPAGSPTSTEEGTSESATPESGPGSEP

ATSGSETPGTSESATPESGPGTSTEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGT

STEPSEGSAPGTSTEPSEGSAPGTSTEPSEGSAPGSPAGSPTSTEEGTSTEPSEGSAP

GTSESATPESGPGSEPATSGSETPGTSESATPESGPGSEPATSGSETPGTSESATPES

GPGTSTEPSEGSAPGTSESATPESGPGSPAGSPTSTEEGSPAGSPTSTEEGSPAGSPT

STEEGTSESATPESGPGTSTEPSEGSAPGAAEPEA

Target Tissues or Cells

In some embodiments of the compositions (such as the therapeutic agents, or activatable therapeutic agents described hereinabove) or methods described herein, the target tissue or cell can contain therein or thereon, or can be associated with in proximity thereto, a reporter polypeptide (such as one described herein this TARGET TISSUES OR CELLS section) capable of being cleaved by a mammalian protease at a cleavage sequence (such as one set forth in Table A). The reporter polypeptide can be a polypeptide set forth in the “Report Protein” column of Table A (or any subset thereof). In some embodiments, the reporter polypeptide can be selected from coagulation factor, complement component, tubulin, immunoglobulin, apolipoprotein, serum amyloid, insulin, growth factor, fibrinogen, PDZ domain protein, LIM domain protein, c-reactive protein, serum albumin, versican, collagen, elastin, keratin, kininogen-1, alpha-2-antiplasmin, clusterin, biglycan, alpha-1-antitrypsin, transthyretin, alpha-1-antichymotrypsin, glucagon, hepcidin, thymosin beta-4, haptoglobin, hemoglobin subunit alpha, caveolae-associated protein 2, alpha-2-HS-glycoprotein, chromogranin-A, vitronectin, hemopexin, epididymis secretory sperm binding protein, secretogranin-2, angiotensinogen, transgelin-2, pancreatic prohormone, neurosecretory protein VGF, ceruloplasmin, PDZ and LIM domain protein 1, multimerin-1, inter-alpha-trypsin inhibitor heavy chain H2, N-acetylmuramoyl-L-alanine amidase, histone H1.4, adhesion G-protein coupled receptor G6, mannan-binding lectin serine protease 2, prothrombin, deleted in malignant brain tumors 1 protein, desmoglein-3, calsyntenin-1, alpha-2-macroglobulin, myosin-9, sodium/potassium-transporting ATPase subunit gamma, oncoprotein-induced transcript 3 protein, serglycin, histidine-rich glycoprotein, inter-alpha-trypsin inhibitor heavy chain H5, integrin alpha-IIb, membrane-associated progesterone receptor component 1, histone H1.2, rho GDP-dissociation inhibitor 2, zinc-alpha-2-glycoprotein, talin-1, secretogranin-1, neutrophil defensin 3, cytochrome P450 2E1, gastric inhibitory polypeptide, transcription initiation factor TFIID subunit 1, integral membrane protein 2B, pigment epithelium-derived factor, voltage-dependent N-type calcium channel subunit alpha-1B, ras GTPase-activating protein nGAP, type I cytoskeletal 17, sulfhydryl oxidase 1, homeobox protein Hox-B2, transcription factor SOX-10, E3 ubiquitin-protein ligase SIAH2, decorin, secreted protein acidic and rich in cysteine (SPARC), laminin gamma 1 chain, vimentin, and nidogen-1 (NID1). In some embodiments, the reporter polypeptide can be selected from collagen, elastin, keratin, coagulation factor, complement component, tubulin, immunoglobulin, apolipoprotein, serum amyloid, insulin, growth factor, fibrinogen, PDZ domain protein, LIM domain protein, c-reactive protein, and serum albumin. The collagen can comprise alpha chain(s) (such as alpha-1, alpha-2, alpha-3, or a combination thereof) of collagen type I, collagen type II, collagen type III, collagen type IV, collagen type V, collagen type VI, collagen type VII, collagen type VIII, collagen type IX, collagen type X, collagen type XI, collagen type XII, collagen type XIII, collagen type XIV, collagen type XV, collagen type XVI, collagen type XVII, collagen type XVIII, collagen type XIX, collagen type XX, collagen type XXI, collagen type XXII, collagen type XXIII, collagen type XXIV, collagen type XXV, collagen type XXVI, collagen type XXVII, collagen type XXVIII, collagen type XXIX, or a combination thereof. The coagulation factor can be selected from coagulation factor IX, coagulation factor XII, and coagulation factor XIII A chain. The complement component can be selected from C1 (for example, and not limited to, complement C1r subcomponent-like protein, complement C1r subcomponent), C3, C4 (for example, and not limited to, complement C4-A, complement C4-B), and C5. The tubulin can be selected from tubulin alpha chain (for example, and not limited to, tubulin alpha-4A chain), and tubulin beta chain. The immunoglobulin can be selected from immunoglobulin lambda variable 3-21, immunoglobulin lambda variable 3-25, immunoglobulin lambda variable 1-51, immunoglobulin lambda variable 1-36, immunoglobulin kappa variable 3-20, immunoglobulin kappa variable 2-30, probable non-functional immunoglobulin kappa variable 2D-24, immunoglobulin lambda constant 3, immunoglobulin kappa variable 2-28, immunoglobulin kappa variable 3-11, immunoglobulin kappa variable 1-39, immunoglobulin lambda variable 6-57, immunoglobulin kappa variable 3-15, immunoglobulin lambda variable 2-18, immunoglobulin heavy variable 3-15, immunoglobulin lambda variable 2-11, immunoglobulin lambda variable 3-27, and immunoglobulin kappa variable 4-1. The apolipoprotein can be selected from apolipoprotein A-I, apolipoprotein A-I Isoform 1, apolipoprotein apolipoprotein C-I, apolipoprotein A-II, and apolipoprotein L1. The serum amyloid protein can be selected from serum amyloid A-1 protein, and serum amyloid A-2 protein. The growth factor can be selected from insulin-like growth factor II, latent-transforming growth factor beta-binding protein 2, and latent-transforming growth factor beta-binding protein 4. The fibrinogen can be selected from fibrinogen alpha chain, fibrinogen beta chain, and fibrinogen gamma chain. The LIM domain protein can be zyxin. In some embodiments, the reporter polypeptide can be selected from the group consisting of versican, type II collagen alpha-1 chain, kininogen-1, complement C4-A, complement C4-B, complement C3, alpha-2-antiplasmin, clusterin, biglycan, elastin, fibrinogen alpha chain, alpha-1-antitrypsin, fibrinogen beta chain, type III collagen alpha-1 chain, serum amyloid A-1 protein, transthyretin, apolipoprotein A-I, apolipoprotein A-I Isoform 1, alpha-1-antichymotrypsin, glucagon, hepcidin, serum amyloid A-2 protein, thymosin beta-4, haptoglobin, hemoglobin subunit alpha, caveolae-associated protein 2, alpha-2-HS-glycoprotein, chromogranin-A, vitronectin, hemopexin, epididymis secretory sperm binding protein, zyxin, apolipoprotein secretogranin-2, angiotensinogen, c-reactive protein, serum albumin, transgelin-2, pancreatic prohormone, neurosecretory protein VGF, ceruloplasmin, PDZ and LIM domain protein 1, tubulin alpha-4A chain, multimerin-1, inter-alpha-trypsin inhibitor heavy chain H2, apolipoprotein C-I, fibrinogen gamma chain, N-acetylmuramoyl-L-alanine amidase, immunoglobulin lambda variable 3-21, histone H1.4, adhesion G-protein coupled receptor G6, immunoglobulin lambda variable 3-25, immunoglobulin lambda variable 1-51, immunoglobulin lambda variable 1-36, mannan-binding lectin serine protease 2, immunoglobulin kappa variable 3-20, immunoglobulin kappa variable 2-30, insulin-like growth factor II, apolipoprotein A-II, probable non-functional immunoglobulin kappa variable 2D-24, prothrombin, coagulation factor IX, apolipoprotein L1, deleted in malignant brain tumors 1 protein, desmoglein-3, calsyntenin-1, immunoglobulin lambda constant 3, complement C5, alpha-2-macroglobulin, myosin-9, sodium/potassium-transporting ATPase subunit gamma, immunoglobulin kappa variable 2-28, oncoprotein-induced transcript 3 protein, serglycin, coagulation factor XII, coagulation factor XIII A chain, insulin, histidine-rich glycoprotein, immunoglobulin kappa variable 3-11, immunoglobulin kappa variable 1-39, collagen alpha-1(I) chain, inter-alpha-trypsin inhibitor heavy chain H5, latent-transforming growth factor beta-binding protein 2, integrin alpha-IIb, membrane-associated progesterone receptor component 1, immunoglobulin lambda variable 6-57, immunoglobulin kappa variable 3-15, complement C1r subcomponent-like protein, histone H1.2, rho GDP-dissociation inhibitor 2, latent-transforming growth factor beta-binding protein 4, collagen alpha-1(XVIII) chain, immunoglobulin lambda variable 2-18, zinc-alpha-2-glycoprotein, talin-1, secretogranin-1, neutrophil defensin 3, cytochrome P450 2E1, gastric inhibitory polypeptide, immunoglobulin heavy variable 3-15, immunoglobulin lambda variable 2-11, transcription initiation factor TFIID subunit 1, collagen alpha-1(VII) chain, integral membrane protein 2B, pigment epithelium-derived factor, voltage-dependent N-type calcium channel subunit alpha-1B, immunoglobulin lambda variable 3-27, ras GTPase-activating protein nGAP, keratin, type I cytoskeletal 17, tubulin beta chain, sulfhydryl oxidase 1, immunoglobulin kappa variable 4-1, complement C1r subcomponent, homeobox protein Hox-B2, transcription factor SOX-10, E3 ubiquitin-protein ligase SIAH2, decorin, SPARC, type I collagen alpha-1 chain, type IV collagen alpha-1 chain, laminin gamma 1 chain, vimentin, type III collagen, type IV collagen alpha-3 chain, type VII collagen alpha-1 chain, type VI collagen alpha-1 chain, type V collagen alpha-1 chain, nidogen-1, and type VI collagen alpha-3 chain. In some embodiments, the reporter polypeptide can comprise a cleavage sequence set forth in Column II or III of Table A (or a subset thereof) and/or the group set forth in Tables 1(a)-1(j) (or any subset thereof). The reporter polypeptide can comprise a sequence set forth in Column IV of Table A (or a subset thereof). The reporter polypeptide can comprise a sequence set forth in Column V of Table A (or a subset thereof). The reporter polypeptide can comprise a sequence set forth in Column VI of Table A (or a subset thereof). The reporter polypeptide can comprise a peptide biomarker (or a peptide biomarker sequence) (such as one shown in Table A) capable of being identified from a biological sample of the subject. The peptide biomarker can comprise a sequence set forth in Column IV of Table A (or a subset thereof). The peptide biomarker can comprise a sequence set forth in Column V of Table A (or a subset thereof). The peptide biomarker can comprise a sequence set forth in Column VI of Table A (or a subset thereof). In some embodiments, the reporter polypeptide is selected from the group set forth in Column I of Table A (or a subset thereof). In some embodiments, the cleavage sequence of the reporter polypeptide does not comprise a methionine residue immediately N-terminal to a scissile bond (contained therein), when the methionine is the first residue at N terminus of the reporter polypeptide.

TABLE A

Exemplary cleavage sequences and biomarker sequences in exemplary

reporter polypeptides

Column II
Column III

Clea-

Clea-

Column I

vage

vage
Column IV
Column V
Column VI

Reporter
SEQ
Se-
SEQ
Se-
SEQ
N-
SEQ

SEQ
C-

Poly-
ID
quence
ID
quence
ID
terminal
ID
Center
ID
terminal

#
peptide
NO:
1*
NO:
2
NO:
Fragment
NO:
Fragment
NO:
Fragment

1
type III
755
GPPG-

N/A
1688
GGPGPQGPPG

N/A
2598
KNGETGPQGP

collagen

KNGE

alpha-1

chain

2
versican
756
VENA-

N/A
1689
CGQPPVVENA

N/A
2599
KTFGKMKPRY

KTFG

3
type II
757
GAAG-

N/A
1690
GPPGRDGAAG

N/A
2600
VKGDRGETGA

collagen

VKGD

alpha-1

chain

4
kininogen-
758
SLMK-
1291
FSPF-
1691
QPLGMISLMK
2199
RPPGFSPF
2601
RSSRIGEIKE

1

RPPG

RSSR

5
complement
759
NGFK-
1292
LNNR-
1692
LSSTGRNGFK
2200
SHALQLNNR
2602
QIRGLEEELQ

C4-A OR

SHAL

QIRG

complement

C4-B

6
kininogen-
760
SLMK-
1293
SPFR-
1693
QPLGMISLMK
2201
RPPGFSPFR
2603
SSRIGEIKEE

1

RPPG

SSRI

7
complement
761
THRI-
1294
SLLR-
1694
SRSSKITHRI
2202
HWESASLLR
2604
SEETKENEGF

C3

HWES

SEET

8
complement
762
SSKI-
1295
WESA-
1695
LQLPSRSSKI
2203
THRIHWESA
2605
SLLRSEETKE

C3

THRI

SLLR

9
complement
763
KSHA-
1296
RQIR-
1696
TGRNGFKSHA
2204
LQLNNRQIR
2606
GLEEELQFSL

C4-A OR

LQLN

GLEE

complement

C4-B

10
complement
764
ITHR-
1297
SLLR-
1697
PSRSSKITHR
2205
IHWESASLLR
2607
SEETKENEGF

C3

IHWE

SEET

11
complement
765
FKSH-
1298
RQIR-
1698
STGRNGFKSH
2206
ALQLNNRQIR
2608
GLEEELQFSL

C4-A OR

ALQL

GLEE

complement

C4-B

12
complement
766
RSSK-
1299
WESA-
1699
SLQLPSRSSK
2207
ITHRIHWESA
2609
SLLRSEETKE

C3

ITHR

SLLR

13
alpha-2-
767
PVSA-
1300
TSGP-
1700
PCSVFSPVSA
2208
MEPLGRQLTSGP
2610
NQEQVSPLTL

anti-

MEPL

NQEQ

plasmin

14
kininogen-
768
GHTR-
1301
EKQR-
1701
DSGKEQGHTR
2209
RHDWGHEKQR
2611
KHNLGHGHKH

1

RHDW

KHNL

15
complement
769
KITH-
1302
SLLR-
1702
LPSRSSKITH
2210
RIHWESASLLR
2612
SEETKENEGF

C3

RIHW

SEET

16
complement
770
SRSS-
1303
WESA-
1703
VSLQLPSRSS
2211
KITHRIHWESA
2613
SLLRSEETKE

C3

KITH

SLLR

17
complement
771
RQIR-
1304
LGSK-
1704
ALQLNNRQIR
2212
GLEEELQFSLGS
2614
INVKVGGNSK

C4-A OR

GLEE

INVK

K

complement

C4-B

18
complement
772
NGFK-
1305
RQIR-
1705
LSSTGRNGFK
2213
SHALQLNNRQIR
2615
GLEEELQFSL

C4-A OR

SHAL

GLEE

complement

C4-B

19
complement
773
PSRS-
1306
WESA-
1706
DVSLQLPSRS
2214
SKITHRIHWESA
2616
SLLRSEETKE

C3

SKIT

SLLR

20
complement
774
STGR-
1307
LNNR-
1707
LNVTLSSTGR
2215
NGFKSHALQLNN
2617
QIRGLEEELQ

C4-A OR

NGFK

QIRG

R

complement

C4-B

21
clusterin
775
LPHR-
1308
SRIV-
1708
HYLPFSLPHR
2216
RPHFFFPKSRIV
2618
RSLMPFSPYE

RPHF

RSLM

22
biglycan
776
NNPVP-

N/A
1709
GISLENNPVP

N/A
2619
YWEVQPATFR

YWEVQ

23
elastin
777
GLPYT-

N/A
1710
LPGGYGLPYT

N/A
2620
TGKLPYGYGP

TGKLP

24
elastin
778
ARPGF-

N/A
1711
LGGVAARPGF

N/A
2621
GLSPIFPGGA

GLSPI

25
fibrinogen
779
SRGK-
1309
ESKS-
1712
GIAEFPSRGK
2217
SSSYSKQFTSST
2622
YKMADEAGSE

alpha

SSSY

YKMA

SYNRGDSTFESK

chain

S

26
fibrinogen
780
SRGK-
1310
SKSY-
1713
GIAEFPSRGK
2218
SSSYSKQFTSST
2623
KMADEAGSEA

alpha

SSSY

KMAD

SYNRGDSTFESK

chain

SY

27
fibrinogen
781
DGFR-
1311
PSRG-
1714
SGIGTLDGFR
2219
HRHPDEAAFFDT
2624
KSSSYSKQFT

alpha

HRHP

KSSS

ASTGKTFPGFFS

chain

PMLGEFVSETES

RGSESGIFTNTK

ESSSHHPGIAEF

PSRG

28
fibrinogen
782
SRGK-
1312
SYKM-
1715
GIAEFPSRGK
2220
SSSYSKQFTSST
2625
ADEAGSEADH

alpha

SSSY

ADEA

SYNRGDSTFESK

chain

SYKM

29
fibrinogen
783
TAWT-
1313
GGVR-
1716
VLSVVGTAWT
2221
ADSGEGDFLAEG
2626
GPRVVERHQS

alpha

ADSG

GPRV

GGVR

chain

30
fibrinogen
784
AWTA-
1314
GGVR-
1717
LSVVGTAWTA
2222
DSGEGDFLAEGG
2627
GPRVVERHQS

alpha

DSGE

GPRV

GVR

chain

31
fibrinogen
785
AWTA-
1315
GGGV-
1718
LSVVGTAWTA
2223
DSGEGDFLAEGG
2628
RGPRVVERHQ

alpha

DSGE

RGPR

GV

chain

32
fibrinogen
786
KNNK-
1316
ANNR-
1719
SLFEYQKNNK
2224
DSHSLTTNIMEI
2629
DNTYNRVSED

alpha

DSHS

DNTY

LRGDFSSANNR

chain

33
fibrinogen
787
WTAD-
788
GGVR-
1720
SVVGTAWTAD
2225
SGEGDFLAEGGG
2630
GPRVVERHQS

alpha

SGEG

GPRV

VR

chain

34
fibrinogen
789
SRGK-
1317
YKMA-
1721
GIAEFPSRGK
2226
SSSYSKQFTSST
2631
DEAGSEADHE

alpha

SSSY

DEAG

SYNRGDSTFESK

chain

SYKMA

35
fibrinogen
790
TAWT-
1318
GGGV-
1722
VLSVVGTAWT
2227
ADSGEGDFLAEG
2632
RGPRVVERHQ

alpha

ADSG

RGPR

GGV

chain

36
fibrinogen
791
GNFK-
1319
MRME-
1723
VPDLVPGNFK
2228
SQLQKVPPEWKA
2633
LERPGGNEIT

alpha

SQLQ

LERP

LTDMPQMRME

chain

37
fibrinogen
792
TADS-
1320
GGVR-
1724
VVGTAWTADS
2229
GEGDFLAEGGGV
2634
GPRVVERHQS

alpha

GEGD

GPRV

R

chain

38
fibrinogen
793
SSGP-
1321
SSGP-
1725
GSWNSGSSGP
2230
GSTGNRNPGSSG
2635
GSTGSWNSGS

alpha

GSTG

GSTG

TGGTATWKPGSS

chain

GP

39
fibrinogen
794
ADSG-
1322
GGVR-
1726
VGTAWTADSG
2231
EGDFLAEGGGVR
2636
GPRVVERHQS

alpha

EGDF

GPRV

chain

40
fibrinogen
795
SRGK-
1323
YKMA-
1727
GIAEFPSRGK
2232
SSSYSKQFTSST
2637
DEAGSEADHE

alpha

SSSY

DEAG

SYNRGDSTFESK

chain

SYKMA

41
fibrinogen
796
SRGK-
1324
TFES-
1728
GIAEFPSRGK
2233
SSSYSKQFTSST
2638
KSYKMADEAG

alpha

SSSY

KSYK

SYNRGDSTFES

chain

42
fibrinogen
797
TAWT-
1325
PRVV-
1729
VLSVVGTAWT
2234
ADSGEGDFLAEG
2639
ERHQSACKDS

alpha

ADSG

ERHQ

GGVRGPRVV

chain

43
fibrinogen
798
NFKS-
1326
PEWK-
1730
PDLVPGNFKS
2235
QLQKVPPEWK
2640
ALTDMPQMRM

alpha

QLQK

ALTD

chain

44
fibrinogen
799
KMKP-
1327
GNFK-
1731
QHLPLIKMKP
2236
VPDLVPGNFK
2641
SQLQKVPPEW

alpha

VPDL

SQLQ

chain

45
fibrinogen
800
SGEG-
1328
GGVR-
1732
TAWTADSGEG
2237
DFLAEGGGVR
2642
GPRVVERHQS

alpha

DFLA

GPRV

chain

46
fibrinogen
801
TAWT-
1329
GPRV-
1733
VLSVVGTAWT
2238
ADSGEGDFLAEG
2643
VERHQSACKD

alpha

ADSG

VERH

GGVRGPRV

chain

47
fibrinogen
802
STGK-
1330
PSRG-
1734
AFFDTASTGK
2239
TFPGFFSPMLGE
2644
KSSSYSKQFT

alpha

TFPG

KSSS

FVSETESRGSES

chain

GIFTNTKESSSH

HPGIAEFPSRG

48
fibrinogen
803
PLIK-
1331
GNFK-
1735
RDRQHLPLIK
2240
MKPVPDLVPGNF
2645
SQLQKVPPEW

alpha

MKPV

SQLQ

K

chain

49
fibrinogen
804
PLIK-
1332
SQLQ-
1736
RDRQHLPLIK
2241
MKPVPDLVPGNF
2646
KVPPEWKALT

alpha

MKPV

KVPP

KSQLQ

chain

50
fibrinogen
805
GSWN-
1333
SSGP-
1737
RNPSSAGSWN
2242
SGSSGPGSTGNR
2647
GSTGSWNSGS

alpha

SGSS

GSTG

NPGSSGTGGTAT

chain

WKPGSSGP

51
fibrinogen
806
PLIK-
1334
MRME-
1738
RDRQHLPLIK
2243
MKPVPDLVPGNF
2648
LERPGGNEIT

alpha

MKPV

LERP

KSQLQKVPPEWK

chain

ALTDMPQMRME

52
fibrinogen
807
GEFV-
1335
PSRG-
1739
FFSPMLGEFV
2244
SETESRGSESGI
2649
KSSSYSKQFT

alpha

SETE

KSSS

FTNTKESSSHHP

chain

GIAEFPSRG

53
fibrinogen
808
AWTA-
1336
PRVV-
1740
LSVVGTAWTA
2245
DSGEGDFLAEGG
2650
ERHQSACKDS

alpha

DSGE

ERHQ

GVRGPRVV

chain

54
fibrinogen
809
KNNK-
1337
SANN-
1741
SLFEYQKNNK
2246
DSHSLTTNIMEI
2651
RDNTYNRVSE

alpha

DSHS

RDNT

LRGDFSSANN

chain

55
fibrinogen
810
TAWT-
1338
RVVE-
1742
VLSVVGTAWT
2247
ADSGEGDFLAEG
2652
RHQSACKDSD

alpha

ADSG

RHQS

GGVRGPRVVE

chain

56
fibrinogen
811
PLIK-
1339
KALT-
1743
RDRQHLPLIK
2248
MKPVPDLVPGNF
2653
DMPQMRMELE

alpha

MKPV

DMPQ

KSQLQKVPPEWK

chain

ALT

57
fibrinogen
812
GQWH-
1340
WGTF-
1744
SVSGSTGQWH
2249
SESGSFRPDSPG
2654
EEVSGNVSPG

alpha

SESG

EEVS

SGNARPNNPDWG

chain

TF

58
fibrinogen
813
KMKP-
1341
PGNF-
1745
QHLPLIKMKP
2250
VPDLVPGNF
2655
KSQLQKVPPE

alpha

VPDL

KSQL

chain

59
fibrinogen
814
MRME-
1342
SPRN-
1746
LTDMPQMRME
2251
LERPGGNEITRG
2656
PSSAGSWNSG

alpha

LERP

PSSA

GSTSYGTGSETE

chain

SPRN

60
fibrinogen
815
KPVP-
1343
PGNF-
1747
LPLIKMKPVP
2252
DLVPGNF
2657
KSQLQKVPPE

alpha

DLVP

KSQL

chain

61
fibrinogen
816
PLIK-
1344
PQMR-
1748
RDRQHLPLIK
2253
MKPVPDLVPGNF
2658
MELERPGGNE

alpha

MKPV

MELE

KSQLQKVPPEWK

chain

ALTDMPQMR

62
fibrinogen
817
TAWT-
1345
ERHQ-
1749
VLSVVGTAWT
2254
ADSGEGDFLAEG
2659
SACKDSDWPF

alpha

ADSG

SACK

GGVRGPRVVERH

chain

Q

63
fibrinogen
818
GNFK-
1346
KALT-
1750
VPDLVPGNFK
2255
SQLQKVPPEWKA
2660
DMPQMRMELE

alpha

SQLQ

DMPQ

LT

chain

64
fibrinogen
819
RGKS-
1347
TFES-
1751
IAEFPSRGKS
2256
SSYSKQFTSSTS
2661
KSYKMADEAG

alpha

SSYS

KSYK

YNRGDSTFES

chain

65
fibrinogen
820
MLGE-
1348
PSRG-
1752
PGFFSPMLGE
2257
FVSETESRGSES
2662
KSSSYSKQFT

alpha

FVSE

KSSS

GIFTNTKESSSH

chain

HPGIAEFPSRG

66
fibrinogen
821
MRME-
1349
GNRN-
1753
LTDMPQMRME
2258
LERPGGNEITRG
2663
PGSSGTGGTA

alpha

LERP

PGSS

GSTSYGTGSETE

chain

SPRNPSSAGSWN

SGSSGPGSTGNR

N

67
fibrinogen
822
SYSK-
1350
SKSY-
1754
SRGKSSSYSK
2259
QFTSSTSYNRGD
2664
KMADEAGSEA

alpha

QFTS

KMAD

STFESKSY

chain

68
fibrinogen
823
ESSV-
1351
WGTF-
1755
AGHWTSESSV
2260
SGSTGQWHSESG
2665
EEVSGNVSPG

alpha

SGST

EEVS

SFRPDSPGSGNA

chain

RPNNPDWGTF

69
fibrinogen
824
DGFR-
1352
PSRG-
1756
SGIGTLDGFR
2261
HRHPDEAAFFDT
2666
KSSSYSKQFT

alpha

HRHP

KSSS

ASTGKTFPGFFS

chain

PMLGEFVSETES

RGSESGIFTNTK

ESSSHHPGIAEF

PSRG

70
fibrinogen
825
PLIK-
1353
MELE-
1757
RDRQHLPLIK
2262
MKPVPDLVPGNF
2667
RPGGNEITRG

alpha

MKPV

RPGG

KSQLQKVPPEWK

chain

ALTDMPQMRMEL

E

71
fibrinogen
826
WGTF-
1354
LVTS-
1758
RPNNPDWGTF
2263
EEVSGNVSPGTR
2668
KGDKELRTGK

alpha

EEVS

KGDK

REYHTEKLVTS

chain

72
fibrinogen
827
MRME-
1355
GSWN-
1759
LTDMPQMRME
2264
LERPGGNEITRG
2669
SGSSGPGSTG

alpha

LERP

SGSS

GSTSYGTGSETE

chain

SPRNPSSAGSWN

73
fibrinogen
828
EAAF-
1356
TFPG-
1760
RHRHPDEAAF
2265
FDTASTGKTFPG
2670
FFSPMLGEFV

alpha

FDTA

FFSP

chain

74
fibrinogen
829
SRGK-
1357
RGHA-
1761
GIAEFPSRGK
2266
SSSYSKQFTSST
2671
KSRPVRDCDD

alpha

SSSY

KSRP

SYNRGDSTFESK

chain

SYKMADEAGSEA

DHEGTHSTKRGH

A

75
fibrinogen
830
PLIK-
1358
PGNF-
1762
RDRQHLPLIK
2267
MKPVPDLVPGNF
2672
KSQLQKVPPE

alpha

MKPV

KSQL

chain

76
alpha-1-
831
DPQG-
1359
LAHQ-
1763
PVSLAEDPQG
2268
DAAQKTDTSHHD
2673
SNSTNIFFSP

anti-

DAAQ

SNST

QDHPTFNKITPN

trypsin

LAEFAFSLYRQL

AHQ

77
alpha-1-
832
FVFL-

N/A
1764
VKFNKPFVFL

N/A
2674
MIEQNTKSPL

anti-

MIEQ

(end

FMGKVVNPTQ

trypsin

of

K

pro-

tein)

78
alpha-1-
833
MFLE-

N/A
1765
TEAAGAMFLE

N/A
2675
AIPMSIPPEV

anti-

AIPM

(end

KFNKPFVFLM

trypsin

of

IEQNTKSPLF

pro-

MGKVVNPTQK

tein)

79
alpha-1-
834
IPMS-

N/A
1766
AMFLEAIPMS

N/A
2676
IPPEVKFNKP

anti-

IPPE

(end

FVFLMIEQNT

trypsin

of

KSPLFMGKV

pro-

VNPTQK

tein}

80
alpha-1-
835
GTEA-

N/A
1767
LTIDEKGTEA

N/A
2677
AGAMFLEAIP

anti-

AGAM

(end

MSIPPEVKFN

trypsin

of

KPFVFLMIEQ

pro-

NTKSPLFMGK

tein)

VVNPTQK

81
alpha-1-
836
AIPM-

N/A
1768
GAMFLEAIPM

N/A
2678
SIPPEVKFNK

anti-

SIPP

(end

PFVFLMIEQN

trypsin

of

TKSPLFMGKV

pro-

VNPTQK

tein)

82
alpha-1-
837
PFVF-

N/A
1769
EVKENKPFVF

N/A
2679
LMIEQNTKSP

anti-

LMIE

(end

LFMGKVVNPT

trypsin

of

QK

pro-

tein)

83
alpha-1-
838
PEVK-

N/A
1770
IPMSIPPEVK

N/A
2680
FNKPFVFLMI

anti-

FNKP

(end

EQNTKSPLFM

trypsin

of

GKVVNPTQK

pro-

tein)

84
alpha-1-
839
AMFL-

N/A
1771
GTEAAGAMFL

N/A
2681
EAIPMSIPPE

anti-

EAIP

(end

VKFNKPFVFL

trypsin

of

MIEQNTKSPL

pro-

FMGKVVNPTQ

tein)

K

85
alpha-1-
840
VSLA-
1360
LAHQ-
1772
LCCLVPVSLA
2269
EDPQGDAAQKTD
2682
SNSTNIFFSP

anti-

EDPQ

SNST

TSHHDQDHPTFN

trypsin

KITPNLAEFAFS

LYRQLAHQ

86
alpha-1-
841
VFLM-

N/A
1773
KFNKPFVFLM

N/A
2683
IEQNTKSPLF

anti-

IEQN

(end

MGKVVNPTQK

trypsin

of

pro-

tein)

87
alpha-1-
842
AAGA-

N/A
1774
DEKGTEAAGA

N/A
2684
MFLEAIPMSI

anti-

MFLE

(end

PPEVKFNKPF

trypsin

of

VFLMIEQNTK

pro-

SPLFMGKVVN

tein)

PTQK

88
alpha-1-
843
EAIP-

N/A
1775
AGAMFLEAIP

N/A
2685
MSIPPEVKFN

anti-

MSIP

(end

KPFVFLMIEQ

trypsin

of

NTKSPLFMGK

pro-

VVNPTQK

tein)

89
alpha-1-
844
AGAM-

N/A
1776
EKGTEAAGAM

N/A
2686
FLEAIPMSIP

anti-

FLEA

(end

PEVKFNKPFV

trypsin

of

FLMIEQNTKS

pro-

PLFMGKVVNP

tein)

TQK

90
alpha-1-
845
DPQG-
1361
AHQS-
1777
PVSLAEDPQG
2270
DAAQKTDTSHHD
2687
NSTNIFFSPV

anti-

DAAQ

NSTN

QDHPTFNKITPN

trypsin

LAEFAFSLYRQL

AHQS

91
alpha-1-
846
DEKG-

N/A
1778
KAVLTIDEKG

N/A
2688
TEAAGAMFLE

anti-

TEAA

(end

AIPMSIPPEV

trypsin

of

KFNKPFVFLM

pro-

IEQNTKSPLF

tein)

MGKVVNPTQK

92
alpha-1-
847
EKGT-

N/A
1779
AVLTIDEKGT

N/A
2689
EAAGAMFLEA

anti-

EAAG

(end

IPMSIPPEVK

trypsin

of

FNKPFVFLMI

pro-

EQNTKSPLFM

tein)

GKVVNPTQK

93
alpha-1-
848
AMFL-

N/A
1780
GTEAAGAMFL

N/A
2690
EAIPMSIPPE

anti-

EAIP

(end

VKFNKPFVFL

trypsin

of

MIEQNTKSPL

pro-

FMGKVVNPTQ

tein)

K

94
alpha-1-
849
VSLA-
1362
AHQS-
1781
LCCLVPVSLA
2271
EDPQGDAAQKTD
2691
NSTNIFFSPV

anti-

EDPQ

NSTN

TSHHDQDHPTFN

trypsin

KITPNLAEFAFS

LYRQLAHQS

95
alpha-1-
850
VSLA-
1363
QLAH-
1782
LCCLVPVSLA
2272
EDPQGDAAQKTD
2692
QSNSTNIFFS

anti-

EDPQ

QSNS

TSHHDQDHPTFN

trypsin

KITPNLAEFAFS

LYRQLAH

96
alpha-1-
851
KGTE-

N/A
1783
VLTIDEKGTE

N/A
2693
AAGAMFLEAI

anti-

AAGA

(end

PMSIPPEVKF

trypsin

of

NKPFVFLMIE

pro-

QNTKSPLFMG

tein)

KVVNPTQK

97
alpha-1-
852
DPQG-
1364
QLAH-
1784
PVSLAEDPQG
2273
DAAQKTDTSHHD
2694
QSNSTNIFFS

anti-

DAAQ

QSNS

QDHPTFNKITPN

trypsin

LAEFAFSLYRQL

AH

98
alpha-1-
853
TEAA-

N/A
1785
TIDEKGTEAA

N/A
2695
GAMFLEAIPM

anti-

GAMF

(end

SIPPEVKFNK

trypsin

of

PFVFLMIEQN

pro-

TKSPLFMGKV

tein)

VNPTQK

99
alpha-1-
854
IDEK-

N/A
1786
HKAVLTIDEK

N/A
2696
GTEAAGAMFL

anti-

GTEA

(end

EAIPMSIPPE

trypsin

of

VKFNKPFVFL

pro-

MIEQNTKSPL

tein)

FMGKVVNPTQ

K

100
alpha-1-
855
EAAG-

N/A
1787
IDEKGTEAAG

N/A
2697
AMFLEAIPMS

anti-

AMFL

(end

IPPEVKFNKP

trypsin

of

FVFLMIEQNT

pro-

KSPLFMGKVV

tein)

NPTQK

101
alpha-1-
856
VSLA-
1365
RQLA-
1788
LCCLVPVSLA
2274
EDPQGDAAQKTD
2698
HQSNSTNIFF

anti-

EDPQ

HQSN

TSHHDQDHPTFN

trypsin

KITPNLAEFAFS

LYRQLA

102
alpha-1-
857
LAED-
1366
LAHQ-
1789
CLVPVSLAED
2275
PQGDAAQKTDTS
2699
SNSTNIFFSP

anti-

PQGD

SNST

HHDQDHPTFNKI

trypsin

TPNLAEFAFSLY

RQLAHQ

103
Complement
858
LSLQ-
1367
QVVK-
1790
ASSFFTLSLQ
2276
KPRLLLFSPSVV
2700
GSVFLRNPSR

C4-B OR

KPRL

GSVF

HLGVPLSVGVQL

Complement

QDVPRGQVVK

C4-A

104
Complement
859
LSLQ-
1368
VPRG-
1791
ASSFFTLSLQ
2277
KPRLLLFSPSVV
2701
QVVKGSVFLR

C4-B OR

KPRL

QVVK

HLGVPLSVGVQL

Complement

QDVPRG

C4-A

105
Complement
860
LSLQ-
1369
PSRN-
1792
ASSFFTLSLQ
2278
KPRLLLFSPSVV
2702
NVPCSPKVDF

C4-B OR

KPRL

NVPC

HLGVPLSVGVQL

Complement

QDVPRGQVVKGS

C4-A

VFLRNPSRN

106
Complement
861
LSLQ-
1370
SRNN-
1793
ASSFFTLSLQ
2279
KPRLLLFSPSVV
2703
VPCSPKVDFT

C4-B OR

KPRL

VPCS

HLGVPLSVGVQL

Complement

QDVPRGQVVKGS

C4-A

VFLRNPSRNN

107
Complement
862
LSLQ-
1371
VPLS-
1794
ASSFFTLSLQ
2280
KPRLLLFSPSVV
2704
VGVQLQDVPR

C4-B OR

KPRL

VGVQ

HLGVPLS

Complement

C4-A

108
Complement
863
LSLQ-
1372
GSVF-
1795
ASSFFTLSLQ
2281
KPRLLLFSPSVV
2705
LRNPSRNNVP

C4-B OR

KPRL

LRNP

HLGVPLSVGVQL

Complement

QDVPRGQVVKGS

C4-A

VF

109
Complement
864
LSLQ-
1373
VQLQ-
1796
ASSFFTLSLQ
2282
KPRLLLFSPSVV
2706
DVPRGQVVKG

C4-B OR

KPRL

DVPR

HLGVPLSVGVQL

Complement

Q

C4-A

110
Complement
865
LSLQ-
1374
VGVQ-
1797
ASSFFTLSLQ
2283
KPRLLLFSPSVV
2707
LQDVPRGQVV

C4-B OR

KPRL

LQDV

HLGVPLSVGVQ

Complement

C4-A

111
Complement
866
STGR-
1375
NRQI-
1798
LNVTLSSTGR
2284
NGFKSHALQLNN
2708
RGLEEELQFS

C4-B OR

NGFK

RGLE

RQI

Complement

C4-A

112
Complement
867
LSLQ-
1376
VHLG-
1799
ASSFFTLSLQ
2285
KPRLLLFSPSVV
2709
VPLSVGVQLQ

C4-B OR

KPRL

VPLS

HLG

Complement

C4-A

113
Complement
868
STGR-
1377
RQIR-
1800
LNVTLSSTGR
2286
NGFKSHALQLNN
2710
GLEEELQFSL

C4-B OR

NGFK

GLEE

RQIR

Complement

C4-A

114
Complement
869
LPAK-
1378
GRRN-
1801
DYEYDELPAK
2287
DDPDAPLQPVTP
2711
RRRREAPKVV

C4-B OR

DDPD

RRRR

LQLFEGRRN

Complement

C4-A

115
Complement
870
LSLQ-
1379
SVVH-
1802
ASSFFTLSLQ
2288
KPRLLLFSPSVV
2712
LGVPLSVGVQ

C4-B OR

KPRL

LGVP

H

Complement

C4-A

116
Complement
871
RQIR-
1380
INVK-
1803
ALQLNNRQIR
2289
GLEEELQFSLGS
2713
VGGNSKGTLK

C4-B OR

GLEE

VGGN

KINVK

Complement

C4-A

117
Complement
872
PAKD-
1381
PVTP-
1804
YEYDELPAKD
2290
DPDAPLQPVTP
2714
LQLFEGRRNR

C4-B OR

DPDA

LQLF

Complement

C4-A

118
Complement
873
LPAK-
1382
PVTP-
1805
DYEYDELPAK
2291
DDPDAPLQPVTP
2715
LQLFEGRRNR

C4-B OR

DDPD

LQLF

Complement

C4-A

119
Complement
874
TGRN-
1383
NRQI-
1806
NVTLSSTGRN
2292
GFKSHALQLNNR
2716
RGLEEELQFS

C4-B OR

GFKS

RGLE

QI

Complement

C4-A

120
Complement
875
LPAK-
1384
TPLQ-
1807
DYEYDELPAK
2293
DDPDAPLQPVTP
2717
LFEGRRNRRR

C4-B OR

DDPD

LFEG

LQ

Complement

C4-A

121
Complement
876
LSLQ-
1385
VFLR-
1808
ASSFFTLSLQ
2294
KPRLLLFSPSVV
2718
NPSRNNVPCS

C4-B OR

KPRL

NPSR

HLGVPLSVGVQL

Complement

QDVPRGQVVKGS

C4-A

VFLR

122
Complement
877
LPAK-
1386
PLQL-
1809
DYEYDELPAK
2295
DDPDAPLQPVTP
2719
FEGRRNRRRR

C4-B OR

DDPD

FEGR

LQL

Complement

C4-A

123
Complement
878
RQIR-
1387
LGSK-
1810
ALQLNNRQIR
2296
GLEEELQFSLGS
2720
INVKVGGNSK

C4-B OR

GLEE

INVK

K

Complement

C4-A

124
Complement
879
HRGR-
1388
VRVT-
1811
ELNPLDHRGR
2297
TLEIPGNSDPNM
2721
ASDPLDTLGS

C4-B OR

TLEI

ASDP

IPDGDFNSYVRV

Complement

T

C4-A

125
Complement
880
PRLL-
1389
VGVQ-
1812
TLSLQKPRLL
2298
LFSPSVVHLGVP
2722
LQDVPRGQVV

C4-B OR

LFSP

LQDV

LSVGVQ

Complement

C4-A

126
Complement
881
SELQ-
1390
ARLT-
1813
IIPQTISELQ
2299
LSVSAGSPHPAI
2723
VAAPPSGGPG

C4-B OR

LSVS

VAAP

ARLT

Complement

C4-A

127
Complement
882
ARLT-
1391
PRVG-
1814
SPHPAIARLT
2300
VAAPPSGGPGFL
2724
DTLNLNLRAV

C4-B OR

VAAP

DTLN

SIERPDSRPPRV

Complement

G

C4-A

128
Complement
883
PRLL-
1392
VPLS-
1815
TLSLQKPRLL
2301
LFSPSVVHLGVP
2725
VGVQLQDVPR

C4-B OR

LFSP

VGVQ

LS

Complement

C4-A

129
Complement
884
AKDD-
1393
GRRN-
1816
EYDELPAKDD
2302
PDAPLQPVTPLQ
2726
RRRREAPKVV

C4-B OR

PDAP

RRRR

LFEGRRN

Complement

C4-A

130
Complement
885
PAKD-
1394
GRRN-
1817
YEYDELPAKD
2303
DPDAPLQPVTPL
2727
RRRREAPKVV

C4-B OR

DPDA

RRRR

QLFEGRRN

Complement

C4-A

131
Complement
886
RLLL-
1395
VPLS-
1818
LSLQKPRLLL
2304
FSPSVVHLGVPL
2728
VGVQLQDVPR

C4-B OR

FSPS

VGVQ

S

Complement

C4-A

132
Complement
887
DYEY-
1396
GRRN-
1819
ANEDYEDYEY
2305
DELPAKDDPDAP
2729
RRRREAPKVV

C4-B OR

DELP

RRRR

LQPVTPLQLFEG

Complement

RRN

C4-A

133
Complement
888
PRLL-
1397
QVVK-
1820
TLSLQKPRLL
2306
LFSPSVVHLGVP
2730
GSVFLRNPSR

C4-B OR

LFSP

GSVF

LSVGVQLQDVPR

Complement

GQVVK

C4-A

134
Complement
889
PRLL-
1398
VHLG-
1821
TLSLQKPRLL
2307
LFSPSVVHLG
2731
VPLSVGVQLQ

C4-B OR

LFSP

VPLS

Complement

C4-A

135
Complement
890
LTVA-
1399
PRVG-
1822
HPAIARLTVA
2308
APPSGGPGFLSI
2732
DTLNLNLRAV

C4-B OR

APPS

DTLN

ERPDSRPPRVG

Complement

C4-A

136
fibrinogen
891
DKKR-
1400
GGGY-
1823
RGHRPLDKKR
2309
EEAPSLRPAPPP
2733
RARPAKAAAT

beta

EEAP

RARP

ISGGGY

chain

137
fibrinogen
892
KKRE-
1401
GGGY-
1824
GHRPLDKKRE
2310
EAPSLRPAPPPI
2734
RARPAKAAAT

beta

EAPS

RARP

SGGGY

chain

138
fibrinogen
893
LDKK-
1402
GGGY-
1825
ARGHRPLDKK
2311
REEAPSLRPAPP
2735
RARPAKAAAT

beta

REEA

RARP

PISGGGY

chain

139
fibrinogen
894
LVKS-
1403
GGGY-
1826
LLLCVFLVKS
2312
QGVNDNEEGFFS
2736
RARPAKAAAT

beta

QGVN

RARP

ARGHRPLDKKRE

chain

EAPSLRPAPPPI

SGGGY

140
fibrinogen
895
LVKS-
1404
FSAR-
1827
LLLCVFLVKS
2313
QGVNDNEEGFFS
2737
GHRPLDKKRE

beta

QGVN

GHRP

AR

chain

141
fibrinogen
896
LVKS-
1405
ARGH-
1828
LLLCVFLVKS
2314
QGVNDNEEGFFS
2738
RPLDKKREEA

beta

QGVN

RPLD

ARGH

chain

142
fibrinogen
897
LVKS-
1406
FFSA-
1829
LLLCVFLVKS
2315
QGVNDNEEGFFS
2739
RGHRPLDKKR

beta

QGVN

RGHR

A

chain

143
fibrinogen
898
LVKS-
1407
KKRE-
1830
LLLCVFLVKS
2316
QGVNDNEEGFFS
2740
EAPSLRPAPP

beta

QGVN

EAPS

ARGHRPLDKKRE

chain

144
fibrinogen
899
PLDK-
1408
GGGY-
1831
SARGHRPLDK
2317
KREEAPSLRPAP
2741
RARPAKAAAT

beta

KREE

RARP

PPISGGGY

chain

145
fibrinogen
900
LVKS-
1409
GGYR-
1832
LLLCVFLVKS
2318
QGVNDNEEGFFS
2742
ARPAKAAAT

beta

QGVN

ARPA

ARGHRPLDKKRE

chain

EAPSLRPAPPPI

SGGGYR

146
fibrinogen
901
LVKS-
1410
RPLD-
1833
LLLCVFLVKS
2319
QGVNDNEEGFFS
2743
KKREEAPSLR

beta

QGVN

KKRE

ARGHRPLD

chain

147
fibrinogen
902
HRPL-
1411
GGGY-
1834
FFSARGHRPL
2320
DKKREEAPSLRP
2744
RARPAKAAAT

beta

DKKR

RARP

APPPISGGGY

chain

148
fibrinogen
903
LVKS-
1412
DKKR-
1835
LLLCVFLVKS
2321
QGVNDNEEGFFS
2745
EEAPSLRPAP

beta

QGVN

EEAP

ARGHRPLDKKR

chain

149
fibrinogen
904
SQGV-
1413
FFSA-
1836
CVFLVKSQGV
2322
NDNEEGFFSA
2746
RGHRPLDKKR

beta

NDNE

RGHR

chain

150
fibrinogen
905
LVKS-
1414
RPAK-
1837
LLLCVFLVKS
2323
QGVNDNEEGFFS
2747
AAATQKKVER

beta

QGVN

AAAT

ARGHRPLDKKRE

chain

EAPSLRPAPPPI

SGGGYRARPAK

151
fibrinogen
906
LVKS-
1415
GFFS-
1838
LLLCVFLVKS
2324
QGVNDNEEGFFS
2748
ARGHRPLDKK

beta

QGVN

ARGH

chain

152
fibrinogen
907
KSQG-
1416
RPLD-
1839
LCVFLVKSQG
2325
VNDNEEGFFSAR
2749
KKREEAPSLR

beta

VNDN

KKRE

GHRPLD

chain

153
fibrinogen
908
KSQG-
1417
GGYR-
1840
LCVFLVKSQG
2326
VNDNEEGFFSAR
2750
ARPAKAAATQ

beta

VNDN

ARPA

GHRPLDKKREEA

chain

PSLRPAPPPISG

GGYR

154
fibrinogen
909
KSQG-
1418
PLDK-
1841
LCVFLVKSQG
2327
VNDNEEGFFSAR
2751
KREEAPSLRP

beta

VNDN

KREE

GHRPLDK

chain

155
fibrinogen
910
LVKS-
1419
APSL-
1842
LLLCVFLVKS
2328
QGVNDNEEGFFS
2752
RPAPPPISGG

beta

QGVN

RPAP

ARGHRPLDKKRE

chain

EAPSL

156
fibrinogen
911
LVKS-
1420
EAPS-
1843
LLLCVFLVKS
2329
QGVNDNEEGFFS
2753
LRPAPPPISG

beta

QGVN

LRPA

ARGHRPLDKKRE

chain

EAPS

157
fibrinogen
912
LVKS-
1421
LDKK-
1844
LLLCVFLVKS
2330
QGVNDNEEGFFS
2754
REEAPSLRPA

beta

QGVN

REEA

ARGHRPLDKK

chain

158
fibrinogen
913
KSQG-
1422
FFSA-
1845
LCVFLVKSQG
2331
VNDNEEGFFSA
2755
RGHRPLDKKR

beta

VNDN

RGHR

chain

159
fibrinogen
914
PLDK-
1423
GGYR-
1846
SARGHRPLDK
2332
KREEAPSLRPAP
2756
ARPAKAAATQ

beta

KREE

ARPA

PPISGGGYR

chain

160
fibrinogen
915
FSAR-
1424
GGGY-
1847
DNEEGFFSAR
2333
GHRPLDKKREEA
2757
RARPAKAAAT

beta

GHRP

RARP

PSLRPAPPPISG

chain

GGY

161
fibrinogen
916
SQGV-
1425
RPLD-
1848
CVFLVKSQGV
2334
NDNEEGFFSARG
2758
KKREEAPSLR

beta

NDNE

KKRE

HRPLD

chain

162
fibrinogen
917
LVKS-
1426
RARP-
1849
LLLCVFLVKS
2335
QGVNDNEEGFFS
2759
AKAAATQKKV

beta

QGVN

AKAA

ARGHRPLDKKRE

chain

EAPSLRPAPPPI

SGGGYRARP

163
fibrinogen
918
LVKS-
1427
AKAA-
1850
LLLCVFLVKS
2336
QGVNDNEEGFFS
2760
ATQKKVERKA

beta

QGVN

ATQK

ARGHRPLDKKRE

chain

EAPSLRPAPPPI

SGGGYRARPAKA

A

164
fibrinogen
919
MSMK-

N/A
1851
WYSMRKMSMK

N/A
2761
IRPFFPQQ

beta

IRPF

(end

chain

of

pro-

tein)

165
fibrinogen
920
QGVN-
1428
FFSA-
1852
VFLVKSQGVN
2337
DNEEGFFSA
2762
RGHRPLDKKR

beta

DNEE

RGHR

chain

166
fibrinogen
921
LVKS-
1429
PPIS-
1853
LLLCVFLVKS
2338
QGVNDNEEGFFS
2763
GGGYRARPAK

beta

QGVN

GGGY

ARGHRPLDKKRE

chain

EAPSLRPAPPPI

S

167
fibrinogen
922
LVKS-
1430
KREE-
1854
LLLCVFLVKS
2339
QGVNDNEEGFFS
2764
APSLRPAPPP

beta

QGVN

APSL

ARGHRPLDKKRE

chain

E

168
fibrinogen
923
LVKS-
1431
ARPA-
1855
LLLCVFLVKS
2340
QGVNDNEEGFFS
2765
KAAATQKKVE

beta

QGVN

KAAA

ARGHRPLDKKRE

chain

EAPSLRPAPPPI

SGGGYRARPA

169
serum
924
NIQR-

N/A
1856
ISDARENIQR

N/A
2766
FFGHGAEDSL

amyloid

FFGH

(end

ADQAANEWGR

A-1

of

SGKDPNHFRP

protein

pro-

AGLPEKY

tein)

170
serum
925
AAEA-

N/A
1857
GPGGVWAAEA

N/A
2767
ISDARENIQR

amyloid

ISDA

(end

FFGHGAEDSL

A-1

of

ADQAANEWGR

protein

pro-

SGKDPNHFRP

tein)

AGLPEKY

171
serum
926
GAED-

N/A
1858
QRFFGHGAED

N/A
2768
SLADQAANEW

amyloid

SLAD

(end

GRSGKDPNHF

A-1

of

RPAGLPEKY

protein

pro-

tein)

172
serum
927
IQRF-

N/A
1859
SDARENIQRF

N/A
2769
FGHGAEDSLA

amyloid

FGHG

(end

DQAANEWGRS

A-1

of

GKDPNHFRPA

protein

pro-

GLPEKY

tein)

173
serum
928
GVSS-
1432
EAFD-
1860
FCSLVLGVSS
2341
RSFFSFLGEAFD
2770
GARDMWRAYS

amyloid

RSFF

GARD

A-1

protein

174
serum
929
SGKD-

N/A
1861
ANEWGRSGKD

N/A
2771
PNHFRPAGLP

amyloid

PNHF

(end

EKY

A-1

of

protein

pro-

tein)

175
serum
930
GVWA-

N/A
1862
AKRGPGGVWA

N/A
2772
AEAISDAREN

amyloid

AEAI

(end

IQRFFGHGAE

A-1

of

DSLADQAANE

protein

pro-

WGRSGKDPNH

tein)

FRPAGLPEKY

176
serum
931
ADQA-

N/A
1863
GAEDSLADQA

N/A
2773
ANEWGRSGKD

amyloid

ANEW

(end

PNHFRPAGLP

A-1

of

EKY

protein

pro-

tein)

177
serum
932
GEAF-
1433
KRGP-
1864
SFFSFLGEAF
2342
DGARDMWRAYSD
2774
GGVWAAEAIS

amyloid

DGAR

GGVW

MREANYIGSDKY

A-1

FHARGNYDAAKR

protein

GP

178
serum
933
AISD-

N/A
1865
GVWAAEAISD

N/A
2775
ARENIQRFFG

amyloid

AREN

(end

HGAEDSLADQ

A-1

of

AANEWGRSGK

protein

pro-

DPNHFRPAGL

tein)

PEKY

179
serum
934
VSSR-
935
EAFD-
1866
CSLVLSVSSR
2343
SFFSFLGEAFD
2776
GARDMWRAYS

amyloid

SFFS

GARD

A-1 pro-

tein OR

serum

amyloid

A-2

protein

180
serum
936
DSLA-

N/A
1867
FGHGAEDSLA

N/A
2777
DQAANEWGRS

amyloid

DQAA

(end

GKDPNHFRPA

A-1

of

GLPEKY

protein

pro-

tein)

181
serum
937
SFFS-
1434
KRGP-
1868
LSVSSRSFFS
2344
FLGEAFDGARDM
2778
GGVWAAEVIS

amyloid

FLGE

GGVW

WRAYSDMREANY

A-1

IGSDKYFHARGN

protein

YDAAKRGP

182
serum
938
SLAD-

N/A
1869
GHGAEDSLAD

N/A
2779
QAANEWGRSG

amyloid

QAAN

(end

KDPNHFRPAG

A-1

of

LPEKY

protein

pro-

tein)

183
serum
939
ENIQ-

N/A
1870
AISDARENIQ

N/A
2780
RFFGHGAEDS

amyloid

RFFG

(end

LADQAANEWG

A-1

of

RSGKDPNHFR

protein

pro-

PAGLPEKY

tein)

184
serum
940
NIQR-
1435
LPEK-Y
1871
ISDARENIQR
2345
FFGHGAEDSLAD

N/A

amyloid

FFGH

QAANEWGRSGKD

A-1

PNHFRPAGLPEK

protein

185
serum
941
FFGH-

N/A
1872
RENIQRFFGH

N/A
2781
GAEDSLADQA

amyloid

GAED

(end

ANEWGRSGKD

A-1

of

PNHFRPAGLP

protein

pro-

EKY

tein)

186
serum
942
RSGK-

N/A
1873
AANEWGRSGK

N/A
2782
DPNHFRPAGL

amyloid

DPNH

(end

PEKY

A-1

of

protein

pro-

tein)

187
serum
943
RFFG-

N/A
1874
ARENIQRFFG

N/A
2783
HGAEDSLADQ

amyloid

HGAE

(end

AANEWGRSGK

A-1

of

DPNHFRPAGL

protein

pro-

PEKY

tein)

188
serum
944
EWGR-

N/A
1875
ADQAANEWGR

N/A
2784
SGKDPNHFRP

amyloid

SGKD

(end

AGLPEKY

A-1

of

protein

pro-

tein)

189
serum
945
DQAA-

N/A
1876
AEDSLADQAA

N/A
2785
NEWGRSGKDP

amyloid

NEWG

(end

NHFRPAGLPE

A-1

of

KY

protein

pro-

tein)

190
serum
946
EAIS-

N/A
1877
GGVWAAEAIS

N/A
2786
DARENIQRFF

amyloid

DARE

(end

GHGAEDSLAD

A-1

of

QAANEWGRSG

protein

pro-

KDPNHFRPAG

tein)

LPEKY

191
serum
947
RSFF-
1436
KRGP-
1878
VLSVSSRSFF
2346
SFLGEAFDGARD
2787
GGVWAAEAIS

amyloid

SFLG

GGVW

MWRAYSDMREAN

A-1

YIGSDKYFHARG

protein

NYDAAKRGP

192
serum
948
DSLA-
1437
FRPA-
1879
FGHGAEDSLA
2347
DQAANEWGRSGK
2788
GLPEKY

amyloid

DQAA

GLPE

DPNHFRPA

A-1

protein

193
serum
949
AISD-
1438
GAED-
1880
GVWAAEAISD
2348
ARENIQRFFGHG
2789
SLADQAANEW

amyloid

AREN

SLAD

AED

A-1

protein

194
serum
950
EDSL-

N/A
1881
FFGHGAEDSL

N/A
2790
ADQAANEWGR

amyloid

ADQA

(end

SGKDPNHFRP

A-1

of

AGLPEKY

protein

pro-

tein)

195
serum
951
AEAI-

N/A
1882
PGGVWAAEAI

N/A
2791
SDARENIQRF

amyloid

SDAR

(end

FGHGAEDSLA

A-1

of

DQAANEWGRS

protein

pro-

GKDPNHFRPA

tein)

GLPEKY

196
serum
952
FSFL-
1439
KRGP-
1883
VSSRSFFSFL
2349
GEAFDGARDMWR
2792
GGVWAAEAIS

amyloid

GEAF

GGVW

AYSDMREANYIG

A-1

SDKYFHARGNYD

protein

AAKRGP

197
serum
953
FHAR-

N/A
1884
IGSDKYFHAR

N/A
2793
GNYDAAKRGP

amyloid

GNYD

(end

GGVWAAEAIS

A-1

of

DARENIQRFF

protein

pro-

GHGAEDSLAD

tein)

QAANEWGRSG

KDPNHFRPA

GLPEKY

198
serum
954
GVSS-
1440
FSFL-
1885
FCSLVLGVSS
2350
RSFFSFL
2794
GEAFDGARDM

amyloid

RSFF

GEAF

A-1 pro-

tein OR

serum

amyloid

A-2

protein

199
serum
955
KRGP-

N/A
1886
GNYDAAKRGP

N/A
2795
GGVWAAEAIS

amyloid

GGVW

(end

DARENIQRFF

A-1

of

GHGAEDSLAD

protein

pro

QAANEWGRSG

tein)

KDPNHFRPAG

LPEKY

200
trans-
956
FTAN-

N/A
1887
EHAEVVFTAN

N/A
2796
DSGPRRYTIA

thyretin

DSGP

(end

ALLSPYSYST

of

TAVVTNPKE

pro-

tein)

201
trans-
957
ANDS-

N/A
1888
AEVVFTANDS

N/A
2797
GPRRYTIAAL

thyretin

GPRR

(end

LSPYSYSTTA

of

VVTNPKE

pro-

tein)

202
trans-
958
ANDS-
1441
TNPK-E
1889
AEVVFTANDS
2351
GPRRYTIAALLS

N/A

thyretin

GPRR

PYSYSTTAVVTN

PK

203
trans-
959
TAND-

N/A
1890
HAEVVFTAND

N/A
2798
SGPRRYTIAA

thyretin

SGPR

(end

LLSPYSYSTT

of

AVVTNPKE

pro-

tein)

204
trans-
960
NDSG-

N/A
1891
EVVFTANDSG

N/A
2799
PRRYTIAALL

thyretin

PRRY

(end

SPYSYSTTAV

of

VTNPKE

pro-

tein)

205
trans-
961
AALL-

N/A
1892
PRRYTIAALL

N/A
2800
SPYSYSTTAV

thyretin

SPYS

(end

VTNPKE

of

pro-

tein)

206
trans-
962
YTIA-

N/A
1893
DSGPRRYTIA

N/A
2801
ALLSPYSYST

thyretin

ALLS

(end

TAVVTNPKE

of

pro-

tein)

207
trans-
963
RRYT-

N/A
1894
ANDSGPRRYT

N/A
2802
IAALLSPYSY

thyretin

IAAL

(end

STTAVVTNPK

of

E

pro-

tein)

208
trans-
964
EVVF-

N/A
1895
PFHEHAEVVF

N/A
2803
TANDSGPRRY

thyretin

TAND

(end

TIAALL

of

SPYSYSTTAV

pro-

VTNPKE

tein)

209
trans-
965
KALG-

N/A
1896
DTKSYWKALG

N/A
2804
ISPFHEHAEV

thyretin

ISPF

(end

VFTANDSGPR

of

RYTIAALLSP

pro-

YSYSTTAVVT

tein)

NPKE

210
alpha-2-
966
PVSA-
1442
TALK-
1897
PCSVFSPVSA
2352
MEPLGRQLTSGP
2805
SPPGVCSRDP

anti-

MEPL

SPPG

NQEQVSPLTLLK

plasmin

LGNQEPGGQTAL

K

211
alpha-2-
967
PVSA-
1443
TLLK-
1898
PCSVFSPVSA
2353
MEPLGRQLTSGP
2806
LGNQEPGGQT

anti-

MEPL

LGNQ

NQEQVSPLTLLK

plasmin

212
alpha-2-
968
TSGP-
1444
TALK-
1899
PLGRQLTSGP
2354
NQEQVSPLTLLK
2807
SPPGVCSRDP

anti-

NQEQ

SPPG

LGNQEPGGQTAL

plasmin

K

213
alpha-2-
969
PDLK-

N/A
1900
GDKLFGPDLK

N/A
2808
LVPPMEEDYP

anti-

LVPP

(end

QFGSPK

plasmin

of

pro-

tein)

214
alpha-2-
970
RQLT-
1445
TLLK-
1901
AMEPLGRQLT
2355
SGPNQEQVSPLT
2809
LGNQEPGGQT

anti-

SGPN

LGNQ

LLK

plasmin

215
alpha-2-
971
PVSA-
1446
TSGP-
1902
PCSVFSPVSA
2356
MEPLGRQLTSGP
2810
NQEQVSPLTL

anti-

MEPL

NQEQ

plasmin

216
alpha-2-
972
VSAM-
1447
TSGP-
1903
CSVFSPVSAM
2357
EPLGRQLTSGP
2811
NQEQVSPLTL

anti-

EPLG

NQEQ

plasmin

217
alpha-2-
973
TSGP-
1448
TLLK-
1904
PLGRQLTSGP
2358
NQEQVSPLTLLK
2812
LGNQEPGGQT

anti-

NQEQ

LGNQ

plasmin

218
alpha-2-
974
TSGP-
1449
LKLG-
1905
PLGRQLTSGP
2359
NQEQVSPLTLLK
2813
NQEPGGQTAL

anti-

NQEQ

NQEP

LG

plasmin

219
alpha-2-
975
AMSR-
1450
KEQQ-
1906
AAATSIAMSR
2360
MSLSSFSVNRPF
2814
DSPGNKDFLQ

anti-

MSLS

DSPG

LFFIFEDTTGLP

plasmin

LFVGSVRNPNPS

APRELKEQQ

220
alpha-2-
976
PVSA-
1451
PLTL-
1907
PCSVFSPVSA
2361
MEPLGRQLTSGP
2815
LKLGNQEPGG

anti-

MEPL

LKLG

NQEQVSPLTL

plasmin

221
apolipo-
977
LLPV-

N/A
1908
EDLRQGLLPV

N/A
2816
LESFKVSFLS

protein

LESF

(end

ALEEYTKKLN

A-I

of

TQ

pro-

tein)

222
apolipo-
978
LPVL-

N/A
1909
DLRQGLLPVL

N/A
2817
ESFKVSFLSA

protein

ESFK

(end

LEEYTKKLNT

A-I

of

Q

pro-

tein)

223
apolipo
979
RVKD-
1452
GKQL-
1910
PQSPWDRVKD
2362
LATVYVDVLKDS
2818
NLKLLDNWDS

protein

LATV

NLKL

GRDYVSQFEGSA

A-I

LGKQL

Isoform 1

224
apolipo-
980
FWQQ-
1453
DVLK-
1911
GSQARHFWQQ
2363
DEPPQSPWDRVK
2819
DSGRDYVSQF

protein

DEPP

DSGR

DLATVYVDVLK

A-I

225
apolipo-
981
PVLE-

N/A
1912
LRQGLLPVLE

N/A
2820
SFKVSFLSAL

protein

SFKV

(end

EEYTKKLNTQ

A-I

of

pro-

tein)

226
apolipo-
982
ESFK-

N/A
1913
GLLPVLESFK

N/A
2821
VSFLSALEEY

protein

VSFL

(end

TKKLNTQ

A-I

of

pro-

tein)

227
apolipo-
983
EFWD-
1454
RQEM-
1914
LGPVTQEFWD
2364
NLEKETEGLRQE
2822
SKDLEEVKAK

protein

NLEK

SKDL

M

A-I

228
apolipo-
984
VSFL-

N/A
1915
VLESFKVSFL

N/A
2823
SALEEYTKKL

protein

SALE

(end

NTQ

A-I

of

pro-

tein)

229
apolipo-
985
SFKV-

N/A
1916
LLPVLESFKV

N/A
2824
SFLSALEEYT

protein

SFLS

(end

KKLNTQ

A-I

of

pro-

tein)

230
apolipo-
986
FWQQ-
1455
RVKD-
1917
GSQARHFWQQ
2365
DEPPQSPWDRVK
2825
LATVYVDVLK

protein

DEPP

LATV

D

A-I

231
apolipo-
987
EPLR-
1456
HELQ-
1918
LYRQKVEPLR
2366
AELQEGARQKLH
2826
EKLSPLGEEM

protein

AELQ

EKLS

ELQ

A-I

232
apolipo-
988
EPLR-
1457
SPLG-
1919
LYRQKVEPLR
2367
AELQEGARQKLH
2827
EEMRDRARAH

protein

AELQ

EEMR

ELQEKLSPLG

A-I

233
apolipo-
989
FWQQ-
1458
ATVY-
1920
GSQARHFWQQ
2368
DEPPQSPWDRVK
2828
VDVLKDSGRD

protein

DEPP

VDVL

DLATVY

A-I

234
alpha-1-
990
LLSA-

N/A
1921
TAVKITLLSA

N/A
2829
LVETRTIVRF

antichy-

LVET

(end

NRPFLMIIVP

motrypsin

of

TDTQNIFFMS

pro-

KVTNPKQA

tein)

235
alpha-1-
991
ITLL-

N/A
1922
AATAVKITLL

N/A
2830
SALVETRTIV

antichy-

SALV

(end

RFNRPFLMII

motrypsin

of

VPTDTQNIFF

pro-

MSKVTNPKQA

tein)

236
alpha-1-
992
LSAL-

N/A
1923
AVKITLLSAL

N/A
2831
VETRTIVRFN

antichy-

VETR

(end

RPFLMIIVPT

motrypsin

of

DTQNIFFMSK

pro-

VTNPKQA

tein)

237
alpha-1-
993
ALVE-

N/A
1924
KITLLSALVE

N/A
2832
TRTIVRFNRP

antichy-

TRTI

(end

FLMIIVPTDT

motrypsin

of

QNIFFMSKVT

pro-

NPKQA

tein)

238
alpha-1-
994
VPTD-

N/A
1925
PFLMIIVPTD

N/A
2833
TQNIFFMSKV

antichy-

TQNI

(end

TNPKQA

motrypsin

of

pro-

tein)

239
alpha-1-
995
TLLS-

N/A
1926
ATAVKITLLS

N/A
2834
ALVETRTIVR

antichy-

ALVE

(end

FNRPFLMIIV

motrypsin

of

PTDTQNIFFM

pro-

SKVTNPKQA

tein)

240
alpha-1-
996
LCHP-
1459
ENLT-
1927
GFCPAVLCHP
2369
NSPLDEENLT
2835
QENQDRGTHV

antichy-

NSPL

QENQ

motrypsin

241
alpha-1-
997
PFLM-

N/A
1928
IVRFNRPFLM

N/A
2836
IIVPTDTQNI

antichy-

IIVP

(end

FFMSKVTNPK

motrypsin

of

QA

pro-

tein)

242
alpha-1-
998
AVLC-
1460
LGLA-
1929
AAGFCPAVLC
2370
HPNSPLDEENLT
2837
SANVDFAFSL

antichy-

HPNS

SANV

QENQDRGTHVDL

motrypsin

GLA

243
glucagon
999
GSWQ-
1461
MNED-
1930
FVMLVQGSWQ
2371
RSLQDTEEKSRS
2838
KRHSQGTFTS

RSLQ

KRHS

FSASQADPLSDP

DQMNED

244
glucagon
1000
GSWQ-
1462
QMNE-
1931
FVMLVQGSWQ
2372
RSLQDTEEKSRS
2839
DKRHSQGTFT

RSLQ

DKRH

FSASQADPLSDP

DQMNE

245
glucagon
1001
WQRS-
1463
MNED-
1932
MLVQGSWQRS
2373
LQDTEEKSRSFS
2840
KRHSQGTFTS

LQDT

KRHS

ASQADPLSDPDQ

MNED

246
glucagon
1002
SWQR-
1464
MNED-
1933
VMLVQGSWQR
2374
SLQDTEEKSRSF
2841
KRHSQGTFTS

SLQD

KRHS

SASQADPLSDPD

QMNED

247
glucagon
1003
EDKR-
1465
RAQD-
1934
DPDQMNEDKR
2375
HSQGTFTSDYSK
2842
FVQWLMNTKR

HSQG

FVQW

YLDSRRAQD

248
glucagon
1004
WQRS-
1466
QMNE-
1935
MLVQGSWQRS
2376
LQDTEEKSRSFS
2843
DKRHSQGTFT

LQDT

DKRH

ASQADPLSDPDQ

MNE

249
glucagon
1005
EDKR-
1467
LMNT-
1936
DPDQMNEDKR
2377
HSQGTFTSDYSK
2844
KRNRNNIAKR

HSQG

KRNR

YLDSRRAQDFVQ

WLMNT

250
glucagon
1006
QRSL-
1468
MNED-
1937
LVQGSWQRSL
2378
QDTEEKSRSFSA
2845
KRHSQGTFTS

QDTE

KRHS

SQADPLSDPDQM

NED

251
glucagon
1007
SWQR-
1469
QMNE-
1938
VMLVQGSWQR
2379
SLQDTEEKSRSF
2846
DKRHSQGTFT

SLQD

DKRH

SASQADPLSDPD

QMNE

252
glucagon
1008
RGRR-
1470
KITD-
1939
AWLVKGRGRR
2380
DFPEEVAIVEEL

N/A

DFPE

RK

GRRHADGSFSDE

MNTILDNLAARD

FINWLIQTKITD

253
glucagon
1009
IAKR-
1471
KGRG-
1940
KRNRNNIAKR
2381
HDEFERHAEGTF
2847
RRDFPEEVAI

HDEF

RRDF

TSDVSSYLEGQA

AKEFIAWLVKGR

G

254
hepcidin
1010
LTSG-
1472
PMFQ-
1941
LLLLASLTSG
2382
SVFPQQTGQLAE
2848
RRRRRDTHFP

SVFP

RRRR

LQPQDRAGARAS

WMPMFQ

255
hepcidin
1011
LTSG-
1473
WMPM-
1942
LLLLASLTSG
2383
SVFPQQTGQLAE
2849
FQRRRRRDTH

SVFP

FQRR

LQPQDRAGARAS

WMPM

256
hepcidin
1012
LTSG-
1474
SWMP-
1943
LLLLASLTSG
2384
SVFPQQTGQLAE
2850
MFQRRRRRDT

SVFP

MFQR

LQPQDRAGARAS

WMP

257
hepcidin
1013
LTSG-
1475
GARA-
1944
LLLLASLTSG
2385
SVFPQQTGQLAE
2851
SWMPMFQRRR

SVFP

SWMP

LQPQDRAGARA

258
hepcidin
1014
LTSG-
1476
ARAS-
1945
LLLLASLTSG
2386
SVFPQQTGQLAE
2852
WMPMFQRRRR

SVFP

WMPM

LQPQDRAGARAS

259
hepcidin
1015
LTSG-
1477
RAGA-
1946
LLLLASLTSG
2387
SVFPQQTGQLAE
2853
RASWMPMFQR

SVFP

RASW

LQPQDRAGA

260
hepcidin
1016
SVFP-
1478
ARAS-
1947
ASLTSGSVFP
2388
QQTGQLAELQPQ
2854
WMPMFQRRRR

QQTG

WMPM

DRAGARAS

261
hepcidin
1017
TSGS-
1479
ARAS-
1948
LLLASLTSGS
2389
VFPQQTGQLAEL
2855
WMPMFQRRRR

VFPQ

WMPM

QPQDRAGARAS

262
hepcidin
1018
SVFP-
1480
RAGA-
1949
ASLTSGSVFP
2390
QQTGQLAELQPQ
2856
RASWMPMFQR

QQTG

RASW

DRAGA

263
hepcidin
1019
LTSG-
1481
QLAE-
1950
LLLLASLTSG
2391
SVFPQQTGQLAE
2857
LQPQDRAGAR

SVFP

LQPQ

264
hepcidin
1020
LTSG-
1482
AGAR-
1951
LLLLASLTSG
2392
SVFPQQTGQLAE
2858
ASWMPMFQRR

SVFP

ASWM

LQPQDRAGAR

265
hepcidin
1021
LTSG-
1483
ELQP-
1952
LLLLASLTSG
2393
SVFPQQTGQLAE
2859
QDRAGARASW

SVFP

QDRA

LQP

266
serum
1022
NIQR-

N/A
1953
GPNARENIQR

N/A
2860
LTGRGAEDSL

amyloid

LTGR

(end

ADQAANKWGR

A-2

of

SGRDPNHFRP

protein

pro-

AGLPEKY

tein)

267
serum
1023
IQRL-

N/A
1954
PNARENIQRL

N/A
2861
TGRGAEDSLA

amyloid

TGRG

(end

DQAANKWGRS

A-2

of

GRDPNHFRPA

protein

pro-

GLPEKY

tein)

268
serum
1024
RSGR-

N/A
1955
AANKWGRSGR

N/A
2862
DPNHFRPAGL

amyloid

DPNH

(end

PEKY

A-2

of

protein

pro-

tein)

269
serum
1025
QRLT-

N/A
1956
NARENIQRLT

N/A
2863
GRGAEDSLAD

amyloid

GRGA

(end

QAANKWGRSG

A-2

of

RDPNHFRPAG

protein

pro-

LPEKY

tein)

270
serum
1026
IQRL-
1484
DPNH-
1957
PNARENIQRL
2394
TGRGAEDSLADQ
2864
FRPAGLPEKY

amyloid

TGRG

FRPA

AANKWGRSGRDP

A-2

NH

protein

271
serum
1027
NIQR-
1485
DPNH-
1958
GPNARENIQR
2395
LTGRGAEDSLAD
2865
FRPAGLPEKY

amyloid

LTGR

FRPA

QAANKWGRSGRD

A-2

PNH

protein

272
serum
1028
GAED-

N/A
1959
QRLTGRGAED

N/A
2866
SLADQAANKW

amyloid

SLAD

(end

GRSGRDPNHF

A-2

of

RPAGLPEKY

protein

pro-

tein)

273
serum
1029
RLTG-

N/A
1960
ARENIQRLTG

N/A
2867
RGAEDSLADQ

amyloid

RGAE

(end

AANKWGRSGR

A-2

of

DPNHFRPAGL

protein

pro-

PEKY

tein)

274
serum
1030
TGRG-
1486
DPNH-
1961
ENIQRLTGRG
2396
AEDSLADQAANK
2868
FRPAGLPEKY

amyloid

AEDS

FRPA

WGRSGRDPNH

A-2

protein

275
serum
1031
AAKR-
1487
AEVI-
1962
ARGNYDAAKR
2397
GPGGAWAAEVI
2869
SNARENIQRL

amyloid

GPGG

SNAR

A-2

protein

276
serum
1032
GAWA-

N/A
1963
AKRGPGGAWA

N/A
2870
AEVISNAREN

amyloid

AEVI

(end

IQRLTGRGAE

A-2

of

DSLADQAANK

protein

pro-

WGRSGRDPNH

tein)

FRPAGLPEKY

277
serum
1033
DSLA-

N/A
1964
TGRGAEDSLA

N/A
2871
DQAANKWGRS

amyloid

DQAA

(end

GRDPNHFRPA

A-2

of

GLPEKY

protein

pro-

tein)

278
serum
1034
AWAA-

N/A
1965
KRGPGGAWAA

N/A
2872
EVISNARENI

amyloid

EVIS

(end

QRLTGRGAED

A-2

of

SLADQAANKW

protein

pro-

GRSGRDPNHF

tein)

RPAGLPEKY

279
thymosin
1035
M-
1488
KSKL-

N/A
2398
SDKPDMAEIEKF
2873
KKTETQEKNP

beta-4

SDKP

KKTE

DKSKL

280
thymosin
1036
M-
1489
KKTE-

N/A
2399
SDKPDMAEIEKF
2874
TQEKNPLPSK

beta-4

SDKP

TQEK

DKSKLKKTE

281
thymosin
1037
KTET-

N/A
1966
DKSKLKKTET

N/A
2875
QEKNPLPSKE

beta-4

QEKN

(end

TIEQEKQAGE

of

S

pro-

tein)

282
thymosin
1038
M-

N/A

N/A

N/A
2876
SDKPDMAEIE

beta-4

SDKP

(end

KFDKSKLKKT

of

ETQEKNPLPS

pro-

KETIEQEKQA

tein)

GES

283
thymosin
1039
M-
1490
ETQE-

N/A
2400
SDKPDMAEIEKF
2877
KNPLPSKETI

beta-4

SDKP

KNPL

DKSKLKKTETQE

284
thymosin
1040
KKTE-

N/A
1967
FDKSKLKKTE

N/A
2878
TQEKNPLPSK

beta-4

TQEK

(end

ETIEQEKQAG

of

ES

pro-

tein)

285
thymosin
1041
ETQE-

N/A
1968
SKLKKTETQE

N/A
2879
KNPLPSKETI

beta-4

KNPL

(end

EQEKQAGES

of

pro-

tein)

286
thymosin
1042
KLKK-

N/A
1969
EKFDKSKLKK

N/A
2880
TETQEKNPLP

beta-4

TETQ

(end

SKETIEQEKQ

of

AGES

pro-

tein)

287
thymosin
1043
TETQ-

N/A
1970
KSKLKKTETQ

N/A
2881
EKNPLPSKET

beta-4

EKNP

(end

IEQEKQAGES

of

pro-

tein)

288
thymosin
1044
TQEK-

N/A
1971
KLKKTETQEK

N/A
2882
NPLPSKETIE

beta-4

NPLP

(end

QEKQAGES

of

pro-

tein)

289
hapto-
1045
PVQR-
1491
MVSH-
1972
PKNPANPVQR
2401
ILGGHLDAKGSF
2883
HNLTTGATLI

globin

ILGG

HNLT

PWQAKMVSH

290
hapto-
1046
PVQR-
1492
VSHH-
1973
PKNPANPVQR
2402
ILGGHLDAKGSF
2884
NLTTGATLIN

globin

ILGG

NLTT

PWQAKMVSHH

291
hapto-
1047
GVYV-

N/A
1974
CAVAEYGVYV

N/A
2885
KVTSIQDWVQ

globin

KVTS

(end

KTIAEN

of

pro-

tein)

292
hapto-
1048
PVQR-
1493
AKMV-
1975
PKNPANPVQR
2403
ILGGHLDAKGSF
2886
SHHNLTTGAT

globin

ILGG

SHHN

PWQAKMV

293
hapto-
1049
PVQR-
1494
FPWQ-
1976
PKNPANPVQR
2404
ILGGHLDAKGSF
2887
AKMVSHHNLT

globin

ILGG

AKMV

PWQ

294
hapto-
1050
SALG-
1495
LWGQ-
1977
MSALG
2405
AVIALLLWGQ
2888
LFAVDSGNDV

globin

AVIA

LFAV

295
hapto-
1051
MSAL-
1496
LWGQ-
1978
MSAL
2406
GAVIALLLWGQ
2889
LFAVDSGNDV

globin

GAVI

LFAV

296
hemoglobin
1052
HCLL-

N/A
1979
NFKLLSHCLL

N/A
2890
VTLAAHLPAE

subunit

VTLA

(end

FTPAVHASLD

alpha

of

KFLASVSTVL

pro-

TSKYR

tein)

297
hemoglobin
1053
LLVT-

N/A
1980
KLLSHCLLVT

N/A
2891
LAAHLPAEFT

subunit

LAAH

(end

PAVHASLDKF

alpha

of

LASVSTVLTS

pro-

KYR

tein)

298
hemoglobin
1054
M-
1497
LERM-

N/A
2407
VLSPADKTNVKA
2892
FLSFPTTKTY

subunit

VLSP

FLSF

AWGKVGAHAGEY

alpha

GAEALERM

299
hemoglobin
1055
M-
1498
ERMF-

N/A
2408
VLSPADKTNVKA
2893
LSFPTTKTYF

subunit

VLSP

LSFP

AWGKVGAHAGEY

alpha

GAEALERMF

300
hemoglobin
1056
ASLD-

N/A
1981
FTPAVHASLD

N/A
2894
KFLASVSTVL

subunit

KFLA

(end

TSKYR

alpha

of

pro-

tein)

301
hemoglobin
1057
LVTL-

N/A
1982
LLSHCLLVTL

N/A
2895
AAHLPAEFTP

subunit

AAHL

(end

AVHASLDKFL

alpha

of

ASVSTVLTSK

pro-

YR

tein)

302
hemoglobin
1058
M-
1499
FLSF-

N/A
2409
VLSPADKTNVKA
2896
PTTKTYFPHF

subunit

VLSP

PTTK

AWGKVGAHAGEY

alpha

GAEALERMFLSF

303
hemoglobin
1059
M-
1500
GKVG-

N/A
2410
VLSPADKTNVKA
2897
AHAGEYGAEA

subunit

VLSP

AHAG

AWGKVG

alpha

304
caveolae-
1060
QKVR-

N/A
1983
VALEQAQKVR

N/A
2898
YEGSYALTSE

associated

YEGS

(end

EAERSDGDPV

protein 2

of

QPAVLQVHQT

pro-

S

tein)

305
caveolae-
1061
M-
1501
SDMR-

N/A
2411
GEDAAQAEKFQH
2899
QEKPSSPSPM

associated

GEDA

QEKP

PGSDMR

protein 2

306
caveolae-
1062
EGSY-

N/A
1984
AQKVRYEGSY

N/A
2900
ALTSEEAERS

associated

ALTS

(end

DGDPVQPAVL

protein 2

of

QVHQTS

pro-

tein)

307
caveolae-
1063
M-
1502
QHPG-

N/A
2412
GEDAAQAEKFQH
2901
SDMRQEKPSS

associated

GEDA

SDMR

PG

protein 2

308
caveolae-
1064
M-
1503
GSDM-

N/A
2413
GEDAAQAEKFQH
2902
RQEKPSSPSP

associated

GEDA

RQEK

PGSDM

protein 2

309
caveolae-
1065
RYEG-

N/A
1985
EQAQKVRYEG

N/A
2903
SYALTSEEAE

associated

SYAL

(end

RSDGDPVQPA

protein 2

of

VLQVHQTS

pro-

tein)

310
alpha-2-
1066
PPLG-
1504
HVLL-
1986
PDAPPSPPLG
2414
APGLPPAGSPPD
2904
AAPPGHQLHR

HS-glyco-

APGL

AAPP

SHVLL

protein

311
alpha-2-
1067
RKTR-

N/A
1987
GEVSHPRKTR

N/A
2905
TVVQPSVGAA

HS-glyco-

TVVQ

(end

AGPVVPPCPG

protein

of

RIRHFKV

pro-

tein)

312
alpha-2-
1068
PPLG-
1505
VLLA-
1988
PDAPPSPPLG
2415
APGLPPAGSPPD
2906
APPGHQLHRA

HS-glyco-

APGL

APPG

SHVLLA

protein

313
alpha-2-
1069
HVLL-
1506
PRKT-
1989
GSPPDSHVLL
2416
AAPPGHQLHRAH
2907
RTVVQPSVGA

HS-glyco-

AAPP

RTVV

YDLRHTFMGVVS

protein

LGSPSGEVSHPR

KT

314
alpha-2-
1070
PPLG-
1507
PRKT-
1990
PDAPPSPPLG
2417
APGLPPAGSPPD
2908
RTVVQPSVGA

HS-glyco-

APGL

RTVV

SHVLLAAPPGHQ

protein

LHRAHYDLRHTF

MGVVSLGSPSGE

VSHPRKT

315
alpha-2-
1071
VLLA-
1508
PRKT-
1991
SPPDSHVLLA
2418
APPGHQLHRAHY
2909
RTVVQPSVGA

HS-glyco-

APPG

RTVV

DLRHTFMGVVSL

protein

GSPSGEVSHPRK

T

316
alpha-2-
1072
PPLG-
1509
SHVL-
1992
PDAPPSPPLG
2419
APGLPPAGSPPD
2910
LAAPPGHQLH

HS-glyco-

APGL

LAAP

SHVL

protein

317
alpha-2-
1073
HVLL-
1510
SHPR-
1993
GSPPDSHVLL
2420
AAPPGHQLHRAH
2911
KTRTVVQPSV

HS-glyco-

AAPP

KTRT

YDLRHTFMGVVS

protein

LGSPSGEVSHPR

318
chromo-
1074
QQKK-
1511
KDVM-
1994
AKERAHQQKK
2421
HSGFEDELSEVL
2912
EKREDSKEAE

granin-A

HSGF

EKRE

ENQSSQAELKEA

VEEPSSKDVM

319
chromo-
1075
QQKK-
1512
DVME-
1995
AKERAHQQKK
2422
HSGFEDELSEVL
2913
KREDSKEAEK

granin-A

HSGF

KRED

ENQSSQAELKEA

VEEPSSKDVME

320
chromo-
1076
LQVR-
1513
ALRR-
1996
LEAGLPLQVR
2423
GYPEEKKEEEGS

N/A

granin-A

GYPE

G

ANRRPEDQELES

LSAIEAELEKVA

HQLQALRR

321
chromo-
1077
QQKK-
1514
KDVM-
1997
AKERAHQQKK
2424
HSGFEDELSEVL
2914
EKREDSKEAE

granin-A

HSGF

EKRE

ENQSSQAELKEA

VEEPSSKDVM

322
chromo-
1078
AEKR-
1515
GPQL-
1998
LAKELTAEKR
2425
LEGQEEEEDNRD
2915
RRGWRPSSRE

granin-A

LEGQ

RRGW

SSMKLSFRARAY

GFRGPGPQL

323
chromo-
1079
AEKR-
1516
KLSF-
1999
LAKELTAEKR
2426
LEGQEEEEDNRD
2916
RARAYGFRGP

granin-A

LEGQ

RARA

SSMKLSF

324
complement
1080
LPSR-
1517
SLLR-
2000
LDVSLQLPSR
2427
SSKITHRIHWES
2917
SEETKENEGF

C3

SSKI

SEET

ASLLR

325
complement
1081
LPSR-
1518
ASLL-
2001
LDVSLQLPSR
2428
SSKITHRIHWES
2918
RSEETKENEG

C3

SSKI

RSEE

ASLL

326
complement
1082
SSKI-
1519
ASLL-
2002
LQLPSRSSKI
2429
THRIHWESASLL
2919
RSEETKENEG

C3

THRI

RSEE

327
complement
1083
SRSS-
1520
ASLL-
2003
VSLQLPSRSS
2430
KITHRIHWESAS
2920
RSEETKENEG

C3

KITH

RSEE

LL

328
complement
1084
LALG-
1521
HDAQ-
2004
LLTHLPLALG
2431
SPMYSIITPNIL
2921
GDVPVTVTVH

C3

SPMY

GDVP

RLESEETMVLEA

HDAQ

329
complement
1085
KITH-
1522
ASLL-
2005
LPSRSSKITH
2432
RIHWESASLL
2922
RSEETKENEG

C3

RIHW

RSEE

330
complement
1086
SKIT-
1523
ASLL-
2006
QLPSRSSKIT
2433
HRIHWESASLL
2923
RSEETKENEG

C3

HRIH

RSEE

331
complement
1087
THRI -
1524
ASLL-
2007
SRSSKITHRI
2434
HWESASLL
2924
RSEETKENEG

C3

HWES

RSEE

332
complement
1088
ITHR-
1525
ASLL-
2008
PSRSSKITHR
2435
IHWESASLL
2925
RSEETKENEG

C3

IHWE

RSEE

333
vitronec-
1089
FWGR-
1526
PSLA-
2009
DIFELLEWGR
2436
TSAGTRQPQFIS
2926
KKQRFRHRNR

tin

TSAG

KKQR

RDWHGVPGQVDA

AMAGRIYISGMA

PRPSLA

334
vitronec-
1090
TSAG-
1527
IYIS-
2010
LLFWGRTSAG
2437
TRQPQFISRDWH
2927
GMAPRPSLAK

tin

TRQP

GMAP

GVPGQVDAAMAG

RIYIS

335
vitronec-
1091
LTSD-
1528
KPEG-
2011
QVGGPSLTSD
2438
LQAQSKGNPEQT
2928
IDSRPETLHP

tin

LQAQ

IDSR

PVLKPEEEAPAP

EVGASKPEG

336
vitronec-
1092
FELL-
1529
PSLA-
2012
DSWEDIFELL
2439
FWGRTSAGTRQP
2929
KKQRFRHRNR

tin

FWGR

KKQR

QFISRDWHGVPG

QVDAAMAGRIYI

SGMAPRPSLA

337
vitronec-
1093
TSAG-
1530
APRP-
2013
LLFWGRTSAG
2440
TRQPQFISRDWH
2930
SLAKKQRFRH

tin

TRQP

SLAK

GVPGQVDAAMAG

RIYISGMAPRP

338
vitronec-
1094
FWGR-
1531
PSLA-
2014
DIFELLFWGR
2441
TSAGTRQPQFIS
2931
KKQRFRHRNR

tin

TSAG

KKQR

RDWHGVPGQVDA

AMAGRIYISGMA

PRPSLA

339
vitronec-
1095
FELL-
1532
PRPS-
2015
DSWEDIFELL
2442
FWGRTSAGTRQP
2932
LAKKQRFRHR

tin

FWGR

LAKK

QFISRDWHGVPG

QVDAAMAGRIYI

SGMAPRPS

340
hemopexin
1096
QGHN-
1533
KLLQ-
2016
VDAAFRQGHN
2443
SVFLIKGDKVWV
2933
DEFPGIPSPL

OR

SVFL

DEFP

YPPEKKEKGYPK

epididymis

LLQ

secretory

sperm

binding

protein

341
hemopexin
1097
QGHN-
1534
PPEK-
2017
VDAAFRQGHN
2444
SVFLIKGDKVWV
2934
KEKGYPKLLQ

OR

SVFL

KEKG

YPPEK

epididymis

secretory

sperm

binding

protein

342
hemopexin
1098
QGHN-
1535
EKKE-
2018
VDAAFRQGHN
2445
SVFLIKGDKVWV
2935
KGYPKLLQDE

OR

SVFL

KGYP

YPPEKKE

epididymis

secretory

sperm

binding

protein

343
hemopexin
1099
RWKN-
1536
QGHN-
2019
RELISERWKN
2446
FPSPVDAAFRQG
2936
SVFLIKGDKV

OR

FPSP

SVFL

HN

epididymis

secretory

sperm

binding

protein

344
hemopexin
1100
QGHN-
1537
YPPE-
2020
VDAAFRQGHN
2447
SVFLIKGDKVWV
2937
KKEKGYPKLL

OR

SVFL

KKEK

YPPE

epididymis

secretory

sperm

binding

protein

345
hemopexin
1101
DKVW-
1538
KLLQ-
2021
VFLIKGDKVW
2448
VYPPEKKEKGYP
2938
DEFPGIPSPL

OR

VYPP

DEFP

KLLQ

epididymis

secretory

sperm

binding

protein

346
zyxin
1102
QTQF-
1539
QSQT-
2022
PAPAQSQTQF
2449
HVQPQPQPKPQV
2939
QPVSLANTQP

HVQP

QPVS

QLHVQSQT

347
zyxin
1103
QPVS-
1540
PVAS-
2023
HVQSQTQPVS
2450
LANTQPRGPPAS
2940
KFSPGAPGGS

LANT

KFSP

SPAPAPKFSPVT

PKFTPVAS

348
zyxin
1104
QTQF-
1541
LHVQ-
2024
PAPAQSQTQF
2451
HVQPQPQPKPQV
2941
SQTQPVSLAN

HVQP

SQTQ

QLHVQ

349
zyxin
1105
M-
1542
APAF-

N/A
2452
AAPRPSPAISVS
2942
YAPQKKFGPV

AAPR

YAPQ

VSAPAF

350
zyxin
1106
QTQF-
1543
QPVS-
2025
PAPAQSQTQF
2453
HVQPQPQPKPQV
2943
LANTQPRGPP

HVQP

LANT

QLHVQSQTQPVS

351
zyxin
1107
PKPK-
1544
QRAQ-
2026
FGPVVAPKPK
2454
VNPFRPGDSEPP
2944
MGRVGEIPPP

VNPF

MGRV

PAPGAQRAQ

352
apolipo-
1108
SARA-
1545
TAKD-
2027
LLALLASARA
2455
SEAEDASLLSFM
2945
ALSSVQESQV

protein

SEAE

ALSS

QGYMKHATKTAK

C-III

D

353
apolipo-
1109
SARA-
1546
TKTA-
2028
LLALLASARA
2456
SEAEDASLLSFM
2946
KDALSSVQES

protein

SEAE

KDAL

QGYMKHATKTA

C-III

354
apolipo-
1110
FSEF-

N/A
2029
STVKDKFSEF

N/A
2947
WDLDPEVRPT

protein

WDLD

(end

SAVAA

C-III

of

pro-

tein)

355
apolipo-
1111
SARA-
1547
AQQA-
2030
LLALLASARA
2457
SEAEDASLLSFM
2948
RGWVTDGFSS

protein

SEAE

RGWV

QGYMKHATKTAK

C-III

DALSSVQESQVA

QQA

356
apolipo-
1112
WDLD-

N/A
2031
DKFSEFWDLD

N/A
2949
PEVRPTSAVA

protein

PEVR

(end

A

of

pro-

tein)

357
secreto-
1113
KRFP-
1548
EHIA-
2032
KLAPVSKRFP
2458
VGPPKNDDTPNR
2950
KRAMENM

granin-2

VGPP

KRAM

QYWDEDLLMKVL

EYLNQEKAEKGR

EHIA

358
secreto-
1114
VSKR-
1549
EHIA-
2033
TDKLAPVSKR
2459
FPVGPPKNDDTP
2951
KRAMENM

granin-2

FPVG

KRAM

NRQYWDEDLLMK

VLEYLNQEKAEK

GREHIA

359
secreto-
1115
KRFP-
1550
GREH-
2034
KLAPVSKRFP
2460
VGPPKNDDTPNR
2952
IAKRAMENM

granin-2

VGPP

IAKR

QYWDEDLLMKVL

EYLNQEKAEKGR

EH

360
secreto-
1116
KRVP-
1551
APVS-
2035
INSNQVKRVP
2461
GQGSSEDDLQEE
2953
KRFPVGPPKN

granin-2

GQGS

KRFP

EQIEQAIKEHLN

QGSSQETDKLAP

VS

361
secreto-
1117
QVKR-
1552
APVS-
2036
EIINSNQVKR
2462
VPGQGSSEDDLQ
2954
KRFPVGPPKN

granin-2

VPGQ

KRFP

EEEQIEQAIKEH

LNQGSSQETDKL

APVS

362
secreto-
1118
PKTP-
1553
DGLS-
2037
LSKSGYPKTP
2463
GRAGTEALPDGL
2955
VEDILNLLGM

granin-2

GRAG

VEDI

S

363
secreto-
1119
ERKL-
1554
PMYE-
2038
ETQQWPERKL
2464
KHMQFPPMYE
2956
ENSRDNPFKR

granin-2

KHMQ

ENSR

364
angioten-
1120
QQLN-

N/A
2039
EPTESTQQLN

N/A
2957
KPEVLEVTLN

sinogen

KPEV

(end

RPFLFAVYDQ

of

SATALHFLGR

pro-

VANPLSTA

tein)

365
angioten-
1121
EPTE-

N/A
2040
LEADEREPTE

N/A
2958
STQQLNKPEV

sinogen

STQQ

(end

LEVTLNRPFL

of

FAVYDQSATA

pro-

LHFLGRVAN

tein)

PLSTA

366
angioten-
1122
TEST-

N/A
2041
ADEREPTEST

N/A
2959
QQLNKPEVLE

sinogen

QQLN

(end

VTLNRPFLFA

of

VYDQSATALH

pro-

FLGRVANPLS

tein)

TA

367
c-reactive
1123
HAFG-
1555
VSLK-
2042
VLTSLSHAFG
2465
QTDMSRKAFVFP
2960
APLTKPLKAF

protein

QTDM

APLT

KESDTSYVSLK

368
c-reactive
1124
HAFG-
1556
APLT-
2043
VLTSLSHAFG
2466
QTDMSRKAFVFP
2961
KPLKAFTVCL

protein

QTDM

KPLK

KESDTSYVSLKA

PLT

369
c-reactive
1125
HAFG-
1557
SYVS-
2044
VLTSLSHAFG
2467
QTDMSRKAFVFP
2962
LKAPLTKPLK

protein

QTDM

LKAP

KESDTSYVS

370
serum
1126
VFRR-
1558
QYLQ-
2045
SAYSRGVFRR
2468
DAHKSEVAHRFK
2963
QCPFEDHVKL

albumin

DAHK

QCPF

DLGEENFKALVL

IAFAQYLQ

371
serum
1127
VFRR-
1559
ENFK-
2046
SAYSRGVFRR
2469
DAHKSEVAHRFK
2964
ALVLIAFAQY

albumin

DAHK

ALVL

DLGEENFK

372
serum
1128
VFRR-
1560
ALVL-
2047
SAYSRGVFRR
2470
DAHKSEVAHRFK
2965
IAFAQYLQQC

albumin

DAHK

IAFA

DLGEENFKALVL

373
trans-
1129
GLQM-

N/A
2048
EGKNVIGLQM

N/A
2966
GTNRGASQAG

gelin-2

GTNR

(end

MTGYGMPRQI

of

L

pro-

tein)

374
trans-
1130
IGLQ-

N/A
2049
QEGKNVIGLQ

N/A
2967
MGTNRGASQA

gelin-2

MGTN

(end

GMTGYGMPRQ

of

IL

pro-

tein)

375
trans-
1131
QAGM-

N/A
2050
TNRGASQAGM

N/A
2968
TGYGMPRQIL

gelin-2

TGYG

(end

of

pro-

tein)

376
pancreatic
1132
YGKR-
1561
AVPR-
2051
MLTRPRYGKR
2471
HKEDTLAFSEWG
2969
ELSPLDL

prohormone

HKED

ELSP

SPHAAVPR

377
pancreatic
1133
QGAP-
1562
LRRY-
2052
QPLLGAQGAP
2472
LEPVYPGDNATP
2970
INMLTRPRYG

prohormone

LEPV

INML

EQMAQYAADLRR

Y

378
pancreatic
1134
PLLG-
1563
RPRY-
2053
VALLLQPLLG
2473
AQGAPLEPVYPG
2971
GKRHKEDTLA

prohormone

AQGA

GKRH

DNATPEQMAQYA

ADLRRYINMLTR

PRY

379
pancreatic
1135
QGAP-
1564
RPRY-
2054
QPLLGAQGAP
2474
LEPVYPGDNATP
2972
GKRHKEDTLA

prohormone

LEPV

GKRH

EQMAQYAADLRR

YINMLTRPRY

380
neurosec-
1136
AAPP-
1565
VRGA-
2055
LINGLGAAPP
2475
GRPEAQPPPLSS
2973
RNSEPQDEGE

retory

GRPE

RNSE

EHKEPVAGDAVP

protein

GPKDGSAPEVRG

VGF

A

381
neurosec-
1137
AAPP-
1566
APEV-
2056
LINGLGAAPP
2476
GRPEAQPPPLSS
2974
RGARNSEPQD

retory

GRPE

RGAR

EHKEPVAGDAVP

protein

GPKDGSAPEV

VGF

382
neurosec-
1138
GLGA-
1567
VRGA-
2057
CLLLINGLGA
2477
APPGRPEAQPPP
2975
RNSEPQDEGE

retory

APPG

RNSE

LSSEHKEPVAGD

protein

AVPGPKDGSAPE

VGF

VRGA

383
neurosec-
1139
RKKN-
1568
PTHV-
2058
VEEKRKRKKN
2478
APPEPVPPPRAA
2976
RSPQPPPPAP

retory

APPE

RSPQ

PAPTHV

protein

VGF

384
neurosec-
1140
KRKK-
1569
PTHV-
2059
EVEEKRKRKK
2479
NAPPEPVPPPRA
2977
RSPQPPPPAP

retory

NAPP

RSPQ

APAPTHV

protein

VGF

385
neurosec-
1141
GLGA-
1570
APEV-
2060
CLLLINGLGA
2480
APPGRPEAQPPP
2978
RGARNSEPQD

retory

APPG

RGAR

LSSEHKEPVAGD

protein

AVPGPKDGSAPE

VGF

V

386
neurosec-
1142
EEEA-
1571
LTET-
2061
GSQQGPEEEA
2481
AEALLTET
2979
VRSQTHSLPA

retory

AEAL

VRSQ

protein

VGF

387
neurosec-
1143
KRKK-
1572
ELPD-
2062
EVEEKRKRKK
2482
NAPPEPVPPPRA
2980
WNEVLPPWDR

retory

NAPP

WNEV

APAPTHVRSPQP

protein

PPPAPAPARDEL

VGF

PD

388
cerulo-
1144
LLHC-
1573
EDTK-
2063
RTPGIWLLHC
2483
HVTDHIHAGMET

N/A

plasmin

HVTD

SG

TYTVLQNEDTK

389
cerulo-
1145
PAWA-
1574
WDYA-
2064
LFLCSTPAWA
2484
KEKHYYIGIIET
2981
SDHGEKKLIS

plasmin

KEKH

SDHG

TWDYA

390
PDZ and
1146
PFTA-
1575
TNQY-
2065
HNRSAMPFTA
2485
SPASSTTARVIT
2982
NNPAGLYSSE

LIM

SPAS

NNPA

NQY

domain

protein 1

391
PDZ and
1147
LVLQ-
1576
APVT-
2066
KQSTSFLVLQ
2486
EILESEEKGDPN
2983
KVAASIGNAQ

LIM

EILE

KVAA

KPSGFRSVKAPV

domain

T

protein 1

392
PDZ and
1148
TNQY-
1577
SGVE-
2067
TTARVITNQY
2487
NNPAGLYSSENI
2984
ANSRPLDHAQ

LIM

NNPA

ANSR

SNFNNALESKTA

domain

ASGVE

protein 1

393
tubulin
1149
FPLA-
1578
YHEQ-
2068
PYPRIHFPLA
2488
TYAPVISAEKAY
2985
LSVAEITNAC

alpha-4A

TYAP

LSVA

HEQ

chain

394
tubulin
1150
FPLA-
1579
HEQL-
2069
PYPRIHFPLA
2489
TYAPVISAEKAY
2986
SVAEITNACE

alpha-4A

TYAP

SVAE

HEQL

chain

395
tubulin
1151
PVIS-
1580
EITN-
2070
PLATYAPVIS
2490
AEKAYHEQLSVA
2987
ACFEPANQMV

alpha-4A

AEKA

ACFE

EITN

chain

396
tubulin
1152
M-
1581
LGRL-

N/A
2491
VHLTPEEKSAVT
2988
LVVYPWTQRF

alpha-4A

VHLT

LVVY

ALWGKVNVDEVG

chain

GEALGRL

397
tubulin
1153
M-
1582
TQRF-

N/A
2492
VHLTPEEKSAVT
2989
FESFGDLSTP

alpha-4A

VHLT

FESF

ALWGKVNVDEVG

chain

GEALGRLLVVYP

WTQRF

398
tubulin
1154
LGRL-
1583
TQRF-
2071
EVGGEALGRL
2493
LVVYPWTQRF
2990
FESFGDLSTP

alpha-4A

LVVY

FESF

chain

399
multi-
1155
SLNT-
1584
RAPR-
2072
SNEQATSLNT
2494
VGGTGGIGGVGG
2991
ETYLSRGDSS

merin-1

VGGT

ETYL

TGGVGNRAPR

400
multi-
1156
LNTV-
1585
RAPR-
2073
NEQATSLNTV
2495
GGTGGIGGVGGT
2992
ETYLSRGDSS

merin-1

GGTG

ETYL

GGVGNRAPR

401
multi-
1157
SLNT-
1586
NRAP-
2074
SNEQATSLNT
2496
VGGTGGIGGVGG
2993
RETYLSRGDS

merin-1

VGGT

RETY

TGGVGNRAP

402
multi-
1158
TSLN-
1587
RAPR-
2075
KSNEQATSLN
2497
TVGGTGGIGGVG
2994
ETYLSRGDSS

merin-1

TVGG

ETYL

GTGGVGNRAPR

403
inter-
1159
EVSG-
1588
RRYQ-
2076
ICFFLSEVSG
2498
FEIPINGLSEFV
2995
RSLPGESEEM

alpha-

FEIP

RSLP

DYEDLVELAPGK

trypsin

FQLVAENRRYQ

inhibitor

heavy

chain H2

404
inter-
1160
RITR-
1589
RMLA-
2077
TAAAKRRITR
2499
SILQMSLDHHIV
2996
DAPPQDPSCC

alpha-

SILQ

DAPP

TPLTSLVIENEA

trypsin

GDERMLA

inhibitor

heavy

chain H2

405
clusterin
1161
VTTV-
1590
PVEV-
2078
DQYYLRVTTV
2500
ASHTSDSDVPSG
2997
SRKNPKFMET

ASHT

SRKN

VTEVVVKLFDSD

PITVTVPVEV

406
clusterin
1162
KNPK-

N/A
2079
PVEVSRKNPK

N/A
2998
FMETVAEKAL

FMET

(end

QEYRKKHREE

of

pro-

tein)

407
apolipo-
1163
PAQG-

N/A
2080
VLEGPAPAQG

N/A
2999
TPDVSSALDK

protein

TPDV

(end

LKEFGNTLED

C-I

of

KARELISRIK

pro-

QSELSAKMRE

tein)

WFSETFQKVK

EKLKIDS

408
apolipo-
1164
QGTP-

N/A
2081
EGPAPAQGTP

N/A
3000
DVSSALDKLK

protein

DVSS

(end

EFGNTLEDKA

C-I

of

RELISRIKQS

pro-

ELSAKMREWF

tein)

SETFQKVKEK

LKIDS

409
fibrinogen
1165
QLIK-
1591
ATLK-
2082
KTSEVKQLIK
2501
AIQLTYNPDESS
3001
SRKMLEEIMK

gamma

AIQL

SRKM

KPNMIDAATLK

chain

410
fibrinogen
1166
EGFG-
1592
HLIS-
2083
NWIQYKEGFG
2502
HLSPTGTTEFWL
3002
TQSAIPYALR

gamma

HLSP

TQSA

GNEKIHLIS

chain

411
fibrinogen
1167
NRLT-

N/A
2084
MKIIPFNRLT

N/A
3003
IGEGQQHHLG

gamma

IGEG

(end

GAKQAGDV

chain

of

pro-

tein)

412
N-acetyl-
1168
RSRR-

N/A
2085
ARSVSKRSRR

N/A
3004
EPPPRTLPAT

muramoyl-

EPPP

(end

DLQ

L-alanine

of

amidase

pro-

tein)

413
N-acetyl-
1169
SRRE-

N/A
2086
RSVSKRSRRE

N/A
3005
PPPRTLPATD

muramoyl-

PPPR

(end

LQ

L-alanine

of

amidase

pro-

tein)

414
immuno-
1170
GSVT-
1593
PSVS-
2087
LLSHCTGSVT
2503
SYVLTQPPSVS
3006
VAPGQTARIT

globulin

SYVL

VAPG

lambda

variable

3-21

415
immuno-
1171
SVTS-
1594
PSVS-
2088
LSHCTGSVTS
2504
YVLTQPPSVS
3007
VAPGQTARIT

globulin

YVLT

VAPG

lambda

variable

3-21

416
histone
1172
M-
1595
KKKA-

N/A
2505
SETAPAAPAAPA
3008
RKSAGAAKRK

H1.4

SETA

RKSA

PAEKTPVKKKA

417
histone
1173
M-
1596
KTPV-

N/A
2506
SETAPAAPAAPA
3009
KKKARKSAGA

H1.4

SETA

KKKA

PAEKTPV

418
histone
1174
M-
1597
TPVK-

N/A
2507
SETAPAAPAAPA
3010
KKARKSAGAA

H1.4

SETA

KKAR

PAEKTPVK

419
adhesion
1175
CNHF-
1598
RSAS-
2089
SETVCLCNHF
2508
THFGVLMDLPRS
3011
QLDARNTKVL

G-protein

THFG

QLDA

AS

coupled

receptor

G6

420
adhesion
1176
CNHF-
1599
QLDA-
2090
SETVCLCNHF
2509
THFGVLMDLPRS
3012
RNTKVLTFIS

G-protein

THFG

RNTK

ASQLDA

coupled

receptor

G6

421
immuno-
1177
SEAS-
1600
PSVS-
2091
LTLCTGSEAS
2510
YELTQPPSVS
3013
VSPGQTARIT

globulin

YELT

VSPG

lambda

variable

3-25

422
immuno-
1178
GSVA-
1601
PSVS-
2092
VLAYCTGSVA
2511
SYELTQPPSVS
3014
VSPGQTASIT

globulin

SYEL

VSPG

lambda

variable

3-25

423
immuno-
1179
SWAQ-
1602
PSVS-
2093
LIHCTGSWAQ
2512
SVLTQPPSVS
3015
AAPGQKVTIS

globulin

SVLT

AAPG

lambda

variable

1-51

424
immuno-
1180
QSVL-
1603
PSVS-
2094
CTGSWAQSVL
2513
TQPPSVS
3016
AAPGQKVTIS

globulin

TQPP

AAPG

lambda

variable

1-51

425
immuno-
1181
GSWA-
1604
PSVS-
2095
LITHCAGSWA
2514
QSVLTQPPSVS
3017
EAPRQRVTIS

globulin

QSVL

EAPR

lambda

variable

1-36

426
immuno-
1182
SWAQ-
1605
PSVS-
2096
LITHCAGSWA
2515
SVLTQPPSVS
3018
EAPRQRVTIS

globulin

SVLT

EAPR

Q

lambda

variable

1-36

427
immuno-
1183
SYEL-
1606
PSVS-
2097
CTGSVASYEL
2516
TQPPSVS
3019
VSPGQTASIT

globulin

TQPP

VSPG

lambda

variable

1-36

428
mannan-
1184
GSVA-
1607
GRLA-
2098
LLGLLCGSVA
2517
TPLGPKWPEPVF
3020
SPGFPGEYAN

binding

TPLG

SPGF

GRLA

lectin

serine

protease 2

429
immuno-
1185
DTTG-
1608
GTLS-
2099
LLLWLPDTTG
2518
EIVLTQSPGTLS
3021
LSPGERATLS

globulin

EIVL

LSPG

kappa

variable

3-20

430
immuno-
1186
TTGE-
1609
GTLS-
2100
LLWLPDTTGE
2519
IVLTQSPGTLS
3022
LSPGERATLS

globulin

IVLT

LSPG

kappa

variable

3-20

431
immuno-
1187
GSSG-
1610
LPVT-
2101
LMLWVPGSSG
2520
DVVMTQSPLSLP
3023
LGQPASISCR

globulin

DVVM

LGQP

VT

kappa

variable

2-30

432
immuno-
1188
GSSG-
1611
SPLS-
2102
LMLWVPGSSG
2521
DVVMTQSPLS
3024
LPVTLGQPAS

globulin

DVVM

LPVT

kappa

variable

2-30

433
insulin-
1189
PVGK-
1612
QSTQ-
2103
DNFPRYPVGK
2522
FFQYDTWKQSTQ
3025
RLRRGLPALL

like

FFQY

RLRR

growth

factor II

434
insulin-
1190
PVGK-
1613
TQRL-
2104
DNFPRYPVGK
2523
FFQYDTWKQSTQ
3026
RRGLPALLRA

like

FFQY

RRGL

RL

growth

factor II

435
apolipo-
1191
VNFL-
1614
TQPA-
2105
KAGTELVNFL
2524
SYFVELGTQPA

N/A

protein

SYFV

TQ

A-II

436
apolipo-
1192
IKKA-
1615
TQPA-
2106
EQLTPLIKKA
2525
GTELVNFLSYFV

N/A

protein

GTEL

TQ

ELGTQPA

A-II

437
apolipo-
1193
FQTV-
1616
SPEL-
2107
SLVSQYFQTV
2526
TDYGKDLMEKVK
3027
QAEAKSYFEK

protein

TDYG

QAEA

SPEL

A-II

438
probable
1194
GSSG-
1617
SPVT-
2108
LMLWVPGSSG
2527
DIVMTQTPLSSP
3028
LGQPASISFR

non-

DIVM

LGQP

VT

functional

immuno-

globulin

kappa

variable

2D-24

439
probable
1195
GSSG-
1618
TPLS-
2109
LMLWVPGSSG
2528
DIVMTQTPLS
3029
SPVTLGQPAS

non-

DIVM

SPVT

functional

immuno-

globulin

kappa

variable

2D-24

440
pro-
1196
RTAT-
1619
FNPR-
2110
DRAIEGRTAT
2529
SEYQTFFNPR
3030
TFGSGEADCG

thrombin

SEYQ

TFGS

441
pro-
1197
TATS-
1620
FNPR-
2111
RAIEGRTATS
2530
EYQTFFNPR
3031
TFGSGEADCG

thrombin

EYQT

TFGS

442
pro-
1198
LVHS-
1621
LQRV-
2112
LAALCSLVHS
2531
QHVFLAPQQARS
3032
RRANTFLEEV

thrombin

QHVF

RRAN

LLQRV

443
coagu-
1199
SAEC-
1622
NRPK-
2113
LLGYLLSAEC
2532
TVFLDHENANKI
3033
RYNSGKLEEF

lation

TVFL

RYNS

LNRPK

factor IX

444
coagu-
1200
ECTV-
1623
NRPK-
2114
GYLLSAECTV
2533
FLDHENANKILN
3034
RYNSGKLEEF

lation

FLDH

RYNS

RPK

factor IX

445
apolipo-
1201
GVRA-
1624
KPLG-
2115
ALFLGVGVRA
2534
EEAGARVQQNVP
3035
DWAAGTMDPE

protein L1

EEAG

DWAA

SGTDTGDPQSKP

LG

446
apolipo-
1202
GVRA-
1625
QSKP-
2116
ALFLGVGVRA
2535
EEAGARVQQNVP
3036
LGDWAAGTMD

protein L1

EEAG

LGDW

SGTDTGDPQSKP

447
deleted in
1203
RSKR-

N/A
2117
YRGCVLRSKR

N/A
3037
DVGSYQEKVD

malignant

DVGS

(end

VVLGPIQLQT

brain

of

PPRREEEPR

tumors 1

pro-

protein

tein)

448
desmo-
1204
GELR-
1626
KRRQ-
2118
VVILVHGELR
2536
IETKGQYDEEEM
3038
KREWVKFAKP

glein-3

IETK

KREW

TMQQAKRRQ

449
desmo-
1205
LVHG-
1627
KRRQ-
2119
IFVVVILVHG
2537
ELRIETKGQYDE
3039
KREWVKFAKP

glein-3

ELRI

KREW

EEMTMQQAKRRQ

450
calsyn-
1206
NHMA-
1628
PHPF-
2120
NPMEHANHMA
2538
AQPQFVHPEHRS
3040
AVVPSTATVV

tenin-1

AQPQ

AVVP

FVDLSGHNLANP

HPF

451
calsyn-
1207
HANH-
1629
PHPF-
2121
TANPMEHANH
2539
MAAQPQFVHPEH
3041
AVVPSTATVV

tenin-1

MAAQ

AVVP

RSFVDLSGHNLA

NPHPF

452
immuno-
1208
GAVT-
1630
AGVE-
2122
ISDFYPGAVT
2540
VAWKADSSPVKA
3042
TTTPSKQSNN

globulin

VAWK

TTTP

GVE

lambda

constant 3

453
immuno-
1209
GAVT-
1631
SYLS-
2123
ISDFYPGAVT
2541
VAWKADSSPVKA
3043
LTPEQWKSHK

globulin

VAWK

LTPE

GVETTTPSKQSN

lambda

NKYAASSYLS

constant 3

454
immuno-
1210
SPVK-
1632
SYLS-
2124
AWKADSSPVK
2542
AGVETTTPSKQS
3044
LTPEQWKSHK

globulin

AGVE

LTPE

NNKYAASSYLS

lambda

constant 3

455
complement
1211
KTWG-
1633
FRVG-
2125
FLIFLGKTWG
2543
QEQTYVISAPKI
3045
ASENIVIQVY

C5

QEQT

ASEN

FRVG

456
alpha-2-
1212
NKVD-
1634
LRVT-
2126
VENCLANKVD
2544
LSFSPSQSLPAS
3046
AAPQSVCALR

macro-

LSFS

AAPQ

HAHLRVT

globulin

457
alpha-2-
1213
PTDA-
1635
SLLH-
2127
LLLVLLPTDA
2545
SVSGKPQYMVLV
3047
TETTEKGCVL

macro-

SVSG

TETT

PSLLH

globulin

458
myosin-9
1214
M-
1636
DKNF-

N/A
2546
AQQAADKYLYVD
3048
INNPLAQADW

AQQA

INNP

KNF

459
sodium/
1215
M-
1637
NGGL-

N/A
2547
TGLSMDGGGSPK
3049
IFAGLAFIVG

potassium-

TGLS

IFAG

GDVDPFYYDYET

transport-

VRNGGL

ing

ATPase

subunit

gamma

460
sodium/
1216
M-
1638
DVDP-

N/A
2548
TGLSMDGGGSPK
3050
FYYDYETVRN

potassium-

TGLS

FYYD

GDVDP

transport-

ing

ATPase

subunit

gamma

461
immuno-
1217
GSSG-
1639
SPLS-
2128
LMLWVSGSSG
2549
DIVMTQSPLS
3051
LPVTPGEPAS

globulin

DIVM

LPVT

kappa

variable

2-28

462
immuno-
1218
GSSG-
1640
LPVT-
2129
LMLWVSGSSG
2550
DIVMTQSPLSLP
3052
PGEPASISCR

globulin

DIVM

PGEP

VT

kappa

variable

2-28

463
onco-
1219
RMRR-

N/A
2130
AQGCHRRMRR

N/A
3053
GAGGEDSAGL

protein-

GAGG

(end

QGQTLTGGPI

induced

of

RIDWED

transcript

pro-

3 protein

tein)

464
serglycin
1220
SDAF-

N/A
2131
YQLVDESDAF

N/A
3054
HDNLRSLDRN

HDNL

(end

LPSDSQDLGQ

of

HGLEEDFML

pro-

tein)

465
coagula-
1221
GDRN-

N/A
2132
SWGSGCGDRN

N/A
3055
KPGVYTDVAY

tion

KPGV

(end

YLAWIREHTV

factor XII

of

S

pro-

tein)

466
coagula-
1222
M-
1641
VVPR-

N/A
2551
SETSRTAFGGRR
3056
GVNLQEFLNV

tion

SETS

GVNL

AVPPNNSNAAED

factor

DLPTVELQGVVP

XIII

R

A chain

467
insulin
1223
KTRR-
1642
GSLQ-
2133
GFFYTPKTRR
2552
EAEDLQVGQVEL
3057
KRGIVEQCCT

EAED

KRGI

GGGPGAGSLQPL

ALEGSLQ Q

468
histidine-
1224
GKFK-

N/A
2134
VSESCPGKFK

N/A
3058
SGFPQVSMFF

rich

SGFP

(end

THTFPK

glyco-

of

protein

pro-

tein)

469
immuno-
1225
DTTG-
1643
ATLS-
2135
LLLWLPDTTG
2553
EIVLTQSPATLS
3059
LSPGERATLS

globulin

EIVL

LSPG

kappa

variable

3-11

470
immuno-
1226
GARC-
1644
SSLS-
2136
LLLWLRGARC
2554
DIQMTQSPSSLS
3060
ASVGDRVTIT

globulin

DIQM

ASVG

kappa

variable

1-3

471
collagen
1227
AGFD-
1645
QPPQ-
2137
PPGPPSAGFD
2555
FSFLPQPPQ
3061
EKAHDGGRYY

alpha-1(1)

FSFL

EKAH

chain

472
inter-
1228
CVGS-
1646
RVPR-
2138
CLGLSLCVGS
2556
QEEAQSWGHSSE
3062
QVRLLQRLKT

alpha-

QEEA

QVRL

QDGLRVPR

trypsin

inhibitor

heavy

chain H5

473
latent-
1229
AGHA-
1647
AAKV-
2139
LALFVGAGHA
2557
QRDPVGRYEPAG
3063
YSLFREQDAP

trans-

QRDP

YSLF

GDANRLRRPGGS

forming

YPAAAAAKV

growth

factor

beta-

binding

protein 2

474
latent-
1230
AGHA-
1648
RPGG-
2140
LALFVGAGHA
2558
QRDPVGRYEPAG
3064
SYPAAAAAKV

trans-

QRDP

SYPA

GDANRLRRPGG

forming

growth

factor

beta-

binding

protein 2

475
integrin
1231
RDRR-
1649
QPSR-
2141
HPAHHKRDRR
2559
QIFLPEPEQPSR
3065
LQDPVLVSCD

alpha-IIb

QIFL

LQDP

476
membrane-
1232
LLYK-
1650
LPRL-
2142
LGLCIFLLYK
2560
IVRGDQPAASGD
3066
KRRDFTPAEL

associated

IVRG

KRRD

SDDDEPPPLPRL

progester-

one

receptor

component

1

477
immuno-
1233
GSWA-
1651
HSVS-
2143
LLAHCTGSWA
2561
NFMLTQPHSVS
3067
ESPGKTVTIS

globulin

NFML

ESPG

lambda

variable

6-57

478
immuno-
1234
SWAN-
1652
HSVS-
2144
LAHCTGSWAN
2562
FMLTQPHSVS
3068
ESPGKTVTIS

globulin

FMLT

ESPG

lambda

variable

6-57

479
immuno-
1235
DTTG-
1653
ATLS-
2145
LLLWLPDTTG
2563
EIVMTQSPATLS
3069
VSPGERATLS

globulin

EIVM

VSPG

kappa

variable

3-15

480
complement
1236
PTRG-
1654
QQLT-
2146
GVLQACPTRG
2564
SVLLAQELPQQL
3070
SPGYPEPYGK

C1r

SVLL

SPGY

T

sub-

component-

like

protein

481
histone
1237
M-
1655
KKAA-

N/A
2565
SETAPAAPAAAP
3071
KKAGGTPRKA

H1.2

SETA

KKAG

PAEKAPVKKKAA

482
rho GDP-
1238
M-
1656
KLNY-

N/A
2566
TEKAPEPHVEED
3072
KPPPQKSLKE

dissoci-

TEKA

KPPP

DDDELDSKLNY

ation

inhibitor

2

483
latent-
1239
FARR-
1657
SRDT-
2147
PPPPGPFARR
2567
EAPYGAPRFDMP
3073
RRSFPEPEEP

trans-

EAPY

RRSF

DFEDDGGPYGES

forming

EAPAPPGPGTRW

growth

PYRSRDT

factor

beta-

binding

protein 4

484
collagen
1240
PWRA-
1658
HHSS-
2148
PHPTARPWRA
2568
DDILASPPRLPE
3074
YVHLRPARPT

alpha-

DDIL

YVHL

PQPYPGAPHHSS

1(XVIII)

chain

485
immuno-
1241
GSWA-
1659
PSVS-
2149
LLTQGTGSWA
2569
QSALTQPPSVS
3075
GSPGQSVTIS

globulin

QSAL

GSPG

lambda

variable

2-18

486
immuno-
1242
QSAL-
1660
PSVS-
2150
GTGSWAQSAL
2570
TQPPSVS
3076
GSPGQSVTIS

globulin

TQPP

GSPG

lambda

variable

2-18

487
zinc-
1243
SSLA-

N/A
2151
SCHVQHSSLA

N/A
3077
QPLVVPWEAS

alpha-2-

QPLV

(end

glycopro-

of

tein

pro-

tein)

488
talin-1
1244
TVLQ-
1661
FQVG-
2152
VSPKKSTVLQ
2571
QQYNRVGKVEHG
3078
SMPPAQQQIT

QQYN

SMPP

SVALPAIMRSGA

SGPENFQVG

489
secreto-
1245
LRDP-
1662
HSRE-
2153
KFEVRLLRDP
2572
ADASEAHESSSR
3079
RADEPQWSLY

granin-1

ADAS

RADE

GEAGAPGEEDIQ

GPTKADTEKWAE

GGGHSRE

490
neutrophil
1246
QAQA-
1663
PEQI-
2154
ILLVALQAQA
2573
EPLQARADEVAA
3080
AADIPEVVVS

defensin 3

EPLQ

AADI

APEQI

491
cytochrome
1247
LYDN-
1664
PEKF-
2155
PTLDSVLYDN
2574
QEFPDPEKF
3081
KPEHFLNENG

P450 2E1

QEFP

KPEH

492
gastric
1248
VGLG-
1665
RGPR-
2156
LSLFLAVGLG
2575
EKKEGHFSALPS
3082
YAEGTFISDY

inhibitory

EKKE

YAEG

LPVGSHAKVSSP

polypep-

QPRGPR

tide

493
immuno-
1249
GVQC-
1666
GSLR-
2157
LAAILKGVQC
2576
EVQLVESGGGLV
3083
LSCAASGFTF

globulin

EVQL

LSCA

KPGGSLR

heavy

variable

3-15

494
immuno-
1250
SWAQ-
1667
RSVS-
2158
LTQGTGSWAQ
2577
SALTQPRSVS
3084
GSPGQSVTIS

globulin

SALT

GSPG

lambda

variable

2-11

495
trans-
1251
DIDC-
1668
PPPP-
2159
YDADCEDIDC
2578
KLMPPPPPP
3085
PGPMKKDKDQ

cription

KLMP

PGPM

initiation

factor

TFIID

subunit 1

496
collagen
1252
EQGR-
1669
PPGP-
2160
EKGERGEQGR
2579
DGPPGLPGTPGP
3086
KVSVDEPGPG

alpha-

DGPP

KVSV

PGPPGP

1(VII)

chain

497
kinino-
1253
SLMK-
1670
FSPF-
2161
QPLGMISLMK
2580
RPPGFSPF
3087
RSSRIGEIKE

gen-1

RPPG

RSSR

498
integral
1254
AIRH-
1671
AVET-
2162
EASNCFAIRH
2581
FENKFAVET
3088
LICS

membrane

FENK

LICS

protein

2B

499
pigment
1255
QPAH-

N/A
2163
TPSPGLQPAH

N/A
3089
LTFPLDYHLN

epithel-

LTFP

(end

QPFIFVLRDT

ium-

of

DTGALLFIGK

derived

pro-

ILDPRGP

factor

tein)

500
voltage-
1256
RHRA-
1672
ADKE-
2164
GEEPARRHRA
2582
RHKAQPAHEAVE
3090
KELRNHQPRE

dependent

RHKA

KELR

KETTEKEATEKE

N-type

AEIVEADKE

calcium

channel

subunit

alpha-1B

501
immuno-
1257
SVAS-
1673
SSVS-
2165
LILCTVSVAS
2583
YELTQPSSVS
3091
VSPGQTARIT

globulin

YELT

VSPG

lambda

variable

3-27

502
ras
1258
PGGL-
1674
LSFQ-
2166
HPALNQPGGL
2584
QPLSFQ
3092
NPVYHLNNPI

GTPase-

QPLS

NPVY

activating

protein

nGAP

503
keratin,
1259
RQVR-
1675
HQTT-
2167
KEPVTTRQVR
2585
TIVEEVQDGKVI

N/A

type I

TIVE

R

SSREQVHQTT

cytoskel-

etal 17

504
tubulin
1260
MNTF-
1676
EPYN-
2168
EYPDRIMNTF
2586
SVVPSPKVSDTV
3093
ATLSVHQLVE

beta chain

SVVP

ATLS

VEPYN

505
sulfhydryl
1261
PGLR-
1677
WHLS-
2169
RPPKLHPGLR
2587
AAPGQEPPEHMA
3094
KRDTGAALLA

oxidase 1

AAPG

KRDT

ELQRNEQEQPLG

QWHLS

506
immuno-
1262
GAYG-
1678
SLAV-
2170
LLLWISGAYG
2588
DIVMTQSPDSLA
3095
SLGERATINC

globulin

DIVM

SLGE

V

kappa

variable

4-1

507
complement
1263
RAGG-
1679
GEVT-
2171
VPALFCRAGG
2589
SIPIPQKLFGEV
3096
SPLFPKPYPN

C1r sub-

SIPI

SPLF

T

component

508
homeobox
1264
KKPS-
1680
SPSP-
2172
KEKKSAKKPS
2590
QSATSPSP
3097
AASAVPASGV

protein

QSAT

AASA

Hox-B2

509
trans-
1265
VALP-
1681
SPPG-
2173
ISKPPGVALP
2591
TVSPPG
3098
VDAKAQVKTE

cription

TVSP

VDAK

factor

SOX-10

510
E3
1266
NKPC-
1682
TPSP-
2174
STGPSANKPC
2592
SKQPPPQPQHTP
3099
AAPPAAATIS

ubiquitin-

SKQP

AAPP

SP

protein

ligase

SIAH2

511
decorin
1267
GLDK-

N/A
2175
VQCSDLGLDK

N/A
3100
VPKDLPPDTT

VPKD

512
SPARC
1268
HPVE-

N/A
2176
RLEAGDHPVE

N/A
3101
LLARDFEKNY

LLAR

513
elastin
1269
LGYP-

N/A
2177
PQPGVPLGYP

N/A
3102
IKAPKLPGGY

IKAP

514
elastin
1270
PGVV-

N/A
2178
GGPGFGPGVV

N/A
3103
GVPGAGVPGV

GVPG

515
type I
1271
GVRG-

N/A
2179
GSPGKDGVRG

N/A
3104
LTGPIGPPGP

collagen

LTGP

alpha-1

chain

516
type IV
1272
SDGL-

N/A
2180
EPGPAGSDGL

N/A
3105
PGLKGKRGDS

collagen

PGLK

alpha-1

chain

517
laminin
1273
QAKN-

N/A
2181
LNRKYEQAKN

N/A
3106
ISQDLEKQAA

gamma 1

ISQD

chain

518
vimentin
1274
PGVR-

N/A
2182
RLRSSVPGVR

N/A
3107
LLQDSVDFSL

LLQD

519
type III
1275
QGLQ-

N/A
2183
TGPPGPQGLQ

N/A
3108
GLPGTGGPPG

collagen

GLPG

520
type IV
1276
DPGE-

N/A
2184
LPGMKGDPGE

N/A
3109
ILGHVPGMLL

collagen

ILGH

alpha-1

chain

521
type IV
1277
PPGP-

N/A
2185
LPGSPGPPGP

N/A
3110
PGDIVFRKGP

collagen

PGDI

alpha-3

chain

522
type VII
1278
GRLV-

N/A
2186
GPPGPPGRLV

N/A
3111
DTGPGAREKG

collagen

DTGP

E

alpha-1

chain

523
fibrinogen
1279
ADSG-
1683
GGVR-
2187
VGTAWTADSG
2593
EGDFLAEGGGVR
3112
GPRVVERHQS

alpha

EGDF

GPRV

chain

524
fibrinogen
1280
AWTA-
1684
GGVR-
2188
LSVVGTAWTA
2594
DSGEGDFLAEGG
3113
GPRVVERHQS

alpha

DSGE

GPRV

GVR

chain

525
elastin
1281
SPEA-

N/A
2189
VPGVGISPEA

N/A
3114
QAAAAAKAAK

QAAA

526
C-reactive
1282
DMSR-

N/A
2190
HAFGQTDMSR

N/A
3115
KAFVFP

protein

KAFV

527
elastin
1283
GPGG-

N/A
2191
GVAPGIGPGG

N/A
3116
VAAAAKSAAK

VAAA

528
type VI
1284
GAKG-

N/A
2192
APRGVKGAKG

N/A
3117
YRGPEGPQGP

collagen

YRGP

alpha-1

chain

529
type V
1285
GPSG-

N/A
2193
GPPGKRGPSG

N/A
3118
HMGREGREGE

collagen

HMGR

alpha-1

chain

530
complement
1286
STGR-
1685
RQIR-
2194
LNVTLSSTGR
2595
NGFKSHALQLNN
3119
GLEEELQFSL

C4-A OR

NGFK

GLEE

RQIR

complement

C4-B

531
complement
1287
LPSR-
1686
SLLR-
2195
LDVSLQLPSR
2596
SSKITHRIHWES
3120
SEETKENEGF

C3

SSKI

SEET

ASLLR

532
fibrinogen
1288
FESK-
1687
RPVR-
2196
NRGDSTFESK
2597
SYKMADEAGSEA
3121
DCDDVLQTHP

alpha

SYKM

DCDD

DHEGTHSTKRGH

chain

AKSRPVR

533
nidogen-1
1289
HERE-

N/A
2197
VEKTRCQHER

N/A
3122
HILGAAGATD

HILG

E

534
type VI
1290
GNRG-

N/A
2198
GPKGGIGNRG

N/A
3123
PRGETGDDGR

collagen

PRGE

alpha-3

chain

*The (putative) scissile bond of each cleavage sequence listed in Table A, cleavage sequence 1 and cleavage sequence 2 (if present) in each reporter polypeptide, is indicated by a hyphen (-).

“N/A” indicates that the amino acid sequence of the corresponding cleavage sequence is not, or cannot be, specified in the instance.

In some embodiments of the compositions (such as the therapeutic agents, or activatable therapeutic agents described hereinabove) or methods described herein, the mammalian protease (for cleavage of the release segment (RS), or the first release segment (RS1), or the second release segment (RS2)) can be a serine protease, a cysteine protease, an aspartate protease, a threonine protease, or a metalloproteinase. The mammalian protease (for cleavage of the release segment (RS), or the first release segment (RS1), or the second release segment (RS2)) can be selected from the group consisting of disintegrin and metalloproteinase domain-containing protein 10 (ADAM10), disintegrin and metalloproteinase domain-containing protein 12 (ADAM12), disintegrin and metalloproteinase domain-containing protein 15 (ADAM15), disintegrin and metalloproteinase domain-containing protein 17 (ADAM17), disintegrin and metalloproteinase domain-containing protein 9 (ADAM9), disintegrin and metalloproteinase with thrombospondin motifs 5 (ADAMTS5), Cathepsin B, Cathepsin D, Cathepsin E, Cathepsin K, cathepsin L, cathepsin S, Fibroblast activation protein alpha, Hepsin, kallikrein-2, kallikrein-4, kallikrein-3, Prostate-specific antigen (PSA), kallikrein-13, Legumain, matrix metallopeptidase 1 (MMP-1), matrix metallopeptidase 10 (MMP-10), matrix metallopeptidase 11 (MMP-11), matrix metallopeptidase 12 (MMP-12), matrix metallopeptidase 13 (MMP-13), matrix metallopeptidase 14 (MMP-14), matrix metallopeptidase 16 (MMP-16), matrix metallopeptidase 2 (MMP-2), matrix metallopeptidase 3 (MMP-3), matrix metallopeptidase 7 (MMP-7), matrix metallopeptidase 8 (MMP-8), matrix metallopeptidase 9 (MMP-9), matrix metallopeptidase 4 (MMP-4), matrix metallopeptidase 5 (MMP-5), matrix metallopeptidase 6 (MMP-6), matrix metallopeptidase 15 (MMP-15), neutrophil elastase, protease activated receptor 2 (PAR2), plasmin, prostasin, PSMA-FOLH1, membrane type serine protease 1 (MT-SP1), matriptase, and u-plasminogen. The mammalian protease (for cleavage of the release segment (RS), or the first release segment (RS1), or the second release segment (RS2)) can be selected from the group consisting of matrix metallopeptidase 1 (MMP1), matrix metallopeptidase 2 (MMP2), matrix metallopeptidase 7 (MMP1), matrix metallopeptidase 9 (MMP9), matrix metallopeptidase 11 (MMP11), matrix metallopeptidase 14 (MMP14), urokinase-type plasminogen activator (uPA), legumain, and matriptase. The mammalian protease can be preferentially expressed or activated in the target tissue or cell.

In some embodiments of the compositions (such as the therapeutic agents, or activatable therapeutic agents described hereinabove) or methods described herein, the target tissue or cell can be characterized by an increased amount or activity of a mammalian protease (such as one described herein) in proximity to the target tissue or cell as compared to a non-target tissue or cell in a subject. The target tissue or cell can be characterized by a presence, in proximity thereto, of at least (about) 10% more, at least (about) 20% more, at least (about) 30% more, at least (about) 40% more, at least (about) 50% more, at least (about) 60% more, at least (about) 70% more, at least (about) 80% more, at least (about) 90% more, at least (about) 100% more, or at least (about) 200% more amount of the mammalian protease as compared to a non-target tissue or cell in the subject. The target tissue or cell can be characterized by an activity, in proximity thereto, of the mammalian protease of at least (about) 10% higher, at least (about) 20% higher, at least (about) 30% higher, at least (about) 40% higher, at least (about) 50% higher, at least (about) 60% higher, at least (about) 70% higher, at least (about) 80% higher, at least (about) 90% higher, at least (about) 100% higher, or at least (about) 200% higher as compared to a non-target tissue or cell in the subject. The target tissue or cell can produce or can be co-localized with the mammalian protease (such as one described herein). The target tissue or cell can be a tumor.

In some embodiments, the compositions of this disclosure (such as activatable therapeutic agents) are designed with considerations of the location of the target tissue protease as well as the presence of the same protease in healthy tissues not intended to be targeted, but a greater presence of the ligand in unhealthy target tissue, in order to provide a wide therapeutic window. A “therapeutic window” refers to the largest difference between the minimal effective dose and the maximal tolerated dose for a given therapeutic composition. To help achieve a wide therapeutic window, the binding domains of the compositions are shielded by the proximity of the masking moiety (e.g., XTEN) such that the binding affinity of the intact composition for one or both of the ligands is reduced compared to the composition that has been cleaved by a mammalian protease, thereby releasing the biologically active moiety from the shielding effects of the masking moiety.

Nucleic Acids, Expression Vectors, Host Cells

Provided herein, in some embodiments, is an isolated nucleic acid comprising: (a) a polynucleotide encoding a recombinant polypeptide as described herein; or (b) a reverse complement of the polynucleotide of (a).

Provided herein, in some embodiments, is an expression vector comprising a polynucleotide sequence as described herein and a recombinant regulatory sequence operably linked to the polynucleotide sequence.

Provided herein, in some embodiments, is an isolated host cell, comprising an expression vector as described herein. The isolated host cell can be a prokaryote. The isolated host cell can be E. coli. The isolated host cell can be mammalian cell(s).

Pharmaceutical Compositions

Provided herein, in some embodiments, is a pharmaceutical composition comprising a therapeutic agent (such as described hereinabove or described anywhere else herein) and one or more pharmaceutically suitable excipients. The pharmaceutical composition can be formulated for oral, intradermal, subcutaneous, intravenous, intra-arterial, intraabdominal, intraperitoneal, intrathecal, or intramuscular administration. The pharmaceutical composition can be in a liquid form or frozen form. The pharmaceutical composition can be in a pre-filled syringe for a single injection. The pharmaceutical composition can be formulated as a lyophilized powder to be reconstituted prior to administration.

Kits

Provided herein, in some embodiments, is a kit comprising a pharmaceutical composition described herein (or a therapeutic agent described herein), a container, and a label or package insert on or associated with the container.

Methods
Methods for Assessing a Likelihood of a Response to Therapeutic Agent(s)

Provided herein, in some embodiments, is a method for assessing a likelihood of a subject being responsive to a therapeutic agent that is activatable by a mammalian protease expressed in the subject, the method comprising:

- (a) determining, in a biological sample from the subject, a presence or an amount of
  - (i) a polypeptide comprising at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acid residues shown in a sequence set forth in Column V of Table A (or a subset thereof); or
  - (ii) a polypeptide comprising at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column IV of Table A (or a subset thereof); or
  - (iii) a polypeptide comprising at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column VI of Table A (or a subset thereof); and
- (b) designating the subject as being likely to respond to the therapeutic agent when the polypeptide of (i), (ii) or (iii) is present and/or if its amount exceeds a threshold.

In some embodiments of the method for assessing the likelihood of the subject being responsive to the therapeutic agent, the therapeutic agent can comprise a peptide substrate susceptible to cleavage by the mammalian protease (e.g., at a scissile bond). The peptide substrate can be susceptible to cleavage by the mammalian protease at a scissile bond. The polypeptide of (i), (ii), or (iii) can comprise a portion (e.g., containing at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least eleven, at least twelve, at least thirteen, at least fourteen, or at least fifteen consecutive amino acid residues) of the peptide substrate that is either N-terminal or C-terminal side of the scissile bond. The portion (e.g., containing at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least eleven, at least twelve, at least thirteen, at least fourteen, or at least fifteen consecutive amino acid residues) of the peptide substrate can be either immediately N-terminal or immediately C-terminal of the scissile bond. The polypeptide of (i) can comprise at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acid residues shown in a sequence set forth in Column V of Table A (or a subset thereof). The polypeptide of (i) can comprise a sequence set forth in Column V of Table A (or a subset thereof). The polypeptide of (ii) can comprise at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column IV of Table A (or a subset thereof). The polypeptide of (ii) can comprise a sequence set forth in Column IV of Table A (or a subset thereof). The polypeptide of (iii) can comprise at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column VI of Table A (or a subset thereof). The polypeptide of (iii) can comprise a sequence set forth in Column VI of Table A (or a subset thereof). In some embodiments of the method for assessing the likelihood, (a) comprises determining the presence or the amount of any two of (i)-(iii). In some embodiments of the method for assessing the likelihood, (a) comprises determining the presence or the amount of all three of (i)-(iii). Additionally or alternatively, the subject designated, by the method described herein in the section entitled “METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S),” as being likely to respond to the activatable therapeutic agent (such as one described herein) can be one with an expression profile of biomarker(s) such that, upon administering an activatable therapeutic agent (such as one described herein) to the subject, the activatable therapeutic agent is more likely than not to be cleaved at or near the target tissue(s) or cell(s) (such as described herein in the “Target Tissues or Cells” section), e.g., by mammalian protease(s), thereby activating the therapeutic agent. In some embodiments of the method for assessing the likelihood, the threshold can be zero or nominal. The peptide substrate can be any peptide substrate described hereinabove in the RELEASE SEGMENTS section or described anywhere else herein. The activatable therapeutic agent can be any therapeutic agent (or any activatable therapeutic agent, or any non-natural, activatable therapeutic agent) as described hereinabove in the THERAPEUTIC AGENTS section or described anywhere else herein. The mammalian protease can be any mammalian protease as described hereinabove in the TARGET TISSUES OR CELLS section or described anywhere else herein. The target tissue or cell can be any one described hereinabove in the TARGET TISSUES OR CELLS section or described anywhere else herein. The target tissue or cell can be a tumor.

In some embodiments of the method for assessing the likelihood, the biological sample can be selected from serum, plasma, blood, spinal fluid, semen, and saliva. The biological sample can comprise a serum or plasma sample. The biological sample can comprise a serum sample. The biological sample can comprise a plasma sample. The biological sample can comprise a blood sample. The biological sample can comprise a spinal fluid sample. The biological sample can comprise a semen sample. The biological sample can comprise a saliva sample.

In some embodiments of the method for assessing the likelihood, the subject can be suffering from, or can be suspected of suffering from, a disease or condition characterized by an increased expression or activity of the mammalian protease in proximity to a target tissue or cell (such as one described hereinabove in the TARGET TISSUES OR CELLS section or described anywhere else herein) as compared to a corresponding non-target tissue or cell in the subject. The subject can be selected from mouse, rat, monkey, and human. The subject can be a human. In some embodiments, the disease or condition can be a cancer or an inflammatory or autoimmune disease. In some embodiments, the disease or condition can be a cancer. The cancer can be selected from the group consisting of carcinoma, Hodgkin's lymphoma, and non-Hodgkin's lymphoma, diffuse large B cell lymphoma, follicular lymphoma, mantle cell lymphoma, blastoma, breast cancer, ER/PR+ breast cancer, Her2+ breast cancer, triple-negative breast cancer, colon cancer, colon cancer with malignant ascites, mucinous tumors, prostate cancer, head and neck cancer, skin cancer, melanoma, genito-urinary tract cancer, ovarian cancer, ovarian cancer with malignant ascites, peritoneal carcinomatosis, uterine serous carcinoma, endometrial cancer, cervix cancer, colorectal, uterine cancer, mesothelioma in the peritoneum, kidney cancer, Wilm's tumor, lung cancer, small-cell lung cancer, non-small cell lung cancer, gastric cancer, stomach cancer, small intestine cancer, liver cancer, hepatocarcinoma, hepatoblastoma, liposarcoma, pancreatic cancer, gall bladder cancer, cancers of the bile duct, esophageal cancer, salivary gland carcinoma, thyroid cancer, epithelial cancer, arrhenoblastoma, adenocarcinoma, sarcoma, and B-cell derived chronic lymphatic leukemia. In some embodiments, the disease or condition can be an inflammatory or autoimmune disease. The inflammatory or autoimmune disease can be selected from the group consisting of ankylosing spondylitis (AS), arthritis (for example, and not limited to, rheumatoid arthritis (RA), juvenile idiopathic arthritis (JIA), osteoarthritis (OA), psoriatic arthritis (PsA), gout, chronic arthritis), chagas disease, chronic obstructive pulmonary disease (COPD), dermatomyositis, type 1 diabetes, endometriosis, Goodpasture syndrome, Graves' disease, Guillain-Barre syndrome (GB S), Hashimoto's disease, suppurative scab, Kawasaki disease, IgA nephropathy, idiopathic thrombocytopenic purpura, inflammatory bowel disease (IBD) (for example, and not limited to, Crohn's disease (CD), clonal disease, ulcerative colitis, collagen colitis, lymphocytic colitis, ischemic colitis, empty colitis, Behcet's syndrome, infectious colitis, indeterminate colitis, interstitial Cystitis), lupus (for example, and not limited to, systemic lupus erythematosus, discoid lupus, subacute cutaneous lupus erythematosus, cutaneous lupus erythematosus (such as chilblain lupus erythematosus), drug-induced lupus, neonatal lupus, lupus nephritis), mixed connective tissue disease, morphea, multiple sclerosis (MS), severe muscle Force disorder, narcolepsy, neuromuscular angina, pemphigus vulgaris, pernicious anemia, psoriasis, psoriatic arthritis, polymyositis, primary biliary cirrhosis, relapsing polychondritis, schizophrenia, scleroderma, Sjogren's syndrome, systemic stiffness syndrome, temporal arteritis (also known as giant cell arteritis), vasculitis, vitiligo, Wegener's granulomatosis, transplant rejection-associated immune reaction(s) (for example, and not limited to, renal transplant rejection, lung transplant rejection, liver transplant rejection), psoriasis, Wiskott-Aldrich syndrome, autoimmune lymphoproliferative syndrome, myasthenia gravis, inflammatory chronic rhinosinusitis, colitis, celiac disease, Barrett's esophagus, inflammatory gastritis, autoimmune nephritis, autoimmune hepatitis, autoimmune carditis, autoimmune encephalitis, autoimmune mediated hematological disease, asthma, atopic dermatitis, atopy, allergy, allergic rhinitis, scleroderma, bronchitis, pericarditis, the inflammatory disease is, Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, inflammatory lung disease, inflammatory skin disease, atherosclerosis, myocardial infarction, stroke, gram-positive shock, gram-negative shock, sepsis, septic shock, hemorrhagic shock, anaphylactic shock, systemic inflammatory response syndrome. Additionally or alternatively, the subject designated, by the method described herein in the section entitled “METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S),” as being likely to respond to the activatable therapeutic agent (such as one described herein) can be one with an expression profile of biomarker(s) such that, upon administering an activatable therapeutic agent (such as one described herein) to the subject, the activatable therapeutic agent is more likely than not to be cleaved at or near the target tissue(s) or cell(s) (such as described herein in the “Target Tissues or Cells” section), e.g., by mammalian protease(s), thereby activating the therapeutic agent. In some embodiments, the method for assessing the likelihood can further comprise transmitting the designation to a healthcare provider and/or the subject. In some embodiments, the method for assessing the likelihood can further comprise, subsequent to (b), contacting the therapeutic agent with the mammalian protease. In some embodiments, the method for assessing the likelihood can further comprise, subsequent to (b), administering to the subject an effective amount of the therapeutic agent based on the designation of step (b). In some embodiments of the method for assessing the likelihood, (a) can comprise detecting the polypeptide of (i), (ii) or (iii) in an immunoassay. The immunoassay can utilize an antibody that specifically binds to the polypeptide of (i), (ii) or (iii), or an epitope thereof. In some embodiments of the method for assessing the likelihood, (a) can comprise detecting the polypeptide of (i), (ii) or (iii) by using a mass spectrometer (MS) (including but not limited to LC-MS, LC-MS/MS, etc.).

Methods for Preparing Therapeutic Agent(s)

Provided herein, in some embodiments, is a method for preparing an activatable therapeutic agent, the method comprising:

- (a) culturing a host cell comprising a nucleic acid construct that encodes a recombinant polypeptide under conditions sufficient to express the recombinant polypeptide in the host cell, wherein the recombinant polypeptide comprises a biologically active polypeptide (BP), a release segment (RS), and a masking moiety (MM), wherein:
  - the RS comprises a peptide substrate susceptible for cleavage by a mammalian protease at a scissile bond, wherein the peptide substrate comprises an amino acid sequence having at least 80% sequence identity to a sequence set forth in Column II or III of Table A (or a subset thereof) and/or the group set forth in Tables 1(a)-1(j); and
  - the recombinant polypeptide has a structural arrangement from N-terminus to C-terminus of BP-RS-MM or MM-RS-BP; and
- (b) recovering the activatable therapeutic agent comprising the recombinant polypeptide.

In some embodiments of the method for preparing the activatable therapeutic agent, the release segment (RS) can be a first release segment (RS1), the peptide substrate can be a first peptide substrate, the scissile bond can be a first scissile bond, the masking moiety (MM) can be a first masking moiety (MM1), and the recombinant polypeptide can further comprise a second release segment (RS2), and a second masking moiety (MM2), where:

- the RS2 comprises a second peptide substrate susceptible for cleavage by a mammalian protease at a second scissile bond, where the second peptide substrate can comprise an amino acid sequence having at least 80% sequence identity to a sequence set forth in Column II or III of Table A (or a subset thereof) and/or the group set forth in Tables 1(a)-1(j); and
- the recombinant polypeptide can have a structural arrangement from N-terminus to C-terminus of MM1-RS1-BP-RS2-MM2, MM1-RS2-BP-RS1-MM2, MM2-RS1-BP-RS2-MM1, or MM2-RS2-BP-RS1-MM1.

In some embodiments of the method for preparing the activatable therapeutic agent, the masking moiety (MM) can comprise an extended recombinant polypeptide (XTEN) (such as one described hereinabove in the MASKING MOIETIES section or described anywhere else herein). In some embodiments of the method for preparing the activatable therapeutic agent, where the activatable therapeutic agent comprises a first masking moiety (MM1) and a second masking moiety (MM2), one of the MM1 and the MM2 can be a first extended recombinant polypeptide (XTEN1) (such as one described hereinabove in the MASKING MOIETIES section or described anywhere else herein). The other one of the MM1 and the MM2 can comprise a second extended recombinant polypeptide (XTEN2) (such as one described hereinabove in the MASKING MOIETIES section or described anywhere else herein).

In some embodiments of the method for preparing the activatable therapeutic agent, the recombinant polypeptide can be anyone described herein. The masking moiety (MM), when linked to the recombinant polypeptide, can interfere with an interaction of the biologically active peptide (BP) to a target tissue or cell such that a dissociation constant (K_d) of the BP of the recombinant polypeptide with a target cell marker borne by the target tissue or cell can be greater, when the recombinant polypeptide is in an uncleaved state, compared to a dissociation constant (K_d) of a corresponding biologically active peptide released from the recombinant polypeptide. The first masking moiety (MM1) and the second masking moiety (MM2), when both linked in the recombinant polypeptide, can (each independently, individually or collectively) interfere with an interaction of the biologically active peptide (BP) to a target tissue or cell such that a dissociation constant (K_d) of the BP of the recombinant polypeptide with a target cell marker borne by the target tissue or cell can be greater, when the recombinant polypeptide is in an uncleaved state, compared to a dissociation constant (K_d) of a corresponding biologically active peptide, when one or both of the first release segment (RS1) and the second release segment (RS2) is/are cleaved. The dissociation constant (Kd) can be measured in an in vitro assay under equivalent molar concentrations. The in vitro assay can be selected from cell membrane integrity assay, mixed cell culture assay, cell-based competitive binding assay, FACS based propidium Iodide assay, trypan Blue influx assay, photometric enzyme release assay, radiometric 51Cr release assay, fluorometric Europium release assay, CalceinAM release assay, photometric MTT assay, XTT assay, WST-1 assay, alamar blue assay, radiometric 3H-Thd incorporation assay, clonogenic assay measuring cell division activity, fluorometric rhodamine123 assay measuring mitochondrial transmembrane gradient, apoptosis assay monitored by FACS-based phosphatidylserine exposure, ELISA-based TUNEL test assay, sandwich ELISA, caspase activity assay, cell-based LDH release assay, reporter gene activity assay, and cell morphology assay, or any combination thereof.

Methods for Treating Subjects with Therapeutic Agent(s)

Provided herein, in some embodiments, is a method for treating a subject with an activatable therapeutic agent, the method comprising:

- (a) identifying the subject as having a likelihood of a response to the activatable therapeutic agent based on identification of a peptide biomarker in a biological sample from the subject, which activatable therapeutic agent comprises a peptide substrate susceptible to cleavage by a mammalian protease at a scissile bond; and
- (b) administering the activatable therapeutic agent to the subject based on the identification of the subject in (a);
- wherein the peptide biomarker comprises a portion identical to at least four consecutive amino acid residues of the peptide substrate that is either N-terminal or C-terminal of the scissile bond.

In some embodiments described in the immediately preceding paragraph, the peptide substrate can be any peptide substrate described hereinabove in the RELEASE SEGMENTS section or described anywhere else herein. The activatable therapeutic agent can be any therapeutic agent (or any activatable therapeutic agent, or any non-natural, activatable therapeutic agent) as described hereinabove in the THERAPEUTIC AGENTS section or described anywhere else herein. The mammalian protease can be any mammalian protease as described hereinabove in the TARGET TISSUES OR CELLS section or described anywhere else herein. The peptide biomarker can be any peptide biomarker as described hereinabove in the TARGET TISSUES OR CELLS section (such as those set forth in Table A) or described anywhere else herein. The likelihood of the response can be determined by a method as described hereinabove in the METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S) section or described anywhere else herein. The portion containing at least four consecutive amino acid residues can contain at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least eleven, at least twelve, at least thirteen, at least fourteen, or at least fifteen consecutive amino acid residues of the peptide substrate that is either N-terminal or C-terminal of the scissile bond. The portion containing at least four (e.g., at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least eleven, at least twelve, at least thirteen, at least fourteen, or at least fifteen) consecutive amino acid residues of the peptide substrate can be either immediately N-terminal or immediately C-terminal of the scissile bond. Additionally or alternatively, the subject designated, by the method described herein in the section entitled “METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S),” as being likely to respond to the activatable therapeutic agent (such as one described herein) can be one with an expression profile of biomarker(s) such that, upon administering an activatable therapeutic agent (such as one described herein) to the subject, the activatable therapeutic agent is more likely than not to be cleaved at or near the target tissue(s) or cell(s) (such as described herein in the “Target Tissues or Cells” section), e.g., by mammalian protease(s), thereby activating the therapeutic agent. In some embodiments, the peptide biomarker can be derived from a reporter polypeptide (such as described herein). In some embodiments, the peptide biomarker can have an amino acid sequence that is identical to a sequence of a reporter polypeptide. The reporter polypeptide can comprise a sequence set forth in Columns II-VI of Table A (or a subset thereof). In some embodiments, the peptide substrate can comprise an amino acid sequence having at most three, at most two, or at most one amino acid substitution(s) with respect to a sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, none of the amino acid substitution can be at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond as indicated in Table A. In some embodiments, the peptide substrate can comprise an amino acid sequence set forth in Column II or III of Table A (or a subset thereof). In some embodiments, the peptide substrate can comprise an amino acid sequence having at most three, at most two or at most one amino acid substitution(s) with respect to a sequence set forth in Table 1(j). In some embodiments, none of the amino acid substitution can be at a position corresponding to an amino acid residue immediately adjacent to a corresponding scissile bond set forth in Table 1(j). In some embodiments, the peptide substrate can comprise an amino acid sequence set forth in Table 1(j).

Provided herein, in some embodiments, is a method for treating a subject in need of a therapeutic agent that is activatable by a mammalian protease expressed in the subject, the method comprising:

- administering an effective amount of the therapeutic agent to the subject, wherein the subject has been shown to express in a biological sample from the subject:
  - (i) a polypeptide comprising at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acid residues shown in a sequence set forth in Column V of Table A (or a subset thereof); or
  - (ii) a polypeptide comprising at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column IV of Table A (or a subset thereof); or
  - (iii) a polypeptide comprising at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column VI of Table A (or a subset thereof); or
  - (iv) expression level of polypeptide (i), (ii) or (iii) exceeds a threshold.

In some embodiments described in the immediately preceding paragraph, the threshold can be zero or nominal. The peptide substrate can be any peptide substrate described hereinabove in the RELEASE SEGMENTS section or described anywhere else herein. The activatable therapeutic agent can be any therapeutic agent (or any activatable therapeutic agent, or any non-natural, activatable therapeutic agent) as described hereinabove in the THERAPEUTIC AGENTS section or described anywhere else herein. The mammalian protease can be any mammalian protease as described hereinabove in the TARGET TISSUES OR CELLS section or described anywhere else herein. The likelihood of the response can be determined by a method described hereinabove in the METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S) section or described anywhere else herein. The polypeptide of (i) can comprise at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acid residues shown in a sequence set forth in Column V of Table A (or a subset thereof). The polypeptide of (i) can comprise a sequence set forth in Column V of Table A (or a subset thereof). The polypeptide of (ii) can comprise at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column IV of Table A (or a subset thereof). The polypeptide of (ii) can comprise a sequence set forth in Column IV of Table A (or a subset thereof). The polypeptide of (iii) can comprise at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten consecutive amino acids shown in a sequence set forth in Column VI of Table A (or a subset thereof). The polypeptide of (iii) can comprise a sequence set forth in Column VI of Table A (or a subset thereof). The therapeutic agent can comprise a peptide substrate susceptible to cleavage by the mammalian protease (e.g., at a scissile bond). The peptide substrate can be susceptible to cleavage by the mammalian protease at a scissile bond, and the polypeptide of (i), (ii), or (iii) can comprise a portion (e.g., containing at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least eleven, at least twelve, at least thirteen, at least fourteen, or at least fifteen consecutive amino acid residues) of the peptide substrate that is either N-terminal or C-terminal of the scissile bond. The portion (e.g., containing at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least eleven, at least twelve, at least thirteen, at least fourteen, or at least fifteen consecutive amino acid residues) of the peptide substrate can be either immediately N-terminal or immediately C-terminal of the scissile bond. In some embodiments, the subject has been shown to express in the biological sample any two of (i)-(iii). In some embodiments, the subject has been shown to express in the biological sample all three of (i)-(iii).

In some embodiments of the method described herein this METHODS FOR TREATING SUBJECTS WITH THERAPEUTIC AGENT(S) section, the biological sample can be selected from serum, plasma, blood, spinal fluid, semen, and saliva. The biological sample can comprise a serum or plasma sample. The biological sample can comprise a serum sample. The biological sample can comprise a plasma sample. The biological sample can comprise a blood sample. The biological sample can comprise a spinal fluid sample. The biological sample can comprise a semen sample. The biological sample can comprise a saliva sample.

In some embodiments of the method described herein this METHODS FOR TREATING SUBJECTS WITH THERAPEUTIC AGENT(S) section, the subject can be suffering from, or can be suspected of suffering from, a disease or condition characterized by an increased expression or activity of the mammalian protease in proximity to a target tissue or cell (such as one described hereinabove in the TARGET TISSUES OR CELLS section or described anywhere else herein) as compared to a corresponding non-target tissue or cell in the subject. The subject can be selected from mouse, rat, monkey, and human. The subject can be a human. The subject can be determined to have a likelihood of a response to the therapeutic agent or the pharmaceutical composition. The likelihood of the response can be 50% or higher. The likelihood of the response can be determined by a method as described herein (such as one described hereinabove in the METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S) section). In some embodiments, the disease or condition can be a cancer or an inflammatory or autoimmune disease. In some embodiments, the disease or condition can be a cancer. The cancer can be selected from the group consisting of carcinoma, Hodgkin's lymphoma, and non-Hodgkin's lymphoma, diffuse large B cell lymphoma, follicular lymphoma, mantle cell lymphoma, blastoma, breast cancer, ER/PR+ breast cancer, Her2+ breast cancer, triple-negative breast cancer, colon cancer, colon cancer with malignant ascites, mucinous tumors, prostate cancer, head and neck cancer, skin cancer, melanoma, genito-urinary tract cancer, ovarian cancer, ovarian cancer with malignant ascites, peritoneal carcinomatosis, uterine serous carcinoma, endometrial cancer, cervix cancer, colorectal, uterine cancer, mesothelioma in the peritoneum, kidney cancer, Wilm's tumor, lung cancer, small-cell lung cancer, non-small cell lung cancer, gastric cancer, stomach cancer, small intestine cancer, liver cancer, hepatocarcinoma, hepatoblastoma, liposarcoma, pancreatic cancer, gall bladder cancer, cancers of the bile duct, esophageal cancer, salivary gland carcinoma, thyroid cancer, epithelial cancer, arrhenoblastoma, adenocarcinoma, sarcoma, and B-cell derived chronic lymphatic leukemia. In some embodiments, the disease or condition can be an inflammatory or autoimmune disease. The inflammatory or autoimmune disease can be selected from the group consisting of ankylosing spondylitis (AS), arthritis (for example, and not limited to, rheumatoid arthritis (RA), juvenile idiopathic arthritis (JIA), osteoarthritis (OA), psoriatic arthritis (PsA), gout, chronic arthritis), chagas disease, chronic obstructive pulmonary disease (COPD), dermatomyositis, type 1 diabetes, endometriosis, Goodpasture syndrome, Graves' disease, Guillain-Barre syndrome (GBS), Hashimoto's disease, suppurative scab, Kawasaki disease, IgA nephropathy, idiopathic thrombocytopenic purpura, inflammatory bowel disease (IBD) (for example, and not limited to, Crohn's disease (CD), clonal disease, ulcerative colitis, collagen colitis, lymphocytic colitis, ischemic colitis, empty colitis, Behcet's syndrome, infectious colitis, indeterminate colitis, interstitial Cystitis), lupus (for example, and not limited to, systemic lupus erythematosus, discoid lupus, subacute cutaneous lupus erythematosus, cutaneous lupus erythematosus (such as chilblain lupus erythematosus), drug-induced lupus, neonatal lupus, lupus nephritis), mixed connective tissue disease, morphea, multiple sclerosis (MS), severe muscle Force disorder, narcolepsy, neuromuscular angina, pemphigus vulgaris, pernicious anemia, psoriasis, psoriatic arthritis, polymyositis, primary biliary cirrhosis, relapsing polychondritis, schizophrenia, scleroderma, Sjogren's syndrome, systemic stiffness syndrome, temporal arteritis (also known as giant cell arteritis), vasculitis, vitiligo, Wegener's granulomatosis, transplant rejection-associated immune reaction(s) (for example, and not limited to, renal transplant rejection, lung transplant rejection, liver transplant rejection), psoriasis, Wiskott-Aldrich syndrome, autoimmune lymphoproliferative syndrome, myasthenia gravis, inflammatory chronic rhinosinusitis, colitis, celiac disease, Barrett's esophagus, inflammatory gastritis, autoimmune nephritis, autoimmune hepatitis, autoimmune carditis, autoimmune encephalitis, autoimmune mediated hematological disease, asthma, atopic dermatitis, atopy, allergy, allergic rhinitis, scleroderma, bronchitis, pericarditis, the inflammatory disease is, Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, inflammatory lung disease, inflammatory skin disease, atherosclerosis, myocardial infarction, stroke, gram-positive shock, gram-negative shock, sepsis, septic shock, hemorrhagic shock, anaphylactic shock, systemic inflammatory response syndrome. Additionally or alternatively, the subject designated, by the method described herein in the section entitled “METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S),” as being likely to respond to the activatable therapeutic agent (such as one described herein) can be one with an expression profile of biomarker(s) such that, upon administering an activatable therapeutic agent (such as one described herein) to the subject, the activatable therapeutic agent is more likely than not to be cleaved at or near the target tissue(s) or cell(s) (such as described herein in the “Target Tissues or Cells” section), e.g., by mammalian protease(s), thereby activating the therapeutic agent.

Methods and Uses of Therapeutic Agent(s)

Provided herein, in some embodiments, is a method for treating a disease or condition in a subject, comprising administering to the subject in need thereof one or more therapeutically effective doses of a therapeutic agent (such as one described herein) or a pharmaceutical composition (such as one described herein). The subject can be selected from mouse, rat, monkey, and human. The subject can be a human. The subject can be determined to have a likelihood of a response to the therapeutic agent or the pharmaceutical composition. The likelihood of the response can be 50% or higher. The likelihood of the response can be determined by a method as described herein (such as one described hereinabove in the METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S) section). In some embodiments, the disease or condition can be a cancer or an inflammatory or autoimmune disease. In some embodiments, the disease or condition can be a cancer. The cancer can be selected from the group consisting of carcinoma, Hodgkin's lymphoma, and non-Hodgkin's lymphoma, diffuse large B cell lymphoma, follicular lymphoma, mantle cell lymphoma, blastoma, breast cancer, ER/PR+ breast cancer, Her2+ breast cancer, triple-negative breast cancer, colon cancer, colon cancer with malignant ascites, mucinous tumors, prostate cancer, head and neck cancer, skin cancer, melanoma, genito-urinary tract cancer, ovarian cancer, ovarian cancer with malignant ascites, peritoneal carcinomatosis, uterine serous carcinoma, endometrial cancer, cervix cancer, colorectal, uterine cancer, mesothelioma in the peritoneum, kidney cancer, Wilm's tumor, lung cancer, small-cell lung cancer, non-small cell lung cancer, gastric cancer, stomach cancer, small intestine cancer, liver cancer, hepatocarcinoma, hepatoblastoma, liposarcoma, pancreatic cancer, gall bladder cancer, cancers of the bile duct, esophageal cancer, salivary gland carcinoma, thyroid cancer, epithelial cancer, arrhenoblastoma, adenocarcinoma, sarcoma, and B-cell derived chronic lymphatic leukemia. In some embodiments, the disease or condition can be an inflammatory or autoimmune disease. The inflammatory or autoimmune disease can be selected from the group consisting of ankylosing spondylitis (AS), arthritis (for example, and not limited to, rheumatoid arthritis (RA), juvenile idiopathic arthritis (JIA), osteoarthritis (OA), psoriatic arthritis (PsA), gout, chronic arthritis), chagas disease, chronic obstructive pulmonary disease (COPD), dermatomyositis, type 1 diabetes, endometriosis, Goodpasture syndrome, Graves' disease, Guillain-Barre syndrome (GBS), Hashimoto's disease, suppurative scab, Kawasaki disease, IgA nephropathy, idiopathic thrombocytopenic purpura, inflammatory bowel disease (IBD) (for example, and not limited to, Crohn's disease (CD), clonal disease, ulcerative colitis, collagen colitis, lymphocytic colitis, ischemic colitis, empty colitis, Behcet's syndrome, infectious colitis, indeterminate colitis, interstitial Cystitis), lupus (for example, and not limited to, systemic lupus erythematosus, discoid lupus, subacute cutaneous lupus erythematosus, cutaneous lupus erythematosus (such as chilblain lupus erythematosus), drug-induced lupus, neonatal lupus, lupus nephritis), mixed connective tissue disease, morphea, multiple sclerosis (MS), severe muscle Force disorder, narcolepsy, neuromuscular angina, pemphigus vulgaris, pernicious anemia, psoriasis, psoriatic arthritis, polymyositis, primary biliary cirrhosis, relapsing polychondritis, schizophrenia, scleroderma, Sjogren's syndrome, systemic stiffness syndrome, temporal arteritis (also known as giant cell arteritis), vasculitis, vitiligo, Wegener's granulomatosis, transplant rejection-associated immune reaction(s) (for example, and not limited to, renal transplant rejection, lung transplant rejection, liver transplant rejection), psoriasis, Wiskott-Aldrich syndrome, autoimmune lymphoproliferative syndrome, myasthenia gravis, inflammatory chronic rhinosinusitis, colitis, celiac disease, Barrett's esophagus, inflammatory gastritis, autoimmune nephritis, autoimmune hepatitis, autoimmune carditis, autoimmune encephalitis, autoimmune mediated hematological disease, asthma, atopic dermatitis, atopy, allergy, allergic rhinitis, scleroderma, bronchitis, pericarditis, the inflammatory disease is, Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, inflammatory lung disease, inflammatory skin disease, atherosclerosis, myocardial infarction, stroke, gram-positive shock, gram-negative shock, sepsis, septic shock, hemorrhagic shock, anaphylactic shock, systemic inflammatory response syndrome. Additionally or alternatively, the subject designated, by the method described herein in the section entitled “METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S),” as being likely to respond to the activatable therapeutic agent (such as one described herein) can be one with an expression profile of biomarker(s) such that, upon administering an activatable therapeutic agent (such as one described herein) to the subject, the activatable therapeutic agent is more likely than not to be cleaved at or near the target tissue(s) or cell(s) (such as described herein in the “Target Tissues or Cells” section), e.g., by mammalian protease(s), thereby activating the therapeutic agent.

Provided herein, in some embodiments, is use of a therapeutic agent (such as one described herein) or a pharmaceutical composition (such as one described herein) in the preparation of a medicament for the treatment of a disease or condition in a subject. The subject can be selected from mouse, rat, monkey, and human. The subject can be a human. The subject can be determined to have a likelihood of a response to the therapeutic agent or the pharmaceutical composition. The likelihood of the response can be 50% or higher. The likelihood of the response can be determined by a method as described herein (such as one described hereinabove in the METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S) section). In some embodiments, the disease or condition can be a cancer or an inflammatory or autoimmune disease. In some embodiments, the disease or condition can be a cancer. The cancer can be selected from the group consisting of carcinoma, Hodgkin's lymphoma, and non-Hodgkin's lymphoma, diffuse large B cell lymphoma, follicular lymphoma, mantle cell lymphoma, blastoma, breast cancer, ER/PR+ breast cancer, Her2+ breast cancer, triple-negative breast cancer, colon cancer, colon cancer with malignant ascites, mucinous tumors, prostate cancer, head and neck cancer, skin cancer, melanoma, genito-urinary tract cancer, ovarian cancer, ovarian cancer with malignant ascites, peritoneal carcinomatosis, uterine serous carcinoma, endometrial cancer, cervix cancer, colorectal, uterine cancer, mesothelioma in the peritoneum, kidney cancer, Wilm's tumor, lung cancer, small-cell lung cancer, non-small cell lung cancer, gastric cancer, stomach cancer, small intestine cancer, liver cancer, hepatocarcinoma, hepatoblastoma, liposarcoma, pancreatic cancer, gall bladder cancer, cancers of the bile duct, esophageal cancer, salivary gland carcinoma, thyroid cancer, epithelial cancer, arrhenoblastoma, adenocarcinoma, sarcoma, and B-cell derived chronic lymphatic leukemia. In some embodiments, the disease or condition can be an inflammatory or autoimmune disease. The inflammatory or autoimmune disease can be selected from the group consisting of ankylosing spondylitis (AS), arthritis (for example, and not limited to, rheumatoid arthritis (RA), juvenile idiopathic arthritis (JIA), osteoarthritis (OA), psoriatic arthritis (PsA), gout, chronic arthritis), chagas disease, chronic obstructive pulmonary disease (COPD), dermatomyositis, type 1 diabetes, endometriosis, Goodpasture syndrome, Graves' disease, Guillain-Barre syndrome (GB S), Hashimoto's disease, suppurative scab, Kawasaki disease, IgA nephropathy, idiopathic thrombocytopenic purpura, inflammatory bowel disease (IBD) (for example, and not limited to, Crohn's disease (CD), clonal disease, ulcerative colitis, collagen colitis, lymphocytic colitis, ischemic colitis, empty colitis, Behcet's syndrome, infectious colitis, indeterminate colitis, interstitial Cystitis), lupus (for example, and not limited to, systemic lupus erythematosus, discoid lupus, subacute cutaneous lupus erythematosus, cutaneous lupus erythematosus (such as chilblain lupus erythematosus), drug-induced lupus, neonatal lupus, lupus nephritis), mixed connective tissue disease, morphea, multiple sclerosis (MS), severe muscle Force disorder, narcolepsy, neuromuscular angina, pemphigus vulgaris, pernicious anemia, psoriasis, psoriatic arthritis, polymyositis, primary biliary cirrhosis, relapsing polychondritis, schizophrenia, scleroderma, Sjogren's syndrome, systemic stiffness syndrome, temporal arteritis (also known as giant cell arteritis), vasculitis, vitiligo, Wegener's granulomatosis, transplant rejection-associated immune reaction(s) (for example, and not limited to, renal transplant rejection, lung transplant rejection, liver transplant rejection), psoriasis, Wiskott-Aldrich syndrome, autoimmune lymphoproliferative syndrome, myasthenia gravis, inflammatory chronic rhinosinusitis, colitis, celiac disease, Barrett's esophagus, inflammatory gastritis, autoimmune nephritis, autoimmune hepatitis, autoimmune carditis, autoimmune encephalitis, autoimmune mediated hematological disease, asthma, atopic dermatitis, atopy, allergy, allergic rhinitis, scleroderma, bronchitis, pericarditis, the inflammatory disease is, Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, inflammatory lung disease, inflammatory skin disease, atherosclerosis, myocardial infarction, stroke, gram-positive shock, gram-negative shock, sepsis, septic shock, hemorrhagic shock, anaphylactic shock, systemic inflammatory response syndrome. Additionally or alternatively, the subject designated, by the method described herein in the section entitled “METHODS FOR ASSESSING A LIKELIHOOD OF A RESPONSE TO THERAPEUTIC AGENT(S),” as being likely to respond to the activatable therapeutic agent (such as one described herein) can be one with an expression profile of biomarker(s) such that, upon administering an activatable therapeutic agent (such as one described herein) to the subject, the activatable therapeutic agent is more likely than not to be cleaved at or near the target tissue(s) or cell(s) (such as described herein in the “Target Tissues or Cells” section), e.g., by mammalian protease(s), thereby activating the therapeutic agent.

EXAMPLES
Example 1. Recombinant Production of an XTENylated Fusion Polypeptide Containing an Exemplary Peptide Substrate

This example illustrates recombinant construction, production, and purification of an XTENylated fusion polypeptide containing an exemplary peptide substrate using the methods disclosed herein.

EXPRESSION: Constructs encoding an XTENylated fusion polypeptide comprising an amino acid sequence of SEQ ID NO: 20 or 22, containing two elastin-based peptide substrates, both of the sequence GPGG-VAAA (SEQ ID NO: 1283) (shown in #527 of Column II of Table A), are expressed in a proprietary E. coli AmE098 strain and partitioned into the periplasm via an N-terminal secretory leader sequence (MKKNIAFLLASMFVFSIATNAYA-) (SEQ ID NO: 3129), which is cleaved during translocation. Fermentation cultures are grown with animal-free complex medium at 37° C.; and the temperature is shifted to 26° C. prior to phosphate depletion. During harvest, fermentation whole broth is centrifuged to pellet the cells. At harvest, the total volume and the wet cell weight (WCW; ratio of pellet to supernatant) is recorded, and the pelleted cells are collected and frozen at −80° C.

RECOVERY: The frozen cell pellet is resuspended in Lysis Buffer (17.7 mM citric acid, 22.3 mM Na₂HPO₄, 75 mM NaCl, 2 mM EDTA, pH 4.0) targeting 30% wet cell weight. The resuspension is allowed to equilibrate at pH 4 then homogenized via two passes at 800±50 bar while output temperature is monitored and maintained at 15±5° C. The pH of the homogenate is confirmed to be within the specified range (pH 4.0±0.2).

CLARIFICATION: To reduce endotoxin and host cell impurities, the homogenate is allowed to undergo low-temperature (10±5° C.), acidic (pH 4.0±0.2) flocculation overnight (15-20 hours). To remove the insoluble fraction, the flocculated homogenate is centrifuged for 40 minutes at 16,900 RCF at 2-8° C., and the supernatant is retained. The supernatant is diluted approximately 3-fold with Milli-Q water (MQ), then adjusted to 7±1 mS/cm with 5 M NaCl. To remove nucleic acid, lipids, and endotoxin and to act as a filter aid, the supernatant is adjusted to 0.1% (m/m) diatomaceous earth. To keep the filter aid suspended, the supernatant is mixed via impeller and allowed to equilibrate for 30 minutes. A filter train, consisting of a depth filter followed by a 0.22 μm filter, is assembled then flushed with MQ. The supernatant is pumped through the filter train while modulating flow to maintain a pressure drop of 25±5 psig. To adjust the composite buffer system (based on the ratio of citric acid and Na₂HPO₄) to the desired range for capture chromatography, the filtrate is adjusted with 500 mM Na₂HPO₄such that the final ratio of Na₂HPO₄to citric acid is 9.33:1, and the pH of the buffered filtrate is confirmed to be within the specified range (pH 7.0±0.2).

Purification

AEX Capture: To separate dimer, aggregate, and large truncates from monomeric product, and to remove endotoxin and nucleic acids, anion exchange (AEX) chromatography is utilized to capture the electronegative C-terminal XTEN domain. The AEX1 stationary phase (GE Q Sepharose FF), AEX1 mobile phase A (12.2 mM Na₂HPO₄, 7.8 mM Na₂HPO₄, 40 mM NaCl), and AEX1 mobile phase B (12.2 mM Na₂HPO₄, 7.8 mM Na₂HPO₄, 500 mM NaCl) are used herein. The column is equilibrated with AEX1 mobile phase A. Based on the total protein concentration measured by bicinchoninic acid (BCA) assay, the filtrate is loaded onto the column targeting 28±4 g/L-resin, chased with AEX1 mobile phase A, then washed with a step to 30% B. Bound material is eluted with a gradient from 30% B to 60% B over 20 CV. Fractions are collected in 1 CV aliquots while A220≥100 mAU above (local) baseline. Elution fractions are analyzed and pooled on the basis of SDS-PAGE and SE-HPLC.

IMAC Intermediate Purification: To ensure C-terminal integrity, immobilized metal affinity chromatography (IMAC) is used to capture the C-terminal polyhistidine tag (His(6)). The IMAC stationary phase (GE IMAC Sepharose FF), IMAC mobile phase A (18.3 mM Na₂HPO₄, 1.7 mM Na₂HPO₄, 500 mM NaCl, 1 mM imidazole), and IMAC mobile phase B (18.3 mM Na₂HPO₄, 1.7 mM Na₂HPO₄, 500 mM NaCl, 500 mM imidazole) are used herein. The column is charged with zinc solution and equilibrated with IMAC mobile phase A. The AEX1 Pool is adjusted to pH 7.8±0.1, 50±5 mS/cm (with 5 M NaCl), and 1 mM imidazole, loaded onto the IMAC column targeting 2 g/L-resin, and chased with IMAC mobile phase A until absorbance at 280 nm (A280) returned to (local) baseline. Bound material is eluted with a step to 25% IMAC mobile phase B. The IMAC Elution collection is initiated when A280≥10 mAU above (local) baseline, directed into a container pre-spiked with EDTA sufficient to bring 2 CV to 2 mM EDTA, and terminated once 2 CV were collected. The elution is analyzed by SDS-PAGE.

Protein-L Intermediate Purification: To ensure N-terminal integrity, Protein-L is used to capture kappa domains present close to the N-terminus of the fusion polypeptide (specifically the aEpCAM scFv). Protein-L stationary phase (GE Capto L), Protein-L mobile phase A (16.0 mM citric acid, 20.0 mM Na₂HPO₄, pH 4.0±0.1), Protein-L mobile phase B (29.0 mM citric acid, 7.0 mM Na₂HPO₄, pH 2.60±0.02), and Protein-L mobile phase C (3.5 mM citric acid, 32.5 mM Na₂HPO₄, 250 mM NaCl, pH 7.0±0.1) are used herein. The column is equilibrated with Protein-L mobile phase C. The IMAC Elution is adjusted to pH 7.0±0.1 and 30±3 mS/cm (with 5 M NaCl and MQ) and loaded onto the Protein-L column targeting 2 g/L-resin then chased with Protein-L mobile phase C until absorbance at 280 nm (A280) returns to (local) baseline. The column is washed with Protein-L mobile phase A, and Protein-L mobile phases A and B are used to effect low-pH elution. Bound material is eluted at approximately pH 3.0 and collected into a container pre-spiked with one part 0.5 M Na₂HPO₄for every 10 parts collected volume. Fractions are analyzed by SDS-PAGE.

HIC Polishing: To separate N-terminal variants (4 residues at the absolute N-terminus are not essential for Protein-L binding) and overall conformation variants, hydrophobic interaction chromatography (HIC) is used. HIC stationary phase (GE Capto Phenyl ImpRes), HIC mobile phase A (20 mM histidine, 0.02% (w/v) polysorbate 80, pH 6.5±0.1) and HIC mobile phase B (1 M ammonium sulfate, 20 mM histidine, 0.02% (w/v) polysorbate 80, pH 6.5±0.1) are used herein. The column is equilibrated with HIC mobile phase B. The adjusted Protein-L Elution is loaded onto the HIC column targeting 2 g/L-resin and chased with HIC mobile phase B until absorbance at 280 nm (A280) returned to (local) baseline. The column is washed with 50% B. Bound material is eluted with a gradient from 50% B to 0% B over 75 CV. Fractions are collected in 1 CV aliquots while A280≥3 mAU above (local) baseline. Elution fractions are analyzed and pooled on the basis of SE-HPLC and HI-HPLC.

FORMULATION: To exchange the product into formulation buffer and to bring the product to the target concentration (0.5 g/L), anion exchange is again used to capture the C-terminal XTEN. AEX2 stationary phase (GE Q Sepharose FF), AEX2 mobile phase A (20 mM histidine, 40 mM NaCl, 0.02% (w/v) polysorbate 80, pH 6.5±0.2), AEX2 mobile phase B (20 mM histidine, 1 M NaCl, 0.02% (w/v) polysorbate 80, pH 6.5±0.2), and AEX2 mobile phase C (12.2 mM Na₂HPO₄, 7.8 mM NaH₂PO₄, 40 mM NaCl, 0.02% (w/v) polysorbate 80, pH 7.0±0.2) are used herein. The column is equilibrated with AEX2 mobile phase C. The HIC Pool is adjusted to pH 7.0±0.1 and 7±1 mS/cm (with MQ) and loaded onto the AEX2 column targeting 2 g/L-resin then chased with AEX2 mobile phase C until A280 returned to (local) baseline. The column is washed with AEX2 mobile phase A (20 mM histidine, 40 mM NaCl, 0.02% (w/v) polysorbate 80, pH 6.5±0.2). AEX2 mobile phases A and B are used to generate an [NaCl] step and effect elution. Bound material is eluted with a step to 38% AEX2 mobile phase B. The AEX2 Elution collection is initiated when A280≥5 mAU above (local) baseline and terminated once 2 CV were collected. The AEX2 Elution is 0.22 μm filtered within a BSC, aliquoted, labeled, and stored at −80° C. as Bulk Drug Substance (BDS). The bulk drug substance (BDS) is confirmed by various analytical methods to meet all lot release criteria. Overall quality is analyzed by SDS-PAGE, the ratio of monomer to dimer and aggregate is analyzed by SE-HPLC, and N-terminal quality and product homogeneity are analyzed by HI-HPLC.

Example 2. Preparation of Plasma Samples

This example illustrates preparation of plasma samples from patients suffering from, or is suspected of suffering from, a disease or condition known to be associated with an elevated level of elastin at or near a diseased site.

Blood is collected from a patient of choice into an EDTA plasma tube and centrifuged for 10 minutes at 4° C. and 3,500 g. Plasma is then aliquoted and flash-frozen on dry ice within 30 minutes of collection. 250 μL aliquots of plasma are later thawed on ice and precipitated with 1 mL of water containing 80% acetonitrile and 1 nanogram (ng) of bovine insulin as an internal standard. The solid phase extraction eluant is transferred and evaporated to dryness, then diluted with 75 μL of water with 0.1% formic acid, thereby obtaining a sample of plasma peptides.

Possible variations in sample preparation, including those for a nano LC/MS, may be found in Kay et al. 2018 (Rapid Communications in Mass Spectrometry 32 (16), 1414-1424, 2018.

Example 3. Liquid Chromatography-Mass Spectrometry (LC-MS)

This example illustrates liquid chromatography-mass spectrometry (LC-MS) methods used to determine the presence and/or amount of biomarker peptides in plasma samples from subjects using the methods disclosed herein.

50 μL of the plasma peptides as obtained according to Example 2 is injected into a liquid chromatography-mass spectrometry (LC-MS) system with a high flow configuration. Two buffers, buffer A (0.1% formic acid in water) and buffer B (0.1% formic acid in 80:20 acetonitrile/water), for liquid chromatography (LC) separations are prepared. 50 μL of sample extract is injected into a HSS T3 column (2.1×50 mm) at 15% buffer A and 85% buffer B with a flow rate of 300 μL/min, then separated to 40% buffer B using a 6.5 minute gradient. The column is then washed at 90% buffer B for 1.5 minutes and returned to initial conditions after 8 minutes. A scan from 600 mass per charge (m/z) to 1,600 m/z is conducted for information-dependent acquisition using a resolution of 75,000, a maximum fill time of 200 ms, and an automatic gain control of 3×10⁶.

Peptides are identified using Peaks 8.0 software searched against the human Swissprot database. The search configuration includes precursor and product ion tolerances of 10 ppm and 0.05 Da (respectively), the no-digest setting, a false discovery rate threshold of 1%, and allowance of modifications such as C-terminal amidation.

Example 4. Matrix-Assisted Laser Desorption/Ionization-Time of Flight (MALDI-TOF) Mass Spectrometry

This example illustrates matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF) mass spectrometry methods used to determine the presence and/or amount of biomarker peptides in plasma samples from subjects using the methods disclosed herein.

As an alternative to Example 3, plasma peptides obtained according to Example 2 is isolated by loading plasma samples, mixed in a 3:1 ratio with a solution of 20% acetonitrile and 1% trifluoroacetic acid, onto nanoporous silica chips for analysis by a matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF) mass spectrometer, as described in details in Bedin et al. 2015 (J Cell Physiol., 231(4):915-25). The plasma peptides are identified using Mascot and MS-Tag search engines with preprocessing steps performed by flexAnalysis and Snap™ softwares. The presence or/and amount of the plasma peptides having (i) a sequence of GVAPGIGPGG (shown in #527 of column IV of Table A), or (ii) a sequence of VAAAAKSAAK (SEQ ID NO. 3116; shown in #527 of column VI of Table A) (or a fragment thereof) is determined.

Example 5. Enzyme-Linked Immunosorbent Assay (ELISA)

This example illustrates immunoassay methods used to determine the presence and/or amount of biomarker peptides in plasma samples from subjects using the methods disclosed herein.

Capture antibodies specific to one or more biomarker(s) of (i) a sequence of GVAPGIGPGG (SEQ ID NO: ) (shown in #527 of column IV of Table A), (ii) a sequence of VAAAAKSAAK (SEQ ID NO: ) (shown in #527 of column VI of Table A), and (iii) a sequence of GPGGVAAA (SEQ ID NO: ) (shown in #527 of column II of Table A) (or a fragment thereof) are obtained.

The plasma sample obtained according to Example 2 is diluted and the plasma concentrations of the biomarker peptide(s) are measured using a competitive ELISA. Primary antibody (unlabeled) is incubated with sample antigen. Antibody-antigen complexes are then added to 96-well plates which are pre-coated with the same antigen. Unbound antibody is removed by washing the plate. (The more antigen in the sample, the less antibody will be able to bind to the antigen in the well, hence “competition.”) The secondary antibody that is specific to the primary antibody and conjugated with an enzyme is added. A substrate is added, and remaining enzymes elicit a chromogenic or fluorescent signal.

Example 6. Patient Designations

This example illustrates designating patients as being likely to respond to activatable therapeutic agents using the methods disclosed herein.

The presence or/and amount of biomarker peptide(s) as determined according to one of Examples 3-5 is analyzed manually or with semi-automated/automated procedures/instruments. If the biomarker peptide(s) is/determined to be present in the plasma sample from the patient, or if the amount of biomarker peptide(s) of the patient is determined to exceed a pre-determined threshold, the patient is designated as having a likeliness of more than 50% to respond to the therapeutic agent constructed and produced according to Example 1 which comprises the elastin-based peptide substrate (shown in #527 of Column II of Table A) in its release segment.

Example 7. Assessment of Protease Cleavage of Release Segments Having Collagen I Derived Amino Acid Sequences

This invention provides non-natural, activatable therapeutic agents (e.g. XPATs) wherein a biologically active moiety (BM) is preferentially released at a target site associated with expression of a mammalian protease that cleaves a scissile bond in a release segment linked directly or indirectly to the BM. Successful therapeutic use of these agents in an individual depends on whether the agent comprises a release segment linked directly or indirectly to the BM that is cleaved by a mammalian protease expressed at a target site in that individual. An assessment of whether an individual having a target site to be targeted for delivery and release of the BM expresses a mammalian protease that cleaves a release segment can be valuable in identifying and matching therapeutically effective agents for a particular individual. Achieving such a beneficial assessment is dependent on determining the relative efficiency of cleavage of release segment sequences by mammalian proteases known to be expressed at therapeutic target sites, such as tumors and inflammatory sites.

Set forth in this example are the results of experiments that demonstrated unmasking rates of ECP-based release sites. The substrates 818-P1, C1MA, and C1 MB were digested by proteases and cleavage rates measured.

Protease digestion was performed under varying conditions and were based on comparison of 818-C1MA and 818-C1 MB to 818-P1 digestion. Substrate (1 μM) was digested at 37° C. with MMPs for two hours, Legumain and ST14 for four hours, or Urokinase-type Plasminogen Activator (uPA) for 6 hours as shown in Table 8. Digestion buffers varied in composition and enzyme concentration, MMP (5 nM), Legumain, ST14 (50 nM) and uPA (100 nM). Cleavage of 818-P1, C1MA and C1 MB at lysine/leucine residues similar to collagen (a known component of the extracellular matrix, ECM) are demonstrated in FIG. 9.

Results demonstrated that MMP 2, 7, and 9 unmasked 818-P1 faster than 818-C1MA and 818-C1 MB (MMP2: 818-P1>818-C1MA>818-C1 MB; MMP1: 818-P1>818-C1MA=818-C1 MB; MMP9: 818-P1>818-C1 MB>818-C1MA). Legumain and ST 14 required a higher concentration and longer time for unmasking. Legumain demonstrated minimal unmasking differences whereas ST14 unmaking was characterized by 818-C1MA>818-P1>818-C1 MB. Unmasking activity attributable to uPA required higher concentrations of proteases and longer digestion times.

Proteases expressed during cancer growth and metastasis remodel the ECM and can lead to elevated plasma levels of ECM protein cleavage products that are elevated in the plasma of patients with a wide variety of tumors. The current example demonstrates that a cleavage product resulting from MMP cleavage of an ECM protein is highly similar to the MMP cleavage site in protease-cleavable linkers in XPATs. These results demonstrated that the protease cleavable linker employed in the XPATs of this invention are more efficiently cleaved than the ECM by purified MMPs and that the presence of ECM peptides in cancer patients can serve as an indicator that the patients' tumors are expressing MMPs that can cleave the protease-cleavable linker in an XPAT, thereby predicting whether a given patient or tumor will be able to cleave the XPAT and hence result in treatment of the tumor. This allows for a personalized approach to determine whether an XPAT will be cleaved in a given tumor type by determining whether the subject that has said tumor type has elevated plasma levels of certain cleavage product(s) derived from the extracellular matrix.

TABLE 8

Protease Sources and Partial Digest Conditions

Protease

Conc
Time

Protease
(nM)
(hr)
Digest Buffer

MMP2
5
2
20 mM Histidine, 154 mM NaCl,

0.005% PS-80, 10 mM CaCl2, pH 6.5

MMP7
5
2
20 mM Histidine, 154 mM NaCl,

0.005% PS-80, 10 mM CaCl2, pH 6.5

MMP9
5
2
20 mM Histidine, 154 mM NaCl,

0.005% PS-80, 10 mM CaCl2, pH 6.5

Legumain
50
4
50 mM MES, 250 mM NaCl, pH 5.0

ST14/
50
4
20 mM Histidine, 154 mM NaCl,

Matriptase

0.005% PS-80, 10 mM CaCl2, pH 6.5

uPA
100
6
50 mM Tris-HCl, pH 8.0

Trypsin
N/A (~20 μL
0.5-2
PBS

(immo-
slurry/

bilized)
100 μL)

TABLE 9

Protease Cleavage Release Segment Sequences

Name
SEQ ID NO
Sequence

Collagen I
3124
GADGSPGKDGVRGLTGPIGPPGP

818-NonClv
3225
APTTGEAGEAAGATSAGATGPATSGS

AMX-818
3126
GGSAPEAGRSANHTPAGLTGPATSGS

AC2566
3127
GGSAPEAGRSANHGVRGLTGPATSGS

AC2567
3128
GGSAPEAGSPGKDGVRGLTGPATSGS

While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. It is not intended that the invention be limited by the specific examples provided within the specification. While the invention has been described with reference to the aforementioned specification, the descriptions and illustrations of the embodiments herein are not meant to be construed in a limiting sense. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. Furthermore, it shall be understood that all aspects of the invention are not limited to the specific depictions, configurations or relative proportions set forth herein which depend upon a variety of conditions and variables. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is therefore contemplated that the invention shall also cover any such alternatives, modifications, variations or equivalents. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.

	Number	Date	Country
Parent	PCT/US2021/042426	Jul 2021	US
Child	18068872		US

COMPOSITIONS AND METHODS RELATED TO ACTIVATABLE THERAPEUTIC AGENTS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

RELATED APPLICATIONS

Provisional Applications (1)

Continuations (1)