Activatable Therapeutic Peptides and Uses Thereof

Information

  • Patent Application
  • 20230365635
  • Publication Number
    20230365635
  • Date Filed
    March 10, 2023
    a year ago
  • Date Published
    November 16, 2023
    5 months ago
Abstract
Disclosed herein are fusion protein, comprising (a) at least one X1 domain comprising a half-life extension compound, including but not limited to a half-life extension polypeptide; (b) at least one X2 domain comprising an anionic block; (c) at least one X3 domain comprising a linker susceptible to cleavage at a site of disease, including but not limited to a microbial infection site or tumor site; and (d) at least one X4 domain comprising a therapeutic peptide; wherein the at least one X1, X2, X3, and X4 domains are covalently linked, and each X4 domain is linked to an X3 domain without an intervening X1 or X2 domain; compositions containing such fusion proteins, and methods for their use.
Description
SEQUENCE LISTING STATEMENT

A computer readable form of the Sequence Listing is filed with this application by electronic submission and is incorporated into this application by reference in its entirety. The Sequence Listing is contained in the file created on Mar. 1, 2023 having the file name “22-0288-US.xml” and is 445,171 bytes in size.


BACKGROUND

Peptides represent an important class of therapeutics due to their diverse functionalities, ease of synthesis, and improved screening/design technologies. Nonetheless, when administered systemically, peptide therapeutics often face the challenges of short circulation half-life and insufficient bioavailability at the target diseased site. Further, membranolytic peptide therapeutics such as antimicrobial peptides (AMPs) and anticancer peptides (ACPs), which disrupt bacteria membrane and cancer cell membrane respectively, tend to elicit off-target toxicity, limiting their utility beyond local administration. To unleash clinical potential of these therapeutics, there is a great need in development of formulations to overcome these barriers to systemic administration of therapeutic peptides.


SUMMARY

In one aspect, the disclosure provides fusion proteins, comprising:

    • (a) at least one X1 domain comprising a half-life extension compound, including but not limited to a half-life extension polypeptide;
    • (b) at least one X2 domain comprising an anionic block
    • (c) at least one X3 domain comprising a linker susceptible to cleavage at a site of disease, including but not limited to a microbial infection site or tumor site; and
    • (d) at least one X4 domain comprising a therapeutic peptide;
    • wherein the at least one X1, X2, X3, and X4 domains are covalently linked, and each X4 domain is linked to an X3 domain without an intervening X1 or X2 domain.


In one embodiment, the fusion protein comprises a linear fusion protein. In another embodiment, each X4 domain is linked to at least one X3 domain without an intervening X1 or X2 domain. In another embodiment, an X4 domain is present at one terminus of the fusion protein. In one embodiment, the fusion protein includes only 1 X2 domain. In another embodiment, the fusion protein includes only 1 X4 domain. In another embodiment, the fusion protein comprises 1, 2, or 3 X1 domains. In a further embodiment, the fusion protein comprises 2 or 3 X1 domains, and wherein at least 2 X1 domains are linked without an intervening X2, X3, or X4 domain. In one embodiment, the fusion protein comprises 1, 2, 3, or 4 X3 domains. In another embodiment, the fusion protein comprises 2, 3, or 4 X3 domains, and wherein at least 2 X3 domains are linked without an intervening X1, X2, or X4 domain, and optionally wherein at least one X1 domain is linked to an X2 domain without an intervening X3 or X4 domain. In a further embodiment, the fusion protein further comprises at least one X5 domain comprising a targeting polypeptide.


In another embodiment, the fusion protein comprises a branched fusion protein. In one embodiment, an X4 domain is present at a terminus of the fusion protein. In a further embodiment, a branch of the fusion protein comprises at least one X3 domain. In another embodiment, a branch of the fusion protein comprises at least one X3 domain linked to an X4 domain. In one embodiment, the branched fusion protein further comprises at least one X5 domain comprising a targeting polypeptide. In another embodiment, a branch of the fusion protein is linked to the primary fusion protein backbone at a location between an X2 domain and one of an X1, X2, X3, or X5 domain. In various embodiments, the fusion protein includes only 1 X2 domain; the fusion protein includes only 1 X4 domain; and/or the fusion protein comprises 1 or 2 X1 domains. In one embodiment, the branched fusion protein comprises 2 X1 domains, and wherein the 2 X1 domains are linked without an intervening X2, X3, or X4 domain. In another embodiment, the branched fusion comprises 1, 2, 3, or 4 X3 domains.


In another embodiment of any of the fusion proteins herein, each X1 domain independently comprises a half-life extension compound selected from the group consisting of an albumin-binding polypeptide, an antibody/Fc domain (such as human Fc or mouse Fc), an unstructured XTEN polypeptide, a proline/alanine-rich sequence polypeptide (PAS), and poly(ethylene glycol). In another embodiment, each X1 domain independently comprises an albumin or albumin-binding polypeptide, including but not limited to human serum albumin, mouse serum albumin and albumin-binding domain. In a further embodiment, each X1 domain independently comprises the amino acid sequence each X1 domain independently comprises the amino acid sequence selected from the group consisting of SEQ ID NO:1-5.


In one embodiment, each X2 domain independently comprises an amino acid sequence selected from the group consisting of

    • (E/D)x, x=1-20;
    • (E/D)x(G/S/A)x, x=1-20;
    • ((E/D)(E/D)(G/S/A))x, x=1-20;
    • ((E/D)(G/S/A)(E/D))x, x=1-20;
    • (E/D)x(G/S/A)x(E/D)x, x=1-20; and
    • (E/D)x((E/D)K)x, x=1-20.


In another embodiment, each X2 domain independently comprises the amino acid sequence (EEG)x, wherein “x” is 1-20, 2-16, 3-12, 4-10, 5-8, 1-15, 1-10, 2-10, 3-10, 4-10, 5-10, 2-8, 3-8, 4-8, 5-8, 5-7, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In a further embodiment, each X3 domain independently comprises an amino acid sequence selected from the linker amino acid sequences selected from SEQ ID NO: 6-70, 171, and 173-187 and/or the non-peptide linkers shown in Table 4 (S86-S96).


In one embodiment, at least one, or each, X4 domain comprises a cationic therapeutic peptide, an anti-microbial peptide, an anti-cancer peptide, and/or a hydrophobic therapeutic peptide. In a further embodiment, at least one, or each, X4 domain comprises an amino acid sequence selected from the group consisting of SEQ ID NO:74-140.


In another embodiment,

    • (a) each X1 domain comprises the amino acid sequence of SEQ ID NO:4 or 5;
    • (b) each X2 domain comprises the amino acid sequence (EEG)x, wherein “x” is 1-20, 2-16, 3-12, 4-10, 5-8, 1-15, 1-10, 2-10, 3-10, 4-10, 5-10, 2-8, 3-8, 4-8, 5-8, 5-7, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10;
    • (c) each X3 domain comprises the amino acid sequence of SEQ ID NO: 16; and
    • (d) each X4 domain comprises the amino acid sequence selected from the group consisting of SEQ ID NO:137-140.


In a further embodiment, the X5 domain, when present, comprises the amino acid sequence selected from the group consisting of SEQ ID NO:141-155 and 188-192. In a further embodiment, the fusion protein comprises a structure selected from SEQ ID NO:156-170 and 193-204, wherein any detectable labels are optional.


In another embodiment, the disclosure provides compositions, comprising a plurality of fusion proteins according to any embodiment herein. In some embodiments, the composition further comprises a pharmaceutically acceptable carrier.


In one embodiment, the fusion protein is genetically encodable. In other embodiments, the disclosure provides nucleic acids encoding a genetically encodable fusion protein herein, expression vectors comprising the nucleic acid operatively linked to a suitable regulatory sequence, and host cells comprising the nucleic acid and/or the expression vector.


The disclosure also provides methods for treating a microbial infection or cancer in a subject, comprising administering to the subject an amount effective to treat the microbial infection or cancer of the fusion protein, composition, nucleic acid, expression vector, or host cell of any embodiment herein.





DESCRIPTION OF THE FIGURES


FIG. 1. Conceptual design of activatable antimicrobial peptides. Desirable features of pro-AMP therapeutics include (1) long circulation with masked activity, (2) accumulation at infection site either by passive or active targeting, (3) activation by diseased site microenvironmental trigger (e.g. infection site proteases), and (4) exhibition of on-target therapeutic activity.



FIG. 2. Design and synthesis of activatable AMP. (A) Components of the therapeutic (1) half-life extension domain (e.g. albumin-binding domain), (2) anionic block, (3) cleavable linker, and (4) therapeutic payload (AMP). (B) Synthesis of ABD-AMP conjugate (from top to bottom SEQ ID NO: 171, 104, 133).



FIG. 3. In vitro evaluation of a model ABD-AMP conjugate. (A) Schematic of (ABD)2-(EEG)6-S12-(D)Pex. (B) Antibacterial assay on PAO1 treated with conjugate with and without thrombin pre-cleavage. OD600 indicates extent of bacteria viability. (C) MTS-based mammalian toxicity evaluation of L929 fibroblasts. (D) Hemolysis assay on mouse red blood cells.



FIG. 4. In vivo pharmacokinetic and biodistribution evaluations of a model ABD-AMP conjugate. (A) Schematic of (ABD)2-(EEG)6-S12-(D)Pex-Cy7. (B) Pharmacokinetics of ABD-AMP-Cy7 conjugate and a free AMP-Cy7 control. (C) Experimental timeline for time-lapse biodistribution and conjugate activation evaluation in a PAO1 lung infection model. Quantification of intact and active AMP contents in (D) lungs, (E) kidneys, (F) liver, and (G) spleen at different time points. (H) Fold change in AUC of active (D)Pex-Cy7 of the conjugate group versus free AMP control group. (I) Estimated fold change at equivalent lung exposure of active AMP content.



FIG. 5. In vivo toxicity evaluation of a model ABD-AMP conjugate. (A) Treatment groups and their body conditions. (B) Serum chemistry analysis of ALT, AST, and BUN. (C) H&E staining of spleen and kidneys sections. Arrows indicate protein casts or apoptotic bodies.



FIG. 6. Evaluation of activity masking for alternative AMPs and alternative application. Antibacterial assay on PAO1 treated with (A) (ABD)2-(EEG)6-512-POL7080 and (B) (ABD)2-(EEG)6-S12-Colistin with and without thrombin pre-cleavage. OD600 indicates extent of bacteria viability. (C) Evaluation of anticancer activity masking of (ABD)2-(EEG)6-S12-(D)Pex on HepG2 cell line by an MTS assay.



FIG. 7. Biodistribution of a targeted ABD-AMP conjugate. (A) Components of the targeted therapeutic (1) half-life extension domain (e.g. albumin-binding domain), (2) anionic block, (3) cleavable linker, (4) therapeutic payload (AMP), and (5) targeting domain. (B) Experimental timeline biodistribution and conjugate activation evaluation in a PAO1 lung infection model. (C) Quantification of intact and active AMP contents in the infected lungs for the targeted conjugate VHH3-(ABD)2-(EEG)6-POL7080-Cy7, the non-targeted control, and the free POL7080-Cy7 control.





DETAILED DESCRIPTION

As used herein, the amino acid residues are abbreviated as follows: alanine (Ala; A), asparagine (Asn; N), aspartic acid (Asp; D), arginine (Arg; R), cysteine (Cys; C), glutamic acid (Glu; E), glutamine (Gln; Q), glycine (Gly; G), histidine (His; H), isoleucine (Ile; I), leucine (Leu; L), lysine (Lys; K), methionine (Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S), threonine (Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V).


In all embodiments of polypeptides disclosed herein, any N-terminal methionine residues are optional (i.e.: the N-terminal methionine residue may be present or may be absent).


As used herein and unless otherwise indicated, the terms “a” and “an” are taken to mean “one”, “at least one” or “one or more”. Unless otherwise required by context, singular terms used herein shall include pluralities and plural terms shall include the singular.


Unless the context clearly requires otherwise, throughout the description and the claims, the words ‘comprise’, ‘comprising’, and the like are to be construed in an inclusive sense as opposed to an exclusive or exhaustive sense; that is to say, in the sense of “including, but not limited to”. Words using the singular or plural number also include the plural or singular number, respectively. Additionally, the words “herein,” “above” and “below” and words of similar import, when used in this application, shall refer to this application as a whole and not to any particular portions of this application.


All embodiments of any aspect of the invention can be used in combination, unless the context clearly dictates otherwise.


In one embodiment, the disclosure provides fusion protein, comprising:

    • (a) at least one X1 domain comprising a half-life extension compound, including but not limited to an albumin or albumin-binding polypeptide;
    • (b) at least one X2 domain comprising an anionic block
    • (c) at least one X3 domain comprising a linker susceptible to cleavage at a site of disease, including but not limited to a microbial infection site or tumor site;
    • (d) at least one X4 domain comprising a therapeutic peptide;
    • wherein the at least one X1, X2, X3, and X4 domains are covalently linked, and each X4 domain is linked to an X3 domain without an intervening X1 or X2 domain.


The fusion proteins disclosed herein comprise pro-therapeutic peptides based on a half-life extension compound-therapeutic peptide conjugate which is long-circulating and has a masked biological activity that can be activated upon cleavage of the linker.


As used herein, “fusion protein” requires at least some polypeptide components, but may include non-polypeptide components as well, as disclosed herein. The fusion protein may comprise one each of X1, X2, X3, and X4 domains, or may independently comprise 2, 3, 4, or more such domains.


In one embodiment, the fusion protein comprises a linear fusion protein (i.e., non-branched). In one embodiment, each X4 domain (therapeutic peptide) is linked to at least one X3 domain (cleavage-susceptible linker) without an intervening X1 or X2 domain. This embodiment enables activation of therapeutic peptide X4 from the fusion protein. In this embodiment, the X3-X4 linkage may be direct (with no linker), or may include a linker, such as an amino acid linker, between the X3 and X4 domains, but cannot include an X1 or X2 domain between the X3 and X4 domains. Other embodiments disclosed herein regarding such linkage between two domains without any intervening other domains can similarly be direct (with no linker), or may include a linker, such as an amino acid linker.


In another embodiment, an X4 domain (therapeutic peptide) is present at one terminus of the fusion protein. As used herein, “at one terminus” means that there are no X1, X2, X3, or X5 (discussed below) domains between the terminus and the X4 domain at the terminus. In this embodiment, there may be other moieties present between the X4 domain and the terminus, but none of the defined X1, X2, X3, or X5 domains. In one embodiment, the X4 domain may be present at an amino terminus of the linear fusion protein. In another embodiment, the X4 domain may be present at the carboxy terminus of the linear fusion protein. In a further embodiment, the fusion protein includes only 1 X4 domain.


In one embodiment, the linear fusion proteins comprise comprising 1, 2, or 3 X1 domains (half-life extension compounds). In another embodiment, the fusion protein comprises 2 or 3 X1 domains, wherein at least 2 X1 domains are linked without an intervening X2, X3, or X4 domain. This embodiment may be helpful in tuning biological activity masking and half-life of therapeutics.


In a further embodiment, the linear fusion proteins comprise 1, 2, 3, or 4 X3 domains (linkers susceptible to cleavage at a site of disease). In one embodiment, the linear fusion protein comprises 2, 3, or 4 X3 domains, wherein at least 2 X3 domains are linked without an intervening X1, X2, or X4 domain. Multiple X3 domains enable tuning of therapeutic peptide activation.


In another embodiment of the linear fusion proteins, at least one X1 domain is linked to an X2 domain without an intervening X3 or X4 domain.


The fusion proteins may further comprise at least one X5 domain comprising a targeting polypeptide. As used herein, a “targeting polypeptide” is any polypeptide that can serve to direct the fusion protein to a desired cell or tissue location.


In various non-limiting embodiments, the linear fusion proteins comprise a general formula selected from the group consisting of:

    • X1-X2-X3-X4;
    • X4-X3-X2-X1;
    • X1-X1-X2-X3-X4;
    • X4-X3-X2-X1-X1;
    • X1-X1-X2-X3-X3-X4;
    • X1-X1-X2-X3-X3-X3-X4;
    • X1-X3-X1-X2-X3-X4;
    • X1-X1-X3-X2-X3-X4;
    • X1-X3-X1-X3-X2-X3-X4;
    • X1-X3-X1-X2-X3-X3-X4;
    • X1-X1-X3-X2-X3-X3-X4;
    • X1-X3-X1-X3-X2-X3-X3-X4;
    • X5-X1-X2-X3-X4;
    • X1-X5-X2-X3-X4;
    • X4-X3-X2-X1-X5;
    • X4-X3-X2-X5-X1;
    • X1-X5-X1-X2-X3-X4;
    • X5-X1-X1-X2-X3-X4;
    • X5-X5-X1-X1-X2-X3-X4;
    • X5-X1-X5-X1-X2-X3-X4;
    • X1-X5-X1-X5-X2-X3-X4;
    • X1-X5-X1-X2-X3-X3-X4;
    • X5-X1-X1-X2-X3-X3-X4;
    • X5-X1-X1-X3-X2-X3-X3-X4;
    • X5-X5-X1-X1-X2-X3-X3-X4;
    • X5-X5-X1-X1-X2-X3-X3-X3-X4;
    • X1-X1-X5-X2-X3-X3-X4;
    • X1-X1-X5-X5-X3-X2-X3-X3-X4;
    • X1-X1-X3-X5-X5-X2-X3-X3-X4;
    • X1-X1-X3-X5-X5-X3-X2-X3-X3-X4;
    • X5-X5-X1-X1-X3-X2-X3-X3-X4;
    • X5-X5-X5-X1-X1-X2-X3-X3-X4;
    • X5-X5-X5-X1-X1-X3-X2-X3-X3-X4;
    • X5-X1-X5-X1-X2-X3-X3-X4;
    • X1-X5-X1-X5-X2-X3-X3-X4; and
    • X5-X1-X5-X1-X5-X1-X2-X3-X3-X4.


In another embodiment, the fusion protein comprises a branched fusion protein (i.e., at least one domain is covalently linked to a side chain of an amino acid along a protein backbone using a chemical crosslinker). The branched covalent linkage may be accomplished using any conjugation technique suitable for a specific branched fusion protein. In one non-limiting embodiment, the branched covalent linkage may be accomplished via chemical linkages between Cys-Maleimide and DBCO-azide, as exemplified herein.


All disclosure relative to linear fusion protein domains and linkages above are equally relevant to branched fusion proteins unless the context clearly indicates otherwise.


In one embodiment, an X4 domain is present at a terminus of the branched fusion protein. In another embodiment, a branch of the fusion protein comprises at least one X3 domain. In another embodiment, a branch of the fusion protein comprises at least one X3 domain linked to an X4 domain. Flexible positioning of X3 domains ensure optimal activation at the disease site. In another embodiment, the branched fusion protein further comprises at least one X5 domain comprising a targeting polypeptide.


In a further embodiment, a branch of the fusion protein is linked to the primary fusion protein backbone at a location between an X2 domain and one of an X1, X2, X3, or X5 domain. The embodiment ensures proximity of X2 to X4 for activity masking via electrostatic interaction.


In one embodiment, the branched fusion protein includes only 1 X2 domain. In another embodiment, the branched fusion protein includes only 1 X4 domain.


In a further embodiment, the branched fusion protein comprises 1 or 2 X1 domains. In another embodiment, the branched fusion protein comprises 2 X1 domains, wherein the 2 X1 domains are linked without an intervening X2, X3, or X4 domain. In one embodiment, the branched fusion protein comprises 1, 2, 3, or 4 X3 domains.


In various non-limiting embodiments, the branched fusion proteins comprise a general formula selected from the group consisting of formula 1-34 s shown in Table 1.









TABLE 1





Exemplary branched fusion protein formulae
















1


embedded image







2


embedded image







3


embedded image







4


embedded image







5


embedded image







6


embedded image







7


embedded image







8


embedded image







9


embedded image







10


embedded image







11


embedded image







12


embedded image







13


embedded image







14


embedded image







15


embedded image







16


embedded image







17


embedded image







18


embedded image







19


embedded image







20


embedded image







21


embedded image







22


embedded image







23


embedded image







24


embedded image







25


embedded image







26


embedded image







27


embedded image







28


embedded image







29


embedded image







30


embedded image







31


embedded image







32


embedded image







33


embedded image







34


embedded image







35


embedded image











The X1 domains may comprise any compound that can serve to increase the half-life of the fusion protein upon administration to a subject. In one embodiment, each X1 domain independently comprises a half-life extension compound selected from the group consisting of an albumin-binding polypeptide, an antibody/Fc domain (such as human Fc or mouse Fc), an unstructured XTEN polypeptide (see, for example, Nature Biotechnology volume 27, pages 1186-1190 (2009), incorporated herein by reference), a proline/alanine-rich sequence polypeptide (PAS), and/or poly(ethylene glycol). In one embodiment, the half-life extension compound is a half-life extension polypeptide. In embodiments where the fusion protein comprises more than one X1 domain, each X1 domain may independently be the same or different. In another embodiment, each X1 domain independently comprises an albumin or albumin-binding polypeptide, including but not limited to human serum albumin, mouse serum albumin and/or albumin-binding domain. In another embodiment, each X1 domain independently comprises the amino acid sequence selected from

    • (ABD035): LAEAKVLANRELDKYGVSDFYKRLINKAKTVEGVEALKLHILAALP (SEQ ID NO: 1);
    • (ABD(DSA))-LAEAKVLANRELDKYGVSDYYKNLIDNAKSAEGVKALIDEILAALP (SEQ ID NO: 2);
    • Albumin-binding SSo7d M11.1.3; ATVKSTYRGEEKQVDISKIKWVIRWGQHLAFKYDEGGGAAGYGWVSEKDAPKELLQMLEKQGGGGSGGGGS GGGGSATVKSTYRGEEKQVDISKIKWVIRWGQHLAFKYDEGGGAAGYGWVSEKDAPKELLQMLEKQ (SEQ ID NO: 3); and
    • (MG)LKEAKEKA IEELKKAGIT SDYYFDLINK AKTVEGVNAL KDEILKA (SEQ ID NO: 4),


wherein the residues in parentheses are optional and may be present or absent. In one embodiment, each X1 domain independently comprises the amino acid sequence LKEAKEKA IEELKKAGIT SDYYFDLINK AKTVEGVNAL KDEILKA (SEQ ID NO: 5).


Any X2 domain can be used that provides an anionic block as suitable for an intended use. In one embodiment, the anionic block is may be combined with cationic therapeutic peptides (X4), including but not limited to antimicrobial peptides or anticancer peptides. In other embodiments, the anionic block may serve as a solubility enhancer for hydrophobic therapeutic peptides. In one embodiment, each X2 domain independently comprises an amino acid sequence selected from those in Table 2, wherein residues within parentheses are optional amino acids at that position.









TABLE 2







Exemplary anionic block formulae









Anionic block formula














A
(E/D)x, x = 1-20



B
(E/D)x(G/S/A)x, x = 1-20



C
((E/D)(E/D)(G/S/A))x, x = 1-20



D
((E/D)(G/S/A)(E/D))x, x = 1-20



E
(E/D)x(G/S/A)x(E/D)x, x = 1-20



F
(E/D)x((E/D)K)x, x = 1-20










In one embodiment, each X2 domain independently comprises the amino acid sequence (EEG)x, wherein “x” is 1-20, 2-16, 3-12, 4-10, 5-8, 1-15, 1-10, 2-10, 3-10, 4-10, 5-10, 2-8, 3-8, 4-8, 5-8, 5-7, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


Any linker susceptible to cleavage at a site of disease can be used as the X3 domain(s), as appropriate for an intended use. In one embodiment, each X3 domain independently comprises an amino acid sequence selected from the linker amino acid sequences listed in Table 3 (SEQ ID NO:6-70, 171, and 173-187), and/or the non-peptide linkers shown in Table 4 (S86-S96) In embodiments with more than one X3 domain, each X3 domain may be the same or different.









TABLE 3







Exemplary cleavable linkers










Name
Sequence
Name






S1
PLGVRGKLVPR (SEQ ID NO:
S36
PLAQAVRSS (SEQ ID NO: 38)



171)







S2
PLG-(4-iodo-Phe)-GAR (SEQ
S37
LAQAVRS (SEQ ID NO: 39)



ID NO: 6)







S3
RSLSRLTA (SEQ ID NO: 7)
S38
LAQAFTS (SEQ ID NO: 40)





S4
GLGRG (SEQ ID NO: 8)
S39
LAAAVVS (SEQ ID NO: 41)





S5
GAGLG (SEQ ID NO: 9)
S40
KIEAVKS (SEQ ID NO: 42)





S6
Nle(O-Bzl)-Met(O)2-Oic-Abu
S41
PRAEALKG (SEQ ID NO: 43)



(SEQ ID NO: 10)







S7
LVPRG (SEQ ID NO: 11)
S42
PRYEAYKMG (SEQ ID NO: 44)





S8
PLGVRGK (SEQ ID NO: 12)
S43
PRYEAYK (SEQ ID NO: 45)





S9
P-(Cha)-G-Cys(Me)-HA (SEQ
S44
PRAAAVKS (SEQ ID NO: 46)



ID NO: 13)







S10
PVPLSLVM (SEQ ID NO: 14)
S45
PTTSALKG (SEQ ID NO: 47)





S11
PVGLIGG (SEQ ID NO: 15)
S46
SLPVQDS (SEQ ID NO: 48)





S12
PLGLRSW (SEQ ID NO: 16)
S47
GLTLPVE (SEQ ID NO: 49)





S13
VLK
S48
YKIEAVK (SEQ ID NO: 50)





S14
PQGIWGQ (SEQ ID NO: 17)
S49
VHHQKLV (SEQ ID NO: 51)





S15
KPISLISS (SEQ ID NO: 18)
S50
TNMKHMA (SEQ ID NO: 52)





S16
GGP
S51
KTNMKHM (SEQ ID NO: 53)





S17
ILSRIV (SEQ ID NO: 19)
S52
VVSTQLI (SEQ ID NO: 54)





S18
KPILFFRL (SEQ ID NO: 20)
S53
QETNRSF (SEQ ID NO: 55)





S19
KAFRRSG (SEQ ID NO: 21)
S54
GPARQYY (SEQ ID NO: 56)





S20
RQRRALEK (SEQ ID NO: 22)
S55
YGSLPQK (SEQ ID NO: 57)





S21
FSRPFR (SEQ ID NO: 23)
S56
PPVAASS (SEQ ID NO: 58)





S22
TTFYRRGA (SEQ ID NO: 24)
S57
RSANAK (SEQ ID NO: 59)





S23
ARLYSRG (SEQ ID NO: 25)
S58
V-(Cit)





S24
KLRSSKQ (SEQ ID NO: 26)
S59
V-(Cit)-E





S25
TSVLMAAPQ (SEQ ID NO: 27)
S60
GFLG (SEQ ID NO: 60)





S26
VGPSQG (SEQ ID NO: 28)
S61
LPETG (SEQ ID NO: 61)





S27
VRFRST (SEQ ID NO: 29)
S62
RWARKK (SEQ ID NO: 62)





S28
IQQRSL (SEQ ID NO: 30)
S63
AAPV (SEQ ID NO: 63)





S29
RQSRIV (SEQ ID NO: 31)
S64
PLGLAR (SEQ ID NO: 64)





S30
SQPRIV (SEQ ID NO: 32)
S65
LSGRSDNH (SEQ ID NO: 65)





S31
FPRS (SEQ ID NO: 33)
S66
SGRSANAK (SEQ ID NO: 66)





S32
LAQA-(homo-F)-RS (SEQ ID
S67
VPLSLYSG (SEQ ID NO: 67)



NO: 34)







S33
LAQAFRS (SEQ ID NO: 35)
S68
HPVGLLAR (SEQ ID NO: 68)





S34
TRFYSR (SEQ ID NO: 36)
S69
VHMPLGFLGP (SEQ ID NO: 69)





S35
YVADAP (SEQ ID NO: 37)
S70
PMAKK (SEQ ID NO: 70)





S71
PRAEALK (SEQ ID NO: 173)
S79
PRAEALY (SEQ ID NO: 181)





S72
PRAEAL (SEQ ID NO: 174)
S80
PRAAALK (SEQ ID NO: 182)





S73
RAEALK (SEQ ID NO: 175)
S81
PTTSALT (SEQ ID NO: 183)





S74
PRAEALS (SEQ ID NO: 176)
S82
SAQAVV (SEQ ID NO: 184)





S75
PRAEALT (SEQ ID NO: 177)
S83
VFRMLSV (SEQ ID NO: 185)





S76
PRAEALA (SEQ ID NO: 178)
S84
PYSARLA (SEQ ID NO: 186)





S77
PRAEALV (SEQ ID NO: 179)
S85
QYAYLT (SEQ ID NO: 187)





S78
PRAEALP (SEQ ID NO: 180)





Amino acids are denoted by a single-letter code.


Upper case letters denote L amino acids.


4-iodo-Phe: 4-iodo-L-phenylalanine


Met (O) 2: L-methionine sulfone


Abu: L-2-aminobutyric acid


Cys (Me): S-methyl-L-cysteine


Nle (O-Bzl): 6-benzyloxy-L-norleucine


Oic: octahydroindole-2-carboxylic Acid


Cha: 3-cyclohexyl-L-alanine


Homo-F: L-homophenylalaine


Cit: L-citrulline













TABLE 4





Exemplary non-peptidic cleavable linkers


















S86


embedded image








S87


embedded image








S88


embedded image








S89


embedded image








S90


embedded image








S91


embedded image








S92


embedded image








S93


embedded image








S94


embedded image








S95


embedded image








S96


embedded image











In one embodiment, a spacing sequence may independently be inserted between different domains. In one embodiment, a spacing sequence is provided between each domain. In other embodiments, a spacing sequence is provided between some, but not all of the domains. Any spacing sequence may be used as appropriate for an intended use. In various embodiments, the spacing sequences independently comprise the amino acid sequence selected from Gx, (GS)x, (GGS)x, (GSA)x, (GGGS)x (SEQ ID NO: 71), (GGGGS)x (SEQ ID NO: 72), where x=1-4, or SPSTPPTPSPSTPP (SEQ ID NO: 73)


Any therapeutic peptide may be used as an X4 domain as appropriate for an intended use. In one embodiment, the therapeutic peptide is a cationic therapeutic peptide. In another embodiment, the therapeutic peptide is an anti-microbial peptide. In additional embodiment, the therapeutic peptide is an anti-cancer peptide. In another embodiment, the therapeutic peptide is a cell-penetrating peptide fused to therapeutic peptide for intracellular targeting. In other embodiments, the therapeutic peptide may be a hydrophobic therapeutic peptide.


In various embodiments, each X4 domain comprises an amino acid sequence selected from the therapeutic peptide sequences listed in Table 5 SEQ ID NO:74-103).









TABLE 5







Exemplary therapeutic peptides








Name
Sequence





(D)Pexiganan
GiGkflkkakkfGkafvkilkk-CONH2 (SEQ ID NO: 74)





(D)CAMEL0
kwklfkkiGavlkvl-CONH2 (SEQ ID NO: 75)





Tachyplesin I
KWCFRVCYRGICYRRCR-CONH2, Disulfide bond: C3 & C16 and



C7 & C12 (SEQ ID NO: 76)





(D)KLA
klaklakklaklak-CONH2 (SEQ ID NO: 77)





r9
rrrrrrrrr-CONH2 (SEQ ID NO: 78)





Cyclic-r9
Cyclo(rrrrrrrrr) (SEQ ID NO: 79)



Amide-cyclized between N-terminal amine and C-



terminal carboxylic acid





Tat
YGRKKRRQRRR-CONH2 (SEQ ID NO: 80)





(D)Tat
yGrkkrrqrrr-CONH2 (SEQ ID NO: 81)





Cyclic-(D)Tat
Cyclo(yGrkkrrqrrr) (SEQ ID NO: 82)



Amide-cyclized between N-terminal amine and C-



terminal carboxylic acid





(D)Transportan
GwtlnsaGyllGkinlkalaalakkil-CONH2 (SEQ ID NO: 83)





Colistin
6-methyloctanoic acid-Dab-Thr-Dab-(Dab-Dab-(D-Leu)-



Leu-Dab-Dab-Thr) (SEQ ID NO: 84)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)



6-methyloctanoic acid can be replaced with 6-



methylheptanoic acid





PMB
6-methyloctanoic acid-Dab-Thr-Dab-(Dab-Dab-(D-Phe)-



Leu-Dab-Dab-Thr) (SEQ ID NO: 85)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)



6-methyloctanoic acid can be replaced with 6-



methylheptanoic acid





Macolacin
6-methyloctanoic acid-Dab-Thr-(D-Ser)-(Dab-Dab-(D-



Leu)-Ile-Dab-Dab-Leu) (SEQ ID NO: 86)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)





Biphenyl-
Bipheny1-4-carboxylic acid-Dab-Thr-(D-Ser)-(Dab-


Macolacin
Dab-(D-Leu-Ile-Dab-Dab-Leu) (SEQ ID NO: 87)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)





POL7080
Cyclo-(T-W-I-(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-



A-S-p-P) (SEQ ID NO: 88)



Amide-cyclized between N-terminal amine and C-



terminal carboxylic acid





LL-37
LLGDFFRKSKEKIGKEFKRIVQRIKDFLRNLVPRTES (SEQ ID NO:



89)





(D)LL-37
llGdffrkskekiGkefkrivqrikdflrnlvprtes (SEQ ID NO:



90)





LTX-315
KKWWKK-Dip-K-CONH2 (SEQ ID NO: 91)





(D)LTX-315
kkwwkk-(D-Dip)-k-CONH2 (SEQ ID NO: 92)





LfcinB
FKCRRWQWRMKKLGAPSITCVRRAF-CONH2 (SEQ ID NO: 93)



Disulfide bond between Cys





(D)LfcinB
fkcrrwqwrmkklgapsitcvrraf-CONH2 (SEQ ID NO: 94)



Disulfide bond between Cys





OLP-4
Ac-ILKKWPWWPWRRK (SEQ ID NO: 95)





Magainin 2
GIGKFLHSAKKFGKAFVGEIMNS (SEQ ID NO: 96)





(D)Magainin 2
GiGkflhsakkfGkafvGeimns (SEQ ID NO: 97)





Buforin IIb
RAGLQFPVGRLLRRLLRRLLR (SEQ ID NO: 98)





(D)Buforin IIb
raGlqfpvGrllrrllrrllr (SEQ ID NO: 99)





PR-39
RRRPRPPYLPRPRPPPFFPPRLPPRIPPGFPPRFPPRFP (SEQ ID NO:



100)





RP-182
KERKAFKRFF (SEQ ID NO: 101)





P28
TAADMQGVVTDGMASGLDKDYLKPDD (SEQ ID NO: 102)





Cyclorasin 9A5
Cyclo(WTaRRR-(D-2-Nal)-R-(4-F-Phe)-(D-Nle)-Q) (SEQ



ID NO: 103)



Amide-cyclized between N-terminal amine and C-



terminal carboxylic acid





Amino acids are denoted by either one or three-letter codes.


CONH2 denotes C-terminal amide.


Upper case letters denote L amino acids.


Lower case letters denote D-amino acids.


Dab: L-diaminobutyric acid


Orn: L-ornithine


Dip stands for diphenylalanine


(D-Dip) stands for D-diphenylalanine


D-2-Nal: D-2-naphthylalanine


4-F-Phe: L-4-fluorophenylalanine


D-Nle: D-norleucine






In some embodiments, each X4 domains may comprise a detectable marker, including but not limited to Cy7, Cy5, AF647, AF680, or small molecule therapeutics (ciprofloxacin, LpxC inhibitors, aminoglycosides, rifampicin, linezolid, chemotherapeutics (doxorubicin, monomethyl auristatin E, monomethyl auristatin F, trabectedin, SN-38)). In one embodiment, the detectable marker may be located at the carboxy-terminus of a peptide X4 domain. In other embodiments, 1, 2, 3, or all X4 domains may comprise an azide linkage or other functional groups for conjugation, including but not limited to an Azidoacetyl linkage. In one embodiment, the azide linkage may be at the N-terminus of a peptide X4 domain. Non-limiting examples of such “functionalized” X4 domains are provided in Table 6 (SEQ ID NO:104-136).









TABLE 6





Exemplary functionalized X4 domains
















(D)Pexiganan-
Azidoacetyl-GiGkflkkakkfGkafvkilkk-CONH2 (SEQ ID


azide
NO: 104)





(D)CAMEL0-azide
Azidoacetyl-GkwklfkkiGavlkvl-CONH2 (SEQ ID NO: 105)





Tachyplesin I-
Azidoacetyl-GKWCFRVCYRGICYRRCR-CONH2, Disulfide


azide
bond: C4 & C17 and C8 & C13 (SEQ ID NO: 106)





(D)KLA-azide
Azidoacetyl-klaklakklaklak-CONH2 (SEQ ID NO: 107)





r9-azide
Azidoacetyl-rrrrrrrrr-CONH2 (SEQ ID NO: 108)





Cyclic-r9-azide
Cyclo(K(N3)-rrrrrrrrr) (SEQ ID NO: 109)



Amide-cyclized between N-terminal amine and C-



terminal carboxylic acid





(D)Tat-azide
Azidoacetyl-yGrkkrrqrrr-CONH2 (SEQ ID NO: 110)





Cyclic-(D)Tat-
Cyclo(K(N3)-yGrkkrrqrrr) (SEQ ID NO: 111)


azide
Amide-cyclized between N-terminal amine and C-



terminal carboxylic acid





(D)Transportan-
Azidoacetyl-GwtlnsaGyllGkinlkalaalakkil-CONH2 (SEQ


azide
ID NO: 112)





Colistin-azide
6-methyloctanoic acid-K(N3)-Dab-Thr-Dab-(Dab-Dab-



(D-Leu)-Leu-Dab-Dab-Thr) (SEQ ID NO: 113)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)



6-methyloctanoic acid can be replaced with 6-



methylheptanoic acid





Colistin-X3-
6-methyloctanoic acid-K(Azidoacetyl-X3-)-Dab-Thr-


azide
Dab-(Dab-Dab-(D-Leu)-Leu-Dab-Dab-Thr) (SEQ ID NO:



114)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)



X3 denotes cleavable linker



6-methyloctanoic acid can be replaced with 6-



methylheptanoic acid





Colistin-(X3-
6-methyloctanoic acid-Dab(Azidoacetyl-X3-)-Thr-


NH)-azide
Dab (Azidoacetyl-X3-)-(Dab-Dab (Azidoacetyl-X3-)-(D-



Leu)-Leu-Dab (Azidoacetyl-X3-)-Dab(Azidoacetyl-X3-)-



Thr) (SEQ ID NO: 115)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab4 (Bold)




X3 denotes cleavable linker connected to one or




more amino acid (s) with amine side chain



6-methyloctanoic acid can be replaced with 6-



methylheptanoic acid





PMB-azide
6-methyloctanoic acid-K(N3)-Dab-Thr-Dab-(Dab-Dab-



(D-Phe)-Leu-Dab-Dab-Thr) (SEQ ID NO: 116)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)



6-methyloctanoic acid can be replaced with 6-



methylheptanoic acid





PMB-X3-azide
6-methyloctanoic acid-K(Azidoacetyl-X3-)-Dab-Thr-



Dab-(Dab-Dab-(D-Phe)-Leu-Dab-Dab-Thr) (SEQ ID NO:



117)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)




X3 denotes cleavable linker




6-methyloctanoic acid can be replaced with 6-



methylheptanoic acid





PMB-(X3-NH)-
6-methyloctanoic acid-Dab(Azidoacetyl-X3-)-Thr-


azide
Dab(Azidoacetyl-X3-)-(Dab-Dab(Azidoacetyl-X3-)-(D-



Phe)-Leu-Dab(Azidoacetyl-X3-)-Dab(Azidoacetyl-X3-)-



Thr) (SEQ ID NO: 118)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab4 (Bold)




X3 denotes cleavable linker connected to one or




more amino acid(s) with amine side chain



6-methyloctanoic acid can be replaced with 6-



methylheptanoic acid





Macolacin-azide
6-methyloctanoic acid-K(N3)-Dab-Thr-(D-Ser)-(Dab-



Dab-(D-Leu)-Ile-Dab-Dab-Leu) (SEQ ID NO: 119)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)





Macolacin-X3-
6-methyloctanoic acid-K(Azidoacetyl-X3-)-Dab-Thr-


azide
(D-Ser)-(Dab-Dab-(D-Leu)-Ile-Dab-Dab-Leu) (SEQ ID



NO: 120)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)




X3 denotes cleavable linker






Macolacin-(X3-
6-methyloctanoic acid-Dab(Azidoacetyl-X3-)-Thr-(D-


NH)-azide
Ser)-(Dab-Dab(Azidoacetyl-X3-)-(D-Leu)-Ile-



Dab(Azidoacetyl-X3-)-Dab(Azidoacetyl-X3-)-Leu) (SEQ



ID NO: 121)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab4 (Bold)



X3 denotes cleavable linker connected to one or



more amino acid(s) with amine side chain





Biphenyl-
Biphenyl-4-carboxylic acid-K(N3)-Dab-Thr-(D-Ser)-


Macolacin-azide
(Dab-Dab-(D-Leu)-Ile-Dab-Dab-Leu) (SEQ ID NO: 122)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)





Biphenyl-
Biphenyl-4-carboxylic acid-K(Azidoacetyl-X3-)-Dab-


Macolacin-X3-
Thr-(D-Ser)-(Dab-Dab-(D-Leu)-Ile-Dab-Dab-Leu) (SEQ


azide
ID NO: 123)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)



X3 denotes cleavable linker





Biphenyl-
Biphenyl-4-carboxylic acid-Dab(Azidoacetyl-X3-)-


Macolacin-(X3-
Thr-(D-Ser)-(Dab-Dab (Azidoacetyl-X3-)-(D-Leu)-Ile-


NH)-azide
Dab(Azidoacetyl-X3-)-Dab(Azidoacetyl-X3-)-Leu) (SEQ



ID NO: 124)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)




X3 denotes cleavable linker connected to one or




more amino acid (s) with amine side chain





POL7080-azide
Cyclo-(K(N3)-W-I-(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-



(Dab)-A-S-p-P) (SEQ ID NO: 125)



Amide-cyclized between N-terminal amine and C-



terminal carboxylic acid





POL7080-X3-
Cyclo-(K(Azidoacetyl-X3-)-W-I-(Dab)-(Orn)-(Dab)-


azide
(Dab)-W-(Dab)-(Dab)-A-S-p-P) (SEQ ID NO: 126)



Amide-cyclized between N-terminal amine and C-



terminal carboxylic acid




X3 denotes cleavable linker






POL7080-(X3-
Cyclo-(T-W-I-(Dab(Azidoacetyl-X3-))-


NH)-azide
(Orn(Azidoacetyl-X3-))-(Dab(Azidoacetyl-X3-))-



(Dab(Azidoacetyl-X3-))-W-(Dab(Azidoacetyl-X3-))-



(Dab(Azidoacetyl-X3-))-A-S-p-P) (SEQ ID NO: 127)



Amide-cyclized between N-terminal amine and C-



terminal carboxylic acid




X3 denotes cleavable linker connected to one or




more amino acid(s) with amine side chain





POL7080-Cy7
Cyclo-(K(Cy7)-W-I-(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-



(Dab)-A-S-p-P) (SEQ ID NO: 128)





POL7080-Cy7-
Cyclo-(K(N3)-W-I-(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-


azide
(Dab)-K(Cy7)-S-p-P) (SEQ ID NO: 129)





Colistin-Cy7-
6-methyloctanoic acid-K(N3)-Dab-K(Cy7)-Dab-(Dab-


azide
Dab-(D-Leu)-Leu-Dab-Dab-Thr) (SEQ ID NO: 130)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)



6-methyloctanoic acid can be replaced with 6-



methylheptanoic acid





PMB-Cy7-azide
6-methyloctanoic acid-K(N3)-Dab-K(Cy7)-Dab-(Dab-



Dab-(D-Phe)-Leu-Dab-Dab-Thr) (SEQ ID NO: 131)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)



6-methyloctanoic acid can be replaced with 6-



methylheptanoic acid





Macolacin-Cy7-
6-methyloctanoic acid-K(N3)-Dab-K(Cy7)-(D-Ser)-


azide
(Dab-Dab-(D-Leu)-Ile-Dab-Dab-Leu) (SEQ ID NO: 132)



Amide-cyclized between C-terminal carboxylic acid



and amine-side chain of Dab5 (Bold)





(D)Pexiganan-
Azidoacetyl-GiGkflkkakkfGkafvkilkk-K(Cy7)-CONH2


Cy7-azide
(SEQ ID NO: 133)





(D)KLA-Cy7-
Azidoacetyl-Gklaklakklaklak-K(Cy7)-CONH2 (SEQ ID


azide
NO: 134)





Tachyplesin I-
Azidoacetyl-GKWCFRVCYRGICYRRCRG-K(Cy7)-CONH2,


Cy7-azide
Disulfide bond: C4 & C17 and C8 & C13 (SEQ ID NO: 135)





r9-Cy7-azide
Azidoacetyl-rrrrrrrrr-(Ahx)-K(Cy7)-CONH2 (SEQ ID



NO: 136)





Amino acids are denoted by either one or three-letter codes.


CONH2 denotes C-terminal amide.


Upper case letters denote L amino acids.


Lower case letters denote D-amino acids.


K(N3): L-azidolysine.


Dab: L-diaminobutyric acid.


Orn: L-ornithine.


Azide group can be replaced with other functional groups for conjugation


Cy7 can be replaced with other reporter molecules (e.g. Cy5, AF647, AF680) or small molecule antibiotics (ciprofloxacin, LpxC inhibitors, aminoglycosides)






In specific embodiments, each X4 domain may independently comprise the amino acid sequence selected from SEQ ID NO:136-139.









(SEQ ID NO: 137)


GiGkflkkakkfGkafvkilkk;


and/or





(SEQ ID NO: 138)


GiGkflkkakkfGkafvkilkkK;


and/or





(SEQ ID NO: 139)


Cyclo-(T-W-I-(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-


(Dab)-A-S-p-P);


and/or





(SEQ ID NO: 140)


6-methyloctanoic acid-Dab-T-Dab-(Dab-Dab-1-L-


Dab-Dab-T);






wherein residues in lower case are D amino acids and residues in upper case are L amino acids or glycine (which is achiral) and Dab and Orn denote L-diaminobutyric acid and L-ornithine respectively.


In other specific embodiments, of the fusion proteins:

    • (a) each X1 domain comprises the amino acid sequence (MG)LKEAKEKA IEELKKAGIT SDYYFDLINK AKTVEGVNAL KDEILKA (SEQ ID NO: 4), wherein the residues in parentheses are optional and may be present or absent, or wherein each X1 domain independently comprises the amino acid sequence LKEAKEKA IEELKKAGIT SDYYFDLINK AKTVEGVNAL KDEILKA (SEQ ID NO: 5);
    • (b) each X2 domain comprises the amino acid sequence (EEG)x, wherein “x” is 1-20, 2-16, 3-12, 4-10, 5-8, 1-15, 1-10, 2-10, 3-10, 4-10, 5-10, 2-8, 3-8, 4-8, 5-8, 5-7, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10;
    • (c) each X3 domain comprises the amino acid sequence SPSTPPTPSPSTPP (SEQ ID NO: 73); and
    • (d) each X4 domain comprises the amino acid sequence GiGkflkkakkfGkafvkilkk (SEQ ID NO: 137); and/or GiGkflkkakkfGkafvkilkkK (SEQ ID NO: 138); and/or
    • Cyclo-(T-W-I-(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P) (SEQ ID NO: 139); and/or 6-methyloctanoic acid-Dab-T-Dab-(Dab-Dab-1-L-Dab-Dab-T) (SEQ ID NO: 140);
    • wherein residues in lower case are D amino acids and residues in upper case are L amino acids or glycine (which is achiral) and Dab and Orn denote L-diaminobutyric acid and L-ornithine respectively.


In one such embodiment, the fusion protein comprises 1 X1 domain, 1 X2 domain, 1 X3 domain, and 1 X4 domain.


Any targeting domain may be used as the X5 domain(s), as deemed appropriate for an intended purpose. In various embodiments, the X5 domain comprises the amino acid sequence selected from the group consisting of those shown in Table 7 (SEQ ID NO:141-155 and 188-192).









TABLE 7







Exemplary Targeting domains








Target
Sequence





ICAM1
FEGFSFLAFEDFVSSI (SEQ ID NO: 141)





ICAM1
QVQLVESGGGLVQPGGSLRLSCAASGSISSLYVMGWYRQAPG



KQRELVADITSSGSIYYVDSLKGRFTISRDNARSTVYLQMNSLEP



EDTAVYYCMAHVRQDSGSEYLTYWGQGTQVTVSS (SEQ ID NO:



142)





ICAM1
NVDLVFLFDGSMSLQPDEFQKILDFMKDVMKKLSNTSYQFAAV



QFSTSYKTEFDFSDYVKWKDPDALLKHVKHMLLLTNTFGAINYV



ATEVFREELGARPDATKVLIIITDGEATDSGNIDAAKDIIRYIIGIGK



HSQTKESQETLHKFASKPASEFVKILDTGEKLKDLFTELQKKIYVIE



GTSKQDLT (SEQ ID NO: 143)





Aminopeptidase P
DVVMTQTPSSLSASRGDRVTISCSASQAISKYLNWYQQKPDGTV



KLLINYTSRLHSGVPSRFSGSGSGTDYSLTISNLEPEDIATYYCQQY



NKLPYTFGGGTKLEIKGGGGSGGGGSGGGGSEVQLQQSGAELM



KSGASVKISCKATGYTFSSYWIEWIKQRPGHGLEWIGEILPGSGST



NYNEKFKGKATVTTDTSSNTAYMQFSSLTSEDSAVYYCARWYDG



HFDYWGQGTTLTVSSTS (SEQ ID NO: 144)





CD177
DFYKPMPNLRIT (SEQ ID NO: 145)





CD177
LQIQSWSSSP (SEQ ID NO: 146)





CD177
FPLETSHMSAPL (SEQ ID NO: 147)





CD177
KFPDLDSRRLPHMSL (SEQ ID NO: 148)





P-selectin
KYIKFKHDYNILEFNDGTFE (SEQ ID NO: 149)





P-selectin
EWVDV (SEQ ID NO: 150)





P-selectin
DVEWVDVA (SEQ ID NO: 151)





E-selectin
DITWDQLWDLMK (SEQ ID NO: 152)





PcrV
EVQLVESGGGLVQPGGSLRLSCAASGSTLDYYAIGWFRQAPG



KEREGVSCTSNSGSTYYGGSVKGRFTASRDNAKNTVYLQMNSLR



PEDTAVYYCVATIGCATLGGTLDVQRYYYRGQGTQVTVSS (SEQ



ID NO: 153)





Flagellin
QVQLQESGGGLVQAGGSLTLSCAASGRTFSNYAMGWFRQAP



GKEREFVAMISWNGENTYYADSVKGRFTISRDNAKNTVYLQM



NSLKPEDTAVYYCAVRILSGWYDRPDEYGYWGQGTQVTVSS (SEQ



ID NO: 154)





IRGD
CRGDKGPDC (SEQ ID NO: 155)



Disulfide bond between cysteines





ADAM10
EMQLVESGGGLVQTGGSLRLSCAASGRTFTSYCVGWWRQAP



GKERDVVAAITRGSNSTDYVDSVKGRFTISRDNAENTVYLQ



MNSLKPEDTAVYYCAADINCRNLYTGRPEYWGQGTQVTVSS (SEQ



ID NO: 188)





ADAM10
EVQLVESGGGLVQAGGSLRLSCAASERIFSTYFMGWFRQAP



GKEREFVAFISGNGGSTDYADSVKGRFAISRDNVKNTLYLQ



MSSLKPDDTAVYYCAVAGRQIKSTWDYWGQGTQVTVSS (SEQ ID



NO: 189)





ADAM10
EVQLVESGGGLVQAGGSLRLSCARSGRISNINIMSWYRQAP



GKTRDMVAAIIGDSTNYADSVKGRFTISRDNAKNTVHLQMN



RLKPEDTGVYYCNIPGVDWGQGTQVTVSS (SEQ ID NO: 190)





ADAM10
EVQLVESGGGLVQAGGSLRLSCARSGRISNINIMAWYRQAP



GKTRDMVAAIIGDSTNYADSVKGRFTISRDNAKNTVYLHMN



RLKPEDTGVYYCKISGVDWGQGTQVTVSS (SEQ ID NO: 191)





ADAM10
KVQLVESGGGLVQAGGSLRLSCAASGNIFINNAVGWYRQAP



GKQREMVAAMLSGGSTNYADSVKGRFTISRDNAKNTVYLQM



NSLKPEDTAVYYCNVQVNGTWARWGQGTQVTVSS (SEQ ID NO:



192)









The fusion proteins may be made via any suitable technique, including chemical synthesis. In some embodiments of the linear fusion proteins, the fusion protein may be genetically encodable and can be expressed using standard recombinant techniques.


The disclosure further comprises nucleic acids encoding a genetically encodable fusion protein, expression vectors comprising a nucleic acid encoding the fusion protein operatively linked to a suitable regulatory sequence, and host cells comprising the nucleic acids and/or the expression vector. The nucleic acid may comprise RNA or DNA, and may comprise additional sequences useful for promoting expression and/or purification of the encoded protein, including but not limited to polyA sequences, modified Kozak sequences. “Expression vectors” includes vectors that operatively link a nucleic acid coding region or gene to any control sequences capable of effecting expression of the gene product. “Control sequences” operably linked to the nucleic acid sequences of the invention are nucleic acid sequences capable of effecting the expression of the nucleic acid molecules. The control sequences need not be contiguous with the nucleic acid sequences, so long as they function to direct the expression thereof. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the nucleic acid sequences and the promoter sequence can still be considered “operably linked” to the coding sequence. Other such control sequences include, but are not limited to, polyadenylation signals, termination signals, and ribosome binding sites. Such expression vectors include but are not limited to, plasmid and viral-based expression vectors. The control sequence used to drive expression of the disclosed nucleic acid sequences in a mammalian system may be constitutive (driven by any of a variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive). The expression vector must be replicable in the host organisms either as an episome or by integration into host chromosomal DNA. In various embodiments, the expression vector may comprise a plasmid, viral-based vector (including but not limited to a retroviral vector or oncolytic virus), or any other suitable expression vector. The host cells can be transiently or stably engineered to incorporate the expression vector, using techniques including but not limited to bacterial transformations, calcium phosphate co-precipitation, electroporation, or liposome mediated-, DEAE dextran mediated-, polycationic mediated-, or viral mediated transfection. (See, for example, Molecular Cloning: A Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press); Culture of Animal Cells: A Manual of Basic Technique, 2nd Ed. (R. I. Freshney. 1987. Liss, Inc. New York, NY)). A method of producing a polypeptide according to the invention is an additional part of the invention. The method comprises the steps of (a) culturing a host according to this aspect of the invention under conditions conducive to the expression of the fusion protein, and (b) optionally, recovering the expressed fusion protein.


In various non-limiting embodiments, the fusion protein comprises the structure selected from the fusion proteins listed in Table 8 (SEQ ID No: 156-170 and 193-204, wherein any detectable labels are optional.









TABLE 8







Examples of final fusion proteins








Fusion



protein
Sequence





ABD-(EEG)6-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHHHHH


S1-(D)Pex
SPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGPLGVRGKLVPRGC-(DBCO-



Maleimide)-Azidoacetyl-GiGkflkkakkfGkafvkilkk-CONH2



(SEQ ID NO: 156)



Chemically linked at Cys-Maleimide and DBCO-azide





(ABD)2-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSG


(EEG)6-S12-
GGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL


(D)Pex
KAHHHHHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGPLGLRSWGC-



(DBCO-Maleimide)-Azidoacetyl-GiGkflkkakkfGkafvkilkk-



CONH2 (SEQ ID NO: 157)



Chemically linked at Cys-Maleimide and DBCO-azide





(ABD)2-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSG


(EEG)6-S12-
GGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL


(D)Pex-Cy7
KAHHHHHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGPLGLRSWGC-



(DBCO-Maleimide)-Azidoacetyl-GiGkflkkakkfGkafvkilkk-



K(Cy7)-CONH2 (SEQ ID NO: 158)



Chemically linked at Cys-Maleimide and DBCO-azide





(ABD)2-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSG


(EEG)6-S12-
GGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL


S12-POL7080
KAHHHHHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-Cyclo-(K(Azidoacetyl-GPLGLRSWGPLGLRSW-)-



W-I-(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P)



(SEQ ID NO: 159)



Chemically linked at Cys-Maleimide and DBCO-azide





(ABD)2-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSG


(EEG)6-S12-
GGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL


S6-POL7080
KAHHHHHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-Cyclo-(T-W-I-(Dab(Azidoacetyl-GPLGLRSWG-



Nle(O-Bzl)-Met(O)2-Oic-Abu-))-(Orn)-(Dab)-(Dab)-W-



(Dab)-(Dab)-A-S-p-P) (SEQ ID NO: 160)



Chemically linked at Cys-Maleimide and DBCO-azide





(ABD)2-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSG


(EEG)6-S12-
GGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL


S12-Colistin
KAHHHHHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-6-methyloctanoic acid-K(Azidoacetyl-



GPLGLRSWGPLGLRSW-)-Dab-Thr-Dab-(Dab-Dab-(D-Leu)-Leu-



Dab-Dab-Thr) (SEQ ID NO: 161)



Chemically linked at Cys-Maleimide and DBCO-azide





(ABD)2-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSG


(EEG)6-S12-
GGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL


S6-Colistin
KAHHHHHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-6-methyloctanoic acid-Dab(Azidoacetyl-



GPLGLRSWGPLGLRSW-Nle(O-Bzl)-Met(O)2-Oic-Abu-)-Thr-



Dab-(Dab-Dab-(D-Leu)-Leu-Dab-Dab-Thr) (SEQ ID NO:



162)



Chemically linked at Cys-Maleimide and DBCO-azide





(ABD)2-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSG


(EEG)6-S12-
GGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL


S12-
KAHHHHHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-


Biphenyl-
Maleimide)-Biphenyl-4-carboxylic acid-K(Azidoacetyl-


Macolacin
GPLGLRSWGPLGLRSW-)-Dab-Thr-(D-Ser)-(Dab-Dab-(D-Leu)-



Ile-Dab-Dab-Leu) (SEQ ID NO: 163)



Chemically linked at Cys-Maleimide and DBCO-azide





(ABD)2-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSG


(EEG)6-S12-
GGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL


S6-Biphenyl-
KAHHHHHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-


Macolacin
Maleimide)-Biphenyl-4-carboxylic acid-



Dab(Azidoacetyl-GPLGLRSWG-Nle(O-Bzl)-Met(O)2-Oic-



Abu-)-Thr-(D-Ser)-(Dab-Dab-(D-Leu)-Ile-Dab-Dab-Leu)



(SEQ ID NO: 164)



Chemically linked at Cys-Maleimide and DBCO-azide





DFY-(ABD)2-
GDFYKPMPNLRITGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKA


(EEG)6-S12-
KTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITS


S12-POL7080
DYYFDLINKAKTVEGVNALKDEILKAHHHHHHSPSTPPTPSPSTPPEEGEE



GEEGEEGEEGEEGC-(DBCO-Maleimide)-Cyclo-



(K(Azidoacetyl-GPLGLRSWGPLGLRSW-)-W-I-(Dab)-(Orn)-



(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P) (SEQ ID NO: 165)



Chemically linked at Cys-Maleimide and DBCO-azide





(DFY)2-
GDFYKPMPNLRITGGGGSDFYKPMPNLRITGGGGSLKEAKEKAIEELKKA


(ABD)2-
GITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAK


(EEG)6-S12-
EKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHHHHHSPSTPP


S12-POL7080
TPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-Maleimide)-Cyclo-



(K(Azidoacetyl-GPLGLRSWGPLGLRSW-)-W-I-(Dab)-(Orn)-



(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P) (SEQ ID NO: 166)



Chemically linked at Cys-Maleimide and DBCO-azide





(DFY-ABD)2-
GDFYKPMPNLRITGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEG


(EEG)6-S12-
VNALKDEILKAGGGGSDFYKPMPNLRITGGGGSLKEAKEKAIEELKKAGITS


S12-POL7080
DYYFDLINKAKTVEGVNALKDEILKAHHHHHHSPSTPPTPSPSTPPEEGEEG



EEGEEGEEGEEGC-(DBCO-Maleimide)-Cyclo-(K(Azidoacetyl-



GPLGLRSWGPLGLRSW-)-W-I-(Dab)-(Orn)-(Dab)-(Dab)-W-



(Dab)-(Dab)-A-S-p-P) (SEQ ID NO: 167)



Chemically linked at Cys-Maleimide and DBCO-azide





DFY-(ABD)2-
GDFYKPMPNLRITGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKA


(EEG)6-S12-
KTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEKATEELKKAGITS


S6-Colistin
DYYFDLINKAKTVEGVNALKDEILKAHHHHHHSPSTPPTPSPSTPPEEGEE



GEEGEEGEEGEEGC-(DBCO-Maleimide)-6-methyloctanoic



acid-Dab(Azidoacetyl-GPLGLRSWGPLGLRSW-Nle(O-Bzl)-



Met(O)2-Oic-Abu-)-Thr-Dab-(Dab-Dab-(D-Leu)-Leu-Dab-



Dab-Thr) (SEQ ID NO: 168)



Chemically linked at Cys-Maleimide and DBCO-azide





(DFY)2-
GDFYKPMPNLRITGGGGSDFYKPMPNLRITGGGGSLKEAKEKAIEELKKA


(ABD)2-
GITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAK


(EEG)6-S12-
EKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHHHHHSPSTPP


S6-Colistin
TPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-Maleimide)-6-



methyloctanoic acid-Dab(Azidoacetyl-



GPLGLRSWGPLGLRSW-Nle(O-Bzl)-Met(O)2-Oic-Abu-)-Thr-



Dab-(Dab-Dab-(D-Leu)-Leu-Dab-Dab-Thr) (SEQ ID NO:



169)



Chemically linked at Cys-Maleimide and DBCO-azide





(DFY-ABD)2-
GDFYKPMPNLRITGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEG


(EEG)6-S12-
VNALKDEILKAGGGGSDFYKPMPNLRITGGGGSLKEAKEKAIEELKKAGITS


S6-Colistin
DYYFDLINKAKTVEGVNALKDEILKAHHHHHHSPSTPPTPSPSTPPEEGEEG



EEGEEGEEGEEGC-(DBCO-Maleimide)-6-methyloctanoic



acid-Dab(Azidoacetyl-GPLGLRSWGPLGLRSW-Nle(O-Bzl)-



Met(O)2-Oic-Abu-)-Thr-Dab-(Dab-Dab-(D-Leu)-Leu-Dab-



Dab-Thr) (SEQ ID NO: 170)



Chemically linked at Cys-Maleimide and DBCO-azide





(ABD)2-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSG


(EEG)6-S74-
GGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL


S74-POL7080
KAHHHHHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-Cyclo-(K(Azidoacetyl-GPRAEALSPRAEALS-)-W-



I-(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P)



(SEQ ID NO: 193)



Chemically linked at Cys-Maleimide and DBCO-azide





(ABD)2-
GLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGGGGSG


(EEG)6-S75-
GGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL


S75-POL7080
KAHHHHHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-Cyclo-(K(Azidoacetyl-GPRAEALTPRAEALT-)-W-



I-(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P)



(SEQ ID NO: 194)



Chemically linked at Cys-Maleimide and DBCO-azide





VHH1-(ABD)2-
EMQLVESGGGLVQTGGSLRLSCAASGRTFTSYCVGWWRQAP


(EEG)6-S75-
GKERDVVAAITRGSNSTDYVDSVKGRFTISRDNAENTVYLQ


POL7080
MNSLKPEDTAVYYCAADINCRNLYTGRPEYWGQGTQVTVSS



GGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLI



NKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEK



AIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHH



HHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-Cyclo-(K(Azidoacetyl-GPRAEALT-)-W-I-



(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P) (SEQ



ID NO: 195)



Chemically linked at Cys-Maleimide and DBCO-azide





VHH2-(ABD)2-
EVQLVESGGGLVQAGGSLRLSCAASERIFSTYFMGWFRQAP


(EEG)6-S75-
GKEREFVAFISGNGGSTDYADSVKGRFAISRDNVKNTLYLQ


POL7080
MSSLKPDDTAVYYCAVAGRQIKSTWDYWGQGTQVTVSS



GGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLI



NKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEK



AIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHH



HHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-Cyclo-(K(Azidoacetyl-GPRAEALT-)-W-I-



(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P) (SEQ



ID NO: 196)



Chemically linked at Cys-Maleimide and DBCO-azide





VHH3-(ABD)2-
EVQLVESGGGLVQAGGSLRLSCARSGRISNINIMSWYRQAP


(EEG)6-S75-
GKTRDMVAAIIGDSTNYADSVKGRFTISRDNAKNTVHLQMN


POL7080
RLKPEDTGVYYCNIPGVDWGQGTQVTVSS



GGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLI



NKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEK



AIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHH



HHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-Cyclo-(K(Azidoacetyl-GPRAEALT-)-W-I-



(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P) (SEQ



ID NO: 197)



Chemically linked at Cys-Maleimide and DBCO-azide





VHH4-(ABD)2-
EVQLVESGGGLVQAGGSLRLSCARSGRISNINIMAWYRQAP


(EEG)6-S75-
GKTRDMVAAIIGDSTNYADSVKGRFTISRDNAKNTVYLHMN


POL7080
RLKPEDTGVYYCKISGVDWGQGTQVTVSS



GGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLI



NKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEK



AIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHH



HHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-Cyclo-(K(Azidoacetyl-GPRAEALT-)-W-I-



(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P) (SEQ



ID NO: 198)



Chemically linked at Cys-Maleimide and DBCO-azide





VHH5-(ABD)2-
KVQLVESGGGLVQAGGSLRLSCAASGNIFINNAVGWYRQAP


(EEG)6-S75-
GKQREMVAAMLSGGSTNYADSVKGRFTISRDNAKNTVYLQM


POL7080
NSLKPEDTAVYYCNVQVNGTWARWGQGTQVTVSS



GGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLI



NKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEK



AIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHH



HHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-Cyclo-(K(Azidoacetyl-GPRAEALT-)-W-I-



(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P) (SEQ



ID NO: 199)



Chemically linked at Cys-Maleimide and DBCO-azide





VHH1-(ABD)2-
EMQLVESGGGLVQTGGSLRLSCAASGRTFTSYCVGWWRQAP


(EEG)6-S75-
GKERDVVAAITRGSNSTDYVDSVKGRFTISRDNAENTVYLQ


Colistin
MNSLKPEDTAVYYCAADINCRNLYTGRPEYWGQGTQVTVSS



GGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLI



NKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEK



AIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHH



HHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-6-methyloctanoic acid-Dab(Azidoacetyl-



GPRAEALT-)-Thr-Dab-(Dab-Dab-(D-Leu)-Leu-Dab-Dab-Thr)



(SEQ ID NO: 200)



Chemically linked at Cys-Maleimide and DBCO-azide





VHH2-(ABD)2-
EVQLVESGGGLVQAGGSLRLSCAASERIFSTYFMGWFRQAP


(EEG)6-S75-
GKEREFVAFISGNGGSTDYADSVKGRFAISRDNVKNTLYLQ


Colistin
MSSLKPDDTAVYYCAVAGRQIKSTWDYWGQGTQVTVSS



GGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLI



NKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEK



AIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHH



HHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-6-methyloctanoic acid-Dab(Azidoacetyl-



GPRAEALT-)-Thr-Dab-(Dab-Dab-(D-Leu)-Leu-Dab-Dab-Thr)



(SEQ ID NO: 201)



Chemically linked at Cys-Maleimide and DBCO-azide





VHH3-(ABD)2-
EVQLVESGGGLVQAGGSLRLSCARSGRISNINIMSWYRQAP


(EEG)6-S75-
GKTRDMVAAIIGDSTNYADSVKGRFTISRDNAKNTVHLQMN


Colistin
RLKPEDTGVYYCNIPGVDWGQGTQVTVSS



GGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLI



NKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEK



AIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHH



HHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-6-methyloctanoic acid-Dab(Azidoacetyl-



GPRAEALT-)-Thr-Dab-(Dab-Dab-(D-Leu)-Leu-Dab-Dab-Thr)



(SEQ ID NO: 202)



Chemically linked at Cys-Maleimide and DBCO-azide





VHH4-(ABD)2-
EVQLVESGGGLVQAGGSLRLSCARSGRISNINIMAWYRQAP


(EEG)6-S75-
GKTRDMVAAIIGDSTNYADSVKGRFTISRDNAKNTVYLHMN


Colistin
RLKPEDTGVYYCKISGVDWGQGTQVTVSS



GGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLI



NKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEK



AIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHH



HHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-6-methyloctanoic acid-Dab(Azidoacetyl-



GPRAEALT-)-Thr-Dab-(Dab-Dab-(D-Leu)-Leu-Dab-Dab-Thr)



(SEQ ID NO: 203)



Chemically linked at Cys-Maleimide and DBCO-azide





VHH5-(ABD)2-
KVQLVESGGGLVQAGGSLRLSCAASGNIFINNAVGWYRQAP


(EEG)6-S75-
GKQREMVAAMLSGGSTNYADSVKGRFTISRDNAKNTVYLQM


Colistin
NSLKPEDTAVYYCNVQVNGTWARWGQGTQVTVSS



GGGGSGGGGSGGGGSLKEAKEKAIEELKKAGITSDYYFDLI



NKAKTVEGVNALKDEILKAGGGGSGGGGSGGGGSLKEAKEK



AIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAHHH



HHHSPSTPPTPSPSTPPEEGEEGEEGEEGEEGEEGC-(DBCO-



Maleimide)-6-methyloctanoic acid-Dab(Azidoacetyl-



GPRAEALT-)-Thr-Dab-(Dab-Dab-(D-Leu)-Leu-Dab-Dab-Thr)



(SEQ ID NO: 204)



Chemically linked at Cys-Maleimide and DBCO-azide









In one embodiment, the fusion protein comprises the structure of (ABD)2-(EEG)6-S12-(D)Pex (SEQ ID NO:157), (ABD)2-(EEG)6-S12-(D)Pex-Cy7 (SEQ ID NO:158), (ABD)2-(EEG)6-S12-S12-POL7080 (SEQ ID NO:159), or (ABD)2-(EEG)6-S12-S12-Colistin (SEQ ID NO:161).


In another embodiment, the disclosure provides compositions, comprising a plurality of fusion proteins according to any embodiment or embodiments herein. Such compositions may comprise fusion proteins having the same therapeutic peptide or different therapeutic peptides, or the same fusion protein or different fusion proteins (with the same or different therapeutic peptides). In various embodiments, the plurality of fusion proteins comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, or more different therapeutic peptides in total. This embodiment is useful for therapeutic peptide combination that is synergistic in activity.


In another embodiment, the compositions further comprises a pharmaceutically acceptable carrier. Such pharmaceutical compositions of the disclosure can be used, for example, in the methods of the disclosure described herein. The pharmaceutical compositions may further comprise (a) a lyoprotectant; (b) a surfactant; (c) a bulking agent; (d) a tonicity adjusting agent; (e) a stabilizer; (f) a preservative and/or (g) a buffer. In some embodiments, the buffer in the pharmaceutical composition is a Tris buffer, a histidine buffer, a phosphate buffer, a citrate buffer or an acetate buffer. The pharmaceutical composition may also include a lyoprotectant, e.g. sucrose, sorbitol or trehalose. In certain embodiments, the pharmaceutical composition includes a preservative e.g. benzalkonium chloride, benzethonium, chlorohexidine, phenol, m-cresol, benzyl alcohol, methylparaben, propylparaben, chlorobutanol, o-cresol, p-cresol, chlorocresol, phenylmercuric nitrate, thimerosal, benzoic acid, and various mixtures thereof. In other embodiments, the pharmaceutical composition includes a bulking agent, like glycine. In yet other embodiments, the pharmaceutical composition includes a surfactant e.g., polysorbate-20, polysorbate-40, polysorbate-60, polysorbate-65, polysorbate-80 polysorbate-85, poloxamer-188, sorbitan monolaurate, sorbitan monopalmitate, sorbitan monostearate, sorbitan monooleate, sorbitan trilaurate, sorbitan tristearate, sorbitan trioleaste, or a combination thereof. The pharmaceutical composition may also include a tonicity adjusting agent, e.g., a compound that renders the formulation substantially isotonic or isoosmotic with human blood. Exemplary tonicity adjusting agents include sucrose, sorbitol, glycine, methionine, mannitol, dextrose, inositol, sodium chloride, arginine and arginine hydrochloride. In other embodiments, the pharmaceutical composition additionally includes a stabilizer, e.g., a molecule which, when combined with a protein of interest substantially prevents or reduces chemical and/or physical instability of the protein of interest in lyophilized or liquid form. Exemplary stabilizers include sucrose, sorbitol, glycine, inositol, sodium chloride, methionine, arginine, and arginine hydrochloride.


The fusion protein(s) may be the sole active agent in the pharmaceutical compositions, or the compositions may further comprise one or more other active agents suitable for an intended use.


In another embodiment, the disclosure provides methods for treating a microbial infection in a subject, comprising administering to the subject an amount effective to treat the microbial infection of the fusion protein, composition, nucleic acid, expression vector, or host cell of any embodiment or combination of embodiments herein, particularly when the therapeutic peptide comprises an anti-microbial peptide.


As used herein, “treat” or “treating” means accomplishing one or more of the following: (a) reducing the severity of the disorder; (b) limiting or preventing development of symptoms characteristic of the disorder(s) being treated; (c) inhibiting worsening of symptoms characteristic of the disorder(s) being treated; (d) limiting or preventing recurrence of the disorder(s) in patients that have previously had the disorder(s); and (e) limiting or preventing recurrence of symptoms in patients that were previously symptomatic for the disorder(s).


The subject may be any subject that has a relevant disorder. In one embodiment, the subject is a mammal, including but not limited to humans, dogs, cats, horses, cattle, etc.


The methods may be used to treat any microbial infection that an AMP in the fusion protein has activity against, and at the site of which the linker will be susceptible to cleavage. In one embodiment, the microbial infection comprises a bacterial infection. In one such embodiment, the linker is susceptible to cleavage by a protease produced by the bacterial that is the cause/a cause of the infection or a protease produced by host cells in response to infection. In various non-limiting embodiments, the bacterial infection comprises one or more of pneumonia, a soft tissue infection, and endocarditis.


In another embodiment, the disclosure provides method for treating a cancer in a subject, comprising administering to the subject an amount effective to treat cancer of the fusion protein, composition, nucleic acid, expression vector, or host cell of any embodiment or combination of embodiments herein, particularly when the therapeutic peptide comprises an anti-cancer therapeutic. In one such embodiment, the methods may be used to treat cancer that a therapeutic peptide in the fusion protein has activity against, and at the site of which the linker will be susceptible to cleavage. In one such embodiment, the linker is susceptible to cleavage by a protease produced by cancer and stromal cells in tumor microenvironment.


Examples
A. General Purposes

We report design and development of activatable therapeutic peptide using antimicrobial peptide (AMP) as a model therapeutic peptide. As a specific example, we demonstrate development of a pro-AMP therapeutic based on albumin-binding domain (ABD)-AMP conjugate which is long-circulating with a masked biological activity that can be activated upon cleavage of the cleavable linker (FIG. 1). Comparing to a free AMP administration, the optimized ABD-AMP conjugate enhances delivery of active AMP to target diseased (infected) organ while lowering exposure of active AMP in other off-target organs (liver, kidneys, and spleen). Additionally, a targeting domain could be added to enhance on-target activation of the conjugate. This shift in biodistribution of active AMP with the conjugate formulation leads to improved safety profile of AMP supporting its systemic application.


B. Technical Description

The novel activatable AMP therapeutics exemplified herein comprise 4 components in tandem; (1) half-life extension domain (including but not limited to albumin-binding domain (ABD) and Fc), (2) anionic block, (3) cleavable linker (e.g. protease substrate), and (4) therapeutic payload (e.g. AMPs ((D)Pexiganan, (D)CAMEL0, Tachyplesin I, POL7080, and colistin)) (FIG. 2A). ABD associates with serum albumin following systemic administration, increasing effective size of the conjugate beyond renal filtration threshold leading to improved circulation time. Anionic block neutralizes cationic charge of AMPs to reduce liver sequestration of AMPs. Both albumin association and charge neutralization also confer activity masking on the AMPs. Cleavable linker facilitates conditional release of the AMPs upon specific trigger such as dysregulated proteases at the site of infection. Additional components such as targeting domain and imaging probe may be included to further increase functionality of the therapeutics (e.g. enhanced conjugate activation). For a specific example, ABD-AMP conjugate was formulated by site-specific conjugation of a chemically synthesized AMP to a recombinantly expressed carrier domain (ABD-anionic block-cleavable linker) via a small molecule cross linker (DBCO-Maleimide) (FIG. 2B).


To verify activity masking of our therapeutics, we evaluated antibacterial activity of our model ABD-AMP conjugate candidate ((ABD)2-(EEG)6-S12-(D)Pex) (FIG. 3A) with and without thrombin pre-activation on Pseudomonas aeruginosa PAO1. The intact conjugate exhibits 64 fold higher minimum inhibitory concentration (MIC) (defined as the lowest treatment concentration that fully inhibits bacteria growth), indicating efficient activity masking (FIG. 3B). Similarly, we verified that our intact conjugate could effectively mask mammalian toxicity on L929 fibroblasts (FIG. 3C) and hemolysis (FIG. 3D). Overall, formulating AMP as an ABD-AMP conjugate provides an activity mask that can be activated with cleavage of the cleavable linker.


Next, we comprehensively characterized our model ABD-AMP conjugate ((ABD)2-(EEG)6-S12-(D)Pex-Cy7) (FIG. 4A) in vivo in regard to pharmacokinetics and time-lapse biodistribution and activation in different organs. (ABD)2-(EEG)6-S12-(D)Pex-Cy7 exhibited significantly prolonged circulation time (91-fold increase in area-under-the-concentration-time curve (AUC)) compared to the free (D)Pex-Cy7 control (FIG. 4B). To study time-lapse biodistribution and conjugate activation in an infection context, we first infected mice with PAO1 via intratracheal instillation (IT) (FIG. 4C). The conjugate or free AMP control were intravenously (IV) administered 6 h post infection. The mice were euthanized at different time points, and organs were harvested and homogenized for quantification of conjugate and released AMP contents via SDS-PAGE analysis. We observed a time-dependent increase in active (D)Pex-Cy7 of the conjugate in the infected lungs whereas the free (D)Pex-Cy7 control group showed a steadily lower level of AMP which further declined at later time points (FIG. 4D). There is no significant difference in active AMP levels in the kidneys (FIG. 4E). In the liver and spleen, the levels of active AMP of conjugate are lower than those of the free AMP treatment groups, indicating less exposure of active AMP for the conjugate formulation in these organs (FIGS. 4F and G). When quantifying AUC fold difference in exposure of active AMP in different organs, our conjugate delivered 2.6 fold higher active AMP to the lungs compared to free AMP control while simultaneously reducing exposure to kidneys, liver, and spleen by 1.2, 3.6, and 1.6 fold, respectively (FIG. 4H). Alternatively, when estimating at the equivalent lung exposure of active AMP, our conjugate reduced exposure of active AMP to kidneys, liver, and spleen by 3.1, 9.3, and 4.2 fold, respectively (FIG. 4I). Hence, we demonstrated that formulating AMP as an ABD-AMP conjugate increases in vivo circulation time and improves on-target activation of active AMP while minimizing its exposure in other off-target organs.


Given a favorable biodistribution profile of ABD-AMP conjugate, we next evaluated if our conjugate could improve safety profile of AMP. We performed toxicity evaluation in mice following intravenous treatments with free AMP (D)Pex or conjugate (ABD)2-(EEG)6-S12-(D)Pex at 5 and 10 mg/kg AMP equivalent doses (FIG. 5A). Mice were observed for 24 h before euthanasia to collect serum and organs for analysis unless significant morbidity was observed that necessitated earlier euthanasia. Our conjugate was well tolerated up to the highest dose tested with no signs of distress (FIG. 5A). On the contrary, mice treated with 10 mg/kg (D)Pex showed evident signs of distress, and 3 out of 5 mice did not survive to the end point. In regard to serum chemistry analysis, mice treated with (D)Pex (10 mg/kg) had elevated serum levels of alanine aminotransferase (ALT), aspartate aminotransferase (AST) and blood nitrogen urea (BUN) exceeding the reference range, indicating liver and kidney malfunctions in those mice (FIG. 5B). In contrast, mice treated with the conjugate at the same equivalent AMP dose had ALT, AST, and BUN levels within the normal reference range. When evaluating H&E staining of organ sections, significant damage was observed in kidneys and spleen of mice treated with (D)Pex at 10 mg/kg (FIG. 5C). In the kidneys of the (D)Pex-treated mice, protein casts and apoptotic epithelial cells were observed in the renal tubule spaces which were also dilated. In the spleens of the (D)Pex-treated mice, patches of apoptotic lymphocytes were observed. These pathological features were not observed in mice treated with the conjugate at the equivalent AMP dose. Altogether, our conjugate formulation effectively improves safety profile of AMP following systemic administration.


To ensure broad application across different therapeutic peptides, we formulated additional AMPS (POL7080 and colistin) into ABD-AMP conjugates and evaluated the extent of activity masking. A 64-fold activity masking was similarly observed for both peptides in term of antibacterial activity on PAO1 (FIGS. 6A and B) confirming robust activity masking of AMPS across diverse secondary structures. Given that several AMPS are widely explored for anticancer property capitalizing on higher negative charge of cancer cell membrane, we determined whether ABD-AMP conjugate formulation could be utilized in the cancer context. Cell toxicity assay of (ABD)2-(EEG)6-S12-(D)Pex was performed on human hepatocellular carcinoma cell line HepG2 (FIG. 6C). As similarly observed in the antibacterial activity assay, the intact conjugate was natively inactive and the cell toxicity was only restored upon activation by target protease (Thrombin). Altogether, ABD-AMP conjugate platform represents a generalizable platform for formulating diverse activatable therapeutic peptides for different disease applications with aberrant protease microenvironment such as microbial infection and cancer.


Finally, we provided an example of a targeted ABD-AMP conjugate where a targeting domain is fused to the N-terminus of the construct to promote engagement in the infected microenvironment (FIG. 7A). Specifically, we formulated an LptD inhibitor POL7080 (a model AMP) as VHH3-(ABD)2-(EEG)6-S41-POL7080-Cy7 where VHH3 encodes a nanobody that targets ADAM10 protease abundantly expressed at the infection site. A biodistribution study of the targeted conjugate was performed in a non-neutropenic PAO1 lung infection model to evaluate the amount of intact and released AMPs at the infection site (FIG. 7B). Remarkably, the targeted conjugate increased the amount of released POL7080-Cy7 in the infected lungs by 3.3 fold compared to the free POL7080-Cy7 control group while the non-targeted conjugate had a similar level of released POL7080-Cy7 as the control group. In summary, incorporation of a targeting domain could provide a further improvement in functionalities of the conjugate as illustrated by the enhanced conjugate activation for the ADAM10-targeted conjugate. Generally, alternative targets (e.g. immune cell targets (Ly6G, Ly6C, CD177, CD11b, CD11c, CD8) or other proteases (Neutrophil elastase, matrix metalloprotienases, ADAMs, fibroblast activation protein)) could be suitably employed in different disease contexts.


C. Conclusions

We designed and developed a platform for formulating activatable therapeutic peptides. Our model construct (ABD-AMP conjugate) exhibits robust activity masking that can be liberated upon cleavage of the cleavable linker. Following systemic administration, the conjugate selectively delivers active AMP to target infected organ while minimizing exposure in other off-target organs leading to improved safety profile of the conjugate. The formulation will increase utility of AMPs for systemic application by increasing circulation time, improving target organ bioavailability and preventing off-target organ toxicity.


D. Advantages and Improvements Over Existing Methods, Devices, or Materials

Long circulation. AMPS are typically small in size (<5 kDa) and cationic. Hence, they are rapidly cleared off the body either via kidney filtration and/or liver sequestration. Our formulation provides a charge stealth on AMP which reduces liver sequestration while simultaneously increasing its effective size via albumin association to reduce renal filtration. No existing formulation has capitalized on both cationic charge neutralization and steric albumin association in a single conjugate to formulate a pro-AMP therapeutic.


On-target delivery to diseased (infection) organ. Conditional release of AMP upon cleavage of cleavable linker provides an opportunity to tune release profile via optimization of the linker.


Reduction in off-target organ exposure. Our therapeutic conjugate is inherently inactive until activated. Optimization of cleavable linker ensures minimal activation in other off-target organs which leads to improved safety profile of systemically administered AMP.


REFERENCE



  • 1. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet (London, England) (2022), doi:10.1016/S0140-6736(21)02724-0.

  • 2. N. Mookherjee, M. A. Anderson, H. P. Haagsman, D. J. Davidson, Antimicrobial host defence peptides: functions and clinical potential. Nat. Rev. Drug Discov. 19, 311-332 (2020).

  • 3. C. D. Fjell, J. A. Hiss, R. E. W. Hancock, G. Schneider, Designing antimicrobial peptides: form follows function. Nat. Rev. Drug Discov. 11, 37-51 (2011).

  • 4. M. Lei, A. Jayaraman, J. A. Van Deventer, K. Lee, Engineering Selectively Targeting Antimicrobial Peptides. Annu. Rev. Biomed. Eng. 23, 339-357 (2021).

  • 5. B. H. Gan, J. Gaynord, S. M. Rowe, T. Deingruber, D. R. Spring, The multifaceted nature of antimicrobial peptides: current synthetic chemistry approaches and future directions. Chem. Soc. Rev. 50, 7820-7880 (2021).


Claims
  • 1. A fusion protein, comprising: (a) at least one X1 domain comprising a half-life extension compound, including but not limited to a half-life extension polypeptide;(b) at least one X2 domain comprising an anionic block(c) at least one X3 domain comprising a linker susceptible to cleavage at a site of disease, including but not limited to a microbial infection site or tumor site; and(d) at least one X4 domain comprising a therapeutic peptide;wherein the at least one X1, X2, X3, and X4 domains are covalently linked, and each X4 domain is linked to an X3 domain without an intervening X1 or X2 domain.
  • 2. The fusion protein of claim 1, comprising a linear fusion protein.
  • 3. The fusion protein of claim 2, wherein one or more of the following is true: (a) each X4 domain is linked to at least one X3 domain without an intervening X1 or X2 domain,(b) an X4 domain is present at one terminus of the fusion protein,(c) the fusion protein includes only 1 X2 domain,(d) the fusion protein includes only 1 X4 domain,(e) the fusion protein comprises 1, 2, or 3 X1 domain,(f) the fusion protein comprises 2 or 3 X1 domains, and wherein at least 2 X1 domains are linked without an intervening X2, X3, or X4 domain,(g) the fusion protein comprises 1, 2, 3, or 4 X3 domains,(h) the fusion protein comprises 2, 3, or 4 X3 domains, and wherein at least 2 X3 domains are linked without an intervening X1, X2, or X4 domain, and optionally wherein at least one X1 domain is linked to an X2 domain without an intervening X3 or X4 domain, and/or(i) the protein further comprises at least one X5 domain comprising a targeting polypeptide.
  • 4-11. (canceled)
  • 12. The fusion protein of claim 2, wherein the fusion proteins comprise a general formula selected from the group consisting of: X1-X2-X3-X4;X4-X3-X2-X1;X1-X1-X2-X3-X4;X4-X3-X2-X1-X1;X1-X1-X2-X3-X3-X4;X1-X1-X2-X3-X3-X3-X4;X1-X3-X1-X2-X3-X4;X1-X1-X3-X2-X3-X4;X1-X3-X1-X3-X2-X3-X4;X1-X3-X1-X2-X3-X3-X4;X1-X1-X3-X2-X3-X3-X4;X1-X3-X1-X3-X2-X3-X3-X4;X5-X1-X2-X3-X4;X1-X5-X2-X3-X4;X4-X3-X2-X1-X5;X4-X3-X2-X5-X1;X1-X5-X1-X2-X3-X4;X5-X1-X1-X2-X3-X4;X5-X5-X1-X1-X2-X3-X4;X5-X1-X5-X1-X2-X3-X4;X1-X5-X1-X5-X2-X3-X4;X1-X5-X1-X2-X3-X3-X4;X5-X1-X1-X2-X3-X3-X4;X5-X1-X1-X3-X2-X3-X3-X4;X5-X5-X1-X1-X2-X3-X3-X4;X5-X5-X1-X1-X2-X3-X3-X3-X4;X1-X1-X5-X5-X2-X3-X3-X4;X1-X1-X5-X5-X3-X2-X3-X3-X4;X1-X1-X3-X5-X5-X2-X3-X3-X4;X1-X1-X3-X5-X5-X3-X2-X3-X3-X4;X5-X5-X1-X1-X3-X2-X3-X3-X4;X5-X5-X5-X1-X1-X2-X3-X3-X4;X5-X5-X5-X1-X1-X3-X2-X3-X3-X4;X5-X1-X5-X1-X2-X3-X3-X4;X1-X5-X1-X5-X2-X3-X3-X4; andX5-X1-X5-X1-X5-X1-X2-X3-X3-X4.
  • 13. The fusion protein of claim 1, wherein the fusion protein comprises a branched fusion protein.
  • 14. The fusion protein of claim 13, wherein one or more of the following are true: (a) an X4 domain is present at a terminus of the fusion protein,(b) a branch of the fusion protein comprises at least one X3 domain,(c) a branch of the fusion protein comprises at least one X3 domain linked to an X4 domain,(d) the fusion protein further comprises at least one X5 domain comprising a targeting polypeptide,(e) a branch of the fusion protein is linked to the primary fusion protein backbone at a location between an X2 domain and one of an X1, X2, X3, or X5 domain,(f) the fusion protein includes only 1 X2 domain,(g) the fusion protein includes only 1 X4 domain,(h) the fusion protein comprises 1 or 2 X1 domains,(i) the fusion protein comprises 2 X1 domains, and wherein the 2 X1 domains are linked without an intervening X2, X3, or X4 domain,(j) the fusion protein comprises 1, 2, 3, or 4 X3 domains, and/or(k) the fusion proteins comprise a general formula selected from the group listed in Table 1.
  • 15-24. (canceled)
  • 25. The fusion protein of claim 1, wherein each X1 domain independently comprises a half-life extension compound selected from the group consisting of an albumin-binding polypeptide, an antibody/Fc domain (such as human Fc or mouse Fc), an unstructured XTEN polypeptide, a proline/alanine-rich sequence polypeptide (PAS), and poly(ethylene glycol).
  • 26. (canceled)
  • 27. The fusion protein of claim 1, wherein each X1 domain independently comprises the amino acid sequence selected from the group consisting of SEQ ID NO:1-5.
  • 28. The fusion protein of claim 1, wherein each X2 domain independently comprises the amino acid sequence selected from the group consisting of (E/D)x, x=1-20;(E/D)x(G/S/A)x, x=1-20;((E/D)(E/D)(G/S/A))x, x=1-20;((E/D)(G/S/A)(E/D))x, x=1-20;(E/D)x(G/S/A)x(E/D)x, x=1-20;(E/D)x((E/D)K)x, x=1-20; and(EEG)x, wherein “x” is 1-20, 2-16, 3-12, 4-10, 5-8, 1-15, 1-10, 2-10, 3-10, 4-10, 5-10, 2-8, 3-8, 4-8, 5-8, 5-7, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.
  • 29. (canceled)
  • 30. The fusion protein of claim 1, wherein each X3 domain independently comprises an amino acid sequence selected from the linker amino acid sequences selected from SEQ ID NO: 6-70, 171, and 173-187 and/or the non-peptide linkers shown in Table 4 (S86-S96).
  • 31. The fusion protein of claim 1, wherein at least one spacing sequence is inserted between one or more different domains, optionally wherein at least one spacing sequence independently comprises the amino acid sequence selected from Gx, (GS)x, (GGS)x, (GSA)x, (GGGS)x (SEQ ID NO: 71), (GGGGS)x (SEQ ID NO: 72), where x=1-4, and SPSTPPTPSPSTPP (SEQ ID NO: 73).
  • 32. The fusion protein of claim 1, wherein at least one, or each, X4 domain comprises a cationic therapeutic peptide, an anti-microbial peptide, an anti-cancer peptide, ad/or a hydrophobic therapeutic peptide, or wherein at least one, or each, X4 domain comprises an amino acid sequence selected from the group consisting of SEQ ID NO:74-103.
  • 33. (canceled)
  • 34. The fusion protein of claim 1, wherein each X4 domain comprises an amino acid sequence selected from the group consisting of SEQ ID NO:104-136, or wherein each X4 domain independently comprises the amino acid sequence selected from the group consisting of
  • 35. (canceled)
  • 36. The fusion protein of claim 1, wherein: (a) each X1 domain comprises the amino acid sequence (MG)LKEAKEKA IEELKKAGIT SDYYFDLINK AKTVEGVNAL KDEILKA (SEQ ID NO: 4), wherein the residues in parentheses are optional and may be present or absent, or wherein each X1 domain independently comprises the amino acid sequence LKEAKEKA IEELKKAGIT SDYYFDLINK AKTVEGVNAL KDEILKA (SEQ ID NO: 5);(b) each X2 domain comprises the amino acid sequence (EEG)x, wherein “x” is 1-20, 2-16, 3-12, 4-10, 5-8, 1-15, 1-10, 2-10, 3-10, 4-10, 5-10, 2-8, 3-8, 4-8, 5-8, 5-7, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10;(c) each X3 domain comprises the amino acid sequence PLGLRSW (SEQ ID NO: 16); and(d) each X4 domain comprises the amino acid sequence GiGkflkkakkfGkafvkilkk (SEQ ID NO: 137); and/or GiGkflkkakkfGkafvkilkkK (SEQ ID NO: 138); and/or Cyclo-(T-W-I-(Dab)-(Orn)-(Dab)-(Dab)-W-(Dab)-(Dab)-A-S-p-P) (SEQ ID NO: 139); and/or 6-methyloctanoic acid-Dab-T-Dab-(Dab-Dab-1-L-Dab-Dab-T) (SEQ ID NO: 140);wherein residues in lower case are D amino acids and residues in upper case are L amino acids or glycine (which is achiral) and Dab and Orn denote L-diaminobutyric acid and L-ornithine respectively.
  • 37-40. (canceled)
  • 41. The fusion protein of claim 1, comprising the structure selected from SEQ ID NO:156-170 and 193-204, wherein any detectable labels are optional.
  • 42. A composition, comprising a plurality of fusion proteins according to claim 1.
  • 43-45. (canceled)
  • 46. A nucleic acid encoding the fusion protein of claim 40.
  • 47. An expression vector comprising the nucleic acid of claim 46 operatively linked to a suitable regulatory sequence.
  • 48. A host cell comprising the expression vector of claim 47.
  • 49. A method for treating cancer or a microbial infection in a subject, comprising administering to the subject an amount effective to treat the cancer or the microbial infection of the fusion protein, composition, nucleic acid, expression vector, or host cell of any preceding claim.
  • 50-52. (canceled)
RELATED APPLICATIONS

This application claims priority to U.S. Provisional Patent Application Ser. No. 63/341,061 filed May 12, 2022, incorporated by reference herein in its entirety.

STATEMENT OF GOVERNMENT SUPPORT

This invention was made with government support under AI142780 and AI132413 awarded by the National Institutes of Health. The government has certain rights in the invention.

Provisional Applications (1)
Number Date Country
63341061 May 2022 US