The transforming growth factor-beta (TGF-beta) superfamily contains a variety of growth factors that share common sequence elements and structural motifs. These proteins are known to exert biological effects on a large variety of cell types in both vertebrates and invertebrates. Members of the superfamily perform important functions during embryonic development in pattern formation and tissue specification and can influence a variety of differentiation processes, including adipogenesis, myogenesis, chondrogenesis, cardiogenesis, hematopoiesis, neurogenesis, and epithelial cell differentiation. The superfamily is divided into two general phylogenetic clades: the more recently evolved members of the superfamily, which includes TGF-betas, activins, and nodal and the clade of more distantly related proteins of the superfamily, which includes a number of BMPs and GDFs. Hinck (2012) FEBS Letters 586:1860-1870. TGF-beta superfamily members have diverse, often complementary biological effects. By manipulating the activity of a member of the TGF-beta superfamily, it is often possible to cause significant physiological changes in an organism. For example, the Piedmontese and Belgian Blue cattle breeds carry a loss-of-function mutation in the GDF8 (also called myostatin) gene that causes a marked increase in muscle mass. Grobet et al. (1997) Nat Genet., 17(1):71-4. Furthermore, in humans, inactive alleles of GDF8 are associated with increased muscle mass and, reportedly, exceptional strength. Schuelke et al. (2004) N Engl J Med, 350:2682-8.
Changes in muscle, bone, fat, red blood cells, and other tissues may be achieved by enhancing or inhibiting signaling (e.g., SMAD 1, 2, 3, 5, and/or 8) that is mediated by ligands of the TGF-beta superfamily. Thus, there is a need for agents that regulate the activity of various ligands of the TGF-beta superfamily.
In part, the disclosure provides heteromultimeric complexes comprising a single TGF-beta superfamily co-receptor polypeptide (e.g., an endoglin, betaglycan, Cripto-1, Cryptic, Cryptic family protein 1B, Crim1, Crim2, BAMBI, BMPER, RGM-A, RGM-B, MuSK, and hemojuvelin polypeptide), including fragments and variants thereof. These constructs may be referred to herein as “single-arm” polypeptide complexes. Optionally, single-arm polypeptide complexes disclosed herein have different ligand-binding specificities/profiles compared to a corresponding homodimeric complex.
Heteromultimeric structures include, for example, heterodimers, heterotrimers, and higher order complexes. Preferably, TGF-beta superfamily co-receptor polypeptides as described herein comprise a ligand-binding domain of the receptor, for example, an extracellular domain of a TGF-beta superfamily co-receptor. Accordingly, in certain aspects, protein complexes described herein comprise a ligand-biding domain of a TGF-beta superfamily co-receptor selected from: endoglin, betaglycan, Cripto-1, Cryptic, Cryptic family protein 1B, Crim1, Crim2, BAMBI, BMPER, RGM-A, RGM-B, MuSK, and hemojuvelin, as well as truncations and variants thereof. Preferably, TGF-beta superfamily co-receptor polypeptides as described herein, as well as protein complexes comprising the same, are soluble. In certain aspects, heteromultimer of the disclosure bind to one or more TGF-beta superfamily ligands (e.g., BMP2, BMP2/7, BMP3, BMP4, BMP4/7, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF3, GDF5, GDF6/BMP13, GDF7, GDF8, GDF9b/BMP15, GDF11/BMP11, GDF15/MIC1, TGF-β1, TGF-β2, TGF-β3, activin A, activin B, activin C, activin E, activin AB, activin AC, activin AE, activin BC, activin BE, nodal, glial cell-derived neurotrophic factor (GDNF), neurturin, artemin, persephin, Müllerian-inhibiting substance (MIS), and Lefty). Optionally, heteromers of the disclosure bind to one or more of these ligands with a KD of less than or equal to 10−8, 10−9, 10−10, 10−11, or 10−12. In general, heteromultimer complexes of the disclosure antagonize (inhibit) one or more activities of at least one TGF-beta superfamily ligand, and such alterations in activity may be measured using various assays known in the art, including, for example, a cell-based assay as described herein. Preferably, protein complexes of the disclosure exhibit a serum half-life of at least 4, 6, 12, 24, 36, 48, or 72 hours in a mammal (e.g., a mouse or a human). Optionally, protein complexes of the disclosure may exhibit a serum half-life of at least 6, 8, 10, 12, 14, 20, 25, or 30 days in a mammal (e.g., a mouse or a human).
In certain aspects, protein complexes described herein comprise a first polypeptide covalently or non-covalently associated with a second polypeptide wherein the first polypeptide comprises the amino acid sequence of a TGF-beta superfamily co-receptor polypeptide and the amino acid sequence of a first member of an interaction pair and the second polypeptide comprises a second member of the interaction pair and does not contain an amino acid sequence of a TGF-beta superfamily co-receptor polypeptide. Optionally, the second polypeptide comprises, in addition to the second member of the interaction pair, a further polypeptide sequence that is not a TGF-beta superfamily co-receptor polypeptide and may optionally comprise not more than 5, 10, 15, 20, 30, 40, 50, 100, 200, 300, 400 or 500 amino acids. Optionally, the TGF-beta superfamily co-receptor polypeptide is connected directly to the first member of the interaction pair, or an intervening sequence, such as a linker, may be positioned between the amino acid sequence of the TGF-beta superfamily co-receptor polypeptide and the amino acid sequence of the first member of the interaction pair. Examples of linkers include, but are not limited to, the sequences TGGG (SEQ ID NO: 162), TGGGG (SEQ ID NO: 160), SGGGG (SEQ ID NO: 161), SGGG (SEQ ID NO: 163), GGGG (SEQ ID NO: 159), and GGG (SEQ ID NO: 158).
Interaction pairs described herein are designed to promote dimerization or form higher order multimers. In some embodiments, the interaction pair may be any two polypeptide sequences that interact to form a complex, particularly a heterodimeric complex although operative embodiments may also employ an interaction pair that forms a homodimeric complex. The first and second members of the interaction pair may be an asymmetric pair, meaning that the members of the pair preferentially associate with each other rather than self-associate. Accordingly, first and second members of an asymmetric interaction pair may associate to form a heterodimeric complex. Alternatively, the interaction pair may be unguided, meaning that the members of the pair may associate with each other or self-associate without substantial preference and thus may have the same or different amino acid sequences. Accordingly, first and second members of an unguided interaction pair may associate to form a homodimer complex or a heterodimeric complex. Optionally, the first member of the interaction pair (e.g., an asymmetric pair or an unguided interaction pair) associates covalently with the second member of the interaction pair. Optionally, the first member of the interaction pair (e.g., an asymmetric pair or an unguided interaction pair) associates non-covalently with the second member of the interaction pair.
Traditional Fc fusion proteins and antibodies are examples of unguided interaction pairs, whereas a variety of engineered Fc domains have been designed as asymmetric interaction pairs. Therefore, a first member and/or a second member of an interaction pair described herein may comprise a constant domain of an immunoglobulin, including, for example, the Fc portion of an immunoglobulin. Optionally, a first member of an interaction pair may comprise an amino acid sequence that is derived from an Fc domain of an IgG1, IgG2, IgG3, or IgG4 immunoglobulin. For example, the first member of an interaction pair may comprise, consist essentially of, or consist of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NOs: 200-214, 502, 503, 506, or 507. Optionally, a second member of an interaction pair may comprise an amino acid sequence that is derived from an Fc domain of an IgG1, IgG2, IgG3, or IgG4. For example, the second member of an interaction pair may comprise, consist essentially of, or consist of an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NOs: 200-214, 502, 503, 506, or 507. In some embodiments, a first member and a second member of an interaction pair comprise Fc domains derived from the same immunoglobulin class and subtype. In other embodiments, a first member and a second member of an interaction pair comprise Fc domains derived from different immunoglobulin classes or subtypes. Optionally, a first member and/or a second member of an interaction pair (e.g., an asymmetric pair or an unguided interaction pair) comprise a modified constant domain of an immunoglobulin, including, for example, a modified Fc portion of an immunoglobulin. For example, protein complexes of the disclosure may comprise a first Fc portion of an IgG comprising an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to an amino acid sequence selected from the group: SEQ ID NOs: 200-214, 502, 503, 506, or 507 and a second Fc portion of an IgG, which may be the same or different from the amino acid sequence of the first modified Fc portion of the IgG, comprising an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to an amino acid sequence selected from the group: SEQ ID NOs: 200-214, 502, 503, 506, or 507.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from an endoglin polypeptide. For example, endoglin polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to an endoglin sequence disclosed herein (e.g., SEQ ID NOs: 1, 2, 5, 6, 9, 10, 500, 501, 504, and 505). Optionally, endoglin polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, an endoglin polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the endoglin polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 500, 501, 504, and 505). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise an endoglin polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a betaglycan polypeptide. For example, betaglycan polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to an betaglycan sequence disclosed herein (e.g., SEQ ID NOs: 85, 86, 89, 90, 548, 549, 550, or 551). Optionally, betaglycan polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, an betaglycan polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the betaglycan polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 548, 549, 550, or 551). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise a betaglycan polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a Cripto-1 polypeptide. For example, Cripto-1 polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to an Cripto-1 sequence disclosed herein (e.g., SEQ ID NOs: 13, 14, 17, 18, 508, 509, 510, or 511). Optionally, Cripto-1 polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a Cripto-1 polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the Cripto-1 polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 508, 509, 510, or 511). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise an Cripto-1 polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a Cryptic polypeptide. For example, Cryptic polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to a Cryptic sequence disclosed herein (e.g., SEQ ID NOs: 21, 22, 25, 26, 29, 30, 512, 513, 514, or 515). Optionally, Cryptic polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a Cryptic polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the Cryptic polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 512, 513, 514, or 515). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise an Cryptic polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a Cryptic family protein 1B polypeptide. For example, Cryptic family protein 1B polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to a Cryptic family protein 1B sequence disclosed herein (e.g., SEQ ID NOs: 33, 34, 516, 517, 518, or 519). Optionally, Cryptic family protein 1B polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a Cryptic family protein 1B polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the Cryptic family protein 1B polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 516, 517, 518, or 519). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise a Cryptic family protein 1B polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a Crim1 polypeptide. For example, Crim1 polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to a Crim1 sequence disclosed herein (e.g., SEQ ID NOs: 37, 38, 520, 521, 522, or 523). Optionally, Crim1 polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a Crim1 polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the Crim1 polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 520, 521, 522, or 523). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise a CrimI polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a Crim2 polypeptide. For example, Crim2 polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to a Crim2 sequence disclosed herein (e.g., SEQ ID NOs: 41, 42, 45, 46, 524, 525, 526, or 527). Optionally, Crim2 polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a Crim2 polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the Crim2 polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 524, 525, 526, or 527). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise a Crim2 polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a BAMBI polypeptide. For example, BAMBI polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to a BAMBI sequence disclosed herein (e.g., SEQ ID NOs: 49, 50, 528, 529, 530, or 531). Optionally, BAMBI polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a BAMBI polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the BAMBI polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 528, 529, 530, or 531). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise a BAMBI polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a BMPER polypeptide. For example, BMPER polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to a BMPER sequence disclosed herein (e.g., SEQ ID NOs: 53, 54, 532, 533, 534, or 535). Optionally, BMPER polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a BMPER polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the BMPER polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 532, 533, 534, or 535). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise a BMPER polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a RGM-A polypeptide. For example, RGM-A polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to a RGM-A sequence disclosed herein (e.g., SEQ ID NOs: 61, 62, 65, 66, 69, 70, 540, 541, 542, or 543). Optionally, RGM-A polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a RGM-A polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the RGM-A polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 540, 541, 542, or 543). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise a RGM-A polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a RGM-B polypeptide. For example, RGM-B polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to an RGM-B sequence disclosed herein (e.g., SEQ ID NOs: 57, 58, 536, 537, 538, or 539). Optionally, RGM-B polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a RGM-B polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the RGM-B polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 536, 537, 538, or 539). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise a RGM-B polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a hemojuvelin polypeptide. For example, hemojuvelin polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to a hemojuvelin sequence disclosed herein (e.g., SEQ ID NOs: 73, 74, 77, 78, 81, 82, 544, 545, 546, or 547). Optionally, hemojuvelin polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a hemojuvelin polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the hemojuvelin polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 544, 545, 546, or 547). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise a hemojuvelin polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the disclosure provides heteromeric polypeptide complexes comprising a single TGF-beta superfamily co-receptor polypeptide, wherein the TGF-beta superfamily receptor polypeptide is derived from a MuSK polypeptide. For example, MuSK polypeptides may comprise of an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to a MuSK sequence disclosed herein (e.g., SEQ ID NOs: 95, 96, 99, 100, 103, 104, 552, 553, 554, or 555). Optionally, MuSK polypeptides of the disclosure may be fusion proteins that further comprise one or more portions (domains) that are heterologous to endoglin. For example, a MuSK polypeptide may be fused to a heterologous polypeptide that comprises a multimerization domain, optionally with a linker domain positioned between the MuSK polypeptide and the heterologous polypeptide (e.g., SEQ ID NOs: 552, 553, 554, or 555). In some embodiments, multimerization domains described herein comprise one component of an interaction pair. Heteromeric complexes that comprise a MuSK polypeptide do not comprise a type I receptor, type II receptor, or another co-receptor TGF-beta superfamily polypeptide but may contain additional polypeptides that are not type I receptor, type II receptor, or co-receptor TGF-beta superfamily polypeptides.
In some embodiments, the TGF-beta superfamily co-receptor polypeptides disclosed herein comprise one or more modified amino acid residues selected from: a glycosylated amino acid, a PEGylated amino acid, a farnesylated amino acid, an acetylated amino acid, a biotinylated amino acid, an amino acid conjugated to a lipid moiety, and an amino acid conjugated to an organic derivatizing agent. In some embodiments, the co-receptor polypeptides described herein are glycosylated and have a glycosylation pattern obtainable from the expression of the polypeptides in a mammalian cell, including, for example, a CHO cell.
In certain aspects the disclosure provides nucleic acids encoding any of the TGF-beta superfamily co-receptor polypeptides described herein, including any fusion proteins comprising members of an interaction pair. Nucleic acids disclosed herein may be operably linked to a promoter for expression, and the disclosure further provides cells transformed with such recombinant polynucleotides. Preferably the cell is a mammalian cell such as a COS cell or a CHO cell.
In certain aspects, the disclosure provides methods for making any of the TGF-beta superfamily co-receptor polypeptides described herein as well as protein complexes comprising such a polypeptide. Such a method may include expressing any of the nucleic acids disclosed herein in a suitable cell (e.g., CHO cell or a COS cell). Such a method may comprise: a) culturing a cell under conditions suitable for expression of a TGF-beta superfamily co-receptor polypeptides described herein, wherein said cell is transformed with a co-receptor polypeptide expression construct; and b) recovering the co-receptor polypeptides so expressed. TGF-beta superfamily co-receptor polypeptides described herein, as well as protein complexes of the same, may be recovered as crude, partially purified, or highly purified fractions using any of the well-known techniques for obtaining protein from cell cultures.
Any of the protein complexes described herein may be incorporated into a pharmaceutical preparation. Optionally, such pharmaceutical preparations are at least 80%, 85%, 90%, 95%, 97%, 98% or 99% pure with respect to other polypeptide components. Optionally, pharmaceutical preparations disclosed herein may comprise one or more additional active agents.
The disclosure further provides methods for use of the protein complexes and pharmaceutical preparations described herein for the treatment or prevention of various TGF-beta associated conditions, including without limitation diseases and disorders associated with, for example, cancer, muscle, bone, fat, red blood cells, metabolism, fibrosis and other tissues that are affected by one or more ligands of the TGF-beta superfamily.
In part, the present disclosure relates to single-arm heteromultimer complexes comprising a ligand-binding domain of a TGFβ superfamily co-receptor polypeptide, methods of making such single-arm heteromultimer complexes, and uses thereof. As described herein, single-arm heteromultimer complexes may comprise a ligand-binding domain of a TGFβ superfamily co-receptor polypeptide selected from: endoglin, betaglycan, Cripto-1, Cryptic, Cryptic family protein 1B, Crim1, Crim2, BAMBI, BMPER, RGM-A, RGM-B, MuSK, and hemojuvelin. In some embodiments, heteromultimer complexes of the disclosure have an altered profile of binding to TGFβ superfamily ligands relative to a corresponding homomultimer complex.
The TGF-β superfamily is comprised of over thirty secreted factors including TGF-betas, activins, nodals, bone morphogenetic proteins (BMPs), growth and differentiation factors (GDFs), and anti-Mullerian hormone (AMH). See, e.g., Weiss et al. (2013) Developmental Biology, 2(1): 47-63. Members of the superfamily, which are found in both vertebrates and invertebrates, are ubiquitously expressed in diverse tissues and function during the earliest stages of development throughout the lifetime of an animal. Indeed, TGF-β superfamily proteins are key mediators of stem cell self-renewal, gastrulation, differentiation, organ morphogenesis, and adult tissue homeostasis. Consistent with this ubiquitous activity, aberrant TGF-beta superfamily signaling is associated with a wide range of human pathologies including, for example, autoimmune disease, cardiovascular disease, fibrotic disease, and cancer.
Ligands of the TGF-beta superfamily share the same dimeric structure in which the central 3-1/2 turn helix of one monomer packs against the concave surface formed by the beta-strands of the other monomer. The majority of TGF-beta family members are further stabilized by an intermolecular disulfide bond. This disulfide bonds traverses through a ring formed by two other disulfide bonds generating what has been termed a ‘cysteine knot’ motif. See, e.g., Lin et al., (2006) Reproduction 132: 179-190 and Hinck (2012) FEBS Letters 586: 1860-1870.
TGF-beta superfamily signaling is mediated by heteromeric complexes of type I and type II serine/threonine kinase receptors, which phosphorylate and activate downstream SMAD proteins (e.g., SMAD proteins 1, 2, 3, 5, and 8) upon ligand stimulation. See, e.g., Massagud (2000) Nat. Rev. Mol. Cell Biol. 1:169-178. These type I and type II receptors are transmembrane proteins, composed of a ligand-binding extracellular domain with cysteine-rich region, a transmembrane domain, and a cytoplasmic domain with predicted serine/threonine kinase specificity. In general, type I receptors mediate intracellular signaling while the type II receptors are required for binding TGF-beta superfamily ligands. Type I and II receptors form a stable complex after ligand binding, resulting in phosphorylation of type I receptors by type II receptors.
The TGF-beta family can be divided into two phylogenetic branches based on the type I receptors they bind and the Smad proteins they activate. One is the more recently evolved branch, which includes, e.g., the TGF-betas, activins, GDF8, GDF9, GDF11, BMP3 and nodal. The other branch comprises the more distantly related proteins of the superfamily and includes, e.g., BMP2, BMP4, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF1, GDF5, GDF6, and GDF7. See, e.g. Hinck (2012) FEBS Letters 586:1860-1870.
TGF-beta isoforms are the founding members of the TGF-beta superfamily, of which there are 3 known isoforms in mammals designated as TGF-beta1, TGF-beta2 and TGF-beta3. Mature bioactive TGF-beta ligands function as homodimers and predominantly signal through the type I receptor ALK5 but have also been found to signal through ALK1 in endothelial cells. See, e.g., Goumans et al. (2003) Mol Cell 12(4): 817-828. TGF-beta1 is the most abundant and ubiquitously expressed isoform. TGF-beta1 is known to have an important role in wound healing, and mice expressing a constitutively active TGF-beta1 transgene develop fibrosis. See e.g., Clouthier et al., (1997) J Clin. Invest. 100(11): 2697-2713. TGF-beta1 is also involved in T cell activation and maintenance of T regulatory cells. See, e.g., Li et al., (2006) Immunity 25(3): 455-471. TGF-beta2 expression was first described in human glioblastoma cells and occurs in neurons and astroglial cells of the embryonic nervous system. TGF-beta2 is also known to suppress interleukin-2-dependent growth of T lymphocytes. TGF-beta3 was initially isolated from a human rhabdomyosarcoma cell line and since has been found in lung adenocarcinoma and kidney carcinoma cell lines. TGF-beta3 is known to be important for palate and lung morphogenesis. See, e.g., Kubiczkova et al., (2012) Journal of Translational Medicine 10:183.
Activins are members of the TGF-beta superfamily that were initially discovered as regulators of follicle-stimulating hormone secretion, but subsequently various reproductive and non-reproductive roles have been characterized. Principal activin forms A, B, and AB are homo/heterodimers of two closely related β subunits (βAβA, βBβB, and βAβB, respectively). The human genome also encodes an activin C and an activin E, which are primarily expressed in the liver, and heterodimeric forms containing βC or βE are also known. In the TGF-beta superfamily, activins are unique and multifunctional factors that can stimulate hormone production in ovarian and placental cells, support neuronal cell survival, influence cell-cycle progress positively or negatively depending on cell type, and induce mesodermal differentiation at least in amphibian embryos. See, e.g., DePaolo et al. (1991) Proc Soc Ep Biol Med. 198:500-512; Dyson et al. (1997) Curr Biol. 7:81-84; and Woodruff (1998) Biochem Pharmacol. 55:953-963. In several tissues, activin signaling is antagonized by its related heterodimer, inhibin. For example, in the regulation of follicle-stimulating hormone (FSH) secretion from the pituitary, activin promotes FSH synthesis and secretion, while inhibin reduces FSH synthesis and secretion. Other proteins that may regulate activin bioactivity and/or bind to activin include follistatin (FS), follistatin-related protein (FSRP, also known as FLRG or FSTL3), and α2-macroglobulin.
As described herein, agents that bind to “activin A” are agents that specifically bind to the βA subunit, whether in the context of an isolated βA subunit or as a dimeric complex (e.g., a βAβA homodimer or a βAβB heterodimer). In the case of a heterodimer complex (e.g., a βAβB heterodimer), agents that bind to “activin A” are specific for epitopes present within the βA subunit, but do not bind to epitopes present within the non-βA subunit of the complex (e.g., the βB subunit of the complex). Similarly, agents disclosed herein that antagonize (inhibit) “activin A” are agents that inhibit one or more activities as mediated by a βA subunit, whether in the context of an isolated DA subunit or as a dimeric complex (e.g., a βAβA homodimer or a βAβB heterodimer). In the case of βAβB heterodimers, agents that inhibit “activin A” are agents that specifically inhibit one or more activities of the βA subunit but do not inhibit the activity of the non-βA subunit of the complex (e.g., the βB subunit of the complex). This principle applies also to agents that bind to and/or inhibit “activin B”, “activin C”, and “activin E”. Agents disclosed herein that antagonize “activin AB” are agents that inhibit one or more activities as mediated by the βA subunit and one or more activities as mediated by the βB subunit.
The BMPs and GDFs together form a family of cysteine-knot cytokines sharing the characteristic fold of the TGF-beta superfamily. See, e.g., Rider et al. (2010) Biochem J., 429(1):1-12. This family includes, for example, BMP2, BMP4, BMP6, BMP7, BMP2a, BMP3, BMP3b (also known as GDF10), BMP4, BMP5, BMP6, BMP7, BMP8, BMP8a, BMP8b, BMP9 (also known as GDF2), BMP10, BMP11 (also known as GDF11), BMP12 (also known as GDF7), BMP13 (also known as GDF6), BMP14 (also known as GDF5), BMP15, GDF1, GDF3 (also known as VGR2), GDF8 (also known as myostatin), GDF9, GDF15, and decapentaplegic. Besides the ability to induce bone formation, which gave the BMPs their name, the BMP/GDFs display morphogenetic activities in the development of a wide range of tissues. BMP/GDF homo- and hetero-dimers interact with combinations of type I and type II receptor dimers to produce multiple possible signaling complexes, leading to the activation of one of two competing sets of SMAD transcription factors. BMP/GDFs have highly specific and localized functions. These are regulated in a number of ways, including the developmental restriction of BMP/GDF expression and through the secretion of several proteins that bind certain TGF-beta superfamily ligands with high affinity and thereby inhibit ligand activity. Curiously, some of these endogenous antagonists resemble TGF-beta superfamily ligands themselves.
Growth and differentiation factor-8 (GDF8) is also known as myostatin. GDF8 is a negative regulator of skeletal muscle mass and is highly expressed in developing and adult skeletal muscle. The GDF8 null mutation in transgenic mice is characterized by a marked hypertrophy and hyperplasia of skeletal muscle. See, e.g., McPherron et al., Nature (1997) 387:83-90. Similar increases in skeletal muscle mass are evident in naturally occurring mutations of GDF8 in cattle and, strikingly, in humans. See, e.g., Ashmore et al. (1974) Growth, 38:501-507; Swatland and Kieffer, J. Anim. Sci. (1994) 38:752-757; McPherron and Lee, Proc. Natl. Acad. Sci. USA (1997) 94:12457-12461; Kambadur et al., Genome Res. (1997) 7:910-915; and Schuelke et al. (2004) N Engl J Med, 350:2682-8. Studies have also shown that muscle wasting associated with HIV-infection in humans is accompanied by increases in GDF8 protein expression. See, e.g., Gonzalez-Cadavid et al., PNAS (1998) 95:14938-43. In addition, GDF8 can modulate the production of muscle-specific enzymes (e.g., creatine kinase) and modulate myoblast cell proliferation. See, e.g., International Patent Application Publication No. WO 00/43781). The GDF8 propeptide can noncovalently bind to the mature GDF8 domain dimer, inactivating its biological activity. See, e.g., Miyazono et al. (1988) J. Biol. Chem., 263: 6407-6415; Wakefield et al. (1988) J. Biol. Chem., 263; 7646-7654; and Brown et al. (1990) Growth Factors, 3: 35-43. Other proteins which bind to GDF8 or structurally related proteins and inhibit their biological activity include follistatin, and potentially, follistatin-related proteins. See, e.g., Gamer et al. (1999) Dev. Biol., 208: 222-232.
GDF11, also known as BMP11, is a secreted protein that is expressed in the tail bud, limb bud, maxillary and mandibular arches, and dorsal root ganglia during mouse development. See, e.g., McPherron et al. (1999) Nat. Genet., 22: 260-264; and Nakashima et al. (1999) Mech. Dev., 80: 185-189. GDF11 plays a unique role in patterning both mesodermal and neural tissues. See, e.g., Gamer et al. (1999) Dev Biol., 208:222-32. GDF11 was shown to be a negative regulator of chondrogenesis and myogenesis in developing chick limb. See, e.g., Gamer et al. (2001) Dev Biol., 229:407-20. The expression of GDF11 in muscle also suggests its role in regulating muscle growth in a similar way to GDF8. In addition, the expression of GDF11 in brain suggests that GDF11 may also possess activities that relate to the function of the nervous system. Interestingly, GDF11 was found to inhibit neurogenesis in the olfactory epithelium. See, e.g., Wu et al. (2003) Neuron., 37:197-207. Hence, GDF11 may have in vitro and in vivo applications in the treatment of diseases such as muscle diseases and neurodegenerative diseases (e.g., amyotrophic lateral sclerosis).
BMP7, also called osteogenic protein-1 (OP-1), is well known to induce cartilage and bone formation. In addition, BMP7 regulates a wide array of physiological processes. For example, BMP7 may be the osteoinductive factor responsible for the phenomenon of epithelial osteogenesis. It is also found that BMP7 plays a role in calcium regulation and bone homeostasis. Like activin, BMP7 binds to type II receptors, ActRIIA and ActRIIB. However, BMP7 and activin recruit distinct type I receptors into heteromeric receptor complexes. The major BMP7 type I receptor observed was ALK2, while activin bound exclusively to ALK4 (ActRIIB). BMP7 and activin elicited distinct biological responses and activated different SMAD pathways. See, e.g., Macias-Silva et al. (1998) J Biol Chem. 273:25628-36.
Anti-Mullerian hormone (AMH), also known as Mullerian-inhibiting substance (MIS), is a TGF-beta family glycoprotein. One AMH-associated type II receptor has been identified and is designated as AMHRII, or alternatively MISRII. AMH induces regression of the Mullerian ducts in the human male embryo. AMH is expressed in reproductive age women and does not fluctuate with cycle or pregnancy, but was found to gradually decrease as both oocyte quantity and quality decrease, suggesting AMH could serve as a biomarker for ovarian physiology. See e.g. Zec et al., (2011) Biochemia Medica 21(3): 219-30.
In certain aspects, the present invention relates to ENG polypeptides. The protein endoglin (ENG), also known as CD105 and encoded by ENG, is considered a co-receptor for the transforming growth factor-β (TGF-β) superfamily of ligands and is implicated in normal and pathological fibrosis and angiogenesis. Structurally, ENG is a homodimeric cell-surface glycoprotein. It belongs to the zona pellucida (ZP) family of proteins and consists of a short C-terminal cytoplasmic domain, a single hydrophobic transmembrane domain, and a long extracellular domain (ECD) (Gougos et al, 1990, J Biol Chem 265:8361-8364). As determined by electron microscopy, monomeric ENG ECD consists of two ZP regions and an orphan domain located at the N-terminus (Llorca et al, 2007, J Mol Biol 365:694-705).
ENG expression is low in quiescent vascular endothelium but upregulated in endothelial cells of healing wounds, developing embryos, inflammatory tissues, and solid tumors (Dallas et al, 2008, Clin Cancer Res 14:1931-1937). Mice homozygous for null ENG alleles die early in gestation due to defective vascular development (Li et al, 1999, Science 284:1534-1537), whereas heterozygous null ENG mice display angiogenic abnormalities as adults (Jerkic et al, 2006, Cardiovasc Res 69:845-854). In humans, ENG gene mutations have been identified as the cause of hereditary hemorrhagic telangiectasia (Osler-Rendu-Weber syndrome) type-1 (HHT-1), an autosomal dominant form of vascular dysplasia characterized by arteriovenous malformations resulting in direct flow (communication) from artery to vein (arteriovenous shunt) without an intervening capillary bed (McAllister et al, 1994, Nat Genet 8:345-351; Fernandez-L et al, 2006, Clin Med Res 4:66-78). Typical symptoms of patients with HHT include recurrent epistaxis, gastrointestinal hemorrhage, cutaneous and mucocutaneous telangiectases, and arteriovenous malformations in the pulmonary, cerebral, or hepatic vasculature.
As a co-receptor, ENG is thought to modulate responses of other receptors to TGF-β family ligands without direct mediation of ligand signaling by itself. Ligands in the TGF-β family typically signal by binding to a homodimeric type II receptor, which triggers recruitment and transphosphorylation of a homodimeric type I receptor, thereby leading to phosphorylation of Smad proteins responsible for transcriptional activation of specific genes (Massague, 2000, Nat Rev Mol Cell Biol 1:169-178). Based on ectopic cellular expression assays, it has been reported that ENG cannot bind ligands on its own and that its binding to TGF-β1, TGF-β3, activin A, bone morphogenetic protein-2 (BMP-2), and BMP-7 requires the presence of an appropriate type I and/or type II receptor (Barbara et al, 1999, J Biol Chem 274:584-594). Nevertheless, there is evidence that ENG expressed by a fibroblast cell line can bind TGF-β1 (St.-Jacques et al, 1994, Endocrinology 134:2645-2657), and recent results in COS cells indicate that transfected full-length ENG can bind BMP-9 in the absence of transfected type I or type II receptors (Scharpfenecker et al, 2007, J Cell Sci 120:964-972).
In addition to the foregoing, ENG can occur in a soluble form in vivo under certain conditions after proteolytic cleavage of the full-length membrane-bound protein (Hawinkels et al, 2010, Cancer Res 70:4141-4150). Elevated levels of soluble ENG have been observed in the circulation of patients with cancer and preeclampsia (Li et al, 2000, Int J Cancer 89:122-126; Calabro et al, 2003, J Cell Physiol 194:171-175; Venkatesha et al, 2006, Nat Med 12:642-649; Levine et al, 2006, N Engl J Med 355:992-1005). Although the role of endogenous soluble ENG is poorly understood, a protein corresponding to residues 26-437 of the ENG precursor (amino acids 26-437 of SEQ ID NO: 1) has been proposed to act as a scavenger or trap for TGF-β family ligands (Venkatesha et al, 2006, Nat Med 12:642-649; WO-2007/143023), of which only TGF-β1 and TGF-β3 have specifically been implicated.
In certain aspects, the present invention relates to betaglycan polypeptides. Betaglycan, also known as TGFβ receptor type III (TβRIII, TGFβRIII) and encoded by TGFBR3, is a single-pass transmembrane protein consisting of a large extracellular domain, transmembrane domain, and relatively short cytoplasmic domain (43 amino acids). It is thought that betaglycan is not directly involved in signal transduction since its cytoplasmic domain lacks an obvious signaling motif Consistent with a co-receptor role, the presence of betaglycan on the cell surface increases the binding of TGFβ isoforms to their type II receptor (TGFβRII) and increases ligand efficacy in biologic assays (Bilandzic et al., 2011, Mol Cell Endocrinol 339:180-189). This effect is most pronounced for TGFβ2, which binds weakly to TGFβRII in the absence of betaglycan (Lopez-Casillas et al., 1993, 1994). In addition, the extracellular domain of betaglycan is released from some cells in a soluble form whose physiologic role remains to be determined.
Betaglycan can alter signaling by superfamily ligands besides TGFβ. For example, inhibin is capable of binding ActRIIA or ActRIIB and functionally antagonizing activins by preventing recruitment of activin type I receptors. However, inhibin requires the presence of betaglycan for high potency inhibition of activin signaling (Lewis et al., 2000, Nature 404:411-414; Wiater et al., 2009, Mol Endocrinol 23:1033-1042). Betaglycan forms a stable complex with inhibin and activin type II receptors, thus reducing the availability of these receptors to transmit activin signaling (Lewis et al., 2000, Nature 404:411-414). In a similar manner, betaglycan enables inhibin to antagonize the binding of BMPs to ActRIIA, ActRIIB, or BMPRII, thereby inhibiting BMP signaling (Wiater et al., 2003, J Biol Chem 278:7934-7941).
In certain aspects, the present invention relates to EGF-CFC family polypeptides. Members of the epidermal growth factor-Cripto-1/FRL-1/Cryptic (EGF-CFC) family in humans include founder Cripto-1 (encoded by TDGF1) as well as Cryptic protein (encoded by CFC1) and Cryptic family protein 1B (encoded by CFC1B). EGF-CFC genes encode small extracellular proteins that contain a divergent EGF motif and a novel conserved cysteine-rich domain termed the CFC motif, with most sequence similarity occurring in the central EGF and CFC motifs (Shen et al., 2000, Trends Genet 16:303-309). Most EGF-CFC proteins have been shown or predicted to possess a glycosylphosphatidylinositol (GPI) anchor site at the C-terminus. However, soluble extracellular forms of these proteins also exist (see, e.g., Watanabe et al., 2007, J Biol Chem 282:31643-31655).
In certain aspects, the present invention relates to Cripto-1 polypeptides. Cripto-1, also known as Cripto orteratocarcinoma-derived growth factor (TDGF-1), regulates the activity of multiple TGFβ superfamily ligands that signal via the Smad2/3 pathway. Cripto-1 functions as an obligatory cell-surface co-receptor for a subset of ligands including Nodal, GDF1, and GDF3 (Gray et al., 2012, FEBS Lett 586:1836-1845). Cripto-1 acts as a co-receptor for Nodal by recruiting ALK4, leading to formation of an ActRIIB-ALK4-Cripto-Nodal complex for signaling (Rosa, 2002, Sci STKE 2002 (158):pe47; Yan et al., 2002, Mol Cell Biol 22:4439-4449; Blanchet et al., 2008, Sci Signal 1 (45):ra13). This co-receptor function plays essential roles in regulating stem cell differentiation and vertebrate embryogenesis and regulates normal tissue growth and remodeling in adult tissues. See, e.g., Guardiola et al. (2012) Proc Natl Acad Sci USA 109:E3231-E3240. Cripto-1 co-receptor function has also been linked to tumor growth since Nodal signaling plays a key role in promoting tumorigenicity. In addition to facilitating signaling by some ligands, Cripto-1 inhibits receptor activation by activin A, activin B, myostatin (GDF8), and TGFβ (Gray et al., 2003, Proc Natl Acad Sci USA 100:5193-5198; Gray et al., 2006, Mol Cell Biol 26:9268-9278; Guardiola et al., 2012, Proc Natl Acad Sci USA 109:E3231-E3240). It has been shown in a detailed analysis that Cripto-1 forms analogous receptor complexes with Nodal and activin and thereby functions as a noncompetitive activin antagonist (Kelber et al., 2008, J Biol Chem 283:4490-4500).
In certain aspects, the present invention relates to Cryptic and Cryptic family 1B polypeptides. On the basis of phenotypes in double null mutant mice, Cryptic and Cripto-1 have been found to serve partially redundant functions during early embryonic development, and most if not all Nodal activity in early mouse embryogenesis is thought to be dependent on these two EGF-CFC proteins (Chu et al., 2010, Dev Biol 342:63-73). A separate study of mice deficient only in Cryptic has revealed a role for this protein in correct establishment of left-right asymmetry during embryogenesis (Gaio et al., 1999, Curr Biol 9:1339-1342).
In certain aspects, the present invention relates to chordin-related polypeptides. Proteins in this family contain chordin-like cysteine-rich repeat (CRR) motifs of the von Willebrand C (VWC) type which are important for protein binding to superfamily ligands. Such CRRs have a conserved consensus sequence based on ten cysteines (CXnWX4CX2CXCX6CX4CX4-6CX9-11CCPXC) (Sasai et al., 1994, Cell 79:779-790; Garcia-Abreu et al., 2002, Gene 287:39-47). Examples of chordin-related proteins include BMPER, CRIM1, and CRIM2.
In certain aspects, the present invention relates to BMPER polypeptides. BMP-binding endothelial cell precursor-derived regulator (BMPER) is encoded by BMPER and is the human homolog of Drosophila Crossveinless-2 (CV-2). BMPER is a secreted protein containing five CCR motifs and is reported to be proteolytically cleaved to generate two fragments that are disulfide-linked (Moser et al., 2003, Mol Cell Biol 23:5664-5679; Binnerts et al., 2004, Biochem Biophys Res Commun 315:272-280). Mammalian BMPER was originally identified as an inhibitor of BMP signaling. However, subsequent investigation determined that BMPER can exert biphasic activity depending on concentration, enhancing BMP-mediated signaling at molar concentrations less than that of ligand but inhibiting such signaling at concentrations exceeding those of ligand (Kelley et al., 2009, J Cell Biol 184:597-609). BMPER is implicated in a wide range of BMP-mediated differentiation processes during embryonic development and also implicated as an important postnatal regulator of BMP-mediated vascular inflammation in mice (Pi et al., 2012, Arterioscler Thromb Vase Biol 32:2214-2222).
In certain aspects, the present invention relates to CRIM1 polypeptides. Cysteine-rich motor neuron 1 (CRIM1), also known as “cysteine-rich transmembrane BMP regulator 1”, is encoded by CRIM1. This type I transmembrane protein contains a signal sequence, an extracellular domain (905 amino acids), a transmembrane domain (21 amino acids), and an intracellular domain (76 amino acids). The extracellular domain can also be released from the cell as a soluble form, likely via cleavage of the full protein at the membrane (Wilkinson et al., 2003, J Biol Chem 278:34181-34188), and contains an N-terminal insulin-like growth factor-binding motif and six chordin-like CRR motifs of the VWC type. These CRRs mediate protein binding to superfamily ligands such as TGFβ isoforms, BMP4, and BMP7 (see, e.g., Wilkinson et al., 2003, J Biol Chem 278:34181-34188). CRIM1 inhibits BMP signaling in part by reducing the rate of processing and delivery of BMPs to the cell surface. Studies in transgenic mice expressing a dominant negative (truncated) CRIM1 isoform indicate the importance of CRIM1 for normal development of the eye, central nervous system, and kidney (Pennisi et al., 2007, Dev Dyn 236:502-511; Wilkinson et al., 2007, J Am Soc Nephrol 18:1697-1708).
In certain aspects, the present invention relates to CRIM2 polypeptides. CRIM2 is a secreted protein encoded by the human gene KCP (kielin/chordin-like protein 1), named in recognition of the protein's sequence similarity to Xenopus kielin and mouse chordin. The longest CRIM2 isoform, which is nearly 1500 amino acids in human, contains many CRR motifs of the VWC type. Unlike most inhibitory proteins containing CRR motifs, CRIM2 is a potent enhancer of BMP signaling and is able to increase the affinity of BMP7 for its type I receptor ALK3 and/or enhance the stability of this ligand-receptor complex in mice (Lin et al., 2005, Nat Med 11:387-393). Mice homozygous for a CRIM2 null allele are viable and fertile but are hypersensitive to developing renal interstitial fibrosis, a disease stimulated by TGFβ but inhibited by BMP7. In contrast to the enhancing effect on BMPs, CRIM2 inhibits both activin A-mediated and TGFβ 1-mediated signaling through the Smad2/3 pathway (Lin et al., 2006, Mol Cell Biol 26:4577-4585). These inhibitory effects of CRIM2 are mediated in a paracrine manner, suggesting that direct binding of CRIM2 to TGF 1 or activin A can block interactions of these ligands with prospective receptors. The ability to enhance BMP signaling while suppressing activation by TGFβ and activin indicates an important role for CRIM2 in modulating responses between these antifibrotic and profibrotic cytokines in the initiation and progression of renal interstitial fibrosis.
In certain aspects, the present invention relates to BAMBI polypeptides. The protein named “BMP and activin membrane-bound inhibitor” (BAMBI), also known as “non-metastatic gene A” (NMA), is encoded by BAMBI. BAMBI resembles a type I receptor from the TGFβ superfamily, with an extracellular domain (132 amino acids), a transmembrane domain, and a cytoplasmic domain. However, BAMBI lacks an intracellular kinase domain and has therefore been described as a pseudoreceptor (Onichtchouk et al., 1999, Nature 401:480-485). BAMBI competes with type I receptors to form stable complexes with type II receptors and thereby prevents the formation of active complexes of type I and type II receptors. Additionally, BAMBI cooperates with Smad7 to inhibit ligand-mediated signaling (Yan et al., 2009, J Biol Chem 284:30097-30104). Ligands inhibited by BAMBI include BMPs, activin, and TGFβ. During development, BAMBI is prominent in gastrulation, neurulation, and development of bones and teeth, and is often co-expressed with BMP family members (Onichtchouk et al., 1999, Nature 401:480-485; Knight et al., J Dent Res 80:1895; Paulsen et al., 2011, Proc Natl Acad Sci USA 108:10202-). In the adult, BAMBI modulates processes such as diabetic nephropathy, thrombus formation, response to cardiac overload, and TGFβ-mediated tumor invasiveness (Villar et al., 2013, Biochim Biophys Acta 1832:323-335; Salles-Crawley et al., 2014, Blood 123:2873-2881; Fan et al., 2015, Diabetes 64:2220-2233; Marwitz et al., 2016, Cancer Res 76:3785-3801).
In certain aspects, the present invention relates to repulsive guidance molecule (RGM) polypeptides. RGMs constitute a family of structurally related proteins that have been proposed to act as co-receptors for BMP signaling and also interact with an unrelated transmembrane protein known as neogenin. The three mammalian proteins, RGM-A, RGM-B, and RGM-C, are approximately 50-60% identical in primary amino acid sequence and share structural features such as a proteolytic cleavage site and GPI anchor but undergo distinct biosynthetic and processing steps. Each RGM exhibits a distinct tissue-specific pattern of gene expression (Oldecamp et al., 2004, Gene Expr Patterns 4:283-288) and is thought to serve distinct biologic functions (see below). Soluble RGM proteins, which could form by shedding (Lin et al., 2008, Blood Cells Mol Dis 40:122-131; Tassew et al., 2012, Dev Cell 22:391-402), have been shown to inhibit BMP activity (Lin et al., 2005, Blood 106:2884-2889). A recent structural study reveals that the N-terminal domains of RGMs mimic a key BMP-binding motif of type I superfamily receptors, which could enable membrane-anchored RGMs to compete with type I receptors for BMP binding in a pH-dependent manner and yet eventually enhance BMP signaling from within an endosomal compartment (Healey et al., 2015, Nat Struct Mol Biol 22:458-465; Mueller, 2015, Nat Struct Mol Biol 22:439-440). As determined by surface plasmon resonance, the three RGM proteins exhibit differential binding kinetics for BMPs, which may contribute to their context-specific effects in vivo (Wu et al., 2012, PLOS One 7:e46307).
The protein RGM-A, encoded by RGMA, is expressed in the central nervous system during embryonic development in a largely non-overlapping manner with RGM-B. In the adult, RGM-A is expressed in brain as well as many other tissues, and it has been implicated in cancer, immune regulation, and as a sarcoplasmic protein regulating differentiation and size of skeletal muscle cells (Tian et al., 2013, Mol Reprod Dev 80:700-717; Martins et al., 2014, Cells Tissues Organs 200:326-338). Studies of RGM-A in several cell types in vitro suggest that it increases BMP signaling by facilitating use of ActRIIA by endogenous BMP2 and BMP4 ligands that otherwise prefer signaling through BMPRII (Xia et al., 2007, J Biol Chem 282:18129-18140).
RGM-B, also known as DRAGON and encoded by RGMB. Like RGM-A, RGM-B is expressed in brain as well as many other tissues of the adult. RGM-B knockout mice die several weeks after birth for undetermined reasons (Xia et al., 2011, J Immunol 186:1369-1376). RGM-B binds BMP2 and BMP4 but not BMP7, activin A, or TGFβ isoforms, as determined by surface plasmon resonance, and interacts directly with type I receptors (ALK2, ALK3, and ALK6) and type II receptors (ActRIIA and ActRIIB), as determined by co-immunoprecipitation and blockade with dominant negative receptors (Samad et al., 2005, J Biol Chem 280:14122-14129). The ability of RGM-B to increase BMP signaling requires membrane association through its C-terminal GPI anchor.
The protein RGM-C, also known as hemojuvelin (HJV) and encoded by HFE2, is associated with juvenile hemochromatosis, a rare recessive disease characterized by early-onset systemic iron overload with severe clinical complications. Hemojuvelin is now known to be an essential factor in the regulation of hepcidin, a master regulator of iron homeostasis (Niederkofler et al., 2005, J Clin Invest 115:2180-2186). Hemojuvelin is expressed primarily in liver, consistent with the predominant site of hepcidin regulation, and also in heart and skeletal muscle, where the role of hemojuvelin is unclear. Multiple studies have demonstrated that hemojuvelin regulates hepcidin expression in the liver by altering BMP signaling. Unlike RGM-A and RGM-B, hemojuvelin binds with high affinity to BMP6, a key ligand regulating hepcidin expression (Andriopoulos et al., 2009, Nat Genet 41:482-487), in addition to binding BMP2 and BMP4. On the basis of siRNA knockdown experiments in cell lines and hepatic expression of superfamily proteins, it has been suggested that hemojuvelin promotes endogenous signaling of BMP2, BMP4, and BMP6 through ALK2 or ALK3 and ActRIIA (Xia et al., 2008, Blood 111:5195-5204).
In certain aspects, the present invention relates to MuSK polypeptides. Muscle-associated receptor tyrosine kinase (MuSK), also known as muscle-specific kinase, CMS9, or FADS, is encoded by MUSK. MuSK is a single-pass transmembrane protein originally identified as a receptor tyrosine kinase expressed prominently in embryonic skeletal muscle and at the mature neuromuscular junction (Valenzuela et al., 1995, Neuron 15:573-584). These investigators showed that MuSK expression is induced dramatically throughout the adult myofiber after denervation, blockade of electrical activity, or physical immobilization. Subsequent studies indicate that MuSK is activated by proteins structurally unrelated to the TGFβ superfamily in a complex temporal-spatial manner to promote and maintain clustering of acetylcholine receptors on the postsynaptic side of the neuromuscular junction and to induce differentiation of the presynaptic nerve terminal (Hubbard et al., 2013, Biochim Biophys Acta 1834:2166-2169). Surprisingly, recent studies have revealed that MuSK also serves as a BMP co-receptor which is capable of binding BMPs and type I receptors (ALK3, ALK6) and stimulating BMP signaling by a mechanism independent of MuSK tyrosine kinase activity (Yilmaz et al., 2016, Sci Signal 9:ra87).
The terms used in this specification generally have their ordinary meanings in the art, within the context of this disclosure and in the specific context where each term is used. Certain terms are discussed below or elsewhere in the specification to provide additional guidance to the practitioner in describing the compositions and methods of the disclosure and how to make and use them. The scope or meaning of any use of a term will be apparent from the specific context in which it is used.
The terms “heteromultimer complex”, “heteromer”, or “heteromultimer” is a complex comprising at least a first polypeptide and a second polypeptide, wherein the second polypeptide differs in amino acid sequence from the first polypeptide by at least one amino acid residue. The heteromer can comprise a “heterodimer” formed by the first and second polypeptide or can form higher order structures where polypeptides in addition to the first and second polypeptide are present. Exemplary structures for the heteromultimer include heterodimers, heterotrimers, heterotetramers and further oligomeric structures. Heterodimers are designated herein as X:Y or equivalently as X-Y, where X represents a first polypeptide and Y represents a second polypeptide. Higher-order heteromers and oligomeric structures are designated herein in a corresponding manner. In certain embodiments a heteromultimer is recombinant (e.g., one or more polypeptide components may be a recombinant protein), isolated and/or purified.
“Homologous,” in all its grammatical forms and spelling variations, refers to the relationship between two proteins that possess a “common evolutionary origin,” including proteins from superfamilies in the same species of organism, as well as homologous proteins from different species of organism. Such proteins (and their encoding nucleic acids) have sequence homology, as reflected by their sequence similarity, whether in terms of percent identity or by the presence of specific residues or motifs and conserved positions. However, in common usage and in the instant application, the term “homologous,” when modified with an adverb such as “highly,” may refer to sequence similarity and may or may not relate to a common evolutionary origin.
The term “sequence similarity,” in all its grammatical forms, refers to the degree of identity or correspondence between nucleic acid or amino acid sequences that may or may not share a common evolutionary origin.
“Percent (%) sequence identity” with respect to a reference polypeptide (or nucleotide) sequence is defined as the percentage of amino acid residues (or nucleic acids) in a candidate sequence that are identical to the amino acid residues (or nucleic acids) in the reference polypeptide (nucleotide) sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. Alignment for purposes of determining percent amino acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those skilled in the art can determine appropriate parameters for aligning sequences, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. For purposes herein, however, % amino acid (nucleic acid) sequence identity values are generated using the sequence comparison computer program ALIGN-2. The ALIGN-2 sequence comparison computer program was authored by Genentech, Inc., and the source code has been filed with user documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. Copyright Registration No. TXU510087. The ALIGN-2 program is publicly available from Genentech, Inc., South San Francisco, Calif., or may be compiled from the source code. The ALIGN-2 program should be compiled for use on a UNIX operating system, including digital UNIX V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary.
“Agonize”, in all its grammatical forms, refers to the process of activating a protein and/or gene (e.g., by activating or amplifying that protein's gene expression or by inducing an inactive protein to enter an active state) or increasing a protein's and/or gene's activity.
“Antagonize”, in all its grammatical forms, refers to the process of inhibiting a protein and/or gene (e.g., by inhibiting or decreasing that protein's gene expression or by inducing an active protein to enter an inactive state) or decreasing a protein's and/or gene's activity.
The terms “about” and “approximately” as used in connection with a numerical value throughout the specification and the claims denotes an interval of accuracy, familiar and acceptable to a person skilled in the art. In general, such interval of accuracy is ±10%, Alternatively, and particularly in biological systems, the terms “about” and “approximately” may mean values that are within an order of magnitude, preferably ≤5-fold and more preferably ≤2-fold of a given value.
Numeric ranges disclosed herein are inclusive of the numbers defining the ranges.
The terms “a” and “an” include plural referents unless the context in which the term is used clearly dictates otherwise. The terms “a” (or “an”), as well as the terms “one or more,” and “at least one” can be used interchangeably herein. Furthermore, “and/or” where used herein is to be taken as specific disclosure of each of the two or more specified features or components with or without the other. Thus, the term “and/or” as used in a phrase such as “A and/or B” herein is intended to include “A and B,” “A or B,” “A” (alone), and “B” (alone). Likewise, the term “and/or” as used in a phrase such as “A, B, and/or C” is intended to encompass each of the following aspects: A, B, and C; A, B, or C; A or C; A or B; B or C; A and C; A and B; B and C; A (alone); B (alone); and C (alone).
In part, the disclosure provides recombinant TGF-beta superfamily heteromultimers (heteromultimers) comprising at least one TGF-beta superfamily co-receptor polypeptide, including fragments and variants thereof. In some embodiments, the disclosure relates to a recombinant heteromultimer comprising a TGF-beta superfamily co-receptor polypeptide selected from the group consisting of: endoglin, betaglycan, Cripto-1, Cryptic, Cryptic family protein 1B, Crim1, Crim2, BAMBI, BMPER, RGM-A, RGM-B, hemojuvelin, and MuSK including fragments and variants thereof. Preferably, TGF-beta superfamily co-receptor polypeptides as described herein comprise a ligand-binding domain of the receptor. In some preferred embodiments, polypeptides and heteromultimers of the disclosure are soluble. In certain preferred embodiments, heteromultimers of the disclosure bind to one or more TGF-beta superfamily ligands (e.g., BMP2, BMP2/7, BMP3, BMP4, BMP4/7, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF3, GDF5, GDF6/BMP13, GDF7, GDF8, GDF9b/BMP15, GDF11/BMP11, GDF15/MIC1, TGF-β1, TGF-β2, TGF-β3, activin A, activin B, activin C, activin E, activin AB, activin AC, activin AE, activin BC, activin BE, nodal, glial cell-derived neurotrophic factor (GDNF), neurturin, artemin, persephin, Müllerian-inhibiting substance (MIS), and Lefty). In some embodiments, a heteromultimer may bind to one or more TGF-beta superfamily ligands with a KD of at least 1×10−7 M (e.g., KD of greater than or equal to 10−7, 10−8, 10−9, 10−10, 10−11, or 10−12). In some embodiments, a heteromultimer of the disclosure has a different TGF-beta superfamily ligand binding and/or inhibition profile (specificity) compared to a corresponding homomultimer. In some embodiments, a heteromultimer of the disclosure may inhibit one or more TGF-beta superfamily ligands (e.g., BMP2, BMP2/7, BMP3, BMP4, BMP4/7, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF3, GDF5, GDF6/BMP13, GDF7, GDF8, GDF9b/BMP15, GDF11/BMP11, GDF15/MIC1, TGF-β1, TGF-β2, TGF-β3, activin A, activin B, activin C, activin E, activin AB, activin AC, activin AE, activin BC, activin BE, nodal, glial cell-derived neurotrophic factor (GDNF), neurturin, artemin, persephin, Millerian-inhibiting substance (MIS), and Lefty). In some embodiments, a heteromultimer of the disclosure may inhibit signaling of one or more TGF-beta superfamily ligands. For example, in some embodiments, a heteromultimer of the disclosure may inhibit signaling of one or more TGF-beta superfamily ligands in a cell-based assay (e.g., cell-based signaling assays as described herein). In some embodiments, heteromultimers of the disclosure are heterodimers.
The term “endoglin polypeptide” includes polypeptides comprising any naturally occurring endoglin protein (encoded by ENG or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
The human endoglin isoform 1 precursor protein sequence (NCBI Ref Seq NP_001108225.1) is as follows:
MDRGTLPLAV ALLLASCSLS PTSLA
ETVHC DLQPVGPERG EVTYTTSQVS KGCVAQAPNA
ILEVHVLFLE FPTGPSQLEL TLQASKQNGT WPREVLLVLS VNSSVFLHLQ ALGIPLHLAY
NSSLVTFQEP PGVNTTELPS FPKTQILEWA AERGPITSAA ELNDPQSILL RLGQAQGSLS
FCMLEASQDM GRTLEWRPRT PALVRGCHLE GVAGHKEAHI LRVLPGHSAG PRTVTVKVEL
SCAPGDLDAV LILQGPPYVS WLIDANHNMQ IWTTGEYSFK IFPEKNIRGF KLPDTPQGLL
GEARMLNASI VASFVELPLA SIVSLHASSC GGRLQTSPAP IQTTPPKDTC SPELLMSLIQ
TKCADDAMTL VLKKELVAHL KCTITGLTFW DPSCEAEDRG DKFVLRSAYS SCGMQVSASM
ISNEAVVNIL SSSSPQRKKV HCLNMDSLSF QLGLYLSPHF LQASNTIEPG QQSFVQVRVS
PSVSEFLLQL DSCHLDLGPE GGTVELIQGR AAKGNCVSLL SPSPEGDPRF SFLLHFYTVP
The signal peptide is indicated by single underline, the extracellular domain is indicated in bold font, and the transmembrane domain is indicated by dotted underline.
A processed extracellular endoglin polypeptide sequence (isoform 1) is as follows:
A nucleic acid sequence encoding unprocessed human ENG isoform 1 precursor protein is shown below (SEQ ID NO: 3), corresponding to nucleotides 419-2392 of NCBI Reference Sequence NM_001114753.2. The signal sequence is underlined.
A nucleic acid sequence encoding a processed extracellular ENG isoform1 polypeptide is as follows (SEQ ID NO: 4):
The human endoglin isoform 2 precursor protein sequence (NCBI Ref Seq NP_000109.1) is as follows:
MDRGTLPLAV ALLLASCSLS PTSLA
ETVHC DLQPVGPERG EVTYTTSQVS KGCVAQAPNA
ILEVHVLFLE FPTGPSQLEL TLQASKQNGT WPREVLLVLS VNSSVFLHLQ ALGIPLHLAY
NSSLVTFQEP PGVNTTELPS FPKTQILEWA AERGPITSAA ELNDPQSILL RLGQAQGSLS
FCMLEASQDM GRTLEWRPRT PALVRGCHLE GVAGHKEAHI LRVLPGHSAG PRTVTVKVEL
SCAPGDLDAV LILQGPPYVS WLIDANHNMQ IWTTGEYSFK IFPEKNIRGF KLPDTPQGLL
GEARMLNASI VASFVELPLA SIVSLHASSC GGRLQTSPAP IQTTPPKDTC SPELLMSLIQ
TKCADDAMTL VLKKELVAHL KCTITGLTFW DPSCEAEDRG DKFVLRSAYS SCGMQVSASM
ISNEAVVNIL SSSSPQRKKV HCLNMDSLSF QLGLYLSPHF LQASNTIEPG QQSFVQVRVS
PSVSEFLLQL DSCHLDLGPE GGTVELIQGR AAKGNCVSLL SPSPEGDPRF SFLLHFYTVP
The signal peptide is indicated by single underline, the extracellular domain is indicated in bold font, and the transmembrane domain is indicated by doedunderline. The endoglin isoform 2 has a shortened and distinct intracellular domain compared to endoglin isoform 1 and an unchanged extracellular domain compared to endoglin isoform 1.
A processed extracellular endoglin polypeptide sequence (isoform 2) is as follows:
A nucleic acid sequence encoding unprocessed human ENG isoform 2 precursor protein is shown below (SEQ ID NO: 7), corresponding to nucleotides 419-2293 of NCBI Reference Sequence NM_000118.3. The signal sequence is underlined.
ATGGACCGCGGCACGCTCCCTCTGGCTGTTGCCCTGCTGCTGGCCAGC
TGCAGCCTCAGCCCCACAAGTCTTGCAGAAACAGTCCATTGTGACCTT
A nucleic acid sequence encoding a processed extracellular ENG isoform 2 polypeptide is as follows (SEQ ID NO: 8):
An alternative processed extracellular endoglin polypeptide sequence (from either isoform 1 or isoform 2) is as follows:
A nucleic acid sequence encoding this alternative processed extracellular ENG polypeptide is as follows (SEQ ID NO: 94):
The human endoglin isoform 3 protein sequence (NCBI Ref Seq NP_001265067.1) is as follows:
MLEASQDMGR TLEWRPRTPA LVRGCHLEGV AGHKEAHILR VLPGHSAGPR TVTVKVELSC
APGDLDAVLI LQGPPYVSWL IDANHNMQIW TTGEYSFKIF PEKNIRGFKL PDTPQGLLGE
ARMLNASIVA SFVELPLASI VSLHASSCGG RLQTSPAPIQ TTPPKDTCSP ELLMSLIQTK
CADDAMTLVL KKELVAHLKC TITGLTFWDP SCEAEDRGDK FVLRSAYSSC GMQVSASMIS
NEAVVNILSS SSPQRKKVHC LNMDSLSFQL GLYLSPHFLQ ASNTIEPGQQ SFVQVRVSPS
VSEFLLQLDS CHLDLGPEGG TVELIQGRAA KGNCVSLLSP SPEGDPRFSF LLHFYTVPIP
The extracellular domain is indicated in bold font, and the transmembrane domain is indicated by dotted underline. The endoglin isoform 3 has a distinct 5′ untranslated region, lacks a portion of the 5′ coding region, and uses a downstream start codon compared to endoglin isoform 1.
A processed extracellular endoglin polypeptide sequence (isoform 3) is as follows:
A nucleic acid sequence encoding human ENG isoform 3 protein is shown below (SEQ ID NO: 11), corresponding to nucleotides 705-2132 of NCBI Reference Sequence NM_001278138.1. The transmembrane region is indicated by dotted underline.
A nucleic acid sequence encoding a processed extracellular ENG isoform 3 polypeptide is as follows (SEQ ID NO: 12):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one endoglin polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, endoglin polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising an endoglin polypeptide and uses thereof) are soluble (e.g., an extracellular domain of endoglin). In other preferred embodiments, endoglin polypeptides for use in accordance with the disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 1, 2, 5, 6, 9, 10, or 93. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-30 (e.g., amino acid residues 26, 27, 28, 29, or 30) of SEQ ID NO: 1, and ends at any one of amino acids 330-346 (e.g., amino acid residues 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 342, 343, 344, 345, or 346) of SEQ ID NO: 1. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-346 of SEQ ID NO: 1. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-330 of SEQ ID NO: 1. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-330 of SEQ ID NO: 1. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-346 of SEQ ID NO: 1. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-30 (e.g., amino acid residues 26, 27, 28, 29, or 30) of SEQ ID NO: 5, and ends at any one of amino acids 330-346 (e.g., amino acid residues 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 342, 343, 344, 345, or 346) of SEQ ID NO: 5. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-346 of SEQ ID NO: 5. In some embodiments, heteromultimers of the disclosure comprise least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-330 of SEQ ID NO: 5. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-330 of SEQ ID NO: 5. In some embodiments, heteromultimers of the disclosure comprise least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-346 of SEQ ID NO: 5. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-25 (e.g., amino acid residues 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25) of SEQ ID NO: 9, and ends at any one of amino acids 148-164 (e.g., amino acid residues 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, or 164) of SEQ ID NO: 9. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-164 of SEQ ID NO: 9. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 25-148 of SEQ ID NO: 9. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-148 of SEQ ID NO: 9. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 25-164 of SEQ ID NO: 9. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-30 (e.g., amino acid residues 26, 27, 28, 29, or 30) of SEQ ID NO: 1, and ends at any one of amino acids 582-586 (e.g., amino acid residues 582, 583, 584, 585, or 586) of SEQ ID NO: 1. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-586 of SEQ ID NO: 501. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-582 of SEQ ID NO: 1. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-30 (e.g., amino acid residues 26, 27, 28, 29, or 30) of SEQ ID NO: 5, and ends at any one of amino acids 582-586 (e.g., amino acid residues 582, 583, 584, 585, or 586) of SEQ ID NO: 5. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-586 of SEQ ID NO: 5. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-582 of SEQ ID NO: 5. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-25 (e.g., amino acid residues 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25) of SEQ ID NO: 9, and ends at any one of amino acids 400-404 (e.g., amino acid residues 400, 401, 402, or 403) of SEQ ID NO: 9. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-404 of SEQ ID NO: 9. In some embodiments, heteromultimers of the disclosure comprise at least one endoglin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 25-400 of SEQ ID NO: 9.
The term “Cripto-1 polypeptide” includes polypeptides comprising any naturally occurring Cripto-1 protein (encoded by TDGF1 or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
The human Cripto-1 isoform 1 precursor protein sequence (NCBI Ref Seq NP_003203.1) is as follows:
The signal peptide is indicated by single underline.
A processed Cripto-1 isoform 1 polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human Cripto-1 isoform 1 precursor protein is shown below (SEQ ID NO: 15), corresponding to nucleotides 385-948 of NCBI Reference Sequence NM_003212.3. The signal sequence is underlined.
ATGGACTGCAGGAAGATGGCCCGCTTCTCTTACAGTGTGATTTGGATCA
TGGCCATTTCTAAAGTCTTTGAACTGGGATTAGTTGCCGGGCTGGGCCA
A nucleic acid sequence encoding a processed Cripto-1 isoform 1 is shown below (SEQ ID NO: 16):
The human Cripto-1 isoform 2 protein sequence (NCBI Ref Seq NP_001167607.1) is as follows:
A mature Cripto-1 polypeptide sequence (isoform 2) is as follows:
A nucleic acid sequence encoding unprocessed human Cripto-1 isoform 2 precursor protein is shown below (SEQ ID NO: 19), corresponding to nucleotides 43-558 of NCBI Reference Sequence NM_001174136.1.
A nucleic acid sequence encoding a processed human Cripto-1 isoform 2 is shown below (SEQ ID NO: 20):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one Cripto-1 polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, Cripto-1 polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a Cripto-1 polypeptide and uses thereof) are soluble (e.g., an extracellular domain of Cripto-1). In other preferred embodiments, Cripto-1 polypeptides for use in accordance with the disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 13, 14, 17, or 18. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 31-82 (e.g., amino acid residues 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, or 82) of SEQ ID NO: 13, and ends at any one of amino acids 172-188 (e.g., amino acid residues 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, or 188) of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 31-188 of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 63-172 of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 82-172 of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 82-188 of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 31-172 of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 63-188 of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 15-66 (e.g., amino acid residues 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, or 66) of SEQ ID NO: 17, and ends at any one of amino acids 156-172 (e.g., amino acid residues 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or 172) of SEQ ID NO: 17. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 15-172 of SEQ ID NO: 17. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 47-172 of SEQ ID NO: 17. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 47-156 of SEQ ID NO: 17. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 66-165 of SEQ ID NO: 17. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 15-156 of SEQ ID NO: 17. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 66-172 of SEQ ID NO: 17. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 31-82 (e.g., amino acid residues 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, or 82) of SEQ ID NO: 13, and ends at any one of amino acids 181-188 (e.g., amino acid residues 181, 182, 183, 184, 185, 185, 187, or 188) of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 31-188 of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 82-181 of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-66 (e.g., amino acid residues 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, or 66) of SEQ ID NO: 17, and ends at any one of amino acids 165-172 (e.g., amino acid residues 165, 166, 167, 168, 169, 170, 171, or 172) of SEQ ID NO: 17. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-172 of SEQ ID NO: 17. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 66-165 of SEQ ID NO: 17. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 31-61 of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 63-161 of SEQ ID NO: 13. In some embodiments, heteromultimers of the disclosure comprise at least one Cripto-1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-145 of SEQ ID NO: 17.
The term “Cryptic polypeptide” includes polypeptides comprising any naturally occurring Cryptic protein (encoded by CFC1 or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
The human Cryptic isoform 1 precursor protein sequence (NCBI Ref Seq NP_115934.1) is as follows:
The signal peptide is indicated by single underline.
A processed Cryptic isoform 1 polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human Cryptic isoform 1 precursor protein is shown below (SEQ ID NO: 23), corresponding to nucleotides 289-957 of NCBI Reference Sequence NM_032545.3. The signal sequence is underlined.
ATGACCTGGAGGCACCATGTCAGGCTTCTGTTTACGGTCAGTTTGGCAT
TACAGATCATCAATTTGGGAAACAGCTATCAAAGAGAGAAACATAACGG
A nucleic acid sequence encoding a processed human Cryptic isoform 1 is shown below (SEQ ID NO: 24):
The human Cryptic isoform 2 precursor protein sequence (NCBI Ref Seq NP_001257349.1) is as follows:
The signal peptide is indicated by single underline.
A processed Cryptic isoform 2 polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human Cryptic isoform 2 precursor protein is shown below (SEQ ID NO: 27), corresponding to nucleotides 289-861 of NCBI Reference Sequence NM_001270420.1. The signal sequence is underlined.
ATGACCTGGAGGCACCATGTCAGGCTTCTGTTTACGGTCAGTTTGGCA
TTACAGATCATCAATTTGGGAAACAGCTATCAAAGAGAGAAACATAAC
A nucleic acid sequence encoding processed Cryptic isoform 2 is shown below (SEQ ID NO: 28):
The human Cryptic isoform 3 precursor protein sequence (NCBI Ref Seq NP_001257350.1) is as follows:
The signal peptide is indicated by single underline.
A processed Cryptic isoform 3 polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human Cryptic isoform 3 precursor protein is shown below (SEQ ID NO: 31), corresponding to nucleotides 289-732 of NCBI Reference Sequence NM_001270421.1. The signal sequence is underlined.
ATGACCTGGAGGCACCATGTCAGGCTTCTGTTTACGGTCAGTTTGGC
ATTACAGATCATCAATTTGGGAAACAGCTATCAAAGAGAGAAACATA
A nucleic acid sequence encoding a processed Cryptic isoform 3 is shown below (SEQ ID NO: 32):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one Cryptic polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, Cryptic polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a Cryptic polypeptide and uses thereof) are soluble (e.g., an extracellular domain of Cryptic). In other preferred embodiments, Cryptic polypeptides for use in accordance with the disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 21, 22, 25, 26, 29, or 30. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-90 (e.g., amino acid residues 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90) of SEQ ID NO: 21, and ends at any one of amino acids 157-223 (e.g., amino acid residues 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 126, 217, 218, 219, 220, 221, 222, or 223) of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-223 of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-157 of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 90-157 of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-169 of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 90-169 of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 90-223 of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-82 of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-30 (e.g., amino acid residues 26, 27, 28, 29, or 30) of SEQ ID NO: 25, and ends at any one of amino acids 82-191 (e.g., amino acid residues 82, 83, 84, 85, 86, 57, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, or 191) of SEQ ID NO: 25. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-82 of SEQ ID NO: 25. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-191 of SEQ ID NO: 25. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-82 of SEQ ID NO: 25. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-191 of SEQ ID NO: 25. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-30 (e.g., amino acid residues 26, 27, 28, 29, or 30) of SEQ ID NO: 29, and ends at any one of amino acids 82-148 (e.g., amino acid residues 82, 83, 84, 85, 86, 57, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, or 148) of SEQ ID NO: 29. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-148 of SEQ ID NO: 29. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-82 of SEQ ID NO: 29. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-148 of SEQ ID NO: 29. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-82 of SEQ ID NO: 29. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-90 (e.g., amino acid residues 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90) of SEQ ID NO: 21, and ends at any one of amino acids 214-223 (e.g., amino acid residues 214, 215, 126, 217, 218, 219, 220, 221, 222, or 223) of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-223 of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 109-223 of SEQ ID NO: 21. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-108 (e.g., amino acid residues 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, or 108) of SEQ ID NO: 25, and ends at any one of amino acids 189-191 (e.g., amino acid residues 189, 190, or 191) of SEQ ID NO: 25. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-191 of SEQ ID NO: 25. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 108-189 of SEQ ID NO: 25. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-109 (e.g., amino acid residues 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108 or 109) of SEQ ID NO: 29, and ends at any one of amino acids 139-148 (e.g., amino acid residues 139, 140, 141, 142, 143, 144, 145, 146, 147, or 148) of SEQ ID NO: 29. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-148 of SEQ ID NO: 29. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 109-139 of SEQ ID NO: 29. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-94 of SEQ ID NO: 29.
The term “Cryptic family protein 1B polypeptide” includes polypeptides comprising any naturally occurring Cryptic family protein 1B protein (encoded by CFC1B or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
The human Cryptic family protein 1B precursor protein sequence (NCBI Ref Seq NP_001072998.1) is as follows:
The signal peptide is indicated by single underline.
A processed Cryptic family protein 1B polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human Cryptic family protein 1B precursor protein is shown below (SEQ ID NO: 35), corresponding to nucleotides 392-1060 of NCBI Reference Sequence NM_001079530.1. The signal sequence is underlined.
ATGACCTGGAGGCACCATGTCAGGCTTCTGTTTACGGTCAGTTTGGC
ATTACAGATCATCAATTTGGGAAACAGCTATCAAAGAGAGAAACATA
A nucleic acid sequence encoding a processed Cryptic family protein 1B is shown below (SEQ ID NO: 36):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one Cryptic family protein 1B polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, Cryptic family protein 1B polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a Cryptic family protein 1B polypeptide and uses thereof) are soluble (e.g., an extracellular domain of Cryptic family protein 1B). In other preferred embodiments, Cryptic family protein 1B polypeptides for use in accordance with the disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 33 or 34. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-30 (e.g., amino acid residues 26, 27, 28, 29, or 30) of SEQ ID NO: 33, and ends at any one of amino acids 82-223 (e.g., amino acid residues 82, 83, 84, 85, 86, 57, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 126, 217, 218, 219, 220, 221, 222, or 223) of SEQ ID NO: 33. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-223 of SEQ ID NO: 33. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-82 of SEQ ID NO: 33. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-82 of SEQ ID NO: 33. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-223 of SEQ ID NO: 33. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-169 of SEQ ID NO: 33. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-169 of SEQ ID NO: 33. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-90 (e.g., amino acid residues 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90) of SEQ ID NO: 33, and ends at any one of amino acids 214-223 (e.g., amino acid residues 214, 215, 126, 217, 218, 219, 220, 221, 222, or 223) of SEQ ID NO: 33. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-223 of SEQ ID NO: 33. In some embodiments, heteromultimers of the disclosure comprise at least one Cryptic family protein 1B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 90-214 of SEQ ID NO: 33.
The term “CRIM1 polypeptide” includes polypeptides comprising any naturally occurring polypeptide of a CRIM1 protein (encoded by CRIM1 or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
The human CRIM1 precursor protein sequence (NCBI Ref Seq NP_057525.1) is as follows:
MYLVAGDRGL AGCGHLLVSL LGLLLLLARS GTRA
LVCLPC DESKCEEPRN CPGSIVQGVC
GCCYTCASQR NESCGGTFGI YGTCDRGLRC VIRPPLNGDS LTEYEAGVCE DENWTDDQLL
GFKPCNENLI AGCNIINGKC ECNTIRTCSN PFEFPSQDMC LSALKRIEEE KPDCSKARCE
VQFSPRCPED SVLIEGYAPP GECCPLPSRC VCNPAGCLRK VCQPGNLNIL VSKASGKPGE
CCDLYECKPV FGVDCRTVEC PPVQQTACPP DSYETQVRLT ADGCCTLPTR CECLSGLCGF
PVCEVGSTPR IVSRGDGTPG KCCDVFECVN DTKPACVFNN VEYYDGDMFR MDNCRFCRCQ
GGVAICFTAQ CGEINCERYY VPEGECCPVC EDPVYPFNNP AGCYANGLIL AHGDRWREDD
CTFCQCVNGE RHCVATVCGQ TCTNPVKVPG ECCPVCEEPT IITVDPPACG ELSNCTLTGK
DCINGFKRDH NGCRTCQCIN TEELCSERKQ GCTLNCPFGF LTDAQNCEIC ECRPRPKKCR
PIICDKYCPL GLLKNKHGCD ICRCKKCPEL SCSKICPLGF QQDSHGCLIC KCREASASAG
PPILSGTCLT VDGHHHKNEE SWHDGCRECY CLNGREMCAL ITCPVPACGN PTIHPGQCCP
SCADDFVVQK PELSTPSICH APGGEYFVEG ETWNIDSCTQ CTCHSGRVLC ETEVCPPLLC
QNPSRTQDSC CPQCTDQPFR PSLSRNNSVP NYCKNDEGDI FLAAESWKPD VCTSCICIDS
VISCFSESCP SVSCERPVLR KGQCCPYCIE DTIPKKVVCH FSGKAYADEE RWDLDSCTHC
YCLQGQTLCS TVSCPPLPCV EPINVEGSCC PMCPEMYVPE PTNIPIEKTN HRGEVDLEVP
The signal peptide is indicated by a single underline, the extracellular domain is indicated by bold, and the transmembrane domain is indicated by dotted underline.
A mature CRIM1 sequence is as follows:
A nucleic acid sequence encoding unprocessed human CRIM1 precursor protein is shown below (SEQ ID NO: 39), corresponding to nucleotides 67-3174 of NCBI Reference Sequence NM_016441.2. The signal sequence is indicated by solid underline and the transmembrane region by dotted underline.
ATGTACTTGGTGGCGGGGGACAGGGGGTTGGCCGGCTGCGGGCACCTCCTGGTCTCGCTGCTGGGGCTGCTGCTG
CTGCTGGCGCGCTCCGGCACCCGGGCGCTGGTCTGCCTGCCCTGTGACGAGTCCAAGTGCGAGGAGCCCAGGAAC
A nucleic acid sequence encoding processed extracellular human CRIM1 is shown below (SEQ ID NO: 40):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one CRIM1 polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, CRIM1 polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a CRIM1 polypeptide and uses thereof) are soluble (e.g., an extracellular domain of CRIM1). In other preferred embodiments, CRIM1 polypeptides for use in accordance with the disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 37 or 38. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 35-37 (e.g., amino acid residues 35, 36, or 37) of SEQ ID NO: 37, and ends at any one of amino acids 873-939 (e.g., amino acid residues 873, 874, 875, 876, 877, 878, 879, 880, 881, 882, 883, 884, 885, 886, 887, 888, 889, 890, 891, 892, 893, 894, 895, 896, 897, 898, 899, 900, 901, 902, 903, 904, 905, 906, 907, 908, 909, 910, 911, 912, 913, 914, 915, 916, 917, 918, 919, 920, 921, 922, 923, 924, 925, 926, 927, 928, 929, 930, 931, 932, 933, 934, 935, 936, 937, 938, or 939) of SEQ ID NO: 37. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 35-939 of SEQ ID NO: 37. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 37-939 of SEQ ID NO: 37. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 35-873 of SEQ ID NO: 37. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM1 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 37-939 of SEQ ID NO: 37.
The term “CRIM2 polypeptide” includes polypeptides comprising any naturally occurring CRIM2 protein (encoded by KCP or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
A human CRIM2 isoform 1 precursor protein sequence (NCBI Ref Seq NP_001129386.1) is as follows:
The signal peptide is indicated by single underline.
A processed CRIM2 isoform 1 polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human CRIM2 isoform 1 precursor protein is shown below (SEQ ID NO: 43), corresponding to nucleotides 44-4552 of NCBI Reference Sequence NM_001135914.1. The signal sequence is underlined.
ATGGCCGGGGTCGGGGCCGCTGCGCTGTCCCTTCTCCTGCACCTCGGGG
CCCTGGCGCTGGCCGCGGGCGCGGAAGGTGGGGCTGTCCCCAGGGAGCC
A nucleic acid sequence encoding a processed human CRIM2 isoform 1 is shown below (SEQ ID NO: 44):
A human CRIM2 isoform 2 precursor protein sequence (NCBI Ref Seq NP_955381.2) is as follows:
MAGVGAAALS LLLHLGALAL AAGAEGGAVP REPPGQQTTA HSSVLAGNSQ EQWHPLREWL
A processed CRIM2 isoform 2 sequence is as follows:
A nucleic acid sequence encoding an unprocessed human CRIM2 isoform 2 precursor protein is shown below (SEQ ID NO: 47), corresponding to nucleotides 44-2485 of NCBI Reference Sequence NM_199349.2. The signal sequence is underlined.
ATGGCCGGGGTCGGGGCCGCTGCGCTGTCCCTTCTCCTGCACCTCGGGG
CCCTGGCGCTGGCCGCGGGCGCGGAAGGTGGGGCTGTCCCCAGGGAGCC
A nucleic acid sequence encoding a processed CRIM2 isoform 2 is shown below (SEQ ID NO: 48):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one CRIM2 polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, CRIM2 polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a CRIM2 polypeptide and uses thereof) are soluble (e.g., an extracellular domain of CRIM2). In other preferred embodiments, CRIM2 polypeptides for use in accordance with the disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 41, 42, 45, or 46. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 26-138 (e.g., amino acid residues 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, and 138) of SEQ ID NO: 41, and ends at any one of amino acids 1298-1503 (e.g., amino acid residues 1298, 1299, 1300, 1301, 1302, 1303, 1304, 1305, 1306, 1307, 1308, 1309, 1310, 1311, 1312, 1313, 1314, 1315, 1316, 1317, 1318, 1319, 1320, 1321, 1322, 1323, 1324, 1325, 1326, 1327, 1328, 1329, 1330, 1331, 1332, 1333, 1334, 1335, 1335, 1336, 1337, 1338, 1339, 1340, 1341, 1342, 1343, 1344, 1345, 1346, 1347, 1348, 1349, 1350, 1351, 1352, 1353, 1354, 1355, 1356, 1357, 1358, 1359, 1360, 1361, 1362, 1363, 1364, 1365, 1366, 1367, 1368, 1369, 1370, 1371, 1372, 1373, 1374, 1375, 1376, 1377, 1378, 1379, 1380, 1381, 1382, 1383, 1384, 1385, 1386, 1387, 1388, 1389, 1390, 1391, 1392, 1393, 1394, 1395, 1396, 1397, 1398, 1399, 1400, 1401, 1402, 1403, 1404, 1405, 1406, 1407, 1408, 1409, 1410, 1411, 1412, 1413, 1414, 1415, 1416, 1417, 1418, 1419, 1420, 1421, 1422, 1423, 1424, 1425, 1426, 1427, 1428, 1429, 1430, 1431, 1432, 1433, 1434, 1435, 1435, 1436, 1437, 1438, 1439, 1440, 1441, 1442, 1443, 1444, 1445, 1446, 1447, 1448, 1349, 1450, 1451, 1452, 1453, 1454, 1455, 1456, 1457, 1458, 1459, 1460, 1461, 1462, 1463, 1464, 1465, 1466, 1467, 1468, 1469, 1470, 1471, 1472, 1473, 1474, 1475, 1476, 1477, 1478, 1479, 1480, 1481, 1482, 1483, 1484, 1485, 1486, 1487, 1488, 1489, 1490, 1491, 1492, 1493, 1494, 1495, 1496, 1497, 1498, 1499, 1500, 1501, 1502, or 1503) of SEQ ID NO: 41. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-1298 of SEQ ID NO: 41. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 26-1503 of SEQ ID NO: 41. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 138-1298 of SEQ ID NO: 41. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 138-1503 of SEQ ID NO: 41. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 24-138 (e.g., amino acid residues 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, or 138) of SEQ ID NO: 45, and ends at any one of amino acids 539-814 (e.g., amino acid residues 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 414, 615, 616, 617, 618, 619, 620, 621, 622, 623, 624, 625, 626, 627, 628, 629, 630, 631, 632, 633, 634, 635, 635, 636, 637, 638, 639, 640, 641, 642, 643, 644, 645, 646, 647, 648, 649, 650, 651, 652, 653, 654, 655, 656, 657, 658, 659, 660, 661, 662, 663, 664, 665, 666, 667, 668, 669, 670, 671, 672, 673, 674, 675, 676, 677, 678, 679, 680, 681, 682, 683, 684, 685, 686, 687, 688, 689, 690, 691, 692, 693, 694, 695, 696, 697, 698, 699, 700, 701, 702, 703, 704, 405, 706, 707, 708, 709, 710, 711, 712, 713, 714, 715, 716, 717, 718, 719, 720, 721, 722, 723, 724, 725, 726, 727, 728, 729, 730, 731, 732, 733, 734, 735, 735, 736, 737, 738, 739, 740, 741, 742, 743, 744, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 760, 761, 762, 763, 764, 765, 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 781, 782, 783, 784, 785, 786, 787, 788, 789, 790, 791, 792, 793, 794, 795, 796, 797, 798, 799, 800, 801, 802, 803, 804, 805, 806, 807, 808, 809, 810, 811, 812, 813, or 814) of SEQ ID NO: 45. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 24-539 of SEQ ID NO: 45. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 24-814 of SEQ ID NO: 45. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 138-539 of SEQ ID NO: 45. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 138-814 of SEQ ID NO: 45. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 27-87 (e.g., amino acid residues 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, and 87) of SEQ ID NO: 41, and ends at any one of amino acids 1478-1503 (e.g., amino acid residues 1479, 1480, 1481, 1482, 1483, 1484, 1485, 1486, 1487, 1488, 1489, 1490, 1491, 1492, 1493, 1494, 1495, 1496, 1497, 1498, 1499, 1500, 1501, 1502, or 1503) of SEQ ID NO: 41. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 27-1503 of SEQ ID NO: 41. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 87-1478 of SEQ ID NO: 41. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 24-87 (e.g., amino acid residues 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, and 87) of SEQ ID NO: 45, and ends at any one of amino acids 804-814 (e.g., amino acid residues 804, 805, 806, 807, 808, 809, 810, 811, 812, 813, or 814) of SEQ ID NO: 45. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 24-814 of SEQ ID NO: 45. In some embodiments, heteromultimers of the disclosure comprise at least one CRIM2 polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 87-804 of SEQ ID NO: 45.
The term “BAMBI polypeptide” includes polypeptides comprising any naturally occurring BAMBI protein (encoded by BAMBI or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
The human BAMBI precursor protein sequence (NCBI Ref Seq NP_036474.1) is as follows:
MDRHSSYIFI WLQLELCAMA VLLTKGEIRC YCDAAHCVAT GYMCKSELSA CFSRLLDPQN
61
SNSPLTHGCL DSLASTTDIC QAKQARNHSG TTIPTLECCH EDMCNYRGLH DVLSPPRGEA
121
The signal peptide is indicated by single underline, the extracellular domain is indicated in bold font, and the transmembrane domain is indicated by dotted underline.
A processed BAMBI polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human BAMBI precursor protein is shown below (SEQ ID NO: 51), corresponding to nucleotides 404-1183 of NCBI Reference Sequence NM_012342.2. The signal sequence is indicated by solid underline and the transmembrane domain by dotted underline.
ATGGATCGCCACTCCAGCTACATCTTCATCTGGCTGCAGCTGGAGCTCTGCGCCATGGCCGTGCTGCTCACCAAA
A nucleic acid sequence encoding a processed extracellular BAMBI is shown below (SEQ ID NO: 52):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one BAMBI polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, BAMBI polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a BAMBI polypeptide and uses thereof) are soluble (e.g., an extracellular domain of BAMBI). In other preferred embodiments, BAMBI polypeptides for use in accordance with disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one BAMBI polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 49 or 50. In some embodiments, heteromultimers of the disclosure comprise at least one BAMBI polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 21-30 (e.g., amino acid residues 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30) of SEQ ID NO: 49, and ends at any one of amino acids 104-152 (e.g., amino acid residues 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, or 152) of SEQ ID NO: 49. In some embodiments, heteromultimers of the disclosure comprise at least one BAMBI polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-104 of SEQ ID NO: 49. In some embodiments, heteromultimers of the disclosure comprise at least one BAMBI polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-152 of SEQ ID NO: 49. In some embodiments, heteromultimers of the disclosure comprise at least one BAMBI polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-104 of SEQ ID NO: 49. In some embodiments, heteromultimers of the disclosure comprise at least one BAMBI polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 30-152 of SEQ ID NO: 49. In some embodiments, heteromultimers of the disclosure comprise at least one BAMBI polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 27-152 of SEQ ID NO: 49.
The term “BMPER polypeptide” includes polypeptides comprising any naturally occurring BMPER protein (encoded by BMPER or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
A human BMPER precursor protein sequence (NCBI Ref Seq NP_597725.1) is as follows:
MLWFSGVGAL AERYCRRSPG ITCCVLLLLN CSGVPMSLAS SFLTGSVAKC ENEGEVLQIP
The signal peptide is indicated by a single underline.
A mature BMPER polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human BMPER precursor protein is shown below (SEQ ID NO: 55), corresponding to nucleotides 375-2429 of NCBI Reference Sequence NM_133468.4. The signal sequence is underlined.
ATGCTCTGGTTCTCCGGCGTCGGGGCTCTGGCTGAGCGTTACTGCCGCC
GCTCGCCTGGGATTACGTGCTGCGTCTTGCTGCTACTCAATTGCTCGGG
GGTCCCCATGTCTCTGGCTTCCTCCTTCTTGACAGGTTCTGTTGCAAAA
A nucleic acid sequence encoding a processed BMPER is shown below (SEQ ID NO: 56):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one BMPER polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, BMPER polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a BMPER polypeptide and uses thereof) are soluble (e.g., an extracellular domain of BMPER). In other preferred embodiments, BMPER polypeptides for use in accordance with the disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 53 or 54. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 40-50 (e.g., amino acid residues 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50) of SEQ ID NO: 53, and ends at any one of amino acids 364-369 (e.g., amino acid residues 364, 365, 366, 367, 368, or 369) of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 370-386 (e.g., amino acid residues 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 284, 385, or 386) of SEQ ID NO: 53, and ends at any one of amino acids 682-685 (e.g., amino acid residues 682, 683, 684, or 685) of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 39-50 (e.g., amino acid residues 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50) of SEQ ID NO: 53, and ends at any one of amino acids 682-685 (e.g., amino acid residues 682, 683, 684, or 685) of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 39-364 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 39-369 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 39-682 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 39-685 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 50-364 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 50-369 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 50-682 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 50-685 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 370-682 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 370-685 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 386-682 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one BMPER polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 386-685 of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least a BMPER protein, wherein the BMPER protein is a dimer comprising a first polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 39-50 (e.g., amino acid residues 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50) of SEQ ID NO: 53, and ends at any one of amino acids 364-369 (e.g., amino acid residues 364, 365, 366, 367, 368, or 369) of SEQ ID NO: 53, and second polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 370-386 (e.g., amino acid residues 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 284, 385, or 386) of SEQ ID NO: 53, and ends at any one of amino acids 682-685 (e.g., amino acid residues 682, 683, 684, or 685) of SEQ ID NO: 53. In some embodiments, heteromultimers of the disclosure comprise at least one single chain ligand trap that comprises a first BMPER polypeptide domain that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 39-50 (e.g., amino acid residues 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50) of SEQ ID NO: 53, and ends at any one of amino acids 364-369 (e.g., amino acid residues 364, 365, 366, 367, 368, or 369) of SEQ ID NO: 53, and second BMPER polypeptide domain that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 370-386 (e.g., amino acid residues 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 284, 385, or 386) of SEQ ID NO: 53, and ends at any one of amino acids 682-685 (e.g., amino acid residues 682, 683, 684, or 685) of SEQ ID NO: 53.
The term “RGM-B polypeptide” includes polypeptides comprising any naturally occurring RGM-B protein (encoded by RGMB or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
A human RGM-B precursor protein sequence (NCBI Ref Seq NP_001012779.2) is as follows:
MIRKKRKRSA PPGPCRSHGP RPATAPAPPP SPEPTRPAWT GMGLRAAPSS AAAAAAEVEQ
RRSPGLCPPP LELLLLLLFS LGLLHAGDCQ QPAQCRIQKC TTDFVSLTSH LNSAVDGFDS
The signal peptide is indicated by single underline.
A processed RGM-B polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human RGM-B precursor protein is shown below (SEQ ID NO: 59), corresponding to nucleotides 403-1836 of NCBI Reference Sequence NM_001012761.2. The signal sequence is underlined.
ATGATAAGGAAGAAGAGGAAGCGAAGCGCGCCCCCCGGCCCATGCCGCA
GCCACGGGCCCAGACCCGCCACGGCGCCCGCGCCGCCGCCCTCGCCGGA
GCCCACGAGACCTGCATGGACGGGCATGGGCTTGAGAGCAGCACCTTCC
AGCGCCGCCGCTGCCGCCGCCGAGGTTGAGCAGCGCCGCAGCCCCGGGC
TCTGCCCCCCGCCGCTGGAGCTGCTGCTGCTGCTGCTGTTCAGCCTCGG
GCTGCTCCACGCAGGTGACTGCCAACAGCCAGCCCAATGTCGAATCCAG
A nucleic acid sequence encoding a processed RGM-B is shown below (SEQ ID NO: 60):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one RGM-B polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, RGM-B polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a RGM-B polypeptide and uses thereof) are soluble (e.g., an extracellular domain of RGM-B). In other preferred embodiments, RGM-B polypeptides for use in accordance with the disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 57 or 58. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-87 (e.g., amino acid residues 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, or 87) of SEQ ID NO: 57, and ends at any one of amino acids 452-478 (e.g., amino acid residues 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, or 478) of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 210-222 (e.g., amino acid residues 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, or 222) of SEQ ID NO: 57, and ends at any one of amino acids 413-452 (e.g., amino acid residues 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, or 452) of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 87-95 (e.g., amino acid residues 87, 88, 89, 90, 91, 92, 93, 94 or 95) of SEQ ID NO: 57, and ends at any one of amino acids 204-209 (e.g., amino acid residues 204, 205, 206, 207, 208, or 209) of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-452 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 87-204 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 87-209 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 95-204 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 95-209 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 210-413 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 210-452 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 222-413 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 222-452 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 87-413 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 87-452 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 95-413 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 95-452 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise at least a RGM-B protein, wherein the RGM-B protein is a dimer comprising a first polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 87-95 (e.g., amino acid residues 87, 88, 89, 90, 91, 92, 93, 94 or 95) of SEQ ID NO: 57, and ends at any one of amino acids 204-209 (e.g., amino acid residues 204, 205, 206, 207, 208, or 209) of SEQ ID NO: 57, and second polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 210-222 (e.g., amino acid residues 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, or 222) of SEQ ID NO: 57, and ends at any one of amino acids 413-452 (e.g., amino acid residues 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, or 452) of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise at least one single chain ligand trap that comprises a first RGM-B polypeptide domain that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 87-95 (e.g., amino acid residues 87, 88, 89, 90, 91, 92, 93, 94 or 95) of SEQ ID NO: 57, and ends at any one of amino acids 204-209 (e.g., amino acid residues 204, 205, 206, 207, 208, or 209) of SEQ ID NO: 57, and second RGM-B polypeptide domain that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 210-222 (e.g., amino acid residues 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, or 222) of SEQ ID NO: 57, and ends at any one of amino acids 413-452 (e.g., amino acid residues 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, or 452) of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 87-89 (e.g., amino acid residues 87, 88, or 89) of SEQ ID NO: 57, and ends at any one of amino acids 471-478 (e.g., amino acid residues 471, 472, 473, 474, 475, 476, 477, or 478) of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids 87-478 of SEQ ID NO: 57. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-B polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids 89-471 of SEQ ID NO: 57.
The term “RGM-A polypeptide” includes polypeptides comprising any naturally occurring RGM-A protein (encoded by RGMA or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
A human RGM-A isoform 1 precursor protein sequence (NCBI Ref Seq NP_001159755.1) is as follows:
MGGLGPRRAG TSRERLVVTG RAGWMGMGRG AGRSALGFWP TLAFLLCSFP AATSPCKILK
The signal peptide is indicated by solid underline.
A processed RGM-A isoform 1 polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human RGM-A isoform 1 precursor protein is shown below (SEQ ID NO: 63), corresponding to nucleotides 232-1605 of NCBI Reference Sequence NM_001166283.1. The signal sequence is underlined.
ATGGGTGGCCTGGGGCCACGACGGGCGGGAACCTCGAGGGAGAGGCTA
GTGGTAACAGGCCGAGCTGGATGGATGGGTATGGGGAGAGGGGCAGGA
CGTTCAGCCCTGGGATTCTGGCCGACCCTCGCCTTCCTTCTCTGCAGC
TTCCCCGCAGCCACCTCCCCGTGCAAGATCCTCAAGTGCAACTCTGAG
A nucleic acid sequence encoding a processed RGM-A isoform 1 is shown below (SEQ ID NO: 64):
A human RGM-A isoform 2 precursor protein sequence (NCBI Ref Seq NP_001159758.1) is as follows:
MGMGRGAGRS ALGFWPTLAF LLCSFPAATS PCKILKCNSE FWSATSGSHA PASDDTPEFC
The signal peptide is indicated by solid underline.
A mature RGM-A isoform 2 sequence is as follows:
A nucleic acid sequence encoding unprocessed human RGM-A isoform 2 precursor protein is shown below (SEQ ID NO: 67), corresponding to nucleotides 164-1465 of NCBI Reference Sequence NM_001166286.1. The signal sequence is underlined.
ATGGGTATGGGGAGAGGGGCAGGACGTTCAGCCCTGGGATTCTGGCC
GACCCTCGCCTTCCTTCTCTGCAGCTTCCCCGCAGCCACCTCCCCGT
A nucleic acid sequence encoding a processed RGM-A isoform 2 is shown below (SEQ ID NO: 68):
A human RGM-A isoform 3 precursor protein sequence (NCBI Ref Seq NP_064596.2) is as follows:
MQPPRERLVV TGRAGWMGMG RGAGRSALGF WPTLAFLLCS FPAATSPCKI LKCNSEFWSA
The signal peptide is indicated by solid underline.
A mature RGM-A isoform 3 sequence is as follows:
A nucleic acid sequence encoding unprocessed RGM-A isoform 3 precursor protein is shown below (SEQ ID NO: 71), corresponding to nucleotides 283-1632 of NCBI Reference Sequence NM_020211.2. The signal sequence is underlined.
ATGCAGCCGCCAAGGGAGAGGCTAGTGGTAACAGGCCGAGCTGGATGGA
TGGGTATGGGGAGAGGGGCAGGACGTTCAGCCCTGGGATTCTGGCCGAC
CCTCGCCTTCCTTCTCTGCAGCTTCCCCGCAGCCACCTCCCCGTGCAAG
A nucleic acid sequence encoding processed RGM-A isoform 3 is shown below (SEQ ID NO: 72):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one RGM-A polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, RGM-A polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a RGM-A polypeptide and uses thereof) are soluble (e.g., an extracellular domain of RGM-A). In other preferred embodiments, RGM-A polypeptides for use in accordance with the disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 61, 62, 65, 66, 69, or 70. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-177 (e.g., amino acid residues 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, or 177) of SEQ ID NO: 61, and ends at any one of amino acids 430-458 (e.g., amino acid residues 430, 431, 432, 433, 434, 435, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, or 458) of SEQ ID NO: 61. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-430 of SEQ ID NO: 61. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-458 of SEQ ID NO: 61. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 177-430 of SEQ ID NO: 61. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 177-458 of SEQ ID NO: 61. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 56-430 of SEQ ID NO: 61. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 56-458 of SEQ ID NO: 61. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-153 (e.g., amino acid residues 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, or 153) of SEQ ID NO: 65, and ends at any one of amino acids 406-434 (e.g., amino acid residues 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434) of SEQ ID NO: 65. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-406 of SEQ ID NO: 65. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 153-406 of SEQ ID NO: 65. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-434 of SEQ ID NO: 65. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 153-434 of SEQ ID NO: 65. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 32-406 of SEQ ID NO: 65. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 32-434 of SEQ ID NO: 65. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-169 (e.g., amino acid residues 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169) of SEQ ID NO: 69, and ends at any one of amino acids 422-450 (e.g., amino acid residues 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450) of SEQ ID NO: 69. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-422 of SEQ ID NO: 69. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 169-422 of SEQ ID NO: 69. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-450 of SEQ ID NO: 69. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 169-450 of SEQ ID NO: 69. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 48-422 of SEQ ID NO: 69. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 48-450 of SEQ ID NO: 69. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 56-61 (e.g., amino acid residues 56, 57, 58, 59, 60, or 61) of SEQ ID NO: 61, and ends at any one of amino acids 366-458 (e.g., amino acid residues 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, or 458) of SEQ ID NO: 61. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 56-458 of SEQ ID NO: 61. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 61-366 of SEQ ID NO: 61. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 32-37 (e.g., amino acid residues 32, 33, 34, 35, 36, or 37) of SEQ ID NO: 65, and ends at any one of amino acids 362-434 (e.g., amino acid residues 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, or 434) of SEQ ID NO: 65. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 32-434 of SEQ ID NO: 65. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 37-362 of SEQ ID NO: 65. In some embodiments, heteromultimers of the disclosure comprise at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 48-53 (e.g., amino acid residues 48, 49, 50, 51, 52, or 53) of SEQ ID NO: 69, and ends at any one of amino acids 378-450 (e.g., amino acid residues 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450) of SEQ ID NO: 69. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 48-450 of SEQ ID NO: 69. In some embodiments, heteromultimers of the disclosure comprise of at least one RGM-A polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 53-378 of SEQ ID NO: 69.
The term “hemojuvelin polypeptide” includes polypeptides comprising any naturally occurring hemojuvelin protein (encoded by HFE2 or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
The human hemojuvelin isoform A precursor protein sequence (NCBI Ref Seq NP_998818.1) is as follows:
MGEPGQSPSP RSSHGSPPTL STLTLLLLLC GHAHSQCKIL RCNAEYVSST LSLRGGGSSG
The signal peptide is indicated by single underline.
A processed hemojuvelin isoform A polypeptide sequence is as follows:
A nucleic acid sequence encoding unprocessed human hemojuvelin isoform A precursor protein is shown below (SEQ ID NO: 75), corresponding to nucleotides 326-1603 of NCBI Reference Sequence NM_213653.3. The signal sequence is underlined.
ATGGGGGAGCCAGGCCAGTCCCCTAGTCCCAGGTCCTCCCATGGCAGT
CCCCCAACTCTAAGCACTCTCACTCTCCTGCTGCTCCTCTGTGGACAT
GCTCATTCTCAATGCAAGATCCTCCGCTGCAATGCTGAGTACGTATCG
A nucleic acid sequence encoding a processed hemojuvelin isoform A is shown below (SEQ ID NO: 76):
A human hemojuvelin isoform B protein sequence (NCBI Ref Seq NP_660320.3) is as follows:
A processed hemojuvelin isoform B polypeptide sequence is as follows:
MIQHNCSRQGPTAPPPPRGPALPGAGSGLPAPDPCDYEGRFSRLHGRPPGFLHCASFGDPHVRSFHHHFHTCRVQ GAWPLLDNDFLFVQATSSPMALGANATATRKLTIIFKNMQECIDQKVYQAEVDNLPVAFEDGSINGGDRPGGSSL SIQTANPGNHVEIQAAYIGTTIIIRQTAGQLSFSIKVAEDVAMAFSAEQDLQLCVGGCPPSQRLSRSERNRRGAI TIDTARRLCKEGLPVEDAYFHSCVFDVLISGDPNFTVAAQAALEDARAFLPDLEKLHLFPSD (SEQ ID NO: 78)
A nucleic acid sequence encoding human hemojuvelin isoform B precursor protein is shown below (SEQ ID NO: 79), corresponding to nucleotides 479-1417 of NCBI Reference Sequence NM_145277.4.
A nucleic acid sequence encoding a processed hemojuvelin isoform B is shown below (SEQ ID NO: 80):
A human hemojuvelin isoform C protein sequence (NCBI Ref Seq NP_973733.1) is as follows:
A processed hemojuvelin isoform C polypeptide sequence is as follows:
A nucleic acid sequence encoding human hemojuvelin isoform C protein is shown below (SEQ ID NO: 83), corresponding to nucleotides 295-894 of NCBI Reference Sequence NM_202004.3.
A nucleic acid sequence encoding a processed hemojuvelin isoform C is shown below (SEQ ID NO: 84):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one hemojuvelin polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, hemojuvelin polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a hemojuvelin polypeptide and uses thereof) are soluble (e.g., an extracellular domain of hemojuvelin). In other preferred embodiments, hemojuvelin polypeptides for use in accordance with disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 73, 74, 77, 78, 81, or 82. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-36 (e.g., amino acid residues 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 35, or 36) of SEQ ID NO: 73, and ends at any one of amino acids 400-426 (e.g., amino acid residues 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, or 426) of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 36-42 (e.g., amino acid residues 36, 37, 38, 39, 40, 41, or 42) of SEQ ID NO: 73, and ends at any one of amino acids 167-172 (e.g., amino acid residues 167, 168, 169, 170, 171, or 172) of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 173-185 (e.g., amino acid residues 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, or 185) of SEQ ID NO: 73, and ends at any one of amino acids 361-400 (e.g., amino acid residues 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400) of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-400 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-426 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 36-400 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 36-426 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 36-167 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 36-172 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 42-167 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 42-172 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 173-361 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 173-400 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 185-361 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 185-400 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin protein, wherein the hemojuvelin protein is a dimer comprising a first polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 36-42 (e.g., amino acid residues 36, 37, 38, 39, 40, 41, or 42) of SEQ ID NO: 73, and ends at any one of amino acids 167-172 (e.g., amino acid residues 167, 168, 169, 170, 171, or 172) of SEQ ID NO: 73, and second polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 173-185 (e.g., amino acid residues 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, or 185) of SEQ ID NO: 73, and ends at any one of amino acids 361-400 (e.g., amino acid residues 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400) of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise at least one single chain ligand trap that comprises a first hemojuvelin polypeptide domain that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 36-42 (e.g., amino acid residues 36, 37, 38, 39, 40, 41, or 42) of SEQ ID NO: 73, and ends at any one of amino acids 167-172 (e.g., amino acid residues 167, 168, 169, 170, 171, or 172) of SEQ ID NO: 73, and second hemojuvelin polypeptide domain that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 173-185 (e.g., amino acid residues 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, or 185) of SEQ ID NO: 73, and ends at any one of amino acids 361-400 (e.g., amino acid residues 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400) of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-6 (e.g., amino acid residues 1, 2, 3, 4, 5, or 6) of SEQ ID NO: 77, and ends at any one of amino acids 287-313 (e.g., amino acid residues 287, 288, 289, 300, 301, 302, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, or 313) of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-6 (e.g., amino acid residues 1, 2, 3, 4, 5, or 6) of SEQ ID NO: 77, and ends at any one of amino acids 54-59 (e.g., amino acid residues 54, 55, 56, 57, 58, or 59) of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 60-72 (e.g., amino acid residues 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, or 72) of SEQ ID NO: 77, and ends at any one of amino acids 248-287 (e.g., amino acid residues 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, or 287) of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-287 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-313 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 6-287 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 6-313 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-54 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-59 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 6-54 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 6-59 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 60-248 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 60-287 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 72-248 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 72-287 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin protein, wherein the hemojuvelin protein is a dimer comprising a first polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-6 (e.g., amino acid residues 1, 2, 3, 4, 5, or 6) of SEQ ID NO: 77, and ends at any one of amino acids 54-59 (e.g., amino acid residues 54, 55, 56, 57, 58, or 59) of SEQ ID NO: 77, and second polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 60-72 (e.g., amino acid residues 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, or 72) of SEQ ID NO: 77, and ends at any one of amino acids 248-287 (e.g., amino acid residues 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, or 287) of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise at least one single chain ligand trap that comprises a first hemojuvelin polypeptide domain that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-6 (e.g., amino acid residues 1, 2, 3, 4, 5, or 6) of SEQ ID NO: 77, and ends at any one of amino acids 54-59 (e.g., amino acid residues 54, 55, 56, 57, 58, or 59) of SEQ ID NO: 77, and second hemojuvelin polypeptide domain that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 60-72 (e.g., amino acid residues 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, or 72) of SEQ ID NO: 77, and ends at any one of amino acids 248-287 (e.g., amino acid residues 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, or 287) of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 1-4 (e.g., amino acid residues 1, 2, 3, or 4) of SEQ ID NO: 81, and ends at any one of amino acids 135-200 (e.g., amino acid residues 135, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200) of SEQ ID NO: 81. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-135 of SEQ ID NO: 81. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-200 of SEQ ID NO: 81. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 4-135 of SEQ ID NO: 81. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 4-200 of SEQ ID NO: 81. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-174 of SEQ ID NO: 81. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 4-174 of SEQ ID NO: 81. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 36-37 (e.g., amino acid residues 36 or 37) of SEQ ID NO: 73, and ends at any one of amino acids 424-426 (e.g., amino acid residues 424, 425, or 426) of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 36-426 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 37-424 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 36-400 of SEQ ID NO: 73. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at anyone of amino acids of 1-4 (e.g., amino acid residues 1, 2, 3, or 4) of SEQ ID NO: 82, and ends at any one of amino acids 135-174 (e.g., amino acid residues 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, or 174) of SEQ ID NO: 82. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-174 of SEQ ID NO: 82. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 4-135 of SEQ ID NO: 82. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-174 of SEQ ID NO: 82. In some embodiments, heteromultimers of the disclosure comprise at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at anyone of amino acids of 1-6 (e.g., amino acid residues 1, 2, 3, 4, 5, or 6) of SEQ ID NO: 77, and ends at any one of amino acids 311-313 (e.g., amino acid residues 311, 312, or 313) of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-313 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 6-311 of SEQ ID NO: 77. In some embodiments, heteromultimers of the disclosure comprise of at least one hemojuvelin polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 1-127 of SEQ ID NO: 77.
The term “betaglycan polypeptide” includes polypeptides comprising any naturally is occurring betaglycan protein (encoded by TGFBR3 or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
The human betaglycan isoform A precursor protein sequence (NCBI Ref Seq NP_003234.2) is as follows:
MTSHYVIAIF ALMSSCLATA GPEPGALCEL SPVSASHPVQ ALMESFTVLS GCASRGTTGL
PQEVHVLNLR TAGQGPGQLQ REVTLHLNPI SSVHIHHKSV VFLLNSPHPL VWHLKTERLA
TGVSRLFLVS EGSVVQFSSA NFSLTAETEE RNFPHGNEHL LNWAREEYGA VISFTELKIA
RNIYIKVGED QVFPPKCNIG KNFLSLNYLA EYLQPKAAEG CVMSSQPQNE EVHIIELITP
NSNPYSAFQV DITIDIRPSQ EDLEVVKNLI LILKCKKSVN WVIKSFDVKG SLKIIAPNSI
GFGKESERSM TMTKSIRDDI PSTQGNLVKW ALDNGYSPIT SYTMAPVANR FHLRLENNAE
EMGDEEVHTI PPELRILLDP GALPALQNPP IRGGEGQNGG LPFPFPDISR RVWNEEGEDG
LPRPKDPVIP SIQLFPGLRE PEEVQGSVDI ALSVKCDNEK MIVAVEKDSF QASGYSGMDV
TLLDPTCKAK MNGTHFVLES PLNGCGTRPR WSALDGVVYY NSIVIQVPAL GDSSGWPDGY
EDLESGDNGF PGDMDEGDAS LFTRPEIVVF NCSLQQVRNP SSFQEQPHGN ITFNMELYNT
DLFLVPSQGV FSVPENGHVY VEVSVTKAEQ ELGFAIQTCF ISPYSNPDRM SHYTIIENIC
PKDESVKFYS PKRVHFPIPQ ADMDKKRFSF VFKPVFNTSL LFLQCELTLC TKMEKHPQKL
PKCVPPDEAC TSLDASIIWA MMQNKKTFTK PLAVIHHEAE SKEKGPSMEE PNPISPPIFH
The signal peptide is indicated by single underline, the extracellular domain is indicated in bold font, and the transmembrane domain is indicated by dotted-underline. This isoform differs from betaglycan isoform B by insertion of a single alanine indicated above by double underline.
A processed betaglycan isoform A polypeptide sequence is as follows:
A nucleic acid sequence encoding the unprocessed precursor protein of human betaglycan isoform A is shown below (SEQ ID NO: 87), corresponding to nucleotides 516-3068 of NCBI Reference Sequence NM_003243.4. The signal sequence is indicated by solid underline and the transmembrane region by dotted underline.
ATGACTTCCCATTATGTGATTGCCATCTTTGCCCTGATGAGCTCCTGTTTAGCCACTGCAGGTCCAGAGCCTGGT
A nucleic acid sequence encoding a processed extracellular domain of betaglycan isoform A is shown below (SEQ ID NO: 88):
A human betaglycan isoform B precursor protein sequence (NCBI Ref Seq NP_001182612.1) is as follows:
MTSHYVIAIF ALMSSCLATA GPEPGALCEL SPVSASHPVQ ALMESFTVLS GCASRGTTGL
PQEVHVLNLR TAGQGPGQLQ REVTLHLNPI SSVHIHHKSV VFLLNSPHPL VWHLKTERLA
TGVSRLFLVS EGSVVQFSSA NFSLTAETEE RNFPHGNEHL LNWAREEYGA VISFTELKIA
RNIYIKVGED QVFPPKCNIG KNFLSLNYLA EYLQPKAAEG CVMSSQPQNE EVHIIELITP
NSNPYSAFQV DITIDIRPSQ EDLEVVKNLI LILKCKKSVN WVIKSFDVKG SLKIIAPNSI
GFGKESERSM TMTKSIRDDI PSTQGNLVKW ALDNGYSPIT SYTMAPVANR FHLRLENNEE
MGDEEVHTIP PELRILLDPG ALPALQNPPI RGGEGQNGGL PFPFPDISRR VWNEEGEDGL
PRPKDPVIPS IQLFPGLREP EEVQGSVDIA LSVKCDNEKM IVAVEKDSFQ ASGYSGMDVT
LLDPTCKAKM NGTHFVLESP LNGCGTRPRW SALDGVVYYN SIVIQVPALG DSSGWPDGYE
DLESGDNGFP GDMDEGDASL FTRPEIVVFN CSLQQVRNPS SFQEQPHGNI TFNMELYNTD
LFLVPSQGVF SVPENGHVYV EVSVTKAEQE LGFAIQTCFI SPYSNPDRMS HYTIIENICP
KDESVKFYSP KRVHFPIPQA DMDKKRFSFV FKFVFNTSLL FLQCELTLCT KMEKHPQKLP
KCVPPDEACT SLDASIIWAM MQNKKTFTKP LAVIHHEAES KEKGPSMKEP NPISPPIFHG
The signal peptide is indicated by single underline, the extracellular domain is indicated in bold font, and the transmembrane domain is indicated by dotted underline.
A processed betaglycan isoform B polypeptide sequence is as follows:
A nucleic acid sequence encoding the unprocessed precursor protein of human betaglycan isoform B is shown below (SEQ ID NO: 91), corresponding to nucleotides 516-3065 of NCBI Reference Sequence NM_001195683.1. The signal sequence is indicated by solid underline and the transmembrane region by dotted underline.
ATGACTTCCCATTATGTGATTGCCATCTTTGCCCTGATGAGCTCCTGTTTAGCCACTGCAGGTCCAGAGCCTGGT
A nucleic acid sequence encoding a processed extracellular domain of betaglycan isoform B is shown below (SEQ ID NO: 92):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one betaglycan polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, betaglycan polypeptides for use in accordance with inventions of the disclosure (e.g., heteromultimers comprising a betaglycan polypeptide and uses thereof) are soluble (e.g., an extracellular domain of betaglycan). In other preferred embodiments, betaglycan polypeptides for use in accordance with the inventions of the disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise of at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 85, 86, 89, or 90. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 21-28 (e.g., amino acid residues 21, 22, 23, 24, 25, 26, 27, or 28) of SEQ ID NO: 85, and ends at any one of amino acids 381-787 (e.g., amino acid residues 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 481, 482, 483, 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, 500, 501, 502, 503, 504, 505, 506, 507, 508, 509, 510, 511, 512, 513, 514, 515, 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 620, 621, 622, 623, 624, 625, 626, 627, 628, 629, 630, 631, 632, 633, 634, 635, 635, 636, 637, 638, 639, 640, 641, 642, 643, 644, 645, 646, 647, 648, 649, 650, 651, 652, 653, 654, 655, 656, 657, 658, 659, 660, 661, 662, 663, 664, 665, 666, 667, 668, 669, 670, 671, 672, 673, 674, 675, 676, 677, 678, 679, 680, 681, 682, 683, 684, 685, 686, 687, 688, 689, 690, 691, 692, 693, 694, 695, 696, 697, 698, 699, 700, 701, 702, 703, 704, 705, 706, 707, 708, 709, 710, 711, 712, 713, 714, 715, 716, 717, 718, 719, 720, 721, 722, 723, 724, 725, 726, 727, 728, 729, 730, 731, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 743, 744, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 760, 761, 762, 763, 764, 765, 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 781, 782, 783, 784, 785, 786, or 787) of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-381 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-787 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 28-381 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 28-787 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise of at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-781 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 28-781 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 21-28 (e.g., amino acid residues 21, 22, 23, 24, 25, 26, or 27) of SEQ ID NO: 89, and ends at any one of amino acids 380-786 (e.g., amino acid residues 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 481, 482, 483, 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, 500, 501, 502, 503, 504, 505, 506, 507, 508, 509, 510, 511, 512, 513, 514, 515, 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 620, 621, 622, 623, 624, 625, 626, 627, 628, 629, 630, 631, 632, 633, 634, 635, 635, 636, 637, 638, 639, 640, 641, 642, 643, 644, 645, 646, 647, 648, 649, 650, 651, 652, 653, 654, 655, 656, 657, 658, 659, 660, 661, 662, 663, 664, 665, 666, 667, 668, 669, 670, 671, 672, 673, 674, 675, 676, 677, 678, 679, 680, 681, 682, 683, 684, 685, 686, 687, 688, 689, 690, 691, 692, 693, 694, 695, 696, 697, 698, 699, 700, 701, 702, 703, 704, 705, 706, 707, 708, 709, 710, 711, 712, 713, 714, 715, 716, 717, 718, 719, 720, 721, 722, 723, 724, 725, 726, 727, 728, 729, 730, 731, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 743, 744, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 760, 761, 762, 763, 764, 765, 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 781, 782, 783, 784, 785, or 786) of SEQ ID NO: 89. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-380 of SEQ ID NO: 89. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-786 of SEQ ID NO: 89. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 28-380 of SEQ ID NO: 89. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 28-786 of SEQ ID NO: 89. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-780 of SEQ ID NO: 89. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 28-780 of SEQ ID NO: 89. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 21-28 (e.g., amino acid residues 21, 22, 23, 24, 25, 26, 27, or 28) of SEQ ID NO: 85, and ends at any one of amino acids 730-787 (e.g., amino acid residues 730, 731, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 743, 744, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 760, 761, 762, 763, 764, 765, 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 781, 782, 783, 784, 785, 786, or 787) of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-787 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 28-730 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 21-28 (e.g., amino acid residues 21, 22, 23, 24, 25, 26, 27, or 28) of SEQ ID NO: 85, and ends at any one of amino acids 730-787 (e.g., amino acid residues 730, 731, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 743, 744, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 760, 761, 762, 763, 764, 765, 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 781, 782, 783, 784, 785, 786, or 787) of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-787 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 28-730 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 21-28 (e.g., amino acid residues 21, 22, 23, 24, 25, 26, 27, or 28) of SEQ ID NO: 85, and ends at any one of amino acids 730-787 (e.g., amino acid residues 729, 730, 731, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 743, 744, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 760, 761, 762, 763, 764, 765, 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 781, 782, 783, 784, 785, or 786) of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-786 of SEQ ID NO: 85. In some embodiments, heteromultimers of the disclosure comprise at least one betaglycan polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 28-729 of SEQ ID NO: 85.
The term “MuSK polypeptide” includes polypeptides comprising any naturally occurring MuSK protein (encoded by MUSK or one of its nonhuman orthologs) as well as any variants thereof (including mutants, fragments, fusions, and peptidomimetic forms) that retain a useful activity.
A human MuSK isoform 1 precursor protein sequence (NCBI Reference Sequence NP_005583.1) is as follows:
MRELVNIPLV HILTLVAFS
G TEKLPKAPVI TTPLETVDAL VEEVATFMCA
VESYPQPEIS WTRNKILIKL FDTRYSIREN GQLLTILSVE DSDDGIYCCT
ANNGVGGAVE SCGALQVKMK PKITRPPINV KIIEGLKAVL PCTTMGNPKP
SVSWIKGDSP LRENSRIAVL ESGSLRIHNV QKEDAGQYRC VAKNSLGTAY
SKVVKLEVEV FARILRAPES HNVTFGSFVT LHCTATGIPV PTITWIENGN
AVSSGSIQES VKDRVIDSRL QLFITKPGLY TCIATNKHGE KFSTAKAAAT
ISIAEWSKPQ KDNKGYCAQY RGEVCNAVLA KDALVFLNTS YADPEEAQEL
LVHTAWNELK VVSPVCRPAA EALLCNHIFQ ECSPGVVPTP IPICREYCLA
VKELFCAKEW LVMEEKTHRG LYRSEMHLLS VPECSKLPSM HWDPTACARL
The signal peptide is indicated by single underline, the extracellular domain is indicated in bold font, and the transmembrane domain is indicated by dotted underline. This isoform is the longest of human MuSK isoforms 1, 2, and 3.
A processed MuSK isoform 1 polypeptide sequence (SEQ ID NO: 96) is as follows:
A nucleic acid sequence encoding the unprocessed precursor protein of human MuSK isoform 1 is shown below (SEQ ID NO: 97), corresponding to nucleotides 135-2744 of NCBI Reference Sequence NM_005592.3. The signal sequence is indicated by solid underline and the transmembrane region by dotted underline.
ATGAGAGAGCTCGTCAACATTCCACTGGTACATATTCTTACTCTGGTTGCCTTCAGCGGAACTGAGAAACTTCCA
A nucleic acid sequence encoding a processed extracellular domain of MuSK isoform 1 is shown below (SEQ ID NO: 98):
A human MuSK isoform 2 precursor protein sequence (NCBI Reference Sequence NP_001159752.1) is as follows:
MRELVNIPLV HILTLVAFS
G TEKLPKAPVI TTPLETVDAL VEEVATFMCA
VESYPQPEIS WTRNKILIKL FDTRYSIREN GQLLTILSVE DSDDGIYCCT
ANNGVGGAVE SCGALQVKMK PKITRPPINV KIIEGLKAVL PCTTMGNPKP
SVSWIKGDSP LRENSRIAVL ESGSLRIHNV QKEDAGQYRC VAKNSLGTAY
SKVVKLEVEE ESEPEQDTKV FARILRAPES HNVITGSFVT LHCTATGIPV
PTITWIENGN AVSSGSIQES VKDRVIDSRL QLFITKPGLY TCIATNKHGE
KFSTAKAAAT ISIAEWREYC LAVKELFCAK EWLVMEEKTH RGLYRSEMHL
LSVPECSKLP SMHWDPTACA RLPHLAFPPM TSSKPSVDIP NLPSSSSSSF
The signal peptide is indicated by single underline, the extracellular domain is indicated in bold font, and the transmembrane domain is indicated by dotted underline. This variant contains an alternate in-frame exon and lacks an alternate in-frame exon in the middle portion of the coding region compared to variant 1. The encoded isoform 2 is shorter than isoform 1.
A mature MuSK isoform 2 polypeptide sequence (SEQ ID NO: 100) is as follows:
A nucleic acid sequence encoding the unprocessed precursor protein of human MuSK isoform 2 is shown below (SEQ ID NO: 101), corresponding to nucleotides 135-2483 of NCBI Reference Sequence NM_001166280.1. The signal sequence is indicated by solid underline and the transmembrane region by dotted underline.
ATGAGAGAGCTCGTCAACATTCCACTGGTACATATTCTTACTCTGGTTGCCTTCAGCGGAACTGAGAAACTTCCA
A nucleic acid sequence encoding a processed extracellular domain of MuSK isoform 2 is shown below (SEQ ID NO: 102):
A human MuSK isoform 3 precursor protein sequence (NCBI Reference Sequence NP_001159753.1) is as follows:
MRELVNIPLV HILTLVAFS
G TEKLPKAPVI TTPLETVDAL VEEVATFMCA
VESYPQPEIS WTRNKILIKL FDTRYSIREN GQLLTILSVE DSDDGIYCCT
ANNGVGGAVE SCGALQVKMK PKITRPPINV KIIEGLKAVL PCTTMGNPKP
SVSWIKGDSP LRENSRIAVL ESGSLRIHNV QKEDAGQYRC VAKNSLGTAY
SKVVKLEVEV FARILRAPES HNVITGSFVT LHCTATGIPV PTITWIENGN
AVSSGSIQES VKDRVIDSRL QLFITKPGLY TCIATNKHGE KFSTAKAAAT
ISIAEWREYC LAVKELFCAK EWLVMEEKTH RGLYRSEMHL LSVPECSKLP
SMHWDPTACA RLPHLAFPPM TSSKPSVDIP NLPSSSSSSF SVSPTYSMT
V
The signal peptide is indicated by single underline, the extracellular domain is indicated in bold font, and the transmembrane domain is indicated by dotted underline. This variant lacks an alternate in-frame exon in the middle portion of the coding region compared to variant 1. The encoded isoform 3 is shorter than isoform 1.
A processed MuSK isoform 3 polypeptide sequence (SEQ ID NO: 104) is as follows:
A nucleic acid sequence encoding the unprocessed precursor protein of human MuSK isoform 3 is shown below (SEQ ID NO: 105), corresponding to nucleotides 135-2453 of NCBI Reference Sequence NM_001166281.1. The signal sequence is indicated by solid underline and the transmembrane region by dotted underline.
ATGAGAGAGCTCGTCAACATTCCACTGGTACATATTCTTACTCTGGTTGCCTTCAGCGGAACTGAGAAACTTCCA
A nucleic acid sequence encoding a processed extracellular domain of MuSK isoform 3 is shown below (SEQ ID NO: 106):
In certain embodiments, the disclosure relates to heteromultimers that comprise at least one MuSK polypeptide, which includes fragments, functional variants, and modified forms thereof. Preferably, MuSK polypeptides for use in accordance with the disclosure (e.g., heteromultimers comprising a MuSK polypeptide and uses thereof) are soluble (e.g., an extracellular domain of MuSK). In other preferred embodiments, MuSK polypeptides for use in accordance with disclosure bind to and/or inhibit (antagonize) activity (e.g., Smad signaling) of one or more TGF-beta superfamily ligands. In some embodiments, heteromultimers of the disclosure comprise at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NOs: 95, 96, 99, 100, 103, and 104. In some embodiments, heteromultimers of the disclosure comprise at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 21-49 (e.g., amino acid residues 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, or 49) of SEQ ID NO: 95, and ends at any one of amino acids 447-495 (e.g., amino acid residues 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 481, 482, 483, 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, or 495) of SEQ ID NO: 95. In some embodiments, heteromultimers of the disclosure comprise of at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 21-495 of SEQ ID NO: 95. In some embodiments, heteromultimers of the disclosure comprise of at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 49-447 of SEQ ID NO: 95. In some embodiments, heteromultimers of the disclosure comprise of at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 210-495 of SEQ ID NO: 95. In some embodiments, heteromultimers of the disclosure comprise at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 20-49 (e.g., amino acid residues 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, or 49) of SEQ ID NO: 99, and ends at any one of amino acids 369-409 (e.g., amino acid residues 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, or 409) of SEQ ID NO: 99. In some embodiments, heteromultimers of the disclosure comprise of at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 20-409 of SEQ ID NO: 99. In some embodiments, heteromultimers of the disclosure comprise of at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 49-369 of SEQ ID NO: 99. In some embodiments, heteromultimers of the disclosure comprise of at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 210-409 of SEQ ID NO: 99. In some embodiments, heteromultimers of the disclosure comprise at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to a polypeptide that begins at any one of amino acids of 20-49 (e.g., amino acid residues 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, or 49) of SEQ ID NO: 103, and ends at any one of amino acids 359-399 (e.g., amino acid residues 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, or 399) of SEQ ID NO: 103. In some embodiments, heteromultimers of the disclosure comprise of at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 20-399 of SEQ ID NO: 103. In some embodiments, heteromultimers of the disclosure comprise of at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 49-359 of SEQ ID NO: 103. In some embodiments, heteromultimers of the disclosure comprise of at least one MuSK polypeptide that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acids of 210-399 of SEQ ID NO: 103.
In some embodiments, the present disclosure contemplates making functional variants by modifying the structure of a TGF-beta superfamily co-receptor (e.g., endoglin, betaglycan, Cripto-1, Cryptic, Cryptic family protein 1B, CRIM1, CRIM2, BAMBI, BMPER, RGM-A, RGM-B, hemojuvelin, and MuSK) for such purposes as enhancing therapeutic efficacy or stability (e.g., shelf-life and resistance to proteolytic degradation in vivo). Variants can be produced by amino acid substitution, deletion, addition, or combinations thereof. For instance, it is reasonable to expect that an isolated replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar replacement of an amino acid with a structurally related amino acid (e.g., conservative mutations) will not have a major effect on the biological activity of the resulting molecule. Conservative replacements are those that take place within a family of amino acids that are related in their side chains. Whether a change in the amino acid sequence of a polypeptide of the disclosure results in a functional homolog can be readily determined by assessing the ability of the variant polypeptide to produce a response in cells in a fashion similar to the wild-type polypeptide, or to bind to one or more TGF-beta superfamily ligands including, for example, BMP2, BMP2/7, BMP3, BMP4, BMP4/7, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF3, GDF5, GDF6/BMP13, GDF7, GDF8, GDF9b/BMP15, GDF11/BMP11, GDF15/MIC1, TGF-β1, TGF-β2, TGF-β3, activin A, activin B, activin C, activin E, activin AB, activin AC, activin AE, activin BC, activin BE, nodal, glial cell-derived neurotrophic factor (GDNF), neurturin, artemin, persephin, MIS, and Lefty. In some embodiments, the present disclosure contemplates making functional variants by modifying the structure of the TGF-beta superfamily co-receptor polypeptide for such purposes as enhancing therapeutic efficacy or stability (e.g., increased shelf-life and/or increased resistance to proteolytic degradation).
In certain embodiments, the present disclosure contemplates specific mutations of a TGF-beta superfamily co-receptor polypeptide (e.g., endoglin, betaglycan, Cripto-1, Cryptic, Cryptic family protein 1B, CRIM1, CRIM2, BAMBI, BMPER, RGM-A, RGM-B, MuSK, and hemojuvelin) of the disclosure so as to alter the glycosylation of the polypeptide. Such mutations may be selected so as to introduce or eliminate one or more glycosylation sites, such as O-linked or N-linked glycosylation sites. Asparagine-linked glycosylation recognition sites generally comprise a tripeptide sequence, asparagine-X-threonine or asparagine-X-serine (where “X” is any amino acid) which is specifically recognized by appropriate cellular glycosylation enzymes. The alteration may also be made by the addition of, or substitution by, one or more serine or threonine residues to the sequence of the polypeptide (for O-linked glycosylation sites). A variety of amino acid substitutions or deletions at one or both of the first or third amino acid positions of a glycosylation recognition site (and/or amino acid deletion at the second position) results in non-glycosylation at the modified tripeptide sequence. Another means of increasing the number of carbohydrate moieties on a polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Depending on the coupling mode used, the sugar(s) may be attached to (a) arginine and histidine; (b) free carboxyl groups; (c) free sulfhydryl groups such as those of cysteine; (d) free hydroxyl groups such as those of serine, threonine, or hydroxyproline; (e) aromatic residues such as those of phenylalanine, tyrosine, or tryptophan; or (f) the amide group of glutamine. Removal of one or more carbohydrate moieties present on a polypeptide may be accomplished chemically and/or enzymatically. Chemical deglycosylation may involve, for example, exposure of a polypeptide to the compound trifluoromethanesulfonic acid, or an equivalent compound. This treatment results in the cleavage of most or all sugars except the linking sugar (N-acetylglucosamine or N-acetylgalactosamine), while leaving the amino acid sequence intact. Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of endo- and exo-glycosidases as described by Thotakura et al. [Meth. Enzymol. (1987) 138:350]. The sequence of a polypeptide may be adjusted, as appropriate, depending on the type of expression system used, as mammalian, yeast, insect, and plant cells may all introduce differing glycosylation patterns that can be affected by the amino acid sequence of the peptide. In general, heteromultimers of the disclosure for use in humans may be expressed in a mammalian cell line that provides proper glycosylation, such as HEK293 or CHO cell lines, although other mammalian expression cell lines are expected to be useful as well.
The present disclosure further contemplates a method of generating mutants, particularly sets of combinatorial mutants of a TGF-beta superfamily co-receptor polypeptide (e.g., endoglin, betaglycan, Cripto-1, Cryptic, Cryptic family protein 1B, CRIM1, CRIM2, BAMBI, BMPER, RGM-A, RGM-B, MuSK, and hemojuvelin) of the present disclosure, as well as truncation mutants. Pools of combinatorial mutants are especially useful for identifying functionally active (e.g., ligand binding) TGF-beta superfamily co-receptor sequences. The purpose of screening such combinatorial libraries may be to generate, for example, polypeptides variants which have altered properties, such as altered pharmacokinetic or altered ligand binding. A variety of screening assays are provided below, and such assays may be used to evaluate variants. For example, TGF-beta co-receptor variants may be screened for ability to bind to a TGF-beta superfamily ligand (e.g., BMP2, BMP2/7, BMP3, BMP4, BMP4/7, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF3, GDF5, GDF6/BMP13, GDF7, GDF8, GDF9b/BMP15, GDF11/BMP11, GDF15/MIC1, TGF-β1, TGF-β2, TGF-β3, activin A, activin B, activin C, activin E, activin AB, activin AC, activin AE, activin BC, activin BE, nodal, glial cell-derived neurotrophic factor (GDNF), neurturin, artemin, persephin, MIS, and Lefty), to prevent binding of a TGF-beta superfamily ligand to a TGF-beta superfamily co-receptor, and/or to interfere with signaling caused by an TGF-beta superfamily ligand.
The activity of a TGF-beta superfamily heteromultimers of the disclosure also may be tested, for example in a cell-based or in vivo assay. For example, the effect of a heteromultimer on the expression of genes or the activity of proteins involved in muscle production in a muscle cell may be assessed. This may, as needed, be performed in the presence of one or more recombinant TGF-beta superfamily ligand proteins (e.g., BMP2, BMP2/7, BMP3, BMP4, BMP4/7, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF3, GDF5, GDF6/BMP13, GDF7, GDF8, GDF9b/BMP15, GDF11/BMP11, GDF15/MIC1, TGF-β1, TGF-β2, TGF-β3, activin A, activin B, activin C, activin E, activin AB, activin AC, activin AE, activin BC, activin BE, nodal, glial cell-derived neurotrophic factor (GDNF), neurturin, artemin, persephin, MIS, and Lefty), and cells may be transfected so as to produce a TGF-beta superfamily heteromultimer, and optionally, a TGF-beta superfamily ligand. Likewise, a heteromultimer of the disclosure may be administered to a mouse or other animal, and one or more measurements, such as muscle formation and strength may be assessed using art-recognized methods. Similarly, the activity of a heteromultimer, or variants thereof, may be tested in osteoblasts, adipocytes, and/or neuronal cells for any effect on growth of these cells, for example, by the assays as described herein and those of common knowledge in the art. A SMAD-responsive reporter gene may be used in such cell lines to monitor effects on downstream signaling.
Combinatorial-derived variants can be generated which have increased selectivity or generally increased potency relative to a reference TGF-beta superfamily heteromultimer. Such variants, when expressed from recombinant DNA constructs, can be used in gene therapy protocols. Likewise, mutagenesis can give rise to variants which have intracellular half-lives dramatically different than the corresponding unmodified TGF-beta superfamily heteromultimer. For example, the altered protein can be rendered either more stable or less stable to proteolytic degradation or other cellular processes which result in destruction, or otherwise inactivation, of an unmodified polypeptide. Such variants, and the genes which encode them, can be utilized to alter polypeptide complex levels by modulating the half-life of the polypeptide. For instance, a short half-life can give rise to more transient biological effects and, when part of an inducible expression system, can allow tighter control of recombinant polypeptide complex levels within the cell. In an Fc fusion protein, mutations may be made in the linker (if any) and/or the Fc portion to alter one or more activities of the TGF-beta superfamily heteromultimer including, for example, immunogenicity, half-life, and solubility.
A combinatorial library may be produced by way of a degenerate library of genes encoding a library of polypeptides which each include at least a portion of potential TGF-beta superfamily or co-receptor polypeptide sequences. For instance, a mixture of synthetic oligonucleotides can be enzymatically ligated into gene sequences such that the degenerate set of potential TGF-beta superfamily co-receptor encoding nucleotide sequences are expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage display).
There are many ways by which the library of potential homologs can be generated from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be carried out in an automatic DNA synthesizer, and the synthetic genes can then be ligated into an appropriate vector for expression. The synthesis of degenerate oligonucleotides is well known in the art. See, e.g., Narang, S A (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc. 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp 273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477. Such techniques have been employed in the directed evolution of other proteins. See, e.g., Scott et al., (1990) Science 249:386-390; Roberts et al. (1992) PNAS USA 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al., (1990) PNAS USA 87: 6378-6382; as well as U.S. Pat. Nos. 5,223,409, 5,198,346, and 5,096,815.
Alternatively, other forms of mutagenesis can be utilized to generate a combinatorial library. For example, heteromultimers of the disclosure can be generated and isolated from a library by screening using, for example, alanine scanning mutagenesis [see, e.g., Ruf et al. (1994) Biochemistry 33:1565-1572; Wang et al. (1994) J. Biol. Chem. 269:3095-3099; Balint et al. (1993) Gene 137:109-118; Grodberg et al. (1993) Eur. J. Biochem. 218:597-601; Nagashima et al. (1993) J. Biol. Chem. 268:2888-2892; Lowman et al. (1991) Biochemistry 30:10832-10838; and Cunningham et al. (1989) Science 244:1081-1085], by linker scanning mutagenesis [see, e.g., Gustin et al. (1993) Virology 193:653-660; and Brown et al. (1992) Mol. Cell Biol. 12:2644-2652; McKnight et al. (1982) Science 232:316], by saturation mutagenesis [see, e.g., Meyers et al., (1986) Science 232:613]; by PCR mutagenesis [see, e.g., Leung et al. (1989) Method Cell Mol Biol 1:11-19]; or by random mutagenesis, including chemical mutagenesis [see, e.g., Miller et al. (1992) A Short Course in Bacterial Genetics, CSHL Press, Cold Spring Harbor, N.Y.; and Greener et al. (1994) Strategies in Mol Biol 7:32-34]. Linker scanning mutagenesis, particularly in a combinatorial setting, is an attractive method for identifying truncated (bioactive) forms of TGF-beta superfamily co-receptor polypeptides.
A wide range of techniques are known in the art for screening gene products of combinatorial libraries made by point mutations and truncations, and, for that matter, for screening cDNA libraries for gene products having a certain property. Such techniques will be generally adaptable for rapid screening of the gene libraries generated by the combinatorial mutagenesis of heteromultimers of the disclosure. The most widely used techniques for screening large gene libraries typically comprise cloning the gene library into replicable expression vectors, transforming appropriate cells with the resulting library of vectors, and expressing the combinatorial genes under conditions in which detection of a desired activity facilitates relatively easy isolation of the vector encoding the gene whose product was detected. Preferred assays include TGF-beta superfamily ligand (e.g., BMP2, BMP2/7, BMP3, BMP4, BMP4/7, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF3, GDF5, GDF6/BMP13, GDF7, GDF8, GDF9b/BMP15, GDF11/BMP11, GDF15/MIC1, TGF-β1, TGF-β2, TGF-β3, activin A, activin B, activin C, activin E, activin AB, activin AC, activin AE, activin BC, activin BE, nodal, glial cell-derived neurotrophic factor (GDNF), neurturin, artemin, persephin, MIS, and Lefty) binding assays and/or TGF-beta superfamily ligand-mediated cell signaling assays.
In certain embodiments, heteromultimers of the disclosure may further comprise post-translational modifications in addition to any that are naturally present in the TGF-beta superfamily co-receptor polypeptide. Such modifications include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation. As a result, the heteromultimers may comprise non-amino acid elements, such as polyethylene glycols, lipids, polysaccharide or monosaccharide, and phosphates. Effects of such non-amino acid elements on the functionality of a heteromultimer may be tested as described herein for other heteromultimer variants. When a polypeptide of the disclosure is produced in cells by cleaving a nascent form of the polypeptide, post-translational processing may also be important for correct folding and/or function of the protein. Different cells (e.g., CHO, HeLa, MDCK, 293, WI38, NIH-3T3 or HEK293) have specific cellular machinery and characteristic mechanisms for such post-translational activities and may be chosen to ensure the correct modification and processing of the TGF-beta superfamily co-receptor polypeptides as well as heteromultimers comprising the same.
In certain aspects, the polypeptides disclosed herein may form heteromultimers comprising at least one TGF-beta superfamily co-receptor polypeptide. Preferably, polypeptides disclosed herein form heterodimers, although higher order heteromultimers are also included such as, but not limited to, heterotrimers, heterotetramers, and further oligomeric structures (see, e.g.,
Many methods known in the art can be used to generate heteromultimers of the disclosure. For example, non-naturally occurring disulfide bonds may be constructed by replacing on a first polypeptide a naturally occurring amino acid with a free thiol-containing residue, such as cysteine, such that the free thiol interacts with another free thiol-containing residue on a second polypeptide such that a disulfide bond is formed between the first and second polypeptides. Additional examples of interactions to promote heteromultimer formation include, but are not limited to, ionic interactions such as described in Kjaergaard et al., WO2007147901; electrostatic steering effects such as described in Kannan et al., U.S. Pat. No. 8,592,562; coiled-coil interactions such as described in Christensen et al., U.S.20120302737; leucine zippers such as described in Pack & Plueckthun, (1992) Biochemistry 31: 1579-1584; and helix-turn-helix motifs such as described in Pack et al., (1993) Bio/Technology 11: 1271-1277. Linkage of the various segments may be obtained via, e.g., covalent binding such as by chemical cross-linking, peptide linkers, disulfide bridges, etc., or affinity interactions such as by avidin-biotin or leucine zipper technology.
In certain aspects, a multimerization domain may comprise one component of an interaction pair. In some embodiments, the polypeptides disclosed herein may form protein complexes comprising a first polypeptide covalently or non-covalently associated with a second polypeptide, wherein the first polypeptide comprises the amino acid sequence of a TGF-beta superfamily co-receptor polypeptide and the amino acid sequence of a first member of an interaction pair; and the second polypeptide comprises an amino acid sequence of a second member of an interaction pair. The interaction pair may be any two polypeptide sequences that interact to form a complex, particularly a heterodimeric complex although operative embodiments may also employ an interaction pair that can form a homodimeric complex. One member of the interaction pair may be fused to a TGF-beta superfamily co-receptor polypeptide as described herein, including for example, a polypeptide sequence comprising an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the sequence of any one of SEQ ID NOs: 1, 2, 5, 6, 9, 10, 13, 14, 17, 18, 21, 22, 25, 26, 29, 30, 33, 34, 37, 38, 41, 42, 45, 46, 49, 50, 53, 54, 57, 58, 61, 62, 65, 66, 69, 70, 73, 74, 77, 78, 81, 82, 85, 86, 89, 90, 93, 95, 96, 99, 100, 103, and 104. An interaction pair may be selected to confer an improved property/activity such as increased serum half-life, or to act as an adaptor on to which another moiety is attached to provide an improved property/activity. For example, a polyethylene glycol moiety may be attached to one or both components of an interaction pair to provide an improved property/activity such as improved serum half-life.
The first and second members of the interaction pair may be an asymmetric pair, meaning that the members of the pair preferentially associate with each other rather than self-associate. Accordingly, first and second members of an asymmetric interaction pair may associate to form a heterodimeric complex (see, e.g.,
As specific examples, the present disclosure provides fusion proteins comprising TGF-beta superfamily co-receptor polypeptides fused to a polypeptide comprising a constant domain of an immunoglobulin, such as a CH1, CH2, or CH3 domain of an immunoglobulin or an Fc domain. Fc domains derived from human IgG1, IgG2, IgG3, and IgG4 are provided herein. Other mutations are known that decrease either CDC or ADCC activity, and collectively, any of these variants are included in the disclosure and may be used as advantageous components of a heteromultimers of the disclosure. Optionally, the IgG1 Fc domain of SEQ ID NO: 208 has one or more mutations at residues such as Asp-265, Lys-322, and Asn-434 (numbered in accordance with the corresponding full-length IgG1). In certain cases, the mutant Fc domain having one or more of these mutations (e.g., Asp-265 mutation) has reduced ability of binding to the Fcγ receptor relative to a wildtype Fc domain. In other cases, the mutant Fc domain having one or more of these mutations (e.g., Asn-434 mutation) has increased ability of binding to the MHC class I-related Fc-receptor (FcRN) relative to a wildtype Fc domain.
An example of a native amino acid sequence that may be used for the Fc portion of human IgG1 (G1Fc) is shown below (SEQ ID NO: 208). Dotted underline indicates the hinge region, and solid underline indicates positions with naturally occurring variants. In part, the disclosure provides polypeptides comprising, consisting of, or consisting essentially of an amino acid sequence with 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identity to SEQ ID NO: 208. Naturally occurring variants in G1Fc would include E134D and M136L according to the numbering system used in SEQ ID NO: 208 (see Uniprot P01857).
An example of a native amino acid sequence that may be used for the Fc portion of human IgG2 (G2Fc) is shown below (SEQ ID NO: 209). Dotted underline indicates the hinge region and double underline indicates positions where there are data base conflicts in the sequence (according to UniProt P01859). In part, the disclosure provides polypeptides comprising, consisting of, or consisting essentially of an amino acid sequence with 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identity to SEQ ID NO: 209.
Two examples of amino acid sequences that may be used for the Fc portion of human IgG3 (G3Fc) are shown below. The hinge region in G3Fc can be up to four times as long as in other Fc chains and contains three identical 15-residue segments preceded by a similar 17-residue segment. The first G3Fc sequence shown below (SEQ ID NO: 210) contains a short hinge region consisting of a single 15-residue segment, whereas the second G3Fc sequence (SEQ ID NO: 211) contains a full-length hinge region. In each case, dotted underline indicates the hinge region, and solid underline indicates positions with naturally occurring variants according to UniProt P01859. In part, the disclosure provides polypeptides comprising, consisting of, or consisting essentially of an amino acid sequence with 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identity to SEQ ID NOs: 210 and 211.
Naturally occurring variants in G3Fc (for example, see Uniprot P01860) include E68Q, P76L, E79Q, Y81F, D97N, N100D, T124A, S169N, S169del, F221Y when converted to the numbering system used in SEQ ID NO: 210, and the present disclosure provides fusion proteins comprising G3Fc domains containing one or more of these variations. In addition, the human immunoglobulin IgG3 gene (IGHG3) shows a structural polymorphism characterized by different hinge lengths [see Uniprot P01859]. Specifically, variant WIS is lacking most of the V region and all of the CH1 region. It has an extra interchain disulfide bond at position 7 in addition to the 11 normally present in the hinge region. Variant ZUC lacks most of the V region, all of the CH1 region, and part of the hinge. Variant OMM may represent an allelic form or another gamma chain subclass. The present disclosure provides additional fusion proteins comprising G3Fc domains containing one or more of these variants.
An example of a native amino acid sequence that may be used for the Fc portion of human IgG4 (G4Fc) is shown below (SEQ ID NO: 212). Dotted underline indicates the hinge region. In part, the disclosure provides polypeptides comprising, consisting of, or consisting essentially of an amino acid sequence with 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identity to SEQ ID NO: 212.
A variety of engineered mutations in the Fc domain are presented herein with respect to the G1Fc sequence (SEQ ID NO: 208), and analogous mutations in G2Fc, G3Fc, and G4Fc can be derived from their alignment with G1Fc in
A problem that arises in large-scale production of asymmetric immunoglobulin-based proteins from a single cell line is known as the “chain association issue”. As confronted prominently in the production of bispecific antibodies, the chain-association issue concerns 12′7 the challenge of efficiently producing a desired multichain protein from among the multiple combinations that inherently result when different heavy chains and/or light chains are produced in a single cell line [see, for example, Klein et al (2012) mAbs 4:653-663]. This problem is most acute when two different heavy chains and two different light chains are produced in the same cell, in which case there are a total of 16 possible chain combinations (although some of these are identical) when only one is typically desired. Nevertheless, the same principle accounts for diminished yield of a desired multichain fusion protein that incorporates only two different (asymmetric) heavy chains.
Various methods are known in the art that increase desired pairing of Fc-containing fusion polypeptide chains in a single cell line to produce a preferred asymmetric fusion protein at acceptable yields [see, for example, Klein et al (2012) mAbs 4:653-663; and Spiess et al (2015) Molecular Immunology 67(2A): 95-106]. Methods to obtain desired pairing of Fc-containing chains include, but are not limited to, charge-based pairing (electrostatic steering), “knobs-into-holes” steric pairing, SEEDbody pairing, and leucine zipper-based pairing. See, for example, Ridgway et al (1996) Protein Eng 9:617-621; Merchant et al (1998) Nat Biotech 16:677-681; Davis et al (2010) Protein Eng Des Sel 23:195-202; Gunasekaran et al (2010); 285:19637-19646; Wranik et al (2012) J Biol Chem 287:43331-43339; U.S. Pat. No. 5,932,448; WO 1993/011162; WO 2009/089004, and WO 2011/034605. As described herein, these methods may be used to generate heterodimers comprising a TGF-beta superfamily co-receptor. See
For example, one means by which interaction between specific polypeptides may be promoted is by engineering protuberance-into-cavity (knob-into-holes) complementary regions such as described in Arathoon et al., U.S. Pat. No. 7,183,076 and Carter et al., U.S. Pat. No. 5,731,168. “Protuberances” are constructed by replacing small amino acid side chains from the interface of the first polypeptide (e.g., a first interaction pair) with larger side chains (e.g., tyrosine or tryptophan). Complementary “cavities” of identical or similar size to the protuberances are optionally created on the interface of the second polypeptide (e.g., a second interaction pair) by replacing large amino acid side chains with smaller ones (e.g., alanine or threonine). Where a suitably positioned and dimensioned protuberance or cavity exists at the interface of either the first or second polypeptide, it is only necessary to engineer a corresponding cavity or protuberance, respectively, at the adjacent interface.
At neutral pH (7.0), aspartic acid and glutamic acid are negatively charged and lysine, arginine, and histidine are positively charged. These charged residues can be used to promote heterodimer formation and at the same time hinder homodimer formation. Attractive interactions take place between opposite charges and repulsive interactions occur between like charges. In part, protein complexes disclosed herein make use of the attractive interactions for promoting heteromultimer formation (e.g., heterodimer formation), and optionally repulsive interactions for hindering homodimer formation (e.g., homodimer formation) by carrying out site directed mutagenesis of charged interface residues.
For example, the IgG1 CH3 domain interface comprises four unique charge residue pairs involved in domain-domain interactions: Asp356-Lys439′, Glu357-Lys370′, Lys392-Asp399′, and Asp399-Lys409′ [residue numbering in the second chain is indicated by (′)]. It should be noted that the numbering scheme used here to designate residues in the IgG1 CH3 domain conforms to the EU numbering scheme of Kabat. Due to the 2-fold symmetry present in the CH3-CH3 domain interactions, each unique interaction will represented twice in the structure (e.g., Asp-399-Lys409′ and Lys409-Asp399′). In the wild-type sequence, K409-D399′ favors both heterodimer and homodimer formation. A single mutation switching the charge polarity (e.g., K409E; positive to negative charge) in the first chain leads to unfavorable interactions for the formation of the first chain homodimer. The unfavorable interactions arise due to the repulsive interactions occurring between the same charges (negative-negative; K409E-D399′ and D399-K409E′). A similar mutation switching the charge polarity (D399K′; negative to positive) in the second chain leads to unfavorable interactions (K409′-D399K′ and D399K-K409′) for the second chain homodimer formation. But, at the same time, these two mutations (K409E and D399K′) lead to favorable interactions (K409E-D399K′ and D399-K409′) for the heterodimer formation.
The electrostatic steering effect on heterodimer formation and homodimer discouragement can be further enhanced by mutation of additional charge residues which may or may not be paired with an oppositely charged residue in the second chain including, for example, Arg355 and Lys360. The table below lists possible charge change mutations that can be used, alone or in combination, to enhance heteromultimer formation of the heteromultimers disclosed herein.
In some embodiments, one or more residues that make up the CH3-CH3 interface in a fusion protein of the instant application are replaced with a charged amino acid such that the interaction becomes electrostatically unfavorable. For example, a positive-charged amino acid in the interface (e.g., a lysine, arginine, or histidine) is replaced with a negatively charged amino acid (e.g., aspartic acid or glutamic acid). Alternatively, or in combination with the forgoing substitution, a negative-charged amino acid in the interface is replaced with a positive-charged amino acid. In certain embodiments, the amino acid is replaced with a non-naturally occurring amino acid having the desired charge characteristic. It should be noted that mutating negatively charged residues (Asp or Glu) to His will lead to increase in side chain volume, which may cause steric issues. Furthermore, His proton donor- and acceptor-form depends on the localized environment. These issues should be taken into consideration with the design strategy. Because the interface residues are highly conserved in human and mouse IgG subclasses, electrostatic steering effects disclosed herein can be applied to human and mouse IgG1, IgG2, IgG3, and IgG4. This strategy can also be extended to modifying uncharged residues to charged residues at the CH3 domain interface.
In part, the disclosure provides desired pairing of asymmetric Fc-containing polypeptide chains using Fc sequences engineered to be complementary on the basis of charge pairing (electrostatic steering). One of a pair of Fc sequences with electrostatic complementarity can be arbitrarily fused to the co-receptor polypeptide of the construct, with or without an optional linker, to generate a TGF-beta superfamily co-receptor receptor fusion polypeptide. This single chain can be coexpressed in a cell of choice along with the Fc sequence complementary to the first Fe to favor generation of the desired multichain construct (e.g., a TGF-beta superfamily heteromultimer). In this example based on electrostatic steering, SEQ ID NO: 200 [human G1Fc(E134K/D177K)] and SEQ ID NO: 201 [human G1Fc(K170D/K187D)] are examples of complementary Fc sequences in which the engineered amino acid substitutions are double underlined, and the TGF-beta superfamily co-receptor polypeptide of the construct can be fused to either SEQ ID NO: 200 or SEQ ID NO: 201, but not both. Given the high degree of amino acid sequence identity between native hG1Fc, native hG2Fc, native hG3Fc, and native hG4Fc, it can be appreciated that amino acid substitutions at corresponding positions in hG2Fc, hG3Fc, or hG4Fc (see
In part, the disclosure provides desired pairing of asymmetric Fc-containing polypeptide chains using Fc sequences engineered for steric complementarity. In part, the disclosure provides knobs-into-holes pairing as an example of steric complementarity. One of a pair of Fc sequences with steric complementarity can be arbitrarily fused to the TGF-beta superfamily co-receptor polypeptide of the construct, with or without an optional linker, to generate a TGF-beta superfamily co-receptor fusion polypeptide. This single chain can be coexpressed in a cell of choice along with the Fc sequence complementary to the first Fc to favor generation of the desired multichain construct. In this example based on knobs-into-holes pairing, SEQ ID NO: 202 [human G1Fc(T144Y)] and SEQ ID NO: 203 [human G1Fc(Y185T)] are examples of complementary Fc sequences in which the engineered amino acid substitutions are double underlined, and the co-receptor polypeptide of the construct can be fused to either SEQ ID NO: 202 or SEQ ID NO: 203, but not both. Given the high degree of amino acid sequence identity between native hG1Fc, native hG2Fc, native hG3Fc, and native hG4Fc, it can be appreciated that amino acid substitutions at corresponding positions in hG2Fc, hG3Fc, or hG4Fc (see
An example of Fc complementarity based on knobs-into-holes pairing combined with an engineered disulfide bond is disclosed in SEQ ID NO: 204 [hG1Fc(S132C/T144W)] and SEQ ID NO: 205 [hGlFc(Y127C/T144S/L146A/Y185V)]. The engineered amino acid substitutions in these sequences are double underlined, and the TGF-beta superfamily co-receptor of the construct can be fused to either SEQ ID NO: 204 or SEQ ID NO: 205, but not both. Given the high degree of amino acid sequence identity between native hG1Fc, native hG2Fc, native hG3Fc, and native hG4Fc, it can be appreciated that amino acid substitutions at corresponding positions in hG2Fc, hG3Fc, or hG4Fc (see
In part, the disclosure provides desired pairing of asymmetric Fc-containing polypeptide chains using Fc sequences engineered to generate interdigitating β-strand segments of human IgG and IgA CH3 domains. Such methods include the use of strand-exchange engineered domain (SEED) CH3 heterodimers allowing the formation of SEEDbody fusion proteins [see, for example, Davis et al (2010) Protein Eng Design Sel 23:195-202]. One of a pair of Fc sequences with SEEDbody complementarity can be arbitrarily fused to the TGF-beta superfamily type co-receptor polypeptide of the construct, with or without an optional linker, to generate a TGF-beta superfamily fusion polypeptide. This single chain can be coexpressed in a cell of choice along with the Fc sequence complementary to the first Fc to favor generation of the desired multichain construct. In this example based on SEEDbody (Sb) pairing, SEQ ID NO: 206 [hG1Fc(SbAG)] and SEQ ID NO: 207 [hG1Fc(SbGA)] are examples of complementary IgG Fc sequences in which the engineered amino acid substitutions from IgA Fc are double underlined, and the TGF-beta superfamily co-receptor polypeptide of the construct can be fused to either SEQ ID NO: 206 or SEQ ID NO: 207, but not both. Given the high degree of amino acid sequence identity between native hG1Fc, native hG2Fc, native hG3Fc, and native hG4Fc, it can be appreciated that amino acid substitutions at corresponding positions in hG1Fc, hG2Fc, hG3Fc, or hG4Fc (see
In part, the disclosure provides desired pairing of asymmetric Fc-containing polypeptide chains with a cleavable leucine zipper domain attached at the C-terminus of the Fc CH3 domains. Attachment of a leucine zipper is sufficient to cause preferential assembly of heterodimeric antibody heavy chains. See, e.g., Wranik et al (2012) J Biol Chem 287:43331-43339. As disclosed herein, one of a pair of Fc sequences attached to a leucine zipper-forming strand can be arbitrarily fused to the TGF-beta superfamily co-receptor polypeptide of the construct, with or without an optional linker, to generate a TGF-beta superfamily fusion polypeptide. This single chain can be coexpressed in a cell of choice along with the Fe sequence attached to a complementary leucine zipper-forming strand to favor generation of the desired multichain construct. Proteolytic digestion of the construct with the bacterial endoproteinase Lys-C post purification can release the leucine zipper domain, resulting in an Fc construct whose structure is identical to that of native Fc. In this example based on leucine zipper pairing, SEQ ID NO: 213 [hG1Fc-Ap1 (acidic)] and SEQ ID NO: 214 [hG1Fc-Bp1 (basic)] are examples of complementary IgG Fc sequences in which the engineered complimentary leucine zipper sequences are underlined, and the co-receptor polypeptide of the construct can be fused to either SEQ ID NO: 213 or SEQ ID NO: 214, but not both. Given the high degree of amino acid sequence identity between native hG1Fc, native hG2Fc, native hG3Fc, and native hG4Fc, it can be appreciated that leucine zipper-forming sequences attached, with or without an optional linker, to hG1Fc, hG2Fc, hG3Fc, or hG4Fc will generate an Fc monomer which may be used in the complementary leucine zipper-forming pair below (SEQ ID NOs: 213 and 214).
ALEKELAQGA T
ALKKKLAQGA T
In part, the disclosure provides desired pairing of asymmetric Fc-containing polypeptide chains by methods described above in combination with additional mutations in the Fc domain which facilitate purification of the desired heteromeric species. An example is complementarity of Fc domains based on knobs-into-holes pairing combined with an engineered disulfide bond, as disclosed in SEQ ID NOs: 204-205, plus additional substitution of two negatively charged amino acids (aspartic acid or glutamic acid) in one Fc-containing polypeptide chain and two positively charged amino acids (e.g., arginine) in the complementary Fc-containing polypeptide chain (SEQ ID NOs: 215-216). These four amino acid substitutions facilitate selective purification of the desired heteromeric fusion protein from a heterogeneous polypeptide mixture based on differences in isoelectric point or net molecular charge. The engineered amino acid substitutions in these sequences are double underlined below, and the TGFβ superfamily type I receptor polypeptide, type II receptor polypeptide, or co-receptor polypeptide of the construct can be fused to either SEQ ID NO: 215 or SEQ ID NO: 216, but not both. Given the high degree of amino acid sequence identity between native hG1Fc, native hG2Fc, native hG3Fc, and native hG4Fc, it can be appreciated that amino acid substitutions at corresponding positions in hG2Fc, hG3Fc, or hG4Fc (see
Another example involves complementarity of Fc domains based on knobs-into-holes pairing combined with an engineered disulfide bond, as disclosed in SEQ ID NOs: 204-205, plus a histidine-to-arginine substitution at position 213 in one Fc-containing polypeptide chain (SEQ ID NO: 217). This substitution (denoted H435R in the numbering system of Kabat et al.) facilitates separation of desired heteromer from undesirable homodimer based on differences in affinity for protein A. The engineered amino acid substitution is indicated by double underline, and the TGFβ superfamily co-receptor polypeptide of the construct can be fused to either SEQ ID NO: 217 or SEQ ID NO: 205, but not both. Given the high degree of amino acid sequence identity between native hG1Fc, native hG2Fc, native hG3Fc, and native hG4Fc, it can be appreciated that amino acid substitutions at corresponding positions in hG2Fc, hG3Fc, or hG4Fc (see
A variety of engineered mutations in the Fc domain are presented above with respect to the G1Fc sequence (SEQ ID NO: 208). Analogous mutations in G2Fc, G3Fc, and G4Fc can be derived from their alignment with G1Fc in
It is understood that different elements of the fusion proteins (e.g., immunoglobulin Fc fusion proteins) may be arranged in any manner that is consistent with desired functionality. For example, a TGF-beta co-receptor polypeptide domain may be placed C-terminal to a heterologous domain, or alternatively, a heterologous domain may be placed C-terminal to a TGF-beta superfamily co-receptor polypeptide domain. The TGF-beta superfamily co-receptor domain and the heterologous domain need not be adjacent in a fusion protein, and additional domains or amino acid sequences may be included C- or N-terminal to either domain or between the domains.
For example, a TGF-beta superfamily co-receptor fusion protein may comprise an amino acid sequence as set forth in the formula A-B-C. The B portion corresponds to a TGF-beta superfamily co-receptor polypeptide domain. The A and C portions may be independently zero, one, or more than one amino acid, and both the A and C portions when present are heterologous to B. The A and/or C portions may be attached to the B portion via a linker sequence. A linker may be rich in glycine (e.g., 2-10, 2-5, 2-4, 2-3 glycine residues) or glycine and proline residues and may, for example, contain a single sequence of threonine/serine and glycines or repeating sequences of threonine/serine and/or glycines, e.g., GGG (SEQ ID NO: 158), GGGG (SEQ ID NO: 159), TGGGG (SEQ ID NO: 160), SGGGG (SEQ ID NO: 161), TGGG (SEQ ID NO: 162), or SGGG (SEQ ID NO: 163) singlets, or repeats. In certain embodiments, a TGF-beta superfamily co-receptor fusion protein comprises an amino acid sequence as set forth in the formula A-B-C, wherein A is a leader (signal) sequence, B consists of a TGF-beta superfamily co-receptor polypeptide domain, and C is a polypeptide portion that enhances one or more of in vivo stability, in vivo half-life, uptake/administration, tissue localization or distribution, formation of protein complexes, and/or purification. In certain embodiments, a TGF-beta superfamily co-receptor fusion protein comprises an amino acid sequence as set forth in the formula A-B-C, wherein A is a TPA leader sequence, B consists of a TGF-beta superfamily co-receptor polypeptide domain, and C is an immunoglobulin Fc domain. Preferred fusion proteins comprise the amino acid sequence set forth in any one of SEQ ID NOs: 500, 501, 504, 505, and 508-555.
In some embodiments, heteromultimers of the present disclosure further comprise one or more heterologous portions (domains) so as to confer a desired property. For example, some fusion domains are particularly useful for isolation of the fusion proteins by affinity chromatography. Well-known examples of such fusion domains include, but are not limited to, polyhistidine, Glu-Glu, glutathione S-transferase (GST), thioredoxin, protein A, protein G, an immunoglobulin heavy-chain constant region (Fc), maltose binding protein (MBP), or human serum albumin. For the purpose of affinity purification, relevant matrices for affinity chromatography, such as glutathione-, amylase-, and nickel- or cobalt-conjugated resins are used. Many of such matrices are available in “kit” form, such as the Pharmacia GST purification system and the QIAexpress™ system (Qiagen) useful with (HIS6) fusion partners. As another example, a fusion domain may be selected so as to facilitate detection of the ligand trap polypeptides. Examples of such detection domains include the various fluorescent proteins (e.g., GFP) as well as “epitope tags,” which are usually short peptide sequences for which a specific antibody is available. Well-known epitope tags for which specific monoclonal antibodies are readily available include FLAG, influenza virus haemagglutinin (HA), and c-myc tags. In some cases, the fusion domains have a protease cleavage site, such as for factor Xa or thrombin, which allows the relevant protease to partially digest the fusion proteins and thereby liberate the recombinant proteins therefrom. The liberated proteins can then be isolated from the fusion domain by subsequent chromatographic separation.
In certain embodiments, TGF-beta superfamily co-receptor polypeptides of the present disclosure contain one or more modifications that are capable of stabilizing the polypeptides. For example, such modifications enhance the in vitro half-life of the polypeptides, enhance circulatory half-life of the polypeptides, and/or reduce proteolytic degradation of the polypeptides. Such stabilizing modifications include, but are not limited to, fusion proteins (including, for example, fusion proteins comprising a co-receptor polypeptide domain and a stabilizer domain), modifications of a glycosylation site (including, for example, addition of a glycosylation site to a polypeptide of the disclosure), and modifications of carbohydrate moiety (including, for example, removal of carbohydrate moieties from a polypeptide of the disclosure). As used herein, the term “stabilizer domain” not only refers to a fusion domain (e.g., an immunoglobulin Fc domain) as in the case of fusion proteins, but also includes nonproteinaceous modifications such as a carbohydrate moiety, or nonproteinaceous moiety, such as polyethylene glycol.
In preferred embodiments, heteromultimers to be used in accordance with the methods described herein are isolated polypeptide complexes. As used herein, an isolated protein (or protein complex) or polypeptide (or polypeptide complex) is one which has been separated from a component of its natural environment. In some embodiments, a heteromultimer complex of the disclosure is purified to greater than 95%, 96%, 97%, 98%, or 99% purity as determined by, for example, electrophoretic (e.g., SDS-PAGE, isoelectric focusing (IEF), capillary electrophoresis) or chromatographic (e.g., ion exchange or reverse phase HPLC). Methods for assessment of antibody purity are well known in the art [See, e.g., Flatman et al., (2007) J. Chromatogr. B 848:79-87]. In some embodiments, heteromultimer preparations of the disclosure are substantially free of TGF-beta superfamily co-receptor polypeptide homomultimers. For example, in some embodiments, heteromultimer preparations comprise less than about 10%, 9%, 8%, 7%, 5%, 4%, 3%, 2%, or less than 1% of TGF-beta superfamily co-receptor polypeptide homomultimers.
In certain embodiments, TGFβ superfamily co-receptor polypeptides as well as heteromultimer complexes thereof, of the disclosure can be produced by a variety of art-known techniques. For example, polypeptides of the disclosure can be synthesized using standard protein chemistry techniques such as those described in Bodansky, M. Principles of Peptide Synthesis, Springer Verlag, Berlin (1993) and Grant G. A. (ed.), Synthetic Peptides: A User's Guide, W. H. Freeman and Company, New York (1992). In addition, automated peptide synthesizers are commercially available (see, e.g., Advanced ChemTech Model 396; Milligen/Biosearch 9600). Alternatively, the polypeptides and complexes of the disclosure, including fragments or variants thereof, may be recombinantly produced using various expression systems [e.g., E. coli, Chinese Hamster Ovary (CHO) cells, COS cells, baculovirus] as is well known in the art. In a further embodiment, the modified or unmodified polypeptides of the disclosure may be produced by digestion of recombinantly produced full-length TGFβ superfamily co-receptor polypeptides by using, for example, a protease, e.g., trypsin, thermolysin, chymotrypsin, pepsin, or paired basic amino acid converting enzyme (PACE). Computer analysis (using a commercially available software, e.g., MacVector, Omega, PCGene, Molecular Simulation, Inc.) can be used to identify proteolytic cleavage sites.
In certain embodiments, the present disclosure provides isolated and/or recombinant nucleic acids encoding TGF superfamily co-receptors (including fragments, functional variants, and fusion proteins thereof) disclosed herein. For example, SEQ ID NO: 3 encodes a naturally occurring human endoglin isoform 1 precursor polypeptide, while SEQ ID NO: 4 encodes a mature, extracellular domain of endoglin isoform 1. The subject nucleic acids may be single-stranded or double stranded. Such nucleic acids may be DNA or RNA molecules. These nucleic acids may be used, for example, in methods for making TGF-beta superfamily heteromultimers of the present disclosure.
In certain embodiments, nucleic acids encoding TGFβ superfamily-receptor polypeptides of the present disclosure are understood to include nucleic acids of any one of SEQ ID NOs: 3, 4, 7, 8, 11, 12, 15, 16, 19, 20, 23, 24, 27, 28, 31, 32, 35, 36, 39, 40, 43, 44, 47, 48, 51, 52, 55, 56, 59, 60, 63, 64, 67, 68, 71, 72, 75, 76, 79, 80, 83, 84, 87, 88, 91, 92, 94, 97, 98, 101, 102, 105, and 106 as well as variants thereof. Variant nucleotide sequences include sequences that differ by one or more nucleotide substitutions, additions, or deletions including allelic variants, and therefore, will include coding sequences that differ from the nucleotide sequence designated in any one of SEQ ID NOs: 3, 4, 7, 8, 11, 12, 15, 16, 19, 20, 23, 24, 27, 28, 31, 32, 35, 36, 39, 40, 43, 44, 47, 48, 51, 52, 55, 56, 59, 60, 63, 64, 67, 68, 71, 72, 75, 76, 79, 80, 83, 84, 87, 88, 91, 92, 94, 97, 98, 101, 102, 105, and 106.
In certain embodiments, TGFβ superfamily co-receptor polypeptides of the present disclosure are encoded by isolated or recombinant nucleic acid sequences that are at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NOs: 3, 4, 7, 8, 11, 12, 15, 16, 19, 20, 23, 24, 27, 28, 31, 32, 35, 36, 39, 40, 43, 44, 47, 48, 51, 52, 55, 56, 59, 60, 63, 64, 67, 68, 71, 72, 75, 76, 79, 80, 83, 84, 87, 88, 91, 92, 94, 97, 98, 101, 102, 105, and 106. One of ordinary skill in the art will appreciate that nucleic acid sequences that are at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the sequences complementary to SEQ ID NOs: 3, 4, 7, 8, 11, 12, 15, 16, 19, 20, 23, 24, 27, 28, 31, 32, 35, 36, 39, 40, 43, 44, 47, 48, 51, 52, 55, 56, 59, 60, 63, 64, 67, 68, 71, 72, 75, 76, 79, 80, 83, 84, 87, 88, 91, 92, 94, 97, 98, 101, 102, 105, and 106 are also within the scope of the present disclosure. In further embodiments, the nucleic acid sequences of the disclosure can be isolated, recombinant, and/or fused with a heterologous nucleotide sequence or in a DNA library.
In other embodiments, nucleic acids of the present disclosure also include nucleotide sequences that hybridize under highly stringent conditions to the nucleotide sequence designated in SEQ ID NOs: 3, 4, 7, 8, 11, 12, 15, 16, 19, 20, 23, 24, 27, 28, 31, 32, 35, 36, 39, 40, 43, 44, 47, 48, 51, 52, 55, 56, 59, 60, 63, 64, 67, 68, 71, 72, 75, 76, 79, 80, 83, 84, 87, 88, 91, 92, 94, 97, 98, 101, 102, 105, and 106, the complement sequence of SEQ ID NOs: 3, 4, 7, 8, 11, 12, 15, 16, 19, 20, 23, 24, 27, 28, 31, 32, 35, 36, 39, 40, 43, 44, 47, 48, 51, 52, 55, 56, 59, 60, 63, 64, 67, 68, 71, 72, 75, 76, 79, 80, 83, 84, 87, 88, 91, 92, 94, 97, 98, 101, 102, 105, and 106, or fragments thereof. One of ordinary skill in the art will understand readily that appropriate stringency conditions which promote DNA hybridization can be varied. For example, one could perform the hybridization at 6.0 x sodium chloride/sodium citrate (SSC) at about 45° C., followed by a wash of 2.0×SSC at 50° C. For example, the salt concentration in the wash step can be selected from a low stringency of about 2.0×SSC at 50° C. to a high stringency of about 0.2×SSC at 50° C. In addition, the temperature in the wash step can be increased from low stringency conditions at room temperature, about 22° C., to high stringency conditions at about 65° C. Both temperature and salt may be varied, or temperature or salt concentration may be held constant while the other variable is changed. In one embodiment, the disclosure provides nucleic acids which hybridize under low stringency conditions of 6×SSC at room temperature followed by a wash at 2×SSC at room temperature.
Isolated nucleic acids which differ from the nucleic acids as set forth in SEQ ID NOs: 3, 4, 7, 8, 11, 12, 15, 16, 19, 20, 23, 24, 27, 28, 31, 32, 35, 36, 39, 40, 43, 44, 47, 48, 51, 52, 55, 56, 59, 60, 63, 64, 67, 68, 71, 72, 75, 76, 79, 80, 83, 84, 87, 88, 91, 92, 94, 97, 98, 101, 102, 105, and 106 due to degeneracy in the genetic code are also within the scope of the disclosure. For example, a number of amino acids are designated by more than one triplet. Codons that specify the same amino acid, or synonyms (for example, CAU and CAC are synonyms for histidine) may result in “silent” mutations which do not affect the amino acid sequence of the protein. However, it is expected that DNA sequence polymorphisms that do lead to changes in the amino acid sequences of the subject proteins will exist among mammalian cells. One skilled in the art will appreciate that these variations in one or more nucleotides (up to about 3-5% of the nucleotides) of the nucleic acids encoding a particular protein may exist among individuals of a given species due to natural allelic variation. Any and all such nucleotide variations and resulting amino acid polymorphisms are within the scope of this disclosure.
In certain embodiments, the recombinant nucleic acids of the present disclosure may be operably linked to one or more regulatory nucleotide sequences in an expression construct. Regulatory nucleotide sequences will generally be appropriate to the host cell used for expression. Numerous types of appropriate expression vectors and suitable regulatory sequences are known in the art for a variety of host cells. Typically, said one or more regulatory nucleotide sequences may include, but are not limited to, promoter sequences, leader or signal sequences, ribosomal binding sites, transcriptional start and termination sequences, translational start and termination sequences, and enhancer or activator sequences. Constitutive or inducible promoters as known in the art are contemplated by the disclosure. The promoters may be either naturally occurring promoters, or hybrid promoters that combine elements of more than one promoter. An expression construct may be present in a cell on an episome, such as a plasmid, or the expression construct may be inserted in a chromosome. In some embodiments, the expression vector contains a selectable marker gene to allow the selection of transformed host cells. Selectable marker genes are well known in the art and will vary with the host cell used.
In certain aspects of the present disclosure, the subject nucleic acid is provided in an expression vector comprising a nucleotide sequence encoding a TGFβ superfamily co-receptor polypeptide and operably linked to at least one regulatory sequence. Regulatory sequences are art-recognized and are selected to direct expression of the TGFβ superfamily co-receptor polypeptide. Accordingly, the term regulatory sequence includes promoters, enhancers, and other expression control elements. Exemplary regulatory sequences are described in Goeddel; Gene Expression Technology: Methods in Enzymology, Academic Press, San Diego, Calif. (1990). For instance, any of a wide variety of expression control sequences that control the expression of a DNA sequence when operatively linked to it may be used in these vectors to express DNA sequences encoding a TGFβ superfamily co-receptor polypeptide. Such useful expression control sequences, include, for example, the early and late promoters of SV40, tet promoter, adenovirus or cytomegalovirus immediate early promoter, RSV promoters, the lac system, the trp system, the TAC or TRC system, T7 promoter whose expression is directed by T7 RNA polymerase, the major operator and promoter regions of phage lambda, the control regions for fd coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast α-mating factors, the polyhedron promoter of the baculovirus system and other sequences known to control the expression of genes of prokaryotic or eukaryotic cells or their viruses, and various combinations thereof. It should be understood that the design of the expression vector may depend on such factors as the choice of the host cell to be transformed and/or the type of protein desired to be expressed. Moreover, the vector's copy number, the ability to control that copy number and the expression of any other protein encoded by the vector, such as antibiotic markers, should also be considered.
A recombinant nucleic acid of the present disclosure can be produced by ligating the cloned gene, or a portion thereof, into a vector suitable for expression in either prokaryotic cells, eukaryotic cells (yeast, avian, insect or mammalian), or both. Expression vehicles for production of a recombinant TGFβ superfamily co-receptor polypeptide include plasmids and other vectors. For instance, suitable vectors include plasmids of the following types: pBR322-derived plasmids, pEMBL-derived plasmids, pEX-derived plasmids, pBTac-derived plasmids and pUC-derived plasmids for expression in prokaryotic cells, such as E. coli.
Some mammalian expression vectors contain both prokaryotic sequences to facilitate the propagation of the vector in bacteria, and one or more eukaryotic transcription units that are expressed in eukaryotic cells. The pcDNAI/amp, pcDNAI/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived vectors are examples of mammalian expression vectors suitable for transfection of eukaryotic cells. Some of these vectors are modified with sequences from bacterial plasmids, such as pBR322, to facilitate replication and drug resistance selection in both prokaryotic and eukaryotic cells. Alternatively, derivatives of viruses such as the bovine papilloma virus (BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived and p205) can be used for transient expression of proteins in eukaryotic cells. Examples of other viral (including retroviral) expression systems can be found below in the description of gene therapy delivery systems. The various methods employed in the preparation of the plasmids and in transformation of host organisms are well known in the art. For other suitable expression systems for both prokaryotic and eukaryotic cells, as well as general recombinant procedures, see, e.g., Molecular Cloning A Laboratory Manual, 3rd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press, 2001). In some instances, it may be desirable to express the recombinant polypeptides by the use of a baculovirus expression system. Examples of such baculovirus expression systems include pVL-derived vectors (such as pVL1392, pVL1393 and pVL941), pAcUW-derived vectors (such as pAcUW1), and pBlueBac-derived vectors (such as the ß-gal containing pBlueBac III).
In a preferred embodiment, a vector will be designed for production of the subject TGFβ superfamily co-receptor polypeptides in CHO cells, such as a Pcmv-Script vector (Stratagene, La Jolla, Calif.), pcDNA4 vectors (Invitrogen, Carlsbad, Calif.) and pCI-neo vectors (Promega, Madison, Wis.). As will be apparent, the subject gene constructs can be used to cause expression of the subject TGFβ superfamily co-receptor polypeptides in cells propagated in culture, e.g., to produce proteins, including fusion proteins or variant proteins, for purification.
This disclosure also pertains to a host cell transfected with a recombinant gene including a coding sequence for one or more of the subject TGF superfamily co-receptor polypeptides. The host cell may be any prokaryotic or eukaryotic cell. For example, a TGFβ superfamily co-receptor polypeptide of the disclosure may be expressed in bacterial cells such as E. coli, insect cells (e.g., using a baculovirus expression system), yeast, or mammalian cells [e.g. a Chinese hamster ovary (CHO) cell line]. Other suitable host cells are known to those skilled in the art.
Accordingly, the present disclosure further pertains to methods of producing the subject TGFβ superfamily co-receptor polypeptides. For example, a host cell transfected with an expression vector encoding a TGF superfamily co-receptor polypeptide can be cultured under appropriate conditions to allow expression of the TGFβ superfamily co-receptor polypeptide to occur. The polypeptide may be secreted and isolated from a mixture of cells and medium containing the polypeptide. Alternatively, the TGFβ superfamily co-receptor polypeptide may be isolated from a cytoplasmic or membrane fraction obtained from harvested and lysed cells. A cell culture includes host cells, media and other byproducts. Suitable media for cell culture are well known in the art. The subject polypeptides can be isolated from cell culture medium, host cells, or both, using techniques known in the art for purifying proteins, including ion-exchange chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, immunoaffinity purification with antibodies specific for particular epitopes of the TGFβ superfamily co-receptor polypeptides and affinity purification with an agent that binds to a domain fused to TGFβ superfamily co-receptor polypeptides (e.g., a protein A column may be used to purify a TGFβ superfamily co-receptor-Fc fusion protein). In some embodiments, the TGFβ superfamily co-receptor polypeptide is a fusion protein containing a domain which facilitates its purification.
In some embodiments, purification is achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange. A TGFβ superfamily co-receptor-Fc fusion protein may be purified to a purity of >90%, >95%, >96%, >98%, or >99% as determined by size exclusion chromatography and >90%, >95%, >96%, >98%, or >99% as determined by SDS PAGE. The target level of purity should be one that is sufficient to achieve desirable results in mammalian systems, particularly non-human primates, rodents (mice), and humans.
In another embodiment, a fusion gene coding for a purification leader sequence, such as a poly-(His)/enterokinase cleavage site sequence at the N-terminus of the desired portion of the recombinant TGFβ co-receptor polypeptide, can allow purification of the expressed fusion protein by affinity chromatography using a Ni2+ metal resin. The purification leader sequence can then be subsequently removed by treatment with enterokinase to provide the purified TGFβ superfamily co-receptor polypeptide. See, e.g., Hochuli et al. (1987) J Chromatography 411:177; and Janknecht et al. (1991) PNAS USA 88:8972.
Techniques for making fusion genes are well known. Essentially, the joining of various DNA fragments coding for different polypeptide sequences is performed in accordance with conventional techniques, employing blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using anchor primers which give rise to complementary overhangs between two consecutive gene fragments which can subsequently be annealed to generate a chimeric gene sequence. See, e.g., Current Protocols in Molecular Biology, eds. Ausubel et al., John Wiley & Sons: 1992.
In certain aspects, the present disclosure relates to the use of TGFβ superfamily co-receptor heteromultimers which are agonists or antagonists of TGFβ superfamily receptors. Compounds identified through this screening can be tested to assess their ability to modulate tissues such as bone, cartilage, muscle, fat, and/or neurons, to assess their ability to modulate tissue growth in vivo or in vitro. These compounds can be tested, for example, in animal models.
There are numerous approaches to screening for therapeutic agents for modulating tissue growth by targeting TGFβ superfamily ligand signaling (e.g., SMAD signaling). In certain embodiments, high-throughput screening of compounds can be carried out to identify agents that perturb TGFβ superfamily receptor-mediated effects on a selected cell line. In certain embodiments, the assay is carried out to screen and identify compounds that specifically inhibit or reduce binding of a TGF-beta superfamily co-receptor heteromultimer to its binding partner, such as a TGFβ superfamily ligand (e.g., BMP2, BMP2/7, BMP3, BMP4, BMP4/7, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF3, GDF5, GDF6/BMP13, GDF7, GDF8, GDF9b/BMP15, GDF11/BMP11, GDF15/MIC1, TGF-β1, TGF-β2, TGF-β3, activin A, activin B, activin C, activin E, activin AB, activin AC, activin AE, activin BC, activin BE, nodal, glial cell-derived neurotrophic factor (GDNF), neurturin, artemin, persephin, MIS, and Lefty). Alternatively, the assay can be used to identify compounds that enhance binding of a TGF-beta superfamily co-receptor heteromultimer to its binding partner such as a TGFβ superfamily ligand. In a further embodiment, the compounds can be identified by their ability to interact with a TGF-beta superfamily co-receptor heteromultimer of the disclosure.
A variety of assay formats will suffice and, in light of the present disclosure, those not expressly described herein will nevertheless be comprehended by one of ordinary skill in the art. As described herein, the test compounds (agents) of the invention may be created by any combinatorial chemical method. Alternatively, the subject compounds may be naturally occurring biomolecules synthesized in vivo or in vitro. Compounds (agents) to be tested for their ability to act as modulators of tissue growth can be produced, for example, by bacteria, yeast, plants or other organisms (e.g., natural products), produced chemically (e.g., small molecules, including peptidomimetics), or produced recombinantly. Test compounds contemplated by the present invention include non-peptidyl organic molecules, peptides, polypeptides, peptidomimetics, sugars, hormones, and nucleic acid molecules. In certain embodiments, the test agent is a small organic molecule having a molecular weight of less than about 2,000 Daltons.
The test compounds of the disclosure can be provided as single, discrete entities, or provided in libraries of greater complexity, such as made by combinatorial chemistry. These libraries can comprise, for example, alcohols, alkyl halides, amines, amides, esters, aldehydes, ethers and other classes of organic compounds. Presentation of test compounds to the test system can be in either an isolated form or as mixtures of compounds, especially in initial screening steps. Optionally, the compounds may be optionally derivatized with other compounds and have derivatizing groups that facilitate isolation of the compounds. Non-limiting examples of derivatizing groups include biotin, fluorescein, digoxygenin, green fluorescent protein, isotopes, polyhistidine, magnetic beads, glutathione S-transferase (GST), photoactivatible crosslinkers or any combinations thereof.
In many drug-screening programs which test libraries of compounds and natural extracts, high-throughput assays are desirable in order to maximize the number of compounds surveyed in a given period of time. Assays which are performed in cell-free systems, such as may be derived with purified or semi-purified proteins, are often preferred as “primary” screens in that they can be generated to permit rapid development and relatively easy detection of an alteration in a molecular target which is mediated by a test compound. Moreover, the effects of cellular toxicity or bioavailability of the test compound can be generally ignored in the in vitro system, the assay instead being focused primarily on the effect of the drug on the molecular target as may be manifest in an alteration of binding affinity between a TGF-beta superfamily co-receptor heteromultimer and its binding partner (e.g., BMP2, BMP2/7, BMP3, BMP4, BMP4/7, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF3, GDF5, GDF6/BMP13, GDF7, GDF8, GDF9b/BMP15, GDF11/BMP11, GDF15/MIC1, TGF-β1, TGF-β2, TGF-β3, activin A, activin B, activin C, activin E, activin AB, activin AC, activin AE, activin BC, activin BE, nodal, glial cell-derived neurotrophic factor (GDNF), neurturin, artemin, persephin, MIS, and Lefty).
Merely to illustrate, in an exemplary screening assay of the present disclosure, the compound of interest is contacted with an isolated and purified TGF-beta superfamily co-receptor heteromultimer which is ordinarily capable of binding to a TGF-beta superfamily ligand, as appropriate for the intention of the assay. To the mixture of the compound and TGF-beta superfamily co-receptor heteromultimer is then added to a composition containing the appropriate TGF-beta superfamily ligand (e.g., BMP2, BMP2/7, BMP3, BMP4, BMP4/7, BMP5, BMP6, BMP7, BMP8a, BMP8b, BMP9, BMP10, GDF3, GDF5, GDF6/BMP13, GDF7, GDF8, GDF9b/BMP15, GDF11/BMP11, GDF15/MIC1, TGF-β1, TGF-β2, TGF-β3, activin A, activin B, activin C, activin E, activin AB, activin AC, activin AE, activin BC, activin BE, nodal, glial cell-derived neurotrophic factor (GDNF), neurturin, artemin, persephin, MIS, and Lefty). Detection and quantification of heteromultimer-superfamily ligand complexes provides a means for determining the compound's efficacy at inhibiting (or potentiating) complex formation between the TGF-beta superfamily co-receptor heteromultimer and its binding protein. The efficacy of the compound can be assessed by generating dose-response curves from data obtained using various concentrations of the test compound. Moreover, a control assay can also be performed to provide a baseline for comparison. For example, in a control assay, isolated and purified TGF-beta superfamily ligand is added to a composition containing the TGF-beta superfamily co-receptor heteromultimer, and the formation of heteromultimer-ligand complex is quantitated in the absence of the test compound. It will be understood that, in general, the order in which the reactants may be admixed can be varied, and can be admixed simultaneously. Moreover, in place of purified proteins, cellular extracts and lysates may be used to render a suitable cell-free assay system.
Binding of a TGF-beta superfamily co-receptor heteromultimer to another protein may be detected by a variety of techniques. For instance, modulation of the formation of complexes can be quantitated using, for example, detectably labeled proteins such as radiolabeled (e.g., 32P, 35 S, 14C or 3H), fluorescently labeled (e.g., FITC), or enzymatically labeled TGF-beta superfamily co-receptor heteromultimer and/or its binding protein, by immunoassay, or by chromatographic detection.
In certain embodiments, the present disclosure contemplates the use of fluorescence polarization assays and fluorescence resonance energy transfer (FRET) assays in measuring, either directly or indirectly, the degree of interaction between a TGF-beta superfamily co-receptor heteromultimer and its binding protein. Further, other modes of detection, such as those based on optical waveguides (see, e.g., PCT Publication WO 96/26432 and U.S. Pat. No. 5,677,196), surface plasmon resonance (SPR), surface charge sensors, and surface force sensors, are compatible with many embodiments of the disclosure.
Moreover, the present disclosure contemplates the use of an interaction trap assay, also known as the “two-hybrid assay,” for identifying agents that disrupt or potentiate interaction between a TGF-beta superfamily heteromultimer and its binding partner. See, e.g., U.S. Pat. No. 5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J Biol Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; and Iwabuchi et al. (1993) Oncogene 8:1693-1696). In a specific embodiment, the present disclosure contemplates the use of reverse two-hybrid systems to identify compounds (e.g., small molecules or peptides) that dissociate interactions between a TGF-beta superfamily heteromultimer and its binding protein [see, e.g., Vidal and Legrain, (1999) Nucleic Acids Res 27:919-29; Vidal and Legrain, (1999) Trends Biotechnol 17:374-81; and U.S. Pat. Nos. 5,525,490; 5,955,280; and 5,965,368].
In certain embodiments, the subject compounds are identified by their ability to interact with a TGF-beta superfamily co-receptor heteromultimer of the disclosure. The interaction between the compound and the TGF-beta superfamily co-receptor heteromultimer may be covalent or non-covalent. For example, such interaction can be identified at the protein level using in vitro biochemical methods, including photo-crosslinking, radiolabeled ligand binding, and affinity chromatography. See, e.g., Jakoby W B et al. (1974) Methods in Enzymology 46:1. In certain cases, the compounds may be screened in a mechanism-based assay, such as an assay to detect compounds which bind to a TGF-beta superfamily co-receptor heteromultimer. This may include a solid-phase or fluid-phase binding event. Alternatively, the gene encoding a TGF-beta superfamily co-receptor heteromultimer can be transfected with a reporter system (e.g., β-galactosidase, luciferase, or green fluorescent protein) into a cell and screened against the library preferably by high-throughput screening or with individual members of the library. Other mechanism-based binding assays may be used; for example, binding assays which detect changes in free energy. Binding assays can be performed with the target fixed to a well, bead or chip or captured by an immobilized antibody or resolved by capillary electrophoresis. The bound compounds may be detected usually using colorimetric endpoints or fluorescence or surface plasmon resonance.
In aspects embodiments, a TGF-beta superfamily co-receptor heteromultimer, or combination of TGF-beta superfamily co-receptor heteromultimers, of the present disclosure can be administered to a patient in need thereof, particularly to treat or prevent a TGF-beta superfamily-associated disorder or condition. In some embodiments, the present invention provides methods of treating a disorder or condition in a patient in need thereof by administering to the patient a therapeutically effective amount of a TGF-beta superfamily co-receptor heteromultimer, or combination of TGF-beta superfamily co-receptor heteromultimers, as described herein. In some embodiments, the present invention provides methods of preventing a disorder or condition in a patient in need thereof by administering to the patient a therapeutically effective amount of a TGF-beta superfamily co-receptor heteromultimer, or combination of TGF-beta superfamily co-receptor heteromultimers, as described herein. In some embodiments, the present invention provides methods of delaying the progression or onset a disorder or condition in a patient in need thereof by administering to the patient a therapeutically effective amount of a TGF-beta superfamily co-receptor heteromultimer, or combination of TGF-beta superfamily co-receptor heteromultimers, as described herein. In some embodiments, the present invention provides methods of treating one or more complications of a disorder or condition in a patient in need thereof by administering to the patient a therapeutically effective amount of a TGF-beta superfamily co-receptor heteromultimer, or combination of TGF-beta superfamily co-receptor heteromultimers, as described herein. In some embodiments, the disorder or condition is one or more of anemia, a thalassemia, myelodysplastic syndrome (MDS), sickle cell disease, and a bone-related disorder (e.g., a bone-related disorder associated with one or more of low bone density, low bone strength, and/or low bone growth). In some embodiments, the methods of the disclosure relate to increasing bone growth in a patient in need thereof. In some embodiments, the methods of the disclosure relate to increasing bone strength in a patient in need thereof. In some embodiments, the methods of the disclosure relate to increasing bone density (e.g., bone mineral density) in a patient in need thereof. In some embodiments, the methods of the disclosure relate to increasing red blood cell levels in a patient in need thereof. In some embodiments, the methods of the disclosure relate to increasing hemoglobin levels in a patient in need thereof. Optionally, any of the TGF-beta superfamily co-receptor heteromultimers of the present disclosure can potentially be employed individually or in combination for therapeutic uses disclosed herein. These methods are particularly aimed at therapeutic and prophylactic treatments of mammals including, for example, rodents, primates, and humans.
As used herein, a therapeutic that “prevents” a disorder or condition refers to a compound that, in a statistical sample, reduces the occurrence of the disorder or condition in the treated sample relative to an untreated control sample, or delays the onset or reduces the severity of one or more symptoms of the disorder or condition relative to the untreated control sample. The term “treating” as used herein includes amelioration or elimination of the condition once it has been established. In either case, prevention or treatment may be discerned in the diagnosis provided by a physician or other health care provider and the intended result of administration of the therapeutic agent.
In certain embodiments, a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, of the present disclosure may be used in methods of inducing bone and/or cartilage formation, preventing bone loss, increasing bone mineralization, preventing the demineralization of bone, and/or increasing bone density. TGF-beta superfamily co-receptor heteromultimers may be useful in patients who are diagnosed with subclinical low bone density, as a protective measure against the development of osteoporosis.
In some embodiments, a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, of the present disclosure may find medical utility in the healing of bone fractures and cartilage defects in humans and other animals. The subject methods and compositions may also have prophylactic use in closed as well as open fracture reduction and also in the improved fixation of artificial joints. De novo bone formation induced by an osteogenic agent is useful for repair of craniofacial defects that are congenital, trauma-induced, or caused by oncologic resection, and is also useful in cosmetic plastic surgery. Further, methods and compositions of the invention may be used in the treatment of periodontal disease and in other tooth repair processes. In certain cases, a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, may provide an environment to attract bone-forming cells, stimulate growth of bone-forming cells, or induce differentiation of progenitors of bone-forming cells. TGF-beta superfamily co-receptor heteromultimers of the disclosure may also be useful in the treatment of osteoporosis. Further, TGF-beta superfamily co-receptor heteromultimers may be used in repair of cartilage defects and prevention/reversal of osteoarthritis.
In some embodiments, methods and compositions of the disclosure can be applied to conditions characterized by or causing bone loss, such as osteoporosis (including secondary osteoporosis), hyperparathyroidism, mineral bone disorder, sex hormone deprivation or ablation (e.g. androgen and/or estrogen), glucocorticoid treatment, rheumatoid arthritis, severe burns, hyperparathyroidism, hypercalcemia, hypocalcemia, hypophosphatemia, osteomalacia (including tumor-induced osteomalacia), hyperphosphatemia, vitamin D deficiency, hyperparathyroidism (including familial hyperparathyroidism) and pseudohypoparathyroidism, tumor metastases to bone, bone loss as a consequence of a tumor or chemotherapy, tumors of the bone and bone marrow (e.g., multiple myeloma), ischemic bone disorders, periodontal disease and oral bone loss, Cushing's disease, Paget's disease, thyrotoxicosis, chronic diarrheal state or malabsorption, renal tubular acidosis, or anorexia nervosa. Methods and compositions of the invention may also be applied to conditions characterized by a failure of bone formation or healing, including non-union fractures, fractures that are otherwise slow to heal, fetal and neonatal bone dysplasias (e.g., hypocalcemia, hypercalcemia, calcium receptor defects and vitamin D deficiency), osteonecrosis (including osteonecrosis of the jaw) and osteogenesis imperfecta. Additionally, the anabolic effects will cause such antagonists to diminish bone pain associated with bone damage or erosion. As a consequence of the anti-resorptive effects, such antagonists may be useful to treat disorders of abnormal bone formation, such as osteoblastic tumor metastases (e.g., associated with primary prostate or breast cancer), osteogenic osteosarcoma, osteopetrosis, progressive diaphyseal dysplasia, endosteal hyperostosis, osteopoikilosis, and melorheostosis. Other disorders that may be treated include fibrous dysplasia and chondrodysplasias.
In another specific embodiment, the disclosure provides a therapeutic method and composition for repairing fractures and other conditions related to cartilage and/or bone defects or periodontal diseases. The invention further provides therapeutic methods and compositions for wound healing and tissue repair. The types of wounds include, but are not limited to, burns, incisions and ulcers. See, e.g., PCT Publication No. WO 84/01106. Such compositions comprise a therapeutically effective amount of at least one of the TGF-beta superfamily co-receptor heteromultimers of the disclosure in admixture with a pharmaceutically acceptable vehicle, carrier, or matrix.
In some embodiments, a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, of the disclosure can be applied to conditions causing bone loss such as osteoporosis, hyperparathyroidism, Cushing's disease, thyrotoxicosis, chronic diarrheal state or malabsorption, renal tubular acidosis, or anorexia nervosa. It is commonly appreciated that being female, having a low body weight, and leading a sedentary lifestyle are risk factors for osteoporosis (loss of bone mineral density, leading to fracture risk). However, osteoporosis can also result from the long-term use of certain medications. Osteoporosis resulting from drugs or another medical condition is known as secondary osteoporosis. In Cushing's disease, the excess amount of cortisol produced by the body results in osteoporosis and fractures. The most common medications associated with secondary osteoporosis are the corticosteroids, a class of drugs that act like cortisol, a hormone produced naturally by the adrenal glands. Although adequate levels of thyroid hormones are needed for the development of the skeleton, excess thyroid hormone can decrease bone mass over time. Antacids that contain aluminum can lead to bone loss when taken in high doses. Other medications that can cause secondary osteoporosis include phenytoin (Dilantin) and barbiturates that are used to prevent seizures; methotrexate (Rheumatrex, Immunex, Folex PFS), a drug for some forms of arthritis, cancer, and immune disorders; cyclosporine (Sandimmune, Neoral), a drug used to treat some autoimmune diseases and to suppress the immune system in organ transplant patients; luteinizing hormone-releasing hormone agonists (Lupron, Zoladex), used to treat prostate cancer and endometriosis; heparin (Calciparine, Liquaemin), an anticlotting medication; and cholestyramine (Questran) and colestipol (Colestid), used to treat high cholesterol. Bone loss resulting from cancer therapy is widely recognized and termed cancer therapy-induced bone loss (CTIBL). Bone metastases can create cavities in the bone that may be corrected by treatment with a TGF-beta superfamily co-receptor heteromultimer. Bone loss can also be caused by gum disease, a chronic infection in which bacteria located in gum recesses produce toxins and harmful enzymes.
In a further embodiment, the present disclosure provides methods and therapeutic agents for treating diseases or disorders associated with abnormal or unwanted bone growth. For example, patients with the congenital disorder fibrodysplasia ossificans progressiva (FOP) are afflicted by progressive ectopic bone growth in soft tissues spontaneously or in response to tissue trauma, with a major impact on quality of life. Additionally, abnormal bone growth can occur after hip replacement surgery and thus ruin the surgical outcome. This is a more common example of pathological bone growth and a situation in which the subject methods and compositions may be therapeutically useful. The same methods and compositions may also be useful for treating other forms of abnormal bone growth (e.g., pathological growth of bone following trauma, burns or spinal cord injury), and for treating or preventing the undesirable conditions associated with the abnormal bone growth seen in connection with metastatic prostate cancer or osteosarcoma.
In certain embodiments, a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, of the disclosure may be used to promote bone formation in patients with cancer. Patients having certain tumors are at high risk for bone loss due to tumor-induced bone loss, bone metastases, and therapeutic agents. Generally, DEXA scans are employed to assess changes in bone density, while indicators of bone remodeling may be used to assess the likelihood of bone metastases. Serum markers may be monitored. Bone specific alkaline phosphatase (BSAP) is an enzyme that is present in osteoblasts. Blood levels of BSAP are increased in patients with bone metastasis and other conditions that result in increased bone remodeling. Osteocalcin and procollagen peptides are also associated with bone formation and bone metastases. Increases in BSAP have been detected in patients with bone metastasis caused by prostate cancer, and to a lesser degree, in bone metastases from breast cancer. BMP7 levels are high in prostate cancer that has metastasized to bone, but not in bone metastases due to bladder, skin, liver, or lung cancer. Type I carboxy-terminal telopeptide (ICTP) is a crosslink found in collagen that is formed during to the resorption of bone. Since bone is constantly being broken down and reformed, ICTP will be found throughout the body. However, at the site of bone metastasis, the level will be significantly higher than in an area of normal bone. ICTP has been found in high levels in bone metastasis due to prostate, lung, and breast cancer. Another collagen crosslink, Type I N-terminal telopeptide (NTx), is produced along with ICTP during bone turnover. The amount of NTx is increased in bone metastasis caused by many different types of cancer including lung, prostate, and breast cancer. Also, the levels of NTx increase with the progression of the bone metastasis. Therefore, this marker can be used to both detect metastasis as well as measure the extent of the disease. Other markers of resorption include pyridinoline and deoxypyridinoline. Any increase in resorption markers or markers of bone metastases indicate the need for therapy with a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, in a patient.
A TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, of the disclosure may be conjointly administered with other bone-active pharmaceutical agents. Conjoint administration may be accomplished by administration of a single co-formulation, by simultaneous administration, or by administration at separate times. TGF-beta superfamily co-receptor heteromultimer complexes may be particularly advantageous if administered with other bone-active agents. A patient may benefit from conjointly receiving a TGF-beta superfamily co-receptor heteromultimer complex and taking calcium supplements, vitamin D, appropriate exercise and/or, in some cases, other medication. Examples of other medications incude, bisphosphonates (alendronate, ibandronate and risedronate), calcitonin, estrogens, parathyroid hormone and raloxifene. The bisphosphonates (alendronate, ibandronate and risedronate), calcitonin, estrogens and raloxifene affect the bone remodeling cycle and are classified as anti-resorptive medications. Bone remodeling consists of two distinct stages: bone resorption and bone formation. Anti-resorptive medications slow or stop the bone-resorbing portion of the bone-remodeling cycle but do not slow the bone-forming portion of the cycle. As a result, new formation continues at a greater rate than bone resorption, and bone density may increase over time. Teriparatide, a form of parathyroid hormone, increases the rate of bone formation in the bone remodeling cycle. Alendronate is approved for both the prevention (5 mg per day or 35 mg once a week) and treatment (10 mg per day or 70 mg once a week) of postmenopausal osteoporosis. Alendronate reduces bone loss, increases bone density and reduces the risk of spine, wrist and hip fractures. Alendronate also is approved for treatment of glucocorticoid-induced osteoporosis in men and women as a result of long-term use of these medications (i.e., prednisone and cortisone) and for the treatment of osteoporosis in men. Alendronate plus vitamin D is approved for the treatment of osteoporosis in postmenopausal women (70 mg once a week plus vitamin D), and for treatment to improve bone mass in men with osteoporosis. Ibandronate is approved for the prevention and treatment of postmenopausal osteoporosis. Taken as a once-a-month pill (150 mg), ibandronate should be taken on the same day each month. Ibandronate reduces bone loss, increases bone density and reduces the risk of spine fractures. Risedronate is approved for the prevention and treatment of postmenopausal osteoporosis. Taken daily (5 mg dose) or weekly (35 mg dose or 35 mg dose with calcium), risedronate slows bone loss, increases bone density and reduces the risk of spine and non-spine fractures. Risedronate also is approved for use by men and women to prevent and/or treat glucocorticoid-induced osteoporosis that results from long-term use of these medications (i.e., prednisone or cortisone). Calcitonin is a naturally occurring hormone involved in calcium regulation and bone metabolism. In women who are more than 5 years beyond menopause, calcitonin slows bone loss, increases spinal bone density, and may relieve the pain associated with bone fractures. Calcitonin reduces the risk of spinal fractures. Calcitonin is available as an injection (50-100 IU daily) or nasal spray (200 IU daily).
A patient may also benefit from conjointly receiving a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, and additional bone-active medications. Estrogen therapy (ET)/hormone therapy (HT) is approved for the prevention of osteoporosis. ET has been shown to reduce bone loss, increase bone density in both the spine and hip, and reduce the risk of hip and spinal fractures in postmenopausal women. ET is administered most commonly in the form of a pill or skin patch that delivers a low dose of approximately 0.3 mg daily or a standard dose of approximately 0.625 mg daily and is effective even when started after age 70. When estrogen is taken alone, it can increase a woman's risk of developing cancer of the uterine lining (endometrial cancer). To eliminate this risk, healthcare providers prescribe the hormone progestin in combination with estrogen (hormone replacement therapy or HT) for those women who have an intact uterus. ET/HT relieves menopause symptoms and has been shown to have a beneficial effect on bone health. Side effects may include vaginal bleeding, breast tenderness, mood disturbances and gallbladder disease. Raloxifene, 60 mg a day, is approved for the prevention and treatment of postmenopausal osteoporosis. It is from a class of drugs called Selective Estrogen Receptor Modulators (SERMs) that have been developed to provide the beneficial effects of estrogens without their potential disadvantages. Raloxifene increases bone mass and reduces the risk of spine fractures. Data are not yet available to demonstrate that raloxifene can reduce the risk of hip and other non-spine fractures. Teriparatide, a form of parathyroid hormone, is approved for the treatment of osteoporosis in postmenopausal women and men who are at high risk for a fracture. This medication stimulates new bone formation and significantly increases bone mineral density. In postmenopausal women, fracture reduction was noted in the spine, hip, foot, ribs and wrist. In men, fracture reduction was noted in the spine, but there were insufficient data to evaluate fracture reduction at other sites. Teriparatide is self-administered as a daily injection for up to 24 months.
In certain aspects, a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, of the present disclosure can be used to increase red blood cell levels, treat or prevent an anemia, and/or treat or prevent ineffective erythropoiesis in a subject in need thereof. In certain aspects, a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, of the present disclosure may be used in combination with conventional therapeutic approaches for increasing red blood cell levels, particularly those used to treat anemias of multifactorial origin. Conventional therapeutic approaches for increasing red blood cell levels include, for example, red blood cell transfusion, administration of one or more EPO receptor activators, hematopoietic stem cell transplantation, immunosuppressive biologics and drugs (e.g., corticosteroids). In certain embodiments, a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, of the present disclosure can be used to treat or prevent ineffective erythropoiesis and/or the disorders associated with ineffective erythropoiesis in a subject in need thereof. In certain aspects, a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, of the present disclosure can be used in combination with conventional therapeutic approaches for treating or preventing an anemia or ineffective erythropoiesis disorder, particularly those used to treat anemias of multifactorial origin.
In certain embodiments, a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, optionally combined with an EPO receptor activator, may be used to increase red blood cell, hemoglobin, or reticulocyte levels in healthy individuals and selected patient populations. Examples of appropriate patient populations include those with undesirably low red blood cell or hemoglobin levels, such as patients having an anemia, and those that are at risk for developing undesirably low red blood cell or hemoglobin levels, such as those patients who are about to undergo major surgery or other procedures that may result in substantial blood loss. In one embodiment, a patient with adequate red blood cell levels is treated with a TGF-beta superfamily co-receptor heteromultimer, or combinations of TGF-beta superfamily co-receptor heteromultimers, to increase red blood cell levels, and then blood is drawn and stored for later use in transfusions.
One or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, optionally combined with an EPO receptor activator, may be used to increase red blood cell levels, hemoglobin levels, and/or hematocrit levels in a patient having an anemia. When observing hemoglobin and/or hematocrit levels in humans, a level of less than normal for the appropriate age and gender category may be indicative of anemia, although individual variations are taken into account. For example, a hemoglobin level from 10-12.5 g/dl, and typically about 11.0 g/dl is considered to be within the normal range in health adults, although, in terms of therapy, a lower target level may cause fewer cardiovascular side effects [see, e.g., Jacobs et al. (2000) Nephrol Dial Transplant 15, 15-19]. Alternatively, hematocrit levels (percentage of the volume of a blood sample occupied by the cells) can be used as a measure for anemia. Hematocrit levels for healthy individuals range from about 41-51% for adult males and from 35-45% for adult females. In certain embodiments, a patient may be treated with a dosing regimen intended to restore the patient to a target level of red blood cells, hemoglobin, and/or hematocrit. As hemoglobin and hematocrit levels vary from person to person, optimally, the target hemoglobin and/or hematocrit level can be individualized for each patient.
Anemia is frequently observed in patients having a tissue injury, an infection, and/or a chronic disease, particularly cancer. In some subjects, anemia is distinguished by low erythropoietin levels and/or an inadequate response to erythropoietin in the bone marrow [see, e.g., Adamson (2008) Harrison's Principles of Internal Medicine, 17th ed.; McGraw Hill, N.Y., pp 628-634]. Potential causes of anemia include, for example, blood loss, nutritional deficits (e.g. reduced dietary intake of protein), medication reaction, various problems associated with the bone marrow, and many diseases. More particularly, anemia has been associated with a variety of disorders and conditions that include, for example, bone marrow transplantation; solid tumors (e.g., breast cancer, lung cancer, and colon cancer); tumors of the lymphatic system (e.g., chronic lymphocyte leukemia, non-Hodgkins lymphoma, and Hodgkins lymphoma); tumors of the hematopoietic system (e.g., leukemia, a myelodysplastic syndrome and multiple myeloma); radiation therapy; chemotherapy (e.g., platinum containing regimens); inflammatory and autoimmune diseases, including, but not limited to, rheumatoid arthritis, other inflammatory arthritides, systemic lupus erythematosis (SLE), acute or chronic skin diseases (e.g., psoriasis), inflammatory bowel disease (e.g., Crohn's disease and ulcerative colitis); acute or chronic renal disease or failure, including idiopathic or congenital conditions; acute or chronic liver disease; acute or chronic bleeding; situations where transfusion of red blood cells is not possible due to patient allo- or auto-antibodies and/or for religious reasons (e.g., some Jehovah's Witnesses); infections (e.g., malaria and osteomyelitis); hemoglobinopathies including, for example, sickle cell disease (anemia), thalassemias; drug use or abuse (e.g., alcohol misuse); pediatric patients with anemia from any cause to avoid transfusion; and elderly patients or patients with underlying cardiopulmonary disease with anemia who cannot receive transfusions due to concerns about circulatory overload [see, e.g., Adamson (2008) Harrison's Principles of Internal Medicine, 17th ed.; McGraw Hill, N.Y., pp 628-634]. In some embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure could be used to treat or prevent anemia associated with one or more of the disorders or conditions disclosed herein.
Many factors can contribute to cancer-related anemia. Some are associated with the disease process itself and the generation of inflammatory cytokines such as interleukin-1, interferon-gamma, and tumor necrosis factor [Bron et al. (2001) Semin Oncol 28(Suppl 8):1-6]. Among its effects, inflammation induces the key iron-regulatory peptide hepcidin, thereby inhibiting iron export from macrophages and generally limiting iron availability for erythropoiesis [see, e.g., Ganz (2007) J Am Soc Nephrol 18:394-400]. Blood loss through various routes can also contribute to cancer-related anemia. The prevalence of anemia due to cancer progression varies with cancer type, ranging from 5% in prostate cancer up to 90% in multiple myeloma. Cancer-related anemia has profound consequences for patients, including fatigue and reduced quality of life, reduced treatment efficacy, and increased mortality. In some embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, optionally combined with an EPO receptor activator, could be used to treat a cancer-related anemia.
A hypoproliferative anemia can result from primary dysfunction or failure of the bone marrow. Hypoproliferative anemias include: anemia of chronic disease, anemia associated with hypometabolic states, and anemia associated with cancer. In each of these types, endogenous erythropoietin levels are inappropriately low for the degree of anemia observed. Other hypoproliferative anemias include: early-stage iron-deficient anemia, and anemia caused by damage to the bone marrow. In these types, endogenous erythropoietin levels are appropriately elevated for the degree of anemia observed. Prominent examples would be myelosuppression caused by cancer and/or chemotherapeutic drugs or cancer radiation therapy. A broad review of clinical trials found that mild anemia can occur in 100% of patients after chemotherapy, while more severe anemia can occur in up to 80% of such patients [see, e.g., Groopman et al. (1999) J Natl Cancer Inst 91:1616-1634]. Myelosuppressive drugs include, for example: 1) alkylating agents such as nitrogen mustards (e.g., melphalan) and nitrosoureas (e.g., streptozocin); 2) antimetabolites such as folic acid antagonists (e.g., methotrexate), purine analogs (e.g., thioguanine), and pyrimidine analogs (e.g., gemcitabine); 3) cytotoxic antibiotics such as anthracyclines (e.g., doxorubicin); 4) kinase inhibitors (e.g., gefitinib); 5) mitotic inhibitors such as taxanes (e.g., paclitaxel) and vinca alkaloids (e.g., vinorelbine); 6) monoclonal antibodies (e.g., rituximab); and 7) topoisomerase inhibitors (e.g., topotecan and etoposide). In addition, conditions resulting in a hypometabolic rate can produce a mild-to-moderate hypoproliferative anemia. Among such conditions are endocrine deficiency states. For example, anemia can occur in Addison's disease, hypothyroidism, hyperparathyroidism, or males who are castrated or treated with estrogen. In some embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, optionally combined with an EPO receptor activator, could be used to treat a hyperproliferative anemia.
Anemia resulting from acute blood loss of sufficient volume, such as from trauma or postpartum hemorrhage, is known as acute post-hemorrhagic anemia. Acute blood loss initially causes hypovolemia without anemia since there is proportional depletion of RBCs along with other blood constituents. However, hypovolemia will rapidly trigger physiologic mechanisms that shift fluid from the extravascular to the vascular compartment, which results in hemodilution and anemia. If chronic, blood loss gradually depletes body iron stores and eventually leads to iron deficiency. In some embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, optionally combined with an EPO receptor activator, could be used to treat anemia resulting from acute blood loss.
Iron-deficiency anemia is the final stage in a graded progression of increasing iron deficiency which includes negative iron balance and iron-deficient erythropoiesis as intermediate stages. Iron deficiency can result from increased iron demand, decreased iron intake, or increased iron loss, as exemplified in conditions such as pregnancy, inadequate diet, intestinal malabsorption, acute or chronic inflammation, and acute or chronic blood loss. With mild-to-moderate anemia of this type, the bone marrow remains hypoproliferative, and RBC morphology is largely normal; however, even mild anemia can result in some microcytic hypochromic RBCs, and the transition to severe iron-deficient anemia is accompanied by hyperproliferation of the bone marrow and increasingly prevalent microcytic and hypochromic RBCs [see, e.g., Adamson (2008) Harrison's Principles of Internal Medicine, 17th ed.; McGraw Hill, N.Y., pp 628-634]. Appropriate therapy for iron-deficiency anemia depends on its cause and severity, with oral iron preparations, parenteral iron formulations, and RBC transfusion as major conventional options. In some embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, optionally combined with an EPO receptor activator, could be used to treat a chronic iron-deficiency.
Myelodysplastic syndrome (MDS) is a diverse collection of hematological conditions characterized by ineffective production of myeloid blood cells and risk of transformation to acute myelogenous leukemia. In MDS patients, blood stem cells do not mature into healthy red blood cells, white blood cells, or platelets. MDS disorders include, for example, refractory anemia, refractory anemia with ringed sideroblasts, refractory anemia with excess blasts, refractory anemia with excess blasts in transformation, refractory cytopenia with multilineage dysplasia, and myelodysplastic syndrome associated with an isolated 5q chromosome abnormality. As these disorders manifest as irreversible defects in both quantity and quality of hematopoietic cells, most MDS patients are afflicted with chronic anemia. Therefore, MDS patients eventually require blood transfusions and/or treatment with growth factors (e.g., erythropoietin or G-CSF) to increase red blood cell levels. However, many MDS patients develop side-effects due to frequency of such therapies. For example, patients who receive frequent red blood cell transfusion can exhibit tissue and organ damage from the buildup of extra iron. Accordingly, one or more TGF-beta superfamily heteromultimer complexes of the disclosure, may be used to treat patients having MDS. In certain embodiments, patients suffering from MDS may be treated using one or more TGF-beta superfamily heteromultimers of the disclosure, optionally in combination with an EPO receptor activator. In other embodiments, patients suffering from MDS may be treated using a combination of one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure and one or more additional therapeutic agents for treating MDS including, for example, thalidomide, lenalidomide, azacitadine, decitabine, erythropoietins, deferoxamine, antithymocyte globulin, and filgrastrim (G-CSF).
Originally distinguished from aplastic anemia, hemorrhage, or peripheral hemolysis on the basis of ferrokinetic studies [see, e.g., Ricketts et al. (1978) Clin Nucl Med 3:159-164], ineffective erythropoiesis describes a diverse group of anemias in which production of mature RBCs is less than would be expected given the number of erythroid precursors (erythroblasts) present in the bone marrow [Tanno et al. (2010) Adv Hematol 2010:358283]. In such anemias, tissue hypoxia persists despite elevated erythropoietin levels due to ineffective production of mature RBCs. A vicious cycle eventually develops in which elevated erythropoietin levels drive massive expansion of erythroblasts, potentially leading to splenomegaly (spleen enlargement) due to extramedullary erythropoiesis [see, e.g., Aizawa et al. (2003) Am J Hematol 74:68-72], erythroblast-induced bone pathology [see, e.g., Di Matteo et al. (2008) J Biol Regul Homeost Agents 22:211-216], and tissue iron overload, even in the absence of therapeutic RBC transfusions [see, e.g., Pippard et al. (1979) Lancet 2:819-821]. Thus, by boosting erythropoietic effectiveness, one or more TGF-beta superfamily heteromultimers of the present disclosure may break the aforementioned cycle and thus alleviate not only the underlying anemia but also the associated complications of elevated erythropoietin levels, splenomegaly, bone pathology, and tissue iron overload. In some embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the present disclosure can be used to treat or prevent ineffective erythropoiesis, including anemia and elevated EPO levels as well as complications such as splenomegaly, erythroblast-induced bone pathology, iron overload, and their attendant pathologies. With splenomegaly, such pathologies include thoracic or abdominal pain and reticuloendothelial hyperplasia. Extramedullary hematopoiesis can occur not only in the spleen but potentially in other tissues in the form of extramedullary hematopoietic pseudotumors [see, e.g., Musallam et al. (2012) Cold Spring Harb Perspect Med 2:a013482]. With erythroblast-induced bone pathology, attendant pathologies include low bone mineral density, osteoporosis, and bone pain [see, e.g., Haidar et al. (2011) Bone 48:425-432]. With iron overload, attendant pathologies include hepcidin suppression and hyperabsorption of dietary iron [see, e.g., Musallam et al. (2012) Blood Rev 26(Suppl 1):S16-S19], multiple endocrinopathies and liver fibrosis/cirrhosis [see, e.g., Galanello et al. (2010) Orphanet J Rare Dis 5:11], and iron-overload cardiomyopathy [Lekawanvijit et al., 2009, Can J Cardiol 25:213-218].
The most common causes of ineffective erythropoiesis are the thalassemia syndromes, hereditary hemoglobinopathies in which imbalances in the production of intact alpha- and beta-hemoglobin chains lead to increased apoptosis during erythroblast maturation [see, e.g., Schrier (2002) Curr Opin Hematol 9:123-126]. Thalassemias are collectively among the most frequent genetic disorders worldwide, with changing epidemiologic patterns predicted to contribute to a growing public health problem in both the U.S. and globally [Vichinsky (2005) Ann NY Acad Sci 1054:18-24]. Thalassemia syndromes are named according to their severity. Thus, α-thalassemias include α-thalassemia minor (also known as α-thalassemia trait; two affected α-globin genes), hemoglobin H disease (three affected α-globin genes), and α-thalassemia major (also known as hydrops fetalis; four affected α-globin genes). β-Thalassemias include β-thalassemia minor (also known as β-thalassemia trait; one affected β-globin gene), β-thalassemia intermedia (two affected β-globin genes), hemoglobin E thalassemia (two affected β-globin genes), and β-thalassemia major (also known as Cooley's anemia; two affected β-globin genes resulting in a complete absence of β-globin protein). β-Thalassemia impacts multiple organs, is associated with considerable morbidity and mortality, and currently requires life-long care. Although life expectancy in patients with β-thalassemia has increased in recent years due to use of regular blood transfusions in combination with iron chelation, iron overload resulting both from transfusions and from excessive gastrointestinal absorption of iron can cause serious complications such as heart disease, thrombosis, hypogonadism, hypothyroidism, diabetes, osteoporosis, and osteopenia [see, e.g., Rund et al. (2005) N Engl J Med 353:1135-1146]. In certain embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, optionally combined with an EPO receptor activator, can be used to treat or prevent a thalassemia syndrome.
In some embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, optionally combined with an EPO receptor activator, can be used for treating disorders of ineffective erythropoiesis besides thalassemia syndromes. Such disorders include siderblastic anemia (inherited or acquired); dyserythropoietic anemia (types I and II); sickle cell anemia; hereditary spherocytosis; pyruvate kinase deficiency; megaloblastic anemias, potentially caused by conditions such as folate deficiency (due to congenital diseases, decreased intake, or increased requirements), cobalamin deficiency (due to congenital diseases, pernicious anemia, impaired absorption, pancreatic insufficiency, or decreased intake), certain drugs, or unexplained causes (congenital dyserythropoietic anemia, refractory megaloblastic anemia, or erythroleukemia); myelophthisic anemias including; congenital erythropoietic porphyria; and lead poisoning.
In certain embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be used in combination with supportive therapies for ineffective erythropoiesis. Such therapies include transfusion with either red blood cells or whole blood to treat anemia. In chronic or hereditary anemias, normal mechanisms for iron homeostasis are overwhelmed by repeated transfusions, eventually leading to toxic and potentially fatal accumulation of iron in vital tissues such as heart, liver, and endocrine glands. Thus, supportive therapies for patients chronically afflicted with ineffective erythropoiesis also include treatment with one or more iron-chelating molecules to promote iron excretion in the urine and/or stool and thereby prevent, or reverse, tissue iron overload [see, e.g., Hershko (2006) Haematologica 91:1307-1312; Cao et al. (2011), Pediatr Rep 3(2):e17]. Effective iron-chelating agents should be able to selectively bind and neutralize ferric iron, the oxidized form of non-transferrin bound iron which likely accounts for most iron toxicity through catalytic production of hydroxyl radicals and oxidation products [see, e.g., Esposito et al. (2003) Blood 102:2670-2677]. These agents are structurally diverse, but all possess oxygen or nitrogen donor atoms able to form neutralizing octahedral coordination complexes with individual iron atoms in stoichiometries of 1:1 (hexadentate agents), 2:1 (tridentate), or 3:1 (bidentate) [Kalinowski et al. (2005) Pharmacol Rev 57:547-583]. In general, effective iron-chelating agents also are relatively low molecular weight (e.g., less than 700 daltons), with solubility in both water and lipids to enable access to affected tissues. Specific examples of iron-chelating molecules include deferoxamine, a hexadentate agent of bacterial origin requiring daily parenteral administration, and the orally active synthetic agents deferiprone (bidentate) and deferasirox (tridentate). Combination therapy consisting of same-day administration of two iron-chelating agents shows promise in patients unresponsive to chelation monotherapy and also in overcoming issues of poor patient compliance with dereroxamine alone [Cao et al. (2011) Pediatr Rep 3(2):e17; Galanello et al. (2010) Ann NY Acad Sci 1202:79-86].
As used herein, “combination”, “in combination with” or “conjoint administration” refers to any form of administration such that the second therapy is still effective in the body (e.g., the two compounds are simultaneously effective in the patient, which may include synergistic effects of the two compounds). Effectiveness may not correlate to measurable concentration of the agent in blood, serum, or plasma. For example, the different therapeutic compounds can be administered either in the same formulation or in separate formulations, either concomitantly or sequentially, and on different schedules. Thus, an individual who receives such treatment can benefit from a combined effect of different therapies. One or more TGF-beta superfamily co-receptor heteromultimers of the disclosure can be administered concurrently with, prior to, or subsequent to, one or more other additional agents or supportive therapies. In general, each therapeutic agent will be administered at a dose and/or on a time schedule determined for that particular agent. The particular combination to employ in a regimen will take into account compatibility of the antagonist of the present disclosure with the therapy and/or the desired therapeutic effect to be achieved.
In certain embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be used in combination with hepcidin or a hepcidin agonist for ineffective erythropoiesis. A circulating polypeptide produced mainly in the liver, hepcidin is considered a master regulator of iron metabolism by virtue of its ability to induce the degradation of ferroportin, an iron-export protein localized on absorptive enterocytes, hepatocytes, and macrophages. Broadly speaking, hepcidin reduces availability of extracellular iron, so hepcidin agonists may be beneficial in the treatment of ineffective erythropoiesis [see, e.g., Nemeth (2010) Adv Hematol 2010:750643]. This view is supported by beneficial effects of increased hepcidin expression in a mouse model of β-thalassemia [Gardenghi et al. (2010) J Clin Invest 120:4466-4477].
One or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, optionally combined with an EPO receptor activator, would also be appropriate for treating anemias of disordered RBC maturation, which are characterized in part by undersized (microcytic), oversized (macrocytic), misshapen, or abnormally colored (hypochromic) RBCs.
In certain embodiments, the present disclosure provides methods of treating or preventing anemia in an individual in need thereof by administering to the individual a therapeutically effective amount of one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure and an EPO receptor activator. In certain embodiments, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be used in combination with EPO receptor activators to reduce the required dose of these activators in patients that are susceptible to adverse effects of EPO. These methods may be used for therapeutic and prophylactic treatments of a patient.
One or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be used in combination with EPO receptor activators to achieve an increase in red blood cells, particularly at lower dose ranges of EPO receptor activators. This may be beneficial in reducing the known off-target effects and risks associated with high doses of EPO receptor activators. The primary adverse effects of EPO include, for example, an excessive increase in the hematocrit or hemoglobin levels and polycythemia. Elevated hematocrit levels can lead to hypertension (more particularly aggravation of hypertension) and vascular thrombosis. Other adverse effects of EPO which have been reported, some of which relate to hypertension, are headaches, influenza-like syndrome, obstruction of shunts, myocardial infarctions and cerebral convulsions due to thrombosis, hypertensive encephalopathy, and red cell blood cell aplasia. See, e.g., Singibarti (1994) J. Clin Investig 72(suppl 6), S36-S43; Horl et al. (2000) Nephrol Dial Transplant 15(suppl 4), 51-56; Delanty et al. (1997) Neurology 49, 686-689; and Bunn (2002) N Engl J Med 346(7), 522-523).
Provided that TGF-beta superfamily co-receptor heteromultimers of the present disclosure act by a different mechanism than EPO, these antagonists may be useful for increasing red blood cell and hemoglobin levels in patients that do not respond well to EPO. For example, a TGF-beta superfamily co-receptor heteromultimer of the present disclosure may be beneficial for a patient in which administration of a normal-to-increased dose of EPO (>300 IU/kg/week) does not result in the increase of hemoglobin level up to the target level. Patients with an inadequate EPO response are found in all types of anemia, but higher numbers of non-responders have been observed particularly frequently in patients with cancers and patients with end-stage renal disease. An inadequate response to EPO can be either constitutive (observed upon the first treatment with EPO) or acquired (observed upon repeated treatment with EPO).
In certain embodiments, the present disclosure provides methods for managing a patient that has been treated with, or is a candidate to be treated with, one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure by measuring one or more hematologic parameters in the patient. The hematologic parameters may be used to evaluate appropriate dosing for a patient who is a candidate to be treated with the antagonist of the present disclosure, to monitor the hematologic parameters during treatment, to evaluate whether to adjust the dosage during treatment with one or more antagonist of the disclosure, and/or to evaluate an appropriate maintenance dose of one or more antagonists of the disclosure. If one or more of the hematologic parameters are outside the normal level, dosing with one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be reduced, delayed or terminated.
Hematologic parameters that may be measured in accordance with the methods provided herein include, for example, red blood cell levels, blood pressure, iron stores, and other agents found in bodily fluids that correlate with increased red blood cell levels, using art-recognized methods. Such parameters may be determined using a blood sample from a patient. Increases in red blood cell levels, hemoglobin levels, and/or hematocrit levels may cause increases in blood pressure.
In one embodiment, if one or more hematologic parameters are outside the normal range or on the high side of normal in a patient who is a candidate to be treated with one or more TGF-beta co-receptor superfamily heteromultimers of the disclosure, then onset of administration of the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be delayed until the hematologic parameters have returned to a normal or acceptable level either naturally or via therapeutic intervention. For example, if a candidate patient is hypertensive or pre-hypertensive, then the patient may be treated with a blood pressure lowering agent in order to reduce the patient's blood pressure. Any blood pressure lowering agent appropriate for the individual patient's condition may be used including, for example, diuretics, adrenergic inhibitors (including alpha blockers and beta blockers), vasodilators, calcium channel blockers, angiotensin-converting enzyme (ACE) inhibitors, or angiotensin II receptor blockers. Blood pressure may alternatively be treated using a diet and exercise regimen. Similarly, if a candidate patient has iron stores that are lower than normal, or on the low side of normal, then the patient may be treated with an appropriate regimen of diet and/or iron supplements until the patient's iron stores have returned to a normal or acceptable level. For patients having higher than normal red blood cell levels and/or hemoglobin levels, then administration of the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be delayed until the levels have returned to a normal or acceptable level.
In certain embodiments, if one or more hematologic parameters are outside the normal range or on the high side of normal in a patient who is a candidate to be treated with one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, then the onset of administration may not be delayed. However, the dosage amount or frequency of dosing of the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be set at an amount that would reduce the risk of an unacceptable increase in the hematologic parameters arising upon administration of the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure. Alternatively, a therapeutic regimen may be developed for the patient that combines one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure with a therapeutic agent that addresses the undesirable level of the hematologic parameter. For example, if the patient has elevated blood pressure, then a therapeutic regimen involving administration of one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure and a blood pressure-lowering agent may be designed. For a patient having lower than desired iron stores, a therapeutic regimen of one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure and iron supplementation may be developed.
In one embodiment, baseline parameter(s) for one or more hematologic parameters may be established for a patient who is a candidate to be treated with one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure and an appropriate dosing regimen established for that patient based on the baseline value(s). Alternatively, established baseline parameters based on a patient's medical history could be used to inform an appropriate dosing regimen for a patient. For example, if a healthy patient has an established baseline blood pressure reading that is above the defined normal range it may not be necessary to bring the patient's blood pressure into the range that is considered normal for the general population prior to treatment with the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure. A patient's baseline values for one or more hematologic parameters prior to treatment with one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may also be used as the relevant comparative values for monitoring any changes to the hematologic parameters during treatment with the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure.
In certain embodiments, one or more hematologic parameters are measured in patients who are being treated with a one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure. The hematologic parameters may be used to monitor the patient during treatment and permit adjustment or termination of the dosing with the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure or additional dosing with another therapeutic agent. For example, if administration of one or more TGF-beta superfamily co-receptor heteromultimer complexes of the disclosure of the disclosure results in an increase in blood pressure, red blood cell level, or hemoglobin level, or a reduction in iron stores, then the dose of the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be reduced in amount or frequency in order to decrease the effects of the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure on the one or more hematologic parameters. If administration of one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure results in a change in one or more hematologic parameters that is adverse to the patient, then the dosing of the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be terminated either temporarily, until the hematologic parameter(s) return to an acceptable level, or permanently. Similarly, if one or more hematologic parameters are not brought within an acceptable range after reducing the dose or frequency of administration of the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, then the dosing may be terminated. As an alternative, or in addition to, reducing or terminating the dosing with the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure, the patient may be dosed with an additional therapeutic agent that addresses the undesirable level in the hematologic parameter(s), such as, for example, a blood pressure-lowering agent or an iron supplement. For example, if a patient being treated with one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure has elevated blood pressure, then dosing with the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may continue at the same level and a blood pressure-lowering agent is added to the treatment regimen, dosing with the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be reduced (e.g., in amount and/or frequency) and a blood pressure-lowering agent is added to the treatment regimen, or dosing with the one or more TGF-beta superfamily co-receptor heteromultimers of the disclosure may be terminated and the patient may be treated with a blood pressure-lowering agent.
In certain aspects, TGF-beta superfamily co-receptor single-arm heteromultimer complexes of the present disclosure can be administered alone or as a component of a pharmaceutical formulation (also referred to as a therapeutic composition or pharmaceutical composition). A pharmaceutical formation refers to a preparation which is in such form as to permit the biological activity of an active ingredient (e.g., an agent of the present disclosure) contained therein to be effective and which contains no additional components which are unacceptably toxic to a subject to which the formulation would be administered. The subject compounds may be formulated for administration in any convenient way for use in human or veterinary medicine. For example, one or more agents of the present disclosure may be formulated with a pharmaceutically acceptable carrier. A pharmaceutically acceptable carrier refers to an ingredient in a pharmaceutical formulation, other than an active ingredient, which is generally nontoxic to a subject. A pharmaceutically acceptable carrier includes, but is not limited to, a buffer, excipient, stabilizer, and/or preservative. In general, pharmaceutical formulations for use in the present disclosure are in a pyrogen-free, physiologically-acceptable form when administered to a subject. Therapeutically useful agents other than those described herein, which may optionally be included in the formulation as described above, may be administered in combination with the subject agents in the methods of the present disclosure.
In certain embodiments, compositions will be administered parenterally [e.g., by intravenous (I.V.) injection, intraarterial injection, intraosseous injection, intramuscular injection, intrathecal injection, subcutaneous injection, or intradermal injection]. Pharmaceutical compositions suitable for parenteral administration may comprise one or more agents of the disclosure in combination with one or more pharmaceutically acceptable sterile isotonic aqueous or nonaqueous solutions, dispersions, suspensions or emulsions, or sterile powders which may be reconstituted into sterile injectable solutions or dispersions just prior to use. Injectable solutions or dispersions may contain antioxidants, buffers, bacteriostats, suspending agents, thickening agents, or solutes which render the formulation isotonic with the blood of the intended recipient. Examples of suitable aqueous and nonaqueous carriers which may be employed in the pharmaceutical formulations of the present disclosure include water, ethanol, polyols (e.g., glycerol, propylene glycol, polyethylene glycol, etc.), vegetable oils (e.g., olive oil), injectable organic esters (e.g., ethyl oleate), and suitable mixtures thereof. Proper fluidity can be maintained, for example, by the use of coating materials (e.g., lecithin), by the maintenance of the required particle size in the case of dispersions, and by the use of surfactants.
In some embodiments, a therapeutic method of the present disclosure includes administering the pharmaceutical composition systemically, or locally, from an implant or device. Further, the pharmaceutical composition may be encapsulated or injected in a form for delivery to a target tissue site (e.g., bone marrow or muscle). In certain embodiments, compositions of the present disclosure may include a matrix capable of delivering one or more of the agents of the present disclosure to a target tissue site (e.g., bone marrow or muscle), providing a structure for the developing tissue and optimally capable of being resorbed into the body. For example, the matrix may provide slow release of one or more agents of the present disclosure. Such matrices may be formed of materials presently in use for other implanted medical applications.
The choice of matrix material may be based on one or more of biocompatibility, biodegradability, mechanical properties, cosmetic appearance, and interface properties. The particular application of the subject compositions will define the appropriate formulation. Potential matrices for the compositions may be biodegradable and chemically defined calcium sulfate, tricalciumphosphate, hydroxyapatite, polylactic acid, and polyanhydrides. Other potential materials are biodegradable and biologically well-defined including, for example, bone or dermal collagen. Further matrices are comprised of pure proteins or extracellular matrix components. Other potential matrices are non-biodegradable and chemically defined including, for example, sintered hydroxyapatite, bioglass, aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above mentioned types of material including, for example, polylactic acid and hydroxyapatite or collagen and tricalciumphosphate. The bioceramics may be altered in composition (e.g., calcium-aluminate-phosphate) and processing to alter one or more of pore size, particle size, particle shape, and biodegradability.
In certain embodiments, pharmaceutical compositions of the present disclosure can be administered topically. “Topical application” or “topically” means contact of the pharmaceutical composition with body surfaces including, for example, the skin, wound sites, and mucous membranes. The topical pharmaceutical compositions can have various application forms and typically comprises a drug-containing layer, which is adapted to be placed near to or in direct contact with the tissue upon topically administering the composition. Pharmaceutical compositions suitable for topical administration may comprise one or more TGFβ superfamily co-receptor single-arm heteromultimer complexes of the disclosure in combination formulated as a liquid, a gel, a cream, a lotion, an ointment, a foam, a paste, a putty, a semi-solid, or a solid. Compositions in the liquid, gel, cream, lotion, ointment, foam, paste, or putty form can be applied by spreading, spraying, smearing, dabbing or rolling the composition on the target tissue. The compositions also may be impregnated into sterile dressings, transdermal patches, plasters, and bandages. Compositions of the putty, semi-solid or solid forms may be deformable. They may be elastic or non-elastic (e.g., flexible or rigid). In certain aspects, the composition forms part of a composite and can include fibers, particulates, or multiple layers with the same or different compositions.
Topical compositions in the liquid form may include pharmaceutically acceptable solutions, emulsions, microemulsions, and suspensions. In addition to the active ingredient(s), the liquid dosage form may contain an inert diluent commonly used in the art including, for example, water or other solvent, a solubilizing agent and/or emulsifier [e.g., ethyl alcohol, isopropyl alcohol, ethyl carbonate, ethyl acetate, benzyl alcohol, benzyl benzoate, propylene glycol, or 1,3-butylene glycol, an oil (e.g., cottonseed, groundnut, corn, germ, olive, castor, and sesame oil), glycerol, tetrahydrofuryl alcohol, a polyethylene glycol, a fatty acid ester of sorbitan, and mixtures thereof].
Topical gel, cream, lotion, ointment, semi-solid or solid compositions may include one or more thickening agents, such as a polysaccharide, synthetic polymer or protein-based polymer. In one embodiment of the invention, the gelling agent herein is one that is suitably nontoxic and gives the desired viscosity. The thickening agents may include polymers, copolymers, and monomers of vinylpyrrolidones, methacrylamides, acrylamides N-vinylimidazoles, carboxy vinyls, vinyl esters, vinyl ethers, silicones, polyethyleneoxides, polyethyleneglycols, vinylalcohols, sodium acrylates, acrylates, maleic acids, NN-dimethylacrylamides, diacetone acrylamides, acrylamides, acryloyl morpholine, pluronic, collagens, polyacrylamides, polyacrylates, polyvinyl alcohols, polyvinylenes, polyvinyl silicates, polyacrylates substituted with a sugar (e.g., sucrose, glucose, glucosamines, galactose, trehalose, mannose, or lactose), acylamidopropane sulfonic acids, tetramethoxyorthosilicates, methyltrimethoxyorthosilicates, tetraalkoxyorthosilicates, trialkoxyorthosilicates, glycols, propylene glycol, glycerine, polysaccharides, alginates, dextrans, cyclodextrin, celluloses, modified celluloses, oxidized celluloses, chitosans, chitins, guars, carrageenans, hyaluronic acids, inulin, starches, modified starches, agarose, methylcelluloses, plant gums, hylaronans, hydrogels, gelatins, glycosaminoglycans, carboxymethyl celluloses, hydroxyethyl celluloses, hydroxy propyl methyl celluloses, pectins, low-methoxy pectins, cross-linked dextrans, starch-acrylonitrile graft copolymers, starch sodium polyacrylate, hydroxyethyl methacrylates, hydroxyl ethyl acrylates, polyvinylene, polyethylvinylethers, polymethyl methacrylates, polystyrenes, polyurethanes, polyalkanoates, polylactic acids, polylactates, poly(3-hydroxybutyrate), sulfonated hydrogels, AMPS (2-acrylamido-2-methyl-1-propanesulfonic acid), SEM (sulfoethylmethacrylate), SPM (sulfopropyl methacrylate), SPA (sulfopropyl acrylate), N,N-dimethyl-N-methacryloxyethyl-N-(3-sulfopropyl)ammonium betaine, methacryllic acid amidopropyl-dimethyl ammonium sulfobetaine, SPI (itaconic acid-bis(1-propyl sulfonizacid-3) ester di-potassium salt), itaconic acids, AMBC (3-acrylamido-3-methylbutanoic acid), beta-carboxyethyl acrylate (acrylic acid dimers), and maleic anhydride-methylvinyl ether polymers, derivatives thereof, salts thereof, acids thereof, and combinations thereof. In certain embodiments, pharmaceutical compositions of present disclosure can be administered orally, for example, in the form of capsules, cachets, pills, tablets, lozenges (using a flavored basis such as sucrose and acacia or tragacanth), powders, granules, a solution or a suspension in an aqueous or non-aqueous liquid, an oil-in-water or water-in-oil liquid emulsion, or an elixir or syrup, or pastille (using an inert base, such as gelatin and glycerin, or sucrose and acacia), and/or a mouth wash, each containing a predetermined amount of a compound of the present disclosure and optionally one or more other active ingredients. A compound of the present disclosure and optionally one or more other active ingredients may also be administered as a bolus, electuary, or paste.
In solid dosage forms for oral administration (e.g., capsules, tablets, pills, dragees, powders, and granules), one or more compounds of the present disclosure may be mixed with one or more pharmaceutically acceptable carriers including, for example, sodium citrate, dicalcium phosphate, a filler or extender (e.g., a starch, lactose, sucrose, glucose, mannitol, and silicic acid), a binder (e.g. carboxymethylcellulose, an alginate, gelatin, polyvinyl pyrrolidone, sucrose, and acacia), a humectant (e.g., glycerol), a disintegrating agent (e.g., agar-agar, calcium carbonate, potato or tapioca starch, alginic acid, a silicate, and sodium carbonate), a solution retarding agent (e.g. paraffin), an absorption accelerator (e.g. a quaternary ammonium compound), a wetting agent (e.g., cetyl alcohol and glycerol monostearate), an absorbent (e.g., kaolin and bentonite clay), a lubricant (e.g., a talc, calcium stearate, magnesium stearate, solid polyethylene glycols, sodium lauryl sulfate), a coloring agent, and mixtures thereof. In the case of capsules, tablets, and pills, the pharmaceutical formulation (composition) may also comprise a buffering agent. Solid compositions of a similar type may also be employed as fillers in soft and hard-filled gelatin capsules using one or more excipients including, e.g., lactose or a milk sugar as well as a high molecular-weight polyethylene glycol.
Liquid dosage forms for oral administration of the pharmaceutical composition may include pharmaceutically acceptable emulsions, microemulsions, solutions, suspensions, syrups, and elixirs. In addition to the active ingredient(s), the liquid dosage form may contain an inert diluent commonly used in the art including, for example, water or other solvent, a solubilizing agent and/or emulsifier [e.g., ethyl alcohol, isopropyl alcohol, ethyl carbonate, ethyl acetate, benzyl alcohol, benzyl benzoate, propylene glycol, or 1,3-butylene glycol, an oil (e.g., cottonseed, groundnut, corn, germ, olive, castor, and sesame oil), glycerol, tetrahydrofuryl alcohol, a polyethylene glycol, a fatty acid ester of sorbitan, and mixtures thereof]. Besides inert diluents, the oral formulation can also include an adjuvant including, for example, a wetting agent, an emulsifying and suspending agent, a sweetening agent, a flavoring agent, a coloring agent, a perfuming agent, a preservative agent, and combinations thereof.
Suspensions, in addition to the active compounds, may contain suspending agents including, for example, an ethoxylated isostearyl alcohol, polyoxyethylene sorbitol, a sorbitan ester, microcrystalline cellulose, aluminum metahydroxide, bentonite, agar-agar, tragacanth, and combinations thereof.
Prevention of the action and/or growth of microorganisms may be ensured by the inclusion of various antibacterial and antifungal agents including, for example, paraben, chlorobutanol, and phenol sorbic acid.
In certain embodiments, it may be desirable to include an isotonic agent including, for example, a sugar or sodium chloride into the compositions. In addition, prolonged absorption of an injectable pharmaceutical form may be brought about by the inclusion of an agent that delay absorption including, for example, aluminum monostearate and gelatin.
It is understood that the dosage regimen will be determined by the attending physician considering various factors which modify the action of the one or more of the agents of the present disclosure. In the case of a TGF-beta superfamily co-receptor single-arm heteromultimer complex that promotes red blood cell formation, various factors may include, but are not limited to, the patient's red blood cell count, hemoglobin level, the desired target red blood cell count, the patient's age, the patient's sex, the patient's diet, the severity of any disease that may be contributing to a depressed red blood cell level, the time of administration, and other clinical factors. The addition of other known active agents to the final composition may also affect the dosage. Progress can be monitored by periodic assessment of one or more of red blood cell levels, hemoglobin levels, reticulocyte levels, and other indicators of the hematopoietic process.
In certain embodiments, the present disclosure also provides gene therapy for the in vivo production of one or more of the agents of the present disclosure. Such therapy would achieve its therapeutic effect by introduction of the agent sequences into cells or tissues having one or more of the disorders as listed above. Delivery of the agent sequences can be achieved, for example, by using a recombinant expression vector such as a chimeric virus or a colloidal dispersion system. Preferred therapeutic delivery of one or more of agent sequences of the disclosure is the use of targeted liposomes.
Various viral vectors which can be utilized for gene therapy as taught herein include adenovirus, herpes virus, vaccinia, or an RNA virus (e.g., a retrovirus). The retroviral vector may be a derivative of a murine or avian retrovirus. Examples of retroviral vectors in which a single foreign gene can be inserted include, but are not limited to: Moloney murine leukemia virus (MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine mammary tumor virus (MuMTV), and Rous Sarcoma Virus (RSV). A number of additional retroviral vectors can incorporate multiple genes. All of these vectors can transfer or incorporate a gene for a selectable marker so that transduced cells can be identified and generated. Retroviral vectors can be made target-specific by attaching, for example, a sugar, a glycolipid, or a protein. Preferred targeting is accomplished by using an antibody. Those of skill in the art will recognize that specific polynucleotide sequences can be inserted into the retroviral genome or attached to a viral envelope to allow target specific delivery of the retroviral vector containing one or more of the agents of the present disclosure.
Alternatively, tissue culture cells can be directly transfected with plasmids encoding the retroviral structural genes (gag, pol, and env), by conventional calcium phosphate transfection. These cells are then transfected with the vector plasmid containing the genes of interest. The resulting cells release the retroviral vector into the culture medium.
Another targeted delivery system for one or more of the agents of the present disclosure is a colloidal dispersion system. Colloidal dispersion systems include, for example, macromolecule complexes, nanocapsules, microspheres, beads, and lipid-based systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes. In certain embodiments, the preferred colloidal system of this disclosure is a liposome. Liposomes are artificial membrane vesicles which are useful as delivery vehicles in vitro and in vivo. RNA, DNA, and intact virions can be encapsulated within the aqueous interior and be delivered to cells in a biologically active form. See, e.g., Fraley, et al. (1981) Trends Biochem. Sci., 6:77. Methods for efficient gene transfer using a liposome vehicle are known in the art. See, e.g., Mannino, et al. (1988) Biotechniques, 6:682, 1988.
The composition of the liposome is usually a combination of phospholipids, which may include a steroid (e.g.cholesterol). The physical characteristics of liposomes depend on pH, ionic strength, and the presence of divalent cations. Other phospholipids or other lipids may also be used including, for example a phosphatidyl compound (e.g., phosphatidylglycerol, phosphatidylcholine, phosphatidylserine, phosphatidylethanolamine, a sphingolipid, a cerebroside, and a ganglioside), egg phosphatidylcholine, dipalmitoylphosphatidylcholine, and distearoylphosphatidylcholine. The targeting of liposomes is also possible based on, for example, organ-specificity, cell-specificity, and organelle-specificity and is known in the art.
The invention now being generally described, it will be more readily understood by reference to the following examples, which are included merely for purposes of illustration of certain embodiments of the present invention, and are not intended to limit the invention.
Applicants envision construction of a soluble single-arm endoglin-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human endoglin is fused to a separate Fc domain with a linker positioned between the ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and endoglin-Fc fusion polypeptide, respectively, and the sequences for each are provided below. Applicants also envision similar single-arm endoglin-Fc heterodimeric complexes comprising ligand-binding domains of endoglin isoforms 2 or 3 (SEQ ID Nos: 6 or 10).
A methodology for promoting formation of endoglin-Fc:Fc heteromeric complexes rather than endoglin-Fc: endoglin-Fc or Fc:Fc homodimeric complexes is to introduce alterations in the amino acid sequence of the Fc domains to guide the formation of asymmetric heteromeric complexes. Many different approaches to making asymmetric interaction pairs using Fc domains are described in this disclosure.
In one approach, illustrated in the endoglin-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 500-501 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face. The endoglin-Fc fusion polypeptide and monomeric Fe polypeptide each employ the tissue plasminogen activator (TPA) leader:
The endoglin-Fc polypeptide sequence (SEQ ID NO: 500) is shown below:
MDAMKRGLCC VLLLCGAVFV SPGASETVHC DLQPVGPERG EVTYTTSQVS KGCVAQAPNA
The leader (signal) sequence and linker are underlined. To promote formation of the endoglin-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (endoglin-Fc:endoglin-Fc or Fc:Fc), two amino acid substitutions (replacing acidic amino acids with lysine) can be introduced into the Fc domain of the endoglin fusion protein as indicated by double underline above. The amino acid sequence of SEQ ID NO: 500 may optionally be provided with the C-terminal lysine (K) removed.
The mature endoglin-Fc fusion polypeptide (SEQ ID NO: 501) is as follows and may optionally be provided with the C-terminal lysine removed.
The complementary human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and is as follows:
The leader sequence is underlined, and an optional N-terminal extension of the Fc polypeptide is indicated by double underline. To promote formation of the endoglin-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed.
The sequence of the mature monomeric Fc polypeptide is as follows (SEQ ID NO: 503) and may optionally be provided with the C-terminal lysine removed.
The endoglin-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 501 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising endoglin-Fc:Fc.
In another approach to promote the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond, as illustrated in the endoglin-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 504-505 and 506-507, respectively.
The endoglin-Fc polypeptide sequence (SEQ ID NO: 504) employs the TPA leader and is shown below:
The leader sequence and linker are underlined. To promote formation of the endoglin-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a trytophan) can be introduced into the Fe domain of the fusion protein as indicated by double underline above. The amino acid sequence of SEQ ID NO: 504 may optionally be provided with the C-terminal lysine removed.
The mature endoglin-Fc fusion polypeptide is as follows:
The complementary form of monomeric Fc polypeptide (SEQ ID NO: 506) uses the TPA leader and is as follows.
The leader sequence is underlined, and an optional N-terminal extension of the Fc polypeptide is indicated by double underline. To promote formation of the endoglin-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 506 may optionally be provided with the C-terminal lysine removed.
The mature monomeric Fe polypeptide sequence (SEQ ID NO: 507) is as follows and may optionally be provided with the C-terminal lysine removed.
The endoglin-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 505 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising endoglin-Fc:Fc.
Purification of various endoglin-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm Cripto-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human Cripto-1 is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and Cripto-Fc fusion polypeptide, respectively, and the sequences for each are provided below. Applicants also envision additional single-arm Cripto-Fc heterodimeric complexes comprising a ligand-binding domain of Cripto-1 isoform 2 (SEQ ID NO: 18).
Formation of a single-arm Cripto-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the Cripto-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 508-509 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The Cripto-Fc fusion polypeptide employs the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASPPNPR TCVFFEAPGV RGSTKTLGEL LDTGTELPRA
The leader and linker sequences are underlined. To promote formation of the Cripto-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (Cripto-Fc:Cripto-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fc domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 508 may optionally be provided with the C-terminal lysine removed.
The mature Cripto-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 509) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the Cripto-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The Cripto-Fc fusion polypeptide and monomeric Fe polypeptide of SEQ ID NO: 509 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising Cripto-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the Cripto-Fc and Fc polypeptide sequences of SEQ ID NOs: 510-511 and 506-507, respectively.
The Cripto-Fc fusion polypeptide (SEQ ID NO: 510) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASPPNRR TCVFFEADGV RGSTKTLGEL LDTGTELPRA
The leader sequence and linker are underlined. To promote formation of the Cripto-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the Cripto fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 510 may optionally be provided with the C-terminal lysine removed.
The mature Cripto-Fc fusion polypeptide (SEQ ID NO: 511) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the Cripto-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The Cripto-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 511 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising Cripto-Fc:Fc.
Purification of various Cripto-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm Cryptic-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human Cryptic is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and Cryptic-Fc fusion polypeptide, respectively, and the sequences for each are provided below. Applicants also envision additional single-arm Cryptic-Fc heterodimeric complexes comprising a ligand-binding domain of Cryptic isoforms 2 or 3 (SEQ ID NOs: 26 or 30).
Formation of a single-arm Cryptic-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the Cryptic-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 512-513 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The Cryptic-Fc fusion polypeptide employs the TPA leader and is as follows:
The leader and linker sequences are underlined. To promote formation of the Cryptic-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (Cryptic-Fc:Cryptic-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fc domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 512 may optionally be provided with the C-terminal lysine removed.
The mature Cryptic-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 513) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the Cryptic-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The Cryptic-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 513 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising Cryptic-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the Cryptic-Fc and Fc polypeptide sequences of SEQ ID NOs: 514-515 and 506-507, respectively.
The Cryptic-Fc fusion polypeptide (SEQ ID NO: 514) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASYQREK HNGGREEVTK VATQKHRQSP LVWTSSEFGE
The leader sequence and linker are underlined. To promote formation of the Cryptic-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the Cryptic fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 514 may optionally be provided with the C-terminal lysine removed.
The mature Cryptic-Fc fusion polypeptide (SEQ ID NO: 515) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the Cryptic-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The Cryptic-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 515 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising Cryptic-Fc:Fc.
Purification of various Cryptic-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm Cryptic1B-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human Cryptic family protein 1B is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and Cryptic1B-Fc fusion polypeptide, respectively, and the sequences for each are provided below.
Formation of a single-arm Cryptic1B-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the Cryptic1B-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 516-517 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The Cryptic1B-Fc fusion polypeptide employs the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASYQREK HNGGREEVTK VATQKHRQSP LNWTSSHFGE
The leader and linker sequences are underlined. To promote formation of the Cryptic1B-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (Cryptic1B-Fc:Cryptic1B-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fc domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 516 may optionally be provided with the C-terminal lysine removed.
The mature Cryptic1B-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 517) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1F polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the Cryptic1B-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The Cryptic1B-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 517 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising CrypticB-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the Cryptic1B-Fc and Fc polypeptide sequences of SEQ ID NOs: 518-519 and 506-507, respectively.
The Cryptic1B-Fc fusion polypeptide (SEQ ID NO: 518) uses the TPA leader and is as follows:
The leader sequence and linker are underlined. To promote formation of the Cryptic1B-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a seine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the Cryptic1B fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 518 may optionally be provided with the C-terminal lysine removed.
The mature Cryptic1B-Fc fusion polypeptide (SEQ ID NO: 519) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the Cryptic1B-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The Cryptic1B-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 519 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising CrypticB-Fc:Fc.
Purification of various Cryptic1B-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm CRIM1-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human CRIM1 is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and CRIM1-Fc fusion polypeptide, respectively, and the sequences for each are provided below.
Formation of a single-arm CRIM1-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the CRIM1-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 520-521 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic PGP211,TRE amino acids at the interaction face.
The CRIM1-Fc fusion polypeptide employs the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASLVCLP CDESKCEEPR NCPGSIVQGV CGCCYTCASQ
The leader and linker sequences are underlined. To promote formation of the CRIM1-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (CRIM1-Fc:CRIM1-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fc domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 520 may optionally be provided with the C-terminal lysine removed.
The mature CRIM1-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 521) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the CRIM1-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The CRIM1-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 521 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising CRIM1-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the CRIM1-Fc and Fc polypeptide sequences of SEQ ID NOs: 522-523 and 506-507, respectively.
The CRIM1-Fc fusion polypeptide (SEQ ID NO: 522) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASLVCLP CDESKCEEPR NCPGSIVQGV CGCCYTCASQ
The leader sequence and linker are underlined. To promote formation of the CRIM1-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the CRIM1 fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 522 may optionally be provided with the C-terminal lysine removed.
The mature CRIM1-Fc fusion polypeptide (SEQ ID NO: 523) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the CRIM1-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The CRIM1-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 523 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising CRIM1-Fc:Fc.
Purification of various CRIM1-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm CRIM2-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human CRIM2 is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and CRIM2-Fc fusion polypeptide, respectively, and the sequences for each are provided below. Applicants also envision additional single-arm CRIM2-Fc heterodimeric complexes comprising a ligand-binding domain of CRIM2 isoform 2 (SEQ ID NO: 46).
Formation of a single-arm CRIM2-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the CRIM2-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 524-525 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The CRIM2-Fc fusion polypeptide employs the TPA leader and is as follows:
The leader and linker sequences are underlined. To promote formation of the CRIM2-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (CRIM2-Fc:CRIM2-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fc domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 524 may optionally be provided with the C-terminal lysine removed.
The mature CRIM2-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 525) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the CRIM2-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The CRIM2-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 525 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising CRIM2-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the CRIM2-Fc and Fc polypeptide sequences of SEQ ID NOs: 526-527 and 506-507, respectively.
The CRIM2-Fc fusion polypeptide (SEQ ID NO: 526) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASGAVPR EPPGQQTTAH SSVLAGNSQE QWHPLREWLG
The leader sequence and linker are underlined. To promote formation of the CRIM2-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the CRIM2 fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 526 may optionally be provided with the C-terminal lysine removed.
The mature CRIM2-Fc fusion polypeptide (SEQ ID NO: 527) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fe polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the CRIM2-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fe polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fe polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The CRIM2-Fc fusion polypeptide and monomeric Fe polypeptide of SEQ ID NO: 527 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising CRIM2-Fc:Fc.
Purification of various CRIM2-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm BAMBI-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human BAMBI is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and BAMBI-Fc fusion polypeptide, respectively, and the sequences for each are provided below.
Formation of a single-arm BAMBI-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the BAMBI-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 528-529 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The BAMBI-Fc fusion polypeptide employs the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASVLLTK GEIRCYCDAA HCVATGYMCK SELSACFSRL
The leader and linker sequences are underlined. To promote formation of the BAMBI-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (BAMBI-Fc:BAMBI-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fc domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 528 may optionally be provided with the C-terminal lysine removed.
The mature BAMBI-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 529) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the BAMBI-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The BAMBI-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 529 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising BAMBI-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the BAMBI-Fc and Fc polypeptide sequences of SEQ ID NOs: 530-531 and 506-507, respectively.
The BAMBI-Fc fusion polypeptide (SEQ ID NO: 530) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASVLLTK GEIRCYCDAA HCVATGYMCK SELSACFSRL
The leader sequence and linker are underlined. To promote formation of the BAMBI-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the BAMBI fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 530 may optionally be provided with the C-terminal lysine removed.
The mature BAMBI-Fc fusion polypeptide (SEQ ID NO: 531) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the BAMBI-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The BAMBI-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 531 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising BAMBI-Fc:Fc.
Purification of various BAMBI-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm BMPER-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human BMPER is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and BMPER-Fc fusion polypeptide, respectively, and the sequences for each are provided below.
Formation of a single-arm BMPER-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the BMPER-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 532-533 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The BMPER-Fc fusion polypeptide employs the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASSSFLT GSVAKCENEG EVLQIPFITD NPCIMCVCLN
The leader and linker sequences are underlined. To promote formation of the BMPER-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (BMPER-Fc:BMPER-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fe domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 532 may optionally be provided with the C-terminal lysine removed.
The mature BMPER-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 533) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fe polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the BMPER-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fe polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fe polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The BMPER-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 533 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising BMPER-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the BMPER-Fc and Fc polypeptide sequences of SEQ ID NOs: 534-535 and 506-507, respectively.
The BMPER-Fc fusion polypeptide (SEQ ID NO: 534) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASSSFLT GSVAKCENEG EVLQIPFITD NPCIMCVCLN
The leader sequence and linker are underlined. To promote formation of the BMPER-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the BMPER fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 534 may optionally be provided with the C-terminal lysine removed.
The mature BMPER-Fc fusion polypeptide (SEQ ID NO: 535) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the BMPER-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The BMPER-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 535 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising BMPER-Fc:Fc.
Purification of various BMPER-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm RGMB-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human RGM-B is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and RGMB-Fc fusion polypeptide, respectively, and the sequences for each are provided below.
Formation of a single-arm RGMB-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the RGMB-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 536-537 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The RGMB-Fc fusion polypeptide employs the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASGDCQQ PAQCRIQKCT TDFVSLTSHL NSAVDGFDSE
The leader and linker sequences are underlined. To promote formation of the RGMB-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (RGMB-Fc:RGMB-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fc domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 536 may optionally be provided with the C-terminal lysine removed.
The mature RGMB-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 537) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the RGMB-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The RGMB-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 537 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising RGMB-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the RGMB-Fc and Fc polypeptide sequences of SEQ ID NOs: 538-539 and 506-507, respectively.
The RGMB-Fc fusion polypeptide (SEQ ID NO: 538) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASGDCQQ PAQCRIQKCT TDFVSLTSHL NSAVDGFDSE
The leader sequence and linker are underlined. To promote formation of the RGMB-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the RGMB fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 538 may optionally be provided with the C-terminal lysine removed.
The mature RGMB-Fc fusion polypeptide (SEQ ID NO: 539) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the RGMB-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The RGMB-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 539 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising RGMB-Fc:Fc.
Purification of various RGMB-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm RGMA-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human RGM-A is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and RGMA-Fc fusion polypeptide, respectively, and the sequences for each are provided below. Applicants also envision additional single-arm RGMA-Fc heterodimeric complexes comprising a ligand-binding domain of RGM-A isoforms 2 or 3 (SEQ ID NOs: 66 or 70).
Formation of a single-arm RGMA-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the RGMA-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 540-541 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The RGMA-Fc fusion polypeptide employs the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASCKILK CNSEFWSATS GSHAPASDDT PEFCAALRSY
The leader and linker sequences are underlined. To promote formation of the RGMA-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (RGMA-Fc:RGMA-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fc domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 540 may optionally be provided with the C-terminal lysine removed.
The mature RGMA-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 541) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the RGMA-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The RGMA-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 541 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising RGMA-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the RGMA-Fc and Fc polypeptide sequences of SEQ ID NOs: 542-543 and 506-507, respectively.
The RGMA-Fc fusion polypeptide (SEQ ID NO: 542) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASCKILK CNSEFWSATS GSHAPASDDT PEFCAALRSY
The leader sequence and linker are underlined. To promote formation of the RGMA-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the RGMA fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 542 may optionally be provided with the C-terminal lysine removed.
The mature RGMA-Fc fusion polypeptide (SEQ ID NO: 543) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the RGMA-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The RGMA-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 543 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising RGMA-Fc:Fc.
Purification of various RGMA-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm HEMO-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human hemojuvelin is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and HEMO-Fc fusion polypeptide, respectively, and the sequences for each are provided below. Applicants also envision additional single-arm HEMO-Fc heterodimeric complexes comprising a ligand-binding domain of hemojuvelin isoforms 2 or 3 (SEQ ID NOs: 78 or 82).
Formation of a single-arm HEMO-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the HEMO-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 544-545 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The HEMO-Fc fusion polypeptide employs the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASQCKIL RCNAEYVSST LSLRGGGSSG ALRGGGGGGR
The leader and linker sequences are underlined. To promote formation of the HEMO-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (HEMO-Fc:HEMO-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fe domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 544 may optionally be provided with the C-terminal lysine removed.
The mature HEMO-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 545) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the HEMO-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The HEMO-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 545 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising HEMO-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the HEMO-Fc and Fc polypeptide sequences of SEQ ID NOs: 546-547 and 506-507, respectively.
The HEMO-Fc fusion polypeptide (SEQ ID NO: 546) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASQCKIL RCNAEYVSST LSLRGGGSSG ALRGGGGGGR
The leader sequence and linker are underlined. To promote formation of the HEMO-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the hemojuvelin fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 546 may optionally be provided with the C-terminal lysine removed.
The mature HEMO-Fc fusion polypeptide (SEQ ID NO: 547) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the HEMO-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The HEMO-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 547 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising HEMO-Fc:Fc.
Purification of various HEMO-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm BG-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human betaglycan is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and BG-Fc fusion polypeptide, respectively, and the sequences for each are provided below. Applicants also envision additional single-arm BG-Fc heterodimeric complexes comprising a ligand-binding domain of betaglycan isoform 2 (SEQ ID NO: 90).
Formation of a single-arm BG-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the BG-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 548-549 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The BG-Fc fusion polypeptide employs the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASGPEPG ALCELSPVSA SHPVQALMES FTVLSGCASR
The leader and linker sequences are underlined. To promote formation of the BG-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (BG-Fc:BG-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fc domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 548 may optionally be provided with the C-terminal lysine removed.
The mature BG-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 549) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the BG-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The BG-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 549 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising BG-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the BG-Fc and Fc polypeptide sequences of SEQ ID NOs: 550-551 and 506-507, respectively.
The BG-Fc fusion polypeptide (SEQ ID NO: 550) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASGPEPG ALCELSPVSA SHPVQALMES FTVLSGCASR
The leader sequence and linker are underlined. To promote formation of the BG-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fe domain of the betaglycan fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 550 may optionally be provided with the C-terminal lysine removed.
The mature BG-Fc fusion polypeptide (SEQ ID NO: 551) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the BG-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The BG-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 551 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising BG-Fc:Fc.
Purification of various BG-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
Applicants envision construction of a soluble single-arm MUSK-Fc heterodimeric complex comprising a monomeric Fc polypeptide with a short N-terminal extension and a second polypeptide in which a ligand-binding domain of human MuSK is fused to a separate Fc domain with a linker positioned between a ligand-binding domain and this second Fc domain. The individual constructs are referred to as monomeric Fc polypeptide and MUSK-Fc fusion polypeptide, respectively, and the sequences for each are provided below. Applicants also envision additional single-arm MUSK-Fc heterodimeric complexes comprising a ligand-binding domain of MuSK isoforms 2 or 3 (SEQ ID NO: 100 or 104).
Formation of a single-arm MUSK-Fc heterodimer may be guided by approaches similar to those described for single-arm endoglin-Fc heterodimer in Example 1. In a first approach, illustrated in the MUSK-Fc and monomeric Fc polypeptide sequences of SEQ ID NOs: 552-553 and 502-503, respectively, one Fc domain is altered to introduce cationic amino acids at the interaction face, while the other Fc domain is altered to introduce anionic amino acids at the interaction face.
The MUSK-Fc fusion polypeptide employs the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASGTEKL PKAPVITTPL ETVDALVEEV ATFMCAVESY
The leader and linker sequences are underlined. To promote formation of the MUSK-Fc:Fc heterodimer rather than either of the possible homodimeric complexes (MUSK-Fc:MUSK-Fc or Fc:Fc), two amino acid substitutions (replacing anionic residues with lysines) can be introduced into the Fc domain of the fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 552 may optionally be provided with the C-terminal lysine removed.
The mature MUSK-Fc fusion polypeptide sequence is as follows (SEQ ID NO: 553) and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric human G1Fc polypeptide (SEQ ID NO: 502) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the MUSK-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing lysines with anionic residues) can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 502 may optionally be provided with the C-terminal lysine removed. The mature monomeric Fc polypeptide (SEQ ID NO: 503) may optionally be provided with the C-terminal lysine removed.
The MUSK-Fc fusion polypeptide and monomeric Fc polypeptide of SEQ ID NO: 553 and SEQ ID NO: 503, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising MUSK-Fc:Fc.
In another approach to promoting the formation of heteromultimer complexes using asymmetric Fc fusion polypeptides, the Fc domains are altered to introduce complementary hydrophobic interactions and an additional intermolecular disulfide bond as illustrated in the MUSK-Fc and Fc polypeptide sequences of SEQ ID NOs: 554-555 and 506-507, respectively.
The MUSK-Fc fusion polypeptide (SEQ ID NO: 554) uses the TPA leader and is as follows:
MDAMKRGLCC VLLLCGAVFV SPGASGTEKL PKAPVITTPL ETVDALVEEV ATFMCAVESY
The leader sequence and linker are underlined. To promote formation of the MUSK-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, two amino acid substitutions (replacing a serine with a cysteine and a threonine with a tryptophan) can be introduced into the Fc domain of the MuSK fusion polypeptide as indicated by double underline above. The amino acid sequence of SEQ ID NO: 554 may optionally be provided with the C-terminal lysine removed.
The mature MUSK-Fc fusion polypeptide (SEQ ID NO: 555) is as follows and may optionally be provided with the C-terminal lysine removed.
As described in Example 1, the complementary form of monomeric G1Fc polypeptide (SEQ ID NO: 506) employs the TPA leader and incorporates an optional N-terminal extension. To promote formation of the MUSK-Fc:Fc heterodimer rather than either of the possible homodimeric complexes, four amino acid substitutions can be introduced into the monomeric Fc polypeptide as indicated. The amino acid sequence of SEQ ID NO: 506 and the mature Fc polypeptide (SEQ ID NO: 507) may optionally be provided with the C-terminal lysine removed.
The MUSK-Fc fusion polypeptide and monomeric Fe polypeptide of SEQ ID NO: 555 and SEQ ID NO: 507, respectively, may be co-expressed and purified from a CHO cell line to give rise to a single-arm heteromeric protein complex comprising MUSK-Fc:Fc.
Purification of various MUSK-Fc:Fc complexes could be achieved by a series of column chromatography steps, including, for example, three or more of the following, in any order: protein A chromatography, Q sepharose chromatography, phenylsepharose chromatography, size exclusion chromatography, and cation exchange chromatography. The purification could be completed with viral filtration and buffer exchange.
This application claims the benefit of priority to U.S. Provisional Patent Application No. 62/613,340, filed Jan. 3, 2018, which application is hereby incorporated by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2019/012020 | 1/2/2019 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62613340 | Jan 2018 | US |