COMPOSITIONS AND METHODS FOR THE TREATMENT OF HUMAN IMMUNODEFICIENCY VIRUS

Information

  • Patent Application
  • 20230079107
  • Publication Number
    20230079107
  • Date Filed
    June 12, 2020
    4 years ago
  • Date Published
    March 16, 2023
    a year ago
Abstract
Compositions and methods for the treatment of viral infections include conjugates containing inhibitors of viral gp120 receptor (e.g., temsavir, BMS-818251, DMJ-ll-121, BNM-IV-147, or analogs thereof) linked to an Fc monomer, an Fc domain, and Fc-binding peptide, an albumin protein, or albumin-binding peptide. In particular, conjugates can be used in the treatment of viral infections (e.g., HIV infections).
Description
BACKGROUND

The need for novel antiviral treatments for human immunodeficiency virus (HIV) is significant and especially critical in the medical field. Since the first case of HIV was identified over 30 years ago, 78 million people have become infected with HIV and 35 million have died from acquired immune deficiency syndrome (AIDS)-related illnesses. Currently, 36.9 million people worldwide are living with HIV or AIDS, including 1.8 million children under the age of 15. An estimated 1.8 million individuals worldwide became newly infected with HIV in 2017.


The development of antiviral treatments for HIV has been a continuing challenge. Since the first U.S. FDA approved anti-HIV drugs in 1987, a series of antiretroviral therapies have been developed. However, drug-resistant strains have emerged limiting the number of patients that can use these antiretroviral therapies.


HIV antiviral inhibitors come in many several classes targeting distinct steps of the HIV cycle. One class of antivirals, nucleoside reverse transcriptase inhibitors (NRTIs) inhibit viral replication by chain termination after being incorporated into growing DNA strands by HIV reverse transcriptase. Another class, non-nucleoside reverse transcription inhibitors (NNRTIs), similarly target reverse transcription, however at a different site than nucleoside reverse transcription inhibitors. A different class of antivirals, integrase inhibitors inhibit viral DNA insertion into the host cellular genome. Protease inhibitors are agents that inhibit the protease enzyme, a key enzyme in the assembly of new virus particles. One class of antivirals, known as viral entry inhibitors, contains agents that interfere in viral entry into the cell by binding to HIV envelope (Env) glycoprotein. In particular, viral entry inhibitors target the surface subunit gp120 receptor of the HIV virus.


However, many of these agents are secreted or cleared by the kidney, requiring dose adjustments in those with compromised kidney function, and they have drug-drug interactions that may increase the effect of adverse reactions, particularly in HIV positive individuals undergoing organ transplant. Furthermore, many of these agents have been shown to be directly nephrotoxic, inducing a variety of kidney disorders. New, more effective therapies for treating HIV are needed.


SUMMARY

The disclosure relates to conjugates, compositions, and methods for inhibiting viral growth, and methods for the treatment of viral infections. In particular, such conjugates contain monomers or dimers of a moiety that inhibits human immunodeficiency virus, for example by binding to the gp120 glycoprotein (e.g., a gp120 binder such as temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof), conjugated to Fc monomers, Fc domains, Fc-binding peptides, albumin proteins, or albumin protein-binding peptides. In preferred embodiments, the HIV targeting moiety (e.g., temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof) in the conjugate targets a protein encoded by the HIV Env gene, in particular gp120 glycoprotein on the surface of the viral particle, thereby preventing viral attachment to the host CD4+ T cell and entry into the host immune cell. The Fc monomers or Fc domains in the conjugates bind to FcγRs (e.g., FcRn, FcγRI, FcγRIIa, FcγRIIc, FcγRIIIa, and FcγRIIIb) on immune cells, e.g., neutrophils, to activate phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates. The albumin or albumin-binding peptide may extend the half-life of the conjugate, for example, by binding of albumin to the recycling neonatal Fc receptor. Such compositions are useful in methods for the inhibition of viral growth and in methods for the treatment of viral infections, such as those caused by an HIV-1 and HIV-2.


In a first aspect, the invention features a conjugate described by any one of formulas (D-I), (M-I), (1), or (2):




embedded image


wherein each A1 and each A2 is independently described by formula (A-I) or (A-II):




embedded image


wherein Q is selected from the group consisting of:




text missing or illegible when filed


S is selected from the group consisting of:




embedded image


R1, R2, R3, are each independently selected from H, OH, halogen, nitrile, nitro, optionally substituted amine, optionally substituted sulfhydryl, optionally substituted carboxyl, optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C3-C20 cycloalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C5-C20 aryl, optionally substituted C3-C15 heteroaryl, and optionally substituted C1-C20 alkoxy;


R4 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, optionally substituted C3-C15 heteroaryl, and a bond;


R5 is selected from H or optionally substituted C1-C6 alkyl;


R6 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl;


R7 and Y are each independently selected from




embedded image


embedded image


embedded image


each R8 is independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl;


each R9 is independently selected from optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl;


x is 1 or 2;


k is 0, 1, 2, 3, 4, or 5;


Ar is selected from the group consisting of optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl;


n is 1 or 2;


In some embodiments, n is 1 and each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide;


In some embodiments, n is 2 and each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95), wherein the Fc domain monomers dimerize to form and Fc domain;


L is a linker covalently attached to each E and to each Y of each A1 and/or A2;


T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), and each squiggly line in formulas (D-I), (M-I), (1), or (2) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E; or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L or each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L or A1-L-A2 structures described herein).


In preferred embodiments of any of the aspects described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)), when A1 and/or A2 are selected from a structure described by (A-I), x is 2.


In preferred embodiments of any of the aspects described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)), when A1 and/or A2 are selected from a structure described by (A-II), x is 2.


In preferred embodiments of any of the aspects described herein, n is 2 and each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95). In a conjugate having two Fc domain monomers (e.g., a conjugate of formula (1), formula (2), formula (D-I) where n equals 2, or (M-I) where n equals 2), the Fc domain monomers dimerize to form an Fc domain.


In certain embodiments, each A1 and each A2 is independently described by any one of formulas (A-Ia)-(A-Ih):




embedded image


embedded image


wherein each X is independently C or N;


or a pharmaceutically acceptable salt thereof.


In further embodiments, each A1 and each A2 is independently described by any one of formulas (A-Ia-i)-(A-Ih-i):




embedded image


embedded image


or a pharmaceutically acceptable salt thereof.

    • In some embodiments, each A1 and each A2 is independently described by any one of formulas (A-Ii)-(A-Ip):




embedded image


embedded image


wherein each X is independently C or N;


or a pharmaceutically acceptable salt thereof.


In another embodiment, each A1 and each A2 is independently described by any one of formulas (A-Iq)-(A-Ix):




embedded image


embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, each A1 and each A2 is independently described by any one of formulas (A-Iq-i)-(A-Ix-i)




embedded image


embedded image


or a pharmaceutically acceptable salt thereof.

    • In yet another embodiment, each A1 and each A2 is independently described by any one of formulas (A-Iaa)-(A-Ihh):




embedded image


embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, each A1 and each A2 is independently described by any one of formulas (A-Iii)-(A-Ipp):




embedded image


embedded image


wherein each X is independently C or N;


or a pharmaceutically acceptable salt thereof.


In another embodiment, each A1 and each A2 is independently described by any one of formulas (A-Iii-i)-(A-Ipp-i):




embedded image


embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, R1 is H. In certain embodiments, R2 is H. In preferred embodiments, R2 is —OCH3. In particular embodiments, R3 is H. In another embodiment, R4 is H. In some embodiments, R5 is H. In particular embodiments, R7 is a carbonyl. In certain embodiments, X is N. In other embodiments, X is C.


In certain embodiments, each A1 and each A2 is independently described by any one of formulas (A-IIa)-(A-IId):




embedded image


or a pharmaceutically acceptable salt thereof.


In another embodiment, each A1 and each A2 is independently described by any one of formulas (A-IIa-i)-(A-IId-i):




embedded image


wherein U5 is C1-C10 alkyl;


or a pharmaceutically acceptable salt thereof.


In another aspect, the invention features a conjugate described by formula (D-I):




embedded image


wherein each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95); L in each A1-L-A2 is a linker covalently attached to a sulfur atom of a hinge cysteine in E and to each of A1 and A2; n is 1 or 2 (e.g., when n is 2, the two Fc domain monomers dimerize to form and Fc domain); T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), and the squiggly line connected to the E indicates that each A1-L-A2 is covalently attached (e.g., by way of a covalent bond or linker) to a sulfur atom of a hinge cysteine in E, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L-A2 structures described herein).


In another aspect, the invention features a conjugate described by formula (D-I):




embedded image


wherein each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95); L in each A1-L-A2 is a linker covalently attached to a nitrogen atom of a surface exposed lysine in E and to each of A1 and A2; n is 1 or 2 (e.g., when n is 2, the two Fc domain monomers dimerize to form and Fc domain); T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), and the squiggly line connected to the E indicates that each A1-L-A2 is covalently attached (e.g., by way of a covalent bond or linker) to the nitrogen atom of a surface exposed lysine in E, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L-A2 structures described herein).


In another aspect, the invention features a conjugate described by formula (M-I):




embedded image


wherein each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95); L in each L-A1 is a linker covalently attached to a sulfur atom of a hinge cysteine in E and to A1; n is 1 or 2 (e.g., when n is 2, the two Fc domain monomers dimerize to form and Fc domain); T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20); and the squiggly line connected to E indicates that each L-A1 is covalently attached (e.g., by way of a covalent bond or linker) to the sulfur atom of the hinge cysteine in E, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1 may be independently selected from any structure described by formula (A-I).


In another aspect, the invention features a conjugate described by formula (M-I):




embedded image


wherein each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95); L in each L-A1 is a linker covalently attached to a nitrogen atom of a surface exposed lysine in E and to A1; n is 1 or 2 (e.g., when n is 2, the two Fc domain monomers dimerize to form and Fc domain); T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), the squiggly line connected to E indicates that each L-A1 is covalently attached (e.g., by way of a covalent bond or linker) to the nitrogen atom of a surface exposed lysine in E, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1 may be independently selected from any structure described by formula (A-I).


In another aspect, the invention features a conjugate described by formula (1):




embedded image


wherein each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95); L in each A1-L-A2 is a linker covalently attached to a sulfur atom of a hinge cysteine in each E and to each of A1 and A2; T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), and the two squiggly lines connected to the two Es indicate that each A1-L-A2 is covalently attached (e.g., by way of a covalent bond or linker) to a pair of sulfur atoms of two hinge cysteines in the two Es, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L-A2 structures described herein).


In another aspect, the invention features a conjugate described by formula (2):




embedded image


wherein each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95); L in each L-A1 is a linker covalently attached to a sulfur atom in a hinge cysteine in E and to A1; T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), and the two squiggly lines connected to the two sulfur atoms indicate that each L-A1 is covalently attached to a pair of sulfur atoms of two hinge cysteines in the two Es, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A may be independently selected from a structure described by formula (A-I).


In some embodiments of any of the aspects described herein, each E includes an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95.


In some embodiments, at least one of the pair of sulfur atoms is the sulfur atom corresponding to (e.g., the sulfur atom of) a hinge cysteine of SEQ ID NO: 10 or SEQ ID NO: 11, i.e., Cys10, Cys13, Cys16, or Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11. In some embodiments, the pair of sulfur atoms are the sulfur atoms corresponding to (e.g., the sulfur atoms of) Cys10 and Cys13 in SEQ ID NO: 10 or SEQ ID NO: 11, Cys10 and Cys16 in SEQ ID NO: 10 or SEQ ID NO: 11, Cys 30 and Cys18 in SEQ ID NO: 10 or SEQ ID NO: 11, Cys13 and Cys 36 in SEQ ID NO: 10 or SEQ ID NO: 11, Cys13 and Cys 38 in SEQ ID NO: 10 or SEQ ID NO: 11, and/or Cys 36 and Cys 38 in SEQ ID NO: 10 or SEQ ID NO: 11.


In some embodiments, when T is 2, the pair of sulfur atoms are (e.g., the sulfur atoms corresponding to) Cys10 and Cys13 in SEQ ID NO: 10 or SEQ ID NO: 11 or Cys 36 and Cys 38 in SEQ ID NO: 10 or SEQ ID NO: 11.


In some embodiments, the pair of sulfur atoms include one sulfur atom of a cysteine from each E, i.e., L-A1 along with the sulfur atoms to which it is attached forms a bridge between two Fc domains (e.g., two Fc domains comprising the sequence of SEQ ID NO: 10 or SEQ ID NO: 11). In some embodiments, the pair of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E. In some embodiments, the pair of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E. In some embodiments, the pair of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E. In some embodiments, the pair of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E.


In some embodiments, when T is 2, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E. In some embodiments, when T is 2, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E. In some embodiments, when T is 2, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E.


In some embodiments, when T is 2, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E. In some embodiments, when T is 2, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E.


In some embodiments, when T is 2, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E.


In some embodiments, when T is 3, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E. In some embodiments, when T is 3, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E. In some embodiments, when T is 3, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E. In some embodiments, when T is 3, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E.


In some embodiments, when T is 3, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys16 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E; and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys18 of SEQ ID NO: 10 or SEQ ID NO: 11 from another E.


In some embodiments, the conjugate has the structure:




embedded image


wherein each of a, b, c, and d is, independently, 0 or 1 and wherein when a, b, c, or d is 0, the two sulfur atoms form a disulfide bond.


In some embodiments, a is 1 and b, c, and d are 0. In some embodiments, a and b are 1 and c and d are 0. In some embodiments, a and c are 1 and b and d are 0. In some embodiments, a and d are 1 and b and c are 0. In some embodiments, a, b, and c are 1 and d is 0. In some embodiments, a, b, and d are 1 and c is 0. In some embodiments, a, c, and d are 1 and b is 0. In some embodiments, b and c are 1 and a and d are 0. In some embodiments, b and d are 1 and a and c are 0. In some embodiments, b, c, and d are 1 and a is 0. In some embodiments, c and d are 1 and a and b are 0. In some embodiments, a, b, c, and d are 1.


In some embodiments, at least one of the pair of sulfur atoms is the sulfur atom corresponding to (e.g., the sulfur atom of) a hinge cysteine of SEQ ID NO: 4 or SEQ ID NO: 33, i.e., Cys10 and/or Cys13. In some embodiments, the pair of sulfur atoms are the sulfur atoms corresponding to (e.g., the sulfur atoms of) Cys10 and Cys13 in SEQ ID NO: 4 or SEQ ID NO: 33.


In some embodiments, the pair of sulfur atoms include one sulfur atom of a cysteine from each E, i.e., L-A1 along with the sulfur atoms to which it is attached forms a bridge between two Fc domains (e.g., two Fc domains comprising the sequence of SEQ ID NO: 4 or SEQ ID NO: 33). In some embodiments, the pair of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 4 or SEQ ID NO: 33 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 4 or SEQ ID NO: 33 from another E. In some embodiments, the pair of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 4 or SEQ ID NO: 33 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 4 or SEQ ID NO: 33 from another E. In some embodiments, when T is 2, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 4 or SEQ ID NO: 33 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 4 or SEQ ID NO: 33 from another E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 4 or SEQ ID NO: 33 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 4 or SEQ ID NO: 33 from another E.


In some embodiments, the conjugate has the structure:




embedded image


wherein each of a and b is, independently, 0 or 1 and wherein when a or b is 0, the two sulfur atoms form a disulfide bond. In some embodiments, a is 1 and b is 0. In some embodiments, a is 0 and b is 1. In some embodiments, a and b are 1.


In some embodiments, at least one of the pair of sulfur atoms is the sulfur atom corresponding to (e.g., the sulfur atom of) a hinge cysteine of SEQ ID NO: 8, i.e., Cys10 and/or Cys13. In some embodiments, the pair of sulfur atoms are the sulfur atoms corresponding to (e.g., the sulfur atoms of) Cys10 and Cys13 in SEQ ID NO: 8.


In some embodiments, the pair of sulfur atoms include one sulfur atom of a cysteine from each E, i.e., L-A1 along with the sulfur atoms to which it is attached forms a bridge between two Fc domains (e.g., two Fc domains comprising the sequence of SEQ ID NO: 8). In some embodiments, the pair of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 8 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 8 from another E. In some embodiments, the pair of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 8 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 8 from another E. In some embodiments, when T is 2, the pairs of sulfur atoms are the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 8 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys10 of SEQ ID NO: 8 from another E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 8 from one E and the sulfur atom corresponding to (e.g., the sulfur atom of) Cys13 of SEQ ID NO: 8 from another E.


In some embodiments, the conjugate has the structure:




embedded image


wherein each of a and b is, independently, 0 or 1 and wherein when a or b is 0, the two sulfur atoms form a disulfide bond. In some embodiments, a is 1 and b is 0. In some embodiments, a is 0 and b is 1. In some embodiments, a and b are 1.


In some embodiments, the conjugate has the structure:




embedded image


wherein each of a and b is, independently, 0 or 1 and wherein when a or b is 0, the two sulfur atoms form a disulfide bond. In some embodiments, a is 1 and b is 0. In some embodiments, a is 0 and b is 1. In some embodiments, a and b are 1.


In some embodiments, the conjugate has the structure:




embedded image


wherein each of a and b is, independently, 0 or 1 and wherein when a or b is 0, the two sulfur atoms form a disulfide bond. In some embodiments, a is 1 and b is 0. In some embodiments, a is 0 and b is 1. In some embodiments, a and b are 1.


In some embodiments, the conjugate has the structure:




embedded image


wherein each of a and b is, independently, 0 or 1 and wherein when a or b is 0, the sulfur atoms is a thiol. In some embodiments, a is 1 and b is 0. In some embodiments, a is 0 and b is 1. In some embodiments, a and b are 1.


In some embodiments, the nitrogen atom is the nitrogen of a surface exposed lysine, e.g., the nitrogen atom corresponding to (e.g., the nitrogen atom of) Lys35, Lys63, Lys77, Lys79, Lys106, Lys123, Lys129, Lys181, Lys203, Lys228, or Lys236 of SEQ ID NO: 10 or SEQ ID NO: 11. In some embodiments, the nitrogen atom is the nitrogen atom corresponding to (e.g., the nitrogen atom of) Lys65, Lys79, Lys108, Lys230, and/or Lys238 of SEQ ID NO: 10 or SEQ ID NO: 11.


In some embodiments, the conjugate has the structure:




embedded image


wherein each of a, b, c, d, and e is, independently, 0 or 1 and wherein when a, b, c, d, or e is 0, the two nitrogen atom is NH2. In some embodiments, a is 1 and b, c, d, and e are 0. In some embodiments, b is 1 and a, c, d, and e are 0. In some embodiments, c is 1 and a, b, d, and e are 0. In some embodiments, d is 1 and a, b, c, and e are 0. In some embodiments, e is 1 and a, b, c, and d are 0. In some embodiments, a and b are 1 and c, d, and e are 0. In some embodiments, a and c are 1 and b, d, and e are 0. In some embodiments, a and d are 1 and b, c, and e are 0. In some embodiments, a and e are 1 and b, c, and d are 0. In some embodiments, b and c are 1 and a, d, and e are 0. In some embodiments, b and d are 1 and a, c, and e are 0. In some embodiments, b and e are 1 and a, c, and d are 0. In some embodiments, c and d are 1 and a, b, and e are 0. In some embodiments, c and e are 1 and a, b, and d are 0. In some embodiments, d and e are 1 and a, b, and c are 0. In some embodiments, a, b, and c are 1 and d and e are 0. In some embodiments, a, b, and d are 1 and c and e are 0. In some embodiments, a, b, and e are 1 and c and d are 0. In some embodiments, a, c, and d are 1 and b and e are 0. In some embodiments, a, c, and e are 1 and b and d are 0. In some embodiments, a, d, and e are 1 and b and c are 0. In some embodiments, b, c, and d are 1 and a and e are 0. In some embodiments, b, d, and e are 1 and a and c are 0. In some embodiments, c, d, and e are 1 and a and b are 0.


In some embodiments of any of the conjugates described herein, the conjugate forms a homodimer including an Fc domain. In some embodiments of the conjugates described herein, E homodimerizes with another E to form an Fc domain.


In another aspect, the invention features a conjugate described by (D-I):




embedded image


wherein E includes an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L in each A1-L-A2 is a linker independently covalently attached to a sulfur atom of a surface exposed cysteine or a nitrogen atom of a surface exposed lysine in E and to each of A1 and A2; n is 1; T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), and the squiggly line connected to the E indicates that each A1-L-A2 is independently covalently attached to the sulfur atom of a solvent-exposed cysteine or the nitrogen atom of a solvent-exposed lysine in E, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L-A2 structures described herein). In a preferred embodiment of the above, x is 2.


In another aspect, the invention features a conjugate described by formula (M-I):




embedded image


wherein E includes an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L in each L-A1 is a linker independently covalently attached to a sulfur atom of a surface exposed cysteine or a nitrogen atom of a surface exposed lysine in E and to A1; n is 1; T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20); and the squiggly line connected to E indicates that each L-A1 is independently covalently attached to the sulfur atom of the solvent-exposed cysteine or the nitrogen atom of the solvent-exposed lysine in E, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1 may be independently selected from any structure described by formula (A-I). In a preferred embodiment of the above, x is 2.


In some embodiments, each E includes an albumin protein having the sequence of any one of SEQ ID NOs: 96-98.


In some embodiments, T is 1 and L-A1 is covalently attached to the sulfur atom corresponding to Cys34 of SEQ ID NO: 96.


In another aspect, the invention features an intermediate (Int) of Table 1. These intermediates comprise one or more gp120 binders and a linker (e.g., a PEG2-PEG20 linker) and may be used in the synthesis of a conjugate described herein. Intermediates of Table 1 may be conjugated to, for example, an Fc domain or Fc domain monomer, albumin protein, albumin protein-binding peptide, or Fc-binding peptide (e.g., by way of a linker) by any suitable methods known to those of skill in the art, including any of the methods described or exemplified herein. In some embodiments, the conjugate (e.g., a conjugate described by any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) includes E, wherein E is an Fc domain monomer or an Fc domain (e.g., an Fc domain monomer or an Fc domain, each Fc domain monomer having, independently, the sequence of any one of SEQ ID NOs: 1-95). In preferred embodiments, one or more nitrogen atoms of one or more surface exposed lysine residues of E or one or more sulfur atoms of one or more surface exposed cysteines in E is covalently conjugated to a linker (e.g., a PEG2-PEG20 linker). The linker conjugated to E may be functionalized such that it may react to form a covalent bond with any of the Ints described herein (e.g., an Int of Table 1). In preferred embodiments, E is conjugated to a linker functionalized with an azido group and the Int (e.g., an Int of Table 1) is functionalized with an alkyne group. Conjugation (e.g., by click chemistry) of the linker-azido of E and linker-alkyne of the Int forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII). In yet other embodiments, E is conjugated to a linker functionalized with an alkyne group and the Int (e.g., an Int of Table 1) is functionalized with an azido group. Conjugation (e.g., by click chemistry) of the linker-alkyne of E and the linker-azido of the Int forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII).









TABLE 1







Intermediates








Intermediate
Structure





Int-1 


embedded image







Int-2 


embedded image







Int-3 


embedded image







Int-4 


embedded image







Int-5 


embedded image







Int-6 


embedded image







Int-7 


embedded image







Int-8 


embedded image







Int-9 


embedded image







Int-10


embedded image







Int-11


embedded image







Int-12


embedded image







Int-13


embedded image







Int-14


embedded image







Int-15


embedded image







Int-16


embedded image







Int-17


embedded image







Int-18


embedded image







Int-19


embedded image







Int-20


embedded image







Int-21


embedded image







Int-22


embedded image







Int-23


embedded image







Int-24


embedded image







Int-25


embedded image







Int-26


embedded image







Int-27


embedded image











In another aspect, the invention features a conjugate of Table 2. Each conjugate of Table 2 corresponds to a conjugate of either formula (M-I) or formula (D-I), as indicated. Conjugates of Table 2 include conjugates formed by the covalent reaction of an Int of Table 1 with a linker which is in turn conjugated to E (e.g., an Fc domain monomer, an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide). In some embodiments, the reactive moiety of the Int (e.g., the alkyne or azido group) reacts with a corresponding reactive group (e.g., an alkyne or azido group) of a linker (represented by L′) covalently attached to E, such that an Int of Table 1 is covalently attached to E. As represented in Table 2, L′ corresponds to the remainder of L as defined in (M-I) or (D-I) (e.g., L′ is a linker that covalently joins the Int and E). For example, L′ may include a triazole (formed by the click chemistry reaction between the Int and a linker conjugated to E) and a linker (e.g., a PEG2-PEG20 linker) which in turn is conjugated to an amino acid side chain of E.


In some embodiments in any conjugate of Table 2, n is 1 or 2. When n is 1, each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide. When n is 2, each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95), and the Fc domain monomers dimerize to form and Fc domain.


In some embodiments in any conjugate of Table 2, T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20). The disclosure also provides a population of any of the conjugates of Table 2 wherein the average value of T is 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20). In some embodiments, the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments, the average T is 1 to 10 (e.g., 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, or 10). In certain embodiments, the average T is 1 to 5 (e.g., 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, or 5). In some embodiment, the average T is 5 to 10 (e.g., 5, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10). In some embodiments, the average T is 2.5 to 7.5 (e.g., 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, or 7.5).


The squiggly line in the conjugates of Table 2 indicates that each L′-Int is covalently attached to an amino acid side chain in E (e.g., the nitrogen atom of a surface exposed lysine or the sulfur atom of a surface exposed cysteine in E), or a pharmaceutically acceptable salt thereof.









TABLE 2







Conjugates Corresponding to Intermediates of Table 1










Corre-




sponding




Intermediate




of Table 1
Conjugate Structure







Int-1 


embedded image









Int-2 


embedded image









Int-3 


embedded image









Int-4 


embedded image









Int-5 


embedded image









Int-6 


embedded image









Int-7 


embedded image









Int-8 


embedded image









Int-9 


embedded image









Int-10


embedded image









Int-11


embedded image









Int-12


embedded image









Int-13


embedded image









Int-14


embedded image









Int-15


embedded image









Int-16


embedded image









Int-17


embedded image









Int-18


embedded image









Int-19


embedded image









Int-20


embedded image









Int-21


embedded image









Int-22


embedded image









Int-23


embedded image









Int-24


embedded image









Int-25


embedded image









Int-26


embedded image









Int-27


embedded image












In another aspect, the invention features a conjugate including (i) a first moiety, A1; (ii) a second moiety, A2; (iii) an Fc domain monomer or an Fc domain; and (iv) a linker covalently attached to A1 and A2, and to the Fc domain monomer or the Fc domain; wherein each A1 and each A2 is independently selected from any structure described by formula (A-I) or (A-II). In a preferred embodiment of the above, x is 2.


In another aspect, the invention features a conjugate including (i) a first moiety, Int; (ii) an Fc domain monomer or an Fc domain; and (iv) a linker covalently attached to Int, and to the Fc domain monomer or the Fc domain; wherein each Int is independently selected from any one of the intermediates of Table 1.


In another aspect, the invention features a conjugate including (i) a first moiety, A1; (ii) a second moiety, A2; (iii) an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide; and (iv) a linker covalently attached to A1 and A2, and to the Fc domain monomer or the Fc domain; wherein each A1 and each A2 is independently selected from any structure described by formula (A-I) or (A-II). In a preferred embodiment of the above, x is 2.


In another aspect, the invention features a conjugate described by formula (D-I):




embedded image


wherein each A1 and each A2 is independently described by formula (A-I) or (A-II); each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; n is 1 or 2; T is an integer from 1 to 20 (e.g., T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20); and L is a linker covalently attached to each of E, A1, and A2, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L-A2 structures described herein). In a preferred embodiment of the above, x is 2.


In some embodiments, the conjugate is described by formula (D-II):




embedded image


wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-III):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-III-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-III-2):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-III-3):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-III-4):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-III-5):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-III-6):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-IV):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-IV-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-IV-2):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-IV-3):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-IV-4):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-IV-5):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-IV-6):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-V):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-V-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-V-2):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-V-3):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-V-4):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-V-5):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-V-6):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-VI):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-VI-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-VI-2):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-VI-3):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-VI-4):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-VI-5):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-VI-6):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-VII):




embedded image


wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-VIII):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-VIII-1):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-IX):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-IX-1):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-X):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-X-1):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-XI):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XI-1):




embedded image


wherein L′ is the remainder of L, and y1 and y2 are each independently an integer from 1-20 (e.g., y1 and y2 are each independently 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, L′ is a nitrogen atom.


In some embodiments, the conjugate is described by formula (D-XII):




embedded image


wherein X is C, O, or N,


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XII-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XII-2):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XIII):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XIII-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XIII-2):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the invention features a conjugate described by formula (D-I):




embedded image


wherein each A1 and each A2 is independently described by formula (A-II);


each E comprises an Fc domain monomer;


the squiggly line connected to the E indicates that each A1-L-A2 is covalently attached to E;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XIV):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XIV-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XIV-2):




embedded image


wherein L′ is the remainder of L,


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XIV-3):




embedded image


wherein L′ is the remainder of L, and


e1 and e2 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XIV-4):




embedded image


wherein L′ is the remainder of L, and


e1, e2, e3 and e4 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XIV-5):




embedded image


wherein L′ is the remainder of L, and


e1, e2, e3, and e4 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XV):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XV-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XV-2):




embedded image


wherein L′ is the remainder of L,


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XV-3):




embedded image


wherein L′ is the remainder of L, and


e1 and e2 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XV-4):




embedded image


wherein L′ is the remainder of L, and


e1, e2, e3 and e4 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XV-5):




embedded image


wherein L′ is the remainder of L, and


e1, e2, e3, and e4 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVI):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVI-1):




embedded image


wherein U5 is C1-C10 alkyl;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVI-2):




embedded image


wherein L′ is the remainder of L,


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVI-3):




embedded image


wherein L′ is the remainder of L, and


e1 and e2 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVI-4):




embedded image


wherein L′ is the remainder of L, and


e1, e2, e3 and e4 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVI-5):




embedded image


wherein L′ is the remainder of L, and


e1, e2, e3, and e4 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVII):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVII-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVII-2):




embedded image


wherein L′ is the remainder of L,


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVII-3):




embedded image


wherein L′ is the remainder of L, and


e1 and e2 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVII-4):




embedded image


wherein L′ is the remainder of L, and


e1, e2, e3 and e4 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (D-XVII-5):




embedded image


wherein L′ is the remainder of L, and


e1, e2, e3 and e4 are each independently an integer from 1-10


y1 and y2 are each independently an integer from 1-20


or a pharmaceutically acceptable salt thereof.


In some embodiments of any of the aspects described herein, L or L′ includes one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.


In some embodiments of any of the aspects described herein, the backbone of L or L′ consists of one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.


In some embodiments of any of the aspects described herein, L or L′ is oxo substituted. In some embodiments, the backbone of L or L′ includes no more than 250 atoms. In some embodiments, L or L′ is capable of forming an amide, a carbamate, a sulfonyl, or a urea linkage. In some embodiments L or L′ is a bond. In some embodiments, L or L′ is an atom.


In some embodiments of any of the aspects described herein, each L is described by formula (D-L-I):




embedded image


wherein LA is described by formula GA1-(ZA1)g1—(YA1)h1—(ZA2)i1—(YA2)j1—(ZA3)k1—(YA3)l1—(ZA4)m1—(YA4)n1—(ZA5)o1-GA2; LB is described by formula GB1-(ZB1)g2—(YB1)h2—(ZB2)i2—(YB2)j2—(ZB3)k2—(YB3)l2—(ZB4)m2—(YB4)n2—(ZB5)o2-GB2; LC is described by formula GC1-(ZC1)g3—(YC1)h3—(ZC2)i3—(YC2)j3—(ZC3)k3—(YC3)l3—(ZC4)m3—(YC4)n3—(ZC5)o3-GC2; GA1 is a bond attached to Qi; GA2 is a bond attached to A1; GB1 is a bond attached to Qi); GB2 is a bond attached to A2; GC1 is a bond attached to Qi; GC2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid, thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of ZA1, ZA2, ZA3, ZA4, ZA5, ZB1, ZB2, ZB3, ZB4, ZB5, ZC1, ZC2, ZC3, ZC4, and ZC5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene; each of YA1, YA2, YA3, YA4, YB1, YB2, YB3, YB4, YC1, YC2, YC3 and YC4 is, independently, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl; each of g1, h1, i1, j1, k1, l1, m1, n1, o1, g2, h2, i2, j2, k2, l2, m2, n2, o2, g3, h3, i3, j3, k3, l3, m3, n3, and o3 is, independently, 0 or 1; Q is a nitrogen atom, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene.


In some embodiments, optionally substituted includes substitution with a polyethylene glycol (PEG). A PEG has a repeating unit structure (—CH2CH2O—)n, wherein n is an integer from 2 to 100. A polyethylene glycol may be selected any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100).


In some embodiments, LC may have two points of attachment to the Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide (e.g., two GC2).


In some embodiments of any of the aspects described herein, L includes a polyethylene glycol (PEG) linker. A PEG linker includes a linker having the repeating unit structure (—CH2CH2O—)n, wherein n is an integer from 2 to 100. A polyethylene glycol linker may covalently join a gp120 binder and E (e.g., in a conjugate of any one of formulas (M-I)-(M-XVII)). A polyethylene glycol linker may covalently join a first gp120 binder and a second gp120 binder (e.g., in a conjugate of any one of formulas (D-I)-(D-XVII)). A polyethylene glycol linker may covalently join a gp120 binder dimer and E (e.g., in a conjugate of any one of formulas (D-I)-(D-XVII)). A polyethylene glycol linker may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100). In some embodiments, LC includes a PEG linker, where LC is covalently attached to each of Qi and E.


In some embodiments, L is




embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


wherein z1, z2, y1, y2, y3, and y4 are each, independently, and integer from 1 to 20; and R9 is selected from H, C1-C20 alkyl, C3-C20cycloalkyl, C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and C3-C15 heteroaryl.


In some embodiments, L is




embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


wherein R* is a bond or includes one or more of optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, and imino, and wherein Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.


In another aspect, the invention features a conjugate described by formula (M-I):




embedded image


wherein each A1 is independently described by formula (A-I);


each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; n is 1 or 2; T is an integer from 1 to 20 (e.g., T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20); and L is a linker covalently attached to each of E and A1, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1 may be independently selected from any structure described by formula (A-I). In a preferred embodiment of the above, x is 2.


In some embodiments, the conjugate is described by formula (M-II):




embedded image


wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-III):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-III-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-III-2):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-III-3):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-III-4):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-III-5):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-III-6):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-IV):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-IV-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-IV-2):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-IV-3):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-IV-4):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-IV-5):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-IV-6):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-V):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-V-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-V-2):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-V-3):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-V-4):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-V-5):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-V-6):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-VI):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-VI-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-VI-2):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-VI-3):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-VI-4):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-VI-5):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-VI-6):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-VII):




embedded image


wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-VIII):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-VIII-1):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-IX):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-IX-1):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-X):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-X-1):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XI):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XI-1):




embedded image


wherein L′ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XII):




embedded image


wherein X is C, O, or N,


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XII-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XII-2):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XIII):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XIII-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XIII-2):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the invention features a conjugate described by formula (M-I):




embedded image


wherein each A1 is independently described by formula (A-II);


each E comprises an Fc domain monomer;


the squiggly line connected to the E indicates that each A1-L-A2 is covalently attached to E;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XIV):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XIV-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XIV-2):




embedded image


wherein L′ is the remainder of L;


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XIV-3):




embedded image


wherein L′ is the remainder of L;


e1 is an integer from 1-10; and


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XIV-4):




embedded image


wherein L′ is the remainder of L;


e1 and e3 are each independently an integer from 1-10; and


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XIV-5):




embedded image


wherein L′ is the remainder of L;


e1 and e3 are each independently an integer from 1-10; and


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XV):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XV-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XV-2):




embedded image


wherein L′ is the remainder of L;


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XV-3):




embedded image


wherein L′ is the remainder of L;


e1 is an integer from 1-10; and


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XV-4):




embedded image


wherein L′ is the remainder of L;


e1 and e3 are each independently an integer from 1-10; and


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XV-5):




embedded image


wherein L′ is the remainder of L;


e1 and e3 are each independently an integer from 1-10; and


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVI):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVI-1):




embedded image


wherein U5 is C1-C10 alkyl;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVI-2):




embedded image


wherein L′ is the remainder of L;


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVI-3):




embedded image


wherein L′ is the remainder of L;


e1 is an integer from 1-10; and


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVI-4):




embedded image


wherein L′ is the remainder of L;


e1 and e3 are each independently an integer from 1-10;


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVI-5):




embedded image


wherein L′ is the remainder of L;


e1 and e3 are each independently an integer from 1-10;


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVII):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVII-1):




embedded image


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVII-2):




embedded image


wherein L′ is the remainder of L,


y1 is an integer from 1-20,


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVII-3):




embedded image


wherein L′ is the remainder of L;


e1 is an integer from 1-10; and


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVII-4):




embedded image


wherein L′ is the remainder of L;


e1 and e3 are each independently an integer from 1-10; and


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments, the conjugate is described by formula (M-XVII-5):




embedded image


wherein L′ is the remainder of L;


e1 and e3 are each independently an integer from 1-10; and


y1 is an integer from 1-20;


or a pharmaceutically acceptable salt thereof.


In some embodiments of any of the aspects described herein, L or L′ includes one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.


In some embodiments of any of the aspects described herein, the backbone of L or L′ consists of one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.


In some embodiments of any of the aspects described herein, L or L′ is oxo substituted. In some embodiments, the backbone of L or L′ includes no more than 250 atoms. In some embodiments, L or L′ is capable of forming an amide, a carbamate, a sulfonyl, or a urea linkage. In some embodiments, L or L′ is a bond. In some embodiments, L or L′ is an atom. In some embodiments, L′ is a nitrogen atom.


In some embodiments, each L is described by formula (M-L-I):





J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2


wherein: J1 is a bond attached to A1; J2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid, thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of Q1, Q2, Q3, Q4 and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene; each of T1, T2, T3, T4 is, independently, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1; or a pharmaceutically acceptable salt thereof.


In some embodiments, optionally substituted includes substitution with a polyethylene glycol (PEG). A PEG has a repeating unit structure (—CH2CH2O—)n, wherein n is an integer from 2 to 100. A polyethylene glycol may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100).


In some embodiments, J2 may have two points of attachment to the Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide (e.g., two J2).


In some embodiments, L is




embedded image


wherein d is an integer from 1 to 20 (e.g., d is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20).


In some embodiments, L is




embedded image


embedded image


embedded image


embedded image


wherein each Y is independently selected from (—O—), (—S—), (—R8—), (—O(C═O)NR8—), (—O(C═S)NR8—), (—O(C═O)O—), (—O(C═O)—), (—NH(C═O)O—), (—NH(C═O)—), (—NH(C═NH)—), (—NH(C═O)NR8—), (—NH(C═NH)NR8—), (—NH(C═S)NR8—), (—NH(C═S)—), (—OCH2(C═O)NR8—), (—NH(SO2)—), (—NH(SO2)NR8—), (—OR9—), (—NR9—), (—SR9—), (—R9NH(C═O)—), (—R9OR9C(═O)NH—), (—CH2NH(C═O)—), (—CH2OCH2(C═O)NH—), (—(C═NR8)NH—), (—NH(SO2)—), (—(C═O)NH—), (—C(═O)—), (—C(NR8)—), or (—R9C(═O)—);


each R8 is independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl;


each R9 is independently selected from optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; and


each of d, e, y1, and x1 is, independently, an integer from 1 to 26 (e.g., d is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26).


In some embodiments of any of the aspects described herein, L includes a polyethylene glycol (PEG) linker. A PEG linker includes a linker having the repeating unit structure (—CH2CH2O—)n, wherein n is an integer from 2 to 100. A polyethylene glycol linker may covalently join a gp120 binder and E (e.g., in a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)). A polyethylene glycol linker may covalently join a first gp120 binder and a second gp120 binder (e.g., in a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)). A polyethylene glycol linker may covalently join a gp120 binder dimer and E (e.g., in a conjugate of any one of formulas). A polyethylene glycol linker may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100). In some embodiments, LC includes a PEG linker, where LC is covalently attached to each of Qi and E.


In some embodiments of any of the aspects described herein, L is covalently attached to the nitrogen atom of a surface exposed lysine of E or L is covalently attached to the sulfur atom of a surface exposed cysteine of E.


In some embodiments of any of the aspects described herein, E is an Fc domain monomer. In some embodiments, n is 2 and each E dimerizes to form an Fc domain.


In some embodiments, n is 2, each E is an Fc domain monomer, each E dimerizes to form an Fc domain, and the conjugate is described by formula (D-I-1):




embedded image


wherein J is an Fc domain; and T is an integer from 1 to 20 (e.g., T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments, n is 2, each E is an Fc domain monomer, each E dimerizes to form an Fc domain, and the conjugate is described by formula (M-I-1):




embedded image


wherein J is an Fc domain; and T is an integer from 1 to 20 (e.g., T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.


In some embodiments of any of the aspects described herein, E has the sequence of any one of SEQ ID NOs: 1-95.


In some embodiments of any of the aspects described herein, E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide. In some embodiments, where E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide, n is 1.


In some embodiments, n is 1, E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide and the conjugate is described by formula (D-I-2):




embedded image


wherein E is an albumin protein, an albumin protein-binding peptide, or Fc-binding peptide; and T is an integer from 1 to 20, or a pharmaceutically acceptable salt thereof.


In some embodiments, n is 1, E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide, and the conjugate is described by formula (M-I-2):




embedded image


wherein E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide; and T is an integer from 1 to 20, or a pharmaceutically acceptable salt thereof.


In some embodiments of any of the aspects described herein, E is an albumin protein having the sequence of any one of SEQ ID NOs: 96-98.


In some embodiments of any of the aspects described herein, T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.


In another aspect, the invention provides a population of conjugates having the structure of any of the conjugates described herein (e.g., a population of conjugates having the formula of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)), wherein the average value of T is 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20). In some embodiments, the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.


In another aspect, the invention provides a pharmaceutical composition comprising any of the conjugates described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)), or a pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable excipient.


In another aspect, the invention provides a method for the treatment of a subject having a viral infection or presumed to have a viral infection, the method comprising administering to the subject an effective amount of any of the conjugates or compositions described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)).


In another aspect, the invention provides a method for the prophylactic treatment of a viral infection in a subject in need thereof, the method comprising administering to the subject an effective amount of any of the conjugates or compositions described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)).


In some embodiments, the viral infection is caused by HIV. In some embodiments, the viral infection is HIV-1 or HIV-2.


In some embodiments, the subject is immunocompromised.


In some embodiments, the subject has been diagnosed with humoral immune deficiency, T cell deficiency, neutropenia, asplenia, or complement deficiency.


In some embodiments, the subject is being treated or is about to be treated with an immunosuppressive therapy.


In some embodiments, the subject has been diagnosed with a disease which causes immunosuppression. In some embodiments, the disease is cancer or acquired immunodeficiency syndrome. In some embodiments, the cancer is leukemia, lymphoma, or multiple myeloma.


In some embodiments, the subject has undergone or is about to undergo hematopoietic stem cell transplantation.


In some embodiments, wherein the subject has undergone or is about to undergo an organ transplant.


In some embodiments, the conjugate of composition is administered intramuscularly, intravenously, intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally, locally, by inhalation, by injection, or by infusion.


In some embodiments, the subject is treated with a second therapeutic agent. In some embodiments, the second therapeutic agent is an antiviral agent. In some embodiments, the second therapeutic agent is a viral vaccine. In some embodiments, the viral vaccine elicits an immune response in the subject against HIV (e.g., HIV-1 or HIV-2).


In some embodiments, an Fc-domain-containing composition may be substituted for an Fc domain and an Fc-domain-monomer-containing composition may be substituted for an Fc domain monomer in any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII) (e.g., any one of formulas (1), (2), (D-I), (D-II), (D-III), (D-III-1), (D-III-2), (D-III-3), (D-III-4), (D-III-5), (D-III-6), (D-IV), (D-IV-1), (D-IV-2), (D-IV-3), (D-IV-4), (D-IV-5), (D-IV-6), (D-V), (D-V-1), (D-V-2), (D-V-3), (D-V-4), (D-V-5), (D-V-6), (D-VI), (D-VI-1), (D-VI-2), (D-VI-3), (D-VI-4), (D-VI-5), (D-VI-6), (D-VII), (D-VIII), (D-VIII-1), (D-IX), (D-IX-1), (D-X), (D-X-1), (D-XI), (D-XI-1), (D-XII), (D-XII-1), (D-XII-2), (D-XIII), (D-XIII-1), (D-XIII-2), (D-XIV), (D-XIV-1), (D-XIV-2), (D-XIV-3), (D-XIV-4), (D-XIV-5), (D-XV), (D-XV-1), (D-XV-2), (D-XV-3), (D-XV-4), (D-XV-5), (D-XVI), (D-XVI-1), (D-XVI-2), (D-XVI-3), (D-XVI-4), (D-XVI-5), (D-XV), (D-XVI-1), (D-XVI-2), (D-XVI-3), (D-XVII-4), (D-XVII-5), (M-I), (M-II), (M-III), (M-III-1), (M-III-2), (M-III-3), (M-III-4), (M-III-5), (M-III-6), (M-IV), (M-IV-1), (M-IV-2), (M-IV-3), (M-IV-4), (M-IV-5), (M-IV-6), (M-V), (M-V-1), (M-V-2), (M-V-3), (M-V-4), (M-V-5), (M-V-6), (M-VI), (M-VI-1), (M-VI-2), (M-VI-3), (M-VI-4), (M-VI-5), (M-VI-6), (M-VII), (M-VIII), (M-VIII-1), (M-IX), (M-IX-1), (M-X), (M-X-1), (M-XI), or (M-XI-1), (M-XII), (M-XII-1), (M-XII-2), (M-XIII), (M-XIII-1), (M-XIII-2), (M-XIV), (M-XIV-1), (M-XIV-2), (M-XIV-3), (M-XIV-4), (M-XIV-5), (M-XV), (M-XV-1), (M-XV-2), (M-XV-3), (M-XV-4), (M-XV-5), (M-XVI), (M-XVI-1), (M-XVI-2), (M-XVI-3), (M-XVI-4), (M-XVI-5), (M-XV), (M-XVII-1), (M-XVII-2), (M-XVII-3), (M-XVII-4), (M-XVII-5)). In any of the formulas described herein (e.g., any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)), when n is 1, E is an Fc-domain-monomer-containing composition. In any of the formulas described herein (e.g., any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)), when n is 2, E is an Fc-domain-containing composition.


In certain embodiments, the Fc-domain-containing composition is an antibody or an antibody fragment. An antibody may include any form of immunoglobulin, heavy chain antibody, light chain antibody, LRR-based antibody, or other protein scaffold with antibody-like properties, as well as any other immunological binding moiety known in the art, including antibody fragments (e.g., a Fab, Fab′, Fab′2, F(ab)2, Fd, Fv, Feb, scFv, or SMIP). The subunit structures and three-dimensional configurations of different classes of antibodies are known in the art. An antibody fragment may include a binding moiety that includes a portion derived from or having significant homology to an antibody, such as the antigen-determining region of an antibody. Exemplary antibody fragments include Fab, Fab′, Fab′2, F(ab′)2, Fd, Fv, Feb, scFv, and SMIP.


In particular embodiments, the antibody or antibody fragment is a human, mouse, camelid (e.g., llama, alpaca, or camel), goat, sheep, rabbit, chicken, guinea pig, hamster, horse, or rat antibody or antibody fragment. In specific embodiments, the antibody is an IgG, IgA, IgD, IgE, IgM, or intrabody. In certain embodiments, the antibody fragment includes an scFv, sdAb, dAb, Fab, Fab′, Fab′2, F(ab′)2, Fd, Fv, Feb, or SMIP.


In some embodiments, the Fc-domain-containing composition (e.g., an antibody or antibody fragment) confers binding specificity to a one or more targets (e.g., an antigen, such as an antigen associated with HIV). HIV-targeting antibodies are known in the art, for example, as described in Wibmer et al. Curr. Opin. HIV AIDS, 10(3): 135-143 (2015), which is incorporated herein by reference in its entirety.


In some embodiments, the one or more targets (e.g., an antigen) bound by the Fc-domain-containing composition (e.g., an antibody or antibody fragment) is a viral (e.g., HIV) protein such as gp41 or gp120 receptor. In some embodiments, the antibody or antibody fragment recognizes a viral surface antigen.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 1. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 1.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 2. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 2.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 3. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 3.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 4. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 4.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 5. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 5.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 6. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 6.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 7. In some embodiments, E includes an amino acid sequence that is at least 70% 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 7.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 8. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 8.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 9. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 9.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 10. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 10.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 11. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 11.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 12. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 12.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 13. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 13.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 14. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 14.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 15. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 15.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 16. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 16.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 17. In some embodiments, E includes an amino acid sequence that is at least 70% 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 17.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 18. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 18.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 19. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 19.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 20. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 20.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 21. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 21.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 22. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 22.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 23. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 23.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 24. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 24.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 25. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 25.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 26. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 26.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 27. In some embodiments, E includes an amino acid sequence that is at least 70% 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 27.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 28. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 28.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 29. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 29.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 30. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 30.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 31. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 31.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 32. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 32.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 33. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 33.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 34. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 34.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 35. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 35.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 36. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 36.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 37. In some embodiments, E includes an amino acid sequence that is at least 70% 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 37.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 38. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 38.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 39. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 39.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 40. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 40.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 41. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 41.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 42. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 42.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 43. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 43.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 44. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 45.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 46. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 46.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 47. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 47.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 48. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 48.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 49. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 49.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 50. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 50.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 51. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 51.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 52. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 52.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 53. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 53.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 54. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 54.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 55. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 55.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 56. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 56.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 57. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 57.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 58. In some embodiments, E includes an amino acid sequence that is at least 70% 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 58.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 59. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 59.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 60. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 60.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 61. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 61.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 62. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 62.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 63. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 63.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 64. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 64.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 65. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 65.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 66. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 66.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 67. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 67.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 68. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 68.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 69. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 69.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 70. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 70.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 71. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 71.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 72. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 72.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 73. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 73.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 74. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 74.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 75. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 75.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 76. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 76.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 77. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 77.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 78. In some embodiments, E includes an amino acid sequence that is at least 70% 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 78.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 79. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 79.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 80. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 80.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 81. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 81.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 82. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 82.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 83. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 83.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 84. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 84.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 85. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 85.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 86. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 86.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 87. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 87.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 88. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 88.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 89. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 89.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 90. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 90.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 91. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 91.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 92. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 92.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 93. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 93.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 94. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 94.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 95. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 95.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 96. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 96.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 97. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 97.


In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 98. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 98.


In some embodiments of any of the aspects described herein, wherein E includes an Fc domain monomer, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95) includes a triple mutation corresponding to M252Y/S254T/T256E (YTE). As used herein, an amino acid “corresponding to” a particular amino acid residue (e.g., of a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 may be mutated to include a YTE mutation.


In some embodiments of any of the aspects described herein, wherein E includes an Fc domain monomer, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95) includes a double mutant corresponding to M428L/N434S (LS). As used herein, an amino acid “corresponding to” a particular amino acid residue (e.g., or a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 may be mutated to include a LS mutation.


In some embodiments of any of the aspects described herein, wherein E includes an Fc domain monomer, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95) includes a mutant corresponding to N434H. As used herein, an amino acid “corresponding to” a particular amino acid residue (e.g., of a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 may be mutated to include an N434H mutation.


In some embodiments of any of the aspects described herein, wherein E includes an Fc domain monomer, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95) includes a mutant corresponding to C220S. As used herein, an amino acid “corresponding to” a particular amino acid residue (e.g., or a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 may be mutated to include a C220S mutation.


In some embodiments of any of the aspects described herein, wherein E includes an Fc domain monomer, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95) is a fragment of the Fc domain monomer (e.g., a fragment of at least 25 (e.g., 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more), at least 50 (e.g., 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75 or more), at least 75 (e.g., 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 or more) consecutive amino acids in length from SEQ ID NOs: 1-95.


In some embodiments of any of the aspects described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)), one or more nitrogen atoms of one or more surface exposed lysine residues of E or one or more sulfur atoms of one or more surface exposed cysteines in E is covalently conjugated to a linker (e.g., a PEG2-PEG20 linker). The linker conjugated to E may be functionalized such that it may react to form a covalent bond with the L of any A1-L or any A2-L-A1 described herein. In preferred embodiments, E is conjugated to a linker functionalized with an azido group and the L of A1-L or any A2-L-A1 is functionalized with an alkyne group. Conjugation (e.g., by click chemistry) of the linker-azido of E and the linker-alkyne of A1-L or A2-L-A1 forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII). In yet other embodiments, E is conjugated to a linker functionalized with an alkyne group and L of an A1-L or of any A2-L-A1 is functionalized with an azido group. Conjugation (e.g., by click chemistry) of the linker-alkyne of E and linker-azido of A1-L or of any A2-L-A1 forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII).


In some embodiments of any of the aspects described herein, the squiggly line of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII) represents a covalent bond between the L of A1-L or A2-L-A1 or L′ of A1-L′ or A1-L′-A2.


In some embodiments of any of the aspects described herein, the squiggly line of any one of formulas (1), (2), (D-I)-(D-X), (D′-1), (M-I)-(M-X), or (M′-I) represents that one or more amino acid side chains of E (e.g., one or more nitrogen atoms of one or more surface exposed lysine residues of E or one or more sulfur atoms of one or more surface exposed cysteines in E) have been conjugated to a linker (e.g., a PEG2-PEG20 linker) wherein the linker has been functionalized with a reactive moiety, such that the reactive moiety forms a covalent bond with the L of any A1-L or any A2-L-A1 described herein (e.g., by click chemistry between an azido functionalized linker and an alkyne functionalized linker, as described above).


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-I):




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-II):




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-IIaa):




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-IIbb):




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-IIcc):




embedded image


In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-IIdd):




embedded image


In some embodiments, the conjugate is conjugate 1, or any regioisomer thereof, and the drug-to-antibody ratio (DAR) (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 2, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 3, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 4, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 5, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 6, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 7, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 8, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 9, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 10, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 11, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 12, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 13, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, the conjugate is conjugate 14, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments the DAR is between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.


In some embodiments, a population of conjugates described herein has a DAR (e.g., T) of between 1 and 2, 2 and 4, 4 and 6, 6 and 8, 8 and 10, 1 and 10, 1 and 20, 1 and 5, 3 and 7, 5 and 10, or 10 and 20.


In some embodiments, the Fc domain monomer includes less than about 300 amino acid residues (e.g., less than about 300, less than about 295, less than about 290, less than about 285, less than about 280, less than about 275, less than about 270, less than about 265, less than about 260, less than about 255, less than about 250, less than about 245, less than about 240, less than about 235, less than about 230, less than about 225, or less than about 220 amino acid residues). In some embodiments, the Fc domain monomer is less than about 40 kDa (e.g., less than about 35 kDa, less than about 30 kDa, less than about 25 kDa).


In some embodiments, the Fc domain monomer includes at least 200 amino acid residues (e.g., at least 210, at least 220, at least 230, at least 240, at least 250, at least 260, at least 270, at least 280, at least 290, or at least 300 amino residues). In some embodiments, the Fc domain monomer is at least 20 kDa (e.g., at least 25 kDa, at least 30 kDa, or at least 35 kDa).


In some embodiments, the Fc domain monomer includes 200 to 400 amino acid residues (e.g., 200 to 250, 250 to 300, 300 to 350, 350 to 400, 200 to 300, 250 to 350, or 300 to 400 amino acid residues). In some embodiments, the Fc domain monomer is 20 to 40 kDa (e.g., 20 to 25 kDa, 25 to 30 kDa, 35 to 40 kDa, 20 to 30 kDa, 25 to 35 kDa, or 30 to 40 KDa).


In some embodiments, the Fc domain monomer includes an amino acid sequence at least 90% identical (e.g., at least 95%, at least 98%) to the sequence of any one of SEQ ID NOs: 1-95, or a region thereof. In some embodiments, the Fc domain monomer includes the amino acid sequence of any one of SEQ ID NOs: 1-95, or a region thereof.


In some embodiments, the Fc domain monomer includes a region of any one of SEQ ID NOs: 1-95, wherein the region includes positions 220, 252, 254, and 256. In some embodiments, the region includes at least 40 amino acid residues, at least 50 amino acid residues, at least 60 amino acid residues, at least 70 amino acids residues, at least 80 amino acids residues, at least 90 amino acid residues, at least 100 amino acid residues, at least 110 amino acid residues, at least 120 amino residues, at least 130 amino acid residues, at least 140 amino acid residues, at least 150 amino acid residues, at least 160 amino acid residues, at least 170 amino acid residues, at least 180 amino acid residues, at least 190 amino acid residues, or at least 200 amino acid residues.


Definitions

To facilitate the understanding of this invention, a number of terms are defined below. Terms defined herein have meanings as commonly understood by a person of ordinary skill in the areas relevant to the present invention. Terms such as “a”, “an,” and “the” are not intended to refer to only a singular entity, but include the general class of which a specific example may be used for illustration. The terminology herein is used to describe specific embodiments of the invention, but their usage does not delimit the invention, except as outlined in the claims.


By “viral infection” is meant the pathogenic growth of a virus (e.g., the human immunodeficiency virus) in a host organism (e.g., a human subject). A viral infection can be any situation in which the presence of a viral population(s) is damaging to a host body. Thus, a subject is “suffering” from a viral infection when an excessive amount of a viral population is present in or on the subject's body, or when the presence of a viral population(s) is damaging the cells or other tissue of the subject.


As used herein, the term “Fc domain monomer” refers to a polypeptide chain that includes at least a hinge domain and second and third antibody constant domains (CH2 and CH3) or functional fragments thereof (e.g., fragments that that capable of (i) dimerizing with another Fc domain monomer to form an Fc domain, and (ii) binding to an Fc receptor. The Fc domain monomer can be any immunoglobulin antibody isotype, including IgG, IgE, IgM, IgA, or IgD (e.g., IgG). Additionally, the Fc domain monomer can be an IgG subtype (e.g., IgG1, IgG2a, IgG2b, IgG3, or IgG4) (e.g., IgG1). An Fc domain monomer does not include any portion of an immunoglobulin that is capable of acting as an antigen-recognition region, e.g., a variable domain or a complementarity determining region (CDR). Fc domain monomers in the conjugates as described herein can contain one or more changes from a wild-type Fc domain monomer sequence (e.g., 1-10, 1-8, 1-6, 1-4 amino acid substitutions, additions, or deletions) that alter the interaction between an Fc domain and an Fc receptor. Examples of suitable changes are known in the art. In certain embodiments, a human Fc domain monomer (e.g., an IgG heavy chain, such as IgG1) includes a region that extends from any of Asn208, Glu216, Asp221, Lys222, or Cys226 to the carboxyl-terminus of the heavy chain at Lys447. C-terminal Lys447 of the Fc region may or may not be present, without affecting the structure or stability of the Fc region. Unless otherwise specified herein, numbering of amino acid residues in the IgG or Fc domain monomer is according to the EU numbering system for antibodies, also called the Kabat EU index, as described, for example, in Kabat et al., Sequences of Proteins of Immunological Interest, 5th Ed. Public Health Service, National Institutes of Health, Bethesda, Md., 1991.


As used herein, the term “Fc domain” refers to a dimer of two Fc domain monomers that is capable of binding an Fc receptor. In the wild-type Fc domain, the two Fc domain monomers dimerize by the interaction between the two CH3 antibody constant domains, in some embodiments, one or more disulfide bonds form between the hinge domains of the two dimerizing Fc domain monomers.


The term “covalently attached” refers to two parts of a conjugate that are linked to each other by a covalent bond formed between two atoms in the two parts of the conjugate.


As used herein, the term “Fc-binding peptide” refers to refers to a polypeptide having an amino acid sequence of 5 to 50 (e.g., 5 to 40, 5 to 30, 5 to 20, 5 to 15, 5 to 10, 10 to 50, 10 to 30, or 10 to 20) amino acid residues that has affinity for and functions to bind an Fc domain, such as any of the Fc domain described herein. An Fc-binding peptide can be of different origins, e.g., synthetic, human, mouse, or rat. Fc-binding peptides of the invention include Fc-binding peptides which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the Fc-binding peptide will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention. Fc-binding peptides may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker). Fc-binding peptides of the invention may be linear or cyclic. Fc-binding peptides of the invention include any Fc-binding peptides known to one of skill in the art.


As used here, the term “albumin protein” refers to a polypeptide comprising an amino acid sequence corresponding to a naturally-occurring albumin protein (e.g., human serum albumin) or a variant thereof, such as an engineered variant of a naturally-occurring albumin protein. Variants of albumin proteins include polymorphisms, fragments such as domains and sub-domains, and fusion proteins (e.g., an albumin protein having a C-terminal or N-terminal fusion, such as a polypeptide linker). Preferably the albumin protein has the amino acid sequence of human serum albumin (HSA) or a variant or fragment thereof, most preferably a functional variant or fragment thereof. Albumin proteins of the invention include proteins having at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to any one of SEQ ID NOs: 96-98. Albumin proteins of the invention include albumin proteins which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention. Albumin proteins may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker).


As used herein, the term “albumin protein-binding peptide” refers to a polypeptide having an amino acid sequence of 5 to 50 (e.g., 5 to 40, 5 to 30, 5 to 20, 5 to 15, 5 to 10, 10 to 50, 10 to 30, or 10 to 20) amino acid residues that has affinity for and functions to bind an albumin protein, such as any of the albumin proteins described herein. Preferably, the albumin protein-binding peptide binds to a naturally-occurring serum albumin, most preferably human serum albumin. An albumin protein-binding peptide can be of different origins, e.g., synthetic, human, mouse, or rat. Albumin protein-binding peptides of the invention include albumin protein-binding peptides which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein-binding peptide will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention. Albumin protein-binding peptides may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker). Albumin protein-binding peptides of the invention may be linear or cyclic. Albumin protein-binding peptide of the invention include any albumin protein-binding peptides known to one of skill in the art, examples of which, are provided herein. Further exemplary albumin protein-binding peptides are provided in U.S. Patent Application No. 2005/0287153, which is incorporated herein by reference in its entirety.


As used-herein, a “surface exposed amino acid” or “solvent-exposed amino acid,” such as a surface exposed cysteine or a surface exposed lysine refers to an amino acid that is accessible to the solvent surrounding the protein. A surface exposed amino acid may be a naturally-occurring or an engineered variant (e.g., a substitution or insertion) of the protein. In some embodiments, a surface exposed amino acid is an amino acid that when substituted does not substantially change the three-dimensional structure of the protein.


The terms “linker,” “L,” and “L′,” as used herein, refer to a covalent linkage or connection between two or more components in a conjugate (e.g., between two gp120 binders in a conjugate described herein, between a gp120 binder and an Fc domain or albumin protein in a conjugate described herein, and between a dimer of two gp120 binders and an Fc domain or an albumin protein in a conjugate described herein). In some embodiments, a conjugate described herein may contain a linker that has a trivalent structure (e.g., a trivalent linker). A trivalent linker has three arms, in which each arm is covalently linked to a component of the conjugate (e.g., a first arm conjugated to a first gp120 binder, a second arm conjugated to a second gp120 binder, and a third arm conjugated to an Fc domain or an albumin protein).


Molecules that may be used as linkers include at least two functional groups, which may be the same or different, e.g., two carboxylic acid groups, two amine groups, two sulfonic acid groups, a carboxylic acid group and a maleimide group, a carboxylic acid group and an alkyne group, a carboxylic acid group and an amine group, a carboxylic acid group and a sulfonic acid group, an amine group and a maleimide group, an amine group and an alkyne group, or an amine group and a sulfonic acid group. The first functional group may form a covalent linkage with a first component in the conjugate and the second functional group may form a covalent linkage with the second component in the conjugate. In some embodiments of a trivalent linker, two arms of a linker may contain two dicarboxylic acids, in which the first carboxylic acid may form a covalent linkage with the first gp120 binder in the conjugate and the second carboxylic acid may form a covalent linkage with the second gp120 binder in the conjugate, and the third arm of the linker may for a covalent linkage with an Fc domain or albumin protein in the conjugate. Examples of dicarboxylic acids are described further herein. In some embodiments, a molecule containing one or more maleimide groups may be used as a linker, in which the maleimide group may form a carbon-sulfur linkage with a cysteine in a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate. In some embodiments, a molecule containing one or more alkyne groups may be used as a linker, in which the alkyne group may form a 1,2,3-triazole linkage with an azide in a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate. In some embodiments, a molecule containing one or more azide groups may be used as a linker, in which the azide group may form a 1,2,3-triazole linkage with an alkyne in a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate. In some embodiments, a molecule containing one or more bis-sulfone groups may be used as a linker, in which the bis-sulfone group may form a linkage with an amine group a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate. In some embodiments, a molecule containing one or more sulfonic acid groups may be used as a linker, in which the sulfonic acid group may form a sulfonamide linkage with a component in the conjugate. In some embodiments, a molecule containing one or more isocyanate groups may be used as a linker, in which the isocyanate group may form a urea linkage with a component in the conjugate. In some embodiments, a molecule containing one or more haloalkyl groups may be used as a linker, in which the haloalkyl group may form a covalent linkage, e.g., C—N and C—O linkages, with a component in the conjugate.


In some embodiments, a linker provides space, rigidity, and/or flexibility between the two or more components. In some embodiments, a linker may be a bond, e.g., a covalent bond. The term “bond” refers to a chemical bond, e.g., an amide bond, a disulfide bond, a C—O bond, a C—N bond, a N—N bond, a C—S bond, or any kind of bond created from a chemical reaction, e.g., chemical conjugation. In some embodiments, a linker includes no more than 250 atoms. In some embodiments, a linker includes no more than 250 non-hydrogen atoms. In some embodiments, the backbone of a linker includes no more than 250 atoms. The “backbone” of a linker refers to the atoms in the linker that together form the shortest path from one part of a conjugate to another part of the conjugate (e.g., the shortest path linking a first gp120 binder and a second gp120 binder). The atoms in the backbone of the linker are directly involved in linking one part of a conjugate to another part of the conjugate (e.g., linking a first gp120 binder and a second gp120 binder). For examples, hydrogen atoms attached to carbons in the backbone of the linker are not considered as directly involved in linking one part of the conjugate to another part of the conjugate.


In some embodiments, a linker may comprise a synthetic group derived from, e.g., a synthetic polymer (e.g., a polyethylene glycol (PEG) polymer). In some embodiments, a linker may comprise one or more amino acid residues, such as D- or L-amino acid residues. In some embodiments, a linker may be a residue of an amino acid sequence (e.g., a 1-25 amino acid, 1-10 amino acid, 1-9 amino acid, 1-8 amino acid, 1-7 amino acid, 1-6 amino acid, 1-5 amino acid, 1-4 amino acid, 1-3 amino acid, 1-2 amino acid, or 1 amino acid sequence). In some embodiments, a linker may comprise one or more, e.g., 1-100, 1-50, 1-25, 1-10, 1-5, or 1-3, optionally substituted alkylene, optionally substituted heteroalkylene (e.g., a PEG unit), optionally substituted alkenylene, optionally substituted heteroalkenylene, optionally substituted alkynylene, optionally substituted heteroalkynylene, optionally substituted cycloalkylene, optionally substituted heterocycloalkylene, optionally substituted cycloalkenylene, optionally substituted heterocycloalkenylene, optionally substituted cycloalkynylene, optionally substituted heterocycloalkynylene, optionally substituted arylene, optionally substituted heteroarylene (e.g., pyridine), O, S, NRi (Ri is H, optionally substituted alkyl, optionally substituted heteroalkyl, optionally substituted alkenyl, optionally substituted heteroalkenyl, optionally substituted alkynyl, optionally substituted heteroalkynyl, optionally substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted cycloalkenyl, optionally substituted heterocycloalkenyl, optionally substituted cycloalkynyl, optionally substituted heterocycloalkynyl, optionally substituted aryl, or optionally substituted heteroaryl), P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino. For example, a linker may comprise one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene (e.g., a PEG unit), optionally substituted C2-C20 alkenylene (e.g., C2 alkenylene), optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene (e.g., cyclopropylene, cyclobutylene), optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene (e.g., C6 arylene), optionally substituted C3-C15 heteroarylene (e.g., imidazole, pyridine), O, S, NRi (Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl), P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino.


The terms “alkyl,” “alkenyl,” and “alkynyl,” as used herein, include straight-chain and branched-chain monovalent substituents, as well as combinations of these, containing only C and H when unsubstituted. When the alkyl group includes at least one carbon-carbon double bond or carbon-carbon triple bond, the alkyl group can be referred to as an “alkenyl” or “alkynyl” group respectively. The monovalency of an alkyl, alkenyl, or alkynyl group does not include the optional substituents on the alkyl, alkenyl, or alkynyl group. For example, if an alkyl, alkenyl, or alkynyl group is attached to a compound, monovalency of the alkyl, alkenyl, or alkynyl group refers to its attachment to the compound and does not include any additional substituents that may be present on the alkyl, alkenyl, or alkynyl group. In some embodiments, the alkyl or heteroalkyl group may contain, e.g., 1-20. 1-18, 1-16, 1-14, 1-12, 1-10, 1-8, 1-6, 1-4, or 1-2 carbon atoms (e.g., C1-C20, C1-C18, C1-C16, C1-C14, C1-C12, C1-C10, C1-C8, C1-C6, C1-C4, or C1-C2). In some embodiments, the alkenyl, heteroalkenyl, alkynyl, or heteroalkynyl group may contain, e.g., 2-20, 2-18, 2-16, 2-14, 2-12, 2-10, 2-8, 2-6, or 2-4 carbon atoms (e.g., C2-C20, C2-C18, C2-C16, C2-C14, C2-C12, C2-C10, C2-C8, C2-C6, or C2-C4). Examples include, but are not limited to, methyl, ethyl, isobutyl, sec-butyl, tert-butyl, 2-propenyl, and 3-butynyl.


The term “cycloalkyl,” as used herein, represents a monovalent saturated or unsaturated non-aromatic cyclic alkyl group. A cycloalkyl may have, e.g., three to twenty carbons (e.g., a C3-C7, C3-C8, C3-C9, C3-C10, C3-C11, C3-C12, C3-C14, C3-C16, C3-C18, or C3-C20 cycloalkyl). Examples of cycloalkyls include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, and cycloheptyl. When the cycloalkyl group includes at least one carbon-carbon double bond, the cycloalkyl group can be referred to as a “cycloalkenyl” group. A cycloalkenyl may have, e.g., four to twenty carbons (e.g., a C4-C7, C4-C8, C4-C9, C4-C10, C4-C11, C4-C12, C4-C14, C4-C16, C4-C18, or C4-C20 cycloalkenyl). Exemplary cycloalkenyl groups include, but are not limited to, cyclopentenyl, cyclohexenyl, and cycloheptenyl. When the cycloalkyl group includes at least one carbon-carbon triple bond, the cycloalkyl group can be referred to as a “cycloalkynyl” group. A cycloalkynyl may have, e.g., eight to twenty carbons (e.g., a C8-C9, C8-C10, C8-C11, C8-C12, C8-C14, C8-C16, C8-C18, or C8-C20 cycloalkynyl). The term “cycloalkyl” also includes a cyclic compound having a bridged multicyclic structure in which one or more carbons bridges two non-adjacent members of a monocyclic ring, e.g., bicyclo[2.2.1.]heptyl and adamantane. The term “cycloalkyl” also includes bicyclic, tricyclic, and tetracyclic fused ring structures, e.g., decalin and spiro cyclic compounds. A “heterocycloalkyl,” “heterocycloalkenyl,” or “heterocycloalkynyl” group refers to a cycloalkyl, cycloalkenyl, or cycloalkynyl group having one or more rings (e.g., 1, 2, 3, 4 or more rings) that has one or more heteroatoms independently selected from, e.g., N, O, and S. Exemplary heterocycloalkyl groups include pyrrolidine, thiophene, thiolane, tetrahydrofuran, piperidine, tetrahydropyran, quinoline, isoquinoline, cinnoline, phthalazine, quinazoline, quinoxaline, indole, benzothiophene, benzofuran, isoindole, benzo[c]thiophene, isobenzofuran, benzimidazole, benzoxazole, benzothiazole, 1H-indazole, 1,2,benzisoxazole, 1,2-benzisothiazole, 2,1-benzisothiazole, 2,1-benzisoxazole, purine, pyrrolizidine, indene, fluorene, carbazole, dibenzofuran, acridine, phenazine, and phenoxazine.


The term “aryl,” as used herein, refers to any monocyclic or fused ring bicyclic or tricyclic system which has the characteristics of aromaticity in terms of electron distribution throughout the ring system, e.g., phenyl, naphthyl, or phenanthrene. In some embodiments, a ring system contains 5-15 ring member atoms or 5-10 ring member atoms. An aryl group may have, e.g., five to fifteen carbons (e.g., a C5-C6, C5-C7, C5-C8, C5-C9, C5-C10, C5-C11, C5-C12, C5-C13, C5-C14, or C5-C15 aryl). The term “heteroaryl” also refers to such monocyclic or fused bicyclic ring systems containing one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms selected from O, S and N. A heteroaryl group may have, e.g., two to fifteen ring member atoms (e.g., a C2-C3, C2-C4, C2-C5, C2-C6, C2-C7, C2-C8, C2-C9, C2-C10, C2-C11, C2-C12, C2-C13, C2-C14, or C3-C15 heteroaryl). The inclusion of a heteroatom permits inclusion of 5-membered rings to be considered aromatic as well as 6-membered rings. Thus, typical heteroaryl systems include, e.g., pyridyl, pyrimidyl, indolyl, benzimidazolyl, benzotriazolyl, isoquinolyl, quinolyl, benzothiazolyl, benzofuranyl, thienyl, furyl, pyrrolyl, thiazolyl, triazolyl (e.g., 1,2,3- or 1,2,4-triazolyl) oxazolyl, isoxazolyl, benzoxazolyl, benzoisoxazolyl, and imidazolyl. Because tautomers are possible, a group such as phthalimido is also considered heteroaryl. In some embodiments, the aryl or heteroaryl group is a 5- or 6-membered aromatic rings system optionally containing 1-2 nitrogen atoms. In some embodiments, the aryl or heteroaryl group is an optionally substituted phenyl, pyridyl, indolyl, pyrimidyl, pyridazinyl, benzothiazolyl, benzimidazolyl, pyrazolyl, imidazolyl, isoxazolyl, thiazolyl, or imidazopyridinyl. In some embodiments, the aryl group is phenyl. In some embodiments, an aryl group may be optionally substituted with a substituent such an aryl substituent, e.g., biphenyl.


The term “alkaryl,” refers to an aryl group that is connected to an alkylene, alkenylene, or alkynylene group. In general, if a compound is attached to an alkaryl group, the alkylene, alkenylene, or alkynylene portion of the alkaryl is attached to the compound. In some embodiments, an alkaryl is C6-C35 alkaryl (e.g., C6-C16, C6-C14, C6-C12, C6-C10, C6-C9, C6-C8, C7, or C6 alkaryl), in which the number of carbons indicates the total number of carbons in both the aryl portion and the alkylene, alkenylene, or alkynylene portion of the alkaryl. Examples of alkaryls include, but are not limited to, (C1-C8)alkylene(C6-C12)aryl, (C2-C8)alkenylene(C6-C12)aryl, or (C2-C8)alkynylene(C6-C12)aryl. In some embodiments, an alkaryl is benzyl or phenethyl. In a heteroalkaryl, one or more heteroatoms selected from N, O, and S may be present in the alkylene, alkenylene, or alkynylene portion of the alkaryl group and/or may be present in the aryl portion of the alkaryl group. In an optionally substituted alkaryl, the substituent may be present on the alkylene, alkenylene, or alkynylene portion of the alkaryl group and/or may be present on the aryl portion of the alkaryl group.


The term “amino,” as used herein, represents —N(Rx)2 or —N+(Rx)3, where each Rx is, independently, H, alkyl, alkenyl, alkynyl, aryl, alkaryl, cycloalkyl, or two Rx combine to form a heterocycloalkyl. In some embodiment, the amino group is —NH2.


The term “alkamino,” as used herein, refers to an amino group, described herein, that is attached to an alkylene (e.g., C1-C5 alkylene), alkenylene (e.g., C2-C5 alkenylene), or alkynylene group (e.g., C2-C5 alkenylene). In general, if a compound is attached to an alkamino group, the alkylene, alkenylene, or alkynylene portion of the alkamino is attached to the compound. The amino portion of an alkamino refers to —N(Rx)2 or —N+(Rx)3, where each Rx is, independently, H, alkyl, alkenyl, alkynyl, aryl, alkaryl, cycloalkyl, or two Rx combine to form a heterocycloalkyl. In some embodiment, the amino portion of an alkamino is —NH2. An example of an alkamino group is C1-C5 alkamino, e.g., C2 alkamino (e.g., CH2CH2NH2 or CH2CH2N(CH3)2). In a heteroalkamino group, one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms selected from N, O, and S may be present in the alkylene, alkenylene, or alkynylene portion of the heteroalkamino group. In some embodiments, an alkamino group may be optionally substituted. In a substituted alkamino group, the substituent may be present on the alkylene, alkenylene, or alkynylene portion of the alkamino group and/or may be present on the amino portion of the alkamino group.


The term “alkamide,” as used herein, refers to an amide group that is attached to an alkylene (e.g., C1-C5 alkylene), alkenylene (e.g., C2-C5 alkenylene), or alkynylene (e.g., C2-C5 alkenylene) group. In general, if a compound is attached to an alkamide group, the alkylene, alkenylene, or alkynylene portion of the alkamide is attached to the compound. The amide portion of an alkamide refers to —C(O)—N(Rx)2, where each Rx is, independently, H, alkyl, alkenyl, alkynyl, aryl, alkaryl, cycloalkyl, or two Rx combine to form a heterocycloalkyl. In some embodiment, the amide portion of an alkamide is —C(O)NH2. An alkamide group may be —(CH2)2—C(O)NH2 or —CH2—C(O)NH2. In a heteroalkamide group, one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms selected from N, O, and S may be present in the alkylene, alkenylene, or alkynylene portion of the heteroalkamide group. In some embodiments, an alkamide group may be optionally substituted. In a substituted alkamide group, the substituent may be present on the alkylene, alkenylene, or alkynylene portion of the alkamide group and/or may be present on the amide portion of the alkamide group.


The terms “alkylene,” “alkenylene,” and “alkynylene,” as used herein, refer to divalent groups having a specified size. In some embodiments, an alkylene may contain, e.g., 1-20, 1-18, 1-16, 1-14, 1-12, 1-10, 1-8, 1-6, 1-4, or 1-2 carbon atoms (e.g., C1-C20, C1-C18, C1-C16, C1-C14, C1-C12, C1-C10, C1-C8, C1-C6, C1-C4, or C1-C2). In some embodiments, an alkenylene or alkynylene may contain, e.g., 2-20, 2-18, 2-16, 2-14, 2-12, 2-10, 2-8, 2-6, or 2-4 carbon atoms (e.g., C2-C20, C2-C18, C2-C16, C2-C14, C2-C12, C2-C10, C2-C8, C2-C6, or C2-C4). Alkylene, alkenylene, and/or alkynylene includes straight-chain and branched-chain forms, as well as combinations of these. The divalency of an alkylene, alkenylene, or alkynylene group does not include the optional substituents on the alkylene, alkenylene, or alkynylene group. For example, two gp120 binders may be attached to each other by way of a linker that includes alkylene, alkenylene, and/or alkynylene, or combinations thereof. Each of the alkylene, alkenylene, and/or alkynylene groups in the linker is considered divalent with respect to the two attachments on either end of alkylene, alkenylene, and/or alkynylene group. For example, if a linker includes -(optionally substituted alkylene)-(optionally substituted alkenylene)-(optionally substituted alkylene)-, the alkenylene is considered divalent with respect to its attachments to the two alkylenes at the ends of the linker. The optional substituents on the alkenylene are not included in the divalency of the alkenylene. The divalent nature of an alkylene, alkenylene, or alkynylene group (e.g., an alkylene, alkenylene, or alkynylene group in a linker) refers to both of the ends of the group and does not include optional substituents that may be present in an alkylene, alkenylene, or alkynylene group. Because they are divalent, they can link together multiple (e.g., two) parts of a conjugate, e.g., a first gp120 binder and a second gp120 binder. Alkylene, alkenylene, and/or alkynylene groups can be substituted by the groups typically suitable as substituents for alkyl, alkenyl and alkynyl groups as set forth herein. For example, C═O is a C1 alkylene that is substituted by an oxo (═O). For example, —HCR—C═C— may be considered as an optionally substituted alkynylene and is considered a divalent group even though it has an optional substituent, R. Heteroalkylene, heteroalkenylene, and/or heteroalkynylene groups refer to alkylene, alkenylene, and/or alkynylene groups including one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms, e.g., N, O, and S. For example, a polyethylene glycol (PEG) polymer or a PEG unit —(CH2)2—O— in a PEG polymer is considered a heteroalkylene containing one or more oxygen atoms.


The term “cycloalkylene,” as used herein, refers to a divalent cyclic group linking together two parts of a compound. For example, one carbon within the cycloalkylene group may be linked to one part of the compound, while another carbon within the cycloalkylene group may be linked to another part of the compound. A cycloalkylene group may include saturated or unsaturated non-aromatic cyclic groups. A cycloalkylene may have, e.g., three to twenty carbons in the cyclic portion of the cycloalkylene (e.g., a C3-C7, C3-C8, C3-C9, C3-C10, C3-C11, C3-C12, C3-C14, C3-C16, C3-C18, or C3-C20 cycloalkylene). When the cycloalkylene group includes at least one carbon-carbon double bond, the cycloalkylene group can be referred to as a “cycloalkenylene” group. A cycloalkenylene may have, e.g., four to twenty carbons in the cyclic portion of the cycloalkenylene (e.g., a C4-C7, C4-C8, C4-C9. C4-C10, C4-C11, C4-C12, C4-C14, C4-C16, C4-C18, or C4-C20 cycloalkenylene). When the cycloalkylene group includes at least one carbon-carbon triple bond, the cycloalkylene group can be referred to as a “cycloalkynylene” group. A cycloalkynylene may have, e.g., four to twenty carbons in the cyclic portion of the cycloalkynylene (e.g., a C4-C7, C4-C8, C4-C9, C4-C10, C4-C11, C4-C12, C4-C14, C4-C16, C4-C18, or C8-C20 cycloalkynylene). A cycloalkylene group can be substituted by the groups typically suitable as substituents for alkyl, alkenyl and alkynyl groups as set forth herein. Heterocycloalkylene refers to a cycloalkylene group including one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms, e.g., N, O, and S. Examples of cycloalkylenes include, but are not limited to, cyclopropylene and cyclobutylene. A tetrahydrofuran may be considered as a heterocycloalkylene.


The term “arylene,” as used herein, refers to a multivalent (e.g., divalent or trivalent) aryl group linking together multiple (e.g., two or three) parts of a compound. For example, one carbon within the arylene group may be linked to one part of the compound, while another carbon within the arylene group may be linked to another part of the compound. An arylene may have, e.g., five to fifteen carbons in the aryl portion of the arylene (e.g., a C5-C6, C5-C7, C5-C8, C5-C9, C5-C10, C5-C11, C5-C12, C5-C13, C5-C14, or 05-C15 arylene). An arylene group can be substituted by the groups typically suitable as substituents for alkyl, alkenyl and alkynyl groups as set forth herein. Heteroarylene refers to an aromatic group including one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms, e.g., N, O, and S. A heteroarylene group may have, e.g., two to fifteen carbons (e.g., a C2-C3, C2-C4, C2-C5, C2-C6, C2-C7, C2-C8, C2-C9, C2-C10, C2-C11, C2-C12, C2-C13, C2-C14, or C3-C15 heteroarylene).


The term “optionally substituted,” as used herein, refers to having 0, 1, or more substituents, such as 0-25, 0-20, 0-10 or 0-5 substituents. Substituents include, but are not limited to, alkyl, alkenyl, alkynyl, aryl, alkaryl, acyl, heteroaryl, heteroalkyl, heteroalkenyl, heteroalkynyl, heteroalkaryl, halogen, oxo, cyano, nitro, amino, alkamino, hydroxy, alkoxy, alkanoyl, carbonyl, carbamoyl, guanidinyl, ureido, amidinyl, any of the groups or moieties described above, and hetero versions of any of the groups or moieties described above. Substituents include, but are not limited to, F, Cl, methyl, phenyl, benzyl, OR, NR2, SR, SOR, SO2R, OCOR, NRCOR, NRCONR2, NRCOOR, OCONR2, RCO, COOR, alkyl-OOCR, SO3R, CONR2, SO2NR2, NRSO2NR2, CN, CF3, OCF3, SiR3, and NO2, wherein each R is, independently, H, alkyl, alkenyl, aryl, heteroalkyl, heteroalkenyl, or heteroaryl, and wherein two of the optional substituents on the same or adjacent atoms can be joined to form a fused, optionally substituted aromatic or nonaromatic, saturated or unsaturated ring which contains 3-8 members, or two of the optional substituents on the same atom can be joined to form an optionally substituted aromatic or nonaromatic, saturated or unsaturated ring which contains 3-8 members.


An optionally substituted group or moiety refers to a group or moiety (e.g., any one of the groups or moieties described above) in which one of the atoms (e.g., a hydrogen atom) is optionally replaced with another substituent. For example, an optionally substituted alkyl may be an optionally substituted methyl, in which a hydrogen atom of the methyl group is replaced by, e.g., OH. As another example, a substituent on a heteroalkyl or its divalent counterpart, heteroalkylene, may replace a hydrogen on a carbon or a hydrogen on a heteroatom such as N. For example, the hydrogen atom in the group —R—NH—R— may be substituted with an alkamide substituent, e.g., —R—N[(CH2C(O)N(CH3)2]—R.


Generally, an optional substituent is a noninterfering substituent. A “noninterfering substituent” refers to a substituent that leaves the ability of the conjugates described herein (e.g., conjugates of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) to either bind to viral gp41 or gp120 receptor or to inhibit the proliferation of HIV. Thus, in some embodiments, the substituent may alter the degree of such activity. However, as long as the conjugate retains the ability to bind to viral gp41 or gp120 receptor or to inhibit HIV proliferation, the substituent will be classified as “noninterfering.” For example, the noninterfering substituent would leave the ability of the compound to provide antiviral efficacy based on an IC50 value of 10 μM or less in a viral plaque reduction assay. Thus, the substituent may alter the degree of inhibition based on plaque reduction or gp120 receptor inhibition. However, as long as the compound described herein, such as any compound of formula (A-I), retains the ability to inhibit gp120 receptor, the substituent will be classified as “noninterfering.” A number of assays for determining viral plaque reduction or the ability of any compound to inhibit gp120 receptor are available in the art, and some are exemplified in the Examples below.


The term “hetero,” when used to describe a chemical group or moiety, refers to having at least one heteroatom that is not a carbon or a hydrogen, e.g., N, O, and S. Any one of the groups or moieties described above may be referred to as hetero if it contains at least one heteroatom. For example, a heterocycloalkyl, heterocycloalkenyl, or heterocycloalkynyl group refers to a cycloalkyl, cycloalkenyl, or cycloalkynyl group that has one or more heteroatoms independently selected from, e.g., N, O, and S. An example of a heterocycloalkenyl group is a maleimido. For example, a heteroaryl group refers to an aromatic group that has one or more heteroatoms independently selected from, e.g., N, O, and S. One or more heteroatoms may also be included in a substituent that replaced a hydrogen atom in a group or moiety as described herein. For example, in an optionally substituted heteroaryl group, if one of the hydrogen atoms in the heteroaryl group is replaced with a substituent (e.g., methyl), the substituent may also contain one or more heteroatoms (e.g., methanol).


The term “acyl,” as used herein, refers to a group having the structure:




embedded image


Wherein Rz is an optionally substituted alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, alkaryl, alkamino, heteroalkyl, heteroalkenyl, heteroalkynyl, heterocycloalkyl, heterocycloalkenyl, heterocycloalkynyl, heteroaryl, heteroalkaryl, or heteroalkamino.


The term “halo” or “halogen,” as used herein, refers to any halogen atom, e.g., F, Cl, Br, or I. Any one of the groups or moieties described herein may be referred to as a “halo moiety” if it contains at least one halogen atom, such as haloalkyl.


The term “hydroxyl,” as used herein, represents an —OH group.


The term “oxo,” as used herein, refers to a substituent having the structure ═O, where there is a double bond between an atom and an oxygen atom.


The term “carbonyl,” as used herein, refers to a group having the structure:




embedded image


The term “thiocarbonyl,” as used herein, refers to a group having the structure:




embedded image


The term “phosphate,” as used herein, represents the group having the structure:




embedded image


The term “phosphoryl,” as used herein, represents the group having the structure:




embedded image


The term “sulfonyl,” as used herein, represents the group having the structure:




embedded image


The term “imino,” as used herein, represents the group having the structure:




embedded image


wherein R is an optional substituent.


The term “N-protecting group,” as used herein, represents those groups intended to protect an amino group against undesirable reactions during synthetic procedures. Commonly used N-protecting groups are disclosed in Greene, “Protective Groups in Organic Synthesis,” 5th Edition (John Wiley & Sons, New York, 2014), which is incorporated herein by reference. N-protecting groups include, e.g., acyl, aryloyl, and carbamyl groups such as formyl, acetyl, propionyl, pivaloyl, t-butylacetyl, 2-chloroacetyl, 2-bromoacetyl, trifluoroacetyl, trichloroacetyl, phthaloyl, o-nitrophenoxyacetyl, α-chlorobutyryl, benzoyl, carboxybenzyl (CBz), 4-chlorobenzoyl, 4-bromobenzoyl, 4-nitrobenzoyl, and chiral auxiliaries such as protected or unprotected D, L or D, L-amino acid residues such as alanine, leucine, phenylalanine; sulfonyl-containing groups such as benzenesulfonyl and p-toluenesulfonyl; carbamate forming groups such as benzyloxycarbonyl, p-chlorobenzyloxycarbonyl, p-methoxybenzyloxycarbonyl, p-nitrobenzyloxycarbonyl, 2-nitrobenzyloxycarbonyl, p-bromobenzyloxycarbonyl, 3,4-dimethoxybenzyloxycarbonyl, 3,5-dimethoxybenzyl oxycarbonyl, 2,4-dimethoxybenzyloxycarbonyl, 4-methoxybenzyloxycarbonyl, 2-nitro-4,5-dimethoxybenzyloxycarbonyl, 3,4,5-trimethoxybenzyloxycarbonyl, 1-(p-biphenylyl)-1-methylethoxycarbonyl, α,α-dimethyl-3,5-dimethoxybenzyloxycarbonyl, benzhydryloxy carbonyl, t-butyloxycarbonyl (BOC), diisopropylmethoxycarbonyl, isopropyloxycarbonyl, ethoxycarbonyl, methoxycarbonyl, allyloxycarbonyl, 2,2,2,-trichloroethoxycarbonyl, phenoxycarbonyl, 4-nitrophenoxy carbonyl, fluorenyl-9-methoxycarbonyl (Fmoc), cyclopentyloxycarbonyl, adamantyloxycarbonyl, cyclohexyloxycarbonyl, and phenylthiocarbonyl; alkaryl groups such as benzyl, triphenylmethyl, and benzyloxymethyl; and silyl groups such as trimethylsilyl.


The term “amino acid,” as used herein, means naturally occurring amino acids and non-naturally occurring amino acids.


The term “naturally occurring amino acids,” as used herein, means amino acids including Ala, Arg, Asn, Asp, Cys, Gln, Glu, Gly, His, lie, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, and Val.


The term “non-naturally occurring amino acid,” as used herein, means an alpha amino acid that is not naturally produced or found in a mammal. Examples of non-naturally occurring amino acids include D-amino acids; an amino acid having an acetylaminomethyl group attached to a sulfur atom of a cysteine; a pegylated amino acid; the omega amino acids of the formula NH2(CH2)nCOOH where n is 2-6, neutral nonpolar amino acids, such as sarcosine, t-butyl alanine, t-butyl glycine, N-methyl isoleucine, and norleucine; oxomethionine; phenylglycine; citrulline; methionine sulfoxide; cystic acid; ornithine; diaminobutyric acid; 3-aminoalanine; 3-hydroxy-D-proline; 2,4-diaminobutyric acid; 2-aminopentanoic acid; 2-aminooctanoic acid, 2-carboxy piperazine; piperazine-2-carboxylic acid, 2-amino-4-phenylbutanoic acid; 3-(2-naphthyl)alanine, and hydroxyproline. Other amino acids are α-aminobutyric acid, α-amino-α-methylbutyrate, aminocyclopropane-carboxylate, aminoisobutyric acid, aminonorbornyl-carboxylate, L-cyclohexylalanine, cyclopentylalanine, L-N-methylleucine, L-N-methylmethionine, L-N-methylnorvaline, L-N-methylphenylalanine, L-N-methylproline, L-N-methylserine, L-N-methyltryptophan, D-ornithine, L-N-methylethylglycine, L-norleucine, α-methyl-aminoisobutyrate, α-methylcyclohexylalanine, D-α-methylalanine, D-α-methylarginine, D-α-methylasparagine, D-α-methylaspartate, D-α-methylcysteine, D-α-methylglutamine, D-α-methylhistidine, D-α-methylisoleucine, D-α-methylleucine, D-α-methyllysine, D-α-methylmethionine, D-α-methylornithine, D-α-methylphenylalanine, D-α-methylproline, D-α-methylserine, D-N-methylserine, D-α-methylthreonine, D-α-methyltryptophan, D-α-methyltyrosine, D-α-methylvaline, D-N-methylalanine, D-N-methylarginine, D-N-methylasparagine, D-N-methylaspartate, D-N-methylcysteine, D-N-methylglutamine, D-N-methylglutamate, D-N-methylhistidine, D-N-methylisoleucine, D-N-methylleucine, D-N-methyllysine, N-methylcyclohexylalanine, D-N-methylornithine, N-methylglycine, N-methylaminoisobutyrate, N-(1-methylpropyl)glycine, N-(2-methylpropyl)glycine, D-N-methyltryptophan, D-N-methyltyrosine, D-N-methylvaline, γ-aminobutyric acid, L-t-butylglycine, L-ethylglycine, L-homophenylalanine, L-α-methylarginine, L-α-methylaspartate, L-α-methylcysteine, L-α-methylglutamine, L-α-methylhistidine, L-α-methylisoleucine, L-α-methylleucine, L-α-methylmethionine, L-α-methylnorvaline, L-α-methylphenylalanine, L-α-methylserine, L-α-methyltryptophan, L-α-methylvaline, N—(N-(2,2-diphenylethyl) carbamylmethylglycine, 1-carboxy-1-(2,2-diphenyl-ethylamino) cyclopropane, 4-hydroxyproline, ornithine, 2-aminobenzoyl (anthraniloyl), D-cyclohexylalanine, 4-phenyl-phenylalanine, L-citrulline, α-cyclohexylglycine, L-1,2,3,4-tetrahydroisoquinoline-3-carboxylic acid, L-thiazolidine-4-carboxylic acid, L-homotyrosine, L-2-furylalanine, L-histidine (3-methyl), N-(3-guanidinopropyl)glycine, O-methyl-L-tyrosine, O-glycan-serine, meta-tyrosine, nor-tyrosine, L-N,N′,N″-trimethyllysine, homolysine, norlysine, N-glycan asparagine, 7-hydroxy-1,2,3,4-tetrahydro-4-fluorophenylalanine, 4-methylphenylalanine, bis-(2-picolyl)amine, pentafluorophenylalanine, indoline-2-carboxylic acid, 2-aminobenzoic acid, 3-amino-2-naphthoic acid, asymmetric dimethylarginine, L-tetrahydroisoquinoline-1-carboxylic acid, D-tetrahydroisoquinoline-1-carboxylic acid, 1-amino-cyclohexane acetic acid, D/L-allylglycine, 4-aminobenzoic acid, 1-amino-cyclobutane carboxylic acid, 2 or 3 or 4-aminocyclohexane carboxylic acid, 1-amino-1-cyclopentane carboxylic acid, 1-aminoindane-1-carboxylic acid, 4-amino-pyrrolidine-2-carboxylic acid, 2-aminotetraline-2-carboxylic acid, azetidine-3-carboxylic acid, 4-benzyl-pyrolidine-2-carboxylic acid, tert-butylglycine, b-(benzothiazolyl-2-yl)-alanine, b-cyclopropyl alanine, 5,5-dimethyl-1,3-thiazolidine-4-carboxylic acid, (2R,4S)4-hydroxypiperidine-2-carboxylic acid, (2S,4S) and (2S,4R)-4-(2-naphthylmethoxy)-pyrolidine-2-carboxylic acid, (2S,4S) and (2S,4R)4-phenoxy-pyrrolidine-2-carboxylic acid, (2R,5S) and (2S,5R)-5-phenyl-pyrrolidine-2-carboxylic acid, (2S,4S)-4-amino-1-benzoyl-pyrrolidine-2-carboxylic acid, t-butylalanine, (2S,5R)-5-phenyl-pyrrolidine-2-carboxylic acid, 1-aminomethyl-cyclohexane-acetic acid, 3,5-bis-(2-amino)ethoxy-benzoic acid, 3,5-diamino-benzoic acid, 2-methylamino-benzoic acid, N-methylanthranylic acid, L-N-methylalanine, L-N-methylarginine, L-N-methylasparagine, L-N-methylaspartic acid, L-N-methylcysteine, L-N-methylglutamine, L-N-methylglutamic acid, L-N-methylhistidine, L-N-methylisoleucine, L-N-methyllysine, L-N-methylnorleucine, L-N-methylornithine, L-N-methylthreonine, L-N-methyltyrosine, L-N-methylvaline, L-N-methyl-t-butylglycine, L-norvaline, α-methyl-γ-aminobutyrate, 4,4′-biphenylalanine, α-methylcylcopentylalanine, α-methyl-α-napthylalanine, α-methylpenicillamine, N-(4-aminobutyl)glycine, N-(2-aminoethyl)glycine, N-(3-aminopropyl)glycine, N-amino-α-methylbutyrate, α-napthylalanine, N-benzylglycine, N-(2-carbamylethyl)glycine, N-(carbamylmethyl)glycine, N-(2-carboxyethyl)glycine, N-(carboxymethyl)glycine, N-cyclobutylglycine, N-cyclodecylglycine, N-cycloheptylglycine, N-cyclohexylglycine, N-cyclodecylglycine, N-cylcododecylglycine, N-cyclooctylglycine, N-cyclopropylglycine, N-cycloundecylglycine, N-(2,2-diphenylethyl)glycine, N-(3,3-diphenylpropyl)glycine, N-(3-guanidinopropyl)glycine, N-(1-hydroxyethyl)glycine, N-(hydroxyethyl))glycine, N-(imidazolylethyl))glycine, N-(3-indolylyethyl)glycine, N-methyl-γ-aminobutyrate, D-N-methylmethionine, N-methylcyclopentylalanine, D-N-methylphenylalanine, D-N-methylproline, D-N-methylthreonine, N-(1-methylethyl)glycine, N-methyl-napthylalanine, N-methylpenicillamine, N-(p-hydroxyphenyl)glycine, N-(thiomethyl)glycine, penicillamine, L-α-methylalanine, L-α-methylasparagine, L-α-methyl-t-butylglycine, L-methylethylglycine, L-α-methylglutamate, L-α-methylhomophenylalanine, N-(2-methylthioethyl)glycine, L-α-methyllysine, L-α-methylnorleucine, L-α-methylornithine, L-α-methylproline, L-α-methylthreonine, L-α-methyltyrosine, L-N-methylhomophenylalanine, N—(N-(3,3-diphenylpropyl) carbamylmethylglycine, L-pyroglutamic acid, D-pyroglutamic acid, O-methyl-L-serine, O-methyl-L-homoserine, 5-hydroxylysine, α-carboxyglutamate, phenylglycine, L-pipecolic acid (homoproline), L-homoleucine, L-lysine (dimethyl), L-2-naphthylalanine, L-dimethyldopa or L-dimethoxy-phenylalanine, L-3-pyridylalanine, L-histidine (benzoyloxymethyl), N-cycloheptylglycine, L-diphenylalanine, O-methyl-L-homotyrosine, L-β-homolysine, O-glycan-threoine, Ortho-tyrosine, L-N,N′-dimethyllysine, L-homoarginine, neotryptophan, 3-benzothienylalanine, isoquinoline-3-carboxylic acid, diaminopropionic acid, homocysteine, 3,4-dimethoxyphenylalanine, 4-chlorophenylalanine, L-1,2,3,4-tetrahydronorharman-3-carboxylic acid, adamantylalanine, symmetrical dimethylarginine, 3-carboxythiomorpholine, D-1,2,3,4-tetrahydronorharman-3-carboxylic acid, 3-aminobenzoic acid, 3-amino-1-carboxymethyl-pyridin-2-one, 1-amino-1-cyclohexane carboxylic acid, 2-aminocyclopentane carboxylic acid, 1-amino-1-cyclopropane carboxylic acid, 2-aminoindane-2-carboxylic acid, 4-amino-tetrahydrothiopyran-4-carboxylic acid, azetidine-2-carboxylic acid, b-(benzothiazol-2-yl)-alanine, neopentylglycine, 2-carboxymethyl piperidine, b-cyclobutyl alanine, allylglycine, diaminopropionic acid, homo-cyclohexyl alanine, (2S,4R)-4-hydroxypiperidine-2-carboxylic acid, octahydroindole-2-carboxylic acid, (2S,4R) and (2S,4R)-4-(2-naphthyl), pyrrolidine-2-carboxylic acid, nipecotic acid, (2S,4R) and (2S,4S)-4-(4-phenylbenzyl) pyrrolidine-2-carboxylic acid, (3S)-1-pyrrolidine-3-carboxylic acid, (2S,4S)-4-tritylmercapto-pyrrolidine-2-carboxylic acid, (2S,4S)-4-mercaptoproline, t-butylglycine, N,N-bis(3-aminopropyl)glycine, 1-amino-cyclohexane-1-carboxylic acid, N-mercaptoethylglycine, and selenocysteine. In some embodiments, amino acid residues may be charged or polar. Charged amino acids include alanine, lysine, aspartic acid, or glutamic acid, or non-naturally occurring analogs thereof. Polar amino acids include glutamine, asparagine, histidine, serine, threonine, tyrosine, methionine, or tryptophan, or non-naturally occurring analogs thereof. It is specifically contemplated that in some embodiments, a terminal amino group in the amino acid may be an amido group or a carbamate group.


As used herein, the term “percent (%) identity” refers to the percentage of amino acid residues of a candidate sequence, e.g., an Fc-IgG, or fragment thereof, that are identical to the amino acid residues of a reference sequence after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent identity (i.e., gaps can be introduced in one or both of the candidate and reference sequences for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). Alignment for purposes of determining percent identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, ALIGN, or Megalign (DNASTAR) software. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. In some embodiments, the percent amino acid sequence identity of a given candidate sequence to, with, or against a given reference sequence (which can alternatively be phrased as a given candidate sequence that has or includes a certain percent amino acid sequence identity to, with, or against a given reference sequence) is calculated as follows:





100×(fraction of A/B)


where A is the number of amino acid residues scored as identical in the alignment of the candidate sequence and the reference sequence, and where B is the total number of amino acid residues in the reference sequence. In some embodiments where the length of the candidate sequence does not equal to the length of the reference sequence, the percent amino acid sequence identity of the candidate sequence to the reference sequence would not equal to the percent amino acid sequence identity of the reference sequence to the candidate sequence.


Two polynucleotide or polypeptide sequences are said to be “identical” if the sequence of nucleotides or amino acids in the two sequences is the same when aligned for maximum correspondence as described above. Comparisons between two sequences are typically performed by comparing the sequences over a comparison window to identify and compare local regions of sequence similarity. A “comparison window” as used herein, refers to a segment of at least about 15 contiguous positions, about 20 contiguous positions, about 25 contiguous positions, or more (e.g., about 30 to about 75 contiguous positions, or about 40 to about 50 contiguous positions), in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned.


The term “treating” or “to treat,” as used herein, refers to a therapeutic treatment of a viral infection (e.g., a viral infection such as an HIV infection) in a subject. In some embodiments, a therapeutic treatment may slow the progression of the viral infection, improve the subject's outcome, and/or eliminate the infection. In some embodiments, a therapeutic treatment of a viral infection in a subject may alleviate or ameliorate of one or more symptoms or conditions associated with the viral infection, diminish the extent of the viral, stabilize (i.e., not worsening) the state of the viral infection, prevent the spread of the viral infection, and/or delay or slow the progress of the viral infection, as compare the state and/or the condition of the viral infection in the absence of the therapeutic treatment.


The term “average value of T,” as used herein, refers to the mean number of monomers of gp120 binder or dimers of gp120 binders conjugated to an Fc domain or an albumin protein within a population of conjugates. In some embodiments, within a population of conjugates, the average number of monomers of gp120 binder or dimers of gp120 binders conjugated to an Fc domain monomer may be from 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20). In some embodiments, the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.


The term “subject,” as used herein, can be a human or non-human primate, or other mammal, such as but not limited to dog, cat, horse, cow, pig, turkey, goat, fish, monkey, chicken, rat, mouse, or sheep.


The term “therapeutically effective amount,” as used herein, refers to an amount, e.g., pharmaceutical dose, effective in inducing a desired effect in a subject or in treating a subject having a condition or disorder described herein (e.g., a viral infection, such as an HIV infection). It is also to be understood herein that a “therapeutically effective amount” may be interpreted as an amount giving a desired therapeutic and/or preventative effect, taken in one or more doses or in any dosage or route, and/or taken alone or in combination with other therapeutic agents (e.g., an antiviral agent described herein). For example, in the context of administering a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) that is used for the treatment of a viral infection, an effective amount of a conjugate is, for example, an amount sufficient to prevent, slow down, or reverse the progression of the viral infection as compared to the response obtained without administration of the conjugate.


As used herein, the term “pharmaceutical composition” refers to a medicinal or pharmaceutical formulation that contains at least one active ingredient (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) as well as one or more excipients and diluents to enable the active ingredient suitable for the method of administration. The pharmaceutical composition of the present disclosure includes pharmaceutically acceptable components that are compatible with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)).


As used herein, the term “pharmaceutically acceptable carrier” refers to an excipient or diluent in a pharmaceutical composition. For example, a pharmaceutically acceptable carrier may be a vehicle capable of suspending or dissolving the active conjugate (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)). The pharmaceutically acceptable carrier must be compatible with the other ingredients of the formulation and not deleterious to the recipient. In the present disclosure, the pharmaceutically acceptable carrier must provide adequate pharmaceutical stability to a conjugate described herein. The nature of the carrier differs with the mode of administration. For example, for oral administration, a solid carrier is preferred; for intravenous administration, an aqueous solution carrier (e.g., WFI, and/or a buffered solution) is generally used.


The term “pharmaceutically acceptable salt,” as used herein, represents salts of the conjugates described herein (e.g., conjugates of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) that are, within the scope of sound medical judgment, suitable for use in methods described herein without undue toxicity, irritation, and/or allergic response. Pharmaceutically acceptable salts are well known in the art. For example, pharmaceutically acceptable salts are described in: Pharmaceutical Salts: Properties, Selection, and Use (Eds. P. H. Stahl and C. G. Wermuth), Wiley-VCH, 2008. The salts can be prepared in situ during the final isolation and purification of the conjugates described herein or separately by reacting the free base group with a suitable organic acid.


The term “gp120 binder,” as used herein, refers to a moiety, such as a small molecule (e.g., temsavir, BMS-818251, DMJ-II-121, BNM-IV-147 or analogs thereof) that binds to the HIV gp120 glycoprotein. By blocking the gp120 glycoprotein of the virus, a gp120 binder prevents viral attachment to the host CD4+ T cell and entry into the host immune cell. Gp120 binders of the invention include compounds described by formula (A-I), preferably temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or an analog thereof.


The term “about,” as used herein, indicates a deviation of ±5%. For example, about 10% refers to from 9.5% to 10.5%.


Any values provided in a range of values include both the upper and lower bounds, and any values contained within the upper and lower bounds.


The term “(1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)”, as used herein, represents the formulas of any one of (1), (2), (D-I), (D-II), (D-III), (D-III-1), (D-III-2), (D-III-3), (D-III-4), (D-III-5), (D-III-6), (D-IV), (D-IV-1), (D-IV-2), (D-IV-3), (D-IV-4), (D-IV-5), (D-IV-6), (D-V), (D-V-1), (D-V-2), (D-V-3), (D-V-4), (D-V-5), (D-V-6), (D-VI), (D-VI-1), (D-VI-2), (D-VI-3), (D-VI-4), (D-VI-5), (D-VI-6), (D-VII), (D-VIII), (D-VIII-1), (D-IX), (D-IX-1), (D-X), (D-X-1), (D-XI), (D-XI-1), (D-XII), (D-XII-1), (D-XII-2), (D-XIII), (D-XIII-1), (D-XIII-2), (D-XIV), (D-XIV-1), (D-XIV-2), (D-XIV-3), (D-XIV-4), (D-XIV-5), (D-XV), (D-XV-1), (D-XV-2), (D-XV-3), (D-XV-4), (D-XV-5), (D-XVI), (D-XVI-1), (D-XVI-2), (D-XVI-3), (D-XVI-4), (D-XVI-5), (D-XV), (D-XVII-1), (D-XVII-2), (D-XVII-3), (D-XVII-4), (D-XVII-5), (M-I), (M-II), (M-III), (M-III-1), (M-III-2), (M-III-3), (M-III-4), (M-III-5), (M-III-6), (M-IV), (M-IV-1), (M-IV-2), (M-IV-3), (M-IV-4), (M-IV-5), (M-IV-6), (M-V), (M-V-1), (M-V-2), (M-V-3), (M-V-4), (M-V-5), (M-V-6), (M-VI), (M-VI-1), (M-VI-2), (M-VI-3), (M-VI-4), (M-VI-5), (M-VI-6), (M-VII), (M-VIII), (M-VIII-1), (M-IX), (M-IX-1), (M-X), (M-X-1), (M-XI), or (M-XI-1), (M-XII), (M-XII-1), (M-XII-2), (M-XIII), (M-XIII-1), (M-XIII-2), (M-XIV), (M-XIV-1), (M-XIV-2), (M-XIV-3), (M-XIV-4), (M-XIV-5), (M-XV), (M-XV-1), (M-XV-2), (M-XV-3), (M-XV-4), (M-XV-5), (M-XVI), (M-XVI-1), (M-XVI-2), (M-XVI-3), (M-XVI-4), (M-XVI-5), (M-XV), (M-XVII-1), (M-XVII-2), (M-XVII-3), (M-XVII-4), (M-XVII-5).


Other features and advantages of the conjugates described herein will be apparent from the following Detailed Description and the claims.





DESCRIPTION OF THE DRAWINGS


FIG. 1 is an image depicting exemplary methods of conjugating a gp120 receptor inhibitor monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide.



FIG. 2 is an image depicting a method of conjugating a gp120 binder monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide by oxime conjugation to an amino acid residue, e.g., a nitrogen atom of a surface exposed lysine.



FIG. 3 is an image depicting a method of conjugating a gp120 binder monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide by thioether conjugation to an amino acid residue, e.g., a nitrogen atom of a surface exposed lysine.



FIG. 4 is an image depicting a method of conjugating a gp120 binder monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide by rebridged cysteine conjugation, e.g., rebridged cysteine conjugation to a pair of sulfur atoms of two hinge cysteines in an Fc domain monomer or Fc domain.



FIG. 5 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 1.



FIG. 6 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 3.



FIG. 7 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 5.



FIG. 8 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 7.



FIG. 9 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 9.



FIG. 10 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 12.



FIG. 11 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 14.



FIG. 12 is a graph showing the binding of conjugates containing gp120 binders to the gp120 protein compared to a polyclonal goat anti-gp120 HRP (PA1-73097, Invitrogen) positive control and an unconjugated Fc molecule negative control.



FIG. 13 is a graph showing plasma levels of a conjugate including an Fc domain having a C220S mutation (SEQ ID NO: 64) (2 mpk IV) compared to a conjugate including an Fc domain having a C220S mutation and a YTE triple mutation (SEQ ID NO: 67) (2 mpk IV) in non-human primate PK studies determined by Fc capture. This study was performed as described in Example 35.



FIG. 14 is an image depicting exemplary conjugates including a gp120 binder monomer or dimer and an Fc domain monomer or an Fc domain. “T” is representative of the drug-to-antibody ratio (DAR) and depicts that multiple monomers or dimers can be conjugated to an Fc domain monomer or an Fc domain.





DETAILED DESCRIPTION

The disclosure features conjugates, compositions, and methods for the treatment of viral infections (e.g., human immunodeficiency viral infections). The conjugates disclosed herein include monomers or dimers of viral gp120 binders (e.g., temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof) conjugated to Fc monomers, Fc domains, Fc-binding peptides, albumin proteins, or albumin protein-binding peptides. The gp120 binder (e.g., temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof) in the conjugates targets the gp120 receptor on the surface of the viral particle. The Fc monomers or Fc domains in the conjugates bind to FcγRs (e.g., FcRn, FcγRI, FcγRIIa, FcγRIIc, FcγRIIIa, and FcγRIIIb) on immune cells, e.g., neutrophils, to activate phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates. The albumin or albumin-binding peptide may extend the half-life of the conjugate, for example, by binding of albumin to the recycling neonatal Fc receptor. Such compositions are useful in methods for the inhibition of viral growth and in methods for the treatment of viral infections, such as those caused by an HIV-1 or HIV-2.


I. Viral Infections

The compounds and pharmaceutical compositions described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) can be used to treat a viral infection (e.g., an HIV-1 or HIV-2 viral infection).


Viral infection refers to the pathogenic growth of a virus (e.g., the human immunodeficiency virus) in a host organism (e.g., a human subject). A viral infection can be any situation in which the presence of a viral population(s) is damaging to a host body. Thus, a subject is suffering from a viral infection when an excessive amount of a viral population is present in or on the subject's body, or when the presence of a viral population(s) is damaging the cells or other tissue of the subject.


The human immunodeficiency viruses (HIV) are two species of Lentivirus (a subgroup of retrovirus) that causes HIV infection and overtime acquired immunodeficiency syndrome (AIDS). AIDS is a condition in humans in which progressive failure of the immune system allows life-threatening opportunistic infections and cancers to thrive. Without treatment, average survival time after infection with HIV is estimated to be 9 to 11 years, depending on the HIV subtype. In most cases, HIV is a sexually transmitted infection and occurs by contact with or transfer of blood, pre-ejaculate, semen, and vaginal fluids.


Two types of HIV have been characterized: HIV-1 and HIV-2. HIV infects vital cells in the human immune system, such as helper T cells (specifically CD4+ T cells), macrophages, and dendritic cells. HIV infection leads to low levels of CD4+ T cells through a number of mechanisms, including pyroptosis of abortively infected T cells, apoptosis of uninfected bystander cells, direct viral killing of infected cells, and killing of infected CD4+ T cells by CD8+cytotoxic lymphocytes that recognize infected cells. When CD4+ T cell numbers decline below a critical level, cell-mediated immunity is lost, and the body becomes progressively more susceptible to opportunistic infections, leading to the development of AIDS.


II. Conjugates of the Disclosure

Provided herein are synthetic conjugates useful in the treatment of viral infections (e.g., HIV infections). The conjugates disclosed herein include an Fc domain monomer, an Fc domain, or an albumin protein conjugated to one or more monomers gp120 binders or one or more dimers of two gp120 binders (e.g., gp120 binders selected from temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof). The dimers of two gp120 binders include a gp120 binder (e.g., a first gp120 binder of formula (A-I) or (A-II)) and a second gp120 binder (e.g., a second gp120 binder of formula (A-I) or (A-II)). The first and second gp120 binders are linked to each other by way of a linker.


Without being bound by theory, in some aspects, conjugates described herein bind to the surface of a viral particle (e.g., bind to viral gp120 receptor on the surface on an human immunodeficiency virus particle) through the interactions between the gp120 binder moieties in the conjugates and proteins on the surface of the viral particle. The gp120 binder disrupts gp120, an envelope glycoprotein that binds with the CD4 receptor, particularly on helper T cells. Binding to CD4 initiates a cascade of conformational changes in gp120 and gp41 that lead to the fusion of the viral membrane with the host cell membrane, allowing the spread of the virus.


Conjugates of the invention include gp120 binder monomers and dimers conjugated to an Fc domain, Fc monomer, or Fc-binding peptide. The Fc domain in the conjugates described herein binds to the FcγRs (e.g., FcRn, FcγRI, FcγRIIa, FcγRIIc, FcγRIIIa, and FcγRIIIb) on immune cells. The binding of the Fc domain in the conjugates described herein to the FcγRs on immune cells activates phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates.


Conjugates of the invention include gp120 binder monomers and dimers conjugated to an albumin protein or an albumin protein-binding peptide. The albumin protein or albumin protein-binding peptide may extend the half-life of the conjugate, for example, by binding of albumin to the recycling neonatal Fc receptor.


Conjugates provided herein are described by any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII). In some embodiments, the conjugates described herein include one or more monomers of gp120 binders conjugated to an Fc domain or an albumin protein. In some embodiments, the conjugates described herein include one or more dimers of gp120 binders conjugated to an Fc domain monomer, an Fc domain, or an albumin protein. In some embodiments, when n is 2, E (an Fc domain monomer) dimerizes to form an Fc domain.


Conjugates described herein may be synthesized using available chemical synthesis techniques in the art. In cases where a functional group is not available for conjugation, a molecule may be derivatized using conventional chemical synthesis techniques that are well known in the art. In some embodiments, the conjugates described herein contain one or more chiral centers. The conjugates include each of the isolated stereoisomeric forms as well as mixtures of stereoisomers in varying degrees of chiral purity, including racemic mixtures. It also encompasses the various diastereomers, enantiomers, and tautomers that can be formed.


Gp120 Binders

A component of the conjugates described herein is an HIV gp120 binder moiety. The gp120 binder disrupts gp120, an envelope glycoprotein that binds with the CD4 receptor, particularly on helper T-Cells. Binding to CD4 initiates a cascade of conformational changes in gp120 and gp41 that lead to the fusion of the viral membrane with the host cell membrane, allowing the spread of the virus. Examples of gp120 binders include temsavir, BMS-818251, DMJ-II-121, and BNM-IV-147. In addition, derivatives of temsavir, BMS-818251, DMJ-II-121, and BNM-IV-147, such as those found in the literature, have gp120 binder activity and are useful as gp120 inhibitor moieties of the compounds herein (see, for example, Lu et al. Curr. Top. Med. Chem. 16(10): 1074-1090).


Conjugates described herein are separated into two types: (1) one or more dimers of gp120 binders conjugated to an Fc domain monomer, an Fc domain, or an albumin protein and (2) one or more monomers of gp120 binders conjugated to an Fc domain monomer, an Fc domain, or an albumin protein.


The dimers of gp120 binders are linked to each other by way of a linker, such as the linkers described herein.


Viral gp120 binders of the invention include temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, and analogs thereof, such as the viral gp120 binders of formula (A-I) and (A-II):




embedded image


wherein Q is selected from the group consisting of:




embedded image


S is selected from the group consisting of:




embedded image


R1, R2, R3, are each independently selected from H, OH, halogen, nitrile, nitro, optionally substituted amine, optionally substituted sulfhydryl, optionally substituted carboxyl, optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C3-C20 cycloalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C5-C20 aryl, optionally substituted C3-C15 heteroaryl, and optionally substituted C1-C20 alkoxy;


R4 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, optionally substituted C3-C15 heteroaryl, and a bond;


R5 is selected from H or optionally substituted C1-C6 alkyl;


R6 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl;


R7 and Y are each independently selected from




embedded image


embedded image


embedded image


each R is independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl;


each R9 is independently selected from optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl;


x is 1 or 2;


k is 0, 1, 2, 3, 4, or 5;


Ar is selected from the group consisting of optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl. In a preferred embodiment of the above, x is 2.


Preferably the gp120 inhibitor is selected from temsavir, BMS-818251, DMJ-II-121, or BNM-IV-147:




embedded image


Conjugates of Dimers of Gp120 Binders Linked to an Fc Domain or an Albumin Protein

The conjugates described herein include an Fc domain monomer, an Fc domain, an Fc-binding peptide, and albumin protein, or an albumin protein-binding peptide covalently linked to one or more dimers of gp120 binders. The dimers of two gp120 binders include a first gp120 binder (e.g., a first viral gp120 binder of formula (A-I) or (A-II)) and a second gp120 binder (e.g., a second viral gp120 binder of formula (A-I) or (A-II)). The first and second gp120 binders are linked to each other by way of a linker, such as a linker described herein. In some embodiments of the dimers of gp120 binders, the first and second gp120 binders are the same. In some embodiments, the first and second gp120 binders are different.


In some embodiments, when T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L-A2 structures described herein). In some embodiments, E may be conjugated to 2, 3, 4, 5, 6, 7, 8, 9, 10, or more different A1-L-A2 moieties. In some embodiments, E is conjugated to a first A1-L-A2 moiety, and a second A1-L-A2, moiety. In some embodiments, each of A1 and A2 of the first A1-L-A2 moiety and of the second A1-L-A2 moiety are independently selected from any structure described by formula (A-I) and (A-II):




embedded image


In a preferred embodiment of the above, x is 2.


In some embodiments, the first A1-L-A2 moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E), and the second A1-L-A2 moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E). In some embodiments, the first A1-L-A2 moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E), and the second A1-L-A2 moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E).


In some embodiments, the disclosure provides a conjugate, or a pharmaceutically acceptable salt thereof, described by the formulae below:




embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


or a pharmaceutically acceptable salt thereof.


In the conjugates described herein, the squiggly line connected to E indicates that one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) dimers of gp120 binders may be attached to an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when n is 1, one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) dimers of gp120 binders may be attached to an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when n is 2, one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) dimers of gp120 binders may be attached to an Fc domain. The squiggly line in the conjugates described herein is not to be construed as a single bond between one or more dimers of gp120 binders and an atom in the Fc domain monomer, Fc domain, or albumin protein. In some embodiments, when T is 1, one dimer of gp120 binders may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when T is 2, two dimers of gp120 binders may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.


As described further herein, a linker in a conjugate described herein (e.g., L or L′) may be a branched structure. As described further herein, a linker in a conjugate described herein (e.g., L or L′) may be a multivalent structure, e.g., a divalent or trivalent structure having two or three arms, respectively. In some embodiments when the linker has three arms, two of the arms may be attached to the first and second gp120 binders and the third arm may be attached to the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.


In conjugates having an Fc domain covalently linked to one or more dimers of gp120 binders, as represented by the formulae above, when n is 2, two Fc domain monomers (each Fc domain monomer is represented by E) dimerize to form an Fc domain.


Conjugates of Monomers of Gp120 Binders Linked to an Fc Domain Monomer, an Fc Domain, or an Albumin Protein

In some embodiments, the conjugates described herein include an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide covalently linked to one or more monomers of gp120 binders. Conjugates of an Fc domain monomer or albumin protein and one or more monomers of gp120 binders may be formed by linking the Fc domain monomer, Fc domain, or albumin protein to each of the monomers of gp120 binders through a linker, such as any of the linkers described herein.


In the conjugates having an Fc domain or albumin protein covalently linked to one or more monomers of gp120 binders described herein, the squiggly line connected to E indicates that one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) monomers of gp120 binders may be attached to an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when n is 1, one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) monomers of gp120 binders may be attached to an Fc domain monomer, Fc domain, or an albumin protein. In some embodiments, when n is 2, one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) monomers of gp120 binders may be attached to an Fc domain. The squiggly line in the conjugates described herein is not to be construed as a single bond between one or more monomers of gp120 binders and an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when T is 1, one monomer of gp120 binder may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when T is 2, two monomers of gp120 binders may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.


In some embodiments, when T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L may be independently selected (e.g., independently selected from any of the A1-L structures described herein). In some embodiments, E may be conjugated to 2, 3, 4, 5, 6, 7, 8, 9, 10, or more different A1-L moieties. In some embodiments, E is conjugated to a first A1-L moiety, and a second A1-L, moiety. In some embodiments, A1 of each of the first A1-L moiety and of the second A1-L moiety is independently selected from any structure described by formula (A-I) or (A-II):




embedded image


In a preferred embodiment, x is 2.


In some embodiments, the first A1-L moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E), and the second A1-L moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E). In some embodiments, the first A1-L moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E), and the second A1-L moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E).


As described further herein, a linker in a conjugate having an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide covalently linked to one or more monomers of the gp120 binders described herein (e.g., L or L′) may be a divalent structure having two arms. One arm in a divalent linker may be attached to the monomer of the gp120 binder and the other arm may be attached to the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.


In some embodiments, a conjugate containing an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide covalently linked to one or more monomers of gp120 binders provided herein is described by any one of formulae below:




embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


or a pharmaceutically acceptable salt thereof.


In conjugates having an Fc domain covalently linked to one or more monomers of gp120 binders, as represented by the formulae above, when n is 2, two Fc domain monomers (each Fc domain monomer is represented by E) dimerize to form an Fc domain.


III. Fc Domain Monomers and Fc Domains

An Fc domain monomer includes a hinge domain, a CH2 antibody constant domain, and a CH3 antibody constant domain. The Fc domain monomer can be of immunoglobulin antibody isotype IgG, IgE, IgM, IgA, or IgD. The Fc domain monomer can also be of any immunoglobulin antibody isotype (e.g., IgG1, IgG2a, IgG2b, IgG3, or IgG4). The Fc domain monomer can be of any immunoglobulin antibody allotype (e.g., IGHG1*01 (i.e., G1m(za)), IGHG1*07 (i.e., G1m(zax)), IGHG1*04 (i.e., G1m(zav)), IGHG1*03 (G1m(f)), IGHG1*08 (i.e., G1m(fa)), IGHG2*01, IGHG2*06, IGHG2*02, IGHG3*01, IGHG3*05, IGHG3*10, IGHG3*04, IGHG3*09, IGHG3*11, IGHG3*12, IGHG3*06, IGHG3*07, IGHG3*08, IGHG3*13, IGHG3*03, IGHG3*14, IGHG3*15, IGHG3*16, IGHG3*17, IGHG3*18, IGHG3*19, IGHG2*04, IGHG4*01, IGHG4*03, or IGHG4*02) (as described in, for example, in Vidarsson et al. IgG subclasses and allotypes: from structure to effector function. Frontiers in Immunology. 5(520):1-17 (2014)). The Fc domain monomer can also be of any species, e.g., human, murine, or mouse. A dimer of Fc domain monomers is an Fc domain that can bind to an Fc receptor, which is a receptor located on the surface of leukocytes.


In some embodiments, an Fc domain monomer in the conjugates described herein may contain one or more amino acid substitutions, additions, and/or deletion relative to an Fc domain monomer having a sequence of any one of SEQ ID NOs: 1-95. In some embodiments, an Asn in an Fc domain monomer in the conjugates as described herein may be replaced by Ala in order to prevent N-linked glycosylation (see, e.g., SEQ ID NOs: 12-15, where Asn to Ala substitution is labeled with *). In some embodiments, an Fc domain monomer in the conjugates described herein may also containing additional Cys additions (see, e.g., SEQ ID NOs: 9, 10, and 11, where Cys additions are labeled with *).


In some embodiments, an Fc domain monomer in the conjugates as described herein includes an additional moiety, e.g., an albumin-binding peptide, a purification peptide (e.g., a hexa-histidine peptide (HHHHHH (SEQ ID NO: 99)), or a signal sequence (e.g., IL2 signal sequence MYRMQLLSCIALSLALVTNS (SEQ ID NO: 100)) attached to the N- or C-terminus of the Fc domain monomer. In some embodiments, an Fc domain monomer in the conjugate does not contain any type of antibody variable region, e.g., VH, VL, a complementarity determining region (CDR), or a hypervariable region (HVR).


In some embodiments, an Fc domain monomer in the conjugates as described herein may have a sequence that is at least 95% identical (e.g., 97%, 99%, or 99.5% identical) to the sequence of any one of SEQ ID NOs: 1-95 shown below. In some embodiments, an Fc domain monomer in the conjugates as described herein may have a sequence of any one of SEQ ID NOs: 1-95 shown below.










SEQ ID NO: 1: murine Fc-IgG2a with IL2 signal sequence at the N-terminus



(bold)



MYRMQLLSCIALSLALVTNSPRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVS






EDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTI





SKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYF





MYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK





SEQ ID NO: 2: mature murine Fc-IgG2a


PRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTA





QTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEE





MTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSC





SVVHEGLHNHHTTKSFSRTPGK





SEQ ID NO: 3: human Fc-IgG1 with IL2 signal sequence at the N-terminus


(bold) and N-terminal MVRS amino acid residues added (underlined)



MYRMQLLSCIALSLALVTNS
MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV






SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT





ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 4: mature human Fc-IgG1 with N-terminal MVRS amino acid


residues added (underlined)



MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVEVH






NAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSR





EEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVF





SCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 5: murine Fc-IgG2a with IL2 signal sequence (bold) at the


N-terminus and hexa-histidine peptide (italicized) at the C-terminus



MYRMQLLSCIALSLALVTNSPRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVS






EDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTI





SKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYF





MYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGKHHHHHH





SEQ ID NO: 6: mature murine Fc-IgG2a with hexa-histidine peptide


(italicized) at the C-terminus


PRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQ1SWFVNNVEVHTA





QTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEE





MTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSC





SVVHEGLHNHHTTKSFSRTPGKHHHHHH





SEQ ID NO: 7: human Fc-IgG1 with IL2 signal sequence (bold) at the


N-terminus, N-terminal MVRS amino acid residues added (underlined),


and hexa-histidine peptide (italicized) at the C-terminus



MYRMQLLSCIALSLALVTNS
MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV






SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT





ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGKHHHHHH





SEQ ID NO: 8: mature human Fc-IgG1 with hexa-histidine peptide


(italicized) at the C-terminus and N-terminal MVRS amino acid residues


added (underlined)



MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVEVH






NAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSR





EEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVF





SCSVMHEALHNHYTQKSLSLSPGKHHHHHH





SEQ ID NO: 9: human Fc-IgG1 with IL2 signal sequence (bold) at the


N-terminus, N-terminal MVRS amino acid residues added (underlined),


two additional cysteines in the hinge region (*), and hexa-histidine


peptide (italicized) at the C-terminus



MYRMQLLSCIALSLALVTNS
MVRSDKTHTCPPCPPC*KC*PAPELLGGPSVFLFPPKPKDTLMISRTPEVT






CVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKAL





PAPIEKTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVL





DSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGKHHHHHH





SEQ ID NO: 10: mature human Fc-IgG1 with N-terminal MVRS amino acid


residues added (underlined), two additional cysteines in the hinge


region (*), and hexa-histidine peptide (italicized) at the C-terminus



MVRSDKTHTCPPCPPC*KC*PAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV






DGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQV





YTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRW





QQGNVFSCSVMHEALHNHYTQKSLSLSPGKHHHHHH





SEQ ID NO: 11: mature human Fc-IgG1 with N-terminal MVRS amino


acid residues added (underlined) and two additional cysteines in the


hinge region (*)



MVRSDKTHTCPPCPPC*KC*PAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYV






DGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQV





YTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRW





QQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 12: murine Fc-IgG2a with IL2 signal sequence (bold) at the


N-terminus, Asn to Ala substitution (*), and hexa-histidine peptide


(italicized) at the C-terminus



MYRMQLLSCIALSLALVTNSPRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVS






EDDPDVQISWFVNNVEVHTAQTQTHREDYA*STLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTI





SKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYF





MYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGKHHHHHH





SEQ ID NO: 13: mature murine Fc-IgG2a with Asn to Ala substitution


(*) and hexa-histidine peptide (italicized) at the C-terminus


PRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQ1SWFVNNVEVHTA





QTQTHREDYA*STLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEE





EMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYS





CSVVHEGLHNHHTTKSFSRTPGKHHHHHH





SEQ ID NO: 14: human Fc-IgG1 with IL2 signal sequence (bold) at the


N-terminus, N-terminal MVRS amino acid residues added (underlined),


Asn to Ala substitution (*), and hexa-histidine peptide (italicized)


at the C-terminus



MYRMQLLSCIALSLALVTNS
MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV






SHEDPEVKFNWYVDGVEVHNAKTKPREEQYA*STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK





TISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS





FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGKHHHHHH





SEQ ID NO: 15: mature human Fc-IgG1 with Asn to Ala substitution (*),


N-terminal MVRS amino acid residues added (underlined), and hexa-


histidine peptide (italicized) at the C-terminus



MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVEVH






NAKTKPREEQYA*STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPS





REEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNV





FSCSVMHEALHNHYTQKSLSLSPGKHHHHHH





SEQ ID NO: 16: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold) at the N-terminus and N-terminal ISAMVRS amino acid residues


added (underlined)



MKWVTFISLLFLFSSAYS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV






SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT





ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 17: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added


(underlined), C-terminal G4S linker (italicized), and C-terminal c-Myc


tag (underlined, italicized)



MKWVTFISLLFLFSSAYS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV






SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT





ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 18: mature human IgG1 Fc with N-terminal ISAMVRS amino acid


residues added (underlined), C-terminal G4S linker (italicized), and


C-terminal c-Myc tag (underlined, italicized)



ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVE






VHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPP





SREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGN





VFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 19: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold), N-terminal ISAMVRS amino acid residues added (underlined),


and lysine to serine modification (*) to prevent lysine conjugation


at this site



MKWVTFISLLFLFSSAYS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPS*DTLMISRTPEVTCVVVD






VSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK





TISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS





FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 20: mature human IgG1 Fc with N-terminal ISAMVRS amino


acid residues added (underlined) and lysine to serine modification (*)


to prevent lysine conjugation at this site



ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPS*DTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGV






EVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLP





PSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQG





NVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 21: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added


(underlined), lysine to serine modification (*) to prevent lysine


conjugation at this site, C-terminal G4S linker (italicized), and


C-terminal C-Myc tag (underlined, italicized)



MKWVTFISLLFLFSSAYS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPS(*)DTLMISRTPEVTCVVV






DVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPI





EKTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSD





GSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 22: mature human IgG1 Fc with N-terminal ISAMVRS amino


acid residues added (underlined), lysine to serine modification (*)


to prevent lysine conjugation at this site, C-terminal G4S linker


(italicized), and C-terminal C-Myc tag (underlined, italicized)



ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPS(*)DTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDG






VEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTL





PPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQ





GNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 23: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added


(underlined), Asn to Ala substitution (*), C-terminal G4S linker


(italicized), and C-terminal C-myc tag (underlined, italicized)



MKWVTFISLLFLFSSAYS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV






SHEDPEVKFNWYVDGVEVHNAKTKPREEQYA(*)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIE





KTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG





SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 24: mature human IgG1 Fc with N-terminal ISAMVRS amino acid


residues added (underlined), Asn to Ala substitution (*), C-terminal


G4S linker (italicized), and C-terminal C-myc tag (underlined, italicized)



ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVE






VHNAKTKPREEQYA(*)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTL





PPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQ





GNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 25: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added


(underlined), H310A (*) and H435A (*) mutations to impede FcRn binding,


C-terminal G4S (italicized), and C-terminal C-myc tag (underlined,


italicized)



MKWVTFISLLFLFSSAYS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV






SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLA(*)QDWLNGKEYKCKVSNKALPAPIE





KTISKA(*)KGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSD





GSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNAYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 26: mature human IgG1 Fc with Human Serum Albumin Signal


Sequence (bold) at the N-terminus, N-terminal ISAMVRS amino acid residues


added (underlined), with H310A (*) and H435A (*) mutations to impede


FcRn binding, C-terminal G4S (italicized), and C-terminal C-myc tag


(underlined, italicized)



ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVE






VHNAKTKPREEQYNSTYRVVSVLTVLA(*)QDWLNGKEYKCKVSNKALPAPIEKTISKA(*)KGQPREPQVY





TLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQ





QGNVFSCSVMHEALHNAYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 27: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added


(underlined), C-terminal G4S linker (italicized), and C-terminal


mutated (lysine to phenylalanine, bold) C-myc tag (underlined, italicized)



MKWVTFISLLFLFSSAYS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV






SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT





ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGScustom-character





SEQ ID NO: 28: mature human IgG1 Fc with N-terminal ISAMVRS amino acid


residues added (underlined), C-terminal G4S linker (italicized), and


C-terminal mutated (lysine to phenylalanine, bold) C-myc tag (underlined,


italicized)



ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVE






VHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPP





SREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGN





VFSCSVMHEALHNHYTQKSLSLSPGGGGGScustom-character





SEQ ID NO: 29: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold) at the N-terminus, N-terminal ISAMVRS amino acid residues


added (underlined), Asn to Ala substitution (*), C-terminal G4S


linker (italicized), and C-terminal mutated (lysine to phenylalanine,


bold) C-myc tag (underlined, italicized)



MKWVTFISLLFLFSSAYS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV






SHEDPEVKFNWYVDGVEVHNAKTKPREEQYA(*)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIE





KTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG





SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQFLISEEDL





SEQ ID NO: 30: mature human IgG1 Fc with N-terminal MVRS amino acid


residues added (underlined), Asn to Ala substitution (*), C-terminal


G4S linker (italicized), and C-terminal mutated (lysine to


phenylalanine, bold) C-myc tag (underlined, italicized)



ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVE






VHNAKTKPREEQYA(*)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTL





PPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQ





GNVFSCSVMHEALHNHYTQKSLSLSPGGGGGScustom-character





SEQ ID NO: 31: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold) at the N-terminus, allotype G1m(fa) (bold italics), C-terminal


G4S linker (italicized), and C-terminal mutated (lysine to


phenylalanine, bold) C-myc tag (underlined)



MKWVTFISLLFLFSSAYSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPE






VKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKG





QPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT





VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQFLISEEDL





SEQ ID NO: 32: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold) at the N-terminus, allotype G1m(fa) (bold italics)



MKWVTFISLLFLFSSAYSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPE






VKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKG





QPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT





VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 33: mature human IgG1 Fc with a YTE triple mutation (bold and


underlined) with N-terminal MVRS amino acid residues added (underlined)



MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNVVYVDGVEVH






NAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSR





EEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVF





SCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 34: human IgG1 Fc with Human Serum Albumin Signal Sequence


(bold) at the N-terminus, contains residues EPKSS comprising the full


hinge region on the N-terminus of mature human IgG1 Fc (underlined),


Cys to Ser substitution (#), allotype G1m(fa) (bold italics)



MKWVTFISLLFLFSSAYS
EPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV






SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT





ISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF





LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 35: human IgG1 Fc with murine IgG signal sequence (bold)


at the N-terminus, with removal of EPKSSD hinge residues from the


N-terminus of the mature human IgG1 Fc, allotype G1m(fa) (bold italics)



MGWSCIILFLVATATGVHSKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPE






VKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKG





QPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT





VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 36: mature human IgG1 Fc with a YTE triple mutation (bold


and underlined), with removal of EPKSSD hinge residues from the


N-terminus of the mature human IgG1 Fc, allotype G1m(fa) (bold italics)


KTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKP





REEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-character E






custom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS






VMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 37: mature human IgG1 Fc with an LS double mutation (bold


and underlined), with removal of EPKSSD hinge residues from the


N-terminus of the mature human IgG1 Fc, allotype G1m(fa) (bold


italics)


KTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTK





PREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-character E






custom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS






VLHEALHSHYTQKSLSLSPGK





SEQ ID NO: 38: mature human IgG1 Fc with Human Serum Albumin Signal


Sequence (bold) at the N-terminus, a YTE triple mutation (bold and


underlined), allotype G1m(fa) (bold italics), C-terminal G4S


linker (italicized), and C-terminal C-myc tag (underlined)



MKWVTFISLLFLFSSAYSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEV






KFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQ





PREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV





DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 39: mature human Fc IgG1, wherein X1 is Met or Trp, X2 is


Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met,


X6 is Met or Leu, and X7 is Asn or Ser


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAK





TKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRX4E






X
5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSC






SVX6HEALHX7HYTQKSLSLSPG





SEQ ID NO: 40: mature human Fc IgG1 wherein X4 is Asp or Glu, and X5 is


Leu or Met


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKT





KPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRX4EX5





TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV





MHEALHNHYTQKSLSLSPG





SEQ ID NO: 41: mature human Fc IgG1 with a YTE triple mutation (bold


and underlined), and wherein X4 is Asp or Glu, and X5 is Leu or Met


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKTK





PREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRX4EX5T





KNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV





MHEALHNHYTQKSLSLSPG





SEQ ID NO: 42: mature human Fc IgG1 with a YTE triple mutation (bold


and underlined), allotype G1m(fa) (bold italics)


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKTK





PREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character TK





NQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM





HEALHNHYTQKSLSLSPG





SEQ ID NO: 43: mature human Fc IgG1 with a YTE triple mutation (bold


and underlined), allotype G1m(f) (bold italics)


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKTK





PREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character TK





NQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM





HEALHNHYTQKSLSLSPG





SEQ ID NO: 44: mature human Fc IgG1 with a LS double mutation (bold


and underlined), and wherein X4 is Asp or Glu, and X5 is Leu or Met


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKT





KPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRX4EX5





TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV







L
HEALHSHYTQKSLSLSPG






SEQ ID NO: 45: mature human Fc IgG1 with a LS double mutation (bold


and underlined), allotype G1m(fa) (bold italics)


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKT





KPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character T





KNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVL





HEALHSHYTQKSLSLSPG





SEQ ID NO: 46: mature human Fc IgG1 with a LS double mutation (bold


and underlined), allotype G1m(f) (bold italics)


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKT





KPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character T





KNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVL





HEALHSHYTQKSLSLSPG





SEQ ID NO: 47: mature human Fc IgG1 with mouse heavy chain MIgG Vh


signal sequence (bold), deletion of Asp ([D]) Cys to Ser substitution


(#), and wherein X1 is Met or Trp, X2 is Ser or Thr, X3 is Thr or


Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7


is Asn or Ser


MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD





TLX1IX2RX3PEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNG





KEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNG





QPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPG





SEQ ID NO: 48: mature human IgG1 Fc with mouse heavy chain MIgG Vh


signal sequence (bold), Cys to Ser substitution (#), allotype G1m(fa)


(bold italics)



MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD






TLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGK





EYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-characterEcustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQP





ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 49: mature human IgG1 Fc with mouse heavy chain MIgG Vh


signal sequence (bold), Cys to Ser substitution (#), allotype G1m(f)


(bold italics)



MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD






TLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGK





EYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-characterEcustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQP





ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 50: mature human IgG1 Fc with mouse heavy chain MIgG Vh


signal sequence (bold), Cys to Ser substitution (#), M428L, N4345


mutations (Bold/Underlined), allotype G1m(fa) (bold italics)



MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD






TLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGK





EYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-characterEcustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQP





ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK





SEQ ID NO: 51: mature human IgG1 Fc with mouse heavy chain MIgG Vh


signal sequence (bold), Cys to Ser substitution (#), M428L, N4345


mutations (Bold/Underlined), allotype G1m(f) (bold italics)



MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSSMDKTHTCPPCPAPELLGGPSVFLFPPKPKD






TLMISRTPEVTCVVVDVSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGK





EYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-characterEcustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQP





ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK





SEQ ID NO: 52: mature human IgG1 Fc with mouse heavy chain MIgG Vh


signal sequence (bold), Cys to Ser substitution (#), YTE triple


mutation (bold and underlined), allotype G1m(fa) (bold italics)



MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD






TLYITREPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKE





YKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-characterEcustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPE





NNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 53: mature human IgG1 Fc with mouse heavy chain MIgG Vh


signal sequence (bold), Cys to Ser substitution (#), YTE triple


mutation (bold and underlined), allotype G1m(f) (bold italics)



MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD






TLYITREPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKE





YKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPE





NNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 54: mature human IgG1 Fc with mouse heavy chain MIgG Vh


signal sequence (bold), N-terminal ISAMVRS amino acid residues added


(italicized), M428L, N4345 mutations (bold/underlined), G4S linker


(italicized), and C-terminal C-myc-tag (underlined), allotype G1m(f)


(bold italics)



MGWSCIILFLVATATGVHS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVD






VSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK





TISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS





FFLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 55: mature human IgG1 Fc with mouse heavy chain MIgG Vh


signal sequence (bold), N-terminal ISAMVRS amino acid residues added


(italicized), M428L, N4345 mutations (bold/underlined), G4S linker


(italicized), C-terminal C-myc-tag (underlined), allotype G1m(fa)


(bold italics)



MGWSCIILFLVATATGVHS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVD






VSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK





TISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSLHEALHSHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 56: mature human IgG1 Fc with mouse heavy chain MIgG Vh


signal sequence (bold), N-terminal ISAMVRS amino acid residues added


(italicized), YTE triple mutant (bold/underlined), G4S linker


(italicized), and C-terminal C-myc-tag (underlined), allotype G1m(f)


(bold italics)



MGWSCIILFLVATATGVHS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVD






VSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK





TISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS





FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 57: mature human IgG1 Fc with mouse heavy chain MIgG Vh


signal sequence (bold), N-terminal ISAMVRS amino acid residues added


(italicized), YTE triple mutant (bold/underlined), G4S linker


(italicized), C-terminal C-myc-tag (underlined), allotype G1m(fa) (bold


italics)



MGWSCIILFLVATATGVHS
ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVD






VSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK





TISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL





SEQ ID NO: 58: mature human IgG1 with mouse heavy chain MIgG1 signal


sequence (bold), Cys to Ser substitution (#), C-terminal G4S (italics),


and C-terminal IgA peptide (underline), allotype G1m(fa) (bold


italics)



MGWSCIILFLVATATGVHSEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVD






VSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK





TISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSQRNPRLRLIRRHPTLRIPPI





SEQ ID NO: 59: mature human IgG1 with mouse heavy chain MIgG1 signal


sequence (bold), Cys to Ser substitution (#), M428L, N4345 mutations


(bold/underlined), C-terminal G4S (italics), and C-terminal IgA


peptide (underline), allotype G1m(fa) (bold italics)



MGWSCIILFLVATATGVHSEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVD






VSHEDPEVKFNVVYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK





TISKAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGGGGGSQRNPRLRLIRRHPTLRIPPI





SEQ ID NO: 60: mature human Fc IgG1, Z1 is Cys or Ser, and wherein X1


is Met or Trp, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu,


and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser


NVNHKPSNTKVDKKVEPKSZ1DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF





LYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPGK





SEQ ID NO: 61: mature human Fc IgG1, Cys to Ser substitution (#),


and wherein X1 is Met or Trp, X2 is Ser or Thr, X3 is Thr or Glu, X4 is


Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVS





HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI





SKAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPGK





SEQ ID NO: 62: mature human IgG1 Fc, Cys to Ser substitution (#), X4


is Asp or Glu, and X5 is Leu or Met


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF





LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 63: mature human IgG1 Fc, Cys to Ser substitution (#),


allotype G1m(f) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 64: mature human IgG1 Fc, Cys to Ser substitution (#),


allotype G1m(fa) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 65: mature human IgG1 Fc, Cys to Ser substitution (#),


M428L, N4345 mutations (Bold/Underlined), allotype G1m(fa) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK





SEQ ID NO: 66: mature human IgG1 Fc, Cys to Ser substitution (#),


M428L, N4345 mutations (Bold/Underlined), allotype G1m(f) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK





SEQ ID NO: 67: mature human IgG1 Fc, Cys to Ser substitution (#),


YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 68: mature human IgG1 Fc, Cys to Ser substitution (#),


YTE triple mutation (bold and underlined), allotype G1m(f) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 69: mature human Fc IgG1, Z1 is Cys or Ser, and wherein X1


is Met or Trp, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu,


and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser


NVNHKPSNTKVDKKVEPKSZ1DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF





LYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPG





SEQ ID NO: 70: mature human Fc IgG1, Cys to Ser substitution (#),


and wherein X1 is Met or Trp, X2 is Ser or Thr, X3 is Thr or Glu, X4


is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVS





HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI





SKAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF





FLYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPG





SEQ ID NO: 71: mature human IgG1 Fc, Cys to Ser substitution (#),


X4 is Asp or Glu, and X5 is Leu or Met


NVNHKPSNTKVDKKVEPKSSMDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF





LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG





SEQ ID NO: 72: mature human IgG1 Fc, Cys to Ser substitution (#),


allotype G1m(f) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG





SEQ ID NO: 73: mature human IgG1 Fc, Cys to Ser substitution (#),


allotype G1m(fa) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG





SEQ ID NO: 74: mature human IgG1 Fc, Cys to Ser substitution (#),


M428L, N4345 mutations (Bold/Underlined), allotype G1m(fa) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG





SEQ ID NO: 75: mature human IgG1 Fc, Cys to Ser substitution (#),


M428L, N434S mutations (Bold/Underlined), allotype G1m(f) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG





SEQ ID NO: 76: mature human IgG1 Fc, Cys to Ser substitution (#),


YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics)


NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG





SEQ ID NO: 77: mature human IgG1 Fc, Cys to Ser substitution (#),


YTE triple mutation (bold and underlined), allotype G1m(f) (bold italics)


NVNHKPSNTKVDKKVEPKSSMDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG





SEQ ID NO: 78: mature human Fc IgG1, Z1 is Cys or Ser, and wherein X1


is Met or Trp, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu,


and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser


VNHKPSNTKVDKKVEPKSZ1DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPGK





SEQ ID NO: 79: mature human Fc IgG1, Cys to Ser substitution (#),


and wherein X1 is Met or Trp, X2 is Ser or Thr, X3 is Thr or Glu, X4


is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser


VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF





LYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPGK





SEQ ID NO: 80: mature human IgG1 Fc, Cys to Ser substitution (#),


X4 is Asp or Glu, and X5 is Leu or Met


VNHKPSNTKVDKKVEPKSSMDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 81: mature human IgG1 Fc, Cys to Ser substitution (#),


allotype G1m(f) (bold italics)


VNHKPSNTKVDKKVEPKSSMDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 82: mature human IgG1 Fc, Cys to Ser substitution (#),


allotype G1m(fa) (bold italics)


VNHKPSNTKVDKKVEPKSSMDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 83: mature human IgG1 Fc, Cys to Ser substitution (#),


M428L, N434S mutations (Bold/Underlined), allotype G1m(fa) (bold italics)


VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK





SEQ ID NO: 84: mature human IgG1 Fc, Cys to Ser substitution (#),


M428L, N434S mutations (Bold/Underlined), allotype G1m(f) (bold italics)


VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK





SEQ ID NO: 85: mature human IgG1 Fc, Cys to Ser substitution (#),


YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics)


VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 86: mature human IgG1 Fc, Cys to Ser substitution (#),


YTE triple mutation (bold and underlined), allotype G1m(f) (bold italics)


VNHKPSNTKVDKKVEPKSSMDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK





SEQ ID NO: 87: mature human Fc IgG1, Z1 is Cys or Ser, and wherein X1


is Met or Trp, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu,


and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser


VNHKPSNTKVDKKVEPKSZ1DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPG





SEQ ID NO: 88: mature human Fc IgG1, Cys to Ser substitution (#),


and wherein X1 is Met or Trp, X2 is Ser or Thr, X3 is Thr or Glu, X4


is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser


VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSH





EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS





KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF





LYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPG





SEQ ID NO: 89: mature human IgG1 Fc, Cys to Ser substitution (#),


X4 is Asp or Glu, and X5 is Leu or Met


VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL





YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG





SEQ ID NO: 90: mature human IgG1 Fc, Cys to Ser substitution (#),


allotype G1m(f) (bold italics)


VNHKPSNTKVDKKVEPKSSMDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG





SEQ ID NO: 91: mature human IgG1 Fc, Cys to Ser substitution (#),


allotype G1m(fa) (bold italics)


VNHKPSNTKVDKKVEPKSSMDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG





SEQ ID NO: 92: mature human IgG1 Fc, Cys to Ser substitution (#),


M428L, N434S mutations (Bold/Underlined), allotype G1m(fa) (bold italics)


VNHKPSNTKVDKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG





SEQ ID NO: 93: mature human IgG1 Fc, Cys to Ser substitution (#),


M428L, N434S mutations (Bold/Underlined), allotype G1m(f) (bold italics)


VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG





SEQ ID NO: 94: mature human IgG1 Fc, Cys to Ser substitution (#),


YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics)


VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG





SEQ ID NO: 95: mature human IgG1 Fc, Cys to Ser substitution (#),


YTE triple mutation (bold and underlined), allotype G1m(f) (bold italics)


VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHE





DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK





AKGQPREPQVYTLPPSRcustom-character Ecustom-character TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY





SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG






As defined herein, an Fc domain includes two Fc domain monomers that are dimerized by the interaction between the CH3 antibody constant domains, as well as one or more disulfide bonds that form between the hinge domains of the two dimerizing Fc domain monomers. An Fc domain forms the minimum structure that binds to an Fc receptor, e.g., Fc-gamma receptors (i.e., Fcγ receptors (FcγR)), Fc-alpha receptors (i.e., Fcα receptors (FcαR)), Fc-epsilon receptors (i.e., Fcs receptors (FcεR)), and/or the neonatal Fc receptor (FcRn). In some embodiments, an Fc domain of the present invention binds to an Fcγ receptor (e.g., FcRn, FcγRI (CD64), FcγRIIa (CD32), FcγRIIb (CD32), FcγRIIIa (CD16a), FcγRIIIb (CD16b)), and/or FcγRIV and/or the neonatal Fc receptor (FcRn).


In some embodiments, the Fc domain monomer or Fc domain of the invention is an aglycosylated Fc domain monomer or Fc domain (e.g., an Fc domain monomer or an Fc domain that maintains engagement to an Fc receptor (e.g., FcRn). For example, the Fc domain is an aglycosylated IgG1 variants that maintains engagement to an Fc receptor (e.g., an IgG1 having an amino acid substitution at N297 and/or T299 of the glycosylation motif). Exemplary aglycosylated Fc domains and methods for making aglycosylated Fc domains are known in the art, for example, as described in Sazinsky S. L. et al., Aglycosylated immunoglobulin G1 variants productively engage activating Fc receptors, PNAS, 2008, 105(51):20167-20172, which is incorporated herein in its entirety.


In some embodiments, the Fc domain or Fc domain monomer of the invention is engineered to enhance binding to the neonatal Fc receptor (FcRn). For example, the Fc domain may include the triple mutation corresponding to M252Y/S254T/T256E (YTE) (e.g., an IgG1, such as a human or humanized IgG1 having a YTE mutation, for example SEQ ID NO: 33, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 56, or SEQ ID NO: 57). The Fc domain may include the double mutant corresponding to M428L/N434S (LS) (e.g., an IgG1, such as a human or humanized IgG1 having an LS mutation, such as SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 54, SEQ ID NO: 55, or SEQ ID NO: 59). The Fc domain may include the single mutant corresponding to N434H (e.g., an IgG1, such as a human or humanized IgG1 having an N434H mutation). The Fc domain may include the single mutant corresponding to C220S (e.g., and IgG1, such as a human or humanized IgG1 having a C220S mutation, such as SEQ ID NO: 34, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, or SEQ ID NO: 68). The Fc domain may include a combination of one or more of the above-described mutations that enhance binding to the FcRn. Enhanced binding to the FcRn may increase the half-life Fc domain-containing conjugate. For example, incorporation of one or more amino acid mutations that increase binding to the FcRn (e.g., a YTE mutation, an LS mutation, or an N434H mutation) may increase the half-life of the conjugate by 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%. 100%, 200%, 300%, 400%, 500% or more relative to a conjugate having the corresponding Fc domain without the mutation that enhances FcRn binding. Exemplary Fc domains with enhanced binding to the FcRN and methods for making Fc domains having enhanced binding to the FcRN are known in the art, for example, as described in Maeda, A. et al., Identification of human IgG1 variant with enhanced FcRn binding and without increased binding to rheumatoid factor autoantibody, MABS, 2017, 9(5):844-853, which is incorporated herein in its entirety. As used herein, an amino acid “corresponding to” a particular amino acid residue (e.g., of a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 may be mutated to include a YTE mutation, an LS mutation, and/or an N434H mutation by mutating the “corresponding residues” of the amino acid sequence.


As used herein, a sulfur atom “corresponding to” a particular cysteine residue of a particular SEQ ID NO. should be understood to include the sulfur atom of any cysteine residue that one of skill in the art would understand to align to the particular cysteine of the particular sequence. The protein sequence alignment of human IgG1 (UniProtKB: P01857; SEQ ID NO: 121), human IgG2 (UniProtKB: P01859; SEQ ID NO: 122), human IgG3 (UniProtKB: P01860; SEQ ID NO: 123), and human IgG4 (UniProtKB: P01861; SEQ ID NO: 124) is provided below (aligned with Clustal Omega Multiple Pairwise Alignment). The alignment indicates cysteine residues (e.g., sulfur atoms of cysteine residues) that “correspond to” one another (in boxes and indicated by the • symbol). One of skill in the art would readily be able to perform such an alignment with any IgG variant of the invention to determine the sulfur atom of a cysteine that corresponds to any sulfur atom of a particular cysteine of a particular SEQ ID NO. described herein (e.g., any one of SEQ ID NOs: 1-95). For example, one of skill in the art would readily be able to determine that Cys10 of SEQ ID NO: 10 (the first cysteine of the conserved CPPC motif of the hinge region of the Fc domain) corresponds to, for example, Cys109 of IgG1, Cys106 of IgG2, Cys156 of IgG3, Cys29 of SEQ ID NO: 1, Cys9 of SEQ ID NO: 2, Cys30 of SEQ ID NO: 3, or Cys10 of SEQ ID NO: 10.


In some embodiments, the Fc domain or Fc domain monomer of the invention has the sequence of any one of SEQ ID NOs: 39-95 may further include additional amino acids at the N-terminus (Xaa)x and/or additional amino acids at the C-terminus (Xaa)z, wherein Xaa is any amino acid and x and z are a whole number greater than or equal to zero, generally less than 100, preferably less than 10 and more preferably 0, 1, 2, 3, 4, or 5. In some embodiments, the additional amino acids are least 70% (e.g., 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100%) identical to one or more consecutive amino acids of SEQ ID NO: 103. For example, the additional amino acids may be a single amino acid on the C-terminus corresponding to Lys330 of IgG1 (SEQ ID NO: 121).


As used herein, a nitrogen atom “corresponding to” a particular lysine residue of a particular SEQ ID NO. should be understood to include the nitrogen atom of any lysine residue that one of skill in the art would understand to align to the particular lysine of the particular sequence. The protein sequence alignment of human IgG1 (UniProtKB: P01857; SEQ ID NO: 121), human IgG2 (UniProtKB: P01859; SEQ ID NO: 122), human IgG3 (UniProtKB: P01860; SEQ ID NO: 123), and human IgG4 (UniProtKB: P01861; SEQ ID NO: 124) is provided below (aligned with Clustal Omega Multiple Pairwise Alignment). The alignment indicates lysine residues (e.g., nitrogen atoms of lysine residues) that “correspond to” one another (in boxes and indicated by the * symbol). One of skill in the art would readily be able to perform such an alignment with any IgG variant of the invention to determine the nitrogen atom of a lysine that corresponds to any nitrogen atom of a particular lysine of a particular SEQ ID NO. described herein (e.g., any one of SEQ ID NOs: 1-95). For example, one of skill in the art would readily be able to determine that Lys35 of SEQ ID NO: 10 corresponds to, for example, Lys129 of IgG1, Lys126 of IgG2, Lys176 of IgG3, Lys51 of SEQ ID NO: 1, Lys31 of SEQ ID NO: 2, Lys50 of SEQ ID NO: 3, or Lys30 of SEQ ID NO: 10.


Protein Sequence Alignment of IgG1 (SEQ ID NO: 121), IgG2 (SEQ ID NO: 122), IgG3 (SEQ ID NO: 123), and IgG4 (SEQ ID NO: 124)

















embedded image




embedded image









embedded image




embedded image









embedded image




embedded image









embedded image




embedded image









embedded image




embedded image









embedded image




embedded image









embedded image




embedded image











In some embodiments, the Fc domain monomer includes less than about 300 amino acid residues (e.g., less than about 300, less than about 295, less than about 290, less than about 285, less than about 280, less than about 275, less than about 270, less than about 265, less than about 260, less than about 255, less than about 250, less than about 245, less than about 240, less than about 235, less than about 230, less than about 225, or less than about 220 amino acid residues). In some embodiments, the Fc domain monomer is less than about 40 kDa (e.g., less than about 35 kDa, less than about 30 kDa, less than about 25 kDa).


In some embodiments, the Fc domain monomer includes at least 200 amino acid residues (e.g., at least 210, at least 220, at least 230, at least 240, at least 250, at least 260, at least 270, at least 280, at least 290, or at least 300 amino residues). In some embodiments, the Fc domain monomer is at least 20 kDa (e.g., at least 25 kDa, at least 30 kDa, or at least 35 kDa).


In some embodiments, the Fc domain monomer includes 200 to 400 amino acid residues (e.g., 200 to 250, 250 to 300, 300 to 350, 350 to 400, 200 to 300, 250 to 350, or 300 to 400 amino acid residues). In some embodiments, the Fc domain monomer is 20 to 40 kDa (e.g., 20 to 25 kDa, 25 to 30 kDa, 35 to 40 kDa, 20 to 30 kDa, 25 to 35 kDa, or 30 to 40 KDa).


In some embodiments, the Fc domain monomer includes an amino acid sequence at least 90% identical (e.g., at least 95%, at least 98%) to the sequence of any one of SEQ ID NOs: 1-95, or a region thereof. In some embodiments, the Fc domain monomer includes the amino acid sequence of any one of SEQ ID NOs: 1-95, or a region thereof.


In some embodiments, the Fc domain monomer includes a region of any one of SEQ ID NOs: 1-95, wherein the region includes positions 220, 252, 254, and 256. In some embodiments, the region includes at least 40 amino acid residues, at least 50 amino acid residues, at least 60 amino acid residues, at least 70 amino acids residues, at least 80 amino acids residues, at least 90 amino acid residues, at least 100 amino acid residues, at least 110 amino acid residues, at least 120 amino residues, at least 130 amino acid residues, at least 140 amino acid residues, at least 150 amino acid residues, at least 160 amino acid residues, at least 170 amino acid residues, at least 180 amino acid residues, at least 190 amino acid residues, or at least 200 amino acid residues.


Activation of Immune Cells

Fc-gamma receptors (FcγRs) bind the Fc portion of immunoglobulin G (IgG) and play important roles in immune activation and regulation. For example, the IgG Fc domains in immune complexes (ICs) engage FcγRs with high avidity, thus triggering signaling cascades that regulate immune cell activation. The human FcγR family contains several activating receptors (FcγRI, FcγRIIa, FcγRIIc, FcγRIIIa, and FcγRIIIb) and one inhibitory receptor (FcγRIIb). FcγR signaling is mediated by intracellular domains that contain immune tyrosine activating motifs (ITAMs) for activating FcγRs and immune tyrosine inhibitory motifs (ITIM) for inhibitory receptor FcγRIIb. In some embodiments, FcγR binding by Fc domains results in ITAM phosphorylation by Src family kinases; this activates Syk family kinases and induces downstream signaling networks, which include PI3K and Ras pathways.


In the conjugates described herein, the portion of the conjugates including monomers or dimers of gp120 binders bind to and inhibits viral gp120 receptor leading to inhibition of viral replication, while the Fc domain portion of the conjugates bind to FcγRs (e.g., FcRn, FcγRI, FcγRIIa, FcγRIIc, FcγRIIIa, and FcγRIIIb) on immune cells and activate phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates. Examples of immune cells that may be activated by the conjugates described herein include, but are not limited to, macrophages, neutrophils, eosinophils, basophils, lymphocytes, follicular dendritic cells, natural killer cells, and mast cells.


Tissue Distribution

After a therapeutic enters the systemic circulation, it is distributed to the body's tissues. Distribution is generally uneven because of different in blood perfusion, tissue binding, regional pH, and permeability of cell membranes. The entry rate of a drug into a tissue depends on the rate of blood flow to the tissue, tissue mass, and partition characteristics between blood and tissue. Distribution equilibrium (when the entry and exit rates are the same) between blood and tissue is reached more rapidly in richly vascularized areas, unless diffusion across cell membranes is the rate-limiting step. The size, shape, charge, target binding, FcRn and target binding mechanisms, route of administration, and formulation affect tissue distribution.


In some instances, the conjugates described herein may be optimized to distribute to lung tissue. In some instances, the conjugates have a concentration ratio of distribution in epithelial lining fluid of at least 30% the concentration of the conjugate in plasma within 2 hours after administration. In certain embodiments, ratio of the concentration is at least 45% within 2 hours after administration. In some embodiments, the ratio of concentration is at least 55% within 2 hours after administration. In particular, the ratio of concentration is at least 60% within 2 hours after administration. As shown in Example 35 and FIG. 13, by 2 hours post injection, a conjugate having an Fc domain (SEQ ID NO: 64) decorated with one or more small molecule antiviral inhibitors ELF levels are surprisingly ˜60% of plasma exposure levels as measured by AUC across the rest of the time course indicating nearly immediate partitioning of the conjugate from plasma to the ELF in the lung. This demonstrates that an Fc containing conjugate rapidly distributes to lung, and maintains high concentrations in lung relative to levels in plasma.


IV. Albumin Proteins and Albumin Protein-Binding Peptides
Albumin Proteins

An albumin protein of the invention may be a naturally-occurring albumin or a variant thereof, such as an engineered variant of a naturally-occurring albumin protein. Variants include polymorphisms, fragments such as domains and sub-domains, and fusion proteins. An albumin protein may include the sequence of an albumin protein obtained from any source. Preferably the source is mammalian, such as human or bovine. Most preferably, the albumin protein is human serum albumin (HSA), or a variant thereof. Human serum albumins include any albumin protein having an amino acid sequence naturally occurring in humans, and variants thereof. An albumin protein coding sequence is obtainable by methods know to those of skill in the art for isolating and sequencing cDNA corresponding to human genes. An albumin protein of the invention may include the amino acid sequence of human serum albumin (HSA), provided in SEQ ID NO: 96 or SEQ ID NO: 97, or the amino acid sequence of mouse serum albumin (MSA), provided in SEQ ID NO: 98, or a variant or fragment thereof, preferably a functional variant or fragment thereof. A fragment or variant may or may not be functional, or may retain the function of albumin to some degree. For example, a fragment or variant may retain the ability to bind to an albumin receptor, such as HSA or MSA, by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, or 105% of the ability of the parent albumin (e.g., the parent albumin from which the fragment or variant is derived). Relative binding ability may be determined by methods known in the art, such as by surface plasmon resonance.


The albumin protein may be a naturally-occurring polymorphic variant of an albumin protein, such as human serum albumin. Generally, variants or fragments of human serum albumin will have at least 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, or 70%, and preferably 80%, 90%, 95%, 100%, or 105% or more of human serum albumin or mouse serum albumin's ligand binding activity.


The albumin protein may include the amino acid sequence of bovine serum albumin. Bovine serum albumin proteins include any albumin having an amino acid sequence naturally occurring in cows, for example, as described by Swissprot accession number P02769, and variants thereof as defined herein. Bovine serum albumin proteins also includes fragments of full-length bovine serum albumin or variants thereof, as defined herein.


The albumin protein may comprise the sequence of an albumin derived from one of serum albumin from dog (e.g., Swissprot accession number P49822-1), pig (e.g., Swissprot accession number P08835-1), goat (e.g., Sigma product no. A2514 or A4164), cat (e.g., Swissprot accession number P49064-1), chicken (e.g., Swissprot accession number P19121-1), ovalbumin (e.g., chicken ovalbumin) (e.g., Swissprot accession number P01012-1), turkey ovalbumin (e.g., Swissprot accession number O73860-1), donkey (e.g., Swissprot accession number Q5XLE4-1), guinea pig (e.g., Swissprot accession number Q6WDN9-1), hamster (e.g., as described in DeMarco et al. International Journal for Parasitology 37(11): 1201-1208 (2007)), horse (e.g., Swissprot accession number P35747-1), rhesus monkey (e.g., Swissprot accession number Q28522-1), mouse (e.g., Swissprot accession number P07724-1), pigeon (e.g., as defined by Khan et al. Int. J. Biol. Macromol. 30(3-4), 171-8 (2002)), rabbit (e.g., Swissprot accession number P49065-1), rat (e.g., Swissprot accession number P02770-1) or sheep (e.g., Swissprot accession number P14639-1), and includes variants and fragments thereof as defined herein.


Many naturally-occurring mutant forms of albumin are known to those skilled in the art. Naturally-occurring mutant forms of albumin are described in, for example, Peters, et al. All About Albumin: Biochemistry, Genetics and Medical Applications, Academic Press, Inc., San Diego, Calif., p. 170-181 (1996).


Albumin proteins of the invention include variants of naturally-occurring albumin proteins. A variant albumin refers to an albumin protein having at least one amino acid mutation, such as an amino acid mutation generated by an insertion, deletion, or substitution, either conservative or non-conservative, provided that such changes result in an albumin protein for which at least one basic property has not been significantly altered (e.g., has not been altered by more than 5%, 10%, 15%, 20%, 25%, 30%, 35%, or 40%). Exemplary properties which may define the activity of an albumin protein include binding activity (e.g., including binding specificity or affinity to bilirubin, or a fatty acid such as a long-chain fatty acid), osmolarity, or behavior in a certain pH-range.


Typically an albumin protein variant will have at least 40%, at least 50%, at least 60%, and preferably at least 70%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% amino acid sequence identity with a naturally-occurring albumin protein, such as the albumin protein of any one of SEQ ID NOs: 96-98.


Methods for the production and purification of recombinant human albumins are well-established (Sleep et al. Biotechnology, 8(1):42-6 (1990)), and include the production of recombinant human albumin for pharmaceutical applications (Bosse et al. J Clin Pharmacol 45(1):57-67 (2005)). The three-dimensional structure of HSA has been elucidated by X-ray crystallography (Carter et al. Science. 244(4909): 1195-8(1998)); Sugio et al. Protein Eng. 12(6):439-46 (1999)). The HSA polypeptide chain has 35 cysteine residues, which form 17 disulfide bonds, and one unpaired (e.g., free) cysteine at position 34 of the mature protein. Cys-34 of HSA has been used for conjugation of molecules to albumin (Leger et al. Bioorg Med Chem Lett 14(17):4395-8 (2004); Thibaudeau et al. Bioconjug Chem 16(4):1000-8 (2005)), and provides a site for site-specific conjugation.









(Human serum albumin (HSA), variant 1)


SEQ ID NO: 96


DAHKSEVAHRFKDLGEENFKALVLIAFAQYLQQCPFEDHVKLVNEVTEFA





KTCVADESAENCDKSLHTLFGDKLCTVATLRETYGEMADCCAKQEPERNE





CFLQHKDDNPNLPRLVRPEVDVMCTAFHDNEETFLKKYLYEIARRHPYFY





APELLFFAKRYKAAFTECCQAADKAACLLPKLDELRDEGKASSAKQRLKC





ASLQKFGERAFKAWAVARLSQRFPKAEFAEVSKLVTDLTKVHTECCHGDL





LECADDRADLAKYICENQDSISSKLKECCEKPLLEKSHCIAEVENDEMPA





DLPSLAADFVESKDVCKNYAEAKDVFLGMFLYEYARRHPDYSVVLLLRLA





KTYETTLEKCCAAADPHECYAKVFDEFKPLVEEPQNLIKQNCELFEQLGE





YKFQNALLVRYTKKVPQVSTPTLVEVSRNLGKVGSKCCKHPEAKRMPCAE





DYLSVVLNQLCVLHEKTPVSDRVTKCCTESLVNRRPCFSALEVDETYVPK





EFNAETFTFHADICTLSEKERQIKKQTALVELVKHKPKATKEQLKAVMDD





FAAFVEKCCKADDKETCFAEEGKKLVAASQAALGL





(Human serum albumin (HSA), variant 2)


SEQ ID NO: 97


RGVFRRDAHKSEVAHRFKDLGEENFKALVLIAFAQYLQQCPFEDHVKLVN





EVTEFAKTCVADESAENCDKSLHTLFGDKLCTVATLRETYGEMADCCAKQ





EPERNECFLQHKDDNPNLPRLVRPEVDVMCTAFHDNEETFLKKYLYEIAR





RHPYFYAPELLFFAKRYKAAFTECCQAADKAACLLPKLDELRDEGKASSA





KQRLKCASLQKFGERAFKAWAVARLSQRFPKAEFAEVSKLVTDLTKVHTE





CCHGDLLECADDRADLAKYICENQDSISSKLKECCEKPLLEKSHCIAEVE





NDEMPADLPSLAADFVESKDVCKNYAEAKDVFLGMFLYEYARRHPDYSVV





LLLRLAKTYETTLEKCCAAADPHECYAKVFDEFKPLVEEPQNLIKQNCEL





FEQLGEYKFQNALLVRYTKKVPQVSTPTLVEVSRNLGKVGSKCCKHPEAK





RMPCAEDYLSVVLNQLCVLHEKTPVSDRVTKCCTESLVNRRPCFSALEVD





ETYVPKEFNAETFTFHADICTLSEKERQIKKQTALVELVKHKPKATKEQL





KAVMDDFAAFVEKCCKADDKETCFAEEGKKLVAASQAALGL





(Mouse serum albumin (MSA))


SEQ ID NO: 98


RGVFRREAHKSEIAHRYNDLGEQHFKGLVLIAFSQYLQKCSYDEHAKLVQ





EVTDFAKTCVADESAANCDKSLHTLFGDKLCAIPNLRENYGELADCCTKQ





EPERNECFLQHKDDNPSLPPFERPEAEAMCTSFKENPTTFMGHYLHEVAR





RHPYFYAPELLYYAEQYNEILTQCCAEADKESCLTPKLDGVKEKALVSSV





RQRMKCSSMQKFGERAFKAWAVARLSQTFPNADFAEITKLATDLTKVNKE





CCHGDLLECADDRAELAKYMCENQATISSKLQTCCDKPLLKKAHCLSEVE





HDTMPADLPAIAADFVEDQEVCKNYAEAKDVFLGTFLYEYSRRHPDYSVS





LLLRLAKKYEATLEKCCAEANPPACYGTVLAEFQPLVEEPKNLVKTNCDL





YEKLGEYGFQNAILVRYTQKAPQVSTPTLVEAARNLGRVGTKCCTLPEDQ





RLPCVEDYLSAILNRVCLLHEKTPVSEHVTKCCSGSLVERRPCFSALTVD





ETYVPKEFKAETFTFHSDICTLPEKEKQIKKQTALAELVKHKPKATAEQL





KTVMDDFAQFLDTCCKAADKDTCFSTEGPNLVTRCKDALA






Conjugation of Albumin Proteins

An albumin protein of the invention may be conjugated to (e.g., by way of a covalent bond) to any compound of the invention (e.g., by way of the linker portion of a gp120 binder monomer or dimer). The albumin protein may be conjugated to any compound of the invention by any method well-known to those of skill in the art for producing small-molecule-protein conjugates. This may include covalent conjugation to a solvent-exposed amino acid, such as a solvent exposed cysteine or lysine. For example, human serum albumin may be conjugated to a compound of the invention by covalent linkage to the sulfur atom corresponding to Cys34 of SEQ ID NO: 96 or Cys40 of SEQ ID NO: 97.


An albumin protein of the invention may be conjugated to any compound of the invention by way of an amino acid located within 10 amino acid residues of the C-terminal or N-terminal end of the albumin protein. An albumin protein may include a C-terminal or N-terminal polypeptide fusion of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, or 20 or more amino acid. The C-terminal or N-terminal polypeptide fusion may include one or more solvent-exposed cysteine or lysine residues, which may be used for covalent conjugation of a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker).


Albumin proteins of the invention include any albumin protein which has been engineered to include one or more solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention.


Exemplary methods for the production of engineered variants of albumin proteins that include one or more conjugation-competent cysteine residues are provided in U.S. Patent Application No. 2017/0081389, which is incorporated herein by reference in its entirety. Briefly, preferred albumin protein variants are those comprising a single, solvent-exposed, unpaired (e.g., free) cysteine residue, thus enabling site-specific conjugation of a linker to the cysteine residue.


Albumin proteins which have been engineered to enable chemical conjugation to a solvent-exposed, unpaired cysteine residue include the following albumin protein variants:

    • (a) an albumin protein having a substitution of a non-cysteine amino acid residue with a cysteine at an amino acid residue corresponding to any of L585, D1, A2, D562, A364, A504, E505, T79, E86, D129, D549, A581, D121, E82, S270, Q397, and A578 of SEQ ID NO: 96;
    • (b) an albumin protein having an insertion of a cysteine at a position adjacent the N- or C-terminal side of an amino acid residue corresponding to any of L585, D1, A2, D562, A364, A504, E505, T79, E86, D129, D549, A581, D121, E82, S270, Q397, and A578 of SEQ ID NO: 96;
    • (c) an albumin protein engineered to have an unpaired cysteine having a free thiol group at a residue corresponding to any of C369, C361, C91, C177, C567, C316, C75, C169, C124, or C558 of SEQ ID NO: 96, and which may or may not be generated by deletion or substitution of a residue corresponding to C360, C316, C75, C168, C558, C361, C91, C124, C169, or C567 of SEQ ID NO: 96; and/or
    • (d) addition of a cysteine to the N- or C-terminus of an albumin protein.


In some embodiments of the invention, the net result of the substitution, deletion, addition, or insertion events of (a), (b), (c) and/or (d) is that the number of conjugation competent cysteine residues of the polypeptide sequence is increased relative to the parent albumin sequence. In some embodiments of the invention, the net result of the substitution, deletion, addition, or insertion events of (a), (b), (c) and/or (d) is that the number of conjugation competent-cysteine residues of the polypeptide sequence is one, thus enabling site-specific conjugation.


Preferred albumin protein variants also include albumin proteins having a single solvent-exposed lysine residue, thus enabling site-specific conjugation of a linker to the lysine residue. Such variants may be generated by engineering an albumin protein, including any of the methods previously described (e.g., insertion, deletion, substitution, or C-terminal or N-terminal fusion).


Albumin Protein-Binding Peptides

Conjugation of a biologically-active compound to an albumin protein-binding peptide can alter the pharmacodynamics of the biologically-active compound, including the alteration of tissue uptake, penetration, and diffusion. In a preferred embodiment, conjugation of an albumin protein-binding peptide to a compound of the invention (e.g., a gp120 binder monomer or dimer, by way of a linker) increases the efficacy or decreases the toxicity of the compound, as compared to the compound alone.


Albumin protein-binding peptides of the invention include any polypeptide having an amino acid sequence of 5 to 50 (e.g., 5 to 40, 5 to 30, 5 to 20, 5 to 15, 5 to 10, 10 to 50, 10 to 30, or 10 to 20) amino acid residues that has affinity for and functions to bind an albumin protein, such as any of the albumin proteins described herein. Preferably, the albumin protein-binding peptide binds to a naturally occurring serum albumin, most preferably human serum albumin. An albumin protein-binding peptide can be of different origins, e.g., synthetic, human, mouse, or rat. Albumin protein-binding peptides of the invention include albumin protein-binding peptides which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein-binding peptide will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention. Albumin protein-binding peptides may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker). Albumin protein-binding peptides of the invention may be linear or cyclic. Albumin protein-binding peptides of the invention include any albumin protein-binding peptides known to one of skill in the art, examples of which, are provided herein.


Albumin protein-binding peptide, and conjugates including an albumin protein-binding peptide, preferably bind an albumin protein (e.g., human serum albumin) with an affinity characterized by a dissociation constant, Kd, that is less than about 100 μM, preferably less than about 100 nM, and most preferably do not substantially bind other plasma proteins. Specific examples of such compounds are linear or cyclic peptides, preferably between about 10 and 20 amino acid residues in length, optionally modified at the N-terminus or C-terminus or both.


Albumin protein-binding peptides include linear and cyclic peptides comprising the following general formulae, wherein Xaa is any amino acid:









SEQ ID NO: 101


Xaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Xaa-Cys-Xaa-Xaa-Phe-





Cys-Xaa-Asp-Trp-Pro-Xaa-Xaa-Xaa-Ser-Cys





SEQ ID NO: 102


Val-Cys-Tyr-Xaa-Xaa-Xaa-Ile-Cys-Phe





SEQ ID NO: 103


Cys-Tyr-Xaa-Pro-Gly-Xaa-Cys





SEQ ID NO: 104


Asp-Xaa-Cys-Leu-Pro-Xaa-Trp-Gly-Cys-Leu-Trp





SEQ ID NO: 105


Trp-Cys-Asp-Xaa-Xaa-Leu-Xaa-Ala-Xaa-Asp-Leu-Cys





SEQ ID NO: 106


Asp-Leu-Val-Xaa-Leu-Gly-Leu-Glu-Cys-Trp






Albumin protein-binding peptides of the invention further include any of the following peptide sequences, which may be linear or cyclic:











SEQ ID NO: 107



DLCLRDWGCLW







SEQ ID NO: 108



DICLPRWGCLW







SEQ ID NO: 109



MEDICLPRWGCLWGD







SEQ ID NO: 110



QRLMEDICLPRWGCLWEDDE







SEQ ID NO: 111



QGLIGDICLPRWGCLWGRSV







SEQ ID NO: 112



QGLIGDICLPRWGCLWGRSVK







SEQ ID NO: 113



EDICLPRWGCLWEDD







SEQ ID NO: 114



RLMEDICLPRWGCLWEDD







SEQ ID NO: 115



MEDICLPRWGCLWEDD







SEQ ID NO: 116



MEDICLPRWGCLWED







SEQ ID NO: 117



RLMEDICLARWGCLWEDD







SEQ ID NO: 118



EVRSFCTRWPAEKSCKPLRG







SEQ ID NO: 119



RAPESFVCYVVETICFERSEQ







SEQ ID NO: 120



EMCYFPGICWM






Albumin protein-binding peptides of SEQ ID NOs: 101-120 may further include additional amino acids at the N-terminus (Xaa)x and/or additional amino acids at the C-terminus (Xaa)z, wherein Xaa is any amino acid and x and z are a whole number greater or equal to zero, generally less than 100, preferably less than 10, and more preferably 0, 1, 2, 3, 4 or 5.


Further exemplary albumin protein-binding peptides are provided in U.S. Patent Application No. 2005/0287153, which is incorporated herein by reference in its entirety.


Conjugation of Albumin Protein-Binding Peptides

An albumin protein-binding peptide of the invention may be conjugated to (e.g., by way of a covalent bond) to any compound of the invention (e.g., by way of the linker portion of a gp120 binder monomer or dimer). The albumin protein-binding peptide may be conjugated to any compound of the invention by any method known to those of skill in the art for producing peptide-small molecule conjugates. This may include covalent conjugation to the side chain group of an amino acid residue, such as a cysteine, a lysine, or a non-natural amino acid. Alternately, covalent conjugation may occur at the C-terminus (e.g., to the C-terminal carboxylic acid, or to the side chain group of the C-terminal residue) or at the N-terminus (e.g., to the N-terminal amino group, or to the side chain group of the N-terminal amino acid).


V. Linkers

A linker refers to a linkage or connection between two or more components in a conjugate described herein (e.g., between two gp120 binders in a conjugate described herein, between a gp120 binder and an Fc domain monomer, an Fc domain, or an albumin protein in a conjugate described herein, and between a dimer of two gp120 binders and an Fc domain monomer, an Fc domain or an albumin protein in a conjugate described herein).


Linkers in Conjugates Having an Fc Domain Monomer, an Fc Domain, or an Albumin Protein Covalently Linked to Dimers of Gp120 Binders

In a conjugate containing an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide covalently linked to one or more dimers of gp120 binders as described herein, a linker in the conjugate (e.g., L or L′) may be a branched structure. As described further herein, a linker in a conjugate described herein (e.g., L or L′) may be a multivalent structure, e.g., a divalent or trivalent structure having two or three arms, respectively. In some embodiments when the linker has three arms, two of the arms may be attached to the first and second gp120 binders and the third arm may be attached to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide. In some embodiments when the linker has two arms, one arm may be attached to an Fc domain monomer, an Fc domain, or an albumin protein and the other arm may be attached to one of the two gp120 binders. In other embodiments, a linker with three arms may be used to attach the two gp120 binders on a conjugate containing an Fc domain monomer, an Fc domain, or albumin protein covalently linked to one or more dimers of gp120 binders.


In some embodiments, a linker in a conjugate having an Fc domain monomer, an Fc domain, or an albumin protein covalently linked to one or more dimers of gp120 binders is described by formula (D-L-I):




embedded image


wherein LA is described by formula GA1-(ZA1)g1—(YA1)h1—(ZA2)i1—(YA2)j1—(ZA3)k1—(YA3)l1-(ZA4)m1- (YA4)n1-(ZA5)O1-GA2; LB is described by formula GB1-(ZB1)g2-(YB1)h2-(ZB2)i2-(YB2)j2-(ZB3)k2-(YB3)l2-(ZB4)m2-(YB4)n2-(ZB5)o2-GB2; LC is described by formula GC1-(ZC1)g3-(YC1)h3-(ZC2)i3-(YC2)j3-(ZC3)k3-(YC3)l3-(ZC4)m3-(YC4)n3-(ZC5)o3-GC2; GA1 is a bond attached to Qi in formula (D-L-I); GA2 is a bond attached to the first gp120 binder (e.g., A1); GB1 is a bond attached to Qi in formula (D-L-I); GB2 is a bond attached to the second gp120 binder (e.g., A2); GC1 is a bond attached to Qi in formula (D-L-I); GC2 is a bond attached to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid, thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of ZA1, ZA2, ZA3, ZA4, ZA5, ZB1, ZB2, ZB3, ZB4, ZB5 ZC1, ZC2, ZC3, ZC4, and ZC5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene; each of YA1, YA2, YA3, YA4, YB1, YB2, YB3, YB4, YC1, YC2, YC3, and YC4 is, independently, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl; each of g1, h1, i1, j1, k1, l1, m1, n1, o1, g2, h2, i2, j2, k2, l2, m2, n2, o2, g3, h3, i3, j3, k3, l3, m3, n3, and o3 is, independently, 0 or 1; Q is a nitrogen atom, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene.


In some embodiments, optionally substituted includes substitution with a polyethylene glycol (PEG). A PEG has a repeating unit structure (—CH2CH2O—)n, wherein n is an integer from 2 to 100. A polyethylene glycol may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100).


In some embodiments, LC may have two points of attachment to the Fc domain (e.g., two GC2).


In some embodiments, L includes a polyethylene glycol (PEG) linker. A PEG linker includes a linker having the repeating unit structure (—CH2CH2O—)n, where n is an integer from 2 to 100. A polyethylene glycol linker may covalently join a gp120 binder and E (e.g., in a conjugate of any one of formulas (M-I)-(M-X)). A polyethylene glycol linker may covalently join a first gp120 binder and a second gp120 binder (e.g., in a conjugate of any one of formulas (D-I)-(D-X)). A polyethylene glycol linker may covalently join a gp120 binder dimer and E (e.g., in a conjugate of any one of formulas (D-I)-(D-X)). A polyethylene glycol linker may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100). In some embodiments, LC includes a PEG linker, where LC is covalently attached to each of Qi and E.


Linkers of formula (D-L-I) that may be used in conjugates described herein include, but are not limited to




embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


embedded image


where z1, z2, y1, y2, y3, and y4 are each, independently, and integer from 1 to 20; and R9 is selected from H, C1-C20 alkyl, C3-C20cycloalkyl, C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and C3-C15 heteroaryl.


Linkers of the formula (D-L-I) may also include any of




embedded image


embedded image


embedded image


embedded image


embedded image


Linkers in Conjugates Having an Fc Domain Monomer, an Fc Domain, or an Albumin Protein Covalently Linked to Monomers of Gp120 Binders

In a conjugate containing an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide covalently linked to one or more monomers of gp120 binders as described herein, a linker in the conjugate (e.g., L, or L′) may be a divalent structure having two arms. One arm in a divalent linker may be attached to the monomer of gp120 binder and the other arm may be attached to the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide. In some embodiments, the one or more monomers of gp120 binders in the conjugates described herein may each be, independently, connected to an atom in the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide.


In some embodiments, a linker is described by formula (M-L-I):





J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2


wherein J1 is a bond attached to a gp120 binder; J2 is a bond attached to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, or a functional group capable of reacting with a functional group conjugated to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., maleimide and cysteine, amine and activated carboxylic acid, thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazene); each of Q1, Q2, Q3, Q4 and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene; each of T1, T2, T3, T4 is, independently, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1.


In some embodiments, optionally substituted includes substitution with a polyethylene glycol (PEG). A PEG has a repeating unit structure (—CH2CH2O—)n, wherein n is an integer from 2 to 100. A polyethylene glycol may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100).


In some embodiments, J2 may have two points of attachment to the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., two J2).


Linkers of formula (M-L-1) that may be used in conjugates described herein include, but are not limited to,




embedded image


wherein d is an integer from 1 to 20 (e.g., d is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20).


Linkers of formula (M-L-1) that may be used in conjugates described herein include, but are not limited to,




embedded image


embedded image


embedded image


wherein each Y is independently selected from (—O—), (—S—), (—R8—), (—O(C═O)NR8—), (—O(C═S)NR8—), (—O(C═O)O—), (—O(C═O)—), (—NH(C═O)O—), (—NH(C═O)—), (—NH(C═NH)—), (—NH(C═O)NR8—), (—NH(C═NH)NR8—), (—NH(C═S)NR8—), (—NH(C═S)—), (—OCH2(C═O)NR8—), (—NH(SO2)—), (—NH(SO2)NR8—), (—OR9—), (—NR9—), (—SR9—), (—R9NH(C═O)—), (—R9OR9C(═O)NH—), (—CH2NH(C═O)—), (—CH2OCH2(C═O)NH—), (—(C═NR8)NH—), (—NH(SO2)—), (—(C═O)NH—), (—C(═O)—), (—C(NR8)—), or (—R9C(═O)—);


each R8 is independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl;


each R9 is independently selected from optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; and


each of d, e, y1, and x1 is, independently, an integer from 1 to 26 (e.g., d is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26).


Linking Groups

In some embodiments, a linker provides space, rigidity, and/or flexibility between the gp120 binders and the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide in the conjugates described here or between two gp120 binders in the conjugates described herein. In some embodiments, a linker may be a bond, e.g., a covalent bond, e.g., an amide bond, a disulfide bond, a C—O bond, a C—N bond, a N—N bond, a C—S bond, or any kind of bond created from a chemical reaction, e.g., chemical conjugation. In some embodiments, a linker (L or L′ as shown in any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII) includes no more than 250 atoms (e.g., 1-2, 1-4, 1-6, 1-8, 1-10, 1-12, 1-14, 1-16, 1-18, 1-20, 1-25, 1-30, 1-35, 1-40, 1-45, 1-50, 1-55, 1-60, 1-65, 1-70, 1-75, 1-80, 1-85, 1-90, 1-95, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1-180, 1-190, 1-200, 1-210, 1-220, 1-230, 1-240, or 1-250 atom(s); 250, 240, 230, 220, 210, 200, 190, 180, 170, 160, 150, 140, 130, 120, 110, 100, 95, 90, 85, 80, 75, 70, 65, 60, 55, 50, 45, 40, 35, 30, 28, 26, 24, 22, 20, 18, 16, 14, 12, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 atom(s)). In some embodiments, a linker (L or L) includes no more than 250 non-hydrogen atoms (e.g., 1-2, 1-4, 1-6, 1-8, 1-10, 1-12, 1-14, 1-16, 1-18, 1-20, 1-25, 1-30, 1-35, 1-40, 1-45, 1-50, 1-55, 1-60, 1-65, 1-70, 1-75, 1-80, 1-85, 1-90, 1-95, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1-180, 1-190, 1-200, 1-210, 1-220, 1-230, 1-240, or 1-250 non-hydrogen atom(s); 250, 240, 230, 220, 210, 200, 190, 180, 170, 160, 150, 140, 130, 120, 110, 100, 95, 90, 85, 80, 75, 70, 65, 60, 55, 50, 45, 40, 35, 30, 28, 26, 24, 22, 20, 18, 16, 14, 12, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 non-hydrogen atom(s)). In some embodiments, the backbone of a linker (L or L) includes no more than 250 atoms (e.g., 1-2, 1-4, 1-6, 1-8, 1-10, 1-12, 1-14, 1-16, 1-18, 1-20, 1-25, 1-30, 1-35, 1-40, 1-45, 1-50, 1-55, 1-60, 1-65, 1-70, 1-75, 1-80, 1-85, 1-90, 1-95, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1-180, 1-190, 1-200, 1-210, 1-220, 1-230, 1-240, or 1-250 atom(s); 250, 240, 230, 220, 210, 200, 190, 180, 170, 160, 150, 140, 130, 120, 110, 100, 95, 90, 85, 80, 75, 70, 65, 60, 55, 50, 45, 40, 35, 30, 28, 26, 24, 22, 20, 18, 16, 14, 12, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 atom(s)). The “backbone” of a linker refers to the atoms in the linker that together form the shortest path from one part of the conjugate to another part of the conjugate. The atoms in the backbone of the linker are directly involved in linking one part of the conjugate to another part of the conjugate. For examples, hydrogen atoms attached to carbons in the backbone of the linker are not considered as directly involved in linking one part of the conjugate to another part of the conjugate.


Molecules that may be used to make linkers (L or L′) include at least two functional groups, e.g., two carboxylic acid groups. In some embodiments of a trivalent linker, two arms of a linker may contain two dicarboxylic acids, in which the first carboxylic acid may form a covalent linkage with the first gp120 binder in the conjugate and the second carboxylic acid may form a covalent linkage with the second gp120 binder in the conjugate, and the third arm of the linker may for a covalent linkage (e.g., a C—O bond) with an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide in the conjugate. In some embodiments of a divalent linker, the divalent linker may contain two carboxylic acids, in which the first carboxylic acid may form a covalent linkage with one component (e.g., a gp120 binder) in the conjugate and the second carboxylic acid may form a covalent linkage (e.g., a C—S bond or a C—N bond) with another component (e.g., an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide) in the conjugate.


In some embodiments, dicarboxylic acid molecules may be used as linkers (e.g., a dicarboxylic acid linker). For example, in a conjugate containing an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide covalently linked to one or more dimers of gp120 binders, the first carboxylic acid in a dicarboxylic acid molecule may form a covalent linkage with a hydroxyl or amine group of the first gp120 binder and the second carboxylic acid may form a covalent linkage with a hydroxyl or amine group of the second gp120 binder.


Examples of dicarboxylic acids molecules that may be used to form linkers include, but are not limited to,




embedded image


embedded image


embedded image


embedded image


wherein n is an integer from 1 to 20 (e.g., n is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20).


Other examples of dicarboxylic acids molecules that may be used to form linkers include, but are not limited to,




embedded image


embedded image


embedded image


embedded image


In some embodiments, dicarboxylic acid molecules, such as the ones described herein, may be further functionalized to contain one or more additional functional groups. Dicarboxylic acids may be further functionalized, for example, to provide an attachment point to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., by way of a linker, such as a PEG linker).


In some embodiments, when the gp120 binder is attached to Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, the linking group may comprise a moiety comprising a carboxylic acid moiety and an amino moiety that are spaced by from 1 to 25 atoms. Examples of such linking groups include, but are not limited to,




embedded image


embedded image


wherein n is an integer from 1 to 20 (e.g., n is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20).


In some embodiments, a linking group may include a moiety including a carboxylic acid moiety and an amino moiety, such as the ones described herein, may be further functionalized to contain one or more additional functional groups. Such linking groups may be further functionalized, for example, to provide an attachment point to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., by way of a linker, such as a PEG linker).


In some embodiments, when the gp120 binder is attached to Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, the linking group may comprise a moiety comprising two or amino moieties (e.g., a diamino moiety) that are spaced by from 1 to 25 atoms. Examples of such linking groups include, but are not limited to,




embedded image


embedded image


wherein n is an integer from 1 to 20 (e.g., n is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20).


In some embodiments, a linking group may include a diamino moiety, such as the ones described herein, may be further functionalized to contain one or more additional functional groups. Such diamino linking groups may be further functionalized, for example, to provide an attachment point to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., by way of a linker, such as a PEG linker).


In some embodiments, a molecule containing an azide group may be used to form a linker, in which the azide group may undergo cycloaddition with an alkyne to form a 1,2,3-triazole linkage. In some embodiments, a molecule containing an alkyne group may be used to form a linker, in which the alkyne group may undergo cycloaddition with an azide to form a 1,2,3-triazole linkage. In some embodiments, a molecule containing a maleimide group may be used to form a linker, in which the maleimide group may react with a cysteine to form a C—S linkage. In some embodiments, a molecule containing one or more haloalkyl groups may be used to form a linker, in which the haloalkyl group may form a covalent linkage, e.g., C—N and C—O linkages, with a gp120 binder.


In some embodiments, a linker (L or L′) may comprise a synthetic group derived from, e.g., a synthetic polymer (e.g., a polyethylene glycol (PEG) polymer). In some embodiments, a linker may comprise one or more amino acid residues. In some embodiments, a linker may be an amino acid sequence (e.g., a 1-25 amino acid, 1-10 amino acid, 1-9 amino acid, 1-8 amino acid, 1-7 amino acid, 1-6 amino acid, 1-5 amino acid, 1-4 amino acid, 1-3 amino acid, 1-2 amino acid, or 1 amino acid sequence). In some embodiments, a linker (L or L′) may include one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene (e.g., a PEG unit), optionally substituted C2-C20 alkenylene (e.g., C2 alkenylene), optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene (e.g., cyclopropylene, cyclobutylene), optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene (e.g., C6 arylene), optionally substituted C3-C15 heteroarylene (e.g., imidazole, pyridine), O, S, NRi (Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl), P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino.


Conjugation Chemistries

Gp120 binder monomers or dimers (e.g., in a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) may be conjugated to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, e.g., by way of a linker, by any standard conjugation chemistries known to those of skill in the art. The following conjugation chemistries are specifically contemplated, e.g., for conjugation of a PEG linker (e.g., a functionalized PEG linker) to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide.


Covalent conjugation of two or more components in a conjugate using a linker may be accomplished using well-known organic chemical synthesis techniques and methods. Complementary functional groups on two components may react with each other to form a covalent bond. Examples of complementary reactive functional groups include, but are not limited to, e.g., maleimide and cysteine, amine and activated carboxylic acid, thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine. Site-specific conjugation to a polypeptide (e.g., an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide) may accomplished using techniques known in the art. Exemplary techniques for site-specific conjugation of a small molecule to an Fc domain are provided in Agarwall. P., et al. Bioconjugate Chem. 26:176-192 (2015).


Other examples of functional groups capable of reacting with amino groups include, e.g., alkylating and acylating agents. Representative alkylating agents include: (i) an α-haloacetyl group, e.g., XCH2CO— (where X=Br, Cl, or I); (ii) a N-maleimide group, which may react with amino groups either through a Michael type reaction or through acylation by addition to the ring carbonyl group; (iii) an aryl halide, e.g., a nitrohaloaromatic group; (iv) an alkyl halide; (v) an aldehyde or ketone capable of Schiff's base formation with amino groups; (vi) an epoxide, e.g., an epichlorohydrin and a bisoxirane, which may react with amino, sulfhydryl, or phenolic hydroxyl groups; (vii) a chlorine-containing of s-triazine, which is reactive towards nucleophiles such as amino, sulfhydryl, and hydroxyl groups; (viii) an aziridine, which is reactive towards nucleophiles such as amino groups by ring opening; (ix) a squaric acid diethyl ester; and (x) an α-haloalkyl ether.


Examples of amino-reactive acylating groups include, e.g., (i) an isocyanate and an isothiocyanate; (ii) a sulfonyl chloride; (iii) an acid halide; (iv) an active ester, e.g., a nitrophenylester or N-hydroxysuccinimidyl ester; (v) an acid anhydride, e.g., a mixed, symmetrical, or N-carboxyanhydride; (vi) an acylazide; and (vii) an imidoester. Aldehydes and ketones may be reacted with amines to form Schiffs bases, which may be stabilized through reductive amination.


It will be appreciated that certain functional groups may be converted to other functional groups prior to reaction, for example, to confer additional reactivity or selectivity. Examples of methods useful for this purpose include conversion of amines to carboxyls using reagents such as dicarboxylic anhydrides; conversion of amines to thiols using reagents such as N-acetylhomocysteine thiolactone, S-acetylmercaptosuccinic anhydride, 2-iminothiolane, or thiol-containing succinimidyl derivatives; conversion of thiols to carboxyls using reagents such as α-haloacetates; conversion of thiols to amines using reagents such as ethylenimine or 2-bromoethylamine; conversion of carboxyls to amines using reagents such as carbodiimides followed by diamines; and conversion of alcohols to thiols using reagents such as tosyl chloride followed by transesterification with thioacetate and hydrolysis to the thiol with sodium acetate.


In some embodiments, a linker of the invention (e.g., L or L′, such as LC of D-L-I), is conjugated (e.g., by any of the methods described herein) to E (e.g., an Fc domain monomer, an Fc domain, or albumin protein). In preferred embodiments of the invention, the linker is conjugated by way of: (a) a thiourea linkage (i.e., —NH(C═S)NH—) to a lysine of E; (b) a carbamate linkage (i.e., —NH(C═O)—O) to a lysine of E; (c) an amine linkage by reductive amination (i.e., —NHCH2) between a lysine and E; (d) an amide (i.e., —NH—(C═O)CH2) to a lysine of E; (e) a cysteine-maleimide conjugate between a maleimide of the linker to a cysteine of E; (f) an amine linkage by reductive amination (i.e., —NHCH2) between the linker and a carbohydrate of E (e.g., a glycosyl group of an Fc domain monomer or an Fc domain); (g) a rebridged cysteine conjugate, wherein the linker is conjugated to two cysteines of E; (h) an oxime linkage between the linker and a carbohydrate of E (e.g., a glycosyl group of an Fc domain monomer or an Fc domain); (i) an oxime linkage between the linker and an amino acid residue of E; (j) an azido linkage between the linker and E; (k) direct acylation of a linker to E; or (I) a thioether linkage between the linker and E.


In some embodiments, a linker is conjugated to E, wherein the linkage includes the structure —NH(C═NH)X—, wherein X is O, HN, or a bond. In some embodiments, a linker is conjugated to E, wherein the linkage between the remainder of the linker and E includes the structure —NH(C═O)NH—.


In some embodiments, a linker is conjugated to E, wherein the linkage includes the structure —R9OR9C(═O)NH—, wherein R9 is H, optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, or optionally substituted C2-C15 heteroaryl. In some embodiments, the linker is conjugated to E, wherein the linkage between the remainder of the linker and E includes the structure —CH2OCH2C(═O)NH—.


Exemplary linking strategies (e.g., methods for linking a monomer or a dimer of a gp120 binder to E, such as, by way of a linker) are further depicted in FIGS. 1-4 and 14.


In some embodiments, a linker (e.g., an active ester, e.g., a nitrophenylester or N-hydroxysuccinimidyl ester, or derivatives thereof (e.g., a functionalized PEG linker (e.g., azido-PEG2-PEG40-NHS ester), is conjugated to E, with a T of (e.g., DAR) of between 0.5 and 10.0, e.g., about 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In these instances, the E-(PEG2-PEG40)-azide can react with an Int having a terminal alkyne linker (e.g., L, or L′, such as LC of D-L-I) through click conjugation. During click conjugation, the copper-catalyzed reaction of the an azide (e.g., the Fc-(PEG2-PEG40)-azide) with the alkyne (e.g., the Int having a terminal alkyne linker (e.g., L or L′, such as LC of D-L-I) forming a 5-membered heteroatom ring. In some embodiments, the linker conjugated to E is a terminal alkyne and is conjugated to an Int having a terminal azide. Exemplary preparations of preparations of E-(PEG2-PEG40)-azide are described in Examples 2, 3, and 12. One of skill in the art would readily understand the final product from a click chemistry conjugation.


Exemplary linking strategies (e.g., methods for linking a monomer or a dimer of a neuraminidase inhibitor to E, such as, byway of a linker) are further depicted in FIGS. 1-4 and 14.


VI. Combination Therapies
Antiviral Agents

In some embodiments, one or more antiviral agents may be administered in combination (e.g., administered substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions) or administered separately at different times) with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)).


In some embodiments, the antiviral agent is an antiviral agent for the treatment of HIV. For example, the antiviral agent may be a nucleoside/nucleotide reverse transcriptase inhibitor, a gp120 inhibitor, a polymerase inhibitor, or a fusion protein inhibitor. The antiviral agent may target either the virus or the host subject. The antiviral agent for the treatment of HIV used in combination with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) may be selected from an integrase inhibitor (e.g., dolutegravir, elvitegravir, or raltegravir), a nucleoside reverse transcriptase inhibitor (NRTI) (e.g., abacavir, lamivudine, zidovudine, emtricitabine, tenofovir, emtricitabine, didanosine, or stavudine), a non-nucleoside reverse transcriptase inhibitor (NNRTI) (e.g., efavirenz, etravirine, nevirapine, rilpivirine, or delavirdine), a protease inhibitor (e.g., atazanavir, cobicistat, darunavir, cobicistat, lopinavir, ritonavir, fosamprenavir, tipranavir, nelfinavir, indinavir, or saquinavir), an inhibitor of viral entry (e.g., enfuviritide), a CCR5 antagonist (e.g., maraviroc), or a CYP3A inhibitor (e.g., cobicistat or ritonavir), or an siRNA targeting a host or viral gene, or prodrugs thereof, or pharmaceutically acceptable salts thereof.


Antiviral Vaccines

In some embodiments, any one of conjugates described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) is administered in combination with an antiviral vaccine (e.g., a composition that elicits an immune response in a subject directed against a virus). The antiviral vaccine may be administered substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions) as the conjugates, or may be administered prior to or following the conjugates (e.g., within a period of 1 day, 2, days, 5, days, 1 week, 2 weeks, 3 weeks, 1 month, 2 months, 6 months, or 12 months, or more).


In some embodiments the viral vaccine includes an immunogen that elicits an immune response in the subject against HIV-1 or HIV-2. In some embodiments the vaccine is administered as a nasal spray.


VII. Methods

Methods described herein include, e.g., methods of protecting against or treating a viral infection (e.g., an HIV infection) in a subject and methods of preventing, stabilizing, or inhibiting the growth of viral particles. A method of treating a viral infection (e.g., an HIV infection) in a subject includes administering to the subject a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof. In some embodiments, the viral infection is cause by the human immunodeficiency virus (e.g., HIV-1 or HIV-2). In some embodiments, the viral infection is caused by a resistant strain of virus. A method of preventing, stabilizing, or inhibiting the growth of viral particles or preventing the replication and spread of the virus includes contacting the virus or a site susceptible to viral growth with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof.


Moreover, methods described herein also include methods of protecting against or treating viral infection in a subject by administering to the subject a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)). In some embodiments, the method further includes administering to the subject an antiviral agent or an antiviral vaccine.


Methods described herein also include methods of protecting against or treating a viral infection in a subject by administering to said subject (1) a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) and (2) an antiviral agent or an antiviral vaccine. Methods described herein also include methods of preventing, stabilizing, or inhibiting the growth of viral particles or preventing the replication or spread of a virus, by contacting the virus or a site susceptible to viral growth with (1) a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) and (2) an antiviral agent or an antiviral vaccine.


In some embodiments, the conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) is administered first, followed by administering of the antiviral agent or antiviral vaccine alone. In some embodiments, the antiviral agent or antiviral vaccine is administered first, followed by administering of the conjugate described herein alone. In some embodiments, the conjugate described herein and the antiviral agent or antiviral vaccine are administered substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions). In some embodiments, the conjugate described herein or the antiviral agent or antiviral vaccine is administered first, followed by administering of the conjugate described herein and the antiviral agent or antiviral vaccine substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions). In some embodiments, the conjugate described herein and the antiviral agent or antiviral vaccine are administered first substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions), followed by administering of the conjugate described herein or the antiviral agent or antiviral vaccine alone. In some embodiments, when a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) and an antiviral agent or antiviral vaccine are administered together (e.g., substantially simultaneously in the same or separate pharmaceutical compositions, or separately in the same treatment regimen), inhibition of viral replication of each of the conjugate and the antiviral agent or antiviral vaccine may be greater (e.g., occur at a lower concentration) than inhibition of viral replication of each of the conjugate and the antiviral agent or antiviral vaccine when each is used alone in a treatment regimen.


VIII. Pharmaceutical Compositions and Preparations

A conjugate described herein may be formulated in a pharmaceutical composition for use in the methods described herein. In some embodiments, a conjugate described herein may be formulated in a pharmaceutical composition alone. In some embodiments, a conjugate described herein may be formulated in combination with an antiviral agent or antiviral vaccine in a pharmaceutical composition. In some embodiments, the pharmaceutical composition includes a conjugate described herein (e.g., a conjugate described by any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) and pharmaceutically acceptable carriers and excipients.


Acceptable carriers and excipients in the pharmaceutical compositions are nontoxic to recipients at the dosages and concentrations employed. Acceptable carriers and excipients may include buffers such as phosphate, citrate, HEPES, and TAE, antioxidants such as ascorbic acid and methionine, preservatives such as hexamethonium chloride, octadecyldimethylbenzyl ammonium chloride, resorcinol, and benzalkonium chloride, proteins such as human serum albumin, gelatin, dextran, and immunoglobulins, hydrophilic polymers such as polyvinylpyrrolidone, amino acid residues such as glycine, glutamine, histidine, and lysine, and carbohydrates such as glucose, mannose, sucrose, and sorbitol.


Examples of other excipients include, but are not limited to, antiadherents, binders, coatings, compression aids, disintegrants, dyes, emollients, emulsifiers, fillers (diluents), film formers or coatings, flavors, fragrances, glidants (flow enhancers), lubricants, sorbents, suspensing or dispersing agents, or sweeteners. Exemplary excipients include, but are not limited to: butylated hydroxytoluene (BHT), calcium carbonate, calcium phosphate (dibasic), calcium stearate, croscarmellose, crosslinked polyvinyl pyrrolidone, citric acid, crospovidone, cysteine, ethylcellulose, gelatin, hydroxypropyl cellulose, hydroxypropyl methylcellulose, lactose, magnesium stearate, maltitol, mannitol, methionine, methylcellulose, methyl paraben, microcrystalline cellulose, polyethylene glycol, povidone, pregelatinized starch, propyl paraben, retinyl palmitate, shellac, silicon dioxide, sodium carboxymethyl cellulose, sodium citrate, sodium starch glycolate, sorbitol, starch (corn), stearic acid, stearic acid, sucrose, talc, titanium dioxide, vitamin A, vitamin E, vitamin C, and xylitol.


The conjugates herein may have ionizable groups so as to be capable of preparation as pharmaceutically acceptable salts. These salts may be acid addition salts involving inorganic or organic acids or the salts may, in the case of acidic forms of the conjugates herein be prepared from inorganic or organic bases. Frequently, the conjugates are prepared or used as pharmaceutically acceptable salts prepared as addition products of pharmaceutically acceptable acids or bases. Suitable pharmaceutically acceptable acids and bases are well-known in the art, such as hydrochloric, sulphuric, hydrobromic, acetic, lactic, citric, or tartaric acids for forming acid addition salts, and potassium hydroxide, sodium hydroxide, ammonium hydroxide, caffeine, various amines, and the like for forming basic salts. Methods for preparation of the appropriate salts are well-established in the art.


Representative acid addition salts include, but are not limited to, acetate, adipate, alginate, ascorbate, aspartate, benzenesulfonate, benzoate, bisulfate, borate, butyrate, camphorate, camphorsulfonate, citrate, cyclopentanepropionate, digluconate, dodecylsulfate, ethanesulfonate, fumarate, glucoheptonate, glycerophosphate, hemisulfate, heptonate, hexanoate, hydrobromide, hydrochloride, hydroiodide, 2-hydroxy-ethanesulfonate, lactobionate, lactate, laurate, lauryl sulfate, malate, maleate, malonate, methanesulfonate, 2-naphthalenesulfonate, nicotinate, nitrate, oleate, oxalate, palmitate, pamoate, pectinate, persulfate, 3-phenylpropionate, phosphate, picrate, pivalate, propionate, stearate, succinate, sulfate, tartrate, thiocyanate, toluenesulfonate, undecanoate, and valerate salts. Representative alkali or alkaline earth metal salts include, but are not limited to, sodium, lithium, potassium, calcium, and magnesium, as well as nontoxic ammonium, quaternary ammonium, and amine cations, including, but not limited to ammonium, tetramethylammonium, tetraethylammonium, methylamine, dimethylamine, trimethylamine, triethylamine, and ethylamine.


Depending on the route of administration and the dosage, a conjugate herein or a pharmaceutical composition thereof used in the methods described herein will be formulated into suitable pharmaceutical compositions to permit facile delivery. A conjugate (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof may be formulated to be administered intramuscularly, intravenously (e.g., as a sterile solution and in a solvent system suitable for intravenous use), intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally (e.g., a tablet, capsule, caplet, gelcap, or syrup), topically (e.g., as a cream, gel, lotion, or ointment), locally, by inhalation, by injection, or by infusion (e.g., continuous infusion, localized perfusion bathing target cells directly, catheter, lavage, in cremes, or lipid compositions). Depending on the route of administration, a conjugate herein or a pharmaceutical composition thereof may be in the form of, e.g., tablets, capsules, pills, powders, granulates, suspensions, emulsions, solutions, gels including hydrogels, pastes, ointments, creams, plasters, drenches, osmotic delivery devices, suppositories, enemas, injectables, implants, sprays, preparations suitable for iontophoretic delivery, or aerosols. The compositions may be formulated according to conventional pharmaceutical practice.


A conjugate described herein may be formulated in a variety of ways that are known in the art. For use as treatment of human and animal subjects, a conjugate described herein can be formulated as pharmaceutical or veterinary compositions. Depending on the subject (e.g., a human) to be treated, the mode of administration, and the type of treatment desired, e.g., prophylaxis or therapy, a conjugate described herein is formulated in ways consonant with these parameters. A summary of such techniques is found in Remington: The Science and Practice of Pharmacy, 22nd Edition, Lippincott Williams & Wilkins (2012); and Encyclopedia of Pharmaceutical Technology, 4th Edition, J. Swarbrick and J. C. Boylan, Marcel Dekker, New York (2013), each of which is incorporated herein by reference.


Formulations may be prepared in a manner suitable for systemic administration or topical or local administration. Systemic formulations include those designed for injection (e.g., intramuscular, intravenous or subcutaneous injection) or may be prepared for transdermal, transmucosal, or oral administration. The formulation will generally include a diluent as well as, in some cases, adjuvants, buffers, and preservatives. The conjugates can be administered also in liposomal compositions or as microemulsions. Systemic administration may also include relatively noninvasive methods such as the use of suppositories, transdermal patches, transmucosal delivery and intranasal administration. Oral administration is also suitable for conjugates herein. Suitable forms include syrups, capsules, and tablets, as is understood in the art.


The pharmaceutical compositions can be administered parenterally in the form of an injectable formulation. Pharmaceutical compositions for injection can be formulated using a sterile solution or any pharmaceutically acceptable liquid as a vehicle. Formulations may be prepared as solid forms suitable for solution or suspension in liquid prior to injection or as emulsions. Pharmaceutically acceptable vehicles include, but are not limited to, sterile water, physiological saline, and cell culture media (e.g., Dulbecco's Modified Eagle Medium (DMEM), α-Modified Eagles Medium (α-MEM), F-12 medium). Such injectable compositions may also contain amounts of nontoxic auxiliary substances such as wetting or emulsifying agents, pH buffering agents, such as sodium acetate and sorbitan monolaurate. Formulation methods are known in the art, see e.g., Pharmaceutical Preformulation and Formulation, 2nd Edition, M. Gibson, Taylor & Francis Group, CRC Press (2009).


The pharmaceutical compositions can be prepared in the form of an oral formulation. Formulations for oral use include tablets containing the active ingredient(s) in a mixture with non-toxic pharmaceutically acceptable excipients. These excipients may be, for example, inert diluents or fillers (e.g., sucrose, sorbitol, sugar, mannitol, microcrystalline cellulose, starches including potato starch, calcium carbonate, sodium chloride, lactose, calcium phosphate, calcium sulfate, or sodium phosphate); granulating and disintegrating agents (e.g., cellulose derivatives including microcrystalline cellulose, starches including potato starch, croscarmellose sodium, alginates, or alginic acid); binding agents (e.g., sucrose, glucose, sorbitol, acacia, alginic acid, sodium alginate, gelatin, starch, pregelatinized starch, microcrystalline cellulose, magnesium aluminum silicate, carboxymethylcellulose sodium, methylcellulose, hydroxypropyl methylcellulose, ethylcellulose, polyvinylpyrrolidone, or polyethylene glycol); and lubricating agents, glidants, and antiadhesives (e.g., magnesium stearate, zinc stearate, stearic acid, silicas, hydrogenated vegetable oils, or talc). Formulations for oral use may also be provided as chewable tablets, or as hard gelatin capsules wherein the active ingredient is mixed with an inert solid diluent (e.g., potato starch, lactose, microcrystalline cellulose, calcium carbonate, calcium phosphate or kaolin), or as soft gelatin capsules wherein the active ingredient is mixed with water or an oil medium, for example, peanut oil, liquid paraffin, or olive oil. Powders, granulates, and pellets may be prepared using the ingredients mentioned above under tablets and capsules in a conventional manner using, e.g., a mixer, a fluid bed apparatus, or a spray drying equipment.


Other pharmaceutically acceptable excipients for oral formulations include, but are not limited to, colorants, flavoring agents, plasticizers, humectants, and buffering agents. Formulations for oral use may also be provided as chewable tablets, or as hard gelatin capsules wherein the active ingredient is mixed with an inert solid diluent (e.g., potato starch, lactose, microcrystalline cellulose, calcium carbonate, calcium phosphate or kaolin), or as soft gelatin capsules wherein the active ingredient is mixed with water or an oil medium, for example, peanut oil, liquid paraffin, or olive oil. Powders, granulates, and pellets may be prepared using the ingredients mentioned above under tablets and capsules in a conventional manner using, e.g., a mixer, a fluid bed apparatus or a spray drying equipment.


Dissolution or diffusion controlled release of a conjugate described herein (e.g., a conjugate of any one of (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof can be achieved by appropriate coating of a tablet, capsule, pellet, or granulate formulation of the conjugate, or by incorporating the conjugate into an appropriate matrix. A controlled release coating may include one or more of the coating substances mentioned above and/or, e.g., shellac, beeswax, glycowax, castor wax, carnauba wax, stearyl alcohol, glyceryl monostearate, glyceryl distearate, glycerol palmitostearate, ethylcellulose, acrylic resins, dl-polylactic acid, cellulose acetate butyrate, polyvinyl chloride, polyvinyl acetate, vinyl pyrrolidone, polyethylene, polymethacrylate, methylmethacrylate, 2-hydroxymethacrylate, methacrylate hydrogels, 1,3 butylene glycol, ethylene glycol methacrylate, and/or polyethylene glycols. In a controlled release matrix formulation, the matrix material may also include, e.g., hydrated methylcellulose, carnauba wax and stearyl alcohol, carbopol 934, silicone, glyceryl tristearate, methyl acrylate-methyl methacrylate, polyvinyl chloride, polyethylene, and/or halogenated fluorocarbon.


The pharmaceutical composition may be formed in a unit dose form as needed. The amount of active component, e.g., a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)), included in the pharmaceutical compositions are such that a suitable dose within the designated range is provided (e.g., a dose within the range of 0.01-100 mg/kg of body weight).


IX. Routes of Administration and Dosages

In any of the methods described herein, conjugates herein may be administered by any appropriate route for treating or protecting against a viral infection (e.g., an HIV infection), or for preventing, stabilizing, or inhibiting the proliferation or spread of a virus (e.g., an HIV virus). Conjugates described herein may be administered to humans, domestic pets, livestock, or other animals with a pharmaceutically acceptable diluent, carrier, or excipient. In some embodiments, administering includes administration of any of the conjugates described herein (e.g., conjugates of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) or compositions intramuscularly, intravenously (e.g., as a sterile solution and in a solvent system suitable for intravenous use), intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally (e.g., a tablet, capsule, caplet, gelcap, or syrup), topically (e.g., as a cream, gel, lotion, or ointment), locally, by inhalation, by injection, or by infusion (e.g., continuous infusion, localized perfusion bathing target cells directly, catheter, lavage, in cremes, or lipid compositions). In some embodiments, if an antiviral agent is also administered in addition to a conjugate described herein, the antiviral agent or a pharmaceutical composition thereof may also be administered in any of the routes of administration described herein.


The dosage of a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) or pharmaceutical compositions thereof depends on factors including the route of administration, the disease to be treated (e.g., the extent and/or condition of the viral infection), and physical characteristics, e.g., age, weight, general health, of the subject. Typically, the amount of the conjugate or the pharmaceutical composition thereof contained within a single dose may be an amount that effectively prevents, delays, or treats the viral infection without inducing significant toxicity. A pharmaceutical composition may include a dosage of a conjugate described herein ranging from 0.01 to 500 mg/kg (e.g., 0.01, 0.1, 0.2, 0.3, 0.4, 0.5, 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100, 150, 200, 250, 300, 350, 400, 450, or 500 mg/kg) and, in a more specific embodiment, about 0.1 to about 30 mg/kg and, in a more specific embodiment, about 1 to about 30 mg/kg. In some embodiments, when a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) and an antiviral agent or antiviral vaccine are administered in combination (e.g., substantially simultaneously in the same or separate pharmaceutical compositions, or separately in the same treatment regimen), the dosage needed of the conjugate described herein may be lower than the dosage needed of the conjugate if the conjugate was used alone in a treatment regimen.


A conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I)-(D-XVII), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof may be administered to a subject in need thereof, for example, one or more times (e.g., 1-10 times or more; 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 times) daily, weekly, monthly, biannually, annually, or as medically necessary. Dosages may be provided in either a single or multiple dosage regimens. The timing between administrations may decrease as the medical condition improves or increase as the health of the patient declines. The dosage and frequency of administration may be adapted by the physician in accordance with conventional factors such as the extent of the infection and different parameters of the subject.


EXAMPLES

The following examples are put forth so as to provide those of ordinary skill in the art with a description of how the compositions and methods described herein may be used, made, and evaluated, and are intended to be purely exemplary of the invention and are not intended to limit the scope of what the inventors regard as their invention.


Example 1: Preparation of Fc Constructs

Reverse translations of the amino acids comprising the protein constructs (SEQ ID NOs: 1, 3, 5, 7, 9, 12, and 14) were synthesized by solid-phase synthesis. The oligonucleotide templates were cloned into pcDNA3.1 (Life Technologies, Carlsbad, Calif., USA) at the cloning sites BamHI and Xhol (New England Biolabs, Ipswich, Mass., USA) and included signal sequences derived from the human Interleukin-2 or human albumin. The pcDNA3.1 plasmids were transformed into Top10 E. coli cells (LifeTech). DNA was amplified, extracted, and purified using the PURELINK® HiPure Plasmid Filter Maxiprep Kit (LifeTech). The plasmid DNA is delivered, using the EXPIFECTAMINE™ 293 Transfection Kit (LifeTech), into HEK-293 cells per the manufacturer's protocol. Cells were centrifuged, filtered, and the supernatants were purified using MabSelect Sure Resin (GE Healthcare, Chicago, Ill., USA). Purified molecules were analyzed using 4-12% Bis Tris SDS PAGE gels by loading 1-2 μg of each molecule into the gel, and staining using instant Blue staining. Each gel included a molecular weight ladder with the indicated molecular weight standards. Reduced and non-reduced lanes are denoted by “R” and “NR”. FIGS. 5-11 show non-reducing and reducing SDS-PAGE of an Fc domain formed from Fc domain monomers having the sequences of SEQ ID NOs: 1, 3, 5, 7, 9, 12, and 14, respectively.


Example 2. Synthesis of h-IgG1 Fc-PEG4-azide

Preparation of 0.05M PEG4-azido NHS ester solution in DMF/PBS: 195.8 mg of PEG4-azido NHS ester was dissolved in 0.500 mL of DMF at 0° C. and diluted to 9.88 mL by adding PBS buffer at 0° C. This solution was used for preparing other PEG4-azido Fc with variety of DAR values by adjusting the equivalents of this PEG4-azido NHS ester solution.


Preparation of PEG4-azido Fc: 0.05M PEG4-azidoNHS ester PBS buffer solution (9.88 mL, 494.0 μmol, 9.5 equivalents) was added to a solution of h-IgG1 Fc (SEQ ID NO: 4) (3027 mg in 213.0 mL of pH 7.4 PBS, MW-58,200 Da, 16.5 μmol) and the mixture was shaken gently for 2 hours at ambient temperature. The solution was concentrated by using 10 centrifugal concentrators (30,000 MWCO, 15 mL) to a volume of ˜1.5 mL. The crude mixture was diluted 1:10 in PBS pH 7.4, and concentrated again. This wash procedure was repeated for total of three times. The small molecule reagent was removed with this wash procedure. The concentrated Fc-PEG4-azide (SEQ ID NO: 4) was diluted to 213.0 mL with pH 7.4 PBS 1× buffer and ready for Click conjugation. The purified material was quantified using a NANODROP™ UV visible spectrophotometer (using a calculated extinction coefficient based on the amino acid sequence of h-IgG1). Yield is quantitative after purification.


The Fc-PEG4-azide (SEQ ID NO: 35) was prepared analogously.


Example 3. Synthesis of Recombinant Mouse Serum Albumin (MSA)-PEG4-Azide

PEG4-azidoNHS ester (98%, 81.7 μmol, 4.5 equivalents, 32.4 mg in 0.3 mL of DMF and diluted to 1.63 mL with pH 7.4 PBS 1× buffer solution) was added to a solution of recombinant mouse serum albumin (SEQ ID NO: 71) (1200 mg in 75.0 mL of pH 7.4 PBS, MW-66,000 Da, 18.2 μmol) and the mixture was shaken gently for 12 hours at ambient temperature. The solution was concentrated using a centrifugal concentrator (30,000 MWCO) to a volume of ˜1.5 mL. The crude mixture was diluted 1:10 in PBS pH 7.4, and concentrated again. This wash procedure was repeated for total of three times. The small molecule reagent was removed with this wash procedure. The concentrated MSA-PEG4-azide was diluted to 75.0 mL with pH 7.4 PBS 1× buffer and ready for Click conjugation. The purified material was quantified using a NANODROP™ UV visible spectrophotometer (using a calculated extinction coefficient based on the amino acid sequence of h-IgG1). Yield is quantitative after purification. DAR=3.5 determined by MALDI. The DAR value can be adjusted by altering the equivalents of PEG4-azido NHS ester similar to h-IgG1 Fc (Example 2).


Example 4. Synthesis of Int-1



embedded image


Step a.




embedded image


To a solution of (7-bromo-4-methoxy-1H-pyrrolo[2,3-c]pyridin-3-yl)(oxo)acetic acid (2.5 g, 8.6 mmol, described in J. Med. Chem. 2018, 61(1):62-80) and phenyl(piperidin-4-ylidene)acetonitrile (1.90 g, 9.5 mmol, 1.1 eq) in DMF (40 ml) was added HATU (3.6 g, 9.5 mmol, 10.5 mmol), and N-methylmorpholine (2.1 g, 20 mmol) at room temperature. The resulting solution was stirred for 1 hour at room temperature, then concentrated and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 20% to 80% acetonitrile and water with 0.1% TFA as modifier. Yield of products 2.43 g, 59%. Ion(s) found by LCMS: M+H=479.1.


Step b.




embedded image


To a solution of product from the previous step (0.24 g, 0.5 mmol) and methyl 4-(4,4,5,5-tetramethyl-1,3-dioxolan-2-yl)-1,3-thiazole-2-carboxylate (0.27 g, 1 mmol) in dioxane (10 ml) was added potassium carbonate (2 M, 2 ml), tetrakis(triphenylphosphine)paladium (0.065 g, 0.05 mmol) at room temperature. The resulting solution was degassed with nitrogen gas for 5 min, and heated at 100° C. overnight in an oil bath. The solution was concentrated and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 20% to 80% acetonitrile and water with 0.1% TFA as modifier. Yield 0.11 g, 43.0%. Ion(s) found by LCMS: M+H=528.1.


Step c.




embedded image


To a solution of product from the previous step (26 mg, 0.050 mmol) and propargyl-PEG4-amine (23 mg, 0.10 mmol) in DMF (2 ml) was added HATU (38 mg, 0.10 mmol), and N-methylmorpholine (0.07 ml, 0.50 mmol) at room temperature. The resulting solution was stirred for 1 hour at room temperature then concentrated and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 20% to 80% acetonitrile and water with 0.1% TFA as modifier. Yield of product 18 mg, 48%. Ion(s) found by LCMS: M+H=741.3.


Example 5. Synthesis of Int-2



embedded image


To a solution of 7-bromo-4-methoxy-1H-pyrrolo[2,3-c]pyridine (480 mg, 1.0 mmol), described in Example 4 (Int-1), potassium carbonate (457 mg, 3.30 mmol), copper(I) iodide (210 mg, 1.1 mmol), 1H-1,2,4-triazol-3-carboxylate methyl ester (254 mg, 2 mmol) and (1R,2R)—N1,N2-dimethylcyclohexane-1,2-diamine (160 mg, 1.1 mmol) in 1,4-dioxane (10 mL) were heated up at 110° C. for 13 h. The reaction solution was treated with water (0.5 mL) for 15 minutes then concentrated and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 20% to 80% acetonitrile and water using 0.1% TFA as the modifier. Yield of product 270 mg, 51%. Ion(s) found by LCMS: M+H=512.2.


Step b.




embedded image


To a solution of product from the previous step (1-{3-[{4-[cyano(phenyl)methylidene]piperidin-1-yl}(oxo)acetyl]-4-methoxy-1H-pyrrolo[2,3-c]pyridin-7-yl}-1H-1,2,4-triazole-3-carboxylic acid, 50 mg, 0.1 mmol) and propargyl-PEG4-amine (23 mg, 0.1 mmol) in DMF (2 ml) was added HATU (38 mg, 0.1 mmol), and N-methylmorpholine (0.07 ml, 0.5 mmol) at room temperature, and the resulting solution was stirred for 1 hour at room temperature. The solution was concentrated and purified by and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 2% to 100% acetonitrile and water with 0.1% TFA as modifier. Yield of products 21 mg, 29.6%. Ion(s) found by LCMS: M+H=725.3.


Example 6. Synthesis of Int-3



embedded image


Step a.




embedded image


A solution of propargyl-Peg2-alcohol (1.9 g, 13.2 mmol), N,N′-Di-Boc-1H-pyrazole-1-carboxamidine (3.4 g, 11.0 mmol), and triphenylphosphine (3.5 g, 13.2 mmol) dissolved in THE (20 mL) was cooled to 0 C, and treated with DIAD (in three portions, 2.58 mL) over 30 minutes. LCMS shows complete conversion after 2 h at room temperature. The product was purified by RPLC (10% acetonitrile/water to 90% acetonitrile/water). Yield 3.66 g, 76%. Ion found by LCMS: M+H+=437.2.


Step b.




embedded image


To a solution of product from the previous step (1.2 g, 2.75 mmol) and tert-butyl (2-aminoethyl)carbamate (0.44 g, 2.75 mmol) in 20 ml THF was added 4-dimethylaminopyridine (120 mg, 1 mmol) and triethylamine (0.7 ml, 5 mmol), and heated to 60° C. for 2 hours. The resulting solution was concentrated and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 0% to 50% acetonitrile and water with no modifier. Yield of 1.1 g, 76%. Ion(s) found by LCMS: M+H=529.3.


Step c.




embedded image


The of product from the previous step (1.00 g, 2.00 mmol) was treated with 10 ml TFA at room temperature for 0.5 hour, then concentrated to dry and used for next step without any further purification. Yield is quantitative for this step. Ion(s) found by LCMS: M/2+H=229.2.


Step d.




embedded image


The title compound was prepared analogously to Example 4, where the propargyl-PEG4 amine was substituted with N-(2-aminoethyl)-N′-(2-{2-[(prop-2-yn-1-yl)oxy]-ethoxy}ethyl)guanidine from previous step. Yield of product 15 mg, 36%. Ion(s) found by LCMS: M+H=738.3.


Example 7. Synthesis of Int-4



embedded image


Step a.




embedded image


To a solution of N-(2,3-epoxypropyl)phthalimide (0.5 g, 2.5 mmol) and propargyl-PEG4 alcohol (0.7 g, 3.6 mmol) in DCM (20 ml) was added boron trifluoride diethyl etherate (BF3.Et2O) (1.42 g, 1.23 ml, 10 mmol), and the resulting solution was stirred at 40° C. for 12 hours. The crude reaction was concentrated and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 10% to 80% acetonitrile and water with no modifier. Yield of 0.35 g, 36%. Ion(s) found by LCMS: M+H=392.2.


Step b.




embedded image


To a solution of product from the previous step (0.48 g, 1.2 mmol) in methanol (5 ml) was added hydrazine (0.20 g, 6 mmol), and then it was stirred at room temperature for overnight. The resulting solution was concentrated and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 0% to 50% acetonitrile and water with no modifier. Yield of 0.22 g, 68.5%. Ion(s) found by LCMS: M+H=262.2.


Step c.




embedded image


The title compound was prepared analogously to Example 4, where the PEG-amine component was substituted with the 1-amino-4,7,10,13-tetraoxahexadec-15-yn-2-ol from the previous step. Yield of products 8 mg, 19.0%. Ion(s) found by LCMS: M+H=771.3.


Example 8. Synthesis of Conjugates

A preparation of 0.0050M CuSO4 in PBS buffer solution Click reagent was performed. Briefly, 10.0 mg CuSO4 was dissolved in 12.53 mL PBS, next 6.00 mL of the CuSO4 solution and added 51.7 mg BTTAA (CAS #1334179-85-9) and 297.2 mg sodium ascorbate to give the Click reagent solution (0.0050M CuSO4, 0.020M BTTAA and 0.25M sodium ascorbate). This Click reagent solution was used for all subsequent conjugations.


General procedure for Click conjugation of payload: a solution of azido functionalized Fc was added to a 15 mL centrifuge tube containing alkyne derivatized small molecule (2 equivalents for each DAR). After gently shaking to dissolve all solids, the mixture was treated with the Click reagent solution of (L-ascorbic acid sodium, 0.25 M, 400 equivalents, copper (II) sulfate 0.0050M, 8 equivalents, and BTTAA 0.020M, 32 equivalents). The resulting mixture was gently rotated for 6 hours at ambient temperature. It was purified by affinity chromatography over a protein A column, followed size exclusion chromatography (as described herein). Maldi TOF analysis of the purified final product gave an average mass, average DAR and Yield listed in Table 3 below.









TABLE 3







Conjugates and properties

















MALDI






Fc carrier

mass Da
Fc amount


Conjugate
Intermediate
SEQ ID NO
DAR (Average)
(Average)
(μmol)
Yield (%)





Conjugate 1
Int-3
35
4.1
57617
0.754
 1%


Conjugate 2
Int-4
35
4.4
58059
0.824
 2%


Conjugate 3
Int-2
17
3.5
62128
0.860
13%


Conjugate 4
Int-1
35
4.2
57738
N.D.
N.D.


Conjugate 5
Int-17
35
3.2
57258
N.D.
N.D.


Conjugate 6
Int-5
17
4.8
63211
N.D.
N.D.


Conjugate 7
Int-7
17
6.4
64614
N.D.
N.D.


Conjugate 8
Int-15
 64*
2.5
60593
0.862
25%


Conjugate 9
Int-22
 64*
3.4
65947
21.2  
41%


Conjugate 10
Int-20
 64*
3.8
61490
N.D.
N.D.


Conjugate 11
Int-21
 64*
3.9
61999
N.D.
N.D.


Conjugate 12
Int-25
 64*
4.2
65946
N.D.
N.D.


Conjugate 13
Int-26
 64*
2.9
60940
N.D.
N.D.


Conjugate 14
Int-27
 64*
3.0
61064
N.D.
N.D.





*The terminal Lys residue of the Fc domain may be cleaved upon expression and purification, e.g., SEQ ID NO: 64 coverts to SEQ ID NO: 73






Example 9. General Procedure for Purification of Conjugates

The crude mixture was diluted 1:10 in PBS pH 7.4, and purified using MabSelect Sure Resin (GE Healthcare, Chicago, Ill., USA), followed by size exclusion chromatography. (HiLoad 26/600 Superdex200 μg, GE Healthcare, Chicago, Ill., USA). Fractions containing purified conjugate were pooled and concentrated to approximately 20 mg/mL using a centrifugal concentrator (30,000 MWCO). Purified material was quantified using a NANODROP™ UV visible spectrophotometer using a calculated extinction coefficient based on the amino acid sequence of hIgG1 Fc(myc). Purified molecules were analyzed using 4-12% Bis Tris SIDS PAGE gels by loading 1 μg of each molecule into the gel, and staining using Instant Blue (Expedeon, San Diego, Calif., USA). Each gel included a molecular weight ladder with the indicated molecular weight standards. Yields were calculated and purity determined by Agilent Analytical HPLC. Product peak and MW were found by MALDI MS and a final DAR calculated.


Example 10. Gp120 Glycoprotein Binding Assay

Nunc MaxiSorp flat-bottom 96-well plates (12-565-136, Fisher Scientific) were coated with recombinant HIV-1 GP120 (SAE0071, Sigma) at 2 μg/mL in PBS (pH 7.4) (10-010-049, Fisher Scientific) overnight at 4° C. (100 μL, 0.2 μg/well). Plates were washed (5×300 μL) with wash buffer (PBS 0.05% Tween 20) and blocked with 1% BSA (A5611-10G, Sigma; 200 μL/well) in wash buffer for 1 h at room temp on an orbital microplate shaker at 500 rpm (BT908, BT LabSystems). The blocking agent was removed and wells incubated with 3-fold serial dilutions of conjugate in sample diluent (0.5% BSA in PBS 0.025% Tween 20) starting at 1 μM for 1 h with shaking at room temp. After 5×300 μL washes, the plates were incubated with HRP conjugated donkey anti-human IgG Fc F(ab′)2 (709-036-098, Jackson ImmunoResearch) secondary antibody diluted 1:1,000 in sample diluent for 1 h with shaking at room temp. Plates were then washed (8×300 μL) and developed with TMB substrate (BD555214, Fisher Scientific) for 3-5 minutes at room temp. The reaction was stopped with 1N H2SO4 and the absorbance read at 450 nm using the EnSpire multimode plate reader (PerkinElmer). Half maximal effective concentration (EC50) was calculated with GraphPad Prism version 8 using nonlinear regression analysis (Sigmoidal, 4PL) of binding curves. Polyclonal goat anti-GP120 HRP (PA1-73097, Invitrogen) and unconjugated Fc molecule were run as the positive and negative binding controls, respectively. The results are provided in FIG. 12 and in Table 4.









TABLE 4







GP120 protein binding EC50 (nM)











Fc carrier

EC50 (nM)














Conjugate
Intermediate
SEQ ID NO
DAR
Run 1
Run 2
Run 3
Average

















GP120 antibody
n/a
n/a
n/a
10.7
9.1
21.5
13.8


Conjugate 1
Int-3
35
4.1
3.6
2.3
 2.8
2.9


Conjugate 2
Int-4
35
4.4
47.9
n.a
n/a
47.9


Conjugate 3 (batch 1)
Int-2
4
3.5
n/a
57.2 
22.5
39.9


Conjugate 3 (batch 2)
Int-2
4
4.8
n/a
n/a
15.9
15.9









Example 11. Activity of Pre-Conjugation Intermediate (Int) Compounds in an In Vitro Cell Fusion Assay

Activity of HIV compounds was determined in an assay designed to measure the inhibition of cell-cell fusion which is an important step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1294) and HL2/3 cells (catalog #1294), obtained from the AIDS Research Reagent and Reference Program (Rockville, Md.). HeLa-CD4-LTR-β-Gal cells were plated at a density of 5×103 cells per well in a volume of 50 μL with 50 μL of nine serial logarithmic dilutions of compound in triplicate for one hour at 37° C./5% CO2. Following the incubation, 100 μL of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37° C./5% CO2. Following the incubation, efficacy plates were evaluated for β-galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability.


In these studies, cytotoxicity was also evaluated (TC50). Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5-[(phenylamino)carbonyl]-2H-tetrazolium hydroxide). XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product. XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives. Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at −20° C. XTT/PMS stock was prepared immediately before use by adding 40 μL of PMS per mL of XTT solution. 50 μL of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37° C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 384 96 well plate format spectrophotometer.


This assay identified several compounds with EC50 values within 10-fold of the benchmark compound (Temsavir) (Table 5). Particularly active was Int-1 with an EC50 of less than 4 nM. Importantly, no cytotoxicity was evident for any compound at concentration tested. The combination of nM inhibition and no detectable cytotoxicity indicates this is a potent series with significant therapeutic potential.









TABLE 5







Fusion inhibition activity of pre-conjugation


intermediate (Int) compounds











Compound
EC50 (μM)
TC50 (μM)















CSB control
0.646
>10.0



(Chicago Sky Blue, Sigma-Aldrich)



T20 control
0.156
>1.0



(Enfuvirtide, Medchem Express)



Temsavir
0.00284
>10.0



Int-1
0.00383
>10.0



Int-3
0.0282
>10.0



Int-5
>0.5
>0.5



Int-6
>0.5
>0.5



Int-7
0.18
>0.5



Int-8
5.16
>10.0



Int-9
0.198
>10.0



Int- 10
0.529
>10.0










Example 12. General Procedure for Synthesis of Azido Fc

Preparation of PEG4-azido NHS ester solution (0.050 M) in DMF/PBS: 16.75 mg of PEG4-azido NHS ester was dissolved in 0.100 mL of DMF at 0° C. and diluted to 0.837 mL by adding PBS 1× buffer at 0° C. This solution was used for preparing other PEG4-azido Fc with a variety of DAR values by adjusting the equivalents of this PEG4-azido NHS ester PBS solution.


Pretreatment of h-IgG1 Fc, SEQ ID NO: 48 (107.2 mg in 8.800 mL of pH 7.4 PBS, MW-57891 Da, 1.852 μmol): The Fc solution was transferred into four centrifugal concentrators (30,000 MWCO, 15 mL) and diluted to 15 mL with PBS×1 buffer and concentrated to a volume of ˜1.5 mL. The residue was diluted 1:10 in PBS pH 7.4, and concentrated again. This wash procedure was repeated for total of four times followed by dilution to 8.80 mL.


Preparation of PEG4-azido Fc: 0.050M PEG4-azidoNHS ester PBS buffer solution (0.593 mL, 29.6 μmol, 16 equivalents) was added to above solution of h-IgG1 Fc (SEQ ID NO: 48) and the mixture was shaken rotated for 2 hours at ambient temperature. The solution was concentrated by using four centrifugal concentrators (30,000 MWCO, 15 mL) to a volume of ˜1.5 mL. The crude mixture was diluted 1:10 in PBS pH 7.4, and concentrated again. This wash procedure was repeated for total of three times. The concentrated Fc-PEG4-azide was diluted to 8.80 mL with pH 7.4 PBS buffer and ready for Click conjugation. The purified material was quantified using a NANODROP™ UV visible spectrophotometer (using a calculated extinction coefficient based on the amino acid sequence of h-IgG1). Yield was quantitative after purification.


Example 13. Synthesis of Int-12



embedded image


The title compound was prepared analogously to Example 18 (Int-17) as shown in the scheme above. Ion(s) found by LCMS: M+H=1030.5.


Example 14. Synthesis of Int-13



embedded image


The title compound was prepared analogously to Example 5 (Int-2), where the 1H-i1,2,4-triazol-3-carboxylate methyl ester was substituted with the 1H-i1,2,3-triazol-4-carboxylate methyl ester in the step a. Ion(s) found by LCMS: M+H=725.3.


Example 15. Synthesis of Int-14



embedded image


Step a.




embedded image


Benzyl chloroformate (2.4 g, 14.2 mmol) was added dropwise to a stirring mixture of (L)-cystic acid (2 g, 11.8 mmol) and triethylamine (3.6 g, 35.5 mmol) in a (1/1) mixture of acetonitrile/aqueous sodium bicarbonate (40 mL) cooled to 0° C. The reaction was stirred at 0° C. for 40 minutes then the solvent was removed by rotary evaporation. The crude material was purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 5% to 95% acetonitrile/water with 0.1% TFA as the modifier. The pure fractions were pooled and concentrated to afford the crude product as a clear, viscous oil. Yield 2.1 g, 58%. LC/MS [M−H]=302.2.


Step b.




embedded image


HATU (451 mg, 1.2 mmol) in DMF (1 mL) was added, dropwise to a stirring mixture of the product from step a of this example (300 mg, 0.99 mmol), 1((N-Boc-amino)ethyl)piperazine (272 mg, 1.89 mmol) and triethylamine (500 mg, 4.95 mmol) in DMF (5 mL). The mixture was stirred for 45 minutes at ambient temperature and concentrated via rotary evaporation. The crude material was purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 5% to 95% acetonitrile/water with 0.1% TFA as the modifier, 30 minute gradient. The pure fractions were pooled and concentrated to afford the product as a clear, viscous oil. Yield 270 mg, 53%. LC/MS [M+H]+=515.2.


Step c.




embedded image


The CBZ protected intermediate from step b (270 mg, 0.53 mmol) was stirred in methanol (20 mL) in the presence of 5% Pd/C (75 mg) under 1 atmosphere of hydrogen gas for 1 hour. The mixture was filtered through Celite and concentrated to afford the product as a clear, viscous oil which was taken forward without purification. Yield 200 mg, ˜99%. LC/MS [M+H]+=381.2.


Step d.




embedded image


HATU (240 mg, 0.63 mmol) in DMF (1 mL) was added, dropwise to a stirring mixture of the product from step c (200 mg, 0.52 mmol), propargyl-Peg4-carboxylic acid (164 mg, 0.63 mmol) and triethylamine (265 mg, 2.62 mmol) in DMF (3 mL). The mixture was stirred for 45 minutes at ambient temperature then concentrated on a rotary evaporator. The crude material was purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatography eluted with 5% to 75% acetonitrile/water with 0.1% TFA as the modifier. The pure fractions were pooled and concentrated to afford the product TFA salt as a clear, viscous oil. Yield 238 mg, 73%. LC/MS [M+H]+=623.4.


The Boc protected intermediate (238 mg, 0.53 mmol) was stirred in 4H HCl (gas in Dioxane, 10 mL) for 1 hour. The solvent was removed by rotary evaporation. The resulting oil was dissolved in DI water then lyophilized to afford the HCl salt as a clear, viscous oil. Yield 215 mg, ˜99%. LC/MS [M+H]+=522.4.


Step e.




embedded image


To a solution of amine HCl salt from Step d (66.1 mg, 0.12 mmol), acid (described in Example 5 (Int-2), 60.5 mg, 0.12 mmol) and HATU (69.0 mg, 0.18 mmol) in DMF (3 mL) at room temperature was added DIEA (0.11 mL, 0.62 mmol). The reaction mixture was stirred at room temperature for 1 hr, then purified by semi-preparative HPLC (ACCQ, 5 to 50% acetonitrile and water, using 0.1% TFA as modifier). Yielded 2.5 mg, 1.7%. Ion found by LCMS [M+2+H]/2=509.2.


Example 16. Synthesis of Int-15



embedded image


Step a.




embedded image


A solution of benzylaldehyde (2.80 g, 25.83 mmol) and 1-N-Boc-1,3-diaminopropane (2.99 g, 16.81 mmol) in methanol (30 mL) was heated at 60 C for 6 hours. After cooling to room temperature, NaBH4 (1.91 g, 49.58 mmol) was added portion-wise, then the resulting solution was stirred for 30 minutes. The reaction was concentrated and quenched with NH4Cl aqueous solution then extracted with CH2Cl2. The organic layer was separated and dried over Na2SO4, filtered then concentrated. The residue was purified by normal phase liquid chromatography (Isco, 0 to 10% methanol and methylene chloride). Yield 2.99 g, 67.5% as colorless oil. Ion found by LCMS [M+H]+=265.2.


Step b.




embedded image


A solution of product from step a. (0.86 g, 3.26 mmol) 2-bromoethane sulphonic acid (1.40 g, 7.32 mmol) and K2CO3 (1.35 g, 9.78 mmol) in DMF:H2O (10 mL:1 mL) was heated in the microwave at 70 degree Celsius for 3 hours. The reaction mixture was filtered and purified by reverse phase liquid chromatography (Isco, 0 to 25% Acetonitrile and water using 0.1% TFA as modifier). Yielded 0.94 g, 77.4%. Ion found by LCMS [M-Boc+H]+=273.2.


Step c.




embedded image


A solution of the product from Step b. (0.94 g, 2.52 mmol) in methanol (25 mL), was charged with Pd(OH)2 (0.18 g, 0.25 mmol) and H2 from a balloon. The reaction mixture was stirred at room temperature overnight. After the reaction was completed, it was filtered through a pad of Celite and washed with methanol then concentrated. The white foam solid was obtained and carried on to the next step without purification. Yielded 0.71 g, 100%. Ion found by LCMS [M+H]+=283.1.


Step d.




embedded image


To a solution of the product from step c. (0.72 g, 2.55 mmol), propargyl-PEG-4-acid (0.75 g, 2.81 mmol) and HATU (1.48 g, 3.83 mmol) in DMF (4 mL) at room temperature was added DIEA (1.36 mL, 7.65 mmol). The resulting solution was stirred at room temperature for 2 hours then purified by reverse phase liquid chromatography (Isco, 0 to 30% acetonitrile and water with no modifier). Yielded 1.04 g, 69.5%. Ion found by LCMS [M-Boc+H]+=425.2.


Step e.




embedded image


A solution of the product from step e (1.04 g, 1.77 mmol) in dichloromethane (5 mL) was treated with HCl (4N in Dioxane, 2.22 mL, 8.86 mmol). The resulting solution was stirred at room temperature until deprotection was complete by LCMS, then solvents were removed by rotary evaporation to yield the desired product as HCl salt. Yielded 0.94 g, 95%. Ion found by LCMS [M+H]+=425.2.


Step f.




embedded image


To a solution of the product from step e (0.37 g, 0.79 mmol), the triazole acid (described in Example 5 (Int-2), 0.31 g, 0.50 mmol) and HATU (0.29 g, 0.75 mmol) in DMF (4 mL) was added DIEA (0.41 mL, 2.29 mmol). The resulting solution was stirred at room temperature for 16 hours then purified by semipreparative HPLC (5 to 35% acetonitrile and water, using 0.1% TFA as modifier). Yielded 0.12 g as TFA salt, 26%. Ion found by LCMS [M+H]+=918.1, [M+2+H]/2+=459.7.


Example 17. Synthesis of Int-16



embedded image


Step a.




embedded image


CBZ-piperazine (3.8 g, 17.4 mmol), N-Boc-bromo-ethylamine (3 g, 13.4 mmol), and diisopropylethylamine (3.5 g, 26.8 mmol) were stirred in acetonitrile (30 mL) at 65° C. for 18 hours. The mixture was cooled to room temperature and concentrated, diluted with DI water (100 mL) and extracted with ethyl acetate (3×75 mL). The combined organic extracts were washed with brine and dried over sodium sulfate. The crude material was purified by silica gel chromatography (0-5% methanol in DCM, 30 minutes). The pure fractions were combined and concentrated to afford the product as a clear oil. Yield 3.3, 67%. LC/MS [M+H]+=364.2.


Step b.




embedded image


The product from step a (3.3 g, 9.1 mmol) of this example was stirred in methanol (30 mL) in the presence of 5% Pd/C (200 mg) under 1 atm of hydrogen gas for 2 hours. The mixture was filtered through celite and taken forward without further purification. Yield 2 g, ˜99%. LC/MS [M+H]+=230.2.


Step c.




embedded image


The product from step b of this example (0.74 g, 3.2 mmol), propargyl-peg4-tosylate (1.9 g, 4.8 mmol), and diisopropylethylamine (0.83 g, 6.5 mmol) were stirred in acetonitrile (20 ml) at reflux for 8 hours. The mixture was cooled and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 5% to 80% acetonitrile/water with 0.1% TFA as the modifier, 30 minute gradient. The pure fractions were pooled and concentrated to afford the product as a thick light yellow oil. Yield 0.71 g, 50%. LC/MS [M+H]+=444.2.


Step d.




embedded image


The product from step c of this example (0.71 g, 1.6 mmol) was stirred in 4N HCl in dioxane (8 mL) for 1 hour. The mixture was concentrated on the rotary evaporator, dissolved in DI water (20 mL), frozen and lyophilized to afford the product as an HCl salt. Yield 0.75 g, 99%. LC/MS [M+H]+=344.2.


Step e.




embedded image


HATU (162 mg, 0.43 mmol) in DMF (1 mL) was added, dropwise over 5 minutes to a stirring mixture of the product from step d (265 mg, 0.58 mmol), triazole carboxylic acid scaffold (200 mg, 0.38 mmol, described in Example 5 (Int-2)), and diisopropylethylamine (200 mg, 2.3 mmol) in DMF (5 mL). The mixture was stirred at ambient temperature for 2 hours and then concentrated on the rotary evaporator. The crude material was purified on the ACQ semi-prep HPLC eluted with 5% to 50% acetonitrile/water with 0.1% TFA as the modifier, 30 minute gradient. The pure fractions were pooled and concentrated to afford the product as a thick light yellow oil. Yield 50 mg, 15%. LC/MS [M+H]+=837.4.


Example 18. Synthesis of Int-17



embedded image


Step a.




embedded image


To a solution of Tris(hydroxymethyl)-aminomethane (1.22 g, 10 mmol) and 3-[(Benzyloxycarbonyl)amino]-1-propanal (2.1 g, 10 mmol) in DCM (20 mL) and methanol (10 ml) was added acetic acid (1 ml). The resulting solution was stirred for 1 hour at room temperature, then treated under vigorous stirring with sodium triacetoxyborohydride (4.2 g, 20 mmol). This mixture was stirred overnight, then concentrated and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 5% to 80% acetonitrile and water with 0.1% TFA as modifier. Yield of the products 2.3 g, 72.0%. Ion(s) found by LCMS: M+H=313.2.


Step b.




embedded image


To a solution of the product from the previous step (0.1 g, 0.32 mmol) and propargyl-PEG4-acid (130 mg, 0.5 mmol) in DMF (5 ml) was added HATU (38 mg, 0.1 mmol), and N-methylmorpholine (0.14 ml, 1 mmol) at room temperature, and the resulting solution was stirred for 1 hour at room temperature. The solution was concentrated and purified by and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 10% to 100% acetonitrile and water with 0.1% TFA as modifier. Yield of products 120 mg, 68%. Ion(s) found by LCMS: M+H=554.3.


Step c.




embedded image


The product from the previous step (0.2 g, 32 mmol) was treated with TFA (3 mL) and thioanisole (0.2 ml), and the resulted solution was heated to 45° C. for overnight. The solution was concentrated and purified by and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 10% to 100% acetonitrile and water with 0.1% TFA as modifier. Yield was quantitative for this step. Ion(s) found by LCMS: M+H=421.3.


Step d.




embedded image


To a solution of 1-{3-[{4-[cyano(phenyl)methylidene]piperidin-1-yl}(oxo)acetyl]-4-methoxy-1H-pyrrolo[2,3-c]pyridine-7-yl}-1H-1,2,4-triazole-3-carboxylic acid (50 mg, 0.1 mmol, described in Example 5, Int-2 and the product from previous step (41 mg, 0.1 mmol) in DMF (2 ml) was added HATU (38 mg, 0.1 mmol), and N-Methylmorpholine (0.07 ml, 0.5 mmol) at room temperature, and the resulting solution was stirred for 1 hour at room temperature. The solution was concentrated and purified by and purified by reverse phase liquid chromatography (RPLC) using an Isco CombiFlash liquid chromatograph eluted with 10% to 100% acetonitrile and water with 0.1% TFA as modifier. Yield of product 21 mg, 24.0%. Ion(s) found by LCMS: M+H=914.4.


Example 19. Screening of Ints and Conjugates in an In Vitro Cell Fusion Assay

Activity of Ints was determined in an assay designed to measure the inhibition of cell-cell fusion which is an important step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1294) and HL2/3 cells (catalog #1294), obtained from the AIDS Research Reagent and Reference Program (Rockville, Md.). HeLa-CD4-LTR-β-Gal cells were plated at a density of 5×103 cells per well in a volume of 50 μL, with 50 μL of nine serial half-logarithmic dilutions of compound in triplicate for one hour at 37° C./5% CO2. Following the incubation, 100 μL of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37° C./5% CO2. Following the incubation, efficacy plates were evaluated for β-galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability.


In these studies, cytotoxicity was also evaluated (TC50). Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5-[(phenylamino)carbonyl]-2H-tetrazolium hydroxide). XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product. XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives. Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at −20° C. XTT/PMS stock was prepared immediately before use by adding 40 μL of PMS per mL of XTT solution. Fifty L (50 μL) of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37° C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 384 96 well plate format spectrophotometer.


This assay identified four compounds with EC50 values approximately equal to the benchmark compound (Temsavir) (Table 6). These compounds were highly potent at inhibiting cell fusion with EC50 values of less than 0.9 nM. One of these compounds, Int-17, also demonstrated no apparent loss of activity upon conjugation to an hIgG1 Fc (conjugate 5); this was an important finding. Lastly, no compounds showed cytotoxicity at the concentrations tested in this study. Therefore, for the most active compounds the difference between EC50 and cytotoxicity is greater than 10,000-fold. A prior fusion inhibition study also identified several highly active compounds (Int-2 and Int-4). However, both compounds lost significant potency upon conjugation (conjugates 2 and 3, respectively), further emphasizing the significance of conjugate 4.









TABLE 6







Fusion inhibition activity of HIV inhibitor compounds










EC50 [μM]
TC50 [μM]













Run
1
2
3
1
2
3
















CSB
0.646
0.53
0.785
>10.0
>10.0
>10.0


(control)


Temsavir
0.00284
0.000231
<0.0009
>10.0
>10.0
>10.0


(control)


Int-1
0.00383


>10.0


Int-2

0.0000014
<0.0009

>10.0
>10.0


Int-3
0.0282


>10.0


Int-4

0.000829


>10.0


Int-5
>0.5


>0.5


Int-6
>0.5


>0.5


Int-7
0.18


>0.5


Int-8
5.16


>10.0


Int-9
0.198


>10.0


Int-10
0.529


>10.0


Int-11
0.209


>10.0


Int-12


<0.0009


>10.0


Int-13


0.003


>10.0


Int-14


0.0044


>10.0


Int-15


<0.0009


>10.0


Int-16


0.0018


>10.0


Int-17


<0.0009


>10.0


Conjugate 2

0.162


>0.6


Conjugate 3

0.249


>10


Conjugate 4

0.206


>0.8


Conjugate 5

<0.0005


>5


Conjugate 6
>2


>2


Conjugate 7
1.05


>2









Example 20. Synthesis of DMJ-II-121



embedded image


embedded image


Step a.



embedded image


(1S, 2R)-(−)-Cis-1-amino-2-indanol (2.98 g, 20 mmol) was dissolved in anhydrous DMF (10 ml) by heating with a heat gun. After the solution was cooled to room temperature, anhydrous THF (20 ml) and DIPEA (2.59 g, 20 mmol) were added. To this well-stirred solution was slowly added di-tert-butyl-dicarbonate (5.46 g, 25 mmol). Upon the addition, a white gel was formed and it was manually broken into small pieces. The reaction became a clear solution after 2-hours stirring at room temperature. It was then extracted with water (50 ml) and hexane (50 ml). The aqueous layer was back-extracted with EtOAc (50 ml). The combined organic layers were dried over Na2SO4 and concentrated by rotary evaporation. The residue was purified by silica gel column chromatography (120 g column, 10% to 45% EtOAc/hexane). Yield 4.97 g, 99.7%. Ion found by LCMS: [M−Boc]+=150, [M−H]=248.9.


Step b.




embedded image


The step-a product (4.97 g, 19.94 mmol) was dissolved anhydrous DCM (30 ml). After cooling in an ice-water bath, the solution was treated with DIPEA (5.17 g, 40 mmol) followed by slow addition of methanesulfonyl chloride (2.75 g, 24 mmol). The reaction mixture was stirred at 0° C. to room temperature overnight, then extracted with water (30 ml). The organic layer was dried over Na2SO4, concentrated by rotary evaporation, and further dried under high vacuum. The crude product (6.24 g) was carried to the subsequent step without further purification. Ion found by LCMS: [M−Boc]+=228.


Step c.




embedded image


The crude product in step-b (assumed 19.94 mmol) was dissolved in anhydrous DMSO (20 ml). Potassium cyanide (6.51 g, 100 mmol) was added, and the resulting mixture was heated at 80° C. for 20 hours. It was then cooled to room temperature and diluted with EtOAc (70 ml), and hexane (100 ml). The solid was filtered off and washed with EtOAc. The filtrate was then extracted with water (50 ml×4). The combined organic layers were concentrated by rotary evaporation, then purified by silica gel column chromatography (220 g column, 5% to 30% EtOAc/hexane). Yield 2.34 g, 45.4% over two steps. Ion found by LCMS: [M+H]+=259.2.


Step d.




embedded image


A flame-dried reaction flask was purged with nitrogen and charged with the step-c product (2.18 g, 8.44 mmol) and anhydrous THF (12 ml). The solution was cooled in an ice-water bath, then treated dropwise with LiAlH4 (1.0 M in THF, 8.5 ml, 8.5 mmol). The resulting mixture was stirred at 0° C. to room temperature for 1.5 hours. It was then cooled back in an ice-water bath and slowly treated with a solution of KOH (940 mg, 16.8 mmol) in water (15 ml). The solid was filtered off and washed with EtOAc. The filtrate was extracted with water (30 ml). The organic layer was dried over Na2SO4, concentrated by rotary evaporation and further dried under high vacuum. 2.27 g of the crude product was carried to the subsequent step without further purification. Ion found by LCMS: [M+H]+=263.


Step e.




embedded image


The crude product from step-d (2.27 g, assumed 8.44 mmol) was dissolved in anhydrous DCM (20 ml), treated with DIPEA (1.16 g, 8.44 mmol) and N-(benzyloxycarbonyloxy) succinimide (4.1 g, 16.5 mmol). The resulting mixture was stirred at room temperature overnight. It was then extracted with water (30 ml). The organic layer was dried over Na2SO4 and concentrated by rotary evaporation. The residue was purified by silica gel column chromatography (220 g column, 5% to 40% EtOAc and hexane). Yield 2.2 g, 65.7% for two steps. Ion found by LCMS: [M−Boc+H]+=297.2, [M−BocNH3+H]+=280.1.


Step f.




embedded image


To a solution of the step-e product (2.07 g, 5.22 mmol) in THF (5 ml) was added 4N HCl in dioxane (10 ml, 40 mmol). The reaction mixture was stirred for 1 hour, then extracted with hexane (15 ml) and water (5 ml×3). The combined aqueous layers were lyophilized to yield a light yellow solid. The material was carried to the subsequent step without further purification. Yield 1.7 g, 92%. Ion found by LCMS: [M+H]+=297.2, [M−NH4+H]+=280.1.


Step g.




embedded image


4-Chloro-3-fluoroaniline (4.368 g, 30 mmol) was dissolved in anhydrous DCM (30 ml). After the solution was cooled in an ice-water bath, DIEPA (4.265 g, 33 mmol) was added followed by drop-wise addition of methyl oxalylchloride (3.92 g, 32 mmol). The reaction was stirred for 1 hour in the ice-water bath, then at room temperature for 5 hours. The white precipitate was filtered and was re-dissolved in hot THF (50 ml). It was extracted with water (100 ml). The filtrate from the filtration was extracted with water. The combined organic layers from the two extractions were dried over Na2SO4, concentrated by rotary evaporation and further dried in high vacuum. The crude product was carried to the subsequent step without further purification. Yield 6.98 g, quantitative yield. Ion found by LCMS: [M+H]+=232.0, [M+Na]+=254.


Step h.




embedded image


The step-g product (2.31 g, 10 mmol) was dissolved in MeOH (50 ml) by heating at 60° C. A solution of KOH (2.25 g, 40 mmol) in water (40 ml) was added. White gel was formed upon the addition of KOH. The reaction was continued at 60° C. for 30 minutes, then slowly acidified with 6N HCl aqueous solution (15 ml). The reaction was heated up to 100° C. until all gel was dissolved (It took about 10 minutes). MeOH was removed by rotary evaporation, and the white product was filtered, washed with water then dried under high vacuum. Yield 2.2 g, quantitative yield. Ion found by LCMS: [M−H]=216.0.


Step i.




embedded image


To a mixture of the step-h product (255.7 mg, 1 mmol) and the step-f product (332.8 mg, 1 mmol) in anhydrous DMF (1 ml) was added HATU (437 mg, 1.15 mmol) and stirred for 5 minutes. DIPEA (258.5 mg, 2 mmol) was added, and reaction was continued for 1 hour. It was then purified by silica gel column chromatography (80 g column, 5% to 50% EtOAc/hexane). Yield 480.7 mg, 96.9%. Ion found by LCMS: [M+Na]+=518.0.


Step j.




embedded image


The step i product (360.5 mg, 0.728 mmol) was dissolved in TFA (2 ml). Thioanisole (169 mg, 1.36 mmol) was added, and the resulting mixture was heated at 70° C. for 1 hour. It was then cooled to room temperature and directly purified by RPLC (150 g column, 5% to 70% acetonitrile and water, using 0.1% TFA as the modifier). Yield 331.2 mg, 95.6%. Ion found by LCMS: [M+H]+=362.0.


Step k.




embedded image


To a solution of the step-j product (32.4 mg, 0.0682 mmol) in anhydrous THF (0.5 ml) was added DIPEA (31 mg, 0.3 mmol) and N,N′-Bis-Boc-1-guanylpyrazole (31 mg, 0.1 mmol). The reaction mixture was heated at 50° C. overnight. It was then purified by RPLC (50 g column, 30% to 100% acetonitrile and water, using 0.1% TFA as the modifier). Yield 40.3 mg, 97.9%. Ion found by LCMS: [M+H]+=604.2.


Step l.




embedded image


The step-k product (40.3 mg, 0.0667 mmol) was dissolved in DCM/TFA (1:1, 1 ml), then heated at 40° C. for 1 hour. The crude reaction was concentrated and purified by RPLC (50 g, 5 to 60% acetonitrile and water, using 0.1% TFA as modifier). Yield 20 mg, 57.9%. Ion found by LCMS: [M+H]+=404.0.


Example 21. Synthesis of Conjugate 8

A solution of azido functionalized Fc (50 mg, 28.43 mL, 0.862 μmol, 1.76 mg/mL; SEQ ID NO: 64, Example 2) was added to a 50 mL centrifuge tube following by addition of alkyne derivatized small molecule (15.83 mg, 0.012 mmol, Int-15, Example 16) in EPPES at pH 8.5, and a solution of copper (II) sulfate (1.1 mg, 0.0043 mmol) in water mixed with THTPA (0.43 mL, 0.0216 mmol, 50 nM in water), aminoguanidine HCl (2.16 mL, 100 mM in water), and sodium ascorbate (2.16 mL, 100 mM in water). The resulting solution was gently shaken for 4 hours. It was purified by affinity chromatography over a protein A column, followed by size exclusion chromatography (as described in Example 8). Maldi TOF analysis of the purified final product gave an average mass of 60593 Da (DAR 2.5). Yield 12.71 mg, 25%.


Example 22. Synthesis of Int-18



embedded image


Step a.




embedded image


HATU (164 mg, 0.43 mmol) was added to a stirring mixture of the triazole acid starting material (described in Example 05, Int-2)), (185 mg, 0.36) tert-Butyl 4-(3-aminopropyl)piperazine-1-carboxylate (96 mg, 0.39 mmol), and diisopropylethylamine (186 mg, 1.44 mmol) in DMF (3 L) and stirred for 12 hours. The solvent was removed on a rotary evaporator and the resulting oil was purified by RPLC Isco COMBIFLASH® (20-95% ACN in DI water, 0.1% TFA, 30 minute gradient). The pure fractions were pooled and lyophilized and taken on to the next step. Ion found by LC/MS [M+H]+=737.4. The Boc protected intermediate was stirred in a 1/1 mixture of DCM/TFA (10 ml) at ambient temperature for 2 hours, then concentrated on the rotary evaporator and purified by RPLC Isco COMBIFLASH® (20-95% ACN in DI water, 0.1% TFA, 30 minute gradient). The pure fractions were pooled and lyophilized. Yield 120 mg, 44%, 2 steps. Ion found by LC/MS [M+H]+=637.2.


Step b.




embedded image


The intermediate from the previous step (90 mg, 0.41 mmol), propargyl peg4 mesylate (57 mg, 0.18 mmol), and diisopropylethylamine (36 mg, 0.28 mmol) were stirred together in DMF (3 mL) at 80° C. for 12 hours. The solvent was removed by rotary evaporator and purified by RPLC Isco COMBIFLASH® (20-95% ACN in DI water, 0.1% TFA, 30 minute gradient). The pure fractions were pooled and lyophilized. Yield 120 mg, 44%. Ion found by LC/MS [M+H]+=851.2.


Example 23. Synthesis of Int-19



embedded image


Step a.




embedded image


A mixture of Z-piperazine (29.62 g, 131.8 mmol), 2-(Boc-amino)ethyl bromide (24.87 g, 105.4 mmol), KI (8.75 g, 52.7 mmol) and potassium carbonate (21.86 g, 158.1 mmol) in 1,4-dioxane (300 mL) was stirred at 75° C. for 24 hrs. The crude reaction mixture was filtered and concentrated. The residue was purified by normal phase chromatography, eluting with 0% to 10% methanol/dichloromethane to give the pure product as an oil (32.0 g, 83% isolated yield). Ions found by LCMS: M+H+: 364.2. H1 NMR (300 MHz): 7.26-7.45 (m, 5H), 5.10-5.20 (m, 2H), 3.47-3.60 (m, 4H), 3.18-3.30 (m, 2H), 2.32-2.51 (m, 4H), 1.60-1.75 (m, 2H) and 1.47 (s, 9H).


This product (5.38 g, 14.8 mmol) was treated with 5% Pd/C (1.57 g, 0.74 mmol) in methanol (100 mL) under hydrogen from a balloon for 3 hrs. After Celite filtration and solvent removal, the product was obtained as a white foam in quantitative yield and used for next step without further purification. Ions found by LCMS: M+H+: 230.2.


Step b.




embedded image


A solution of 2-(Boc-amino)-ethyl-1-piperazine (3.707 g, 15.23 mmol), Z-piperazinyl-propyl bromide (5.728 g, 16.76 mmol, ACS MEdChem Lett, 2018, 446), K2CO3 (3.158 g, 22.85 mmol) and KI (1.264 g, 7.619 mmol) in 1,4-dioxane (100 mL) was heated in an oil bath at 75 C for 24 h. The mixture was filtered, concentrated and purified by normal phase chromatography, eluting with 0% to 10% methanol/dichloromethane to give the product as a foam (5.45 g, 75% isolated yield). Ions found by LCMS: M+H++: 490.2


This product (5.45 g, 11.13 mmol) was treated with 5% Pd(OH)2/C (3.91 g, 5.57 mmol) in methanol (100 mL) under hydrogen from a balloon overnight. After Celite filtration and solvent removal, the desired product was obtained as a white foam in quantitative yield, and used for next step without further purification. Ions found by LCMS: M+H+: 356.2; M-Boc+H*: 256.2.


Step c.




embedded image


A solution of 2-(Boc-amino)-ethyl-1-piperazinyl-propyl-piperazine (1.31 g, 3.68 mmol), propargyl-PEG4 mesylate (1.721 g, 5.53 mmol), K2CO3 (0.764 g, 5.53 mmol) and KI (0.306 g, 1.84 mmol) in 1,4-dioxane (50 mL) was heated in an oil bath at 75° C. for 24 h. The mixture was filtered, concentrated and purified by reverse phase chromatography, eluting with 5% to 45% ACN/water (0.1% TFA) to give the pure product as an oil (0.890 g, 42% isolated yield). Ions found by LCMS: M+H++: 570.4; (M+2H+)/2: 284.8.


This product (0.104 g, 0.183 mmol) was treated with 4M HCl in dioxane (10 mL) for 2 hrs. After solvent removal, the product was obtained as a white foam in quantitative yield, and used in the next step without further purification.


Step d.




embedded image


A solution of propargyl-PEG4-1-piperazinyl-propyl-piperazinyl-ethylamine (0.0857 g, 0.182 mmol), triazole acid core (0.121 g, 0.237 mmol, described in Example 05, Int-2), NaHCO3 (0.0613 g, 0.730 mmol), NMM (0.030 mL, 0.274 mmol) and HATU (0.138 g, 0.365 mmol) in DMF (5 mL) was stirred for 4 hrs. The solvent was removed and the residue was dissolved in minimal amount of NMP/water (1:1, 0.1% TFA) and purified by reverse phase chromatography, eluting with 5% to 45% ACN/water (0.1% TFA) to give the desired product as an oil. Ions found by LCMS: M+H+: 963.2; (M+2H+)/2: 482.2.


Example 24. Synthesis of Int-20



embedded image


Step a.




embedded image


To a 0° C. stirring solution of 1,2,4-triazole-3-carboxylic acid (500 mg, 4.422 mmol), propargyl-PEG4-amine (1.227 g, 5.306 mmol), N,N-diisopropylethylamine (3.466 mL, 19.90 mmol) in dichloromethane (1.0 mL) and DMF (5.0 mL), was added a solution of propylphosphonic anhydride solution (2.711 mL, 4.643 mmol, ˜50% in DMF). The temperature was raised to ambient after 10 minutes, and upon completion of the reaction as determined by LCMS, all volatiles were removed per rotatory evaporation. The residue was stirred in water under until a suspension was obtained. The mixture was filtered, and the solids were washed with water (3×30 mL). The solids were collected and dried per vacuum techniques. The oil was used in the without further purification. Yield 968 mg, 67%. Ions found by LCMS: [(M+H)+Na]+=349.2; [(M+H]]+=327.2.


Step b.




embedded image


To a 0° C. stirring solution of the heterocyclic acid (200 mg, 0.669 mmol, described in Org. Process Res. Dev. 2017, 21, 1145-1155), 1-benzoylpiperazine (178 mg, 0.936 mmol), N,N-diisopropylethylamine (524 uL, 3.009 mmol), dissolved in dichloromethane (0.250 mL) and N,N-dimethylformamide (2.5 mL), was added a 50% solution of propylphosphonic anhydride in DMF (390 uL, 0.669 mmol). Upon completion of the reaction as determined by LCMS, all the volatiles were evaporated per vacuum techniques. The thick crude was taken up with vigorous stirring in water (30 mL). Stirring was continued until a suspension formed. The mixture was filtered and the solids were washed with additional water (3×30 mL), then collected and dried per vacuum techniques. This material was used in the next step without additional purification. Yield 0.268 mg, 85%. Ions found by LCMS: [(M+H)+Na]+=494.9, 492.9; [(M+H]]+=473.0, 471.0.


Step c.




embedded image


Strictly under nitrogen in a sealed tube, a stirring mixture of step b product (268 mg, 0.569 mmol), N-(propargyl-PEG4)-1H-1,2,4-triazole-3-carboxamide (278 mg, 0.853 mmol), KOH (60 mg, 1.080 mmol), water (512 mg, 28.43 mmol), trans-N,N′-dimethylcyclohexane-1,2-diamine (64 mg, 0.455 mmol) and CuI (32 mg, 0.170 mmol), was heated at 100° C. for 2 days. Upon cooling, the crude reaction was mixed with copper scavenging resin SiliaMetS TAAcONa (800 mg, loading 0.45 mmol/g) and stirred for 1 hour. The mixture was filtered and the filtrate was evaporated per vacuum techniques. The residue was purified by RPLC using an Isco COMBIFLASH® liquid chromatography eluted with 0% to 100% water and methanol. Yield 93 mg, 23%. Ions found by LCMS: [(M+H)+Na]+=739.2; [(M+H]]+=717.2.


Example 25. Synthesis of Int-21



embedded image


Step a.


A mixture of tert-Butyl [3-(piperazin-1-yl)propyl]carbamate (1.73 g, 7.09 mmol), triethyleneglycolmethanesulfonate propargylether (2.00 g, 6.44 mmol), potassium carbonate (1.96 g, 14.18 mmol), and acetonitrile (20 mL) were heated in a 70° C. oil bath for 24 hr, at which time LCMS indicated that most starting materials had been consumed. The resulting mixture was filtered to remove salt and potassium carbonate. The filtrate was concentrated and purified by RPLC (10% to 100% ACN/water containing 0.1% TFA) giving 2.60 g of double TFA salt (59% yield).


Step b.


Product from the previous step (2.60 g, 5.68 mmol) was treated with HCl/dioxane (4M, 15 mL) for 1 hr at room temperature, and then concentrated to dryness and used without further purification (Yield: quantitative).


Step c.


Strictly under nitrogen in a sealed tube, a stirring suspension of previously described intermediate (145 mg, 0.308 mmol, see Int-20, Example 24 step b), methyl 1,2,4-triazole-3-carboxylate (59 mg, 0.461 mmol), KOH (50 mg, 0.892 mmol), water (166 mg, 9.230 mmol), trans-N,N′-dimethylcyclohexane-1,2-diamine (35 mg, 0.246 mmol) and CuI (18 mg, 0.092 mmol), was heated at 100° C. for 2 days. Upon cooling, volatiles were evaporated via rotary evaporator. The solid residue was washed with ethyl acetate (3×12 mL) and then dichloromethane (3×12 mL). The obtained solids were dissolved in water and treated for 3 h under stirring with SiliaMetS TAAcONa (300 mg, loading 0.45 mmol/g). The suspension was filtered and all the volatiles were removed via rotary evaporation. To a 0° C. stirring solution of the obtained residue, propargyl-PEG4 piperazine linker from step b (HCl salt, 144 mg, 0.308 mmol), DIPEA (241 uL, 1.384 mmol) in DMF (2.0 mL), was added a 50% solution of propylphosphonic anhydride in DMF (196 uL, 0.338 mmol). Upon completion, all the volatiles were evaporated per vacuum techniques. The residue was purified by RPLC using an Isco COMBIFLASH® liquid chromatography eluted with 0% to 100% water and methanol. Yield 29 mg, 11% for 2 steps. Ions found by LCMS: [(M+H)+Na]+=865.2; [(M+H]]+=843.2; [(M+2H)/2]+=422.2.


Example 26. Synthesis of Int-22



embedded image


Step a.




embedded image


A mixture of Z-piperazine (14.16 g, 63.0 mmol), 3-(Boc-amino)propyl bromide (12.50 g, 50.4 mmol), and potassium carbonate (10.45 g, 75.9 mmol) in 1,4-dioxane (150 mL) was stirred at 75° C. for 24 hrs. The crude reaction mixture was filtered and concentrated, then purified by normal phase chromatography, eluting with 0% to 10% methanol/dichloromethane to give the pure product as an oil (17.23 g, 90% isolated yield). Ions found by LCMS: M+H+: 378.2. H1 NMR (300 MHz): 7.26-7.40 (m, 5H), 5.10-5.20 (m, 2H), 3.47-3.60 (m, 4H), 3.14-3.30 (m, 2H), 2.32-2.50 (m, 6H), 1.60-1.75 (m, 2H) and 1.45 (s, 9H).


The Cbz-protected product (4.84 g, 12.8 mmol) was treated with 5% Pd/C (1.34 g, 0.64 mmol) in methanol (100 mL) under hydrogen balloon for 3 hrs. After Celite filtration and solvent removal, the pure product was obtained as a white foam in quantitative yield and used for next step without further purification. Ions found by LCMS: M+H+: 244.2.


Step b.




embedded image


A solution of 3-(Boc-amino)-propyl-1-piperazine from the previous step (3.707 g, 15.23 mmol), Z-piperazinyl-propyl bromide (5.728 g, 16.76 mmol, described in ACS MedChem Lett, 2018, 446), K2CO3 (3.158 g, 22.85 mmol) and KI (1.264 g, 7.619 mmol) in 1,4-dioxane (100 mL) was heated in an oil bath at 75° C. for 24 h. The mixture was filtered, concentrated and purified by normal phase chromatography, eluting with 0% to 10% methanol/dichloromethane to give the desired product as a foam (6.59 g, 85% isolated yield). LC/MS mass: M+H+: 504.4


This product (5.45 g, 11.13 mmol) was treated with 5% Pd(OH)2/C (1.34 g, 0.64 mmol) in methanol (100 mL) under hydrogen balloon for 12 h. After Celite filtration and solvent removal, the pure product was obtained as a white foam in quantitative yield and used in the next step without further purification. LC/MS mass: M+H++: 370.2.


Step c.




embedded image


A solution of 3-(Boc-amino)-propyl-1-piperazinyl-propyl-piperazine (1.98 g, 5.36 mmol), propargyl-PEG4 bromide (2.421 g, 8.037 mmol), K2CO3 (1.851 g, 14.0 mmol) and KI (0.445 g, 2.68 mmol) in 1,4-dioxane (50 mL) was heated in an oil bath at 75° C. for 24 h. The mixture was filtered, concentrated and purified by reverse phase chromatography, eluting with 5% to 45% ACN/water (0.1% TFA) to give the pure product as an oil (1.61 g, 51% isolated yield). Ions found by LCMS: M+H+: 584.4; (M+2H+)/2: 292.6.


This product (0.107 g, 0.183 mmol) was treated with 4M HCl in dioxane (10 mL) for 2 hrs. After solvent removal, the product was obtained as a white foam and used in the next step without further purification.


Step d.




embedded image


A solution of propargyl-PEG4-1-piperazinyl-propyl-piperazinyl-propylamine HCl salt (0.0886 g, 0.163 mmol), Triazole acid core (0.122 g, 0.238 mmol, described in Example 05, Int-2), NaHCO3 (0.0616 g, 0.733 mmol), N-methyl morpholine (0.031 mL, 0.274 mmol) and HATU (0.139 g, 0.366 mmol) in DMF (5 mL) was stirred for 4 hrs. The solvent was removed and the residue was dissolved in minimal amount of NMP/water (1:1, w/0.1% TFA) and purified by reverse phase chromatography, eluting with 5% to 45% ACN/water (w/0.1% TFA) to give the desired product as an oil. Ions found by LCMS: M+H+: 977.3; (M+2H+)/2: 489.4.


Example 27. Synthesis of Conjugate 9

Prepared the Click reagent solution: 0.0050M CuSO4 in PBS buffer solution: 10.0 mg CuSO4 was dissolved in 12.53 mL PBS, then took 6.00 mL this CuSO4 solution and added 64.8 mg BTTAA (CAS #1334179-85-9) and 297 mg sodium ascorbate to give the Click reagent solution (0.0050M CuSO4, 0.025M BTTAA and 0.25M sodium ascorbate).


To a solution of azido functionalized Fc (122.1 mg, 8.55 mL, 21.1 μmol, SEQ ID NO: 64, Example 2, DAR=3.9, in 25 mM MES, 150 mM NaCl, pH6.0 buffer) in a 15 mL centrifuge tube was added an alkyne derivatized small molecule (25.0 mg, 19.0 mmol, 3.0 equivalents for each azido on the Fc, described in Example 26, Int-22) in 1.5 mL of MES buffer. After gently agitating, the mixture was treated with the Click reagent solution (5.05 mL). The resulting mixture was gently rotated for 4 hours at ambient temperature. It was then purified by affinity chromatography over a protein A column, followed size exclusion chromatography (see general conjugate purification protocol). Maldi TOF analysis of the purified final product gave an average mass of 65,947 Da (DAR=3.4). Yield 50 mg/41%.


Example 28. Synthesis of Int-23



embedded image


Step a.




embedded image


The aryl bromide starting material (previously described in Example 05, Int-2) (350 mg, 1.36 mmol) was added to the triazole ethyl ester (421 mg, 2.71 mmol) in DMF (10 mL) and stirred under nitrogen until fully dissolved, then dioxane (20 mL) was added followed by potassium carbonate (561, 4.07 mmol) and then trans-N,N-dimethylcyclohexane-diamine (38 mg, 027 mmol). The mixture was evacuated and purged with nitrogen (3×), then CuI (129 mg, 0.68 mmol) was added and the mixture was vacuum/purged again (3×) with nitrogen and stirred at 100° C. under 1 atm of nitrogen for 4 hours. The mixture was filtered, and concentrated. The crude product was purified by RPLC Isco COMBIFLASH® (15-95% ACN in DI water, 0.1% TFA, 30 min). The pure fractions were pooled and lyophilized to afford the ethyl ester product as a light brown solid. Ion found by LC/MS [M+H]+=554.2


The ester from the previous reaction was stirred in a 4/1 mixture of methanol/di water (5 mL) containing LiOH (97 mg, 4.1 mmol) for 1 hour. The pH was raised to pH-5 with glacial acetic acid and concentrated. The residue was dissolved in DMF (2 mL) and purified by RPLC Isco COMBIFLASH® (20-95% ACN in DI water, 0.1% TFA, 30 minute gradient). The pure fractions were pooled and lyophilized. Yield 375 mg, 52% for two steps. Ion found by LC/MS [M+H]+=525.8.


Step b.




embedded image


HATU (71 mg, 0.19 mmol) was added to a stirring mixture of the triazole acid (90 mg, 0.17), described in step a of this example, tert-Butyl 4-(3-aminopropyl)piperazine-1-carboxylate (46 mg, 0.19 mmol), and diisopropylethylamine (110 mg, 0.86 mmol) in DMF (3 mL) and the reaction was stirred for 12 hours. The solvent was removed on the rotary evaporator and the mixture was purified reversed phase HPLC Isco COMBIFLASH® (20-95% ACN in DI water, 0.1% TFA, 30 minute gradient). The pure fractions were pooled and lyophilized. 85 mg were isolated. Ion found by LC/MS [M+H]+=751.4.


The Boc protected intermediate was stirred in a 1/1 mixture of DCM/TFA (10 ml) at ambient temperature for 2 hours, concentrated on a rotary evaporator and purified by reversed phase HPLC Isco COMBIFLASH® (20-95% ACN in DI water, 0.1% TFA, 30 minute gradient). The pure fractions were pooled and lyophilized. Yield 67 mg, 45%, 2 steps Ion found by LC/MS [M+H]+=651.2.


Step c.




embedded image


The intermediate from step b. of this example (90 mg, 0.41 mmol), propargyl peg4 mesylate (57 mg, 0.18 mmol), and diisopropylethylamine (36 mg, 0.28 mmol) were stirred together in DMF (3 mL) at 80° C. for 12 hours. The solvent was removed by rotary evaporator and purified by reversed phase ACQ semi prep (20-95% ACN in DI water, 0.1% TFA, 30 minute gradient). The pure fractions were pooled and lyophilized. Yield 120 mg, 44%. Ion found by LC/MS [M+H]+=865.4.


Example 29. Synthesis of Int-24



embedded image


Step a.




embedded image


To the free base of 2-phenyl-2-(4-piperidylidene)ethanenitrile (see Example 05, Int-2, 0.28 g, 1.16 mmol) in methanol (12 mL) was added 5% Pd/C and H2 from a balloon. The reaction was stirred at room temperature for 16 hours then filtered through a pad of Celite. The solvent was removed under reduced pressure to give a light yellow solid. This material was used in the next step without further purification. Yield 0.12 g, 50.9%. Ion found by LCMS: [M+H]+=201.2


Step b.




embedded image


A solution of product from the previous step (0.88 g, 2.93 mmol, synthesis described in Org. Process Res. Dev. 2017, 21, 1145-1155), 2-phenyl-2-(4-piperidyl)ethanenitrile (0.65 g, 3.23 mmol), HATU (1.71 g, 4.40 mmol) in DMF (8.4 mL) was stirred at room temperature under nitrogen for 10 minutes followed by addition of DIEA (1.56 mL, 8.80 mmol). The resulting mixture was stirred at room temperature for 16 hours; it was quenched with water. The aqueous layer was extracted with ethyl acetate (2×100 mL). The combined organic layer was washed with brine, dried over Na2SO4, and filtered and then concentrated under reduced pressure. The residue was purified by normal phase liquid chromatography (Isco COMBIFLASH®, 0 to 100% ethyl acetate and hexane) to yield desired product as yellow foam. Yield 1.23 g, 87.3%. Ion found by LCMS: [M+H]+=481.0.


Step c.




embedded image


A solution of the product from the previous step (0.12 g, 0.25 mmol), PEG4 triazole (0.12 g, 0.37 mmol, described in Example 25, Int-20), K2CO3 (0.10 g, 0.75 mmol), water (0.22 mL, 12.47 mmol), CuI (14.5 mg, 0.075 mmol), trans-dimethylcyclohexyldiamine (28.9 mg, 0.20 mmol) in DMF (3 mL) was degassed 6 times with N2, then the mixture was heated at 100° C. for 4 hours. The reaction mixture was filtered and purified by reverse phase liquid chromatography (ACCQ, 10 to 45% of acetonitrile and water with 0.1% TFA as modifier). Yield 11.8 mg, 5.6%. Ion found by LCMS: [M+H]+=727.2.


Example 30. Synthesis of Int-25



embedded image


embedded image


Step a.


To a 0° C. stirring solution of Fmoc-N-(tert-butyloxycarbonylmethyl)-glycine (2.00 g, 4.861 mmol), 2-azidoethan-1-amine hydrochloride (626 mg, 5.104 mmol) and DIPEA (3.387 mL, 19.44 mmol) in DMF (10 mL) and DCM (10 mL), was added HATU (1.885 mg, 4.958 mmol). The temperature was raised to ambient and stirring was continued until complete as determined by LCMS. All the volatiles were removed per vacuum techniques. The residue was purified by silica column using an Isco COMBIFLASH® liquid chromatography eluted with 0% to 100% ethyl acetate in hexanes. Yield 2.26 g, 97% yield. Ions found by LCMS: [(M+H−t-Bu)]+=424.2.


Step b.


The product from step a (2.26 g, 4.713 mmol) was taken up in TEA (10 mL) and stirring was continued until the reaction was complete by LCMS. Volatiles were evaporated per vacuum techniques. The residue was purified by RP-C18 column using an Isco COMBIFLASH® liquid chromatography eluted with 0% to 100% water and methanol. Yield 990 mg, 45%. Ions found by LCMS: [(M+H)+Na]+=446.2; [(M+H]]+=424.2.


Step c.


To a 0° C. stirring solution of compound from step b (990 mg, 2.338 mmol), methyl-PEG12-amine (1.335 g, 2.385 mmol) and DIPEA (1.018 mL, 5.845 mmol) in DMF (5.0 mL) and DCM (5.0 mL), was added HATU (907 mg, 2.385 mmol). The temperature warmed to ambient and stirring was continued until the reaction was complete by LCMS. All volatiles were removed per vacuum techniques. The residue was purified by RP-C18 column using an Isco COMBIFLASH® liquid chromatography eluted with 0% to 100% water and methanol. Yield 1.704 g, 76%. Ions found by LCMS: [(M+H)]+=965.2, [(M+2H)/2]+=483.2.


Step d.


To a stirring solution of alkyne functionalized compound from Example 5, Int-2 (668 mg, 0.922 mmol), compound from step c (907 mg, 0.940 mmol), TBTA (51 mg, 0.097 mmol) and cupric sulfate (15 mg, 0.092 mmol) in ethanol (10 mL) and water (5 mL), was added sodium ascorbate (91 mg, 0.460 mmol). The desired product was formed, and was confirmed by LCMS data: {[(M+H)+2Na]+=867.8; [(M+H)+Na]+=856.5; [(M+H]]+=845.4, [(M+2H)/2]+=563.8}. Upon completion, copper scavenger SiliaMetS TAAcONa (205 mg, loading 0.45 mmol/g) was added and stirring was continued overnight.


The mixture was filtered and rinsed with ethanol. The resulting solution (about 20 mL) was treated with piperidine (1.0 mL) to remove the Fmoc group. Upon completion, all the volatiles were removed per vacuum techniques. The residue was purified by RP-C18 column using an Isco COMBIFLASH® liquid chromatography eluted with 0% to 100% water and methanol. Yield 696 mg, 51%. Ions found by LCMS: [(M+H]]+=734.4, [(M+2H)/2]+=490.0.


Step e.


To a 0° C. stirring solution of product from step d (255 mg, 0.174 mmol), propargyl-PEG4-acid (68 mg, 0.261 mmol) and DIPEA (121 uL, 0.695 mmol) in DMF (5.0 mL) and DCM (0.5 mL), was added HATU (67 mg, 0.177 mmol). The temperature was raised to ambient and stirring was continued until complete by LCMS. All the volatiles were removed per vacuum techniques. The residue was purified by RP-C18 column using an Isco COMBIFLASH® liquid chromatography eluted with 0% to 100% water and methanol. Yield 157 mg, 53% yield. Ions found by LCMS: [(M+2H)/2]+=855.8; [(M+3H)/3]+=570.8.


Example 31. Synthesis of Int-26



embedded image


Step a.




embedded image


Dibenzylpropylamine (2.0 g, 7.86 mmol), 3-bromopropanol (2.73 g, 19.66 mmol), and DIEA (4.11 mL, 23.6 mmol), were dissolved in ACN (10 mL), then heated in a 75° C. oil bath for 12 h. LCMS after 12 h shows mostly product. The crude reaction was concentrated and purified by RPLC 10% to 100% ACN/water with 0.1% TFA. Separation was poor. Yield of bis-TFA salt 4.57 g, 97%. Ion(s) observed by LCMS: (M+H)+ 371.2.


Step b.




embedded image


Product from the previous step (5.16 g, 8.62 mmol), Pd(OH)2 (2.0 g), and H2 from a balloon, were stirred for 2 h at room temperature, at which time LCMS showed complete conversion. The crude mixture was filtered through Celite, concentrated, and used in the next step without further purification. Yield 3.48 g, 96%. Ion(s) observed by LCMS: (M+H)+ 191.2.


Step c.




embedded image


Product from the previous step (3.48 g, 8.33 mmol), Cbz-protected aldehyde (1.73 g, 8.33 mmol), STAB (1.76 g, 8.33 mmol), and DIEA (1.45 mL, 8.33 mmol), were stirred in THF (20 mL) and MeOH (5 mL) at room temperature for 12 h. The crude reaction was concentrated, and purified by RPLC 5% to 100% ACN/water with 0.1% TFA as a modifier. Yield 2.09 g. 42% for two steps. Ion(s) observed by LCMS: (M+H)+ 382.3.


Step d.




embedded image


Product from the previous step (2.09 g, 3.43 mmol), propargyl-PEG4-mesylate (1.17 g, 3.77 mmol), and K2CO3 (1.89 g, 13.7 mmol), were stirred in acetonitrile (10 mL) at 70° C. for 72 h. LCMS at 72 h showed desired product and a significant amount of starting material. The crude reaction was filtered, acidified with TFA to pH 4-5, concentrated, and purified by RPLC 5% to 100% acetonitrile/water containing 0.1% TFA. 1.14 g of unreacted starting material was isolated along with 0.746 g of product (26% yield).


Step e.




embedded image


Cbz protected product from the previous step (0.700 g, 0.850 mmol), and thioanisole (2.51 mL, 21.2 mmol), dissolved in TFA (10 mL) were treated with TMS bromide at room temperature while stirring. LCMS after 2 minutes shows complete deprotection. The crude reaction was stripped of TFA, washed with hexanes (3×5 mL), and used without further purification. Crude yield of triple TFA salt was quantitative.


Step f.




embedded image


To a solution of triazole-acid intermediate (511 mg, 0.1 mmol, described in example 5, Int-2) and HATU (45.6 mg, 0.12 mmol) in anhydrous DMF (0.5 ml) was added 4-methylmorpholine (20.2 mg, 0.2 mmol). After stirring for 5 minutes, the solution was treated with a solution of propargyl-PEG linker from step e (160.7 mg, 0.2 mmol) in DMF (0.5 ml). The reaction mixture was stirred for 1 hour, then directly purified by HPLC (5% to 50% acetonitrile and water, 0.1% TFA). Yield 16.4 mg, 13.9%. Ions found by LCMS: [M+H]+=954.8, [(M+2H)/2]+=478.2.


Example 32. Synthesis of Int-27



embedded image


Step a.




embedded image


A flame-dried reaction flask was purged with nitrogen and charged with 2-pyridylacetonitrile (590.5 mg, 5 mmol) and anhydrous THF (5 ml). After cooling in a −78° C. bath, NaHMDS (1.0 M in THF, 10 mmol) was added slowly. The resulting mixture was stirred for 5 minutes under nitrogen, and then treated with 1-Boc-piperidine-4-one (996 mg, 5 mmol). The −78° C. bath was removed, and the reaction was stirred for 2.5 hours. It was then quenched by 10% NH4Cl (50 ml) and extracted with EtOAc (50 ml)/hexane (20 ml). The aqueous layer was back extracted by EtOAc (30 ml). The combined organic layers were dried over Na2SO4 and concentrated by rotary evaporation. The residue was purified through silica gel column chromatography (120 g, 5% to 60% EtOAc and hexane). Yield 970 mg, 64.8%. Ion found by LCMS: [M+H]+=300.2.


Step b.




embedded image


The product from step a (970 mg, 3.24 mmol) was dissolved in THF (5 ml) and treated with 4M HCl solution in dioxane (5 ml). The mixture was heated at 40° C. for 5 hours. The mixture was then concentrated by rotary evaporation, and excess HCl was further removed by precipitating the product in EtOAc (50 ml). The crude product was used without further purification. Ion found by LCMS: [M+H]+=200.2.


Step c.




embedded image


To a solution of bromo aza-indole (299.1 mg, 1 mmol, described in J. Med. Chem. 2018, 61(1):62-80) in anhydrous DMF (1 ml) was added dioxane (2 ml), methyl 1,2,4-triazole-3-carboxylate (381.3 mg, 3 mmol), K2CO3 (414.6 mg, 4 mmol), and anhydrous EtOH (4 ml). The resulting mixture was heated at 75° C. with nitrogen was bubbling through slowly. Trans-N,N′-dimethylcyclohexane-1,2-diamine (284.5 mg, 2 mmol) was added. Nitrogen was continued bubbling until all forming gas disappeared (˜3 minutes). CuI (390 mg, 2 mmol) was added, and the reaction was heated under nitrogen overnight. The solution was then cooled to room temperature and filtered through Celite into HCl solution (2 ml of 6N HCl and 20 ml water). The solid was washed with MeOH (20 ml). The filtrate was concentrated by rotary evaporation to solid. The residue was re-dissolved in MeOH, and the salt was filtered off. After concentrating, the crude product was purified by prep-HPLC (0% to 40% acetonitrile and water, using 0.1% TFA as modifier). Yield 64.7 mg, 18%. Ion found by LCMS″ [M+H]+=360.0


Step d.




embedded image


To a solution of the product from step c (64.7 mg, 0.18 mmol) and the product from step c (63.6 mg, 0.27 mmol) in anhydrous DMF (1 ml), was added HATU (83.6 mg, 0.22 mmol). After stirring to dissolve all HATU reagent, DIPEA (48.5 mg, 0.375 mmol) was added, and the reaction was stirred for 30 minutes. It was then directly purified by RPLC (50 g column, 10% to 100% acetonitrile and water, using, 0.1% TFA as a modifier). Yield 80 mg, 82.2%. Ion found by LCMS: {M+H]+=540.8.


Step e.




embedded image


The product from step d (80 mg, 0.148 mmol) was dissolved in MeOH:DCM (1:1, 2 ml) and cooled in an ice-water bath. 1 M LiOH solution (0.592 ml) and water (1 ml) were added. The reaction was stirred for 4 hours, then acidified by 4N HCl solution in dioxane (0.148 ml). The organic solvents were removed by rotary evaporation. The remaining aqueous layer was frozen and lyophilized. The crude product containing 4 eq NaCl was carried to the next step without further purification. Ion found by LCMS: [M+H]+=513.2.


Step f.




embedded image


To a solution of the step-e product (33.5 mg, 0.0449 mmol) and HATU (21.4 mg, 0.0562 mmol) in anhydrous DMF (1 ml) was added the PEG-piperazine linker (27.6 mg, 0.07 mmol, described in Example 5, Int-2) and 4-methylmorpholine (50.5 mg, 0.5 mmol). The resulting mixture was stirred at room temperature for 30 minutes, then directly purified by HPLC: 0% to 40% acetonitrile and water, using 0.1% TFA as a modifier. Yield 26.3 mg, 54.2%. Ions found by LCMS: [M+H]+=851.8, [(M+2H)/2]+=426.4.


Example 33. Screening of HIV Lead Compounds in an In Vitro Cell Fusion Assay

Activity of HIV compounds was determined in an assay designed to measure the inhibition of cell-cell fusion mediated by gp120 and CD4 interaction which is an important step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1294) and HL2/3 cells (catalog #1299), obtained from the AIDS Research Reagent and Reference Program (Rockville, Md.). HeLa-CD4-LTR-β-Gal cells were plated at a density of 5×103 cells per well in a volume of 50 μL, with 50 μL of nine serial half-logarithmic dilutions of compound in triplicate for one hour at 37° C./5% CO2. Following the incubation, 100 μL of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37° C./5% CO2. Following the incubation, efficacy plates were evaluated for β-galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability.


In these studies cytotoxicity was also evaluated (TC50). Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5-[(phenylamino)carbonyl]-2H-tetrazolium hydroxide). XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product. XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives. Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at −20° C. XTT/PMS stock was prepared immediately before use by adding 40 μL of PMS per mL of XTT solution. Fifty L (50 μL) of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37° C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 384 96 well plate format spectrophotometer.


In this assay 7 Ints and 5 conjugates were run in with 2 control compounds (Chicago Sky Blue (CSB) and Enfuvirtide) know to prevent cell fusion mediated by binding of viral gp120 to the host cell receptor (CD4). CSB and Enfuvirtide had EC50 values of 201 and 409 nM, indicating moderate inhibition of cell fusion. In contrast, the 7 Ints ranged from 1.56 to 72.8 nM. Collectively, the Ints tested in this assay have significantly more activity than Enfuvirtide, an approved HIV therapeutic. Importantly, conjugate 5 was highly potent with an EC50 value of 3.62 nM. With the exception of conjugate 014 (EC50 value of 406 nM) the other AVCs were >500 nM. However, it is worth noting that although the EC50 for these compounds was greater than 500 nM a signal was detected at this concentration, suggesting the true value is not much higher than 500 nM. Most critically is the observation that at least one of these chemical series could be conjugated to an hIgG1 Fc and retain potent activity (Int-17 conjugate, conjugate 5). Lastly, no test articles showed cytotoxicity at the concentrations tested in this study.









TABLE 7







Activity of lead compounds in a cell fusion


assay (EC50) and cytotoxicity (TC50).











Test article
EC50 (nM)
TC50 (nM) (Tox)















Chicago Sky Blue
201
>10,000



Enfuvirtide
409
>1,000



Int-17
1.7
>500



Int-18
1.56
>500



Int-19
8.03
>500



Int-20
72.8
>500



Int-22
4.94
>500



Int-25
43.1
>500



Conjugate 5
3.62
498



Conjugate 9
>500
>500



Conjugate 10
>500
>500



Conjugate 11
>500
>500



Conjugate 12
406
>500










Example 34. Synthesis of a Conjugate Including an Fc Domain Having a C220S/YTE Quadruple Mutation

Preparation of the Click reagent solution: 0.0050M CuSO4 in PBS buffer solution: 10.0 mg CuSO4 was dissolved in 12.53 mL PBS, then took 5.00 mL this CuSO4 solution and added 43.1 mg BTTAA (CAS #1334179-85-9) and 247.5 mg sodium ascorbate to give the Click reagent solution (0.0050M CuSO4, 0.020M BTTAA and 0.25M sodium ascorbate).


To a solution of azido functionalized Fc having a C220S mutation and a YTE mutation (65.5 mg, 10.0 mL, 1.13 μmol, azido DAR-5.9, SEQ ID NO: 67) in a 15 mL centrifuge tube was added to an alkyne derivatized small molecule (3.0 equivalents per each azido of the Fc). After gently agitating to dissolve all solids, the mixture was treated with the Click reagent solution (1.80 mL). The resulting mixture was gently rotated for 12 hours at ambient temperature. It was purified by affinity chromatography over a protein A column, followed size exclusion chromatography (see general conjugate purification protocol). Maldi TOF analysis of the purified final product gave an average mass of 66,420 Da (DAR=5.8). Yield 57 mg with 98% purity.


Example 35. 30-Day Comparative Non-Human Primate PK Study Following IV Administration of a Conjugate Including an Fc Domain Having a C220S/YTE Quadruple Mutant

A conjugate including an Fc domain having a C220S mutation and a YTE mutation (SEQ ID NO: 67) was synthesized as described in Example 34. A non-human primate PK study was performed to compare IV administration of the C220S/YTE Fc conjugate (SEQ ID NO: 67) to a conjugate including an Fc domain having a C220S mutation alone (SEQ ID NO: 64).


Non-human primate (NHP) PK studies were performed by BTS Research (San Diego, Calif.) using male and female cynomolgus monkeys 5-9 years old with body weights ranging from 3.5-8.5 kg. NHPs were injected IV with 2 mg/kg of test article (0.4 mL/kg dose volume). Animals were housed under standard IACUC approved housing conditions. At appropriate times animals were non-terminally bled (via femoral or cephalic veins) with blood collected in K2EDTA tubes to prevent coagulation. Collected blood was centrifuged (2,000×g, for 10 minutes) and plasma withdrawn for analysis of test article concentrations over time. The plasma concentrations for the C220S/YTE Fc conjugate and the C220S conjugate at each time point were measured by sandwich ELISA. Briefly, test articles were captured on Fc-coated plates and then detected using a HRP-conjugated anti-human IgG-Fc antibody. Protein concentrations were calculated in GraphPad Prism using 4PL non-linear regression of the C220S/YTE Fc conjugate or C220S conjugate standard curves. A more detailed method description is provided above. The corresponding curves are shown in FIG. 13. The C220S/YTE Fc conjugate demonstrates a significantly improved terminal half-life of −45 days compared with −10 days for the C220S Fc conjugate. AUCs for the C220S/YTE Fc conjugate are 2× greater than the AUCs for The C220S conjugate (Table 8).









TABLE 8





Monkey PK, C220S/YTE Fc conjugate vs. C220S Fc conjugate

















Time (hr)














0.25
4
8
24
72
120











Dose



Conc


(mg/kg)
Route
Conjugate

(ug/mL)



















2
IV
C220S
Mean
32.6
24.8
20.1
14.1
9.97
7.61


2
IV
C220S/YTE
Mean
35.4
29
25.7
20.5
15.1
13
















Time (hr)

















168
240
336
672














Conc
Tmax
Cmax
AUClast
Half-life



(ug/mL)
(hr)
(ug/mL)
(hr*ug/mL)
(hr)




















6.33
4.47
3.62
1.47
0.25
32.6
3450
249



11.2
10.4
8.71
7.97
0.25
35.4
7210
1080










Example 36. Screening of HIV Compounds in an In Vitro Cell Fusion Assay

Activity of HIV compounds was determined in an assay designed to measure the inhibition of cell-cell fusion mediated by gp120 and CD4 interaction which is an important step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1294) and HL2/3 cells (catalog #1299), obtained from the AIDS Research Reagent and Reference Program (Rockville, Md.). HeLa-CD4-LTR-β-Gal cells were plated at a density of 5×103 cells per well in a volume of 50 μL, with 50 μL of nine serial half-logarithmic dilutions of compound in triplicate for one hour at 37° C./5% CO2. Following the incubation, 100 μL of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37° C./5% CO2. Following the incubation, efficacy plates were evaluated for β-galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability.


In these studies, cytotoxicity was also evaluated (TC50). Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5-[(phenylamino)carbonyl]-2H-tetrazolium hydroxide). XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product. XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives. Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at −20° C. XTT/PMS stock was prepared immediately before use by adding 40 μL of PMS per mL of XTT solution. Fifty μL (50 μL) of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37° C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 384 96 well plate format spectrophotometer.


In this assay 5 Ints and 4 conjugates were run with 2 control compounds (Chicago Sky Blue (CSB) and Enfuvirtide) know to prevent cell fusion mediated by binding of viral gp120 to the host cell receptor (CD4). CSB and Enfuvirtide had EC50 values of 781 and 358 nM, indicating moderate inhibition of cell fusion. In contrast, the 5 Ints tested ranged from <0.0051 to 70.1 nM. Collectively, the Ints tested in this assay show significantly more activity than Enfuvirtide, an approved HIV therapeutic. The four conjugates also demonstrated acceptable activity with the exception of Conjugate 14.


Lastly, no test articles showed cytotoxicity at the concentrations tested in this study.









TABLE 9







Activity of lead compounds in a cell fusion


assay (EC50) and cytotoxicity (TC50)















Therapeutic



COMPOUND
EC50 (nM)
TC50
Index (TI)
















Chicago Sky Blue
781
>10,000
>12.8



(ng/mL)



Enfuvirtide
358
>1,000
>2.8



Int-18
<0.0051
>2000
>392,157



Int-22
0.732
>2000
>2732



Int-25
70.1
>2000
>28.5



Int-26
7.6
>2000
>263



Int-27
20.8
>2000
>96.2



Conjugate 9
795
>2000
>2.5



Conjugate 12
746
>2000
>2.7



Conjugate 13
345
>2000
>5.8



Conjugate 14
>2000
>2000
N.D.









Claims
  • 1. A conjugate described by any one of formulas (D-I), (M-I), (1), or (2):
  • 2. The conjugate of claim 1, wherein each A1 and each A2 is independently described by formula (A-I).
  • 3. The conjugate of claim 2, wherein each A1 and each A2 is independently described by any one of formulas (A-Ia)-(A-Ih):
  • 4. The conjugate of claim 3, wherein each A1 and each A2 is independently described by any one of formulas (A-Ia-i)-(A-Ih-i):
  • 5. The conjugate of claim 1, wherein each A1 and each A2 is independently described by any one of formulas (A-Ii)-(A-Ip):
  • 6. The conjugate of claim 1, wherein each A1 and each A2 is independently described by any one of formulas (A-Iq)-(A-Ix):
  • 7. The conjugate of claim 6, wherein each A1 and each A2 is independently described by any one of formulas (A-Iq-i)-(A-Ix-i):
  • 8. The conjugate of claim 1, wherein each A1 and each A2 is independently described by any one of formulas (A-Iaa)-(A-Ihh):
  • 9. The conjugate of claim 1, wherein each A1 and each A2 is independently described by any one of formulas (A-Iii)-(A-Ipp):
  • 10. The conjugate of claim 9, wherein each A1 and each A2 is independently described by any one of formulas (A-Iaa-i)-(A-Ihh-i):
  • 11. The conjugate of claim 1, wherein each A1 and each A2 is independently described by any one of formulas (A-IIa)-(A-IId):
  • 12. The conjugate of claim 11, wherein each A1 and each A2 is independently described by any one of formulas (A-IIa-i)-(A-IId-i):
  • 13. The conjugate of claim 1, wherein the conjugate is described by formula (D-I):
  • 14. The conjugate of claim 13, wherein the conjugate is described by formula (D-II):
  • 15. The conjugate of claim 14, wherein the conjugate is described by formula (D-III):
  • 16. The conjugate of claim 15, wherein the conjugate is described by formula (D-III-1):
  • 17. The conjugate of claim 16, wherein the conjugate is described by formula (D-III-2):
  • 18. The conjugate of claim 17, wherein L′ is a nitrogen atom.
  • 19. The conjugate of claim 15, wherein the conjugate is described by formula (D-III-3):
  • 20. The conjugate of claim 19, wherein the conjugate is described by formula (D-III-4):
  • 21. The conjugate of claim 20, wherein L′ is a nitrogen atom.
  • 22. The conjugate of claim 15, wherein the conjugate is described by formula (D-III-5):
  • 23. The conjugate of claim 22, wherein the conjugate is described by formula (D-III-6):
  • 24. The conjugate of claim 23, wherein L′ is a nitrogen atom.
  • 25. The conjugate of claim 14, wherein the conjugate is described by formula (D-IV):
  • 26. The conjugate of claim 25, wherein the conjugate is described by formula (D-V-1):
  • 27. The conjugate of claim 26, wherein the conjugate is described by formula (D-IV-2):
  • 28. The conjugate of claim 27, wherein L′ is a nitrogen atom.
  • 29. The conjugate of claim 25, wherein the conjugate is described by formula (D-IV-3):
  • 30. The conjugate of claim 29, wherein the conjugate is described by formula (D-IV-4):
  • 31. The conjugate of claim 30, wherein L′ is a nitrogen atom.
  • 32. The conjugate of claim 25, wherein the conjugate is described by formula (D-IV-5):
  • 33. The conjugate of claim 32, wherein the conjugate is described by formula (D-IV-6):
  • 34. The conjugate of claim 33, wherein L′ is a nitrogen atom.
  • 35. The conjugate of claim 14, wherein the conjugate is described by formula (D-V):
  • 36. The conjugate of claim 35, wherein the conjugate is described by formula (D-V-1):
  • 37. The conjugate of claim 36, wherein the conjugate is described by formula (D-V-2):
  • 38. The conjugate of claim 37, wherein L′ is a nitrogen atom.
  • 39. The conjugate of claim 35, wherein the conjugate is described by formula (D-V-3):
  • 40. The conjugate of claim 39, wherein the conjugate is described by formula (D-V-4):
  • 41. The conjugate of claim 40, wherein L′ is a nitrogen atom.
  • 42. The conjugate of claim 35, wherein the conjugate is described by formula (D-V-5):
  • 43. The conjugate of claim 42, wherein the conjugate is described by formula (D-V-6):
  • 44. The conjugate of claim 43, wherein L′ is a nitrogen atom.
  • 45. The conjugate of claim 14, wherein the conjugate is described by formula (D-VI):
  • 46. The conjugate of claim 45, wherein the conjugate is described by formula (D-VI-1):
  • 47. The conjugate of claim 46, wherein the conjugate is described by formula (D-VI-2):
  • 48. The conjugate of claim 47, wherein L′ is a nitrogen atom.
  • 49. The conjugate of claim 45, wherein the conjugate is described by formula (D-VI-3):
  • 50. The conjugate of claim 49, wherein the conjugate is described by formula (D-VI-4):
  • 51. The conjugate of claim 50, wherein L′ is a nitrogen atom.
  • 52. The conjugate of claim 45, wherein the conjugate is described by formula (D-VI-5):
  • 53. The conjugate of claim 52, wherein the conjugate is described by formula (D-VI-6):
  • 54. The conjugate of claim 53, wherein L′ is a nitrogen atom.
  • 55. The conjugate of claim 13, wherein the conjugate is described by formula (D-VII):
  • 56. The conjugate of claim 55, wherein the conjugate is described by formula (D-VIII):
  • 57. The conjugate of claim 56, wherein the conjugate is described by formula (D-VIII-1):
  • 58. The conjugate of claim 57, wherein L′ is a nitrogen atom.
  • 59. The conjugate of claim 55, wherein the conjugate is described by formula (D-IX):
  • 60. The conjugate of claim 59, wherein the conjugate is described by formula (D-IX-1):
  • 61. The conjugate of claim 60, wherein L′ is a nitrogen atom.
  • 62. The conjugate of claim 55, wherein the conjugate is described by formula (D-X):
  • 63. The conjugate of claim 62, wherein the conjugate is described by formula (D-X-1):
  • 64. The conjugate of claim 63, wherein L′ is a nitrogen atom.
  • 65. The conjugate of claim 55, wherein the conjugate is described by formula (D-XI):
  • 66. The conjugate of claim 65, wherein the conjugate is described by formula (D-XI-1):
  • 67. The conjugate of claim 66, wherein L′ is a nitrogen atom.
  • 68. The conjugate of claim 13, wherein the conjugate is described by formula (D-XII):
  • 69. The conjugate of claim 68, wherein the conjugate is described by formula (D-XII-1):
  • 70. The conjugate of claim 69, wherein the conjugate is described by formula (D-XII-2):
  • 71. The conjugate of claim 13, wherein the conjugate is described by formula (D-XIII):
  • 72. The conjugate of claim 71, wherein the conjugate is described by formula (D-XIII-1):
  • 73. The conjugate of claim 72, wherein the conjugate is described by formula (D-XIII-2):
  • 74. The conjugate of claim 1, wherein the conjugate is described by formula (D-I):
  • 75. The conjugate of claim 74, wherein the conjugate is described by formula (D-XIV):
  • 76. The conjugate of claim 75, wherein the conjugate is described by formula (D-XIV-1):
  • 77. The conjugate of claim 76, wherein the conjugate is described by formula (D-XIV-2):
  • 78. The conjugate of claim 76, wherein the conjugate is described by formula (D-XIV-3):
  • 79. The conjugate of claim 76, wherein the conjugate is described by formula (D-XIV-4):
  • 80. The conjugate of claim 76, wherein the conjugate is described by formula (D-XIV-5):
  • 81. The conjugate of claim 74, wherein the conjugate is described by formula (D-XV):
  • 82. The conjugate of claim 81, wherein the conjugate is described by formula (D-XV-1):
  • 83. The conjugate of claim 82, wherein the conjugate is described by formula (D-XV-2):
  • 84. The conjugate of claim 82, wherein the conjugate is described by formula (D-XV-3):
  • 85. The conjugate of claim 82, wherein the conjugate is described by formula (D-XV-4):
  • 86. The conjugate of claim 82, wherein the conjugate is described by formula (D-XV-5):
  • 87. The conjugate of claim 74, wherein the conjugate is described by formula (D-XVI):
  • 88. The conjugate of claim 86, wherein the conjugate is described by formula (D-XVI-1):
  • 89. The conjugate of claim 88, wherein the conjugate is described by formula (D-XVI-2):
  • 90. The conjugate of claim 88, wherein the conjugate is described by formula (D-XVI-3):
  • 91. The conjugate of claim 88, wherein the conjugate is described by formula (D-XVI-4):
  • 92. The conjugate of claim 88, wherein the conjugate is described by formula (D-XVI-5):
  • 93. The conjugate of claim 74, wherein the conjugate is described by formula (D-XVII):
  • 94. The conjugate of claim 93, wherein the conjugate is described by formula (D-XVII-1):
  • 95. The conjugate of claim 94, wherein the conjugate is described by formula (D-XVII-2):
  • 96. The conjugate of claim 94, wherein the conjugate is described by formula (D-XVII-3):
  • 97. The conjugate of claim 94, wherein the conjugate is described by formula (D-XVII-4):
  • 98. The conjugate of claim 94, wherein the conjugate is described by formula (D-XVII-5):
  • 99. The conjugate of any one of claims 1-98, wherein L or L′ comprises one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C6-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.
  • 100. The conjugate of claim 99, wherein the backbone of L or L′ consists of one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.
  • 101. The conjugate of claim 99 or 100, wherein L or L′ is oxo substituted.
  • 102. The conjugate of any one of claims 1-101, wherein the backbone of L or L′ comprises no more than 250 atoms.
  • 103. The conjugate of any one of claims 1-102, wherein L or L′ is capable of forming an amide, a carbamate, a sulfonyl, or a urea linkage.
  • 104. The conjugate of any one of claims 1-98, wherein L or L′ is a bond.
  • 105. The conjugate of any one of claims 1-98, wherein L or L′ is an atom.
  • 106. The conjugate of any one of claims 1-105 wherein each L is described by formula (D-L-I):
  • 107. The conjugate of claim 106, wherein L is selected from
  • 108. The conjugate of claim 1, wherein the conjugate is described by formula (M-I):
  • 109. The conjugate of claim 108, wherein the conjugate is described by formula (M-II):
  • 110. The conjugate of claim 109, wherein the conjugate is described by formula (M-III):
  • 111. The conjugate of claim 110, wherein the conjugate is described by formula (M-III-1):
  • 112. The conjugate of claim 111, wherein the conjugate is described by formula (M-III-2):
  • 113. The conjugate of claim 112, wherein L′ is a nitrogen atom.
  • 114. The conjugate of claim 110, wherein the conjugate is described by formula (M-III-3):
  • 115. The conjugate of claim 114, wherein the conjugate is described by formula (M-III-4):
  • 116. The conjugate of claim 115, wherein L′ is a nitrogen atom.
  • 117. The conjugate of claim 110, wherein the conjugate is described by formula (M-III-5):
  • 118. The conjugate of claim 117, wherein the conjugate is described by formula (M-III-6):
  • 119. The conjugate of claim 118, wherein L′ is a nitrogen atom.
  • 120. The conjugate of claim 109, wherein the conjugate is described by formula (M-IV):
  • 121. The conjugate of claim 120, wherein the conjugate is described by formula (M-IV-1):
  • 122. The conjugate of claim 121, wherein the conjugate is described by formula (M-IV-2):
  • 123. The conjugate of claim 122, wherein L′ is a nitrogen atom.
  • 124. The conjugate of claim 120, wherein the conjugate is described by formula (M-IV-3):
  • 125. The conjugate of claim 124, wherein the conjugate is described by formula (M-IV-4):
  • 126. The conjugate of claim 125, wherein L′ is a nitrogen atom.
  • 127. The conjugate of claim 120, wherein the conjugate is described by formula (M-IV-5):
  • 128. The conjugate of claim 127, wherein the conjugate is described by formula (M-IV-6):
  • 129. The conjugate of claim 128, wherein L′ is a nitrogen atom.
  • 130. The conjugate of claim 109, wherein the conjugate is described by formula (M-V):
  • 131. The conjugate of claim 130, wherein the conjugate is described by formula (M-V-1):
  • 132. The conjugate of claim 131, wherein the conjugate is described by formula (M-V-2):
  • 133. The conjugate of claim 132, wherein L′ is a nitrogen atom.
  • 134. The conjugate of claim 130, wherein the conjugate is described by formula (M-V-3):
  • 135. The conjugate of claim 132, wherein the conjugate is described by formula (M-V-4):
  • 136. The conjugate of claim 135, wherein L′ is a nitrogen atom.
  • 137. The conjugate of claim 130, wherein the conjugate is described by formula (M-V-5):
  • 138. The conjugate of claim 137, wherein the conjugate is described by formula (M-V-6):
  • 139. The conjugate of claim 138, wherein L′ is a nitrogen atom.
  • 140. The conjugate of claim 109, wherein the conjugate is described by formula (M-VI):
  • 141. The conjugate of claim 140, wherein the conjugate is described by formula (M-VI-1):
  • 142. The conjugate of claim 141, wherein the conjugate is described by formula (M-VI-2):
  • 143. The conjugate of claim 142, wherein L′ is a nitrogen atom.
  • 144. The conjugate of claim 140, wherein the conjugate is described by formula (M-VI-3):
  • 145. The conjugate of claim 144, wherein the conjugate is described by formula (M-VI-4):
  • 146. The conjugate of claim 145, wherein L′ is a nitrogen atom.
  • 147. The conjugate of claim 140, wherein the conjugate is described by formula (M-VI-5):
  • 148. The conjugate of claim 147, wherein the conjugate is described by formula (M-VI-6):
  • 149. The conjugate of claim 148, wherein L′ is a nitrogen atom.
  • 150. The conjugate of claim 108, wherein the conjugate is described by formula (M-VII):
  • 151. The conjugate of claim 150, wherein the conjugate is described by formula (M-VIII):
  • 152. The conjugate of claim 151, wherein the conjugate is described by formula M-VIII-1):
  • 153. The conjugate of claim 152, wherein L′ is a nitrogen atom.
  • 154. The conjugate of claim 150, wherein the conjugate is described by formula (M-IX):
  • 155. The conjugate of claim 154, wherein the conjugate is described by formula (M-IX-1):
  • 156. The conjugate of claim 155, wherein L′ is a nitrogen atom.
  • 157. The conjugate of claim 150, wherein the conjugate is described by formula (M-X):
  • 158. The conjugate of claim 157, wherein the conjugate is described by formula (M-X-1):
  • 159. The conjugate of claim 158, wherein L′ is a nitrogen atom.
  • 160. The conjugate of claim 150, wherein the conjugate is described by formula (M-XI):
  • 161. The conjugate of claim 160, wherein the conjugate is described by formula (M-XI-1):
  • 162. The conjugate of claim 161, wherein L′ is a nitrogen atom.
  • 163. The conjugate of claim 108, wherein the conjugate is described by formula (M-XII):
  • 164. The conjugate of claim 163, wherein the conjugate is described by formula (M-XII-1):
  • 165. The conjugate of claim 164, wherein the conjugate is described by formula (M-XII-2):
  • 166. The conjugate of claim 108, wherein the conjugate is described by formula (M-XIII):
  • 167. The conjugate of claim 166, wherein the conjugate is described by formula (M-XIII-1):
  • 168. The conjugate of claim 167, wherein the conjugate is described by formula (M-XIII-2):
  • 169. The conjugate of claim 1, wherein the conjugate is described by formula (M-I):
  • 170. The conjugate of claim 169, wherein the conjugate is described by formula (M-XIV):
  • 171. The conjugate of claim 170, wherein the conjugate is described by formula (M-XIV-1):
  • 172. The conjugate of claim 171, wherein the conjugate is described by formula (M-XIV-2):
  • 173. The conjugate of claim 171, wherein the conjugate is described by formula (M-XIV-3):
  • 174. The conjugate of claim 171, wherein the conjugate is described by formula (M-XIV-4):
  • 175. The conjugate of claim 171, wherein the conjugate is described by formula (M-XIV-5):
  • 176. The conjugate of claim 169, wherein the conjugate is described by formula (M-XV):
  • 177. The conjugate of claim 176, wherein the conjugate is described by formula (M-XV-1):
  • 178. The conjugate of claim 177, wherein the conjugate is described by formula (M-XV-2):
  • 179. The conjugate of claim 177, wherein the conjugate is described by formula (M-XV-3):
  • 180. The conjugate of claim 177, wherein the conjugate is described by formula (M-XV-4):
  • 181. The conjugate of claim 177, wherein the conjugate is described by formula (M-XV-5):
  • 182. The conjugate of claim 169, wherein the conjugate is described by formula (M-XVI):
  • 183. The conjugate of claim 182, wherein the conjugate is described by formula (M-XVI-1):
  • 184. The conjugate of claim 183, wherein the conjugate is described by formula (M-XVI-2):
  • 185. The conjugate of claim 183, wherein the conjugate is described by formula (M-XVI-3):
  • 186. The conjugate of claim 183, wherein the conjugate is described by formula (M-XVI-4):
  • 187. The conjugate of claim 183, wherein the conjugate is described by formula (M-XVI-5):
  • 188. The conjugate of claim 169, wherein the conjugate is described by formula (M-XVII):
  • 189. The conjugate of claim 188, wherein the conjugate is described by formula (M-XVII-1):
  • 190. The conjugate of claim 189, wherein the conjugate is described by formula (M-XVII-2):
  • 191. The conjugate of claim 189, wherein the conjugate is described by formula (M-XVII-3):
  • 192. The conjugate of claim 189, wherein the conjugate is described by formula (M-XVII-4):
  • 193. The conjugate of claim 189, wherein the conjugate is described by formula (M-XVII-5):
  • 194. The conjugate of any one of claims 108-193, wherein L or L′ comprises one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.
  • 195. The conjugate of claim 194, wherein the backbone of L or L′ consists of one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.
  • 196. The conjugate of claim 194 or 195, wherein L or L′ is oxo substituted.
  • 197. The conjugate of any one of claims 108-196, wherein the backbone of L or L′ comprises no more than 250 atoms.
  • 198. The conjugate of any one of claims 108-197, wherein L or L′ is capable of forming an amide, a carbamate, a sulfonyl, or a urea linkage.
  • 199. The conjugate of any one of claims 108-197, wherein L or L′ is a bond.
  • 200. The conjugate of any one of claims 108-197, wherein L or L′ is an atom.
  • 201. The conjugate of any one of claims 108-200, wherein each L is described by formula (M-L-1): J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2 wherein J1 is a bond attached A1;J2 is a bond attached to E;each of Q1, Q2, Q3, Q4 and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene;each of T1, T2, T3, T4 is, independently, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino;Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1.
  • 202. The conjugate of any one of claims 1-201, wherein the squiggly line connected to E indicates that the L of each A1-L or each A1-L-A2 is covalently attached to a nitrogen atom of a solvent-exposed lysine of E.
  • 203. The conjugate of any one of claims 1-201, wherein the squiggly line connected to E indicates that the L of each A1-L or each A1-L-A2 is covalently attached to the sulfur atom of a solvent-exposed cysteine of E.
  • 204. The conjugate of any one of claims 1-203, wherein each E is an Fc domain monomer.
  • 205. The conjugate of claim 204, wherein n is 2, and each E dimerizes to form an Fc domain.
  • 206. The conjugate of claim 13, wherein n is 2, each E is an Fc domain monomer, each E dimerizes to form an Fc domain, and the conjugate is described by formula (D-I-1):
  • 207. The conjugate of claim 108, wherein n is 2, each E is an Fc domain monomer, each E dimerizes to form an Fc domain, and the conjugate is described by formula (M-I-1):
  • 208. The conjugate of any one of claims 1-207, wherein each E has the sequence of any one of SEQ ID NOs: 1-95.
  • 209. The conjugate of any one of claims 1-208, wherein T is 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.
  • 210. A population of conjugates of any one of claims 1-208, wherein the average value of T is 1 to 10.
  • 211. A population of conjugates of claim 210, wherein the average value of T is 1 to 5.
  • 212. A pharmaceutical composition comprising a conjugate of any of claims 1-209, or a population of conjugates of claim 210 or 211, or a pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable excipient.
  • 213. A method for the treatment of a subject having a viral infection or presumed to have a viral infection, the method comprising administering to the subject an effective amount of a conjugate of any of claims 1-209, a population of conjugates of claim 210 or 211, or a composition of claim 212.
  • 214. A method for the prophylactic treatment of a viral infection in a subject in need thereof, the method comprising administering to the subject an effective amount of a conjugate of any of claims 1-209, a population of conjugates of claim 210 or 211, or a composition of claim 212.
  • 215. The method of claim 213 or 214, wherein the viral infection is caused by human immunodeficiency virus (HIV).
  • 216. The method of claim 215, wherein the HIV is HIV-1 or HIV-2.
  • 217. The method of any one of claims 213-216, wherein the subject is immunocompromised.
  • 218. The method of any one of claims 213-217, wherein the subject has been diagnosed with humoral immune deficiency, T cell deficiency, neutropenia, asplenia, or complement deficiency.
  • 219. The method of any one of claims 213-218, wherein the subject is being treated or is about to be treated with an immunosuppressive therapy.
  • 220. The method of any one of claims 213-219, wherein said subject has been diagnosed with a disease which causes immunosuppression.
  • 221. The method of claim 220, wherein the disease is cancer.
  • 222. The method of claim 221, wherein the cancer is leukemia, lymphoma, or multiple myeloma.
  • 223. The method of any one of claims 213-222, wherein the subject has undergone or is about to undergo hematopoietic stem cell transplantation.
  • 224. The method of any one of claims 213-223, wherein the subject has undergone or is about to undergo an organ transplant.
  • 225. The method of any one of claims 213-224, wherein the conjugate of composition is administered intramuscularly, intravenously, intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally, locally, by inhalation, by injection, or by infusion.
  • 226. The method of any one of claims 213-225, wherein the subject is treated with a second therapeutic agent.
  • 227. The method of claim 226, wherein the second therapeutic agent is an antiviral agent.
  • 228. The method of claim 227, wherein the antiviral agent is selected from an integrase inhibitor, a nucleoside reverse transcriptase inhibitor (NRTI), a non-nucleoside reverse transcriptase inhibitor (NNRTI), a protease inhibitor, an inhibitor of viral entry, a CCR5 antagonist, or a CYP3A inhibitor.
  • 229. The method of claim 228, wherein the integrase inhibitor is selected from dolutegravir, elvitegravir, or raltegravir.
  • 230. The method of claim 228, wherein the nucleoside reverse transcriptase inhibitor (NRTI) is selected from abacavir, lamivudine, zidovudine, emtricitabine, tenofovir, emtricitabine, didanosine, or stavudine.
  • 231. The method of claim 228, wherein the non-nucleoside reverse transcriptase inhibitor (NNRTI) is selected from efavirenz, etravirine, nevirapine, rilpivirine, or delavirdine.
  • 232. The method of claim 228, wherein the protease inhibitor is selected from atazanavir, cobicistat, darunavir, cobicistat, lopinavir, ritonavir, fosamprenavir, tipranavir, nelfinavir, indinavir, or saquinavir.
  • 233. The method of claim 228, wherein the inhibitor of viral entry is enfuvirtide.
  • 234. The method of claim 228, wherein the CCR5 antagonist is maraviroc.
  • 235. The method of claim 228, wherein the CYP3A inhibitor is cobicistat or ritonavir.
PCT Information
Filing Document Filing Date Country Kind
PCT/US2020/037601 6/12/2020 WO
Provisional Applications (6)
Number Date Country
62984557 Mar 2020 US
62970549 Feb 2020 US
62959557 Jan 2020 US
62973790 Oct 2019 US
62897952 Sep 2019 US
62861148 Jun 2019 US