The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Sep. 3, 2018, is named MTV-145_01_Sequence_listing.txt and is 10,384 bytes in size.
Post-translational modifications greatly expand the function of proteins. The diversity of potentially reactive functional groups present in biomolecules (e.g., amides, acids, alcohols, amines) combined with the requirement for fast kinetics and mild reaction conditions (e.g., aqueous solvent, pH 6-8, T<37° C.) sets a high bar for the development of new techniques to functionalize proteins. While certain methods have emerged for bioconjugation of natural and unnatural amino acids in protein molecules, functionalization of cysteine residues has remained a challenge. Cysteine is a key residue for the chemical modification of proteins owing to (1) the unique reactivity of the thiol functional group and (2) the low abundance of cysteine residues in naturally occurring proteins.
Cysteine functionalization, and more generally, thiol modification, is an important tool in the chemical, biological, medical, and material sciences. As the only thiol-containing amino acid, cysteine is typically exploited for protein modification using thiol-based reactions. There currently exist several chemical modification techniques allowing for cysteine functionalization in biomolecules. One chemical functionalization, arylation, enables formation of robust arylthioether conjugates with superior stability properties. However, current state of the art arylation methods suffer from several disadvantages. These arylation methods rely on SNAr chemistry and are fundamentally limited to electron-deficient aromatic reagents, such as, for example, perfluorinated arylation agents. Further, these reagents generate complex mixtures of products, reacting non-specifically with nitrogen-based nucleophiles widely present in biomolecules. Worse still, these current methods exhibit slow reaction rates and require harsh pH and/or solvent conditions. Therefore, there exists a need to develop methods of cysteine functionalization, particularly methods that can tolerate various functional groups, reaction conditions, and that can generate stable products.
In certain embodiments, the invention provides a method of functionalizing a thiol or selenol, wherein said method is represented by Scheme 1:
wherein:
A1 is H, an amine protecting group, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A2 is NH2, NH(amide protecting group), N(amide protecting group), OH, O(carboxylate protecting group), a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
Y is S or Se;
R1 is H, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
M is Ni, Pd, Pt, Cu, or Au;
Ar1 is optionally substituted aryl, heteroaryl, alkenyl, or cycloalkenyl;
X is a halide, triflate, tetrafluoroborate, tetraarylborate, hexafluoroantimonate, bis(alkylsulfonyl)amide, tetrafluorophosphate, hexafluorophosphate, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, or hypochlorite;
L is independently for each occurrence a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, a triarylphosphonate, an N-heterocyclic carbene, an optionally substituted phenanthroline, an optionally substituted iminopyridine, an optionally substituted 2,2′-bipyridine, an optionally substituted diimine, an optionally substituted triazolylpyridine, or an optionally substituted pyrazolyl pyridine;
n is an integer from 1-5;
m is 1 or 2; and
solvent is a polar protic solvent, a polar aprotic solvent, or a non-polar solvent.
In certain embodiments, the invention relates to a method, wherein said method is represented by Scheme 4:
wherein, independently for each occurrence:
A1 is H, an amine protecting group, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A2 is NH2, NH(amide protecting group), N(amide protecting group), OH, O(carboxylate protecting group), a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A3, A4, and A5 are selected from the group consisting of a natural amino acid, an unnatural amino acid, and a plurality of natural amino acids or unnatural amino acids;
Y is S or Se;
R1 is H, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
M is Ni, Pd, Pt, Cu, or Au;
Ry is an optionally substituted bridging moiety, comprising an aromatic group, a heteroaromatic group, an alkene group, or a cycloalkene group;
y is 2, 3, 4, 5, or 6;
X is a halide, triflate, tetrafluoroborate, tetraarylborate, hexafluoroantimonate, bis(alkylsulfonyl)amide, tetrafluorophosphate, hexafluorophosphate, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, or hypochlorite;
L is independently for each occurrence a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, a triarylphosphonate, an N-heterocyclic carbene, an optionally substituted phenanthroline, an optionally substituted iminopyridine, an optionally substituted 2,2′-bipyridine, an optionally substituted diimine, an optionally substituted triazolylpyridine, or an optionally substituted pyrazolyl pyridine;
n is an integer from 1-5;
m is 1 or 2;
each Z is independently
—S-alkyl, —SH, —S—(CH2)n—CO2H, —SCH(CH3)—CO2H, or —SCH(CO2H)—CH2CO2H; and
solvent is a polar protic solvent, a polar aprotic solvent, or a non-polar solvent.
The invention also provides methods according to Scheme 5:
wherein, independently for each occurrence:
A1 is H, an amine protecting group, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A2 is NH2, NH(amide protecting group), N(amide protecting group), OH, O(carboxylate protecting group), a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A3, A4, and A5 are selected from the group consisting of a natural amino acid, an unnatural amino acid, and a plurality of natural amino acids or unnatural amino acids;
Y is S or Se;
R1 is H, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
M is Ni, Pd, Pt, Cu, or Au;
X is a halide, triflate, tetrafluoroborate, tetraarylborate, hexafluoroantimonate, bis(alkylsulfonyl)amide, tetrafluorophosphate, hexafluorophosphate, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, or hypochlorite;
L is independently for each occurrence a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, a triarylphosphonate, an N-heterocyclic carbene, an optionally substituted phenanthroline, an optionally substituted iminopyridine, an optionally substituted 2,2′-bipyridine, an optionally substituted diimine, an optionally substituted triazolylpyridine, or an optionally substituted pyrazolyl pyridine;
is aryl, heteroaryl, alkenyl, or cycloalkenyl, wherein
is optionally further substituted by one or more substituents selected from halide, acyl, azide, isothiocyanate, alkyl, aralkyl, alkenyl, alkynyl or protected alkynyl, alkoxyl, arylcarbonyl, cycloalkyl, formyl, haloalkyl, hydroxyl, amino, nitro, sulfhydryl, amido, phosphonate, phosphinate, alkylthio, sulfonyl, sulfonamido, heterocyclyl, aryl, heteroaryl, —CF3, —CF2R7, —CFR72, —CN, polyethylene glycol, polyethylene imine, —(CH2)p-FG-R7, and Z;
Z is
—S-alkyl, —SH, —S—(CH2)n—CO2H, —SCH(CH3)—CO2H, or —SCH(CO2H)—CH2CO2H;
p is independently for each occurrence an integer from 0-10;
FG is independently for each occurrence selected from the group consisting of C(O), CO2, O(CO), C(O)NR7, NR7C(O), O, Si(R7)2, C(NR7), (R7)2N(CO)N(R7)2, OC(O)NR7, NR7C(O)O, and C(N═N);
R7 is independently for each occurrence selected from the group consisting of H, alkyl, cycloalkyl, aryl, aralkyl, alkenyl, and alkynyl;
n is an integer from 1-5;
m is 1 or 2; and
solvent is a polar protic solvent, a polar aprotic solvent, or a non-polar solvent.
In certain embodiments, L is selected from the group consisting of PPh3, Ph2P—CH3, PhP(CH3)2, P(o-tol)3, PCy3, P(tBu)3, BINAP, dppb, dppe, dppf, dppp,
Rx is independently for each occurrence alkyl, aralkyl, cycloalkyl, or aryl;
X1 is CH or N;
R2 is H or alkyl;
R3 is H or alkyl;
R4 is H, alkoxy, or alkyl;
R5 is alkyl or aryl;
R6 is alkyl or aryl; and
q is 1, 2, 3, or 4.
In certain embodiments, M is Ni or Pd.
In certain embodiments, X is triflate or halide.
In certain embodiments, Ar1 is (C6-C10)carbocyclic aryl, (C3-C12)heteroaryl, (C3-C14)polycyclic aryl, or alkenyl; and Ar1 is optionally substituted by one or more substituents independently selected from the group consisting of halide, acyl, azide, isothiocyanate, alkyl, aralkyl, alkenyl, alkynyl or protected alkynyl, alkoxyl, arylcarbonyl, cycloalkyl, formyl, haloalkyl, hydroxyl, amino, nitro, sulfhydryl, amido, phosphonate, phosphinate, alkylthio, sulfonyl, sulfonamido, heterocyclyl, aryl, heteroaryl, —CF3, —CF2R7, —CFR72, —CN, polyethylene glycol, polyethylene imine, and —(CH2)p-FG-R7;
p is independently for each occurrence an integer from 0-10;
FG is independently for each occurrence selected from the group consisting of C(O), CO2, O(CO), C(O)NR7, NR7C(O), O, Si(R7)2, C(NR7), (R7)2N(CO)N(R7)2, OC(O)NR7, NR7C(O)O, and C(N═N);
R7 is independently for each occurrence selected from the group consisting of H, alkyl, cycloalkyl, aryl, aralkyl, alkenyl, and alkynyl; and
if two or more substituents are present on Ar1, then two of said substituents taken together may form a ring.
In certain embodiments, Ar1 is covalently linked to a fluorophore, an imaging agent, a detection agent, a biomolecule, a therapeutic agent, a lipophilic moiety, a member of a high-affinity binding pair, or a cell-receptor targeting agent. In one embodiment, Ar1 is linked to biotin. In another embodiment, Ar1 is linked to fluorescein. In one embodiment, the therapeutic agent is trametinib, topotecan, abiraterone, dabrafenib, or vandetanib.
In certain embodiments, Ar1 is comprised by a fluorophore.
In certain embodiments, Ar1 is comprised by a therapeutic agent.
In certain embodiments, A1 and A2 are independently a natural or unnatural amino acid, a plurality of natural or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, or a protein.
In certain embodiments, A1 or A2 comprises arginine, histidine, lysine, aspartic acid, glutamic acid, serine, threonine, asparagine, glutamine, proline, tyrosine, or tryptophan.
In certain embodiments, the invention is a method of functionalizing a thiol or selenol, wherein the limiting reagent is
In certain embodiments, when A1 or A2 comprises an —SH or —SeH moiety, the molar ratio of the amount of
to the amount of
multiplied by the aggregate number of —SH and —SeH moieties in
is greater than 1:1.
In certain embodiments of the method of the invention, A1 and A2 are covalently linked.
In certain embodiments, the solvent used in the methods of the invention comprises water.
In certain embodiments, the solvent used in the methods of the invention comprises an aqueous buffer.
In other embodiments, the invention relates to a method of functionalizing a thiol or selenol in a biopolymer, comprising contacting a biopolymer comprising a thiol or selenol moiety with a reagent of structural formula II, thereby generating a functionalized biopolymer, wherein the thiol or selenol moiety has been transformed to —S—Ar1 or —Se—Ar1.
In certain embodiments, the biopolymer is an oligonucleotide, a polynucleotide, an oligosaccharide, or a polysaccharide.
In certain embodiments, the invention relates to a method of functionalizing a thiol or selenol, wherein the method is represented by Scheme 1:
wherein:
A1 is H, an amine protecting group, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A2 is NH2, NH(amide protecting group), N(amide protecting group), OH, O(carboxylate protecting group), a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
Y is S or Se;
R1 is H, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
M is Ni, Pd, Pt, Cu, or Au;
Ar1 is optionally substituted aryl, heteroaryl, alkenyl, or cycloalkenyl;
X is a halide, triflate, tetrafluoroborate, tetraarylborate, hexafluoroantimonate, bis(alkylsulfonyl)amide, tetrafluorophosphate, hexafluorophosphate, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, or hypochlorite; L is independently for each occurrence a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, a triarylphosphonate, an N-heterocyclic carbene, an optionally substituted phenanthroline, an optionally substituted iminopyridine, an optionally substituted 2,2′-bipyridine, an optionally substituted diimine, an optionally substituted triazolylpyridine, or an optionally substituted pyrazolyl pyridine;
n is an integer from 1-5;
m is 1 or 2; and
solvent is a polar protic solvent, a polar aprotic solvent, or a non-polar solvent.
This method features several significant advantages over existing functionalization methods, such as specificity for functionalization of thiols and selenols over other reactive functional groups (e.g., hydroxyls, amines), excellent functional group tolerance, and mild reaction conditions in both polar organic and buffered aqueous solvent media. Furthermore, kinetic studies demonstrate that the methods of the invention are fast, resulting in complete labeling at micromolar concentrations of biomolecules within minutes. The methods presented herein are widely applicable for modifications of biomolecules containing amino acids bearing thiol or selenol moieties. The ability to selectively chemically modify biomolecules is an important application relevant to research and development in the pharmaceutical and biotechnology industries.
In certain embodiments, the invention relates to selective cysteine and selenocysteine modification on unprotected peptide/protein molecules under physiologically relevant conditions. This process exhibits specificity towards cysteine (Cys) and selenocysteine (Sec) over other competing nucleophilic amino acids (e.g., serine, threonine, lysine), excellent functional group tolerance, and mild reaction conditions.
In certain embodiments, the invention is a method according to Scheme 1, wherein m is an integer from 0-3.
In certain embodiments, the thiol or selenol that is functionalized in the methods of the invention is an alpha amino acid having the structure of formula (I):
wherein A1, A2, Y, n, and R1 are defined as above. In certain embodiments, the thiol is cysteine and the selenol is selenocysteine. In certain embodiments, n is 1 or 2.
In certain embodiments, the invention relates to a method of functionalizing (e.g., arylating) a thiol or selenol according to Scheme 1, wherein the functionalization agent is a compound of formula (II):
wherein L is a ligand, X is a halide or a triflate, m is 1 or 2, and Ar1 is optionally substituted aryl, heteroaryl, alkenyl, or cycloalkenyl.
In certain embodiments, the invention relates to a method according to Scheme 1, wherein the functionalization agent is a compound of formula (II), wherein m is an integer from 0-3. In certain embodiments, m is an integer from 1-3. In certain embodiments, m is 1 or 2. In more particular embodiments, m is 1. In certain embodiments in which m is 2 or 3, one instance of L is covalently connected via a linker moiety to one or more other instances of L. In such certain embodiments, M, taken together with two or three instances of ligand, is a cyclic or bicyclic structure.
In certain embodiments, the ligand L of formula (II) is a ligand described in U.S. Pat. No. 7,858,784, which is hereby incorporated by reference in its entirety.
In certain embodiments, the ligand L of formula (II) is a ligand described in U.S. Patent Application Publication No. 2011/0015401, which is hereby incorporated by reference in its entirety.
In certain embodiments, the ligand L of formula (II) is a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, a triarylphosphonate, an N-heterocyclic carbene, an optionally substituted phenanthroline, an optionally substituted iminopyridine, an optionally substituted 2,2′-bipyridine, an optionally substituted diimine, an optionally substituted triazolylpyridine, or an optionally substituted pyrazolyl pyridine. In certain embodiments, the ligand L of formula (II) is a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, or a triarylphosphonate.
In certain embodiments, the ligand L of formula (II) is selected from the group consisting of PPh3, Ph2P—CH3, PhP(CH3)2, P(o-tol)3, PCy3, P(tBu)3, BINAP, dppb, dppe,
or its salt,
or its salt,
Rx is alkyl, aralkyl, cycloalkyl, or aryl;
X1 is CH or N;
R2 is H or alkyl;
R3 is H or alkyl;
R4 is H, alkoxy, or alkyl;
R5 is alkyl or aryl;
R6 is alkyl or aryl; and
q is 1, 2, 3, or 4.
In certain embodiments, X of formula (II) is X is a halide (e.g., fluoride, chloride, bromide, iodide) or a triflate.
In certain embodiments, X of formula (II) is selected from the group consisting of boron tetrafluoride, tetraarylborates (such as B(C6F5)4− and (B[3,5-(CF3)2C6H3]4)−), hexafluoroantimonate, phosphorus tetrafluoride, phosphorus hexafluoride, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(alkylsulfonyl)amide, halide, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, and hypochlorite.
In certain embodiments, X of formula (II) is alkylsulfonate; and the alkyl is substituted alkyl. In certain embodiments, X of formula (II) is alkylsulfonate; and the alkyl is unsubstituted alkyl.
In certain embodiments, X of formula (II) is alkylsulfonate; and the alkyl is methyl, ethyl, propyl, or butyl. In certain embodiments, X of formula (II) is alkylsulfonate; and the alkyl is methyl or ethyl.
In certain embodiments, X of formula (II) is haloalkylsulfonate. In certain embodiments, X of formula (II) is fluoroalkylsulfonate.
In certain embodiments, X of formula (II) is fluoromethylsulfonate. In certain embodiments, X is trifluoromethylsulfonate.
In certain embodiments, X of formula (II) is cycloalkylalkylsulfonate. In certain embodiments, X is
or its enantiomer.
In certain embodiments, m of formula (II) is 1 or 2. In certain embodiments, m is 1.
In certain embodiments, Ar1 of formula (II) is optionally substituted aryl, heteroaryl, alkenyl, or cycloalkenyl. In certain embodiments, Ar1 is optionally substituted aryl or heteroaryl group.
In certain embodiments, Ar1 of formula (II) is (C6-C10)carbocyclic aryl, (C3-C12)heteroaryl, (C3-C14)polycyclic aryl, or alkenyl; and Ar1 is optionally substituted by one or more substituents independently selected from the group consisting of halide, acyl, azide, isothiocyanate, alkyl, aralkyl, alkenyl, alkynyl or protected alkynyl, alkoxyl, arylcarbonyl, cycloalkyl, formyl, haloalkyl, hydroxyl, amino, nitro, sulfhydryl, amido, phosphonate, phosphinate, alkylthio, sulfonyl, sulfonamido, heterocyclyl, aryl, heteroaryl, —CF3, —CF2R7, —CFR72, —CN, polyethylene glycol, polyethylene imide, and —(CH2)n-FG-R7;
n is independently for each occurrence an integer from 0-10;
FG is independently for each occurrence selected from the group consisting of C(O), CO2, O(CO), C(O)NR7, NR7C(O), O, Si(R7)2, C(NR7), (R7)2N(CO)N(R7)2, OC(O)NR7, NR7C(O)O, and C(N═N);
R7 is independently for each occurrence selected from the group consisting of H, alkyl, cycloalkyl, aryl, aralkyl, alkenyl, and alkynyl; and
if two or more substituents are present on Ar1, then two of said substituents taken together may form a ring.
In certain embodiments, Ar1 of formula (II) is covalently linked to a fluorophore, an imaging agent, a detection agent, a biomolecule, a therapeutic agent, a lipophilic moiety, a member of a high-affinity binding pair, or a cell-receptor targeting agent. In certain embodiments, the invention relates to any one of the aforementioned compounds, wherein Ar1 is covalently linked to biotin. In certain embodiments, the invention relates to any one of the aforementioned compounds, wherein Ar1 is covalently linked to fluorescein. In certain embodiments, the invention relates to any of the aforementioned compounds, wherein Ar1 is covalently linked to a therapeutic agent; and the therapeutic agent is trametinib, topotecan, abiraterone, dabrafenib, or vandetanib.
In certain other embodiments, Ar1 of formula (II) is comprised by a fluorophore. In certain embodiments, the invention relates to any of the aforementioned compounds, wherein Ar1 is comprised by a therapeutic agent. In certain embodiments, the therapeutic agent is the trametinib, topotecan, abiraterone, dabrafenib, or vandetanib.
In certain embodiments, the fluorophore is a derivative of xanthene, fluorescein, rhodamine, coumarin, naphthalene, anathracene, oxadiazole, pyrene, acridine, tetrapyrrole, arylmethine, boron-dipyrromethene (BODIPY), or a cyanine dye. In certain other embodiments, the fluorophore is a fluorescent protein. In certain embodiments, the detection agent is for example, a nanoparticle, an MRI contrast agent, a dye moiety, or a radionuclide. In certain other embodiments, a biomolecule is a protein, a peptide, a monosaccharide, a disaccharide, an oligosaccharide, a polysaccharide, a lipid, a glycolipid, a glycerolipid, a phospholipid, a hormone, a neurotransmitter, a nucleic acid, a nucleotide, a nucleoside, a sterol, a metabolite, a vitamin, or a natural product.
In certain embodiments, a therapeutic agent is a compound or substructure of a compound that brings about a therapeutic effect in a subject to which the agent is administered. In certain embodiments, the therapeutic agent is toxic to certain cells. Exemplary therapeutic agents that are covalently linked to Ar1 of formula (II) include trametinib, topotecan, abiraterone, dabrafenib, or vandetanib.
In certain embodiments, the lipophilic moiety enables the compound bearing Ar1 to have an affinity for, or be soluble in, lipids, fats, oils, ad non-polar solvents, as described herein. Exemplary lipophilic moieties include amphiphilic surfactants, such as cinnamic acid.
In certain embodiments, the cell-receptor targeting agent is a ligand such as an epitope, a peptide, an antibody, a small organic compound, a neurotransmitter. High-affinity binding pairs include biotin-avidin, biotin-streptavidin, ligand-cell receptor, S-Peptide and Ribonuclease A, digoxigenin and its receptor, and complementary oligonucleotide pairs.
In certain embodiments, the invention relates to a method of Scheme 1:
wherein,
A1 is H, an amine protecting group, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A2 is NH2, NH(amide protecting group), N(amide protecting group), OH, O(carboxylate protecting group), a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
Y is S or Se;
R1 is H, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
M is Ni, Pd, Pt, Cu, or Au;
Ar1 is optionally substituted aryl, heteroaryl, alkenyl, or cycloalkenyl;
X is a halide, triflate, tetrafluoroborate, tetraarylborate, hexafluoroantimonate, bis(alkylsulfonyl)amide, tetrafluorophosphate, hexafluorophosphate, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, or hypochlorite;
L is independently for each occurrence a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, a triarylphosphonate, an N-heterocyclic carbene, an optionally substituted phenanthroline, an optionally substituted iminopyridine, an optionally substituted 2,2′-bipyridine, an optionally substituted diimine, an optionally substituted triazolylpyridine, or an optionally substituted pyrazolyl pyridine;
n is an integer from 1-5;
m is 1 or 2; and
solvent is a polar protic solvent, a polar aprotic solvent, or a non-polar solvent.
In certain embodiments, the invention relates to a method, wherein said method is represented by Scheme 4:
wherein, independently for each occurrence:
A1 is H, an amine protecting group, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A2 is NH2, NH(amide protecting group), N(amide protecting group), OH, O(carboxylate protecting group), a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A3, A4, and A5 are selected from the group consisting of a natural amino acid, an unnatural amino acid, and a plurality of natural amino acids or unnatural amino acids;
Y is S or Se;
R1 is H, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
M is Ni, Pd, Pt, Cu, or Au;
Ry is an optionally substituted bridging moiety, comprising an aromatic group, a heteroaromatic group, an alkene group, or a cycloalkene group;
y is 2, 3, 4, 5, or 6;
X is a halide, triflate, tetrafluoroborate, tetraarylborate, hexafluoroantimonate, bis(alkylsulfonyl)amide, tetrafluorophosphate, hexafluorophosphate, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, or hypochlorite;
L is independently for each occurrence a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, a triarylphosphonate, an N-heterocyclic carbene, an optionally substituted phenanthroline, an optionally substituted iminopyridine, an optionally substituted 2,2′-bipyridine, an optionally substituted diimine, an optionally substituted triazolylpyridine, or an optionally substituted pyrazolyl pyridine;
n is an integer from 1-5;
m is 1 or 2;
each Z is independently
—S-alkyl, —SH, —S—(CH2)n—CO2H, —SCH(CH3)—CO2H, or —SCH(CO2H)—CH2CO2H; and
solvent is a polar protic solvent, a polar aprotic solvent, or a non-polar solvent.
The invention described herein also provides methods for generating a stapled peptide using a mono-metallated catalyst bearing a haloaryl group. Such methods provide an alternative non-symmetric synthesis of a stapled peptide. For example, such synthesis can occur in a stepwise manner, in which a first bond forming step occurs between a first cysteine residue in a peptide and a mono-metallated haloarylation reagent. A second cross-coupling step may then occur between a second cysteine residue and the aryl halide, yielding the target stapled peptide product.
In certain embodiments, the invention relates to a method, wherein said method is represented by Scheme 5:
wherein, independently for each occurrence:
A1 is H, an amine protecting group, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A2 is NH2, NH(amide protecting group), N(amide protecting group), OH, O(carboxylate protecting group), a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A3, A4, and A5 are selected from the group consisting of a natural amino acid, an unnatural amino acid, and a plurality of natural amino acids or unnatural amino acids;
Y is S or Se;
R1 is H, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
M is Ni, Pd, Pt, Cu, or Au;
X is a halide, triflate, tetrafluoroborate, tetraarylborate, hexafluoroantimonate, bis(alkylsulfonyl)amide, tetrafluorophosphate, hexafluorophosphate, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, or hypochlorite;
L is independently for each occurrence a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, a triarylphosphonate, an N-heterocyclic carbene, an optionally substituted phenanthroline, an optionally substituted iminopyridine, an optionally substituted 2,2′-bipyridine, an optionally substituted diimine, an optionally substituted triazolylpyridine, or an optionally substituted pyrazolyl pyridine;
is aryl, heteroaryl, alkenyl, or cycloalkenyl, wherein
is optionally further substituted by one or more substituents selected from halide, acyl, azide, isothiocyanate, alkyl, aralkyl, alkenyl, alkynyl or protected alkynyl, alkoxyl, arylcarbonyl, cycloalkyl, formyl, haloalkyl, hydroxyl, amino, nitro, sulfhydryl, amido, phosphonate, phosphinate, alkylthio, sulfonyl, sulfonamido, heterocyclyl, aryl, heteroaryl, —CF3, —CF2R7, —CFR72, —CN, polyethylene glycol, polyethylene imine, —(CH2)p-FG-R7, and Z;
Z is
—S-alkyl, —SH, —S—(CH2)n—CO2H, —SCH(CH3)—CO2H, or —SCH(CO2H)—CH2CO2H;
p is independently for each occurrence an integer from 0-10;
FG is independently for each occurrence selected from the group consisting of C(O), CO2, O(CO), C(O)NR7, NR7C(O), O, Si(R7)2, C(NR7), (R7)2N(CO)N(R7)2, OC(O)NR7, NR7C(O)O, and C(N═N);
R7 is independently for each occurrence selected from the group consisting of H, alkyl, cycloalkyl, aryl, aralkyl, alkenyl, and alkynyl;
n is an integer from 1-5;
m is 1 or 2; and
solvent is a polar protic solvent, a polar aprotic solvent, or a non-polar solvent.
In certain embodiments, the invention relates to any one of the aforementioned methods, wherein the solvent is an inert solvent, preferably one in which the reaction ingredients, including the catalyst, are substantially soluble. Suitable solvents include ethers such as diethyl ether, 1,2-dimethoxyethane, diglyme, t-butyl methyl ether, tetrahydrofuran, water and the like; halogenated solvents such as chloroform, dichloromethane, dichloroethane, chlorobenzene, and the like; aliphatic or aromatic hydrocarbon solvents such as benzene, xylene, toluene, hexane, pentane and the like; esters and ketones, such as ethyl acetate, acetone, and 2-butanone; polar aprotic solvents, such as acetonitrile, dimethylsulfoxide, dimethylformamide and the like; or combinations of two or more solvents.
In certain embodiments, the invention relates to any one of the aforementioned methods, wherein the solvent is a solvent mixture. In certain embodiments, the solvent mixture is an aqueous solvent mixture including a polar aprotic solvent. In certain embodiments, the invention relates to any one of the aforementioned methods, wherein the solvent comprises water and a polar protic solvent such as acetonitrile, dimethylsulfoxide, or dimethylformamide. In certain embodiments, the solvent is a solvent mixture comprising water and acetonitrile. In certain embodiments, the invention relates to any one of the aforementioned methods, wherein the solvent is a solvent mixture comprising water and dimethylformamide. In certain embodiments, the solvent mixture comprises from about 20:1 water to polar aprotic solvent to about 1:20 water to polar aprotic solvent, about 19:1 water to polar aprotic solvent to about 1:19 water to polar aprotic solvent, or about 18:1 water to polar aprotic solvent to about 1:18 water to polar aprotic solvent. In certain embodiments, the solvent mixture comprises from about 5:1 water to polar aprotic solvent to about 1:5 water to polar aprotic solvent. In certain embodiments, the solvent mixture further comprises a buffer. For example, the buffer may be Tris, HEPES, MOPS, MES, or Na2HPO4:NaH2PO4. In certain embodiments, the concentration of the buffer is from about 0.01 M to about 1 M, for example, about 25 mM or about 0.1 M.
In certain embodiments, the invention relates to any one of the aforementioned methods, wherein the reaction takes place at from about 4° C. to about 40° C. In certain embodiments, the invention relates to any one of the aforementioned methods, wherein the reaction takes place at about 10° C., about 15° C., about 20° C., about 25° C., about 30° C., or about 35° C.
In certain embodiments, the invention relates to any one of the aforementioned methods, wherein the reaction is substantially complete after about 10 s, about 20 s, about 30 s, about 40 s, about 50 s, about 1 min, about 2 min, about 3 min, about 4 min, about 5 min, about 10 min, about 15 min, about 20 min, about 25 min, about 30 min, about 35 min, about 40 min, about 45 min, about 50 min, about 55 min, about 60 min, about 65 min, about 70 min, about 75 min, about 80 min, about 85 min, or about 90 min. In certain embodiments, the invention relates to any one of the aforementioned methods, wherein the reaction is substantially complete after about 2 h, about 3 h, about 4 h, about 5 h, about 6 h, about 7 h, about 8 h, about 9 h, about 10 h, about 11 h, or about 12 h.
The reactions of the present invention may be performed under a wide range of conditions, though it will be understood that the solvents and temperature ranges recited herein are not limitative and only correspond to exemplary modes of the processes of the invention.
In general, it will be desirable that reactions are run using mild conditions which will not adversely affect the reactants, the precatalyst, or the product. For example, the reaction temperature influences the speed of the reaction, as well as the stability of the reactants and catalyst. The reactions will usually be run at temperatures in the range of 20° C. to 300° C., more preferably in the range 20° C. to 150° C. In certain embodiments, the reactions will be run at room temperature (i.e., about 20° C. to about 25° C.). In certain embodiments, the pH of the reaction mixture may be about 8.5. In certain embodiments, the pH of the reaction mixture may be about 8.0, about 7.5, about 7.0, about 6.5, about 6.0, about 5.5, about 5.0, about 4.5, about 4.0, about 3.5, about 3.0, about 2.5, about 2.0, or about 1.5.
Another aspect of the invention relates to a method of functionalizing a thiol or selenol in a biopolymer, comprising contacting a biopolymer comprising a thiol or selenol moiety with a reagent of structural formula II, as defined above. The conditions under which the biopolymer and II come into contact with one another are sufficient to generate the functionalized biopolymer, in which Ar1 is installed at the thiol or selenol moiety of the biopolymer. In certain embodiments, the biopolymer is an oligonucleotide, a polynucleotide, an oligosaccharide, or a polysaccharide.
In certain embodiments, the invention relates to a method of functionalizing a thiol or selenol in a biopolymer, wherein the functionalization reagent is a compound of formula (II) as described herein.
Another aspect of the invention relates to a method, comprising contacting a biopolymer comprising a first thiol moiety or a first selenol moiety and a second thiol or a second selenol moiety with a reagent of formula IV as defined herein, thereby generating a functionalized biopolymer, wherein the first thiol moiety or the first selenol moiety has been covalently bound to the second thiol moiety or the second selenol moiety by Ry. The conditions under which the biopolymer and IV come into contact with one another are sufficient to generate the functionalized biopolymer. In certain embodiments, the biopolymer is an oligonucleotide, a polynucleotide, an oligosaccharide, or a polysaccharide.
In certain embodiments, the invention relates to a method of functionalizing a thiol or selenol in a biopolymer, wherein the functionalization reagent is a compound of formula (IV) as described herein.
In certain embodiments of the method represented by Scheme 1, Ar1 is (C6-C10)carbocyclic aryl, (C3-C12)heteroaryl, (C3-C14)polycyclic aryl, or alkenyl, substituted by one or more substituents independently selected from the group consisting of halide, acyl, azide, isothiocyanate, alkyl, aralkyl, alkenyl, alkynyl or protected alkynyl, alkoxyl, arylcarbonyl, cycloalkyl, formyl, haloalkyl, hydroxyl, amino, nitro, sulfhydryl, amido, phosphonate, phosphinate, alkylthio, sulfonyl, sulfonamido, heterocyclyl, aryl, heteroaryl, —CF3, —CF2R7, —CFR72, —CN, polyethylene glycol, polyethylene imine, and —(CH2)p-FG-R7;
p is independently for each occurrence an integer from 0-10;
FG is independently for each occurrence selected from the group consisting of C(O), CO2, O(CO), C(O)NR7, NR7C(O), O, Si(R7)2, C(NR7), (R7)2N(CO)N(R7)2, OC(O)NR7, NR7C(O)O, and C(N═N);
R7 is independently for each occurrence selected from the group consisting of H, alkyl, cycloalkyl, aryl, aralkyl, alkenyl, and alkynyl;
wherein at least one of the one or more substituents is halide.
Certain arylated products contain functional groups that allow for further functionalization of the product. In certain embodiments, an aryl-halide bond provides a useful handle for such further functionalization. For example, the aryl-halide bond can undergo a metal-catalyzed or metal-mediated cross-coupling reaction with an additional thiol-containing reagent.
Accordingly, in certain embodiments wherein Ar1 is (C6-C10)carbocyclic aryl, (C3-C12)heteroaryl, (C3-C14)polycyclic aryl, or alkenyl substituted by at least one halide, the method represented by Scheme 1 further comprises contacting compound III,
with a compound containing a thiol moiety or a selenol moiety; thereby yielding a coupling product.
In certain embodiments, the compound containing a thiol moiety or a selenol moiety is a small molecule having a molecular weight below about 500 g/mol.
In certain embodiments, the compound containing a thiol moiety or a selenol moiety is a biomolecule such as a natural or unnatural amino acid, a plurality of natural or unnatural amino acids, peptide, oligopeptide, polypeptide, or protein.
In certain embodiments, the step of contacting compound III with a compound containing a thiol moiety or a selenol moiety occurs in the presence of a Pd byproduct from the reaction depicted in Scheme 1.
In certain embodiments, the invention relates to a compound comprising substructure III:
wherein,
A1 is H, an amine protecting group, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A2 is NH2, NH(amide protecting group), N(amide protecting group), OH, O(carboxylate protecting group), a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
Y is S or Se;
R1 is H, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, or aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
n is an integer from 1-5; and
Ar1 is optionally substituted aryl, heteroaryl, alkenyl, or cycloalkenyl.
In certain embodiments, the invention relates to a compound comprising substructure III, wherein Ar1 is covalently linked to a fluorophore, an imaging agent, a detection agent, a biomolecule, a therapeutic agent, a lipophilic moiety, a member of a high-affinity binding pair, or a cell-receptor targeting agent. In certain embodiments, the invention relates to any one of the aforementioned compounds, wherein Ar1 is covalently linked to biotin. In certain embodiments, the invention relates to any one of the aforementioned compounds, wherein Ar1 is covalently linked to fluorescein. In certain embodiments, the invention relates to any of the aforementioned compounds, wherein Ar1 is covalently linked to a therapeutic agent; and the therapeutic agent is trametinib, topotecan, abiraterone, dabrafenib, or vandetanib.
In certain other embodiments, the invention relates to a compound comprising substructure III, wherein Ar1 is comprised by a fluorophore. In certain embodiments, the invention relates to any of the aforementioned compounds, wherein Ar1 is comprised by a therapeutic agent. In certain embodiments, the therapeutic agent is the trametinib, topotecan, abiraterone, dabrafenib, or vandetanib.
In certain embodiments, the fluorophore is a derivative of xanthene, fluorescein, rhodamine, coumarin, naphthalene, anathracene, oxadiazole, pyrene, acridine, tetrapyrrole, arylmethine, boron-dipyrromethene (BODIPY), or a cyanine dye. In certain other embodiments, the fluorophore is a fluorescent protein. In certain embodiments, the detection agent is for example, a nanoparticle, an MRI contrast agent, a dye moiety, or a radionuclide. In certain other embodiments, a biomolecule is a protein, a peptide, a monosaccharide, a disaccharide, a polysaccharide, a lipid, a glycolipid, a glycerolipid, a phospholipid, a hormone, a neurotransmitter, a nucleic acid, a nucleotide, a nucleoside, a sterol, a metabolite, a vitamin, or a natural product.
In certain embodiments, a therapeutic agent is a compound or substructure of a compound that brings about a therapeutic effect in a subject to which the agent is administered. In certain embodiments, the therapeutic agent is toxic to certain cells. Exemplary therapeutic agents that are covalently linked to Ar1 in substructure III include trametinib, topotecan, abiraterone, dabrafenib, or vandetanib.
In certain embodiments, the lipophilic moiety enables the compound of substructure III to which the lipophilic moiety is conjugated to have an affinity for, or be soluble in, lipids, fats, oils, ad non-polar solvents, as described herein. Exemplary lipophilic moieties include amphiphilic surfactants, such as cinnamic acid.
In certain embodiments, the cell-receptor targeting agent is a ligand such as an epitope, a peptide, an antibody, a small organic compound, a neurotransmitter. High-affinity binding pairs include biotin-avidin, biotin-streptavidin, ligand-cell receptor, S-Peptide and Ribonuclease A, digoxigenin and its receptor, and complementary oligonucleotide pairs.
In certain embodiments, the invention relates to a compound comprising substructure III, wherein A1 and A2 are independently a natural or unnatural amino acid, a plurality of natural or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, or a protein.
In certain embodiments, A1 and A2 of substructure III each independently comprise arginine, histidine, lysine, aspartic acid, glutamic acid, serine, threonine, asparagine, glutamine, proline, tyrosine, or tryptophan. In certain embodiments, A1 and A2 do not comprise cysteine or selenocysteine. In certain embodiments, A1 and A2 do not comprise any amino acids that contain —SH or —SeH moieties.
In certain embodiments, the invention relates to a compound comprising substructure III, wherein R1 is H. In certain embodiments, the invention relates to a compound comprising substructure III, wherein X is halide, such as chloride. In certain embodiments, X is triflate.
In certain embodiments, the invention relates to a compound comprising substructure III, wherein A1 and A2 are covalently linked. In certain embodiments, substructure III comprises a cyclic peptide having an functionalized S moiety or a functionalized Se moiety. In certain embodiments, the functionalized S moiety or functionalized Se moiety is an arylated S moiety or an arylated Se moiety, respectively.
In certain embodiments, A1 or A2 comprises an antibody or an antibody fragment. In certain embodiments, the antibody is intact and comprises a single-point mutation with functionalized (e.g., arylated) Cys, Sec, or an artificial amino acid comprising —S(functional group) or —Se(functional group) on its main chain terminus. In alternative embodiments, A1 or A2 comprises an antibody fragment after partial antibody reduction.
In certain embodiments, the invention relates to any one of the compounds described herein.
In certain embodiments, the invention relates to a compound comprising substructure V:
wherein, independently for each occurrence,
A1 is H, an amine protecting group, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A2 is NH2, NH(amide protecting group), N(amide protecting group), OH, O(carboxylate protecting group), a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A3, A4, and A5 are selected from the group consisting of a natural amino acid, an unnatural amino acid, and a plurality of natural amino acids or unnatural amino acids;
Y is S or Se;
n is 1-5;
Ry is an optionally substituted bridging moiety, comprising an aromatic group, a heteroaromatic group, an alkene group, or a cycloalkene group;
y is 2, 3, 4, 5, or 6;
each Z is independently
—S-alkyl, —SH, —S—(CH2)n—CO2H, —SCH(CH3)—CO2H, or —SCH(CO2H)—CH2CO2H; and
R1 is H, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, or aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment.
In certain embodiments, the invention relates to any of the compounds described herein, wherein none of A1, A2, A3, A4, and A5 comprises cysteine.
In certain embodiments, the invention relates to any of the compounds described herein, wherein one or more of A1, A2, A3, A4, and A5 comprises arginine, histidine, lysine, aspartic acid, glutamic acid, serine, threonine, asparagine, glutamine, glycine, proline, alanine, valine, isoleucine, leucine, methionine, phenylalanine, tyrosine, or tryptophan.
In certain embodiments, the invention relates to any of the compounds described herein, wherein Ry is an optionally substituted bifunctional bridging moiety or an optionally substituted trifunctional bridging moiety.
In certain embodiments, the invention relates to any of the compounds described herein, wherein Ry comprises an aromatic group.
In certain embodiments, the invention relates to any of the compounds described herein, wherein Ry is optionally substituted
In certain embodiments, the invention relates to any of the compounds described herein, wherein Ry is not a perfluorinated aryl para-substituted diradical.
In certain embodiments, the invention relates to any one of the compounds described herein, wherein y is 2; and Ry is selected from the group consisting of
wherein any of the bifunctional bridging moieties may be optionally substituted.
In certain embodiments, the invention relates to a compound comprising substructure VI:
wherein, independently for each occurrence:
A1 is H, an amine protecting group, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A2 is NH2, NH(amide protecting group), N(amide protecting group), OH, O(carboxylate protecting group), a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
A3, A4, and A5 are selected from the group consisting of a natural amino acid, an unnatural amino acid, and a plurality of natural amino acids or unnatural amino acids;
Y is S or Se;
R1 is H, alkyl, arylalkyl, acyl, aryl, alkoxycarbonyl, aryloxycarbonyl, a natural or unnatural amino acid, a plurality of natural amino acids or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, a protein, an antibody, or an antibody fragment;
M is Ni, Pd, Pt, Cu, or Au;
X is a halide, triflate, tetrafluoroborate, tetraarylborate, hexafluoroantimonate, bis(alkylsulfonyl)amide, tetrafluorophosphate, hexafluorophosphate, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, or hypochlorite;
L is independently for each occurrence a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, a triarylphosphonate, an N-heterocyclic carbene, an optionally substituted phenanthroline, an optionally substituted iminopyridine, an optionally substituted 2,2′-bipyridine, an optionally substituted diimine, an optionally substituted triazolylpyridine, or an optionally substituted pyrazolyl pyridine;
is aryl, heteroaryl, alkenyl, or cycloalkenyl, wherein
is optionally further substituted by one or more substituents selected from halide, acyl, azide, isothiocyanate, alkyl, aralkyl, alkenyl, alkynyl or protected alkynyl, alkoxyl, arylcarbonyl, cycloalkyl, formyl, haloalkyl, hydroxyl, amino, nitro, sulfhydryl, amido, phosphonate, phosphinate, alkylthio, sulfonyl, sulfonamido, heterocyclyl, aryl, heteroaryl, —CF3, —CF2R7, —CFR72, —CN, polyethylene glycol, polyethylene imine, —(CH2)p-FG-R7, and Z;
Z is
—S-alkyl, —SH, —S—(CH2)n—CO2H, —SCH(CH3)—CO2H, or —SCH(CO2H)—CH2CO2H;
p is independently for each occurrence an integer from 0-10;
FG is independently for each occurrence selected from the group consisting of C(O), CO2, O(CO), C(O)NR7, NR7C(O), O, Si(R7)2, C(NR7), (R7)2N(CO)N(R7)2, OC(O)NR7, NR7C(O)O, and C(N═N);
R7 is independently for each occurrence selected from the group consisting of H, alkyl, cycloalkyl, aryl, aralkyl, alkenyl, and alkynyl;
n is an integer from 1-5; and
m is 1 or 2;
In certain embodiments of the compound comprising substructure VI,
is selected from the group consisting of
In certain embodiments, wherein A1 and A2 are independently a natural or unnatural amino acid, a plurality of natural or unnatural amino acids, a peptide, an oligopeptide, a polypeptide, or a protein.
In certain embodiments, A1 comprises arginine, histidine, lysine, aspartic acid, glutamic acid, serine, threonine, asparagine, glutamine, proline, tyrosine, or tryptophan.
In certain embodiments, A2 comprises arginine, histidine, lysine, aspartic acid, glutamic acid, serine, threonine, asparagine, glutamine, proline, tyrosine, or tryptophan.
In certain embodiments, A1 and A2 do not comprise cysteine or selenocysteine.
In certain embodiments, R1 is H.
In certain embodiments, the invention relates to a compound of formula IV:
wherein, independently for each occurrence,
M is Ni, Pd, Pt, Cu, or Au;
Ry is an optionally substituted bridging moiety, comprising an aromatic group, a heteroaromatic group, an alkene group, or a cycloalkene group;
y is 2, 3, 4, 5, or 6;
X is a halide, triflate, tetrafluoroborate, tetraarylborate, hexafluoroantimonate, bis(alkylsulfonyl)amide, tetrafluorophosphate, hexafluorophosphate, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, or hypochlorite;
L is independently for each occurrence a trialkylphosphine, a triarylphosphine, a dialkylarylphosphine, an alkyldiarylphosphine, an (alkenyl)(alkyl)(aryl)phosphine, an alkenyldiarylphosphine, an alkenyldialkylphosphine, a phosphine oxide, a bis(phosphine), a phosphoramide, a triarylphosphonate, an N-heterocyclic carbene, an optionally substituted phenanthroline, an optionally substituted iminopyridine, an optionally substituted 2,2′-bipyridine, an optionally substituted diimine, an optionally substituted triazolylpyridine, or an optionally substituted pyrazolyl pyridine; and
m is 1 or 2.
In certain embodiments, the invention relates to any one of the compounds described herein, wherein Ry is an optionally substituted bifunctional bridging moiety or an optionally substituted trifunctional bridging moiety.
In certain embodiments, the invention relates to any one of the compounds described herein, wherein Ry comprises an aromatic group.
In certain embodiments, the invention relates to any one of the compounds described herein, wherein Ry is optionally substituted
In certain embodiments, the invention relates to any one of the compounds described herein, wherein y is 2; and Ry is selected from the group consisting of
wherein any of the bifunctional bridging moieties may be optionally substituted.
In certain embodiments, the invention relates to any one of the compounds described herein, wherein y is 3; and Ry is selected from the group consisting of
wherein any of the trifunctional bridging moieties may be optionally substituted.
The invention also provides methods of functionalizing (e.g., arylating) a thiol or selenol (e.g., as in the representative reaction represented by Scheme 6) using a metal precatalyst in conjunction with Ar1X (e.g., an aryl halide) reagent.
In certain embodiments, precatalysts exhibit the advantageous property of air stability. Exemplary precatalysts include Ph-mesylate palladium precatalysts (e.g., 2-amino biphenyl Pd species, such as the second generation Buchwald catalyst).
In embodiments of the reaction represented by Scheme 6,
Ar1 is optionally substituted aryl, heteroaryl, alkenyl, or cycloalkenyl;
X is a halide, triflate, tetrafluoroborate, tetraarylborate, hexafluoroantimonate, bis(alkylsulfonyl)amide, tetrafluorophosphate, hexafluorophosphate, alkylsulfonate, haloalkylsulfonate, arylsulfonate, perchlorate, bis(fluoroalkylsulfonyl)amide, bis(arylsulfonyl)amide, (fluoroalkylsulfonyl)(fluoroalkyl-carbonyl)amide, nitrate, nitrite, sulfate, hydrogensulfate, alkyl sulfate, aryl sulfate, carbonate, bicarbonate, carboxylate, phosphate, hydrogen phosphate, dihydrogen phosphate, phosphinate, or hypochlorite;
R10 represents H, alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkylalkyl, aralkyl, aryl, heteroaralkyl, or heteroaryl; and
A1, A2, R1, Y, L, M, m, and n are as defined for Scheme 1.
In certain embodiments, the invention relates to a hybrid composition, wherein the hybrid composition comprises a linker, a compound of substructure III, and a detectable moiety; and the linker links the compound to the detectable moiety.
In certain embodiments, the invention relates to any one of the aforementioned hybrid compositions, wherein the detectable moiety is a fluorescent moiety, a dye moiety, a radionuclide, a drug molecule, an epitope, or an MRI contrast agent.
In certain embodiments, the invention relates to a hybrid composition, wherein the hybrid composition comprises a linker, a compound of substructure III, and a biomolecule; and the linker links the compound to the biomolecule.
In certain embodiments, the invention relates to any one of the aforementioned hybrid compositions, wherein the biomolecule is a protein.
In certain embodiments, the invention relates to any one of the aforementioned hybrid compositions, wherein the protein is an antibody.
In certain embodiments, the invention relates to any one of the aforementioned hybrid compositions, wherein the biomolecule is DNA, RNA, or peptide nucleic acid (PNA).
In certain embodiments, the invention relates to any one of the aforementioned hybrid compositions, wherein the biomolecule is siRNA.
In certain embodiments, the invention relates to a hybrid composition, wherein the hybrid composition comprises a linker, a compound of substructure III, and a polymer; and the linker links the compound to the polymer.
In certain embodiments, the invention relates to any one of the aforementioned hybrid compositions, wherein the polymer is polyethylene glycol.
In certain embodiments, the invention relates to any one of the hybrid compositions described herein.
In certain embodiments, the invention relates to a method to generate a peptide, an oligopeptide, a polypeptide, or a protein, wherein the peptide, oligopeptide, polypeptide, or protein comprises substructure III.
In certain embodiments, the invention relates to a peptide, an oligopeptide, a polypeptide, or a protein, wherein the peptide, oligopeptide, polypeptide, or protein comprises a plurality of substructures comprising substructure III.
In certain embodiments, the invention relates to any one of the peptides, oligopeptides, polypeptides, or proteins described herein.
In certain embodiments, the invention relates to a method to generate a peptide, an oligopeptide, a polypeptide, or a protein, wherein the peptide, oligopeptide, polypeptide, or protein comprises substructure V.
In certain embodiments, the invention relates to a peptide, an oligopeptide, a polypeptide, or a protein, wherein the peptide, oligopeptide, polypeptide, or protein comprises a plurality of substructures comprising substructure V.
In certain embodiments, the invention relates to a method to generate a peptide, an oligopeptide, a polypeptide, or a protein, wherein the peptide, oligopeptide, polypeptide, or protein comprises substructure VI.
In certain embodiments, the invention relates to a peptide, an oligopeptide, a polypeptide, or a protein, wherein the peptide, oligopeptide, polypeptide, or protein comprises a plurality of substructures comprising substructure VI.
In certain embodiments, the invention relates to a peptide, an oligopeptide, a polypeptide, or a protein, or a method involving the peptide, oligopeptides, polypeptide, or protein, described in US published patent application publication number US 2014/0113871, which is hereby incorporated by reference in its entirety.
Antibody-drug conjugates (ADCs) are an emerging class of anti-cancer therapeutics. Highly cytotoxic small molecule drugs are conjugated to antibodies to create a single molecular entity. ADCs combine the high efficacy of small molecules with the target specificity of antibodies to enable the selective delivery of drug payloads to cancerous tissues, which reduces the systematic toxicity of conventional small molecule drugs.
Traditionally, ADCs are prepared by conjugating small molecule drugs to either cysteines generated from reducing an internal disulfide bond or surface-exposed lysines. Because multiple lysines and cysteines are present in antibodies, these conventional approaches usually lead to heterogeneous products with undefined drug-antibody ratio, which might cause difficulty for manufacturing and characterization. Furthermore, each individual antibody-drug conjugate may exhibit different pharmacokinetics, efficacy, and safety profiles, hindering a rational approach to optimizing ADC-based cancer treatment.
Recent studies showed that ADCs prepared using site-specific conjugation techniques exhibited improved pharmacological profiles.
So, in certain embodiments, the invention relates to an ADC with defined position of drug-attachment and defined drug to antibody ratio. In certain embodiments, the ADCs of the invention permit rational optimization of ADC-based therapies. In certain embodiments, the ADC comprises a structure of any one of the compounds generated by the methods described herein. In certain embodiments, the drug-to-antibody ratio is about 2:1, about 3:1, about 4:1, about 5:1, about 6:1, about 7:1, about 8:1, about 9:1, about 10:1, about 11:1, or about 12:1.
In certain embodiments, the invention relates to any one of the ADCs mentioned herein, comprising monomethyl auristatin E (MMAE) covalently conjugated to an antibody, wherein the antibody targets a cell surface receptor that is over-expressed in a cancer cell. MMAE is a highly toxic antimitotic agent that inhibits cell division by blocking tubulin polymerization. MMAE has been successfully conjugated to antibodies targeting human CD30 to create ADCs that have been approved by FDA to treat Hodgkin lymphoma as well as anaplastic large-cell lymphoma. In certain embodiments, the invention relates to a method for the selective synthesis of an ADC comprising MMAE covalently conjugated to an antibody.
In certain embodiments, the invention relates to any one of the ADCs mentioned herein, wherein the antibody targets cell receptors CD30, CD22, CD33, human epidermal growth factor receptor 2 (HER2), or epidermal growth factor receptor (EGFR). It should be noted that by conjugating drugs to antibodies targeting different receptors, the ADCs prepared should be useful for treating different cancers.
For convenience, before further description of the present invention, certain terms employed in the specification, examples, and appended claims are collected here.
The articles “a” and “an” are used herein to refer to one or to more than one (i.e., to at least one) of the grammatical object of the article. By way of example, “an element” means one element or more than one element.
The term “heteroatom” is art-recognized and refers to an atom of any element other than carbon or hydrogen. Illustrative heteroatoms include boron, nitrogen, oxygen, phosphorus, sulfur and selenium.
The term “alkyl” refers to the radical of saturated aliphatic groups, including straight-chain alkyl groups, branched-chain alkyl groups, cycloalkyl (alicyclic) groups, alkyl substituted cycloalkyl groups, and cycloalkyl substituted alkyl groups. In preferred embodiments, a straight chain or branched chain alkyl has 30 or fewer carbon atoms in its backbone (e.g., C1-C30 for straight chain, C3-C30 for branched chain), and more preferably 20 or fewer. Likewise, preferred cycloalkyls have from 3-10 carbon atoms in their ring structure, and more preferably have 5, 6 or 7 carbons in the ring structure.
Unless the number of carbons is otherwise specified, “lower alkyl” as used herein means an alkyl group, as defined above, but having from one to ten carbons, more preferably from one to six carbon atoms in its backbone structure. Likewise, “lower alkenyl” and “lower alkynyl” have similar chain lengths but with at least two carbon atoms. Preferred alkyl groups are lower alkyls. In preferred embodiments, a substituent designated herein as alkyl is a lower alkyl.
The term “aralkyl”, as used herein, means an aryl group, as defined herein, appended to the parent molecular moiety through an alkyl group, as defined herein. Representative examples of arylalkyl include, but are not limited to, benzyl, 2-phenylethyl, 3-phenylpropyl, and 2-naphth-2-ylethyl.
The term “alkoxy” means an alkyl group, as defined herein, appended to the parent molecular moiety through an oxygen atom. Representative examples of alkoxy include, but are not limited to, methoxy, ethoxy, propoxy, 2-propoxy, butoxy, tert-butoxy, pentyloxy, and hexyloxy.
The term “alkoxycarbonyl” means an alkoxy group, as defined herein, appended to the parent molecular moiety through a carbonyl group, represented by —C(═O)—, as defined herein. Representative examples of alkoxycarbonyl include, but are not limited to, methoxycarbonyl, ethoxycarbonyl, and tert-butoxycarbonyl.
The term “carboxy” as used herein, means a —CO2H group.
The term “alkylthio” as used herein, means an alkyl group, as defined herein, appended to the parent molecular moiety through a sulfur atom. Representative examples of alkylthio include, but are not limited, methylthio, ethylthio, tert-butylthio, and hexylthio. The terms “arylthio,” “alkenylthio” and “arylakylthio,” for example, are likewise defined.
The term “amido” as used herein, means —NHC(═O)—, wherein the amido group is bound to the parent molecular moiety through the nitrogen. Examples of amido include alkylamido such as CH3C(═O)N(H)— and CH3CH2C(═O)N(H)—.
The term “aryl” as used herein includes 5-, 6- and 7-membered aromatic groups that may include from zero to four heteroatoms, for example, benzene, naphthalene, anthracene, pyrene, pyrrole, furan, thiophene, imidazole, oxazole, thiazole, triazole, pyrazole, pyridine, pyrazine, pyridazine and pyrimidine, and the like. Those aryl groups having heteroatoms in the ring structure may also be referred to as “aryl heterocycles” or “heteroaromatics”. The aromatic ring can be substituted at one or more ring positions with such substituents as described above, for example, halogen, azide, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, amino, nitro, sulfhydryl, imino, amido, phosphonate, phosphinate, carbonyl, carboxyl, silyl, ether, alkylthio, sulfonyl, sulfonamido, ketone, aldehyde, ester, heterocyclyl, aromatic or heteroaromatic moieties, —CF3, —CN, or the like. The term “aryl” also includes polycyclic ring systems having two or more cyclic rings in which two or more carbons are common to two adjoining rings (the rings are “fused rings”) wherein at least one of the rings is aromatic, e.g., the other cyclic rings can be cycloalkyls, cycloalkenyls, cycloalkynyls, aryls and/or heterocyclyls.
The abbreviations Me, Et, Ph, Tf, Nf, Ts, Ms, and dba represent methyl, ethyl, phenyl, trifluoromethanesulfonyl, nonafluorobutanesulfonyl, p-toluenesulfonyl, methanesulfonyl, and dibenzylideneacetone, respectively. Also, “DCM” stands for dichloromethane; “rt” stands for room temperature, and may mean about 20° C., about 21° C., about 22° C., about 23° C., about 24° C., about 25° C., or about 26° C.; “THF” stands for tetrahydrofuran; “BINAP” stands for 2,2′-bis(diphenylphosphino)-1,1′-binaphthyl; “dppf” stands for 1,1′-bis(diphenylphosphino)ferrocene; “dppb” stands for 1,4-bis(diphenylphosphinobutane; “dppp” stands for 1,3-bis(diphenylphosphino)propane; “dppe” stands for 1,2-bis(diphenylphosphino)ethane. A more comprehensive list of the abbreviations utilized by organic chemists of ordinary skill in the art appears in the first issue of each volume of the Journal of Organic Chemistry; this list is typically presented in a table entitled Standard List of Abbreviations. The abbreviations contained in said list, and all abbreviations utilized by organic chemists of ordinary skill in the art are hereby incorporated by reference.
The terms ortho, meta and para apply to 1,2-, 1,3- and 1,4-disubstituted benzenes, respectively. For example, the names 1,2-dimethylbenzene and ortho-dimethylbenzene are synonymous.
The terms “heterocyclyl” or “heterocyclic group” refer to 3- to 10-membered ring structures, more preferably 3- to 7-membered rings, whose ring structures include one to four heteroatoms. Heterocycles can also be polycycles. Heterocyclyl groups include, for example, thiophene, thianthrene, furan, pyran, isobenzofuran, chromene, xanthene, phenoxathiin, pyrrole, imidazole, pyrazole, isothiazole, isoxazole, pyridine, pyrazine, pyrimidine, pyridazine, indolizine, isoindole, indole, indazole, purine, quinolizine, isoquinoline, quinoline, phthalazine, naphthyridine, quinoxaline, quinazoline, cinnoline, pteridine, carbazole, carboline, phenanthridine, acridine, pyrimidine, phenanthroline, phenazine, phenarsazine, phenothiazine, furazan, phenoxazine, pyrrolidine, oxolane, thiolane, oxazole, piperidine, piperazine, morpholine, lactones, lactams such as azetidinones and pyrrolidinones, sultams, sultones, and the like. The heterocyclic ring can be substituted at one or more positions with such substituents as described above, as for example, halogen, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, amino, nitro, sulfhydryl, imino, amido, phosphonate, phosphinate, carbonyl, carboxyl, silyl, ether, alkylthio, sulfonyl, ketone, aldehyde, ester, a heterocyclyl, an aromatic or heteroaromatic moiety, —CF3, —CN, or the like.
The term “non-coordinating anion” relates to a negatively charged moiety that interacts weakly with cations. Non-coordinating anions are useful in studying the reactivity of electrophilic cations, and are commonly found as counterions for cationic metal complexes with an unsaturated coordination sphere. In many cases, non-coordinating anions have a negative charge that is distributed symmetrically over a number of electronegative atoms. Salts of these anions are often soluble non-polar organic solvents, such as dichloromethane, toluene, or alkanes.
The terms “polycyclyl” or “polycyclic group” refer to two or more rings (e.g., cycloalkyls, cycloalkenyls, cycloalkynyls, aryls and/or heterocyclyls) in which two or more carbons are common to two adjoining rings, e.g., the rings are “fused rings”. Rings that are joined through non-adjacent atoms are termed “bridged” rings. Each of the rings of the polycycle can be substituted with such substituents as described above, as for example, halogen, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, amino, nitro, sulfhydryl, imino, amido, phosphonate, phosphinate, carbonyl, carboxyl, silyl, ether, alkylthio, sulfonyl, ketone, aldehyde, ester, a heterocyclyl, an aromatic or heteroaromatic moiety, —CF3, —CN, or the like.
The term “heteroatom” as used herein means an atom of any element other than carbon or hydrogen. Preferred heteroatoms are nitrogen, oxygen, sulfur and phosphorous.
As used herein, the term “nitro” means —NO2; the term “halogen” or “halo” designates —F, —Cl, —Br or —I; the term “sulfhydryl” means —SH; the term “hydroxyl” means —OH; the term “sulfonyl” means —SO2—; and the term “cyano” as used herein, means a —CN group.
The term “haloalkyl” means at least one halogen, as defined herein, appended to the parent molecular moiety through an alkyl group, as defined herein. Representative examples of haloalkyl include, but are not limited to, chloromethyl, 2-fluoroethyl, trifluoromethyl, pentafluoroethyl, and 2-chloro-3-fluoropentyl.
The terms “amine” and “amino” are art recognized and refer to both unsubstituted and substituted amines, e.g., a moiety that can be represented by the general formula:
wherein R9, R10 and R′10 each independently represent a hydrogen, an alkyl, an alkenyl, —(CH2)m—R8, or R9 and R10 taken together with the N atom to which they are attached complete a heterocycle having from 4 to 8 atoms in the ring structure; R8 represents an aryl, a cycloalkyl, a cycloalkenyl, a heterocycle or a polycycle; and m is zero or an integer in the range of 1 to 8. In preferred embodiments, only one of R9 or R10 can be a carbonyl, e.g., R9, R10 and the nitrogen together do not form an imide. In even more preferred embodiments, R9 and R10 (and optionally R′10) each independently represent a hydrogen, an alkyl, an alkenyl, or —(CH2)m—R8. Thus, the term “alkylamine” as used herein means an amine group, as defined above, having a substituted or unsubstituted alkyl attached thereto, i.e., at least one of R9 and R10 is an alkyl group.
The definition of each expression, e.g., alkyl, m, n, and the like, when it occurs more than once in any structure, is intended to be independent of its definition elsewhere in the same structure.
The terms triflyl (-Tf), tosyl (-Ts), mesyl (-Ms), and nonaflyl are art-recognized and refer to trifluoromethanesulfonyl, p-toluenesulfonyl, methanesulfonyl, and nonafluorobutanesulfonyl groups, respectively. The terms triflate (-OTf), tosylate (-OTs), mesylate (-OMs), and nonaflate are art-recognized and refer to trifluoromethanesulfonate ester, p-toluenesulfonate ester, methanesulfonate ester, and nonafluorobutanesulfonate ester functional groups and molecules that contain said groups, respectively.
The phrase “protecting group” as used herein means temporary modifications of a potentially reactive functional group which protect it from undesired chemical transformations. Examples of such protecting groups include silyl ethers of alcohols, and acetals and ketals of aldehydes and ketones, respectively. In embodiments of the invention, a carboxylate protecting group masks a carboxylic acid as an ester. In certain other embodiments, an amide is protected by an amide protecting group, masking the —NH2 of the amide as, for example, —NH(alkyl), or —N(alkyl)2. The field of protecting group chemistry has been reviewed (Greene, T. W.; Wuts, P. G. M. Protective Groups in Organic Synthesis, 2nd ed.; Wiley: New York, 1991).
It will be understood that “substitution” or “substituted with” includes the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent, and that the substitution results in a stable compound, e.g., which does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc.
As used herein, the term “substituted” is contemplated to include all permissible substituents of organic compounds. In a broad aspect, the permissible substituents include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, aromatic and nonaromatic substituents of organic compounds. Illustrative substituents include, for example, those described hereinabove. The permissible substituents can be one or more and the same or different for appropriate organic compounds. For purposes of this invention, the heteroatoms, such as nitrogen, may have hydrogen substituents and/or any permissible substituents of organic compounds described herein which satisfy the valencies of the heteroatoms.
A “polar protic solvent” as used herein is a solvent having a dipole moment of about 1.4 to 4.0 D, and comprising a chemical moiety that participates in hydrogen bonding, such as an O—H bond or an N—H bond. Exemplary polar protic solvents include methanol, ethanol, n-propanol, isopropanol, n-butanol, isobutanol, ammonia, water, and acetic acid.
A “polar aprotic solvent” as used herein means a solvent having a dipole moment of about 1.4 to 4.0 D that lacks a hydrogen bonding group such as O—H or N—H. Exemplary polar aprotic solvents include acetone, N,N-dimethylformamide, acetonitrile, ethyl acetate, dichloromethane, tetrahydrofuran, and dimethylsulfoxide.
A “non-polar solvent” as used herein means a solvent having a low dielectric constant (<5) and low dipole moment of about 0.0 to about 1.2. Exemplary nonpolar solvents include pentane, hexane, cyclohexane, benzene, toluene, chloroform, and diethyl ether.
For purposes of this invention, the chemical elements are identified in accordance with the Periodic Table of the Elements, CAS version, Handbook of Chemistry and Physics, 67th Ed., 1986-87, inside cover.
The invention may be understood with reference to the following examples, which are presented for illustrative purposes only and which are non-limiting. The substrates utilized in these examples were either commercially available, or were prepared from commercially available reagents.
Tris(2-carboxyethyl)phosphine hydrochloride (TCEP.HCl) was purchased from Hampton Research (Aliso Viejo, Calif.). 1-[Bis(dimethylamino)methylene]-1H-1,2,3-triazolo[4,5-b]pyridinium 3-oxid hexafluorophosphate (HATU), D-Biotin, Fmoc-Rink amide linker, Fmoc-L-Gly-OH, Fmoc-L-Leu-OH, Fmoc-L-Lys(Boc)-OH, Fmoc-L-Ala-OH, Fmoc-L-Cys(Trt)-OH, Fmoc-L-Gln(Trt)-OH, Fmoc-L-Asn(Trt)-OH, Fmoc-L-Glu(OtBu)-OH, Fmoc-L-Arg(Pbf)-OH, Fmoc-L-Phe-OH, Fmoc-L-Ser(tBu)-OH, Fmoc-L-Thr(tBu)-OH, Fmoc-L-Tyr(tBu)-OH, and Fmoc-L-His(Trt)-OH were purchased from Chem-Impex International (Wood Dale, Ill.). Aminomethyl polystyrene resin was prepared according to an in-house protocol.1 Peptide synthesis-grade N,N-dimethylformamide (DMF), dichloromethane (DCM), diethyl ether, HPLC-grade acetonitrile, and guanidine hydrochloride were obtained from VWR International (Philadelphia, Pa.). Aryl halides and aryl trifluoromethanesulfonates were purchased from Aldrich Chemical Co., Alfa Aesar, or Matrix Scientific and were used without additional purification. All deuterated solvents were purchased from Cambridge Isotopes and used without further purification. All other reagents were purchased from Sigma-Aldrich and used as received. Trastuzumab was a kind gift from Prof K. Dane Wittrup at MIT.
All reactions with peptides, proteins, and antibodies were set up on the bench top and carried out under ambient conditions. For procedures carried out in the nitrogen-filled glovebox, the dry degassed THF was obtained by passage through activated alumina columns followed by purging with argon. Anhydrous pentane, cyclohexane, and acetonitrile were purchased from Aldrich Chemical Company in Sureseal® bottles and were purged with argon before use.
All small-molecule organic and organometallic compounds were characterized by 1H, 13C NMR, and IR spectroscopy, as well as elemental analysis (unless otherwise noted). 19F NMR spectroscopy was used for organometallic complexes containing a trifluoromethanesulfonate counterion. 31P NMR spectroscopy was used for characterization of palladium complexes. Copies of the 1H, 13C, 31P, and 19F NMR spectra can be found at the end of the Supporting Information. Nuclear Magnetic Resonance spectra were recorded on a Bruker 400 MHz instrument and a Varian 300 MHz instrument. Unless otherwise stated, all 1H NMR experiments are reported in 6 units, parts per million (ppm), and were measured relative to the signals of the residual proton resonances CH2Cl2 (5.32 ppm) or CH3CN (1.94 ppm) in the deuterated solvents. All 13C NMR spectra are measured decoupled from 1H nuclei and are reported in δ units (ppm) relative to CD2Cl2 (54.00 ppm) or CD3CN (118.69 ppm), unless otherwise stated. All 31P NMR spectra are measured decoupled from 1H nuclei and are reported relative to H3PO4 (0.00 ppm). 19F NMR spectra are measured decoupled from 1H nuclei and are reported in ppm relative to CFCl3 (0.00 ppm) or α,α,α-trifluorotoluene (˜63.72 ppm). All FT-IR spectra were recorded on a Thermo Scientific—Nicolet iS5 spectrometer (iD5 ATR—diamond). Elemental analyses were performed by Atlantic Microlabs Inc., Norcross, Ga.
LC-MS chromatograms and associated mass spectra were acquired using Agilent 6520 ESI-Q-TOF mass spectrometer. Solvent compositions used in the majority of experiments are 0.1% TFA in H2O (solvent A) and 0.1% TFA in acetonitrile (solvent B). The following LC-MS methods were used:
Method A
LC conditions: Zorbax SB C3 column: 2.1×150 mm, 5 m, column temperature: 40° C., gradient: 0-3 min 5% B, 3-22 min 5-95% B, 22-24 min 95% B, flow rate: 0.8 mL/min. MS conditions: positive electrospray ionization (ESI) extended dynamic mode in mass range 300-3000 m/z, temperature of drying gas=350° C., flow rate of drying gas=11 L/min, pressure of nebulizer gas=60 psi, the capillary, fragmentor, and octupole rf voltages were set at 4000, 175, and 750, respectively.
Method B
LC conditions: Zorbax SB C3 column: 2.1×150 mm, 5 m, column temperature: 40° C., gradient: 0-2 min 5% B, 2-11 min 5-65% B, 11-12 min 65% B, flow rate: 0.8 mL/min. MS conditions are same as Method A.
Method C
LC conditions: Zorbax SB C3 column: 2.1×150 mm, 5 m, column temperature: 40° C., gradient: gradient: 0-2 min 5% B, 2-10 min 5-95% B, 10-11 min 95% B, flow rate: 0.8 mL/min. MS conditions are same as Method A.
Data were processed using Agilent MassHunter software package. Deconvoluted masses of proteins were obtained using maximum entropy algorithm.
LC-MS data shown were acquired using Method A, unless otherwise noted; Y-axis in all chromatograms shown in supplementary figures represents total ion current (TIC); mass spectrum insets correspond to the integration of the TIC peak unless otherwise noted.
All reported yields were determined by integrating TIC spectra. First, the peak areas for all relevant peptide-containing species on the chromatogram were integrated using Agilent MassHunter software package. Since no peptide-based side products were generated in the experiments, the yields shown in Table 2 were determined as follows: % yield=Spr/Stotal where Spr is the peak area of the product and Stotal is the peak area of combined peptide-containing species (product and starting material). The yield of the stapled peptide (Example 19) was calculated as follows: % yield=k·Spr/Sst where Spr is the peak area of the reaction product, Sst is the peak area of a known amount of purified product, and k equals to the ratio of the known amount of standard divided by the initial amount of starting material. For peptide stability experiments the conversion was calculated as following: % remaining peptide=St/S0 where St is the peak area of the corresponding cysteine conjugate at time t, and S0 is the peak area of the cysteine conjugate at time 0.
A series of Pd(II) reagents were designed that were capable of selectively recognizing a Cys moiety and transferring an aryl group. These reagents feature a biaryl phosphine ligand, which confers a bulky steric and electron-rich environment at the metal center favoring facile oxidative addition of electrophilic substrates. For example, complexes 1a-b were isolated as air-stable solids and were conveniently synthesized from the Pd(0) precursor and a phosphine ligand in the presence of aryl triflate or chloride electrophiles, respectively (
Reaction between 1a and a unprotected model polypeptide 2 (
Control peptides lacking Cys residue or Sec residues were submitted to arylation conditions. For example, AKLTGF-NH(CH2C6F5) and VTLPSTF*GAS showed no conversion and/or decomposition, indicating that arylation occurs exclusively on the Cys or Sec residue. The LCMS trace for AKLTGF-NH(CH2C6F5) under arylation conditions is shown in
The Cys arylation described herein also operates in solvent mixtures containing water. Arylation experiments were conducted between a model peptide 4 (γ-Glu-Cys-Gly-Pro-Leu-Leu) and reagent 1a in 1:1 DMF:H2O and 2:1 H2O:MeCN mixtures, respectively. In both cases, selective transformation producing S-arylated peptide 5 (γ-Glu-CysTol-Gly-Pro-Leu-Leu) occurred within minutes suggesting very fast reaction kinetics (
To address functional group tolerance of this transformation, studies were conducted between several Pd-based triflate reagents and the unprotected peptide 6 (FRSNLYGCEKHKAT-NH2) featuring other common nucleophilic amino-acid residues such as OH (e.g., Tyr, Ser, Thr) and NH/NH2 (e.g., His, Lys, Arg). Arylation reactions were conducted in the presence of 0.1 M Tris at a pH of 8.5, a solvent system of 1:2 CH3CN:H2O for 5 minutes, unless noted otherwise. For all arylation agents examined (6a-f), selective and nearly quantitative S-arylation was observed irrespective of the nature of the Pd(II) reagent used (
The arylation chemistry was next evaluated using a model protein species containing a single Cys residue. DARPin protein with a single-point mutation incorporating a Cys residue on the N-terminus of the sequence chain was designed for these studies and expressed in E. coli (final amino-acid sequence: GGCGGSDLGKKLLEAARAGQDDEVRILMANGADVNAY DDNGVTPLHLAAFLGHLEIVEVLLKYGADVNAADSWGTTPLHLAATWGHLEIVEV LLKHGADVNAQDKFGKTAFDISIDNGNEDLAEILQKLN). A reaction between 50 uM protein with 5 equivalents of 1a resulted in a complete consumption of the starting material within 5 minutes. The resulting product mixture was analyzed by LC-MS confirming quantitative monoarylation of the protein.
Trypsin digestion followed by MS/MS analysis of the product mixture indicated that the modification occurred exclusively on the Cys residue further corroborating results obtained with the peptide substrates (vide supra). In addition to reagent 1a, Cys arylation was successfully performed using other reagents, including biotinylated and fluorescein-based species. For example, reaction between DARPin and the Pd(II) reagent containing fluorescein resulted in a quantitative formation of an S-labeled protein species (
Further studies were aimed at functionalization of native and non-native Cys residues in IgG antibodies. Specifically, two independent approaches were examined, where one can either functionalize native Cys moieties after partial antibody reduction or perform functionalization on the intact antibody containing single-point mutation with Cys or selenocysteine moieties on the main-chain terminus (
In a nitrogen-filled glovebox, an oven-dried scintillation vial (10 mL), which was equipped with a magnetic stir bar and fitted with a Teflon screwcap septum, was charged with RuPhos (66 mg, 0.14 mmol), 4-bromotoluene (24.2 mg, 0.14 mmol), and cyclohexane (1.0 mL). Solid (COD)Pd(CH2SiMe3)2 (50.0 mg, 0.13 mmol) was added rapidly in one portion and the resulting solution was stirred for 16 h at rt. After this time, pentane (3 mL) was added and the resulting mixture was placed into a −20° C. freezer for 3 h. The vial was then taken outside of the glovebox, and the resulting precipitate was filtered, washed with pentane (3×3 mL), and dried under reduced pressure to afford the oxidative addition complex (78.4 mg, 82%).
1H NMR (400 MHz, CD2Cl2) δ 7.61 (m, 2H), 7.43 (tt, J=7.5, 1.6 Hz, 1H), 7.37 (m, 1H), 6.91 (dd, J=8.2, 2.3 Hz, 2H), 6.86 (ddd, J=7.8, 3.1, 1.5 Hz, 1H), 6.76 (d, J=8.0 Hz, 2H), 6.64 (d, J=8.4 Hz, 2H), 4.60 (hept, J=6.1 Hz, 2H), 2.22 (s, 3H), 2.14 (m, 2H), 1.77 (m, 6H), 1.60 (m, 6H), 1.38 (d, J=6.0 Hz, 6H), 1.17 (m, 6H), 1.01 (d, J=6.0 Hz, 6H), 0.78 (m, 2H).
13C NMR (101 MHz, CD2Cl2) δ 159.42, 145.38, 145.20, 137.74, 137.70, 134.88, 134.18, 133.84, 133.11, 133.01, 132.94, 131.62, 131.56, 130.99, 130.97, 128.20, 126.81, 126.76, 112.44, 112.41, 107.88, 71.44, 34.40, 34.14, 28.73, 28.17, 28.15, 27.82, 27.69, 27.49, 27.46, 27.35, 26.60, 22.46, 21.93, 20.79 (observed complexity is due to C—P coupling).
31P NMR (121 MHz, CD2Cl2) δ 29.89.
Peptide P1 (4 μL, 150 μM, above), H2O (47 μL), organic solvent (1 μL) and the buffer (6 μL, 1 M) were combined in a 0.6 mL plastic Eppendorf tube and the resulting solution was mixed using a vortexer. A stock solution of the palladium complex (2 μL, 600 μM) in organic solvent was added in one portion, the reaction tube was vortexed to ensure proper reagent mixing and left at room temperature for 5 min. The reaction was quenched by the addition of 3-mercaptopropionic acid (6.3 μL, 0.05 μL/mL solution). After an additional 5 min the LCMS solution (60 μL) was added to the Eppendorf and the reaction mixture was analyzed by LCMS.
Final concentration of the reaction before quenching:
Peptide—10 μM,
Pd-complex—20 μM,
Tris buffer—100 mM;
CH3CN:H2O=5:95.
In a nitrogen-filled glovebox, an oven-dried scintillation vial (10 mL), which was equipped with a magnetic stir bar and fitted with a Teflon screwcap septum, was charged with RuPhos (139.4 mg, 0.30 mmol, 2.5 equiv), 4,4′-dichlorobenzophenone (30.0 mg, 0.12 mmol, 1 equiv) and cyclohexane (1.2 mL). Solid (COD)Pd(CH2SiMe3)2 (116.2 mg, 0.30 mmol, 2.5 equiv) was added rapidly in one portion and the resulting solution was stirred for 16 h at rt. After this time, pentane (3 mL) was added and the resulting mixture was placed into a −20° C. freezer for 3 h. The vial was then taken outside of the glovebox, and the resulting precipitate was filtered, washed with pentane (3×3 mL), and dried under reduced pressure to afford the oxidative addition complex.
1H NMR (400 MHz, CD2Cl2) δ 7.64 (m, 4H), 7.45 (m, 2H), 7.39 (m, 2H), 7.32 (d, J=8.0 Hz, 4H), 7.25 (dd, J=8.4, 2.1 Hz, 4H), 6.88 (ddd, J=7.7, 3.1, 1.3 Hz, 2H), 6.65 (d, J=8.5 Hz, 4H), 4.64 (hept, J=6.1 Hz, 4H), 2.14 (m, 4H), 1.70 (m, 24H), 1.39 (d, J=6.0 Hz, 12H), 1.20 (m, 12H), 1.02 (d, J=6.0 Hz, 12H), 0.75 (m, 4H).
13C NMR (101 MHz, CD2Cl2) δ 197.01, 159.78, 149.09, 145.47, 145.30, 137.25, 137.21, 135.49, 134.06, 133.94, 133.58, 133.06, 132.95, 131.55, 131.23, 131.21, 128.34, 126.98, 126.92, 111.50, 107.69, 71.53, 34.39, 34.12, 28.78, 28.32, 27.73, 27.59, 27.38, 27.27, 26.59, 22.44, 21.89 (observed complexity is due to C—P coupling).
31P NMR (121 MHz, CD2Cl2) δ 33.27.
Peptide (4 μL, 150 μM), H2O (23 μL), and Tris buffer (3 μL, 1 M, pH=7.5) were combined in a 0.6 mL plastic Eppendorf tube and the resulting solution was mixed using a vortexer. A stock solution of the palladium complex (30 μL, 40 μM) in CH3CN was added in one portion, the reaction tube was vortexed to ensure proper reagent mixing and left at room temperature for 10 min. The reaction was quenched by the addition of 3-mercaptopropionic acid (6.3 μL, 0.1 μL/mL solution). After an additional 5 min the LCMS solution (60 μL) was added to the Eppendorf and the reaction mixture was analyzed by LCMS.
Final concentration of the reaction before quenching:
peptide—10 μM,
OA—20 μM,
Tris buffer—100 mM;
CH3CN:H2O=1:1.
Conjugation protocol: Trastuzumab was partially reduced with TCEP on a 20-μL scale. Reaction conditions: 10 μM trastuzumab (˜1.5 mg/mL), 30 μM TCEP, 0.1 M Tris, pH 8.0, 37° C., 2 hours.
1 μL of 0.4 mM palladium-vandetanib complex dissolved in DMF was added to 20 μL of partially reduced antibody, the resulting mixture was left at room temperature for 30 minutes. See
LC-MS analysis: 20 μL of crude reaction mixture was quenched by addition of 1 μL of 4 mM mercaptopropionic acid. The resulting solution was left at room temperature for 5 minutes, and was then buffer exchanged into buffer P (20 mM Tris, 150 mM NaCl, pH 7.5) using a 10K spin concentrator. N-linked glycans were removed by addition of 1 μL of PNGase F (New England Biolabs) was added to 100 μg of antibody and incubation at 45° C. for 1 hour. The resulting solution was completely reduced by addition of 1/10 volume of 200 mM TCEP solution (pH 7.5) and incubation at 37° C. for 30 minutes before subjecting to LC-MS analysis. Based on this analysis, the drug-to-antibody ratio (DAR) was calculated to be about 5.5 (data not shown).
In a nitrogen-filled glovebox, an oven-dried scintillation vial (10 mL), which was equipped with a magnetic stir bar, was charged with RuPhos (1.1 equiv), Ar—X (1.1 equiv), and cyclohexane. Solid (COD)Pd(CH2SiMe3)2 (McAtee, J. R. Angew. Chem., Int. Ed. 51, 3663-3667 (2012)) (1 equiv) was added rapidly in one portion and the resulting solution was stirred for 16 h at rt. After this time, pentane (3 mL) was added and the resulting mixture was placed into a −20° C. freezer for 3 h. The vial was then taken outside of the glovebox, and the resulting precipitate was filtered, washed with pentane (3×3 mL), and dried under reduced pressure to afford the oxidative addition complex.
Following the general procedure, a mixture containing 4-chlorotoluene (17 μL, 0.14 mmol), RuPhos (66 mg, 0.14 mmol), and (COD)Pd(CH2SiMe3)2 (50 mg, 0.13 mmol) was stirred at rt in cyclohexane (1.5 mL) for 16 h. General work up afforded 1A-CI as a white solid (68.7 mg, 77%).
Following the general procedure, a mixture containing 4-bromotoluene (24.2 mg, 0.14 mmol), RuPhos (66.0 mg, 0.14 mmol), and (COD)Pd(CH2SiMe3)2 (50.0 mg, 0.13 mmol) was stirred at rt in cyclohexane (1 mL) for 16 h. General work up afforded 1A-Br as an off-white solid (78.4 mg, 82%).
Following the general procedure, a mixture containing 4-iodotoluene (61.7 mg, 0.28 mmol), RuPhos (131.9 mg, 0.28 mmol), and (COD)Pd(CH2SiMe3)2 (100.0 mg, 0.26 mmol) was stirred at rt in cyclohexane (1.5 mL) for 16 h. General work up afforded 1A-I as a bright yellow solid (180.0 mg, 89%).
Following the general procedure, a mixture containing 4-tolyl trifluoromethanesulfonate (100.0 mg, 0.42 mmol), RuPhos (194.0 mg, 0.42 mmol), and (COD)Pd(CH2SiMe3)2 (147.0 mg, 0.38 mmol) was stirred at rt in cyclohexane (1.5 mL) for 16 h. General work up afforded 1A-OTf as an off-white solid (270.0 mg, 88%).
Following the general procedure, a mixture containing 2-ethyl-6-methylpyridin-3-yl trifluoromethanesulfonate (76.0 mg, 0.28 mmol, Note: 2.2 equiv was used), RuPhos (66.0 mg, 0.141 mmol), and (COD)Pd(CH2SiMe3)2 (50.0 mg, 0.129 mmol) was stirred at rt in cyclohexane (0.75 mL) for 16 h. General work up afforded 1B as a light yellow solid (95.0 mg, 88%).
Following the general procedure, a mixture containing fluorescein monotrifluoromethanesulfonate (52.5 mg, 0.11 mmol, Note: used as the limiting reagent), RuPhos (66.0 mg, 0.14 mmol), and (COD)Pd(CH2SiMe3)2 (50.0 mg, 0.13 mmol) was stirred in THF (0.75 mL) at rt for 16 h using aluminum foil for light exclusion. General work up afforded 1C as a bright orange precipitate (107.5 mg, 92%).
Following the general procedure, a mixture containing 2-oxo-2H-chromen-6-yl trifluoromethanesulfonate (38.2 mg, 0.13 mmol, Note: 1.01 equiv was used), RuPhos (66.0 mg, 0.14 mmol), and (COD)Pd(CH2SiMe3)2 (50.0 mg, 0.13 mmol) was stirred at rt in THF (0.75 mL) for 16 h. General work up afforded 1D as a light yellow solid (103.3 mg, 93%).
Following the general procedure, a mixture containing aryl trifluoromethanesulfonate Si (100.0 mg, 0.21 mmol, Note: 1 equiv was used), RuPhos (109.8 mg, 0.24 mmol), and (COD)Pd(CH2SiMe3)2 (83.2 mg, 0.21 mmol) was stirred in THF (1.5 mL) at rt for 16 h. General work up afforded 1E as a light orange solid (179.0 mg, 80%).
Following the general procedure, a mixture containing 4-chlorobenzaldehyde (39.7 mg, 0.28 mmol), RuPhos (131.9 mg, 0.28 mmol), and (COD)Pd(CH2SiMe3)2 (100.0 mg, 0.26 mmol) was stirred in cyclohexane (1.5 mL) at rt for 16 h. General work up afforded 1F as a white solid (166.0 mg, 91%).
Following the general procedure, a mixture containing 4-chloroacetophenone (36.7 L, 0.28 mmol), RuPhos (131.9 mg, 0.28 mmol), and (COD)Pd(CH2SiMe3)2 (100.0 mg, 0.26 mmol) was stirred in cyclohexane (1.5 mL) at rt for 16 h. General work up afforded 1G as a white solid (187.1 mg, 80%).
Following the general procedure, a mixture containing 4-chlorobenzophenone (61.2 mg, 0.28 mmol), RuPhos (131.9 mg, 0.28 mmol), and (COD)Pd(CH2SiMe3)2 (100.0 mg, 0.26 mmol) was stirred in cyclohexane (1.5 mL) at rt for 16 h. General work up afforded 1H as a white solid (170.3 mg, 84%).
Following the general procedure, a mixture containing (4-chlorophenylethynyl)trimethylsilane (71.6 mg, 0.34 mmol), RuPhos (131.9 mg, 0.28 mmol), and (COD)Pd(CH2SiMe3)2 (100.0 mg, 0.26 mmol) was stirred in cyclohexane (1.5 mL) at rt for 16 h. General work up afforded 1I as a white solid (157.7 mg, 78%).
Following the general procedure, a mixture containing Vandetanib (61.7 mg, 0.13 mmol, Note: 1.01 equiv was used), RuPhos (66.0 mg, 0.14 mmol), and (COD)Pd(CH2SiMe3)2 (50.0 mg, 0.13 mmol) was stirred in THF (1.5 mL) at rt for 16 h.
General work up afforded 1J as an off-white solid (119.0 mg, 88%).
Following a slightly modified general procedure, a mixture of 4,4′-dichlorobenzophenone (30.0 mg, 0.12 mmol, 1 equiv), RuPhos (139.4 mg, 0.30 mmol, 2.5 equiv), and (COD)Pd(CH2SiMe3)2 (116.2 mg, 0.30 mmol, 2.5 equiv) was stirred in cyclohexane (1.2 mL) at rt for 16 h. General work up afforded 2A as a beige solid (146.8 mg, 88%).
Following the general procedure, a mixture of 4-chlorobenzonitrile (42.4 mg, 0.31 mmol), RuPhos (144.0 mg, 0.31 mmol), and (COD)Pd(CH2SiMe3)2 (100.0 mg, 0.26 mmol) was stirred in cyclohexane (1.5 mL) at rt for 16 h. General work up afforded 1-Benzonitrile as a white solid (186.4 mg, 99%).
Following the general procedure, a mixture containing 4-bromo-1,2,3,6-tetrahydro-1,1′-biphenyl (30.0 mg, 0.127 mmol), RuPhos (59.0 mg, 0.127 mmol), and (COD)Pd(CH2SiMe3)2 (44.7 mg, 0.115 mmol) was stirred in cyclohexane (0.75 mL) at rt for 16 h. General work up afforded 1-Vinyl as a yellow solid (80.0 mg, 86%).
Many of the exemplified cysteine conjugation reactions operate at nearly neutral to slightly basic pH values. Further evaluation of the reaction conditions using palladium reagents revealed quantitative conversion of the starting peptide to the corresponding S-aryl cysteine conjugate within a broad pH range (5.5-8.5) using common organic cosolvents (5% of DMF, DMSO, CH3CN) in various buffers. Remarkably, even in 0.1% TFA solution (pH 2.0) the reaction yielded 59% of the S-arylated product after 7 hours. The process was also compatible with the protein disulfide reducing agent tris(2-carboxyethyl)phosphine (TCEP) that has been shown to hamper bioconjugations by reacting with maleimide and α-haloacyl groups
100 μM
10 μM
8.5
8
7.5
Na
2
HPO
4/
NaH
2
PO
4
25 mM Tris
7
6.5
5.5
5.5
2.0
2.0
aOptimal conditions used for further substrate scope evaluation are highlighted in grey;
bReaction time: 10 min;
cReaction time: 7 h 20 min;
dReaction performed in the presence of TCEP (20 μM);
ePeptide P1-Ser was used as the control.
The palladium mediated conjugation is fast, with complete product formation occurring within 15 seconds at 4° C. The reaction rate was estimated by competition experiments against the commonly used N-methyl maleimide cysteine ligation. (Gorin, G., et al. Arch. Biochem. Biophys. 115, 593-597 (1966)). At pH 7.5, the rate of the palladium-mediated reaction was comparable to that of the maleimide ligation, where 70% of the products resulted from the reaction with palladium-tolyl complex (1A-OTf). Notably, the palladium-mediated conjugation outperformed the maleimide ligation at pH 5.5, at which only the arylated product was formed.
The optimized conditions (0.1 M Tris buffer, 5% CH3CN, pH 7.5, room temperature) were used for further evaluation of the substrate scope (General Arylation Procedure A). Palladium complexes containing chloride, bromide and iodide counterions were all found to produce the desired product (1A-CI, 1A-Br, and 1A-I). This method can be used to functionalize unprotected peptides with a variety of important groups including fluorescent tags (1C, 1D), affinity labels (1E), bioconjugation handles (aldehyde 1F, ketone 1G, and alkyne 1H), photochemical crosslinkers (1I), as well as complex drug molecules (1J). Importantly, the palladium(II) complexes are stable under ambient conditions, and can be stored in closed vials under air at 4° C. for over four months. The “aged” reagents still exhibited reactivity comparable to the freshly made complexes.
General Arylation Procedure A.
Peptide P1 (4 μL, 150 μM in water), H2O (47 μL), organic solvent (1 μL), and the buffer (6 μL, 1 M) were combined in a 0.6 mL plastic Eppendorf tube and the resulting solution was mixed by vortexing for 10 s. A stock solution of the palladium complex (2 μL, 600 μM) in organic solvent was added in one portion, the reaction tube was vortexed to ensure proper reagent mixing and left at room temperature for 5 min. The reaction was quenched by the addition of 3-mercaptopropionic acid (6.3 μL, 0.05 μL/mL solution in water, 3 equiv to the palladium complex). After an additional 5 min, a solvent mixture of (e.g., 50% A:50% B (v/v, 60 μL)) was added to the Eppendorf and the reaction mixture was analyzed by LC-MS.
Final concentrations of the reaction before quenching: peptide P1—10 μM, Pd-complex—20 μM, Buffer—100 mM; organic solvent: H2O=5:95.
The arylated peptide P1-A was synthesized according to general procedure A. Final conditions before quenching: peptide—10 μM, 1A-OTf—20 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
Me Et
The arylated peptide P1-B was synthesized according to general procedure A. Final conditions before quenching: peptide—10 μM, 1B—20 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The arylated peptide P1-C was synthesized according to general procedure A. The reaction was quenched by the addition of 3-mercaptopropionic acid (12.5 μL, 0.05 μL/mL solution in water, 2 equiv to 1C). Final conditions before quenching: peptide—10 μM, 1C—30 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The arylated peptide P1-D was synthesized according to general procedure A. The reaction was quenched by the addition of 3-mercaptopropionic acid (6.3 μL, 0.05 μL/mL solution in water, 2 equiv to 1D). Final conditions before quenching: peptide—10 μM, 1D—30 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The arylated peptide P1-E was synthesized according to general procedure A. Final conditions before quenching: peptide—10 μM, 1E—20 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The arylated peptide P1-A was synthesized according to general procedure A. Final conditions before quenching: peptide—10 μM, 1A-X (X=Cl, Br, I)—20 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The arylated peptide P1-F was synthesized according to general procedure A. Final conditions before quenching: peptide—10 μM, 1F—20 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The arylated peptide P1-G was synthesized according to general procedure A. Final conditions before quenching: peptide—10 μM, 1G—20 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The arylated peptide P1-H was synthesized according to general procedure A. Final conditions before quenching: peptide—10 μM, 1H—20 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The arylated peptide P1-I was synthesized according to general procedure A. The reaction was quenched by the addition of 3-mercaptopropionic acid (6.3 μL, 0.05 μL/mL solution in water, 1 equiv to 1I). Final conditions before quenching: peptide—10 μM, 1I—60 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The arylated peptide P1-J was synthesized according to general procedure A. Final conditions before quenching: peptide—10 μM, 1J—20 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The vinylated peptide P1-Vinyl was synthesized according to general procedure A. The reaction was quenched by the addition of 3-mercaptopropionic acid (6.3 μL, 0.05 μL/mL solution in water, 1.5 equiv to 1-Vinyl). Final conditions before quenching: peptide—10 μM, 1-Vinyl—40 μM, 0.1 M Tris (pH 7.5), CH3CN:H2O=5:95.
The stability of the arylated peptides was compared to that of conjugates formed from reactions with reagents including N-ethyl maleimide, 2-bromoacetamide, and benzyl bromide. The S-arylated peptide was shown to be stable toward acids, bases, and external thiol nucleophiles. In contrast, the corresponding acetamide derivative was unstable under acidic and basic conditions and the maleimide conjugate decomposed in the presence of base and exogenous thiol. Finally, comparable stability of both aryl and benzyl conjugates to treatment with the periodic acid oxidant at 37° C. was observed.
Peptide P1 conjugates were pre-dissolved in water in plastic Eppendorfs to afford the 1.11 mM stock solutions used in the stability evaluation experiments. For each experiment, the corresponding cysteine conjugate (1.11 mM; 18 μL) and stability test reagent (2 μL, 50 mM in H2O or 50 mM in 1M Tris, pH 7.4) were combined in a plastic Eppendorf and left at rt for 2 days, followed by 4 days at 37° C. After this time, individual reactions were quenched with a solution of 50% A:50% B (v/v, 200 μL) and the resulting samples were analyzed by LC-MS.
Stability test reagent: K2CO3 (2 μL, 50 mM in H2O);
Final conditions before quenching: 1 mM peptide, 5 mM K2CO3; 2 d at rt, then 4 d at 37° C.
Stability test reagent: HCl (2 μL, 1 M in H2O);
Final conditions before quenching: 1 mM peptide, 0.1 M HCl; 2 d at rt, then 4 d at 37° C.
Stability test reagent: Glutathione (2 μL, 50 mM in 1 M Tris; pH 7.4);
Final conditions before quenching: 1 mM peptide, 5 mM GSH, 0.1M Tris, pH 7.4; 2 d at rt, then 4 d at 37° C.
Additional tuning of the electronic properties of the aromatic ring of the arylated peptide by installing a para-electron withdrawing cyano-group could be achieved. This modification significantly decreased the amount of oxidation producing the most stable peptides across all the evaluated conjugates. Notably, installing the para cyano-group in the benzyl conjugates did not have any effect toward oxidation.
Peptide P2 conjugates were pre-dissolved in water in plastic Eppendorfs to afford the 111.1 μM stock solutions used in the oxidation stability evaluation experiments. The corresponding cysteine conjugates (18 μL, 111.1 μM in H2O) and H5IO6 (2 μL, 4 mM in H2O) were then combined in a plastic Eppendorf, mixed using a vortexer and transferred into a pre-heated water bath at 37° C. Individual reactions were quenched with Na2SO3 (20 L, 4 mM in H2O) after 10 min, 30 min, 1 h, 2 h, 4 h, and 6 h, and the resulting mixtures were kept at rt for an additional 10 min. Subsequently, a solution of 50% A:50% B (v/v, 160 μL) was added and the resulting samples were analyzed by LC-MS (
This reaction was explored with proteins. Three antibody mimetic proteins (P4-P6) were expressed that contained a cysteine at structurally distinct positions including the N-terminus, C-terminus, and a loop. The same proteins without cysteine were used as controls to confirm the selectivity of the reaction (P7-P9). All three proteins (P4-P6) were quantitatively tagged with either coumarin (
Protein Labeling
To a solution of protein (500 pmoles) in 475 μL of 20 mM Tris and 150 mM NaCl buffer (pH 7.5) was added palladium-coumarin complex 1D or palladium-drug complex 1J (25 μL, 200 μM) in DMF. The solution was pipetted up and down 20 times to ensure proper reagent mixing. The reaction mixture was left at room temperature for 30 min. After this time, the reaction was quenched by the addition of 3-mercaptopropionic acid (25 μL, 2 mM) dissolved in 20 mM Tris and 150 mM NaCl buffer (pH 7.5). After an additional 5 min at rt, 500 μL of 1:1 CH3CN/H2O (v/v) containing 0.2% TFA was added and the resulting mixture was analyzed by LC-MS.
Haloarylated peptides (i.e., containing an aryl-halide bond) can undergo further cross-coupling reaction with external thiols to generate arylated peptides with additional complexity. As demonstrated in
The stapled peptides discussed herein can also be generated by an alternative non-symmetric process. A monopalladium haloarylation reagent (i.e., a reagent containing an aryl halide bond) has undergone reaction with a cysteine-containing peptide. After this first cross coupling reaction step, a secondary cross coupling reaction with the catalyst at a second cysteine residue in the peptide yielded the target stapled peptide product (
Air-stable Ph-mesylate palladium precatalysts (e.g., 2-amino biphenyl Pd species such as the second generation Buchwald catalyst) can also be used as catalysts for the biomolecule arylation reaction. When used in conjunction with an aryl halide reagent, these precatalysts generated arylated peptide products (
All of the U.S. patents and U.S. patent application publications cited herein are hereby incorporated by reference.
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be encompassed by the following claims.
This application is the U.S. national phase of International Patent Application No. PCT/US2015/040495, filed Jul. 15, 2015, which claims the benefit of priority to U.S. Patent Application Ser. Nos. 62/024,769, filed Jul. 15, 2014; and 62/091,720, filed Dec. 15, 2014, the contents of which are hereby incorporated by reference in their entities.
This invention was made with Government support under Grant Nos. GM046059 and GM101762 awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US15/40495 | 7/15/2015 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62091720 | Dec 2014 | US | |
62024769 | Jul 2014 | US |