Mucus is one of the human body's primary defenses against pathogens. Mucus is built from proteins with hydrophobic and hydrophilic domains, where the hydrophobic domains associate together to form physical gels. Size exclusion and ionic repulsion restrict most molecules such as viruses from penetrating the gel, see
However, acquiring large quantities of mucus, such as mucins, can be challenging. A common source of mucins is obtaining them by scraping pig stomachs; however, this process typically yields mucins on the order of micrograms. Further, mucins are high molecular weight polymers that range from several hundred thousand to several million Daltons. The high molecular weight and glycosylation of mucins make them very challenging to synthesize via molecular biology. In nature, mucins are synthesized as a lightly glycosylated, thiol-reduction-resistant precursor in the golgi apparatus. This precursor is subsequently glycosylated in the endoplasmic reticulum and golgi apparatus and then further modified after secretion outside of the cell. Known mucin mimics are typically synthesized either as short glycosylated oligomers that are polymerized resulting in low-molecular weight mimics or expressed as proteins that are glycosylated with expensive enzymes resulting in a low degree of glycosylation.
Provided herein is a series of brush proteins that mimics the variable number of tandem repeats (VNTR) of respiratory mucins that have the capability to be chain-extended through disulfide coupling and the use of a bioconjugation technique to post-translationally mass functionalize proteins. In nature, enzymes functionalize threonines and serines via glycosylation. As reproducing this process in vitro is both expensive and challenging, we used diazonium coupling based tyrosine modification chemistry that is orthogonal to cysteine based chain extension functionalization. Diazonium coupling is typically used to bioconjugate proteins on a single location. Here, we describe the use of this chemistry to mass functionalize a protein. This method of economically mimicking post-translational modification enables the production of high molecular weight and densely functionalized mucin mimetic materials.
There exists a need for mimics of mucins that retain their physical and functional characteristics. The proteins described herein provide a series of mucin mimics that are useful in developing and testing, for example, pharmaceutical drug metabolic properties.
Disclosed herein are polypeptides comprising a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B, wherein about 20% to about 50% of the serine and threonine amino acids in the tandem repeats have been replaced by tyrosine residues. In one embodiment, the mucin is MUC5AC or MUC5B. In another embodiment, about 33% to about 50% of the serine and threonine amino acids have been replaced by tyrosine residues. In some embodiments, the N-terminal and C-terminal amino acids are cysteine.
The polypeptide can be selected from the group consisting of MUC5ACL, MUC5ACH, MUC5BL, MUC5BH, MUC5ACL-S, MUC5ACL-LT, MUC5ACL+D, MUC5ACLS-15, pCoil-MUC5ACL-S, pCoil-MUC5ACL-LT, pCoil-MUC5ACL+D, MUC5ACL-S-Cold, GST-MUC5ACL-S, MBP-MUC5ACL-S, ELP 1:1 Y:S 10k, ELP 3:1 Y:S 10k, and MUC5ACLSS. In some embodiments, the polypeptide is selected from the group consisting of MUC5ACL, MUC5ACH, MUC5BL, MUC5BH, MUC5ACL-S, MUC5ACL-LT, and MUC5ACL+D. In some embodiments, the polypeptide is selected from the group consisting of MUC5ACLS-15, pCoil-MUC5ACL-S, pCoil-MUC5ACL-LT, and pCoil-MUC5ACL+D. In some embodiments, the polypeptide is selected from the group consisting of MUC5ACL-S-Cold, GST-MUC5ACL-S, and MBP-MUC5ACL-S. In some embodiments, the polypeptide is selected from the group consisting of ELP 1:1 Y:S 10k, ELP 3:1 Y:S 10k, and MUC5ACLSS. In one embodiment, the polypeptide is MUC5ACL.
In some embodiments, the number of tandem repeat sequences ranges from about 15 to about 70. The length of the repeat sequence can range from about 700 Da to about 2 kDa. In some embodiments, a plurality of the tyrosine residues have been modified to include a substituent selected from the group consisting of alkyl, alkynyl, aryl, amino, carboxyl, heteroaryl, nitro, sulfate, and polyethylene oxide polymer. In one embodiment, the plurality is at least 20% of the tyrosine residues. In another embodiment, the plurality is at least 50% of the tyrosine residues. Also provided are pharmaceutical compositions comprising a polypeptide as described herein and a pharmaceutically acceptable carrier.
Disclosed herein are protein oligomers comprising at least two polypeptide units, wherein the polypeptide units comprise a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B; and about 20% to about 50% of the serine and threonine amino acids in the protein oligomer have been replaced by tyrosine residues. All of the polypeptide embodiments disclosed above also describe the polypeptide units in the protein oligomers. Also provided are pharmaceutical compositions comprising a protein oligomer as described herein and a pharmaceutically acceptable carrier.
Disclosed herein are processes for preparing a polypeptide comprising a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B, wherein about 20% to about 50% of the serine and threonine amino acids in the tandem repeats have been replaced by tyrosine residues, comprising
a. expressing the polypeptide through use of a plasmid in a host cell; and
b. isolating the polypeptide from the cell.
All of the polypeptide embodiments disclosed above also describe the polypeptide prepared by this process.
Disclosed herein are processes for preparing a polypeptide comprising a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B, wherein about 20% to about 50% of the serine and threonine amino acids in the tandem repeats have been replaced by tyrosine residues,
comprising modifying a plurality of the tyrosine residues by
or
wherein X is selected from the group consisting of alkyl, alkynyl, aryl, amino, carboxyl, heteroaryl, nitro, sulfate, and polyethylene oxide polymer; and
R is selected from the group consisting of H or alkyl;
or a pharmaceutically acceptable salt thereof.
In one embodiment, the mucin is MUC5AC or MUC5B. In another embodiment, X is selected from the group consisting of ethynyl, phenyl, carboxyl, triazolyl, nitro, sulfate and polyethylene oxide. In some embodiments, X is triazolyl, and the triazolyl is linked to a galactosyl group via a polyol linker. In other embodiments, X is phenyl, and the phenyl is substituted with an amido group. In some embodiments, R is H. In some embodiments, the plurality is at least 20% of the tyrosine residues, while in other embodiments, the plurality is at least 50% of the tyrosine residues.
Disclosed herein are processes for preparing a protein oligomer comprising at least two polypeptide units, wherein the polypeptide units comprise a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B; and about 20% to about 50% of the serine and threonine amino acids in the protein oligomer have been replaced by tyrosine residues, comprising linking at least two polypeptide units together through a disulfide bond between the N-terminus of one protein and the C-terminus of the other protein.
All of the polypeptide embodiments disclosed above also describe the polypeptide units used in this process for preparing a protein oligomer.
Disclosed herein are processes for a protein oligomer comprising at least two polypeptide units, wherein the polypeptide units comprise a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B; and about 20% to about 50% of the serine and threonine amino acids in the protein oligomer have been replaced by tyrosine residues,
comprising modifying a plurality of the tyrosine residues by
or
wherein X is selected from the group consisting of alkyl, alkynyl, aryl, amino, carboxyl, heteroaryl, nitro, sulfate, and polyethylene oxide polymer; and
R is selected from the group consisting of H or alkyl;
or a pharmaceutically acceptable salt thereof.
All of the embodiments described above for processes for preparing a polypeptide comprising modifying a plurality of the tyrosine residues by Process A or Process B also describe the above processes for preparing a protein oligomer comprising modifying a plurality of the tyrosine residues by Process A or Process B.
Unless defined otherwise, all technical and scientific terms used herein have the meaning commonly understood by a person skilled in the art of the present disclosure. The following references provide one of skill with a general definition of many of the terms used in this disclosure: Singleton et al., Dictionary of Microbiology and Molecular Biology (2nd ed. 1994); The Cambridge Dictionary of Science and Technology (Walker ed., 1988); The Glossary of Genetics, 5th Ed., R. Rieger et al. (eds.), Springer Verlag (1991); and Hale & Marham, The Harper Collins Dictionary of Biology (1991). As used herein, the following terms have the meanings ascribed to them below, unless specified otherwise.
In this disclosure, “comprises,” “comprising,” “containing” and “having” and the like can have the meaning ascribed to them in U.S. Patent law and can mean “includes,” “including,” and the like; “consisting essentially of” or “consists essentially” likewise has the meaning ascribed in U.S. Patent law and the term is open-ended, allowing for the presence of more than that which is recited so long as basic or novel characteristics of that which is recited is not changed by the presence of more than that which is recited, but excludes prior art embodiments.
Ranges provided herein are understood to be shorthand for all of the values within the range. For example, a range of 1 to 50 is understood to include any number, combination of numbers, or sub-range from the group consisting 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 13, 14, 15, 16, 17, 18, 19, 11, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50.
Unless specifically stated or obvious from context, as used herein, the term “or” is understood to be inclusive. Unless specifically stated or obvious from context, as used herein, the terms “a”, “an”, and “the” are understood to be singular or plural.
Unless specifically stated or obvious from context, as used herein, the term “about” is understood as within a range of normal tolerance in the art, for example within 2 standard deviations of the mean. About can be understood as within 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1%, 0.05%, or 0.01% of the stated value. Unless otherwise clear from context, all numerical values provided herein are modified by the term about.
As used herein, the term “alkyl”, by itself or as part of another substituent means, unless otherwise stated, a branched or unbranched saturated hydrocarbon group. The term “n-alkyl” refers to an unbranched alkyl group. The term “Cx-Cy alkyl” refers to an alkyl group having between x and y carbon atoms, inclusively, in the branched or unbranched hydrocarbon group. By way of illustration, but without limitation, the term “C1-C8 alkyl” refers to a straight chain or branched hydrocarbon moiety having from 1, 2, 3, 4, 5, 6, 7, or 8 carbon atoms. “C1-C6” refers to a straight chain or branched hydrocarbon moiety having from 1, 2, 3, 4, 5, or 6 carbon atoms. “C1-C4 alkyl” refers to a straight chain or branched hydrocarbon moiety having from 1, 2, 3, or 4 carbon atoms, including methyl, ethyl, n-propyl, iso-propyl, n-butyl, iso-butyl, sec-butyl, and tert-butyl. The term “C1-C4 n-alkyl” refers to straight chain hydrocarbon moieties that have 1, 2, 3, or 4 carbon atoms including methyl, ethyl, n-propyl, and n-butyl.
As used herein, the term “alkenyl” by itself or as part of another substituent means, unless otherwise stated, a straight or branched hydrocarbon chain radical group consisting solely of carbon and hydrogen atoms, containing at least one double bond, and having from two to ten carbon atoms (i.e., C2-10 alkenyl). Whenever it appears herein, a numerical range such as “2 to 10” refers to each integer in the given range; e.g., “2 to 10 carbon atoms” means that the alkenyl group can consist of 2 carbon atoms, 3 carbon atoms, etc., up to and including 10 carbon atoms. In certain embodiments, an alkenyl comprises two to eight carbon atoms. In other embodiments, an alkenyl comprises two to six carbon atoms (e.g., C2-6 alkenyl). The alkenyl is attached to the parent molecular structure by a single bond, for example, ethenyl (i.e., vinyl), prop-1-enyl (i.e., allyl), but-1-enyl, pent-1-enyl, penta-1,4-dienyl, and the like. The one or more carbon-carbon double bonds can be internal (such as in 2-butenyl) or terminal (such as in 1-butenyl). Examples of C2-4 alkenyl groups include ethenyl (C2), 1-propenyl (C3), 2-propenyl (C3), 1-butenyl (C4), 2-butenyl (C4), 2-methylprop-2-enyl (C4), butadienyl (C4) and the like. Examples of C2-6 alkenyl groups include the aforementioned C2-4 alkenyl groups as well as pentenyl (C5), pentadienyl (C5), hexenyl (C6), 2,3-dimethyl-2-butenyl (C6) and the like, and the higher homologs and isomers. A non-limiting functional group representing an alkene is exemplified by —CH2—CH═CH2.
As used herein, the term “alkynyl” employed alone or in combination with other terms means, unless otherwise stated, a stable straight chain or branched chain hydrocarbon group with a triple carbon-carbon bond, having the stated number of carbon atoms (i.e., C2-C10 means two to ten carbon atoms, C2-C6 means two to six carbon atoms). Non-limiting examples include ethynyl and propynyl, and the higher homologs and isomers. The term “propargylic” refers to a group exemplified by —CH2—C≡CH. The term “homopropargylic” refers to a group exemplified by —CH2CH2—C≡CH. The term “substituted propargylic” refers to a group exemplified by —CR2—C≡CR, wherein each occurrence of R is independently H, alkyl, substituted alkyl, alkenyl or substituted alkenyl, with the proviso that at least one R group is not hydrogen. The term “substituted homopropargylic” refers to a group exemplified by —CR2CR2—C≡CR, wherein each occurrence of R is independently H, alkyl, substituted alkyl, alkenyl or substituted alkenyl, with the proviso that at least one R group is not hydrogen.
As used herein, the term “amino acid” refers to a molecule H2N—CHR—COOH, where R is known as the side-chain and can be selected from hydrogen, unsubstituted alkyl or alkyl substituted with alkenyl, alkynyl, alkoxy, alkylaryl, cycloalkyl, aralkyl, aryl, aryloxy, amino, amido, amidino, imino, azide, carbonate, carbamate, carbonyl, heteroalkyl, heteroaryl, heteroarylalkyl, heterocycloalkyl, hydroxy, cyano, halo, haloalkoxy, haloalkyl, ester, ether, mercapto, thio, alkylthio, arylthio, thiocarbonyl, nitro, oxo, phosphate, phosphonate, phosphinate, silyl, sulfinyl, sulfonyl, sulfonamidyl, sulfoxyl, sulfonate, and urea. In some embodiments, the amino acid is a naturally occurring proteogenic amino acid in the L- or S-configuration (with the exception of cysteine which is R). These amino acids are as follows with the full name, abbreviation and letter code:
alanine—ala—A, arginine—arg—R, asparagine—asn—N, aspartic acid—asp—D, cysteine—cys—C, glutamine—gln—Q, glutamic acid—glu—E, glycine—gly—G, histidine—his—H, isoleucine—ile—I, leucine—leu—L, lysine—lys—K, methionine—met—M, phenylalanine—phe—F, proline—pro—P, serine—ser—S, threonine—thr—T, tryptophan—trp—W, tyrosine—tyr—Y, and valine—val—V.
The term “amino acid” also refers to unnatural or man-made amino acids. In some cases, the unnatural amino acid is the D- or R-configuration of a naturally occurring amino acid (with the exception of cysteine which is S). In other embodiments, the unnatural amino acid is a non-coded or non-proteogenic amino acid, such as selenocysteine, pyrrolysine, and N-formylmethionine. In other embodiments, an amino acid contains a synthetically derived side-chain optionally substituted as described above. An amino acid can be in neutral form or as a zwitterion: H3N+—CHR—COO—. The ratio of neutral to zwitterionic forms can be altered by changes in pH of the medium in which they are dissolved.
As used herein, the term “aryl” or “arene” employed alone or in combination with other terms means, unless otherwise stated, a radical with 6 to 14 ring atoms (e.g., C6-14 aromatic or C6-14 aryl) which has at least one ring having a conjugated pi electron system which is carbocyclic (e.g., phenyl, fluorenyl, and naphthyl). In some embodiments, the aryl is a C6-10 aryl group. Whenever it appears herein, a numerical range such as “6 to 14 aryl” refers to each integer in the given range; e.g., “6 to 14 ring atoms” means that the aryl group can consist of 6 ring atoms, 7 ring atoms, etc., up to and including 14 ring atoms. The term includes monocyclic or fused-ring polycyclic (i.e., rings which share adjacent pairs of ring atoms) groups. In a multi-ring group, only one ring is required to be aromatic, so groups such as indanyl are encompassed by the aryl definition. Non-limiting examples of aryl groups include phenyl, phenalenyl, naphthalenyl, tetrahydronaphthyl, phenanthrenyl, anthracenyl, fluorenyl, indolyl, indanyl, and the like.
“Carboxyl” refers to a —(C═O)OH radical.
“Galactose” is a monosaccharide of the formula below:
In its cyclic form, galactose can exist as a 6-membered pyranose as shown above, or as a 5-membered furanose, each having an anomer in the α or β configuration.
As used herein, the term “heterocycle”, by itself or as part of another substituent means, unless otherwise stated, an unsubstituted or substituted, stable, mono- or multi-cyclic heterocyclic ring system that consists of carbon atoms and at least one heteroatom. A heterocycle refers to any 3- to 18-membered non-aromatic radical monocyclic or polycyclic moiety comprising at least one heteroatom selected from nitrogen, oxygen, phosphorous and sulfur. In some aspects, the heteroatom(s) are chosen from N, O, and S. A heterocyclyl group can be a monocyclic, bicyclic, tricyclic or tetracyclic ring system, wherein the polycyclic ring systems can be a fused, bridged or spiro ring system. Heterocyclyl polycyclic ring systems can include one or more heteroatoms in one or both rings. A heterocyclyl group can be saturated or partially unsaturated. Partially unsaturated heterocycloalkyl groups can be termed “heterocycloalkenyl” if the heterocyclyl contains at least one double bond, or “heterocycloalkynyl” if the heterocyclyl contains at least one triple bond. Whenever it appears herein, a numerical range such as “5 to 18” refers to each integer in the given range; e.g., “5 to 18 ring atoms” means that the heterocyclyl group can consist of 5 ring atoms, 6 ring atoms, etc., up to and including 18 ring atoms.
An N-containing heterocyclyl moiety refers to an non-aromatic group in which at least one of the ring atoms is a nitrogen atom. The heteroatom(s) in the heterocyclyl radical can be optionally oxidized. One or more nitrogen atoms, if present, can be optionally quaternized. Heterocyclyl also includes ring systems substituted with one or more nitrogen oxide (—O—) substituents, such as piperidinyl N-oxides. The heterocyclyl is attached to the parent molecular structure through any atom of any of the ring(s).
“Heterocyclyl” also includes ring systems wherein the heterocyclyl ring, as defined above, is fused with one or more carbocyclyl groups wherein the point of attachment is either on the carbocyclyl or heterocyclyl ring, or ring systems wherein the heterocyclyl ring, as defined above, is fused with one or more aryl or heteroaryl groups, wherein the point of attachment to the parent molecular structure is on the heterocyclyl ring. In some embodiments, a heterocyclyl group is a 5-14 membered non-aromatic ring system having ring carbon atoms and 1-4 ring heteroatoms, wherein each heteroatom is independently selected from nitrogen, oxygen, phosphorous and sulfur (“5-14 membered heterocyclyl”). In some embodiments, a heterocyclyl group is a 3-10 membered non-aromatic ring system having ring carbon atoms and 1-4 ring heteroatoms, wherein each heteroatom is independently selected from nitrogen, oxygen, phosphorous and sulfur (“3-10 membered heterocyclyl”). In some embodiments, a heterocyclyl group is a 5-8 membered non-aromatic ring system having ring carbon atoms and 1-4 ring heteroatoms, wherein each heteroatom is independently selected from nitrogen, oxygen, phosphorous and sulfur (“5-8 membered heterocyclyl”). In some embodiments, a heterocyclyl group is a 5-6 membered non-aromatic ring system having ring carbon atoms and 1-4 ring heteroatoms, wherein each heteroatom is independently selected from nitrogen, oxygen, phosphorous and sulfur (“5-6 membered heterocyclyl”). In some embodiments, the 5-6 membered heterocyclyl has 1-3 ring heteroatoms selected from nitrogen, oxygen phosphorous and sulfur. In some embodiments, the 5-6 membered heterocyclyl has 1-2 ring heteroatoms selected from nitrogen, oxygen, phosphorous and sulfur. In some embodiments, the 5-6 membered heterocyclyl has 1 ring heteroatom selected from nitrogen, oxygen, phosphorous and sulfur.
Exemplary 3-membered heterocyclyls containing 1 heteroatom include, without limitation, azirdinyl, oxiranyl, and thiorenyl. Exemplary 4-membered heterocyclyls containing 1 heteroatom include, without limitation, azetidinyl, oxetanyl and thietanyl. Exemplary 5-membered heterocyclyls containing 1 heteroatom include, without limitation, tetrahydrofuranyl, dihydrofuranyl, tetrahydrothiophenyl, dihydrothiophenyl, pyrrolidinyl, dihydropyrrolyl and pyrrolyl-2,5-dione. Exemplary 5-membered heterocyclyls containing 2 heteroatoms include, without limitation, dioxolanyl, oxathiolanyl, thiazolidinyl, and dithiolanyl. Exemplary 5-membered heterocyclyls containing 3 heteroatoms include, without limitation, triazolinyl, diazolonyl, oxadiazolinyl, and thiadiazolinyl. Exemplary 6-membered heterocyclyl groups containing 1 heteroatom include, without limitation, piperidinyl, tetrahydropyranyl, dihydropyridinyl, and thianyl. Exemplary 6 membered heterocyclyl groups containing 2 heteroatoms include, without limitation, piperazinyl, morpholinyl, thiomorpholinyl, dithianyl, dioxanyl, and triazinanyl. Exemplary 7-membered heterocyclyl groups containing 1 heteroatom include, without limitation, azepanyl, oxepanyl and thiepanyl. Exemplary 8-membered heterocyclyl groups containing 1 heteroatom include, without limitation, azocanyl, oxecanyl and thiocanyl. Exemplary bicyclic heterocyclyl groups include, without limitation, indolinyl, isoindolinyl, dihydrobenzofuranyl, dihydrobenzothienyl, tetrahydrobenzothienyl, tetrahydrobenzofuranyl, benzoxanyl, benzopyrrolidinyl, benzopiperidinyl, benzoxolanyl, benzothiolanyl, benzothianyl, tetrahydroindolyl, tetrahydroquinolinyl, tetrahydroisoquinolinyl, decahydroquinolinyl, decahydroisoquinolinyl, 3-1H-benzimidazol-2-one, (1-sub stituted)-2-oxo-benzimidazol-3-yl, octahydrochromenyl, octahydroisochromenyl, decahydronaphthyridinyl, decahydro-1,8-naphthyridinyl, octahydropyrrolo[3,2-b]pyrrole, phenanthridinyl, indolinyl, phthalimidyl, naphthalimidyl, chromanyl, chromenyl, 1H-benzo[e][1,4]diazepinyl, 1,4,5,7-tetrahydropyrano[3,4-b]pyrrolyl, 5,6-dihydro-4H-furo[3,2-b]pyrrolyl, 6,7-dihydro-5H-furo[3,2-b]pyranyl, 5,7-dihydro-4H-thieno[2,3-c]pyranyl, 2,3-dihydro-1H-pyrrolo[2,3-b]pyridinyl, hydrofuro[2,3-b]pyridinyl, 4,5,6,7 tetrahydro-1H-pyrrolo[2,3-b]pyridinyl, 4,5,6,7-tetrahydrofuro[3,2-c]pyridinyl, 4,5,6,7-tetrahydrothieno[3,2-b]pyridinyl, 1,2,3,4-tetrahydro-1,6-naphthyridinyl, and the like.
Examples of polycyclic heterocycles include indolyl (such as, but not limited to, 3-, 4-, 5-, 6- and 7-indolyl), indolinyl, quinolyl, tetrahydroquinolyl, isoquinolyl (such as, but not limited to, 1- and 5-isoquinolyl), 1,2,3,4-tetrahydroisoquinolyl, cinnolinyl, quinoxalinyl (such as, but not limited to, 2- and 5-quinoxalinyl), quinazolinyl, phthalazinyl, 1,8-naphthyridinyl, 1,4-benzodioxanyl, coumarin, dihydrocoumarin, 1,5-naphthyridinyl, benzofuryl (such as, but not limited to, 3-, 4-, 5-, 6- and 7-benzofuryl), 2,3-dihydrobenzofuryl, 1,2-benzisoxazolyl, benzothienyl (such as, but not limited to, 3-, 4-, 5-, 6-, and 7-benzothienyl), benzoxazolyl, benzothiazolyl (such as, but not limited to, 2-benzothiazolyl and 5-benzothiazolyl), purinyl, benzimidazolyl, benztriazolyl, thioxanthinyl, carbazolyl, carbolinyl, acridinyl, pyrrolizidinyl, and quinolizidinyl.
As used herein, the term “heteroaryl” or “heteroaromatic”, by itself or as part of another substituent means, unless otherwise stated, a 5-18 membered monocyclic or polycyclic (e.g., bicyclic or tricyclic) aromatic ring system (e.g., having 6, 10 or 14π electrons shared in a cyclic array) having ring carbon atoms and 1-6 ring heteroatoms provided in the aromatic ring system, wherein each heteroatom is independently selected from nitrogen, oxygen, phosphorous and sulfur (“5-18 membered heteroaryl”). Heteroaryl polycyclic ring systems can include one or more heteroatoms in one or both rings. Whenever it appears herein, a numerical range such as “5 to 18” refers to each integer in the given range; e.g., “5 to 18 ring atoms” means that the heteroaryl group can consist of 5 ring atoms, 6 ring atoms, etc., up to and including 18 ring atoms. In some instances, a heteroaryl can have 5 to 14 ring atoms.
For example, an N-containing “heteroaryl” or “heteroaromatic” moiety refers to an aromatic group in which at least one of the skeletal atoms of the ring is a nitrogen atom. One or more heteroatom(s) in the heteroaryl radical can be optionally oxidized. One or more nitrogen atoms, if present, can also be optionally quaternized. Heteroaryl also includes ring systems substituted with one or more nitrogen oxide (—O—) substituents, such as pyridinyl N-oxides. The heteroaryl is attached to the parent molecular structure through any atom of the ring(s).
“Heteroaryl” also includes ring systems wherein the heteroaryl ring, as defined above, is fused with one or more aryl groups wherein the point of attachment to the parent molecular structure is either on the aryl or on the heteroaryl ring, or wherein the heteroaryl ring, as defined above, is fused with one or more cycloalkyl or heterocyclyl groups wherein the point of attachment to the parent molecular structure is on the heteroaryl ring. For polycyclic heteroaryl groups wherein one ring does not contain a heteroatom (e.g., indolyl, quinolinyl, carbazolyl and the like), the point of attachment to the parent molecular structure can be on either ring, i.e., either the ring bearing a heteroatom (e.g., 2-indolyl) or the ring that does not contain a heteroatom (e.g., 5-indolyl). In some embodiments, a heteroaryl group is a 5-10 membered aromatic ring system having ring carbon atoms and 1-4 ring heteroatoms provided in the aromatic ring system, wherein each heteroatom is independently selected from nitrogen, oxygen, phosphorous, and sulfur (“5-10 membered heteroaryl”). In some embodiments, a heteroaryl group is a 5-8 membered aromatic ring system having ring carbon atoms and 1-4 ring heteroatoms provided in the aromatic ring system, wherein each heteroatom is independently selected from nitrogen, oxygen, phosphorous, and sulfur (“5-8 membered heteroaryl”). In some embodiments, a heteroaryl group is a 5-6 membered aromatic ring system having ring carbon atoms and 1-4 ring heteroatoms provided in the aromatic ring system, wherein each heteroatom is independently selected from nitrogen, oxygen, phosphorous, and sulfur (“5-6 membered heteroaryl”). In some embodiments, the 5-6 membered heteroaryl has 1-3 ring heteroatoms selected from nitrogen, oxygen, phosphorous, and sulfur. In some embodiments, the 5-6 membered heteroaryl has 1-2 ring heteroatoms selected from nitrogen, oxygen, phosphorous, and sulfur. In some embodiments, the 5-6 membered heteroaryl has 1 ring heteroatom selected from nitrogen, oxygen, phosphorous, and sulfur.
Examples of heteroaryls include, but are not limited to, azepinyl, acridinyl, benzimidazolyl, benzindolyl, 1,3-benzodioxolyl, benzofuranyl, benzooxazolyl, benzo[d]thiazolyl, benzothiadiazolyl, benzo[b][1,4]dioxepinyl, benzo[b][1,4]oxazinyl, 1,4-benzodioxanyl, benzonaphthofuranyl, benzoxazolyl, benzodioxolyl, benzodioxinyl, benzoxazolyl, benzopyranyl, benzopyranonyl, benzofuranyl, benzopyranonyl, benzofurazanyl, benzothiazolyl, benzothienyl (benzothiophenyl), benzothieno[3,2-d]pyrimidinyl, benzotriazolyl, benzo[4,6]imidazo[1,2-a]pyridinyl, carbazolyl, cinnolinyl, cyclopenta[d]pyrimidinyl, 6,7-dihydro-5H-cyclopenta[4,5]thieno[2,3-d]pyrimidinyl, 5,6-dihydrobenzo[h]quinazolinyl, 5,6-dihydrobenzo[h]cinnolinyl, 6,7-dihydro-5H benzo[6,7]cyclohepta[1,2-c]pyridazinyl, dibenzofuranyl, dibenzothiophenyl, furanyl, furazanyl, furanonyl, furo[3,2-c]pyridinyl, 5,6,7,8,9,10-hexahydrocycloocta[d]pyrimidinyl, 5,6,7,8,9,10-hexahydrocycloocta[d]pyridazinyl, 5,6,7,8,9,10 hexahydrocycloocta[d]pyridinyl, isothiazolyl, imidazolyl, indazolyl, indolyl, indazolyl, isoindolyl, indolinyl, isoindolinyl, isoquinolyl, indolizinyl, isoxazolyl, 5,8-methano-5,6,7,8-tetrahydroquinazolinyl, naphthyridinyl, 1,6-naphthyridinonyl, oxadiazolyl, 2-oxoazepinyl, oxazolyl, oxiranyl, 5,6,6a,7,8,9,10,10a-octahydrobenzo[h]quinazolinyl, 1-phenyl-1H-pyrrolyl, phenazinyl, phenothiazinyl, phenoxazinyl, phthalazinyl, pteridinyl, purinyl, pyranyl, pyrrolyl, pyrazolyl, pyrazolo[3,4-d]pyrimidinyl, pyridinyl, pyrido[3,2-d]pyrimidinyl, pyrido[3,4-d]pyrimidinyl, pyrazinyl, pyrimidinyl, pyridazinyl, pyrrolyl, quinazolinyl, quinoxalinyl, quinolinyl, isoquinolinyl, tetrahydroquinolinyl, 5,6,7,8-tetrahydroquinazolinyl, 5,6,7,8-tetrahydrobenzo[4,5]thieno[2,3-d]pyrimdinyl, 6,7,8,9-tetrahydro-5H-cyclohepta[4,5]thieno[2,3-d]pyrimidinyl, 5,6,7,8-tetrahydropyrido[4,5-c]pyridazinyl, thiazolyl, thiadiazolyl, thiapyranyl, triazolyl, tetrazolyl, triazinyl, thieno[2,3-d]pyrimidinyl, thieno[3,2-d]pyrimidinyl, thieno[2,3-c]pridinyl, and thiophenyl (i.e., thienyl).
Further examples of heteroaryl groups include pyridyl, pyrazinyl, pyrimidinyl (such as, but not limited to, 2- and 4-pyrimidinyl), pyridazinyl, thienyl, furyl, pyrrolyl, imidazolyl, thiazolyl, oxazolyl, pyrazolyl, isothiazolyl, 1,2,3-triazolyl, 1,2,4-triazolyl, 1,3,4-triazolyl, tetrazolyl, 1,2,3-thiadiazolyl, 1,2,3-oxadiazolyl, 1,3,4-thiadiazolyl and 1,3,4-oxadiazolyl. The aforementioned listings of heterocyclyl and heteroaryl moieties are intended to be representative and not limiting.
“Nitro” refers to the —NO2 radical.
As used herein, the term “PEO2000” refers to polyethylene oxide polymer with an average weight of 2,000 kDa. PEO2000 is also known as PEG2000 (polyethylene glycol 2000).
As used herein, the term “peptide” refers to two or more amino acids that are linked together in an amide bond between the carbonyl of 1 amino acid and the amine of another amino acid: H2C—(CHR)—[CONH—(CHR′)]n—COOH where R and R′ can be individually selected from any side-chain as described above and n can range from 1 to 50, such as 3-40, such as 5-30, such as 3-20, such as 5-15, such as 3-10, and further such as 3-6. In some embodiments, a peptide can vary in length between 2 and 100 amino acid monomers or 2 to 200 amino acid monomers. A peptide can also be described by its average molecular weight, such as ranging from about 500 Da to about 10 kDa, such as about 700 Da to about 7 kDa, such as about 700 Da to about 2 kDa, such as about 1 kDa to about 10 kDa, such as about 2 kDa to about 7 kDa, further such as about 5 kDa to about 10 kDa.
In some cases, an amino acid side-chain can be referred to as a “residue”. In some embodiments, the peptide is unbranched, while in others, an R or R′ group contains a side-chain that is a peptide itself giving rise to a branched peptide. A peptide can be in neutral form, zwitterionic form, or positively or negatively charged form. Peptides are referred to by the number of amino acids they contain, such as dipeptide, tripeptide, tetrapeptide, etc. These smaller length peptides are known as oligopeptides. In some instances, peptides having more than about 100 amino acids are termed polypeptides. They can be endogenously created within an organism or synthetically made ex vivo using amide bond forming reactions. In some embodiments, an amino acid side-chain can be chemically modified after incorporation into a peptide. Peptides have a wide variety of applications in biological and chemical fields that are described, for example, in Kastin, A. Ed. Handbook of Bioloigcally Active Peptides, 2nd Ed. Academic Press 2013; and Jakubke, H.-D. et al. Peptides from A to Z: A Consise Encyclopedia, 1st Ed. Wiley 2008.
As used herein, the term “protein” refers to a peptide H2C—(CHR)—[CONH—(CHR′)]n—COOH where n is greater than 50, such as greater than 1000, such as greater than 200, such as greater than 500 and higher. Some proteins can contain over 1000 amino acids. A protein can contain any of the amino acids described above or have sections that correspond to peptides as described above. A peptide can also be described by its average molecular weight, such as ranging from about 10 kDa to about 100 kDa, such as about 20 kDa to about 80 kDa, such as about 10 kDa to about 50 kDa, such as 30 kDa to about 50 kDa.
Some proteins contain sections of repeating sequences, usually at least 5-10 residues, while others do not repeat such sections. Proteins can be synthesized as described for peptides or through linking two or more peptides together. Proteins have a wide variety of functions in vivo and ex vivo that are described, for example, in Whitford, Proteins: Structure and Function 1st Ed. Wiley 2005 and Buxbaum, Fundamentals of Protein Structure and Function, 2nd Ed. Springer 2015.
The terms “substituted” or “substitution” mean that at least one hydrogen present on a group atom (e.g., a carbon or nitrogen atom) is replaced with a permissible substituent, e.g., a substituent which upon substitution for the hydrogen results in a stable compound, e.g., a compound which does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, or other reaction. Unless otherwise indicated, a “substituted” group can have a substituent at one or more substitutable positions of the group, and when more than one position in any given structure is substituted, the substituent is either the same or different at each position. Exemplary substituents include, but are not limited to, acyl, alkyl, alkenyl, alkynyl, alkoxy, alkylaryl, cycloalkyl, aralkyl, aryl, aryloxy, amino, amido, amidino, imino, azide, carbonate, carbamate, carbonyl, heteroalkyl, heteroaryl, heteroarylalkyl, heterocycloalkyl, hydroxy, cyano, halo, haloalkoxy, haloalkyl, ester, ether, mercapto, thio, alkylthio, arylthio, thiocarbonyl, nitro, oxo, phosphate, phosphonate, phosphinate, silyl, sulfinyl, sulfonyl, sulfonamidyl, sulfoxyl, sulfonate, and urea.
“Sulfanyl”, “sulfide”, and “thio” each refer to the radical —S—Rb, wherein Rb is selected from alkyl, alkenyl, alkynyl, haloalkyl, heteroalkyl (bonded through a chain carbon), cycloalkyl, cycloalkylalkyl, aryl, aralkyl, heterocycloalkyl (bonded through a ring carbon), heterocycloalkylalkyl, heteroaryl (bonded through a ring carbon) or heteroarylalkyl, unless stated otherwise in the specification, each of which moiety can itself be optionally substituted as described herein. For instance, an “alkylthio” refers to the “alkyl-S—” radical, and “arylthio” refers to the “aryl-S—” radical, each of which are bound to the parent molecular group through the S atom. The terms “sulfide”, “thiol”, “mercapto”, and “mercaptan” can also each refer to the group —RbSH. The term “disulfide” refers to an —S—S— single bond between two sulfur atoms.
As used herein, the term “sulfonate” refers to a —S(═O)2—ORb radical, wherein Rb is selected from alkyl, alkenyl, alkynyl, haloalkyl, heteroalkyl (bonded through a chain carbon), cycloalkyl, cycloalkylalkyl, aryl, aralkyl, heterocycloalkyl (bonded through a ring carbon), heterocycloalkylalkyl, heteroaryl (bonded through a ring carbon) or heteroarylalkyl, unless stated otherwise in the specification, each of which moiety can itself be optionally substituted as described herein.
The recitation of a listing of chemical groups in any definition of a variable herein includes definitions of that variable as any single group or combination of listed groups. The recitation of an embodiment for a variable or aspect herein includes that embodiment as any single embodiment or in combination with any other embodiments or portions thereof.
Any compositions or methods provided herein can be combined with one or more of any of the other compositions and methods provided herein.
Provided herein are high molecular weight, post-translationally modified protein brushes that mimic the structure and function of mucin. Mucins are a class of glycoproteins that are categorized by their amino acid backbone composition, glycosylation pattern, and typical location within the body. Mucins compose about 0.5 to 1% of the non-water component of bodily tissues as shown in
To reach the lengths of mucin, the proteins have terminal cysteine residues that are reacted to form disulfide bonds, shown in
The disclosed protein brushes are based on artificially engineered respiratory mucin mimetics, where about 25% to about 50%, such as about 33% to about 50%, of their serine and thereonie residues have been replaced by tyrosine residues. Mucin is about 25% glycosylated so adding this level of tyrosine substitution allows for close resemblance to the natural level of functionalization. The tyrosine 4-hydroxybenzyl side chain enables mass modification of the protein backbone to create the brush-like feature around the core protein backbone. Thus, the natural process of mucin glycosylation is replaced by post-translational use of several synthetic reactions to form the disclosed protein brushes. One transformation occurs using diazonium coupling of substituents to the tyrosine residues, as shown in
Disclosed herein are polypeptides comprising a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B, wherein about 20% to about 50% of the serine and threonine amino acids in the tandem repeats have been replaced by tyrosine residues. In one embodiment, the mucin is MUC5AC or MUC5B. In another embodiment, about 33% to about 50% of the serine and threonine amino acids have been replaced by tyrosine residues. In some embodiments, the N-terminal and C-terminal amino acids are cysteine.
The polypeptide can be selected from the group consisting of MUC5ACL, MUC5ACH, MUC5BL, MUC5BH, MUC5ACL-S, MUC5ACL-LT, MUC5ACL+D, MUC5ACLS-15, pCoil-MUC5ACL-S, pCoil-MUC5ACL-LT, pCoil-MUC5ACL+D, MUC5ACL-S-Cold, GST-MUC5ACL-S, MBP-MUC5ACL-S, ELP 1:1 Y:S 10k, ELP 3:1 Y:S 10k, and MUC5ACLSS. In some embodiments, the polypeptide is selected from the group consisting of MUC5ACL, MUC5ACH, MUC5BL, MUC5BH, MUC5ACL-S, MUC5ACL-LT, and MUC5ACL+D. In some embodiments, the polypeptide is selected from the group consisting of MUC5ACLS-15, pCoil-MUC5ACL-S, pCoil-MUC5ACL-LT, and pCoil-MUC5ACL+D. In some embodiments, the polypeptide is selected from the group consisting of MUC5ACL-S-Cold, GST-MUC5ACL-S, and MBP-MUC5ACL-S. In some embodiments, the polypeptide is selected from the group consisting of ELP 1:1 Y:S 10k, ELP 3:1 Y:S 10k, and MUC5ACLSS. In one embodiment, the polypeptide is MUC5ACL.
In some embodiments, the number of tandem repeat sequences ranges from about 15 to about 70. The length of the repeat sequence can range from about 700 Da to about 2 kDa. In some embodiments, a plurality of the tyrosine residues have been modified to include a substituent selected from the group consisting of alkyl, alkynyl, aryl, amino, carboxyl, heteroaryl, nitro, sulfate, and polyethylene oxide polymer. In one embodiment, the plurality is at least 20% of the tyrosine residues. In another embodiment, the plurality is at least 50% of the tyrosine residues. Also provided are pharmaceutical compositions comprising a polypeptide as described herein and a pharmaceutically acceptable carrier.
Disclosed herein are protein oligomers comprising at least two polypeptide units, wherein the polypeptide units comprise a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B; and about 20% to about 50% of the serine and threonine amino acids in the protein oligomer have been replaced by tyrosine residues. All of the polypeptide embodiments disclosed above also describe the polypeptide units in the protein oligomers. Also provided are pharmaceutical compositions comprising a protein oligomer as described herein and a pharmaceutically acceptable carrier.
In preparing the disclosed protein brushes, the protein backbone mimics the VNTR of MUC5AC and MUC5B, well-studied respiratory mucins. The most frequent sequence of the VNTR was extracted through the consensus sequence approach. From the consensus sequence, e.g., about 33% to about 50% of the hydroxylated amino acids (serine and threonine) were replaced with tyrosine in preparation for diazonium-coupling. Replacing those percentages of the amino acids in the consensus sequence mimics natural mucins where about 25% of the amino acids are glycosylated. The consensus sequence was repeated in a modular fashion and flanked with cysteine residues on both termini of the protein. The protein sequences are presented in
Disclosed herein are processes for preparing a polypeptide comprising a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B, wherein about 20% to about 50% of the serine and threonine amino acids in the tandem repeats have been replaced by tyrosine residues, comprising
a. expressing the polypeptide through use of a plasmid in a host cell; and
b. isolating the polypeptide from the cell.
All of the polypeptide embodiments disclosed above also describe the polypeptide prepared by this process.
Disclosed herein are processes for preparing a protein oligomer comprising at least two polypeptide units, wherein the polypeptide units comprise a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B; and about 20% to about 50% of the serine and threonine amino acids in the protein oligomer have been replaced by tyrosine residues,
comprising linking at least two polypeptide units together through a disulfide bond between the N-terminus of one protein and the C-terminus of the other protein.
All of the polypeptide embodiments disclosed above also describe the polypeptide units used in this process for preparing a protein oligomer.
Diazonium coupling of an anionic azido-substituted compound to an aryl ring at the position ortho to a hydroxyl group affords a diazine linker as shown in
Disclosed herein are processes for preparing a polypeptide comprising a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B, wherein about 20% to about 50% of the serine and threonine amino acids in the tandem repeats have been replaced by tyrosine residues, comprising modifying a plurality of the tyrosine residues by
or
wherein X is selected from the group consisting of alkyl, alkynyl, aryl, amino, carboxyl, heteroaryl, nitro, sulfate, and polyethylene oxide polymer; and
R is selected from the group consisting of H or alkyl;
or a pharmaceutically acceptable salt thereof.
In one embodiment, the mucin is MUC5AC or MUC5B. In another embodiment, X is selected from the group consisting of ethynyl, phenyl, carboxyl, triazolyl, nitro, sulfate and polyethylene oxide. In some embodiments, X is triazolyl, and the triazolyl is linked to a galactosyl group via a polyol linker. In other embodiments, X is phenyl, and the phenyl is substituted with an amido group. In some embodiments, R is H. In some embodiments, the plurality is at least 20% of the tyrosine residues, while in other embodiments, the plurality is at least 50% of the tyrosine residues.
Disclosed herein are processes for a protein oligomer comprising at least two polypeptide units, wherein the polypeptide units comprise a plurality of tandem repeats of a sequence comprised by a mucin selected from the group consisting of MUC1, MUC2, MUC4, MUC7, MUC5AC and MUC5B; and about 20% to about 50% of the serine and threonine amino acids in the protein oligomer have been replaced by tyrosine residues,
comprising modifying a plurality of the tyrosine residues by
or
wherein X is selected from the group consisting of alkyl, alkynyl, aryl, amino, carboxyl, heteroaryl, nitro, sulfate, and polyethylene oxide polymer; and
R is selected from the group consisting of H or alkyl;
or a pharmaceutically acceptable salt thereof.
All of the embodiments described above for processes for preparing a polypeptide comprising modifying a plurality of the tyrosine residues by Process A or Process B also describe the above processes for preparing a protein oligomer comprising modifying a plurality of the tyrosine residues by Process A or Process B.
In some embodiments, disclosed proteins can be in the form of a pharmaceutically acceptable composition. Disclosed herein are pharmaceutical compositions comprising a polypeptide as described herein and a pharmaceutically acceptable carrier. Also disclosed herein are pharmaceutical compositions comprising a protein oligomer as described herein and a pharmaceutically acceptable carrier.
Pharmaceutically acceptable carriers and excipients include inert solid diluents and fillers, diluents, including sterile aqueous solution and various organic solvents, permeation enhancers, solubilizers and adjuvants. Other components of a pharmaceutical composition as described herein include dispersion or suspension aids, surface active agents, isotonic agents, thickening or emulsifying agents, preservatives, solid binders, lubricants and the like.
Examples of suitable aqueous and nonaqueous carriers which can be employed in pharmaceutical compositions include water, ethanol, polyols (such as glycerol, propylene glycol, polyethylene glycol, and the like), and suitable mixtures thereof, vegetable oils, such as olive oil, and injectable organic esters, such as ethyl oleate. Proper fluidity can be maintained, for example, by the use of coating materials, such as lecithin, by the maintenance of the required particle size in the case of dispersions, and by the use of surfactants.
These compositions can also contain adjuvants such as preservatives, wetting agents, emulsifying agents, dispersing agents, lubricants, and/or antioxidants. Prevention of the action of microorganisms upon the compounds described herein can be ensured by the inclusion of various antibacterial and antifungal agents, for example, paraben, chlorobutanol, phenol sorbic acid, and the like. It can also be desirable to include isotonic agents, such as sugars, sodium chloride, and the like into the compositions.
Methods of preparing these formulations or compositions include the step of bringing into association a compound described herein with the carrier and, optionally, one or more accessory ingredients. In general, the formulations are prepared by uniformly and intimately bringing into association a compound as disclosed herein with liquid carriers, or finely divided solid carriers, or both, and then, if necessary, shaping the product.
Preparations for such pharmaceutical compositions are well-known in the art. See, e.g., Anderson, Philip O.; Knoben, James E.; Troutman, William G, eds., Handbook of Clinical Drug Data, Tenth Edition, McGraw-Hill, 2002; Pratt and Taylor, eds., Principles of Drug Action, Third Edition, Churchill Livingston, N.Y., 1990; Katzung, ed., Basic and Clinical Pharmacology, Ninth Edition, McGraw Hill, 2003; Goodman and Gilman, eds., The Pharmacological Basis of Therapeutics, Tenth Edition, McGraw Hill, 2001; Remington's Pharmaceutical Sciences, 20th Ed., Lippincott Williams & Wilkins., 2000; Martindale, The Extra Pharmacopoeia, Thirty-Second Edition (The Pharmaceutical Press, London, 1999); all of which are incorporated by reference herein in their entirety. Except insofar as any conventional excipient medium is incompatible with the compounds provided herein, such as by producing any undesirable biological effect or otherwise interacting in a deleterious manner with any other component(s) of the pharmaceutically acceptable composition, the excipient's use is contemplated to be within the scope of this disclosure.
Provided herein are pharmaceutically acceptable salts which refer to those salts which are, within the scope of sound medical judgment, suitable for use in contact with the tissues of subjects without undue toxicity, irritation, allergic response and the like, and are commensurate with a reasonable benefit/risk ratio. Pharmaceutically acceptable salts are well known in the art. For example, Berge et al. describes pharmaceutically acceptable salts in detail in J. Pharmaceutical Sciences (1977) 66:1-19. Pharmaceutically acceptable salts of the compounds provided herein include those derived from suitable inorganic and organic acids and bases. Examples of pharmaceutically acceptable, nontoxic acid addition salts are salts of an amino group formed with inorganic acids such as hydrochloric acid, hydrobromic acid, phosphoric acid, sulfuric acid and perchloric acid or with organic acids such as acetic acid, oxalic acid, maleic acid, tartaric acid, citric acid, succinic acid or malonic acid or by using other methods used in the art such as ion exchange. Other pharmaceutically acceptable salts include adipate, alginate, ascorbate, aspartate, benzenesulfonate, besylate, benzoate, bisulfate, borate, butyrate, camphorate, camphorsulfonate, citrate, cyclopentanepropionate, digluconate, dodecylsulfate, ethanesulfonate, formate, fumarate, glucoheptonate, glycerophosphate, gluconate, hemisulfate, heptanoate, hexanoate, hydroiodide, 2-hydroxy-ethanesulfonate, lactobionate, lactate, laurate, lauryl sulfate, malate, maleate, malonate, methanesulfonate, 2-naphthalenesulfonate, nicotinate, nitrate, oleate, oxalate, palmitate, pamoate, pectinate, persulfate, 3-phenylpropionate, phosphate, picrate, pivalate, propionate, stearate, succinate, sulfate, tartrate, thiocyanate, p-toluenesulfonate, undecanoate, valerate salts, and the like. In some embodiments, organic acids from which salts can be derived include, for example, acetic acid, propionic acid, glycolic acid, pyruvic acid, oxalic acid, lactic acid, trifluoracetic acid, maleic acid, malonic acid, succinic acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid, and the like.
The salts can be prepared in situ during the isolation and purification of the disclosed compounds, or separately, such as by reacting the free base or free acid of the compound with a suitable base or acid, respectively. Pharmaceutically acceptable salts derived from appropriate bases include alkali metal, alkaline earth metal, ammonium and N+(C1-4alkyl)4 salts. Representative alkali or alkaline earth metal salts include sodium, lithium, potassium, calcium, magnesium, iron, zinc, copper, manganese, aluminum, and the like. Further pharmaceutically acceptable salts include, when appropriate, nontoxic ammonium, quaternary ammonium, and amine cations formed using counterions such as halide, hydroxide, carboxylate, sulfate, phosphate, nitrate, lower alkyl sulfonate and aryl sulfonate. Organic bases from which salts can be derived include, for example, primary, secondary, and tertiary amines, substituted amines, including naturally occurring substituted amines, cyclic amines, basic ion exchange resins, and the like, such as isopropylamine, trimethylamine, diethylamine, triethylamine, tripropylamine, and ethanolamine. In some embodiments, the pharmaceutically acceptable base addition salt is chosen from ammonium, potassium, sodium, calcium, and magnesium salts.
Naturally occurring mucin in the body acts as one of the primary barriers to pathogens reaching cells. Any toxin or pathogen that reaches the lungs, the gut, the reproductive tract or the eye is immersed in mucus, and its performance will be defined by this interaction. Mucins provide a physical barrier around cells that serve as a trap for microbes and a matrix for antimicrobial molecules. Given the difficulty of isolating natural mucins from cellular tissues, inexpensive and effective mucin mimetics as described herein are of interest to the defense industry in developing new protection methods against deleterious biological agents, such as those used in biological warfare. These types of barrier systems would easily form air and vapor-permeable membranes that could be used for protection of personnel and other critical assets during a biological threat. Such selective tents, suits, or air barriers would allow personnel to maintain functional capability during a biological incident for extended periods of time.
In another application, mucin mimetics as described herein are of interest to the pharmaceutical industry, such as in pharmacokinetic and pharmacodynamics testing of drug candidates. Mucin coatings on cells can identify a drug as a potential toxin, rather than a nutrient, and prevent drugs from reaching their intended cellular targets. To assess a drug candidate's ability to permeate mucin, testing on the disclosed mucin mimetics provide important insights into how that drug would interact in vivo. The process of mass functionalization of proteins, rather than step-wise or individual residue approaches, as described herein will find many applications in producing mucin mimetics and other functionalized proteins for creating new protein-based materials.
Unless otherwise indicated, all numbers expressing quantities of ingredients, properties such as molecular weight, reaction conditions, and so forth used in the specification and claims are to be understood as being modified in all instances by the term “about.” Accordingly, unless indicated to the contrary, the numerical parameters set forth in the following specification and attached claims are approximations that may vary depending upon the desired properties sought to be obtained by the present invention. At the very least, and not as an attempt to limit the application of the doctrine of equivalents to the scope of the claims, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques.
It is to be understood that wherever values and ranges are provided herein, all values and ranges encompassed by these values and ranges, are meant to be encompassed within the scope of the present invention. Moreover, all values that fall within these ranges, as well as the upper or lower limits of a range of values, are also contemplated by the present application.
All U.S. patents and U.S. and PCT published patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each independent patent and publication was specifically and individually indicated to be incorporated by reference.
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, numerous equivalents to the specific procedures, embodiments, claims, and examples described herein. Such equivalents were considered to be within the scope of this invention and covered by the claims appended hereto. For example, it should be understood, that modifications in reaction conditions, including but not limited to reaction times, reaction size/volume, and experimental reagents, such as solvents, catalysts, pressures, atmospheric conditions, e.g., nitrogen atmosphere, and reducing/oxidizing agents, with art-recognized alternatives and using no more than routine experimentation, are within the scope of the present application.
The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the compounds and methods of the invention, and are not intended to limit the scope of what the inventor(s) regard(s) as the invention.
The recitation of a listing of elements in any definition of a variable herein includes definitions of that variable as any single element or combination (or subcombination) of listed elements. The recitation of an embodiment herein includes that embodiment as any single embodiment or in combination with any other embodiments or portions thereof.
Unless noted otherwise, the starting materials for the synthesis described herein were obtained from commercial sources or known synthetic procedures and were used without further purification.
The protein backbones were cloned in the pET-15b vector as shown in
The MUC5ACL protein was prepared as described above in 25 mg protein/L of culture media. This artificially created protein based on MUC5 had about 33% of its serine and threonine residue replaced by tyrosine. The protein repeating sequence was (YTSTYSAP)46 and flanked by cysteine residues giving CAS(YTSTYSAP)46TSC with a total of 374 amino acids. The protein product was characterized by SDS-PAGE and amino acid analysis.
As shown in
The procedure for synthesizing pcoilcoil-mini intein proteins MUC5ACL-S, MUC5ACL-LT and MUC5ACL+D was similar to that of MUC5ACL described in Example 1A, except the pET-21A vector was used and the isolation procedure was as follows. Their sequences are detailed in
The expression results were determined by SDS-PAGE. Low levels of protein expression led to co-expressing these proteins with P-coiled-coil-mini-Intenin that was expressed using a pET-21a vector as shown in
The sequences of the MUC5ACL-S-Cold, GST-MUC5ACL-S, and MBP-MUC5ACL-S proteins are given in
The sequences of the ELP 1:1 Y:S 10k, ELP 3:1 Y:S 10k, and MUC5ACLSS proteins are given in
As shown in
A second method of tyrosine modification involved using Boc-protected tyrosine as a starting material, which gave increased solubility. The reaction conditions were as described above for unprotected tyrosine as shown in
The coupling products were purified by first lyophilizing the reaction mixture. The solids were dissolved in ethyl acetate, filtered, and then passed through a silica gel column (ethyl acetate:methanol). The solvent was removed via reduced pressure. The solids were then dissolved in methanol and purified by HPLC. The products were characterized with 1H NMR and LRMS. The extinction coefficients were determined by UV-Vis spectroscopy of solutions in 150 mM PBS at pH 9 as shown in
The effectiveness of diazonium coupling at functionalizing proteins was tested on MUC5ACL. A 1.2 mg/ml sample of protein was dissolved in buffer solution (150 mM Na2HPO4, pH 9, MilliQ water). 100 μL of nitro-diazonium salt solution was added to the protein solution. The mixture was stirred at 4° C. for 15 minutes. The UV-Vis absorbance of the reaction mixture at 344 nm was used to calculate the degree of modification of the proteins. 69% of all of the tyrosines present in solution were successfully modified with nitro groups as shown in
In another set of experiments, the following additional substituents were installed on the mucin MUC5ACL backbone using the above procedure: sulfate (sulf), ethynyl (alk), triazolyl-galactosyl (Gal), and triazolyl-PEO (Peo2000). The degree of modification was determined by UV-VIS spectroscopy and shown in
Further modification experiments analyzed by UV-VIS spectroscopy shown in
While the invention has been disclosed with reference to specific embodiments, it is apparent that other embodiments and variations of this invention may be devised by others skilled in the art without departing from the true spirit and scope of the invention. The appended claims are intended to be construed to include all such embodiments and equivalent variations.
This application claims the benefit of priority to U.S. Provisional Patent Application Ser. No. 62/161,334, filed May 14, 2015.
This invention was made with Government support under Contract No. HDTRA1-13-1-0038 awarded by the Defense Threat Reduction Agency. The Government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
62161334 | May 2015 | US |