The contents of the text file named 48538-524001US_ST25.TXT, which was created on Sep. 25, 2016, and is 3,307 bytes in size, are hereby incorporated by reference in their entireties.
M13 bacteriophage is type of virus that infects only bacteria, and can be genetically modified to present ligands on its surface. Phage have also been incorporated into nanomedicine platforms for targeted drug delivery and imaging. Such applications require low background binding by phage to cell surfaces. Phage typically adheres to cell surfaces with high affinity. Such non-specific adhesion complicates the design of phage-based sensors for the detection of tumor cells.
There are provided herein solutions to these and others problems in the art.
In one aspect, provided herein is a virial composition. The virial composition includes (a) a whole viral particle comprising a charged protein coat that has a plurality of charged coat proteins; (b) a first polymer electrostatically bound to the plurality of charged coat proteins; and (c) a covalent linker linking the first polymer to a recognition moiety.
In another aspect, there is provided a complex that includes any virial composition described herein and a cell, where the recognition moiety of the virial composition is bound to the cell.
In another aspect, there is provided a pharmaceutical composition that includes any virial composition described herein and a pharmaceutically acceptable carrier, diluent or excipient.
In another aspect, there is provided a method for detecting a cancer cell in a subject. The method includes (a) contacting a biological sample of the subject with one or more virial compositions described herein, where the recognition moiety of one or more virial compositions is a cell surface marker binding moiety, and (b) detecting a cell-virial composition complex, and presence of the complex indicates presence of a cancer cell in the subject.
Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by a person of ordinary skill in the art. See, e.g., Singleton et al., DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY 2nd ed., J. Wiley & Sons (New York, N.Y. 1994); Sambrook et al., MOLECULAR CLONING, A LABORATORY MANUAL, Cold Springs Harbor Press (Cold Springs Harbor, NY 1989). Any methods, devices and materials similar or equivalent to those described herein can be used in the practice of this invention. The following definitions are provided to facilitate understanding of certain terms used frequently herein and are not meant to limit the scope of the present disclosure.
The abbreviations used herein have their conventional meaning within the chemical and biological arts. The chemical structures and formulae set forth herein are constructed according to the standard rules of chemical valency known in the chemical arts.
Where substituent groups are specified by their conventional chemical formulae, written from left to right, they equally encompass the chemically identical substituents that would result from writing the structure from right to left, e.g., —CH2O— is equivalent to —OCH2—.
The term “alkyl,” by itself or as part of another substituent, means, unless otherwise stated, a straight (i.e., unbranched) or branched non-cyclic carbon chain (or carbon), or combination thereof, which may be fully saturated, mono- or polyunsaturated and can include di- and multivalent radicals, having the number of carbon atoms designated (i.e., C1-C10 means one to ten carbons). Examples of saturated hydrocarbon radicals include, but are not limited to, groups such as methyl, ethyl, n-propyl, isopropyl, n-butyl, t-butyl, isobutyl, sec-butyl, (cyclohexyl)methyl, homologs and isomers of, for example, n-pentyl, n-hexyl, n-heptyl, n-octyl, and the like. An unsaturated alkyl group is one having one or more double bonds or triple bonds. Examples of unsaturated alkyl groups include, but are not limited to, vinyl, 2-propenyl, crotyl, 2-isopentenyl, 2-(butadienyl), 2,4-pentadienyl, 3-(1,4-pentadienyl), ethynyl, 1- and 3-propynyl, 3-butynyl, and the higher homologs and isomers. An alkoxy is an alkyl attached to the remainder of the molecule via an oxygen linker (—O—). An alkyl moiety may be an alkenyl moiety. An alkyl moiety may be an alkynyl moiety. An alkyl moiety may be fully saturated.
The term “alkylene,” by itself or as part of another substituent, means, unless otherwise stated, a divalent radical derived from an alkyl, as exemplified, but not limited by, —CH2CH2CH2CH2—. Typically, an alkyl (or alkylene) group will have from 1 to 24 carbon atoms, with those groups having 10 or fewer carbon atoms being preferred in the present invention. A “lower alkyl” or “lower alkylene” is a shorter chain alkyl or alkylene group, generally having eight or fewer carbon atoms. The term “alkenylene,” by itself or as part of another substituent, means, unless otherwise stated, a divalent radical derived from an alkene.
The term “heteroalkyl,” by itself or in combination with another term, means, unless otherwise stated, a stable straight or branched non-cyclic chain, or combinations thereof, including at least one carbon atom and at least one heteroatom selected from the group consisting of O, N, P, Si, and S, and wherein the nitrogen and sulfur atoms may optionally be oxidized, and the nitrogen heteroatom may optionally be quaternized. The heteroatom(s) O, N, P, S, and Si may be placed at any interior position of the heteroalkyl group or at the position at which the alkyl group is attached to the remainder of the molecule. Examples include, but are not limited to: —CH2—CH2—O—CH3, —CH2—CH2—NH—CH3, —CH2—CH2—N(CH3)—CH3, —CH2—S—CH2—CH3, —CH2—CH2, —S(O)—CH3, —CH2—CH2—S(O)2—CH3, —CH═CH—O—CH3, —Si(CH3)3, —CH2—CH═N—OCH3, —CH═CH—N(CH3)—CH3, —O—CH3, —O—CH2—CH3, and —CN. Up to two or three heteroatoms may be consecutive, such as, for example, —CH2—NH—OCH3 and —CH2—O—Si(CH3)3. A heteroalkyl moiety may include one heteroatom (e.g., O, N, S, Si, or P). A heteroalkyl moiety may include two optionally different heteroatoms (e.g., O, N, S, Si, or P). A heteroalkyl moiety may include three optionally different heteroatoms (e.g., O, N, S, Si, or P). A heteroalkyl moiety may include four optionally different heteroatoms (e.g., O, N, S, Si, or P). A heteroalkyl moiety may include five optionally different heteroatoms (e.g., O, N, S, Si, or P). A heteroalkyl moiety may include up to 8 optionally different heteroatoms (e.g., O, N, S, Si, or P).
Similarly, the term “heteroalkylene,” by itself or as part of another substituent, means, unless otherwise stated, a divalent radical derived from heteroalkyl, as exemplified, but not limited by, —CH2—CH2—S—CH2—CH2— and —CH2—S—CH2—CH2—NH—CH2—. For heteroalkylene groups, heteroatoms can also occupy either or both of the chain termini (e.g., alkyleneoxy, alkylenedioxy, alkyleneamino, alkylenediamino, and the like). Still further, for alkylene and heteroalkylene linking groups, no orientation of the linking group is implied by the direction in which the formula of the linking group is written. For example, the formula —C(O)2R′— represents both —C(O)2R′— and —R′C(O)2—. As described above, heteroalkyl groups, as used herein, include those groups that are attached to the remainder of the molecule through a heteroatom, such as —C(O)R′, —C(O)NR′, —NR′R″, —OR′, —SR′, and/or —SO2R′. Where “heteroalkyl” is recited, followed by recitations of specific heteroalkyl groups, such as —NR′R″ or the like, it will be understood that the terms heteroalkyl and —NR′R″ are not redundant or mutually exclusive. Rather, the specific heteroalkyl groups are recited to add clarity. Thus, the term “heteroalkyl” should not be interpreted herein as excluding specific heteroalkyl groups, such as —NR′R″ or the like.
The terms “cycloalkyl” and “heterocycloalkyl,” by themselves or in combination with other terms, mean, unless otherwise stated, non-aromatic cyclic versions of “alkyl” and “heteroalkyl,” respectively, wherein the carbons making up the ring or rings do not necessarily need to be bonded to a hydrogen due to all carbon valencies participating in bonds with non-hydrogen atoms. Additionally, for heterocycloalkyl, a heteroatom can occupy the position at which the heterocycle is attached to the remainder of the molecule. Examples of cycloalkyl include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, 1-cyclohexenyl, 3-cyclohexenyl, cycloheptyl, 3-hydroxy-cyclobut-3-enyl-1,2, dione, 1H-1,2,4-triazolyl-5(4H)-one, 4H-1,2,4-triazolyl, and the like. Examples of heterocycloalkyl include, but are not limited to, 1-(1,2,5,6-tetrahydropyridyl), 1-piperidinyl, 2-piperidinyl, 3-piperidinyl, 4-morpholinyl, 3-morpholinyl, tetrahydrofuran-2-yl, tetrahydrofuran-3-yl, tetrahydrothien-2-yl, tetrahydrothien-3-yl, 1-piperazinyl, 2-piperazinyl, and the like. A “cycloalkylene” and a “heterocycloalkylene,” alone or as part of another substituent, means a divalent radical derived from a cycloalkyl and heterocycloalkyl, respectively. A heterocycloalkyl moiety may include one ring heteroatom (e.g., O, N, S, Si, or P). A heterocycloalkyl moiety may include two optionally different ring heteroatoms (e.g., O, N, S, Si, or P). A heterocycloalkyl moiety may include three optionally different ring heteroatoms (e.g., O, N, S, Si, or P). A heterocycloalkyl moiety may include four optionally different ring heteroatoms (e.g., O, N, S, Si, or P). A heterocycloalkyl moiety may include five optionally different ring heteroatoms (e.g., O, N, S, Si, or P). A heterocycloalkyl moiety may include up to 8 optionally different ring heteroatoms (e.g., O, N, S, Si, or P).
The terms “halo” or “halogen,” by themselves or as part of another substituent, mean, unless otherwise stated, a fluorine, chlorine, bromine, or iodine atom. Additionally, terms such as “haloalkyl” are meant to include monohaloalkyl and polyhaloalkyl. For example, the term “halo(C1-C4)alkyl” includes, but is not limited to, fluoromethyl, difluoromethyl, trifluoromethyl, 2,2,2-trifluoroethyl, 4-chlorobutyl, 3-bromopropyl, and the like.
The term “acyl” means, unless otherwise stated, —C(O)R where R is a substituted or unsubstituted alkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl, or substituted or unsubstituted heteroaryl.
The term “aryl” means, unless otherwise stated, a polyunsaturated, aromatic, hydrocarbon substituent, which can be a single ring or multiple rings (preferably from 1 to 3 rings) that are fused together (i.e., a fused ring aryl) or linked covalently. A fused ring aryl refers to multiple rings fused together wherein at least one of the fused rings is an aryl ring. The term “heteroaryl” refers to aryl groups (or rings) that contain at least one heteroatom such as N, O, or S, wherein the nitrogen and sulfur atoms are optionally oxidized, and the nitrogen atom(s) are optionally quaternized. Thus, the term “heteroaryl” includes fused ring heteroaryl groups (i.e., multiple rings fused together wherein at least one of the fused rings is a heteroaromatic ring). A 5,6-fused ring heteroarylene refers to two rings fused together, wherein one ring has 5 members and the other ring has 6 members, and wherein at least one ring is a heteroaryl ring. Likewise, a 6,6-fused ring heteroarylene refers to two rings fused together, wherein one ring has 6 members and the other ring has 6 members, and wherein at least one ring is a heteroaryl ring. And a 6,5-fused ring heteroarylene refers to two rings fused together, wherein one ring has 6 members and the other ring has 5 members, and wherein at least one ring is a heteroaryl ring. A heteroaryl group can be attached to the remainder of the molecule through a carbon or heteroatom. Non-limiting examples of aryl and heteroaryl groups include phenyl, 1-naphthyl, 2-naphthyl, 4-biphenyl, 1-pyrrolyl, 2-pyrrolyl, 3-pyrrolyl, 3-pyrazolyl, 2-imidazolyl, 4-imidazolyl, pyrazinyl, 2-oxazolyl, 4-oxazolyl, 2-phenyl-4-oxazolyl, 5-oxazolyl, 3-isoxazolyl, 4-isoxazolyl, 5-isoxazolyl, 2-thiazolyl, 4-thiazolyl, 5-thiazolyl, 2-furyl, 3-furyl, 2-thienyl, 3-thienyl, 2-pyridyl, 3-pyridyl, 4-pyridyl, 2-pyrimidyl, 4-pyrimidyl, 5-benzothiazolyl, purinyl, 2-benzimidazolyl, 5-indolyl, 1-isoquinolyl, 5-isoquinolyl, 2-quinoxalinyl, 5-quinoxalinyl, 3-quinolyl, and 6-quinolyl. Substituents for each of the above noted aryl and heteroaryl ring systems are selected from the group of acceptable substituents described below. An “arylene” and a “heteroarylene,” alone or as part of another substituent, mean a divalent radical derived from an aryl and heteroaryl, respectively. Non-limiting examples of aryl and heteroaryl groups include pyridinyl, pyrimidinyl, thiophenyl, thienyl, furanyl, indolyl, benzoxadiazolyl, benzodioxolyl, benzodioxanyl, thianaphthanyl, pyrrolopyridinyl, indazolyl, quinolinyl, quinoxalinyl, pyridopyrazinyl, quinazolinonyl, benzoisoxazolyl, imidazopyridinyl, benzofuranyl, benzothienyl, benzothiophenyl, phenyl, naphthyl, biphenyl, pyrrolyl, pyrazolyl, imidazolyl, pyrazinyl, oxazolyl, isoxazolyl, thiazolyl, furylthienyl, pyridyl, pyrimidyl, benzothiazolyl, purinyl, benzimidazolyl, isoquinolyl, thiadiazolyl, oxadiazolyl, pyrrolyl, diazolyl, triazolyl, tetrazolyl, benzothiadiazolyl, isothiazolyl, pyrazolopyrimidinyl, pyrrolopyrimidinyl, benzotriazolyl, benzoxazolyl, or quinolyl. The examples above may be substituted or unsubstituted and divalent radicals of each heteroaryl example above are non-limiting examples of heteroarylene. A heteroaryl moiety may include one ring heteroatom (e.g., O, N, or S). A heteroaryl moiety may include two optionally different ring heteroatoms (e.g., O, N, or S). A heteroaryl moiety may include three optionally different ring heteroatoms (e.g., O, N, or S). A heteroaryl moiety may include four optionally different ring heteroatoms (e.g., O, N, or S). A heteroaryl moiety may include five optionally different ring heteroatoms (e.g., O, N, or S). An aryl moiety may have a single ring. An aryl moiety may have two optionally different rings. An aryl moiety may have three optionally different rings. An aryl moiety may have four optionally different rings. A heteroaryl moiety may have one ring. A heteroaryl moiety may have two optionally different rings. A heteroaryl moiety may have three optionally different rings. A heteroaryl moiety may have four optionally different rings. A heteroaryl moiety may have five optionally different rings.
A fused ring heterocycloalkyl-aryl is an aryl fused to a heterocycloalkyl. A fused ring heterocycloalkyl-heteroaryl is a heteroaryl fused to a heterocycloalkyl. A fused ring heterocycloalkyl-cycloalkyl is a heterocycloalkyl fused to a cycloalkyl. A fused ring heterocycloalkyl-heterocycloalkyl is a heterocycloalkyl fused to another heterocycloalkyl. Fused ring heterocycloalkyl-aryl, fused ring heterocycloalkyl-heteroaryl, fused ring heterocycloalkyl-cycloalkyl, or fused ring heterocycloalkyl-heterocycloalkyl may each independently be unsubstituted or substituted with one or more of the substitutents described herein.
For brevity, the term “aryl” when used in combination with other terms (e.g., aryloxy, arylthioxy, arylalkyl) includes both aryl and heteroaryl rings as defined above. Thus, the term “arylalkyl” is meant to include those radicals in which an aryl group is attached to an alkyl group (e.g., benzyl, phenethyl, pyridylmethyl, and the like) including those alkyl groups in which a carbon atom (e.g., a methylene group) has been replaced by, for example, an oxygen atom (e.g., phenoxymethyl, 2-pyridyloxymethyl, 3-(1-naphthyloxy)propyl, and the like).
The term “oxo,” as used herein, means an oxygen that is double bonded to a carbon atom.
The term “alkylsulfonyl,” as used herein, means a moiety having the formula —S(O2)—R′, where R′ is a substituted or unsubstituted alkyl group as defined above. R′ may have a specified number of carbons (e.g., “C1-C4 alkylsulfonyl”).
Each of the above terms (e.g., “alkyl,” “heteroalkyl,”, “cycloalkyl”, “heterocycloalkyl”, “aryl,” and “heteroaryl”) includes both substituted and unsubstituted forms of the indicated radical. Preferred substituents for each type of radical are provided below.
Substituents for the alkyl and heteroalkyl radicals (including those groups often referred to as alkylene, alkenyl, heteroalkylene, heteroalkenyl, alkynyl, cycloalkyl, heterocycloalkyl, cycloalkenyl, and heterocycloalkenyl) can be one or more of a variety of groups selected from, but not limited to, —OR′, ═O, ═NR′, ═N—OR′, —NR′R″, —SR′, -halogen, —SiR′R″R′″, —OC(O)R′, —C(O)R′, —CO2R′, —CONR′R″, —OC(O)NR′R″, —NR″C(O)R′, —NR′—C(O)NR″R′″, —NR″C(O)2R′, —NR—C(NR′R″R′″)═NR″, —NR—C(NR′R″)═NR′″, —S(O)R′, —S(O)2R′, —S(O)2NR′R″, —NRSO2R′, —NR′NR″R′″, —ONR′R″, —NR′C═(O)NR″NR′″R″″, —CN, —NO2, in a number ranging from zero to (2m′+1), where m′ is the total number of carbon atoms in such radical. R, R′, R″, R′″, and R″″ each preferably independently refer to hydrogen, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl (e.g., aryl substituted with 1-3 halogens), substituted or unsubstituted heteroaryl, substituted or unsubstituted alkyl, alkoxy, or thioalkoxy groups, or arylalkyl groups. When a compound of the invention includes more than one R group, for example, each of the R groups is independently selected as are each R′, R″, R′″, and R″″ group when more than one of these groups is present. When R′ and R″ are attached to the same nitrogen atom, they can be combined with the nitrogen atom to form a 4-, 5-, 6-, or 7-membered ring. For example, —NR′R″ includes, but is not limited to, 1-pyrrolidinyl and 4-morpholinyl. From the above discussion of substituents, one of skill in the art will understand that the term “alkyl” is meant to include groups including carbon atoms bound to groups other than hydrogen groups, such as haloalkyl (e.g., —CF3 and —CH2CF3) and acyl (e.g., —C(O)CH3, —C(O)CF3, —C(O)CH2OCH3, and the like).
Similar to the substituents described for the alkyl radical, substituents for the aryl and heteroaryl groups are varied and are selected from, for example: —OR′, —NR′R″, —SR′, -halogen, —SiR′R″R′″, —OC(O)R′, —C(O)R′, —CO2R′, —CONR′R″, —OC(O) NR′R″, —NR″C(O)R′, —NR′—C(O)NR″R′″, —NR″C(O)2R′, —NR—C(NR′R″R′″)═NR″″, —NR—C(NR′R″)═N R′″, —S(O)R′, —S(O)2R′, —S(O)2NR′R″, —NRSO2R′, —NR′NR″R′″, —ONR′R″, —NR′C═(O)NR″NR′″R″″, —CN, —NO2, —R′, —N3, —CH(Ph)2, fluoro(C1-C4)alkoxy, and fluoro(C1-C4)alkyl, in a number ranging from zero to the total number of open valences on the aromatic ring system; and where R′, R″, R′″, and R″″ are preferably independently selected from hydrogen, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl, and substituted or unsubstituted heteroaryl. When a compound of the invention includes more than one R group, for example, each of the R groups is independently selected as are each R′, R″, R′″, and R″″ groups when more than one of these groups is present.
Two or more substituents may optionally be joined to form aryl, heteroaryl, cycloalkyl, or heterocycloalkyl groups. Such so-called ring-forming substituents are typically, though not necessarily, found attached to a cyclic base structure. In one embodiment, the ring-forming substituents are attached to adjacent members of the base structure. For example, two ring-forming substituents attached to adjacent members of a cyclic base structure create a fused ring structure. In another embodiment, the ring-forming substituents are attached to a single member of the base structure. For example, two ring-forming substituents attached to a single member of a cyclic base structure create a spirocyclic structure. In yet another embodiment, the ring-forming substituents are attached to non-adjacent members of the base structure.
Two of the substituents on adjacent atoms of the aryl or heteroaryl ring may optionally form a ring of the formula -T-C(O)—(CRR′)q—U—, wherein T and U are independently —NR—, —O—, —CRR′—, or a single bond, and q is an integer of from 0 to 3. Alternatively, two of the substituents on adjacent atoms of the aryl or heteroaryl ring may optionally be replaced with a substituent of the formula -A-(CH2)r—B—, wherein A and B are independently —CRR′—, —O—, —NR—, —S—, —S(O)—, —S(O)2—, —S(O)2NR′—, or a single bond, and r is an integer of from 1 to 4. One of the single bonds of the new ring so formed may optionally be replaced with a double bond. Alternatively, two of the substituents on adjacent atoms of the aryl or heteroaryl ring may optionally be replaced with a substituent of the formula —(CRR′)s—X′—(C″R″R′″)d—, where s and d are independently integers of from 0 to 3, and X′ is —O—, —NR′—, —S—, —S(O)—, —S(O)2—, or —S(O)2NR′—. The substituents R, R′, R″, and R′″ are preferably independently selected from hydrogen, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl, and substituted or unsubstituted heteroaryl.
As used herein, the terms “heteroatom” or “ring heteroatom” are meant to include, oxygen (O), nitrogen (N), sulfur (S), phosphorus (P), and silicon (Si).
A “substituent group,” as used herein, means a group selected from the following moieties:
A “size-limited substituent” or “size-limited substituent group,” as used herein, means a group selected from all of the substituents described above for a “substituent group,” wherein each substituted or unsubstituted alkyl is a substituted or unsubstituted C1-C20 alkyl, each substituted or unsubstituted heteroalkyl is a substituted or unsubstituted 2 to 20 membered heteroalkyl, each substituted or unsubstituted cycloalkyl is a substituted or unsubstituted C3-C8 cycloalkyl, each substituted or unsubstituted heterocycloalkyl is a substituted or unsubstituted 3 to 8 membered heterocycloalkyl, each substituted or unsubstituted aryl is a substituted or unsubstituted C6-C10 aryl, and each substituted or unsubstituted heteroaryl is a substituted or unsubstituted 5 to 10 membered heteroaryl.
A “lower substituent” or “lower substituent group,” as used herein, means a group selected from all of the substituents described above for a “substituent group,” wherein each substituted or unsubstituted alkyl is a substituted or unsubstituted C1-C8 alkyl, each substituted or unsubstituted heteroalkyl is a substituted or unsubstituted 2 to 8 membered heteroalkyl, each substituted or unsubstituted cycloalkyl is a substituted or unsubstituted C3-C7 cycloalkyl, each substituted or unsubstituted heterocycloalkyl is a substituted or unsubstituted 3 to 7 membered heterocycloalkyl, each substituted or unsubstituted aryl is a substituted or unsubstituted C6-C10 aryl, and each substituted or unsubstituted heteroaryl is a substituted or unsubstituted 5 to 9 membered heteroaryl.
In some embodiments, each substituted group described in the compounds herein is substituted with at least one substituent group. More specifically, in some embodiments, each substituted alkyl, substituted heteroalkyl, substituted cycloalkyl, substituted heterocycloalkyl, substituted aryl, substituted heteroaryl, substituted alkylene, substituted heteroalkylene, substituted cycloalkylene, substituted heterocycloalkylene, substituted arylene, and/or substituted heteroarylene described in the compounds herein are substituted with at least one substituent group. In other embodiments, at least one or all of these groups are substituted with at least one size-limited substituent group. In other embodiments, at least one or all of these groups are substituted with at least one lower substituent group.
In other embodiments of the compounds herein, each substituted or unsubstituted alkyl may be a substituted or unsubstituted C1-C20 alkyl, each substituted or unsubstituted heteroalkyl is a substituted or unsubstituted 2 to 20 membered heteroalkyl, each substituted or unsubstituted cycloalkyl is a substituted or unsubstituted C3-C8 cycloalkyl, each substituted or unsubstituted heterocycloalkyl is a substituted or unsubstituted 3 to 8 membered heterocycloalkyl, each substituted or unsubstituted aryl is a substituted or unsubstituted C6-C10 aryl, and/or each substituted or unsubstituted heteroaryl is a substituted or unsubstituted 5 to 10 membered heteroaryl. In some embodiments of the compounds herein, each substituted or unsubstituted alkylene is a substituted or unsubstituted C1-C20 alkylene, each substituted or unsubstituted heteroalkylene is a substituted or unsubstituted 2 to 20 membered heteroalkylene, each substituted or unsubstituted cycloalkylene is a substituted or unsubstituted C3-C8 cycloalkylene, each substituted or unsubstituted heterocycloalkylene is a substituted or unsubstituted 3 to 8 membered heterocycloalkylene, each substituted or unsubstituted arylene is a substituted or unsubstituted C6-C10 arylene, and/or each substituted or unsubstituted heteroarylene is a substituted or unsubstituted 5 to 10 membered heteroarylene.
In some embodiments, each substituted or unsubstituted alkyl is a substituted or unsubstituted C1-C8 alkyl, each substituted or unsubstituted heteroalkyl is a substituted or unsubstituted 2 to 8 membered heteroalkyl, each substituted or unsubstituted cycloalkyl is a substituted or unsubstituted C3-C7 cycloalkyl, each substituted or unsubstituted heterocycloalkyl is a substituted or unsubstituted 3 to 7 membered heterocycloalkyl, each substituted or unsubstituted aryl is a substituted or unsubstituted C6-C10 aryl, and/or each substituted or unsubstituted heteroaryl is a substituted or unsubstituted 5 to 9 membered heteroaryl. In some embodiments, each substituted or unsubstituted alkylene is a substituted or unsubstituted C1-C8 alkylene, each substituted or unsubstituted heteroalkylene is a substituted or unsubstituted 2 to 8 membered heteroalkylene, each substituted or unsubstituted cycloalkylene is a substituted or unsubstituted C3-C7 cycloalkylene, each substituted or unsubstituted heterocycloalkylene is a substituted or unsubstituted 3 to 7 membered heterocycloalkylene, each substituted or unsubstituted arylene is a substituted or unsubstituted C6-C10 arylene, and/or each substituted or unsubstituted heteroarylene is a substituted or unsubstituted 5 to 9 membered heteroarylene. In some embodiments, the compound is a chemical species set forth in the Examples section, figures, or tables below.
Unless otherwise stated, structures depicted herein are also meant to include all stereochemical forms of the structure; i.e., the R and S configurations for each asymmetric center. Therefore, single stereochemical isomers as well as enantiomeric and diastereomeric mixtures of the present compounds are within the scope disclosed herein.
The symbol “” denotes the point of attachment of a chemical moiety to the remainder of a molecule or chemical formula.
The terms “a” or “an,” as used in herein means one or more. In addition, the phrase “substituted with a[n],” as used herein, means the specified group may be substituted with one or more of any or all of the named substituents. For example, where a group, such as an alkyl or heteroaryl group, is “substituted with an unsubstituted C1-C20 alkyl, or unsubstituted 2 to 20 membered heteroalkyl,” the group may contain one or more unsubstituted C1-C20 alkyls, and/or one or more unsubstituted 2 to 20 membered heteroalkyls. Moreover, where a moiety is substituted with an R substituent, the group may be referred to as “R-substituted.” Where a moiety is R-substituted, the moiety is substituted with at least one R substituent and each R substituent is optionally different.
Descriptions of compounds of the present invention are limited by principles of chemical bonding known to those skilled in the art. Accordingly, where a group may be substituted by one or more of a number of substituents, such substitutions are selected so as to comply with principles of chemical bonding and to give compounds which are not inherently unstable and/or would be known to one of ordinary skill in the art as likely to be unstable under ambient conditions, such as aqueous, neutral, and several known physiological conditions. For example, a heterocycloalkyl or heteroaryl is attached to the remainder of the molecule via a ring heteroatom in compliance with principles of chemical bonding known to those skilled in the art thereby avoiding inherently unstable compounds.
“Nucleic acid” refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single-, double- or multiple-stranded form, or complements thereof. The term “polynucleotide” refers to a linear sequence of nucleotides. The term “nucleotide” typically refers to a single unit of a polynucleotide, i.e., a monomer. Nucleotides can be ribonucleotides, deoxyribonucleotides, or modified versions thereof. Examples of polynucleotides contemplated herein include single and double stranded DNA, single and double stranded RNA (including siRNA), and hybrid molecules having mixtures of single and double stranded DNA and RNA. Nucleic acids can be linear or branched. For example, nucleic acids can be a linear chain of nucleotides or the nucleic acids can be branched, e.g., such that the nucleic acids comprise one or more arms or branches of nucleotides. Optionally, the branched nucleic acids are repetitively branched to form higher ordered structures such as dendrimers and the like.
Nucleic acids, including nucleic acids with a phosphothioate backbone can include one or more reactive moieties. As used herein, the term reactive moiety includes any group capable of reacting with another molecule, e.g., a nucleic acid or polypeptide through covalent, non-covalent or other interactions. By way of example, the nucleic acid can include an amino acid reactive moiety that reacts with an amino acid on a protein or polypeptide through a covalent, non-covalent or other interaction.
The terms also encompass nucleic acids containing known nucleotide analogs or modified backbone residues or linkages, which are synthetic, naturally occurring, and non-naturally occurring, which have similar binding properties as the reference nucleic acid, and which are metabolized in a manner similar to the reference nucleotides. Examples of such analogs include, without limitation, phosphodiester derivatives including, e.g., phosphoramidate, phosphorodiamidate, phosphorothioate (also known as phosphothioate), phosphorodithioate, phosphonocarboxylic acids, phosphonocarboxylates, phosphonoacetic acid, phosphonoformic acid, methyl phosphonate, boron phosphonate, or O-methylphosphoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with positive backbones; non-ionic backbones, modified sugars, and non-ribose backbones (e.g. phosphorodiamidate morpholino oligos or locked nucleic acids (LNA)), including those described in U.S. Pat. Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, Carbohydrate Modifications in Antisense Research, Sanghui & Cook, eds. Nucleic acids containing one or more carbocyclic sugars are also included within one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g., to increase the stability and half-life of such molecules in physiological environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. In embodiments, the internucleotide linkages in DNA are phosphodiester, phosphodiester derivatives, or a combination of both.
Nucleic acids can include nonspecific sequences. As used herein, the term “nonspecific sequence” refers to a nucleic acid sequence that contains a series of residues that are not designed to be complementary to or are only partially complementary to any other nucleic acid sequence. By way of example, a nonspecific nucleic acid sequence is a sequence of nucleic acid residues that does not function as an inhibitory nucleic acid when contacted with a cell or organism. An “inhibitory nucleic acid” is a nucleic acid (e.g. DNA, RNA, polymer of nucleotide analogs) that is capable of binding to a target nucleic acid (e.g. an mRNA translatable into a protein) and reducing transcription of the target nucleic acid (e.g. mRNA from DNA) or reducing the translation of the target nucleic acid (e.g. mRNA) or altering transcript splicing (e.g. single stranded morpholino oligo).
A “labeled nucleic acid or oligonucleotide” is one that is bound, either covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds to a label such that the presence of the nucleic acid may be detected by detecting the presence of the detectable label bound to the nucleic acid. Alternatively, a method using high affinity interactions may achieve the same results where one of a pair of binding partners binds to the other, e.g., biotin, streptavidin. In embodiments, the phosphorothioate nucleic acid or phosphorothioate polymer backbone includes a detectable label, as disclosed herein and generally known in the art.
The term “probe” or “primer”, as used herein, is defined to be one or more nucleic acid fragments whose specific hybridization to a sample can be detected. A probe or primer can be of any length depending on the particular technique it will be used for. For example, PCR primers are generally between 10 and 40 nucleotides in length, while nucleic acid probes for, e.g., a Southern blot, can be more than a hundred nucleotides in length. The probe may be unlabeled or labeled as described below so that its binding to the target or sample can be detected. The probe can be produced from a source of nucleic acids from one or more particular (preselected) portions of a chromosome, e.g., one or more clones, an isolated whole chromosome or chromosome fragment, or a collection of polymerase chain reaction (PCR) amplification products. The length and complexity of the nucleic acid fixed onto the target element is not critical to the invention. One of skill can adjust these factors to provide optimum hybridization and signal production for a given hybridization procedure, and to provide the required resolution among different genes or genomic locations.
The probe may also be isolated nucleic acids immobilized on a solid surface (e.g., nitrocellulose, glass, quartz, fused silica slides), as in an array. In some embodiments, the probe may be a member of an array of nucleic acids as described, for instance, in WO 96/17958. Techniques capable of producing high density arrays can also be used for this purpose (see, e.g., Fodor (1991) Science 767-773; Johnston (1998) Curr. Biol. 8: R171-R174; Schummer (1997) Biotechniques 23: 1087-1092; Kern (1997) Biotechniques 23: 120-124; U.S. Pat. No. 5,143,854).
The words “complementary” or “complementarity” refer to the ability of a nucleic acid in a polynucleotide to form a base pair with another nucleic acid in a second polynucleotide. For example, the sequence A-G-T is complementary to the sequence T-C-A. Complementarity may be partial, in which only some of the nucleic acids match according to base pairing, or complete, where all the nucleic acids match according to base pairing.
The term “isolated”, when applied to a nucleic acid or protein, denotes that the nucleic acid or protein is essentially free of other cellular components with which it is associated in the natural state. It can be, for example, in a homogeneous state and may be in either a dry or aqueous solution. Purity and homogeneity are typically determined using analytical chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid chromatography. A protein that is the predominant species present in a preparation is substantially purified.
The term “purified” denotes that a nucleic acid or protein gives rise to essentially one band in an electrophoretic gel. In some embodiments, the nucleic acid or protein is at least 50% pure, optionally at least 65% pure, optionally at least 75% pure, optionally at least 85% pure, optionally at least 95% pure, and optionally at least 99% pure.
The term “isolated” may also refer to a cell or sample cells. An isolated cell or sample cells are a single cell type that is substantially free of many of the components which normally accompany the cells when they are in their native state or when they are initially removed from their native state. In certain embodiments, an isolated cell sample retains those components from its natural state that are required to maintain the cell in a desired state. In some embodiments, an isolated (e.g. purified, separated) cell or isolated cells, are cells that are substantially the only cell type in a sample. A purified cell sample may contain at least 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of one type of cell. An isolated cell sample may be obtained through the use of a cell marker or a combination of cell markers, either of which is unique to one cell type in an unpurified cell sample. In some embodiments, the cells are isolated through the use of a cell sorter. In some embodiments, antibodies against cell proteins are used to isolate cells.
As used herein, the term “conjugate” refers to the association between atoms or molecules. The association can be direct or indirect. For example, a conjugate between a polymer and a ligand or recognition moiety provided herein can be direct, e.g., by covalent bond, or indirect, e.g., by non-covalent bond (e.g. electrostatic interactions (e.g. ionic bond, hydrogen bond, halogen bond), van der Waals interactions (e.g. dipole-dipole, dipole-induced dipole, London dispersion), ring stacking (pi effects), hydrophobic interactions and the like). In embodiments, conjugates are formed using conjugate chemistry including, but are not limited to nucleophilic substitutions (e.g., reactions of amines and alcohols with acyl halides, active esters), electrophilic substitutions (e.g., enamine reactions) and additions to carbon-carbon and carbon-heteroatom multiple bonds (e.g., Michael reaction, Diels-Alder addition). These and other useful reactions are discussed in, for example, March, ADVANCED ORGANIC CHEMISTRY, 3rd Ed., John Wiley & Sons, New York, 1985; Hermanson, BIOCONJUGATE TECHNIQUES, Academic Press, San Diego, 1996; and Feeney et al., MODIFICATION OF PROTEINS; Advances in Chemistry Series, Vol. 198, American Chemical Society, Washington, D.C., 1982. In embodiments, the polymer is non-covalently attached to the ligand through a non-covalent chemical reaction between a component of the polymer and a component of the ligand. In other embodiments, the polymer is covalently bound to the ligand or recognition moiety using a covalent linker, wherein the covalent linker is attached to the polymer at one end and to the ligand or recognition moiety at the other end. The linker attachment to the polymer or to the ligand or recognition moiety may be accomplished using one or more reactive moieties, e.g., bioconjugate techniques, a covalent reactive moiety, as described herein (e.g., alkyne, azide, maleimide or thiol reactive moiety).
Useful reactive moieties or functional groups (chemical reactive functional groups) used for conjugate chemistries (click chemistries) herein include, for example:
(a) carboxyl groups and various derivatives thereof including, but not limited to, N-hydroxysuccinimide esters, N-hydroxybenztriazole esters, acid halides, acyl imidazoles, thioesters, p-nitrophenyl esters, alkyl, alkenyl, alkynyl and aromatic esters;
(b) hydroxyl groups which can be converted to esters, ethers, aldehydes, etc.
(c) haloalkyl groups wherein the halide can be later displaced with a nucleophilic group such as, for example, an amine, a carboxylate anion, thiol anion, carbanion, or an alkoxide ion, thereby resulting in the covalent attachment of a new group at the site of the halogen atom;
(d) dienophile groups which are capable of participating in Diels-Alder reactions such as, for example, maleimido groups;
(e) aldehyde or ketone groups such that subsequent derivatization is possible via formation of carbonyl derivatives such as, for example, imines, hydrazones, semicarbazones or oximes, or via such mechanisms as Grignard addition or alkyllithium addition;
(f) sulfonyl halide groups for subsequent reaction with amines, for example, to form sulfonamides;
(g) thiol groups, which can be converted to disulfides, reacted with acyl halides, or bonded to metals such as gold;
(h) amine or sulfhydryl groups, which can be, for example, acylated, alkylated or oxidized;
(i) alkenes, which can undergo, for example, cycloadditions, acylation, Michael addition, and the like;
(j) epoxides, which can react with, for example, amines and hydroxyl compounds;
(k) phosphoramidites and other standard functional groups useful in nucleic acid synthesis;
(l) metal silicon oxide bonding;
(m) metal bonding to reactive phosphorus groups (e.g. phosphines) to form, for example, phosphate diester bonds; and
(n) sulfones, for example, vinyl sulfone.
Chemical synthesis of compositions by joining modular units using conjugate (click) chemistry may also be sued to attach the covalent linker to the polymer and/or to the ligand or recognition moiety, which is well known in the art and described, for example, in H. C. Kolb, M. G. Finn and K. B. Sharpless ((2001). “Click Chemistry: Diverse Chemical Function from a Few Good Reactions”. Angewandte Chemie International Edition 40 (11): 2004-2021); R. A. Evans ((2007). “The Rise of Azide-Alkyne 1,3-Dipolar ‘Click’ Cycloaddition and its Application to Polymer Science and Surface Modification”. Australian Journal of Chemistry 60 (6): 384-395; W. C. Guida et al. Med. Res. Rev. p 3 1996; Spiteri, Christian and Moses, John E. ((2010). “Copper-Catalyzed Azide-Alkyne Cycloaddition: Regioselective Synthesis of 1,4,5-Trisubstituted 1,2,3-Triazoles”. Angewandte Chemie International Edition 49 (1): 31-33); Hoyle, Charles E. and Bowman, Christopher N. ((2010). “Thiol-Ene Click Chemistry”. Angewandte Chemie International Edition 49 (9): 1540-1573); Blackman, Melissa L. and Royzen, Maksim and Fox, Joseph M. ((2008). “Tetrazine Ligation: Fast Bioconjugation Based on Inverse-Electron-Demand Diels-Alder Reactivity”. Journal of the American Chemical Society 130 (41): 13518-13519); Devaraj, Neal K. and Weissleder, Ralph and Hilderbrand, Scott A. ((2008). “Tetrazine Based Cycloadditions: Application to Pretargeted Live Cell Labeling”. Bioconjugate Chemistry 19 (12): 2297-2299); Stöckmann, Henning; Neves, Andre; Stairs, Shaun; Brindle, Kevin; Leeper, Finian ((2011). “Exploring isonitrile-based click chemistry for ligation with biomolecules”. Organic & Biomolecular Chemistry), all of which are hereby incorporated by reference in their entirety and for all purposes.
The reactive functional groups can be chosen such that they do not participate in, or interfere with, the chemical stability of the proteins described herein. By way of example, the polymer or ligand/recognition moiety can include a vinyl sulfone or other reactive moiety (e.g., maleimide). Optionally, the polymer or ligand can include a reactive moiety having the formula S—S—R. R can be, for example, a protecting group. Optionally, R is hexanol. As used herein, the term hexanol includes compounds with the formula C6H13OH and includes, 1-hexanol, 2-hexanol, 3-hexanol, 2-methyl-1-pentanol, 3-methyl-1-pentanol, 4-methyl-1-pentanol, 2-methyl-2-pentanol, 3-methyl-2-pentanol, 4-methyl-2-pentanol, 2-methyl-3-pentanol, 3-methyl-3-pentanol, 2,2-dimethyl-1-butanol, 2,3-dimethyl-1-butanol, 3,3-dimethyl-1-butanol, 2,3-dimethyl-2-butanol, 3,3-dimethyl-2-butanol, and 2-ethyl-1-butanol. Optionally, R is 1-hexanol.
As used herein, the term “about” means a range of values including the specified value, which a person of ordinary skill in the art would consider reasonably similar to the specified value. In embodiments, the term “about” means within a standard deviation using measurements generally acceptable in the art. In embodiments, about means a range extending to +/−10% of the specified value. In embodiments, about means the specified value.
The terms “polypeptide,” “peptide” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues, wherein the polymer may be conjugated to a moiety that does not consist of amino acids. The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers and non-naturally occurring amino acid polymers. The terms apply to macrocyclic peptides, peptides that have been modified with non-peptide functionality, peptidomimetics, polyamides, and macrolactams. A “fusion protein” refers to a chimeric protein encoding two or more separate protein sequences that are recombinantly expressed as a single moiety.
The term “peptidyl” and “peptidyl moiety” means a monovalent peptide.
The term “amino acid” refers to naturally occurring and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids. Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, γ-carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., an α carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that functions in a manner similar to a naturally occurring amino acid. The terms “non-naturally occurring amino acid” and “unnatural amino acid” refer to amino acid analogs, synthetic amino acids, and amino acid mimetics which are not found in nature.
Amino acids may be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted single-letter codes.
An amino acid or nucleotide base “position” is denoted by a number that sequentially identifies each amino acid (or nucleotide base) in the reference sequence based on its position relative to the N-terminus (or 5′-end). Due to deletions, insertions, truncations, fusions, and the like that must be taken into account when determining an optimal alignment, in general the amino acid residue number in a test sequence determined by simply counting from the N-terminus will not necessarily be the same as the number of its corresponding position in the reference sequence. For example, in a case where a variant has a deletion relative to an aligned reference sequence, there will be no amino acid in the variant that corresponds to a position in the reference sequence at the site of deletion. Where there is an insertion in an aligned reference sequence, that insertion will not correspond to a numbered amino acid position in the reference sequence. In the case of truncations or fusions there can be stretches of amino acids in either the reference or aligned sequence that do not correspond to any amino acid in the corresponding sequence.
The terms “numbered with reference to” or “corresponding to,” when used in the context of the numbering of a given amino acid or polynucleotide sequence, refers to the numbering of the residues of a specified reference sequence when the given amino acid or polynucleotide sequence is compared to the reference sequence.
“Conservatively modified variants” applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, conservatively modified variants refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are “silent variations,” which are one species of conservatively modified variations. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. One of skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a functionally identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a polypeptide is implicit in each described sequence with respect to the expression product, but not with respect to actual probe sequences.
As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a “conservatively modified variant” where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the invention.
The following eight groups each contain amino acids that are conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, Proteins (1984)).
“Percentage of sequence identity” is determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
The terms “identical” or percent “identity,” in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same (i.e., 60% identity, optionally 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% identity over a specified region, e.g., of the entire polypeptide sequences of the invention or individual domains of the polypeptides of the invention), when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. Such sequences are then said to be “substantially identical.” This definition also refers to the complement of a test sequence. Optionally, the identity exists over a region that is at least about 50 nucleotides in length, or more preferably over a region that is 100 to 500 or 1000 or more nucleotides in length.
For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters.
A “comparison window”, as used herein, includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of, e.g., a full length sequence or from 20 to 600, about 50 to about 200, or about 100 to about 150 amino acids or nucleotides in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned. Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith and Waterman (1970) Adv. Appl. Math. 2:482c, by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443, by the search for similarity method of Pearson and Lipman (1988) Proc. Nat'l. Acad. Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by manual alignment and visual inspection (see, e.g., Ausubel et al., Current Protocols in Molecular Biology (1995 supplement)).
An example of an algorithm that is suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al. (1977) Nuc. Acids Res. 25:3389-3402, and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) or 10, M=5, N=−4 and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915) alignments (B) of 50, expectation (E) of 10, M=5, N=−4, and a comparison of both strands.
The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5787). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably less than about 0.01, and most preferably less than about 0.001.
An indication that two nucleic acid sequences or polypeptides are substantially identical is that the polypeptide encoded by the first nucleic acid is immunologically cross-reactive with the antibodies raised against the polypeptide encoded by the second nucleic acid, as described below. Thus, a polypeptide is typically substantially identical to a second polypeptide, for example, where the two peptides differ only by conservative substitutions. Another indication that two nucleic acid sequences are substantially identical is that the two molecules or their complements hybridize to each other under stringent conditions, as described below. Yet another indication that two nucleic acid sequences are substantially identical is that the same primers can be used to amplify the sequence.
“Contacting” is used in accordance with its plain ordinary meaning and refers to the process of allowing at least two distinct species (e.g. chemical compounds including biomolecules or cells) to become sufficiently proximal to react, interact or physically touch. It should be appreciated; however, the resulting reaction product can be produced directly from a reaction between the added reagents or from an intermediate from one or more of the added reagents which can be produced in the reaction mixture.
A “control” sample or value refers to a sample that serves as a reference, usually a known reference, for comparison to a test sample. For example, a test sample can be taken from a test condition, e.g., in the presence of a test compound, and compared to samples from known conditions, e.g., in the absence of the test compound (negative control), or in the presence of a known compound (positive control). A control can also represent an average value gathered from a number of tests or results. One of skill in the art will recognize that controls can be designed for assessment of any number of parameters. For example, a control can be devised to compare therapeutic benefit based on pharmacological data (e.g., half-life) or therapeutic measures (e.g., comparison of side effects). One of skill in the art will understand which standard controls are most appropriate in a given situation and be able to analyze data based on comparisons to standard control values. Standard controls are also valuable for determining the significance (e.g. statistical significance) of data. For example, if values for a given parameter are widely variant in standard controls, variation in test samples will not be considered as significant.
A “label” or a “detectable moiety” is a composition detectable by spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical means. For example, useful labels include 32P, fluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins or other entities which can be made detectable, e.g., by incorporating a radiolabel into a peptide or antibody specifically reactive with a target peptide. Any appropriate method known in the art for conjugating an antibody to the label may be employed, e.g., using methods described in Hermanson, Bioconjugate Techniques 1996, Academic Press, Inc., San Diego.
A “labeled protein or polypeptide” is one that is bound, either covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds to a label such that the presence of the labeled protein or polypeptide may be detected by detecting the presence of the label bound to the labeled protein or polypeptide. Alternatively, methods using high affinity interactions may achieve the same results where one of a pair of binding partners binds to the other, e.g., biotin, streptavidin.
“Biological sample” or “sample” refer to materials obtained from or derived from a subject or patient. A biological sample includes sections of tissues such as biopsy and autopsy samples, and frozen sections taken for histological purposes. Such samples include bodily fluids such as blood and blood fractions or products (e.g., serum, plasma, platelets, red blood cells, and the like), sputum, tissue, cultured cells (e.g., primary cultures, explants, and transformed cells) stool, urine, synovial fluid, joint tissue, synovial tissue, synoviocytes, fibroblast-like synoviocytes, macrophage-like synoviocytes, immune cells, hematopoietic cells, fibroblasts, macrophages, T cells, etc. A biological sample is typically obtained from a eukaryotic organism, such as a mammal such as a primate e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or a bird; reptile; or fish.
A “cell” as used herein, refers to a cell carrying out metabolic or other function sufficient to preserve or replicate its genomic DNA. A cell can be identified by well-known methods in the art including, for example, presence of an intact membrane, staining by a particular dye, ability to produce progeny or, in the case of a gamete, ability to combine with a second gamete to produce a viable offspring. Cells may include prokaryotic and eukaryotic cells. Prokaryotic cells include but are not limited to bacteria. Eukaryotic cells include but are not limited to yeast cells and cells derived from plants and animals, for example mammalian, insect (e.g., spodoptera) and human cells.
The term “cell surface marker” as used herein, refers to a protein or a group of proteins expressed on the surface of cells that serve as markers of specific cell types.
The term “polymer” or “polymers” as provided herein refers to synthetic or natural molecules, or macromolecules, composed of multiple repeated subunits (monomers). Synthetic polymers (e.g., synthetic plastics such as polystyrene) and natural biopolymers (e.g., DNA, proteins) may be distinguished. Polymers, both natural and synthetic, are created via polymerization of many small molecules, known as monomers. In embodiments, polymers have a large molecular mass relative to small molecule compounds and, therefore, produce unique physical properties (e.g., toughness, viscoelasticity, tendency to form glasses and semicrystalline structures). In embodiments, the polymers are charged (charged polymers). The charged polymers provided herein may include a positive charge or a negative charge. Thus, in embodiments, the charged polymer is an anionic polymer. In embodiments, the charged polymer is a cationic polymer. Non-limiting examples of polymers useful for the compositions and methods provided herein include gum arabic, gum acacia, gum tragacanth, locust bean gum, guar gum, hydroxypropyl guar, xanthan gum, talc, cellulose gum, sclerotium gum, carageenan gum, karaya gum, cellulose gum, rosin, methylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxymethylcellulose, hydroxypropylmethylcellulose, methylhydroxyethylcellulose, cetyl hydroxyethylcellulose, carboxymethylcellulose, corn starch, hydroxypropyl starch phosphate, distarch phosphate, distarch dimethylene urea, aluminum starch octenyl succinate, maltodextrin, dextran, poly(acrylamide), PEG-150 distearate, PEG-150/decyl alcohol/SMDI copolymer, PEG-150/stearyl alcohol/SMDI copolymer, PEG-180/Laureth-50/TMMG copolymer, Polyether 1, acrylic acid/acrylamidomethyl propane sulfonic acid copolymer, acrylate/C10-30 alkyl acrylate cross polymer, acrylate/beheneth-25 methacrylate copolymer, acrylate/steareth-20 methacrylate copolymer, acrylate/steareth-20 copolymer, acrylate/VA cross polymer, acrylic acid/acrylonitrogen copolymer, ammonium acryloyldimethyltaurate/beheneth-25 methacrylate copolymer, ammonium acryloyldimethyltaurate/VP copolymer, sodium acrylate copolymer, PVM/MA decadiene cross polymer, alginic acid, propylene glycol alginate, dimethicone, silica dimethyl silylate, a dimethylacrylamide/acrylic acid/polystyrene ethyl methacrylate copolymer, PLGA polymer, polylactide, polyethylene glycol, carbomer, trolamine, derivatives thereof, and mixtures thereof. In embodiments, the polyethylene glycol is PEG3380. PEG3380 refers, in the customary sense, to CAS Registry No. 71767-64-1. In embodiments, the carbomer is CARBOPOL® 980. The term “carbomer” refers to cross linked polyacrylate polymers as known in the art and, for example, to CARBOPOL® 980 or CARBOPOL® 980 polymer, which are defined by CAS Registry Nos. 9063-87-0, 9003-01-4, or 600-07-7, respectively. The polyacrylate polymer may be, but is not limited to, poly-2-methylbutanoic acid, poly-prop-2-enoic acid, polyacrylic acid.
In embodiments, the polymer is a block polymer. In embodiments, the block polymer is a Lysine14 block polymer. A Lysine14 block polymer (“K14”) as provided herein refers to a polymer derived from lysine homopolymer subunits (monomers), which are linked by covalent bonds.
A “solid support” as provided herein refers to any appropriate material that can be modified to contain discrete individual sites for the attachment or association of an electronically conductive polymer as provided herein including embodiments thereof and is amenable to the methods provided herein including embodiments thereof. Examples of solid supports include without limitation, glass and modified or functionalized glass (e.g., carboxymethyldextran functionalized glass), plastics (including acrylics, polystyrene and copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, polyurethanes, Teflon™, etc.), polysaccharides, nylon or nitrocellulose, composite materials, ceramics, and plastic resins, silica or silica-based materials including silicon and modified silicon (e.g., patterned silicon), carbon, metals, quartz (e.g., patterned quartz), inorganic glasses, plastics, optical fiber bundles, and a variety of other polymers (e.g., electronically conductive polymers such as poly-3,4-ethylenedioxythiophene, PEDOT). In general, the solid support allows optical detection and do not appreciably fluoresce. The solid support may be planar (e.g., flat planar substrates such as glass, polystyrene and other plastics and acrylics). Although it will be appreciated by a person of ordinary skill in the art that other configurations of solid supports may be used as well; for example, three dimensional configurations can be used. The solid support may be modified to contain discrete, individual sites (also referred to herein as “wells”) for polymer binding. These sites generally include physically altered sites, i.e. physical configurations such as wells or small depressions in the substrate that can retain the polymers. The wells may be formed using a variety of techniques well known in the art, including, but not limited to, photolithography, stamping techniques, molding techniques and microetching techniques. It will be appreciated by a person of ordinary skill in the art that the technique used will depend on the composition and shape of the solid support. In embodiments, physical alterations are made in a surface of the solid support to produce wells. In embodiments, the solid support is a microtiter plate.
The term “recognition moiety” or “ligand” (also referred to herein as a ligand domain) refers to a composition (e.g., atom, molecule, ion, molecular ion, compound, particle, protein, peptide, nucleic acid, oligosaccharide, polysaccharide, or small molecule) capable of binding (e.g. specifically binding) to a second complementary ligand-binding composition (e.g., analyte, polymer, protein, marker, small molecule, ligand, polysaccharide, aptamer, or other binder) to form a complex. A recognition moiety as provided herein may without limitation bind to biomolecules (e.g., hormones, cytokines, proteins, nucleic acids, lipids, carbohydrates, cellular membrane antigens and receptors (neural, hormonal, nutrient, and cell surface receptors or their ligands)); whole cells or lysates thereof (e.g., prokaryotic (e.g., pathogenic bacteria), eukaryotic cells (e.g., mammalian tumor cells); viruses (e.g., retroviruses, herpesviruses, adenoviruses, lentiviruses and spores); chemicals (e.g., solvents, polymers, organic materials, small molecules); therapeutic molecules (e.g., therapeutic drugs, abused drugs, antibiotics); environmental pollutants (e.g., pesticides, insecticides, toxins). In embodiments, the recognition moiety is a cell surface marker binding moiety (i.e., a composition that recognizes and binds to a cell surface marker). In embodiments, the recognition moiety is a polypeptide. In embodiments, the recognition moiety is an antibody or a fragment thereof.
As used herein, the terms “specific binding” or “specifically binds” refer to two molecules forming a complex that is relatively stable under physiologic conditions.
Methods for determining whether a ligand binds to a protein and/or the affinity for a ligand to a protein are known in the art. For example, the binding of a ligand to a protein can be detected and/or quantified using a variety of techniques such as, but not limited to, Western blot, dot blot, surface plasmon resonance method (e.g., BIAcore system; Pharmacia Biosensor AB, Uppsala, Sweden and Piscataway, N.J.), isothermal titration calorimetry (ITC), or enzyme-linked immunosorbent assays (ELISA).
Immunoassays which can be used to analyze immunospecific binding and cross-reactivity of the ligand include, but are not limited to, competitive and non-competitive assay systems using techniques such as Western blots, RIA, ELISA (enzyme linked immunosorbent assay), “sandwich” immunoassays, immunoprecipitation assays, immunodiffusion assays, agglutination assays, complement-fixation assays, immunoradiometric assays, and fluorescent immunoassays. Such assays are routine and well known in the art.
The term “antibody” refers to a polypeptide encoded by an immunoglobulin gene or functional fragments thereof that specifically binds and recognizes an antigen. The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively.
An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair having one “light” (about 25 kDa) and one “heavy” chain (about 50-70 kDa). The N-terminus of each chain defines a variable region of about 100 to 110 or more amino acids primarily responsible for antigen recognition. The terms “variable heavy chain,” “VH,” or “VH” refer to the variable region of an immunoglobulin heavy chain, including an Fv, scFv, dsFv or Fab; while the terms “variable light chain,” “VL” or “VL” refer to the variable region of an immunoglobulin light chain, including of an Fv, scFv, dsFv or Fab.
Examples of antibody functional fragments include, but are not limited to, complete antibody molecules, antibody fragments, such as Fv, single chain Fv (scFv), complementarity determining regions (CDRs), VL (light chain variable region), VH (heavy chain variable region), Fab, F(ab)2′ and any combination of those or any other functional portion of an immunoglobulin peptide capable of binding to target antigen (see, e.g., F
The term “diagnosis” refers to a relative probability that a disease (e.g. cancer, urinary tract infection, infection, or other disease) is present in the subject. Similarly, the term “prognosis” refers to a relative probability that a certain future outcome may occur in the subject with respect to a disease state. For example, in the context of the present invention, prognosis can refer to the likelihood that an individual will develop a disease (e.g. cancer, urinary tract infection, infection, or other disease), or the likely severity of the disease (e.g., duration of disease). The terms are not intended to be absolute, as will be appreciated by any one of skill in the field of medical diagnostics.
As used herein, a “diagnostically effective amount” of a composition described herein is an amount sufficient to produce a clinically useful characterization or measurement of a disease state, such as an infection or cancer, (e.g. in an individual, patient, human, mammal, clinical sample, tissue, biopsy). A clinically useful characterization or measurement of a disease state, such as an infection or cancer, (e.g. in an individual, patient, human, mammal, clinical sample, tissue, biopsy) is one containing sufficient detail to enable an experienced clinician to assess the degree and/or extent of disease for purposes of diagnosis, monitoring the efficacy of a therapeutic intervention, and the like.
The terms “disease” or “condition” refer to a state of being or health status of a patient or subject capable of being treated with a compound, pharmaceutical composition, or method provided herein. In embodiments, the disease is cancer (e.g. lung cancer, ovarian cancer, osteosarcoma, bladder cancer, cervical cancer, liver cancer, kidney cancer, skin cancer (e.g., Merkel cell carcinoma), testicular cancer, leukemia, lymphoma, head and neck cancer, colorectal cancer, prostate cancer, pancreatic cancer, melanoma, breast cancer, neuroblastoma).
The compounds disclosed herein may also contain unnatural proportions of atomic isotopes at one or more of the atoms that constitute such compounds. For example, the compounds may be radiolabeled with radioactive isotopes, such as for example tritium (3H), iodine-125 (125I), or carbon-14 (14C). All isotopic variations of the compounds disclosed herein, whether radioactive or not, are encompassed within the scope disclosed herein.
The terms “treating” or “treatment” refers to any indicia of success in the treatment or amelioration of an injury, disease, pathology or condition, including any objective or subjective parameter such as abatement; remission; diminishing of symptoms or making the injury, pathology or condition more tolerable to the patient; slowing in the rate of degeneration or decline; making the final point of degeneration less debilitating; improving a patient's physical or mental well-being. The treatment or amelioration of symptoms can be based on objective or subjective parameters; including the results of a physical examination, neuropsychiatric exams, and/or a psychiatric evaluation. The term “treating,” and conjugations thereof, include prevention of an injury, pathology, condition, or disease.
An “effective amount” is an amount sufficient to accomplish a stated purpose (e.g., achieve the effect for which it is administered, treat a disease, reduce tumor size, and the like). An example of an “effective amount” is an amount sufficient to contribute to the treatment, prevention, or reduction of a symptom or symptoms of a disease, which could also be referred to as a “therapeutically effective amount.” A “reduction” of a symptom or symptoms (and grammatical equivalents of this phrase) means decreasing of the severity or frequency of the symptom(s), or elimination of the symptom(s). A “prophylactically effective amount” of a drug is an amount of a drug that, when administered to a subject, will have the intended prophylactic effect, e.g., preventing or delaying the onset (or reoccurrence) of an injury, disease, pathology or condition, or reducing the likelihood of the onset (or reoccurrence) of an injury, disease, pathology, or condition, or their symptoms. The full prophylactic effect does not necessarily occur by administration of one dose, and may occur only after administration of a series of doses. Thus, a prophylactically effective amount may be administered in one or more administrations. The exact amounts will depend on the purpose of the treatment, and will be ascertainable by one skilled in the art using known techniques (see, e.g., Lieberman, Pharmaceutical Dosage Forms (vols. 1-3, 1992); Lloyd, The Art, Science and Technology of Pharmaceutical Compounding (1999); Pickar, Dosage Calculations (1999); and Remington: The Science and Practice of Pharmacy, 20th Edition, 2003, Gennaro, Ed., Lippincott, Williams & Wilkins).
“Subject,” “patient,” “subject in need thereof” and the like refer to a living organism suffering from or prone to a disease or condition that can be treated by administration of a compound or pharmaceutical composition, as provided herein. Non-limiting examples include humans, other mammals, bovines, rats, mice, dogs, monkeys, goat, sheep, cows, deer, and other non-mammalian animals. In embodiments, a subject is human.
“Pharmaceutically acceptable excipient” and “pharmaceutically acceptable carrier” refer to a substance that aids the administration of an active agent to and absorption by a subject and can be included in the compositions of the present invention without causing a significant adverse toxicological effect on the patient. Unless indicated to the contrary, the terms “active agent,” “active ingredient,” “therapeutically active agent,” “therapeutic agent” and like are used synonymously. Non-limiting examples of pharmaceutically acceptable excipients include water, NaCl, normal saline solutions, lactated Ringer's, normal sucrose, normal glucose, binders, fillers, disintegrants, lubricants, coatings, sweeteners, flavors, salt solutions (such as Ringer's solution), alcohols, oils, gelatins, carbohydrates such as lactose, amylose or starch, fatty acid esters, hydroxymethycellulose, polyvinyl pyrrolidine, polyethylene glycol, and colors, and the like. Such preparations can be sterilized and, if desired, mixed with auxiliary agents such as lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, coloring, and/or aromatic substances and the like that do not deleteriously react with the compounds of the invention. One of skill in the art will recognize that other pharmaceutical excipients are useful in the present invention.
As used herein, the term “administering” means oral administration, administration as an inhaled aerosol or as an inhaled dry powder, suppository, topical contact, intravenous, parenteral, intraperitoneal, intramuscular, intralesional, intrathecal, intranasal or subcutaneous administration, or the implantation of a slow-release device, e.g., a mini-osmotic pump, to a subject. Administration is by any route, including parenteral and transmucosal (e.g., buccal, sublingual, palatal, gingival, nasal, vaginal, rectal, or transdermal). Parenteral administration includes, e.g., intravenous, intramuscular, intra-arteriole, intradermal, subcutaneous, intraperitoneal, intraventricular, and intracranial. Other modes of delivery include, but are not limited to, the use of liposomal formulations, intravenous infusion, transdermal patches, etc. By “co-administer” it is meant that a composition described herein is administered at the same time, just prior to, or just after the administration of one or more additional therapies, for example corticosteroids, antibiotics, cancer therapies such as chemotherapy, hormonal therapy, radiotherapy, or immunotherapy. The compound of the invention can be administered alone or can be coadministered to the patient. Co-administration is meant to include simultaneous or sequential administration of the compound individually or in combination (more than one compound or agent). Thus, the preparations can also be combined, when desired, with other active substances (e.g., to reduce metabolic degradation, to promote the penetration of tissues, or the like). The compositions of the present invention can be delivered transdermally, by a topical route, formulated as applicator sticks, solutions, suspensions, emulsions, gels, creams, ointments, nanoparticles, pastes, jellies, paints, powders, and aerosols. Oral preparations include tablets, pills, powder, dragees, capsules, liquids, lozenges, cachets, gels, syrups, slurries, suspensions, etc., suitable for ingestion by the patient. Solid form preparations include powders, tablets, pills, capsules, cachets, suppositories, and dispersible granules. Liquid form preparations include solutions, suspensions, and emulsions, for example, water or water/propylene glycol solutions. The compositions of the present invention may additionally include components to provide sustained release and/or comfort. Such components include high molecular weight, anionic mucomimetic polymers, gelling polysaccharides and finely-divided drug carrier substrates. These components are discussed in greater detail in U.S. Pat. Nos. 4,911,920; 5,403,841; 5,212,162; and 4,861,760. The entire contents of these patents are incorporated herein by reference in their entirety for all purposes. The compositions of the present invention can also be delivered as microspheres for slow release in the body. For example, microspheres can be administered via intradermal injection of drug-containing microspheres, which slowly release subcutaneously (see Rao, J. Biomater Sci. Polym. Ed. 7:623-645, 1995; as biodegradable and injectable gel formulations (see, e.g., Gao Pharm. Res. 12:857-863, 1995); or, as microspheres for oral administration (see, e.g., Eyles, J. Pharm. Pharmacol. 49:669-674, 1997). In another embodiment, the formulations of the compositions of the present invention can be delivered by the use of liposomes which fuse with the cellular membrane or are endocytosed, i.e., by employing receptor ligands attached to the liposome, that bind to surface membrane protein receptors of the cell resulting in endocytosis. By using liposomes, particularly where the liposome surface carries receptor ligands specific for target cells, or are otherwise preferentially directed to a specific organ, one can focus the delivery of the compositions of the present invention into the target cells in vivo. (See, e.g., Al-Muhammed, J. Microencapsul. 13:293-306, 1996; Chonn, Curr. Opin. Biotechnol. 6:698-708, 1995; Ostro, Am. J. Hosp. Pharm. 46:1576-1587, 1989).
Pharmaceutical compositions provided by the present invention include compositions wherein the active ingredient (e.g., conjugated wrapping phages described herein, including embodiments or examples) is contained in a therapeutically effective amount, i.e., in an amount effective to achieve its intended purpose. The actual amount effective for a particular application will depend, inter alia, on the condition being treated. Determination of a therapeutically effective amount of a compound of the invention is well within the capabilities of those skilled in the art, especially in light of the detailed disclosure herein.
The dosage and frequency (single or multiple doses) administered to a mammal can vary depending upon a variety of factors, for example, whether the mammal suffers from another disease, and its route of administration; size, age, sex, health, body weight, body mass index, and diet of the recipient; nature and extent of symptoms of the disease being treated, kind of concurrent treatment, complications from the disease being treated or other health-related problems. Other therapeutic regimens or agents can be used in conjunction with the methods and compounds of Applicants' invention. Adjustment and manipulation of established dosages (e.g., frequency and duration) are well within the ability of those skilled in the art.
As used herein, the term “cancer” refers to all types of cancer, neoplasm or malignant tumors found in mammals, including glioblastomas, leukemias, lymphomas, melanomas, neuroendocrine tumors, carcinomas and sarcomas. Exemplary cancers that may be treated with a compound, pharmaceutical composition, or method provided herein include glioblastoma, lymphoma, sarcoma, bladder cancer, bone cancer, brain tumor, cervical cancer, colon cancer, esophageal cancer, gastric cancer, head and neck cancer, kidney cancer, myeloma, thyroid cancer, leukemia, prostate cancer, breast cancer (e.g. triple negative, ER positive, ER negative, chemotherapy resistant, herceptin resistant, HER2 positive, doxorubicin resistant, tamoxifen resistant, ductal carcinoma, lobular carcinoma, primary, metastatic), ovarian cancer, pancreatic cancer, liver cancer (e.g., hepatocellular carcinoma), lung cancer (e.g. non-small cell lung carcinoma, squamous cell lung carcinoma, adenocarcinoma, large cell lung carcinoma, small cell lung carcinoma, carcinoid, sarcoma), glioblastoma multiforme, glioma, melanoma, prostate cancer, castration-resistant prostate cancer, breast cancer, triple negative breast cancer, glioblastoma, ovarian cancer, lung cancer, squamous cell carcinoma (e.g., head, neck, or esophagus), colorectal cancer, leukemia, acute myeloid leukemia, lymphoma, B cell lymphoma, or multiple myeloma. Additional examples include, cancer of the thyroid, endocrine system, brain, breast, cervix, colon, head & neck, esophagus, liver, kidney, lung, non-small cell lung, melanoma, mesothelioma, ovary, sarcoma, stomach, uterus or Medulloblastoma, Hodgkin's Disease, Non-Hodgkin's Lymphoma, multiple myeloma, neuroblastoma, glioma, glioblastoma multiforme, ovarian cancer, rhabdomyosarcoma, primary thrombocytosis, primary macroglobulinemia, primary brain tumors, cancer, malignant pancreatic insulanoma, malignant carcinoid, urinary bladder cancer, premalignant skin lesions, testicular cancer, lymphomas, thyroid cancer, neuroblastoma, esophageal cancer, genitourinary tract cancer, malignant hypercalcemia, endometrial cancer, adrenal cortical cancer, neoplasms of the endocrine or exocrine pancreas, medullary thyroid cancer, medullary thyroid carcinoma, melanoma, colorectal cancer, papillary thyroid cancer, hepatocellular carcinoma, Paget's Disease of the Nipple, Phyllodes Tumors, Lobular Carcinoma, Ductal Carcinoma, cancer of the pancreatic stellate cells, cancer of the hepatic stellate cells, or prostate cancer.
The term “leukemia” refers broadly to progressive, malignant diseases of the blood-forming organs and is generally characterized by a distorted proliferation and development of leukocytes and their precursors in the blood and bone marrow. Leukemia is generally clinically classified on the basis of (1) the duration and character of the disease-acute or chronic; (2) the type of cell involved; myeloid (myelogenous), lymphoid (lymphogenous), or monocytic; and (3) the increase or non-increase in the number abnormal cells in the blood-leukemic or aleukemic (subleukemic). Exemplary leukemias that may be treated with a compound, pharmaceutical composition, or method provided herein include, for example, acute nonlymphocytic leukemia, chronic lymphocytic leukemia, acute granulocytic leukemia, chronic granulocytic leukemia, acute promyelocytic leukemia, adult T-cell leukemia, aleukemic leukemia, a leukocythemic leukemia, basophylic leukemia, blast cell leukemia, bovine leukemia, chronic myelocytic leukemia, leukemia cutis, embryonal leukemia, eosinophilic leukemia, Gross' leukemia, hairy-cell leukemia, hemoblastic leukemia, hemocytoblastic leukemia, histiocytic leukemia, stem cell leukemia, acute monocytic leukemia, leukopenic leukemia, lymphatic leukemia, lymphoblastic leukemia, lymphocytic leukemia, lymphogenous leukemia, lymphoid leukemia, lymphosarcoma cell leukemia, mast cell leukemia, megakaryocytic leukemia, micromyeloblastic leukemia, monocytic leukemia, myeloblastic leukemia, myelocytic leukemia, myeloid granulocytic leukemia, myelomonocytic leukemia, Naegeli leukemia, plasma cell leukemia, multiple myeloma, plasmacytic leukemia, promyelocytic leukemia, Rieder cell leukemia, Schilling's leukemia, stem cell leukemia, subleukemic leukemia, or undifferentiated cell leukemia.
The term “sarcoma” generally refers to a tumor which is made up of a substance like the embryonic connective tissue and is generally composed of closely packed cells embedded in a fibrillar or homogeneous substance. Sarcomas that may be treated with a compound, pharmaceutical composition, or method provided herein include a chondrosarcoma, fibrosarcoma, lymphosarcoma, melanosarcoma, myxosarcoma, osteosarcoma, Abemethy's sarcoma, adipose sarcoma, liposarcoma, alveolar soft part sarcoma, ameloblastic sarcoma, botryoid sarcoma, chloroma sarcoma, chorio carcinoma, embryonal sarcoma, Wilms' tumor sarcoma, endometrial sarcoma, stromal sarcoma, Ewing's sarcoma, fascial sarcoma, fibroblastic sarcoma, giant cell sarcoma, granulocytic sarcoma, Hodgkin's sarcoma, idiopathic multiple pigmented hemorrhagic sarcoma, immunoblastic sarcoma of B cells, lymphoma, immunoblastic sarcoma of T-cells, Jensen's sarcoma, Kaposi's sarcoma, Kupffer cell sarcoma, angiosarcoma, leukosarcoma, malignant mesenchymoma sarcoma, parosteal sarcoma, reticulocytic sarcoma, Rous sarcoma, serocystic sarcoma, synovial sarcoma, or telangiectaltic sarcoma.
The term “melanoma” is taken to mean a tumor arising from the melanocytic system of the skin and other organs. Melanomas that may be treated with a compound, pharmaceutical composition, or method provided herein include, for example, acral-lentiginous melanoma, amelanotic melanoma, benign juvenile melanoma, Cloudman's melanoma, S91 melanoma, Harding-Passey melanoma, juvenile melanoma, lentigo maligna melanoma, malignant melanoma, nodular melanoma, subungal melanoma, or superficial spreading melanoma.
The term “carcinoma” refers to a malignant new growth made up of epithelial cells tending to infiltrate the surrounding tissues and give rise to metastases. Exemplary carcinomas that may be treated with a compound, pharmaceutical composition, or method provided herein include, for example, medullary thyroid carcinoma, familial medullary thyroid carcinoma, acinar carcinoma, acinous carcinoma, adenocystic carcinoma, adenoid cystic carcinoma, carcinoma adenomatosum, carcinoma of adrenal cortex, alveolar carcinoma, alveolar cell carcinoma, basal cell carcinoma, carcinoma basocellulare, basaloid carcinoma, basosquamous cell carcinoma, bronchioalveolar carcinoma, bronchiolar carcinoma, bronchogenic carcinoma, cerebriform carcinoma, cholangiocellular carcinoma, chorionic carcinoma, colloid carcinoma, comedo carcinoma, corpus carcinoma, cribriform carcinoma, carcinoma en cuirasse, carcinoma cutaneum, cylindrical carcinoma, cylindrical cell carcinoma, duct carcinoma, ductal carcinoma, carcinoma durum, embryonal carcinoma, encephaloid carcinoma, epiermoid carcinoma, carcinoma epitheliale adenoides, exophytic carcinoma, carcinoma ex ulcere, carcinoma fibrosum, gelatiniforni carcinoma, gelatinous carcinoma, giant cell carcinoma, carcinoma gigantocellulare, glandular carcinoma, granulosa cell carcinoma, hair-matrix carcinoma, hematoid carcinoma, hepatocellular carcinoma, Hurthle cell carcinoma, hyaline carcinoma, hypernephroid carcinoma, infantile embryonal carcinoma, carcinoma in situ, intraepidermal carcinoma, intraepithelial carcinoma, Krompecher's carcinoma, Kulchitzky-cell carcinoma, large-cell carcinoma, lenticular carcinoma, carcinoma lenticulare, lipomatous carcinoma, lobular carcinoma, lymphoepithelial carcinoma, carcinoma medullare, medullary carcinoma, melanotic carcinoma, carcinoma molle, mucinous carcinoma, carcinoma muciparum, carcinoma mucocellulare, mucoepidermoid carcinoma, carcinoma mucosum, mucous carcinoma, carcinoma myxomatodes, nasopharyngeal carcinoma, oat cell carcinoma, carcinoma ossificans, osteoid carcinoma, papillary carcinoma, periportal carcinoma, preinvasive carcinoma, prickle cell carcinoma, pultaceous carcinoma, renal cell carcinoma of kidney, reserve cell carcinoma, carcinoma sarcomatodes, schneiderian carcinoma, scirrhous carcinoma, carcinoma scroti, signet-ring cell carcinoma, carcinoma simplex, small-cell carcinoma, solanoid carcinoma, spheroidal cell carcinoma, spindle cell carcinoma, carcinoma spongiosum, squamous carcinoma, squamous cell carcinoma, string carcinoma, carcinoma telangiectaticum, carcinoma telangiectodes, transitional cell carcinoma, carcinoma tuberosum, tubular carcinoma, tuberous carcinoma, verrucous carcinoma, or carcinoma villosum.
As used herein, the terms “metastasis,” “metastatic,” and “metastatic cancer” can be used interchangeably and refer to the spread of a proliferative disease or disorder, e.g., cancer, from one organ or another non-adjacent organ or body part. Cancer occurs at an originating site, e.g., breast, which site is referred to as a primary tumor, e.g., primary breast cancer. Some cancer cells in the primary tumor or originating site acquire the ability to penetrate and infiltrate surrounding normal tissue in the local area and/or the ability to penetrate the walls of the lymphatic system or vascular system circulating through the system to other sites and tissues in the body. A second clinically detectable tumor formed from cancer cells of a primary tumor is referred to as a metastatic or secondary tumor. When cancer cells metastasize, the metastatic tumor and its cells are presumed to be similar to those of the original tumor. Thus, if lung cancer metastasizes to the breast, the secondary tumor at the site of the breast consists of abnormal lung cells and not abnormal breast cells. The secondary tumor in the breast is referred to a metastatic lung cancer. Thus, the phrase metastatic cancer refers to a disease in which a subject has or had a primary tumor and has one or more secondary tumors. The phrases non-metastatic cancer or subjects with cancer that is not metastatic refers to diseases in which subjects have a primary tumor but not one or more secondary tumors. For example, metastatic lung cancer refers to a disease in a subject with or with a history of a primary lung tumor and with one or more secondary tumors at a second location or multiple locations, e.g., in the breast.
“Anti-cancer agent” is used in accordance with its plain ordinary meaning and refers to a composition (e.g. compound, drug, antagonist, inhibitor, modulator) having antineoplastic properties or the ability to inhibit the growth or proliferation of cells.
The term “associated” or “associated with” in the context of a substance or substance activity or function associated with a disease (e.g., diabetes, cancer (e.g. prostate cancer, renal cancer, metastatic cancer, melanoma, castration-resistant prostate cancer, breast cancer, triple negative breast cancer, glioblastoma, ovarian cancer, lung cancer, squamous cell carcinoma (e.g., head, neck, or esophagus), colorectal cancer, leukemia, acute myeloid leukemia, lymphoma, B cell lymphoma, or multiple myeloma)) means that the disease (e.g. lung cancer, ovarian cancer, osteosarcoma, bladder cancer, cervical cancer, liver cancer, kidney cancer, skin cancer (e.g., Merkel cell carcinoma), testicular cancer, leukemia, lymphoma, head and neck cancer, colorectal cancer, prostate cancer, pancreatic cancer, melanoma, breast cancer, neuroblastoma) is caused by (in whole or in part), or a symptom of the disease is caused by (in whole or in part) the substance or substance activity or function.
Overview
The migration and dissemination of tumor cells, termed metastasis, causes about 90% of cancer deaths. Metastasis requires loss of apoptotic regulation, and such cells respond poorly to conventional anti-cancer treatments. Specific detection of circulating tumor cells and characterization of their aggressiveness could improve cancer diagnostics and treatment. Chemically modified viruses (such as phage) could provide an inexpensive and efficient approach to detect tumor cells and quantitate their cell surface biomarkers.
The M13 virus consists of a circular, single-stranded DNA genome surrounded by a protein coat composed of approximately 2700 copies of the major coat protein, P8, an α-helical protein of 50 amino acid residues with an unstructured N-terminus. One Glu and two Asp residues near the N-terminus of P8 impart a high negative charge to the outer surface of the virus at physiological pH. Selections with phage-displayed libraries of peptides and proteins can target tissue-cultured cells and even organs in living organisms. Phage has also been incorporated into nanomedicine platforms for targeted drug delivery and imaging. Such applications require low background binding by phage to cell surfaces.
Phage typically adheres to cell surfaces with high affinity. Unfortunately, such non-specific adhesion complicates the design of phage-based sensors for the detection of tumor cells: the non-specific background can reduce the signal to noise ratios and the ability to distinguish tumor from non-tumor cells.
Provided herein are compositions and methods that solve this and other problems. In embodiments, a viral composition (e.g., a conjugated wrapped phage) is provided having a surface (e.g., an engineered surface) wrapped with a polymer. The polymer may be a variety of different covalent linker architectures (e.g., oligolysine, linkers, spacers, and recognition moieties). As demonstrated herein, the viral composition described herein may be used to specifically detect cancer cells expressing cancer biomarkers. In embodiments, this approach also allows quantification of biomarker levels on the cell surface, and can distinguish more aggressive forms of the disease. In embodiments, the viral composition described herein results in more than 75% reduction of the non-specific adhesion of virus to cell surfaces.
Compositions
In one aspect, provided herein is a viral composition. The viral composition includes (a) a whole viral particle comprising a charged protein coat that has a charged coat protein; (b) a first polymer electrostatically bound to the charged coat protein; and (c) a covalent linker linking the first polymer to a recognition moiety. In embodiments, the viral composition is a conjugated wrapped phage.
The term “conjugated wrapped phage” and the like as used herein means a bacteriophage (also referred to herein as a phage) which is in contact with a charged polymer which at least partially encircles or enfolds the phage charged protein coat as disclosed herein. In embodiments, the polymer in this context binds the phage non-covalently via e.g., electrostatic attraction between the charged protein coat of the phage and charges on the polymer. The charged protein coat includes a plurality of charged coat proteins in contact with the charged polymer. The plurality of charged coat proteins has the opposite charge of the charged polymer. In embodiments, a phage described herein is in contact with a charged polymer (e.g., a first polymer) that fully encircles or enfolds the phage charged protein coat as disclosed herein. In embodiments, the charged polymer (e.g., a first polymer) encircles 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more times the charged protein coat.
The term “whole viral particle” as used herein, refers to a complete viral particle that includes the genetic material made from either DNA or RNA and a protein coat, also called the capsid, which surrounds and protects the genetic material. In embodiments, where appropriate, the whole viral particle includes; an envelope of lipids that surrounds the protein coat (e.g. when the viral particle is outside a cell). A protein coat or a capsid is the protein shell of a virus. A charged protein coat, as used herein, refers to a protein coat having either a net positive or a net negative electric charge. In embodiments, a charged protein coat has a net negative electric charge.
In embodiments, the whole viral particle is a whole bacteriophage (or phage) that includes the genetic material made from either DNA or RNA and a protein coat, also called the capsid, which surrounds and protects the genetic material. A protein coat or a capsid is the protein shell of the phage. A charged protein coat, as used herein, refers to a protein coat having either a net positive or a net negative electric charge. In embodiments, a charged protein coat has a net negative electric charge.
A coat protein, as used herein, refers to a protein within the capsid (or protein coat). A charged coat protein refers to a coat protein having either a net positive or a net negative electric charge. In embodiments, a charged coat protein has a net negative electric charge.
The term “covalent linker,” “linker,” “spacer” are used herein interchangeably and refer to a divalent chemical moiety attached at each end to the remainder of the compound. In embodiments, the covalent linker is -L1-L2-L3-L4-L5-L6-.
In embodiments, L1 is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl or -L1A-L1B-L1C-L1D-L1E-L1F-L1G-L1H-L1I-L1J-.
In embodiments, L1A is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L1B is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L1C is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L1d is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L1E is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L1F is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L1G is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L1H is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L1I is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L1J is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L1A, L1B, L1C, L1D, L1E, L1F, L1G, L1H, L1I, and L1J is not a bond
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L1A, L1B, L1C, L1D, L1E, L1F, L1G, L1H, L1I, and L1J is a cleavable linker.
In embodiments, L2 is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl or -L2A-L2B-L2C-L2D-L2E-L2F-L2G-L2H-L2I-L2J-.
In embodiments, L2A is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L2B is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L2C is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L2D is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L2E is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L2F is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L2G is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L2H is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L2I is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L2J is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L2A, L2B, L2C, L2D, L2E, L2F, L2G, L2H, L2I, and L2J is not a bond.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L2A, L2B, L2C, L2D, L2E, L2F, L2G, L2H, L2I, and L2J is a cleavable linker.
In embodiments, L3 is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl or -L3A-L3B-L3C-L3D-L3E-L3F-L3G-L3H-L3I-L3J.
In embodiments, L3A is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L3B is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L3C is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L3D is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L3E is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L3F is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L3G is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L3H is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L3I is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L3J is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L3A, L3B, L3C, L3D, L3E, L3F, L3G, L3H, L3I, and L3J is not a bond.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L3A, L3B, L3C, L3D, L3E, L3F, L3G, L3H, L3I, and L3J is a cleavable linker.
In embodiments, L4 is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl or -L4A-L4B-L4C-L4D-L4E-L4F-L4G-L4H-L4I-L4J-.
In embodiments, L4A is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L4B is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L4C is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L4D is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L4E is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L4F is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L4G is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L4H is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L4I is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L4J is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L4A, L4B, L4C, L4D, L4E, L4F, L4G, L4H, L4I, and L4J is not a bond.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L4A, L4B, L4C, L4D, L4E, L4F, L4G, L4H, L4I, and L4J is a cleavable linker.
In embodiments, L5 is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl or -L5A-L5B-L5C-L5D-L5E-L5F-L5G-L5H-L5I-L5J-.
In embodiments, L5A is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L5B is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L5C is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L5D is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L5E is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L5F is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L5G is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L5H is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L5I is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L5J is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L5A, L5B, L5C, L5D, L5E, L5F, L5G, L5H, L5I, and L5J is not a bond.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L5A, L5B, L5C, L5D, L5E, L5F, L5G, L5H, L5I, and L5J is a cleavable linker.
In embodiments, L6 is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl or -L6A-L6B-L6C-L6D-L6E-L6F-L6G-L6H-L6I-L6J-.
In embodiments, L6A is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L6B is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L6C is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L6D is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L6E is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L6F is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L6G is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L6H is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L6I is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, L6J is a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L6A, L6B, L6C, L6D, L6E, L6F, L6G, L6H, L6I, and L6J is not a bond.
In embodiments, at least one (e.g. 1, 2, 3 or 4) of L6A, L6B, L6C, L6D, L6E, L6F, L6G, L6H, L6I, and L6J is a cleavable linker.
In embodiments, L1 is substituted or unsubstituted heteroalkyl, L2 is substituted or unsubstituted heteroaryl, L3 is substituted or unsubstituted heteroalkyl, L4 is substituted or unsubstituted heterocycloalkyl, L5 is a substituted or unsubstituted heteroalkyl, and L6 is a bond.
In embodiments, L4 is
where the carbon at the 3 position is covalently attached to L5.
In embodiments, L5 is —S—CH2—CH(NH2)—C(O)— or —S—CH2—CH(C(O)OH)—NH—, wherein the sulfur of L5 is attached to L4.
In embodiments, L3 comprises a polyethylene glycol linker. In embodiments, polyethylene glycol linker comprises 2 to 150 oxyethylene units (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149 or 150 oxyethylene units).
In some embodiments, each substituted group described for L1, L2, L3, L4, L5, L6, L1A, L1B, L1C, L1D, L1E, L1F, L1G, L1H, L1I, L1J, L2A, L2B, L2C, L2D, L2E, L2F, L2G, L2H, L2I, L2J, L3A, L3B, L3C, L3D, L3E, L3F, L3G, L3H, L3I, L3J, L4A, L4B, L4C, L4D, L4E, L4F, L4G, L4H, L4I, L4J, L5A, L5B, L5C, L5D, L5E, L5F, L5G, L5H, L5I, L5J, L6A, L6B, L6C, L6D, L6E, L6F, L6G, L6H, L6I, and L6J is substituted with at least one substituent group. More specifically, in some embodiments, each substituted alkyl, substituted heteroalkyl, substituted cycloalkyl, substituted heterocycloalkyl, substituted aryl, substituted heteroaryl, substituted alkylene, substituted heteroalkylene, substituted cycloalkylene, substituted heterocycloalkylene, substituted arylene, and/or substituted heteroarylene described for L1, L2, L3, L4, L5, L6, L1A, L1B, L1C, L1D, L1E, L1F, L1G, L1H, L1I, L1J, L2A, L2B, L2C, L2D, L2E, L2F, L2G, L2H, L2I, L2J, L3A, L3B, L3C, L3D, L3E, L3F, L3G, L3H, L3I, L3J, L4A, L4B, L4C, L4D, L4E, L4F, L4G, L4H, L4I, L4J, L5A, L5B, L5C, L5D, L5E, L5F, L5G, L5H, L5I, L5J, L6A, L6B, L6C, L6D, L6E, L6F, L6G, L6H, L6I, and L6J is substituted with at least one substituent group.
In some embodiments, each substituted group described for L1, L2, L3, L4, L5, L6, L1A, L1B, L1C, L1D, L1E, L1F, L1G, L1H, L1I, L1J, L2A, L2B, L2C, L2D, L2E, L2F, L2G, L2H, L2I, L2J, L3A, L3B, L3C, L3D, L3E, L3F, L3G, L3H, L3I, L3J, L4A, L4B, L4C, L4D, L4E, L4F, L4G, L4H, L4I, L4J, L5A, L5B, L5C, L5D, L5E, L5F, L5G, L5H, L5I, L5J, L6A, L6B, L6C, L6D, L6E, L6F, L6G, L6H, L6I, and L6J is substituted with at least one lower substituent group. More specifically, in some embodiments, each substituted alkyl, substituted heteroalkyl, substituted cycloalkyl, substituted heterocycloalkyl, substituted aryl, substituted heteroaryl, substituted alkylene, substituted heteroalkylene, substituted cycloalkylene, substituted heterocycloalkylene, substituted arylene, and/or substituted heteroarylene described for L1, L2, L3, L4, L5, L6, L1A, L1B, L1C, L1D, L1E, L1F, L1G, L1H, L1I, L1J, L2A, L2B, L2C, L2D, L2E, L2F, L2G, L2H, L2I, L2J, L3A, L3B, L3C, L3D, L3E, L3F, L3G, L3H, L3I, L3J, L4A, L4B, L4C, L4D, L4E, L4F, L4G, L4H, L4I, L4J, L5A, L5B, L5C, L5D, L5E, L5F, L5G, L5H, L5I, L5J, L6A, L6B, L6C, L6D, L6E, L6F, L6G, L6H, L6I, and L6J is substituted with at least one lower substituent group.
In some embodiments, each substituted group described for L1, L2, L3, L4, L5, L6, L1A, L1B, L1C, L1D, L1E, L1F, L1G, L1H, L1I, L1J, L2A, L2B, L2C, L2D, L2E, L2F, L2G, L2H, L2I, L2J, L3A, L3B, L3C, L3D, L3E, L3F, L3G, L3H, L3I, L3J, L4A, L4B, L4C, L4D, L4E, L4F, L4G, L4H, L4I, L4J, L5A, L5B, L5C, L5D, L5E, L5F, L5G, L5H, L5I, L5J, L6A, L6B, L6C, L6D, L6E, L6F, L6G, L6H, L6I, and L6J is substituted with at least one size-limited substituent group. More specifically, in some embodiments, each substituted alkyl, substituted heteroalkyl, substituted cycloalkyl, substituted heterocycloalkyl, substituted aryl, substituted heteroaryl, substituted alkylene, substituted heteroalkylene, substituted cycloalkylene, substituted heterocycloalkylene, substituted arylene, and/or substituted heteroarylene described for L1, L2, L3, L4, L5, L6, L1A, L1B, L1C, L1D, L1E, L1F, L1G, L1H, L1I, L1J, L2A, L2B, L2C, L2D, L2E, L2F, L2G, L2H, L2I, L2J, L3A, L3B, L3C, L3D, L3E, L3F, L3G, L3H, L3I, L3J, L4A, L4B, L4C, L4D, L4E, L4F, L4G, L4H, L4I, L4J, L5A, L5B, L5C, L5D, L5E, L5F, L5G, L5H, L5I, L5J, L6A, L6B, L6C, L6D, L6E, L6F, L6G, L6H, L6I, and L6J is substituted with at least one size-limited substituent group.
In embodiments, each of the plurality of charged coat proteins is a negatively charged coat protein. In embodiments, each of the plurality of charged coat proteins is a positively charged coat protein. In embodiments, each of the plurality of charged coat proteins includes one or more negatively charged amino acid residues. In embodiments, each of the plurality of charged coat proteins includes one or more Glu or one or more Asp residues. In embodiments, each of the plurality of charged coat proteins includes one or more Glu and one or more Asp residues. In embodiments, the one or more Glu or one or more Asp residues form part of the N-terminus of the charged coat protein. In embodiments, the charged coat protein is P8.
In embodiments, a plurality of charged coat proteins includes at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149 or 150 or more of charged coat proteins. In embodiments, the charged proteins of the plurality of charged coat proteins are the same. In embodiments, the charged coat proteins plurality of charged coat proteins are not the same.
The term “P8” or “P8 protein” as provided herein includes any of the recombinant or naturally-occurring forms of the viral coat protein P8 or variants or homologs thereof that maintain P8 protein activity (e.g. within at least 50%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% activity compared to P8). In some aspects, the variants or homologs have at least 90%, 95%, 96%, 97%, 98%, 99% or 100% amino acid sequence identity across the whole sequence or a portion of the sequence (e.g. a 50, 100, 150 or 200 continuous amino acid portion) compared to a naturally occurring P8 polypeptide. In embodiments, P8 is the protein as identified by the NCBI sequence reference GI:402239556, homolog or functional fragment thereof.
In embodiments, the whole viral particle is a whole bacteriophage viral particle. In embodiments, the whole viral particle is an M13 filamentous phage.
In embodiments, the first polymer is a cationic polymer. In embodiments, the first polymer is an anionic polymer. Where the charged coat protein is electrostatically bound to a first polymer, the charged coat protein and the first polymer are connected through an ionic bond.
In embodiments, the first polymer is or includes a polypeptide. In embodiments, the polypeptide has a net positive charge. In embodiments, the polypeptide encompasses a polymer of lysine (aka oligolysine, e.g., 3 to 20 lysine, 4 to 19 lysine, 5 to 18 lysine, 6 to 17 lysine, 7 to 17 lysine, 8 to 16 lysine, 9 to 16 lysine, 10 to 15 lysine, 11 to 14 lysine, 12 to 14 lysine, 13 to 14 lysine, 14 lysine). In embodiments, the polypeptide includes a polymer of lysine. In embodiments, the polymer of lysine is K2, K3, K4, K5, K6, K7, K8, K9, K10, K11, K12, K13, K14, K15, K16, K17, K18, K19, or K20 (SEQ ID NO:4). The term “AAx” refers in the usual and customary sense to a polymer of amino acid “AA” having “x” repeating amino acid units. Thus, “K2” refers to a lysine-lysine polymer or polymer portion (e.g. KK), “K3” a lysine-lysine-lysine polymer or polymer portion (e.g. KKK), and so forth. In embodiments, the polymer of lysine includes 2 lysine residues (K2), 3 lysine residues (K3), 4 lysine residues (K4), 5 lysine residues (K5), 6 lysine residues (K6), 7 lysine residues (K7), 8 lysine residues (K8), 9 lysine residues (K9), 10 lysine residues (K10), 11 lysine residues (K11), 12 lysine residues (K12), 13 lysine residues (K13), 14 lysine residues (K14), 15 lysine residues (K15), 16 lysine residues (K16), 17 lysine residues (K17), 18 lysine residues (K18), 19 lysine residues (K19) or 20 lysine residues (K20). In embodiments, the polymer of lysine includes 14 lysine residues (K14).
In embodiments, the recognition moiety or ligand is a composition (e.g., atom, molecule, ion, molecular ion, compound, particle, protein, peptide, nucleic acid, oligosaccharide, polysaccharide, small molecule) capable of binding (e.g. specifically binding) to another complementary composition (e.g., analyte, polymer, protein, marker, small molecule, ligand, polysaccharide, aptamer, or other binder) to form a complex. In embodiments, a recognition moiety as provided herein may without limitation bind to biomolecules (e.g., hormones, cytokines, proteins, nucleic acids, lipids, carbohydrates, cellular membrane antigens and receptors (neural, hormonal, nutrient, and cell surface receptors or their ligands)); whole cells or lysates thereof (e.g., prokaryotic (e.g., pathogenic bacteria), eukaryotic cells (e.g., mammalian tumor cells); viruses (e.g., retroviruses, herpesviruses, adenoviruses, lentiviruses and spores); chemicals (e.g., solvents, polymers, organic materials, small molecules); therapeutic molecules (e.g., therapeutic drugs, abused drugs, antibiotics); environmental pollutants (e.g., pesticides, insecticides, toxins).
In embodiments, the recognition moiety or ligand is a cell surface marker binding moiety. In embodiments, the recognition moiety is a cancer cell surface marker binding moiety. In embodiments, the recognition moiety is a prostate-specific membrane antigen (PMSA) binding moiety.
The term “cancer cell marker” as used herein, refers to a protein or a polypeptide derived from a cancer cell or tumor that can be used to identify the cancer cell. A number of cancer cell markers have been established. The recognition moiety of the viral composition described herein can be designed to bind any cancer cell marker that is located on the surface of the cancer cell (i.e., any cancer cell surface marker) known in the art. Exemplary cancer cell markers that can be recognized and bound by the recognition moiety of the viral composition described herein include, but are not limited to,
In embodiments, the recognition moiety is a polypeptide. In embodiments, the recognition moiety is a peptide having the sequence of CALCEFLG (SEQ ID NO:5). In embodiments, the recognition moiety is a peptide having the sequence of SECVEVFQNSCDW (SEQ ID NO:6). In embodiments, the recognition moiety is a polypeptide that is selected through a phage display library and this polypeptide selectively recognizes and binds to a cell surface marker. In embodiments, the recognition moiety encompasses an antibody, a variant or a fragment thereof, where the antibody (a variant or a fragment thereof) specifically recognizes and binds to a surface marker on a cell (e.g., a cancer cell).
In embodiments, cell is a target cell. The term “target cell” and the like refer, in the usual and customary sense, to a cell which can indicate a pathological condition or the potential for a pathological condition, e.g., a disease. In embodiments, the target cell expresses a surface marker for a disease, as disclosed herein. In embodiments, the target cell is a non-pathological cell, e.g., a normal cell, the identification of which is desired, e.g., within a biological sample.
In embodiments, the linker is attached to one end of the main backbone of the recognition moiety (e.g., —COOH group at the C-terminus of a peptide recognition moiety). In embodiments, the linker is attached to a side chain of the recognition moiety (e.g., a —COOH group of a side chain of a peptide recognition moiety).
In another aspect, there is provided a complex including a viral composition as disclosed herein and a phospholipid vesicle. In embodiments, the phospholipid vesicle is a cell. It is well understood that living cells include a membrane structure defining the surface of the cell, which membrane structure includes a phospholipid bilayer. Within the phospholipid bilayer can be found a variety of biomolecules including proteins, lipids, small molecules (e.g., cholesterol and esters thereof), attachment points for cytoskeletal structures, and other chemical structures which provide recognition signals for the cell. In embodiments, proteins attached to the cellular membrane provide a charged protein coat for the cell or vesicle. In embodiments, the phospholipid vesicle is a cell-derived vesicle. In embodiments, the phospholipid vesicle is an exosome. The term “exosome” and the like refer, in the usual and customary sense, to a cell-derived vesicle that is present in many biological fluids, typically having a size on the order to 30 to 100 nm, which can be released from the cell. Without wishing to be bound by any theory, it is believed that exosomes function in intercellular signaling, cellular functioning (e.g., coagulation), waste management, and other cellular functions. Exosomes can contain molecular constituents including proteins (e.g., recognition proteins), and nucleic acid (e.g., RNA). Indeed, exosomes are implicated in cellular transfer reactions from one cell to another via membrane vesicle trafficking, as known in the art, thereby providing pathways for intercellular communication in e.g., the immune system.
Also provided herein is a complex that includes any viral composition described herein and a cell, where the recognition moiety of the viral composition is bound to the cell. In embodiments, the cell is a cancer cell. In embodiments, the cancer cell has a cancer cell surface marker (i.e., a tumor cell antigen) to which the recognition moiety of the conjugated wrapped phage binds.
In embodiments, the complex includes one or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more) viral compositions described herein, where the recognition moiety of each viral composition is bound to a cell (e.g., a cancer cell). In embodiments, the recognition moiety of each viral composition is the same. In embodiments, the recognition moiety of each viral composition is not the same. In embodiments, the recognition moiety of each viral composition is a same cell surface marker binding moiety. In embodiments, the recognition moiety of each viral composition is not a same cell surface marker binding moiety. In embodiments, the recognition moieties of the 2 or more viral compositions are not the same cell surface marker binding moieties, but they recognize and bind to the same cell surface marker. For example, they recognize and bind to different sites of the same cell surface marker. In embodiments, the recognition moieties of the 2 or more viral compositions recognize and bind to 2 or more different cell surface markers (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more cell surface markers) of a cell (e.g., cancer cell).
Also provided is a pharmaceutical composition that includes any viral composition described herein and a pharmaceutically acceptable carrier, diluent or excipient.
In embodiments, the pharmaceutical composition includes one or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more) viral compositions described herein, where the recognition moiety of each viral composition is capable of binding to a cell surface marker (e.g., a cancer cell surface marker). In embodiments, the recognition moiety of each viral composition is the same. In embodiments, the recognition moiety of each viral composition is not the same. In embodiments, the recognition moiety of each viral composition is a same cell surface marker binding moiety. In embodiments, the recognition moiety of each viral composition is not a same cell surface marker binding moiety. In embodiments, the recognition moieties of the 2 or more viral compositions are not the same cell surface marker binding moieties, but they recognize and bind to the same cell surface marker. For example, they recognize and bind to different sites of the same cell surface marker. In embodiments, the recognition moieties of the 2 or more viral compositions recognize and bind to 2 or more different cell surface markers (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more cell surface markers) of a cell (e.g., cancer cell).
In embodiments, provided herein are articles of manufacture or kits containing any compositions described herein (e.g. viral particle, phage, phage wrappers or linkers, ligands or recognition moieties, first polymer) and instructions for their use in the methods described herein.
Methods
In another aspect, there is provided a method for detecting a cancer cell in a subject. The method includes (a) contacting a biological sample of the subject with one or more viral compositions described herein, where the recognition moiety of one or more viral compositions is a cancer cell surface marker binding moiety, and (b) detecting a cell-viral composition complex, and presence of the complex indicates presence of a cancer cell in the subject.
In another aspect, there is provided a method for diagnosing a cancer in a subject. The method includes (a) contacting a biological sample of the subject with one or more viral compositions described herein, where the recognition moiety of one or more viral compositions is a cancer cell surface marker binding moiety, and (b) detecting a cell-viral composition complex, and presence of the complex indicates presence of a cancer cell in the subject, thereby diagnosing a cancer in the subject.
In another aspect, there is provided a method for detecting aggressiveness of a disease in a subject. The method includes (a) contacting a biological sample of the subject with one or more viral compositions described herein, where the recognition moiety of one or more viral compositions is a cancer cell surface marker binding moiety, (b) detecting a cell-viral composition complex, and (c) quantitating the complex (total number or concentration of the complex), thereby detecting aggressiveness of the disease in the subject. In embodiments, the higher total number or concentration of the complex indicates a higher level of aggressiveness of the disease. In embodiments, the disease is a cancer.
In embodiments, the recognition moiety of each viral composition used in the method described herein is the same. In embodiments, the recognition moiety of each viral composition used in the method described herein is not the same. In embodiments, the recognition moiety of each viral composition is a same cell surface marker binding moiety. In embodiments, the recognition moiety of each viral composition is not a same cell surface marker binding moiety. In embodiments, the recognition moieties of the 2 or more viral compositions are not the same cell surface marker binding moieties, but they recognize and bind to the same cell surface marker. For example, they recognize and bind to different sites of the same cell surface marker. In embodiments, the recognition moieties of the 2 or more viral compositions recognize and bind to 2 or more different cell surface markers (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more cell surface markers) of a cell (e.g., cancer cell).
In embodiments, the molar ratio of different types of recognition moieties used in the compositions/methods described herein is optimized. In embodiments, the optimization leads to a synergistic binding between the recognition moieties and the cell surface marker. In embodiments, the synergistic binding between the recognition moieties and the cell surface marker results in higher sensitivity and/or higher specificity of the method described herein.
In embodiments, methods described herein utilize two types of viral compositions that each includes one unique recognition moiety. In embodiments, the molar ratio of these two types of viral compositions used in the methods described herein is optimized. In embodiments, the ratio of two recognition moieties or two types of viral compositions is, for example, 1:1, 1:2, 1:3, 1:4, 1:5, 1:6, 1:7, 1:8, 1:9, 1:10, 10:1, 9:1, 8:1, 7:1, 6:1, 5:1, 4:1, 3:1, 2:1.
In embodiments, the one or more viral compositions used in the methods described herein are immobilized to a solid support.
Methods for detecting a cell-viral composition complex are known in the art. In embodiments, the detecting includes an antibody based reaction. In embodiments, the binding of a viral composition to a cell (i.e., the binding of a recognition moiety to a cell surface marker) can be detected and/or quantified using a variety of techniques such as, but not limited to, Western blot, dot blot, surface plasmon resonance method (e.g., BIAcore system; Pharmacia Biosensor AB, Uppsala, Sweden and Piscataway, N.J.), isothermal titration calorimetry (ITC), or enzyme-linked immunosorbent assays (ELISA).
Immunoassays which can be used to analyze immunospecific binding and cross-reactivity of the recognition moiety include, but are not limited to, competitive and non-competitive assay systems using techniques such as Western blots, RIA, ELISA (enzyme linked immunosorbent assay), “sandwich” immunoassays, immunoprecipitation assays, immunodiffusion assays, agglutination assays, complement-fixation assays, immunoradiometric assays, and fluorescent immunoassays. Such assays are routine and well known in the art.
In another aspect, there is provided a method for generating a viral composition described herein. The method includes (a) synthesizing a covalent linker comprising -L1-L2-L3-L4-L5-L6-, where L1, L2, L3, L4, L5 and L6 are independently a bond, —O—, —C(O)O—, —C(O)—, —C(O)NH—, —NH—, —S—, —S(O)2NH—, substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or unsubstituted cycloalkyl, substituted or unsubstituted heterocycloalkyl, substituted or unsubstituted aryl or substituted or unsubstituted heteroaryl; (b) contacting the linker with a recognition moiety thereby forming an intermediate linker-recognition moiety conjugate; and (c) contacting the intermediate linker-recognition moiety conjugate with an intermediate wrapped phage thereby forming said viral composition, and the intermediate wrapped phage includes a whole viral particle having a charged protein coat that encompasses a charged coat protein and a first polymer electrostatically bound to the charged coat protein.
Exemplary reagents and steps for generating a viral composition described herein are provided in the figures (e.g.,
There are provided methods and compositions for the selective detection of circulating tumor cells. M13 bacteriophage (termed ‘phage’) are used as the scaffold for the selective capture of target biomarker. But, phage typically adheres to cell surfaces with high affinity, which presents a huge problem. A similar problem arises with phage-based sensors for the detection of tumor cells; the non-specific background can reduce signal to noise, and the ability to distinguish tumor from non-tumor cells.
Phage typically adheres to cell surfaces with high affinity. Such non-specific adhesion complicates the design of phage-based sensors for the detection of tumor cells; the non-specific background can reduce the signal to noise ratios and the ability to distinguish tumor from non-tumor cells. The approach described here, applies non-covalent attachment to the phage surface to access additional architectures, e.g., for biosensor applications. Non-covalent attachment offers comparable stability to covalent modification of the virus surface; for technical reasons, non-covalent attachment offers easier execution than covalent bond formation to the surface of the virus. The high negative charge on the phage surface allows non-covalent wrapping with cationic peptides and polymers. Linking these wrappers to recognition ligands opens new routes to greater sensitivity and specificity for target analytes. The peptide ligands can be chemically synthesized and fused to an oligolysine peptide (K14), which ‘wraps’ around the virus particle through complementary electrostatic interactions. Previously, this strategy allowed maximization of ligand density on the phage surface for sensitive detection of biomarkers in complex biofluids, such as synthetic urine. Here, the overall design incorporates PEG polymers in conjunction with this wrapping strategy to address the problem of non-specific adhesion between phage and cells. Then, we optimize various architectures for the specific detection of PCa cells.
Among prostate cancer cell lines, LNCaP cells provide the most commonly used in vitro model for early stage PCa. Derived from the lymph node adenocarcinoma of the human prostate, LNCaP expresses most of the important PCa biomarkers including Prostate Specific Membrane Antigen (PSMA), Prostate Specific Antigen (PSA) and Androgen Receptor (AR). Attempts to recognize cell surfaces with conventional phage-displayed ligands resulted in unacceptably high, non-specific adhesion by control phage, which lack a displayed peptide. As shown by ELISA, phage-displayed PSMA ligand 2 and control phage produced similar high levels of binding to LNCaP cells. To overcome this non-specific adhesion, we focused on eliminating such interactions by control phage by using polyethylene glycol.
The water-soluble polymer PEG is commonly bioconjugated to proteins to reduce non-specific adhesion to cells and other surfaces. PEG has been shown to broadly adopt two distinct conformations—descriptively termed ‘mushroom’ and ‘brush.’ The transition from the mushroom conformation, a more random orientation, to the brush conformation is dependent upon the polymer length and packing densities; longer PEG lengths and higher packing densities favor formation of the brush conformation. This transition can result in a significant drop in non-specific adsorption. In many systems, a mole fraction of 0.15 PEG-modified to -unmodified sites yields significantly reduced non-specific adhesion. High packing densities with such mole fractions can force the polymer to adopt a more stretched, and extended brush conformation to more effectively suppress non-specific adhesion.
Initial attempts to block non-specific cell adhesion applied PEG variants with different MWs as phage wrappers. Azide-functionalized, polydispersed PEGs with size distributions centered around 7, 22 or 45 ethylene glycol units (providing average MWs of 300, 1K or 2K, respectively) were conjugated to K14-alkyne using the CuI-catalyzed cycloaddition (‘click’) reaction. The conjugated peptides were then purified by reverse-phase HPLC and characterized by MALDI-TOF mass spectrometry. The relative adhesion levels of unwrapped and PEG-wrapped phage targeting immobilized LNCaP cells were compared by phage-based ELISA. Since the phage lacked a displayed peptide, adhesion could only result from non-specific interactions by the phage coat proteins. In theory, phage wrapped with PEG should bind to LNCaP cells with much lower affinity due to decreased non-specific adhesion. However, no such reduction was observed for the different MW PEGs used. The ineffectiveness of this initial approach likely resulted from interaction between PEG and the K14 sidechains used to wrap the phage. A crown ether-like encapsulation can form between the primary amine of the Lys sidechains and ethylene glycols of PEG, thereby could render K14 incapable of wrapping the phage surface. Without the PEG wrapping the phage surface, the results merely compare phage in different assay wells, as is evident from the overlapping responses. The lack of wrapping by PEGylated oligolysine was further verified by dynamic light scattering measurements, which revealed no significant change in the cross-sectional diameter of the treated phage.
On-Phage Cycloaddition Reaction to Generate PEGylated Phage.
To overcome K14 encapsulation by PEG, phage with PEG wrappers were generated in two steps,
PEG reduces non-specific binding largely by surrounding the attached surface with a hydration sphere. The non-specific binding partners could adhere to the outer surface of the PEG layer, termed secondary adsorption. For non-specific adhesion to the larger surfaces of cells, such secondary adsorption is likely a more pronounced effect.
Dynamic Light Scattering (DLS) Analysis of PEGylation.
To characterize the PEG-wrapped phage, DLS measurements were conducted,
Synthesis of PEGylated Ligands.
Towards the goal of specific recognition of a cell surface receptor, different scaffolds for the display of ligands on phage were explored. First, heterobifunctional PEG, Mal-PEG-NH2, provided reactive groups for selective attachment of oligolysine at the maleimide end and PSMA binding ligands to the amine end, Scheme 1. As the scaffold, PEG100 was chosen to provide a longer polymer brush to reduce non-specific secondary adsorption. Azide-functionalized PSMA ligands 1 and 2 were synthesized by conventional solid-phase peptide synthesis (SPPS), and coupled to pentynoic acid via the click reaction. The resultant N-terminal carboxylic acid group was then coupled to Mal-PEG100-NH2 using HBTU as an amide bond forming agent in water. Since this reaction non-specifically couples amine and carboxylate functionalities, the attachment sites could vary as both ligands have sidechain carboxylic acids. The resultant ligand is termed P100NSP-1/2 for ‘PEG100, non-specific attachment to ligand 1 or 2.’ as described in Table 1. Subsequently, phage wrapped with K14-Cys were coupled to the maleimide terminus of P100NSP-1/2, as described above. Preliminary validation of binding to cell surface PSMA by the PEGylated ligand wrappers was performed by phage ELISA as before,
Bidentate Binding Mode of PEGylated Ligands.
Dual display of ligands 1 and 2 can enable synergistic, high affinity binding to PSMA due to a bidentate binding mode and a hook and loop-like avidity effect. A phage ELISA targeting LNCaP cells with different ratios of the two PEGylated, phage-wrapped ligands examined relative binding affinities. First, the effectiveness of the bidentate binding mode (
Optimizing the Attachment Site for PEGylated Ligands.
Further optimization explored the size, geometry and attachment site of the ligands fused to the PEG wrapper. Such variables can be crucial to the pharmacokinetic properties of PEGylated drugs, which demonstrates the sensitivity of biological recognition to such factors. For example, the attachment sites of the peptide ligand to PEG100 dictates the ligand orientation and the potential availability of peptide sidechains. An alternative synthesis scheme was designed to control ligand orientation. Mal-PEG100-NH2 was first coupled to pentynoic acid. The resultant Mal-PEG100-alkyne was then coupled to the azide-functionalized peptide ligands using click chemistry, providing a specific site of attachment to the ligand. The resulting PEGylated ligand is termed P100SP-1/2 for ‘PEG100, specific attachment to ligand 1 or 2,’ as described in Table 1.
Insertion of a PEG4 linker to reduce steric effects on the attached ligands. Heterobifunctional linkers between PEG and a molecule of interest can enhance activity through flexible additional spacing. We envisioned the incorporation of an average 175 MW PEG4 linker between the peptide ligand and the triazole generated by the click reaction could enhance the binding affinity of the peptide ligands. With only four ethylene glycol units, this highly flexible linker can disconnect the peptide ligand from any steric constraints dictated by PEG100 or the triazole. Thus, the peptide ligands were resynthesized via SPPS, and coupled to azido-PEG4-carboxylic acid (15-Azido-4,7,10,13-tetraoxapentadecanoic acid), thereby inserting a PEG4 linker before the azide functionality. Azido-PEG4-ligands were further linked to PEG100 following the two synthesis routes described above, specific and non-specific addition. The resultant PEGylated ligands are termed P100SP-P4-1/2 and P100NSP-P4-1/2, P4 to indicate the insertion of the PEG4 linker, Table 1. The P100SP-P4-1 and P100SP-P4-2 conjugated peptides were shown to have the expected sizes by gel permeation chromatography (Supplementary Information) and DLS. A further increase of 10 nm in cross-sectional diameter was observed for the addition of the PEG4-fused ligand (
An ELISA compared the relative binding affinities of the four PEGylated ligand 2 variants—specific (solid) and non-specific (patterned) attachment with and without the PEG4 linker,
PEG Spacers to Control Relative Ligand Spacing.
The relative spacing between ligands governs the synergy of the chelate-based avidity effect. To achieve optimal geometry of the two ligands, the relative spacing was systematically engineered by interspersing long PEGylated ligands with smaller PEG wrappers on the phage surface. The smaller PEG wrappers could provide spacers to push apart the PEG-fused ligands on the phage surface. Generating ligands and spacers required the two wrapping modes described above, click chemistry and cysteine-maleimide reaction, on the same phage. K14-alkyne and K14-Cys were pre-mixed to an estimated mole fraction of 0.19, and then used to wrap the phage surface. K14-alkyne was linked to short PEG polymers to provide spacers. Different concentrations of the PEG polymers were explored. The ratio of ligands to spacers was empirically optimized, and a ratio of 1.5:1 provided the best levels of PSMA recognition. The concentration of the PEGylated ligands remained unchanged, and a 2:1 molar mixture of the two ligands was reacted with the K14-Cys wrapped on the phage surface. A higher net concentration of wrappers could be accommodated by the phage as the spacers allowed higher packing density. The dual PSMA ligand combination described above, P100SP-P4-2+P100SP-P4-1, without (green) or with spacers (brown) of either PEG 7, 22 or 45, wrapped around the phage were assayed for binding to LNCaP cells,
Selective Recognition of PSMA Positive Cells.
To demonstrate specificity for PCa cells by these chemically modified phage, binding to different prostate cancer cell lines was compared. LNCaP cells can model early or late stage cancer cells, through variation in their culture conditions. The majority of PCa cases gain resistance to therapies based on androgen ablation. The LNCaP cell line, a model for early stage PCa, is androgen sensitive but gradually loses the androgen requirement, providing a model for late stage PCa, which also mimics androgen ablation. The latter can be simulated by culturing LNCaP cells in androgen-depleted media, referred to as LNCaP CSS (for charcoal-stripped serum). Increased levels of PSMA are associated with androgen independent PCa. Thus, both LNCaP and LNCaP CSS cell lines were assayed. The third cell line, PC3 cells, do not express PSMA, and were used as the negative control. The following assays validate the dual ligand system for cell line discrimination and quantification of cell surface receptors.
The optimized dual ligand combination of P100SP-P4-2+P100SP-P4-1 and the PEG7 spacer was assayed for binding to LNCaP (red), LNCaP CSS (blue), and PC3 cell lines (gold),
Detecting PSMA on Suspended Cells and in Culture Media.
The tailored phage could also capture cells from solution, which is critical for future analytical applications in circulating tumor cell detection and characterization. In this experiment unlike other ELISAs described here, the phage were immobilized on the microtiter plate before applying a solution of cells,
Protocols
Synthesis of PEGylated Ligands—Specific Attachment Mode.
The protocol for the synthesis of the PEGylated ligands was adapted from the solid phase peptide synthesis and click chemistry reaction described above, and also described in
For the next step of the synthesis, ˜50 μL of the alkyne-functionalized-PEG100-Mal, obtained as described above, was conjugated to azide-functionalized peptide ligands (final concentration of 40 μM) by click chemistry as before [1,5]. The reaction mixture was stirred overnight at room temperature. Four identical reactions were run in parallel. To remove unreacted starting materials, the four reaction mixtures were then combined and concentrated to ¼th volume using 3K MWCO concentrators. Next, the resultant solution was diluted with an equal volume of HPLC grade water before concentrating to ˜½ volume using 5K MWCO concentrators. The concentrated reaction mixture was purified using reverse-phase analytical HPLC (
For the non-specific attachment mode described in the text and
Cell Growth.
The cell lines were grown as monolayers in media supplemented with 10% fetal bovine serum (Cellgro), 1 mM sodium pyruvate [7] and 1% penicillin-streptomycin-glutamine in a 5% CO2 and 95% air-humidified atmosphere at 37° C. LNCaP cells were cultured in RPMI 1640 media. For studies with LNCaP cells cultured in charcoal-stripped serum (LNCaP CSS cells), LNCaP cells were washed with PBS and then incubated with phenol free RPMI 1640 media supplemented with 10% charcoal-stripped serum (Cellgro) for five minutes. The cells were again washed with PBS, and provided fresh media [8]. PC3 cells were grown in Ham's F-12 media.
Cell-Based Phage Enzyme-Linked Immunosorbent Assay (ELISA).
Day 1: The cell-based ELISA was performed as previously described by Watanabe et. al. with the following modifications [9]. Cells were detached with Trypsin-EDTA, resuspended in PBS, and then collected by centrifuging at 1200 rpm for 5 min. The cells were further washed with PBS and then concentrated as in the previous step. The concentration of the cells was adjusted to 4.5×106 cells/mL in PBS using a hemocytometer, and 100 μL was aliquoted to specific wells of a 96-well microtiter plate (Maxisorp plates from Nunc). The Maxisorp plates used here have a high protein-binding capacity. Thus, the plates can be used to run assays with either the cells or the phage immobilized in the wells. Next, 50 μL of a 0.15% glutaraldehyde in PBS solution was added to the wells at 4° C., and the solution was gently mixed by pipetting. The ELISA plate was then centrifuged at 1200 rpm for 10 min at 4° C., followed by overnight incubation at 4° C.
Day 2: The cell solution was gently removed, and the wells were blocked with 200 μL/well of blocking buffer containing 100 mM glycine, 1% gelatin and 0.1% w/v BSA (bovine serum albumin) in PBS. The plate was incubated overnight (˜20-22 h) at room temperature.
Separately, phage were prepared before attachment to PEG and PEGylated ligands. Phage (10 nM in 100 μL of PBS) and 1 μL of K14-alkyne (525 μM in water) were thoroughly mixed by pipetting ˜25 times. For phage wrapped with PEGylated ligands, phage were mixed with 0.75 μL of K14-Cys (525 μM in water). For mixed wrapping on the phage surface, 0.5 μL of K14-alkyne was pre-mixed with 0.75 μL of K14-Cys, and then mixed with 100 μL of 10 nM phage. The solution was shaken at room temperature for 15 min on an orbital shaker. Next, 2 μL of PEGylated ligand (625 μM in water) was added to the appropriate wells. For the dual ligand combinations, the PEGylated ligands were pre-mixed in the desired ratio (a 2:1 molar ratio for example), and then 2 μL of the mixture was added to the appropriate wells. The solutions were gently mixed by pipetting, and incubated overnight at 4° C.
Day 3: Next, the click reaction was performed, as previously described, but with the following modifications [1,2,5]. To buffer the pH, triethylammonium acetate was added to a final concentration of 50 mM, followed by the addition of 1.5 μL of 1 mM azide-functionalized PEG. The solutions were mixed by pipetting. Next, ascorbic acid was added to a final concentration of 1 mM and the solutions were mixed by gently pipetting. Then, copper sulfate was added to a final concentration of 1.5 mM, followed by pipetting to mix the solutions. Water was added to the other wells to maintain consistent phage concentrations. The plate was incubated at room temperature for 30 min.
The wells of the ELISA plate were then incubated with the phage samples. The blocking buffer was removed and the wells were gently washed two times with PBS. Next, the phage solution was added to the respective wells and incubated for 45 min. The phage solution was removed, and the wells were washed three times with 300 μL/well of wash buffer PT (0.05% Tween-20 in PBS), once with PBS, and then incubated with horseradish peroxidase-conjugated anti-M13 antibody (100 μL/well, 1:5000 dilution in PBS) for 40 min. The wells were washed three times with PT and once with PBS. The plate was then developed by incubating with HRP substrate solution (100 μL/well; 1 mg/mL o-phenylenediamine dihydrochloride and 0.02% w/v H2O2) in citric acid buffer (50 mM citric acid, 50 mM Na2HPO4, pH 5.0). The HRP activity was measured spectrophotometrically at 450 nm using a microtiter plate reader (Bio-Tek). The absorbance at 630 nm was subtracted from the absorbance at 450 nm to eliminate background.
Phage-Based Sandwich ELISA for Cell Capture.
To demonstrate cell capture by the PEGylated-ligand phage, the phage were coated on the plate, and cells added before quantifying binding. This assay setup inverts other cell-based phage ELISAs reported here. This experiment is a significant step towards establishing the relevance of this phage architecture for biosensing assays planned in the future. In this assay, the PEGylated phage architecture is immobilized on the plate as demonstrated in
Day 1: In this phage capture ELISA, specific wells of a 96-well microtiter plate were coated with 100 μL/well of a solution of 10 nM phage pre-wrapped with oligolysine wrappers, as described above. The plate was incubated for 1 h on a shaker at room temperature. The coating solution was removed, and the wells were blocked with 200 μL/well of 0.2% w/v solution of BSA in PBS for 30 min, and washed two times with PT. Next, 98 μL PBS was added per well, followed by PEGylated ligands and incubated overnight at 4° C.
Day 2: Azide-functionalized PEG variants were then conjugated as described above. Separately, the cells were collected and the concentration adjusted as described above; the ELISA plate was then incubated with 100 μL/well of the cell solution or media for 1 h. The wells were washed with PBS and incubated with 100 μL/well of the anti-PSMA antibody, YPSMA antibody (Abcam) at 1:1000 dilution. The wells were then washed with PBS, followed by incubation with horseradish-peroxidase-conjugated anti-mouse antibody (Sigma) at a 1:1000 dilution. The levels of phage binding were quantified as described above.
Abstract.
Specific detection of circulating tumor cells and characterization of their aggressiveness could improve cancer diagnostics and treatment. Metastasis results from such tumor cells, and causes the majority of cancer deaths. Chemically modified viruses could provide an inexpensive and efficient approach to detect tumor cells and quantitate their cell surface biomarkers. However, non-specific adhesion between the cell surface receptors and the virus surface presents a challenge. This report describes wrapping the virus surface with different PEG architectures, including as fusions to oligolysine, linkers, spacers and scaffolded ligands. The reported PEG wrappers can reduce by >75% the non-specific adhesion of phage to cell surfaces. Dynamic light scattering verified the non-covalent attachment by the reported wrappers as increased sizes of the virus particles. Further modifications resulted in specific detection of prostate cancer cells expressing PSMA, a key prostate cancer biomarker. The approach allowed quantification of PSMA levels on the cell surface, and could distinguish more aggressive forms of the disease.
Introduction.
The migration and dissemination of tumor cells, termed metastasis, causes ≈90% of cancer deaths.[1,2] Metastasis requires loss of apoptotic regulation, and such cells respond poorly to conventional anti-cancer treatments. With a majority of the estimated 27,540 deaths from prostate cancer (PCa) in the US for 2015[3] resulting from metastasis[2], new methods for efficient detection and characterization of metastatic cells could impact clinical care and patient prognosis. Previously, we reported the sensitive detection of soluble prostate-specific membrane antigen (PSMA), a PCa biomarker, at 100 pM concentrations using viruses incorporated into an electrically conductive polymer[4]. Here, we engineer similar bacteriophage, termed ‘phage,’ with polymers and ligands for direct binding to PSMA found on the surface of PCa cells.
PSMA, a 750 residue, 90 kD glycoprotein, is overexpressed on the surface of tumor cells as a non-covalent homodimer in both primary and metastatic cancers [5,6]. Differential splicing during tumorigenesis leads to expression of PSMA as a type II integral membrane protein [7]. Elevated PSMA levels have also been observed in seminal fluid and urine of PCa patients [8]. To detect the protein in urine, we reported viruses with both genetically displayed and chemically synthesized ligands for the sensitive detection of PSMA [4,9]. These ligands, selected from phage-displayed peptide libraries had the following amino acid sequences: ligand-1 (CALCEFLG) (SEQ ID NO:5) and ligand-2 (SECVEVFQNSCDW) (SEQ ID NO:6). Genetically encoded, phage-displayed ligand-2 binds with >100-fold higher affinity to PSMA than ligand-1 [4,10].
Used ubiquitously for molecular display applications, the M13 filamentous phage applied here infects E. coli, and can be manipulated to present genetically encoded peptides on the phage surface [11-13]. The M13 virus consists of a circular, single-stranded DNA genome surrounded by a protein coat composed of approximately 2700 copies of the major coat protein, P8, an α-helical protein of 50 amino acid residues with an unstructured N-terminus. One Glu and two Asp residues near the N-terminus of P8 impart a high negative charge to the outer surface of the virus at physiological pH [14]. Selections with phage-displayed libraries of peptides and proteins can target tissue-cultured cells and even organs in living organisms [15-17]. Phage have also been incorporated into nanomedicine platforms for targeted drug delivery [18-20] and imaging [21,22]. Such applications require low background binding by phage to cell surfaces.
Phage typically adhere to cell surfaces with high affinity, however. Such non-specific adhesion complicates the design of phage-based sensors for the detection of tumor cells; the non-specific background can reduce the signal to noise ratios and the ability to distinguish tumor from non-tumor cells. Francis and co-authors have reported covalently linking the coat proteins of fd phage with both polyethylene glycol (PEG) and imaging agents through a two-step reaction [23]. M13 and fd phage are closely homologous with similar sizes, structural features and sequences [24]. An alternative approach described here, applies non-covalent attachment to the phage surface to access additional architectures for biosensor applications.
Non-covalent attachment offers comparable stability to covalent modification of the virus surface. The high negative charge on the phage surface allows non-covalent wrapping with cationic peptides and polymers [25,26]. Linking these wrappers to recognition ligands opens new routes to greater sensitivity and specificity for target analytes. The peptide ligands can be chemically synthesized and fused to an oligolysine peptide (K14), which ‘wraps’ around the virus particle through complementary electrostatic interactions. Previously, this strategy allowed maximization of ligand density on the phage surface for sensitive detection of biomarkers in complex biofluids, such as synthetic urine v[4]. Here, the overall design incorporates PEG polymers in conjunction with this wrapping strategy to address the problem of non-specific adhesion between phage and cells. Then, we optimize various architectures for the specific detection of PCa cells.
Results and Discussion.
Non-Specific Adhesion of Viruses to Cells.
Among prostate cancer cell lines, LNCaP cells provide the most commonly used in vitro model for early stage PCa [27,28]. Derived from the lymph node adenocarcinoma of the human prostate, LNCaP expresses most of the important PCa biomarkers including PSMA, PSA and AR [29]. Attempts to recognize cell surfaces with conventional phage-displayed ligands resulted in unacceptably high, non-specific adhesion by control phage, which lack a displayed peptide. As shown by ELISA, phage-displayed PSMA ligand 2 and control phage produced similar high levels of binding to LNCaP cells (
Wrapping Phage with PEG to Prevent Non-Specific Adhesion.
The water soluble polymer PEG is commonly bioconjugated to proteins to reduce non-specific adhesion to cells and other surfaces [30-33]. In addition, PEG can increase the solubility of attached therapeutic proteins, prolong circulation times, and decrease proteolysis [34]. Furthermore, the activities of proteins conjugated to PEG typically remain unaffected [35,36]. PEG has been shown to broadly adopt two distinct conformations—descriptively termed ‘mushroom’ and ‘brush.’ [30,37,38] The transition from the mushroom conformation, a more random orientation, to the brush conformation is dependent upon the polymer length and packing densities; longer PEG lengths and higher packing densities favor formation of the brush conformation. This transition can result in a significant drop in non-specific adsorption. In many systems, a mole fraction of 0.15 PEG-modified to -unmodified sites yields significantly reduced non-specific adhesion. High packing densities with such mole fractions can force the polymer to adopt a more stretched, and extended brush conformation to more effectively suppress non-specific adhesion [30]. To provide a framework for experimental design and data interpretation, the reported PEG polymers are assumed to form mushroom and brush conformations based on PEG lengths and packing densities, as has been reported previously [30,37,38].
Initial attempts to block non-specific cell adhesion applied PEG variants with different MWs as phage wrappers. Azide-functionalized, polydispersed PEGs with size distributions centered around 7, 22 or 45 ethylene glycol units (providing average MWs of 300, 1K or 2K, respectively) were conjugated to K14-alkyne using the CuI-catalyzed cycloaddition (‘click’) reaction,
In theory, phage wrapped with PEG should bind to LNCaP cells with much lower affinity due to decreased non-specific adhesion. However, no such reduction was observed for the different MW PEGs used,
On-Phage Cycloaddition Reaction to Generate PEGylated Phage.
To overcome K14 encapsulation by PEG, phage with PEG wrappers were generated in two steps,
PEG reduces non-specific binding largely by surrounding the attached surface with a hydration sphere [40]. Direct contact to the phage surface, termed primary adsorption, requires smaller non-specific binding partners to penetrate the PEG layer. Alternatively, the non-specific binding partners could adhere to the outer surface of the PEG layer, termed secondary adsorption. For non-specific adhesion to the larger surfaces of cells, such secondary adsorption is likely a more pronounced effect. To minimize secondary adsorption, the wrappers were applied at 0.15 mole fraction [30]. Here, we estimate the mole fraction as the stoichiometry of PEG molecules added to P8 coat proteins; this analysis is analogous to the calculations for PEG grafted in lipid membranes [41]. Additionally, we assumed that at the concentration used, PEG22, 45 and 100 adopt brush conformations due to their high packing densities [30,37], which were fixed by maximization of oligolysine wrappers as previously described [4].
Based on published precedent, PEG7 presumably adopts a mushroom conformation [30], and fails to suppress non-specific adhesion to the same levels. The brush conformation of the larger PEGs can more efficiently reduce non-specific secondary adsorption due to the hydration sphere extending further from the virus surface. Beyond a certain height of the polymer brush, the effect of secondary adsorption remains constant as shown by the nominal difference obtained between PEG45 and 100 in
Dynamic Light Scattering (DLS) Analysis of PEGylation.
To characterize the PEG-wrapped phage, DLS measurements were conducted,
Next, the change in average cross-sectional diameter was measured for different samples from each step of the phage wrapping process,
Synthesis of PEGylated Ligands.
Towards the goal of specific recognition of a cell surface receptor, different scaffolds for the display of ligands on phage were explored. First, heterobifunctional PEG, Mal-PEG-NH2, provided reactive groups for selective attachment of oligolysine at the maleimide end and PSMA binding ligands to the amine end, Scheme 1. As the scaffold, PEG100 was chosen to provide a longer polymer brush to reduce non-specific secondary adsorption. Azide-functionalized PSMA ligands 1 and 2 were synthesized by conventional solid-phase peptide synthesis (SPPS), and coupled to pentynoic acid via the click reaction. The resultant N-terminal carboxylic acid group was then coupled to Mal-PEG100-NH2 using HBTU as an amide bond forming agent in water. Since this reaction non-specifically couples amine and carboxylate functionalities, the attachment sites could vary as both ligands have sidechain carboxylic acids. The resultant ligand is termed P100NSP-1/2 for ‘PEG100, non-specific attachment to ligand 1 or 2.’ as described in Table 1.
Subsequently, phage wrapped with K14-Cys were coupled to the maleimide terminus of P100NSP-1/2, as described above. Preliminary validation of binding to cell surface PSMA by the PEGylated ligand wrappers was performed by phage ELISA as before,
Bidentate Binding Mode of PEGylated Ligands.
Dual display of ligands 1 and 2 can enable synergistic, high affinity binding to PSMA due to a bidentate binding mode and a hook and loop-like avidity effect [4]. A phage ELISA targeting LNCaP cells with different ratios of the two PEGylated, phage-wrapped ligands examined relative binding affinities. First, the effectiveness of the bidentate binding mode (
Optimizing the Attachment Site for PEGylated Ligands.
Further optimization explored the size, geometry and attachment site of the ligands fused to the PEG wrapper. Such variables can be crucial to the pharmacokinetic properties of PEGylated drugs, which demonstrates the sensitivity of biological recognition to such factors [46]. For example, the attachment sites of the peptide ligand to PEG100 dictates the ligand orientation and the potential availability of peptide sidechains. An alternative synthesis scheme was designed to control ligand orientation. Mal-PEG100-NH2 was first coupled to pentynoic acid,
Specific attachment of PEG to the wrapped ligands could improve binding affinity by removing attachment through the ligands' sidechains and also altering their orientation on the phage surface. The significance of ligand orientation is apparent through the higher binding affinity observed for genetically encoded, phage-displayed ligand 2 (dashed red line) relative to phage wrapped with chemically synthesized ligand 2 (solid line),
Insertion of a PEG4 Linker to Reduce Steric Effects on the Attached Ligands.
Heterobifunctional linkers between PEG and a molecule of interest can enhance activity through flexible additional spacing [40]. We envisioned the incorporation of an average 175 MW PEG4 linker between the peptide ligand and the triazole generated by the click reaction could enhance the binding affinity of the peptide ligands. With only four ethylene glycol units, this highly flexible linker can disconnect the peptide ligand from any steric constraints dictated by PEG100 or the triazole,
An ELISA compared the relative binding affinities of the four PEGylated ligand 2 variants—specific (solid) and non-specific (patterned) attachment with and without the PEG4 linker,
The dual ligand combinations of peptides 1 and 2 were expected to further provide higher affinity through bidentate binding. However, only a modest improvement was observed for the combination of P100NSP-2+P100NSP-1 versus the best individual ligand, P100SP-P4-2,
Controlling the geometry of the PEG4 linker could prevent masking of the Lys sidechains of ligand 1. Sandwiching PEG4 between PEG100 and the peptide ligand through the specific attachment mode, eliminates such debilitating effects, as shown by a significant increase in binding affinity for the dual ligand system P100SP-P4-2+P100SP-P4-1 (
PEG Spacers to Control Relative Ligand Spacing.
The relative spacing between ligands governs the synergy of the chelate-based avidity effect. To achieve optimal geometry of the two ligands, the relative spacing was systematically engineered by interspersing long PEGylated ligands with smaller PEG wrappers on the phage surface. The smaller PEG wrappers could provide spacers to push apart the PEG-fused ligands on the phage surface. Generating ligands and spacers required the two wrapping modes described above, click chemistry and cysteine-maleimide reaction, on the same phage. K14-alkyne and K14-Cys were pre-mixed to an estimated mole fraction of 0.19 (as described above), and then used to wrap the phage surface. K14-alkyne was linked to short PEG polymers to provide spacers. Different concentrations of the PEG polymers were explored. The ratio of ligands to spacers was empirically optimized, and a ratio of 1.5:1 provided the best levels of PSMA recognition (data not shown). The concentration of the PEGylated ligands remained unchanged, and a 2:1 molar mixture of the two ligands was reacted with the K14-Cys wrapped on the phage surface, A higher net concentration of wrappers could be accommodated by the phage as the spacers allowed higher packing density.
The dual PSMA ligand combination described above, P100SP-P4-2+P100SP-P4-1, without (green) or with spacers (brown) of either PEG 7, 22 or 45, wrapped around the phage were assayed for binding to LNCaP cells,
Selective Recognition of PSMA Positive Cells.
To demonstrate specificity for PCa cells by these chemically modified phage, binding to different prostate cancer cell lines was compared. LNCaP cells can model early or late stage cancer cells, through variation in their culture conditions. The majority of PCa cases gain resistance to therapies based on androgen ablation [47]. The LNCaP cell line, a model for early stage PCa, is androgen sensitive but gradually loses the androgen requirement, providing a model for late stage PCa, which also mimics androgen ablation [47,48]. The latter can be simulated by culturing LNCaP cells in androgen-depleted media, referred to as LNCaP CSS (for charcoal-stripped serum) [49,50]. Increased levels of PSMA are associated with androgen independent PCa [48]. Thus, both LNCaP and LNCaP CSS cell lines were assayed. The third cell line, PC3 cells, do not express PSMA, and were used as the negative control [29,51]. The following assays validate the dual ligand system for cell line discrimination and quantification of cell surface receptors.
The optimized dual ligand combination of P100SP-P4-2+P100SP-P4-1 and the PEG7 spacer was assayed for binding to LNCaP, LNCaP CSS, and PC3 cell lines,
Detecting PSMA on Suspended Cells and in Culture Media.
The tailored phage could also capture cells from solution, which is critical for future analytical applications in circulating tumor cell detection and characterization. In this experiment unlike other ELISAs described here, the phage were immobilized on the microtiter plate before applying a solution of cells,
PSMA levels are elevated in the urine samples of PCa patients, and levels of this biomarker correlate with the aggressiveness of the disease [8,52]. Therefore, cultured PCa cells should release PSMA into their culture media. Thus, PSMA detection was also performed with cell culture supernatant, normalized to the volume and the number of cells (
Materials and Methods.
All chemicals and reagents were purchased from Sigma-Aldrich, and used as received unless otherwise noted. PSMA and the cell lines LNCaP and PC3 were generous gifts from Drs. William Ernst and Gary Fuji (Molecular Express). Maleimide-PEG100-amine and 15-azido-4,7,10,13-tetraoxapentadecanoic acid (azido-PEG4-carboxylic acid) were purchased from Alfa Aesar. N,N-Diisopropylethylamine (DIPEA) and azide-functionalized PEG7, 22 and 45 were purchased from Sigma, and 4-Azidobutanoic acid was purchased from Synthonix. 4-Pentynoic acid (GFS Chemicals, Inc.), O-benzotiazole-N,N,N′,N′-tetramethyl-uronium-hexafluoro-phosphate, HBTU (GL Biochem Ltd.), triethyammonium acetate buffer (Fluka Biochemika) and Tween-20 (EMD Science) were used as received. HPLC-grade water was used for the preparation of solutions.
M13 Bacteriophage Propagation.
Phage propagation and isolation was performed as previously described [1,2]. Briefly, the phagemid DNA was transformed into CaCl2 competent E. coli XL-1 Blue cells. The cells were grown at 37° C. in 2 mL 2YT media supplemented with carbenicillin and tetracycline until the culture reached log-phase growth. The culture was then infected with KO7 helper phage with a multiplicity of infection of 4.5:1. The starting culture was then transferred to 75 mL 2YT media supplemented with carbenicillin and kanamycin. The phage culture was incubated for 16 h at 37° C. with shaking. The phage were isolated from the culture supernatant by centrifugation at 10 krpm after precipitation through addition of ⅕th volume of PEG-NaCl (2.5 M NaCl, 20% PEG-8000). After second precipitation, the phage were resuspended in phosphate-buffered saline (PBS, 135 mM NaCl, 2.50 mM KCl, 8.00 mM Na2HPO4, 30.0 mM KH2PO4, pH 7.2). Phage concentration was determined by UV absorbance at 268 nm (OD268 of 1.0=8.31 nM phage).
Solid Phase Peptide Synthesis.
The peptides were synthesized by conventional solid-phase peptide synthesis with Fmoc-protected amino acids on Rink-amide resin (Novabiochem), as previously described [1,3,4]. The peptide N-terminus was coupled to 4-azido butanoic acid or 4-pentynoic acid to yield the azide- or alkyne-functionalized peptides, respectively. For incorporation of the PEG4 linker, the last coupling step was performed with 15-azido-4,7,10,13-tetraoxapentadecanoic acid. The reported peptides were purified by reverse-phase HPLC purification with a C18 column. Fractions containing the purified peptides were combined and concentrated using rotary evaporation, followed by lyophilization and characterization by MALDI-TOF mass spectrometry. The calculated m/z for peptide-1 [M+] 1349.67, found 1349.77. The calculated m/z for peptide-2 [M+] 2040.28, found 2040.23. The calculated m/z for peptide-1 fused to the azido-PEG4 linker is [M+] 1510.80, found 1511.78. The calculated m/z for peptide-2 fused to the azido-PEG4 linker is [M+] 2201.02, found 2202.06. The calculated m/z for alkyne-functionalized K14 peptide [M+Na]+1914.37, found 1914.18. The calculated m/z for K14-Cys peptide [M3+] 638.79, found 638.79.
Click chemistry reaction for the synthesis of PEGylated oligolysines. The protocol for the synthesis of PEGylated oligolysines was adapted from Lumiprobe Corporation's protocol, as described previously [1,2,5,6]. Briefly, the reaction was performed at a final concentration of 100 μM azide-functionalized PEG. The product obtained was purified using reverse-phase analytical HPLC and characterized by MALDI-TOF mass spectrometry. The calculated m/z for alkyne-functionalized K14 fused to azide-functionalized PEG7 [M+] 2285.61, found 2286.93. The mass spectrometry data obtained for PEGylated oligolysine showed a shift in the characteristically polydispersed PEG spectra by the expected mass of K14-alkyne,
Synthesis of PEGylated Ligands—Specific Attachment Mode.
The protocol for the synthesis of the PEGylated ligands was adapted from the solid phase peptide synthesis and click chemistry reaction described above, and also described in
For the next step of the synthesis, ˜50 μL of the alkyne-functionalized-PEG100-Mal, obtained as described above, was conjugated to azide-functionalized peptide ligands (final concentration of 40 μM) by click chemistry as before [1,5]. The reaction mixture was stirred overnight at room temperature. Four identical reactions were run in parallel. To remove unreacted starting materials, the four reaction mixtures were then combined and concentrated to ¼th volume using 3K MWCO concentrators. Next, the resultant solution was diluted with an equal volume of HPLC grade water before concentrating to ˜½ volume using 5K MWCO concentrators. The concentrated reaction mixture was purified using reverse-phase analytical HPLC (
For the non-specific attachment mode described in the text and
Gel Permeation Chromatography (GPC).
GPC was used to characterize the MW's of the PEGylated peptides. The molecular weights of the polymers, calibrated with PEG MW standards, were obtained with an Agilent 1100 series GPC system (Agilent Technologies, Santa Clara, Calif.) using 0.1% (v/v) LiBr/DMF solution (1.0 mL/min) as the eluent. The commercially purchased, unmodified Mal-PEG100-NH2 was found to have an average molecular weight of Mn=3690, and this polymer eluted as a broad peak consistent with its size distribution. The calculated molecular weights for PEGylated-1 and -2 with the PEG4 linker, based on the molecular weight of Mal-PEG100-NH2, were estimated as 5300 and 5990, respectively. The corresponding molecular weights observed by GPC were 5310 and 6060, respectively.
Dynamic Light Scattering (DLS).
To demonstrate wrapping by PEGylated ligands and other materials on the phage surface, DLS measurements were obtained using Nano ZetaSizer ZS series. For determination of size, 1 mL of each sample was measured at the same concentration as used for the biological assay. Each sample was measured at least three times at 25° C., with each individual size measurement being the average of 10 runs.
Cell Growth.
The cell lines were grown as monolayers in media supplemented with 10% fetal bovine serum (Cellgro), 1 mM sodium pyruvate [7] and 1% penicillin-streptomycin-glutamine in a 5% CO2 and 95% air-humidified atmosphere at 37° C. LNCaP cells were cultured in RPMI 1640 media. For studies with LNCaP cells cultured in charcoal-stripped serum (LNCaP CSS cells), LNCaP cells were washed with PBS and then incubated with phenol free RPMI 1640 media supplemented with 10% charcoal-stripped serum (Cellgro) for five minutes. The cells were again washed with PBS, and provided fresh media [8]. PC3 cells were grown in Ham's F-12 media.
Cell-Based Phage Enzyme-Linked Immunosorbent Assay (ELISA):
Day 1: The cell-based ELISA was performed as previously described by Watanabe et. al. with the following modifications [9]. Cells were detached with Trypsin-EDTA, resuspended in PBS, and then collected by centrifuging at 1200 rpm for 5 min. The cells were further washed with PBS and then concentrated as in the previous step. The concentration of the cells was adjusted to 4.5×106 cells/mL in PBS using a hemocytometer, and 100 μL was aliquoted to specific wells of a 96-well microtiter plate (Maxisorp plates from Nunc). The Maxisorp plates used here have a high protein-binding capacity. Thus, the plates can be used to run assays with either the cells or the phage immobilized in the wells. Next, 50 μL of a 0.15% glutaraldehyde in PBS solution was added to the wells at 4° C., and the solution was gently mixed by pipetting. The ELISA plate was then centrifuged at 1200 rpm for 10 min at 4° C., followed by overnight incubation at 4° C.
Day 2: The cell solution was gently removed, and the wells were blocked with 200 μL/well of blocking buffer containing 100 mM glycine, 1% gelatin and 0.1% w/v BSA (bovine serum albumin) in PBS. The plate was incubated overnight (˜20-22 h) at room temperature.
Separately, phage were prepared before attachment to PEG and PEGylated ligands. Phage (10 nM in 100 μL of PBS) and 1 μL of K14-alkyne (525 μM in water) were thoroughly mixed by pipetting ˜25 times. For phage wrapped with PEGylated ligands, phage were mixed with 0.75 μL of K14-Cys (525 μM in water). For mixed wrapping on the phage surface, 0.5 μL of K14-alkyne was pre-mixed with 0.75 μL of K14-Cys, and then mixed with 100 μL of 10 nM phage. The solution was shaken at room temperature for 15 min on an orbital shaker. Next, 2 μL of PEGylated ligand (625 μM in water) was added to the appropriate wells. For the dual ligand combinations, the PEGylated ligands were pre-mixed in the desired ratio (a 2:1 molar ratio for example), and then 2 μL of the mixture was added to the appropriate wells. The solutions were gently mixed by pipetting, and incubated overnight at 4° C.
Day 3: Next, the click reaction was performed, as previously described, but with the following modifications [1,2,5]. To buffer the pH, triethylammonium acetate was added to a final concentration of 50 mM, followed by the addition of 1.5 μL of 1 mM azide-functionalized PEG. The solutions were mixed by pipetting. Next, ascorbic acid was added to a final concentration of 1 mM and the solutions were mixed by gently pipetting. Then, copper sulfate was added to a final concentration of 1.5 mM, followed by pipetting to mix the solutions. Water was added to the other wells to maintain consistent phage concentrations. The plate was incubated at room temperature for 30 min.
The wells of the ELISA plate were then incubated with the phage samples. The blocking buffer was removed and the wells were gently washed two times with PBS. Next, the phage solution was added to the respective wells and incubated for 45 min. The phage solution was removed, and the wells were washed three times with 300 μL/well of wash buffer PT (0.05% Tween-20 in PBS), once with PBS, and then incubated with horseradish peroxidase-conjugated anti-M13 antibody (100 μL/well, 1:5000 dilution in PBS) for 40 min. The wells were washed three times with PT and once with PBS. The plate was then developed by incubating with HRP substrate solution (100 μL/well; 1 mg/mL o-phenylenediamine dihydrochloride and 0.02% w/v H2O2) in citric acid buffer (50 mM citric acid, 50 mM Na2HPO4, pH 5.0). The HRP activity was measured spectrophotometrically at 450 nm using a microtiter plate reader (Bio-Tek). The absorbance at 630 nm was subtracted from the absorbance at 450 nm to eliminate background.
Phage-Based Sandwich ELISA for Cell Capture.
To demonstrate cell capture by the PEGylated-ligand phage, the phage were coated on the plate, and cells added before quantifying binding. This assay setup inverts other cell-based phage ELISAs reported here. This experiment is a significant step towards establishing the relevance of this phage architecture for biosensing assays planned in the future. In this assay, the PEGylated phage architecture is immobilized on the plate as demonstrated in
Day 1: In this phage capture ELISA, specific wells of a 96-well microtiter plate were coated with 100 μL/well of a solution of 10 nM phage pre-wrapped with oligolysine wrappers, as described above. The plate was incubated for 1 h on a shaker at room temperature. The coating solution was removed, and the wells were blocked with 200 μL/well of 0.2% w/v solution of BSA in PBS for 30 min, and washed two times with PT. Next, 98 μL PBS was added per well, followed by PEGylated ligands and incubated overnight at 4° C.
Day 2: Azide-functionalized PEG variants were then conjugated as described above. Separately, the cells were collected and the concentration adjusted as described above; the ELISA plate was then incubated with 100 μL/well of the cell solution or media for 1 h. The wells were washed with PBS and incubated with 100 μL/well of the anti-PSMA antibody, YPSMA antibody (Abcam) at 1:1000 dilution. The wells were then washed with PBS, followed by incubation with horseradish-peroxidase-conjugated anti-mouse antibody (Sigma) at a 1:1000 dilution. The levels of phage binding were quantified as described above.
Conclusions.
In conclusion, this study demonstrates a systematic approach to engineering the phage surface through chemical tailoring. Chemically modifying viruses with PEG addresses a major issue of non-specific adhesion to cellular surfaces, and further engineering allowed specific detection using PEGylated ligands. The reported PEGylated dual ligand combination provides a foundation for applying the phage to cell-based analysis, where highly specific molecular recognition of cells is essential. Optimization of binding affinity required optimization of the PEG length, packing density, point of attachment, linkers and spacers. The versatility of PEG allows such multivariate optimization. This biocompatible polymer is widely available with diverse functionalities for bioconjugation and also has moderately predictable conformations to guide engineering. Furthermore, we demonstrate control over the relative spatial configuration of the ligands using small PEG polymers interdigitated with larger PEG brushes in a general approach applicable to many binding optimization studies. Most importantly, these chemically modified phage could readily distinguish PSMA-positive from PSMA-negative cells, and also identify more aggressive PCa tumor cells. In the future, we will apply such phage to the capture and detection of circulating tumor cells for use in cell-based detectors.
This application claims the benefit of U.S. Provisional Application No. 62/237,781, filed Oct. 6, 2015, the content of which is incorporated herein by reference in its entirety and for all purposes.
This invention was made with Government support under grant numbers 1 R43 AI074163 and 1 R43 CA050779, awarded by The National Institutes of Health. The Government has certain rights in this invention.
Number | Date | Country | |
---|---|---|---|
62237781 | Oct 2015 | US |