PHOTOACTIVATABLE COMPOUNDS AND USES THEREOF

FIELD

Provided herein are compounds, compositions, systems, and methods for photoactivatable labeling, which can be actuated within biological systems. In particular, compounds disclosed herein include vinyl-extended-aryl azide moieties that undergo photoactivation to generate reactive intermediates, which can form covalent linkages with biomolecules. Photoactivation can be conducted by a variety of mechanisms including ultraviolet (UV) irradiation, visible light irradiation, or energy transfer (e.g., from a photocatalyst). The compounds also include functional moieties that provide useful functionalities, for example detection and/or enrichment of biomolecules, such as fluorophores, capture elements (e.g., biotin), reactive moieties (e.g., click handles), and bifunctional moieties (e.g., a moiety comprising a bioactive compound and either a fluorophore or a capture element or a reactive moiety).

BACKGROUND

The need to study and map dynamic microenvironments, molecular networks, and small molecules interactions as well as protein-protein and protein-nucleic acid interactions in physiologically relevant contexts create a demand for new functional biological tools to enable such analyses in live cells and complex models in a nondestructive fashion.

SUMMARY

In one aspect, disclosed herein is a compound of formula (I):

embedded image

- or a salt thereof, wherein:
- A is selected from:

embedded image

- - wherein:
  - each n is independently 1, 2, 3, or 4; and
  - each R is independently selected from hydrogen, halo, C₁-C₄alkyl, C₂-C₄alkenyl, hydroxy, mercapto, amino, cyano, C₁-C₄-alkoxy, halo-C₁-C₄-alkyl, hydroxy-C₁-C₄-alkyl, amino-C₁-C₄-alkyl, mercapto-C₁-C₄-alkyl, cyano-C₁-C₄-alkyl, —C(O)—C₁-C₄-alkyl, —C(O)OH, and —C(O)NH₂;
- R′ is hydrogen or C₁-C₄alkyl;
- L is a linker; and
- Y is a functional moiety.

In some embodiments, A is:

embedded image

- wherein:
- R¹, R², R³, and R⁴are each independently selected from hydrogen, halo, hydroxy, cyano, and C₁-C₄alkoxy.

In some embodiments: R¹is hydrogen, hydroxy, or C₁-C₄alkoxy; R²is hydrogen, halo, cyano, or C₁-C₄alkoxy; R³is hydrogen or halo; and R⁴is hydrogen or halo.

In some embodiments, A is:

embedded image

- wherein each R is independently selected from hydrogen, halo, cyano, and C₁-C₄alkoxy.

In some embodiments, A is:

embedded image

- wherein R is selected from hydrogen, halo, cyano, and C₁-C₄alkoxy.

In some embodiments, A is:

embedded image

- wherein R is selected from hydrogen, halo, cyano, and C₁-C₄alkoxy.

In some embodiments, A has a formula selected from:

embedded image

In some embodiments, R′ is selected from hydrogen and methyl.

In some embodiments, the linker comprises one or more moieties selected from straight or branched chain alkylene, ether (—O—), amine (—NH—), ester (—C(O)O—), amide (—C(O)NH—), carbamate (—NHC(O)O—), urea (—NHC(O)NH—), and phenylene groups.

In some embodiments, the linker has a formula:

—NHCH₂CH₂(OCH₂CH₂)_nNH—

- wherein n is 1, 2, 3, 4, 5, 6, 7, or 8.

In some embodiments, Y is a functional moiety selected from a capture element, a detectable moiety, a reactive moiety, and a bifunctional moiety.

In some embodiments, Y is a capture element selected from biotin and a haloalkane group. In some embodiments, Y has a formula:

embedded image

In some embodiments, Y has a formula —(CH₂)_n—X, wherein n is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12, and X is a halogen.

In some embodiments, Y is a detectable moiety. In some embodiments, Y is a fluorescent functional group. In some embodiments, Y is a fluorescent functional group selected from selected from a xanthene, a cyanine, a naphthalene, an oxadiazole, a pyrene, an oxazine, an acridine, an arylmethine, a tetrapyrrole, a coumarin, a squaraine, and a boron-dipyrromethene. In some embodiments, Y is a fluorogenic functional group.

In some embodiments, Y is a reactive functional group. In some embodiments, Y is a reactive functional group comprising an azide, alkyne, alkene, or 1,2,4,5-tetrazinyl moiety.

In some embodiments, Y is a bifunctional moiety comprising: (i) a bioactive compound; and (ii) a capture element or a fluorescent moiety or a reactive moiety.

In some embodiments, the compound of formula (I) is a compound selected from the group consisting of:

embedded image

and salts thereof.

In another aspect, disclosed herein is a system for photocatalytic labeling of a biomolecule, comprising:

- (a) a compound of formula (I) (e.g., any compound of formula (I) disclosed herein); and
- (b) a photocatalyst.

In some embodiments, the photocatalyst has a structure

embedded image

- wherein:
- each set of dashed lines (------) represents the presence or absence of a fused 6-membered ring;
- M is a transition metal;
- m1, m2, m3, n1, n2, n3, p1, p2, and p3 are each independently 0, 1, or 2;
- R^1a, R^1b, R^1c, R^2a, R^2b, R^2c, R^3a, R^3b, and R^3care each independently selected from halo, alkyl, haloalkyl, amino, and heteroalkyl;
- X^1a, X^1b, X^2a, X^2b, X^3a, and X^3bare each independently selected from N and C, wherein at least one of X^1aand X^1bis N, at least one of X^2aand X^2bis N, and at least one of X^3aand X^3bis N;
- X^1c, X^1d, X^2e, X^2d, X^3c, and X^3dare each independently selected from CH and N;
- Z is an anion; and
- q is 0, 1, or 2.

In some embodiments, the transition metal is selected from Ru and Ir.

In some embodiments, the photocatalyst is an iridium photocatalyst selected from:

embedded image

In some embodiments, the photocatalyst is a ruthenium-based photocatalyst selected from:

embedded image

In some embodiments, the photocatalyst is of the formula:

embedded image

In one aspect, disclosed herein is a method of labeling a biomolecule in a sample, comprising:

- (a) contacting the sample with a compound of formula (I) (e.g., any compound of formula (I) disclosed herein); and
- (b) exposing the sample to light.

In some embodiments, the light is selected from ultraviolet light and visible light. In some embodiments, the light is visible light from a light-emitting diode. In some embodiments, the light is bioluminescent light. In some embodiments, the method further comprises contacting the sample with a photocatalyst in step (a).

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a cartoon depiction of covalent labeling using a probe comprising a vinyl-extended-aryl-azide-based photoreactive group, which upon activation generates a short-lived reactive intermediate that is capable of forming a covalent linkage with neighboring biomolecules. Activation modes includes: (A) UV irradiation, (B) Visible 455 nm LED irradiation, (C) LED-triggered activation of a light sensitive catalyst, which further engages in energy transfer events with the vinyl-extended-aryl-azide-based photoreactive group, and (D) bioluminescence-triggered activation of a light sensitive catalyst, which further engages in energy transfer events with the vinyl-extended-aryl-azide-based photoreactive group.

FIGS. 2A-2B show: (2A) a schematic representation of a probe comprising a vinyl-extended-aryl-azide-based photoreactive group linked to a functional moiety via a linker; (2B) structures of the vinyl-extended-aryl-azide-based photoreactive groups.

FIGS. 3A-3C show absorbance profiles for a range of vinyl-extended-aryl-azide-based photoreactive groups.

FIG. 4 shows data for evaluation of crosslinking efficiencies for a range of vinyl-extended-aryl-azide-based photoreactive groups; in particular, the data show slot blot analyses of covalent protein labeling induced by either direct irradiation with UV or visible light (455 nm LED) or LED-triggered activation of an iridium catalyst, which further engages in energy transfer events with the vinyl-extended-aryl-azide-based photoreactive group.

FIG. 5 shows data evaluating covalent labeling efficiencies for a range of vinyl-extended aryl-azide-based photoreactive groups; in particular, the data shows quantitation of the slot blot analyses from FIG. 4.

FIGS. 6A-6C show data for evaluation of a range of vinyl-extended-aryl-azide-based photoreactive groups for their capacity to undergo bioluminescence-triggered photocatalytic covalent protein labeling; in particular, the data show Western analyses of covalent protein labeling induced by bioluminescence-triggered activation of an iridium catalyst, which further engages in energy transfer events with the vinyl-extended-aryl-azide-based photoreactive group.

DEFINITIONS

Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments described herein, some preferred methods, compositions, devices, and materials are described herein. However, before the present materials and methods are described, it is to be understood that this invention is not limited to the particular molecules, compositions, methodologies, or protocols herein described, as these may vary in accordance with routine experimentation and optimization. It is also to be understood that the terminology used in the description is for the purpose of describing the particular versions or embodiments only, and is not intended to limit the scope of the embodiments described herein.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. However, in case of conflict, the present specification, including definitions, will control. Accordingly, in the context of the embodiments described herein, the following definitions apply.

As used herein and in the appended claims, the singular forms “a,” “an,” and “the” include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to “a peptide” is a reference to one or more peptides and equivalents thereof known to those skilled in the art, and so forth.

As used herein, the term “and/or” includes any and all combinations of listed items, including any of the listed items individually. For example, “A, B, and/or C” encompasses A, B, C, AB, AC, BC, and ABC, each of which is to be considered separately described by the statement “A, B, and/or C.”

As used herein, the term “comprise” and linguistic variations thereof denote the presence of recited feature(s), element(s), method step(s), etc., without the exclusion of the presence of additional feature(s), element(s), method step(s), etc. Conversely, the term “consisting of” and linguistic variations thereof, denotes the presence of recited feature(s), element(s), method step(s), etc., and excludes any unrecited feature(s), element(s), method step(s), etc., except for ordinarily-associated impurities. The phrase “consisting essentially of” denotes the recited feature(s), element(s), method step(s), etc. and any additional feature(s), element(s), method step(s), etc., that do not materially affect the basic nature of the composition, system, or method. Many embodiments herein are described using open “comprising” language. Such embodiments encompass multiple closed “consisting of” and/or “consisting essentially of” embodiments, which may alternatively be claimed or described using such language.

As used herein, the term “substantially” means that the recited characteristic, parameter, and/or value need not be achieved exactly, but that deviations or variations, including for example, tolerances, measurement error, measurement accuracy limitations and other factors known to skill in the art, may occur in amounts that do not preclude the effect the characteristic was intended to provide. A characteristic or feature that is substantially absent (e.g., substantially non-luminescent) may be one that is within the noise, beneath background, below the detection capabilities of the assay being used, or a small fraction (e.g., <1%, <0.1%, <0.01%, <0.001%, <0.00001%, <0.000001%, <0.0000001%) of the significant characteristic (e.g., luminescent intensity of a bioluminescent protein or bioluminescent complex).

As used herein, the term “biomolecule” or “biological molecule” refers to molecules and ions that are present in organisms and are essential to a biological process(es) such as cell division, morphogenesis, or development. Biomolecules include large macromolecules (or polyanions) such as proteins, carbohydrates, lipids, and nucleic acids as well as small molecules such as primary metabolites, secondary metabolites, and natural products. A more general name for this class of material is biological materials. Biomolecules are usually endogenous, but may also be exogenous. For example, pharmaceutical drugs may be natural products or semisynthetic (biopharmaceuticals), or they may be totally synthetic.

Definitions of specific functional groups and chemical terms are described in more detail below. For purposes of this disclosure, the chemical elements are identified in accordance with the Periodic Table of the Elements, CAS version, Handbook of Chemistry and Physics, 75^thEd., inside cover, and specific functional groups are generally defined as described therein. Additionally, general principles of organic chemistry, as well as specific functional moieties and reactivity, are described in Sorrell, Organic Chemistry, 2^ndedition, University Science Books, Sausalito, 2006; Smith, March's Advanced Organic Chemistry: Reactions, Mechanism, and Structure, 7^thEdition, John Wiley & Sons, Inc., New York, 2013; Larock, Comprehensive Organic Transformations, 3^rdEdition, John Wiley & Sons, Inc., New York, 2018; and Carruthers, Some Modern Methods of Organic Synthesis, 3^rdEdition, Cambridge University Press, Cambridge, 1987; the entire contents of each of which are incorporated herein by reference.

As used herein, the term “alkyl” means a straight or branched saturated hydrocarbon chain containing from 1 to 30 carbon atoms, for example 1 to 16 carbon atoms (C₁-C₁₆alkyl), 1 to 14 carbon atoms (C₁-C₁₄alkyl), 1 to 12 carbon atoms (C₁-C₁₂alkyl), 1 to 10 carbon atoms (C₁-C₁₀alkyl), 1 to 8 carbon atoms (C₁-C₈alkyl), 1 to 6 carbon atoms (C₁-C₆alkyl), 1 to 4 carbon atoms (C₁-C₄alkyl), 6 to 20 carbon atoms (C₆-C₂₀alkyl), or 8 to 14 carbon atoms (C₅-C₁₄alkyl). Representative examples of alkyl include, but are not limited to, methyl, ethyl, n-propyl, iso-propyl, n-butyl, sec-butyl, iso-butyl, tert-butyl, n-pentyl, isopentyl, neopentyl, n-hexyl, 3-methylhexyl, 2,2-dimethylpentyl, 2,3-dimethylpentyl, n-heptyl, n-octyl, n-nonyl, n-decyl, n-undecyl, and n-dodecyl.

As used herein, the term “alkenyl” means a straight or branched hydrocarbon chain containing at least one carbon-carbon double bond. The double bond(s) may be located at any positions with the hydrocarbon chain. Representative examples of alkenyl include, but are not limited to, ethenyl, 2-propenyl, 2-methyl-2-propenyl, 3-butenyl, 4-pentenyl, 5-hexenyl, 2-heptenyl, 2-methyl-1-heptenyl, and 3-decenyl.

As used herein, the term “alkynyl” means a straight or branched hydrocarbon chain containing at least one carbon-carbon triple bond. The triple bond(s) may be located at any position within the hydrocarbon chain. Representative examples of alkynyl include, but are not limited to, ethynyl, propynyl, and butynyl.

As used herein, the term “alkylene” means a divalent alkyl radical (e.g., —CH₂CH₂—). As used herein, the term “alkenylene” means a divalent alkenyl radical (e.g., —CH═CH—). As used herein, the term “alkynylene” means a divalent alkynyl radical (e.g., —C≡C—).

As used herein, the term “alkoxy” refers to an alkyl group, as defined herein, appended to the parent molecular moiety through an oxygen atom. Representative examples of alkoxy include, but are not limited to, methoxy, ethoxy, propoxy, 2-propoxy, butoxy, and tert-butoxy.

As used herein, the term “amino” means a —NH₂group.

As used herein, the term “aminoalkyl” means an alkyl group, as defined herein, in which at least one hydrogen atom is replaced with an amino group, as defined herein. Representative examples of aminoalkyl include, but are not limited to, aminomethyl, 2-aminoethyl, 2-aminopropyl, 3-aminopropyl, and 4-aminobutyl.

As used herein, the term “cyano” means a —CN group.

As used herein, the term “cyanoalkyl” means an alkyl group, as defined herein, in which at least one hydrogen atom is replaced with a cyano group, as defined herein. Representative examples of cyanoalkyl include, but are not limited to, cyanomethyl, 2-cyanoethyl, 2-cyanopropyl, 3-cyanopropyl, and 4-cyanobutyl.

As used herein, the term “halogen” or “halo” means F, Cl, Br, or I.

As used herein, the term “haloalkyl” means an alkyl group, as defined herein, in which at least one hydrogen atom (e.g., one, two, three, four, five, six, seven or eight hydrogen atoms) is replaced with a halogen. In some embodiments, each hydrogen atom of the alkyl group is replaced with a halogen. Representative examples of haloalkyl include, but are not limited to, fluoromethyl, difluoromethyl, trifluoromethyl, 2,2,2-trifluoroethyl, and 3,3,3-trifluoropropyl.

As used herein, the term “heteroalkyl” means an alkyl group, as defined herein, in which one or more of the carbon atoms (and any associated hydrogen atoms) are each independently replaced with a heteroatom group such as —NR—, —O—, —S—, —S(O)—, —S(O)₂—, and the like, where R is H, alkyl, aryl, cycloalkyl, heteroalkyl, heteroaryl, or heterocyclyl, each of which may be optionally substituted. By way of example, 1, 2, or 3 carbon atoms may be independently replaced with the same or different heteroatomic group. Examples of heteroalkyl groups include, but are not limited to, —OCH₃, —CH₂OCH₃, —SCH₃, —CH₂SCH₃, —NRCH₃, and —CH₂NRCH₃, where R is hydrogen, alkyl, aryl, arylalkyl, heteroalkyl, or heteroaryl, each of which may be optionally substituted. Heteroalkyl also includes groups in which a carbon atom of the alkyl is oxidized (i.e., is —C(O)—).

As used herein, the term “hydroxy” means a —OH group.

As used herein, the term “hydroxyalkyl” means an alkyl group, as defined herein, in which at least one hydrogen atom is replaced with a hydroxy group. Representative examples of hydroxyalkyl include, but are not limited to, hydroxymethyl, 2-hydroxyethyl, 2-hydroxypropyl, 3-hydroxypropyl, and 4-hydroxybutyl.

As used herein, the term “mercapto” means a —SH group.

As used herein, the term “mercaptoalkyl” means an alkyl group, as defined herein, in which at least one hydrogen atom is replaced with a mercapto group. Representative examples of mercaptoalkyl include, but are not limited to, mercaptomethyl, 2-mercaptoethyl, 2-mercaptopropyl, 3-mercaptopropyl, and 4-mercaptobutyl.

As used herein, in chemical structures the indication:

embedded image

represents a point of attachment of one moiety to another moiety (e.g., a substituent group to the rest of the compound).

For compounds described herein, groups and substituents thereof may be selected in accordance with permitted valence of the atoms and the substituents, such that the selections and substitutions result in a stable compound, e.g., which does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc.

When substituent groups are specified by their conventional chemical formula, written from left to right, such indication also encompass substituent groups resulting from writing the structure from right to left. For example, if a bivalent group is shown as —CH₂O—, such indication also encompasses —OCH₂—; similarly, —OC(O)NH— also encompasses —NHC(O)O—. When linker moieties are shown, the linkers can be attached to other moieties of the compound in either direction.

As used herein, the term “bioactive compound” refers generally to any physiologically or pharmacologically active substance. In some embodiments, a bioactive agent is a potential therapeutic compound (e.g., small molecule, peptide, nucleic acid, etc.) or drug-like molecule.

As used herein, the term “capture protein” refers to a protein or other molecular entity that forms a stable interaction (e.g., a covalent bond or a stable non-covalent interaction) with its substrate, ligand, or other molecular element upon interaction therewith. A capture protein may be a receptor that forms a covalent bond upon binding its ligand or an enzyme that forms a covalent bond with its substrate. An example of a suitable capture protein for use in embodiments of the present invention is the HALOTAG protein described in U.S. Pat. No. 7,425,436 (herein incorporated by reference in its entirety). A capture protein may also be a protein that has a strong non-covalent interaction with its corresponding capture element, such as streptavidin.

As used herein, the term “capture element” refers to a ligand, substrate, etc., that interacts with a corresponding capture protein (e.g., via a covalent bond or a stable non-covalent interaction). An example of a suitable capture element for use in embodiments of the present invention is the HALOTAG ligand described, for example, in U.S. Pat. No. 7,425,436 (herein incorporated by reference in its entirety). Moieties that find use as HALOTAG ligands include haloalkane (HA) groups (e.g., chloroalkane (CA) groups). In embodiments described herein that specify an HA or CA capture element, other suitable capture elements may be substituted unless otherwise specified. Another capture element is biotin.

DETAILED DESCRIPTION

The need to study and map dynamic microenvironments, molecular networks, small molecule interactions as well as protein-protein and protein-nucleic acid interactions in physiologically relevant contexts create a demand for new functional biological tools to enable such analyses in live cells and complex models in a nondestructive fashion. Photoactivatable compounds comprising a functional moiety linked to photoreactive group that can undergo activation upon covalent crosslinking with biomolecules offers a solution to this need by enabling labeling of biomolecules with fluorophores for detection, capture elements for enrichment and identification as well as labeling with a bifunctional moiety comprising a bioactive compound and either a fluorophore or a capture element or a reactive moiety for photoaffinity labeling and subsequent detection and/or enrichment, etc.

Provided herein are compounds, compositions, systems, and methods for photoactivated labeling of biomolecules, which can be actuated within biological systems. In particular, compounds disclosed herein include photoactivatable moieties that generate reactive intermediates upon exposure to light, and subsequently form covalent linkages with biomolecules. The photoactivation can be conducted by a variety of mechanisms including ultraviolet (UV) irradiation, visible light irradiation, or energy transfer. The compounds also include functional moieties that provide useful functionalities, for example detection and/or enrichment of biomolecules, such as fluorophores, capture elements (e.g., biotin), reactive moieties (e.g., click handles), or bifunctional moiety comprising a bioactive compound and either a fluorophore or a capture element or a reactive moiety. These bioorthogonal labeling chemistries can be leveraged for a broad range of phenotypic, proteomic, and genomic analyses.

Compounds

Disclosed herein is a compound of formula (I):

embedded image

- or a salt thereof, wherein:
- A is selected from:

embedded image

- - wherein:
  - each n is independently 1, 2, 3, or 4; and
  - each R is independently selected from hydrogen, halo, C₁-C₄alkyl, C₂-C₄alkenyl, hydroxy, mercapto, amino, cyano, C₁-C₄-alkoxy, halo-C₁-C₄-alkyl, hydroxy-C₁-C₄-alkyl, amino-C₁-C₄-alkyl, mercapto-C₁-C₄-alkyl, cyano-C₁-C₄-alkyl, —C(O)—C₁-C₄-alkyl, —C(O)OH, and —C(O)NH₂;
- R′ is hydrogen or C₁-C₄alkyl;
- L is a linker; and
- Y is a functional moiety.