CHEMICAL TOOLS FOR DRUG TARGET IDENTIFICATION AND CHARACTERIZATION

BACKGROUND

Covalent inhibitors have re-emerged as compelling alternatives to reversible, small molecule drugs (Singh et al., Nat. Rev. Drug Discov., 10(4):307-317 (2011); Kalgutkar et al., Expert Opin. Drug Discov., 7(7):561-581 (2012); Lagoutte et al., Curr. Opin. Chem. Biol. 39:54-63 (2007)). Structure-guided design enables improved selectivity through a combination of a moderate-affinity scaffold which positions an electrophilic warhead in proximity to a non-conserved nucleophilic amino acid (e.g., cysteine) on the protein target. Other advantages of covalent inhibitors compared to their reversible counterparts include high efficacy at lower concentration and less frequent dosing, complete target inhibition, with activity restored only after de novo synthesis of new protein, and wider tolerance for pharmacokinetic parameters. FDA-approved covalent drugs span several clinical indications (Singh et al., Nat. Rev. Drug Discov., 10(4):307-317 (2011); Mah et al., Bioorg. Med. Chem. Lett., 24(1):33-39 (2014); De Cesco et al., Eur. J. Med. Chem., 138:96-114 (2017); Bauer, Drug Discov. Today, 20(9):1061-1073 (2015)). The successes of neratinib (targeting HER2/EGFR, breast cancer) (Mundhenke et al., Breast Care (Basel), 4(6):373-378 (2009)), afatinib (targeting EGFR, non-small cell lung cancer, NSCLC) (Engle et al., Am. J. Health Syst. Pharm., 71(22):1933-8193 (2014)), osimertinib (targeting mutant-EGFR, NSCLC) (Cross et al., Cancer Discov., 4(9):1046-1061 (2014)), ibrutinib and acalabrutinib (targeting BTK, chronic lymphocytic leukemia, CLL) (Herman et al., Blood, 117(23):6287-6296 (2011); Byrd et al., N. Engl. J. Med., 374(4):323-332 (2016)) have driven renewed enthusiasm for new covalent drugs for cancer therapy.

The pharmacologic benefits of covalent drugs are tempered by concerns that modification of off-target proteins may result in toxicity. An elegant chemoproteomic study classified over 1,000 cysteine residues in the proteome as ‘hyper-reactive’ (Weerapana et al., Nature, 468(7325):790-795 (2010)), or prone to spurious modification by covalent compounds. In principle, chemoproteomic methods provide a powerful approach to identify potential off-target liabilities by quantifying covalent inhibitor binding across the proteome. However, in practice these approaches are typically reserved for late-stage inhibitors, meaning that off-target liabilities may not be discovered until much later in the development pipeline.

SUMMARY

A first aspect of the present invention is directed to compounds represented by formulas I and II:

embedded image

wherein R₁, R₂, R₃, X₁, X₂, A, m, and n are as defined herein, or a pharmaceutically acceptable salt or stereoisomer thereof.

Other aspects of the present invention are directed to compounds represented by formulas III and IV:

embedded image

wherein R₁, R₂, R₃, R₄, R₅, X₁, X₂, A, L, m, and n are as defined herein, or a pharmaceutically acceptable salt or stereoisomer thereof.

Further aspects of the present invention are directed to processes of preparing compounds of formulas III and IV. Processes for making compounds of formula III entail reacting a compound of formula I with a compound of formula V,

embedded image

Processes for making compounds of formula IV entail reacting a compound of formula II with a compound of formula V.

Another aspect of the present invention is directed to a composition that includes a compound of formula I-IV or a pharmaceutically acceptable salt or stereoisomer thereof, and a carrier.

Further aspects of the present invention are directed to methods of identifying cysteine residues on a polypeptide that may be targeted by a compound, comprising:

- reacting a compound of formula I-IV (“probe”) with a polypeptide, thereby alkylating the polypeptide at cysteine residues therein;
- digesting the alkylated polypeptide with at least one proteolytic enzyme, thereby producing probe-labeled peptide fragments of the alkylated polypeptide;
- isolating the probe-labeled peptide fragments on a solid phase support;
- contacting the thus-isolated probe-labeled peptide fragments with a diboron reagent, thereby releasing/eluting probe-labeled peptide fragments at cysteine residues thereof, and identifying the cysteine residues on the polypeptide.

Another aspect of the present invention is directed to a method of quantifying the number of cysteine residues on a polypeptide that are targeted by a compound, comprising:

- (i) reacting a compound of formula I-IV (“probe”) at a fixed concentration with a first mixture comprising one or more polypeptides to form a second mixture comprising one or more compounds of formula I-IV-polypeptide conjugates, wherein each of the compounds of formula I-IV-polypeptide conjugates comprise one or more thioether bonds;
- (ii) repeating (i) for a first labelled compound of formula I-IV containing one stable isotope to form a third mixture comprising one or more isotopically labeled compounds of formula I-IV-polypeptide conjugates, wherein each of the isotopically labeled compounds of formula I-IV-polypeptide conjugates comprise one or more thioether bonds;
  - a) repeating (ii) for a second labelled compound of formula I-IV containing two stable isotopes to form a fourth mixture comprising one or more isotopically labeled compounds of formula I-IV-polypeptide conjugates, wherein each of the isotopically labeled compounds of formula I-IV-polypeptide conjugates comprise one or more thioether bonds;
  - b) repeating (ii) up to 6 more times for each successive compound of formula I-IV containing more than two stable isotopes;
- (iii) combining the individual mixtures formed in (i) and (ii) to form a combined mixture;
- (iv) enzymatically digesting the combined mixture to form a mixture of peptides comprising a combination of (a) one or more compounds of formula I-IV-polypeptide conjugates and (b) one or more isotopically labeled compounds of formula I-IV-polypeptide conjugates, whereby each conjugate is formed through one or more thioether bonds;
- (v) capturing the polypeptide conjugates on a solid phase support;
- (vi) contacting the thus-isolated polypeptide conjugates with a diboron reagent, thereby releasing the polypeptides;
- (vii) analyzing the polypeptides via a targeted mass spectrometry assay;
- (viii) detecting one or more thiolated ions, or derivatives ions thereof produced in the targeted mass spectrometry assay; and
- (ix) determining target engagement stoichiometry for the compound of formula I-IV-polypeptide conjugate based on the ratio of thiolated ions, or derivative ions thereof, derived from the isotopically labeled to the unlabeled compounds of formula I-IV-peptide conjugates produced in the targeted mass spectrometry assay.

Another aspect of the present invention is directed to a method of quantifying the number of cysteine residues on a polypeptide that are targeted by a compound, comprising:

- (i) reacting a compound of formula I-IV (“probe”) at a fixed concentration with a first mixture comprising one or more polypeptides to form a second mixture comprising one or more compounds of formula I-IV-polypeptide conjugates, wherein each of the compounds of formula I-IV-polypeptide conjugates comprise one or more thioether bonds;
- (ii) repeating (i)×3 to form a third, fourth, and fifth mixtures;
- (iii) enzymatically digesting the second, third, fourth, and fifth mixtures;
- (iv) capturing the polypeptides on a solid phase support;
- (v) isotopically labeling the captured polypeptides from the third, fourth, and fifth mixtures;
- (vi) combining the individually captured polypeptides to form a combined mixture of captured polypeptides;
- (vi) contacting the captured polypeptides with a diboron reagent, thereby releasing the polypeptides;
- (vii) analyzing the polypeptides via a targeted mass spectrometry assay;
- (viii) detecting one or more thiolated ions, or derivatives ions thereof produced in the targeted mass spectrometry assay; and
- (ix) determining target engagement stoichiometry for the compound of formula I-IV-polypeptide conjugate based on the ratio of thiolated ions, or derivative ions thereof, derived from the isotopically labeled to the unlabeled compounds of formula I-IV-peptide conjugates produced in the targeted mass spectrometry assay.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic depicting the modular assembly of chemoproteomic tools.

FIG. 2A is a schematic showing that thioether bonds exhibit unique tandem mass spectrometry (MS/MS) fragmentation behavior. FIG. 2B illustrates structures of small molecule alkylating scaffolds.

FIG. 3 is an image of capillary electrophoresis-mass spectrometry (CE-MS) demonstrating >95% efficiency at each step with new reagents. Scale of y-axis is the same for extracted ion chromatography shown on upper and lower plots.

FIG. 4 is a Western blot of live K562 cells that were treated with iodoacetamide (IAA) or with iodo-methyl imidazole (IMIA) for indicated times. Additional cultures were treated with DMSO or THZ1 for 6 hours. Cell extracts were treated with THZ1-DTB to label any remaining CDK7-Cys312, followed by streptavidin PD and CDK7 Western blot.

FIG. 5 is a Western blot and target occupancy (TO) assay for THZ1-CDK7. Position of isotopes are indicated by stars overlaying each IMIA 4plex reagent. Red traces show extracted ion chromatograms for thiolated ions detected in high energy MS/MS. Samples were processed in parallel for THZ1-DTB PD and CDK7 Western blot.

FIG. 6 is a schematic showing alkylation and release of cyclooctyne (CO)-caged scaffolds.

FIG. 7A-FIG. 7B show an alternative route for functionalizing reporter scaffolds. FIG. 7A schematically shows different modes of cyclooctyne attachment. FIG. 7B shows that cyclooctynes can be attached at a secondary site on the reporter, should the single attachment point be synthetically inaccessible.

FIG. 8 shows representative reporter scaffolds. Each base scaffold circled in dashed line is modified at the sites marked with an asterisk to optimize the yield of thiolated reporter ions.

FIG. 9 is a schematic showing structurally distinct compounds as chemical bar codes to encode dose-response of different covalent inhibitors or electrophilic fragments and the read out of data generated from a single high-content chemical proteomics assay.

FIG. 10 is a bar graph and heatmap showing the binding activity of a broad covalent kinase inhibitor, a broad covalent DUB inhibitor, and the covalent clinical drug Ibrutinib using the chemoproteomic covalent screen.

DETAILED DESCRIPTION

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of skill in the art to which the subject matter herein belongs. As used in the specification and the appended claims, unless specified to the contrary, the following terms have the meaning indicated in order to facilitate the understanding of the present invention.

As used in the description and the appended claims, the singular forms “a”, “an”, and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a composition” includes mixtures of two or more such compositions, reference to “an inhibitor” includes mixtures of two or more such inhibitors, and the like.

Unless specifically stated or obvious from context, as used herein, the term “about” is understood as within a range of normal tolerance in the art, for example within 2 standard deviations of the mean. “About” can be understood as within 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1%, 0.05%, or 0.01% of the stated value. Unless otherwise clear from context, all numerical values provided herein are modified by the term “about”.

The transitional term “comprising,” which is synonymous with “including,” “containing,” or “characterized by,” is inclusive or open-ended and does not exclude additional, unrecited elements or method steps. When used in the context of the number of heteroatoms in a heterocyclic structure, it means that the heterocyclic group that that minimum number of heteroatoms. By contrast, the transitional phrase “consisting of” excludes any element, step, or ingredient not specified in the claim. The transitional phrase “consisting essentially of” limits the scope of a claim to the specified materials or steps “and those that do not materially affect the basic and novel characteristic(s)” of the claimed invention.

With respect to compounds of the present invention, and to the extent the following terms are used herein to further describe them, the following definitions apply.

As used herein, the term “alkyl” refers to a saturated linear or branched-chain monovalent hydrocarbon radical. In one embodiment, the alkyl radical is a C₁-C₁₈group. In other embodiments, the alkyl radical is a C₀-C₆, C₀-C₅, C₀-C₃, C₁-C₁₂, C₁-C₈, C₁-C₆, C₁-C₅, C₁-C₄or C₁-C₃group (wherein C₀alkyl refers to a bond). Examples of alkyl groups include methyl, ethyl, 1-propyl, 2-propyl, i-propyl, 1-butyl, 2-methyl-1-propyl, 2-butyl, 2-methyl-2-propyl, 1-pentyl, n-pentyl, 2-pentyl, 3-pentyl, 2-methyl-2-butyl, 3-methyl-2-butyl, 3-methyl-1-butyl, 2-methyl-1-butyl, 1-hexyl, 2-hexyl, 3-hexyl, 2-methyl-2-pentyl, 3-methyl-2-pentyl, 4-methyl-2-pentyl, 3-methyl-3-pentyl, 2-methyl-3-pentyl, 2,3-dimethyl-2-butyl, 3,3-dimethyl-2-butyl, heptyl, octyl, nonyl, decyl, undecyl and dodecyl. In some embodiments, an alkyl group is a C₁-C₃alkyl group. In some embodiments, an alkyl group is a C₁-C₂alkyl group, or a methyl group.

As used herein, the term “alkylene” refers to a straight or branched divalent hydrocarbon chain linking the rest of the molecule to a radical group, consisting solely of carbon and hydrogen, containing no unsaturation and having from one to 12 carbon atoms, for example, methylene, ethylene, propylene, n-butylene, and the like. The alkylene chain may be attached to the rest of the molecule through a single bond and to the radical group through a single bond. In some embodiments, the alkylene group contains one to 8 carbon atoms (C₁-C₈alkylene). In other embodiments, an alkylene group contains one to 5 carbon atoms (C₁-C₅alkylene). In other embodiments, an alkylene group contains one to 4 carbon atoms (C₁-C₄alkylene). In other embodiments, an alkylene contains one to three carbon atoms (C₁-C₃alkylene). In other embodiments, an alkylene group contains one to two carbon atoms (C₁-C₂alkylene). In other embodiments, an alkylene group contains one carbon atom (C₁alkylene).

As used herein, the term “alkenyl” refers to a linear or branched-chain monovalent hydrocarbon radical with at least one carbon-carbon double bond. An alkenyl includes radicals having “cis” and “trans” orientations, or alternatively, “E” and “Z” orientations. In one example, the alkenyl radical is a C₂-C₁₈group. In other embodiments, the alkenyl radical is a C₂-C₁₂, C₂-C₁₀, C₂-C₈, C₂-C₆or C₂-C₃group. Examples include ethenyl or vinyl, prop-1-enyl, prop-2-enyl, 2-methylprop-1-enyl, but-1-enyl, but-2-enyl, but-3-enyl, buta-1,3-dienyl, 2-methylbuta-1,3-diene, hex-1-enyl, hex-2-enyl, hex-3-enyl, hex-4-enyl and hexa-1,3-dienyl.

As used herein, the term “alkynyl” refers to a linear or branched monovalent hydrocarbon radical with at least one carbon-carbon triple bond. In one example, the alkynyl radical is a C₂-C₁₈group. In other examples, the alkynyl radical is C₂-C₁₂, C₂-C₁₀, C₂-C₈, C₂-C₆or C₂-C₃. Examples include ethynyl prop-1-ynyl, prop-2-ynyl, but-1-ynyl, but-2-ynyl and but-3-ynyl.

The terms “alkoxyl” or “alkoxy” as used herein refer to an alkyl group, as defined above, having an oxygen radical attached thereto, and which is the point of attachment. Representative alkoxyl groups include methoxy, ethoxy, propyloxy, tert-butoxy and the like. An “ether” is two hydrocarbyl groups covalently linked by an oxygen. Accordingly, the substituent of an alkyl that renders that alkyl an ether is or resembles an alkoxyl, such as can be represented by one of —O-alkyl, —O-alkenyl, and —O-alkynyl.

As used herein, the term “halogen” (or “halo” or “halide”) refers to fluorine, chlorine, bromine, or iodine.

As used herein, the term “cyclic group” broadly refers to any group that used alone or as part of a larger moiety, contains a saturated, partially saturated or aromatic ring system e.g., carbocyclic (cycloalkyl, cycloalkenyl), heterocyclic (heterocycloalkyl, heterocycloalkenyl), aryl and heteroaryl groups. Cyclic groups may have one or more (e.g., fused) ring systems. Thus, for example, a cyclic group can contain one or more carbocyclic, heterocyclic, aryl or heteroaryl groups.

As used herein, the term “carbocyclic” (also “carbocyclyl”) refers to a group that used alone or as part of a larger moiety, contains a saturated, partially unsaturated, or aromatic ring system having 3 to 20 carbon atoms, that is alone or part of a larger moiety (e.g., an alkcarbocyclic group). The term carbocyclyl includes mono-, bi-, tri-, fused, bridged, and spiro-ring systems, and combinations thereof. In one embodiment, carbocyclyl includes 3 to 15 carbon atoms (C₃-C₁₅). In one embodiment, carbocyclyl includes 3 to 12 carbon atoms (C₃-C₁₂). In another embodiment, carbocyclyl includes C₃-C₈, C₃-C₁₀or C₅-C₁₀. In another embodiment, carbocyclyl, as a monocycle, includes C₃-C₈, C₃-C₆or C₅-C₆. In some embodiments, carbocyclyl, as a bicycle, includes C₇-C₁₂. In another embodiment, carbocyclyl, as a spiro system, includes C₅-C₁₂. Representative examples of monocyclic carbocyclyls include cyclopropyl, cyclobutyl, cyclopentyl, 1-cyclopent-1-enyl, 1-cyclopent-2-enyl, 1-cyclopent-3-enyl, cyclohexyl, perdeuteriocyclohexyl, 1-cyclohex-1-enyl, 1-cyclohex-2-enyl, 1-cyclohex-3-enyl, cyclohexadienyl, cycloheptyl, cyclooctyl, cyclononyl, cyclodecyl, cycloundecyl, phenyl, and cyclododecyl; bicyclic carbocyclyls having 7 to 12 ring atoms include [4,3], [4,4], [4,5], [5,5], [5,6] or [6,6] ring systems, such as for example bicyclo[2.2.1]heptane, bicyclo[2.2.2]octane, naphthalene, and bicyclo[3.2.2]nonane. Representative examples of spiro carbocyclyls include spiro[2.2]pentane, spiro[2.3]hexane, spiro[2.4]heptane, spiro[2.5]octane and spiro[4.5]decane. The term carbocyclyl includes aryl ring systems as defined herein. The term carbocyclyl also includes cycloalkyl rings (e.g., saturated or partially unsaturated mono-, bi-, or spiro-carbocycles). The term carbocyclic group also includes a carbocyclic ring fused to one or more (e.g., 1, 2 or 3) different cyclic groups (e.g., aryl or heterocyclic rings), where the radical or point of attachment is on the carbocyclic ring.

Thus, the term carbocyclic also embraces carbocyclylalkyl groups which as used herein refer to a group of the formula —R^c-carbocyclyl where R^cis an alkylene chain. The term carbocyclic also embraces carbocyclylalkoxy groups which as used herein refer to a group bonded through an oxygen atom of the formula —O—R^c-carbocyclyl where R^cis an alkylene chain.

As used herein, the term “aryl” used alone or as part of a larger moiety (e.g., “aralkyl”, wherein the terminal carbon atom on the alkyl group is the point of attachment, e.g., a benzyl group), “aralkoxy” wherein the oxygen atom is the point of attachment, or “aroxyalkyl” wherein the point of attachment is on the aryl group) refers to a group that includes monocyclic, bicyclic or tricyclic, carbon ring system, that includes fused rings, wherein at least one ring in the system is aromatic. In some embodiments, the aralkoxy group is a benzoxy group. The term “aryl” may be used interchangeably with the term “aryl ring”. In one embodiment, aryl includes groups having 6-18 carbon atoms. In another embodiment, aryl includes groups having 6-10 carbon atoms. Examples of aryl groups include phenyl, naphthyl, anthracyl, biphenyl, phenanthrenyl, naphthacenyl, 1,2,3,4-tetrahydronaphthalenyl, 1H-indenyl, 2,3-dihydro-1H-indenyl, naphthyridinyl, and the like, which may be substituted or independently substituted by one or more substituents described herein. A particular aryl is phenyl. In some embodiments, an aryl group includes an aryl ring fused to one or more (e.g., 1, 2 or 3) different cyclic groups (e.g., carbocyclic rings or heterocyclic rings), where the radical or point of attachment is on the aryl ring.

Thus, the term aryl embraces aralkyl groups (e.g., benzyl) which as disclosed above refer to a group of the formula —R^c-aryl where R^cis an alkylene chain such as methylene or ethylene. In some embodiments, the aralkyl group is an optionally substituted benzyl group. The term aryl also embraces aralkoxy groups which as used herein refer to a group bonded through an oxygen atom of the formula —O—R^c-aryl where R^cis an alkylene chain such as methylene or ethylene.

As used herein, the term “heterocyclyl” refers to a “carbocyclyl” that used alone or as part of a larger moiety, contains a saturated, partially unsaturated or aromatic ring system, wherein one or more (e.g., 1, 2, 3, or 4) carbon atoms have been replaced with a heteroatom (e.g., 0, N, N(O), S, S(O), or S(O)₂). The term heterocyclyl includes mono-, bi-, tri-, fused, bridged, and spiro-ring systems, and combinations thereof. In some embodiments, a heterocyclyl refers to a 3 to 15 membered heterocyclyl ring system. In some embodiments, a heterocyclyl refers to a 3 to 12 membered heterocyclyl ring system. In some embodiments, a heterocyclyl refers to a saturated ring system, such as a 3 to 12 membered saturated heterocyclyl ring system. In some embodiments, a heterocyclyl refers to a heteroaryl ring system, such as a 5 to 14 membered heteroaryl ring system. The term heterocyclyl also includes C₃-C₈heterocycloalkyl, which is a saturated or partially unsaturated mono-, bi-, or spiro-ring system containing 3-8 carbons and one or more (1, 2, 3 or 4) heteroatoms.

In some embodiments, a heterocyclyl group includes 3-12 ring atoms and includes monocycles, bicycles, tricycles and spiro ring systems, wherein the ring atoms are carbon, and one to 5 ring atoms is a heteroatom such as nitrogen, sulfur or oxygen. In some embodiments, heterocyclyl includes 3- to 7-membered monocycles having one or more heteroatoms selected from nitrogen, sulfur or oxygen. In some embodiments, heterocyclyl includes 4- to 6-membered monocycles having one or more heteroatoms selected from nitrogen, sulfur or oxygen. In some embodiments, heterocyclyl includes 3-membered monocycles. In some embodiments, heterocyclyl includes 4-membered monocycles. In some embodiments, heterocyclyl includes 5-6 membered monocycles. In some embodiments, the heterocyclyl group includes 0 to 3 double bonds. In any of the foregoing embodiments, heterocyclyl includes 1, 2, 3 or 4 heteroatoms. Any nitrogen or sulfur heteroatom may optionally be oxidized (e.g., NO, SO, SO₂), and any nitrogen heteroatom may optionally be quaternized (e.g., [NR₄]⁺Cl⁻, [NR₄]⁺OH⁻). Representative examples of heterocyclyls include oxiranyl, aziridinyl, thiiranyl, azetidinyl, oxetanyl, thietanyl, 1,2-dithietanyl, 1,3-dithietanyl, pyrrolidinyl, dihydro-1H-pyrrolyl, dihydrofuranyl, tetrahydropyranyl, dihydrothienyl, tetrahydrothienyl, imidazolidinyl, piperidinyl, piperazinyl, morpholinyl, thiomorpholinyl, 1,1-dioxo-thiomorpholinyl, dihydropyranyl, tetrahydropyranyl, hexahydrothiopyranyl, hexahydropyrimidinyl, oxazinanyl, thiazinanyl, thioxanyl, homopiperazinyl, homopiperidinyl, azepanyl, oxepanyl, thiepanyl, oxazepinyl, oxazepanyl, diazepanyl, 1,4-diazepanyl, diazepinyl, thiazepinyl, thiazepanyl, tetrahydrothiopyranyl, oxazolidinyl, thiazolidinyl, isothiazolidinyl, 1,1-dioxoisothiazolidinonyl, oxazolidinonyl, imidazolidinonyl, 4,5,6,7-tetrahydro[2H]indazolyl, tetrahydrobenzoimidazolyl, 4,5,6,7-tetrahydrobenzo[d]imidazolyl, 1,6-dihydroimidazol[4,5-d]pyrrolo[2,3-b]pyridinyl, thiazinyl, thiophenyl, oxazinyl, thiadiazinyl, oxadiazinyl, dithiazinyl, dioxazinyl, oxathiazinyl, thiatriazinyl, oxatriazinyl, dithiadiazinyl, imidazolinyl, dihydropyrimidyl, tetrahydropyrimidyl, 1-pyrrolinyl, 2-pyrrolinyl, 3-pyrrolinyl, indolinyl, thiapyranyl, 2H-pyranyl, 4H-pyranyl, dioxanyl, 1,3-dioxolanyl, pyrazolinyl, pyrazolidinyl, dithianyl, dithiolanyl, pyrimidinonyl, pyrimidindionyl, pyrimidin-2,4-dionyl, piperazinonyl, piperazindionyl, pyrazolidinylimidazolinyl, 3-azabicyclo[3.1.0]hexanyl, 3,6-diazabicyclo[3.1.1]heptanyl, 6-azabicyclo[3.1.1]heptanyl, 3-azabicyclo[3.1.1]heptanyl, 3-azabicyclo[4.1.0]heptanyl, azabicyclo[2.2.2]hexanyl, 2-azabicyclo[3.2.1]octanyl, 8-azabicyclo[3.2.1]octanyl, 2-azabicyclo[2.2.2]octanyl, 8-azabicyclo[2.2.2]octanyl, 7-oxabicyclo[2.2.1]heptane, azaspiro[3.5]nonanyl, azaspiro[2.5]octanyl, azaspiro[4.5]decanyl, 1-azaspiro[4.5]decan-2-only, azaspiro[5.5]undecanyl, tetrahydroindolyl, octahydroindolyl, tetrahydroisoindolyl, tetrahydroindazolyl, 1,1-dioxohexahydrothiopyranyl. Examples of 5-membered heterocyclyls containing a sulfur or oxygen atom and one to three nitrogen atoms are thiazolyl, including thiazol-2-yl and thiazol-2-yl N-oxide, thiadiazolyl, including 1,3,4-thiadiazol-5-yl and 1,2,4-thiadiazol-5-yl, oxazolyl, for example oxazol-2-yl, and oxadiazolyl, such as 1,3,4-oxadiazol-5-yl, and 1,2,4-oxadiazol-5-yl. Example 5-membered ring heterocyclyls containing 2 to 4 nitrogen atoms include imidazolyl, such as imidazol-2-yl; triazolyl, such as 1,3,4-triazol-5-yl; 1,2,3-triazol-5-yl, 1,2,4-triazol-5-yl, and tetrazolyl, such as 1H-tetrazol-5-yl. Representative examples of benzo-fused 5-membered heterocyclyls are benzoxazol-2-yl, benzthiazol-2-yl and benzimidazol-2-yl. Example 6-membered heterocyclyls contain one to three nitrogen atoms and optionally a sulfur or oxygen atom, for example pyridyl, such as pyrid-2-yl, pyrid-3-yl, and pyrid-4-yl; pyrimidyl, such as pyrimid-2-yl and pyrimid-4-yl; triazinyl, such as 1,3,4-triazin-2-yl and 1,3,5-triazin-4-yl; pyridazinyl, in particular pyridazin-3-yl, and pyrazinyl. The pyridine N-oxides and pyridazine N-oxides and the pyridyl, pyrimid-2-yl, pyrimid-4-yl, pyridazinyl and the 1,3,4-triazin-2-yl groups, are yet other examples of heterocyclyl groups. In some embodiments, a heterocyclic group includes a heterocyclic ring fused to one or more (e.g., 1, 2 or 3) different cyclic groups (e.g., carbocyclic rings or heterocyclic rings), where the radical or point of attachment is on the heterocyclic ring, and in some embodiments wherein the point of attachment is a heteroatom contained in the heterocyclic ring.

Thus, the term heterocyclic embraces N-heterocyclyl groups which as used herein refer to a heterocyclyl group containing at least one nitrogen and where the point of attachment of the heterocyclyl group to the rest of the molecule is through a nitrogen atom in the heterocyclyl group. Representative examples of N-heterocyclyl groups include 1-morpholinyl, 1-piperidinyl, 1-piperazinyl, 1-pyrrolidinyl, pyrazolidinyl, imidazolinyl and imidazolidinyl. The term heterocyclic also embraces C-heterocyclyl groups which as used herein refer to a heterocyclyl group containing at least one heteroatom and where the point of attachment of the heterocyclyl group to the rest of the molecule is through a carbon atom in the heterocyclyl group. Representative examples of C-heterocyclyl radicals include 2-morpholinyl, 2- or 3- or 4-piperidinyl, 2-piperazinyl, and 2- or 3-pyrrolidinyl. The term heterocyclic also embraces heterocyclylalkyl groups which as disclosed above refer to a group of the formula —R^c-heterocyclyl where R^cis an alkylene chain. The term heterocyclic also embraces heterocyclylalkoxy groups which as used herein refer to a radical bonded through an oxygen atom of the formula —O—R^c-heterocyclyl where R^cis an alkylene chain.

As used herein, the term “heteroaryl” used alone or as part of a larger moiety (e.g., “heteroarylalkyl” (also “heteroaralkyl”), or “heteroarylalkoxy” (also “heteroaralkoxy”), refers to a monocyclic, bicyclic or tricyclic ring system having 5 to 14 ring atoms, wherein at least one ring is aromatic and contains at least one heteroatom. In one embodiment, heteroaryl includes 5-6 membered monocyclic aromatic groups where one or more ring atoms is nitrogen, sulfur or oxygen. Representative examples of heteroaryl groups include thienyl, furyl, imidazolyl, pyrazolyl, thiazolyl, isothiazolyl, oxazolyl, isoxazolyl, triazolyl, thiadiazolyl, oxadiazolyl, tetrazolyl, thiatriazolyl, oxatriazolyl, pyridyl, pyrimidyl, imidazopyridyl, pyrazinyl, pyridazinyl, triazinyl, tetrazinyl, tetrazolo[1,5-b]pyridazinyl, purinyl, deazapurinyl, benzoxazolyl, benzofuryl, benzothiazolyl, benzothiadiazolyl, benzotriazolyl, benzoimidazolyl, indolyl, 1,3-thiazol-2-yl, 1,3,4-triazol-5-yl, 1,3-oxazol-2-yl, 1,3,4-oxadiazol-5-yl, 1,2,4-oxadiazol-5-yl, 1,3,4-thiadiazol-5-yl, 1H-tetrazol-5-yl, 1,2,3-triazol-5-yl, and pyrid-2-yl N-oxide. The term “heteroaryl” also includes groups in which a heteroaryl is fused to one or more cyclic (e.g., carbocyclyl, or heterocyclyl) rings, where the radical or point of attachment is on the heteroaryl ring. Nonlimiting examples include indolyl, indolizinyl, isoindolyl, benzothienyl, benzothiophenyl, methylenedioxyphenyl, benzofuranyl, dibenzofuranyl, indazolyl, benzimidazolyl, benzodioxazolyl, benzthiazolyl, quinolyl, isoquinolyl, cinnolinyl, phthalazinyl, quinazolinyl, quinoxalinyl, 4H-quinolizinyl, carbazolyl, acridinyl, phenazinyl, phenothiazinyl, phenoxazinyl, tetrahydroquinolinyl, tetrahydroisoquinolinyl and pyrido[2,3-b]-1,4-oxazin-3(4H)-one. A heteroaryl group may be mono-, bi- or tri-cyclic. In some embodiments, a heteroaryl group includes a heteroaryl ring fused to one or more (e.g., 1, 2 or 3) different cyclic groups (e.g., carbocyclic rings or heterocyclic rings), where the radical or point of attachment is on the heteroaryl ring, and in some embodiments wherein the point of attachment is a heteroatom contained in the heterocyclic ring.

Thus, the term heteroaryl embraces N-heteroaryl groups which as used herein refer to a heteroaryl group as defined above containing at least one nitrogen and where the point of attachment of the heteroaryl group to the rest of the molecule is through a nitrogen atom in the heteroaryl group. The term heteroaryl also embraces C-heteroaryl groups which as used herein refer to a heteroaryl group as defined above and where the point of attachment of the heteroaryl group to the rest of the molecule is through a carbon atom in the heteroaryl group. The term heteroaryl also embraces heteroarylalkyl groups which as disclosed above refer to a group of the formula —R^c-heteroaryl, wherein R^cis an alkylene chain as defined above. The term heteroaryl also embraces heteroaralkoxy (or heteroarylalkoxy) groups which as used herein refer to a group bonded through an oxygen atom of the formula —O—R^c-heteroaryl, where R^cis an alkylene group as defined above.

As used herein, the term “arene” refers to a bivalent aryl radical which may be optionally substituted.

As used herein, the term “heterocyclene” refers to a bivalent heterocyclyl radical which may be optionally substituted.

As used herein, the term “heteroarylene” refers to a bivalent heteroaryl radical which may be optionally substituted.

Unless stated otherwise, and to the extent not further defined for any particular group(s), any of the groups described herein may be substituted or unsubstituted. As used herein, the term “substituted” broadly refers to all permissible substituents with the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent, and that the substitution results in a stable compound, i.e. a compound that does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc. Representative substituents include halogens, hydroxyl groups, and any other organic groupings containing any number of carbon atoms, e.g., 1-14 carbon atoms, and which may include one or more (e.g., 1, 2, 3, or 4) heteroatoms such as oxygen, sulfur, and nitrogen grouped in a linear, branched, or cyclic structural format.

To the extent not disclosed otherwise for any particular group(s), representative examples of substituents may include alkyl, substituted alkyl (e.g., C₁-C₆, C₁-C₅, C₁-C₄, C₁-C₃, C₁-C₂, C₁), alkoxy (e.g., C₁-C₆, C₁-C₅, C₁-C₄, C₁-C₃, C₁-C₂, C₁), substituted alkoxy (e.g., C₁-C₆, C₁-C₅, C₁-C₄, C₁-C₃, C₁-C₂, C₁), haloalkyl (e.g., CF₃), alkenyl (e.g., C₂-C₆, C₂-C₅, C₂-C₄, C₂-C₃, C₂), substituted alkenyl (e.g., C₂-C₆, C₂-C₅, C₂-C₄, C₂-C₃, C₂), alkynyl (e.g., C₂-C₆, C₂-C₅, C₂-C₄, C₂-C₃, C₂), substituted alkynyl (e.g., C₂-C₆, C₂-C₅, C₂-C₄, C₂-C₃, C₂), cyclic (e.g., C₃-C₁₂, C₅-C₆), substituted cyclic (e.g., C₃-C₁₂, C₅-C₆), carbocyclic (e.g., C₃-C₁₂, C₅-C₆), substituted carbocyclic (e.g., C₃-C₁₂, C₅-C₆), heterocyclic (e.g., C₃-C₁₂, C₅-C₆), substituted heterocyclic (e.g., C₃-C₁₂, C₅-C₆), aryl (e.g., benzyl and phenyl), substituted aryl (e.g., substituted benzyl or phenyl), heteroaryl (e.g., pyridyl or pyrimidyl), substituted heteroaryl (e.g., substituted pyridyl or pyrimidyl), aralkyl (e.g., benzyl), substituted aralkyl (e.g., substituted benzyl), halo, hydroxyl, aryloxy (e.g., C₆-C₁₂, C₆), substituted aryloxy (e.g., C₆-C₁₂, C₆), alkylthio (e.g., C₁-C₆), substituted alkylthio (e.g., C₁-C₆), arylthio (e.g., C₆-C₁₂, C₆), substituted arylthio (e.g., C₆-C₁₂, C₆), cyano, carbonyl, substituted carbonyl, carboxyl, substituted carboxyl, amino, substituted amino, amido, substituted amido, thio, substituted thio, sulfinyl, substituted sulfinyl, sulfonyl, substituted sulfonyl, sulfinamide, substituted sulfinamide, sulfonamide, substituted sulfonamide, urea, substituted urea, carbamate, substituted carbamate, amino acid, and peptide groups.

As used herein, the term “electron donating group” refers to an atom or functional group that releases electron density to neighboring atoms from itself, usually by resonance or inductive effects.

As used herein, the term “electron withdrawing group” refers to an atom or functional group that draws electron density from neighboring atoms to itself, usually by resonance or inductive effects.

As used herein, the term “ionizable group” refers to any uncharged group in a molecular entity that is capable of dissociating by yielding an ion (usually an H⁺ ion) or an electron and itself becoming oppositely charged.

As used herein, the term “small molecule” refers to a molecule, whether naturally-occurring or artificially created (e.g., via chemical synthesis) that has a relatively low molecular weight. Typically, a small molecule is an organic compound (i.e., it contains carbon). The small molecule may contain multiple carbon-carbon bonds, stereocenters, and other functional groups (e.g., amines, hydroxyl, carbonyls, and heterocyclic rings, etc.).

In one aspect, compounds of the invention are represented by formula I or II:

embedded image

wherein,

- X₁is NR₁, O, S, S(O), or S(O)₂;
- each X₂is independently C(R₁)₂, NR₁, O, C(O), C₆-C₁₀aryl, or —OCH₂CH₂—;
- each R₁is independently hydrogen, C₁-C₆alkyl, C₆-C₁₀aryl, or 5- to 10-membered heteroaryl, wherein said alkyl, aryl, or heteroaryl is optionally substituted;
- R₂is absent or NH;
- R₃is absent, C(O), or C₁-C₃alkylene;
- A is C₁-C₆alkyl, C₆-C₁₀aryl, C₆-C₁₀arene, 5- to 10-membered heteroaryl, 5- to 10-membered heteroarylene, or a small molecule, wherein said alkyl, aryl, or heteroaryl is optionally substituted;
- m is an integer from 0-5; and
- n is an integer from 0-10;
- or a pharmaceutically acceptable salt or stereoisomer thereof.

In some embodiments, X₁is O. In some embodiments, X₁is S. In some embodiments, X₁is S(O) or S(O)₂. In some embodiments, X₁is NR₁and R₁is H.

In some embodiments, each X₂is independently C(R₁)₂, NR₁, O, C(O), or —OCH₂CH₂—. In some embodiments, each X₂is independently CHR₁, CH₂, NR₁, O, or C(O). In some embodiments, each X₂is independently CHR₁, CH₂, NR₁, C(O), or —OCH₂CH₂—. In some embodiments, each X₂is independently CH₂, NR₁, or C(O). In some embodiments, each X₂is independently NR₁, C(O), or —OCH₂CH₂—. In some embodiments, each X₂is independently CH₂, NR₁, or C(O).

In some embodiments, R₃is absent. In some embodiments, R₃is C₂alkylene.

In some embodiments, n is 0. In some embodiments, n is 1. In some embodiments, n is 2. In some embodiments, n is 3. In some embodiments, n is 4. In some embodiments, n is 5.

In some embodiments, m is 0. In some embodiments, m is 1. In some embodiments, m is 2. In some embodiments, m is 3. In some embodiments, m is 5.

In some embodiments, each carbon, nitrogen, and oxygen of the compound of formula I or II is substituted with a stable isotope thereof, wherein the isotope is selected from ¹³C, ¹⁵N, and ¹⁸O.

In some embodiments, the compound of formula I or II contains 1-8 isotopes. In some embodiments, the compound of formula I or II contains 1-4 isotopes. In some embodiments, the compound of formula I or II contains 3 isotopes. In some embodiments, the compound of formula I or II contains 2 isotopes. In some embodiments, the isotope is ²H, ¹³C, ¹⁵N, or ¹⁸O, or a combination of two or more thereof. Possible sites of the compound of formula I or II that can contain isotope(s) are indicated by enclosure in dashed line:

embedded image

In some embodiments, A is imidazolyl, thiazolyl, furanyl, pyridinyl, triazolyl, or phenyl, and wherein A is optionally substituted.

In some embodiments, A is substituted with one or more electron donating groups. In some embodiments, the electron donating group is —OH, C₁-C₆alkyl, or C₁-C₆alkoxyl.

In some embodiments, A is substituted with one or more electron withdrawing groups.

In some embodiments, the electron withdrawing group is —CN, —COOH, or NO₂.

In some embodiments, A is substituted with one or more ionizable groups. In some embodiments, the ionizable group is NH₂.

In some embodiments, A is a small molecule. In certain embodiments, the molecular weight of the small molecule is not more than about 1,000 g/mol, not more than about 900 g/mol, not more than about 800 g/mol, not more than about 700 g/mol, not more than about 600 g/mol, not more than about 500 g/mol, not more than about 400 g/mol, not more than about 300 g/mol, not more than about 200 g/mol, or not more than about 100 g/mol. In certain embodiments, the molecular weight of the small molecule is at least about 100 g/mol, at least about 200 g/mol, at least about 300 g/mol, at least about 400 g/mol, at least about 500 g/mol, at least about 600 g/mol, at least about 700 g/mol, at least about 800 g/mol, or at least about 900 g/mol, or at least about 1,000 g/mol.

In some embodiments, A is an optionally substituted C₃-C₁₂carbocyclyl.

In some embodiments, A is an optionally substituted C₆-C₁₄aryl.

In some embodiments, A is an optionally substituted C₆-C₁₄arene

In some embodiments, A is an optionally substituted 3- to 10-membered heterocyclyl.

In some embodiments, A is an optionally substituted 3- to 10-membered heterocyclene.

In some embodiments, A is an optionally substituted 5- to 10-membered heteroaryl.

In some embodiments, A is an optionally substituted 5- to 10-membered heteroarylene.

In some embodiments, A contains one or more substituents and each substituent for a compound of formula (I or II) is independently alkyl, alkenyl, alkynyl, halo, haloalkyl, cycloalkyl, heterocycloalkyl, hydroxy, alkoxy, cycloalkoxy, heterocycloalkoxy, haloalkoxy, aryloxy, heteroaryloxy, aralkyloxy, alkyenyloxy, alkynyloxy, amino, alkylamino, cycloalkylamino, heterocycloalkylamino, arylamino, heteroarylamino, aralkylamino, N-alkyl-N-arylamino, N-alkyl-N-heteroarylamino, N-alkyl-N-aralkylamino, hydroxyalkyl, aminoalkyl, alkylthio, haloalkylthio, alkylsulfonyl, haloalkylsulfonyl, cycloalkylsulfonyl, heterocycloalkylsulfonyl, arylsulfonyl, heteroarylsulfonyl, aminosulfonyl, alkylaminosulfonyl, cycloalkylaminosulfonyl, heterocycloalkylaminosulfonyl, arylaminosulfonyl, heteroarylaminosulfonyl, N-alkyl-N-arylaminosulfonyl, N-alkyl-N-heteroarylaminosulfonyl, formyl, alkylcarbonyl, haloalkylcarbonyl, alkenylcarbonyl, alkynylcarbonyl, carboxy, alkoxycarbonyl, alkylcarbonyloxy, amino, alkylsulfonylamino, haloalkylsulfonylamino, cycloalkylsulfonylamino, heterocycloalkylsulfonylamino, arylsulfonylamino, heteroarylsulfonylamino, aralkylsulfonylamino, alkylcarbonylamino, haloalkylcarbonylamino, cycloalkylcarbonylamino, heterocycloalkylcarbonylamino, arylcarbonylamino, heteroarylcarbonylamino, aralkylsulfonylamino, aminocarbonyl, alkylaminocarbonyl, cycloalkylaminocarbonyl, heterocycloalkylaminocarbonyl, arylaminocarbonyl, heteroarylaminocarbonyl, N-alkyl-N-arylaminocarbonyl, N-alkyl-N-heteroarylaminocarbonyl, cyano, nitro, or azido.

Representative examples of compounds of formula II include:

embedded image

Other inventive compounds of the invention are represented by formulas III or IV:

embedded image

- wherein,
- X₁is NR₁, O, S, S(O), or S(O)₂;
- each X₂is independently C(R₁)₂, NR₁, O, C(O), C₆-C₁₀aryl, or —OCH₂CH₂—;
- each R₁is independently hydrogen, C₁-C₆alkyl, C₆-C₁₀aryl, or 5- to 10-membered heteroaryl,
- wherein said alkyl, aryl, or heteroaryl is optionally substituted;
- R₂is absent or NH;
- R₃is absent, C(O), or C₁-C₃alkylene;
- R₄is an affinity handle, a bead, or a combination of these;
- R₅is C₁-C₆alkyl;
- L is an alkylene chain or a PEG chain;
- A is C₁-C₆alkyl, C₆-C₁₀aryl, C₆-C₁₀arene, 5- to 10-membered heteroaryl, 5- to 10-membered heteroarylene, or a small molecule, wherein said alkyl, aryl, or heteroaryl is optionally substituted;
- m is an integer from 0-5; and
- n is an integer from 0-10,
- or a pharmaceutically acceptable salt or stereoisomer thereof.

In some embodiments, X₁is O. In some embodiments, X₁is S. In some embodiments, X₁is S(O) or S(O)₂. In some embodiments, X₁is NR₁and R₁is H.

In some embodiments, R₃is absent. In some embodiments, R₃is C₂alkylene.

In some embodiments, n is 0. In some embodiments, n is 1. In some embodiments, n is 2. In some embodiments, n is 3. In some embodiments, n is 4. In some embodiments, n is 5.

In some embodiments, m is 0. In some embodiments, m is 1. In some embodiments, m is 2. In some embodiments, m is 3. In some embodiments, m is 5.

In some embodiments, R₄is an affinity handle. The term “affinity handle” refers to a portion of a compound that targets it to an appropriate site of action, e.g., a targeted polypeptide. Representative examples of affinity handles that may be useful include small chemical compounds (such as biotin and derivatives thereof, e.g., desthiobiotin), amino acid (e.g., His and Leu) tags, typically ranging 2 to 20 amino acids in length, and in some embodiments, from 4 to 12 amino acids in length, such as the (His)₆tag, (His)₄tag, (His)₃tag, (His)₂tag, (Leu)₄tag, (Leu)₃tag, (Leu)₂tag, human influenza hemagglutinin (HA) tag, FLAG® tag, vesicular stomatitis virus glycoprotein (VSV-G) tag, herpes simplex virus (HSV) tag, and V5 tag, human O⁶-alkylguanine-DNA-alkyltransferase (hAGT), chitin binding protein (CBP), maltose binding protein (MBP), Strep-tag, glutathione-S-transferase (GST), SNAP-tag, and CLIP-tag.

In some embodiments, the affinity handle is chloroalkane (HaloTag).

In some embodiments, the affinity handle is biotin or a biotin derivative. Biotin and its derivatives have been widely used as molecular labels in the biotechnology industry for many years. Biotin derivatives that may be suitable for use in the present invention are disclosed in U.S. Pat. No. 8,318,696 and U.S. Patent Application Publication No. 2007/0020206, each of which is incorporated by reference.

In some embodiments, the affinity handle is a protein. In some embodiments, the protein is SNAP-tag or CLIP-tag.

Biotin and its derivatives have been widely used as molecular labels in the biotechnology industry for many years. Biotin derivatives that may be suitable for use in the present invention are disclosed in U.S. Pat. No. 8,318,696 and U.S. Patent Application Publication No. 2007/0020206, each of which is incorporated by reference.

In some embodiments, R₄is a bead. In some embodiments, the bead is a magnetic bead, polystyrene bead, or agarose bead.

In some embodiments, each carbon, nitrogen, and oxygen of the compound of formula III or IV is substituted with a stable isotope thereof, wherein the isotope is selected from ¹³C, ¹⁵N, and ¹⁸O.

In some embodiments, the compound of formula III or IV contains 1-8 isotopes. In some embodiments, the compound of formula III or IV contains 1-4 isotopes. In some embodiments, the compound of formula III or IV contains 3 isotopes. In some embodiments, the compound of formula III or IV contains 2 isotopes. In some embodiments, the isotope is ²H, ¹³C, ¹⁵N, or ¹⁸O, or a combination of two or more thereof. Possible sites of the compound of formula III or IV that can contain isotope(s) are indicated by enclosure in dashed line: