The present disclosure relates to compounds and methods of identifying covalent ligands for protein targets as used in medicinal chemistry.
Availability of various biological assays in high throughput format and access to large chemical libraries have enabled the rapid discovery of small molecules that elicit interesting phenotypes to cells and organisms. However, it is challenging to determine the mechanism of action of the hit compounds from such phenotypic screens. In particular, the cellular targets of the hits are often difficult to identify even with the aid of affinity chromatography, because the complex formed between the small molecule and its protein targets can fall apart during the enrichment and wash steps. The difficulty of target identification has severely limited the therapeutic potential of small molecules with interesting biological activities.
Small molecules that contain electrophilic groups can form covalent linkage with their target proteins, which can be exploited to facilitate the identification of their target proteins. In particular, the covalent modification of proteins at cysteine residues by small molecules has widespread applications in drug discovery. These applications rely on the availability of an arsenal of electrophiles with tunable reactivity and selectivity. Common electrophiles having high selectivity for reaction with the thiol group include halomethylketones, haloacetamides, maleimides, and α,β-unsaturated ketones.
Accordingly, because of the increasing interest in the therapeutic development of electrophilic compounds, there is a strong need for the discovery and preparation of novel electrophiles that exhibit distinct stereo-electronic properties from the ones that are commonly used now.
The present invention solves one or more problems of the prior art by providing a compound that is useful in identifying covalent ligands for target proteins. The compound is described by formula I or formula II:
wherein:
Rf1 is an alkyne-containing moiety or azido-containing moiety;
X1 is NR or O or X1 is absent with LK being directly bonded to Pc where R is H or C1-6 alkyl;
LK1, LK2 are each independently a hydrocarbon-containing linking group;
PC1 is aryl or heteroaryl;
PC2 is an alkyl, aryl, heteroaryl, cycloalkyl, or heteroatom-containing ring system; and
Lm is a leaving group-containing moiety or an electrophilic moiety.
In another embodiment of the present invention, a method of finding covalent ligands is provided. The method comprises contacting a protein-containing sample with any of the above-mentioned compounds to form a modified protein; and contacting the modified protein with a probe compound that includes an alkyne-containing moiety Rf1 or azido-containing moiety Rf2, with the proviso that when Rf1 is an alkyne-containing moiety, Rf2 is an azido-containing moiety or when Rf2 is an alkyne-containing moiety, Rf1 is an azido-containing moiety wherein Rf1 reacts with Rf2 to form a ring.
In another embodiment, a method of preparing one class of electrophilic compounds, 2-chloromethylquinolines, is provided. A condensation reaction was discovered to facilitate a single-step conversion of simple starting materials, chloroacetamides, into target molecules, 2-chloromethylquinolines. This method can be readily adapted to parallel synthesis in generating enormous chemical diversity.
In another embodiment, a novel chemoproteomic approach for rapid identification of covalent ligands for target proteins is provided. A library of tripartite probes that contain a heterocyclic core, an electrophilic ‘warhead’ and an alkyne tag, are designed, synthesized and used as a “bait” to covalently catch “prey” proteins in the proteome. This is a rather general approach that can be exercised for any combinations of heterocycles and electrophiles for target-ligand discovery.
In another embodiment, protein targets of 2-chloromethylquinolines are provided. A number of proteins were identified as proteomic targets of 2-chloromethylquinolines with submicromolar or micromolar affinity. They include glutathione S-transferase omega-1, heme oxygenase 2, and prostaglandin E synthase 2, which are involved in human diseases such as cancer, inflammation and immune disorders. These 2-chloromethylquinolines can thus serve as lead compounds for the development of novel therapeutics by modulating the activity of these target proteins under disease settings.
Reference will now be made in detail to presently preferred compositions, embodiments and methods of the present invention, which constitute the best modes of practicing the invention presently known to the inventors. The Figures are not necessarily to scale. However, it is to be understood that the disclosed embodiments are merely exemplary of the invention that may be embodied in various and alternative forms. Therefore, specific details disclosed herein are not to be interpreted as limiting, but merely as a representative basis for any aspect of the invention and/or as a representative basis for teaching one skilled in the art to variously employ the present invention.
Except in the examples, or where otherwise expressly indicated, all numerical quantities in this description indicating amounts of material or conditions of reaction and/or use are to be understood as modified by the word “about” in describing the broadest scope of the invention. Practice within the numerical limits stated is generally preferred. Also, unless expressly stated to the contrary: all R groups (e.g. Ri where i is an integer) include alkyl, lower alkyl, C1-6 alkyl, C6-10 aryl, or C6-10 heteroaryl; single letters (e.g., “n” or “o”) are 1, 2, 3, 4, or 5; percent, “parts of,” and ratio values are by weight; the term “polymer” includes “oligomer,” “copolymer,” “terpolymer,” and the like; molecular weights provided for any polymers refers to weight average molecular weight unless otherwise indicated; the description of a group or class of materials as suitable or preferred for a given purpose in connection with the invention implies that mixtures of any two or more of the members of the group or class are equally suitable or preferred; description of constituents in chemical terms refers to the constituents at the time of addition to any combination specified in the description, and does not necessarily preclude chemical interactions among the constituents of a mixture once mixed; the first definition of an acronym or other abbreviation applies to all subsequent uses herein of the same abbreviation and applies mutatis mutandis to normal grammatical variations of the initially defined abbreviation; and, unless expressly stated to the contrary, measurement of a property is determined by the same technique as previously or later referenced for the same property.
It is also to be understood that this invention is not limited to the specific embodiments and methods described below, as specific components and/or conditions may, of course, vary. Furthermore, the terminology used herein is used only for the purpose of describing particular embodiments of the present invention and is not intended to be limiting in any way.
It must also be noted that, as used in the specification and the appended claims, the singular form “a,” “an,” and “the” comprise plural referents unless the context clearly indicates otherwise. For example, reference to a component in the singular is intended to comprise a plurality of components.
The term “comprising” is synonymous with “including,” “having,” “containing,” or “characterized by.” These terms are inclusive and open-ended and do not exclude additional, unrecited elements or method steps.
The phrase “consisting of” excludes any element, step, or ingredient not specified in the claim. When this phrase appears in a clause of the body of a claim, rather than immediately following the preamble, it limits only the element set forth in that clause; other elements are not excluded from the claim as a whole.
The phrase “consisting essentially of” limits the scope of a claim to the specified materials or steps, plus those that do not materially affect the basic and novel characteristic(s) of the claimed subject matter.
With respect to the terms “comprising,” “consisting of,” and “consisting essentially of,” where one of these three terms is used herein, the presently disclosed and claimed subject matter can include the use of either of the other two terms.
As used herein “alkyl” refers to C1-20 inclusive, linear (i.e., “straight-chain”), branched, saturated or at least partially and in some cases fully unsaturated (i.e., alkenyl and alkynyl) hydrocarbon chains, including for example, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, tert-butyl, pentyl, hexyl, octyl, ethenyl, propenyl, butenyl, pentenyl, hexenyl, octenyl, butadienyl, propynyl, butynyl, pentynyl, hexynyl, heptynyl, and allenyl groups. “Branched” refers to an alkyl group in which a lower alkyl group, such as methyl, ethyl or propyl, is attached to a linear alkyl chain. “Lower alkyl” refers to an alkyl group having 1 to about 8 carbon atoms (i.e., a C1-8 alkyl), e.g., 1, 2, 3, 4, 5, 6, 7, or 8 carbon atoms. “Higher alkyl” refers to an alkyl group having about 10 to about 20 carbon atoms, e.g., 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 carbon atoms. The alkyl group can be optionally substituted (i.e., a “substituted alkyl”) with another atom or functional group such as alkyl, substituted alkyl, halogen, aryl, substituted aryl, alkoxyl, hydroxyl, nitro, amino, alkylamino, dialkylamino, sulfate, mercapto, and the like.
As used herein “alkenyl” means a substituted or unsubstituted monovalent unsaturated hydrocarbon group that has at least one carbon to carbon double bond. The alkenyl can be linear or branched. In a refinement, the alkenyl can have 1, 2 or 3, carbon-carbon double bonds. In a refinement, the alkenyl groups have from 2 to 10 carbon atoms. Examples of alkenyl groups include, but are not limited to, ethenyl, n-propenyl, isopropenyl, n-but-2-enyl, n-hex-3-enyl and the like. The term “alkenylene” refers a divalent alkenyl group.
As used herein “alkynyl” means a monovalent unsaturated hydrocarbon group which may be linear or branched and which has a carbon-carbon triple bond. In a refinement, such alkynyl groups have from 2 to 10 carbon atoms. Representative alkynyl groups include, by way of example, ethynyl, n-propynyl, n-but-2-ynyl, n-hex-3-ynyl and the like. The term “alkynylene” means a divalent alkynyl group. Therefore, the term alkyne-containing moiety includes C2-10 alkynyl groups or C2-10 alkynylene groups. In context of the present invention, alkyne-containing moiety preferably have one carbon-carbon triple bond.
As used herein “aryl” means a monovalent aromatic hydrocarbon having a single ring (i.e., phenyl) or fused rings (i.e., naphthalene). In a refinement, such aryl groups include from 6 to 12 carbon ring atoms. In another refinement, such aryl groups include 6 to 10 carbon ring atoms. Representative aryl groups include, by way of example, phenyl biphenyl, naphthyl, anthranyl, and naphthalene-1-yl, naphthalene-2-yl, and the like. The term “arylene” means a divalent aryl group. Each aryl can be substituted with a functional group such as alkyl, substituted alkyl, halogen, aryl, substituted aryl, alkoxyl, hydroxyl, nitro, amino, alkylamino, dialkylamino, sulfate, mercapto, and the like.
As used herein “heteroaryl” means a monovalent aromatic group having a single ring or two fused rings and containing in the ring at least one heteroatom (typically 1 to 3 heteroatoms) selected from nitrogen, oxygen or sulfur. In a refinement, heteroaryl groups typically contain from 5 to 10 total ring atoms. In a refinement, heteroaryl groups have from 6 to 16 total ring atoms. In a refinement, the heteroaryl is a C5-12 heteroaryl. Examples of heteroaryl include, but are not limited to, monovalent species of pyrrole, imidazole, thiazole, oxazole, furan, thiophene, triazole, pyrazole, isoxazole, isothiazole, pyridine, pyrazine, pyridazine, pyrimidine, triazine, indole, benzofuran, benzothiophene, benzimidazole, benzthiazole, quinoline, isoquinoline, quinazoline, quinoxaline and the like, where the point of attachment is at any available carbon or nitrogen ring atom. Additional examples heteroaryl groups include, but are not limited to, furanyl, thienyl, and pridinyl group. The term “heteroarylene” means a divalent heteroaryl group. Each heteroaryl can be substituted with a functional group such as alkyl, substituted alkyl, halogen, aryl, substituted aryl, alkoxyl, hydroxyl, nitro, amino, alkylamino, dialkylamino, sulfate, mercapto, and the like.
As used herein, “polycyclic heteroaryl” means heteroaryl groups having at least 2 total rings (e.g., fused rings). In a refinement, polycyclic heteroaryl groups have from 9 to 18 total ring atoms. In another refinement, polycyclic heteroaryl groups have from 9 to 15 total ring atoms. In some refinements, the polycyclic heteroaryl groups are C6-16 polycyclic heteroaryl groups.
As used herein “alkyl aryl” means a substituted or unsubstituted functional group that includes an aromatic ring (e.g., phenyl, biphenyl, naphthyl, etc) with an attached C1-8 alkyl group.
As used herein “alkyl heteroaryl” means a substituted or unsubstituted heteroaryl group (typically, C4-15) with an attached C1-8 alkyl group.
As used herein “heteroatom-containing ring system” means a substituted or unsubstituted functional group that includes aromatic or aliphatic ring systems containing 1 to 3 heteroatoms such as nitrogen, sulfur, and oxygen. In a refinement, the heteroatom-containing ring system has from 5 to 10 total ring atoms. In a refinement, heteroaryl groups have from 6 to 16 total ring atoms. In a refinement, the heteroatom-containing ring system is a C5-12 heteroatom-containing ring system.
As used herein “leaving group” means a functional group or atom which can be displaced by another functional group or atom in a substitution reaction (e.g., a nucleophilic substitution reaction). Examples of leaving groups include, but are not limited to, chloro, bromo and iodo groups; sulfonic ester groups (e.g., tosylate, mesylate, brosylate, nosylate, etc.); and acyloxy groups (e.g., trifluoroacetoxy, etc).
Throughout this application, when a group is described as substituted, examples of substituents include, but are not limited to, nitro, cyano, halo (e.g., F, Cl, Br, I), hydroxyl, ester, carboxylate, and the like.
In an embodiment of the present invention, a compound that is useful for identifying covalent ligands is provided. The compound of this embodiment is described by formula I or formula II:
wherein:
Rf1 is an alkyne-containing moiety (e.g., an ethynyl-containing moiety) or azido-containing moiety;
X1 is NR or O or X1 is absent with LK being directly bonded to Pc where R is H or C1-6 alkyl;
LK1, LK2 are each independently a hydrocarbon-containing linking group;
PC1 is aryl or heteroaryl;
PC2 is an alkyl, aryl, heteroaryl, cycloalkyl, or heteroatom-containing ring system; and
Lm is a leaving group-containing moiety or an electrophilic moiety.
In a refinement of the present embodiment, PC1 and/or PC2 are each independently a C6-12 aryl, or C5-12 heteroaryl. In another refinement, PC1 and/or PC2 are each independently a polycyclic heteroaryl (e.g., a C7-12 polycyclic heteroaryl). In another refinement of the present embodiment, PC1 and/or PC2 are each independently:
where hydrogens are replaced with Lm, Lk1, Lk2, and/or Rf1.
In some variations, LK1 or LK2 are each independently a substituted or unsubstituted alkyl, a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl. In a refinement, LK1 or LK2 are each independently a substituted or unsubstituted C1-12 alkyl, substituted or unsubstituted C6-12 aryl, or substituted or unsubstituted C5-12 heteroaryl.
In most variations, Lm is Cl, CH═CH2, or N═C═S. In a refinement, Rf1 is:
where n is 1-6 and R1 is H or a C1-6 alkyl. In another variation, Rf1 is N3 or
and n is 1-6. Lm can be halo or CH(X2)R1 where X2 is a leaving group and R1 is H or C1-6 alkyl.
In a variation of the present embodiment, the compound that is useful for identifying covalent ligands is described by formula III:
wherein:
R2 is H, halo, C1-6 alkyl, nitro, or cyano;
R3 is halo, C1-6 alkyl, nitro, cyano or aryl; and
p is 0, 1, 2, 3, or 4.
In another variation of the present embodiment, the compound that is useful for identifying covalent ligands is described by formula IV:
In another variation of the compound that is useful for identifying covalent ligands, is described by formula V:
wherein X2 is a leaving group or an electrophilic functional group.
In another variation of the compound, that is useful for identifying covalent ligands, is described by formula VI:
wherein R2 is halo, C1-6 alkyl, nitro, or cyano;
R3 is C1-12 alkyl, C6-12 aryl, C5-12 heteroaryl, C2-24 alkyl ether groups, C12-24 aryl ether groups, or C12-24 aryl alkyl ether groups; and
p is 0, 1, 2, or 3.
In a refinement, Rf1 is
n is 1-6, and R5 is H or C1-6 alkyl. In another refinement of the present embodiment, Rf1 is N3 or
and n is 1-6.
In a variation of the present embodiment, the compound that is useful for identifying covalent ligands is described by formula VII:
wherein R5, R6, R7 are each independently halo, C1-6 alkyl, nitro, or cyano.
In a variation of the present embodiment, the compound is described by formula VIII:
wherein:
n is 1-6; and
R5 is H or C1-6 alkyl.
In another variation of the compound, that is useful for identifying covalent ligands, is described by formula IX:
wherein R5, R6, R7 are each independently halo, C1-6 alkyl, nitro, or cyano.
In a variation of the present embodiment, the compound that is useful for identifying covalent ligands is described by formula X:
wherein:
n is 1-6; and
R5 is H or C1-6 alkyl.
In another embodiment of the present invention, a method of finding covalent ligands is provided. The method comprises contacting a protein-containing sample with any of the above-mentioned compounds to form a modified protein; and contacting the modified protein with a probe compound that includes an alkyne-containing moiety Rf1 or azido-containing moiety Rf2, with the proviso that when Rf1 is an alkyne-containing moiety, Rf2 is an azido-containing moiety or when Rf2 is an alkyne-containing moiety, Rf1 is an azido-containing moiety wherein Rf1 reacts with Rf2 to form a ring:
In a refinement of the method of finding covalent ligands, the compound includes a fluorophore.
The following examples illustrate the various embodiments of the present invention. Those skilled in the art will recognize many variations that are within the spirit of the present invention and scope of the claims.
Quinolines and their derivatives are one of the most important classes of heterocyclic compounds. According to the DrugBank, 47 FDA-approved drugs contain a quinoline core. These quinoline-containing drugs exhibit a broad range of medicinal activities including antimalarial, antibacterial, antihypertensive, anti-HIV, antitubercular, anticancer, and antiasthmatic activity. The quinoline thus represents a “privileged” pharmacophore in medicine. 2-Chloromethylquinolines (2-CMQ) contain an electrophile of chloromethyl group attached at the 2 position of the quinoline ring. Herein, we developed a novel condensation reaction to synthesize a panel of tripartite 2-CMQs that contain a quinoline core, an electrophilic ‘warhead’ and an alkyne reporter. These 2-CMQs were evaluated for their covalent interactions with the whole proteome. The alkyne tag facilitated the use of copper-assisted azide-alkyne cycloaddition (CuAAC) for in-gel fluorescence analysis and mass-spectrometry-based identification of protein targets (
A limited number of methods were reported for the preparation of 2-CMQ in the past. For example, 2-CMQs can be synthesized from quinaldines by converting them to their N-oxides, followed by the reaction with sulfonyl chlorides. A second method is the radical-mediated chlorination reaction of 2-methylquinolines with N-chlorosuccinimide. However, this method suffers from the formation of multiple side products including dichlorinated, trichlorinated, and other undesired side products. Chlorination of 2-methylquinolines using chlorine as the halogen source has benefits of low cost and high yield, but chlorine is an irritating toxic gas. Although 2-CMQ can be prepared from 2-quinolinemethanols by an alternative chlorination approach, the starting materials 2-quinolinemethanols are usually not commercially available. Chlorination of 2-methylquinolines into 2-(chloromethyl)quinolines in the tetrabutylammonium iodide and 1,2-dichloroethane system, via in situ generated ICl, has been developed recently. Unfortunately, all of the above methods are either incompatible with alkyne functional group or require multi-step synthesis to produce the tripartite quinolines (
To fill in the gap, we have developed a one-step condensation method for the preparation of 2-chloromethylquinolines. In this approach, PCl5 is used to condense two molecules of arylacetamide, a readily available material with tremendous choices of structural diversity, to construct the quinoline core, and meantime installs both chloromethyl electrophile and alkyne tag. By changing the substitution position and pattern, we can attach the alkyne tag to various positions in the N-arylacetamide and alter geometries in the probes. In a similar manner, a non-alkyne substituent can also be introduced to broaden the diversity and functionality in the probes.
We first examined the proteome reactivity of a panel of 2-CMQs (
We then chose probe 6 and probe 15 as exemplary probes for further investigation of their proteomic targets. These two probes produced different labeling patterns as band C was labeled by probe 15 but not probe 6 (
We then set out to identify the target proteins that were covalently labeled by the 2-CMQ probes. HEK293H cells were incubated with mock or 10 μM of 15c for 1 hr before further incubation with 3 μM of probe 15 for 1 more hr at 37° C. The cells were then lysed, conjugated with biotin-azo-azide via CuAAC, and pulled down using streptavidin agarose beads. Addition of sodium dithionite cleaved the target proteins off the beads, which were resolved by SDS-PAGAE and visualized by coomassie staining (
Transient transfection of FLAG-tagged wild-type PTGES2 soluble fragment led to enhanced fluorescent band at ˜37 kDa, confirming PTGES2 as the target protein responsible for band D (
In conclusion, we have developed a novel condensation reaction to prepare 2-chloromethylquinoline, a new electrophile for chemoproteomic studies. These chloromethylquinolines covalently and potently bind to a number of cellular protein targets including Prostaglandin E Synthase 2, a critical regulator of cell proliferation, apoptosis, angiogenesis, inflammation, and immune surveillance. The novel PTGES2 inhibitors that we discovered using chemoproteomics have the potential to serve as novel therapies for the treatment of human diseases such as inflammation. Beyond this case study, our methodology can be extended to other electrophiles and scaffolds for discovering covalent ligands for diverse targets in the whole proteome.
While exemplary embodiments are described above, it is not intended that these embodiments describe all possible forms of the invention. Rather, the words used in the specification are words of description rather than limitation, and it is understood that various changes may be made without departing from the spirit and scope of the invention. Additionally, the features of various implementing embodiments may be combined to form further embodiments of the invention.
This application claims the benefit of U.S. provisional application Ser. No. 62/403,925 filed Oct. 4, 2016, the disclosure of which is hereby incorporated in its entirety by reference herein.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2017/055074 | 10/4/2017 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62403925 | Oct 2016 | US |